Query psy460
Match_columns 117
No_of_seqs 54 out of 56
Neff 4.3
Searched_HMMs 46136
Date Fri Aug 16 21:04:11 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy460.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/460hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF14712 Snapin_Pallidin: Snap 99.9 2.2E-23 4.9E-28 142.1 11.0 90 25-114 1-92 (92)
2 PF07889 DUF1664: Protein of u 92.1 3.7 8.1E-05 30.4 11.2 73 35-113 44-116 (126)
3 PF05531 NPV_P10: Nucleopolyhe 89.6 2.8 6.1E-05 28.8 7.1 64 30-110 3-66 (75)
4 PHA03386 P10 fibrous body prot 88.9 3 6.5E-05 29.9 7.0 28 32-59 6-33 (94)
5 KOG2911|consensus 88.9 3 6.4E-05 36.9 8.4 75 39-113 227-307 (439)
6 PRK11637 AmiB activator; Provi 88.4 14 0.0003 31.3 12.0 23 86-108 89-111 (428)
7 PRK11637 AmiB activator; Provi 85.7 12 0.00026 31.7 10.1 66 42-113 44-109 (428)
8 PF04420 CHD5: CHD5-like prote 84.7 4.2 9.1E-05 30.6 6.3 55 48-110 36-90 (161)
9 PHA03395 p10 fibrous body prot 84.0 7.9 0.00017 27.4 7.0 63 30-109 3-65 (87)
10 COG3883 Uncharacterized protei 83.3 8.3 0.00018 32.0 8.0 36 38-73 52-87 (265)
11 PF10458 Val_tRNA-synt_C: Valy 83.2 9.8 0.00021 24.5 7.8 63 43-105 2-65 (66)
12 PF06160 EzrA: Septation ring 83.1 16 0.00036 32.4 10.2 62 32-99 373-434 (560)
13 PF04100 Vps53_N: Vps53-like, 82.9 15 0.00032 31.4 9.5 78 32-109 26-118 (383)
14 PRK14011 prefoldin subunit alp 82.0 9.9 0.00021 28.6 7.3 47 36-88 86-132 (144)
15 PRK10803 tol-pal system protei 81.6 7.5 0.00016 31.3 7.0 34 40-73 42-75 (263)
16 PF10158 LOH1CR12: Tumour supp 81.4 19 0.00042 26.6 10.3 88 29-116 26-117 (131)
17 PF05531 NPV_P10: Nucleopolyhe 81.2 15 0.00032 25.2 7.9 35 39-73 19-56 (75)
18 PF03357 Snf7: Snf7; InterPro 79.9 18 0.00038 25.9 7.9 73 40-114 3-76 (171)
19 PF01519 DUF16: Protein of unk 79.7 21 0.00045 25.9 8.0 24 90-113 78-101 (102)
20 PF11382 DUF3186: Protein of u 79.0 8 0.00017 31.9 6.5 47 27-73 17-67 (308)
21 PF15027 DUF4525: Domain of un 78.8 6.7 0.00015 29.8 5.4 58 52-113 38-99 (138)
22 PF05377 FlaC_arch: Flagella a 77.5 17 0.00036 23.6 6.4 34 40-73 9-42 (55)
23 COG4942 Membrane-bound metallo 77.2 28 0.0006 30.7 9.5 67 36-108 43-109 (420)
24 COG0255 RpmC Ribosomal protein 75.6 9.9 0.00021 25.5 5.1 54 52-107 11-64 (69)
25 PF05837 CENP-H: Centromere pr 74.3 28 0.0006 24.5 9.8 69 37-105 9-84 (106)
26 PF04912 Dynamitin: Dynamitin 73.9 36 0.00078 28.7 9.2 79 36-114 244-326 (388)
27 PF10392 COG5: Golgi transport 73.8 23 0.0005 25.5 7.1 45 56-105 76-122 (132)
28 TIGR02231 conserved hypothetic 73.3 58 0.0013 28.3 10.5 79 36-114 69-166 (525)
29 smart00787 Spc7 Spc7 kinetocho 73.3 54 0.0012 27.4 11.7 76 31-112 172-251 (312)
30 PF12998 ING: Inhibitor of gro 73.0 15 0.00033 24.5 5.6 51 21-71 55-105 (105)
31 PF02646 RmuC: RmuC family; I 72.3 34 0.00073 28.1 8.5 67 31-107 3-69 (304)
32 PF02601 Exonuc_VII_L: Exonucl 72.0 51 0.0011 26.6 11.0 82 33-114 138-227 (319)
33 PF04111 APG6: Autophagy prote 71.9 43 0.00094 27.8 9.1 81 34-114 53-134 (314)
34 PHA02562 46 endonuclease subun 70.8 50 0.0011 28.3 9.5 93 16-110 286-382 (562)
35 PF07889 DUF1664: Protein of u 70.7 41 0.00088 24.9 8.0 68 32-112 55-122 (126)
36 PF03961 DUF342: Protein of un 70.2 59 0.0013 27.9 9.8 31 43-73 332-362 (451)
37 PF08700 Vps51: Vps51/Vps67; 69.2 28 0.00062 22.6 7.8 64 47-112 21-84 (87)
38 PF15233 SYCE1: Synaptonemal c 68.3 36 0.00079 25.8 7.2 55 34-95 2-63 (134)
39 PF06632 XRCC4: DNA double-str 66.1 84 0.0018 26.8 9.9 85 20-107 121-208 (342)
40 TIGR03185 DNA_S_dndD DNA sulfu 65.5 87 0.0019 28.1 10.2 60 55-114 394-456 (650)
41 cd00427 Ribosomal_L29_HIP Ribo 65.3 24 0.00052 22.2 4.9 49 53-103 7-55 (57)
42 PF10552 ORF6C: ORF6C domain; 65.0 46 0.00099 23.5 6.9 51 46-98 2-52 (116)
43 TIGR00012 L29 ribosomal protei 64.5 22 0.00048 22.3 4.7 50 52-103 5-54 (55)
44 KOG0971|consensus 63.1 58 0.0012 32.1 8.9 73 35-110 466-549 (1243)
45 PF15035 Rootletin: Ciliary ro 62.8 55 0.0012 25.4 7.5 69 36-105 14-89 (182)
46 PF00831 Ribosomal_L29: Riboso 62.6 18 0.00039 22.9 4.0 50 52-103 7-56 (58)
47 PF04799 Fzo_mitofusin: fzo-li 61.8 76 0.0016 24.8 9.1 72 14-106 93-164 (171)
48 PRK04778 septation ring format 61.2 1.2E+02 0.0026 27.0 11.0 25 40-64 385-409 (569)
49 PF13166 AAA_13: AAA domain 61.1 1.2E+02 0.0026 26.9 11.4 90 15-110 256-346 (712)
50 PF15458 NTR2: Nineteen comple 60.9 30 0.00065 27.9 6.0 39 36-74 206-244 (254)
51 PF10779 XhlA: Haemolysin XhlA 60.2 38 0.00082 22.1 5.4 38 37-74 12-49 (71)
52 cd00584 Prefoldin_alpha Prefol 59.8 44 0.00096 23.5 6.1 40 35-74 84-123 (129)
53 COG1421 CRISPR system related 58.7 11 0.00023 28.7 2.8 20 24-43 86-107 (137)
54 PRK02224 chromosome segregatio 58.5 1.2E+02 0.0025 27.8 9.9 59 54-113 622-680 (880)
55 KOG3046|consensus 57.8 30 0.00065 26.6 5.1 58 24-85 6-80 (147)
56 PF03980 Nnf1: Nnf1 ; InterPr 57.7 60 0.0013 22.4 6.4 47 19-65 61-107 (109)
57 cd00890 Prefoldin Prefoldin is 57.1 54 0.0012 22.5 6.1 38 36-73 85-122 (129)
58 PRK03918 chromosome segregatio 56.9 96 0.0021 28.2 9.0 57 55-112 636-692 (880)
59 PRK03947 prefoldin subunit alp 56.7 51 0.0011 23.6 6.1 39 35-73 91-129 (140)
60 KOG3684|consensus 56.6 37 0.0008 30.6 6.2 43 37-79 426-468 (489)
61 PF06120 Phage_HK97_TLTM: Tail 56.6 1.2E+02 0.0026 25.5 10.5 35 38-72 74-108 (301)
62 PF05377 FlaC_arch: Flagella a 56.1 53 0.0012 21.3 5.6 35 40-74 2-36 (55)
63 PRK02224 chromosome segregatio 56.0 1.1E+02 0.0024 28.0 9.3 46 24-71 150-199 (880)
64 PF13514 AAA_27: AAA domain 55.4 70 0.0015 30.6 8.2 91 25-115 116-210 (1111)
65 PRK01203 prefoldin subunit alp 54.3 58 0.0013 24.3 6.1 41 24-66 2-42 (130)
66 TIGR02169 SMC_prok_A chromosom 54.2 1.8E+02 0.0039 26.8 11.4 23 86-108 469-491 (1164)
67 TIGR02680 conserved hypothetic 54.2 1.5E+02 0.0032 29.5 10.3 74 36-113 747-820 (1353)
68 PRK10920 putative uroporphyrin 53.6 93 0.002 27.0 8.1 72 42-113 96-173 (390)
69 PRK14549 50S ribosomal protein 53.3 45 0.00097 22.0 4.8 51 52-104 12-63 (69)
70 PF12001 DUF3496: Domain of un 52.6 76 0.0016 23.2 6.3 63 39-108 8-72 (111)
71 TIGR00293 prefoldin, archaeal 52.5 70 0.0015 22.3 6.1 39 35-73 83-121 (126)
72 PF11014 DUF2852: Protein of u 52.4 16 0.00034 27.0 2.8 27 65-94 77-103 (115)
73 PHA02562 46 endonuclease subun 52.3 1.5E+02 0.0033 25.4 10.2 86 24-111 151-245 (562)
74 PF11932 DUF3450: Protein of u 51.8 1.1E+02 0.0024 24.1 7.7 44 40-84 86-129 (251)
75 PF09748 Med10: Transcription 51.2 25 0.00054 25.6 3.7 34 42-75 14-49 (128)
76 PRK00461 rpmC 50S ribosomal pr 50.9 69 0.0015 22.3 5.7 51 52-104 8-58 (87)
77 TIGR00634 recN DNA repair prot 50.6 1.6E+02 0.0036 25.9 9.3 66 40-105 170-238 (563)
78 TIGR02169 SMC_prok_A chromosom 50.4 2.1E+02 0.0045 26.5 10.7 21 91-111 467-487 (1164)
79 PRK00306 50S ribosomal protein 50.4 67 0.0014 20.7 5.3 53 52-106 9-61 (66)
80 PF12325 TMF_TATA_bd: TATA ele 50.3 98 0.0021 22.6 11.1 74 29-112 14-87 (120)
81 PF06160 EzrA: Septation ring 50.1 1.3E+02 0.0029 26.8 8.7 59 55-113 375-434 (560)
82 PF04156 IncA: IncA protein; 49.6 1.1E+02 0.0023 22.8 10.1 19 53-71 82-100 (191)
83 COG0497 RecN ATPase involved i 49.4 2E+02 0.0043 26.4 9.7 75 34-112 160-234 (557)
84 PF14728 PHTB1_C: PTHB1 C-term 49.3 1.7E+02 0.0037 25.1 10.6 80 36-116 199-284 (377)
85 CHL00154 rpl29 ribosomal prote 49.2 67 0.0015 21.2 5.2 50 52-104 12-62 (67)
86 PF12757 DUF3812: Protein of u 48.9 57 0.0012 23.8 5.3 54 12-68 61-114 (126)
87 PF07106 TBPIP: Tat binding pr 48.2 1.1E+02 0.0024 22.6 9.1 62 40-105 74-135 (169)
88 PF06013 WXG100: Proteins of 1 47.9 63 0.0014 19.7 10.6 71 41-112 7-77 (86)
89 COG1196 Smc Chromosome segrega 47.9 2.7E+02 0.0059 27.0 10.9 67 41-113 761-827 (1163)
90 PF04912 Dynamitin: Dynamitin 47.5 1.7E+02 0.0037 24.6 9.5 53 45-113 336-388 (388)
91 PF04977 DivIC: Septum formati 47.5 70 0.0015 20.1 5.3 32 40-71 19-50 (80)
92 smart00338 BRLZ basic region l 47.3 70 0.0015 20.1 5.9 36 38-73 26-61 (65)
93 PF12329 TMF_DNA_bd: TATA elem 47.2 83 0.0018 20.9 5.9 18 93-110 54-71 (74)
94 COG4477 EzrA Negative regulato 46.3 1.5E+02 0.0032 27.4 8.4 61 33-99 377-437 (570)
95 smart00126 IL6 Interleukin-6 h 46.1 81 0.0017 24.2 5.9 93 17-109 42-146 (154)
96 PRK00846 hypothetical protein; 46.0 95 0.0021 21.2 6.4 42 32-73 21-62 (77)
97 TIGR02168 SMC_prok_B chromosom 46.0 2.4E+02 0.0052 25.9 10.7 12 92-103 823-834 (1179)
98 PRK10803 tol-pal system protei 45.8 1.4E+02 0.003 24.0 7.6 46 30-75 46-91 (263)
99 TIGR02338 gimC_beta prefoldin, 45.7 1E+02 0.0022 21.4 6.6 39 33-71 69-107 (110)
100 PF10157 DUF2365: Uncharacteri 45.5 1.3E+02 0.0029 22.8 6.9 45 30-74 87-131 (149)
101 PF10186 Atg14: UV radiation r 45.3 1.4E+02 0.0031 23.0 10.1 26 43-68 68-93 (302)
102 PF12718 Tropomyosin_1: Tropom 45.3 1.2E+02 0.0027 22.4 9.7 54 40-99 16-69 (143)
103 COG3879 Uncharacterized protei 45.2 1.8E+02 0.0038 24.1 8.9 71 34-113 32-102 (247)
104 COG1730 GIM5 Predicted prefold 44.9 83 0.0018 23.8 5.7 38 36-73 92-129 (145)
105 PF14257 DUF4349: Domain of un 44.3 1.5E+02 0.0033 23.3 7.4 65 47-115 127-191 (262)
106 PF06103 DUF948: Bacterial pro 44.1 93 0.002 20.6 11.4 43 32-74 13-55 (90)
107 PF13815 Dzip-like_N: Iguana/D 44.0 1.1E+02 0.0025 21.6 6.1 51 15-73 44-94 (118)
108 PF14644 DUF4456: Domain of un 43.7 1.5E+02 0.0033 22.9 10.6 77 16-98 18-96 (208)
109 PF10846 DUF2722: Protein of u 43.4 1.2E+02 0.0026 26.6 7.2 64 22-85 33-107 (416)
110 PF13600 DUF4140: N-terminal d 43.3 90 0.0019 21.1 5.3 34 36-69 68-101 (104)
111 PF05064 Nsp1_C: Nsp1-like C-t 43.2 57 0.0012 23.2 4.5 38 37-74 56-93 (116)
112 PF10368 YkyA: Putative cell-w 42.6 89 0.0019 24.5 5.8 43 32-74 148-193 (204)
113 PF11853 DUF3373: Protein of u 42.6 28 0.0006 31.3 3.3 24 52-75 24-47 (489)
114 PRK10722 hypothetical protein; 42.0 69 0.0015 26.5 5.3 30 44-73 175-204 (247)
115 PF14282 FlxA: FlxA-like prote 41.2 1.2E+02 0.0027 21.2 6.9 46 57-104 24-69 (106)
116 PF05911 DUF869: Plant protein 41.1 1.1E+02 0.0023 29.1 6.9 71 36-106 678-760 (769)
117 KOG3850|consensus 40.5 82 0.0018 28.1 5.7 57 5-66 232-288 (455)
118 CHL00198 accA acetyl-CoA carbo 40.4 43 0.00094 28.4 4.0 48 34-84 10-61 (322)
119 PRK10869 recombination and rep 40.0 2.7E+02 0.0058 24.9 9.0 66 40-105 166-234 (553)
120 COG1196 Smc Chromosome segrega 39.8 3.6E+02 0.0079 26.2 11.9 23 44-66 827-849 (1163)
121 PF00170 bZIP_1: bZIP transcri 39.7 95 0.0021 19.4 5.8 33 40-72 28-60 (64)
122 KOG4643|consensus 39.5 4E+02 0.0086 26.8 10.5 74 33-113 435-508 (1195)
123 PF08651 DASH_Duo1: DASH compl 39.4 1.2E+02 0.0026 20.5 6.2 41 68-114 10-50 (78)
124 PF06694 Plant_NMP1: Plant nuc 38.8 2.6E+02 0.0056 24.1 8.8 72 33-105 103-194 (325)
125 PF04977 DivIC: Septum formati 38.6 1E+02 0.0022 19.4 5.2 27 38-64 24-50 (80)
126 PF02996 Prefoldin: Prefoldin 38.3 1.3E+02 0.0028 20.5 6.0 38 36-73 75-112 (120)
127 PF10241 KxDL: Uncharacterized 37.7 1.3E+02 0.0028 20.4 7.5 63 47-109 13-80 (88)
128 PF04102 SlyX: SlyX; InterPro 37.7 1.1E+02 0.0025 19.8 5.5 41 32-72 12-52 (69)
129 PF11932 DUF3450: Protein of u 37.3 2E+02 0.0044 22.5 11.1 27 39-65 22-48 (251)
130 PHA03395 p10 fibrous body prot 37.2 64 0.0014 22.8 3.8 35 39-73 19-56 (87)
131 KOG0982|consensus 37.1 3.2E+02 0.007 24.8 9.3 69 34-108 300-368 (502)
132 COG1382 GimC Prefoldin, chaper 36.9 1.3E+02 0.0029 22.2 5.6 35 39-73 78-112 (119)
133 COG2900 SlyX Uncharacterized p 36.8 1.4E+02 0.003 20.5 5.4 40 35-74 19-58 (72)
134 PF09969 DUF2203: Uncharacteri 36.6 1.6E+02 0.0036 21.3 8.4 16 29-44 8-23 (120)
135 PRK10869 recombination and rep 36.6 2.7E+02 0.0059 24.8 8.5 31 43-73 259-289 (553)
136 PF13852 DUF4197: Protein of u 36.3 94 0.002 24.5 5.1 53 26-86 115-167 (202)
137 KOG2196|consensus 36.1 2.6E+02 0.0056 23.4 9.9 38 37-74 112-149 (254)
138 PF10473 CENP-F_leu_zip: Leuci 36.1 1.9E+02 0.0041 21.8 10.7 62 39-113 36-100 (140)
139 TIGR01807 CM_P2 chorismate mut 36.0 1.2E+02 0.0027 19.6 4.9 18 56-73 3-20 (76)
140 PRK04778 septation ring format 35.9 2.9E+02 0.0063 24.6 8.6 68 45-112 369-437 (569)
141 PF07028 DUF1319: Protein of u 35.8 1.9E+02 0.0041 21.7 6.9 37 39-75 47-83 (126)
142 cd08063 MPN_CSN6 Mpr1p, Pad1p 35.5 1.5E+02 0.0033 24.0 6.3 56 23-80 165-220 (288)
143 PF11945 WASH_WAHD: WAHD domai 35.4 1.2E+02 0.0027 25.3 5.9 34 81-114 38-71 (297)
144 PF04625 DEC-1_N: DEC-1 protei 35.4 1.6E+02 0.0034 25.9 6.6 23 53-75 290-312 (407)
145 PF05055 DUF677: Protein of un 34.5 2.9E+02 0.0063 23.5 9.8 71 41-112 83-172 (336)
146 PRK10929 putative mechanosensi 34.5 2.7E+02 0.0059 27.6 8.7 52 37-91 332-383 (1109)
147 PF12777 MT: Microtubule-bindi 34.3 1.6E+02 0.0036 24.3 6.5 55 54-114 230-284 (344)
148 PRK15422 septal ring assembly 34.1 1.6E+02 0.0035 20.5 9.5 35 38-72 4-38 (79)
149 PRK06835 DNA replication prote 34.0 1.1E+02 0.0024 25.5 5.4 43 38-80 36-93 (329)
150 PF06148 COG2: COG (conserved 33.8 62 0.0013 23.0 3.4 19 46-64 70-88 (133)
151 TIGR01791 CM_archaeal chorisma 33.8 1.3E+02 0.0029 19.6 4.8 18 56-73 3-20 (83)
152 PF08317 Spc7: Spc7 kinetochor 33.2 2.8E+02 0.006 22.9 9.6 28 28-55 146-173 (325)
153 PF10498 IFT57: Intra-flagella 33.1 3.1E+02 0.0068 23.4 11.8 66 42-113 249-314 (359)
154 TIGR01843 type_I_hlyD type I s 33.0 2.6E+02 0.0057 22.6 9.1 32 37-68 143-174 (423)
155 PRK07857 hypothetical protein; 32.8 1.3E+02 0.0028 21.8 4.9 17 36-52 33-49 (106)
156 PRK07737 fliD flagellar cappin 32.7 3.5E+02 0.0076 23.9 8.8 47 27-73 414-462 (501)
157 PF12451 VPS11_C: Vacuolar pro 32.6 92 0.002 19.2 3.6 25 45-69 5-29 (49)
158 TIGR00634 recN DNA repair prot 32.6 3.5E+02 0.0076 23.9 10.2 30 44-73 265-294 (563)
159 PRK10929 putative mechanosensi 32.6 2E+02 0.0043 28.5 7.4 59 40-104 104-162 (1109)
160 PHA02675 ORF104 fusion protein 32.5 99 0.0021 22.0 4.1 15 21-35 28-42 (90)
161 PRK00888 ftsB cell division pr 32.3 1.8E+02 0.0039 20.4 6.9 43 39-88 28-70 (105)
162 TIGR03007 pepcterm_ChnLen poly 32.2 3.2E+02 0.0069 23.3 9.7 16 38-53 275-290 (498)
163 TIGR01801 CM_A chorismate muta 32.1 1.5E+02 0.0032 20.8 5.0 15 57-71 9-23 (102)
164 PF12221 HflK_N: Bacterial mem 32.0 42 0.00092 20.5 2.0 15 76-90 18-32 (42)
165 PF02388 FemAB: FemAB family; 32.0 1.8E+02 0.0039 24.7 6.4 50 54-114 244-294 (406)
166 PF04375 HemX: HemX; InterPro 31.7 3.2E+02 0.0069 23.1 8.1 11 102-112 156-166 (372)
167 KOG3436|consensus 31.4 2.3E+02 0.0049 21.3 7.2 52 52-104 12-63 (123)
168 PLN03246 26S proteasome regula 31.1 2.2E+02 0.0047 23.8 6.6 41 39-80 182-222 (303)
169 COG5509 Uncharacterized small 30.8 1.1E+02 0.0023 20.6 3.9 25 40-71 27-51 (65)
170 PRK04325 hypothetical protein; 30.7 1.6E+02 0.0036 19.5 5.3 41 32-72 17-57 (74)
171 PRK09039 hypothetical protein; 30.7 3.2E+02 0.007 22.9 11.6 72 32-109 131-203 (343)
172 PF05478 Prominin: Prominin; 30.6 4.5E+02 0.0098 24.5 10.3 57 36-92 251-329 (806)
173 PRK07248 hypothetical protein; 30.2 1.7E+02 0.0036 19.4 5.0 28 36-63 6-33 (87)
174 cd08062 MPN_RPN7_8 Mpr1p, Pad1 30.2 2.6E+02 0.0056 22.9 6.9 38 42-80 179-216 (280)
175 PRK06664 fliD flagellar hook-a 30.1 1.8E+02 0.0039 27.0 6.4 44 29-72 581-627 (661)
176 PF06840 DUF1241: Protein of u 30.1 2.6E+02 0.0056 21.5 7.1 57 31-99 12-68 (154)
177 COG3074 Uncharacterized protei 29.9 1.9E+02 0.0042 20.0 9.9 64 39-112 5-72 (79)
178 cd00890 Prefoldin Prefoldin is 29.5 1.9E+02 0.0041 19.7 5.8 39 33-71 89-127 (129)
179 PF01608 I_LWEQ: I/LWEQ domain 29.3 2.6E+02 0.0056 21.3 7.7 77 38-115 70-147 (152)
180 PRK11281 hypothetical protein; 29.3 4.5E+02 0.0098 26.1 9.2 58 38-95 353-415 (1113)
181 PF06103 DUF948: Bacterial pro 29.2 1.7E+02 0.0038 19.3 9.7 38 36-73 24-61 (90)
182 KOG0396|consensus 29.0 4.1E+02 0.0089 23.5 9.3 80 33-113 9-99 (389)
183 PF09006 Surfac_D-trimer: Lung 28.8 1.6E+02 0.0034 18.6 4.6 19 56-74 3-21 (46)
184 KOG0018|consensus 28.7 4.3E+02 0.0093 26.5 8.9 67 40-112 685-751 (1141)
185 PF10359 Fmp27_WPPW: RNA pol I 28.5 1.6E+02 0.0036 25.7 5.7 88 13-108 139-229 (475)
186 TIGR02894 DNA_bind_RsfA transc 28.5 1.7E+02 0.0037 22.7 5.3 34 40-73 99-132 (161)
187 COG1579 Zn-ribbon protein, pos 28.4 3.3E+02 0.0072 22.3 12.0 78 37-114 44-124 (239)
188 PF14584 DUF4446: Protein of u 28.2 2.1E+02 0.0045 21.6 5.6 35 36-70 44-78 (151)
189 TIGR01005 eps_transp_fam exopo 27.9 4.6E+02 0.01 23.8 9.9 71 41-117 183-268 (754)
190 PRK06975 bifunctional uroporph 27.3 4.9E+02 0.011 23.8 9.9 14 100-113 445-458 (656)
191 KOG0979|consensus 27.2 6.4E+02 0.014 25.2 10.1 75 37-114 624-699 (1072)
192 PLN03230 acetyl-coenzyme A car 27.2 96 0.0021 27.6 4.1 34 40-73 82-119 (431)
193 TIGR00513 accA acetyl-CoA carb 27.1 1.2E+02 0.0026 25.7 4.5 39 34-73 7-49 (316)
194 PRK06034 hypothetical protein; 26.9 1.4E+02 0.003 24.9 4.8 33 55-93 12-44 (279)
195 PF13942 Lipoprotein_20: YfhG 26.8 1.5E+02 0.0032 23.6 4.6 33 41-73 126-158 (179)
196 PF02181 FH2: Formin Homology 26.4 3.2E+02 0.0069 22.3 6.8 56 31-90 309-364 (370)
197 KOG0976|consensus 26.3 4E+02 0.0087 26.5 8.1 82 28-113 345-430 (1265)
198 PHA03386 P10 fibrous body prot 26.1 2.4E+02 0.0053 20.2 5.3 45 55-102 15-59 (94)
199 PRK15366 type III secretion sy 26.0 2.3E+02 0.0051 19.7 5.4 46 38-86 2-47 (80)
200 PRK10884 SH3 domain-containing 25.9 3.3E+02 0.0072 21.5 9.7 28 87-114 147-174 (206)
201 smart00787 Spc7 Spc7 kinetocho 25.9 4E+02 0.0086 22.3 9.5 34 80-113 205-238 (312)
202 KOG3438|consensus 25.5 1.4E+02 0.0031 21.8 4.1 28 21-48 67-97 (105)
203 PF10267 Tmemb_cc2: Predicted 25.5 4.6E+02 0.01 22.9 11.1 20 94-113 299-318 (395)
204 PF01544 CorA: CorA-like Mg2+ 25.4 3.1E+02 0.0066 20.8 8.2 63 32-109 112-174 (292)
205 PTZ00419 valyl-tRNA synthetase 25.1 3.1E+02 0.0067 26.2 7.3 64 43-106 927-991 (995)
206 PRK05729 valS valyl-tRNA synth 24.9 2.6E+02 0.0056 26.3 6.7 62 43-104 809-871 (874)
207 PHA01750 hypothetical protein 24.9 2.3E+02 0.0051 19.4 6.1 9 34-42 30-38 (75)
208 PRK10378 inactive ferrous ion 24.7 2.4E+02 0.0052 24.5 6.0 36 32-73 301-336 (375)
209 PF04156 IncA: IncA protein; 24.6 2.9E+02 0.0063 20.4 10.4 22 93-114 158-179 (191)
210 PF02865 STAT_int: STAT protei 24.6 2.8E+02 0.006 20.1 7.1 70 23-97 50-120 (124)
211 PF08614 ATG16: Autophagy prot 24.6 1.5E+02 0.0031 22.6 4.2 75 36-116 72-146 (194)
212 PF06120 Phage_HK97_TLTM: Tail 24.6 4.3E+02 0.0093 22.3 9.3 33 43-75 39-71 (301)
213 PF05103 DivIVA: DivIVA protei 24.5 1.5E+02 0.0031 20.5 3.9 32 42-73 22-53 (131)
214 PRK10884 SH3 domain-containing 24.5 3.4E+02 0.0075 21.4 6.4 10 100-109 146-155 (206)
215 PF12128 DUF3584: Protein of u 24.5 6.8E+02 0.015 24.6 11.3 32 81-112 308-339 (1201)
216 PF07851 TMPIT: TMPIT-like pro 24.4 4.6E+02 0.0099 22.5 9.0 69 42-116 1-77 (330)
217 COG1813 Predicted transcriptio 24.3 2.3E+02 0.005 22.1 5.3 45 35-80 72-120 (165)
218 PF07047 OPA3: Optic atrophy 3 24.2 2.9E+02 0.0062 20.1 6.4 64 10-73 61-126 (134)
219 PF08838 DUF1811: Protein of u 24.1 1E+02 0.0022 22.3 3.1 40 51-92 6-46 (102)
220 KOG0996|consensus 23.8 6.3E+02 0.014 25.8 9.1 78 33-116 860-951 (1293)
221 TIGR01797 CM_P_1 chorismate mu 23.7 2.3E+02 0.0049 18.7 5.0 18 56-73 3-20 (83)
222 PF10168 Nup88: Nuclear pore c 23.5 6.2E+02 0.013 23.7 9.2 77 33-112 538-619 (717)
223 PRK12855 hypothetical protein; 23.5 2.3E+02 0.0051 20.2 4.9 46 20-72 28-73 (103)
224 TIGR01803 CM-like chorismate m 23.4 2.3E+02 0.0049 18.7 4.6 16 56-71 3-18 (82)
225 TIGR01843 type_I_hlyD type I s 22.9 4.1E+02 0.0089 21.4 12.5 43 30-72 129-171 (423)
226 KOG0804|consensus 22.8 5.9E+02 0.013 23.2 9.0 22 91-112 373-394 (493)
227 PRK10132 hypothetical protein; 22.7 2.9E+02 0.0064 19.7 6.5 21 87-110 42-62 (108)
228 PF11944 DUF3461: Protein of u 22.6 3.2E+02 0.0068 20.6 5.5 31 82-112 95-125 (125)
229 smart00498 FH2 Formin Homology 22.5 4.9E+02 0.011 22.2 9.1 66 48-114 272-345 (432)
230 PF07716 bZIP_2: Basic region 22.5 1.9E+02 0.0042 17.5 4.8 22 43-64 30-51 (54)
231 PF15466 DUF4635: Domain of un 22.4 3.5E+02 0.0076 20.5 6.9 51 13-64 74-124 (135)
232 PRK09039 hypothetical protein; 22.4 4.7E+02 0.01 21.9 11.5 61 40-106 125-185 (343)
233 KOG0249|consensus 22.2 4E+02 0.0086 25.9 7.2 59 56-114 167-237 (916)
234 PF02403 Seryl_tRNA_N: Seryl-t 22.0 2.6E+02 0.0057 18.9 10.5 70 41-113 32-101 (108)
235 cd00632 Prefoldin_beta Prefold 22.0 2.7E+02 0.0058 19.0 6.3 31 40-70 65-95 (105)
236 PRK05771 V-type ATP synthase s 21.9 2.6E+02 0.0057 25.1 5.9 56 44-101 214-269 (646)
237 PF08656 DASH_Dad3: DASH compl 21.9 2.7E+02 0.0059 19.1 6.8 57 43-105 4-60 (78)
238 TIGR01242 26Sp45 26S proteasom 21.9 3.2E+02 0.0069 22.4 6.0 37 37-73 5-41 (364)
239 PRK09343 prefoldin subunit bet 21.9 3E+02 0.0065 19.6 5.2 34 40-73 80-113 (121)
240 PF04849 HAP1_N: HAP1 N-termin 21.7 5E+02 0.011 22.1 7.5 73 37-111 159-231 (306)
241 PF12805 FUSC-like: FUSC-like 21.7 4.1E+02 0.009 21.1 7.0 67 38-104 86-154 (284)
242 PF15237 PTRF_SDPR: PTRF/SDPR 21.7 4.7E+02 0.01 21.7 9.8 73 33-111 8-83 (246)
243 PF09304 Cortex-I_coil: Cortex 21.7 3.3E+02 0.0071 19.9 10.3 49 42-96 20-68 (107)
244 PF10267 Tmemb_cc2: Predicted 21.6 5.5E+02 0.012 22.5 7.6 31 37-67 211-241 (395)
245 PF01806 Paramyxo_P: Paramyxov 21.6 4.7E+02 0.01 21.6 8.1 49 27-75 48-96 (248)
246 PRK05724 acetyl-CoA carboxylas 21.5 1.9E+02 0.0041 24.6 4.6 39 34-73 7-49 (319)
247 PF10224 DUF2205: Predicted co 21.5 2.8E+02 0.0061 19.0 5.9 48 41-91 19-66 (80)
248 PF03955 Adeno_PIX: Adenovirus 21.4 1.2E+02 0.0026 22.2 3.0 34 31-71 76-109 (109)
249 PF07061 Swi5: Swi5; InterPro 21.4 2.8E+02 0.006 18.9 5.5 10 81-90 38-47 (83)
250 PRK01156 chromosome segregatio 21.4 6.1E+02 0.013 23.5 8.2 79 36-116 573-666 (895)
251 PF07568 HisKA_2: Histidine ki 21.3 2.5E+02 0.0053 18.3 5.2 37 55-91 34-71 (76)
252 PRK05771 V-type ATP synthase s 21.2 3.3E+02 0.0072 24.5 6.4 36 39-74 94-129 (646)
253 PF06698 DUF1192: Protein of u 21.2 1.4E+02 0.0031 19.4 3.1 20 53-72 22-41 (59)
254 PF07083 DUF1351: Protein of u 21.2 4.1E+02 0.0088 20.8 6.7 49 36-90 146-194 (215)
255 PF03148 Tektin: Tektin family 21.1 2.5E+02 0.0053 23.9 5.3 33 41-73 46-78 (384)
256 TIGR03761 ICE_PFL4669 integrat 21.1 2.8E+02 0.006 22.3 5.3 52 40-94 62-139 (216)
257 TIGR03513 GldL_gliding gliding 21.0 4.4E+02 0.0096 21.1 9.3 48 49-105 127-174 (202)
258 KOG4367|consensus 20.9 6.8E+02 0.015 23.2 8.2 35 57-97 313-347 (699)
259 PTZ00454 26S protease regulato 20.9 5.4E+02 0.012 22.0 7.7 38 36-73 27-64 (398)
260 PF03670 UPF0184: Uncharacteri 20.9 2.6E+02 0.0057 19.5 4.5 14 58-71 32-45 (83)
261 PF04375 HemX: HemX; InterPro 20.8 5.2E+02 0.011 21.8 11.8 24 89-112 96-119 (372)
262 cd07638 BAR_ACAP2 The Bin/Amph 20.7 4.3E+02 0.0093 20.8 11.1 78 26-112 40-127 (200)
263 PF10046 BLOC1_2: Biogenesis o 20.7 2.9E+02 0.0064 18.9 10.3 25 81-105 61-85 (99)
264 PRK06285 chorismate mutase; Pr 20.7 2.9E+02 0.0062 18.8 5.0 17 55-71 10-26 (96)
265 PF00458 WHEP-TRS: WHEP-TRS do 20.6 2.1E+02 0.0046 18.2 3.8 38 58-95 2-39 (56)
266 PRK13182 racA polar chromosome 20.6 4E+02 0.0087 20.5 9.1 54 52-113 92-145 (175)
267 PF08181 DegQ: DegQ (SacQ) fam 20.5 2.3E+02 0.005 17.7 4.3 30 36-65 9-38 (46)
268 PF10392 COG5: Golgi transport 20.4 3.3E+02 0.0072 19.4 8.1 44 53-98 34-77 (132)
269 TIGR02338 gimC_beta prefoldin, 20.3 3E+02 0.0066 19.0 5.4 36 37-72 2-37 (110)
270 PF07926 TPR_MLP1_2: TPR/MLP1/ 20.3 3.3E+02 0.0072 19.4 6.5 72 40-111 12-84 (132)
271 PLN02678 seryl-tRNA synthetase 20.3 6E+02 0.013 22.4 10.7 72 38-112 33-104 (448)
272 PF12495 Vip3A_N: Vegetative i 20.2 4.1E+02 0.0089 20.5 7.6 71 32-103 99-173 (177)
273 PRK06798 fliD flagellar cappin 20.2 5.8E+02 0.013 22.2 8.3 19 55-73 382-400 (440)
274 PF06298 PsbY: Photosystem II 20.2 1.6E+02 0.0034 17.7 2.9 18 28-45 14-33 (36)
275 PRK07122 RNA polymerase sigma 20.2 97 0.0021 24.6 2.6 24 21-45 64-87 (264)
276 PRK13689 hypothetical protein; 20.1 1.9E+02 0.0041 20.0 3.6 27 58-84 9-35 (75)
No 1
>PF14712 Snapin_Pallidin: Snapin/Pallidin
Probab=99.90 E-value=2.2e-23 Score=142.08 Aligned_cols=90 Identities=34% Similarity=0.521 Sum_probs=84.3
Q ss_pred HHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHH-HhcC-CCChhHHHHHHHHhhHHHHHHHHHH
Q psy460 25 QDTIAEGLLGLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISA-EQQT-AVPLDSYTRKLVDAQQKISIVGNIL 102 (117)
Q Consensus 25 rd~La~GL~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e-~q~~-p~dLd~yvkKL~~ak~RV~~vnniL 102 (117)
|+.||+||+++|+|.|++++++|++++.||.+|.++|+++.++|+++++ .+.. |+++++|++||.++|+||..+|+++
T Consensus 1 ~~~La~Gl~~~l~P~l~~~~~~l~el~~sQ~~L~~~i~~~~~~L~~~~~~~~~~~~~~~~~y~~KL~~ikkrm~~l~~~l 80 (92)
T PF14712_consen 1 REALAEGLLSLLEPDLDRLDQQLQELRQSQEELLQQIDRLNEKLKELNEVEQINEPFDLDPYVKKLVNIKKRMSNLHERL 80 (92)
T ss_pred ChHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHhhHHHHHHHHHHHHHHHHHHHH
Confidence 5789999999999999999999999999999999999999999999998 4334 6788999999999999999999999
Q ss_pred HHHHHHHHhhhh
Q psy460 103 QTTQVAMHKKFF 114 (117)
Q Consensus 103 q~~Q~RL~rl~~ 114 (117)
+.+++|+.+|.+
T Consensus 81 ~~lk~R~~~L~q 92 (92)
T PF14712_consen 81 QKLKKRADKLQQ 92 (92)
T ss_pred HHHHHHHHhhcC
Confidence 999999999863
No 2
>PF07889 DUF1664: Protein of unknown function (DUF1664); InterPro: IPR012458 The members of this family are hypothetical plant proteins of unknown function. The region featured in this family is approximately 100 amino acids long.
Probab=92.15 E-value=3.7 Score=30.44 Aligned_cols=73 Identities=11% Similarity=0.211 Sum_probs=52.2
Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhh
Q psy460 35 LIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQTTQVAMHKKF 113 (117)
Q Consensus 35 lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q~RL~rl~ 113 (117)
-...+-++||.=-.+.+...++|.++||.|...|++..|.+ ..--.....++..|..++..++.+|..+..|+
T Consensus 44 A~~~v~kql~~vs~~l~~tKkhLsqRId~vd~klDe~~ei~------~~i~~eV~~v~~dv~~i~~dv~~v~~~V~~Le 116 (126)
T PF07889_consen 44 AVASVSKQLEQVSESLSSTKKHLSQRIDRVDDKLDEQKEIS------KQIKDEVTEVREDVSQIGDDVDSVQQMVEGLE 116 (126)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHH------HHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHH
Confidence 34455567777667777788899999999999999998863 33344556666777777777777766665554
No 3
>PF05531 NPV_P10: Nucleopolyhedrovirus P10 protein; InterPro: IPR008702 This family consists of several nucleopolyhedrovirus P10 proteins which are thought to be involved in the morphogenesis of the polyhedra [].; GO: 0019028 viral capsid
Probab=89.57 E-value=2.8 Score=28.76 Aligned_cols=64 Identities=17% Similarity=0.321 Sum_probs=41.3
Q ss_pred HHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHHHHHHH
Q psy460 30 EGLLGLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQTTQVAM 109 (117)
Q Consensus 30 ~GL~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q~RL 109 (117)
.-|+.+++-.|+.+|.||-+.+. +++.|.+ .-+++..--+||-..-.++..+++.+.++|+-|
T Consensus 3 ~NILl~Ir~dIk~vd~KVdaLq~-------~V~~l~~----------~~~~v~~l~~klDa~~~~l~~l~~~V~~I~~iL 65 (75)
T PF05531_consen 3 QNILLVIRQDIKAVDDKVDALQT-------QVDDLES----------NLPDVTELNKKLDAQSAQLTTLNTKVNEIQDIL 65 (75)
T ss_pred HhHHHHHHHHHHHHHHHHHHHHH-------HHHHHHh----------cCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 34777888888888888764443 4444433 234555556677777777777777777777655
Q ss_pred H
Q psy460 110 H 110 (117)
Q Consensus 110 ~ 110 (117)
.
T Consensus 66 ~ 66 (75)
T PF05531_consen 66 N 66 (75)
T ss_pred C
Confidence 3
No 4
>PHA03386 P10 fibrous body protein; Provisional
Probab=88.87 E-value=3 Score=29.89 Aligned_cols=28 Identities=25% Similarity=0.185 Sum_probs=23.4
Q ss_pred HHHhhhHHHHHHHHHHHHHHHHHHHHHH
Q psy460 32 LLGLIKPTVDLLDEKFRATRASQLELRL 59 (117)
Q Consensus 32 L~~lL~P~I~~lD~~V~~tr~SQ~eL~~ 59 (117)
|+.++.-+|+.+|.||-+.+.+=..|++
T Consensus 6 ILl~Ir~dIkavd~KVdaLQ~qV~dv~~ 33 (94)
T PHA03386 6 VLTQILDAVQEVDTKVDALQTQLNGLEE 33 (94)
T ss_pred HHHHHHHHHHHHhhHHHHHHHHHHHHHh
Confidence 6788899999999999998886666664
No 5
>KOG2911|consensus
Probab=88.86 E-value=3 Score=36.93 Aligned_cols=75 Identities=15% Similarity=0.144 Sum_probs=52.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCC-----CCh-hHHHHHHHHhhHHHHHHHHHHHHHHHHHHhh
Q psy460 39 TVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTA-----VPL-DSYTRKLVDAQQKISIVGNILQTTQVAMHKK 112 (117)
Q Consensus 39 ~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p-----~dL-d~yvkKL~~ak~RV~~vnniLq~~Q~RL~rl 112 (117)
-|.+.|..|...++++..|..||+.|.+|+.+..+.--+. =-+ --|.++.--+-+++......|.+++.=|++|
T Consensus 227 ~it~~D~~V~~L~~~~~~L~kqie~L~qeie~~~~~~r~~~k~g~K~iA~~ylr~rk~~eK~~er~~~~l~~l~~vl~~I 306 (439)
T KOG2911|consen 227 IITEIDGSVADLIQARAKLAKQIEFLEQEIEKSKEKLRQALKEGKKQIAITYLRARKLLEKDLERKVSSLNNLETVLSQI 306 (439)
T ss_pred CCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHH
Confidence 3789999999999999999999999999999998852111 111 1344444444445555556666666666665
Q ss_pred h
Q psy460 113 F 113 (117)
Q Consensus 113 ~ 113 (117)
+
T Consensus 307 d 307 (439)
T KOG2911|consen 307 D 307 (439)
T ss_pred H
Confidence 4
No 6
>PRK11637 AmiB activator; Provisional
Probab=88.45 E-value=14 Score=31.34 Aligned_cols=23 Identities=17% Similarity=0.346 Sum_probs=8.7
Q ss_pred HHHHHhhHHHHHHHHHHHHHHHH
Q psy460 86 RKLVDAQQKISIVGNILQTTQVA 108 (117)
Q Consensus 86 kKL~~ak~RV~~vnniLq~~Q~R 108 (117)
++|....+.+..++.-+...+..
T Consensus 89 ~~i~~~~~~i~~~~~ei~~l~~e 111 (428)
T PRK11637 89 RKLRETQNTLNQLNKQIDELNAS 111 (428)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 33333333333333333333333
No 7
>PRK11637 AmiB activator; Provisional
Probab=85.71 E-value=12 Score=31.72 Aligned_cols=66 Identities=11% Similarity=0.104 Sum_probs=31.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhh
Q psy460 42 LLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQTTQVAMHKKF 113 (117)
Q Consensus 42 ~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q~RL~rl~ 113 (117)
++..++...+..-.++...|..+..++..+... +...-++|....+++..+..-+..++.++.++.
T Consensus 44 ~~~~~l~~l~~qi~~~~~~i~~~~~~~~~~~~~------l~~l~~qi~~~~~~i~~~~~~i~~~~~ei~~l~ 109 (428)
T PRK11637 44 DNRDQLKSIQQDIAAKEKSVRQQQQQRASLLAQ------LKKQEEAISQASRKLRETQNTLNQLNKQIDELN 109 (428)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 444444444444444444444444444444332 444455555555555555555555555555443
No 8
>PF04420 CHD5: CHD5-like protein; InterPro: IPR007514 Members of this family are probably coiled-coil proteins that are similar to the CHD5 (Congenital heart disease 5) protein. The exact molecular function of these eukaryotic proteins is unknown.; PDB: 3SJA_H 3SJC_D 3SJB_D 3ZS8_D 3VLC_E.
Probab=84.67 E-value=4.2 Score=30.60 Aligned_cols=55 Identities=22% Similarity=0.336 Sum_probs=41.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHHHHHHHH
Q psy460 48 RATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQTTQVAMH 110 (117)
Q Consensus 48 ~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q~RL~ 110 (117)
...-.-|.+|+.+|.+|.+|+..++. |+ ++..|+|. +|++.++..-|+..++.+.
T Consensus 36 ~~~~~~~~~l~~Ei~~l~~E~~~iS~-qD---eFAkwaKl----~Rk~~kl~~el~~~~~~~~ 90 (161)
T PF04420_consen 36 SKSSKEQRQLRKEILQLKRELNAISA-QD---EFAKWAKL----NRKLDKLEEELEKLNKSLS 90 (161)
T ss_dssp -HHHHHHHHHHHHHHHHHHHHTTS-T-TT---SHHHHHHH----HHHHHHHHHHHHHHHHHHH
T ss_pred ccccHHHHHHHHHHHHHHHHHHcCCc-HH---HHHHHHHH----HHHHHHHHHHHHHHHHHHH
Confidence 34456678899999999999999965 33 68888873 8888888888887766554
No 9
>PHA03395 p10 fibrous body protein; Provisional
Probab=84.05 E-value=7.9 Score=27.36 Aligned_cols=63 Identities=13% Similarity=0.263 Sum_probs=37.5
Q ss_pred HHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHHHHHHH
Q psy460 30 EGLLGLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQTTQVAM 109 (117)
Q Consensus 30 ~GL~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q~RL 109 (117)
..|+-+++-.|+.+|.||-+.+. +++.+.+ .-+|+..--.||..--..+..+++.+.++++-|
T Consensus 3 qNILl~Ir~dIkavd~KVdalQ~-------~V~~l~~----------nlpdv~~l~~kLdaq~~~Ltti~tkv~~I~diL 65 (87)
T PHA03395 3 QNILLLIRQDIKAVSDKVDALQA-------AVDDVRA----------NLPDVTEINEKLDAQSASLDTISSAVDNITDIL 65 (87)
T ss_pred chHHHHHHHHHHHHhhHHHHHHH-------HHHHHHh----------cCCcHHHHHHHHHhHHHHHHHHHHHHHHHHHcc
Confidence 34778889999999999887665 3333321 333444444444444445566666666666543
No 10
>COG3883 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=83.34 E-value=8.3 Score=32.00 Aligned_cols=36 Identities=17% Similarity=0.339 Sum_probs=26.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHH
Q psy460 38 PTVDLLDEKFRATRASQLELRLQIEALTSQLDKISA 73 (117)
Q Consensus 38 P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e 73 (117)
-=|+.||++|..+...+.++...|+++.++++++..
T Consensus 52 ~ei~~L~~qi~~~~~k~~~~~~~i~~~~~eik~l~~ 87 (265)
T COG3883 52 NEIESLDNQIEEIQSKIDELQKEIDQSKAEIKKLQK 87 (265)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 346777777888888777777777777777776654
No 11
>PF10458 Val_tRNA-synt_C: Valyl tRNA synthetase tRNA binding arm; InterPro: IPR019499 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology []. The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric []. Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices [], and are mostly dimeric or multimeric, containing at least three conserved regions [, , ]. However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases []. Based on their mode of binding to the tRNA acceptor stem, both classes of tRNA synthetases have been subdivided into three subclasses, designated 1a, 1b, 1c and 2a, 2b, 2c. This entry represents the C-terminal domain of Valyl-tRNA synthetase, which consists of two helices in a long alpha-hairpin. Valyl-tRNA synthetase (6.1.1.9 from EC) is an alpha monomer that belongs to class Ia.; GO: 0000166 nucleotide binding, 0004832 valine-tRNA ligase activity, 0005524 ATP binding, 0006438 valyl-tRNA aminoacylation, 0005737 cytoplasm; PDB: 1IVS_B 1GAX_B.
Probab=83.20 E-value=9.8 Score=24.48 Aligned_cols=63 Identities=13% Similarity=0.211 Sum_probs=44.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCCh-hHHHHHHHHhhHHHHHHHHHHHHH
Q psy460 43 LDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPL-DSYTRKLVDAQQKISIVGNILQTT 105 (117)
Q Consensus 43 lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dL-d~yvkKL~~ak~RV~~vnniLq~~ 105 (117)
++.-+.-...-...+..+|+++..-|..=++...+|.++ +..-.||.++...+..+...|...
T Consensus 2 ~~~E~~rL~Kel~kl~~~i~~~~~kL~n~~F~~kAP~eVve~er~kl~~~~~~~~~l~~~l~~L 65 (66)
T PF10458_consen 2 VEAEIERLEKELEKLEKEIERLEKKLSNENFVEKAPEEVVEKEREKLEELEEELEKLEEALEQL 65 (66)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHCSTTHHHHS-CCHHHHHHHHHHHHHHHHHHHHHHHHH-
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHcCccccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 455555666666777778888888887655555689987 677888888888888777776654
No 12
>PF06160 EzrA: Septation ring formation regulator, EzrA ; InterPro: IPR010379 During the bacterial cell cycle, the tubulin-like cell-division protein FtsZ polymerises into a ring structure that establishes the location of the nascent division site. EzrA modulates the frequency and position of FtsZ ring formation [].; GO: 0000921 septin ring assembly, 0005940 septin ring, 0016021 integral to membrane
Probab=83.07 E-value=16 Score=32.42 Aligned_cols=62 Identities=13% Similarity=0.260 Sum_probs=39.9
Q ss_pred HHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHH
Q psy460 32 LLGLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVG 99 (117)
Q Consensus 32 L~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vn 99 (117)
-++.+..-++++..++......|.++...+..|..+=.+-.+. |..|-.+|.+.||+|...|
T Consensus 373 ~yS~i~~~l~~~~~~l~~ie~~q~~~~~~l~~L~~dE~~Ar~~------l~~~~~~l~~ikR~lek~n 434 (560)
T PF06160_consen 373 PYSEIQEELEEIEEQLEEIEEEQEEINESLQSLRKDEKEAREK------LQKLKQKLREIKRRLEKSN 434 (560)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHcC
Confidence 4556666666677777777777777777777776655555553 6666666666666665544
No 13
>PF04100 Vps53_N: Vps53-like, N-terminal ; InterPro: IPR007234 Vps53 complexes with Vps52 and Vps54 to form a multi-subunit complex involved in regulating membrane trafficking events [].
Probab=82.93 E-value=15 Score=31.37 Aligned_cols=78 Identities=18% Similarity=0.273 Sum_probs=57.1
Q ss_pred HHHhhhHHHHHHHHHHHHHHHHHH-----------HHHHHHHHHHHHHHhhHHHh----cCCCChhHHHHHHHHhhHHHH
Q psy460 32 LLGLIKPTVDLLDEKFRATRASQL-----------ELRLQIEALTSQLDKISAEQ----QTAVPLDSYTRKLVDAQQKIS 96 (117)
Q Consensus 32 L~~lL~P~I~~lD~~V~~tr~SQ~-----------eL~~qId~L~~eL~~l~e~q----~~p~dLd~yvkKL~~ak~RV~ 96 (117)
+..-+.--+.++|+.+..+-..|. +.+..|..|-+.+..+.+-. ...-++=.=+++|..||+.++
T Consensus 26 ~i~~l~~~i~~ld~eI~~~v~~q~~~~~~~~~~l~~a~~~i~~L~~~i~~ik~kA~~sE~~V~~it~dIk~LD~AKrNLT 105 (383)
T PF04100_consen 26 LIAKLRKEIRELDEEIKELVREQSSSGQDAEEDLEEAQEAIQELFEKISEIKSKAEESEQMVQEITRDIKQLDNAKRNLT 105 (383)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 555666677888888887777775 55666777777666665321 224466678999999999999
Q ss_pred HHHHHHHHHHHHH
Q psy460 97 IVGNILQTTQVAM 109 (117)
Q Consensus 97 ~vnniLq~~Q~RL 109 (117)
...++|++.|==+
T Consensus 106 ~SIT~LkrL~MLv 118 (383)
T PF04100_consen 106 QSITTLKRLQMLV 118 (383)
T ss_pred HHHHHHHHHHHHH
Confidence 9999999887433
No 14
>PRK14011 prefoldin subunit alpha; Provisional
Probab=82.02 E-value=9.9 Score=28.58 Aligned_cols=47 Identities=15% Similarity=0.298 Sum_probs=41.2
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHH
Q psy460 36 IKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKL 88 (117)
Q Consensus 36 L~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL 88 (117)
+.-+++-|+.++.....+..+|...|+.+..+++++... |.+.++.+
T Consensus 86 ~~eA~~~~~~ri~~l~~~~~~l~~~i~~~~~~~~~l~~~------L~~k~~~~ 132 (144)
T PRK14011 86 VSEVIEDFKKSVEELDKTKKEGNKKIEELNKEITKLRKE------LEKRAQAI 132 (144)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHH
Confidence 566899999999999999999999999999999999875 66666654
No 15
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=81.64 E-value=7.5 Score=31.35 Aligned_cols=34 Identities=21% Similarity=0.225 Sum_probs=24.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHH
Q psy460 40 VDLLDEKFRATRASQLELRLQIEALTSQLDKISA 73 (117)
Q Consensus 40 I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e 73 (117)
|.+|+..+..-.++|.+|..||+.|..|+.+|..
T Consensus 42 ~~~le~~~~~~~~~~~~l~~ql~~lq~ev~~LrG 75 (263)
T PRK10803 42 VTQLERISNAHSQLLTQLQQQLSDNQSDIDSLRG 75 (263)
T ss_pred HHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHhh
Confidence 4566666666667777777777777777777764
No 16
>PF10158 LOH1CR12: Tumour suppressor protein; InterPro: IPR018780 This entry represents a region of 130 amino acids that is the most conserved part of some hypothetical proteins involved in loss of heterozygosity, and thus, tumour suppression []. The exact function of these proteins is not known.
Probab=81.42 E-value=19 Score=26.64 Aligned_cols=88 Identities=16% Similarity=0.142 Sum_probs=66.4
Q ss_pred HHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHH----HhcCCCChhHHHHHHHHhhHHHHHHHHHHHH
Q psy460 29 AEGLLGLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISA----EQQTAVPLDSYTRKLVDAQQKISIVGNILQT 104 (117)
Q Consensus 29 a~GL~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e----~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~ 104 (117)
..|++.|-.=.-+-|..-+.++-.-|..|...|-+...++.++.. .|..--=+..++.|+......+..++.+|..
T Consensus 26 s~~~l~Lc~R~Q~HL~~cA~~Va~~Q~~L~~riKevd~~~~~l~~~~~erqk~~~k~ae~L~kv~els~~L~~~~~lL~~ 105 (131)
T PF10158_consen 26 SRPVLRLCSRYQEHLNQCAEAVAFDQNALAKRIKEVDQEIAKLLQQMVERQKRFAKFAEQLEKVNELSQQLSRCQSLLNQ 105 (131)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345667766667777777888888999999998888777766654 2333334567778888888888889999998
Q ss_pred HHHHHHhhhhhc
Q psy460 105 TQVAMHKKFFMG 116 (117)
Q Consensus 105 ~Q~RL~rl~~~~ 116 (117)
+-.-+++|+-++
T Consensus 106 ~v~~ie~LN~~L 117 (131)
T PF10158_consen 106 TVPSIETLNEIL 117 (131)
T ss_pred HHHHHHHHHhhC
Confidence 888888888765
No 17
>PF05531 NPV_P10: Nucleopolyhedrovirus P10 protein; InterPro: IPR008702 This family consists of several nucleopolyhedrovirus P10 proteins which are thought to be involved in the morphogenesis of the polyhedra [].; GO: 0019028 viral capsid
Probab=81.22 E-value=15 Score=25.21 Aligned_cols=35 Identities=20% Similarity=0.348 Sum_probs=24.6
Q ss_pred HHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHhhHH
Q psy460 39 TVDLLDEKFRATRASQLE---LRLQIEALTSQLDKISA 73 (117)
Q Consensus 39 ~I~~lD~~V~~tr~SQ~e---L~~qId~L~~eL~~l~e 73 (117)
=|+.|..+|..++.+... |...+|.++..|..++.
T Consensus 19 KVdaLq~~V~~l~~~~~~v~~l~~klDa~~~~l~~l~~ 56 (75)
T PF05531_consen 19 KVDALQTQVDDLESNLPDVTELNKKLDAQSAQLTTLNT 56 (75)
T ss_pred HHHHHHHHHHHHHhcCCchHHHHHHHHHHHHHHHHHHH
Confidence 356677777777776655 77777777777777654
No 18
>PF03357 Snf7: Snf7; InterPro: IPR005024 This is a family of eukaryotic proteins which are variously described as either hypothetical protein, developmental protein or related to yeast SNF7. The family contains human CHMP1. CHMP1 (CHromatin Modifying Protein; CHarged Multivesicular body Protein), is encoded by an alternative open reading frame in the PRSM1 gene [] and is conserved in both complex and simple eukaryotes. CHMP1 contains a predicted bipartite nuclear localisation signal and distributes as distinct forms to the cytoplasm and the nuclear matrix in all cell lines tested. Human CHMP1 is strongly implicated in multivesicular body formation. A multivesicular body is a vesicle-filled endosome that targets proteins to the interior of lysosomes. Immunocytochemistry and biochemical fractionation localise CHMP1 to early endosomes and CHMP1 physically interacts with SKD1/VPS4, a highly conserved protein directly linked to multivesicular body sorting in yeast. Similar to the action of a mutant SKD1 protein, over expression of a fusion derivative of human CHMP1 dilates endosomal compartments and disrupts the normal distribution of several endosomal markers. Genetic studies in Saccharomyces cerevisiae (Baker's yeast) further support a conserved role of CHMP1 in vesicle trafficking. Deletion of CHM1, the budding yeast homologue of CHMP1, results in defective sorting of carboxypeptidases S and Y and produces abnormal, multi-lamellar prevacuolar compartments. This phenotype classifies CHM1 as a member of the class E vacuolar protein sorting genes []. ; GO: 0015031 protein transport; PDB: 2V6X_B 2W2U_D 2GD5_D 3FRT_B 3FRV_A 4ABM_D 3EAB_H 3HTU_D.
Probab=79.90 E-value=18 Score=25.93 Aligned_cols=73 Identities=12% Similarity=0.097 Sum_probs=46.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCCh-hHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhhh
Q psy460 40 VDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPL-DSYTRKLVDAQQKISIVGNILQTTQVAMHKKFF 114 (117)
Q Consensus 40 I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dL-d~yvkKL~~ak~RV~~vnniLq~~Q~RL~rl~~ 114 (117)
|.+|.............|...|+.+..+++++... ..-.. --|.++....++.+..+.+.+.+++.-+.+|+.
T Consensus 3 i~~Lk~~~~~L~~~~~~le~~i~~~~~~~k~~~~~--~~~~~A~~~lk~~k~~~k~~~~~~~~~~~l~~~~~~ie~ 76 (171)
T PF03357_consen 3 ILKLKKTIRRLEKQIKRLEKKIKKLEKKAKKAIKK--GNKERAKIYLKRKKRLEKQLEKLLNQLSNLESVLLQIET 76 (171)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHCT--T-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc--CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34455555555555556666677766666666542 22223 478888888888888888887777776666654
No 19
>PF01519 DUF16: Protein of unknown function DUF16; InterPro: IPR002862 Proteins that contain this domain are of unknown function. It appears to be confined to proteins from Mycoplasma pneumoniae [].; PDB: 2BA2_C.
Probab=79.72 E-value=21 Score=25.92 Aligned_cols=24 Identities=13% Similarity=0.167 Sum_probs=19.1
Q ss_pred HhhHHHHHHHHHHHHHHHHHHhhh
Q psy460 90 DAQQKISIVGNILQTTQVAMHKKF 113 (117)
Q Consensus 90 ~ak~RV~~vnniLq~~Q~RL~rl~ 113 (117)
..-..+-.+.+.|+.+-.||++|.
T Consensus 78 ~qgktL~~I~~~L~~inkRLD~~E 101 (102)
T PF01519_consen 78 AQGKTLQLILKTLQSINKRLDKME 101 (102)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHC
T ss_pred HHHHHHHHHHHHHHHHHHHHhhcc
Confidence 344677888999999999998874
No 20
>PF11382 DUF3186: Protein of unknown function (DUF3186); InterPro: IPR021522 This bacterial family of proteins has no known function.
Probab=78.95 E-value=8 Score=31.94 Aligned_cols=47 Identities=30% Similarity=0.568 Sum_probs=37.5
Q ss_pred HHHHHHH---Hhhh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHH
Q psy460 27 TIAEGLL---GLIK-PTVDLLDEKFRATRASQLELRLQIEALTSQLDKISA 73 (117)
Q Consensus 27 ~La~GL~---~lL~-P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e 73 (117)
+|+-||+ +++. |.++.|+.++...|....+|+.+++++..++.....
T Consensus 17 ALavGI~lG~~~l~~~l~~~l~~~~~~lr~e~~~l~~~~~~~~~~~~~~d~ 67 (308)
T PF11382_consen 17 ALAVGIVLGSGPLQPNLIDSLEDQFDSLREENDELRAELDALQAQLNAADQ 67 (308)
T ss_pred HHHHHHHhcchhhchhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4566654 4454 578999999999999999999999999988866544
No 21
>PF15027 DUF4525: Domain of unknown function (DUF4525)
Probab=78.77 E-value=6.7 Score=29.80 Aligned_cols=58 Identities=22% Similarity=0.281 Sum_probs=40.8
Q ss_pred HHHHHHHHHHHHHHH-HHHhhHHHhcCCCChhHHHHHHHHh---hHHHHHHHHHHHHHHHHHHhhh
Q psy460 52 ASQLELRLQIEALTS-QLDKISAEQQTAVPLDSYTRKLVDA---QQKISIVGNILQTTQVAMHKKF 113 (117)
Q Consensus 52 ~SQ~eL~~qId~L~~-eL~~l~e~q~~p~dLd~yvkKL~~a---k~RV~~vnniLq~~Q~RL~rl~ 113 (117)
+|=.+||+||-.|+. +.+-+.|++...+|- ||..-+..+ |+-+.+ +|+.+-.||.+|.
T Consensus 38 ~ss~~LR~QILdLSkrYVKaLAeEn~~~~dg-p~~~smagYaDLKktiAV---LLddiLqRl~kLE 99 (138)
T PF15027_consen 38 QSSAELREQILDLSKRYVKALAEENKNVVDG-PYGASMAGYADLKKTIAV---LLDDILQRLVKLE 99 (138)
T ss_pred cChHHHHHHHHHHHHHHHHHHHHHcCCCCCC-CcchhhHHHHHHhhhhhh---hhhHHHHHHHHHH
Confidence 688999999999975 677888877666553 676666655 444433 4666667777664
No 22
>PF05377 FlaC_arch: Flagella accessory protein C (FlaC); InterPro: IPR008039 Although archaeal flagella appear superficially similar to those of bacteria, they are quite distinct []. In several archaea, the flagellin genes are followed immediately by the flagellar accessory genes flaCDEFGHIJ. The gene products may have a role in translocation, secretion, or assembly of the flagellum. FlaC is a protein whose exact role is unknown but it has been shown to be membrane-associated (by immuno-blotting fractionated cells) [].
Probab=77.51 E-value=17 Score=23.64 Aligned_cols=34 Identities=6% Similarity=0.248 Sum_probs=19.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHH
Q psy460 40 VDLLDEKFRATRASQLELRLQIEALTSQLDKISA 73 (117)
Q Consensus 40 I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e 73 (117)
+.+++..+..++....+++..|+.+.+-.+++-.
T Consensus 9 ~~~~~~~i~tvk~en~~i~~~ve~i~envk~ll~ 42 (55)
T PF05377_consen 9 LPRIESSINTVKKENEEISESVEKIEENVKDLLS 42 (55)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4455555556666555666666665555555544
No 23
>COG4942 Membrane-bound metallopeptidase [Cell division and chromosome partitioning]
Probab=77.21 E-value=28 Score=30.74 Aligned_cols=67 Identities=19% Similarity=0.340 Sum_probs=53.9
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHHHHHH
Q psy460 36 IKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQTTQVA 108 (117)
Q Consensus 36 L~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q~R 108 (117)
..--|.+++.++...+..+.+|..+|-++.+++.++... |-.....|...+++|...+.-|...+..
T Consensus 43 ~q~ei~~~~~~i~~~~~~~~kL~~~lk~~e~~i~~~~~q------l~~s~~~l~~~~~~I~~~~~~l~~l~~q 109 (420)
T COG4942 43 IQKEIAALEKKIREQQDQRAKLEKQLKSLETEIASLEAQ------LIETADDLKKLRKQIADLNARLNALEVQ 109 (420)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHhHHHHHHhhHHHHHHHHHHHHHH
Confidence 344577788888888888999999999999999999775 7777888888888888888877766543
No 24
>COG0255 RpmC Ribosomal protein L29 [Translation, ribosomal structure and biogenesis]
Probab=75.61 E-value=9.9 Score=25.54 Aligned_cols=54 Identities=17% Similarity=0.265 Sum_probs=42.6
Q ss_pred HHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHHHHH
Q psy460 52 ASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQTTQV 107 (117)
Q Consensus 52 ~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q~ 107 (117)
.|..+|...+.+|-.||=++.. |.+.=.++ --.++-.+|+.|.-|+.++..-..
T Consensus 11 ~s~eeL~~~l~eLK~ELf~LR~-q~a~g~l~-n~~~ir~vRr~IARi~Tv~~E~~~ 64 (69)
T COG0255 11 KSVEELEEELRELKKELFNLRF-QLATGQLE-NPHRIREVRRDIARILTVLREKEL 64 (69)
T ss_pred CCHHHHHHHHHHHHHHHHHHHH-HHHhCCCC-CcHHHHHHHHHHHHHHHHHHHHHH
Confidence 5788899999999999998887 44444444 556888999999999999986543
No 25
>PF05837 CENP-H: Centromere protein H (CENP-H); InterPro: IPR008426 Chromosome segregation in eukaryotes requires the kinetochore, a multi-protein structure that assembles on centromeric DNA, and which acts to link chromosomes to spindle microtubules. Kinetochore structure and composition is highly conserved among vertebrates. The inner kinetochore is essential for kinetochore assembly, and is involved in chromosome segregation via regulation of the spindle. Inner kinetochore components include the multi-subunit CENP-H/I complex, which may function, in part, in directing centromere protein A (CENP-A) deposition to centromeres, where CENP-A is a centromere-specific histone H3 variant required for the organisation of centromeric chromatin during interphase. The CENP-H/I complex contains three functional classes of proteins [, ]: CENP-H class (includes CENP-H, -I, -K, -L) CENP-M class (includes CENP-M) CENP-O class (includes CENP-O, -P, -Q, -R, -50) CENP-H is required for the localisation of CENP-C, but not CENP-A, to the centromere. However, it may be involved in the incorporation of newly synthesised CENP-A into centromeres via its interaction with the CENP-A/CENP-HI complex. CENP-H contains a coiled-coil structure and a nuclear localisation signal. CENP-H is specifically and constitutively localised in kinetochores throughout the cell cycle, and may play a role in kinetochore organisation and function throughout the cell cycle []. Studies show that CENP-H may be associated with certain human cancers [, ]. This entry also includes Kinetochore protein Fta3 which is a subunit of the Sim4 complex. This complex is required for loading the DASH complex onto the kinetochore via interaction with dad1. Fta2, Fta3 and Fta4 associate with the central core and inner repeat region of the centromere [].; GO: 0043515 kinetochore binding, 0007059 chromosome segregation, 0051301 cell division, 0000777 condensed chromosome kinetochore, 0005634 nucleus
Probab=74.30 E-value=28 Score=24.55 Aligned_cols=69 Identities=19% Similarity=0.347 Sum_probs=49.5
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCC-------hhHHHHHHHHhhHHHHHHHHHHHHH
Q psy460 37 KPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVP-------LDSYTRKLVDAQQKISIVGNILQTT 105 (117)
Q Consensus 37 ~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~d-------Ld~yvkKL~~ak~RV~~vnniLq~~ 105 (117)
.-....+..++..++..=.++..+-.+|..++..+.+.....-. +...-+++-.+|+|...+=||++++
T Consensus 9 ~~~~~~l~~~L~~v~~~~l~l~~~n~el~~el~~l~~~~~~~~~~~~~~~~l~~~~~~lk~~r~~~~v~k~v~q~l 84 (106)
T PF05837_consen 9 QQESRSLQEKLSDVEKKRLRLKRRNQELAQELLELAEKQKSQREDEELSEKLEKLEKELKKSRQRWRVMKNVFQAL 84 (106)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34556677777777777777888888888888877765544433 3445566778889999998888875
No 26
>PF04912 Dynamitin: Dynamitin ; InterPro: IPR006996 Dynamitin is a subunit of the microtubule-dependent motor complex, it is also implicated in cell adhesion by binding to macrophage-enriched myristoylated alanine-rice C kinase substrate (MacMARCKS) []. It is also thought to modulate cytoplasmic dynein binding to an organelle, and plays a role in prometaphase chromosome alignment and spindle organisation during mitosis. Dynamitin is also involved in anchoring microtubules to centrosomes and may play a role in synapse formation during brain development []. ; GO: 0007017 microtubule-based process, 0005869 dynactin complex
Probab=73.85 E-value=36 Score=28.66 Aligned_cols=79 Identities=10% Similarity=0.177 Sum_probs=55.9
Q ss_pred hhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhhHHHhc---CCCChhHHHHHHHHhhHHHHHHHHHHHHHHHHHHh
Q psy460 36 IKPTVDLLDEKFRATRASQ-LELRLQIEALTSQLDKISAEQQ---TAVPLDSYTRKLVDAQQKISIVGNILQTTQVAMHK 111 (117)
Q Consensus 36 L~P~I~~lD~~V~~tr~SQ-~eL~~qId~L~~eL~~l~e~q~---~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q~RL~r 111 (117)
|-|+|..|..++......+ ..+...+..|..++.+|.+... ...+...-|..|-+.=.++..+..+|=.+-+||.-
T Consensus 244 l~~~l~~L~~~lslL~~~~Ld~i~~rl~~L~~~~~~l~~~~~~~~~~~~~e~KI~eLy~~l~~~~~~~~~lP~lv~RL~t 323 (388)
T PF04912_consen 244 LLPALNELERQLSLLDPAKLDSIERRLKSLLSELEELAEKRKEAKEDAEQESKIDELYEILPRWDPYAPSLPSLVERLKT 323 (388)
T ss_pred HHHHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHHhccccccccccchhHHHHHHHHHHHHHHHhhhhhHHHHHHHH
Confidence 5578888888888775554 4556666777777777765422 23455677777777778888888888888888877
Q ss_pred hhh
Q psy460 112 KFF 114 (117)
Q Consensus 112 l~~ 114 (117)
|..
T Consensus 324 L~~ 326 (388)
T PF04912_consen 324 LKS 326 (388)
T ss_pred HHH
Confidence 654
No 27
>PF10392 COG5: Golgi transport complex subunit 5; InterPro: IPR019465 The conserved oligomeric Golgi (COG) complex is a peripheral membrane complex involved in intra-Golgi protein trafficking. Subunit 5 is located in the smaller, B lobe, together with subunits 6-8, and has been shown to bind subunits 1 and 7 [].
Probab=73.85 E-value=23 Score=25.52 Aligned_cols=45 Identities=20% Similarity=0.516 Sum_probs=20.7
Q ss_pred HHHHHHHHHHHHHHhhHHHhcCCCChhHH--HHHHHHhhHHHHHHHHHHHHH
Q psy460 56 ELRLQIEALTSQLDKISAEQQTAVPLDSY--TRKLVDAQQKISIVGNILQTT 105 (117)
Q Consensus 56 eL~~qId~L~~eL~~l~e~q~~p~dLd~y--vkKL~~ak~RV~~vnniLq~~ 105 (117)
.++.+|+.|..-.+||... . ++|| ++++..+=+|+-....+|+.+
T Consensus 76 ~v~~~v~~L~~s~~RL~~e---V--~~Py~~~~~~~~~L~rl~~t~~LLR~~ 122 (132)
T PF10392_consen 76 AVRSSVESLQSSYERLRSE---V--IEPYEKIQKLTSQLERLHQTSDLLRSV 122 (132)
T ss_pred HHHHHHHHHHHHHHHHHHH---H--hhHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3445555555555555543 1 2344 334444444444444444443
No 28
>TIGR02231 conserved hypothetical protein. This family consists of proteins over 500 amino acids long in Caenorhabditis elegans and several bacteria (Pseudomonas aeruginosa, Nostoc sp. PCC 7120, Leptospira interrogans, etc.). The function is unknown.
Probab=73.31 E-value=58 Score=28.31 Aligned_cols=79 Identities=9% Similarity=0.007 Sum_probs=52.2
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhc------------CCCChhH-------HHHHHHHhhHHHH
Q psy460 36 IKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQ------------TAVPLDS-------YTRKLVDAQQKIS 96 (117)
Q Consensus 36 L~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~------------~p~dLd~-------yvkKL~~ak~RV~ 96 (117)
-.|.|.+|+++++..+.-..+++.+++.+...++-+..... ...++.. |-..+...+.+..
T Consensus 69 ~~~~~~~l~~~l~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 148 (525)
T TIGR02231 69 DPERLAELRKQIRELEAELRDLEDRGDALKALAKFLEDIREGLTEPIKDSAKRNEPDLKEWFQAFDFNGSEIERLLTEDR 148 (525)
T ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccCCCCHHHHHHHHHHHHHHHHHHHHHHH
Confidence 56789999999999999999999999999887766554321 1224333 3345555555555
Q ss_pred HHHHHHHHHHHHHHhhhh
Q psy460 97 IVGNILQTTQVAMHKKFF 114 (117)
Q Consensus 97 ~vnniLq~~Q~RL~rl~~ 114 (117)
.+..-+...++++.++..
T Consensus 149 ~~~~~~~~~~~~l~~l~~ 166 (525)
T TIGR02231 149 EAERRIRELEKQLSELQN 166 (525)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 666666666666655543
No 29
>smart00787 Spc7 Spc7 kinetochore protein. This domain is found in cell division proteins which are required for kinetochore-spindle association.
Probab=73.29 E-value=54 Score=27.43 Aligned_cols=76 Identities=13% Similarity=0.202 Sum_probs=36.0
Q ss_pred HHHHhhhHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHHHH
Q psy460 31 GLLGLIKPTVDLLDEKFRATRASQLELRL----QIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQTTQ 106 (117)
Q Consensus 31 GL~~lL~P~I~~lD~~V~~tr~SQ~eL~~----qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q 106 (117)
+++.-+.---..|...+...++.+.++.. .++++-++|..... ++....+++.+.+..+..++..+....
T Consensus 172 ~~~~~l~~~~~~L~~e~~~L~~~~~e~~~~d~~eL~~lk~~l~~~~~------ei~~~~~~l~e~~~~l~~l~~~I~~~~ 245 (312)
T smart00787 172 SIKPKLRDRKDALEEELRQLKQLEDELEDCDPTELDRAKEKLKKLLQ------EIMIKVKKLEELEEELQELESKIEDLT 245 (312)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhHHHHHhCCHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33333333333444444444444444422 33344333333333 255666666666666666666665555
Q ss_pred HHHHhh
Q psy460 107 VAMHKK 112 (117)
Q Consensus 107 ~RL~rl 112 (117)
+....+
T Consensus 246 ~~k~e~ 251 (312)
T smart00787 246 NKKSEL 251 (312)
T ss_pred HHHHHH
Confidence 554443
No 30
>PF12998 ING: Inhibitor of growth proteins N-terminal histone-binding; InterPro: IPR024610 Histones undergo numerous post-translational modifications, including acetylation and methylation, at residues which are then probable docking sites for various chromatin remodelling complexes. Inhibitor of growth proteins (INGs) specifically bind to residues that have been thus modified. INGs carry a well-characterised C-terminal PHD-type zinc-finger domain, binding with lysine 4-tri-methylated histone H3 (H3K4me3), as well as this N-terminal domain that binds unmodified H3 tails. Although these two regions can bind histones independently, together they increase the apparent association of the ING for the H3 tail. This entry represents the N-terminal histone binding domain found in inhibitor proteins.; PDB: 4AFL_A.
Probab=72.97 E-value=15 Score=24.53 Aligned_cols=51 Identities=16% Similarity=0.191 Sum_probs=42.3
Q ss_pred CCchHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy460 21 DNPTQDTIAEGLLGLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKI 71 (117)
Q Consensus 21 ~~p~rd~La~GL~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l 71 (117)
+++.+..+..-|...+.=++.-.|+||+-+.+...-+..+|.+|.++|.+|
T Consensus 55 ~~~~~~~~~~~I~~~~~~~~~l~deKv~lA~~~~d~v~~hi~rLD~dl~~f 105 (105)
T PF12998_consen 55 SPEKRRELLKEIQEEYERALELSDEKVALAQQAYDLVDRHIRRLDQDLKKF 105 (105)
T ss_dssp -HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCT---
T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC
Confidence 334667788889999999999999999999999999999999999887764
No 31
>PF02646 RmuC: RmuC family; InterPro: IPR003798 This protein contains several bacterial RmuC DNA recombination proteins. The function of the RMUC protein is unknown but it is suspected that it is either a structural protein that protects DNA against nuclease action, or is itself involved in DNA cleavage at the regions of DNA secondary structures []. Proteins in this family are predicted to contain a central endonuclease-like fold domain, surrounded by coiled coils, consistent with a direct role in DNA cleavage [, ].
Probab=72.35 E-value=34 Score=28.06 Aligned_cols=67 Identities=19% Similarity=0.228 Sum_probs=41.0
Q ss_pred HHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHHHHH
Q psy460 31 GLLGLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQTTQV 107 (117)
Q Consensus 31 GL~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q~ 107 (117)
.|-.+++|+=++|+.--+.+..+......+...|.++|+.+.+.+ +++......+..+.++|.+.+.
T Consensus 3 ~l~~l~~pl~e~l~~~~~~l~~~~~~~~~~~~~L~~~l~~l~~~~----------~~~~~l~~~~~~L~~aL~~~k~ 69 (304)
T PF02646_consen 3 QLEQLLKPLKEQLEKFEKRLEESFEQRSEEFGSLKEQLKQLSEAN----------GEIQQLSQEASNLTSALKNSKT 69 (304)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh----------hHHHHHHHHHHHHHHHHhCCCc
Confidence 466788888777775555555666666667777777777666642 2233444556666666665443
No 32
>PF02601 Exonuc_VII_L: Exonuclease VII, large subunit; InterPro: IPR020579 Exonuclease VII 3.1.11.6 from EC is composed of two nonidentical subunits; one large subunit and 4 small ones []. Exonuclease VII catalyses exonucleolytic cleavage in either 5'-3' or 3'-5' direction to yield 5'-phosphomononucleotides. The large subunit also contains the OB-fold domains (IPR004365 from INTERPRO) that bind to nucleic acids at the N terminus. This entry represents Exonuclease VII, large subunit, C-terminal. ; GO: 0008855 exodeoxyribonuclease VII activity
Probab=71.98 E-value=51 Score=26.58 Aligned_cols=82 Identities=16% Similarity=0.225 Sum_probs=55.2
Q ss_pred HHhhhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhhHHHhcCCC------ChhHHHHHHHHhhHHHHH-HHHHHHH
Q psy460 33 LGLIKPTVDLLDEKFRATRA-SQLELRLQIEALTSQLDKISAEQQTAV------PLDSYTRKLVDAQQKISI-VGNILQT 104 (117)
Q Consensus 33 ~~lL~P~I~~lD~~V~~tr~-SQ~eL~~qId~L~~eL~~l~e~q~~p~------dLd~yvkKL~~ak~RV~~-vnniLq~ 104 (117)
..++-|...++..++...+. -....+..++.....|..+...-..+. -+..+-.+|...++|+.. ++..|+.
T Consensus 138 Ae~~~~~~~~~~~~l~~~~~~l~~~~~~~l~~~~~~L~~l~~~l~~~~~~~p~~~l~~~~~~Ld~l~~rL~~~~~~~l~~ 217 (319)
T PF02601_consen 138 AELIVPDRRELLQRLDELRQRLNRAMRNRLQRKRQRLNQLAKRLQLQSRRLPERKLEQQQQRLDELKQRLKQAIQQKLQR 217 (319)
T ss_pred HHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34555666666666665553 344456666666666766665433322 167888999999999855 8888888
Q ss_pred HHHHHHhhhh
Q psy460 105 TQVAMHKKFF 114 (117)
Q Consensus 105 ~Q~RL~rl~~ 114 (117)
.+.+|.++..
T Consensus 218 ~~~~L~~l~~ 227 (319)
T PF02601_consen 218 KRQRLQNLSN 227 (319)
T ss_pred HHHHHHHHHH
Confidence 8888888763
No 33
>PF04111 APG6: Autophagy protein Apg6; InterPro: IPR007243 Macroautophagy is a bulk degradation process induced by starvation in eukaryotic cells. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. No molecule involved in autophagy has yet been identified in higher eukaryotes []. The pre-autophagosomal structure contains at least five Apg proteins: Apg1p, Apg2p, Apg5p, Aut7p/Apg8p and Apg16p. It is found in the vacuole []. The C-terminal glycine of Apg12p is conjugated to a lysine residue of Apg5p via an isopeptide bond. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. Auotphagy protein 16 (Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate []. Autophagy protein 5 (Apg5) is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway []. Apg6/Vps30p has two distinct functions in the autophagic process, either associated with the membrane or in a retrieval step of the carboxypeptidase Y sorting pathway [].; GO: 0006914 autophagy; PDB: 3Q8T_A 3VP7_A 4DDP_A.
Probab=71.95 E-value=43 Score=27.80 Aligned_cols=81 Identities=12% Similarity=0.179 Sum_probs=57.8
Q ss_pred HhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCC-ChhHHHHHHHHhhHHHHHHHHHHHHHHHHHHhh
Q psy460 34 GLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAV-PLDSYTRKLVDAQQKISIVGNILQTTQVAMHKK 112 (117)
Q Consensus 34 ~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~-dLd~yvkKL~~ak~RV~~vnniLq~~Q~RL~rl 112 (117)
.-++--.+++.+.+..+..--.+|..+|..|..++..+.+....-. +...+-..+.......-.+++-++.+++.|++|
T Consensus 53 ~~le~Ee~~l~~eL~~LE~e~~~l~~el~~le~e~~~l~~eE~~~~~~~n~~~~~l~~~~~e~~sl~~q~~~~~~~L~~L 132 (314)
T PF04111_consen 53 EKLEQEEEELLQELEELEKEREELDQELEELEEELEELDEEEEEYWREYNELQLELIEFQEERDSLKNQYEYASNQLDRL 132 (314)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3366677777788888888888888888888888888875432211 234555666666677777888888888888887
Q ss_pred hh
Q psy460 113 FF 114 (117)
Q Consensus 113 ~~ 114 (117)
.+
T Consensus 133 ~k 134 (314)
T PF04111_consen 133 RK 134 (314)
T ss_dssp HT
T ss_pred Hh
Confidence 65
No 34
>PHA02562 46 endonuclease subunit; Provisional
Probab=70.77 E-value=50 Score=28.26 Aligned_cols=93 Identities=12% Similarity=0.108 Sum_probs=52.8
Q ss_pred CCCCCCCchHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcC----CCChhHHHHHHHHh
Q psy460 16 TEDFCDNPTQDTIAEGLLGLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQT----AVPLDSYTRKLVDA 91 (117)
Q Consensus 16 ~~~~~~~p~rd~La~GL~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~----p~dLd~yvkKL~~a 91 (117)
.=+.|..+-.+. .+-+.-|.--+..++.++.....+..++...++++.+...++.+.... --++...+++....
T Consensus 286 ~Cp~C~~~~~~~--~~~~~~l~d~i~~l~~~l~~l~~~i~~~~~~~~~~~~~~~~i~el~~~i~~~~~~i~~~~~~~~~l 363 (562)
T PHA02562 286 VCPTCTQQISEG--PDRITKIKDKLKELQHSLEKLDTAIDELEEIMDEFNEQSKKLLELKNKISTNKQSLITLVDKAKKV 363 (562)
T ss_pred CCCCCCCcCCCc--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345565555554 566666777788888888887777776666665554444444432111 11255566666666
Q ss_pred hHHHHHHHHHHHHHHHHHH
Q psy460 92 QQKISIVGNILQTTQVAMH 110 (117)
Q Consensus 92 k~RV~~vnniLq~~Q~RL~ 110 (117)
+..|..+++.-....++|.
T Consensus 364 ~~ei~~l~~~~~~~~~~l~ 382 (562)
T PHA02562 364 KAAIEELQAEFVDNAEELA 382 (562)
T ss_pred HHHHHHHHhhhhchHHHHH
Confidence 6666665554444333333
No 35
>PF07889 DUF1664: Protein of unknown function (DUF1664); InterPro: IPR012458 The members of this family are hypothetical plant proteins of unknown function. The region featured in this family is approximately 100 amino acids long.
Probab=70.67 E-value=41 Score=24.94 Aligned_cols=68 Identities=6% Similarity=0.125 Sum_probs=34.5
Q ss_pred HHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHHHHHHHHh
Q psy460 32 LLGLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQTTQVAMHK 111 (117)
Q Consensus 32 L~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q~RL~r 111 (117)
+..-|.-+=+.|-.+|..+-.++.+-.+-......+...+.+. +.+.+.-|..|+...+....+|..
T Consensus 55 vs~~l~~tKkhLsqRId~vd~klDe~~ei~~~i~~eV~~v~~d-------------v~~i~~dv~~v~~~V~~Le~ki~~ 121 (126)
T PF07889_consen 55 VSESLSSTKKHLSQRIDRVDDKLDEQKEISKQIKDEVTEVRED-------------VSQIGDDVDSVQQMVEGLEGKIDE 121 (126)
T ss_pred HHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHhh-------------HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444555555555555555554444444444444444444332 233334466666666666666665
Q ss_pred h
Q psy460 112 K 112 (117)
Q Consensus 112 l 112 (117)
|
T Consensus 122 i 122 (126)
T PF07889_consen 122 I 122 (126)
T ss_pred H
Confidence 5
No 36
>PF03961 DUF342: Protein of unknown function (DUF342); InterPro: IPR005646 This family of bacterial proteins has no known function. The proteins are in the region of 500-600 amino acid residues in length.
Probab=70.21 E-value=59 Score=27.86 Aligned_cols=31 Identities=29% Similarity=0.419 Sum_probs=18.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhHH
Q psy460 43 LDEKFRATRASQLELRLQIEALTSQLDKISA 73 (117)
Q Consensus 43 lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e 73 (117)
+..+.+..+.-..+++.+++++...|..+..
T Consensus 332 l~~~~~~l~~~~~~~~~~l~~l~~~l~~l~~ 362 (451)
T PF03961_consen 332 LKEKLEELEEELEELKEELEKLKKNLKKLKK 362 (451)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh
Confidence 3444455555555566677777777777765
No 37
>PF08700 Vps51: Vps51/Vps67; InterPro: IPR014812 The VFT tethering complex (also known as GARP complex, Golgi associated retrograde protein complex, Vps53 tethering complex) is a conserved eukaryotic docking complex which is involved in recycling of proteins from endosomes to the late Golgi. Vps51 (also known as Vps67) is a subunit of VFT and interacts with the SNARE Tlg1 [].
Probab=69.18 E-value=28 Score=22.56 Aligned_cols=64 Identities=19% Similarity=0.246 Sum_probs=41.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHHHHHHHHhh
Q psy460 47 FRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQTTQVAMHKK 112 (117)
Q Consensus 47 V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q~RL~rl 112 (117)
+..++.-...|+..|+....+|+.+--. .--+|-.-.+-+...+.-+..+.+.|...+..+.++
T Consensus 21 ~~~i~~~~~~L~~~i~~~~~eLr~~V~~--nY~~fI~as~~I~~m~~~~~~l~~~l~~l~~~~~~l 84 (87)
T PF08700_consen 21 IKEIRQLENKLRQEIEEKDEELRKLVYE--NYRDFIEASDEISSMENDLSELRNLLSELQQSIQSL 84 (87)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHh--hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 5666777788889999999999988643 222344444555555555555666666666555554
No 38
>PF15233 SYCE1: Synaptonemal complex central element protein 1
Probab=68.33 E-value=36 Score=25.81 Aligned_cols=55 Identities=18% Similarity=0.343 Sum_probs=42.2
Q ss_pred HhhhHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHH
Q psy460 34 GLIKPTVDLLDEKFRATRASQLELRLQ-------IEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKI 95 (117)
Q Consensus 34 ~lL~P~I~~lD~~V~~tr~SQ~eL~~q-------Id~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV 95 (117)
|-|.|=|+.|=++|..+++..+...+. .++|..||+.++.+ ..++..+.|.|+-.
T Consensus 2 gsLeP~iE~LInrInelQQaKKk~~EELgEa~~l~eaL~~ELDsL~~E-------kvhLeeilnkKqe~ 63 (134)
T PF15233_consen 2 GSLEPQIEDLINRINELQQAKKKSSEELGEAQALWEALQRELDSLNGE-------KVHLEEILNKKQET 63 (134)
T ss_pred CCccchHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHhhh-------HHHHHHHHHHHHHH
Confidence 457899999999999999999877665 46788999999875 23666666666543
No 39
>PF06632 XRCC4: DNA double-strand break repair and V(D)J recombination protein XRCC4; InterPro: IPR010585 This entry represents the DNA double-strand break repair and V(D)J recombination protein XRCC4, which is found in certain Metazoans, fungi and plants. XRCC4 binds to DNA, and to DNA ligase IV (LIG4) to form the LIG4-XRCC4 complex []. The LIG4-XRCC4 complex is responsible for the ligation step in the non-homologous end joining (NHEJ) pathway of DNA double-strand break repair. XRCC4 enhances the joining activity of LIG4. It is thought that XRCC4 and LIG4 are essential for alignment-based gap filling, as well as for final ligation of the breaks []. Binding of the LIG4-XRCC4 complex to DNA ends is dependent on the assembly of the DNA-dependent protein kinase complex DNA-PK to these DNA ends. ; GO: 0003677 DNA binding, 0006302 double-strand break repair, 0006310 DNA recombination, 0005634 nucleus; PDB: 3RWR_R 1IK9_B 3SR2_E 3Q4F_H 3II6_B 1FU1_A 3MUD_B.
Probab=66.11 E-value=84 Score=26.81 Aligned_cols=85 Identities=18% Similarity=0.241 Sum_probs=61.7
Q ss_pred CCCchHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCC--Ch-hHHHHHHHHhhHHHH
Q psy460 20 CDNPTQDTIAEGLLGLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAV--PL-DSYTRKLVDAQQKIS 96 (117)
Q Consensus 20 ~~~p~rd~La~GL~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~--dL-d~yvkKL~~ak~RV~ 96 (117)
+.+|+. +...|++.+--.+.++..++......-..|+..+..+..+|.++... ..-. +| .-|+.-|+.-|.||-
T Consensus 121 ~~~p~e--~i~el~d~~l~~~~~l~~~~~~L~~enerL~~e~~~~~~qlE~~v~~-K~~~E~~L~~KF~~vLNeKK~KIR 197 (342)
T PF06632_consen 121 VDNPAE--VIRELFDWCLDANSRLQAENEHLQKENERLESEANKLLKQLEKFVNA-KEEHEEDLYAKFVLVLNEKKAKIR 197 (342)
T ss_dssp -S-HHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred CCChHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhHHHHHH
Confidence 455543 33358888888999999999888777778888888888888888753 2222 23 466777777788888
Q ss_pred HHHHHHHHHHH
Q psy460 97 IVGNILQTTQV 107 (117)
Q Consensus 97 ~vnniLq~~Q~ 107 (117)
.+...|..+++
T Consensus 198 ~lq~~L~~~~~ 208 (342)
T PF06632_consen 198 ELQRLLASAKE 208 (342)
T ss_dssp HHHHHHHHHHH
T ss_pred HHHHHHHHhhc
Confidence 88888877775
No 40
>TIGR03185 DNA_S_dndD DNA sulfur modification protein DndD. This model describes the DndB protein encoded by an operon associated with a sulfur-containing modification to DNA. The operon is sporadically distributed in bacteria, much like some restriction enzyme operons. DndD is described as a putative ATPase. The small number of examples known so far include species from among the Firmicutes, Actinomycetes, Proteobacteria, and Cyanobacteria.
Probab=65.46 E-value=87 Score=28.12 Aligned_cols=60 Identities=12% Similarity=0.095 Sum_probs=42.4
Q ss_pred HHHHHHHHHHHHHHHhhHHHhcCC---CChhHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhhh
Q psy460 55 LELRLQIEALTSQLDKISAEQQTA---VPLDSYTRKLVDAQQKISIVGNILQTTQVAMHKKFF 114 (117)
Q Consensus 55 ~eL~~qId~L~~eL~~l~e~q~~p---~dLd~yvkKL~~ak~RV~~vnniLq~~Q~RL~rl~~ 114 (117)
.+|..+|..+..+|+.++..-... -++..|..++..+++++..++.-+...++++.++..
T Consensus 394 ~~~~~~~~~~e~el~~l~~~l~~~~~~e~i~~l~e~l~~l~~~l~~~~~~~~~~~~~~~~~~~ 456 (650)
T TIGR03185 394 SQLLKELRELEEELAEVDKKISTIPSEEQIAQLLEELGEAQNELFRSEAEIEELLRQLETLKE 456 (650)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 456666777777777776533222 246788888888899999888888888777776543
No 41
>cd00427 Ribosomal_L29_HIP Ribosomal L29 protein/HIP. L29 is a protein of the large ribosomal Subunit. A homolog, called heparin/heparan sulfate interacting protein (HIP), has also been identified in mammals. L29 is located on the surface of the large ribosomal subunit, where it participates in forming a protein ring that surrounds the polypeptide exit channel, providing structural support for the ribosome. L29 is involved in forming the translocon binding site, along with L19, L22, L23, L24, and L31e. In addition, L29 and L23 form the interaction site for trigger factor (TF) on the ribosomal surface, adjacent to the exit tunnel. L29 forms numerous interactions with L23 and with the 23S rRNA. In some eukaryotes, L29 is referred to as L35, which is distinct from L35 found in bacteria and some eukaryotes (primarily plastids and mitochondria). The mammalian homolog, HIP, is found on the surface of many tissues and cell lines. It is believed to play a role in cell adhesion and modulat
Probab=65.30 E-value=24 Score=22.21 Aligned_cols=49 Identities=18% Similarity=0.333 Sum_probs=36.6
Q ss_pred HHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHH
Q psy460 53 SQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQ 103 (117)
Q Consensus 53 SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq 103 (117)
|-.+|..+|.+|-.||-++......-. + .-..++-..|+.|+-++.++.
T Consensus 7 s~~eL~~~l~~l~~elf~Lr~q~~~~~-~-~~~~~~~~~Rr~IARi~Til~ 55 (57)
T cd00427 7 SDEELQEKLDELKKELFNLRFQKATGQ-L-ENPHRIRKVRKDIARIKTVLN 55 (57)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHCC-C-cCcHHHHHHHHHHHHHHHHHH
Confidence 557899999999999999985322211 2 244678889999999998875
No 42
>PF10552 ORF6C: ORF6C domain; InterPro: IPR018878 This entry represents the carboxy-terminal domain from ORF6 (Q9B012 from SWISSPROT), an antirepressor protein from Lactococcus phage bIL285 [].
Probab=65.01 E-value=46 Score=23.49 Aligned_cols=51 Identities=8% Similarity=0.135 Sum_probs=35.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHH
Q psy460 46 KFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIV 98 (117)
Q Consensus 46 ~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~v 98 (117)
.+..+-+...++.+.|+.+...+..|.+. .|++-..+-.=--.+++||..+
T Consensus 2 ~i~l~~~~~~~~~~ki~~ve~~V~~l~~~--~~i~~~q~~~i~~~v~~rv~~~ 52 (116)
T PF10552_consen 2 QIKLLMQATEEHNEKIEEVENRVDDLEEN--MPIDPGQQKEIQKAVKSRVYEL 52 (116)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhc--CCCCHHHHHHHHHHHHHHHHHH
Confidence 35667778889999999999999999875 7776543332223345566655
No 43
>TIGR00012 L29 ribosomal protein L29. called L29 in prokaryotic (50S) large subunits and L35 in eukaryotic (60S) large subunits.
Probab=64.54 E-value=22 Score=22.28 Aligned_cols=50 Identities=18% Similarity=0.273 Sum_probs=37.1
Q ss_pred HHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHH
Q psy460 52 ASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQ 103 (117)
Q Consensus 52 ~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq 103 (117)
.|-.+|.+.|..|-.||-++.. |.+.=.+. --.++-..|+.|+-+..++.
T Consensus 5 ~s~~EL~~~l~~lr~eLf~Lr~-~~~~~~~~-~~~~i~~~Rk~IARi~Tvl~ 54 (55)
T TIGR00012 5 KSKEELAKKLDELKKELFELRF-QKATGQLA-KPHRIRQVRRDIARLLTVLR 54 (55)
T ss_pred CCHHHHHHHHHHHHHHHHHHHH-HHHhCCcc-cchHHHHHHHHHHHHHHHHh
Confidence 3667899999999999999985 32222232 35677889999999988874
No 44
>KOG0971|consensus
Probab=63.06 E-value=58 Score=32.15 Aligned_cols=73 Identities=21% Similarity=0.277 Sum_probs=50.6
Q ss_pred hhhHHHHHHHH------H-HHHHHHHHHHHHHHHHHHHH----HHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHH
Q psy460 35 LIKPTVDLLDE------K-FRATRASQLELRLQIEALTS----QLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQ 103 (117)
Q Consensus 35 lL~P~I~~lD~------~-V~~tr~SQ~eL~~qId~L~~----eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq 103 (117)
+|.-+|.+|++ + +..-+.--..|+++||.+.. -.++++.+|...+|.+..++| .|.+|..+|..||
T Consensus 466 lLeetv~dlEalee~~EQL~Esn~ele~DLreEld~~~g~~kel~~r~~aaqet~yDrdqTI~K---fRelva~Lqdqlq 542 (1243)
T KOG0971|consen 466 LLEETVGDLEALEEMNEQLQESNRELELDLREELDMAKGARKELQKRVEAAQETVYDRDQTIKK---FRELVAHLQDQLQ 542 (1243)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHhHHHHHHH---HHHHHHHHHHHHH
Confidence 45555555543 2 23344455678999999844 345566677788999988887 5889999999998
Q ss_pred HHHHHHH
Q psy460 104 TTQVAMH 110 (117)
Q Consensus 104 ~~Q~RL~ 110 (117)
..-++..
T Consensus 543 e~~dq~~ 549 (1243)
T KOG0971|consen 543 ELTDQQE 549 (1243)
T ss_pred HHHhhhh
Confidence 8776543
No 45
>PF15035 Rootletin: Ciliary rootlet component, centrosome cohesion
Probab=62.81 E-value=55 Score=25.39 Aligned_cols=69 Identities=25% Similarity=0.242 Sum_probs=44.8
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHH---HHH-HHHhhHHHhcCCCChhHHHHHHHHhhHHHHH---HHHHHHHH
Q psy460 36 IKPTVDLLDEKFRATRASQLELRLQIEA---LTS-QLDKISAEQQTAVPLDSYTRKLVDAQQKISI---VGNILQTT 105 (117)
Q Consensus 36 L~P~I~~lD~~V~~tr~SQ~eL~~qId~---L~~-eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~---vnniLq~~ 105 (117)
....|.+|.++|..-+.-=.+|..++.. |.. ...+..+ ....+||+..+.+|-.-++|... ||++|..=
T Consensus 14 qa~Lv~~LQ~KV~qYr~rc~ele~~l~~~~~l~~~~~~~~~~-~e~s~dLe~~l~rLeEEqqR~~~L~qvN~lLReQ 89 (182)
T PF15035_consen 14 QAQLVQRLQAKVLQYRKRCAELEQQLSASQVLESPSQRRRSE-EEHSPDLEEALIRLEEEQQRSEELAQVNALLREQ 89 (182)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcCccccccccc-ccCcccHHHHHHHHHHHHHhHHHHHHHHHHHHHH
Confidence 3567888889988877744444444410 000 0001122 23568999999999999999887 99999543
No 46
>PF00831 Ribosomal_L29: Ribosomal L29 protein; InterPro: IPR001854 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. Ribosomal protein L29 is one of the proteins from the large ribosomal subunit. L29 belongs to a family of ribosomal proteins of 63 to 138 amino-acid residues which, on the basis of sequence similarities [], groups: Red algal L29. Bacterial L29. Mammalian L35 Caenorhabditis elegans L35 (ZK652.4). Yeast L35. ; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 1VSP_W 3MS1_Y 3MRZ_Y 3F1H_2 3PYT_Y 3PYO_Y 3D5D_2 3D5B_2 3PYR_Y 1VSA_W ....
Probab=62.59 E-value=18 Score=22.90 Aligned_cols=50 Identities=18% Similarity=0.292 Sum_probs=36.9
Q ss_pred HHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHH
Q psy460 52 ASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQ 103 (117)
Q Consensus 52 ~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq 103 (117)
.|-.+|...|.+|-.+|-++.... +.=.+ .-..++-..|+.|+-++.++.
T Consensus 7 ls~~eL~~~l~elk~eL~~Lr~q~-~~~~l-~n~~~ir~~Rr~IARi~Tvl~ 56 (58)
T PF00831_consen 7 LSDEELQEKLEELKKELFNLRFQK-ATGQL-ENPHRIREIRRDIARILTVLR 56 (58)
T ss_dssp SHHHHHHHHHHHHHHHHHHHHHHH-HHSSS-SCCHHHHHHHHHHHHHHHHHH
T ss_pred CCHHHHHHHHHHHHHHHHHHHHHH-Hhccc-ccccHHHHHHHHHHHHHHHHh
Confidence 366789999999999999998632 22222 233567788999999998875
No 47
>PF04799 Fzo_mitofusin: fzo-like conserved region; InterPro: IPR006884 This entry represents the heptad repeat domain which is conserved at the C terminus of Fzo/mitofusion family of GTPases. Fzo is a mediator of mitochondrial fusion during spermatogenesis []. This conserved region is also found in the human mitofusin protein []. This domain forms a dimeric antiparallel coiled coil structure, which has been proposed to act as a mitochodrial tether before vesicle fusion [].; GO: 0003924 GTPase activity, 0006184 GTP catabolic process, 0008053 mitochondrial fusion, 0005741 mitochondrial outer membrane, 0016021 integral to membrane; PDB: 1T3J_A.
Probab=61.84 E-value=76 Score=24.85 Aligned_cols=72 Identities=19% Similarity=0.250 Sum_probs=36.3
Q ss_pred CCCCCCCCCchHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhH
Q psy460 14 DKTEDFCDNPTQDTIAEGLLGLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQ 93 (117)
Q Consensus 14 ~~~~~~~~~p~rd~La~GL~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~ 93 (117)
+.|...|..--.-.|+ -+..+|...|. ..+.+|..+|++|..++.++.+.| .++-.-|.
T Consensus 93 ~~tsancs~QVqqeL~--------~tf~rL~~~Vd---~~~~eL~~eI~~L~~~i~~le~~~----------~~~k~Lrn 151 (171)
T PF04799_consen 93 SFTSANCSHQVQQELS--------STFARLCQQVD---QTKNELEDEIKQLEKEIQRLEEIQ----------SKSKTLRN 151 (171)
T ss_dssp -------------------------HHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHH----------HHHHHHHH
T ss_pred HHHhcchHHHHHHHHH--------HHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHH----------HHHHHHHH
Confidence 3455556544443333 34455555544 567799999999999999987753 34444456
Q ss_pred HHHHHHHHHHHHH
Q psy460 94 KISIVGNILQTTQ 106 (117)
Q Consensus 94 RV~~vnniLq~~Q 106 (117)
|..-+.+-|..-+
T Consensus 152 Ka~~L~~eL~~F~ 164 (171)
T PF04799_consen 152 KANWLESELERFQ 164 (171)
T ss_dssp HHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHH
Confidence 6666666555443
No 48
>PRK04778 septation ring formation regulator EzrA; Provisional
Probab=61.18 E-value=1.2e+02 Score=26.96 Aligned_cols=25 Identities=12% Similarity=0.206 Sum_probs=10.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Q psy460 40 VDLLDEKFRATRASQLELRLQIEAL 64 (117)
Q Consensus 40 I~~lD~~V~~tr~SQ~eL~~qId~L 64 (117)
.+++.+++......|.++...|..|
T Consensus 385 leel~e~leeie~eq~ei~e~l~~L 409 (569)
T PRK04778 385 LEEILKQLEEIEKEQEKLSEMLQGL 409 (569)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344444444444444444444444
No 49
>PF13166 AAA_13: AAA domain
Probab=61.09 E-value=1.2e+02 Score=26.93 Aligned_cols=90 Identities=20% Similarity=0.261 Sum_probs=43.5
Q ss_pred CCCCCCCC-chHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhH
Q psy460 15 KTEDFCDN-PTQDTIAEGLLGLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQ 93 (117)
Q Consensus 15 ~~~~~~~~-p~rd~La~GL~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~ 93 (117)
.+=+||+. |--..+.+=|-..|.-..+++-. .+......+...++.+...+..+... ..-...+.......+.
T Consensus 256 ~~CpfC~q~~l~~~~~~~l~~~f~~~~~~~~~---~l~~~~~~~~~~~~~~~~~l~~~~~~---~~~~~~~~~~~~~~~~ 329 (712)
T PF13166_consen 256 DTCPFCQQEPLSEERKERLEKYFDEEYEKLIE---ELEKAIKKLEKAIENIIEQLESILSE---NDFYEEFEEDKEELKS 329 (712)
T ss_pred CcCCCCCCcCCcHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhhhhh---hhhhhHHHHHHHHHHH
Confidence 46789999 55544544455555544333322 23333444444444444444443321 1112334444555555
Q ss_pred HHHHHHHHHHHHHHHHH
Q psy460 94 KISIVGNILQTTQVAMH 110 (117)
Q Consensus 94 RV~~vnniLq~~Q~RL~ 110 (117)
.+..++..|......|.
T Consensus 330 ~~~~l~~~l~~l~~~L~ 346 (712)
T PF13166_consen 330 AIEALKEELEELKKALE 346 (712)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 66666666555554443
No 50
>PF15458 NTR2: Nineteen complex-related protein 2
Probab=60.94 E-value=30 Score=27.92 Aligned_cols=39 Identities=21% Similarity=0.367 Sum_probs=34.6
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHH
Q psy460 36 IKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAE 74 (117)
Q Consensus 36 L~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~ 74 (117)
|.+++..|...+.....+...+..+|+.|..|...|.+-
T Consensus 206 L~~~~~rL~~~l~~le~~~~~~~~~l~~l~~E~~~I~~r 244 (254)
T PF15458_consen 206 LSECLERLRESLSSLEDSKSQLQQQLESLEKEKEEIEER 244 (254)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 456668999999999999999999999999999999874
No 51
>PF10779 XhlA: Haemolysin XhlA; InterPro: IPR019715 Haemolysin XhlA is a cell-surface associated haemolysin that lyses the two most prevalent types of insect immune cells (granulocytes and plasmatocytes) as well as rabbit and horse erythrocytes [].
Probab=60.16 E-value=38 Score=22.06 Aligned_cols=38 Identities=21% Similarity=0.308 Sum_probs=28.3
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHH
Q psy460 37 KPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAE 74 (117)
Q Consensus 37 ~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~ 74 (117)
+--++.++.++.........+..+|..+...|.+|.+.
T Consensus 12 e~~l~~~~~~i~~lE~~~~~~e~~i~~~~~~l~~I~~n 49 (71)
T PF10779_consen 12 ETKLDNHEERIDKLEKRDAANEKDIKNLNKQLEKIKSN 49 (71)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33455566677777778888888888888888888775
No 52
>cd00584 Prefoldin_alpha Prefoldin alpha subunit; Prefoldin is a hexameric molecular chaperone complex, found in both eukaryotes and archaea, that binds and stabilizes newly synthesized polypeptides allowing them to fold correctly. The complex contains two alpha and four beta subunits, the two subunits being evolutionarily related. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the structure of the hexamer. The structure of the complex consists of a double beta barrel assembly with six protruding coiled-coils.
Probab=59.80 E-value=44 Score=23.45 Aligned_cols=40 Identities=20% Similarity=0.385 Sum_probs=32.2
Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHH
Q psy460 35 LIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAE 74 (117)
Q Consensus 35 lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~ 74 (117)
=+.-+++-|+.++.....+..+|..+|..+..++..+.+.
T Consensus 84 ~~~eA~~~l~~r~~~l~~~~~~l~~~l~~l~~~~~~~~~~ 123 (129)
T cd00584 84 DLEEAIEFLDKKIEELTKQIEKLQKELAKLKDQINTLEAE 123 (129)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3455778888888888888888888888888888887654
No 53
>COG1421 CRISPR system related protein [Defense mechanisms]
Probab=58.72 E-value=11 Score=28.74 Aligned_cols=20 Identities=25% Similarity=0.476 Sum_probs=16.3
Q ss_pred hHHHHH--HHHHHhhhHHHHHH
Q psy460 24 TQDTIA--EGLLGLIKPTVDLL 43 (117)
Q Consensus 24 ~rd~La--~GL~~lL~P~I~~l 43 (117)
||+... +||+++|+|.|+++
T Consensus 86 gR~~~~~~e~lyevle~~idki 107 (137)
T COG1421 86 GRDGKEALEGLYEVLEEMIDKI 107 (137)
T ss_pred hhcchhhHHHHHHHHHHHHHHh
Confidence 344444 89999999999999
No 54
>PRK02224 chromosome segregation protein; Provisional
Probab=58.49 E-value=1.2e+02 Score=27.84 Aligned_cols=59 Identities=12% Similarity=0.069 Sum_probs=33.7
Q ss_pred HHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhh
Q psy460 54 QLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQTTQVAMHKKF 113 (117)
Q Consensus 54 Q~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q~RL~rl~ 113 (117)
..+++.+|..+.+.+.++... -.+-++.....++..+...+..+..-|.....++.++.
T Consensus 622 ~~~~~~~l~~~r~~i~~l~~~-~~~~~~e~l~~~~~~~~~~~~~l~~~l~~~~~~~~~l~ 680 (880)
T PRK02224 622 NDERRERLAEKRERKRELEAE-FDEARIEEAREDKERAEEYLEQVEEKLDELREERDDLQ 680 (880)
T ss_pred HHHHHHHHHHHHHHHHHHHHH-hCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 335555666666666665432 23445566667777777777777665655555554443
No 55
>KOG3046|consensus
Probab=57.77 E-value=30 Score=26.60 Aligned_cols=58 Identities=22% Similarity=0.296 Sum_probs=37.1
Q ss_pred hHHHHHHHHHHhhhHHHHHHHHHHHHHH-----------HHHHHHHHHHHHHHHHHHhhHHHh------cCCCChhHHH
Q psy460 24 TQDTIAEGLLGLIKPTVDLLDEKFRATR-----------ASQLELRLQIEALTSQLDKISAEQ------QTAVPLDSYT 85 (117)
Q Consensus 24 ~rd~La~GL~~lL~P~I~~lD~~V~~tr-----------~SQ~eL~~qId~L~~eL~~l~e~q------~~p~dLd~yv 85 (117)
+++.+++-|..+ +.++++=++..+ .||..|.+.|++|-.+|+++...+ +.|.++-+|+
T Consensus 6 ~~~q~~ekl~~l----~~~le~~~e~~~~Lgl~vs~F~~tsq~~L~qrl~tLv~~L~~l~~~s~k~n~i~IPleVl~yI 80 (147)
T KOG3046|consen 6 NNDQMQEKLAQL----ENSLEKFLENFRQLGLIVSNFQPTSQDALNQRLNTLVRGLQDLDKLSSKLNDIQIPLEVLEYI 80 (147)
T ss_pred hhHHHHHHHHHH----HHHHHHHHHHHHHHhHhhhcCCCCcHHHHHHHHHHHHHHhhhhHHHHHhhccccCcHHHHHHH
Confidence 345666655443 334444444433 379999999999999999887654 2355555665
No 56
>PF03980 Nnf1: Nnf1 ; InterPro: IPR007128 NNF1 is an essential yeast gene required for proper spindle orientation, nucleolar and nuclear envelope structure and mRNA export [].
Probab=57.72 E-value=60 Score=22.37 Aligned_cols=47 Identities=13% Similarity=0.116 Sum_probs=36.4
Q ss_pred CCCCchHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy460 19 FCDNPTQDTIAEGLLGLIKPTVDLLDEKFRATRASQLELRLQIEALT 65 (117)
Q Consensus 19 ~~~~p~rd~La~GL~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~ 65 (117)
.+..+....+..=|+..+.-.+++|+.++..+...=..|.++|..+-
T Consensus 61 ~~~l~P~~~i~a~l~~~~~~~~~~L~~~l~~l~~eN~~L~~~i~~~r 107 (109)
T PF03980_consen 61 RHSLTPEEDIRAHLAPYKKKEREQLNARLQELEEENEALAEEIQEQR 107 (109)
T ss_pred CCCCChHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 34445555667779999999999999999999987777777776653
No 57
>cd00890 Prefoldin Prefoldin is a hexameric molecular chaperone complex, found in both eukaryotes and archaea, that binds and stabilizes newly synthesized polypeptides allowing them to fold correctly. The complex contains two alpha and four beta subunits, the two subunits being evolutionarily related. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the structure of the hexamer. The structure of the complex consists of a double beta barrel assembly with six protruding coiled-coils.
Probab=57.09 E-value=54 Score=22.49 Aligned_cols=38 Identities=16% Similarity=0.386 Sum_probs=28.2
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHH
Q psy460 36 IKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISA 73 (117)
Q Consensus 36 L~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e 73 (117)
+.-+++-++.++..+.....+|..+|+.+..++..+..
T Consensus 85 ~~eA~~~l~~r~~~l~~~~~~l~~~~~~~~~~~~~l~~ 122 (129)
T cd00890 85 LEEAIEFLKKRLETLEKQIEKLEKQLEKLQDQITELQE 122 (129)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34456778888888888878888888888777777765
No 58
>PRK03918 chromosome segregation protein; Provisional
Probab=56.95 E-value=96 Score=28.19 Aligned_cols=57 Identities=9% Similarity=0.184 Sum_probs=28.8
Q ss_pred HHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHHHHHHHHhh
Q psy460 55 LELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQTTQVAMHKK 112 (117)
Q Consensus 55 ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q~RL~rl 112 (117)
.+|+.+|+.|.+++..+... -.+-++..+-.++...+.++..++.-+...+.++..+
T Consensus 636 ~~l~~~i~~l~~~~~~l~~~-~~~~~~~~l~~~~~~l~~~l~~l~~~~~~l~~~i~~l 692 (880)
T PRK03918 636 AETEKRLEELRKELEELEKK-YSEEEYEELREEYLELSRELAGLRAELEELEKRREEI 692 (880)
T ss_pred HHHHHHHHHHHHHHHHHHhh-cCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44555666666666655421 1223344555555555555555555555555444443
No 59
>PRK03947 prefoldin subunit alpha; Reviewed
Probab=56.70 E-value=51 Score=23.55 Aligned_cols=39 Identities=13% Similarity=0.307 Sum_probs=31.4
Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHH
Q psy460 35 LIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISA 73 (117)
Q Consensus 35 lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e 73 (117)
-+.-+++-|+.++.....+...|..+|..+..++..+..
T Consensus 91 ~~~eA~~~l~~~~~~l~~~~~~l~~~l~~~~~~~~~~~~ 129 (140)
T PRK03947 91 DLDEAIEILDKRKEELEKALEKLEEALQKLASRIAQLAQ 129 (140)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 467788888999888888888888888888777777654
No 60
>KOG3684|consensus
Probab=56.60 E-value=37 Score=30.63 Aligned_cols=43 Identities=23% Similarity=0.282 Sum_probs=35.2
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCC
Q psy460 37 KPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAV 79 (117)
Q Consensus 37 ~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~ 79 (117)
.-+.....+-++.....|.+|..||+.|...|..|.+.-.+-|
T Consensus 426 aK~~~~myd~~~~l~~~q~~le~qI~~Le~kl~~l~~~l~s~~ 468 (489)
T KOG3684|consen 426 AKTQNDMYDLLQELHSRQEELEKQIDTLESKLEALTASLSSLP 468 (489)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCc
Confidence 4455667788999999999999999999999999987654433
No 61
>PF06120 Phage_HK97_TLTM: Tail length tape measure protein; InterPro: IPR009302 This entry consists of the tail length tape measure protein from Bacteriophage HK97 and related sequences from Escherichia coli (strain K12).
Probab=56.58 E-value=1.2e+02 Score=25.55 Aligned_cols=35 Identities=17% Similarity=0.328 Sum_probs=17.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhH
Q psy460 38 PTVDLLDEKFRATRASQLELRLQIEALTSQLDKIS 72 (117)
Q Consensus 38 P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~ 72 (117)
-++.++.+.+.+...-=.+|+.+|+.|..++..+.
T Consensus 74 ~~~~k~~~si~~q~~~i~~l~~~i~~l~~~i~~y~ 108 (301)
T PF06120_consen 74 ANIAKAEESIAAQKRAIEDLQKKIDSLKDQIKNYQ 108 (301)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444444444444444455556666666665443
No 62
>PF05377 FlaC_arch: Flagella accessory protein C (FlaC); InterPro: IPR008039 Although archaeal flagella appear superficially similar to those of bacteria, they are quite distinct []. In several archaea, the flagellin genes are followed immediately by the flagellar accessory genes flaCDEFGHIJ. The gene products may have a role in translocation, secretion, or assembly of the flagellum. FlaC is a protein whose exact role is unknown but it has been shown to be membrane-associated (by immuno-blotting fractionated cells) [].
Probab=56.13 E-value=53 Score=21.29 Aligned_cols=35 Identities=17% Similarity=0.333 Sum_probs=30.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHH
Q psy460 40 VDLLDEKFRATRASQLELRLQIEALTSQLDKISAE 74 (117)
Q Consensus 40 I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~ 74 (117)
|++|++++-..-.+=..++.+++.+..++.++.+.
T Consensus 2 i~elEn~~~~~~~~i~tvk~en~~i~~~ve~i~en 36 (55)
T PF05377_consen 2 IDELENELPRIESSINTVKKENEEISESVEKIEEN 36 (55)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 67899999999998888899999998888888774
No 63
>PRK02224 chromosome segregation protein; Provisional
Probab=55.99 E-value=1.1e+02 Score=27.96 Aligned_cols=46 Identities=15% Similarity=0.261 Sum_probs=27.6
Q ss_pred hHHHHHHHHHHhhhHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHhh
Q psy460 24 TQDTIAEGLLGLIKPTVDLLDEKFRA----TRASQLELRLQIEALTSQLDKI 71 (117)
Q Consensus 24 ~rd~La~GL~~lL~P~I~~lD~~V~~----tr~SQ~eL~~qId~L~~eL~~l 71 (117)
.|..+-+.|+++ -.+++++.++.. +...+..++.+++.+..+|...
T Consensus 150 ~R~~ii~~l~~l--~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 199 (880)
T PRK02224 150 DRQDMIDDLLQL--GKLEEYRERASDARLGVERVLSDQRGSLDQLKAQIEEK 199 (880)
T ss_pred HHHHHHHHHhCC--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345555666665 334555555555 4447777777777777766653
No 64
>PF13514 AAA_27: AAA domain
Probab=55.36 E-value=70 Score=30.64 Aligned_cols=91 Identities=13% Similarity=0.189 Sum_probs=70.6
Q ss_pred HHHHHHHH-HHhhhHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHH
Q psy460 25 QDTIAEGL-LGLIKPTVDLLDEKFRATRA---SQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGN 100 (117)
Q Consensus 25 rd~La~GL-~~lL~P~I~~lD~~V~~tr~---SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnn 100 (117)
.-.|+.|. +|=|.++.++|+........ +--.+++.|.++.+-=+++.+....+-+.....+.+..+..++..+..
T Consensus 116 ~~Lf~agaG~~~l~~~~~~L~~ea~~Lfkprg~~~~in~~l~~l~e~~~~l~~~~~~~~~y~~l~~~~~~~~~~~~~l~~ 195 (1111)
T PF13514_consen 116 QLLFSAGAGLGSLSQVLKQLDKEADELFKPRGRKPEINQALKELKELERELREAEVRAAEYQELQQALEEAEEELEELRA 195 (1111)
T ss_pred HHHHHhccccccHHHHHHHHHHHHHHhhCCCCCChHHHHHHHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33555554 33389999999988877654 344677777777777777777666777888888999999999999999
Q ss_pred HHHHHHHHHHhhhhh
Q psy460 101 ILQTTQVAMHKKFFM 115 (117)
Q Consensus 101 iLq~~Q~RL~rl~~~ 115 (117)
-++..+.++.++.++
T Consensus 196 ~~~~l~~~~~~ler~ 210 (1111)
T PF13514_consen 196 ELKELRAELRRLERL 210 (1111)
T ss_pred HHHHHHHHHHHHHHH
Confidence 999999998888775
No 65
>PRK01203 prefoldin subunit alpha; Provisional
Probab=54.27 E-value=58 Score=24.30 Aligned_cols=41 Identities=20% Similarity=0.310 Sum_probs=35.0
Q ss_pred hHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy460 24 TQDTIAEGLLGLIKPTVDLLDEKFRATRASQLELRLQIEALTS 66 (117)
Q Consensus 24 ~rd~La~GL~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~ 66 (117)
+++..++ +.+++--|+.|..++...+.+|.++.+.|+.|..
T Consensus 2 ~~~~~~~--~~~~~~q~e~l~~ql~~L~~a~se~~~~ie~L~~ 42 (130)
T PRK01203 2 ARDVEAQ--LNYIESLISSVDSQIDSLNKTLSEVQQTISFLSD 42 (130)
T ss_pred hhhHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc
Confidence 3444444 6889999999999999999999999999999976
No 66
>TIGR02169 SMC_prok_A chromosome segregation protein SMC, primarily archaeal type. SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
Probab=54.19 E-value=1.8e+02 Score=26.85 Aligned_cols=23 Identities=22% Similarity=0.224 Sum_probs=8.9
Q ss_pred HHHHHhhHHHHHHHHHHHHHHHH
Q psy460 86 RKLVDAQQKISIVGNILQTTQVA 108 (117)
Q Consensus 86 kKL~~ak~RV~~vnniLq~~Q~R 108 (117)
.++...+..+..++.-+...+.+
T Consensus 469 ~~l~~~~~~l~~l~~~l~~l~~~ 491 (1164)
T TIGR02169 469 QELYDLKEEYDRVEKELSKLQRE 491 (1164)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 33333333344444433333333
No 67
>TIGR02680 conserved hypothetical protein TIGR02680. Members of this protein family belong to a conserved gene four-gene neighborhood found sporadically in a phylogenetically broad range of bacteria: Nocardia farcinica, Symbiobacterium thermophilum, and Streptomyces avermitilis (Actinobacteria), Geobacillus kaustophilus (Firmicutes), Azoarcus sp. EbN1 and Ralstonia solanacearum (Betaproteobacteria). Proteins in this family average over 1400 amino acids in length.
Probab=54.17 E-value=1.5e+02 Score=29.47 Aligned_cols=74 Identities=12% Similarity=0.145 Sum_probs=50.2
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhh
Q psy460 36 IKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQTTQVAMHKKF 113 (117)
Q Consensus 36 L~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q~RL~rl~ 113 (117)
|.=-|.+|+..+........+|.+++++|..+...+.. .-++.+-...+..++..+..-...+...++++....
T Consensus 747 l~~~IaeL~~~i~~l~~~l~~l~~r~~~L~~e~~~~Ps----~~dL~~A~~~l~~A~~~~~~a~~~l~~a~~~l~~a~ 820 (1353)
T TIGR02680 747 LDARLAAVDDELAELARELRALGARQRALADELAGAPS----DRSLRAAHRRAAEAERQAESAERELARAARKAAAAA 820 (1353)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC----chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44555556666666666555666666666555555533 246788888888888888888888888888776643
No 68
>PRK10920 putative uroporphyrinogen III C-methyltransferase; Provisional
Probab=53.62 E-value=93 Score=27.00 Aligned_cols=72 Identities=14% Similarity=0.210 Sum_probs=41.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCCh--hHHHHHHHHhhHH----HHHHHHHHHHHHHHHHhhh
Q psy460 42 LLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPL--DSYTRKLVDAQQK----ISIVGNILQTTQVAMHKKF 113 (117)
Q Consensus 42 ~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dL--d~yvkKL~~ak~R----V~~vnniLq~~Q~RL~rl~ 113 (117)
+++.++..+...|.+|..++.+|.+.+.++.......--| ..|.-+|.+-|-. |..-=..|+++-.||.+++
T Consensus 96 ~~~~~l~~~e~~~~~l~~q~~~Lq~~~~~ls~~~~~dWlLaEaeyLlrlA~qkL~l~~Dv~tA~alLksAD~rLa~~~ 173 (390)
T PRK10920 96 QQAKALDQANRQQAALAKQLDELQQKVATISGSDAKTWLLAQADFLVKLAGRKLWSDQDVTTAAALLKSADASLADMN 173 (390)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCChhhHHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhcC
Confidence 3344455555567777777777777777775431111122 2566666554422 2233345888888888775
No 69
>PRK14549 50S ribosomal protein L29P; Provisional
Probab=53.26 E-value=45 Score=22.00 Aligned_cols=51 Identities=18% Similarity=0.137 Sum_probs=37.7
Q ss_pred HHHHHHHHHHHHHHHHHHhhHHHhcCCCC-hhHHHHHHHHhhHHHHHHHHHHHH
Q psy460 52 ASQLELRLQIEALTSQLDKISAEQQTAVP-LDSYTRKLVDAQQKISIVGNILQT 104 (117)
Q Consensus 52 ~SQ~eL~~qId~L~~eL~~l~e~q~~p~d-Ld~yvkKL~~ak~RV~~vnniLq~ 104 (117)
.|-.+|.+.|..|-.||-.+...+ +.-. ++ --.++-.+|+.|.-+..+|..
T Consensus 12 ls~~eL~~~l~elk~eLf~LR~q~-~~~~~l~-n~~~ir~~Rk~IARi~Tvl~e 63 (69)
T PRK14549 12 MSPEEREEKLEELKLELLKERAQA-AMGGAPE-NPGRIREIRRTIARILTIQRE 63 (69)
T ss_pred CCHHHHHHHHHHHHHHHHHHHHHH-HhCcCcc-ccHHHHHHHHHHHHHHHHHHH
Confidence 366789999999999999998532 2222 21 224677889999999999875
No 70
>PF12001 DUF3496: Domain of unknown function (DUF3496); InterPro: IPR021885 This presumed domain is functionally uncharacterised. This domain is found in eukaryotes. This domain is about 110 amino acids in length.
Probab=52.56 E-value=76 Score=23.18 Aligned_cols=63 Identities=17% Similarity=0.245 Sum_probs=42.4
Q ss_pred HHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhhHHHhcCCCCh-hHHHHHHHHhhHHHHHHHHHHHHHHHH
Q psy460 39 TVDLLDEKFRATRASQLEL-RLQIEALTSQLDKISAEQQTAVPL-DSYTRKLVDAQQKISIVGNILQTTQVA 108 (117)
Q Consensus 39 ~I~~lD~~V~~tr~SQ~eL-~~qId~L~~eL~~l~e~q~~p~dL-d~yvkKL~~ak~RV~~vnniLq~~Q~R 108 (117)
.|++|+..+--.+.||.+- +.+++ ++.+.+-.-+-+ ..--.||..+..|+..+++.|-.-.+|
T Consensus 8 rIkdLeselsk~Ktsq~d~~~~eLE-------kYkqly~eElk~r~SLs~kL~ktnerLaevstkLl~Ekeq 72 (111)
T PF12001_consen 8 RIKDLESELSKMKTSQEDSNKTELE-------KYKQLYLEELKLRKSLSNKLNKTNERLAEVSTKLLVEKEQ 72 (111)
T ss_pred HHHHHHHHHHHhHhHhhhhhHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHh
Confidence 5788888888888787776 44443 333322111222 244678999999999999999877776
No 71
>TIGR00293 prefoldin, archaeal alpha subunit/eukaryotic subunit 5. This model finds a set of small proteins from the Archaea and from Aquifex aeolicus that may represent two orthologous groups. The proteins are predicted to be mostly coiled coil, and may hit large numbers of proteins that contain coiled coil regions.
Probab=52.47 E-value=70 Score=22.33 Aligned_cols=39 Identities=10% Similarity=0.241 Sum_probs=32.3
Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHH
Q psy460 35 LIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISA 73 (117)
Q Consensus 35 lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e 73 (117)
-+.-+++-|+.++.....+..+|...|..+..++..+..
T Consensus 83 ~~~eA~~~l~~~~~~l~~~~~~l~~~l~~l~~~~~~i~~ 121 (126)
T TIGR00293 83 DAEEAIEFLKKRIEELEKAIEKLQEALAELASRAQQLEQ 121 (126)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 356678889999999999888888888888888887765
No 72
>PF11014 DUF2852: Protein of unknown function (DUF2852); InterPro: IPR021273 This bacterial family of proteins has no known function.
Probab=52.39 E-value=16 Score=26.97 Aligned_cols=27 Identities=22% Similarity=0.481 Sum_probs=21.9
Q ss_pred HHHHHhhHHHhcCCCChhHHHHHHHHhhHH
Q psy460 65 TSQLDKISAEQQTAVPLDSYTRKLVDAQQK 94 (117)
Q Consensus 65 ~~eL~~l~e~q~~p~dLd~yvkKL~~ak~R 94 (117)
..+|++|.|+|. .++.|+.+|-.+|-+
T Consensus 77 ~~tL~RLEeEq~---eF~~Fl~rLR~AKDk 103 (115)
T PF11014_consen 77 EDTLRRLEEEQR---EFEDFLERLRRAKDK 103 (115)
T ss_pred HHHHHHHHHHHH---HHHHHHHHHHHhhhH
Confidence 457888888766 899999999888754
No 73
>PHA02562 46 endonuclease subunit; Provisional
Probab=52.34 E-value=1.5e+02 Score=25.36 Aligned_cols=86 Identities=12% Similarity=0.198 Sum_probs=35.5
Q ss_pred hHHHHHHHHHH--hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------hhHHHhcCCCChhHHHHHHHHhhHH
Q psy460 24 TQDTIAEGLLG--LIKPTVDLLDEKFRATRASQLELRLQIEALTSQLD-------KISAEQQTAVPLDSYTRKLVDAQQK 94 (117)
Q Consensus 24 ~rd~La~GL~~--lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~-------~l~e~q~~p~dLd~yvkKL~~ak~R 94 (117)
.|..+-+-|+| .++.+-...-.+++..+.-..+|..+|..+..++. ++.+. ...+++.+=+.+......
T Consensus 151 er~~il~~l~~~~~~~~~~~~~k~~~~e~~~~i~~l~~~i~~l~~~i~~~~~~i~~~~~~--~~~~i~~l~~e~~~l~~~ 228 (562)
T PHA02562 151 ARRKLVEDLLDISVLSEMDKLNKDKIRELNQQIQTLDMKIDHIQQQIKTYNKNIEEQRKK--NGENIARKQNKYDELVEE 228 (562)
T ss_pred hHHHHHHHHhCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHH--HHHHHHHHHHHHHHHHHH
Confidence 34444443442 33333333344444444444444444444444443 33332 122333444444444444
Q ss_pred HHHHHHHHHHHHHHHHh
Q psy460 95 ISIVGNILQTTQVAMHK 111 (117)
Q Consensus 95 V~~vnniLq~~Q~RL~r 111 (117)
...+..-+...++.+..
T Consensus 229 ~~~l~~~l~~l~~~i~~ 245 (562)
T PHA02562 229 AKTIKAEIEELTDELLN 245 (562)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 44444444444444433
No 74
>PF11932 DUF3450: Protein of unknown function (DUF3450); InterPro: IPR016866 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function. However, they are found in an operon along with components of a TonB transport system (typified by Vibrio cholerae TonB2 [], and are predicted to be localized to the periplasmic space. Caution: the low-complexity nature of these sequences produces spurious BLAST hits to chromosome segregation ATPases (which are much longer in length and contain canonical Walker motifs). Accordingly, some members are misidentified as such.
Probab=51.77 E-value=1.1e+02 Score=24.08 Aligned_cols=44 Identities=14% Similarity=0.243 Sum_probs=18.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHH
Q psy460 40 VDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSY 84 (117)
Q Consensus 40 I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~y 84 (117)
+.+|+.++......+.+|..-+.++..+|..+-+. +-|+..+..
T Consensus 86 l~~L~~qi~~~~~~~~~l~p~m~~m~~~L~~~v~~-d~Pf~~~eR 129 (251)
T PF11932_consen 86 LASLEQQIEQIEETRQELVPLMEQMIDELEQFVEL-DLPFLLEER 129 (251)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-CCCCChHHH
Confidence 33344444444444444444444444444444441 344444433
No 75
>PF09748 Med10: Transcription factor subunit Med10 of Mediator complex; InterPro: IPR019145 The Mediator complex is a coactivator involved in the regulated transcription of nearly all RNA polymerase II-dependent genes. Mediator functions as a bridge to convey information from gene-specific regulatory proteins to the basal RNA polymerase II transcription machinery. The Mediator complex, having a compact conformation in its free form, is recruited to promoters by direct interactions with regulatory proteins and serves for the assembly of a functional preinitiation complex with RNA polymerase II and the general transcription factors. On recruitment the Mediator complex unfolds to an extended conformation and partially surrounds RNA polymerase II, specifically interacting with the unphosphorylated form of the C-terminal domain (CTD) of RNA polymerase II. The Mediator complex dissociates from the RNA polymerase II holoenzyme and stays at the promoter when transcriptional elongation begins. The Mediator complex is composed of at least 31 subunits: MED1, MED4, MED6, MED7, MED8, MED9, MED10, MED11, MED12, MED13, MED13L, MED14, MED15, MED16, MED17, MED18, MED19, MED20, MED21, MED22, MED23, MED24, MED25, MED26, MED27, MED29, MED30, MED31, CCNC, CDK8 and CDC2L6/CDK11. The subunits form at least three structurally distinct submodules. The head and the middle modules interact directly with RNA polymerase II, whereas the elongated tail module interacts with gene-specific regulatory proteins. Mediator containing the CDK8 module is less active than Mediator lacking this module in supporting transcriptional activation. The head module contains: MED6, MED8, MED11, SRB4/MED17, SRB5/MED18, ROX3/MED19, SRB2/MED20 and SRB6/MED22. The middle module contains: MED1, MED4, NUT1/MED5, MED7, CSE2/MED9, NUT2/MED10, SRB7/MED21 and SOH1/MED31. CSE2/MED9 interacts directly with MED4. The tail module contains: MED2, PGD1/MED3, RGR1/MED14, GAL11/MED15 and SIN4/MED16. The CDK8 module contains: MED12, MED13, CCNC and CDK8. Individual preparations of the Mediator complex lacking one or more distinct subunits have been variously termed ARC, CRSP, DRIP, PC2, SMCC and TRAP. Med10 is one of the protein subunits of the Mediator complex, tethered to Med14 (Rgr1) protein. Med10 specifically mediates basal-level HIS4 transcription via Gcn4. In addition, there is a putative requirement for Med10 in Bas2-mediated transcription []. ; GO: 0001104 RNA polymerase II transcription cofactor activity, 0006357 regulation of transcription from RNA polymerase II promoter, 0016592 mediator complex
Probab=51.21 E-value=25 Score=25.60 Aligned_cols=34 Identities=21% Similarity=0.190 Sum_probs=26.7
Q ss_pred HHHHHHHHHH--HHHHHHHHHHHHHHHHHHhhHHHh
Q psy460 42 LLDEKFRATR--ASQLELRLQIEALTSQLDKISAEQ 75 (117)
Q Consensus 42 ~lD~~V~~tr--~SQ~eL~~qId~L~~eL~~l~e~q 75 (117)
++--.|.+-+ .||..|...|..|...|.++....
T Consensus 14 el~~~v~d~~~~~s~~~L~~ki~~lv~~L~~l~~~~ 49 (128)
T PF09748_consen 14 ELGVIVSDFQGPPSQEALNQKINQLVTSLQELDKLA 49 (128)
T ss_pred HHHHHHHcCCCCCcHHHHHHHHHHHHHHHHHHHHHh
Confidence 3444555554 899999999999999999998654
No 76
>PRK00461 rpmC 50S ribosomal protein L29; Reviewed
Probab=50.93 E-value=69 Score=22.31 Aligned_cols=51 Identities=25% Similarity=0.292 Sum_probs=38.2
Q ss_pred HHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHH
Q psy460 52 ASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQT 104 (117)
Q Consensus 52 ~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~ 104 (117)
.|-.||.+.|..|-.||-++......- .+. --.++-.+|+.|+-+..++..
T Consensus 8 lS~eEL~e~L~elkkELf~LR~q~atg-ql~-n~~~ir~iRR~IARilTvl~E 58 (87)
T PRK00461 8 KSVEELEKLVIELKAELFTLRFKNATG-SLD-QTHKIKEIRKDIARILTILNE 58 (87)
T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHhC-ccc-ccHHHHHHHHHHHHHHHHHHH
Confidence 467899999999999999998532222 232 234578899999999999875
No 77
>TIGR00634 recN DNA repair protein RecN. All proteins in this family for which functions are known are ATP binding proteins involved in the initiation of recombination and recombinational repair.
Probab=50.59 E-value=1.6e+02 Score=25.93 Aligned_cols=66 Identities=14% Similarity=0.196 Sum_probs=46.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCC---hhHHHHHHHHhhHHHHHHHHHHHHH
Q psy460 40 VDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVP---LDSYTRKLVDAQQKISIVGNILQTT 105 (117)
Q Consensus 40 I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~d---Ld~yvkKL~~ak~RV~~vnniLq~~ 105 (117)
..++..+.+..+..+.++..+++.|..++.+|.+....|-+ |..-.++|.|+.+-...++..++..
T Consensus 170 ~~~~~~~L~~l~~~~~~~~~eld~L~~ql~ELe~~~l~~~E~e~L~~e~~~L~n~e~i~~~~~~~~~~L 238 (563)
T TIGR00634 170 WLKARQQLKDRQQKEQELAQRLDFLQFQLEELEEADLQPGEDEALEAEQQRLSNLEKLRELSQNALAAL 238 (563)
T ss_pred HHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHhCCcCCCcHHHHHHHHHHHhCHHHHHHHHHHHHHHH
Confidence 34445555566666777788888888888888776443332 5677888888888777777776543
No 78
>TIGR02169 SMC_prok_A chromosome segregation protein SMC, primarily archaeal type. SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
Probab=50.44 E-value=2.1e+02 Score=26.46 Aligned_cols=21 Identities=10% Similarity=0.096 Sum_probs=8.2
Q ss_pred hhHHHHHHHHHHHHHHHHHHh
Q psy460 91 AQQKISIVGNILQTTQVAMHK 111 (117)
Q Consensus 91 ak~RV~~vnniLq~~Q~RL~r 111 (117)
.+.++..+..-+...+.++.+
T Consensus 467 ~~~~l~~~~~~l~~l~~~l~~ 487 (1164)
T TIGR02169 467 YEQELYDLKEEYDRVEKELSK 487 (1164)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 333333333334444444333
No 79
>PRK00306 50S ribosomal protein L29; Reviewed
Probab=50.35 E-value=67 Score=20.72 Aligned_cols=53 Identities=21% Similarity=0.302 Sum_probs=38.5
Q ss_pred HHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHHHH
Q psy460 52 ASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQTTQ 106 (117)
Q Consensus 52 ~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q 106 (117)
.|-.+|...|..|-.||-++......- .+ .-..++-..|+.|.-+..++..-+
T Consensus 9 ls~~eL~~~l~~lkkeL~~lR~~~~~~-~~-~n~~~i~~~rk~IARi~Tvl~er~ 61 (66)
T PRK00306 9 LSVEELNEKLLELKKELFNLRFQKATG-QL-ENTHRLREVRRDIARIKTVLRERE 61 (66)
T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHhC-CC-cCcHHHHHHHHHHHHHHHHHHHHH
Confidence 466789999999999999888532111 12 234577789999999999987643
No 80
>PF12325 TMF_TATA_bd: TATA element modulatory factor 1 TATA binding; InterPro: IPR022091 This is the C-terminal conserved coiled coil region of a family of TATA element modulatory factor 1 proteins conserved in eukaryotes []. The proteins bind to the TATA element of some RNA polymerase II promoters and repress their activity. by competing with the binding of TATA binding protein. TMF1_TATA_bd is the most conserved part of the TMFs []. TMFs are evolutionarily conserved golgins that bind Rab6, a ubiquitous ras-like GTP-binding Golgi protein, and contribute to Golgi organisation in animal [] and plant cells. The Rab6-binding domain appears to be the same region as this C-terminal family [].
Probab=50.34 E-value=98 Score=22.61 Aligned_cols=74 Identities=12% Similarity=0.191 Sum_probs=49.7
Q ss_pred HHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHHHHHH
Q psy460 29 AEGLLGLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQTTQVA 108 (117)
Q Consensus 29 a~GL~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q~R 108 (117)
+-+++.-|+..|+++|..+...+.-=..|..+=+.+++|+-++... .-.+.....++..+..-++..+.|
T Consensus 14 ~~~~ve~L~s~lr~~E~E~~~l~~el~~l~~~r~~l~~Eiv~l~~~----------~e~~~~~~~~~~~L~~el~~l~~r 83 (120)
T PF12325_consen 14 SVQLVERLQSQLRRLEGELASLQEELARLEAERDELREEIVKLMEE----------NEELRALKKEVEELEQELEELQQR 83 (120)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----------HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3458888999999999999988887777777777777777776653 122233344555555555555555
Q ss_pred HHhh
Q psy460 109 MHKK 112 (117)
Q Consensus 109 L~rl 112 (117)
...+
T Consensus 84 y~t~ 87 (120)
T PF12325_consen 84 YQTL 87 (120)
T ss_pred HHHH
Confidence 5443
No 81
>PF06160 EzrA: Septation ring formation regulator, EzrA ; InterPro: IPR010379 During the bacterial cell cycle, the tubulin-like cell-division protein FtsZ polymerises into a ring structure that establishes the location of the nascent division site. EzrA modulates the frequency and position of FtsZ ring formation [].; GO: 0000921 septin ring assembly, 0005940 septin ring, 0016021 integral to membrane
Probab=50.06 E-value=1.3e+02 Score=26.78 Aligned_cols=59 Identities=22% Similarity=0.332 Sum_probs=36.0
Q ss_pred HHHHHHHHHHHHHHHhhHHHhcCCC-ChhHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhh
Q psy460 55 LELRLQIEALTSQLDKISAEQQTAV-PLDSYTRKLVDAQQKISIVGNILQTTQVAMHKKF 113 (117)
Q Consensus 55 ~eL~~qId~L~~eL~~l~e~q~~p~-dLd~yvkKL~~ak~RV~~vnniLq~~Q~RL~rl~ 113 (117)
.++...++.+...|..+.+.|..-. .|...-+.=..||.++......|.+++.++.|.+
T Consensus 375 S~i~~~l~~~~~~l~~ie~~q~~~~~~l~~L~~dE~~Ar~~l~~~~~~l~~ikR~lek~n 434 (560)
T PF06160_consen 375 SEIQEELEEIEEQLEEIEEEQEEINESLQSLRKDEKEAREKLQKLKQKLREIKRRLEKSN 434 (560)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcC
Confidence 3444444444444444444321111 1455555667888899999999999988887754
No 82
>PF04156 IncA: IncA protein; InterPro: IPR007285 Chlamydia trachomatis is an obligate intracellular bacterium that develops within a parasitophorous vacuole termed an inclusion. The inclusion is nonfusogenic with lysosomes but intercepts lipids from a host cell exocytic pathway. Initiation of chlamydial development is concurrent with modification of the inclusion membrane by a set of C. trachomatis-encoded proteins collectively designated Incs. One of these Incs, IncA (Inclusion membrane protein A), is functionally associated with the homotypic fusion of inclusions [].
Probab=49.58 E-value=1.1e+02 Score=22.76 Aligned_cols=19 Identities=32% Similarity=0.525 Sum_probs=8.6
Q ss_pred HHHHHHHHHHHHHHHHHhh
Q psy460 53 SQLELRLQIEALTSQLDKI 71 (117)
Q Consensus 53 SQ~eL~~qId~L~~eL~~l 71 (117)
...+++.++..|.+||.++
T Consensus 82 e~~~~~~~l~~l~~el~~l 100 (191)
T PF04156_consen 82 ELSELQQQLQQLQEELDQL 100 (191)
T ss_pred hHHhHHHHHHHHHHHHHHH
Confidence 3344444444444444444
No 83
>COG0497 RecN ATPase involved in DNA repair [DNA replication, recombination, and repair]
Probab=49.38 E-value=2e+02 Score=26.41 Aligned_cols=75 Identities=11% Similarity=0.080 Sum_probs=53.0
Q ss_pred HhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHHHHHHHHhh
Q psy460 34 GLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQTTQVAMHKK 112 (117)
Q Consensus 34 ~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q~RL~rl 112 (117)
.-.--+-++++.++...+.++.+..+++|.|.=++.+|.+...+|- .| ..|.+-++|+.....+.+.++.-+.-|
T Consensus 160 ~~~y~~w~~~~~~l~~~~~~~~e~~~~~d~L~fq~~Ele~~~l~~g---E~-e~L~~e~~rLsn~ekl~~~~~~a~~~L 234 (557)
T COG0497 160 QEAYQAWKQARRELEDLQEKERERAQRADLLQFQLEELEELNLQPG---ED-EELEEERKRLSNSEKLAEAIQNALELL 234 (557)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCc---hH-HHHHHHHHHHhhHHHHHHHHHHHHHHH
Confidence 3344456788889999999999999999999999999988533333 22 346666666666666666665544443
No 84
>PF14728 PHTB1_C: PTHB1 C-terminus
Probab=49.35 E-value=1.7e+02 Score=25.15 Aligned_cols=80 Identities=15% Similarity=0.275 Sum_probs=53.9
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHH------hcCCCChhHHHHHHHHhhHHHHHHHHHHHHHHHHH
Q psy460 36 IKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAE------QQTAVPLDSYTRKLVDAQQKISIVGNILQTTQVAM 109 (117)
Q Consensus 36 L~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~------q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q~RL 109 (117)
|.+..+.+|.|++ .|.--.++...|+..+.+++-+-.. ...|-+|+..=-=|-..=+.+..+..-+..++.++
T Consensus 199 l~~~~~~id~H~~-lr~~~~~~~~~L~~~a~QfRaIQrrlL~r~kd~~p~~l~~L~~LLe~ty~~l~~~~d~~~~~~~~l 277 (377)
T PF14728_consen 199 LQEYFEIIDQHFE-LRQELKELEEELDERAQQFRAIQRRLLTRFKDKNPAPLDNLDTLLEGTYRQLIALADEIEELQANL 277 (377)
T ss_pred HHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 6778889999987 8888999999999999887654321 14666776433334444455666666666666666
Q ss_pred Hhhhhhc
Q psy460 110 HKKFFMG 116 (117)
Q Consensus 110 ~rl~~~~ 116 (117)
.+...++
T Consensus 278 ~~a~~~L 284 (377)
T PF14728_consen 278 KRAGASL 284 (377)
T ss_pred HHHhhhH
Confidence 6655443
No 85
>CHL00154 rpl29 ribosomal protein L29; Validated
Probab=49.23 E-value=67 Score=21.18 Aligned_cols=50 Identities=8% Similarity=0.099 Sum_probs=38.7
Q ss_pred HHHHHHHHHHHHHHHHHHhhHHHhcCCCChh-HHHHHHHHhhHHHHHHHHHHHH
Q psy460 52 ASQLELRLQIEALTSQLDKISAEQQTAVPLD-SYTRKLVDAQQKISIVGNILQT 104 (117)
Q Consensus 52 ~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd-~yvkKL~~ak~RV~~vnniLq~ 104 (117)
.|-.+|...|.+|-.||-++.. |.+.=.++ | .++-.+|+.|+-|..++..
T Consensus 12 ls~~eL~~~l~elk~elf~LRf-q~atgql~n~--~~ir~~RrdIARikTil~e 62 (67)
T CHL00154 12 LTDSEISEEIIKTKKELFDLRL-KKATRQNFKP--HLFKHKKHRLAQLLTLLSS 62 (67)
T ss_pred CCHHHHHHHHHHHHHHHHHHHH-HHHhCcccCh--HHHHHHHHHHHHHHHHHHH
Confidence 4668899999999999999986 44433443 4 5777889999999999864
No 86
>PF12757 DUF3812: Protein of unknown function (DUF3812); InterPro: IPR024527 This family of fungal proteins represents the eisosome 1 family. Eisosome protein 1 is required for normal formation of eisosomes, large cytoplasmic protein assemblies that localize to specialised domains on plasma membrane and mark the site of endocytosis [].
Probab=48.93 E-value=57 Score=23.76 Aligned_cols=54 Identities=28% Similarity=0.354 Sum_probs=38.4
Q ss_pred cCCCCCCCCCCchHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy460 12 ADDKTEDFCDNPTQDTIAEGLLGLIKPTVDLLDEKFRATRASQLELRLQIEALTSQL 68 (117)
Q Consensus 12 ~~~~~~~~~~~p~rd~La~GL~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL 68 (117)
||=..+-|=+..-.+.+|. +.+.|+++++++++...|.-..+++.+-++.-.+.
T Consensus 61 V~lGGGl~m~~~evd~IA~---~rVqPvLdeI~erae~qRa~d~e~k~~~~~~k~~~ 114 (126)
T PF12757_consen 61 VNLGGGLFMDQEEVDAIAR---KRVQPVLDEIDERAEAQRARDEEIKLDEEERKREH 114 (126)
T ss_pred eeCCCCcccCHHHHHHHHH---HhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3334444555555667766 56899999999999999998888877666654443
No 87
>PF07106 TBPIP: Tat binding protein 1(TBP-1)-interacting protein (TBPIP); InterPro: IPR010776 This family consists of several eukaryotic TBP-1 interacting protein (TBPIP) sequences. TBP-1 has been demonstrated to interact with the human immunodeficiency virus type 1 (HIV-1) viral protein Tat, then modulate the essential replication process of HIV. In addition, TBP-1 has been shown to be a component of the 26S proteasome, a basic multiprotein complex that degrades ubiquitinated proteins in an ATP-dependent fashion. Human TBPIP interacts with human TBP-1 then modulates the inhibitory action of human TBP-1 on HIV-Tat-mediated transactivation [].
Probab=48.24 E-value=1.1e+02 Score=22.59 Aligned_cols=62 Identities=16% Similarity=0.320 Sum_probs=35.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHHH
Q psy460 40 VDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQTT 105 (117)
Q Consensus 40 I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~ 105 (117)
+..+|..+...+.-=.+|+..+-.|..+|..++. .|. .+.-...+...+..+..+...|+..
T Consensus 74 l~~ld~ei~~L~~el~~l~~~~k~l~~eL~~L~~---~~t-~~el~~~i~~l~~e~~~l~~kL~~l 135 (169)
T PF07106_consen 74 LAELDAEIKELREELAELKKEVKSLEAELASLSS---EPT-NEELREEIEELEEEIEELEEKLEKL 135 (169)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc---CCC-HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5566666666666666677777777777777765 222 3333333334444444444444443
No 88
>PF06013 WXG100: Proteins of 100 residues with WXG; InterPro: IPR010310 ESAT-6 is a small protein appears to be of fundamental importance in virulence and protective immunity in Mycobacterium tuberculosis. Homologues have been detected in other Gram-positive bacterial species. It may represent a novel secretion system potentially driven by the PF01580 from PFAM domains in the YukA-like proteins []. Members of this protein family include secretion targets for type main variants of type VII secretion systems (T7SS), one found in the Actinobacteria, one found in the Firmicutes. This model was derived through iteration from PF06013 from PFAM. The best characterised member of this family is ESAT-6 from Mycobacterium tuberculosis. Members of this family usually are ~100 amino acids in length but occasionally have long C-terminal extension. ; PDB: 3FAV_A 1WA8_A 3Q4H_B 2KG7_A 2VRZ_B 2VS0_B 3OGI_A 3H6P_B 3GVM_B 3GWK_C ....
Probab=47.93 E-value=63 Score=19.72 Aligned_cols=71 Identities=20% Similarity=0.272 Sum_probs=49.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHHHHHHHHhh
Q psy460 41 DLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQTTQVAMHKK 112 (117)
Q Consensus 41 ~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q~RL~rl 112 (117)
+.|...+........+|+..+.+|......+.. .=.---=+.|..+.......+..++..|..+.+.|...
T Consensus 7 ~~l~~~a~~~~~~~~~l~~~~~~l~~~~~~l~~-~W~G~a~~af~~~~~~~~~~~~~~~~~L~~~~~~l~~~ 77 (86)
T PF06013_consen 7 EQLRAAAQQLQAQADELQSQLQQLESSIDSLQA-SWQGEAADAFQDKFEEWNQAFRQLNEALEELSQALRQA 77 (86)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHGG-GBTSSTSHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-hCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 456666667777777777788888777777743 22333346777777777788888888888887777654
No 89
>COG1196 Smc Chromosome segregation ATPases [Cell division and chromosome partitioning]
Probab=47.87 E-value=2.7e+02 Score=27.01 Aligned_cols=67 Identities=16% Similarity=0.304 Sum_probs=35.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhh
Q psy460 41 DLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQTTQVAMHKKF 113 (117)
Q Consensus 41 ~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q~RL~rl~ 113 (117)
..+...+.....+-..|...++.+...++.+.+. ++.+=..+..++.++..+..-+.+.+.|..++.
T Consensus 761 ~~~~~~l~~~~~~~~~l~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 827 (1163)
T COG1196 761 EELEEELESLEEALAKLKEEIEELEEKRQALQEE------LEELEEELEEAERRLDALERELESLEQRRERLE 827 (1163)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3334444444444444444444444444444433 555666666666666666666666666655543
No 90
>PF04912 Dynamitin: Dynamitin ; InterPro: IPR006996 Dynamitin is a subunit of the microtubule-dependent motor complex, it is also implicated in cell adhesion by binding to macrophage-enriched myristoylated alanine-rice C kinase substrate (MacMARCKS) []. It is also thought to modulate cytoplasmic dynein binding to an organelle, and plays a role in prometaphase chromosome alignment and spindle organisation during mitosis. Dynamitin is also involved in anchoring microtubules to centrosomes and may play a role in synapse formation during brain development []. ; GO: 0007017 microtubule-based process, 0005869 dynactin complex
Probab=47.46 E-value=1.7e+02 Score=24.61 Aligned_cols=53 Identities=11% Similarity=0.218 Sum_probs=35.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhh
Q psy460 45 EKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQTTQVAMHKKF 113 (117)
Q Consensus 45 ~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q~RL~rl~ 113 (117)
..+...-..|..+..+|....+-|..+.+. .+..+..|++-+..+.+||.+|+
T Consensus 336 ~~l~~le~~q~~l~~~l~~~~~~L~~ve~~----------------~~~N~~~i~~n~~~le~Ri~~L~ 388 (388)
T PF04912_consen 336 QTLSELESQQSDLQSQLKKWEELLNKVEEK----------------FKENMETIEKNVKKLEERIAKLQ 388 (388)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----------------HHHHHHHHHHHHHHHHHHHhccC
Confidence 344444555556666666666555555542 55778888889999999998874
No 91
>PF04977 DivIC: Septum formation initiator; InterPro: IPR007060 DivIC, from the spore-forming, Gram-positive bacterium Bacillus subtilis, is necessary for both vegetative and sporulation septum formation []. These proteins are mainly composed of an N-terminal coiled-coil. DivIB, DivIC and FtsL inter-depend on each other for stabilisation and localisation. The latter two form a heterodimer. DivIC is always centre cell but the other two associate with it during septation [].; GO: 0007049 cell cycle
Probab=47.45 E-value=70 Score=20.09 Aligned_cols=32 Identities=16% Similarity=0.283 Sum_probs=25.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy460 40 VDLLDEKFRATRASQLELRLQIEALTSQLDKI 71 (117)
Q Consensus 40 I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l 71 (117)
+.++..++...+.-..+|+.+.+.|..++.++
T Consensus 19 ~~~~~~ei~~l~~~i~~l~~e~~~L~~ei~~l 50 (80)
T PF04977_consen 19 YYQLNQEIAELQKEIEELKKENEELKEEIERL 50 (80)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 56777778887777788888888888888877
No 92
>smart00338 BRLZ basic region leucin zipper.
Probab=47.29 E-value=70 Score=20.06 Aligned_cols=36 Identities=25% Similarity=0.430 Sum_probs=27.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHH
Q psy460 38 PTVDLLDEKFRATRASQLELRLQIEALTSQLDKISA 73 (117)
Q Consensus 38 P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e 73 (117)
.-+..|+.+|.....-=.+|+.+++.|..++..+..
T Consensus 26 ~~~~~Le~~~~~L~~en~~L~~~~~~l~~e~~~lk~ 61 (65)
T smart00338 26 AEIEELERKVEQLEAENERLKKEIERLRRELEKLKS 61 (65)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 456778888888877777788888888777777654
No 93
>PF12329 TMF_DNA_bd: TATA element modulatory factor 1 DNA binding; InterPro: IPR022092 This is the middle region of a family of TATA element modulatory factor 1 proteins conserved in eukaryotes that contains at its N-terminal section a number of leucine zippers that could potentially form coiled coil structures. The whole proteins bind to the TATA element of some RNA polymerase II promoters and repress their activity. by competing with the binding of TATA binding protein. TMFs are evolutionarily conserved golgins that bind Rab6, a ubiquitous ras-like GTP-binding Golgi protein, and contribute to Golgi organisation in animal [] and plant [] cells.
Probab=47.16 E-value=83 Score=20.89 Aligned_cols=18 Identities=11% Similarity=0.235 Sum_probs=8.3
Q ss_pred HHHHHHHHHHHHHHHHHH
Q psy460 93 QKISIVGNILQTTQVAMH 110 (117)
Q Consensus 93 ~RV~~vnniLq~~Q~RL~ 110 (117)
.|+..+..-+.+.++|+.
T Consensus 54 ~~~~~~e~~~~~l~~~l~ 71 (74)
T PF12329_consen 54 KKLEELEKELESLEERLK 71 (74)
T ss_pred HHHHHHHHHHHHHHHHhh
Confidence 444444444444444443
No 94
>COG4477 EzrA Negative regulator of septation ring formation [Cell division and chromosome partitioning]
Probab=46.32 E-value=1.5e+02 Score=27.41 Aligned_cols=61 Identities=8% Similarity=0.149 Sum_probs=43.1
Q ss_pred HHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHH
Q psy460 33 LGLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVG 99 (117)
Q Consensus 33 ~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vn 99 (117)
|+.+.-.+..++.++.++..-|...++.+-+|-.+=..- -=++..|..||...||++-+-|
T Consensus 377 yS~lq~~l~~~~~~l~~i~~~q~~~~e~L~~LrkdEl~A------re~l~~~~~~l~eikR~mek~n 437 (570)
T COG4477 377 YSELQDNLEEIEKALTDIEDEQEKVQEHLTSLRKDELEA------RENLERLKSKLHEIKRYMEKSN 437 (570)
T ss_pred HHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHcC
Confidence 677888888888888888888888888777774422222 2247777777777777776544
No 95
>smart00126 IL6 Interleukin-6 homologues. Family includes granulocyte colony-stimulating factor (G-CSF) and myelomonocytic growth factor (MGF). IL-6 is also known as B-cell stimulatory factor 2.
Probab=46.15 E-value=81 Score=24.17 Aligned_cols=93 Identities=17% Similarity=0.154 Sum_probs=61.5
Q ss_pred CCCCCCchH------HHHHHHHH---HhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHh-cCCCChhHHHH
Q psy460 17 EDFCDNPTQ------DTIAEGLL---GLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQ-QTAVPLDSYTR 86 (117)
Q Consensus 17 ~~~~~~p~r------d~La~GL~---~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q-~~p~dLd~yvk 86 (117)
++-|..++- --|+.||+ .+|+-+=..+.+....++.-|...+.=|-.+-++.++..+.. ..|..=..-..
T Consensus 42 ~dgC~~~gfn~e~CL~ri~~GL~~yq~~L~~l~~~f~~~~~~v~~l~~~~~~L~~~l~~k~k~~~~v~~p~p~~~~~ll~ 121 (154)
T smart00126 42 KDGCFQSGFNQEICLVKITAGLLEYQVYLEYLQNEFPENKENVDTLQLDTKTLIQIIQQEMKDLGKITYPTPTANRGLLP 121 (154)
T ss_pred CCCCCcccCCHhHHHHHHHHHHHHHHHHHHHHHHhCccccccHHHHHHHHHHHHHHHHHHhhCcCccccCCCccchhhhh
Confidence 345666642 36899976 566655566665555556555555555666666777776532 22333346677
Q ss_pred HHHHhhH--HHHHHHHHHHHHHHHH
Q psy460 87 KLVDAQQ--KISIVGNILQTTQVAM 109 (117)
Q Consensus 87 KL~~ak~--RV~~vnniLq~~Q~RL 109 (117)
+|..... |-...+.||++.|+=|
T Consensus 122 ~l~s~~~W~r~~t~~lILr~Le~FL 146 (154)
T smart00126 122 KLQSQNQWVRNATGFLILSSLERFL 146 (154)
T ss_pred hccCccHHHHHHHHHHHHHHHHHHH
Confidence 8888888 9999999999998754
No 96
>PRK00846 hypothetical protein; Provisional
Probab=46.03 E-value=95 Score=21.23 Aligned_cols=42 Identities=19% Similarity=0.210 Sum_probs=33.6
Q ss_pred HHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHH
Q psy460 32 LLGLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISA 73 (117)
Q Consensus 32 L~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e 73 (117)
-+.+..-+|++|+.-|..-+.--..|+.+|..|.+.|+.+..
T Consensus 21 rlAfQe~tIe~LN~~v~~qq~~I~~L~~ql~~L~~rL~~~~~ 62 (77)
T PRK00846 21 RLSFQEQALTELSEALADARLTGARNAELIRHLLEDLGKVRS 62 (77)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc
Confidence 456777889999999988888777888888888888877754
No 97
>TIGR02168 SMC_prok_B chromosome segregation protein SMC, common bacterial type. SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
Probab=46.03 E-value=2.4e+02 Score=25.88 Aligned_cols=12 Identities=0% Similarity=0.241 Sum_probs=4.3
Q ss_pred hHHHHHHHHHHH
Q psy460 92 QQKISIVGNILQ 103 (117)
Q Consensus 92 k~RV~~vnniLq 103 (117)
+.++..+..-+.
T Consensus 823 ~~~~~~l~~~~~ 834 (1179)
T TIGR02168 823 RERLESLERRIA 834 (1179)
T ss_pred HHHHHHHHHHHH
Confidence 333333333333
No 98
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=45.80 E-value=1.4e+02 Score=24.03 Aligned_cols=46 Identities=20% Similarity=0.219 Sum_probs=33.0
Q ss_pred HHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHh
Q psy460 30 EGLLGLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQ 75 (117)
Q Consensus 30 ~GL~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q 75 (117)
+.+..--.-..-+|-.++...+.=-.+||++||++.-+|+++.+-|
T Consensus 46 e~~~~~~~~~~~~l~~ql~~lq~ev~~LrG~~E~~~~~l~~~~~rq 91 (263)
T PRK10803 46 ERISNAHSQLLTQLQQQLSDNQSDIDSLRGQIQENQYQLNQVVERQ 91 (263)
T ss_pred HHHHHhhhHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHH
Confidence 4455544555667777777777777788888888888888877655
No 99
>TIGR02338 gimC_beta prefoldin, beta subunit, archaeal. Chaperonins are cytosolic, ATP-dependent molecular chaperones, with a conserved toroidal architecture, that assist in the folding of nascent and/or denatured polypeptide chains. The group I chaperonin system consists of GroEL and GroES, and is found (usually) in bacteria and organelles of bacterial origin. The group II chaperonin system, called the thermosome in Archaea and TRiC or CCT in the Eukaryota, is structurally similar but only distantly related. Prefoldin, also called GimC, is a complex in Archaea and Eukaryota, that works with group II chaperonins. Members of this protein family are the archaeal clade of the beta class of prefoldin subunit. Closely related, but outside the scope of this family are the eukaryotic beta-class prefoldin subunits, Gim-1,3,4 and 6. The alpha class prefoldin subunits are more distantly related.
Probab=45.66 E-value=1e+02 Score=21.45 Aligned_cols=39 Identities=15% Similarity=0.308 Sum_probs=26.2
Q ss_pred HHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy460 33 LGLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKI 71 (117)
Q Consensus 33 ~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l 71 (117)
..-|+--++.+|..+......+..|+.++..+...|+.+
T Consensus 69 ~~~l~~r~e~ie~~i~~lek~~~~l~~~l~e~q~~l~~~ 107 (110)
T TIGR02338 69 IQELKEKKETLELRVKTLQRQEERLREQLKELQEKIQEA 107 (110)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334555566677777777777777777777777766655
No 100
>PF10157 DUF2365: Uncharacterized conserved protein (DUF2365); InterPro: IPR019314 This entry is found in a highly conserved family of proteins which have no known function.
Probab=45.54 E-value=1.3e+02 Score=22.78 Aligned_cols=45 Identities=20% Similarity=0.158 Sum_probs=31.3
Q ss_pred HHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHH
Q psy460 30 EGLLGLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAE 74 (117)
Q Consensus 30 ~GL~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~ 74 (117)
-|-+..++-+|+++=..|.+.-++.+.|=..+++|.++++-+-+.
T Consensus 87 v~~~~~y~~sv~~~cdsvD~sik~~y~liakceELn~~M~~v~~L 131 (149)
T PF10157_consen 87 VEHMETYKDSVDKLCDSVDASIKSMYTLIAKCEELNESMKPVYKL 131 (149)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344556667777777777777777777777777777777766553
No 101
>PF10186 Atg14: UV radiation resistance protein and autophagy-related subunit 14; InterPro: IPR018791 Class III phosphatidylinositol 3-kinase (PI3-kinase) regulates multiple membrane trafficking. In yeast, two distinct PI3-kinase complexes are known: complex I (Vps34, Vps15, Vps30/Atg6, and Atg14) is involved in autophagy, and complex II (Vps34, Vps15, Vps30/Atg6, and Vps38) functions in the vacuolar protein sorting pathway. In mammals, the counterparts of Vps34, Vps15, and Vps30/Atg6 are Vps34, p150, and Beclin 1, respectively. Mammalian UV irradiation resistance-associated gene (UVRAG) has been identified as identical to yeast Vps38 []. The Atg14 (autophagy-related protein 14) proteins are hydrophilic proteins and have a coiled-coil motif at the N terminus region. Yeast cells with mutant Atg14 are defective not only in autophagy but also in sorting of carboxypeptidase Y (CPY), a vacuolar-soluble hydrolase, to the vacuole []. This entry represents Atg14 and UVRAG, which bind Beclin 1 to forms two distinct PI3-kinase complexes. This entry also includes Bakor (beclin-1-associated autophagy-related key regulator), also known as autophagy-related protein 14-like protein, which share sequence similarity to the yeast Atg14 protein []. Barkor positively regulates autophagy through its interaction with Beclin-1, with decreased levels of autophagosome formation observed when Barkor expression is eliminated []. Autophagy mediates the cellular response to nutrient deprivation, protein aggregation, and pathogen invasion in humans, and malfunction of autophagy has been implicated in multiple human diseases including cancer. ; GO: 0010508 positive regulation of autophagy
Probab=45.35 E-value=1.4e+02 Score=23.03 Aligned_cols=26 Identities=31% Similarity=0.381 Sum_probs=9.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy460 43 LDEKFRATRASQLELRLQIEALTSQL 68 (117)
Q Consensus 43 lD~~V~~tr~SQ~eL~~qId~L~~eL 68 (117)
+..++...+..=.+++.+|+...+++
T Consensus 68 ~~~r~~~l~~~i~~~~~~i~~~r~~l 93 (302)
T PF10186_consen 68 LRERLERLRERIERLRKRIEQKRERL 93 (302)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33333333333333333333333333
No 102
>PF12718 Tropomyosin_1: Tropomyosin like; InterPro: IPR000533 Tropomyosins [], are a family of closely related proteins present in muscle and non-muscle cells. In striated muscle, tropomyosin mediate the interactions between the troponin complex and actin so as to regulate muscle contraction []. The role of tropomyosin in smooth muscle and non-muscle tissues is not clear. Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 parallel helices containing 2 sets of 7 alternating actin binding sites []. There are multiple cell-specific isoforms, created by differential splicing of the messenger RNA from one gene, but the proportions of the isoforms vary between different cell types. Muscle isoforms of tropomyosin are characterised by having 284 amino acid residues and a highly conserved N-terminal region, whereas non-muscle forms are generally smaller and are heterogeneous in their N-terminal region. This entry represents tropomyosin (Tmp) 1, 2 and 3. Within the yeast Tmp1 and Tmp2, biochemical and sequence analyses indicate that Tpm2 spans four actin monomers along a filament, whereas Tpm1 spans five. Despite its shorter length, Tpm2 can compete with Tpm1 for binding to F-actin. Over-expression of Tpm2 in vivo alters the axial budding of haploids to a bipolar pattern, and this can be partially suppressed by co-over-expression of Tpm1. This suggests distinct functions for the two tropomyosins, and indicates that the ratio between them is important for correct morphogenesis [].
Probab=45.30 E-value=1.2e+02 Score=22.36 Aligned_cols=54 Identities=19% Similarity=0.305 Sum_probs=28.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHH
Q psy460 40 VDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVG 99 (117)
Q Consensus 40 I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vn 99 (117)
++.+++++..+-..-..+..+|.+|...+..+... |+.+-.+|..++..+-...
T Consensus 16 ~e~~e~~~K~le~~~~~~E~EI~sL~~K~~~lE~e------ld~~~~~l~~~k~~lee~~ 69 (143)
T PF12718_consen 16 AEELEAKVKQLEQENEQKEQEITSLQKKNQQLEEE------LDKLEEQLKEAKEKLEESE 69 (143)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHhHH
Confidence 34555566666555555666666665555555442 4555555554444444333
No 103
>COG3879 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=45.19 E-value=1.8e+02 Score=24.10 Aligned_cols=71 Identities=17% Similarity=0.219 Sum_probs=42.8
Q ss_pred HhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhh
Q psy460 34 GLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQTTQVAMHKKF 113 (117)
Q Consensus 34 ~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q~RL~rl~ 113 (117)
+++.-++.+-...=...|.-+..|...+.++.+++.+|..+ ...|=+++..+++.+.. ....+++||.++.
T Consensus 32 ~~~~a~~~q~~k~~~~~~~r~~~L~~e~~s~Q~~~~~L~~e------v~~~~~~~~s~~~~~~t---~~~~ie~~l~~l~ 102 (247)
T COG3879 32 GVMLAAVFQTSKGESVRRARDLDLVKELRSLQKKVNTLAAE------VEDLENKLDSVRRSVLT---DDAALEDRLEKLR 102 (247)
T ss_pred HHHHHHHHhhccCcchhhhhhhHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHhHHh---HHHHHHHHHHHHH
Confidence 33333443333332233333447777888888877777765 77888888888844444 4444555887764
No 104
>COG1730 GIM5 Predicted prefoldin, molecular chaperone implicated in de novo protein folding [Posttranslational modification, protein turnover, chaperones]
Probab=44.91 E-value=83 Score=23.80 Aligned_cols=38 Identities=8% Similarity=0.320 Sum_probs=29.7
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHH
Q psy460 36 IKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISA 73 (117)
Q Consensus 36 L~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e 73 (117)
+.-+|+-|+.++...-.+...|.+.|+.|++....+..
T Consensus 92 ~~eAie~l~k~~~~l~~~~~~l~~~l~~l~~~~~~l~~ 129 (145)
T COG1730 92 ADEAIEFLKKRIEELEKAIEKLQQALAELAQRIEQLEQ 129 (145)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45678888888888888888888888888777766654
No 105
>PF14257 DUF4349: Domain of unknown function (DUF4349)
Probab=44.32 E-value=1.5e+02 Score=23.28 Aligned_cols=65 Identities=9% Similarity=0.119 Sum_probs=41.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhhhh
Q psy460 47 FRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQTTQVAMHKKFFM 115 (117)
Q Consensus 47 V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q~RL~rl~~~ 115 (117)
.+++-..=..+..+|..|..+.++|.+....+-+ +.-++...+++..|..-+..++.++..|+.+
T Consensus 127 ~~DvT~~y~D~~arl~~l~~~~~rl~~ll~ka~~----~~d~l~ie~~L~~v~~eIe~~~~~~~~l~~~ 191 (262)
T PF14257_consen 127 SEDVTEQYVDLEARLKNLEAEEERLLELLEKAKT----VEDLLEIERELSRVRSEIEQLEGQLKYLDDR 191 (262)
T ss_pred ccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 3333333345666777777777777776444333 4456667777777777777777777776654
No 106
>PF06103 DUF948: Bacterial protein of unknown function (DUF948); InterPro: IPR009293 This family consists of bacterial sequences several of which are thought to be general stress proteins.
Probab=44.14 E-value=93 Score=20.59 Aligned_cols=43 Identities=12% Similarity=0.245 Sum_probs=35.0
Q ss_pred HHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHH
Q psy460 32 LLGLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAE 74 (117)
Q Consensus 32 L~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~ 74 (117)
|...+-|++.++-.-+..+...-..+..+++.+..|...+...
T Consensus 13 Lvi~l~~~l~~l~~~l~~~~~ti~~l~~~~~~i~~e~~~ll~~ 55 (90)
T PF06103_consen 13 LVIFLIKVLKKLKKTLDEVNKTIDTLQEQVDPITKEINDLLHN 55 (90)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHH
Confidence 4566778888888888888888888888888888888877764
No 107
>PF13815 Dzip-like_N: Iguana/Dzip1-like DAZ-interacting protein N-terminal
Probab=44.01 E-value=1.1e+02 Score=21.56 Aligned_cols=51 Identities=20% Similarity=0.305 Sum_probs=26.7
Q ss_pred CCCCCCCCchHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHH
Q psy460 15 KTEDFCDNPTQDTIAEGLLGLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISA 73 (117)
Q Consensus 15 ~~~~~~~~p~rd~La~GL~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e 73 (117)
-+...|..++-..+.. |+.|..-+|+=| -.+|..|..++..|.++++...+
T Consensus 44 l~~e~~~~~~dp~~~k-lfrLaQl~ieYL-------l~~q~~L~~~~~~l~~~~~~~~~ 94 (118)
T PF13815_consen 44 LENEDCQHFVDPNFLK-LFRLAQLSIEYL-------LHCQEYLSSQLEQLEERLQELQQ 94 (118)
T ss_pred cChhhccCCCCHHHHH-HHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHH
Confidence 4455565554444433 666666555443 34555555555555555555544
No 108
>PF14644 DUF4456: Domain of unknown function (DUF4456)
Probab=43.74 E-value=1.5e+02 Score=22.93 Aligned_cols=77 Identities=23% Similarity=0.176 Sum_probs=47.8
Q ss_pred CCCCCCCchHHHHHHHHHHhhhHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHhhHHHhcCCCCh-hHHHHHHHHhhH
Q psy460 16 TEDFCDNPTQDTIAEGLLGLIKPTVDLLDEKFRATR-ASQLELRLQIEALTSQLDKISAEQQTAVPL-DSYTRKLVDAQQ 93 (117)
Q Consensus 16 ~~~~~~~p~rd~La~GL~~lL~P~I~~lD~~V~~tr-~SQ~eL~~qId~L~~eL~~l~e~q~~p~dL-d~yvkKL~~ak~ 93 (117)
|-+-|-.++-|.+++++...+. .+..+...-+ ++=.+|+.|+.++...|.++.+. ..-.+ ..|..++..+.+
T Consensus 18 ~rP~~i~~t~e~~~d~~~~~l~----~~~~qa~~y~~~~~~elR~qv~~l~~~l~~v~~l--v~~~~~~~~~~~~~~~~~ 91 (208)
T PF14644_consen 18 TRPEMIPETFEQCADNLVQKLQ----SYQEQADEYHNSCLQELRNQVERLEELLPKVPEL--VFESLLKRHWQKLCEAMK 91 (208)
T ss_pred CChhhchHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHH
Confidence 4445556666777776655544 4444444433 34468999999999999998875 22222 566677666555
Q ss_pred HHHHH
Q psy460 94 KISIV 98 (117)
Q Consensus 94 RV~~v 98 (117)
-|...
T Consensus 92 ~i~~~ 96 (208)
T PF14644_consen 92 AIQEE 96 (208)
T ss_pred HHHHH
Confidence 55443
No 109
>PF10846 DUF2722: Protein of unknown function (DUF2722); InterPro: IPR021216 This eukaryotic family of proteins has no known function.
Probab=43.36 E-value=1.2e+02 Score=26.64 Aligned_cols=64 Identities=14% Similarity=0.122 Sum_probs=50.6
Q ss_pred CchHHHHHHHHHHhhhHH-----------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHH
Q psy460 22 NPTQDTIAEGLLGLIKPT-----------VDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYT 85 (117)
Q Consensus 22 ~p~rd~La~GL~~lL~P~-----------I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yv 85 (117)
++...+.-.||..||+|- ++-|+-|++.=|.=|.-+|.++-..+-||=++......|+++-|++
T Consensus 33 ~~~~~~~~~~l~~LlG~~V~~~p~se~al~eai~LK~EQEKTKQEyyRLE~~~~~lelLr~Al~~gIPp~lIP~L 107 (416)
T PF10846_consen 33 GSNKSISDSLLESLLGPNVSSWPFSEEALIEAIKLKQEQEKTKQEYYRLEQRNKELELLRTALQAGIPPNLIPLL 107 (416)
T ss_pred cccccchHHHHHHHhCCCcccCcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCChhhccee
Confidence 334567778999999874 5678888888888888888888888888888887778888887654
No 110
>PF13600 DUF4140: N-terminal domain of unknown function (DUF4140)
Probab=43.34 E-value=90 Score=21.07 Aligned_cols=34 Identities=26% Similarity=0.367 Sum_probs=24.8
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy460 36 IKPTVDLLDEKFRATRASQLELRLQIEALTSQLD 69 (117)
Q Consensus 36 L~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~ 69 (117)
..|.+.+|+++++..+.-...++.+++.+..++.
T Consensus 68 ~~~~~~~l~~~l~~l~~~~~~~~~~~~~~~~~~~ 101 (104)
T PF13600_consen 68 DSPELKELEEELEALEDELAALQDEIQALEAQIA 101 (104)
T ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3567888888888877777777777777766654
No 111
>PF05064 Nsp1_C: Nsp1-like C-terminal region; InterPro: IPR007758 The NSP1-like protein appears to be an essential component of the nuclear pore complex, for example preribosome nuclear export requires the Nup82p-Nup159p-Nsp1p complex. The C-terminal of Nsp1 is involved in binding Nup82 [], probably via coiled-coil formation [, ]. The family is related to the rotavirus nonstructural protein NSP1 which is the least conserved protein in the rotavirus genome. Its function in the replication process is not fully understood.; GO: 0017056 structural constituent of nuclear pore, 0005643 nuclear pore; PDB: 3T97_C.
Probab=43.21 E-value=57 Score=23.23 Aligned_cols=38 Identities=11% Similarity=0.185 Sum_probs=31.4
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHH
Q psy460 37 KPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAE 74 (117)
Q Consensus 37 ~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~ 74 (117)
+.-|..|-..|..+...|.+|..+|+-+..+=..|...
T Consensus 56 ~~~I~~L~~~v~~~~~~Q~~ld~~L~~ie~qQ~eLe~~ 93 (116)
T PF05064_consen 56 GEKISKLYSEVQKAESEQKRLDQELDFIEAQQKELEEL 93 (116)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45678889999999999999999999987776666553
No 112
>PF10368 YkyA: Putative cell-wall binding lipoprotein; InterPro: IPR019454 The YkyA family of proteins contain a lipoprotein signal and a hydrolase domain. They are similar to cell wall binding proteins and might also be recognisable by a host immune defence system. It is thus likely that they function in pathways important for pathogenicity []. ; PDB: 2AP3_A.
Probab=42.64 E-value=89 Score=24.49 Aligned_cols=43 Identities=33% Similarity=0.359 Sum_probs=36.4
Q ss_pred HHHhhh---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHH
Q psy460 32 LLGLIK---PTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAE 74 (117)
Q Consensus 32 L~~lL~---P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~ 74 (117)
||.+|+ +..++|+++|..+-.+..++..+.+..++.+.++|+.
T Consensus 148 ly~~L~~~d~~~~~l~~ki~~iN~~y~~~~~~~~~fn~~t~~yN~~ 193 (204)
T PF10368_consen 148 LYEMLKDKDTTQKQLDEKIKAINQSYKEVNKQKEKFNEYTKKYNEE 193 (204)
T ss_dssp HHHHHTSTT--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 555553 4789999999999999999999999999999999986
No 113
>PF11853 DUF3373: Protein of unknown function (DUF3373); InterPro: IPR021803 This family of proteins are functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 472 to 574 amino acids in length.
Probab=42.62 E-value=28 Score=31.32 Aligned_cols=24 Identities=42% Similarity=0.632 Sum_probs=17.0
Q ss_pred HHHHHHHHHHHHHHHHHHhhHHHh
Q psy460 52 ASQLELRLQIEALTSQLDKISAEQ 75 (117)
Q Consensus 52 ~SQ~eL~~qId~L~~eL~~l~e~q 75 (117)
..+.++.+|||+|..||.+|.+.+
T Consensus 24 ~~~~~~~qkie~L~kql~~Lk~q~ 47 (489)
T PF11853_consen 24 ADDIDLLQKIEALKKQLEELKAQQ 47 (489)
T ss_pred hhhhHHHHHHHHHHHHHHHHHHhh
Confidence 344444558999999999998753
No 114
>PRK10722 hypothetical protein; Provisional
Probab=42.02 E-value=69 Score=26.53 Aligned_cols=30 Identities=30% Similarity=0.473 Sum_probs=22.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhHH
Q psy460 44 DEKFRATRASQLELRLQIEALTSQLDKISA 73 (117)
Q Consensus 44 D~~V~~tr~SQ~eL~~qId~L~~eL~~l~e 73 (117)
|+++...++.|.+|+.+++.....|..|.+
T Consensus 175 D~qlD~lrqq~~~Lq~~L~~t~rKLEnLTd 204 (247)
T PRK10722 175 DSELDALRQQQQRLQYQLELTTRKLENLTD 204 (247)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 777777777777777777777777777664
No 115
>PF14282 FlxA: FlxA-like protein
Probab=41.25 E-value=1.2e+02 Score=21.19 Aligned_cols=46 Identities=20% Similarity=0.292 Sum_probs=24.0
Q ss_pred HHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHH
Q psy460 57 LRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQT 104 (117)
Q Consensus 57 L~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~ 104 (117)
|..+|..|.++|..|... ...+-+..-.+.-....-|..+..-|..
T Consensus 24 L~~Qi~~Lq~ql~~l~~~--~~~~~e~k~~q~q~Lq~QI~~LqaQI~q 69 (106)
T PF14282_consen 24 LQKQIKQLQEQLQELSQD--SDLDAEQKQQQIQLLQAQIQQLQAQIAQ 69 (106)
T ss_pred HHHHHHHHHHHHHHHHcc--cCCCHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 455777777777777653 3444444444444444444444444433
No 116
>PF05911 DUF869: Plant protein of unknown function (DUF869); InterPro: IPR008587 This family consists of a number of sequences found in plants. The function of this family is unknown.
Probab=41.09 E-value=1.1e+02 Score=29.08 Aligned_cols=71 Identities=21% Similarity=0.395 Sum_probs=58.8
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhc------------CCCChhHHHHHHHHhhHHHHHHHHHHH
Q psy460 36 IKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQ------------TAVPLDSYTRKLVDAQQKISIVGNILQ 103 (117)
Q Consensus 36 L~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~------------~p~dLd~yvkKL~~ak~RV~~vnniLq 103 (117)
+.--|..|+..++.-|..+.++...+..|..+|.+...... +--+|..-..||..++.=|.++..-|+
T Consensus 678 l~~Ki~~Le~Ele~er~~~~e~~~kc~~Le~el~r~~~~~~~~~~~~~~~k~kqe~EiaaAA~KLAECQeTI~sLGkQLk 757 (769)
T PF05911_consen 678 LQSKISSLEEELEKERALSEELEAKCRELEEELERMKKEESLQQLANEDKKIKQEKEIAAAAEKLAECQETIASLGKQLK 757 (769)
T ss_pred HHHHHHHHHHHHHHHHhcchhhhhHHHHHHHHHHhhhcccchhhccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 55678899999999999999999999999999998875311 112566888999999999999999998
Q ss_pred HHH
Q psy460 104 TTQ 106 (117)
Q Consensus 104 ~~Q 106 (117)
...
T Consensus 758 sLa 760 (769)
T PF05911_consen 758 SLA 760 (769)
T ss_pred hcC
Confidence 764
No 117
>KOG3850|consensus
Probab=40.47 E-value=82 Score=28.12 Aligned_cols=57 Identities=23% Similarity=0.198 Sum_probs=30.8
Q ss_pred ccccccccCCCCCCCCCCchHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy460 5 TVSTNTSADDKTEDFCDNPTQDTIAEGLLGLIKPTVDLLDEKFRATRASQLELRLQIEALTS 66 (117)
Q Consensus 5 ~~~~~ts~~~~~~~~~~~p~rd~La~GL~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~ 66 (117)
+.++.+|+++++.+.-...+ +. +++==.-+.+.+=++++..+.-|..|.+.+|+|-+
T Consensus 232 ~S~~~gSa~~~~n~~~g~~~----~n-~~~s~~~~l~aileeL~eIk~~q~~Leesye~Lke 288 (455)
T KOG3850|consen 232 SSSRPGSAADETNNVPGHGG----AN-PYHSQGAALDAILEELREIKETQALLEESYERLKE 288 (455)
T ss_pred ccCCCCccccccCCCCCCCC----CC-cccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34678888887665433332 22 22211222444555666666667777766666643
No 118
>CHL00198 accA acetyl-CoA carboxylase carboxyltransferase alpha subunit; Provisional
Probab=40.38 E-value=43 Score=28.44 Aligned_cols=48 Identities=13% Similarity=0.261 Sum_probs=34.2
Q ss_pred HhhhHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHhhHHHhcCCCChhHH
Q psy460 34 GLIKPTVDLLDEKFRATRA----SQLELRLQIEALTSQLDKISAEQQTAVPLDSY 84 (117)
Q Consensus 34 ~lL~P~I~~lD~~V~~tr~----SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~y 84 (117)
.+=+| |.+|+.++...+. +...+..+|.+|...+.++... ..-+|.||
T Consensus 10 ~fe~~-i~el~~~i~~l~~~~~~~~~~~~~~i~~l~~~~~~~~~~--~~~~l~~w 61 (322)
T CHL00198 10 DFMKP-LAELESQVEELSKLAPKNDKVINNKLKSFQRKLRILKKE--IFYSLTPL 61 (322)
T ss_pred chhhh-HHHHHHHHHHHHhhhcccccCHHHHHHHHHHHHHHHHHH--HHhcCCHH
Confidence 34455 5789999999887 3456899999999988887653 23345555
No 119
>PRK10869 recombination and repair protein; Provisional
Probab=39.97 E-value=2.7e+02 Score=24.89 Aligned_cols=66 Identities=15% Similarity=0.118 Sum_probs=46.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCC---hhHHHHHHHHhhHHHHHHHHHHHHH
Q psy460 40 VDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVP---LDSYTRKLVDAQQKISIVGNILQTT 105 (117)
Q Consensus 40 I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~d---Ld~yvkKL~~ak~RV~~vnniLq~~ 105 (117)
..++..++...+..+.+..+++|-|.-++++|.+....|-+ |..-.++|.|+.+-...++.+++..
T Consensus 166 ~~~~~~~l~~l~~~~~~~~~~~d~l~fql~Ei~~~~l~~gE~eeL~~e~~~L~n~e~i~~~~~~~~~~L 234 (553)
T PRK10869 166 WHQSCRDLAQHQQQSQERAARKQLLQYQLKELNEFAPQPGEFEQIDEEYKRLANSGQLLTTSQNALQLL 234 (553)
T ss_pred HHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHhCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 34456666777777888889999999999999886544443 4566777777777666666665543
No 120
>COG1196 Smc Chromosome segregation ATPases [Cell division and chromosome partitioning]
Probab=39.84 E-value=3.6e+02 Score=26.17 Aligned_cols=23 Identities=13% Similarity=0.261 Sum_probs=8.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Q psy460 44 DEKFRATRASQLELRLQIEALTS 66 (117)
Q Consensus 44 D~~V~~tr~SQ~eL~~qId~L~~ 66 (117)
..+++..+.-..++..+++.|..
T Consensus 827 ~~ei~~l~~~~~~~~~~~~~l~~ 849 (1163)
T COG1196 827 EQEIEELEEEIEELEEKLDELEE 849 (1163)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 33333333333333333333333
No 121
>PF00170 bZIP_1: bZIP transcription factor cAMP response element binding (CREB) protein signature fos transforming protein signature jun transcription factor signature; InterPro: IPR011616 The basic-leucine zipper (bZIP) transcription factors [, ] of eukaryotic are proteins that contain a basic region mediating sequence-specific DNA-binding followed by a leucine zipper region (see IPR002158 from INTERPRO) required for dimerization.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0043565 sequence-specific DNA binding, 0046983 protein dimerization activity, 0006355 regulation of transcription, DNA-dependent; PDB: 2H7H_B 2OQQ_B 1S9K_E 1JNM_A 1JUN_A 1FOS_H 1A02_J 1T2K_C 1CI6_A 1DH3_C ....
Probab=39.71 E-value=95 Score=19.43 Aligned_cols=33 Identities=21% Similarity=0.446 Sum_probs=18.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhH
Q psy460 40 VDLLDEKFRATRASQLELRLQIEALTSQLDKIS 72 (117)
Q Consensus 40 I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~ 72 (117)
+++|+.+|.....-=..|...++.|..++..|.
T Consensus 28 ~~~Le~~~~~L~~en~~L~~~~~~L~~~~~~L~ 60 (64)
T PF00170_consen 28 IEELEEKVEELESENEELKKELEQLKKEIQSLK 60 (64)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 455555555555555555555555555555543
No 122
>KOG4643|consensus
Probab=39.47 E-value=4e+02 Score=26.79 Aligned_cols=74 Identities=15% Similarity=0.125 Sum_probs=48.4
Q ss_pred HHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHHHHHHHHhh
Q psy460 33 LGLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQTTQVAMHKK 112 (117)
Q Consensus 33 ~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q~RL~rl 112 (117)
-++.|-...+.+.--.++..-+..+..+ ..+.++++++++. .++.++-....+.++-..|.+|++.+.=+.|+
T Consensus 435 ed~~K~L~~E~ekl~~e~~t~~~s~~rq-~~e~e~~~q~ls~------~~Q~~~et~el~~~iknlnk~L~~r~~elsrl 507 (1195)
T KOG4643|consen 435 EDLEKKLQFELEKLLEETSTVTRSLSRQ-SLENEELDQLLSL------QDQLEAETEELLNQIKNLNKSLNNRDLELSRL 507 (1195)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhHHHH-HHHhHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444444444444444444444444444 4455566666664 67888888889999999999999988777765
Q ss_pred h
Q psy460 113 F 113 (117)
Q Consensus 113 ~ 113 (117)
.
T Consensus 508 ~ 508 (1195)
T KOG4643|consen 508 H 508 (1195)
T ss_pred H
Confidence 3
No 123
>PF08651 DASH_Duo1: DASH complex subunit Duo1; InterPro: IPR013960 The DASH complex is a ~10 subunit microtubule-binding complex that is transferred to the kinetochore prior to mitosis []. In Saccharomyces cerevisiae (Baker's yeast) DASH forms both rings and spiral structures on microtubules in vitro [, ].
Probab=39.39 E-value=1.2e+02 Score=20.51 Aligned_cols=41 Identities=20% Similarity=0.285 Sum_probs=29.9
Q ss_pred HHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhhh
Q psy460 68 LDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQTTQVAMHKKFF 114 (117)
Q Consensus 68 L~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q~RL~rl~~ 114 (117)
|++|+.. +..++.-|-.++..+..|+....++..=|++--+
T Consensus 10 Lr~IN~~------ie~~~~~L~~a~~~~~~v~~~~~~t~~LLd~w~~ 50 (78)
T PF08651_consen 10 LRKINPV------IEGLIETLRSAKSNMNRVQETVESTNTLLDKWIR 50 (78)
T ss_pred HHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4666665 7788888888888888888888877766655433
No 124
>PF06694 Plant_NMP1: Plant nuclear matrix protein 1 (NMP1); InterPro: IPR010604 This family consists of several plant specific nuclear matrix protein 1 (NMP1) sequences. Nuclear Matrix Protein 1 is a ubiquitously expressed 36 kDa protein, which has no homologues in animals and fungi, but is highly conserved among flowering and non-flowering plants. NMP1 is located both in the cytoplasm and nucleus and that the nuclear fraction is associated with the nuclear matrix. NMP1 is a candidate for a plant-specific structural protein with a function both in the nucleus and cytoplasm [].
Probab=38.79 E-value=2.6e+02 Score=24.15 Aligned_cols=72 Identities=15% Similarity=0.226 Sum_probs=42.6
Q ss_pred HHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcC--------------------CCChhHHHHHHHHhh
Q psy460 33 LGLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQT--------------------AVPLDSYTRKLVDAQ 92 (117)
Q Consensus 33 ~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~--------------------p~dLd~yvkKL~~ak 92 (117)
..+|.-.|+-.++...+.. +-..|.+|...=..-++.|.|.+++ |||.....+++...-
T Consensus 103 ~~~l~~IVDlVeas~~~~n-~e~Sl~eQ~~kD~~LiD~IaE~~~QvFs~~ckLFP~DVqi~S~~~lPD~seLe~~~s~~s 181 (325)
T PF06694_consen 103 AEFLRLIVDLVEASMYADN-PEWSLDEQFAKDIQLIDAIAEKQQQVFSEECKLFPPDVQIQSIYPLPDVSELEKKASELS 181 (325)
T ss_pred HHHHHHHHHHHHHHHhhcC-chhhHHHHHHHHHHHHHHHHHhHHHHHhhhcCcCCHHHhhccCCCCCCHHHHHHHHHHHH
Confidence 3444455666666665555 5556777766656666666655433 556666666666666
Q ss_pred HHHHHHHHHHHHH
Q psy460 93 QKISIVGNILQTT 105 (117)
Q Consensus 93 ~RV~~vnniLq~~ 105 (117)
+++...+..+...
T Consensus 182 k~Lq~lqq~v~~L 194 (325)
T PF06694_consen 182 KQLQSLQQQVAEL 194 (325)
T ss_pred HHHHHHHHHHHHH
Confidence 6655555555443
No 125
>PF04977 DivIC: Septum formation initiator; InterPro: IPR007060 DivIC, from the spore-forming, Gram-positive bacterium Bacillus subtilis, is necessary for both vegetative and sporulation septum formation []. These proteins are mainly composed of an N-terminal coiled-coil. DivIB, DivIC and FtsL inter-depend on each other for stabilisation and localisation. The latter two form a heterodimer. DivIC is always centre cell but the other two associate with it during septation [].; GO: 0007049 cell cycle
Probab=38.63 E-value=1e+02 Score=19.35 Aligned_cols=27 Identities=22% Similarity=0.343 Sum_probs=13.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy460 38 PTVDLLDEKFRATRASQLELRLQIEAL 64 (117)
Q Consensus 38 P~I~~lD~~V~~tr~SQ~eL~~qId~L 64 (117)
--+.+++.++...+.-..+|+.+|+.|
T Consensus 24 ~ei~~l~~~i~~l~~e~~~L~~ei~~l 50 (80)
T PF04977_consen 24 QEIAELQKEIEELKKENEELKEEIERL 50 (80)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 334455555555555555555555555
No 126
>PF02996 Prefoldin: Prefoldin subunit; InterPro: IPR004127 This entry comprises of several prefoldin subunits. Prefoldin (PFD) is a chaperone that interacts exclusively with type II chaperonins, hetero-oligomers lacking an obligate co-chaperonin that are found only in eukaryotes (chaperonin-containing T-complex polypeptide-1 (CCT)) and archaea. Eukaryotic PFD is a multi-subunit complex containing six polypeptides in the molecular mass range of 14-23 kDa. In archaea, on the other hand, PFD is composed of two types of subunits, two alpha and four beta. The six subunits associate to form two back-to-back up-and-down eight-stranded barrels, from which hang six coiled coils. Each subunit contributes one (beta subunits) or two (alpha subunits) beta hairpin turns to the barrels. The coiled coils are formed by the N and C termini of an individual subunit. Overall, this unique arrangement resembles a jellyfish. The eukaryotic PFD hexamer is composed of six different subunits; however, these can be grouped into two alpha-like (PFD3 and -5) and four beta-like (PFD1, -2, -4, and -6) subunits based on amino acid sequence similarity with their archaeal counterparts. Eukaryotic PFD has a six-legged structure similar to that seen in the archaeal homologue [, ]. This family contains the archaeal alpha subunit, eukaryotic prefoldin subunits 3 and 5 and the UXT (ubiquitously expressed transcript) family. Eukaryotic PFD has been shown to bind both actin and tubulin co-translationally. The chaperone then delivers the target protein to CCT, interacting with the chaperonin through the tips of the coiled coils. No authentic target proteins of any archaeal PFD have been identified, to date.; GO: 0051082 unfolded protein binding, 0006457 protein folding, 0016272 prefoldin complex; PDB: 1FXK_C 2ZDI_C.
Probab=38.32 E-value=1.3e+02 Score=20.50 Aligned_cols=38 Identities=11% Similarity=0.392 Sum_probs=27.1
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHH
Q psy460 36 IKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISA 73 (117)
Q Consensus 36 L~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e 73 (117)
+.-|++-++.++...+....+|+.++..+...+..+..
T Consensus 75 ~~eA~~~l~~r~~~l~~~~~~l~~~~~~~~~~~~~~~~ 112 (120)
T PF02996_consen 75 LEEAIEFLKKRIKELEEQLEKLEKELAELQAQIEQLEQ 112 (120)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34577778888888877777777777777776666543
No 127
>PF10241 KxDL: Uncharacterized conserved protein; InterPro: IPR019371 This entry represents a conserved region of 80 residues which defines a family of short proteins. There is a characteristic KxDL motif towards the C terminus. The function is unknown.
Probab=37.72 E-value=1.3e+02 Score=20.42 Aligned_cols=63 Identities=10% Similarity=0.102 Sum_probs=36.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhHHHhcC-----CCChhHHHHHHHHhhHHHHHHHHHHHHHHHHH
Q psy460 47 FRATRASQLELRLQIEALTSQLDKISAEQQT-----AVPLDSYTRKLVDAQQKISIVGNILQTTQVAM 109 (117)
Q Consensus 47 V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~-----p~dLd~yvkKL~~ak~RV~~vnniLq~~Q~RL 109 (117)
|......|.+..+.++.=.++|..+++.-.. --+|..|.+-|...|+-+-.+-..+..+..++
T Consensus 13 ~~~~l~~Q~~~l~~ln~tn~~L~~~n~~s~~rl~~~~~~f~~~~~~l~~mK~DLd~i~krir~lk~kl 80 (88)
T PF10241_consen 13 LDEILALQAQTLGRLNKTNEELLNLNDLSQQRLAEARERFARHTKLLKEMKKDLDYIFKRIRSLKAKL 80 (88)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4456667777777777777777777764211 22455566666665555544444444444443
No 128
>PF04102 SlyX: SlyX; InterPro: IPR007236 The SlyX protein has no known function. It is short, less than 80 amino acids, and its gene is found close to the slyD gene. The SlyX protein has a conserved PPH(Y/W) motif at its C terminus. The protein may be a coiled-coil structure.; PDB: 3EFG_A.
Probab=37.69 E-value=1.1e+02 Score=19.77 Aligned_cols=41 Identities=17% Similarity=0.267 Sum_probs=20.8
Q ss_pred HHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhH
Q psy460 32 LLGLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKIS 72 (117)
Q Consensus 32 L~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~ 72 (117)
-+....-+|++|+.-|-.-+.==..|..++..|...|+.+.
T Consensus 12 ~la~qe~~ie~Ln~~v~~Qq~~I~~L~~~l~~L~~rl~~~~ 52 (69)
T PF04102_consen 12 KLAFQEDTIEELNDVVTEQQRQIDRLQRQLRLLRERLRELE 52 (69)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHT-----
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 34556677777777776655533444445555555554444
No 129
>PF11932 DUF3450: Protein of unknown function (DUF3450); InterPro: IPR016866 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function. However, they are found in an operon along with components of a TonB transport system (typified by Vibrio cholerae TonB2 [], and are predicted to be localized to the periplasmic space. Caution: the low-complexity nature of these sequences produces spurious BLAST hits to chromosome segregation ATPases (which are much longer in length and contain canonical Walker motifs). Accordingly, some members are misidentified as such.
Probab=37.30 E-value=2e+02 Score=22.55 Aligned_cols=27 Identities=11% Similarity=0.073 Sum_probs=11.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy460 39 TVDLLDEKFRATRASQLELRLQIEALT 65 (117)
Q Consensus 39 ~I~~lD~~V~~tr~SQ~eL~~qId~L~ 65 (117)
.++++.+.++.+.....+....|+.+.
T Consensus 22 ~~~~~~~~~~~~~~~~~~sQ~~id~~~ 48 (251)
T PF11932_consen 22 TLDQAQQVQQQWVQAAQQSQKRIDQWD 48 (251)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344444444444444444444444443
No 130
>PHA03395 p10 fibrous body protein; Provisional
Probab=37.24 E-value=64 Score=22.80 Aligned_cols=35 Identities=31% Similarity=0.446 Sum_probs=25.6
Q ss_pred HHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHhhHH
Q psy460 39 TVDLLDEKFRATRASQ---LELRLQIEALTSQLDKISA 73 (117)
Q Consensus 39 ~I~~lD~~V~~tr~SQ---~eL~~qId~L~~eL~~l~e 73 (117)
=|+.|..+|..++... .+|++++|.+++.|..+..
T Consensus 19 KVdalQ~~V~~l~~nlpdv~~l~~kLdaq~~~Ltti~t 56 (87)
T PHA03395 19 KVDALQAAVDDVRANLPDVTEINEKLDAQSASLDTISS 56 (87)
T ss_pred HHHHHHHHHHHHHhcCCcHHHHHHHHHhHHHHHHHHHH
Confidence 3667777888887665 4788888888888776653
No 131
>KOG0982|consensus
Probab=37.13 E-value=3.2e+02 Score=24.81 Aligned_cols=69 Identities=17% Similarity=0.144 Sum_probs=43.0
Q ss_pred HhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHHHHHH
Q psy460 34 GLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQTTQVA 108 (117)
Q Consensus 34 ~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q~R 108 (117)
..+.-.+++|+..+-+.|.-=.-|+..+|.|+++..+..+. |+.---.|..-.+--.-.|.+|...|++
T Consensus 300 Enlqmr~qqleeentelRs~~arlksl~dklaee~qr~sd~------LE~lrlql~~eq~l~~rm~d~Lrrfq~e 368 (502)
T KOG0982|consen 300 ENLQMRDQQLEEENTELRSLIARLKSLADKLAEEDQRSSDL------LEALRLQLICEQKLRVRMNDILRRFQEE 368 (502)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 34556677777777777777777888888888888777764 4433333333333333455666555543
No 132
>COG1382 GimC Prefoldin, chaperonin cofactor [Posttranslational modification, protein turnover, chaperones]
Probab=36.93 E-value=1.3e+02 Score=22.20 Aligned_cols=35 Identities=17% Similarity=0.310 Sum_probs=27.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHH
Q psy460 39 TVDLLDEKFRATRASQLELRLQIEALTSQLDKISA 73 (117)
Q Consensus 39 ~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e 73 (117)
=++.|+-+|......+..|++++++|.++|.+...
T Consensus 78 r~E~Le~ri~tLekQe~~l~e~l~eLq~~i~~~l~ 112 (119)
T COG1382 78 RKETLELRIKTLEKQEEKLQERLEELQSEIQKALG 112 (119)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 34556777788888888899999999888877654
No 133
>COG2900 SlyX Uncharacterized protein conserved in bacteria [Function unknown]
Probab=36.83 E-value=1.4e+02 Score=20.45 Aligned_cols=40 Identities=18% Similarity=0.308 Sum_probs=33.5
Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHH
Q psy460 35 LIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAE 74 (117)
Q Consensus 35 lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~ 74 (117)
+=.-+|+.|...|-..+..=..++.+++.|.+.|+.+.+.
T Consensus 19 fQE~tieeLn~~laEq~~~i~k~q~qlr~L~~kl~~~~~~ 58 (72)
T COG2900 19 FQEQTIEELNDALAEQQLVIDKLQAQLRLLTEKLKDLQPS 58 (72)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc
Confidence 3456888899999999998889999999999999888763
No 134
>PF09969 DUF2203: Uncharacterized conserved protein (DUF2203); InterPro: IPR018699 This family has no known function.
Probab=36.65 E-value=1.6e+02 Score=21.27 Aligned_cols=16 Identities=31% Similarity=0.644 Sum_probs=11.0
Q ss_pred HHHHHHhhhHHHHHHH
Q psy460 29 AEGLLGLIKPTVDLLD 44 (117)
Q Consensus 29 a~GL~~lL~P~I~~lD 44 (117)
|..++=+++|.++++-
T Consensus 8 A~~lLP~l~~~~~~~~ 23 (120)
T PF09969_consen 8 ANALLPLLRPILEEIR 23 (120)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 4557777777777773
No 135
>PRK10869 recombination and repair protein; Provisional
Probab=36.58 E-value=2.7e+02 Score=24.84 Aligned_cols=31 Identities=19% Similarity=0.217 Sum_probs=16.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhHH
Q psy460 43 LDEKFRATRASQLELRLQIEALTSQLDKISA 73 (117)
Q Consensus 43 lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e 73 (117)
+|.++......=.++...++.+..+|+.+.+
T Consensus 259 ~d~~~~~~~~~l~~~~~~l~~~~~~l~~~~~ 289 (553)
T PRK10869 259 MDSKLSGVLDMLEEALIQIQEASDELRHYLD 289 (553)
T ss_pred hCHhHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 3555554444444455556666666665554
No 136
>PF13852 DUF4197: Protein of unknown function (DUF4197)
Probab=36.33 E-value=94 Score=24.54 Aligned_cols=53 Identities=26% Similarity=0.302 Sum_probs=37.6
Q ss_pred HHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHH
Q psy460 26 DTIAEGLLGLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTR 86 (117)
Q Consensus 26 d~La~GL~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvk 86 (117)
.....-|..-|.|.|++-=+++..++.-+ .|.+....+.-......||+.||-
T Consensus 115 ~~t~~~L~~~f~PiV~~~l~~~g~~~~~~--------~l~~~~~~~~~~~~~~~dL~~yvt 167 (202)
T PF13852_consen 115 RKTSAQLAEAFRPIVKKALEKVGATQYYN--------QLISKYNKIPFVKKVDADLDDYVT 167 (202)
T ss_pred HhhHHHHHHHhccHHHHHHHHhCHHHHHH--------HHHHHHHcCCCCCCCCCCHHHHHH
Confidence 44556689999999999888887776543 444444555544457789999984
No 137
>KOG2196|consensus
Probab=36.08 E-value=2.6e+02 Score=23.36 Aligned_cols=38 Identities=16% Similarity=0.146 Sum_probs=29.5
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHH
Q psy460 37 KPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAE 74 (117)
Q Consensus 37 ~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~ 74 (117)
+|-|..|=..|..+..+|+.|-+.++-.....++|...
T Consensus 112 gekI~~Ly~e~~~vk~~qkrLdq~L~~I~sqQ~ELE~~ 149 (254)
T KOG2196|consen 112 GEKISGLYNEVVKVKLDQKRLDQELEFILSQQQELEDL 149 (254)
T ss_pred cHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHH
Confidence 57777788888888888888888888777777776554
No 138
>PF10473 CENP-F_leu_zip: Leucine-rich repeats of kinetochore protein Cenp-F/LEK1; InterPro: IPR019513 Cenp-F, a centromeric kinetochore, microtubule-binding protein consisting of two 1,600-amino acid-long coils, is essential for the full functioning of the mitotic checkpoint pathway [, ]. There are several leucine-rich repeats along the sequence of LEK1 that are considered to be zippers, though they do not appear to be binding DNA directly in this instance []. ; GO: 0008134 transcription factor binding, 0042803 protein homodimerization activity, 0045502 dynein binding
Probab=36.05 E-value=1.9e+02 Score=21.77 Aligned_cols=62 Identities=21% Similarity=0.194 Sum_probs=40.4
Q ss_pred HHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhh
Q psy460 39 TVDLLDEKFRATRASQLE---LRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQTTQVAMHKKF 113 (117)
Q Consensus 39 ~I~~lD~~V~~tr~SQ~e---L~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q~RL~rl~ 113 (117)
+=+....-+..+-.|+.+ |.++|..++++|+++..+ |.+.+.---.+...|+.-|+|+.-|.
T Consensus 36 ~q~~~e~~~~daEn~k~eie~L~~el~~lt~el~~L~~E-------------L~~l~sEk~~L~k~lq~~q~kv~eLE 100 (140)
T PF10473_consen 36 SQENKECLILDAENSKAEIETLEEELEELTSELNQLELE-------------LDTLRSEKENLDKELQKKQEKVSELE 100 (140)
T ss_pred HHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------------HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444455555556665 788888888888888764 45555666667777777777776553
No 139
>TIGR01807 CM_P2 chorismate mutase domain of proteobacterial P-protein, clade 2. This model represents one of two separate clades of the chorismate mutase domain of the gamma and beta and epsilon proteobacterial "P-protein" which contains an N-terminal chorismate mutase domain and a C-terminal prephenate dehydratase domain. It is also found in Aquifex aolicus.
Probab=36.03 E-value=1.2e+02 Score=19.62 Aligned_cols=18 Identities=28% Similarity=0.479 Sum_probs=9.6
Q ss_pred HHHHHHHHHHHHHHhhHH
Q psy460 56 ELRLQIEALTSQLDKISA 73 (117)
Q Consensus 56 eL~~qId~L~~eL~~l~e 73 (117)
+||.+||++..+|=.+..
T Consensus 3 ~lR~~ID~iD~~iv~Ll~ 20 (76)
T TIGR01807 3 ELRNKIDAIDDRILDLLS 20 (76)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 455566666555544443
No 140
>PRK04778 septation ring formation regulator EzrA; Provisional
Probab=35.91 E-value=2.9e+02 Score=24.57 Aligned_cols=68 Identities=22% Similarity=0.319 Sum_probs=34.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCC-ChhHHHHHHHHhhHHHHHHHHHHHHHHHHHHhh
Q psy460 45 EKFRATRASQLELRLQIEALTSQLDKISAEQQTAV-PLDSYTRKLVDAQQKISIVGNILQTTQVAMHKK 112 (117)
Q Consensus 45 ~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~-dLd~yvkKL~~ak~RV~~vnniLq~~Q~RL~rl 112 (117)
..+..-..+-.++..+++.+.+.|..+.+.+..-- .+...-+.-..+|.++..+.+.|..++.++.+-
T Consensus 369 ~~i~~~~~~ysel~e~leel~e~leeie~eq~ei~e~l~~Lrk~E~eAr~kL~~~~~~L~~ikr~l~k~ 437 (569)
T PRK04778 369 ERIAEQEIAYSELQEELEEILKQLEEIEKEQEKLSEMLQGLRKDELEAREKLERYRNKLHEIKRYLEKS 437 (569)
T ss_pred HHHHcCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc
Confidence 33333333444455555555555555554321111 133444445566666666666666666665553
No 141
>PF07028 DUF1319: Protein of unknown function (DUF1319); InterPro: IPR010746 This entry is represented by Commelina yellow mottle virus, Orf1. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family contains a number of viral proteins of unknown function approximately 200 residues long. Family members seem to be restricted to badnaviruses.
Probab=35.82 E-value=1.9e+02 Score=21.71 Aligned_cols=37 Identities=19% Similarity=0.325 Sum_probs=29.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHh
Q psy460 39 TVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQ 75 (117)
Q Consensus 39 ~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q 75 (117)
.|+.+-.=++.-|....+|...++.|..||..|..+.
T Consensus 47 ~lk~~~ki~~~Qr~~l~~l~~~l~~l~~eL~~Lr~~~ 83 (126)
T PF07028_consen 47 ELKNLSKIQESQRSELKELKQELDVLSKELQALRKEY 83 (126)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4566666666667778888999999999999988765
No 142
>cd08063 MPN_CSN6 Mpr1p, Pad1p N-terminal (MPN) domains without catalytic isopeptidase activity, found in COP9 signalosome complex subunit 6. CSN6 (COP9 signalosome subunit 6; COP9 subunit 6; MOV34 homolog, 34 kD) is one of the eight subunits of COP9 signalosome, a highly conserved protein complex with diverse functions, including several important intracellular pathways such as the ubiquitin/proteasome system, DNA repair, cell cycle, developmental changes, and some aspects of immune responses. CSN6 is an MPN-domain protein that directly interacts with the MPN+-domain subunit CSN5. It is cleaved during apoptosis by activated caspases. CSN6 processing occurs in CSN/CRL (cullin-RING Ub ligase) complexes and is followed by the cleavage of Rbx1, the direct interaction partner of CSN6. CSN6 cleavage enhances CSN-mediated deneddylating activity (i.e. cleavage of ubiquitin-like protein Nedd8 (neural precursor cell expressed, developmentally downregulated 8)) in the cullin 1 in cells. The cleav
Probab=35.51 E-value=1.5e+02 Score=24.04 Aligned_cols=56 Identities=13% Similarity=0.115 Sum_probs=41.4
Q ss_pred chHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCC
Q psy460 23 PTQDTIAEGLLGLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVP 80 (117)
Q Consensus 23 p~rd~La~GL~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~d 80 (117)
-+.+.++.+... ..+....+.+++.....|=..|...|+.+.++|+....- ..|+|
T Consensus 165 Igv~~l~~~~~~-~~~~~~~~~~~l~~~~~al~~L~~rl~~i~~Yl~~V~~g-~~~~d 220 (288)
T cd08063 165 IGVDHVARGGAS-GSSEKSTVAAHLQAQHNAIKMLNSRVELILEYLKAVPVG-EVPPD 220 (288)
T ss_pred eeHHHHHhcCCC-CCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-CCCCC
Confidence 334556554321 255666778889999999999999999999999999873 55665
No 143
>PF11945 WASH_WAHD: WAHD domain of WASH complex; InterPro: IPR021854 This entry represents a component of the WASH complex. The WASH complex is present at the surface of endosomes and recruits and activates the Arp2/3 complex to induce actin polymerisation. The WASH complex plays a key role in the fission of tubules that serve as transport intermediates during endosome sorting []. The WASH complex's subunit structure: F-actin-capping protein subunit alpha (CAPZA1, CAPZA2 or CAPZA3), F-actin-capping protein subunit beta (CAPZB), WASH (WASH1, WASH2P, WASH3P, WASH4P, WASH5P or WASH6P), FAM21 (FAM21A, FAM21B or FAM21C), KIAA1033, KIAA0196 (strumpellin) and CCDC53. This entry represents the WASH subunit of the WASH complex. WASH genes duplicated to multiple chromosomal ends during primate evolution, with highest copy number reached in humans, whose WASH repertoires probably vary extensively among individuals []. It is therefore difficult to determine which gene is functional or not. The telomeric region of chromosome 9p is paralogous to the pericentromeric regions of chromosome 9 as well as to 2q. Paralogous regions contain 7 transcriptional units. Duplicated WASH genes are also present in the Xq/Yq pseudoautosomal region, as well as on chromosome 1 and 15. The chromosome 16 copy seems to be a pseudogene.
Probab=35.43 E-value=1.2e+02 Score=25.32 Aligned_cols=34 Identities=6% Similarity=0.166 Sum_probs=27.0
Q ss_pred hhHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhhh
Q psy460 81 LDSYTRKLVDAQQKISIVGNILQTTQVAMHKKFF 114 (117)
Q Consensus 81 Ld~yvkKL~~ak~RV~~vnniLq~~Q~RL~rl~~ 114 (117)
|.---.|+-..|.|+..||.-+..+|.++.+|.-
T Consensus 38 F~rI~~Rv~~~~~~l~~i~~Ri~~~qaKi~~l~g 71 (297)
T PF11945_consen 38 FSRISARVERNRERLQAIQQRIEVAQAKIEKLQG 71 (297)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC
Confidence 4556677788888888888888888888888753
No 144
>PF04625 DEC-1_N: DEC-1 protein, N-terminal region; InterPro: IPR006719 The defective chorion-1 gene (dec-1) in Drosophila encodes follicle cell proteins necessary for proper eggshell assembly. Multiple products of the dec-1 gene are formed by alternative RNA splicing and proteolytic processing []. Cleavage products include S80 (80 kDa) which is incorporated into the eggshell, and further proteolysis of S80 gives S60 (60 kDa). This domain is present at the N-terminal of these proteins.; GO: 0005213 structural constituent of chorion, 0007304 chorion-containing eggshell formation, 0005576 extracellular region, 0042600 chorion
Probab=35.43 E-value=1.6e+02 Score=25.92 Aligned_cols=23 Identities=22% Similarity=0.387 Sum_probs=16.7
Q ss_pred HHHHHHHHHHHHHHHHHhhHHHh
Q psy460 53 SQLELRLQIEALTSQLDKISAEQ 75 (117)
Q Consensus 53 SQ~eL~~qId~L~~eL~~l~e~q 75 (117)
+-...=+|||+|+.|++.-...+
T Consensus 290 ~EL~~LQqIEr~AkeMRasA~~~ 312 (407)
T PF04625_consen 290 AELKMLQQIERMAKEMRASATAQ 312 (407)
T ss_pred HHHHHHHHHHHHHHHHHhhhccc
Confidence 34445579999999999865444
No 145
>PF05055 DUF677: Protein of unknown function (DUF677); InterPro: IPR007749 This entry contains proteins belonging to the UPF0496 family, found in plants. This family includes AT14A like proteins from Arabidopsis thaliana. At14a contains a small domain that has sequence similarities to integrins from fungi, insects and humans. Transcripts of At14a are found in all Arabidopsis tissues and the protein localises partly to the plasma membrane [].
Probab=34.52 E-value=2.9e+02 Score=23.49 Aligned_cols=71 Identities=15% Similarity=0.233 Sum_probs=51.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH-------------------HHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHH
Q psy460 41 DLLDEKFRATRASQLELRLQIEAL-------------------TSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNI 101 (117)
Q Consensus 41 ~~lD~~V~~tr~SQ~eL~~qId~L-------------------~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnni 101 (117)
..|+.-|+.+|.+|.-++.-|..+ -+||.++.. ..+|++-..+......+..+-..+=..
T Consensus 83 ~~L~k~I~~aR~~~~~I~~al~~~~~e~~~~d~g~~~~~~~~tl~eL~~F~~-~~NPFs~~~~~~~F~~i~~~~~~Ll~k 161 (336)
T PF05055_consen 83 EALLKCIHRARDNYLPIRRALKQFEKESLDTDVGVSQKKYDKTLEELKKFKA-AGNPFSDEEFFHQFQSIHDQQSSLLEK 161 (336)
T ss_pred HHHHHHHHHHHHHhHHHHHHHHhhhhccccccccccchhHHHHHHHHHhhhh-cCCCCCchhHHHHHHHHHHHHHHHHHH
Confidence 356788999999998888666544 356666665 367876665777777777777777777
Q ss_pred HHHHHHHHHhh
Q psy460 102 LQTTQVAMHKK 112 (117)
Q Consensus 102 Lq~~Q~RL~rl 112 (117)
|+..+.+++|-
T Consensus 162 L~~~k~Kl~kk 172 (336)
T PF05055_consen 162 LDSRKKKLRKK 172 (336)
T ss_pred HHHHHHHHHHH
Confidence 77777776653
No 146
>PRK10929 putative mechanosensitive channel protein; Provisional
Probab=34.51 E-value=2.7e+02 Score=27.57 Aligned_cols=52 Identities=21% Similarity=0.343 Sum_probs=38.0
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHh
Q psy460 37 KPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDA 91 (117)
Q Consensus 37 ~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~a 91 (117)
.|.+++++.++-+.|.-|.++.++.|++.. ++++... ...++.+-.++..++
T Consensus 332 ~~~~~~l~~~IAdlRl~~f~~~q~~~~l~~-i~~~~~~--~~~~~t~~~~~~l~~ 383 (1109)
T PRK10929 332 MPKPQQLDTEMAQLRVQRLRYEDLLNKQPQ-LRQIRQA--DGQPLTAEQNRILDA 383 (1109)
T ss_pred CcccchhHHHHHHHHHHHHHHHHHHHHhhh-hHHHHhh--ccCCCCHHHHHHHHH
Confidence 477899999999999999999999999875 6666543 334454555554443
No 147
>PF12777 MT: Microtubule-binding stalk of dynein motor; InterPro: IPR024743 The 380 kDa motor unit of dynein belongs to the AAA class of chaperone-like ATPases. The core of the 380 kDa motor unit contains a concatenated chain of six AAA modules (D1-6), of which four correspond to the ATP binding sites with P-loop signatures, and two are modules in which the P loop has been lost in evolution. This domain occurs between D4 and D5 and includes the two predicted alpha-helical coiled coil segments that form the stalk supporting the ATP-sensitive microtubule binding component [].; PDB: 3VKH_A 3VKG_A 3ERR_A 4AKI_A 4AI6_B 4AKH_A 4AKG_A 3QMZ_A 2RR7_A.
Probab=34.29 E-value=1.6e+02 Score=24.34 Aligned_cols=55 Identities=13% Similarity=0.167 Sum_probs=25.2
Q ss_pred HHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhhh
Q psy460 54 QLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQTTQVAMHKKFF 114 (117)
Q Consensus 54 Q~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q~RL~rl~~ 114 (117)
-.+.+.++.....+|.++.+. |..+-+.+..+.+....+=.-...++.||.|-.+
T Consensus 230 l~~~~~~L~~~~~~l~~l~~~------l~~l~~~~~~~~~e~~~l~~~~~~~~~kl~rA~~ 284 (344)
T PF12777_consen 230 LEEAEEQLAEKQAELAELEEK------LAALQKEYEEAQKEKQELEEEIEETERKLERAEK 284 (344)
T ss_dssp HHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHhhhccHHH
Confidence 333344444445555555443 4444444444444444444444555555555443
No 148
>PRK15422 septal ring assembly protein ZapB; Provisional
Probab=34.14 E-value=1.6e+02 Score=20.45 Aligned_cols=35 Identities=20% Similarity=0.312 Sum_probs=25.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhH
Q psy460 38 PTVDLLDEKFRATRASQLELRLQIEALTSQLDKIS 72 (117)
Q Consensus 38 P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~ 72 (117)
.+.++|++||+.+--.=.-|+..|++|-++=..+.
T Consensus 4 EvleqLE~KIqqAvdtI~LLqmEieELKekn~~L~ 38 (79)
T PRK15422 4 EVFEKLEAKVQQAIDTITLLQMEIEELKEKNNSLS 38 (79)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 36789999999888877777777777755433333
No 149
>PRK06835 DNA replication protein DnaC; Validated
Probab=33.99 E-value=1.1e+02 Score=25.50 Aligned_cols=43 Identities=14% Similarity=0.228 Sum_probs=28.0
Q ss_pred HHHHHHHHHHHHH--HH-------------HHHHHHHHHHHHHHHHHhhHHHhcCCCC
Q psy460 38 PTVDLLDEKFRAT--RA-------------SQLELRLQIEALTSQLDKISAEQQTAVP 80 (117)
Q Consensus 38 P~I~~lD~~V~~t--r~-------------SQ~eL~~qId~L~~eL~~l~e~q~~p~d 80 (117)
|-|.++|+.|..+ +. .=.+|++.|..|.++-.++.....-|.|
T Consensus 36 P~~~~id~~i~~~~~~~~~~~l~~~~~~~~~~~~l~~~~~~l~~~~~~lL~~~g~~~d 93 (329)
T PRK06835 36 PEIAEIDDEIAKLGIKLSRAILKNPDKKEETLKELKEKITDLRVKKAELLVSNGYPPD 93 (329)
T ss_pred ccHHHHHHHHHHHHHHHHHHHhcCCCccHHHHHHHHHHHHHHHHHHHHHHHHcCCChh
Confidence 8899999988765 22 2246666777777776666665344433
No 150
>PF06148 COG2: COG (conserved oligomeric Golgi) complex component, COG2; InterPro: IPR024602 This entry represents the uncharacterised N-terminal domain of subunit 2 of the COG complex. The COG complex comprises eight proteins COG1-8 and plays critical roles in Golgi structure and function [].; PDB: 2JQQ_A.
Probab=33.83 E-value=62 Score=23.02 Aligned_cols=19 Identities=21% Similarity=0.373 Sum_probs=5.8
Q ss_pred HHHHHHHHHHHHHHHHHHH
Q psy460 46 KFRATRASQLELRLQIEAL 64 (117)
Q Consensus 46 ~V~~tr~SQ~eL~~qId~L 64 (117)
.|...+..-..++.++...
T Consensus 70 ~i~~l~~~L~~~~~~v~~~ 88 (133)
T PF06148_consen 70 KIEELRKPLSQFREEVESV 88 (133)
T ss_dssp ----HHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 3333333333333343333
No 151
>TIGR01791 CM_archaeal chorismate mutase, archaeal type. This model represents a clade of archaeal chorismate mutases. Chorismate mutase catalyzes the conversion of chorismate into prephenate which is subsequently converted into either phenylalanine or tyrosine. In Sulfolobus this gene is found as a fusion with prephenate dehydrogenase (although the non-TIGR annotation contains a typographical error indicating it as a dehydratase) which is the next enzyme in the tyrosine biosynthesis pathway. The Archaeoglobus gene contains an N-terminal prephenate dehydrogenase domain and a C-terminal prephenate dehydratase domain followed by a regulatory amino acid-binding ACT domain. The Thermoplasma volcanium gene is adjacent to prephenate dehydratase.
Probab=33.82 E-value=1.3e+02 Score=19.61 Aligned_cols=18 Identities=28% Similarity=0.434 Sum_probs=10.7
Q ss_pred HHHHHHHHHHHHHHhhHH
Q psy460 56 ELRLQIEALTSQLDKISA 73 (117)
Q Consensus 56 eL~~qId~L~~eL~~l~e 73 (117)
+||.+||.+..+|-.+..
T Consensus 3 ~lR~~Id~iD~~i~~Ll~ 20 (83)
T TIGR01791 3 ELRQEIEEIDKSILDLIE 20 (83)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 466666666666555543
No 152
>PF08317 Spc7: Spc7 kinetochore protein; InterPro: IPR013253 This entry consists of cell division proteins which are required for kinetochore-spindle association [].
Probab=33.23 E-value=2.8e+02 Score=22.87 Aligned_cols=28 Identities=21% Similarity=0.334 Sum_probs=12.2
Q ss_pred HHHHHHHhhhHHHHHHHHHHHHHHHHHH
Q psy460 28 IAEGLLGLIKPTVDLLDEKFRATRASQL 55 (117)
Q Consensus 28 La~GL~~lL~P~I~~lD~~V~~tr~SQ~ 55 (117)
+-+||...|.--++.|+.-...+.....
T Consensus 146 ll~gl~~~L~~~~~~L~~D~~~L~~~~~ 173 (325)
T PF08317_consen 146 LLEGLKEGLEENLELLQEDYAKLDKQLE 173 (325)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4455554444444444444433333333
No 153
>PF10498 IFT57: Intra-flagellar transport protein 57 ; InterPro: IPR019530 Eukaryotic cilia and flagella are specialised organelles found at the periphery of cells of diverse organisms. Intra-flagellar transport (IFT) is required for the assembly and maintenance of eukaryotic cilia and flagella, and consists of the bi-directional movement of large protein particles between the base and the distal tip of the organelle. IFT particles contain multiple copies of two distinct protein complexes, A and B, which contain at least 6 and 11 protein subunits. IFT57 is part of complex B but is not, however, required for the core subunits to stay associated []. This protein is known as Huntington-interacting protein-1 in humans.
Probab=33.11 E-value=3.1e+02 Score=23.45 Aligned_cols=66 Identities=14% Similarity=0.130 Sum_probs=44.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhh
Q psy460 42 LLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQTTQVAMHKKF 113 (117)
Q Consensus 42 ~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q~RL~rl~ 113 (117)
++.........-.+.|+.|++.|.++.+..... |..--.+...+-..|+...+.|..+.+.|+++-
T Consensus 249 ~i~~~lekI~sREk~iN~qle~l~~eYr~~~~~------ls~~~~~y~~~s~~V~~~t~~L~~IseeLe~vK 314 (359)
T PF10498_consen 249 DISKTLEKIESREKYINNQLEPLIQEYRSAQDE------LSEVQEKYKQASEGVSERTRELAEISEELEQVK 314 (359)
T ss_pred HHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHH------HHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHH
Confidence 333444445555667777777777777777664 666666777777777777777777777777654
No 154
>TIGR01843 type_I_hlyD type I secretion membrane fusion protein, HlyD family. Type I secretion is an ABC transport process that exports proteins, without cleavage of any signal sequence, from the cytosol to extracellular medium across both inner and outer membranes. The secretion signal is found in the C-terminus of the transported protein. This model represents the adaptor protein between the ATP-binding cassette (ABC) protein of the inner membrane and the outer membrane protein, and is called the membrane fusion protein. This model selects a subfamily closely related to HlyD; it is defined narrowly and excludes, for example, colicin V secretion protein CvaA and multidrug efflux proteins.
Probab=33.01 E-value=2.6e+02 Score=22.56 Aligned_cols=32 Identities=25% Similarity=0.397 Sum_probs=13.8
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy460 37 KPTVDLLDEKFRATRASQLELRLQIEALTSQL 68 (117)
Q Consensus 37 ~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL 68 (117)
.--+..++.++...+..-..++.+++.+..++
T Consensus 143 ~~~~~~l~~~i~~~~~~i~~~~~~l~~~~~~l 174 (423)
T TIGR01843 143 RAQLELILAQIKQLEAELAGLQAQLQALRQQL 174 (423)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33444444444444444444444444444333
No 155
>PRK07857 hypothetical protein; Provisional
Probab=32.82 E-value=1.3e+02 Score=21.77 Aligned_cols=17 Identities=24% Similarity=0.352 Sum_probs=9.6
Q ss_pred hhHHHHHHHHHHHHHHH
Q psy460 36 IKPTVDLLDEKFRATRA 52 (117)
Q Consensus 36 L~P~I~~lD~~V~~tr~ 52 (117)
++--|+++|.++-..-.
T Consensus 33 lR~eID~ID~eIl~LL~ 49 (106)
T PRK07857 33 LREEIDRLDAEILALVK 49 (106)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 55556666666555544
No 156
>PRK07737 fliD flagellar capping protein; Validated
Probab=32.67 E-value=3.5e+02 Score=23.90 Aligned_cols=47 Identities=19% Similarity=0.345 Sum_probs=21.4
Q ss_pred HHHHHHHHhhhHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHhhHH
Q psy460 27 TIAEGLLGLIKPTVDLLDEKFR--ATRASQLELRLQIEALTSQLDKISA 73 (117)
Q Consensus 27 ~La~GL~~lL~P~I~~lD~~V~--~tr~SQ~eL~~qId~L~~eL~~l~e 73 (117)
-|+.-|..++.|.+..++.+.- .....+.-|..+|+++..+..++..
T Consensus 414 Gia~~l~~~l~~~~~~~~~~~g~g~~~~~~~~l~~~i~~l~~~i~~~~~ 462 (501)
T PRK07737 414 GIARRLRDTLKETIKSIEQKAGNTTMTNQQFAIGKDLNQIETQIDRFQD 462 (501)
T ss_pred cHHHHHHHHHHHHHhhhhhhcCCccccchhHHHHHHHHHHHHHHHHHHH
Confidence 4555566666666654332221 1112333444455555554444443
No 157
>PF12451 VPS11_C: Vacuolar protein sorting protein 11 C terminal; InterPro: IPR024763 Vps 11 is one of the evolutionarily conserved class C vacuolar protein sorting genes (c-vps: vps11, vps16, vps18, and vps33), whose products physically associate to form the c-vps protein complex required for vesicle docking and fusion. This entry represents the C-terminal domain of vps11.
Probab=32.59 E-value=92 Score=19.19 Aligned_cols=25 Identities=32% Similarity=0.296 Sum_probs=22.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Q psy460 45 EKFRATRASQLELRLQIEALTSQLD 69 (117)
Q Consensus 45 ~~V~~tr~SQ~eL~~qId~L~~eL~ 69 (117)
..|++.+.+|.+...+-|.+-.+|.
T Consensus 5 ~~i~~~~r~q~~~~~~~d~F~~~L~ 29 (49)
T PF12451_consen 5 ETIREIRRAQEESADQHDLFFKQLE 29 (49)
T ss_pred HHHHHHHHHHHHHhhcHHHHHHHHH
Confidence 4688999999999999999999883
No 158
>TIGR00634 recN DNA repair protein RecN. All proteins in this family for which functions are known are ATP binding proteins involved in the initiation of recombination and recombinational repair.
Probab=32.58 E-value=3.5e+02 Score=23.89 Aligned_cols=30 Identities=17% Similarity=0.148 Sum_probs=14.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhHH
Q psy460 44 DEKFRATRASQLELRLQIEALTSQLDKISA 73 (117)
Q Consensus 44 D~~V~~tr~SQ~eL~~qId~L~~eL~~l~e 73 (117)
|.++......=.+...+++.+..+|+++.+
T Consensus 265 d~~~~~~~~~l~~~~~~l~d~~~~l~~~~~ 294 (563)
T TIGR00634 265 DGSLRELAEQVGNALTEVEEATRELQNYLD 294 (563)
T ss_pred hHhHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444333333333455555666665554
No 159
>PRK10929 putative mechanosensitive channel protein; Provisional
Probab=32.57 E-value=2e+02 Score=28.50 Aligned_cols=59 Identities=19% Similarity=0.226 Sum_probs=36.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHH
Q psy460 40 VDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQT 104 (117)
Q Consensus 40 I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~ 104 (117)
..+|+.+...+..-..+++.++.+....++.+....++-| ++..++++|...+++.|++
T Consensus 104 ~~~Leq~l~~~~~~L~~~q~~l~~~~~~~~~~~~~l~~~p------q~~~~~~~~l~~i~~~L~~ 162 (1109)
T PRK10929 104 TDALEQEILQVSSQLLEKSRQAQQEQDRAREISDSLSQLP------QQQTEARRQLNEIERRLQT 162 (1109)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHhhch------hhHHHHHHHHHHHHHHHhC
Confidence 3666666666666666666666666666655554322222 4557788888888876655
No 160
>PHA02675 ORF104 fusion protein; Provisional
Probab=32.55 E-value=99 Score=21.99 Aligned_cols=15 Identities=27% Similarity=0.678 Sum_probs=6.5
Q ss_pred CCchHHHHHHHHHHh
Q psy460 21 DNPTQDTIAEGLLGL 35 (117)
Q Consensus 21 ~~p~rd~La~GL~~l 35 (117)
..|++..+-+-|..|
T Consensus 28 g~~~~esle~RL~~L 42 (90)
T PHA02675 28 GAPSKESVEERLVSL 42 (90)
T ss_pred CCCcHHHHHHHHHHH
Confidence 344444444444333
No 161
>PRK00888 ftsB cell division protein FtsB; Reviewed
Probab=32.32 E-value=1.8e+02 Score=20.45 Aligned_cols=43 Identities=9% Similarity=0.193 Sum_probs=34.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHH
Q psy460 39 TVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKL 88 (117)
Q Consensus 39 ~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL 88 (117)
...++..++.+.++.-.+|+++-+.|..++.++... ..|+.++
T Consensus 28 ~~~~l~~q~~~~~~e~~~l~~~n~~L~~eI~~L~~~-------~dyiEe~ 70 (105)
T PRK00888 28 DYWRVNDQVAAQQQTNAKLKARNDQLFAEIDDLKGG-------QEAIEER 70 (105)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCc-------HHHHHHH
Confidence 577888899998888888999999999998888652 2577776
No 162
>TIGR03007 pepcterm_ChnLen polysaccharide chain length determinant protein, PEP-CTERM locus subfamily. Members of this protein family belong to the family of polysaccharide chain length determinant proteins (pfam02706). All are found in species that encode the PEP-CTERM/exosortase system predicted to act in protein sorting in a number of Gram-negative bacteria, and are found near the epsH homolog that is the putative exosortase gene.
Probab=32.25 E-value=3.2e+02 Score=23.26 Aligned_cols=16 Identities=13% Similarity=0.133 Sum_probs=11.2
Q ss_pred HHHHHHHHHHHHHHHH
Q psy460 38 PTVDLLDEKFRATRAS 53 (117)
Q Consensus 38 P~I~~lD~~V~~tr~S 53 (117)
|.|..++.++.+++..
T Consensus 275 P~v~~l~~qi~~l~~~ 290 (498)
T TIGR03007 275 PDVIATKREIAQLEEQ 290 (498)
T ss_pred hHHHHHHHHHHHHHHH
Confidence 7777777777666654
No 163
>TIGR01801 CM_A chorismate mutase domain of gram positive AroA protein. This model represents a small clade of chorismate mutase domains N-terminally fused to the first enzyme in the chorismate pathway, 2-dehydro-3-deoxyphosphoheptanoate aldolase (DAHP synthetase, AroA) which are found in some gram positive species and Deinococcus. Only in Deinococcus, where this domain is the sole CM domain in the genome can a trusted assignment of function be made. In the other species there is at least one other trusted CM domain present. The similarity between the Deinococcus gene and the others in this clade is sufficiently strong (~44% identity), that the whole clade can be trusted to be functional. The possibility exists, however, that in the gram positive species the fusion to the first enzyme in the pathway has evolved a separate, regulatory role.
Probab=32.15 E-value=1.5e+02 Score=20.81 Aligned_cols=15 Identities=27% Similarity=0.512 Sum_probs=6.8
Q ss_pred HHHHHHHHHHHHHhh
Q psy460 57 LRLQIEALTSQLDKI 71 (117)
Q Consensus 57 L~~qId~L~~eL~~l 71 (117)
||.+||++..+|=++
T Consensus 9 lR~~ID~ID~eIl~L 23 (102)
T TIGR01801 9 LRAEVDQLNRQILAL 23 (102)
T ss_pred HHHHHHHHHHHHHHH
Confidence 444444444444433
No 164
>PF12221 HflK_N: Bacterial membrane protein N terminal; InterPro: IPR020980 HflK is a bacterial membrane protein which is thought, together with the HflC protein, to form a membrane protease complex whose activity is modulated by the GTPase HflX []. This entry represents the N-terminal, membrane-spanning, region of of HflK responsible for anchoring the protein in the bacterial membrane. It is often found in association with PF01145 from PFAM.
Probab=31.99 E-value=42 Score=20.47 Aligned_cols=15 Identities=47% Similarity=0.472 Sum_probs=12.1
Q ss_pred cCCCChhHHHHHHHH
Q psy460 76 QTAVPLDSYTRKLVD 90 (117)
Q Consensus 76 ~~p~dLd~yvkKL~~ 90 (117)
+.|+|||..+++|.+
T Consensus 18 ~gPPDLdel~r~l~~ 32 (42)
T PF12221_consen 18 QGPPDLDELFRKLQD 32 (42)
T ss_pred CCCCCHHHHHHHHHH
Confidence 469999998888763
No 165
>PF02388 FemAB: FemAB family; InterPro: IPR003447 The femAB operon codes for two nearly identical approximately 50kDa proteins involved in the formation of the Staphylococcal pentaglycine interpeptide bridge in peptidoglycan []. These proteins are also considered as a factor influencing the level of methicillin resistance [].; GO: 0016755 transferase activity, transferring amino-acyl groups; PDB: 1XE4_A 1NE9_A 3GKR_A 1XIX_A 1P4N_A 1XF8_A 1LRZ_A.
Probab=31.98 E-value=1.8e+02 Score=24.73 Aligned_cols=50 Identities=12% Similarity=0.226 Sum_probs=26.7
Q ss_pred HHHHHHHHHHHHHHHHhhHHH-hcCCCChhHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhhh
Q psy460 54 QLELRLQIEALTSQLDKISAE-QQTAVPLDSYTRKLVDAQQKISIVGNILQTTQVAMHKKFF 114 (117)
Q Consensus 54 Q~eL~~qId~L~~eL~~l~e~-q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q~RL~rl~~ 114 (117)
...|.++|+.+..++.++.+. ...| ..++++..+..-|...+.|+..+..
T Consensus 244 ~~~l~~~~~~~~~~i~~l~~~l~~~~-----------k~~~k~~~~~~q~~~~~k~~~~~~~ 294 (406)
T PF02388_consen 244 LESLQEKLEKLEKEIEKLEEKLEKNP-----------KKKNKLKELEEQLASLEKRIEEAEE 294 (406)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHH-T-----------HHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhCc-----------chhhHHHHHHHHHHHHHHHHHHHHH
Confidence 445566666666666666553 1233 3445555555555555555555443
No 166
>PF04375 HemX: HemX; InterPro: IPR007470 The majority of proteins in this family are annotated as uroporphyrin-III C-methyltransferase (2.1.1.107 from EC) []; however, there is no direct evidence to support this annotation for these proteins, which come from mainly pathogenic Gram-negative organisms. There is some evidence to suggest that the proteins are membrane anchored as they have a predicted N-terminal signal peptide and transmembrane domain and may be involved in haem transport [].
Probab=31.67 E-value=3.2e+02 Score=23.07 Aligned_cols=11 Identities=18% Similarity=0.265 Sum_probs=5.2
Q ss_pred HHHHHHHHHhh
Q psy460 102 LQTTQVAMHKK 112 (117)
Q Consensus 102 Lq~~Q~RL~rl 112 (117)
|+++-.||..+
T Consensus 156 L~~AD~rLa~~ 166 (372)
T PF04375_consen 156 LQSADQRLAEL 166 (372)
T ss_pred HHHHHHHHHhc
Confidence 44444444443
No 167
>KOG3436|consensus
Probab=31.39 E-value=2.3e+02 Score=21.28 Aligned_cols=52 Identities=17% Similarity=0.194 Sum_probs=33.2
Q ss_pred HHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHH
Q psy460 52 ASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQT 104 (117)
Q Consensus 52 ~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~ 104 (117)
.|+.+|..|++.|-.||..|.- +...=+-..-..|+--+|+-|.-|=.+...
T Consensus 12 ~~ke~L~~ql~dLK~ELa~LRv-~K~tgg~~~klskik~vrKsiArvLTVine 63 (123)
T KOG3436|consen 12 KSKEQLLKQLDDLKVELAQLRV-AKVTGGAASKLSKIKVVRKSIARVLTVINE 63 (123)
T ss_pred hHHHHHHHHHHHHHHHHHHHHH-HHhhccchhhhHHHHHHHHHHHHHHHHhhh
Confidence 4889999999999999999875 333333333444555555655444444433
No 168
>PLN03246 26S proteasome regulatory subunit; Provisional
Probab=31.15 E-value=2.2e+02 Score=23.76 Aligned_cols=41 Identities=17% Similarity=0.205 Sum_probs=31.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCC
Q psy460 39 TVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVP 80 (117)
Q Consensus 39 ~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~d 80 (117)
+...+-.++.+...|=+.|...|+.+..+|+...+. ..|+|
T Consensus 182 ~~s~l~~~l~~~~~al~~L~~rl~~i~~Yl~~V~~g-~~~~d 222 (303)
T PLN03246 182 TVSTLATEVTGKLTALKGLDARLREIRSYLDLVVEG-KLPLN 222 (303)
T ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC-CCCCC
Confidence 344555666677777788999999999999998773 56665
No 169
>COG5509 Uncharacterized small protein containing a coiled-coil domain [Function unknown]
Probab=30.82 E-value=1.1e+02 Score=20.60 Aligned_cols=25 Identities=32% Similarity=0.426 Sum_probs=19.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy460 40 VDLLDEKFRATRASQLELRLQIEALTSQLDKI 71 (117)
Q Consensus 40 I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l 71 (117)
|.+|+++|-..+ ..|++|..||.+=
T Consensus 27 V~El~eRIalLq-------~EIeRlkAe~~kK 51 (65)
T COG5509 27 VAELEERIALLQ-------AEIERLKAELAKK 51 (65)
T ss_pred HHHHHHHHHHHH-------HHHHHHHHHHHhh
Confidence 788999887654 4899998887653
No 170
>PRK04325 hypothetical protein; Provisional
Probab=30.74 E-value=1.6e+02 Score=19.49 Aligned_cols=41 Identities=17% Similarity=0.183 Sum_probs=23.0
Q ss_pred HHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhH
Q psy460 32 LLGLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKIS 72 (117)
Q Consensus 32 L~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~ 72 (117)
-+....-+|++|+.-|-.-+.-=..|..++..|...|+.+.
T Consensus 17 klAfQE~tIe~LN~vv~~Qq~~I~~L~~ql~~L~~rl~~~~ 57 (74)
T PRK04325 17 QLAFQEDLIDGLNATVARQQQTLDLLQAQLRLLYQQMRDAN 57 (74)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 34566677777777665544422445555555555555543
No 171
>PRK09039 hypothetical protein; Validated
Probab=30.68 E-value=3.2e+02 Score=22.89 Aligned_cols=72 Identities=15% Similarity=0.155 Sum_probs=33.9
Q ss_pred HHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhH-HHHHHHHHHHHHHHHH
Q psy460 32 LLGLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQ-KISIVGNILQTTQVAM 109 (117)
Q Consensus 32 L~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~-RV~~vnniLq~~Q~RL 109 (117)
.++=-.|-|..|-.+|.+.|.....|..+|+...+..+..... ++.+-++|.++.. ||..+...=.+.-.||
T Consensus 131 ~~se~~~~V~~L~~qI~aLr~Qla~le~~L~~ae~~~~~~~~~------i~~L~~~L~~a~~~~~~~l~~~~~~~~~~l 203 (343)
T PRK09039 131 VSARALAQVELLNQQIAALRRQLAALEAALDASEKRDRESQAK------IADLGRRLNVALAQRVQELNRYRSEFFGRL 203 (343)
T ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHhHHHHHHHH
Confidence 3444455566666666655555445555554444443333221 4444455555542 2444444444443333
No 172
>PF05478 Prominin: Prominin; InterPro: IPR008795 The prominins are an emerging family of proteins that, among the multispan membrane proteins, display a novel topology. Mouse and Homo sapiens prominin and (Mus musculus) prominin-like 1 (PROML1) are predicted to contain five membrane spanning domains, with an N-terminal domain exposed to the extracellular space followed by four, alternating small cytoplasmic and large extracellular, loops and a cytoplasmic C-terminal domain []. The exact function of prominin is unknown although in humans defects in PROM1, the gene coding for prominin, cause retinal degeneration [].; GO: 0016021 integral to membrane
Probab=30.60 E-value=4.5e+02 Score=24.53 Aligned_cols=57 Identities=12% Similarity=0.151 Sum_probs=34.1
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHH----------------------hcCCCChhHHHHHHHHhh
Q psy460 36 IKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAE----------------------QQTAVPLDSYTRKLVDAQ 92 (117)
Q Consensus 36 L~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~----------------------q~~p~dLd~yvkKL~~ak 92 (117)
.+..+..+++.+...|..+.+|+..+......|...... ..+-|++++|+..+.++-
T Consensus 251 ~~~~L~~v~~~~~~L~~~~~qL~~~L~~vK~~L~~~l~~~C~~~~C~~i~~~~~~l~l~~~~~qLP~v~~~l~~l~~v~ 329 (806)
T PF05478_consen 251 TKELLQNVNSSLKDLQEYQSQLRDGLRGVKRDLNNTLQDLCTNRECNSILSSLDILQLDADFSQLPNVTSQLNNLEEVI 329 (806)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCChhhHHHHHhccccccCCCcccCCChHHHHHHHHHHH
Confidence 344555666777777777777777775544443333221 124567788888887653
No 173
>PRK07248 hypothetical protein; Provisional
Probab=30.23 E-value=1.7e+02 Score=19.42 Aligned_cols=28 Identities=25% Similarity=0.395 Sum_probs=14.5
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy460 36 IKPTVDLLDEKFRATRASQLELRLQIEA 63 (117)
Q Consensus 36 L~P~I~~lD~~V~~tr~SQ~eL~~qId~ 63 (117)
++--|+++|.++-..-.=-.++..+|-.
T Consensus 6 lR~~ID~iD~~i~~Ll~~R~~l~~~I~~ 33 (87)
T PRK07248 6 IRQEIDQIDDQLVALLEKRMALVEQVVA 33 (87)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3445566666665555544444444433
No 174
>cd08062 MPN_RPN7_8 Mpr1p, Pad1p N-terminal (MPN) domains without catalytic isopeptidase activity, found in 19S proteasomal subunits Rpn7 and Rpn8. This family includes lid subunits of the 26 S proteasome regulatory particles, Rpn7 (PSMD7; proteasome 26S non-ATPase subunit 7; p44), and Rpn8 (PSMD8; proteasome 26S non-ATPase subunit 8; p40; Mov34). Rpn7 is known to be critical for the integrity of the 26 S proteasome complex by establishing a correct lid structure. It is necessary for the incorporation/anchoring of Rpn3 and Rpn12 to the lid and essential for viability and normal mitosis. Rpn7 and Rpn8 are ATP-independent components of the 19S regulator subunit, and contain the MPN structural motif on its N-terminal region. However, while they show a typical MPN metalloprotease fold, they lack the canonical JAMM motif, and therefore do not show catalytic isopeptidase activity. It is suggested that Rpn7 function is primarily structural.
Probab=30.20 E-value=2.6e+02 Score=22.88 Aligned_cols=38 Identities=11% Similarity=0.161 Sum_probs=28.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCC
Q psy460 42 LLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVP 80 (117)
Q Consensus 42 ~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~d 80 (117)
.+-.++.....|=+.|...|+.+..+|+...+. ..|+|
T Consensus 179 ~l~~~l~~~~~al~~L~~~l~~i~~Yl~~V~~g-~~~~d 216 (280)
T cd08062 179 TLSTRVTNKLNSLKGLQSKLKEIKDYLQLVVEG-KLPIN 216 (280)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC-CCCCC
Confidence 344566666666688999999999999998763 56666
No 175
>PRK06664 fliD flagellar hook-associated protein FliD; Validated
Probab=30.13 E-value=1.8e+02 Score=26.96 Aligned_cols=44 Identities=18% Similarity=0.271 Sum_probs=19.9
Q ss_pred HHHHHHhhhHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhH
Q psy460 29 AEGLLGLIKPTVD---LLDEKFRATRASQLELRLQIEALTSQLDKIS 72 (117)
Q Consensus 29 a~GL~~lL~P~I~---~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~ 72 (117)
|.-|..+++|.++ -+..+.......-..+..+|+++...|.++.
T Consensus 581 a~~l~~~l~~~t~~~G~i~~r~~~l~~~i~~l~~~i~~~e~rl~~~e 627 (661)
T PRK06664 581 AKMLLEYLSPYTQAGGIIYNKVKGLDERIADNNKKIEEYEKKLESKE 627 (661)
T ss_pred HHHHHHHHHHHHcCCCceehHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3334445555441 2334444444444445555555555444443
No 176
>PF06840 DUF1241: Protein of unknown function (DUF1241); InterPro: IPR009652 This family consists of several programmed cell death 10 protein (PDCD10 or TFAR15) sequences. The function of this family is unknown.; PDB: 3L8I_A 3RQG_B 3RQE_B 3L8J_A 3RQF_B 3AJM_B.
Probab=30.07 E-value=2.6e+02 Score=21.51 Aligned_cols=57 Identities=21% Similarity=0.338 Sum_probs=39.5
Q ss_pred HHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHH
Q psy460 31 GLLGLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVG 99 (117)
Q Consensus 31 GL~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vn 99 (117)
-|-.++.|++++|..+ ..- .+++|-..+.+.... .|==.+.+|+.|+.--..-+.++
T Consensus 12 ~L~~li~Pvl~eL~~~--d~~--------A~q~Lr~Af~kAE~~--~PGft~d~v~~ll~~~~~~vnl~ 68 (154)
T PF06840_consen 12 ALQCLIRPVLDELEQK--DSD--------AIQTLRAAFTKAEKS--SPGFTDDFVKGLLERSKLNVNLT 68 (154)
T ss_dssp HHHHTHHHHHHHHHTT--HHH--------HHHHHHHHHHHHHHH--STTHHHHHHHHHHHTHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHh--hHH--------HHHHHHHHHHHHHHh--CCCcHHHHHHHHHHhcCCCCChh
Confidence 3678999999999883 222 667777777777553 55555899999887664444444
No 177
>COG3074 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=29.92 E-value=1.9e+02 Score=20.00 Aligned_cols=64 Identities=14% Similarity=0.181 Sum_probs=41.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHH----HHHHHHhh
Q psy460 39 TVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQT----TQVAMHKK 112 (117)
Q Consensus 39 ~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~----~Q~RL~rl 112 (117)
+.++|+.+|+.+-..=.-|...|++|.+.=..+. .-+..+..+|.-+..=|..|++ -|+||..|
T Consensus 5 v~ekLE~KiqqAvdTI~LLQmEieELKEknn~l~----------~e~q~~q~~reaL~~eneqlk~e~~~WQerlrsL 72 (79)
T COG3074 5 VFEKLEAKVQQAIDTITLLQMEIEELKEKNNSLS----------QEVQNAQHQREALERENEQLKEEQNGWQERLRAL 72 (79)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHhH----------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5788999998888777777788887765433333 3344455555556666666654 56777654
No 178
>cd00890 Prefoldin Prefoldin is a hexameric molecular chaperone complex, found in both eukaryotes and archaea, that binds and stabilizes newly synthesized polypeptides allowing them to fold correctly. The complex contains two alpha and four beta subunits, the two subunits being evolutionarily related. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the structure of the hexamer. The structure of the complex consists of a double beta barrel assembly with six protruding coiled-coils.
Probab=29.48 E-value=1.9e+02 Score=19.75 Aligned_cols=39 Identities=18% Similarity=0.358 Sum_probs=32.6
Q ss_pred HHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy460 33 LGLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKI 71 (117)
Q Consensus 33 ~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l 71 (117)
...|+--++.++.++......-..++.++..+...|.++
T Consensus 89 ~~~l~~r~~~l~~~~~~l~~~~~~~~~~~~~l~~~l~~~ 127 (129)
T cd00890 89 IEFLKKRLETLEKQIEKLEKQLEKLQDQITELQEELQQL 127 (129)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 467888888888888888888888888999888887764
No 179
>PF01608 I_LWEQ: I/LWEQ domain; InterPro: IPR002558 I/LWEQ domains bind to actin. It has been shown that the I/LWEQ domains from mouse talin P26039 from SWISSPROT and yeast Sla2p P33338 from SWISSPROT interact with F-actin []. The domain has four conserved blocks, the name of the domain is derived from the initial conserved amino acid of each of the four blocks []. I/LWEQ domains can be placed into four major groups based on sequence similarity: Metazoan talin. Dictyostelium discoideum (Slime mould) TalA/TalB P54633 from SWISSPROT and SLA110. Metazoan Hip1p O00291 from SWISSPROT. Saccharomyces cerevisiae Sla2p P33338 from SWISSPROT. ; GO: 0003779 actin binding; PDB: 2QDQ_A 2JSW_A 1R0D_B.
Probab=29.27 E-value=2.6e+02 Score=21.34 Aligned_cols=77 Identities=18% Similarity=0.185 Sum_probs=43.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHh-cCCCChhHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhhhh
Q psy460 38 PTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQ-QTAVPLDSYTRKLVDAQQKISIVGNILQTTQVAMHKKFFM 115 (117)
Q Consensus 38 P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q-~~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q~RL~rl~~~ 115 (117)
|+-.+|+..-+++..+-..|=.+.-.-...... .+.. -....+...-+..++++-+|..+=+-|+..+.||..|-++
T Consensus 70 ~~~~~L~~As~~V~~At~~LV~av~~~~~~~~~-~~~~d~s~ls~~~~k~~eMe~Qv~iL~lE~eLe~ar~kL~~lRk~ 147 (152)
T PF01608_consen 70 KTQDRLEQASKAVKKATENLVAAVKAAIEQEEE-QEEVDFSKLSLHQAKRQEMEAQVRILKLEKELEKARKKLAELRKA 147 (152)
T ss_dssp CCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHH-HHH----------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh-hcccchhhhcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 444555555555555444443333332222222 1110 1223455788889999999999999999999999998764
No 180
>PRK11281 hypothetical protein; Provisional
Probab=29.25 E-value=4.5e+02 Score=26.06 Aligned_cols=58 Identities=22% Similarity=0.367 Sum_probs=40.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHhhHHHhcCCCChhH--HHHHHHHhhHHH
Q psy460 38 PTVDLLDEKFRATRASQLELRLQIEAL---TSQLDKISAEQQTAVPLDS--YTRKLVDAQQKI 95 (117)
Q Consensus 38 P~I~~lD~~V~~tr~SQ~eL~~qId~L---~~eL~~l~e~q~~p~dLd~--yvkKL~~ak~RV 95 (117)
|.++.++.++-++|..|-++.++-|+| ..+++++...+..++.-.. -..+|...|+.+
T Consensus 353 ~~~~~l~~~iAdlrl~~f~~~q~~~~l~~~~~y~~~l~~~~~~~~~~~~~~~l~~ll~~r~~L 415 (1113)
T PRK11281 353 DLIEGLADRIADLRLEQFEINQQRDALFQPDAYIDKLEAGHKSEVTDEVRDALLQLLDERREL 415 (1113)
T ss_pred cccchHHHHHHHHHHHHHHHHHHHHHhcChhhHHHHHHhcccCCCCHHHHHHHHHHHHHHHHH
Confidence 567889999999999999999988877 4566676543333332222 244677777665
No 181
>PF06103 DUF948: Bacterial protein of unknown function (DUF948); InterPro: IPR009293 This family consists of bacterial sequences several of which are thought to be general stress proteins.
Probab=29.17 E-value=1.7e+02 Score=19.26 Aligned_cols=38 Identities=13% Similarity=0.225 Sum_probs=24.7
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHH
Q psy460 36 IKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISA 73 (117)
Q Consensus 36 L~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e 73 (117)
++++++++...+...+.-=..+..+++.+..+-.++.+
T Consensus 24 l~~~l~~~~~ti~~l~~~~~~i~~e~~~ll~~~n~l~~ 61 (90)
T PF06103_consen 24 LKKTLDEVNKTIDTLQEQVDPITKEINDLLHNTNELLE 61 (90)
T ss_pred HHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHH
Confidence 46667777766666666666666666666666666655
No 182
>KOG0396|consensus
Probab=28.97 E-value=4.1e+02 Score=23.50 Aligned_cols=80 Identities=14% Similarity=0.145 Sum_probs=44.8
Q ss_pred HHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCCh-------hHHHHHHHHhhHHHHHHHH----H
Q psy460 33 LGLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPL-------DSYTRKLVDAQQKISIVGN----I 101 (117)
Q Consensus 33 ~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dL-------d~yvkKL~~ak~RV~~vnn----i 101 (117)
+-+|+|-.+.+-.+++.++++=..=-+.+-.-.++|.+....+..|. + +.-|.|+...|+++.-... +
T Consensus 9 y~l~ripye~l~kr~r~~qk~i~re~~~v~~~~~~l~~~~~sn~~~~-~d~~~~~id~Li~kv~~~krk~e~~iq~e~~~ 87 (389)
T KOG0396|consen 9 YQLFRIPYELLNKRIRHNQKVIDRETSHVLMVVAELQETLISNIVPH-LDSTVSLIDRLIRKVQCLKRKLEEYIQSEEEQ 87 (389)
T ss_pred chhhcCCHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHhcccChh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 35778888888888887776443333333334555555555443444 3 3445566667776655444 3
Q ss_pred HHHHHHHHHhhh
Q psy460 102 LQTTQVAMHKKF 113 (117)
Q Consensus 102 Lq~~Q~RL~rl~ 113 (117)
++-+.-|++.+.
T Consensus 88 ~~~iksRid~m~ 99 (389)
T KOG0396|consen 88 LKRIKSRIDFMH 99 (389)
T ss_pred HHHHHHHHHHHH
Confidence 444444444443
No 183
>PF09006 Surfac_D-trimer: Lung surfactant protein D coiled-coil trimerisation; InterPro: IPR015097 This domain is found in the SFTPD family, which includes lung surfactant protein D (SFTPD), conglutinin, collectin-43 and collectin-46. It forms a triple-helical parallel coiled coil, and mediates trimerisation of the protein []. ; PDB: 4DN8_A 3G84_A 2RIE_C 3IKR_B 1B08_A 2GGX_B 2OS9_C 2ORK_B 1PWB_A 2RIA_C ....
Probab=28.80 E-value=1.6e+02 Score=18.60 Aligned_cols=19 Identities=42% Similarity=0.646 Sum_probs=14.1
Q ss_pred HHHHHHHHHHHHHHhhHHH
Q psy460 56 ELRLQIEALTSQLDKISAE 74 (117)
Q Consensus 56 eL~~qId~L~~eL~~l~e~ 74 (117)
.|++|++.|..+++.|-..
T Consensus 3 aLrqQv~aL~~qv~~Lq~~ 21 (46)
T PF09006_consen 3 ALRQQVEALQGQVQRLQAA 21 (46)
T ss_dssp HHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 5777888888888777654
No 184
>KOG0018|consensus
Probab=28.66 E-value=4.3e+02 Score=26.54 Aligned_cols=67 Identities=13% Similarity=0.276 Sum_probs=41.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHHHHHHHHhh
Q psy460 40 VDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQTTQVAMHKK 112 (117)
Q Consensus 40 I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q~RL~rl 112 (117)
|.-|+.++.-.+.==..++-+++.+-.||.++... ++-+-.++.+.++++-..-+-.++.++|++++
T Consensus 685 i~~le~~~~~~~~~~~~~k~~l~~~~~El~~~~~~------i~~~~p~i~~i~r~l~~~e~~~~~L~~~~n~v 751 (1141)
T KOG0018|consen 685 IHGLEMRLKYSKLDLEQLKRSLEQNELELQRTESE------IDEFGPEISEIKRKLQNREGEMKELEERMNKV 751 (1141)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHhhCchHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33344444333333334455677777777777664 44555566777777777777777777777665
No 185
>PF10359 Fmp27_WPPW: RNA pol II promoter Fmp27 protein domain; InterPro: IPR019449 The function of the FMP27 protein is not known. FMP27 is the product of a nuclear encoded gene but it is detected in highly purified mitochondria in high-throughput studies []. This entry represents a domain within FMP27 that contains characteristic HQR and WPPW sequence motifs.
Probab=28.53 E-value=1.6e+02 Score=25.67 Aligned_cols=88 Identities=20% Similarity=0.353 Sum_probs=48.8
Q ss_pred CCCCCCCCCCchHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHH---HhcCCCChhHHHHHHH
Q psy460 13 DDKTEDFCDNPTQDTIAEGLLGLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISA---EQQTAVPLDSYTRKLV 89 (117)
Q Consensus 13 ~~~~~~~~~~p~rd~La~GL~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e---~q~~p~dLd~yvkKL~ 89 (117)
++.+..|-..|+.+.+- |-- .|. +-+.+-++.=-.+|.++|+.+.+.|..+.. .+...-++..+.+.+.
T Consensus 139 ~p~~~~FG~E~tH~C~m-~~~---~p~----~vQ~~L~~~Rl~~L~~qi~~~~~~l~~~~~~~~~~~~~~~~~~l~~~~~ 210 (475)
T PF10359_consen 139 DPKRSSFGNEPTHDCLM-GDN---DPR----RVQIELIQERLDELEEQIEKHEEKLGELELNPDDPELKSDIEELERHIS 210 (475)
T ss_pred CCCcCcCCCCCceeeEe-ecC---Ccc----hHHHHHHHHHHHHHHHHHHHHHHhhhccccccccHHHHHHHHHHHHHHH
Confidence 33333366666666554 322 222 122233333334566677777777666542 1111234567777788
Q ss_pred HhhHHHHHHHHHHHHHHHH
Q psy460 90 DAQQKISIVGNILQTTQVA 108 (117)
Q Consensus 90 ~ak~RV~~vnniLq~~Q~R 108 (117)
..++|+..|.+.|+.....
T Consensus 211 ~l~~~~~~l~~~l~~l~~~ 229 (475)
T PF10359_consen 211 SLKERIEFLENMLEDLEDS 229 (475)
T ss_pred HHHHHHHHHHHHHHHHhhh
Confidence 8888888888888776554
No 186
>TIGR02894 DNA_bind_RsfA transcription factor, RsfA family. In a subset of endospore-forming members of the Firmcutes, members of this protein family are found, several to a genome. Two very strongly conserved sequences regions are separated by a highly variable linker region. Much of the linker region was excised from the seed alignment for this model. A characterized member is the prespore-specific transcription RsfA from Bacillus subtilis, previously called YwfN, which is controlled by sigma factor F and seems to fine-tune expression of some genes in the sigma-F regulon. A paralog in Bacillus subtilis is designated YlbO.
Probab=28.52 E-value=1.7e+02 Score=22.73 Aligned_cols=34 Identities=18% Similarity=0.236 Sum_probs=18.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHH
Q psy460 40 VDLLDEKFRATRASQLELRLQIEALTSQLDKISA 73 (117)
Q Consensus 40 I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e 73 (117)
...+...-...+.-..+|+.+++.|..++.++..
T Consensus 99 ~~~~~~e~~~l~~e~~~l~~~~e~Le~e~~~L~~ 132 (161)
T TIGR02894 99 DQALQKENERLKNQNESLQKRNEELEKELEKLRQ 132 (161)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3334444444444445566666666666666644
No 187
>COG1579 Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only]
Probab=28.43 E-value=3.3e+02 Score=22.25 Aligned_cols=78 Identities=10% Similarity=0.155 Sum_probs=51.9
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCC---hhHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhh
Q psy460 37 KPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVP---LDSYTRKLVDAQQKISIVGNILQTTQVAMHKKF 113 (117)
Q Consensus 37 ~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~d---Ld~yvkKL~~ak~RV~~vnniLq~~Q~RL~rl~ 113 (117)
.-.+..++..+.+....=..+...|..+.+.+++..+.+...-+ +..+=+.+..+++|...++.-|..+++++.++.
T Consensus 44 ~~~~~~~~~e~e~le~qv~~~e~ei~~~r~r~~~~e~kl~~v~~~~e~~aL~~E~~~ak~r~~~le~el~~l~~~~~~l~ 123 (239)
T COG1579 44 NKALEALEIELEDLENQVSQLESEIQEIRERIKRAEEKLSAVKDERELRALNIEIQIAKERINSLEDELAELMEEIEKLE 123 (239)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33444444444454444445555566666666666555434334 456778889999999999999999999998876
Q ss_pred h
Q psy460 114 F 114 (117)
Q Consensus 114 ~ 114 (117)
.
T Consensus 124 ~ 124 (239)
T COG1579 124 K 124 (239)
T ss_pred H
Confidence 4
No 188
>PF14584 DUF4446: Protein of unknown function (DUF4446)
Probab=28.22 E-value=2.1e+02 Score=21.57 Aligned_cols=35 Identities=20% Similarity=0.286 Sum_probs=22.2
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy460 36 IKPTVDLLDEKFRATRASQLELRLQIEALTSQLDK 70 (117)
Q Consensus 36 L~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~ 70 (117)
|..++.+.-.+++..+....+++.+++.+...++.
T Consensus 44 lE~~l~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 78 (151)
T PF14584_consen 44 LEDLLNELFDQIDELKEELEELEKRIEELEEKLRN 78 (151)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 45566666666666666666666666666665553
No 189
>TIGR01005 eps_transp_fam exopolysaccharide transport protein family. The model describes the exopolysaccharide transport protein family in bacteria. The transport protein is part of a large genetic locus which is associated with exopolysaccharide (EPS) biosynthesis. Detailed molecular characterization and gene fusion analysis revealed atleast seven gene products are involved in the overall regulation, which among other things, include exopolysaccharide biosynthesis, property of conferring virulence and exopolysaccharide export.
Probab=27.91 E-value=4.6e+02 Score=23.78 Aligned_cols=71 Identities=17% Similarity=0.078 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHH---------------HHHHHHhhHHHHHHHHHHHHH
Q psy460 41 DLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSY---------------TRKLVDAQQKISIVGNILQTT 105 (117)
Q Consensus 41 ~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~y---------------vkKL~~ak~RV~~vnniLq~~ 105 (117)
..++.+...+.....-|..||+.+.++|..-... +..| -.+|.....++...-.-+..+
T Consensus 183 ~~~~~k~~~~~~a~~~L~~ql~~l~~~l~~aE~~------l~~fk~~~~l~~~~~~~~~~~~L~~l~~ql~~a~~~~~~a 256 (754)
T TIGR01005 183 GQGAAKSESNTAAADFLAPEIADLSKQSRDAEAE------VAAYRAQSDLLMGNNATLATQQLAELNTELSRARANRAAA 256 (754)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHcCCcccCCccchHHHHHHHHHHHHHHHHHHHHH
Q ss_pred HHHHHhhhhhcC
Q psy460 106 QVAMHKKFFMGS 117 (117)
Q Consensus 106 Q~RL~rl~~~~~ 117 (117)
+.|+..+..++.
T Consensus 257 ~a~~~~l~~~l~ 268 (754)
T TIGR01005 257 EGTADSVKKALQ 268 (754)
T ss_pred HHHHHHHHHHHh
No 190
>PRK06975 bifunctional uroporphyrinogen-III synthetase/uroporphyrin-III C-methyltransferase; Reviewed
Probab=27.27 E-value=4.9e+02 Score=23.83 Aligned_cols=14 Identities=14% Similarity=0.057 Sum_probs=9.3
Q ss_pred HHHHHHHHHHHhhh
Q psy460 100 NILQTTQVAMHKKF 113 (117)
Q Consensus 100 niLq~~Q~RL~rl~ 113 (117)
..|+++-.||.+++
T Consensus 445 ~~L~~AD~~La~~~ 458 (656)
T PRK06975 445 IALQNADARLATSD 458 (656)
T ss_pred HHHHHHHHHHHhcC
Confidence 34777777776665
No 191
>KOG0979|consensus
Probab=27.20 E-value=6.4e+02 Score=25.21 Aligned_cols=75 Identities=17% Similarity=0.236 Sum_probs=48.4
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcC-CCChhHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhhh
Q psy460 37 KPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQT-AVPLDSYTRKLVDAQQKISIVGNILQTTQVAMHKKFF 114 (117)
Q Consensus 37 ~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~-p~dLd~yvkKL~~ak~RV~~vnniLq~~Q~RL~rl~~ 114 (117)
.|+..+++..+....+- |+.+|+.++++++++.+.-+- -=++..-.+++..-++--...|..+.+.|.|..||..
T Consensus 624 ~~~l~~~~~~~ee~~~~---~~~~~~~~~~~~r~lee~~~k~~k~le~~~~~~~~~~~er~~~~~~~~~~~~r~~~ie~ 699 (1072)
T KOG0979|consen 624 SPVLEELDNRIEEEIQK---LKAEIDIRSSTLRELEEKKQKERKELEEEQKKLKLLKRERTKLNSELKSYQQRKERIEN 699 (1072)
T ss_pred chHHHHHHHHHHHHHHH---HHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHH
Confidence 56666666666655542 333455555555555442111 1136677778888888888999999999999988865
No 192
>PLN03230 acetyl-coenzyme A carboxylase carboxyl transferase; Provisional
Probab=27.19 E-value=96 Score=27.60 Aligned_cols=34 Identities=18% Similarity=0.377 Sum_probs=27.3
Q ss_pred HHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHhhHH
Q psy460 40 VDLLDEKFRATRA----SQLELRLQIEALTSQLDKISA 73 (117)
Q Consensus 40 I~~lD~~V~~tr~----SQ~eL~~qId~L~~eL~~l~e 73 (117)
|.+|+.+|...+. +...+..+|.+|.+.+.++.+
T Consensus 82 i~ele~ki~el~~~~~~~~~~~~~ei~~l~~~~~~~~~ 119 (431)
T PLN03230 82 IVDLENRIDEVRELANKTGVDFSAQIAELEERYDQVRR 119 (431)
T ss_pred HHHHHHHHHHHHhhhhcccccHHHHHHHHHHHHHHHHH
Confidence 6889999988876 456788899999888888755
No 193
>TIGR00513 accA acetyl-CoA carboxylase, carboxyl transferase, alpha subunit. The enzyme acetyl-CoA carboxylase contains a biotin carboxyl carrier protein or domain, a biotin carboxylase, and a carboxyl transferase. This model represents the alpha chain of the carboxyl transferase for cases in which the architecture of the protein is as in E. coli, in which the carboxyltransferase portion consists of two non-identical subnits, alpha and beta.
Probab=27.11 E-value=1.2e+02 Score=25.68 Aligned_cols=39 Identities=26% Similarity=0.423 Sum_probs=29.7
Q ss_pred HhhhHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHhhHH
Q psy460 34 GLIKPTVDLLDEKFRATRA----SQLELRLQIEALTSQLDKISA 73 (117)
Q Consensus 34 ~lL~P~I~~lD~~V~~tr~----SQ~eL~~qId~L~~eL~~l~e 73 (117)
.+-+| |.+|++++...+. ++..+..+|.+|.+.+.++..
T Consensus 7 ~fe~~-i~~l~~~~~~l~~~~~~~~~~~~~~~~~l~~~~~~~~~ 49 (316)
T TIGR00513 7 DFEKP-IAELEAKIESLRARSRDEDVDLSEEIERLEKRSVELTK 49 (316)
T ss_pred hhhHH-HHHHHHHHHHHHhhhhcccccHHHHHHHHHHHHHHHHH
Confidence 33455 5789999998775 456799999999888888754
No 194
>PRK06034 hypothetical protein; Provisional
Probab=26.94 E-value=1.4e+02 Score=24.92 Aligned_cols=33 Identities=24% Similarity=0.454 Sum_probs=17.7
Q ss_pred HHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhH
Q psy460 55 LELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQ 93 (117)
Q Consensus 55 ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~ 93 (117)
.+||.+||++..+|-+|... =..|++++..+|+
T Consensus 12 ~eLR~eID~ID~eLl~LL~e------R~~lv~~Va~~K~ 44 (279)
T PRK06034 12 AELRWEIDAIDEELHQLLME------RGDIIDRLIAVKR 44 (279)
T ss_pred HHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHhh
Confidence 45666666666666665543 2344444444443
No 195
>PF13942 Lipoprotein_20: YfhG lipoprotein
Probab=26.83 E-value=1.5e+02 Score=23.59 Aligned_cols=33 Identities=24% Similarity=0.406 Sum_probs=23.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHH
Q psy460 41 DLLDEKFRATRASQLELRLQIEALTSQLDKISA 73 (117)
Q Consensus 41 ~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e 73 (117)
..-|+++...|+.|..|+.+++.-...|.+|..
T Consensus 126 qssD~~lD~Lr~qq~~Lq~qL~~T~RKLEnLTD 158 (179)
T PF13942_consen 126 QSSDSELDALRQQQQRLQYQLDTTTRKLENLTD 158 (179)
T ss_pred HhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhH
Confidence 345777777777777777777777777776654
No 196
>PF02181 FH2: Formin Homology 2 Domain; InterPro: IPR015425 Formin homology (FH) proteins play a crucial role in the reorganisation of the actin cytoskeleton, which mediates various functions of the cell cortex including motility, adhesion, and cytokinesis []. Formins are multidomain proteins that interact with diverse signalling molecules and cytoskeletal proteins, although some formins have been assigned functions within the nucleus. Formins are characterised by the presence of three FH domains (FH1, FH2 and FH3), although members of the formin family do not necessarily contain all three domains []. The proline-rich FH1 domain mediates interactions with a variety of proteins, including the actin-binding protein profilin, SH3 (Src homology 3) domain proteins, and WW domain proteins. The FH2 domain is required for the self-association of formin proteins through the ability of FH2 domains to directly bind each other [], and may also act to inhibit actin polymerisation []. The FH3 domain (IPR010472 from INTERPRO) is less well conserved and may be important for determining intracellular localisation of formin family proteins. In addition, some formins can contain a GTPase-binding domain (GBD) (IPR010473 from INTERPRO) required for binding to Rho small GTPases, and a C-terminal conserved Dia-autoregulatory domain (DAD). This entry represents the FH2 domain, which was shown by X-ray crystallography to have an elongated, crescent shape containing three helical subdomains [].; PDB: 1Y64_B 1UX4_A 1UX5_A 3O4X_H 3OBV_E 1V9D_D 2Z6E_B 2J1D_G.
Probab=26.39 E-value=3.2e+02 Score=22.30 Aligned_cols=56 Identities=11% Similarity=0.141 Sum_probs=42.9
Q ss_pred HHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHH
Q psy460 31 GLLGLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVD 90 (117)
Q Consensus 31 GL~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ 90 (117)
++...++|.++..+.++..++....++.....++..+. .|. ...++.+.+++-|..
T Consensus 309 ~f~~~~~~f~~~~~~~~~~l~~~~~~~~~~~~~~~~yf---ge~-~~~~~~~~ff~~l~~ 364 (370)
T PF02181_consen 309 KFKEKMKEFLEEAETKLDELQELYEELEEAFKQLLQYF---GED-PKKMSPEEFFKILSQ 364 (370)
T ss_dssp THHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHT---T---TTCCHHHHHHHHHHH
T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---CCC-CCCCCHHHHHHHHHH
Confidence 46778899999999999999998888888888887776 332 346777777776654
No 197
>KOG0976|consensus
Probab=26.34 E-value=4e+02 Score=26.48 Aligned_cols=82 Identities=17% Similarity=0.109 Sum_probs=46.4
Q ss_pred HHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHh---cCCCC-hhHHHHHHHHhhHHHHHHHHHHH
Q psy460 28 IAEGLLGLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQ---QTAVP-LDSYTRKLVDAQQKISIVGNILQ 103 (117)
Q Consensus 28 La~GL~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q---~~p~d-Ld~yvkKL~~ak~RV~~vnniLq 103 (117)
=++|+-..+.-.=++-|...-++|.-|..++ ..-.+|++|.++| +.-+| +..++-.|---|+---++-|-|+
T Consensus 345 k~egfddk~~eLEKkrd~al~dvr~i~e~k~----nve~elqsL~~l~aerqeQidelKn~if~~e~~~~dhe~~kneL~ 420 (1265)
T KOG0976|consen 345 KAEGFDDKLNELEKKRDMALMDVRSIQEKKE----NVEEELQSLLELQAERQEQIDELKNHIFRLEQGKKDHEAAKNELQ 420 (1265)
T ss_pred hhcchhHHHHHHHHHHHHHHHhHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhccchhHHHHHHHH
Confidence 3566666655555555555555555443332 2233333333322 11222 35666677777777778888899
Q ss_pred HHHHHHHhhh
Q psy460 104 TTQVAMHKKF 113 (117)
Q Consensus 104 ~~Q~RL~rl~ 113 (117)
.+++|++.+-
T Consensus 421 ~a~ekld~mg 430 (1265)
T KOG0976|consen 421 EALEKLDLMG 430 (1265)
T ss_pred HHHHHHHHHh
Confidence 9998887653
No 198
>PHA03386 P10 fibrous body protein; Provisional
Probab=26.08 E-value=2.4e+02 Score=20.21 Aligned_cols=45 Identities=29% Similarity=0.522 Sum_probs=23.8
Q ss_pred HHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHH
Q psy460 55 LELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNIL 102 (117)
Q Consensus 55 ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniL 102 (117)
+++....+.|.+.+..+.+. .-.||.--..|...--+|+.+++||
T Consensus 15 kavd~KVdaLQ~qV~dv~~n---~~~LDa~~~qL~~l~tkV~~Iq~iL 59 (94)
T PHA03386 15 QEVDTKVDALQTQLNGLEED---SQPLDGLPAQLTELDTKVSDIQSIL 59 (94)
T ss_pred HHHhhHHHHHHHHHHHHHhc---chhhhhHHHHHHHHHHHHHHHHHhc
Confidence 45556667777777766653 1225555555544444444444444
No 199
>PRK15366 type III secretion system chaperone SsaE; Provisional
Probab=25.97 E-value=2.3e+02 Score=19.74 Aligned_cols=46 Identities=15% Similarity=0.162 Sum_probs=34.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHH
Q psy460 38 PTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTR 86 (117)
Q Consensus 38 P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvk 86 (117)
|++..||...+ .|=.+...-|+.|...=.+|...-.+|.+.+.|-+
T Consensus 2 ~TLT~LEDsLr---~~~~~a~~~i~~L~aa~~rL~~al~~P~sp~qYqq 47 (80)
T PRK15366 2 TTLTRLEDLLL---HSREEAKGIILQLRAARKQLEENNGKLQDPQQYQQ 47 (80)
T ss_pred cchhHHHHHHh---cCHHHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHH
Confidence 67888887654 34445577788888777788777789999998854
No 200
>PRK10884 SH3 domain-containing protein; Provisional
Probab=25.93 E-value=3.3e+02 Score=21.45 Aligned_cols=28 Identities=25% Similarity=0.373 Sum_probs=17.2
Q ss_pred HHHHhhHHHHHHHHHHHHHHHHHHhhhh
Q psy460 87 KLVDAQQKISIVGNILQTTQVAMHKKFF 114 (117)
Q Consensus 87 KL~~ak~RV~~vnniLq~~Q~RL~rl~~ 114 (117)
.|..++.++..+..-+...++.+..=-|
T Consensus 147 ~l~~~~~~~~~l~~~~~~~~~~~~~~wf 174 (206)
T PRK10884 147 QLIVAQKKVDAANLQLDDKQRTIIMQWF 174 (206)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4555667777767777766666544333
No 201
>smart00787 Spc7 Spc7 kinetochore protein. This domain is found in cell division proteins which are required for kinetochore-spindle association.
Probab=25.86 E-value=4e+02 Score=22.31 Aligned_cols=34 Identities=24% Similarity=0.169 Sum_probs=22.8
Q ss_pred ChhHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhh
Q psy460 80 PLDSYTRKLVDAQQKISIVGNILQTTQVAMHKKF 113 (117)
Q Consensus 80 dLd~yvkKL~~ak~RV~~vnniLq~~Q~RL~rl~ 113 (117)
.|...-.+|......+..-.+.+...++.+..+.
T Consensus 205 eL~~lk~~l~~~~~ei~~~~~~l~e~~~~l~~l~ 238 (312)
T smart00787 205 ELDRAKEKLKKLLQEIMIKVKKLEELEEELQELE 238 (312)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3455556777777777777777777777776654
No 202
>KOG3438|consensus
Probab=25.49 E-value=1.4e+02 Score=21.77 Aligned_cols=28 Identities=18% Similarity=0.445 Sum_probs=22.4
Q ss_pred CCchHHHHHHH---HHHhhhHHHHHHHHHHH
Q psy460 21 DNPTQDTIAEG---LLGLIKPTVDLLDEKFR 48 (117)
Q Consensus 21 ~~p~rd~La~G---L~~lL~P~I~~lD~~V~ 48 (117)
..|+++.|..| |+++..-+..+|++.+.
T Consensus 67 ~~~A~evl~kgl~el~~~c~~v~~kF~~~i~ 97 (105)
T KOG3438|consen 67 GDPAVEVLKKGLEELMQLCDHVRSKFEEEIE 97 (105)
T ss_pred CCcchHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 77889999999 56777777788877765
No 203
>PF10267 Tmemb_cc2: Predicted transmembrane and coiled-coil 2 protein; InterPro: IPR019394 This family of transmembrane coiled-coil containing proteins is conserved from worms to humans. Its function is unknown.
Probab=25.47 E-value=4.6e+02 Score=22.95 Aligned_cols=20 Identities=10% Similarity=0.137 Sum_probs=15.1
Q ss_pred HHHHHHHHHHHHHHHHHhhh
Q psy460 94 KISIVGNILQTTQVAMHKKF 113 (117)
Q Consensus 94 RV~~vnniLq~~Q~RL~rl~ 113 (117)
|.--|+-.+.+.|.|+.||.
T Consensus 299 RaRdi~E~~Es~qtRisklE 318 (395)
T PF10267_consen 299 RARDIWEVMESCQTRISKLE 318 (395)
T ss_pred HHhHHHHHHHHHHHHHHHHH
Confidence 44456778888888888886
No 204
>PF01544 CorA: CorA-like Mg2+ transporter protein; InterPro: IPR002523 The CorA transport system is the primary Mg2+ influx system of Salmonella typhimurium and Escherichia coli [, ]. CorA is virtually ubiquitous in the Bacteria and Archaea. There are also eukaryotic relatives of this protein. Transporter ZntB mediates efflux of zinc ions [].; GO: 0046873 metal ion transmembrane transporter activity, 0030001 metal ion transport, 0055085 transmembrane transport, 0016020 membrane; PDB: 2HN1_A 3NWI_D 3NVO_B 3CK6_A 2IUB_E 2BBJ_E 2HN2_A 2BBH_A.
Probab=25.35 E-value=3.1e+02 Score=20.85 Aligned_cols=63 Identities=19% Similarity=0.313 Sum_probs=33.0
Q ss_pred HHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHHHHHHH
Q psy460 32 LLGLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQTTQVAM 109 (117)
Q Consensus 32 L~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q~RL 109 (117)
++.++.-+++.+..-++.... +++.+..++ ..-+-....+.|...|+.+..+...+...++-+
T Consensus 112 l~~il~~~~~~~~~~l~~l~~-------~l~~le~~~--------~~~~~~~~~~~l~~l~~~l~~l~~~l~~~~~~l 174 (292)
T PF01544_consen 112 LYAILDEIVDDYFEVLEELED-------ELDELEDEL--------DDRPSNELLRELFDLRRELSRLRRSLSPLREVL 174 (292)
T ss_dssp HHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHH--------THTTTHHHCCHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHH-------HHHHHHhhc--------ccccchhhHHHHHHHHHHHHHHHHHhhhHHHHH
Confidence 344555555555554444333 555555544 111223455566666666666666666655555
No 205
>PTZ00419 valyl-tRNA synthetase-like protein; Provisional
Probab=25.07 E-value=3.1e+02 Score=26.24 Aligned_cols=64 Identities=9% Similarity=0.172 Sum_probs=43.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCCh-hHHHHHHHHhhHHHHHHHHHHHHHH
Q psy460 43 LDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPL-DSYTRKLVDAQQKISIVGNILQTTQ 106 (117)
Q Consensus 43 lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dL-d~yvkKL~~ak~RV~~vnniLq~~Q 106 (117)
+++.+....+-...|..+|+++...|..-+....+|.++ ..--+||.+....+..+...|++..
T Consensus 927 ~~~E~~rL~K~l~kl~~ei~~~~~kL~N~~F~~kAp~~vve~e~~kl~~~~~~l~~l~~~l~~l~ 991 (995)
T PTZ00419 927 LKKELAKLEKKLAKLQKSLESYLKKISIPNYEDKVPEDVRKLNDEKIDELNEEIKQLEQAIEELK 991 (995)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhCCchhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 555566666666677778888877777766655677766 5666777777777777766666544
No 206
>PRK05729 valS valyl-tRNA synthetase; Reviewed
Probab=24.93 E-value=2.6e+02 Score=26.26 Aligned_cols=62 Identities=13% Similarity=0.205 Sum_probs=38.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCCh-hHHHHHHHHhhHHHHHHHHHHHH
Q psy460 43 LDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPL-DSYTRKLVDAQQKISIVGNILQT 104 (117)
Q Consensus 43 lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dL-d~yvkKL~~ak~RV~~vnniLq~ 104 (117)
+++.+....+-..+|..+|+++...|..=+....+|.++ ...-+||.+....+..+...|.+
T Consensus 809 ~~~e~~rL~K~l~kl~~ei~~~~~kL~n~~F~~KAP~~vve~e~~kl~~~~~~~~~l~~~l~~ 871 (874)
T PRK05729 809 VEAELARLEKELAKLEKEIERVEKKLSNEGFVAKAPEEVVEKEREKLAEYEEKLAKLKERLAR 871 (874)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhCCchhhhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 445555555555566667777777776655555677765 45666666666666666555544
No 207
>PHA01750 hypothetical protein
Probab=24.89 E-value=2.3e+02 Score=19.37 Aligned_cols=9 Identities=22% Similarity=0.180 Sum_probs=4.3
Q ss_pred HhhhHHHHH
Q psy460 34 GLIKPTVDL 42 (117)
Q Consensus 34 ~lL~P~I~~ 42 (117)
..|+.+|++
T Consensus 30 q~lkdAvke 38 (75)
T PHA01750 30 QALKDAVKE 38 (75)
T ss_pred HHHHHHHHH
Confidence 444555543
No 208
>PRK10378 inactive ferrous ion transporter periplasmic protein EfeO; Provisional
Probab=24.70 E-value=2.4e+02 Score=24.46 Aligned_cols=36 Identities=14% Similarity=0.361 Sum_probs=22.3
Q ss_pred HHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHH
Q psy460 32 LLGLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISA 73 (117)
Q Consensus 32 L~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e 73 (117)
++.+|+|.|.+-|...-. ++..+++.+...|.++..
T Consensus 301 ~~~~l~P~l~~~dp~l~~------~id~~f~~v~~lL~~~~~ 336 (375)
T PRK10378 301 IVDLLRPLLEKANPELLA------KVDANFKKVDTILAKYRT 336 (375)
T ss_pred HHHHhhhhhhhcCHHHHH------HHHHHHHHHHHHHHHhcC
Confidence 889999999866554321 233455666666666543
No 209
>PF04156 IncA: IncA protein; InterPro: IPR007285 Chlamydia trachomatis is an obligate intracellular bacterium that develops within a parasitophorous vacuole termed an inclusion. The inclusion is nonfusogenic with lysosomes but intercepts lipids from a host cell exocytic pathway. Initiation of chlamydial development is concurrent with modification of the inclusion membrane by a set of C. trachomatis-encoded proteins collectively designated Incs. One of these Incs, IncA (Inclusion membrane protein A), is functionally associated with the homotypic fusion of inclusions [].
Probab=24.63 E-value=2.9e+02 Score=20.36 Aligned_cols=22 Identities=9% Similarity=0.057 Sum_probs=10.3
Q ss_pred HHHHHHHHHHHHHHHHHHhhhh
Q psy460 93 QKISIVGNILQTTQVAMHKKFF 114 (117)
Q Consensus 93 ~RV~~vnniLq~~Q~RL~rl~~ 114 (117)
.++...-..+...+....++..
T Consensus 158 ~~~~~~~~~~~~~~~~~~~l~~ 179 (191)
T PF04156_consen 158 EEVQELRSQLERLQENLQQLEE 179 (191)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 4444444444444444444443
No 210
>PF02865 STAT_int: STAT protein, protein interaction domain; InterPro: IPR013799 The STAT protein (Signal Transducers and Activators of Transcription) family contains transcription factors that are specifically activated to regulate gene transcription when cells encounter cytokines and growth factors, hence they act as signal transducers in the cytoplasm and transcription activators in the nucleus []. Binding of these factors to cell-surface receptors leads to receptor autophosphorylation at a tyrosine, the phosphotyrosine being recognised by the STAT SH2 domain, which mediates the recruitment of STAT proteins from the cytosol and their association with the activated receptor. The STAT proteins are then activated by phosphorylation via members of the JAK family of protein kinases, causing them to dimerise and translocated to the nucleus, where they bind to specific promoter sequences in target genes. In mammals, STATs comprise a family of seven structurally and functionally related proteins: Stat1, Stat2, Stat3, Stat4, Stat5a and Stat5b, Stat6. STAT proteins play a critical role in regulating innate and acquired host immune responses. Dysregulation of at least two STAT signalling cascades (i.e. Stat3 and Stat5) is associated with cellular transformation. Signalling through the JAK/STAT pathway is initiated when a cytokine binds to its corresponding receptor. This leads to conformational changes in the cytoplasmic portion of the receptor, initiating activation of receptor associated members of the JAK family of kinases. The JAKs, in turn, mediate phosphorylation at the specific receptor tyrosine residues, which then serve as docking sites for STATs and other signalling molecules. Once recruited to the receptor, STATs also become phosphorylated by JAKs, on a single tyrosine residue. Activated STATs dissociate from the receptor, dimerise, translocate to the nucleus and bind to members of the GAS (gamma activated site) family of enhancers. The seven STAT proteins identified in mammals range in size from 750 and 850 amino acids. The chromosomal distribution of these STATs, as well as the identification of STATs in more primitive eukaryotes, suggest that this family arose from a single primordial gene. STATs share structurally and functionally conserved domains including: an N-terminal domain that strengthens interactions between STAT dimers on adjacent DNA-binding sites; a coiled-coil STAT domain that is implicated in protein-protein interactions; a DNA-binding domain with an immunoglobulin-like fold similar to p53 tumour suppressor protein; an EF-hand-like linker domain connecting the DNA-binding and SH2 domains; an SH2 domain (IPR000980 from INTERPRO) that acts as a phosphorylation-dependent switch to control receptor recognition and DNA-binding; and a C-terminal transactivation domain []. The crystal structure of the N terminus of Stat4 reveals a dimer. The interface of this dimer is formed by a ring-shaped element consisting of five short helices. Several studies suggest that this N-terminal dimerisation promotes cooperativity of binding to tandem GAS elements and with the transcriptional coactivator CBP/p300. This entry represents the N-terminal domain, which is responsible for protein interactions. This domain has a multi-helical structure that can be subdivided into two structural sub-domains.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0004871 signal transducer activity, 0006355 regulation of transcription, DNA-dependent, 0007165 signal transduction; PDB: 1BGF_A 1YVL_A.
Probab=24.62 E-value=2.8e+02 Score=20.13 Aligned_cols=70 Identities=14% Similarity=0.267 Sum_probs=46.0
Q ss_pred chHHHHHHHHHHhhhHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHH
Q psy460 23 PTQDTIAEGLLGLIKPTVDLLDEKFRAT-RASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISI 97 (117)
Q Consensus 23 p~rd~La~GL~~lL~P~I~~lD~~V~~t-r~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~ 97 (117)
|.-...|. .+|.-.|++|+.+.... ..+-.-++-.|-+....+...= +..|..|..+|+..+.--+|++.
T Consensus 50 ~~~~~~A~---~l~~~ll~~Lq~~~~~~~~~~~fl~~~~l~~~~~~~q~~y--~~~P~~L~~~I~~~L~~E~~iv~ 120 (124)
T PF02865_consen 50 PQDESQAS---VLFQNLLQELQQQASRQSQEDNFLLQHNLREIAQNFQNRY--QQNPLELARIIRNCLQEEKRIVQ 120 (124)
T ss_dssp ---HHHHH---HHHHHHHHHHHHHHHHHHHHT-HHHHHHHHHHHHHHHHHH--CT-HHHHHHHHHHHHHHHHHHHH
T ss_pred cccHHHHH---HHHHHHHHHHHHHHHHhcCcchhHHHHHHHHHHHHHHHHh--ccCHHHHHHHHHHHHHHHHHHHH
Confidence 44445555 44556678888886554 3466667777777777776532 35899999999999988888763
No 211
>PF08614 ATG16: Autophagy protein 16 (ATG16); InterPro: IPR013923 Macroautophagy is a bulk degradation process induced by starvation in eukaryotic cells. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. No molecule involved in autophagy has yet been identified in higher eukaryotes []. The pre-autophagosomal structure contains at least five Apg proteins: Apg1p, Apg2p, Apg5p, Aut7p/Apg8p and Apg16p. It is found in the vacuole []. The C-terminal glycine of Apg12p is conjugated to a lysine residue of Apg5p via an isopeptide bond. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. Auotphagy protein 16 (Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate []. Autophagy protein 5 (Apg5) is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway []. This entry represents auotphagy protein 16 (Apg16), which is required for the function of the Apg12p-Apg5p conjugate.; PDB: 3A7O_D 3A7P_B.
Probab=24.59 E-value=1.5e+02 Score=22.59 Aligned_cols=75 Identities=17% Similarity=0.277 Sum_probs=0.0
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhhhh
Q psy460 36 IKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQTTQVAMHKKFFM 115 (117)
Q Consensus 36 L~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q~RL~rl~~~ 115 (117)
+.+.+-++-.-...++.++-+|.++|-.+..+|..+... +...-..|...+..+..+..-+....+.|.-..+.
T Consensus 72 le~~~~~l~~ELael~r~~~el~~~L~~~~~~l~~l~~~------~~~~~~~l~~l~~~~~~L~~~~~~l~~~l~ek~k~ 145 (194)
T PF08614_consen 72 LEQKLAKLQEELAELYRSKGELAQQLVELNDELQELEKE------LSEKERRLAELEAELAQLEEKIKDLEEELKEKNKA 145 (194)
T ss_dssp ------------------------------------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred cccccccccccccccccccccccccccccccccchhhhh------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q ss_pred c
Q psy460 116 G 116 (117)
Q Consensus 116 ~ 116 (117)
+
T Consensus 146 ~ 146 (194)
T PF08614_consen 146 N 146 (194)
T ss_dssp H
T ss_pred H
No 212
>PF06120 Phage_HK97_TLTM: Tail length tape measure protein; InterPro: IPR009302 This entry consists of the tail length tape measure protein from Bacteriophage HK97 and related sequences from Escherichia coli (strain K12).
Probab=24.57 E-value=4.3e+02 Score=22.30 Aligned_cols=33 Identities=18% Similarity=0.346 Sum_probs=26.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHh
Q psy460 43 LDEKFRATRASQLELRLQIEALTSQLDKISAEQ 75 (117)
Q Consensus 43 lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q 75 (117)
|=.+.+.+|++=.+....||++.+.|+..+..|
T Consensus 39 ~yQ~~EQAr~~A~~fA~~ld~~~~kl~~Ms~~q 71 (301)
T PF06120_consen 39 FYQNAEQARQEAIEFADSLDELKEKLKEMSSTQ 71 (301)
T ss_pred HHHHHHHHHHHHHHHHHhhHHHHHHHHhcCHHH
Confidence 347888888888999999999998888887653
No 213
>PF05103 DivIVA: DivIVA protein; InterPro: IPR007793 The Bacillus subtilis divIVA1 mutation causes misplacement of the septum during cell division, resulting in the formation of small, circular, anucleate minicells []. Inactivation of divIVA produces a minicell phenotype, whereas overproduction of DivIVA results in a filamentation phenotype []. These proteins appear to contain coiled-coils.; PDB: 2WUK_C 2WUJ_A.
Probab=24.54 E-value=1.5e+02 Score=20.49 Aligned_cols=32 Identities=13% Similarity=0.204 Sum_probs=15.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHH
Q psy460 42 LLDEKFRATRASQLELRLQIEALTSQLDKISA 73 (117)
Q Consensus 42 ~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e 73 (117)
+.|.-+..+...=..|..++..|..++..+..
T Consensus 22 eVD~fl~~l~~~~~~l~~e~~~L~~~~~~l~~ 53 (131)
T PF05103_consen 22 EVDDFLDELAEELERLQRENAELKEEIEELQA 53 (131)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 34444444444444455555555555555443
No 214
>PRK10884 SH3 domain-containing protein; Provisional
Probab=24.50 E-value=3.4e+02 Score=21.36 Aligned_cols=10 Identities=30% Similarity=0.228 Sum_probs=3.7
Q ss_pred HHHHHHHHHH
Q psy460 100 NILQTTQVAM 109 (117)
Q Consensus 100 niLq~~Q~RL 109 (117)
.-|+.++..+
T Consensus 146 ~~l~~~~~~~ 155 (206)
T PRK10884 146 NQLIVAQKKV 155 (206)
T ss_pred HHHHHHHHHH
Confidence 3333333333
No 215
>PF12128 DUF3584: Protein of unknown function (DUF3584); InterPro: IPR021979 This family consist of uncharacterised bacterial proteins.
Probab=24.50 E-value=6.8e+02 Score=24.55 Aligned_cols=32 Identities=9% Similarity=0.139 Sum_probs=23.9
Q ss_pred hhHHHHHHHHhhHHHHHHHHHHHHHHHHHHhh
Q psy460 81 LDSYTRKLVDAQQKISIVGNILQTTQVAMHKK 112 (117)
Q Consensus 81 Ld~yvkKL~~ak~RV~~vnniLq~~Q~RL~rl 112 (117)
-+.|-+.+..++..+..+++-|..++++-..-
T Consensus 308 ~~~~~~~~~~~~~~l~~~~~~L~~i~~~~~~y 339 (1201)
T PF12128_consen 308 RDELNKELSALNADLARIKSELDEIEQQKKDY 339 (1201)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 35677888888888888888888877765543
No 216
>PF07851 TMPIT: TMPIT-like protein; InterPro: IPR012926 A number of members of this family are annotated as being transmembrane proteins induced by tumour necrosis factor alpha, but no literature was found to support this. ; GO: 0016021 integral to membrane
Probab=24.35 E-value=4.6e+02 Score=22.50 Aligned_cols=69 Identities=12% Similarity=0.117 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHHH--------HHHHHhhh
Q psy460 42 LLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQTT--------QVAMHKKF 113 (117)
Q Consensus 42 ~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~--------Q~RL~rl~ 113 (117)
++++...+......+|.+--....+.|.++.+. .+...+.+..-|+|+..+...|+.. .+.+.+|.
T Consensus 1 e~~eEW~eL~~efq~Lqethr~Y~qKleel~~l------Q~~C~ssI~~QkkrLk~L~~sLk~~~~~~~~e~~~~i~~L~ 74 (330)
T PF07851_consen 1 ECEEEWEELQKEFQELQETHRSYKQKLEELSKL------QDKCSSSISHQKKRLKELKKSLKRCKKSLSAEERELIEKLE 74 (330)
T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHhccCCChhHHHHHHHHH
Q ss_pred hhc
Q psy460 114 FMG 116 (117)
Q Consensus 114 ~~~ 116 (117)
..+
T Consensus 75 ~~I 77 (330)
T PF07851_consen 75 EDI 77 (330)
T ss_pred HHH
No 217
>COG1813 Predicted transcription factor, homolog of eukaryotic MBF1 [Transcription]
Probab=24.25 E-value=2.3e+02 Score=22.06 Aligned_cols=45 Identities=20% Similarity=0.271 Sum_probs=37.0
Q ss_pred hhhHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHhhHHHhcCCCC
Q psy460 35 LIKPTVDLLDEKFRATRA----SQLELRLQIEALTSQLDKISAEQQTAVP 80 (117)
Q Consensus 35 lL~P~I~~lD~~V~~tr~----SQ~eL~~qId~L~~eL~~l~e~q~~p~d 80 (117)
.+.-+|+.+...|+.+|+ ||.+|.+.|..=.+-+++|... ..-|+
T Consensus 72 ~~~elvedY~e~Ir~ARE~~G~SqedLA~ki~ek~svI~~iE~g-~~~P~ 120 (165)
T COG1813 72 ELPELVEDYGERIREAREKRGLSQEDLAAKLKEKVSVIRRIERG-EATPN 120 (165)
T ss_pred hHHHHHHHHHHHHHHHHHHcCCCHHHHHHHhcccHHHHHHHHhc-ccCcc
Confidence 455688999999999997 8999999999999999999654 34444
No 218
>PF07047 OPA3: Optic atrophy 3 protein (OPA3); InterPro: IPR010754 OPA3 deficiency causes type III 3-methylglutaconic aciduria (MGA) in humans. This disease manifests with early bilateral optic atrophy, spasticity, extrapyramidal dysfunction, ataxia, and cognitive deficits, but normal longevity []. This family consists of several optic atrophy 3 (OPA3) proteins and related proteins from other eukaryotic species, the function is unknown.
Probab=24.17 E-value=2.9e+02 Score=20.10 Aligned_cols=64 Identities=11% Similarity=0.145 Sum_probs=33.3
Q ss_pred cccCCCCCCCCCCchHHHHHHH-HHHhhhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHH
Q psy460 10 TSADDKTEDFCDNPTQDTIAEG-LLGLIKPTV-DLLDEKFRATRASQLELRLQIEALTSQLDKISA 73 (117)
Q Consensus 10 ts~~~~~~~~~~~p~rd~La~G-L~~lL~P~I-~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e 73 (117)
+.+-+.+|.-.-.-|.|.|+|+ ||++..-+| =+.--..+.-..-..++++.++.|.+++..|.+
T Consensus 61 ~~i~pL~e~~Aie~Gaell~E~fiF~Va~~li~~E~~Rs~~ke~~Ke~~~~~~l~~L~~~i~~L~~ 126 (134)
T PF07047_consen 61 RKIRPLNEEKAIELGAELLGEAFIFSVAAGLIIYEYWRSARKEAKKEEELQERLEELEERIEELEE 126 (134)
T ss_pred CcCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3445555555556678899999 445544333 222222222233334455566666555555543
No 219
>PF08838 DUF1811: Protein of unknown function (DUF1811); InterPro: IPR014938 This entry consists uncharacterised bacterial proteins. Some of the proteins are annotated as being transcriptional regulators (see Q4MQL7 from SWISSPROT, Q65MA2 from SWISSPROT). The structure of one of the proteins has revealed a beta-barrel like structure with helix-turn-helix like motif. ; PDB: 2YXY_A 1SF9_A.
Probab=24.14 E-value=1e+02 Score=22.33 Aligned_cols=40 Identities=30% Similarity=0.366 Sum_probs=28.0
Q ss_pred HHHHHHHHHHHHHHHHHHHhhHHHhcCCC-ChhHHHHHHHHhh
Q psy460 51 RASQLELRLQIEALTSQLDKISAEQQTAV-PLDSYTRKLVDAQ 92 (117)
Q Consensus 51 r~SQ~eL~~qId~L~~eL~~l~e~q~~p~-dLd~yvkKL~~ak 92 (117)
..|+.||++.|..|.++-++-.. .--+ .++.|-+|..=||
T Consensus 6 eMs~~EL~~Ei~~L~ekarKAEq--~G~~nE~aV~erK~~mAk 46 (102)
T PF08838_consen 6 EMSEEELRQEIARLKEKARKAEQ--LGIVNEYAVYERKIIMAK 46 (102)
T ss_dssp C--HHHHHHHHHHHHHHHHHHHH--CT-HHHHHHHHHHHHHHH
T ss_pred hcCHHHHHHHHHHHHHHHHHHHH--cCCccHHHHHHHHHHHHH
Confidence 35788999999999988877543 2333 5788888887665
No 220
>KOG0996|consensus
Probab=23.77 E-value=6.3e+02 Score=25.77 Aligned_cols=78 Identities=15% Similarity=0.207 Sum_probs=0.0
Q ss_pred HHhhhHHHHHHHHHHHHH-----H-HHHHHHHHHHHHH--------HHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHH
Q psy460 33 LGLIKPTVDLLDEKFRAT-----R-ASQLELRLQIEAL--------TSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIV 98 (117)
Q Consensus 33 ~~lL~P~I~~lD~~V~~t-----r-~SQ~eL~~qId~L--------~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~v 98 (117)
+..++--|..++..+.+. . +-+.+|+..||++ ..+++++++- ++-...++....-+|..-
T Consensus 860 l~~~~~~ie~l~kE~e~~qe~~~Kk~~i~~lq~~i~~i~~e~~q~qk~kv~~~~~~------~~~l~~~i~k~~~~i~~s 933 (1293)
T KOG0996|consen 860 LKELEEQIEELKKEVEELQEKAAKKARIKELQNKIDEIGGEKVQAQKDKVEKINEQ------LDKLEADIAKLTVAIKTS 933 (1293)
T ss_pred HHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhhchhhHHhHHHHHHHHHH------HHHHHHHHHHhHHHHhcC
Q ss_pred HHHHHHHHHHHHhhhhhc
Q psy460 99 GNILQTTQVAMHKKFFMG 116 (117)
Q Consensus 99 nniLq~~Q~RL~rl~~~~ 116 (117)
+.-++.+|+-+.++.+++
T Consensus 934 ~~~i~k~q~~l~~le~~~ 951 (1293)
T KOG0996|consen 934 DRNIAKAQKKLSELEREI 951 (1293)
T ss_pred cccHHHHHHHHHHHHHHH
No 221
>TIGR01797 CM_P_1 chorismate mutase domain of proteobacterial P-protein, clade 1. This model represents the chorismate mutase domain of the gamma and beta proteobacterial "P-protein" which contains an N-terminal chorismate mutase domain and a C-terminal prephenate dehydratase domain.
Probab=23.66 E-value=2.3e+02 Score=18.74 Aligned_cols=18 Identities=39% Similarity=0.460 Sum_probs=10.3
Q ss_pred HHHHHHHHHHHHHHhhHH
Q psy460 56 ELRLQIEALTSQLDKISA 73 (117)
Q Consensus 56 eL~~qId~L~~eL~~l~e 73 (117)
++|.+||++..+|=.|-.
T Consensus 3 ~lR~~ID~ID~~lv~Ll~ 20 (83)
T TIGR01797 3 ALREKISAIDEKLLKLLA 20 (83)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 456666666665555443
No 222
>PF10168 Nup88: Nuclear pore component; InterPro: IPR019321 Nup88 can be divided into two structural domains; the N-terminal two-thirds of the protein have no obvious structural motifs. It is, however, where it binds to Nup98; one of the components of the nuclear pore. The C-terminal end is a predicted coiled-coil domain []. Nup88 is over expressed in tumour cells [].
Probab=23.53 E-value=6.2e+02 Score=23.73 Aligned_cols=77 Identities=16% Similarity=0.245 Sum_probs=0.0
Q ss_pred HHhhhHHHHHHHHH-HHHHHHHHHHHHHHHHHH----HHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHHHHH
Q psy460 33 LGLIKPTVDLLDEK-FRATRASQLELRLQIEAL----TSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQTTQV 107 (117)
Q Consensus 33 ~~lL~P~I~~lD~~-V~~tr~SQ~eL~~qId~L----~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q~ 107 (117)
+.+|.=+++.|=++ ++-......++..++..| ..+|+++.+.+..--.|..-..+|.+ |+..+...=..+..
T Consensus 538 l~lL~~a~~vlreeYi~~~~~ar~ei~~rv~~Lk~~~e~Ql~~L~~l~e~~~~l~~~ae~Lae---R~e~a~d~Qe~L~~ 614 (717)
T PF10168_consen 538 LELLSQATKVLREEYIEKQDLAREEIQRRVKLLKQQKEQQLKELQELQEERKSLRESAEKLAE---RYEEAKDKQEKLMK 614 (717)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHH
Q ss_pred HHHhh
Q psy460 108 AMHKK 112 (117)
Q Consensus 108 RL~rl 112 (117)
|+.++
T Consensus 615 R~~~v 619 (717)
T PF10168_consen 615 RVDRV 619 (717)
T ss_pred HHHHH
No 223
>PRK12855 hypothetical protein; Provisional
Probab=23.52 E-value=2.3e+02 Score=20.18 Aligned_cols=46 Identities=13% Similarity=0.243 Sum_probs=36.6
Q ss_pred CCCchHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhH
Q psy460 20 CDNPTQDTIAEGLLGLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKIS 72 (117)
Q Consensus 20 ~~~p~rd~La~GL~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~ 72 (117)
+-|-.+|.+| ||..+++-=+..+++.+..+|. +.|++|.++=+++-
T Consensus 28 ~~n~~kd~~a-~lr~ivGG~~~~Y~~~l~~aR~------~A~~rm~~~A~~lG 73 (103)
T PRK12855 28 GANIVRDLFA-SVRDVVGGRSGAYESKLKEARD------IAMEEMKTLARQKN 73 (103)
T ss_pred eeeeHHHHHH-HHHHHhcCchHHHHHHHHHHHH------HHHHHHHHHHHHcC
Confidence 4566788777 8999999999999999999886 57777777666554
No 224
>TIGR01803 CM-like chorismate mutase related enzymes. This subfamily includes two enzymes which are variants on the mechanism of chorismate mutase and are likely to have evolved from an ancestral chorismate mutase enzyme. 4-amino-4-deoxy-chorismate mutase produces amino-deoxy-prephenate which is subsequently converted to para-dimethylamino-phenylalanine, a component of the natural product pristinamycin. Isochorismate-pyruvate lyase presumably catalyzes the same type of 2+2+2 cyclo-rearrangement as chorismate mutase, but acting on isochorismate, this results in two broken bonds instead of one broken and one made. The product of this reaction is salicylate (2-hydroxy-benzoate) which is also incorporated into various natural products.
Probab=23.44 E-value=2.3e+02 Score=18.65 Aligned_cols=16 Identities=19% Similarity=0.326 Sum_probs=7.4
Q ss_pred HHHHHHHHHHHHHHhh
Q psy460 56 ELRLQIEALTSQLDKI 71 (117)
Q Consensus 56 eL~~qId~L~~eL~~l 71 (117)
++|.+||++..+|=.|
T Consensus 3 ~lR~~ID~ID~~lv~L 18 (82)
T TIGR01803 3 DIREAIDRIDLALVQA 18 (82)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3445555554444333
No 225
>TIGR01843 type_I_hlyD type I secretion membrane fusion protein, HlyD family. Type I secretion is an ABC transport process that exports proteins, without cleavage of any signal sequence, from the cytosol to extracellular medium across both inner and outer membranes. The secretion signal is found in the C-terminus of the transported protein. This model represents the adaptor protein between the ATP-binding cassette (ABC) protein of the inner membrane and the outer membrane protein, and is called the membrane fusion protein. This model selects a subfamily closely related to HlyD; it is defined narrowly and excludes, for example, colicin V secretion protein CvaA and multidrug efflux proteins.
Probab=22.91 E-value=4.1e+02 Score=21.45 Aligned_cols=43 Identities=19% Similarity=0.280 Sum_probs=21.2
Q ss_pred HHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhH
Q psy460 30 EGLLGLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKIS 72 (117)
Q Consensus 30 ~GL~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~ 72 (117)
.+....+.--...+..++...+.-...++.+|..+..++..+.
T Consensus 129 ~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~i~~~~~~l~~~~ 171 (423)
T TIGR01843 129 KGQQSLFESRKSTLRAQLELILAQIKQLEAELAGLQAQLQALR 171 (423)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344444444445555555555544555555555555555443
No 226
>KOG0804|consensus
Probab=22.83 E-value=5.9e+02 Score=23.23 Aligned_cols=22 Identities=27% Similarity=0.241 Sum_probs=9.6
Q ss_pred hhHHHHHHHHHHHHHHHHHHhh
Q psy460 91 AQQKISIVGNILQTTQVAMHKK 112 (117)
Q Consensus 91 ak~RV~~vnniLq~~Q~RL~rl 112 (117)
.......+-+.|+..|.++.|+
T Consensus 373 ~e~~kk~~e~k~~q~q~k~~k~ 394 (493)
T KOG0804|consen 373 LEAEKKIVERKLQQLQTKLKKC 394 (493)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 3333344444444444444443
No 227
>PRK10132 hypothetical protein; Provisional
Probab=22.74 E-value=2.9e+02 Score=19.72 Aligned_cols=21 Identities=29% Similarity=0.430 Sum_probs=10.8
Q ss_pred HHHHhhHHHHHHHHHHHHHHHHHH
Q psy460 87 KLVDAQQKISIVGNILQTTQVAMH 110 (117)
Q Consensus 87 KL~~ak~RV~~vnniLq~~Q~RL~ 110 (117)
++...|.|+.. .|..+++|+.
T Consensus 42 ~~~~lR~r~~~---~L~~ar~~l~ 62 (108)
T PRK10132 42 EAEAARRKAQA---LLKETRARMH 62 (108)
T ss_pred HHHHHHHHHHH---HHHHHHHHHh
Confidence 34555555433 5555555554
No 228
>PF11944 DUF3461: Protein of unknown function (DUF3461); InterPro: IPR020911 This entry describes proteins of unknown function.
Probab=22.58 E-value=3.2e+02 Score=20.56 Aligned_cols=31 Identities=16% Similarity=0.163 Sum_probs=20.7
Q ss_pred hHHHHHHHHhhHHHHHHHHHHHHHHHHHHhh
Q psy460 82 DSYTRKLVDAQQKISIVGNILQTTQVAMHKK 112 (117)
Q Consensus 82 d~yvkKL~~ak~RV~~vnniLq~~Q~RL~rl 112 (117)
|.--+=|.+-++-=.+||+.+..+...|.+|
T Consensus 95 d~K~kiL~dL~HLE~Vv~~KIaEIe~dlekL 125 (125)
T PF11944_consen 95 DLKQKILDDLRHLEKVVNSKIAEIERDLEKL 125 (125)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC
Confidence 3334445555565567888888888888765
No 229
>smart00498 FH2 Formin Homology 2 Domain. FH proteins control rearrangements of the actin cytoskeleton, especially in the context of cytokinesis and cell polarisation. Members of this family have been found to interact with Rho-GTPases, profilin and other actin-assoziated proteins. These interactions are mediated by the proline-rich FH1 domain, usually located in front of FH2 (but not listed in SMART). Despite this cytosolic function, vertebrate formins have been assigned functions within the nucleus. A set of Formin-Binding Proteins (FBPs) has been shown to bind FH1 with their WW domain.
Probab=22.55 E-value=4.9e+02 Score=22.20 Aligned_cols=66 Identities=17% Similarity=0.210 Sum_probs=46.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhhHHH---hcCCCCh-hHHHHH----HHHhhHHHHHHHHHHHHHHHHHHhhhh
Q psy460 48 RATRASQLELRLQIEALTSQLDKISAE---QQTAVPL-DSYTRK----LVDAQQKISIVGNILQTTQVAMHKKFF 114 (117)
Q Consensus 48 ~~tr~SQ~eL~~qId~L~~eL~~l~e~---q~~p~dL-d~yvkK----L~~ak~RV~~vnniLq~~Q~RL~rl~~ 114 (117)
.+.+.| ..|...+.+|...++.+... -..+-+. +.|..+ +..++.++..+...+..++.++.++-.
T Consensus 272 ~askvs-~~l~~~~~~l~~~~~~~e~~~~~l~~~~~~~d~f~~~m~~F~~~a~~~~~~l~~~~~~~~~~~~~~~~ 345 (432)
T smart00498 272 KAKVNL-EQLEKDVKQLERQIKNLETDLGGLSDPENLDDKFIEVMKPFLKAAKEKYDKLQKDLSDLKTRFEKLVE 345 (432)
T ss_pred HHHHHH-HHHHHHHHHHHHHHHHHHHHHHhcCCCCCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 456667 77777777777766655321 1122222 677777 899999999999999999998887643
No 230
>PF07716 bZIP_2: Basic region leucine zipper; InterPro: IPR011700 The basic-leucine zipper (bZIP) transcription factors [, ] of eukaryotes are proteins that contain a basic region mediating sequence-specific DNA-binding, followed by a leucine zipper region (see IPR002158 from INTERPRO), which is required for dimerization.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0043565 sequence-specific DNA binding, 0046983 protein dimerization activity, 0006355 regulation of transcription, DNA-dependent; PDB: 1NWQ_A 1H89_B 1H88_A 1GTW_B 2E43_A 1IO4_A 1GU4_B 2E42_A 1H8A_B 1GU5_B ....
Probab=22.47 E-value=1.9e+02 Score=17.54 Aligned_cols=22 Identities=23% Similarity=0.406 Sum_probs=8.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHHH
Q psy460 43 LDEKFRATRASQLELRLQIEAL 64 (117)
Q Consensus 43 lD~~V~~tr~SQ~eL~~qId~L 64 (117)
++..|.....--..|..+|..|
T Consensus 30 le~~~~~L~~en~~L~~~i~~L 51 (54)
T PF07716_consen 30 LEQEVQELEEENEQLRQEIAQL 51 (54)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 3333333333333333333333
No 231
>PF15466 DUF4635: Domain of unknown function (DUF4635)
Probab=22.45 E-value=3.5e+02 Score=20.47 Aligned_cols=51 Identities=18% Similarity=0.247 Sum_probs=37.1
Q ss_pred CCCCCCCCCCchHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy460 13 DDKTEDFCDNPTQDTIAEGLLGLIKPTVDLLDEKFRATRASQLELRLQIEAL 64 (117)
Q Consensus 13 ~~~~~~~~~~p~rd~La~GL~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L 64 (117)
.-.|..++..|.+ ++-..|..=|.-..++|+..|+...+-=..|..=+|.|
T Consensus 74 EyQt~epseEPik-~~r~WLkenLhvflEkLE~EvreLEQlV~DLE~WLDal 124 (135)
T PF15466_consen 74 EYQTKEPSEEPIK-AIRNWLKENLHVFLEKLEKEVRELEQLVRDLEEWLDAL 124 (135)
T ss_pred ccccCCCccchHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3478888999987 66777777777778899999987666555555555554
No 232
>PRK09039 hypothetical protein; Validated
Probab=22.41 E-value=4.7e+02 Score=21.94 Aligned_cols=61 Identities=28% Similarity=0.257 Sum_probs=41.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHHHH
Q psy460 40 VDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQTTQ 106 (117)
Q Consensus 40 I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q 106 (117)
+..........+.-=..|++||+.|-.+|..+... |+..-.+....+.++..+-.-|+.+-
T Consensus 125 L~~~k~~~se~~~~V~~L~~qI~aLr~Qla~le~~------L~~ae~~~~~~~~~i~~L~~~L~~a~ 185 (343)
T PRK09039 125 LDSEKQVSARALAQVELLNQQIAALRRQLAALEAA------LDASEKRDRESQAKIADLGRRLNVAL 185 (343)
T ss_pred HHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33333344444444456788888888888888775 77777777777777777777777665
No 233
>KOG0249|consensus
Probab=22.17 E-value=4e+02 Score=25.92 Aligned_cols=59 Identities=14% Similarity=0.182 Sum_probs=45.0
Q ss_pred HHHHHHHHHHHHHHhhHHHh------------cCCCChhHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhhh
Q psy460 56 ELRLQIEALTSQLDKISAEQ------------QTAVPLDSYTRKLVDAQQKISIVGNILQTTQVAMHKKFF 114 (117)
Q Consensus 56 eL~~qId~L~~eL~~l~e~q------------~~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q~RL~rl~~ 114 (117)
.|..+++++++||.+.+... +++==|+.+.++.+.+..+...+=.++..+..++..+.+
T Consensus 167 kl~~~~qe~naeL~rarqreemneeh~~rlsdtvdErlqlhlkermaAle~kn~L~~e~~s~kk~l~~~~~ 237 (916)
T KOG0249|consen 167 KLEEQLEELNAELQRARQREKMNEEHNKRLSDTVDERLQLHLKERMAALEDKNRLEQELESVKKQLEEMRH 237 (916)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhhhhhccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 57889999999998765321 112125789999999999999999999999888876543
No 234
>PF02403 Seryl_tRNA_N: Seryl-tRNA synthetase N-terminal domain; InterPro: IPR015866 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology []. The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric []. Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices [], and are mostly dimeric or multimeric, containing at least three conserved regions [, , ]. However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases []. Based on their mode of binding to the tRNA acceptor stem, both classes of tRNA synthetases have been subdivided into three subclasses, designated 1a, 1b, 1c and 2a, 2b, 2c. This entry represents the N-terminal domain of Seryl-tRNA synthetase, which consists of two helices in a long alpha-hairpin. Seryl-tRNA synthetase (6.1.1.11 from EC) exists as monomer and belongs to class IIa [].; GO: 0000166 nucleotide binding, 0004828 serine-tRNA ligase activity, 0005524 ATP binding, 0006434 seryl-tRNA aminoacylation, 0005737 cytoplasm; PDB: 3QO8_A 3QO5_A 3QO7_A 3QNE_A 3LSQ_A 3LSS_A 2DQ3_B 1SET_A 1SER_A 1SRY_B ....
Probab=21.99 E-value=2.6e+02 Score=18.87 Aligned_cols=70 Identities=14% Similarity=0.225 Sum_probs=48.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhh
Q psy460 41 DLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQTTQVAMHKKF 113 (117)
Q Consensus 41 ~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q~RL~rl~ 113 (117)
-++|.+.+.....=.+|+.+--.++.++..... +-.+......+....+..+..+...+..+.+.+..+-
T Consensus 32 ~~ld~~~r~l~~~~e~lr~~rN~~sk~I~~~~~---~~~~~~~l~~e~~~lk~~i~~le~~~~~~e~~l~~~l 101 (108)
T PF02403_consen 32 IELDQERRELQQELEELRAERNELSKEIGKLKK---AGEDAEELKAEVKELKEEIKELEEQLKELEEELNELL 101 (108)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCH---TTCCTHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHhh---CcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345555555555444555555555555555544 2367889999999999999999999999999888763
No 235
>cd00632 Prefoldin_beta Prefoldin beta; Prefoldin is a hexameric molecular chaperone complex, composed of two evolutionarily related subunits (alpha and beta), which are found in both eukaryotes and archaea. Prefoldin binds and stabilizes newly synthesized polypeptides allowing them to fold correctly. The hexameric structure consists of a double beta barrel assembly with six protruding coiled-coils. The alpha prefoldin subunits have two beta hairpin structures while the beta prefoldin subunits (this CD) have only one hairpin that is most similar to the second hairpin of the alpha subunit. The prefoldin hexamer consists of two alpha and four beta subunits and is assembled from the beta hairpins of all six subunits. The alpha subunits initially dimerize providing a structural nucleus for the assembly of the beta subunits. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the st
Probab=21.99 E-value=2.7e+02 Score=19.01 Aligned_cols=31 Identities=23% Similarity=0.224 Sum_probs=12.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy460 40 VDLLDEKFRATRASQLELRLQIEALTSQLDK 70 (117)
Q Consensus 40 I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~ 70 (117)
+..|+.++.....-=..|..+++.+..++..
T Consensus 65 ~~~Le~~~e~le~~i~~l~~~~~~l~~~~~e 95 (105)
T cd00632 65 RTELKERLETIELRIKRLERQEEDLQEKLKE 95 (105)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444444444333333344444444443333
No 236
>PRK05771 V-type ATP synthase subunit I; Validated
Probab=21.90 E-value=2.6e+02 Score=25.11 Aligned_cols=56 Identities=14% Similarity=0.152 Sum_probs=28.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHH
Q psy460 44 DEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNI 101 (117)
Q Consensus 44 D~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnni 101 (117)
++.++..+.-..+++.+++.+.++++++.+.. ...+..+...|.+.+.+....+..
T Consensus 214 ~~~l~~l~~~l~~l~~~~~~~~~~l~~~~~~~--~~~~~~~~~~l~~~~~~~~~~~~~ 269 (646)
T PRK05771 214 SELIREIKEELEEIEKERESLLEELKELAKKY--LEELLALYEYLEIELERAEALSKF 269 (646)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHhh
Confidence 44444445555555556666666666665531 111222444555555555555544
No 237
>PF08656 DASH_Dad3: DASH complex subunit Dad3; InterPro: IPR013965 The DASH complex is a ~10 subunit microtubule-binding complex that is transferred to the kinetochore prior to mitosis []. In Saccharomyces cerevisiae (Baker's yeast) DASH forms both rings and spiral structures on microtubules in vitro [, ].
Probab=21.89 E-value=2.7e+02 Score=19.06 Aligned_cols=57 Identities=14% Similarity=0.295 Sum_probs=40.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHHH
Q psy460 43 LDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQTT 105 (117)
Q Consensus 43 lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~ 105 (117)
|+..|-.-++ .|..++.+|+..|+.++.. ..|.+ .-..-|-..-+|+..|...++.+
T Consensus 4 LEq~VL~eY~---~La~~L~~L~~~l~~L~~~-~~~~~--~lL~~LR~LE~K~glV~TL~KaS 60 (78)
T PF08656_consen 4 LEQEVLDEYQ---RLADNLKTLSDTLKDLNSS-NSPSE--ELLDGLRELERKIGLVYTLFKAS 60 (78)
T ss_pred HHHHHHHHHH---HHHHHHHHHHHHHHHHHcc-CCChH--HHHHHHHHHHHHHHHHHHHHHHH
Confidence 4555544443 6889999999999999542 23332 56667777788999998888764
No 238
>TIGR01242 26Sp45 26S proteasome subunit P45 family. Many proteins may score above the trusted cutoff because an internal
Probab=21.88 E-value=3.2e+02 Score=22.40 Aligned_cols=37 Identities=16% Similarity=0.408 Sum_probs=29.0
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHH
Q psy460 37 KPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISA 73 (117)
Q Consensus 37 ~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e 73 (117)
+-.+++++++.++.+.-...++.+++.+.++++++..
T Consensus 5 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 41 (364)
T TIGR01242 5 DVRIRKLEDEKRSLEKEKIRLERELERLRSEIERLRS 41 (364)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC
Confidence 4467888888888888888888888888887776644
No 239
>PRK09343 prefoldin subunit beta; Provisional
Probab=21.87 E-value=3e+02 Score=19.63 Aligned_cols=34 Identities=18% Similarity=0.388 Sum_probs=19.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHH
Q psy460 40 VDLLDEKFRATRASQLELRLQIEALTSQLDKISA 73 (117)
Q Consensus 40 I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e 73 (117)
++-++.++......+..|+.++..+...|+.+..
T Consensus 80 ~E~ie~~ik~lekq~~~l~~~l~e~q~~l~~ll~ 113 (121)
T PRK09343 80 KELLELRSRTLEKQEKKLREKLKELQAKINEMLS 113 (121)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4445555555555555566666666655555543
No 240
>PF04849 HAP1_N: HAP1 N-terminal conserved region; InterPro: IPR006933 This family is defined by an N-terminal conserved region found in several huntingtin-associated protein 1 (HAP1) homologues. HAP1 binds to huntingtin in a polyglutamine repeat-length-dependent manner. However, its possible role in the pathogenesis of Huntingtons disease is unclear. This family also includes a similar N-terminal conserved region from hypothetical protein products of ALS2CR3 genes found in the human juvenile amyotrophic lateral sclerosis critical region 2q33-2q34 [].
Probab=21.75 E-value=5e+02 Score=22.07 Aligned_cols=73 Identities=16% Similarity=0.267 Sum_probs=58.7
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHHHHHHHHh
Q psy460 37 KPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQTTQVAMHK 111 (117)
Q Consensus 37 ~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q~RL~r 111 (117)
.+.++.|..|++....==..||.....|..+-..+.+. -+-+ +...|+-|..|...|..++.-|-.=.+-..+
T Consensus 159 ~~~le~Lq~Klk~LEeEN~~LR~Ea~~L~~et~~~Eek-EqqL-v~dcv~QL~~An~qia~LseELa~k~Ee~~r 231 (306)
T PF04849_consen 159 CIQLEALQEKLKSLEEENEQLRSEASQLKTETDTYEEK-EQQL-VLDCVKQLSEANQQIASLSEELARKTEENRR 231 (306)
T ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhhccHH-HHHH-HHHHHHHhhhcchhHHHHHHHHHHHHHHHHH
Confidence 46788899999999998889999999999988888764 3333 6679999999999999998888665554443
No 241
>PF12805 FUSC-like: FUSC-like inner membrane protein yccS
Probab=21.73 E-value=4.1e+02 Score=21.06 Aligned_cols=67 Identities=16% Similarity=0.020 Sum_probs=46.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHh--cCCCChhHHHHHHHHhhHHHHHHHHHHHH
Q psy460 38 PTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQ--QTAVPLDSYTRKLVDAQQKISIVGNILQT 104 (117)
Q Consensus 38 P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q--~~p~dLd~yvkKL~~ak~RV~~vnniLq~ 104 (117)
-++.=+=-.++--|..|..|.+-..+|+.+|+.-.+.. +.+-+.+.+-.+|...+.+|+.-.+.++.
T Consensus 86 ~~lsl~~~~l~p~r~~rqaLa~~y~~lA~yl~~ka~~~~p~~~~~~~~~~~~l~~~q~~v~~~~~~~R~ 154 (284)
T PF12805_consen 86 LLLSLLWWPLRPYRPVRQALAECYRALADYLRAKARFFDPDQHDDDEQLRIELAQQQIKVNEALEQARE 154 (284)
T ss_pred HHHHHHHHHHcCCCHHHHHHHHHHHHHHHHHHHHHhcCCCCCccchhHHHHHHHHHHHHHHHHHHHHHH
Confidence 33444455667778899999999999999999866543 45556677777777766666554444443
No 242
>PF15237 PTRF_SDPR: PTRF/SDPR family
Probab=21.67 E-value=4.7e+02 Score=21.71 Aligned_cols=73 Identities=19% Similarity=0.299 Sum_probs=46.9
Q ss_pred HHhhhHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHHHHHHH
Q psy460 33 LGLIKPTVDLLD---EKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQTTQVAM 109 (117)
Q Consensus 33 ~~lL~P~I~~lD---~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q~RL 109 (117)
+.||.-++.-+| ..=+..-.-|.+|...+-....+|-+|... ...-.--|-||+.--+||+ .=...+.+|+
T Consensus 8 ltLLdKl~~~vD~vQ~~Q~~mE~RQ~emE~sV~~IQ~dl~KLsk~---H~~TsnTV~KLLeK~RKVS---~~vk~Vr~r~ 81 (246)
T PF15237_consen 8 LTLLDKLAGMVDSVQETQQRMEERQREMEGSVKGIQGDLTKLSKS---HSTTSNTVNKLLEKTRKVS---VNVKEVRERL 81 (246)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhccHHHHHHHHHHHHHh---cccHHHHHHHHHHHHHHhh---hhHHHHHHHH
Confidence 344444444333 333333345667777777888899999873 4556778999997666654 4567778887
Q ss_pred Hh
Q psy460 110 HK 111 (117)
Q Consensus 110 ~r 111 (117)
+|
T Consensus 82 ek 83 (246)
T PF15237_consen 82 EK 83 (246)
T ss_pred HH
Confidence 76
No 243
>PF09304 Cortex-I_coil: Cortexillin I, coiled coil; InterPro: IPR015383 This domain is predominantly found in the actin-bundling protein cortexillin I from Dictyostelium discoideum (Slime mold). The domain has a structure consisting of an 18-heptad-repeat alpha-helical coiled-coil, and is a prerequisite for the assembly of Cortexillin I []. ; PDB: 1D7M_A.
Probab=21.65 E-value=3.3e+02 Score=19.89 Aligned_cols=49 Identities=27% Similarity=0.354 Sum_probs=33.9
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHH
Q psy460 42 LLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKIS 96 (117)
Q Consensus 42 ~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~ 96 (117)
.|+..+.....||.+|..+=|.|..-+..|... -+..++++..-+.+|.
T Consensus 20 ~Le~slE~~K~S~~eL~kqkd~L~~~l~~L~~q------~~s~~qr~~eLqaki~ 68 (107)
T PF09304_consen 20 SLERSLEDEKTSQGELAKQKDQLRNALQSLQAQ------NASRNQRIAELQAKID 68 (107)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHhhHHHHHHhHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHH
Confidence 467788888888888888888888888887763 3444555555444443
No 244
>PF10267 Tmemb_cc2: Predicted transmembrane and coiled-coil 2 protein; InterPro: IPR019394 This family of transmembrane coiled-coil containing proteins is conserved from worms to humans. Its function is unknown.
Probab=21.63 E-value=5.5e+02 Score=22.49 Aligned_cols=31 Identities=29% Similarity=0.320 Sum_probs=23.8
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy460 37 KPTVDLLDEKFRATRASQLELRLQIEALTSQ 67 (117)
Q Consensus 37 ~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~e 67 (117)
.-.+..+.+.+++.+..|..|...||+|..+
T Consensus 211 ~~~l~~~~~el~eik~~~~~L~~~~e~Lk~~ 241 (395)
T PF10267_consen 211 NLGLQKILEELREIKESQSRLEESIEKLKEQ 241 (395)
T ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3445566677788899999999999988763
No 245
>PF01806 Paramyxo_P: Paramyxovirinae P phosphoprotein C-terminal region; InterPro: IPR002693 Sendai virus is a member of the Paramyxovirinae family. Its negative-sense ssRNA genome is packaged by the viral nucleoprotein (N) within a helical nucleocapsid. Paramyxovirinae use this N-RNA (nucleoprotein-RNA) complex as a template for both transcription and replication. During viral genome replication, the synthesis of viral RNA and its encapsidation by N are concomitant. Viral transcription and replication are carried out by viral RNA-dependent RNA polymerase, which consists of two proteins: L polymerase and phosphoprotein P. The L polymerase carries the enzyme activity. Phosphoprotein P binds the viral nucleocapsid, and positions the L polymerase on the template for transcription and replication formed by nucleoprotein-RNA (N-RNA) []. This entry represents phosphoprotein P from Sendai virus as well as from close family members. Phosphoprotein P, an indispensable subunit of the viral polymerase complex, is a modular protein organised into two moieties that are both functionally and structurally distinct: a well-conserved C-terminal moiety that contains all the regions required for transcription, and a poorly conserved, intrinsically unstructured N-terminal moiety that provides several additional functions required for replication. The N-terminal moiety is responsible for binding to newly synthesized free N(0) (nucleoprotein that has not yet bound RNA), in order to prevent the binding of N(0) to cellular RNA. The C-terminal moiety consists of an oligomerisation domain, an N-RNA (nucleoprotein-RNA)-binding domain and an L polymerase-binding domain [, ]. ; GO: 0003723 RNA binding, 0003968 RNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent, 0019079 viral genome replication; PDB: 1R4G_A 1EZJ_A.
Probab=21.57 E-value=4.7e+02 Score=21.61 Aligned_cols=49 Identities=12% Similarity=0.194 Sum_probs=38.4
Q ss_pred HHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHh
Q psy460 27 TIAEGLLGLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQ 75 (117)
Q Consensus 27 ~La~GL~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q 75 (117)
....-|.||+-.+=..+|.+|...+.-=+.|++.+....+--+||.|.|
T Consensus 48 ~~~~~l~Gl~i~v~~~~~nk~~q~q~~l~~ik~~~~~~~e~hrR~~E~Q 96 (248)
T PF01806_consen 48 SMIDFLCGLMISVSMDHDNKLNQIQQELKQIKEDLKKMDESHRRFIENQ 96 (248)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4455688999999999999999988877777777777776667777654
No 246
>PRK05724 acetyl-CoA carboxylase carboxyltransferase subunit alpha; Validated
Probab=21.51 E-value=1.9e+02 Score=24.57 Aligned_cols=39 Identities=28% Similarity=0.503 Sum_probs=29.4
Q ss_pred HhhhHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHhhHH
Q psy460 34 GLIKPTVDLLDEKFRATRAS----QLELRLQIEALTSQLDKISA 73 (117)
Q Consensus 34 ~lL~P~I~~lD~~V~~tr~S----Q~eL~~qId~L~~eL~~l~e 73 (117)
.+=+| |.+|+.++...+.. ...+..+|.+|...+.++.+
T Consensus 7 ~fe~~-i~~l~~~i~~l~~~~~~~~~~~~~~~~~l~~~~~~~~~ 49 (319)
T PRK05724 7 DFEKP-IAELEAKIEELRAVAEDSDVDLSEEIERLEKKLEELTK 49 (319)
T ss_pred hhhhH-HHHHHHHHHHHHhhhccccccHHHHHHHHHHHHHHHHH
Confidence 34455 58899999988773 45688999999888887754
No 247
>PF10224 DUF2205: Predicted coiled-coil protein (DUF2205); InterPro: IPR019357 This entry represents a highly conserved 100 residue region which is likely to have a coiled-coil structure. The exact function is unknown.
Probab=21.48 E-value=2.8e+02 Score=19.01 Aligned_cols=48 Identities=19% Similarity=0.284 Sum_probs=32.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHh
Q psy460 41 DLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDA 91 (117)
Q Consensus 41 ~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~a 91 (117)
++|-.+|.+.|.+=..|-..||..-+|=.+|..+ .-+ |-.||.-|.+.
T Consensus 19 ~~Li~ei~~LQ~sL~~L~~Rve~Vk~E~~kL~~E--N~~-Lq~YI~nLm~~ 66 (80)
T PF10224_consen 19 EELIQEILELQDSLEALSDRVEEVKEENEKLESE--NEY-LQQYIGNLMSS 66 (80)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHH-HHHHHHHHHHh
Confidence 3555677777777777778888887777777654 222 67888877553
No 248
>PF03955 Adeno_PIX: Adenovirus hexon-associated protein (IX); InterPro: IPR005641 Hexon (IPR000736 from INTERPRO) is the major coat protein from adenovirus type 2. Hexon forms a homo-trimer, 240 copies of which are present in the capsid, organised so that 12 lie on each of the 20 facets of this structure. The central 9 hexons in a facet are cemented together by 12 copies of protein IX []. Protein IX is not neccessarily required for viral replication, but has been shown to affect several processes including DNA-packaging capacity, thermostability, and the transcriptional activity of several promoters. For more information see [].; GO: 0031423 hexon binding, 0044423 virion part; PDB: 3IYN_T.
Probab=21.44 E-value=1.2e+02 Score=22.24 Aligned_cols=34 Identities=32% Similarity=0.470 Sum_probs=19.1
Q ss_pred HHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy460 31 GLLGLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKI 71 (117)
Q Consensus 31 GL~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l 71 (117)
|+++=|+.-+..+++++... ..++++|.++|..+
T Consensus 76 ~~~~~l~~~~~~~~~~l~~l-------~a~Le~l~~~L~~~ 109 (109)
T PF03955_consen 76 GSYSELKANLTALEDKLTAL-------LAQLEALKQQLAEL 109 (109)
T ss_dssp ---SSTTSTHHHHHHHHHHH-------HHHHHHHHHHHHH-
T ss_pred HHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHhcC
Confidence 45555666666665555544 44778888777653
No 249
>PF07061 Swi5: Swi5; InterPro: IPR010760 This entry represents Swi5 and is involved in meiotic DNA repair synthesis and meiotic joint molecule formation []. It is known to interact with Swi2, Rhp51 and Swi6 [].
Probab=21.39 E-value=2.8e+02 Score=18.93 Aligned_cols=10 Identities=10% Similarity=0.155 Sum_probs=5.6
Q ss_pred hhHHHHHHHH
Q psy460 81 LDSYTRKLVD 90 (117)
Q Consensus 81 Ld~yvkKL~~ 90 (117)
+..|+++|-.
T Consensus 38 v~~hI~lLhe 47 (83)
T PF07061_consen 38 VKRHIKLLHE 47 (83)
T ss_pred HHHHHHHHHH
Confidence 4566666543
No 250
>PRK01156 chromosome segregation protein; Provisional
Probab=21.36 E-value=6.1e+02 Score=23.48 Aligned_cols=79 Identities=14% Similarity=0.187 Sum_probs=0.0
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhH---------------HHhcCCCChhHHHHHHHHhhHHHHHHHH
Q psy460 36 IKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKIS---------------AEQQTAVPLDSYTRKLVDAQQKISIVGN 100 (117)
Q Consensus 36 L~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~---------------e~q~~p~dLd~yvkKL~~ak~RV~~vnn 100 (117)
+.+.++.+ ++......-.++..++..+..++..+. +.....-++....+++...+.++..+..
T Consensus 573 ~~~~~~~~--~~~~~~~~l~e~~~~l~~l~~~l~~le~~~~~~~~~~~~~~~~le~~~~~le~~~~~l~~~~~~i~~~~~ 650 (895)
T PRK01156 573 ALAVISLI--DIETNRSRSNEIKKQLNDLESRLQEIEIGFPDDKSYIDKSIREIENEANNLNNKYNEIQENKILIEKLRG 650 (895)
T ss_pred HHhHHHHh--hHHHHHHHHHHHHHHHHHHHhhHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q ss_pred HHHHHHHHHHhhhhhc
Q psy460 101 ILQTTQVAMHKKFFMG 116 (117)
Q Consensus 101 iLq~~Q~RL~rl~~~~ 116 (117)
-+..+..++..+..+.
T Consensus 651 ~i~~l~~~i~~l~~~~ 666 (895)
T PRK01156 651 KIDNYKKQIAEIDSII 666 (895)
T ss_pred HHHHHHHHHHHHHhhh
No 251
>PF07568 HisKA_2: Histidine kinase; InterPro: IPR011495 Two-component signal transduction systems enable bacteria to sense, respond, and adapt to a wide range of environments, stressors, and growth conditions []. Some bacteria can contain up to as many as 200 two-component systems that need tight regulation to prevent unwanted cross-talk []. These pathways have been adapted to response to a wide variety of stimuli, including nutrients, cellular redox state, changes in osmolarity, quorum signals, antibiotics, and more []. Two-component systems are comprised of a sensor histidine kinase (HK) and its cognate response regulator (RR) []. The HK catalyses its own auto-phosphorylation followed by the transfer of the phosphoryl group to the receiver domain on RR; phosphorylation of the RR usually activates an attached output domain, which can then effect changes in cellular physiology, often by regulating gene expression. Some HK are bifunctional, catalysing both the phosphorylation and dephosphorylation of their cognate RR. The input stimuli can regulate either the kinase or phosphatase activity of the bifunctional HK. A variant of the two-component system is the phospho-relay system. Here a hybrid HK auto-phosphorylates and then transfers the phosphoryl group to an internal receiver domain, rather than to a separate RR protein. The phosphoryl group is then shuttled to histidine phosphotransferase (HPT) and subsequently to a terminal RR, which can evoke the desired response [, ]. Signal transducing histidine kinases are the key elements in two-component signal transduction systems, which control complex processes such as the initiation of development in microorganisms [, ]. Examples of histidine kinases are EnvZ, which plays a central role in osmoregulation [], and CheA, which plays a central role in the chemotaxis system []. Histidine kinases usually have an N-terminal ligand-binding domain and a C-terminal kinase domain, but other domains may also be present. The kinase domain is responsible for the autophosphorylation of the histidine with ATP, the phosphotransfer from the kinase to an aspartate of the response regulator, and (with bifunctional enzymes) the phosphotransfer from aspartyl phosphate back to ADP or to water []. The kinase core has a unique fold, distinct from that of the Ser/Thr/Tyr kinase superfamily. HKs can be roughly divided into two classes: orthodox and hybrid kinases [, ]. Most orthodox HKs, typified by the Escherichia coli EnvZ protein, function as periplasmic membrane receptors and have a signal peptide and transmembrane segment(s) that separate the protein into a periplasmic N-terminal sensing domain and a highly conserved cytoplasmic C-terminal kinase core. Members of this family, however, have an integral membrane sensor domain. Not all orthodox kinases are membrane bound, e.g., the nitrogen regulatory kinase NtrB (GlnL) is a soluble cytoplasmic HK []. Hybrid kinases contain multiple phosphodonor and phosphoacceptor sites and use multi-step phospho-relay schemes instead of promoting a single phosphoryl transfer. In addition to the sensor domain and kinase core, they contain a CheY-like receiver domain and a His-containing phosphotransfer (HPt) domain. This is the dimerisation and phosphoacceptor domain of a subfamily of histidine kinases. It shares sequence similarity with IPR003661 from INTERPRO and IPR011102 from INTERPRO. It is usually found adjacent to a C-terminal ATPase domain (IPR003594 from INTERPRO). This domain is found in a wide range of bacteria and also several archaea.
Probab=21.25 E-value=2.5e+02 Score=18.25 Aligned_cols=37 Identities=16% Similarity=0.294 Sum_probs=24.3
Q ss_pred HHHHHHHHHHHHHHHhhHHHh-cCCCChhHHHHHHHHh
Q psy460 55 LELRLQIEALTSQLDKISAEQ-QTAVPLDSYTRKLVDA 91 (117)
Q Consensus 55 ~eL~~qId~L~~eL~~l~e~q-~~p~dLd~yvkKL~~a 91 (117)
.+....|.+++.-=+.|.... ...+|+.+|+.+|...
T Consensus 34 ~~~~~RI~aia~vh~~L~~~~~~~~v~l~~yl~~L~~~ 71 (76)
T PF07568_consen 34 EDAQNRIQAIALVHEQLYQSEDLSEVDLREYLEELCED 71 (76)
T ss_pred HHHHHHHHHHHHHHHHHhcCCCCCeecHHHHHHHHHHH
Confidence 344556666665555555433 3688999999999763
No 252
>PRK05771 V-type ATP synthase subunit I; Validated
Probab=21.19 E-value=3.3e+02 Score=24.47 Aligned_cols=36 Identities=11% Similarity=0.317 Sum_probs=24.4
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHH
Q psy460 39 TVDLLDEKFRATRASQLELRLQIEALTSQLDKISAE 74 (117)
Q Consensus 39 ~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~ 74 (117)
.++++.+++.+...--.+|+++++++.+++..+..-
T Consensus 94 ~~~~~~~~i~~l~~~~~~L~~~~~~l~~~~~~l~~~ 129 (646)
T PRK05771 94 ELEKIEKEIKELEEEISELENEIKELEQEIERLEPW 129 (646)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh
Confidence 345566667777777777777777777777766544
No 253
>PF06698 DUF1192: Protein of unknown function (DUF1192); InterPro: IPR009579 This family consists of several short, hypothetical, bacterial proteins of around 60 residues in length. The function of this family is unknown.
Probab=21.18 E-value=1.4e+02 Score=19.42 Aligned_cols=20 Identities=25% Similarity=0.373 Sum_probs=9.7
Q ss_pred HHHHHHHHHHHHHHHHHhhH
Q psy460 53 SQLELRLQIEALTSQLDKIS 72 (117)
Q Consensus 53 SQ~eL~~qId~L~~eL~~l~ 72 (117)
|..+|.+.|..|..|..++.
T Consensus 22 Sv~EL~~RIa~L~aEI~R~~ 41 (59)
T PF06698_consen 22 SVEELEERIALLEAEIARLE 41 (59)
T ss_pred CHHHHHHHHHHHHHHHHHHH
Confidence 44444445555555555443
No 254
>PF07083 DUF1351: Protein of unknown function (DUF1351); InterPro: IPR009785 This entry is represented by Lactobacillus prophage Lj928, Orf309. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several bacterial and phage proteins of around 230 residues in length. The function of this family is unknown.
Probab=21.16 E-value=4.1e+02 Score=20.77 Aligned_cols=49 Identities=18% Similarity=0.344 Sum_probs=34.1
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHH
Q psy460 36 IKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVD 90 (117)
Q Consensus 36 L~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ 90 (117)
++.+++++++.|.....-+.++...+..-..+ .. ..-++.+||++-|-.
T Consensus 146 ~kk~~eei~~~i~~~~~~~~~~~~~~~~i~~~----A~--~~~l~~~~yi~~l~~ 194 (215)
T PF07083_consen 146 LKKIEEEIDDQIDKIKQDLEEIKAAKQAIEEK----AE--EYGLPADPYIRMLDY 194 (215)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HH--HcCCCcHHHHHHHHc
Confidence 45588999999988888777777766655443 22 256677899955543
No 255
>PF03148 Tektin: Tektin family; InterPro: IPR000435 Tektin heteropolymers form unique protofilaments of flagellar microtubules []. The proteins are predicted to form extended rods composed of 2 alpha- helical segments (~180 residues long) capable of forming coiled coils, interrupted by non-helical linkers []. The 2 segments are similar in sequence, indicating a gene duplication event. Along each tektin rod, cysteine residues occur with a periodicity of ~8nm, coincident with the axial repeat of tubulin dimers in microtubules []. It is proposed that the assembly of tektin heteropolymers produces filaments with repeats of 8, 16, 24, 32, 40, 48 and 96nm, generating the basis for the complex spatial arrangements of axonemal components [].; GO: 0000226 microtubule cytoskeleton organization, 0005874 microtubule
Probab=21.07 E-value=2.5e+02 Score=23.87 Aligned_cols=33 Identities=24% Similarity=0.375 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHH
Q psy460 41 DLLDEKFRATRASQLELRLQIEALTSQLDKISA 73 (117)
Q Consensus 41 ~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e 73 (117)
.+|..+|.++..-..+|...|+.+..|.+.|..
T Consensus 46 ~~L~~Ri~di~~wk~eL~~~l~~~~~Ei~~L~~ 78 (384)
T PF03148_consen 46 KRLRQRIRDIRFWKNELERELEELDEEIDLLEE 78 (384)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
No 256
>TIGR03761 ICE_PFL4669 integrating conjugative element protein, PFL_4669 family. Members of this protein family, such as PFL4669, are found in integrating conjugative elements (ICE) of the PFGI-1 class as in Pseudomonas fluorescens.
Probab=21.06 E-value=2.8e+02 Score=22.25 Aligned_cols=52 Identities=21% Similarity=0.355 Sum_probs=32.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh------hHHHhc-CC-------------------CChhHHHHHHHHhhH
Q psy460 40 VDLLDEKFRATRASQLELRLQIEALTSQLDK------ISAEQQ-TA-------------------VPLDSYTRKLVDAQQ 93 (117)
Q Consensus 40 I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~------l~e~q~-~p-------------------~dLd~yvkKL~~ak~ 93 (117)
+-++|+++..++. +|.+.++++.+.|.. +.+.+. +| +|+|.+|+++..+.+
T Consensus 62 Ll~~E~~l~~~~~---~l~~~~~~l~~~l~~~p~~l~ls~~~s~~P~~i~L~~~splGy~~v~LL~d~D~l~r~~l~a~h 138 (216)
T TIGR03761 62 LLRIEEKLLSARQ---EMQALLQRLDDLLAQLPPALDLSENLSVSPLTVPLFFRSPLGYRAVYLLVDYDQLARRVLLAHH 138 (216)
T ss_pred HHHHHHHHHHHHH---HHHHHHHHHHHHHHhCCCccchhhccCCCCceeeeecCChHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4578888888877 566666666666665 333321 22 356777777777665
Q ss_pred H
Q psy460 94 K 94 (117)
Q Consensus 94 R 94 (117)
.
T Consensus 139 ~ 139 (216)
T TIGR03761 139 Y 139 (216)
T ss_pred H
Confidence 4
No 257
>TIGR03513 GldL_gliding gliding motility-associated protein GldL. This protein family, GldL, is named for the member from Flavobacterium johnsoniae, which is required for a type of rapid gliding motility found in certain members of the Bacteriodetes. However, members are found also in several members of the Bacteriodetes that appear not to be motile
Probab=20.95 E-value=4.4e+02 Score=21.12 Aligned_cols=48 Identities=17% Similarity=0.230 Sum_probs=33.2
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHHH
Q psy460 49 ATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQTT 105 (117)
Q Consensus 49 ~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~ 105 (117)
+...++.+-.+|+..++..|..+|.. |-..|-++.+.+...|.+-.++
T Consensus 127 ~~~~~~~~Y~eqm~~aa~~l~~LN~~---------Ye~QL~~as~q~~~~~~i~~na 174 (202)
T TIGR03513 127 DSYAQQKKYIEQMSSLAANMEGLNTI---------YEAQLKGASSHADANNEIAINS 174 (202)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH---------HHHHHHHHHHHHHHHHHHHHHH
Confidence 34566778888888888888888875 5555666666666666664443
No 258
>KOG4367|consensus
Probab=20.91 E-value=6.8e+02 Score=23.21 Aligned_cols=35 Identities=26% Similarity=0.236 Sum_probs=27.4
Q ss_pred HHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHH
Q psy460 57 LRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISI 97 (117)
Q Consensus 57 L~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~ 97 (117)
|-.|||.|..+|.+-.+. |..+|.|=.+-|.||..
T Consensus 313 l~~q~d~lid~l~~rk~q------ll~~v~~e~e~k~kv~r 347 (699)
T KOG4367|consen 313 LVAQCDALIDALNRRKAQ------LLARVNKEHEHKLKVVR 347 (699)
T ss_pred HHHHHHHHHHHHHHHHHH------HHHHhhhhhhhHHHHHH
Confidence 557899999999888774 77888887777777754
No 259
>PTZ00454 26S protease regulatory subunit 6B-like protein; Provisional
Probab=20.87 E-value=5.4e+02 Score=22.03 Aligned_cols=38 Identities=13% Similarity=0.284 Sum_probs=33.4
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHH
Q psy460 36 IKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISA 73 (117)
Q Consensus 36 L~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e 73 (117)
++-.++.++.+.+....-+..++.++..+.+||.++..
T Consensus 27 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 64 (398)
T PTZ00454 27 LEKELEFLDIQEEYIKEEQKNLKRELIRAKEEVKRIQS 64 (398)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC
Confidence 45678899999999999999999999999999998844
No 260
>PF03670 UPF0184: Uncharacterised protein family (UPF0184); InterPro: IPR022788 This family of proteins has no known function.
Probab=20.85 E-value=2.6e+02 Score=19.48 Aligned_cols=14 Identities=29% Similarity=0.451 Sum_probs=5.8
Q ss_pred HHHHHHHHHHHHhh
Q psy460 58 RLQIEALTSQLDKI 71 (117)
Q Consensus 58 ~~qId~L~~eL~~l 71 (117)
+.++|.|...|+.|
T Consensus 32 ns~LD~Lns~LD~L 45 (83)
T PF03670_consen 32 NSMLDQLNSCLDHL 45 (83)
T ss_pred HHHHHHHHHHHHHH
Confidence 33444444444444
No 261
>PF04375 HemX: HemX; InterPro: IPR007470 The majority of proteins in this family are annotated as uroporphyrin-III C-methyltransferase (2.1.1.107 from EC) []; however, there is no direct evidence to support this annotation for these proteins, which come from mainly pathogenic Gram-negative organisms. There is some evidence to suggest that the proteins are membrane anchored as they have a predicted N-terminal signal peptide and transmembrane domain and may be involved in haem transport [].
Probab=20.76 E-value=5.2e+02 Score=21.79 Aligned_cols=24 Identities=17% Similarity=0.131 Sum_probs=13.4
Q ss_pred HHhhHHHHHHHHHHHHHHHHHHhh
Q psy460 89 VDAQQKISIVGNILQTTQVAMHKK 112 (117)
Q Consensus 89 ~~ak~RV~~vnniLq~~Q~RL~rl 112 (117)
....+++..+-.-+...+.|+.++
T Consensus 96 ~~l~~~l~~~~~~l~~l~~~~~~l 119 (372)
T PF04375_consen 96 QQLQQELAQLQQQLAELQQQLAAL 119 (372)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334445555555666666666554
No 262
>cd07638 BAR_ACAP2 The Bin/Amphiphysin/Rvs (BAR) domain of ArfGAP with Coiled-coil, ANK repeat and PH domain containing protein 2. BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions. ACAP2 (ArfGAP with Coiled-coil, ANK repeat and PH domain containing protein 2), also called centaurin beta-2, is an Arf6-specific GTPase activating protein (GAP) which mediates Arf6 signaling. Arf6 is involved in the regulation of endocytosis, phagocytosis, cell adhesion and migration. ACAP2 contains an N-terminal BAR domain, followed by a Pleckstrin homology (PH) domain, an Arf GAP domain, and C-terminal ankyrin (ANK) repeats. BAR domains form dimers that bind to membranes, induce membrane bending and curvature, and may also be involved in protein-protein interactions.
Probab=20.66 E-value=4.3e+02 Score=20.83 Aligned_cols=78 Identities=18% Similarity=0.227 Sum_probs=57.8
Q ss_pred HHHHHHHHHh---------hhHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHH
Q psy460 26 DTIAEGLLGL---------IKPTVDLLDEKFRATRASQLELRLQIE-ALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKI 95 (117)
Q Consensus 26 d~La~GL~~l---------L~P~I~~lD~~V~~tr~SQ~eL~~qId-~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV 95 (117)
..|+.||..+ +.|++.+|=.-++..-..+..|..|++ .|..-|.++.+. -+++.-++|++-
T Consensus 40 ~~F~~~l~d~~~~~~~De~i~~~l~kF~~~l~ei~~~~~~L~~q~~~~l~~~L~~F~k~---------dl~~vke~kk~F 110 (200)
T cd07638 40 KQFMNGIRDLAQYSSKDAVIETSLTKFSDTLQEMINYHTILFDQAQRSIKAQLQTFVKE---------DLRKFKDAKKQF 110 (200)
T ss_pred HHHHHHHHHHHHhCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------HhHHHHHHHHHH
Confidence 4778888777 789999999999999999999999995 667777777654 245566777777
Q ss_pred HHHHHHHHHHHHHHHhh
Q psy460 96 SIVGNILQTTQVAMHKK 112 (117)
Q Consensus 96 ~~vnniLq~~Q~RL~rl 112 (117)
-+++.-.-.+..|-.++
T Consensus 111 dK~s~~~~~aL~K~~~~ 127 (200)
T cd07638 111 DKVSEEKENALVKNAQV 127 (200)
T ss_pred HHHhHHHHHHHHHhccC
Confidence 77776666665555444
No 263
>PF10046 BLOC1_2: Biogenesis of lysosome-related organelles complex-1 subunit 2 ; InterPro: IPR019269 This entry represents a family of proteins that play a role in cellular proliferation, as well as in the biogenesis of specialised organelles of the endosomal-lysosomal system [].
Probab=20.66 E-value=2.9e+02 Score=18.94 Aligned_cols=25 Identities=8% Similarity=0.287 Sum_probs=16.3
Q ss_pred hhHHHHHHHHhhHHHHHHHHHHHHH
Q psy460 81 LDSYTRKLVDAQQKISIVGNILQTT 105 (117)
Q Consensus 81 Ld~yvkKL~~ak~RV~~vnniLq~~ 105 (117)
|.||+++|...-..|..+=.+....
T Consensus 61 l~~~l~~Id~Ie~~V~~LE~~v~~L 85 (99)
T PF10046_consen 61 LQPYLQQIDQIEEQVTELEQTVYEL 85 (99)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 6777777777777776655544433
No 264
>PRK06285 chorismate mutase; Provisional
Probab=20.65 E-value=2.9e+02 Score=18.80 Aligned_cols=17 Identities=24% Similarity=0.436 Sum_probs=8.2
Q ss_pred HHHHHHHHHHHHHHHhh
Q psy460 55 LELRLQIEALTSQLDKI 71 (117)
Q Consensus 55 ~eL~~qId~L~~eL~~l 71 (117)
.+||.+||.+..+|-.|
T Consensus 10 ~elR~~ID~ID~~iv~L 26 (96)
T PRK06285 10 NEIRKRIDEIDEQIIDL 26 (96)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 34555555554444433
No 265
>PF00458 WHEP-TRS: WHEP-TRS domain; InterPro: IPR000738 A conserved domain of 46 amino acids, called WHEP-TRS has been shown [] to exist in a number of higher eukaryote aminoacyl-transfer RNA synthetases. This domain is present one to six times in the several enzymes. There are three copies in mammalian multifunctional aminoacyl-tRNA synthetase in a region that separates the N-terminal glutamyl-tRNA synthetase domain from the C-terminal prolyl-tRNA synthetase domain, and six copies in the intercatalytic region of the Drosophila enzyme. The domain is found at the N-terminal extremity of the mammalian tryptophanyl- tRNA synthetase and histidyl-tRNA synthetase, and the mammalian, insect, nematode and plant glycyl- tRNA synthetases []. This domain could contain a central alpha-helical region and may play a role in the association of tRNA-synthetases into multienzyme complexes.; GO: 0004812 aminoacyl-tRNA ligase activity, 0005524 ATP binding, 0006418 tRNA aminoacylation for protein translation; PDB: 1R1B_A 1D2D_A 1X59_A 1FYJ_A 2ZT8_A 2ZXF_A 2ZT7_A 2ZT6_A 2PME_A 2ZT5_A ....
Probab=20.64 E-value=2.1e+02 Score=18.25 Aligned_cols=38 Identities=24% Similarity=0.364 Sum_probs=18.1
Q ss_pred HHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHH
Q psy460 58 RLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKI 95 (117)
Q Consensus 58 ~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV 95 (117)
..+|...-...++|.......-++++-|.+|..-|...
T Consensus 2 ~~~I~~QG~~VR~LKa~ka~k~~i~~aV~~Ll~LK~~~ 39 (56)
T PF00458_consen 2 EAQIAAQGDKVRKLKAEKADKEEIDAAVAKLLELKAEL 39 (56)
T ss_dssp HHHHHHHHHHHHHHHHTT--HHHHHHHHHHHHHHHHHH
T ss_pred hHHHHHHHHHHHHHHccCCcHHHHHHHHHHHHHHHHHH
Confidence 34444445555555443222224566666666655544
No 266
>PRK13182 racA polar chromosome segregation protein; Reviewed
Probab=20.61 E-value=4e+02 Score=20.48 Aligned_cols=54 Identities=15% Similarity=0.193 Sum_probs=30.4
Q ss_pred HHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhh
Q psy460 52 ASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQTTQVAMHKKF 113 (117)
Q Consensus 52 ~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q~RL~rl~ 113 (117)
.=...+..+|+.|.+.|..+.+. -..+. |+.-|+-+--+..-|+....|+.++.
T Consensus 92 ~~~~~l~~ri~eLe~~l~~kad~-vvsYq-------ll~hr~e~ee~~~~l~~le~~~~~~e 145 (175)
T PRK13182 92 AQLNTITRRLDELERQLQQKADD-VVSYQ-------LLQHRREMEEMLERLQKLEARLKKLE 145 (175)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhh-hhhHH-------HHHhHHHHHHHHHHHHHHHHHHHHHH
Confidence 33344555666776666666653 23333 44445555566666666666666643
No 267
>PF08181 DegQ: DegQ (SacQ) family; InterPro: IPR012554 This family consists of the DegQ (formerly sacQ) regulatory peptides. The DegQ family of peptides control the rates of synthesis of a class of both secreted and intracellular degradative enzymes in Bacillus subtilis. DegQ is 46 amino acids long and activates the synthesis of degradative enzymes. The expression of this peptide was shown to be subjected both to catabolite repression and DegS-DegU-mediated control. Thus allowing an increase in the rate of synthesis of degQ under conditions of nitrogen starvation [].
Probab=20.55 E-value=2.3e+02 Score=17.67 Aligned_cols=30 Identities=20% Similarity=0.251 Sum_probs=20.1
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy460 36 IKPTVDLLDEKFRATRASQLELRLQIEALT 65 (117)
Q Consensus 36 L~P~I~~lD~~V~~tr~SQ~eL~~qId~L~ 65 (117)
++-.+=+|+..|+.|-.|=...+..||++.
T Consensus 9 lkqll~rle~eirett~sl~ninksidq~d 38 (46)
T PF08181_consen 9 LKQLLWRLENEIRETTDSLRNINKSIDQYD 38 (46)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhHHHHh
Confidence 344555677777777777777777776653
No 268
>PF10392 COG5: Golgi transport complex subunit 5; InterPro: IPR019465 The conserved oligomeric Golgi (COG) complex is a peripheral membrane complex involved in intra-Golgi protein trafficking. Subunit 5 is located in the smaller, B lobe, together with subunits 6-8, and has been shown to bind subunits 1 and 7 [].
Probab=20.39 E-value=3.3e+02 Score=19.44 Aligned_cols=44 Identities=14% Similarity=0.178 Sum_probs=18.8
Q ss_pred HHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHH
Q psy460 53 SQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIV 98 (117)
Q Consensus 53 SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~v 98 (117)
....|...|+++..+|+.+-.. .+.+|-..+..+-.+..-+..|
T Consensus 34 ~l~kL~~~i~eld~~i~~~v~~--~~~~LL~q~~~~~~~~~~l~~v 77 (132)
T PF10392_consen 34 PLKKLNFDIQELDKRIRSQVTS--NHEDLLSQASSIEELESVLQAV 77 (132)
T ss_pred HHHHHHHHHHHHHHHHHHHHHh--CHHHHHHHHHhHHHHHHHHHHH
Confidence 3344444444444444444442 3334444444444443333333
No 269
>TIGR02338 gimC_beta prefoldin, beta subunit, archaeal. Chaperonins are cytosolic, ATP-dependent molecular chaperones, with a conserved toroidal architecture, that assist in the folding of nascent and/or denatured polypeptide chains. The group I chaperonin system consists of GroEL and GroES, and is found (usually) in bacteria and organelles of bacterial origin. The group II chaperonin system, called the thermosome in Archaea and TRiC or CCT in the Eukaryota, is structurally similar but only distantly related. Prefoldin, also called GimC, is a complex in Archaea and Eukaryota, that works with group II chaperonins. Members of this protein family are the archaeal clade of the beta class of prefoldin subunit. Closely related, but outside the scope of this family are the eukaryotic beta-class prefoldin subunits, Gim-1,3,4 and 6. The alpha class prefoldin subunits are more distantly related.
Probab=20.35 E-value=3e+02 Score=18.99 Aligned_cols=36 Identities=14% Similarity=0.160 Sum_probs=26.8
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhH
Q psy460 37 KPTVDLLDEKFRATRASQLELRLQIEALTSQLDKIS 72 (117)
Q Consensus 37 ~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~ 72 (117)
.|-+..+=.+++.++.-...+..++..|..++.+..
T Consensus 2 ~~~~q~~~~~~q~~q~~~~~l~~q~~~le~~~~E~~ 37 (110)
T TIGR02338 2 PPQVQNQLAQLQQLQQQLQAVATQKQQVEAQLKEAE 37 (110)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 467777777788888888888888888877666543
No 270
>PF07926 TPR_MLP1_2: TPR/MLP1/MLP2-like protein; InterPro: IPR012929 This domain is found in a number of proteins, including TPR protein (P12270 from SWISSPROT) and yeast myosin-like proteins 1 (MLP1, Q02455 from SWISSPROT) and 2 (MLP2, P40457 from SWISSPROT). These proteins share a number of features; for example, they all have coiled-coil regions and all three are associated with nuclear pores [, , ]. TPR is thought to be a component of nuclear pore complex- attached intranuclear filaments [], and is implicated in nuclear protein import []. Moreover, its N-terminal region is involved in the activation of oncogenic kinases, possibly by mediating the dimerisation of kinase domains or by targeting these kinases to the nuclear pore complex []. MLP1 and MLP2 are involved in the process of telomere length regulation, where they are thought to interact with proteins such as Tel1p and modulate their activity []. ; GO: 0006606 protein import into nucleus, 0005643 nuclear pore
Probab=20.32 E-value=3.3e+02 Score=19.43 Aligned_cols=72 Identities=8% Similarity=0.119 Sum_probs=38.8
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChh-HHHHHHHHhhHHHHHHHHHHHHHHHHHHh
Q psy460 40 VDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLD-SYTRKLVDAQQKISIVGNILQTTQVAMHK 111 (117)
Q Consensus 40 I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd-~yvkKL~~ak~RV~~vnniLq~~Q~RL~r 111 (117)
+..+...+.....-...++..+...+...+.....+..-+-++ .-++.|...|.....+...+...+.....
T Consensus 12 ~~~~~~~~~~~~~~~~~~~~dl~~q~~~a~~Aq~~YE~El~~Ha~~~~~L~~lr~e~~~~~~~~~~l~~~~~~ 84 (132)
T PF07926_consen 12 LQRLKEQEEDAEEQLQSLREDLESQAKIAQEAQQKYERELVKHAEDIKELQQLREELQELQQEINELKAEAES 84 (132)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344444444444444444444444444444433332333333 45788888888888877777777666554
No 271
>PLN02678 seryl-tRNA synthetase
Probab=20.27 E-value=6e+02 Score=22.41 Aligned_cols=72 Identities=17% Similarity=0.211 Sum_probs=51.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCChhHHHHHHHHhhHHHHHHHHHHHHHHHHHHhh
Q psy460 38 PTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPLDSYTRKLVDAQQKISIVGNILQTTQVAMHKK 112 (117)
Q Consensus 38 P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dLd~yvkKL~~ak~RV~~vnniLq~~Q~RL~rl 112 (117)
..|-++|.+-+..+.-=.+|+.+.-.++.++..+.. ..-+....+.+....|..+..+...+..+++.+..+
T Consensus 33 d~il~ld~~~r~l~~~~e~lr~erN~~sk~I~~~k~---~~~~~~~l~~~~~~Lk~ei~~le~~~~~~~~~l~~~ 104 (448)
T PLN02678 33 DEVIALDKEWRQRQFELDSLRKEFNKLNKEVAKLKI---AKEDATELIAETKELKKEITEKEAEVQEAKAALDAK 104 (448)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh---CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 445567777777766666677777777777766533 233456667777788899999999999998888765
No 272
>PF12495 Vip3A_N: Vegetative insecticide protein 3A N terminal ; InterPro: IPR022180 This family of proteins is found in bacteria. Proteins in this family are typically between 170 and 789 amino acids in length. The family is found in association with PF02018 from PFAM. Vip3A represents a novel class of proteins insecticidal to lepidopteran insect larvae.
Probab=20.23 E-value=4.1e+02 Score=20.47 Aligned_cols=71 Identities=21% Similarity=0.299 Sum_probs=46.2
Q ss_pred HHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhcCCCCh----hHHHHHHHHhhHHHHHHHHHHH
Q psy460 32 LLGLIKPTVDLLDEKFRATRASQLELRLQIEALTSQLDKISAEQQTAVPL----DSYTRKLVDAQQKISIVGNILQ 103 (117)
Q Consensus 32 L~~lL~P~I~~lD~~V~~tr~SQ~eL~~qId~L~~eL~~l~e~q~~p~dL----d~yvkKL~~ak~RV~~vnniLq 103 (117)
+-.+|..-+-++-+-.-.+-+--..|.-||+-|+++|.++++. -..+++ ....-.+--+-+|+.-||....
T Consensus 99 in~~l~~ylpkitsmls~vmkqny~lslqie~ls~qlqeisdk-ldiin~nvlinstlteitpayqrikyvn~kf~ 173 (177)
T PF12495_consen 99 INSMLNTYLPKITSMLSDVMKQNYVLSLQIEFLSKQLQEISDK-LDIINVNVLINSTLTEITPAYQRIKYVNDKFD 173 (177)
T ss_pred HHHHHHHHhHHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhh-cceeeeeeeeccchhhcCHHHHHHHHHHHHHH
Confidence 3344444444555555666677788999999999999999874 233333 3444555556678877776543
No 273
>PRK06798 fliD flagellar capping protein; Validated
Probab=20.22 E-value=5.8e+02 Score=22.20 Aligned_cols=19 Identities=11% Similarity=0.244 Sum_probs=9.4
Q ss_pred HHHHHHHHHHHHHHHhhHH
Q psy460 55 LELRLQIEALTSQLDKISA 73 (117)
Q Consensus 55 ~eL~~qId~L~~eL~~l~e 73 (117)
..|..+|+++..++.++.+
T Consensus 382 ~~l~~~i~~l~~~~~~~e~ 400 (440)
T PRK06798 382 KSIDNRVSKLDLKITDIDT 400 (440)
T ss_pred hHHHHHHHHHHHHHHHHHH
Confidence 3444555555555555443
No 274
>PF06298 PsbY: Photosystem II protein Y (PsbY); InterPro: IPR009388 Oxygenic photosynthesis uses two multi-subunit photosystems (I and II) located in the cell membranes of cyanobacteria and in the thylakoid membranes of chloroplasts in plants and algae. Photosystem II (PSII) has a P680 reaction centre containing chlorophyll 'a' that uses light energy to carry out the oxidation (splitting) of water molecules, and to produce ATP via a proton pump. Photosystem I (PSI) has a P700 reaction centre containing chlorophyll that takes the electron and associated hydrogen donated from PSII to reduce NADP+ to NADPH. Both ATP and NADPH are subsequently used in the light-independent reactions to convert carbon dioxide to glucose using the hydrogen atom extracted from water by PSII, releasing oxygen as a by-product. PSII is a multisubunit protein-pigment complex containing polypeptides both intrinsic and extrinsic to the photosynthetic membrane [, ]. Within the core of the complex, the chlorophyll and beta-carotene pigments are mainly bound to the antenna proteins CP43 (PsbC) and CP47 (PsbB), which pass the excitation energy on to the reaction centre proteins D1 (Qb, PsbA) and D2 (Qa, PsbD) that bind all the redox-active cofactors involved in the energy conversion process. The PSII oxygen-evolving complex (OEC) oxidises water to provide protons for use by PSI, and consists of OEE1 (PsbO), OEE2 (PsbP) and OEE3 (PsbQ). The remaining subunits in PSII are of low molecular weight (less than 10 kDa), and are involved in PSII assembly, stabilisation, dimerisation, and photo-protection []. This family represents the low molecular weight transmembrane protein PsbY found in PSII. In higher plants, two related PsbY proteins exist, PsbY-1 and PsbY-2, which appear to function as a heterodimer. In spinach and Arabidopsis, these two proteins arise from a single-copy nuclear gene that is processed in the chloroplast. By contrast, prokaryotic and organellar chromosomes encode a single PsbY protein, as found in cyanobacteria and red algae, indicating a duplication event in the evolution of higher plants []. PsbY has two low manganese-dependent activities: a catalase-like activity and an L-arginine metabolising activity that converts L-arginine into ornithine and urea []. In addition, a redox-active group is thought to be present in the protein. In cyanobacteria, PsbY deletion mutants have a slightly impaired PSII that is less capable of coping with low levels of calcium ions than the wild-type.; GO: 0030145 manganese ion binding, 0015979 photosynthesis, 0009523 photosystem II, 0016021 integral to membrane
Probab=20.20 E-value=1.6e+02 Score=17.66 Aligned_cols=18 Identities=22% Similarity=0.416 Sum_probs=14.3
Q ss_pred HHHH--HHHhhhHHHHHHHH
Q psy460 28 IAEG--LLGLIKPTVDLLDE 45 (117)
Q Consensus 28 La~G--L~~lL~P~I~~lD~ 45 (117)
+|-| ++...+|++.|++.
T Consensus 14 ~A~gWa~fNIg~~Al~Q~~~ 33 (36)
T PF06298_consen 14 PAAGWALFNIGRAALNQLQR 33 (36)
T ss_pred HHHHHHHHHhHHHHHHHHHH
Confidence 3445 89999999999875
No 275
>PRK07122 RNA polymerase sigma factor SigF; Reviewed
Probab=20.20 E-value=97 Score=24.63 Aligned_cols=24 Identities=25% Similarity=0.516 Sum_probs=19.6
Q ss_pred CCchHHHHHHHHHHhhhHHHHHHHH
Q psy460 21 DNPTQDTIAEGLLGLIKPTVDLLDE 45 (117)
Q Consensus 21 ~~p~rd~La~GL~~lL~P~I~~lD~ 45 (117)
.-+--|.+.+|.+||++ ++++||-
T Consensus 64 g~~~~DLiQeG~iGLi~-AierFDp 87 (264)
T PRK07122 64 GEPRDDLVQVARVGLVN-AVNRFDV 87 (264)
T ss_pred CCCHHHHHHHHHHHHHH-HHHHcCC
Confidence 34567999999999998 7888874
No 276
>PRK13689 hypothetical protein; Provisional
Probab=20.09 E-value=1.9e+02 Score=19.98 Aligned_cols=27 Identities=19% Similarity=0.230 Sum_probs=22.4
Q ss_pred HHHHHHHHHHHHhhHHHhcCCCChhHH
Q psy460 58 RLQIEALTSQLDKISAEQQTAVPLDSY 84 (117)
Q Consensus 58 ~~qId~L~~eL~~l~e~q~~p~dLd~y 84 (117)
.+|+|.|-.||-...|.-++|+||.--
T Consensus 9 d~qvE~il~el~~VLeKH~Ap~DLSLM 35 (75)
T PRK13689 9 DEQVEQLLAELLAVLEKHKAPTDLSLM 35 (75)
T ss_pred HHHHHHHHHHHHHHHHhcCCCccHHHH
Confidence 468999999999988877899998643
Done!