Query 018721
Match_columns 351
No_of_seqs 73 out of 75
Neff 3.7
Searched_HMMs 46136
Date Fri Mar 29 03:29:06 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/018721.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/018721hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF14802 TMEM192: TMEM192 fami 100.0 6.8E-60 1.5E-64 439.7 22.1 194 40-233 25-236 (236)
2 KOG0999 Microtubule-associated 94.4 0.79 1.7E-05 49.3 13.4 121 226-349 63-202 (772)
3 PF09730 BicD: Microtubule-ass 93.6 0.19 4.1E-06 54.9 7.3 101 242-345 11-125 (717)
4 PRK09039 hypothetical protein; 92.3 1.9 4.2E-05 43.0 11.8 56 285-340 123-181 (343)
5 PF14362 DUF4407: Domain of un 91.1 14 0.0003 35.5 15.8 29 73-101 46-75 (301)
6 TIGR01010 BexC_CtrB_KpsE polys 91.0 6.5 0.00014 38.6 13.7 122 213-338 170-306 (362)
7 TIGR03752 conj_TIGR03752 integ 88.8 1.9 4E-05 45.4 8.4 60 243-323 52-111 (472)
8 PRK09039 hypothetical protein; 88.0 14 0.00031 36.9 13.7 111 212-322 66-184 (343)
9 PRK11519 tyrosine kinase; Prov 84.3 32 0.0007 37.4 15.1 40 212-251 266-305 (719)
10 PF14802 TMEM192: TMEM192 fami 84.0 10 0.00022 36.5 10.1 33 207-239 203-235 (236)
11 COG2433 Uncharacterized conser 82.8 14 0.0003 40.4 11.4 101 207-338 423-530 (652)
12 PF06008 Laminin_I: Laminin Do 82.7 29 0.00063 32.9 12.5 99 220-319 94-201 (264)
13 TIGR03007 pepcterm_ChnLen poly 81.5 40 0.00086 34.4 13.7 37 212-248 160-196 (498)
14 PRK09841 cryptic autophosphory 81.4 47 0.001 36.2 15.0 120 214-338 268-398 (726)
15 PRK10884 SH3 domain-containing 78.3 20 0.00043 33.8 9.7 21 296-316 139-159 (206)
16 KOG0977 Nuclear envelope prote 77.7 60 0.0013 35.1 14.0 22 215-236 58-79 (546)
17 PF04859 DUF641: Plant protein 77.6 11 0.00023 33.6 7.3 79 228-319 53-131 (131)
18 PF15070 GOLGA2L5: Putative go 77.5 21 0.00046 38.8 10.8 118 218-335 20-147 (617)
19 PF07888 CALCOCO1: Calcium bin 76.3 74 0.0016 34.5 14.2 79 262-340 312-408 (546)
20 PF15035 Rootletin: Ciliary ro 73.8 76 0.0016 29.4 14.7 123 210-339 13-156 (182)
21 COG3074 Uncharacterized protei 72.8 11 0.00024 30.8 5.6 46 262-307 26-75 (79)
22 PF12325 TMF_TATA_bd: TATA ele 72.7 50 0.0011 28.9 10.0 59 251-309 20-81 (120)
23 PF13870 DUF4201: Domain of un 72.7 46 0.001 29.7 10.2 26 294-319 150-175 (177)
24 PF05622 HOOK: HOOK protein; 72.5 7.8 0.00017 42.0 6.1 122 218-339 205-340 (713)
25 PRK10361 DNA recombination pro 71.5 29 0.00064 36.8 9.8 68 271-338 39-109 (475)
26 TIGR02168 SMC_prok_B chromosom 70.0 1.4E+02 0.0031 32.7 15.0 22 298-319 882-903 (1179)
27 PF00038 Filament: Intermediat 69.6 1.1E+02 0.0023 29.3 13.4 68 215-287 20-87 (312)
28 TIGR01005 eps_transp_fam exopo 69.4 77 0.0017 34.3 12.7 35 214-248 195-229 (754)
29 KOG0249 LAR-interacting protei 69.1 54 0.0012 37.0 11.4 94 214-309 164-257 (916)
30 KOG0249 LAR-interacting protei 66.9 62 0.0014 36.5 11.3 75 208-284 100-186 (916)
31 TIGR00219 mreC rod shape-deter 66.3 12 0.00027 36.4 5.5 42 263-307 68-109 (283)
32 COG1579 Zn-ribbon protein, pos 65.3 1.4E+02 0.0031 29.1 12.7 32 256-287 84-115 (239)
33 KOG0250 DNA repair protein RAD 62.8 3.1E+02 0.0068 32.3 16.4 109 166-289 607-724 (1074)
34 PLN03188 kinesin-12 family pro 62.6 68 0.0015 38.1 11.1 31 301-331 1070-1100(1320)
35 TIGR03495 phage_LysB phage lys 61.3 86 0.0019 28.1 9.4 73 260-332 18-97 (135)
36 PF09730 BicD: Microtubule-ass 60.5 1.5E+02 0.0033 33.2 13.0 76 260-338 358-433 (717)
37 PF02050 FliJ: Flagellar FliJ 59.8 82 0.0018 24.6 12.1 66 219-284 11-82 (123)
38 PF03268 DUF267: Caenorhabditi 59.7 1.1E+02 0.0023 31.6 10.9 255 28-300 12-316 (353)
39 KOG1962 B-cell receptor-associ 59.4 1.7E+02 0.0038 28.2 17.7 106 207-334 84-189 (216)
40 PF09304 Cortex-I_coil: Cortex 59.2 12 0.00026 32.5 3.5 52 268-319 9-60 (107)
41 PRK11637 AmiB activator; Provi 58.4 1.8E+02 0.0038 29.6 12.3 25 256-280 91-115 (428)
42 PF09726 Macoilin: Transmembra 57.8 62 0.0014 35.8 9.5 51 74-134 119-169 (697)
43 KOG0993 Rab5 GTPase effector R 56.2 2.6E+02 0.0055 30.0 13.0 84 207-292 289-387 (542)
44 PF06785 UPF0242: Uncharacteri 55.9 92 0.002 32.3 9.6 77 253-329 91-181 (401)
45 PRK13922 rod shape-determining 55.2 43 0.00094 31.7 7.0 38 266-307 74-111 (276)
46 PRK02224 chromosome segregatio 54.6 2.7E+02 0.0059 30.6 13.7 57 265-321 513-569 (880)
47 PF04111 APG6: Autophagy prote 54.4 1E+02 0.0023 30.6 9.7 25 216-240 12-36 (314)
48 PRK10884 SH3 domain-containing 54.3 88 0.0019 29.5 8.7 82 207-290 80-161 (206)
49 KOG0804 Cytoplasmic Zn-finger 53.6 2.1E+02 0.0046 30.7 12.0 31 296-326 403-433 (493)
50 TIGR02231 conserved hypothetic 53.1 92 0.002 32.4 9.5 25 260-284 77-101 (525)
51 COG1792 MreC Cell shape-determ 53.1 20 0.00044 35.0 4.5 37 258-308 73-109 (284)
52 PF13870 DUF4201: Domain of un 51.9 1.8E+02 0.0038 26.0 15.2 123 210-340 3-135 (177)
53 PF07111 HCR: Alpha helical co 51.5 2.7E+02 0.0058 31.4 12.8 80 260-339 477-564 (739)
54 PF14817 HAUS5: HAUS augmin-li 51.5 2E+02 0.0043 31.7 11.9 120 207-335 31-156 (632)
55 TIGR01843 type_I_hlyD type I s 51.4 2.4E+02 0.0052 27.4 12.9 38 243-284 189-226 (423)
56 PF02403 Seryl_tRNA_N: Seryl-t 50.7 1.2E+02 0.0026 24.8 8.0 65 251-320 27-91 (108)
57 PF04849 HAP1_N: HAP1 N-termin 49.7 1.6E+02 0.0034 29.8 10.1 37 304-340 207-243 (306)
58 KOG0980 Actin-binding protein 49.6 93 0.002 35.7 9.2 63 257-319 329-409 (980)
59 COG2433 Uncharacterized conser 49.4 1.4E+02 0.003 33.1 10.3 73 262-338 437-509 (652)
60 TIGR03017 EpsF chain length de 49.2 2.9E+02 0.0062 27.7 15.2 36 213-248 171-206 (444)
61 PF11368 DUF3169: Protein of u 49.2 2.4E+02 0.0051 26.7 13.1 46 83-128 63-112 (248)
62 TIGR02808 short_TIGR02808 cons 49.2 13 0.00028 27.5 1.8 21 46-66 19-39 (42)
63 KOG0977 Nuclear envelope prote 48.1 79 0.0017 34.3 8.2 74 263-336 58-132 (546)
64 PF06638 Strabismus: Strabismu 47.8 39 0.00084 36.2 5.8 27 194-220 238-267 (505)
65 PF04156 IncA: IncA protein; 47.1 2.1E+02 0.0045 25.5 10.7 12 225-236 100-111 (191)
66 cd08915 V_Alix_like Protein-in 47.0 2.9E+02 0.0063 27.1 14.0 90 248-338 57-161 (342)
67 TIGR02894 DNA_bind_RsfA transc 46.8 69 0.0015 29.7 6.6 53 284-336 89-144 (161)
68 PF09574 DUF2374: Protein of 45.9 17 0.00038 26.8 2.1 21 46-66 19-39 (42)
69 PF09755 DUF2046: Uncharacteri 44.9 3.6E+02 0.0077 27.5 12.1 80 251-337 95-200 (310)
70 TIGR03007 pepcterm_ChnLen poly 44.8 3E+02 0.0065 28.1 11.5 33 305-337 350-382 (498)
71 PRK11637 AmiB activator; Provi 44.5 3.6E+02 0.0077 27.4 12.9 88 207-307 164-251 (428)
72 COG1196 Smc Chromosome segrega 43.6 4.7E+02 0.01 30.4 13.9 27 294-320 826-852 (1163)
73 PF06156 DUF972: Protein of un 42.6 1.2E+02 0.0026 26.0 7.0 44 266-309 13-56 (107)
74 PHA02562 46 endonuclease subun 42.1 4E+02 0.0088 27.4 13.3 26 215-240 257-282 (562)
75 PF10186 Atg14: UV radiation r 41.9 2.9E+02 0.0063 25.6 13.3 120 207-333 22-156 (302)
76 PF07926 TPR_MLP1_2: TPR/MLP1/ 41.7 1.6E+02 0.0034 25.4 7.8 66 275-340 3-68 (132)
77 PF08618 Opi1: Transcription f 41.6 60 0.0013 34.1 6.0 30 213-242 235-264 (427)
78 PF08581 Tup_N: Tup N-terminal 41.1 1.7E+02 0.0037 23.9 7.3 23 285-307 53-75 (79)
79 PF06156 DUF972: Protein of un 41.0 1E+02 0.0022 26.4 6.3 53 264-316 4-56 (107)
80 PF04156 IncA: IncA protein; 41.0 2.6E+02 0.0056 24.8 18.7 18 221-238 89-106 (191)
81 PF07099 DUF1361: Protein of u 40.6 58 0.0013 29.5 5.1 31 153-183 109-141 (168)
82 KOG0161 Myosin class II heavy 40.3 5.7E+02 0.012 32.1 14.3 41 222-268 1430-1470(1930)
83 PF08614 ATG16: Autophagy prot 40.1 1.2E+02 0.0026 27.7 7.1 73 262-334 96-168 (194)
84 TIGR03185 DNA_S_dndD DNA sulfu 40.1 1.6E+02 0.0036 31.6 9.1 27 293-319 259-285 (650)
85 PF12128 DUF3584: Protein of u 40.0 4.4E+02 0.0096 30.8 13.1 72 251-322 466-537 (1201)
86 PF07798 DUF1640: Protein of u 40.0 2.8E+02 0.0061 25.0 9.5 39 297-338 114-152 (177)
87 PF09726 Macoilin: Transmembra 39.9 1.7E+02 0.0036 32.6 9.3 72 267-338 551-629 (697)
88 COG3524 KpsE Capsule polysacch 39.6 2.2E+02 0.0048 29.4 9.3 104 213-317 179-292 (372)
89 PF15619 Lebercilin: Ciliary p 39.5 3.2E+02 0.007 25.5 14.1 83 232-320 45-149 (194)
90 PHA02562 46 endonuclease subun 39.2 4.5E+02 0.0097 27.0 11.9 90 214-309 300-389 (562)
91 COG4942 Membrane-bound metallo 39.0 5E+02 0.011 27.5 14.9 110 230-339 115-239 (420)
92 TIGR01843 type_I_hlyD type I s 38.6 3.8E+02 0.0082 26.1 13.6 19 265-283 155-173 (423)
93 PF00769 ERM: Ezrin/radixin/mo 38.6 2.2E+02 0.0048 27.3 8.9 76 262-337 41-116 (246)
94 KOG4673 Transcription factor T 38.1 5.5E+02 0.012 29.5 12.6 130 207-339 465-629 (961)
95 PF05010 TACC: Transforming ac 37.2 3.8E+02 0.0081 25.6 12.4 69 258-335 132-200 (207)
96 PF10168 Nup88: Nuclear pore c 37.1 2.4E+02 0.0051 31.5 9.9 63 277-339 556-622 (717)
97 PF07856 Orai-1: Mediator of C 37.1 1.4E+02 0.003 27.7 7.0 20 156-175 146-165 (175)
98 KOG0288 WD40 repeat protein Ti 37.0 3.1E+02 0.0068 29.3 10.2 57 256-314 50-106 (459)
99 PF11003 DUF2842: Protein of u 36.8 1.1E+02 0.0023 24.0 5.3 25 37-61 1-26 (62)
100 PF10174 Cast: RIM-binding pro 36.5 6.9E+02 0.015 28.4 14.0 74 267-340 435-509 (775)
101 PF09486 HrpB7: Bacterial type 36.5 3.4E+02 0.0074 24.9 11.3 56 261-323 86-141 (158)
102 PF14197 Cep57_CLD_2: Centroso 36.4 2.2E+02 0.0047 22.6 8.2 41 260-307 25-65 (69)
103 PF06548 Kinesin-related: Kine 36.1 5.9E+02 0.013 27.5 12.3 35 299-333 298-332 (488)
104 PF04977 DivIC: Septum formati 36.0 96 0.0021 23.4 5.0 16 293-308 35-50 (80)
105 PF10186 Atg14: UV radiation r 35.7 3.6E+02 0.0079 25.0 12.6 47 262-308 57-103 (302)
106 PF00038 Filament: Intermediat 35.6 4E+02 0.0086 25.4 11.5 53 285-337 198-250 (312)
107 PF05667 DUF812: Protein of un 35.5 1.7E+02 0.0038 31.8 8.5 82 256-337 323-421 (594)
108 PF05961 Chordopox_A13L: Chord 35.5 60 0.0013 26.3 3.8 47 152-204 6-52 (68)
109 PRK15178 Vi polysaccharide exp 35.3 3.4E+02 0.0073 28.7 10.3 88 210-301 239-337 (434)
110 PRK14127 cell division protein 35.3 1E+02 0.0022 26.8 5.5 45 294-338 49-99 (109)
111 PF10212 TTKRSYEDQ: Predicted 35.3 3.9E+02 0.0085 29.0 10.9 248 71-325 167-484 (518)
112 PF01166 TSC22: TSC-22/dip/bun 35.1 36 0.00078 26.9 2.4 32 273-304 12-43 (59)
113 PF09971 DUF2206: Predicted me 35.1 5.1E+02 0.011 26.5 12.9 83 76-176 121-205 (367)
114 PF07888 CALCOCO1: Calcium bin 35.1 3.8E+02 0.0081 29.3 10.7 96 207-318 133-228 (546)
115 TIGR00870 trp transient-recept 34.9 2.9E+02 0.0063 29.8 10.0 23 39-61 355-377 (743)
116 KOG4403 Cell surface glycoprot 34.4 2E+02 0.0044 30.9 8.4 69 216-284 238-325 (575)
117 PF15254 CCDC14: Coiled-coil d 34.2 5.3E+02 0.011 29.7 11.9 114 215-328 389-526 (861)
118 PF08614 ATG16: Autophagy prot 34.0 2.5E+02 0.0053 25.6 8.1 19 223-241 91-109 (194)
119 PF14661 HAUS6_N: HAUS augmin- 33.8 4.2E+02 0.0091 25.2 11.5 35 285-319 174-208 (247)
120 KOG0161 Myosin class II heavy 32.9 6.2E+02 0.013 31.9 13.1 67 272-338 1066-1132(1930)
121 PF04094 DUF390: Protein of un 32.8 2.4E+02 0.0051 32.1 9.0 89 253-341 536-634 (828)
122 PF03653 UPF0093: Uncharacteri 32.8 3.5E+02 0.0076 23.9 9.2 93 78-177 15-116 (147)
123 TIGR00606 rad50 rad50. This fa 32.3 9E+02 0.02 28.5 14.9 63 225-287 797-862 (1311)
124 KOG0976 Rho/Rac1-interacting s 32.3 1.4E+02 0.0031 34.4 7.3 27 281-307 557-583 (1265)
125 KOG4637 Adaptor for phosphoino 32.2 4.9E+02 0.011 27.7 10.6 100 197-304 79-207 (464)
126 PF05557 MAD: Mitotic checkpoi 31.4 16 0.00034 39.6 0.0 90 228-318 313-407 (722)
127 TIGR00606 rad50 rad50. This fa 31.3 5.6E+02 0.012 30.2 12.2 35 208-242 205-239 (1311)
128 PRK02224 chromosome segregatio 31.2 7.5E+02 0.016 27.3 15.6 76 265-340 318-400 (880)
129 TIGR02209 ftsL_broad cell divi 31.1 2.1E+02 0.0046 22.2 6.3 38 281-319 30-67 (85)
130 cd08318 Death_NMPP84 Death dom 30.6 94 0.002 25.1 4.3 39 280-321 31-75 (86)
131 PRK00888 ftsB cell division pr 30.5 1.2E+02 0.0027 25.6 5.2 17 294-310 46-62 (105)
132 PF12128 DUF3584: Protein of u 30.4 9.5E+02 0.021 28.2 13.7 111 210-320 246-369 (1201)
133 PF04799 Fzo_mitofusin: fzo-li 29.9 1.4E+02 0.0029 28.0 5.7 13 308-320 149-161 (171)
134 COG1988 Predicted membrane-bou 29.9 97 0.0021 28.4 4.8 33 29-61 59-92 (190)
135 COG5102 SFT2 Membrane protein 29.9 5E+02 0.011 24.8 11.5 36 101-136 117-152 (201)
136 PF07760 DUF1616: Protein of u 29.8 1.7E+02 0.0036 28.5 6.6 49 120-171 59-110 (287)
137 PF09323 DUF1980: Domain of un 29.6 1.4E+02 0.003 27.0 5.7 44 46-90 4-50 (182)
138 KOG3402 Predicted membrane pro 29.5 36 0.00079 29.1 1.8 32 24-55 40-77 (101)
139 PF07106 TBPIP: Tat binding pr 29.4 2.2E+02 0.0047 25.3 6.8 17 210-226 2-18 (169)
140 PRK15396 murein lipoprotein; P 29.4 1.9E+02 0.0041 23.8 5.8 47 246-301 23-69 (78)
141 PRK04863 mukB cell division pr 29.3 8.7E+02 0.019 29.7 13.4 35 207-242 302-336 (1486)
142 PF05392 COX7B: Cytochrome C o 29.2 47 0.001 27.6 2.4 33 39-71 42-74 (80)
143 TIGR00219 mreC rod shape-deter 29.2 77 0.0017 31.0 4.3 15 294-308 71-85 (283)
144 PF08687 ASD2: Apx/Shroom doma 29.2 4.9E+02 0.011 25.9 9.7 67 255-321 94-172 (264)
145 PF07407 Seadorna_VP6: Seadorn 29.2 74 0.0016 32.9 4.2 30 296-329 39-69 (420)
146 PF10174 Cast: RIM-binding pro 28.8 2.6E+02 0.0057 31.6 8.7 67 262-328 4-71 (775)
147 PF12896 Apc4: Anaphase-promot 28.8 1.1E+02 0.0025 27.5 5.0 49 211-259 29-85 (210)
148 KOG1853 LIS1-interacting prote 28.7 6.4E+02 0.014 25.6 10.4 84 216-322 16-99 (333)
149 PF15022 DUF4522: Protein of u 28.6 50 0.0011 28.9 2.6 30 309-338 71-100 (117)
150 COG3879 Uncharacterized protei 28.5 1.4E+02 0.003 29.4 5.8 43 261-303 57-103 (247)
151 PRK11281 hypothetical protein; 28.5 8.6E+02 0.019 28.8 12.9 59 224-282 91-156 (1113)
152 PF09755 DUF2046: Uncharacteri 28.4 6.2E+02 0.013 25.8 10.4 29 210-238 175-203 (310)
153 PRK13169 DNA replication intia 28.4 2.1E+02 0.0046 24.8 6.3 40 269-308 16-55 (110)
154 PF14142 YrzO: YrzO-like prote 28.3 50 0.0011 24.6 2.1 15 207-221 27-41 (46)
155 KOG2129 Uncharacterized conser 28.3 7.9E+02 0.017 26.6 11.8 98 209-311 197-308 (552)
156 PF02932 Neur_chan_memb: Neuro 28.2 2E+02 0.0044 23.4 6.1 24 295-318 197-220 (237)
157 PF01763 Herpes_UL6: Herpesvir 28.0 70 0.0015 34.7 4.0 39 208-246 372-410 (557)
158 PF10251 PEN-2: Presenilin enh 28.0 41 0.00088 28.6 1.9 21 36-56 53-73 (94)
159 KOG0982 Centrosomal protein Nu 27.9 6.1E+02 0.013 27.4 10.6 46 287-332 302-354 (502)
160 PF08826 DMPK_coil: DMPK coile 27.8 3E+02 0.0065 21.6 6.8 43 265-307 15-57 (61)
161 PF07782 DC_STAMP: DC-STAMP-li 27.6 4.7E+02 0.01 23.8 10.2 31 143-173 143-173 (191)
162 PF08657 DASH_Spc34: DASH comp 27.5 2.2E+02 0.0048 27.9 7.0 66 209-274 177-259 (259)
163 cd09236 V_AnPalA_UmRIM20_like 27.2 6.4E+02 0.014 25.2 13.9 87 253-339 62-163 (353)
164 PF11833 DUF3353: Protein of u 26.8 1E+02 0.0023 28.8 4.5 55 32-86 132-192 (194)
165 PF14931 IFT20: Intraflagellar 26.6 4.4E+02 0.0095 23.1 8.9 88 218-309 18-107 (120)
166 COG4792 EscU Type III secretor 26.4 4E+02 0.0087 27.5 8.7 68 42-115 151-218 (349)
167 PF14584 DUF4446: Protein of u 26.2 1.7E+02 0.0036 26.5 5.5 53 231-284 24-76 (151)
168 PF10654 DUF2481: Protein of u 25.9 65 0.0014 28.7 2.8 34 208-242 9-42 (126)
169 PF11239 DUF3040: Protein of u 25.6 2.2E+02 0.0049 22.6 5.7 43 16-61 20-62 (82)
170 PF04420 CHD5: CHD5-like prote 25.4 1.6E+02 0.0034 26.5 5.2 59 265-339 37-95 (161)
171 COG4372 Uncharacterized protei 25.4 7.2E+02 0.016 26.7 10.5 71 260-330 94-178 (499)
172 KOG0996 Structural maintenance 25.3 1.2E+03 0.027 28.1 13.2 79 260-338 485-584 (1293)
173 KOG0288 WD40 repeat protein Ti 25.1 3.3E+02 0.0071 29.1 8.0 56 249-304 15-77 (459)
174 PF10066 DUF2304: Uncharacteri 25.0 3.9E+02 0.0085 22.5 7.3 64 34-98 24-92 (115)
175 PRK13922 rod shape-determining 25.0 93 0.002 29.5 3.9 24 293-316 73-96 (276)
176 PF11559 ADIP: Afadin- and alp 25.0 4.6E+02 0.0099 22.7 10.7 30 299-328 83-112 (151)
177 PF04728 LPP: Lipoprotein leuc 24.5 2.3E+02 0.0051 22.1 5.3 24 262-285 11-34 (56)
178 PRK11877 psaI photosystem I re 24.4 50 0.0011 24.0 1.5 27 55-81 6-32 (38)
179 PRK10245 adrA diguanylate cycl 24.3 7.1E+02 0.015 24.7 10.3 53 40-97 44-96 (366)
180 PRK11281 hypothetical protein; 24.3 8.5E+02 0.019 28.8 11.9 43 298-340 137-179 (1113)
181 KOG0243 Kinesin-like protein [ 24.2 9.9E+02 0.021 28.3 12.2 120 200-319 435-555 (1041)
182 TIGR02559 HrpB7 type III secre 24.1 5.9E+02 0.013 23.7 9.1 72 245-323 64-141 (158)
183 TIGR01005 eps_transp_fam exopo 24.0 9.6E+02 0.021 26.1 13.3 32 308-339 374-405 (754)
184 KOG4673 Transcription factor T 23.9 1.2E+03 0.025 27.0 12.5 106 212-323 452-564 (961)
185 KOG0933 Structural maintenance 23.8 6.2E+02 0.013 30.1 10.4 52 256-307 810-861 (1174)
186 TIGR03495 phage_LysB phage lys 23.7 3.2E+02 0.0069 24.5 6.8 80 217-303 16-96 (135)
187 PF12325 TMF_TATA_bd: TATA ele 23.6 5E+02 0.011 22.7 11.8 87 217-320 20-106 (120)
188 KOG0980 Actin-binding protein 23.6 1.2E+03 0.025 27.5 12.3 50 221-276 467-516 (980)
189 KOG3973 Uncharacterized conser 23.4 53 0.0011 34.3 2.0 28 234-261 247-274 (465)
190 PF07926 TPR_MLP1_2: TPR/MLP1/ 23.2 4.9E+02 0.011 22.4 10.6 29 304-332 92-120 (132)
191 PF09731 Mitofilin: Mitochondr 22.9 9.1E+02 0.02 25.5 13.2 28 209-236 247-274 (582)
192 PRK05349 Na(+)-translocating N 22.8 3.2E+02 0.0069 28.8 7.5 52 51-102 350-401 (405)
193 PF03980 Nnf1: Nnf1 ; InterPr 22.5 2.3E+02 0.0051 23.3 5.4 38 247-284 65-103 (109)
194 PRK04778 septation ring format 22.5 5.9E+02 0.013 27.1 9.6 21 215-235 319-339 (569)
195 TIGR02209 ftsL_broad cell divi 22.4 3.5E+02 0.0077 20.9 6.1 39 260-300 30-68 (85)
196 KOG4807 F-actin binding protei 22.1 3.4E+02 0.0073 29.1 7.5 62 220-281 470-539 (593)
197 PF10003 DUF2244: Integral mem 22.0 1.8E+02 0.004 25.3 4.9 24 72-95 37-60 (140)
198 PF03245 Phage_lysis: Bacterio 22.0 2.3E+02 0.0051 24.5 5.5 17 281-297 27-43 (125)
199 PRK10929 putative mechanosensi 21.9 1.4E+03 0.03 27.2 15.8 74 211-284 43-139 (1109)
200 PRK12821 aspartyl/glutamyl-tRN 21.7 6.3E+02 0.014 27.3 9.4 121 39-175 218-354 (477)
201 PF11932 DUF3450: Protein of u 21.6 5.1E+02 0.011 24.4 8.1 30 265-294 46-75 (251)
202 PF12911 OppC_N: N-terminal TM 21.6 61 0.0013 23.4 1.6 26 39-64 15-40 (56)
203 PF09679 TraQ: Type-F conjugat 21.4 87 0.0019 26.7 2.6 20 303-322 57-77 (93)
204 PF13815 Dzip-like_N: Iguana/D 21.2 2.5E+02 0.0054 23.8 5.4 53 213-278 66-118 (118)
205 PF10224 DUF2205: Predicted co 21.1 4.5E+02 0.0096 21.7 6.6 48 260-307 14-62 (80)
206 PF10329 DUF2417: Region of un 21.1 1.3E+02 0.0028 29.2 4.1 40 82-131 52-91 (232)
207 KOG0018 Structural maintenance 21.0 7.4E+02 0.016 29.5 10.4 85 251-339 729-815 (1141)
208 PF12329 TMF_DNA_bd: TATA elem 21.0 4.3E+02 0.0094 21.0 8.6 69 251-319 2-70 (74)
209 KOG2302 T-type voltage-gated C 20.9 1.3E+03 0.028 28.2 12.1 133 26-174 1147-1283(1956)
210 TIGR02680 conserved hypothetic 20.8 1.5E+03 0.032 27.2 15.2 61 275-338 875-935 (1353)
211 PF08961 DUF1875: Domain of un 20.7 33 0.00072 33.5 0.0 35 284-318 131-165 (243)
212 PF10234 Cluap1: Clusterin-ass 20.6 3.9E+02 0.0084 26.6 7.3 43 265-307 194-236 (267)
213 PHA02246 hypothetical protein 20.4 1E+02 0.0022 29.0 3.1 83 106-189 51-147 (192)
214 PF06818 Fez1: Fez1; InterPro 20.3 7.8E+02 0.017 23.7 11.2 47 228-274 32-79 (202)
215 KOG0963 Transcription factor/C 20.2 1.2E+03 0.027 26.0 13.6 117 207-323 180-330 (629)
216 PRK11578 macrolide transporter 20.1 8.3E+02 0.018 24.0 9.8 28 262-289 107-134 (370)
217 PF15099 PIRT: Phosphoinositid 20.1 1.6E+02 0.0034 26.6 4.1 40 40-82 55-95 (129)
No 1
>PF14802 TMEM192: TMEM192 family
Probab=100.00 E-value=6.8e-60 Score=439.71 Aligned_cols=194 Identities=34% Similarity=0.468 Sum_probs=180.9
Q ss_pred hhhHHHHHHHHHHHHHHHhhhhhhccccc--cchhHHHHHHHHHHHHHHHHHHHHHHHHhHHHhhhHHHHHHHhhccccc
Q 018721 40 FGSVVYCFVLAGYAILAAGTTWIFHPIHY--LIPPLLCSCGVILLALTGIFQQYFVYQVQKIRLQGYYSFSQKLKHIVRL 117 (351)
Q Consensus 40 ~g~~~y~~~L~~yA~~~~~~pw~~~~~~~--~~~~lL~y~~v~LWlLt~l~d~yv~~qH~KlRl~GYl~FYR~Tr~lkRl 117 (351)
.-+.+.+++.++.+++++++||.+-+..+ ...++++|+||+||++|+++|+|+|+||+|+|++||++|||+|+++||+
T Consensus 25 ~~~~l~ll~~v~l~~~~~vl~~~~~~~~~~C~~y~iily~~v~lW~lt~l~d~y~k~~H~klr~~GY~~fyr~t~~~rr~ 104 (236)
T PF14802_consen 25 PIFSLLLLLSVVLAIVGFVLCWYPPPDEDKCDVYFIILYLHVALWLLTYLFDRYIKHQHQKLRLQGYLDFYRKTKRLRRL 104 (236)
T ss_pred HHHHHHHHHHHHHHHHHHHHhccCCcccCccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHHhcc
Confidence 34566677778999999999998887644 3467999999999999999999999999999999999999999999999
Q ss_pred chhhhhHHHHHHHHHHHhhhccc-----------cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhhc
Q 018721 118 PFAITAYGTAAMLLVIVWRPHIS-----------ILSISTLLRIIMLIEAICAASFMSVYIGYVHQYNSLNSQPDVMKSL 186 (351)
Q Consensus 118 Pl~IvSlGNa~LLLI~a~~~~~~-----------~Ls~~~lLriil~lElv~~l~~li~YIvkV~rFNk~kp~PDVl~ee 186 (351)
||+|||+||++||++++|++++. +++|..++++++++|++|++||++.||+||+||||+||+|||++++
T Consensus 105 Pl~ivS~gna~LLlv~~~~~~~~~~~~~~~c~~~~ls~~~~l~i~~~lE~~~~~~~~i~Yiv~V~kFN~~~~~PDv~~~~ 184 (236)
T PF14802_consen 105 PLQIVSLGNAVLLLVQAWQHHYFGPDFAEYCSVAPLSPQLYLQILCSLELLVLLPFLIIYIVKVRKFNKARPPPDVLREE 184 (236)
T ss_pred hHHHHHHHHHHHHHHHHHHHhcccccchhhcccccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCCchhh
Confidence 99999999999999999998772 4899999999999999999999999999999999999999999999
Q ss_pred cC--CCCCCCCCCCCcccCCCc---hhHhhHHHHHHHHHHhHHhhHHHHHHH
Q 018721 187 YS--PLQPSSSLEGLRYHDGGR---LSDEQMALLQYQRENLHFLSEEILRLQ 233 (351)
Q Consensus 187 ~s--~~~ps~~~~ElGfrd~g~---LlEKQADLIrYLkdHNa~LSkrIL~Lq 233 (351)
++ +.+|+++++|+||+++++ ++|||||||+||||||++||+|||+||
T Consensus 185 ~~~~~~~~~~~~~e~g~r~~~~~eellEkQadlI~yLk~hn~~L~~ril~l~ 236 (236)
T PF14802_consen 185 YSRSYLYPSSSSSELGFRDGSSLEELLEKQADLIRYLKEHNARLSRRILALT 236 (236)
T ss_pred hccccCCCCCCccccCCcCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhhC
Confidence 99 788999999999999887 999999999999999999999999985
No 2
>KOG0999 consensus Microtubule-associated protein Bicaudal-D [Intracellular trafficking, secretion, and vesicular transport]
Probab=94.39 E-value=0.79 Score=49.35 Aligned_cols=121 Identities=31% Similarity=0.405 Sum_probs=89.0
Q ss_pred hHHHHHHHHHhhhhccc-----CCCCChhhhHHHhhhhhhhHHH----hhHHHHHhHHHHHHHHHHHHHHH-------Hh
Q 018721 226 SEEILRLQECLSKYEQS-----DDGSTPQVDLAHLLAARDQELR----TLSAEMNQLQSELRLARSFVLRV-------RN 289 (351)
Q Consensus 226 SkrIL~Lq~~l~kye~~-----~~g~t~qvdl~h~la~r~qelR----a~~Ae~~q~~~el~~ar~li~~~-------r~ 289 (351)
-.+|=.+++.+.+|.+. .||-+---.|-.==|++|++.- .+.+|+.|+..||...+++..++ ..
T Consensus 63 R~Eldqtkeal~q~~s~hkk~~~~g~e~EesLLqESaakE~~yl~kI~eleneLKq~r~el~~~q~E~erl~~~~sd~~e 142 (772)
T KOG0999|consen 63 RTELDQTKEALGQYRSQHKKVARDGEEREESLLQESAAKEEYYLQKILELENELKQLRQELTNVQEENERLEKVHSDLKE 142 (772)
T ss_pred HHHHHHHHHHHHHHHHHHHHhhccchhhHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Confidence 45888899999999876 3444444445444578887543 78889999999998888886444 44
Q ss_pred hhhhhhhHhHHHHHHHhhhhhhHHHHHH---HHHHhhhccHHHHHHHHHhhhcCCCCCccccc
Q 018721 290 TNNQYVEENERLRAILGEWSTRAAKLER---ALEVERMSNIELQKKISTRRNQHGPAESNEHD 349 (351)
Q Consensus 290 ~n~q~~eEn~rlRa~l~Ews~raakle~---ale~er~~~~~~~~~~~~~r~~~~~~~~~~~~ 349 (351)
.|..-..+--|||+.|.|..-|-|+|=- -||.| |+-|||+++.+|+++.+-|.-.|+
T Consensus 143 ~~~~~E~qR~rlr~elKe~KfRE~RllseYSELEEE---NIsLQKqVs~LR~sQVEyEglkhe 202 (772)
T KOG0999|consen 143 SNAAVEDQRRRLRDELKEYKFREARLLSEYSELEEE---NISLQKQVSNLRQSQVEYEGLKHE 202 (772)
T ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh---cchHHHHHHHHhhhhhhhhHHHHH
Confidence 4444455667899999999999998843 46666 778999999999998776655554
No 3
>PF09730 BicD: Microtubule-associated protein Bicaudal-D; InterPro: IPR018477 BicD proteins consist of three coiled-coiled domains and are involved in dynein-mediated minus end-directed transport from the Golgi apparatus to the endoplasmic reticulum (ER) []. Glycogen synthase kinase-3beta (GSK-3beta) is required for the binding of BICD to dynein but not to dynactin, acting to maintain the anchoring of microtubules to the centromere []. It appears that amino-acid residues 437-617 of BicD and the kinase activity of GSK-3 are necessary for the formation of a complex between BicD and GSK-3beta in intact cells [].; GO: 0006810 transport, 0005794 Golgi apparatus
Probab=93.58 E-value=0.19 Score=54.95 Aligned_cols=101 Identities=31% Similarity=0.388 Sum_probs=74.5
Q ss_pred cCCCCChhhhHHHhhhhhhhHHH----hhHHHHHhHHHHHHHHHHHHHHHHhhhh-------hhhhHhHHHHHHHhhhhh
Q 018721 242 SDDGSTPQVDLAHLLAARDQELR----TLSAEMNQLQSELRLARSFVLRVRNTNN-------QYVEENERLRAILGEWST 310 (351)
Q Consensus 242 ~~~g~t~qvdl~h~la~r~qelR----a~~Ae~~q~~~el~~ar~li~~~r~~n~-------q~~eEn~rlRa~l~Ews~ 310 (351)
+.||-+---+|-.==|+||.++. .+-+|+.|.+.++..+++...|+-..|+ ...-|-.+||+-+.|...
T Consensus 11 ~~~g~~~Ee~Ll~esa~~E~~~~~~i~~l~~elk~~~~~~~~~~~e~~rl~~~~~~~~~~~~~~e~~~~~lr~e~ke~K~ 90 (717)
T PF09730_consen 11 AKDGEEREESLLQESASKEAYLQQRILELENELKQLRQELSNVQAENERLSQLNQELRKECEDLELERKRLREEIKEYKF 90 (717)
T ss_pred HhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45666555556555577887666 7778888888888888888755555555 445556689999999999
Q ss_pred hHHHHHH---HHHHhhhccHHHHHHHHHhhhcCCCCCc
Q 018721 311 RAAKLER---ALEVERMSNIELQKKISTRRNQHGPAES 345 (351)
Q Consensus 311 raakle~---ale~er~~~~~~~~~~~~~r~~~~~~~~ 345 (351)
|-++|=. -||.| |+-|||+++.+|+++.+-|.
T Consensus 91 rE~rll~dyselEeE---NislQKqvs~Lk~sQvefE~ 125 (717)
T PF09730_consen 91 REARLLQDYSELEEE---NISLQKQVSVLKQSQVEFEG 125 (717)
T ss_pred HHHHHhhhhHHHHHH---HHHHHHHHHHHHHhHHHHHH
Confidence 9988853 46776 88899999999998765443
No 4
>PRK09039 hypothetical protein; Validated
Probab=92.31 E-value=1.9 Score=42.99 Aligned_cols=56 Identities=20% Similarity=0.177 Sum_probs=42.2
Q ss_pred HHHHhhhhhhhhH---hHHHHHHHhhhhhhHHHHHHHHHHhhhccHHHHHHHHHhhhcC
Q 018721 285 LRVRNTNNQYVEE---NERLRAILGEWSTRAAKLERALEVERMSNIELQKKISTRRNQH 340 (351)
Q Consensus 285 ~~~r~~n~q~~eE---n~rlRa~l~Ews~raakle~ale~er~~~~~~~~~~~~~r~~~ 340 (351)
.++......|-|. -++|++-++....+-|.||.+|++-.-..-|.+.+|.++....
T Consensus 123 ~~L~~~k~~~se~~~~V~~L~~qI~aLr~Qla~le~~L~~ae~~~~~~~~~i~~L~~~L 181 (343)
T PRK09039 123 QELDSEKQVSARALAQVELLNQQIAALRRQLAALEAALDASEKRDRESQAKIADLGRRL 181 (343)
T ss_pred HHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4455555555554 4678888888888999999998887777788888888886543
No 5
>PF14362 DUF4407: Domain of unknown function (DUF4407)
Probab=91.14 E-value=14 Score=35.50 Aligned_cols=29 Identities=7% Similarity=-0.040 Sum_probs=19.8
Q ss_pred HHHHHHHHHHH-HHHHHHHHHHHHHhHHHh
Q 018721 73 LLCSCGVILLA-LTGIFQQYFVYQVQKIRL 101 (351)
Q Consensus 73 lL~y~~v~LWl-Lt~l~d~yv~~qH~KlRl 101 (351)
+.+..=.++|. +.+.+|+++.....|.+.
T Consensus 46 ~~ai~~glvwgl~I~~lDR~ivss~~~~~~ 75 (301)
T PF14362_consen 46 WAAIPFGLVWGLVIFNLDRFIVSSIRKSDG 75 (301)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccccc
Confidence 44444446775 567799999998776555
No 6
>TIGR01010 BexC_CtrB_KpsE polysaccharide export inner-membrane protein, BexC/CtrB/KpsE family. This family contains gamma proteobacterial proteins involved in capsule polysaccharide export.
Probab=90.96 E-value=6.5 Score=38.56 Aligned_cols=122 Identities=17% Similarity=0.208 Sum_probs=74.4
Q ss_pred HHHHHHHHHhHHhhHHHHHHHHHhhhhcccCCCCChhhhHHHhhhhhhhHHHhhHHHHHhHHHHHHHHHHHH--------
Q 018721 213 ALLQYQRENLHFLSEEILRLQECLSKYEQSDDGSTPQVDLAHLLAARDQELRTLSAEMNQLQSELRLARSFV-------- 284 (351)
Q Consensus 213 DLIrYLkdHNa~LSkrIL~Lq~~l~kye~~~~g~t~qvdl~h~la~r~qelRa~~Ae~~q~~~el~~ar~li-------- 284 (351)
+-+.|+.+....+.+++-..+..+..|++.....+|+-... +=.+.+..+.+++.+++.|+...++-.
T Consensus 170 ~a~~fl~~ql~~~~~~l~~ae~~l~~fr~~~~~~d~~~~~~----~~~~~i~~L~~~l~~~~~~l~~l~~~~~~~~P~v~ 245 (362)
T TIGR01010 170 DTIAFAENEVKEAEQRLNATKAELLKYQIKNKVFDPKAQSS----AQLSLISTLEGELIRVQAQLAQLRSITPEQNPQVP 245 (362)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcChHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHhhCCCCCCchH
Confidence 56678888888888899999999999999866666654321 112234455666666666666655432
Q ss_pred ---HHHHhhhhhhhhHhHHHHHH----HhhhhhhHHHHHHHHHHhhhccHHHHHHHHHhhh
Q 018721 285 ---LRVRNTNNQYVEENERLRAI----LGEWSTRAAKLERALEVERMSNIELQKKISTRRN 338 (351)
Q Consensus 285 ---~~~r~~n~q~~eEn~rlRa~----l~Ews~raakle~ale~er~~~~~~~~~~~~~r~ 338 (351)
.++.....|--+|+.++-+. +..-..+-..|++..|.-+-.-..+.+++.+.|-
T Consensus 246 ~l~~~i~~l~~~i~~e~~~i~~~~~~~l~~~~~~~~~L~re~~~a~~~y~~~l~r~~~a~~ 306 (362)
T TIGR01010 246 SLQARIKSLRKQIDEQRNQLSGGLGDSLNEQTADYQRLVLQNELAQQQLKAALTSLQQTRV 306 (362)
T ss_pred HHHHHHHHHHHHHHHHHHHhhcCCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 55666666666666666442 3333344456666666554444444444444443
No 7
>TIGR03752 conj_TIGR03752 integrating conjugative element protein, PFL_4705 family. Members of this protein family are found occasionally on plasmids such as the Pseudomonas putida toluene catabolic TOL plasmid pWWO_p085. Usually, however, they are found on the bacterial main chromosome in regions flanked by markers of conjugative transfer and/or transposition.
Probab=88.82 E-value=1.9 Score=45.43 Aligned_cols=60 Identities=32% Similarity=0.452 Sum_probs=41.8
Q ss_pred CCCCChhhhHHHhhhhhhhHHHhhHHHHHhHHHHHHHHHHHHHHHHhhhhhhhhHhHHHHHHHhhhhhhHHHHHHHHHHh
Q 018721 243 DDGSTPQVDLAHLLAARDQELRTLSAEMNQLQSELRLARSFVLRVRNTNNQYVEENERLRAILGEWSTRAAKLERALEVE 322 (351)
Q Consensus 243 ~~g~t~qvdl~h~la~r~qelRa~~Ae~~q~~~el~~ar~li~~~r~~n~q~~eEn~rlRa~l~Ews~raakle~ale~e 322 (351)
-+|+||..- +|+|.|++.+++.|+ +.+..-|.+..+||+|||+-..-=.. +++.|++.|
T Consensus 52 iegDTP~DT-----------lrTlva~~k~~r~~~-------~~l~~~N~~l~~eN~~L~~r~~~id~---~i~~av~~~ 110 (472)
T TIGR03752 52 IEGDTPADT-----------LRTLVAEVKELRKRL-------AKLISENEALKAENERLQKREQSIDQ---QIQQAVQSE 110 (472)
T ss_pred CCCCCccch-----------HHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHhhhhHHH---HHHHHHHhh
Confidence 589999864 577777776666654 56777899999999999985443333 344555554
Q ss_pred h
Q 018721 323 R 323 (351)
Q Consensus 323 r 323 (351)
|
T Consensus 111 ~ 111 (472)
T TIGR03752 111 T 111 (472)
T ss_pred h
Confidence 4
No 8
>PRK09039 hypothetical protein; Validated
Probab=87.99 E-value=14 Score=36.91 Aligned_cols=111 Identities=17% Similarity=0.114 Sum_probs=49.5
Q ss_pred HHHHHHHHHHhHHhhHHHHHHHHHhhhhccc-------CCCCCh-hhhHHHhhhhhhhHHHhhHHHHHhHHHHHHHHHHH
Q 018721 212 MALLQYQRENLHFLSEEILRLQECLSKYEQS-------DDGSTP-QVDLAHLLAARDQELRTLSAEMNQLQSELRLARSF 283 (351)
Q Consensus 212 ADLIrYLkdHNa~LSkrIL~Lq~~l~kye~~-------~~g~t~-qvdl~h~la~r~qelRa~~Ae~~q~~~el~~ar~l 283 (351)
++++..=+..+.-|..+|-.|+.+++--+.. ..|... .-.+...++.=+++|....++....+.++.+.++.
T Consensus 66 ~e~L~le~~~~~~l~~~l~~l~~~l~~a~~~r~~Le~~~~~~~~~~~~~~~~~~~l~~~L~~~k~~~se~~~~V~~L~~q 145 (343)
T PRK09039 66 ADLLSLERQGNQDLQDSVANLRASLSAAEAERSRLQALLAELAGAGAAAEGRAGELAQELDSEKQVSARALAQVELLNQQ 145 (343)
T ss_pred HHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhcchHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHH
Confidence 4455566666777777777776655421111 011100 00111111111345555555555556666666666
Q ss_pred HHHHHhhhhhhhhHhHHHHHHHhhhhhhHHHHHHHHHHh
Q 018721 284 VLRVRNTNNQYVEENERLRAILGEWSTRAAKLERALEVE 322 (351)
Q Consensus 284 i~~~r~~n~q~~eEn~rlRa~l~Ews~raakle~ale~e 322 (351)
|..+|.-..+=.++-+-..+...|-..+-+.|+.+|+.-
T Consensus 146 I~aLr~Qla~le~~L~~ae~~~~~~~~~i~~L~~~L~~a 184 (343)
T PRK09039 146 IAALRRQLAALEAALDASEKRDRESQAKIADLGRRLNVA 184 (343)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 655554422222222222222234444555555555543
No 9
>PRK11519 tyrosine kinase; Provisional
Probab=84.25 E-value=32 Score=37.36 Aligned_cols=40 Identities=13% Similarity=0.182 Sum_probs=31.9
Q ss_pred HHHHHHHHHHhHHhhHHHHHHHHHhhhhcccCCCCChhhh
Q 018721 212 MALLQYQRENLHFLSEEILRLQECLSKYEQSDDGSTPQVD 251 (351)
Q Consensus 212 ADLIrYLkdHNa~LSkrIL~Lq~~l~kye~~~~g~t~qvd 251 (351)
..-+.|+.+....+.+++=..+..+.+|++...-.+++.+
T Consensus 266 ~~a~~fL~~ql~~l~~~L~~aE~~l~~fr~~~~~vd~~~e 305 (719)
T PRK11519 266 SKSLAFLAQQLPEVRSRLDVAENKLNAFRQDKDSVDLPLE 305 (719)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCchHH
Confidence 3678899999999999999999999999987443344443
No 10
>PF14802 TMEM192: TMEM192 family
Probab=84.01 E-value=10 Score=36.46 Aligned_cols=33 Identities=21% Similarity=0.233 Sum_probs=30.1
Q ss_pred hhHhhHHHHHHHHHHhHHhhHHHHHHHHHhhhh
Q 018721 207 LSDEQMALLQYQRENLHFLSEEILRLQECLSKY 239 (351)
Q Consensus 207 LlEKQADLIrYLkdHNa~LSkrIL~Lq~~l~ky 239 (351)
-.+.+.|++++|.|-.++|.++..+||+++.++
T Consensus 203 ~~~~~eellEkQadlI~yLk~hn~~L~~ril~l 235 (236)
T PF14802_consen 203 DGSSLEELLEKQADLIRYLKEHNARLSRRILAL 235 (236)
T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 445999999999999999999999999998765
No 11
>COG2433 Uncharacterized conserved protein [Function unknown]
Probab=82.79 E-value=14 Score=40.39 Aligned_cols=101 Identities=24% Similarity=0.259 Sum_probs=64.7
Q ss_pred hhHhhHHHHHHHHHHhHHhhHHHHHHHHHhhhhcccCCCCChhhhHHHhhhhhhhHHHhhHHHHHhHHHHHHHHHHHHHH
Q 018721 207 LSDEQMALLQYQRENLHFLSEEILRLQECLSKYEQSDDGSTPQVDLAHLLAARDQELRTLSAEMNQLQSELRLARSFVLR 286 (351)
Q Consensus 207 LlEKQADLIrYLkdHNa~LSkrIL~Lq~~l~kye~~~~g~t~qvdl~h~la~r~qelRa~~Ae~~q~~~el~~ar~li~~ 286 (351)
-..+....+.-|+++|..|..++.+|+....+++. +++++..|++ .. .+
T Consensus 423 ~i~~~~~~ve~l~~e~~~L~~~~ee~k~eie~L~~---------------------------~l~~~~r~~~---~~-~~ 471 (652)
T COG2433 423 RIKKLEETVERLEEENSELKRELEELKREIEKLES---------------------------ELERFRREVR---DK-VR 471 (652)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------------------------HHHHHHHHHH---HH-Hh
Confidence 56777888888899999999888888866544433 2222222222 00 11
Q ss_pred HHhhhhhhhhHhHHHHHHHhhhhhhHHHHHHHHHH-hhhccHHHH------HHHHHhhh
Q 018721 287 VRNTNNQYVEENERLRAILGEWSTRAAKLERALEV-ERMSNIELQ------KKISTRRN 338 (351)
Q Consensus 287 ~r~~n~q~~eEn~rlRa~l~Ews~raakle~ale~-er~~~~~~~------~~~~~~r~ 338 (351)
.+..=.+--+++++|+.-|.|-+.+...||+.|+. ++|-.+|++ |.+.+++.
T Consensus 472 ~~rei~~~~~~I~~L~~~L~e~~~~ve~L~~~l~~l~k~~~lE~sG~g~pvk~ve~~t~ 530 (652)
T COG2433 472 KDREIRARDRRIERLEKELEEKKKRVEELERKLAELRKMRKLELSGKGTPVKVVEKLTL 530 (652)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCCCcceehhhhhhH
Confidence 11112234568889999999999999999998875 346666665 44555553
No 12
>PF06008 Laminin_I: Laminin Domain I; InterPro: IPR009254 Laminins are glycoproteins that are major constituents of the basement membrane of cells. Laminins are trimeric molecules; laminin-1 is an alpha1 beta1 gamma1 trimer. It has been suggested that the domains I and II from laminin A, B1 and B2 may come together to form a triple helical coiled-coil structure []. Binding to cells via a high affinity receptor, laminin is thought to mediate the attachment, migration and organisation of cells into tissues during embryonic development by interacting with other extracellular matrix components.; GO: 0005102 receptor binding, 0030155 regulation of cell adhesion, 0030334 regulation of cell migration, 0045995 regulation of embryonic development, 0005606 laminin-1 complex
Probab=82.73 E-value=29 Score=32.91 Aligned_cols=99 Identities=24% Similarity=0.375 Sum_probs=66.2
Q ss_pred HHhHHhhHHHHHHHHHhhhhcccCCCCChhhhHHHhhhhhhh---HHH--hhHHHHHhHHHHHHHHHHHHHHHHhhhhhh
Q 018721 220 ENLHFLSEEILRLQECLSKYEQSDDGSTPQVDLAHLLAARDQ---ELR--TLSAEMNQLQSELRLARSFVLRVRNTNNQY 294 (351)
Q Consensus 220 dHNa~LSkrIL~Lq~~l~kye~~~~g~t~qvdl~h~la~r~q---elR--a~~Ae~~q~~~el~~ar~li~~~r~~n~q~ 294 (351)
.-...+...|-.+-+++..|.. .+..+|.-|+...|+-=+. |+| .+......+..|++.|..|+.+|+..=...
T Consensus 94 ~~i~~l~~~i~~l~~~~~~l~~-~~~~~~~~~l~~~l~ea~~mL~emr~r~f~~~~~~Ae~El~~A~~LL~~v~~~~~~~ 172 (264)
T PF06008_consen 94 QFIQNLQDNIQELIEQVESLNE-NGDQLPSEDLQRALAEAQRMLEEMRKRDFTPQRQNAEDELKEAEDLLSRVQKWFQKP 172 (264)
T ss_pred HHHHHHHHHHHHHHHHHHHhCc-ccCCCCHHHHHHHHHHHHHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHHHHHhhH
Confidence 3444566677777888888877 5667888899877765544 443 566667777799999999999998863444
Q ss_pred hhHhHHH----HHHHhhhhhhHHHHHHHH
Q 018721 295 VEENERL----RAILGEWSTRAAKLERAL 319 (351)
Q Consensus 295 ~eEn~rl----Ra~l~Ews~raakle~al 319 (351)
.+||+.| +..|.+-+.+-.-|.-+|
T Consensus 173 ~~~~~~l~~~i~~~L~~~~~kL~Dl~~~l 201 (264)
T PF06008_consen 173 QQENESLAEAIRDDLNDYNAKLQDLRDLL 201 (264)
T ss_pred HHhhHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4555554 555566565554444333
No 13
>TIGR03007 pepcterm_ChnLen polysaccharide chain length determinant protein, PEP-CTERM locus subfamily. Members of this protein family belong to the family of polysaccharide chain length determinant proteins (pfam02706). All are found in species that encode the PEP-CTERM/exosortase system predicted to act in protein sorting in a number of Gram-negative bacteria, and are found near the epsH homolog that is the putative exosortase gene.
Probab=81.49 E-value=40 Score=34.36 Aligned_cols=37 Identities=14% Similarity=0.296 Sum_probs=30.4
Q ss_pred HHHHHHHHHHhHHhhHHHHHHHHHhhhhcccCCCCCh
Q 018721 212 MALLQYQRENLHFLSEEILRLQECLSKYEQSDDGSTP 248 (351)
Q Consensus 212 ADLIrYLkdHNa~LSkrIL~Lq~~l~kye~~~~g~t~ 248 (351)
.+.+.|+.+.+..+.+++-..+..+.+|++......|
T Consensus 160 ~~~~~fl~~ql~~~~~~L~~ae~~l~~f~~~~~~~~~ 196 (498)
T TIGR03007 160 DSAQRFIDEQIKTYEKKLEAAENRLKAFKQENGGILP 196 (498)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCcccCc
Confidence 3578899999999999999999999999876444444
No 14
>PRK09841 cryptic autophosphorylating protein tyrosine kinase Etk; Provisional
Probab=81.38 E-value=47 Score=36.17 Aligned_cols=120 Identities=15% Similarity=0.215 Sum_probs=63.3
Q ss_pred HHHHHHHHhHHhhHHHHHHHHHhhhhcccCCCCChhhhHHHhhhhhhhHHHhhHHHHHhHHHHHHHHHHHH---------
Q 018721 214 LLQYQRENLHFLSEEILRLQECLSKYEQSDDGSTPQVDLAHLLAARDQELRTLSAEMNQLQSELRLARSFV--------- 284 (351)
Q Consensus 214 LIrYLkdHNa~LSkrIL~Lq~~l~kye~~~~g~t~qvdl~h~la~r~qelRa~~Ae~~q~~~el~~ar~li--------- 284 (351)
-++|+.+....+.+++-..+.++.+|++..+-.++..+....+. ++..+.+++.+++.+........
T Consensus 268 a~~fL~~qL~~l~~~L~~aE~~l~~fr~~~~~~d~~~ea~~~l~----~~~~l~~ql~~l~~~~~~l~~~~~~~hP~v~~ 343 (726)
T PRK09841 268 SLEFLQRQLPEVRSELDQAEEKLNVYRQQRDSVDLNLEAKAVLE----QIVNVDNQLNELTFREAEISQLYKKDHPTYRA 343 (726)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHhcccCchHHH
Confidence 47899999999999999999999999998554455444433332 22333333333333322222221
Q ss_pred --HHHHhhhhhhhhHhHHHHHHHhhhhhhHHHHHHHHHHhhhccHHHHHHHHHhhh
Q 018721 285 --LRVRNTNNQYVEENERLRAILGEWSTRAAKLERALEVERMSNIELQKKISTRRN 338 (351)
Q Consensus 285 --~~~r~~n~q~~eEn~rlRa~l~Ews~raakle~ale~er~~~~~~~~~~~~~r~ 338 (351)
.++...+++-.+.+.++ ..+.+.......|||..+.-|--=..|-++..+.+-
T Consensus 344 l~~~~~~L~~~~~~l~~~~-~~~p~~e~~~~~L~R~~~~~~~lY~~lL~r~~e~~i 398 (726)
T PRK09841 344 LLEKRQTLEQERKRLNKRV-SAMPSTQQEVLRLSRDVEAGRAVYLQLLNRQQELSI 398 (726)
T ss_pred HHHHHHHHHHHHHHHHHHH-HhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 22222332222222222 233445555566666666655444444444444443
No 15
>PRK10884 SH3 domain-containing protein; Provisional
Probab=78.34 E-value=20 Score=33.82 Aligned_cols=21 Identities=24% Similarity=0.256 Sum_probs=9.9
Q ss_pred hHhHHHHHHHhhhhhhHHHHH
Q 018721 296 EENERLRAILGEWSTRAAKLE 316 (351)
Q Consensus 296 eEn~rlRa~l~Ews~raakle 316 (351)
+||++|+..|.+=......||
T Consensus 139 ~~n~~L~~~l~~~~~~~~~l~ 159 (206)
T PRK10884 139 EENQKLKNQLIVAQKKVDAAN 159 (206)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 455555555555443333333
No 16
>KOG0977 consensus Nuclear envelope protein lamin, intermediate filament superfamily [Cell cycle control, cell division, chromosome partitioning; Nuclear structure]
Probab=77.73 E-value=60 Score=35.13 Aligned_cols=22 Identities=14% Similarity=0.226 Sum_probs=17.4
Q ss_pred HHHHHHHhHHhhHHHHHHHHHh
Q 018721 215 LQYQRENLHFLSEEILRLQECL 236 (351)
Q Consensus 215 IrYLkdHNa~LSkrIL~Lq~~l 236 (351)
.|||...|+.|...|=-|+..+
T Consensus 58 VR~LEaqN~~L~~di~~lr~~~ 79 (546)
T KOG0977|consen 58 VRFLEAQNRKLEHDINLLRGVV 79 (546)
T ss_pred HHHHHHHHHHHHHHHHHHHhhc
Confidence 5788888998888887777666
No 17
>PF04859 DUF641: Plant protein of unknown function (DUF641); InterPro: IPR006943 This conserved region is found in a number of plant proteins of unknown function.
Probab=77.65 E-value=11 Score=33.60 Aligned_cols=79 Identities=30% Similarity=0.388 Sum_probs=45.9
Q ss_pred HHHHHHHHhhhhcccCCCCChhhhHHHhhhhhhhHHHhhHHHHHhHHHHHHHHHHHHHHHHhhhhhhhhHhHHHHHHHhh
Q 018721 228 EILRLQECLSKYEQSDDGSTPQVDLAHLLAARDQELRTLSAEMNQLQSELRLARSFVLRVRNTNNQYVEENERLRAILGE 307 (351)
Q Consensus 228 rIL~Lq~~l~kye~~~~g~t~qvdl~h~la~r~qelRa~~Ae~~q~~~el~~ar~li~~~r~~n~q~~eEn~rlRa~l~E 307 (351)
++=+|++-...|.+...+..|++-. +++ +.+..|+.++.=...+.++....+..-.|-.+||.-|+|
T Consensus 53 EL~~Ls~LK~~y~~~~~~~~~~~~~---l~a----------~~~e~qsli~~yE~~~~kLe~e~~~Kdsei~~Lr~~L~~ 119 (131)
T PF04859_consen 53 ELRRLSELKRRYRKKQSDPSPQVAR---LAA----------EIQEQQSLIKTYEIVVKKLEAELRAKDSEIDRLREKLDE 119 (131)
T ss_pred HHHHHHHHHHHHHcCCCCCCccccc---ccc----------chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3445555566677775555566532 222 333333333333333344444444555667888899999
Q ss_pred hhhhHHHHHHHH
Q 018721 308 WSTRAAKLERAL 319 (351)
Q Consensus 308 ws~raakle~al 319 (351)
=...+.+||.+|
T Consensus 120 ~~~~n~~Lekrl 131 (131)
T PF04859_consen 120 LNRANKSLEKRL 131 (131)
T ss_pred HHHHHHHhhccC
Confidence 999999998764
No 18
>PF15070 GOLGA2L5: Putative golgin subfamily A member 2-like protein 5
Probab=77.48 E-value=21 Score=38.79 Aligned_cols=118 Identities=22% Similarity=0.235 Sum_probs=65.9
Q ss_pred HHHHhHHhhHHHHHHHHHhhhhcccCCCCChhh-hHHHhhhhh--------hhHHHhhHHHHH-hHHHHHHHHHHHHHHH
Q 018721 218 QRENLHFLSEEILRLQECLSKYEQSDDGSTPQV-DLAHLLAAR--------DQELRTLSAEMN-QLQSELRLARSFVLRV 287 (351)
Q Consensus 218 LkdHNa~LSkrIL~Lq~~l~kye~~~~g~t~qv-dl~h~la~r--------~qelRa~~Ae~~-q~~~el~~ar~li~~~ 287 (351)
++..++.+.+|+=.|++++..-....+-...+| .|.+.|+-= ..+--+-..|.. +++.|++.-+..+..+
T Consensus 20 lk~e~a~~qqr~~qmseev~~L~eEk~~~~~~V~eLE~sL~eLk~q~~~~~~~~~pa~pse~E~~Lq~E~~~L~kElE~L 99 (617)
T PF15070_consen 20 LKEESAQWQQRMQQMSEEVRTLKEEKEHDISRVQELERSLSELKNQMAEPPPPEPPAGPSEVEQQLQAEAEHLRKELESL 99 (617)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccCCccccccchHHHHHHHHHHHHHHHHHHHH
Confidence 455555555555555555544444433333344 333322210 001112222333 5677777767777667
Q ss_pred HhhhhhhhhHhHHHHHHHhhhhhhHHHHHHHHHHhhhccHHHHHHHHH
Q 018721 288 RNTNNQYVEENERLRAILGEWSTRAAKLERALEVERMSNIELQKKIST 335 (351)
Q Consensus 288 r~~n~q~~eEn~rlRa~l~Ews~raakle~ale~er~~~~~~~~~~~~ 335 (351)
..-....+++|++|=....|-..|-+-||+.|+.-.-...|..|-+.+
T Consensus 100 ~~qlqaqv~~ne~Ls~L~~EqEerL~ELE~~le~~~e~~~D~~kLLe~ 147 (617)
T PF15070_consen 100 EEQLQAQVENNEQLSRLNQEQEERLAELEEELERLQEQQEDRQKLLEQ 147 (617)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 766667788888887777788888888888877655544444444443
No 19
>PF07888 CALCOCO1: Calcium binding and coiled-coil domain (CALCOCO1) like; InterPro: IPR012852 Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein expressed by Mus musculus (CoCoA, Q8CGU1 from SWISSPROT). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1 (Q61026 from SWISSPROT), and thus enhances transcriptional activation by a number of nuclear receptors. CoCoA has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region [].
Probab=76.29 E-value=74 Score=34.46 Aligned_cols=79 Identities=24% Similarity=0.343 Sum_probs=55.8
Q ss_pred HHHhhHHHHHhHHHHHHHHHHHH-----------HHHHhhhhhhhhHhHHHHH-------HHhhhhhhHHHHHHHHHHhh
Q 018721 262 ELRTLSAEMNQLQSELRLARSFV-----------LRVRNTNNQYVEENERLRA-------ILGEWSTRAAKLERALEVER 323 (351)
Q Consensus 262 elRa~~Ae~~q~~~el~~ar~li-----------~~~r~~n~q~~eEn~rlRa-------~l~Ews~raakle~ale~er 323 (351)
||+++.+-.|...+||..||-.- ..+|..+-|+..|-+.|.- -+.+=+.+--++|..|-.||
T Consensus 312 EL~~~~~~RDrt~aeLh~aRLe~aql~~qLad~~l~lke~~~q~~qEk~~l~~~~e~~k~~ie~L~~el~~~e~~lqEer 391 (546)
T PF07888_consen 312 ELSDAVNVRDRTMAELHQARLEAAQLKLQLADASLELKEGRSQWAQEKQALQHSAEADKDEIEKLSRELQMLEEHLQEER 391 (546)
T ss_pred HHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHH
Confidence 88888888899999999988543 4566666666666655543 34455555566667777788
Q ss_pred hccHHHHHHHHHhhhcC
Q 018721 324 MSNIELQKKISTRRNQH 340 (351)
Q Consensus 324 ~~~~~~~~~~~~~r~~~ 340 (351)
+.+.-|++++.+.+.+.
T Consensus 392 ~E~qkL~~ql~ke~D~n 408 (546)
T PF07888_consen 392 MERQKLEKQLGKEKDCN 408 (546)
T ss_pred HHHHHHHHHHHHhhhhh
Confidence 88888888887665543
No 20
>PF15035 Rootletin: Ciliary rootlet component, centrosome cohesion
Probab=73.83 E-value=76 Score=29.41 Aligned_cols=123 Identities=21% Similarity=0.269 Sum_probs=61.0
Q ss_pred hhHHHHHHHHHHhHHhhHHHHHHHHHhhhhcccC-----CCCC--hhhhHHHhhhhhhhHHH---hhHHHHHhHHHHHHH
Q 018721 210 EQMALLQYQRENLHFLSEEILRLQECLSKYEQSD-----DGST--PQVDLAHLLAARDQELR---TLSAEMNQLQSELRL 279 (351)
Q Consensus 210 KQADLIrYLkdHNa~LSkrIL~Lq~~l~kye~~~-----~g~t--~qvdl~h~la~r~qelR---a~~Ae~~q~~~el~~ 279 (351)
+||+|+.-|+.=+...=+++=.|..+++.-...+ .++. +-.||...|..=|.|-+ .|+.--+.+..
T Consensus 13 ~qa~Lv~~LQ~KV~qYr~rc~ele~~l~~~~~l~~~~~~~~~~~e~s~dLe~~l~rLeEEqqR~~~L~qvN~lLRe---- 88 (182)
T PF15035_consen 13 RQAQLVQRLQAKVLQYRKRCAELEQQLSASQVLESPSQRRRSEEEHSPDLEEALIRLEEEQQRSEELAQVNALLRE---- 88 (182)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcCcccccccccccCcccHHHHHHHHHHHHHhHHHHHHHHHHHHH----
Confidence 5777777766666666666666666663221100 1111 12455544433333332 22222222222
Q ss_pred HHHHHHHHHhhhhhhhhHhHH-------HHHHHh----hhhhhHHHHHHHHHHhhhccHHHHHHHHHhhhc
Q 018721 280 ARSFVLRVRNTNNQYVEENER-------LRAILG----EWSTRAAKLERALEVERMSNIELQKKISTRRNQ 339 (351)
Q Consensus 280 ar~li~~~r~~n~q~~eEn~r-------lRa~l~----Ews~raakle~ale~er~~~~~~~~~~~~~r~~ 339 (351)
-+.+.+..|++--+|+++ +|..|. +|-.+-.++..-+..|--.=+.|=+++..+|.+
T Consensus 89 ---QLEq~~~~N~~L~~dl~klt~~~~~l~~eL~~ke~~~~~ee~~~~~y~~~eh~rll~LWr~v~~lRr~ 156 (182)
T PF15035_consen 89 ---QLEQARKANEALQEDLQKLTQDWERLRDELEQKEAEWREEEENFNQYLSSEHSRLLSLWREVVALRRQ 156 (182)
T ss_pred ---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccHHHHHHHHHHHHHHH
Confidence 235555666666666665 444454 566666666666666644445555555555544
No 21
>COG3074 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=72.80 E-value=11 Score=30.83 Aligned_cols=46 Identities=37% Similarity=0.492 Sum_probs=36.2
Q ss_pred HHHhhHHHHHhHHHHHHHHHHHHHHHHhhhhhhhhH----hHHHHHHHhh
Q 018721 262 ELRTLSAEMNQLQSELRLARSFVLRVRNTNNQYVEE----NERLRAILGE 307 (351)
Q Consensus 262 elRa~~Ae~~q~~~el~~ar~li~~~r~~n~q~~eE----n~rlRa~l~E 307 (351)
|+-.+..+-|+++.|..-|+.-...++..|+|--+| .+|||+.||.
T Consensus 26 EieELKEknn~l~~e~q~~q~~reaL~~eneqlk~e~~~WQerlrsLLGk 75 (79)
T COG3074 26 EIEELKEKNNSLSQEVQNAQHQREALERENEQLKEEQNGWQERLRALLGK 75 (79)
T ss_pred HHHHHHHHhhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 555667777888888888887777788888887777 5899999874
No 22
>PF12325 TMF_TATA_bd: TATA element modulatory factor 1 TATA binding; InterPro: IPR022091 This is the C-terminal conserved coiled coil region of a family of TATA element modulatory factor 1 proteins conserved in eukaryotes []. The proteins bind to the TATA element of some RNA polymerase II promoters and repress their activity. by competing with the binding of TATA binding protein. TMF1_TATA_bd is the most conserved part of the TMFs []. TMFs are evolutionarily conserved golgins that bind Rab6, a ubiquitous ras-like GTP-binding Golgi protein, and contribute to Golgi organisation in animal [] and plant cells. The Rab6-binding domain appears to be the same region as this C-terminal family [].
Probab=72.75 E-value=50 Score=28.87 Aligned_cols=59 Identities=25% Similarity=0.312 Sum_probs=37.8
Q ss_pred hHHHhhhhhhhHHHhhHHHHHhHHHHHHHHHHHHHHHHhhhh---hhhhHhHHHHHHHhhhh
Q 018721 251 DLAHLLAARDQELRTLSAEMNQLQSELRLARSFVLRVRNTNN---QYVEENERLRAILGEWS 309 (351)
Q Consensus 251 dl~h~la~r~qelRa~~Ae~~q~~~el~~ar~li~~~r~~n~---q~~eEn~rlRa~l~Ews 309 (351)
-+..-+..+|.|+-++..|+..+..+=..++.+|.++-..|+ ....+.+-|++-+.+-.
T Consensus 20 ~L~s~lr~~E~E~~~l~~el~~l~~~r~~l~~Eiv~l~~~~e~~~~~~~~~~~L~~el~~l~ 81 (120)
T PF12325_consen 20 RLQSQLRRLEGELASLQEELARLEAERDELREEIVKLMEENEELRALKKEVEELEQELEELQ 81 (120)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 445566677888888888888888888888888866655553 33344444444444433
No 23
>PF13870 DUF4201: Domain of unknown function (DUF4201)
Probab=72.70 E-value=46 Score=29.72 Aligned_cols=26 Identities=31% Similarity=0.314 Sum_probs=15.6
Q ss_pred hhhHhHHHHHHHhhhhhhHHHHHHHH
Q 018721 294 YVEENERLRAILGEWSTRAAKLERAL 319 (351)
Q Consensus 294 ~~eEn~rlRa~l~Ews~raakle~al 319 (351)
.+++.+.||..+.+|..+-+.+|.++
T Consensus 150 ~~~~~~~l~~~i~~l~rk~~~l~~~i 175 (177)
T PF13870_consen 150 TKEEVEELRKEIKELERKVEILEMRI 175 (177)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 44555666666666666666665544
No 24
>PF05622 HOOK: HOOK protein; InterPro: IPR008636 This family consists of several HOOK1, 2 and 3 proteins from different eukaryotic organisms. The different members of the Homo sapiens gene family are HOOK1, HOOK2 and HOOK3. Different domains have been identified in the three Homo sapiens HOOK proteins, and it was demonstrated that the highly conserved NH2-domain mediates attachment to microtubules, whereas the central coiled-coil motif mediates homodimerisation and the more divergent C-terminal domains are involved in binding to specific organelles (organelle-binding domains). It has been demonstrated that endogenous HOOK3 binds to Golgi membranes [], whereas both HOOK1 and HOOK2 are localised to discrete but unidentified cellular structures. In mice the Hook1 gene is predominantly expressed in the testis. Hook1 function is necessary for the correct positioning of microtubular structures within the haploid germ cell. Disruption of Hook1 function in mice causes abnormal sperm head shape and fragile attachment of the flagellum to the sperm head [].; GO: 0008017 microtubule binding, 0000226 microtubule cytoskeleton organization, 0005737 cytoplasm; PDB: 1WIX_A.
Probab=72.55 E-value=7.8 Score=41.95 Aligned_cols=122 Identities=26% Similarity=0.350 Sum_probs=0.0
Q ss_pred HHHHhHHhhHHHHHHHHHhhhhcccCCCCChh--h---hHHHhhhhhhhHHHhhHHHHHhHHHHHHHHHHHHHHHHhhhh
Q 018721 218 QRENLHFLSEEILRLQECLSKYEQSDDGSTPQ--V---DLAHLLAARDQELRTLSAEMNQLQSELRLARSFVLRVRNTNN 292 (351)
Q Consensus 218 LkdHNa~LSkrIL~Lq~~l~kye~~~~g~t~q--v---dl~h~la~r~qelRa~~Ae~~q~~~el~~ar~li~~~r~~n~ 292 (351)
|++-...|..+.=.|+..++..+.+.+|.... + |+-.-+..=.+|+..+.+.++....++......|..+|..|+
T Consensus 205 L~~e~~~L~~e~~~l~~~~~~~~~~~~~~~~~~~~~~~~l~~ql~~L~~el~~~e~~~~d~~~~~e~le~ei~~L~q~~~ 284 (713)
T PF05622_consen 205 LQEEKESLQSENEELQERLSQLEGSSEEPSQHLSVELADLRAQLRRLREELERLEEQRDDLKIELEELEKEIDELRQENE 284 (713)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred HHHhHHHhhhhhhhhhcccCCCCCCCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444445555555666665555222222111 1 111112221335555555555566666566666778888887
Q ss_pred hhh---hHhHHHHHHHhhhhhhHHH---HHHHHHH--hhhcc-HHHHHHHHHhhhc
Q 018721 293 QYV---EENERLRAILGEWSTRAAK---LERALEV--ERMSN-IELQKKISTRRNQ 339 (351)
Q Consensus 293 q~~---eEn~rlRa~l~Ews~raak---le~ale~--er~~~-~~~~~~~~~~r~~ 339 (351)
... +|...||.-||+|..+|.| +|..+|. +++.. -++++++..|+.+
T Consensus 285 eL~~~A~~a~~LrDElD~lR~~a~r~~klE~~ve~YKkKLed~~~lk~qvk~Lee~ 340 (713)
T PF05622_consen 285 ELQAEAREARALRDELDELREKADRADKLENEVEKYKKKLEDLEDLKRQVKELEED 340 (713)
T ss_dssp --------------------------------------------------------
T ss_pred HHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 665 6778899999999886665 5555543 23332 2455566666553
No 25
>PRK10361 DNA recombination protein RmuC; Provisional
Probab=71.51 E-value=29 Score=36.77 Aligned_cols=68 Identities=12% Similarity=0.143 Sum_probs=39.7
Q ss_pred HhHHHHHHHHHHHH---HHHHhhhhhhhhHhHHHHHHHhhhhhhHHHHHHHHHHhhhccHHHHHHHHHhhh
Q 018721 271 NQLQSELRLARSFV---LRVRNTNNQYVEENERLRAILGEWSTRAAKLERALEVERMSNIELQKKISTRRN 338 (351)
Q Consensus 271 ~q~~~el~~ar~li---~~~r~~n~q~~eEn~rlRa~l~Ews~raakle~ale~er~~~~~~~~~~~~~r~ 338 (351)
.++.+++..++..+ ........+..+|-+.+|+.+.+=..+.+.+|..||.+|...-|..+.+.+.+.
T Consensus 39 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~l~~~le~~~~~~~ek~~~l~~~~~ 109 (475)
T PRK10361 39 EEMVAELSAAKQQITQSEHWRAECELLNNEVRSLQSINTSLEADLREVTTRMEAAQQHADDKIRQMINSEQ 109 (475)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44444444444443 223333344555556666666666677777888888888777776666655554
No 26
>TIGR02168 SMC_prok_B chromosome segregation protein SMC, common bacterial type. SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
Probab=70.02 E-value=1.4e+02 Score=32.74 Aligned_cols=22 Identities=18% Similarity=0.229 Sum_probs=8.9
Q ss_pred hHHHHHHHhhhhhhHHHHHHHH
Q 018721 298 NERLRAILGEWSTRAAKLERAL 319 (351)
Q Consensus 298 n~rlRa~l~Ews~raakle~al 319 (351)
-+.++..+.++..+...++..+
T Consensus 882 ~~~l~~~~~~~~~~~~~~~~~~ 903 (1179)
T TIGR02168 882 RASLEEALALLRSELEELSEEL 903 (1179)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 3334444444444444443333
No 27
>PF00038 Filament: Intermediate filament protein; InterPro: IPR016044 Intermediate filaments (IF) [, , ] are proteins which are primordial components of the cytoskeleton and the nuclear envelope. They generally form filamentous structures 8 to 14 nm wide. IF proteins are members of a very large multigene family of proteins which has been subdivided in five major subgroups: Type I: Acidic cytokeratins. Type II: Basic cytokeratins. Type III: Vimentin, desmin, glial fibrillary acidic protein (GFAP), peripherin, and plasticin. Type IV: Neurofilaments L, H and M, alpha-internexin and nestin. Type V: Nuclear lamins A, B1, B2 and C. All IF proteins are structurally similar in that they consist of: a central rod domain comprising some 300 to 350 residues which is arranged in coiled-coiled alpha-helices, with at least two short characteristic interruptions; a N-terminal non-helical domain (head) of variable length; and a C-terminal domain (tail) which is also non-helical, and which shows extreme length variation between different IF proteins. While IF proteins are evolutionary and structurally related, they have limited sequence homologies except in several regions of the rod domain. This entry represents the central rod domain found in IF proteins.; PDB: 3TNU_B 3KLT_D 1GK4_F 3TRT_A 3G1E_A 3UF1_C 1GK6_B 1GK7_A 3TYY_B 3V4W_A ....
Probab=69.64 E-value=1.1e+02 Score=29.30 Aligned_cols=68 Identities=18% Similarity=0.173 Sum_probs=32.8
Q ss_pred HHHHHHHhHHhhHHHHHHHHHhhhhcccCCCCChhhhHHHhhhhhhhHHHhhHHHHHhHHHHHHHHHHHHHHH
Q 018721 215 LQYQRENLHFLSEEILRLQECLSKYEQSDDGSTPQVDLAHLLAARDQELRTLSAEMNQLQSELRLARSFVLRV 287 (351)
Q Consensus 215 IrYLkdHNa~LSkrIL~Lq~~l~kye~~~~g~t~qvdl~h~la~r~qelRa~~Ae~~q~~~el~~ar~li~~~ 287 (351)
.++|.+.|..|..+|-.+.... ...++........-++.=.+.+-.++.|..+++.|+..++..+..+
T Consensus 20 Vr~LE~~N~~Le~~i~~~~~~~-----~~~~~~~~~~ye~el~~lr~~id~~~~eka~l~~e~~~l~~e~~~~ 87 (312)
T PF00038_consen 20 VRFLEQENKRLESEIEELREKK-----GEEVSRIKEMYEEELRELRRQIDDLSKEKARLELEIDNLKEELEDL 87 (312)
T ss_dssp HHHHHHHHHHHHHHHHH--------------HHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHhhhhHHHHHHHHhcc-----cccCcccccchhhHHHHhHHhhhhHHHHhhHHhhhhhhHHHHHHHH
Confidence 5778888888887776665543 0011111111111111112344466777777777777777666433
No 28
>TIGR01005 eps_transp_fam exopolysaccharide transport protein family. The model describes the exopolysaccharide transport protein family in bacteria. The transport protein is part of a large genetic locus which is associated with exopolysaccharide (EPS) biosynthesis. Detailed molecular characterization and gene fusion analysis revealed atleast seven gene products are involved in the overall regulation, which among other things, include exopolysaccharide biosynthesis, property of conferring virulence and exopolysaccharide export.
Probab=69.42 E-value=77 Score=34.31 Aligned_cols=35 Identities=11% Similarity=0.086 Sum_probs=28.9
Q ss_pred HHHHHHHHhHHhhHHHHHHHHHhhhhcccCCCCCh
Q 018721 214 LLQYQRENLHFLSEEILRLQECLSKYEQSDDGSTP 248 (351)
Q Consensus 214 LIrYLkdHNa~LSkrIL~Lq~~l~kye~~~~g~t~ 248 (351)
-++||.+....+.+++-.-..++.+|++..+..++
T Consensus 195 a~~~L~~ql~~l~~~l~~aE~~l~~fk~~~~l~~~ 229 (754)
T TIGR01005 195 AADFLAPEIADLSKQSRDAEAEVAAYRAQSDLLMG 229 (754)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccc
Confidence 47899999999999999999999999987544433
No 29
>KOG0249 consensus LAR-interacting protein and related proteins [General function prediction only]
Probab=69.13 E-value=54 Score=36.95 Aligned_cols=94 Identities=20% Similarity=0.216 Sum_probs=64.4
Q ss_pred HHHHHHHHhHHhhHHHHHHHHHhhhhcccCCCCChhhhHHHhhhhhhhHHHhhHHHHHhHHHHHHHHHHHHHHHHhhhhh
Q 018721 214 LLQYQRENLHFLSEEILRLQECLSKYEQSDDGSTPQVDLAHLLAARDQELRTLSAEMNQLQSELRLARSFVLRVRNTNNQ 293 (351)
Q Consensus 214 LIrYLkdHNa~LSkrIL~Lq~~l~kye~~~~g~t~qvdl~h~la~r~qelRa~~Ae~~q~~~el~~ar~li~~~r~~n~q 293 (351)
-++.+...+..++.++.+....+..-+-...|-..+|| ..|+---+|.-+.-.+.|-|-+|++-.+.-+...+-..++
T Consensus 164 r~~kl~~~~qe~naeL~rarqreemneeh~~rlsdtvd--ErlqlhlkermaAle~kn~L~~e~~s~kk~l~~~~~~k~r 241 (916)
T KOG0249|consen 164 RTRKLEEQLEELNAELQRARQREKMNEEHNKRLSDTVD--ERLQLHLKERMAALEDKNRLEQELESVKKQLEEMRHDKDK 241 (916)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 35677788888888888888777333333333356666 4444444566666667777777777777777777777778
Q ss_pred hhhHhHHHHHHHhhhh
Q 018721 294 YVEENERLRAILGEWS 309 (351)
Q Consensus 294 ~~eEn~rlRa~l~Ews 309 (351)
...++++||..++.-.
T Consensus 242 l~~d~E~Lr~e~~qL~ 257 (916)
T KOG0249|consen 242 LRTDIEDLRGELDQLR 257 (916)
T ss_pred HhhhHHHHHHHHHHHH
Confidence 8888888888777655
No 30
>KOG0249 consensus LAR-interacting protein and related proteins [General function prediction only]
Probab=66.89 E-value=62 Score=36.48 Aligned_cols=75 Identities=24% Similarity=0.243 Sum_probs=55.2
Q ss_pred hHhhHHHHHHHHHHhHHhhHHHHHHHHHhhhhcccCCCCChhh------hHHHhhhhhhh------HHHhhHHHHHhHHH
Q 018721 208 SDEQMALLQYQRENLHFLSEEILRLQECLSKYEQSDDGSTPQV------DLAHLLAARDQ------ELRTLSAEMNQLQS 275 (351)
Q Consensus 208 lEKQADLIrYLkdHNa~LSkrIL~Lq~~l~kye~~~~g~t~qv------dl~h~la~r~q------elRa~~Ae~~q~~~ 275 (351)
++.|-..++-+.+-|+.|.+++=.....+.-=-+ .|.-|-| |++.+.++++. -+|.+-.+.+.+.+
T Consensus 100 Lankda~lrq~eekn~slqerLelaE~~l~qs~r--ae~lpeveael~qr~~al~~aee~~~~~eer~~kl~~~~qe~na 177 (916)
T KOG0249|consen 100 LANKDADLRQNEEKNRSLQERLELAEPKLQQSLR--AETLPEVEAELAQRNAALTKAEEHSGNIEERTRKLEEQLEELNA 177 (916)
T ss_pred HhCcchhhchhHHhhhhhhHHHHHhhHhhHhHHh--hhhhhhhHHHHHHHHHHHHHHHHhhccHHHHHHHHHHHHHHHHH
Confidence 4445556677888899999988877777754444 5565666 77888888744 45678888888889
Q ss_pred HHHHHHHHH
Q 018721 276 ELRLARSFV 284 (351)
Q Consensus 276 el~~ar~li 284 (351)
||..||...
T Consensus 178 eL~rarqre 186 (916)
T KOG0249|consen 178 ELQRARQRE 186 (916)
T ss_pred HHHHHHHHH
Confidence 999998765
No 31
>TIGR00219 mreC rod shape-determining protein MreC. MreC (murein formation C) is involved in the rod shape determination in E. coli, and more generally in cell shape determination of bacteria whether or not they are rod-shaped. Cells defective in MreC are round. Species with MreC include many of the Proteobacteria, Gram-positives, and spirochetes.
Probab=66.25 E-value=12 Score=36.37 Aligned_cols=42 Identities=26% Similarity=0.310 Sum_probs=22.6
Q ss_pred HHhhHHHHHhHHHHHHHHHHHHHHHHhhhhhhhhHhHHHHHHHhh
Q 018721 263 LRTLSAEMNQLQSELRLARSFVLRVRNTNNQYVEENERLRAILGE 307 (351)
Q Consensus 263 lRa~~Ae~~q~~~el~~ar~li~~~r~~n~q~~eEn~rlRa~l~E 307 (351)
++.+.+|-.+|+.|+...+.. +...++.+.+||+|||+.|+-
T Consensus 68 ~~~l~~EN~~Lr~e~~~l~~~---~~~~~~~l~~EN~rLr~LL~~ 109 (283)
T TIGR00219 68 VNNLEYENYKLRQELLKKNQQ---LEILTQNLKQENVRLRELLNS 109 (283)
T ss_pred HHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHhcC
Confidence 334444444444444333222 222233378999999998864
No 32
>COG1579 Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only]
Probab=65.25 E-value=1.4e+02 Score=29.11 Aligned_cols=32 Identities=16% Similarity=0.260 Sum_probs=24.4
Q ss_pred hhhhhhHHHhhHHHHHhHHHHHHHHHHHHHHH
Q 018721 256 LAARDQELRTLSAEMNQLQSELRLARSFVLRV 287 (351)
Q Consensus 256 la~r~qelRa~~Ae~~q~~~el~~ar~li~~~ 287 (351)
-++++.|+++|.-|++.++......+..|.++
T Consensus 84 ~v~~~~e~~aL~~E~~~ak~r~~~le~el~~l 115 (239)
T COG1579 84 AVKDERELRALNIEIQIAKERINSLEDELAEL 115 (239)
T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34456788888888888888888888887444
No 33
>KOG0250 consensus DNA repair protein RAD18 (SMC family protein) [Replication, recombination and repair]
Probab=62.78 E-value=3.1e+02 Score=32.26 Aligned_cols=109 Identities=17% Similarity=0.121 Sum_probs=64.5
Q ss_pred HHHHHHHhcCCCCCCchhhhccCCC----CCCC---CCCCCcccCC--CchhHhhHHHHHHHHHHhHHhhHHHHHHHHHh
Q 018721 166 YIGYVHQYNSLNSQPDVMKSLYSPL----QPSS---SLEGLRYHDG--GRLSDEQMALLQYQRENLHFLSEEILRLQECL 236 (351)
Q Consensus 166 YIvkV~rFNk~kp~PDVl~ee~s~~----~ps~---~~~ElGfrd~--g~LlEKQADLIrYLkdHNa~LSkrIL~Lq~~l 236 (351)
=+.+++++|+ |||-|..-.-... ++++ ..+..|.+.+ +.+-.-=.|-|++|+.-...|-++++.++.++
T Consensus 607 Ea~~~m~s~~--~p~n~~~aytldg~~~~~~g~~~~~ySt~~~~~r~~~~~~~s~d~~ie~le~e~~~l~~~~~~l~~~~ 684 (1074)
T KOG0250|consen 607 EAREFMQSDK--PPANVTKAYTLDGRQIFAGGPNYRVYSTRGTRARRPGVDEFSFDDEIEDLEREASRLQKEILELENQR 684 (1074)
T ss_pred HHHHHHhcCC--CCccceeeeccCccccccCCCCcceeccCCCCCCCccccchhHhHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3457888887 7776655432211 1111 0112222222 33333334789999999999999999999999
Q ss_pred hhhcccCCCCChhhhHHHhhhhhhhHHHhhHHHHHhHHHHHHHHHHHHHHHHh
Q 018721 237 SKYEQSDDGSTPQVDLAHLLAARDQELRTLSAEMNQLQSELRLARSFVLRVRN 289 (351)
Q Consensus 237 ~kye~~~~g~t~qvdl~h~la~r~qelRa~~Ae~~q~~~el~~ar~li~~~r~ 289 (351)
++++..-++ -+..+|-++-..-|....++..|..+.+++.
T Consensus 685 ~~~e~~l~e-------------~~~~~~~l~~~~~q~~~~~~~~~~em~el~n 724 (1074)
T KOG0250|consen 685 REAEKNLEE-------------LEKKLRELSEHIEQIKRRIRKKRAEMTELKN 724 (1074)
T ss_pred HHHHHHHHH-------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 999987333 3444555555555555555555555555544
No 34
>PLN03188 kinesin-12 family protein; Provisional
Probab=62.55 E-value=68 Score=38.05 Aligned_cols=31 Identities=29% Similarity=0.325 Sum_probs=27.9
Q ss_pred HHHHHhhhhhhHHHHHHHHHHhhhccHHHHH
Q 018721 301 LRAILGEWSTRAAKLERALEVERMSNIELQK 331 (351)
Q Consensus 301 lRa~l~Ews~raakle~ale~er~~~~~~~~ 331 (351)
||..|+-|.++|.||+.-||.|+.-.-||..
T Consensus 1070 lr~eles~r~l~Ekl~~EL~~eK~c~eel~~ 1100 (1320)
T PLN03188 1070 LRTELDASRALAEKQKHELDTEKRCAEELKE 1100 (1320)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhHHHHHHHHH
Confidence 6888999999999999999999998877754
No 35
>TIGR03495 phage_LysB phage lysis regulatory protein, LysB family. Members of this protein family are phage lysis regulatory protein, including the well-studied protein LysB (lysis protein B) of Enterobacteria phage P2. For members of this family, genes are found in phage or in prophage regions of bacterial genomes, typically near a phage lysozyme or phage holin.
Probab=61.27 E-value=86 Score=28.09 Aligned_cols=73 Identities=19% Similarity=0.227 Sum_probs=50.1
Q ss_pred hhHHHhhHHHHHhHHHHHHHHHHHH----HHHH---hhhhhhhhHhHHHHHHHhhhhhhHHHHHHHHHHhhhccHHHHHH
Q 018721 260 DQELRTLSAEMNQLQSELRLARSFV----LRVR---NTNNQYVEENERLRAILGEWSTRAAKLERALEVERMSNIELQKK 332 (351)
Q Consensus 260 ~qelRa~~Ae~~q~~~el~~ar~li----~~~r---~~n~q~~eEn~rlRa~l~Ews~raakle~ale~er~~~~~~~~~ 332 (351)
-.+++.+.+++++.+.+++..+.-| .+|- .....-.++..+||..++-=+.-++.=|..+|...--|-+||+=
T Consensus 18 ~~~~~~l~~~~~~a~~~~~~~~~~l~~~~~qL~~l~~~a~~~~~~Q~~Lr~~~~~~~~~l~~re~~i~rL~~ENe~lR~W 97 (135)
T TIGR03495 18 SQRLRNARADLERANRVLKAQQAELASKANQLIVLLALAKRNEEAQAQLRQQLAQARALLAQREQRIERLKRENEDLRRW 97 (135)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCHHHHHH
Confidence 3566777777777777777777766 2222 22223345667899888888888888888888777777777653
No 36
>PF09730 BicD: Microtubule-associated protein Bicaudal-D; InterPro: IPR018477 BicD proteins consist of three coiled-coiled domains and are involved in dynein-mediated minus end-directed transport from the Golgi apparatus to the endoplasmic reticulum (ER) []. Glycogen synthase kinase-3beta (GSK-3beta) is required for the binding of BICD to dynein but not to dynactin, acting to maintain the anchoring of microtubules to the centromere []. It appears that amino-acid residues 437-617 of BicD and the kinase activity of GSK-3 are necessary for the formation of a complex between BicD and GSK-3beta in intact cells [].; GO: 0006810 transport, 0005794 Golgi apparatus
Probab=60.52 E-value=1.5e+02 Score=33.15 Aligned_cols=76 Identities=20% Similarity=0.309 Sum_probs=55.4
Q ss_pred hhHHHhhHHHHHhHHHHHHHHHHHHHHHHhhhhhhhhHhHHHHHHHhhhhhhHHHHHHHHHHhhhccHHHHHHHHHhhh
Q 018721 260 DQELRTLSAEMNQLQSELRLARSFVLRVRNTNNQYVEENERLRAILGEWSTRAAKLERALEVERMSNIELQKKISTRRN 338 (351)
Q Consensus 260 ~qelRa~~Ae~~q~~~el~~ar~li~~~r~~n~q~~eEn~rlRa~l~Ews~raakle~ale~er~~~~~~~~~~~~~r~ 338 (351)
+...+..-.|.-+++.||+..|+.+..... .|-+|-+++++.+.+-..+-..+|......+=.-.+|++++..++.
T Consensus 358 e~Ky~vav~Ev~~Lk~ELk~Lk~k~~~~~~---~~~~ek~~~~~e~q~L~ekl~~lek~~re~qeri~~LE~ELr~l~~ 433 (717)
T PF09730_consen 358 ECKYKVAVSEVIQLKAELKALKSKYNELEE---RYKQEKDRLESEVQNLKEKLMSLEKSSREDQERISELEKELRALSK 433 (717)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHH
Confidence 445566777888888888888887755555 7888888999888888888888888665554444556666666554
No 37
>PF02050 FliJ: Flagellar FliJ protein; InterPro: IPR012823 Many flagellar proteins are exported by a flagellum-specific export pathway. Attempts have been made to characterise the apparatus responsible for this process, by designing assays to screen for mutants with export defects []. Experiments involving filament removal from temperature-sensitive flagellar mutants of Salmonella typhimurium have shown that, while most mutants were able to regrow filaments, flhA, fliH, fliI and fliN mutants showed no or greatly reduced regrowth. This suggests that the corresponding gene products are involved in the process of flagellum-specific export. The sequences of fliH, fliI and the adjacent gene, fliJ, have been deduced. FliJ was shown to encode a protein of molecular mass 17,302 Da []. It is a membrane-associated protein that affects chemotactic events, mutations in FliJ result in failure to respond to chemotactic stimuli.; GO: 0003774 motor activity, 0001539 ciliary or flagellar motility, 0006935 chemotaxis, 0009288 bacterial-type flagellum, 0016020 membrane, 0044461 bacterial-type flagellum part; PDB: 3AJW_A.
Probab=59.84 E-value=82 Score=24.62 Aligned_cols=66 Identities=23% Similarity=0.336 Sum_probs=33.9
Q ss_pred HHHhHHhhHHHHHHHHHhhhhcccCCCCChhhhHHHhhhhh------hhHHHhhHHHHHhHHHHHHHHHHHH
Q 018721 219 RENLHFLSEEILRLQECLSKYEQSDDGSTPQVDLAHLLAAR------DQELRTLSAEMNQLQSELRLARSFV 284 (351)
Q Consensus 219 kdHNa~LSkrIL~Lq~~l~kye~~~~g~t~qvdl~h~la~r------~qelRa~~Ae~~q~~~el~~ar~li 284 (351)
.+.......+|-.|+.....|...-.+...+|++..+...+ ++.......+++.++.++..+|..+
T Consensus 11 ~~~~~~~~~~l~~L~~~~~~~~~~~~~~~~~~s~~~~~~~~~~~~~l~~~i~~~~~~~~~~~~~~~~~r~~l 82 (123)
T PF02050_consen 11 QQELQEAEEQLEQLQQERQEYQEQLSESQQGVSVAQLRNYQRYISALEQAIQQQQQELERLEQEVEQAREEL 82 (123)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHT-----SGGGHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34455566777888888888865544443356555444433 3334444444555555555554443
No 38
>PF03268 DUF267: Caenorhabditis protein of unknown function, DUF267; InterPro: IPR004950 This family of proteins, from Caenorhabditis species, have not been characterised though a number are annotated as 'serpentine receptor, class r' proteins.
Probab=59.67 E-value=1.1e+02 Score=31.61 Aligned_cols=255 Identities=15% Similarity=0.118 Sum_probs=131.7
Q ss_pred CcccCCCCcccchhhHHHHHHHHHHHHHHHhhhhhhccccccchh----------------HHHHHHHHHHHHHHHHHHH
Q 018721 28 APLLGHRKSHSIFGSVVYCFVLAGYAILAAGTTWIFHPIHYLIPP----------------LLCSCGVILLALTGIFQQY 91 (351)
Q Consensus 28 ~p~~~~r~~~~~~g~~~y~~~L~~yA~~~~~~pw~~~~~~~~~~~----------------lL~y~~v~LWlLt~l~d~y 91 (351)
.||-++.+. ++. +++=+++.+..-++.+..-|+|--+.+...| +.|..-+.-|-=.+.+.++
T Consensus 12 s~ldCs~~~-~~~-~~~t~~~ai~ii~~~f~r~~~l~~~~g~~lSf~WAEsn~fgF~~~~s~~c~~cl~~wT~~~fi~~f 89 (353)
T PF03268_consen 12 SGLDCSAKA-KIR-GIFTRLIAIIIIALIFRRCWMLMQIEGKSLSFGWAESNMFGFMAMQSFVCAICLFGWTKNGFIPKF 89 (353)
T ss_pred CCcCcCccc-chH-hHHHHHHHHHHHHHHHHHHHHHHhcCCceeeeehhhcchhHHHHHHHHHHHHHHHHHhhcccHHHH
Confidence 466665543 433 3444556666666667777877766655543 4666677789999999999
Q ss_pred HHH--HHhHHHhhhHHH--HHHHhhc---ccccchhhhhHHHHHHHHHHHh-hhccc-cccHHHHHH-HHHHHHHHHHHH
Q 018721 92 FVY--QVQKIRLQGYYS--FSQKLKH---IVRLPFAITAYGTAAMLLVIVW-RPHIS-ILSISTLLR-IIMLIEAICAAS 161 (351)
Q Consensus 92 v~~--qH~KlRl~GYl~--FYR~Tr~---lkRlPl~IvSlGNa~LLLI~a~-~~~~~-~Ls~~~lLr-iil~lElv~~l~ 161 (351)
.+. +.+++|...+.+ =|++.+. +-.+|-.++-.++++.+.+.-- ...-. .-++.+++- ++..+=..++..
T Consensus 90 ~~~L~~lR~LRv~~n~~~D~Y~~lh~kafi~s~pw~v~~~s~aiy~~~~~ki~~~g~~~~~~~~~~~~~i~~l~~~is~i 169 (353)
T PF03268_consen 90 EKKLARLRTLRVEPNQEIDDYRILHRKAFIFSIPWFVAFMSTAIYNAVHGKIIYGGAETSSWYYILDPFINFLCWYISFI 169 (353)
T ss_pred HHHHHHHHHHhCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhhceEEeCCCCcccchhhHHHHHHHHHHHHHHH
Confidence 998 777888777654 2333332 2234555555555555555431 11111 112222221 222223333555
Q ss_pred HHHHHHH-------HHHHhcCCCCCCchhhhccCCCCCCCCCCCCcccCCCchhHhhHHHHHHHHHHhHHhhH-----HH
Q 018721 162 FMSVYIG-------YVHQYNSLNSQPDVMKSLYSPLQPSSSLEGLRYHDGGRLSDEQMALLQYQRENLHFLSE-----EI 229 (351)
Q Consensus 162 ~li~YIv-------kV~rFNk~kp~PDVl~ee~s~~~ps~~~~ElGfrd~g~LlEKQADLIrYLkdHNa~LSk-----rI 229 (351)
|+.+|.- -+..||.+=-. +.++.... .....+ .....|.+|+++-+-=|..||. =+
T Consensus 170 ~L~~y~lv~~al~REi~yFN~ELe~--A~keK~L~--n~~vL~--------~F~~RQ~eL~~lv~~~ne~L~~f~~~aPl 237 (353)
T PF03268_consen 170 CLAIYFLVNSALNREIEYFNEELEK--ASKEKKLK--NPQVLE--------KFSHRQIELFELVNFANESLSSFMTFAPL 237 (353)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH--HHHhcccc--ChHHHH--------HHhHHHHHHHHHHHHHHHhhhhhhhHHHH
Confidence 6666654 26778862110 11111110 011111 2777899999999988888887 34
Q ss_pred HHHHHHhh-hhcccCCCC-Chhh-------hHHHhhhhhhhHHHhhHHHHHhHHHHH-HHHHHHH--HHHHhhhhhhhhH
Q 018721 230 LRLQECLS-KYEQSDDGS-TPQV-------DLAHLLAARDQELRTLSAEMNQLQSEL-RLARSFV--LRVRNTNNQYVEE 297 (351)
Q Consensus 230 L~Lq~~l~-kye~~~~g~-t~qv-------dl~h~la~r~qelRa~~Ae~~q~~~el-~~ar~li--~~~r~~n~q~~eE 297 (351)
..+-..++ =|=.++-++ +|-+ .+..-+.--=--|+-.++= |..+ +.+|=|+ ..+...+|.-|=+
T Consensus 238 f~f~a~iN~~Yi~s~F~~~~p~~y~i~li~~l~a~i~~t~~~L~P~a~V----Qe~l~~Ts~ILmn~~efe~s~Dp~vyq 313 (353)
T PF03268_consen 238 FCFYALINAVYIVSSFFSSVPLLYFICLIFNLIAIIFITFFLLYPAANV----QEHLQHTSRILMNSDEFECSKDPQVYQ 313 (353)
T ss_pred HHHHHHHHHHHhhhhhccCCchHHHHHHHHHHHHHHHHHHHHhCchHHH----HHHHHHHHHHHhCChhhhcCCCHHHHH
Confidence 44444443 244442222 3433 2233333233344433332 2222 2344444 6666666665555
Q ss_pred hHH
Q 018721 298 NER 300 (351)
Q Consensus 298 n~r 300 (351)
+-|
T Consensus 314 tYr 316 (353)
T PF03268_consen 314 TYR 316 (353)
T ss_pred HHH
Confidence 544
No 39
>KOG1962 consensus B-cell receptor-associated protein and related proteins [Defense mechanisms]
Probab=59.45 E-value=1.7e+02 Score=28.22 Aligned_cols=106 Identities=19% Similarity=0.220 Sum_probs=60.9
Q ss_pred hhHhhHHHHHHHHHHhHHhhHHHHHHHHHhhhhcccCCCCChhhhHHHhhhhhhhHHHhhHHHHHhHHHHHHHHHHHHHH
Q 018721 207 LSDEQMALLQYQRENLHFLSEEILRLQECLSKYEQSDDGSTPQVDLAHLLAARDQELRTLSAEMNQLQSELRLARSFVLR 286 (351)
Q Consensus 207 LlEKQADLIrYLkdHNa~LSkrIL~Lq~~l~kye~~~~g~t~qvdl~h~la~r~qelRa~~Ae~~q~~~el~~ar~li~~ 286 (351)
....|..+--|..+.|.++|==+|=|.-++.++-+- =.+-..+|+. .+.+.|-..++ ..
T Consensus 84 ~a~~~~~~~l~raqrn~YisGf~LFL~lvI~R~~~l--------------l~~l~~l~~~----~~~~~~~~~lk---~~ 142 (216)
T KOG1962|consen 84 LARTHLLEALFRAQRNLYISGFVLFLSLVIRRLHTL--------------LRELATLRAN----EKAMKENEALK---KQ 142 (216)
T ss_pred HHHHHHHHHHHHHHhhhHHhHHHHHHHHHHHHHHHH--------------HHHHHHHHhh----HHHHHHHHHHH---Hh
Confidence 677888888899999999998888887777444322 1111122221 22333333333 33
Q ss_pred HHhhhhhhhhHhHHHHHHHhhhhhhHHHHHHHHHHhhhccHHHHHHHH
Q 018721 287 VRNTNNQYVEENERLRAILGEWSTRAAKLERALEVERMSNIELQKKIS 334 (351)
Q Consensus 287 ~r~~n~q~~eEn~rlRa~l~Ews~raakle~ale~er~~~~~~~~~~~ 334 (351)
+.+.++ +.|||+-+-+-++--.....+.+-.||..-=++.+|+|+..
T Consensus 143 ~~~~~~-~~~~~~~~~~~~~kL~~el~~~~~~Le~~~~~~~al~Kq~e 189 (216)
T KOG1962|consen 143 LENSSK-LEEENDKLKADLEKLETELEKKQKKLEKAQKKVDALKKQSE 189 (216)
T ss_pred hhcccc-hhhhHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 444444 77777777666665555555555555555445555555543
No 40
>PF09304 Cortex-I_coil: Cortexillin I, coiled coil; InterPro: IPR015383 This domain is predominantly found in the actin-bundling protein cortexillin I from Dictyostelium discoideum (Slime mold). The domain has a structure consisting of an 18-heptad-repeat alpha-helical coiled-coil, and is a prerequisite for the assembly of Cortexillin I []. ; PDB: 1D7M_A.
Probab=59.21 E-value=12 Score=32.52 Aligned_cols=52 Identities=17% Similarity=0.270 Sum_probs=21.9
Q ss_pred HHHHhHHHHHHHHHHHHHHHHhhhhhhhhHhHHHHHHHhhhhhhHHHHHHHH
Q 018721 268 AEMNQLQSELRLARSFVLRVRNTNNQYVEENERLRAILGEWSTRAAKLERAL 319 (351)
Q Consensus 268 Ae~~q~~~el~~ar~li~~~r~~n~q~~eEn~rlRa~l~Ews~raakle~al 319 (351)
|-.+.++.-|+.--..|.-..+.+.+-+.|.+-|++++..--..++-+.-+.
T Consensus 9 as~~el~n~La~Le~slE~~K~S~~eL~kqkd~L~~~l~~L~~q~~s~~qr~ 60 (107)
T PF09304_consen 9 ASQNELQNRLASLERSLEDEKTSQGELAKQKDQLRNALQSLQAQNASRNQRI 60 (107)
T ss_dssp -----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred hhHHHHHHHHHHHHHHHHHHHhhHHHHHHhHHHHHHHHHHHHHHHHHHHHHH
Confidence 3334444444444444444444555555555555555544444444444333
No 41
>PRK11637 AmiB activator; Provisional
Probab=58.40 E-value=1.8e+02 Score=29.59 Aligned_cols=25 Identities=16% Similarity=0.355 Sum_probs=9.8
Q ss_pred hhhhhhHHHhhHHHHHhHHHHHHHH
Q 018721 256 LAARDQELRTLSAEMNQLQSELRLA 280 (351)
Q Consensus 256 la~r~qelRa~~Ae~~q~~~el~~a 280 (351)
+...++++..+.+++++++.|+..+
T Consensus 91 i~~~~~~i~~~~~ei~~l~~eI~~~ 115 (428)
T PRK11637 91 LRETQNTLNQLNKQIDELNASIAKL 115 (428)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3333344444444444444333333
No 42
>PF09726 Macoilin: Transmembrane protein; InterPro: IPR019130 This entry represents the multi-pass transmembrane protein Macoilin, which is highly conserved in eukaryotes. ; GO: 0016021 integral to membrane
Probab=57.79 E-value=62 Score=35.76 Aligned_cols=51 Identities=20% Similarity=0.163 Sum_probs=33.5
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhHHHhhhHHHHHHHhhcccccchhhhhHHHHHHHHHHH
Q 018721 74 LCSCGVILLALTGIFQQYFVYQVQKIRLQGYYSFSQKLKHIVRLPFAITAYGTAAMLLVIV 134 (351)
Q Consensus 74 L~y~~v~LWlLt~l~d~yv~~qH~KlRl~GYl~FYR~Tr~lkRlPl~IvSlGNa~LLLI~a 134 (351)
+|+.-+.||++.+-++--++..+-| ..-|+++.| ||..|+.|-.++.+...
T Consensus 119 ~~~~~~~~~~~~~~~e~~~~~~~~~--~~~~~~~~~--------~~~ah~igypvv~~g~~ 169 (697)
T PF09726_consen 119 ICLPTVSLWILFVYVEASVRLKDLK--SMPHLDLCR--------PFAAHCIGYPVVTLGFG 169 (697)
T ss_pred ccHHHHHHHHHHHHHHHHHhhcccC--CCcchhhcc--------cHHHhhcCCceeEeecc
Confidence 5677788888888777777665443 223455544 88888888777555444
No 43
>KOG0993 consensus Rab5 GTPase effector Rabaptin-5 [Intracellular trafficking, secretion, and vesicular transport]
Probab=56.18 E-value=2.6e+02 Score=30.04 Aligned_cols=84 Identities=32% Similarity=0.332 Sum_probs=46.5
Q ss_pred hhHhhHHHHHHHHHHhHHhhHHHHHHHHHhhhhc-ccCCCCChhhhHHHhhh---hhhhHHHhhHHHHHhHHHHHHHHHH
Q 018721 207 LSDEQMALLQYQRENLHFLSEEILRLQECLSKYE-QSDDGSTPQVDLAHLLA---ARDQELRTLSAEMNQLQSELRLARS 282 (351)
Q Consensus 207 LlEKQADLIrYLkdHNa~LSkrIL~Lq~~l~kye-~~~~g~t~qvdl~h~la---~r~qelRa~~Ae~~q~~~el~~ar~ 282 (351)
++|.|-.++..=-+....==+++++=...+-.++ ++.+.+|-|. +.+|+ .+|+-|+++.++++|+|+++..--+
T Consensus 289 v~dt~w~~lqkegrqlqrdlE~~~~~r~ele~~~~qs~ed~t~q~--~~ll~~~q~sE~ll~tlq~~iSqaq~~vq~qma 366 (542)
T KOG0993|consen 289 VLDTLWFILQKEGRQLQRDLEELIETRAELEHTEQQSQEDITVQR--AQLLEERQHSEDLLVTLQAEISQAQSEVQKQMA 366 (542)
T ss_pred ChHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHH--HHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHH
Confidence 6776666543221111111223333333333333 2333334333 33444 4588999999999999999875444
Q ss_pred HH-----------HHHHhhhh
Q 018721 283 FV-----------LRVRNTNN 292 (351)
Q Consensus 283 li-----------~~~r~~n~ 292 (351)
.+ .++|..|+
T Consensus 367 ~lv~a~e~i~~e~~rl~q~nd 387 (542)
T KOG0993|consen 367 RLVVASETIADEDSRLRQIND 387 (542)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 33 77888887
No 44
>PF06785 UPF0242: Uncharacterised protein family (UPF0242); InterPro: IPR009623 This is a group of proteins of unknown function.
Probab=55.89 E-value=92 Score=32.33 Aligned_cols=77 Identities=22% Similarity=0.248 Sum_probs=53.7
Q ss_pred HHhhhhhhhHHHhhHHHHHhHHHHHHHHHHHH-------HHHHhhhhhhhhHhHHHHHHHhhhh-------hhHHHHHHH
Q 018721 253 AHLLAARDQELRTLSAEMNQLQSELRLARSFV-------LRVRNTNNQYVEENERLRAILGEWS-------TRAAKLERA 318 (351)
Q Consensus 253 ~h~la~r~qelRa~~Ae~~q~~~el~~ar~li-------~~~r~~n~q~~eEn~rlRa~l~Ews-------~raakle~a 318 (351)
-..+..|.+|...|.-.-+++-.||-.+|+.+ |++..+=.+.-|||.+|-.-|++-+ -+++.|-|.
T Consensus 91 ~es~~e~q~e~~qL~~qnqkL~nqL~~~~~vf~k~k~~~q~LE~li~~~~EEn~~lqlqL~~l~~e~~Ekeeesq~LnrE 170 (401)
T PF06785_consen 91 RESVEERQQESEQLQSQNQKLKNQLFHVREVFMKTKGDIQHLEGLIRHLREENQCLQLQLDALQQECGEKEEESQTLNRE 170 (401)
T ss_pred HHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHhHHHHHHHHhHhHHHHHHHHHH
Confidence 34556666666666666666777777788765 8999999999999999998886554 466777665
Q ss_pred HHHhhhccHHH
Q 018721 319 LEVERMSNIEL 329 (351)
Q Consensus 319 le~er~~~~~~ 329 (351)
|-.++--..+|
T Consensus 171 LaE~layqq~L 181 (401)
T PF06785_consen 171 LAEALAYQQEL 181 (401)
T ss_pred HHHHHHHHHHH
Confidence 55544433333
No 45
>PRK13922 rod shape-determining protein MreC; Provisional
Probab=55.17 E-value=43 Score=31.74 Aligned_cols=38 Identities=34% Similarity=0.383 Sum_probs=20.0
Q ss_pred hHHHHHhHHHHHHHHHHHHHHHHhhhhhhhhHhHHHHHHHhh
Q 018721 266 LSAEMNQLQSELRLARSFVLRVRNTNNQYVEENERLRAILGE 307 (351)
Q Consensus 266 ~~Ae~~q~~~el~~ar~li~~~r~~n~q~~eEn~rlRa~l~E 307 (351)
+.+|-.+|+.|+..-+..++++ ++..+||+|||+.|+-
T Consensus 74 l~~en~~L~~e~~~l~~~~~~~----~~l~~en~~L~~lL~~ 111 (276)
T PRK13922 74 LREENEELKKELLELESRLQEL----EQLEAENARLRELLNL 111 (276)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHhcC
Confidence 3344444444444433333333 2446788888888763
No 46
>PRK02224 chromosome segregation protein; Provisional
Probab=54.59 E-value=2.7e+02 Score=30.59 Aligned_cols=57 Identities=21% Similarity=0.169 Sum_probs=30.2
Q ss_pred hhHHHHHhHHHHHHHHHHHHHHHHhhhhhhhhHhHHHHHHHhhhhhhHHHHHHHHHH
Q 018721 265 TLSAEMNQLQSELRLARSFVLRVRNTNNQYVEENERLRAILGEWSTRAAKLERALEV 321 (351)
Q Consensus 265 a~~Ae~~q~~~el~~ar~li~~~r~~n~q~~eEn~rlRa~l~Ews~raakle~ale~ 321 (351)
.+....+++..++...+.-+...........+|-+.|++-+++|-.++..+|..++.
T Consensus 513 ~l~~~~~~l~~~~~~~~e~le~~~~~~~~l~~e~~~l~~~~~~~~~~~~~~~~~~~~ 569 (880)
T PRK02224 513 RLEERREDLEELIAERRETIEEKRERAEELRERAAELEAEAEEKREAAAEAEEEAEE 569 (880)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHH
Confidence 333333334333333333333333334456666777788888887766666655554
No 47
>PF04111 APG6: Autophagy protein Apg6; InterPro: IPR007243 Macroautophagy is a bulk degradation process induced by starvation in eukaryotic cells. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. No molecule involved in autophagy has yet been identified in higher eukaryotes []. The pre-autophagosomal structure contains at least five Apg proteins: Apg1p, Apg2p, Apg5p, Aut7p/Apg8p and Apg16p. It is found in the vacuole []. The C-terminal glycine of Apg12p is conjugated to a lysine residue of Apg5p via an isopeptide bond. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. Auotphagy protein 16 (Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate []. Autophagy protein 5 (Apg5) is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway []. Apg6/Vps30p has two distinct functions in the autophagic process, either associated with the membrane or in a retrieval step of the carboxypeptidase Y sorting pathway [].; GO: 0006914 autophagy; PDB: 3Q8T_A 3VP7_A 4DDP_A.
Probab=54.38 E-value=1e+02 Score=30.56 Aligned_cols=25 Identities=28% Similarity=0.207 Sum_probs=0.0
Q ss_pred HHHHHHhHHhhHHHHHHHHHhhhhc
Q 018721 216 QYQRENLHFLSEEILRLQECLSKYE 240 (351)
Q Consensus 216 rYLkdHNa~LSkrIL~Lq~~l~kye 240 (351)
.-|+++.....++.=..+..+.+-+
T Consensus 12 ~~l~~~~~~~~~E~~~Y~~fL~~l~ 36 (314)
T PF04111_consen 12 EQLDKQLEQAEKERDTYQEFLKKLE 36 (314)
T ss_dssp -------------------------
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3445555555555555555554444
No 48
>PRK10884 SH3 domain-containing protein; Provisional
Probab=54.33 E-value=88 Score=29.53 Aligned_cols=82 Identities=16% Similarity=0.200 Sum_probs=46.7
Q ss_pred hhHhhHHHHHHHHHHhHHhhHHHHHHHHHhhhhcccCCCCChhhhHHHhhhhhhhHHHhhHHHHHhHHHHHHHHHHHHHH
Q 018721 207 LSDEQMALLQYQRENLHFLSEEILRLQECLSKYEQSDDGSTPQVDLAHLLAARDQELRTLSAEMNQLQSELRLARSFVLR 286 (351)
Q Consensus 207 LlEKQADLIrYLkdHNa~LSkrIL~Lq~~l~kye~~~~g~t~qvdl~h~la~r~qelRa~~Ae~~q~~~el~~ar~li~~ 286 (351)
+..+|-.--.=.++-+..|.+++-.|+.++++-+.+-+ +-.-++...++.++++.-.|..|-.+++.|+..++..+..
T Consensus 80 V~~~~Ls~~p~~~~rlp~le~el~~l~~~l~~~~~~~~--~~~~~l~~~~~~~~~~~~~L~~~n~~L~~~l~~~~~~~~~ 157 (206)
T PRK10884 80 IPLKQLSTTPSLRTRVPDLENQVKTLTDKLNNIDNTWN--QRTAEMQQKVAQSDSVINGLKEENQKLKNQLIVAQKKVDA 157 (206)
T ss_pred EEHHHhcCCccHHHHHHHHHHHHHHHHHHHHHHHhHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 55554443444556667777888888888866443311 1222445556666666555666666666666665555544
Q ss_pred HHhh
Q 018721 287 VRNT 290 (351)
Q Consensus 287 ~r~~ 290 (351)
++..
T Consensus 158 l~~~ 161 (206)
T PRK10884 158 ANLQ 161 (206)
T ss_pred HHHH
Confidence 4433
No 49
>KOG0804 consensus Cytoplasmic Zn-finger protein BRAP2 (BRCA1 associated protein) [General function prediction only]
Probab=53.60 E-value=2.1e+02 Score=30.71 Aligned_cols=31 Identities=26% Similarity=0.284 Sum_probs=28.2
Q ss_pred hHhHHHHHHHhhhhhhHHHHHHHHHHhhhcc
Q 018721 296 EENERLRAILGEWSTRAAKLERALEVERMSN 326 (351)
Q Consensus 296 eEn~rlRa~l~Ews~raakle~ale~er~~~ 326 (351)
|||.-|+.-++-|.....++|..++.++-+-
T Consensus 403 E~n~~l~knq~vw~~kl~~~~e~~~~~~~s~ 433 (493)
T KOG0804|consen 403 EENKKLIKNQDVWRGKLKELEEREKEALGSK 433 (493)
T ss_pred HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHH
Confidence 8999999999999999999999999887653
No 50
>TIGR02231 conserved hypothetical protein. This family consists of proteins over 500 amino acids long in Caenorhabditis elegans and several bacteria (Pseudomonas aeruginosa, Nostoc sp. PCC 7120, Leptospira interrogans, etc.). The function is unknown.
Probab=53.08 E-value=92 Score=32.44 Aligned_cols=25 Identities=20% Similarity=0.400 Sum_probs=12.0
Q ss_pred hhHHHhhHHHHHhHHHHHHHHHHHH
Q 018721 260 DQELRTLSAEMNQLQSELRLARSFV 284 (351)
Q Consensus 260 ~qelRa~~Ae~~q~~~el~~ar~li 284 (351)
+++++.+.+|++.+++++....+.+
T Consensus 77 ~~~l~~l~~~~~~~~~~~~~~~~~~ 101 (525)
T TIGR02231 77 RKQIRELEAELRDLEDRGDALKALA 101 (525)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4444455555555555544444443
No 51
>COG1792 MreC Cell shape-determining protein [Cell envelope biogenesis, outer membrane]
Probab=53.07 E-value=20 Score=35.02 Aligned_cols=37 Identities=35% Similarity=0.463 Sum_probs=28.2
Q ss_pred hhhhHHHhhHHHHHhHHHHHHHHHHHHHHHHhhhhhhhhHhHHHHHHHhhh
Q 018721 258 ARDQELRTLSAEMNQLQSELRLARSFVLRVRNTNNQYVEENERLRAILGEW 308 (351)
Q Consensus 258 ~r~qelRa~~Ae~~q~~~el~~ar~li~~~r~~n~q~~eEn~rlRa~l~Ew 308 (351)
...|+||+.-+|++|.+.|+ .++.+||+|||..|+.-
T Consensus 73 ~en~~Lk~~l~~~~~~~~~~--------------~~l~~EN~~Lr~lL~~~ 109 (284)
T COG1792 73 LENEELKKELAELEQLLEEV--------------ESLEEENKRLKELLDFK 109 (284)
T ss_pred HHhHHHHHHHHHHHHHHHHH--------------HHHHHHHHHHHHHhCCc
Confidence 34567777777877777665 57889999999998753
No 52
>PF13870 DUF4201: Domain of unknown function (DUF4201)
Probab=51.88 E-value=1.8e+02 Score=26.02 Aligned_cols=123 Identities=22% Similarity=0.274 Sum_probs=86.5
Q ss_pred hhHHHHHHHHHHhHHhhHHHHHHHHHhhhhcccCCCCChhhhHHHh----------hhhhhhHHHhhHHHHHhHHHHHHH
Q 018721 210 EQMALLQYQRENLHFLSEEILRLQECLSKYEQSDDGSTPQVDLAHL----------LAARDQELRTLSAEMNQLQSELRL 279 (351)
Q Consensus 210 KQADLIrYLkdHNa~LSkrIL~Lq~~l~kye~~~~g~t~qvdl~h~----------la~r~qelRa~~Ae~~q~~~el~~ 279 (351)
++-+.|.-++--|..|-..+-.++.++.+.+..++|-+. ||...+ ++.|..| +..++.-...
T Consensus 3 ~k~~~i~~~Rl~~~~lk~~l~k~~~ql~~ke~lge~L~~-iDFeqLkien~~l~~kIeERn~e-------L~~Lk~~~~~ 74 (177)
T PF13870_consen 3 QKRNEISKLRLKNITLKHQLAKLEEQLRQKEELGEGLHL-IDFEQLKIENQQLNEKIEERNKE-------LLKLKKKIGK 74 (177)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccH-HHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHH
Confidence 345667778888889999999999999999999888762 343333 2222222 2222222222
Q ss_pred HHHHHHHHHhhhhhhhhHhHHHHHHHhhhhhhHHHHHHHHHHhhhccHHHHHHHHHhhhcC
Q 018721 280 ARSFVLRVRNTNNQYVEENERLRAILGEWSTRAAKLERALEVERMSNIELQKKISTRRNQH 340 (351)
Q Consensus 280 ar~li~~~r~~n~q~~eEn~rlRa~l~Ews~raakle~ale~er~~~~~~~~~~~~~r~~~ 340 (351)
+=-.+.++|.--.--.+|++.++..|.+-...-++++..|.......-.+++...++|.+.
T Consensus 75 ~v~~L~h~keKl~~~~~~~~~l~~~l~~~~~~~~~~r~~l~~~k~~r~k~~~~~~~l~~~~ 135 (177)
T PF13870_consen 75 TVQILTHVKEKLHFLSEELERLKQELKDREEELAKLREELYRVKKERDKLRKQNKKLRQQG 135 (177)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 2222367777777778888888888888888888888888888888888888888888776
No 53
>PF07111 HCR: Alpha helical coiled-coil rod protein (HCR); InterPro: IPR009800 This family consists of several mammalian alpha helical coiled-coil rod HCR proteins. The function of HCR is unknown but it has been implicated in psoriasis in humans and is thought to affect keratinocyte proliferation [].; GO: 0030154 cell differentiation, 0005634 nucleus, 0005737 cytoplasm
Probab=51.54 E-value=2.7e+02 Score=31.45 Aligned_cols=80 Identities=29% Similarity=0.347 Sum_probs=63.1
Q ss_pred hhHHHhhHHHHHhHHHHHHHHHHHH-HHH-------HhhhhhhhhHhHHHHHHHhhhhhhHHHHHHHHHHhhhccHHHHH
Q 018721 260 DQELRTLSAEMNQLQSELRLARSFV-LRV-------RNTNNQYVEENERLRAILGEWSTRAAKLERALEVERMSNIELQK 331 (351)
Q Consensus 260 ~qelRa~~Ae~~q~~~el~~ar~li-~~~-------r~~n~q~~eEn~rlRa~l~Ews~raakle~ale~er~~~~~~~~ 331 (351)
.-||+-+-.|.|.+-+||++.=.+| ++| .+.-++|.+.+..|+..|.+--.--+.++.-|++-|-+-.|-.+
T Consensus 477 ~~ELqqLReERdRl~aeLqlSa~liqqeV~~ArEqgeaE~~~Lse~aqqLE~~Lq~~qe~la~l~~QL~~Ar~~lqes~e 556 (739)
T PF07111_consen 477 SLELQQLREERDRLDAELQLSARLIQQEVGRAREQGEAERQQLSEVAQQLEQELQEKQESLAELEEQLEAARKSLQESTE 556 (739)
T ss_pred HHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHH
Confidence 5688999999999999999999999 443 34445677777778888888777778888888888888777777
Q ss_pred HHHHhhhc
Q 018721 332 KISTRRNQ 339 (351)
Q Consensus 332 ~~~~~r~~ 339 (351)
+-..+|+.
T Consensus 557 ea~~lR~E 564 (739)
T PF07111_consen 557 EAAELRRE 564 (739)
T ss_pred HHHHHHHH
Confidence 77777763
No 54
>PF14817 HAUS5: HAUS augmin-like complex subunit 5
Probab=51.46 E-value=2e+02 Score=31.74 Aligned_cols=120 Identities=13% Similarity=0.187 Sum_probs=62.1
Q ss_pred hhHhhHHHHHHHHHHhHHhhHHHHHHHHHhhhhcccCCCCChhhhHHHhhhhhhhHHH---hhHHHHHhHHHHHHHHHHH
Q 018721 207 LSDEQMALLQYQRENLHFLSEEILRLQECLSKYEQSDDGSTPQVDLAHLLAARDQELR---TLSAEMNQLQSELRLARSF 283 (351)
Q Consensus 207 LlEKQADLIrYLkdHNa~LSkrIL~Lq~~l~kye~~~~g~t~qvdl~h~la~r~qelR---a~~Ae~~q~~~el~~ar~l 283 (351)
-+=..+++-.|+-+|.+.-- .+=.....+-=|.-.+++..++ ++|.+++ .+.+|+.+|++|++.-++.
T Consensus 31 C~G~~~~IWkfli~~V~s~r-tV~~iRgNl~~~~~~~~~~~~~--------~~e~~~~~r~~L~~everLraei~~l~~~ 101 (632)
T PF14817_consen 31 CRGNMAPIWKFLIQHVRSQR-TVRKIRGNLLWYGHQQSKERKK--------SRENEARRRRELEKEVERLRAEIQELDKE 101 (632)
T ss_pred hccCChHHHHHHHHHcCcHh-HHHHHHcceeeccccccccchh--------hhHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34467889999999998642 2333334444455555554455 5555543 4555666666666655555
Q ss_pred HHHHHhhhhhhhhHhHHHHHHHhhhhhhHHHHHH---HHHHhhhccHHHHHHHHH
Q 018721 284 VLRVRNTNNQYVEENERLRAILGEWSTRAAKLER---ALEVERMSNIELQKKIST 335 (351)
Q Consensus 284 i~~~r~~n~q~~eEn~rlRa~l~Ews~raakle~---ale~er~~~~~~~~~~~~ 335 (351)
|......+---..+-+|-..-+.+=.+|.+-|+. +-+.++-.-.|-.|++..
T Consensus 102 I~~~e~e~~~~e~~~~q~~~~~~~~~~k~~LL~Ay~q~c~~~~~~l~e~~~rl~~ 156 (632)
T PF14817_consen 102 IESREREVSRQEASREQMLDKISDSRHKQLLLEAYSQQCEEQRRILREYTKRLQG 156 (632)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5333322222222333444445555556655542 444444444454444443
No 55
>TIGR01843 type_I_hlyD type I secretion membrane fusion protein, HlyD family. Type I secretion is an ABC transport process that exports proteins, without cleavage of any signal sequence, from the cytosol to extracellular medium across both inner and outer membranes. The secretion signal is found in the C-terminus of the transported protein. This model represents the adaptor protein between the ATP-binding cassette (ABC) protein of the inner membrane and the outer membrane protein, and is called the membrane fusion protein. This model selects a subfamily closely related to HlyD; it is defined narrowly and excludes, for example, colicin V secretion protein CvaA and multidrug efflux proteins.
Probab=51.38 E-value=2.4e+02 Score=27.42 Aligned_cols=38 Identities=18% Similarity=0.406 Sum_probs=18.5
Q ss_pred CCCCChhhhHHHhhhhhhhHHHhhHHHHHhHHHHHHHHHHHH
Q 018721 243 DDGSTPQVDLAHLLAARDQELRTLSAEMNQLQSELRLARSFV 284 (351)
Q Consensus 243 ~~g~t~qvdl~h~la~r~qelRa~~Ae~~q~~~el~~ar~li 284 (351)
+.|..++.++.. -+.++..+.++.+.+++++..+++.+
T Consensus 189 ~~g~is~~~~~~----~~~~~~~~~~~l~~~~~~l~~~~~~l 226 (423)
T TIGR01843 189 EKGLVSRLELLE----LERERAEAQGELGRLEAELEVLKRQI 226 (423)
T ss_pred HcCCCCHHHHHH----HHHHHHHHHhHHHHHHHHHHHHHHHH
Confidence 556666665542 22334444444444555555444444
No 56
>PF02403 Seryl_tRNA_N: Seryl-tRNA synthetase N-terminal domain; InterPro: IPR015866 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology []. The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric []. Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices [], and are mostly dimeric or multimeric, containing at least three conserved regions [, , ]. However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases []. Based on their mode of binding to the tRNA acceptor stem, both classes of tRNA synthetases have been subdivided into three subclasses, designated 1a, 1b, 1c and 2a, 2b, 2c. This entry represents the N-terminal domain of Seryl-tRNA synthetase, which consists of two helices in a long alpha-hairpin. Seryl-tRNA synthetase (6.1.1.11 from EC) exists as monomer and belongs to class IIa [].; GO: 0000166 nucleotide binding, 0004828 serine-tRNA ligase activity, 0005524 ATP binding, 0006434 seryl-tRNA aminoacylation, 0005737 cytoplasm; PDB: 3QO8_A 3QO5_A 3QO7_A 3QNE_A 3LSQ_A 3LSS_A 2DQ3_B 1SET_A 1SER_A 1SRY_B ....
Probab=50.71 E-value=1.2e+02 Score=24.81 Aligned_cols=65 Identities=26% Similarity=0.400 Sum_probs=44.1
Q ss_pred hHHHhhhhhhhHHHhhHHHHHhHHHHHHHHHHHHHHHHhhhhhhhhHhHHHHHHHhhhhhhHHHHHHHHH
Q 018721 251 DLAHLLAARDQELRTLSAEMNQLQSELRLARSFVLRVRNTNNQYVEENERLRAILGEWSTRAAKLERALE 320 (351)
Q Consensus 251 dl~h~la~r~qelRa~~Ae~~q~~~el~~ar~li~~~r~~n~q~~eEn~rlRa~l~Ews~raakle~ale 320 (351)
|++.++ .-+++.|.+-.+++.+++|-...-..|....... ++.+.|.+-..+.+.....+|..+.
T Consensus 27 ~vd~i~-~ld~~~r~l~~~~e~lr~~rN~~sk~I~~~~~~~----~~~~~l~~e~~~lk~~i~~le~~~~ 91 (108)
T PF02403_consen 27 DVDEII-ELDQERRELQQELEELRAERNELSKEIGKLKKAG----EDAEELKAEVKELKEEIKELEEQLK 91 (108)
T ss_dssp HHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHTT----CCTHHHHHHHHHHHHHHHHHHHHHH
T ss_pred hHHHHH-HHHHHHHHHHHHHHHHHHHHhHHHHHHHHHhhCc----ccHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344443 3477778888888888888888777776555544 6666677777777766666665543
No 57
>PF04849 HAP1_N: HAP1 N-terminal conserved region; InterPro: IPR006933 This family is defined by an N-terminal conserved region found in several huntingtin-associated protein 1 (HAP1) homologues. HAP1 binds to huntingtin in a polyglutamine repeat-length-dependent manner. However, its possible role in the pathogenesis of Huntingtons disease is unclear. This family also includes a similar N-terminal conserved region from hypothetical protein products of ALS2CR3 genes found in the human juvenile amyotrophic lateral sclerosis critical region 2q33-2q34 [].
Probab=49.65 E-value=1.6e+02 Score=29.85 Aligned_cols=37 Identities=24% Similarity=0.197 Sum_probs=29.0
Q ss_pred HHhhhhhhHHHHHHHHHHhhhccHHHHHHHHHhhhcC
Q 018721 304 ILGEWSTRAAKLERALEVERMSNIELQKKISTRRNQH 340 (351)
Q Consensus 304 ~l~Ews~raakle~ale~er~~~~~~~~~~~~~r~~~ 340 (351)
-|.+=+...|.|..-|....--+...|++|+.+..|.
T Consensus 207 QL~~An~qia~LseELa~k~Ee~~rQQEEIt~Llsqi 243 (306)
T PF04849_consen 207 QLSEANQQIASLSEELARKTEENRRQQEEITSLLSQI 243 (306)
T ss_pred HhhhcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4566677888888888888888888888888887754
No 58
>KOG0980 consensus Actin-binding protein SLA2/Huntingtin-interacting protein Hip1 [Cytoskeleton]
Probab=49.56 E-value=93 Score=35.75 Aligned_cols=63 Identities=29% Similarity=0.390 Sum_probs=44.6
Q ss_pred hhhhhHHHhhHHHHHhHHHHH----HHHHHHH--------------HHHHhhhhhhhhHhHHHHHHHhhhhhhHHHHHHH
Q 018721 257 AARDQELRTLSAEMNQLQSEL----RLARSFV--------------LRVRNTNNQYVEENERLRAILGEWSTRAAKLERA 318 (351)
Q Consensus 257 a~r~qelRa~~Ae~~q~~~el----~~ar~li--------------~~~r~~n~q~~eEn~rlRa~l~Ews~raakle~a 318 (351)
+.||...+-+..|..|+..++ -.||.-| ++.+-.+++-.+|+++||..+..--.----+|.+
T Consensus 329 d~~~~~~~~~~~e~~~~~~~l~~~~~ear~~~~q~~~ql~~le~~~~e~q~~~qe~~~e~eqLr~elaql~a~r~q~eka 408 (980)
T KOG0980|consen 329 DPRELQIEQLSREVAQLKAQLENLKEEARRRIEQYENQLLALEGELQEQQREAQENREEQEQLRNELAQLLASRTQLEKA 408 (980)
T ss_pred ChhhHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 566666666666666665544 4667666 5666677788999999999988766555567777
Q ss_pred H
Q 018721 319 L 319 (351)
Q Consensus 319 l 319 (351)
.
T Consensus 409 ~ 409 (980)
T KOG0980|consen 409 Q 409 (980)
T ss_pred H
Confidence 6
No 59
>COG2433 Uncharacterized conserved protein [Function unknown]
Probab=49.42 E-value=1.4e+02 Score=33.07 Aligned_cols=73 Identities=26% Similarity=0.349 Sum_probs=45.2
Q ss_pred HHHhhHHHHHhHHHHHHHHHHHHHHHHhhhhhhhhHhHHHHHHHhhhhhhHHHHHHHHHHhhhccHHHHHHHHHhhh
Q 018721 262 ELRTLSAEMNQLQSELRLARSFVLRVRNTNNQYVEENERLRAILGEWSTRAAKLERALEVERMSNIELQKKISTRRN 338 (351)
Q Consensus 262 elRa~~Ae~~q~~~el~~ar~li~~~r~~n~q~~eEn~rlRa~l~Ews~raakle~ale~er~~~~~~~~~~~~~r~ 338 (351)
|.+.|.+++.+++.|+-.-++.+.+++.. ++.-.+.+.-+..-..|-.+||+.|+.++..--+|.+++..+|.
T Consensus 437 e~~~L~~~~ee~k~eie~L~~~l~~~~r~----~~~~~~~~rei~~~~~~I~~L~~~L~e~~~~ve~L~~~l~~l~k 509 (652)
T COG2433 437 ENSELKRELEELKREIEKLESELERFRRE----VRDKVRKDREIRARDRRIERLEKELEEKKKRVEELERKLAELRK 509 (652)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33344455555444444444444333321 11233444445555667899999999999999999999999984
No 60
>TIGR03017 EpsF chain length determinant protein EpsF. Sequences in this family of proteins are members of the chain length determinant family (pfam02706) which includes the wzc protein from E.coli. This family of proteins are homologous to the EpsF protein of the methanolan biosynthesis operon of Methylobacillus species strain 12S. The distribution of this protein appears to be restricted to a subset of exopolysaccharide operons containing a syntenic grouping of genes including a variant of the EpsH exosortase protein. Exosortase has been proposed to be involved in the targetting and processing of proteins containing the PEP-CTERM domain to the exopolysaccharide layer.
Probab=49.23 E-value=2.9e+02 Score=27.68 Aligned_cols=36 Identities=22% Similarity=0.258 Sum_probs=29.5
Q ss_pred HHHHHHHHHhHHhhHHHHHHHHHhhhhcccCCCCCh
Q 018721 213 ALLQYQRENLHFLSEEILRLQECLSKYEQSDDGSTP 248 (351)
Q Consensus 213 DLIrYLkdHNa~LSkrIL~Lq~~l~kye~~~~g~t~ 248 (351)
..+.|+.+....+.+++-..+..+.+|++..+-..|
T Consensus 171 ~~~~fl~~ql~~~~~~l~~ae~~l~~fr~~~~i~~~ 206 (444)
T TIGR03017 171 KAALWFVQQIAALREDLARAQSKLSAYQQEKGIVSS 206 (444)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCccc
Confidence 347889999999999999999999999998554443
No 61
>PF11368 DUF3169: Protein of unknown function (DUF3169); InterPro: IPR021509 Some members in this family of proteins are annotated as membrane proteins however this cannot be confirmed. Currently there is no known function.
Probab=49.17 E-value=2.4e+02 Score=26.67 Aligned_cols=46 Identities=9% Similarity=-0.081 Sum_probs=25.2
Q ss_pred HHHHHHHHHHHHHHhHHHh----hhHHHHHHHhhcccccchhhhhHHHHH
Q 018721 83 ALTGIFQQYFVYQVQKIRL----QGYYSFSQKLKHIVRLPFAITAYGTAA 128 (351)
Q Consensus 83 lLt~l~d~yv~~qH~KlRl----~GYl~FYR~Tr~lkRlPl~IvSlGNa~ 128 (351)
++++.+....++.|++... ..-.++|+++.+--..+..+.+.....
T Consensus 63 ~~~~~~~~~~~k~~~~~~~~~deD~~~~~~~~~~r~~~~~~i~~~i~~i~ 112 (248)
T PF11368_consen 63 LLTFYFIYKSRKYKKLYEEEEDEDENEEYYRKMNRKLEYATIFFNISIII 112 (248)
T ss_pred HHHHHHHHHHHHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344444555555565554 355668888877655544444443333
No 62
>TIGR02808 short_TIGR02808 conserved hypothetical protein TIGR02808. This very small protein (about 46 amino acids) consists largely of a single predicted membrane-spanning region. It is found in Photobacterium profundum SS9 and in three species of Vibrio, always near periplasmic nitrate reductase genes, but far from the periplasmic nitrate reductase genes in Aeromonas hydrophila ATCC7966.
Probab=49.16 E-value=13 Score=27.46 Aligned_cols=21 Identities=14% Similarity=0.479 Sum_probs=18.1
Q ss_pred HHHHHHHHHHHHhhhhhhccc
Q 018721 46 CFVLAGYAILAAGTTWIFHPI 66 (351)
Q Consensus 46 ~~~L~~yA~~~~~~pw~~~~~ 66 (351)
.|+|.||++++++..|++...
T Consensus 19 vIil~GF~~Va~~si~lLs~~ 39 (42)
T TIGR02808 19 FIILSGFVAVAVTSILLLNAF 39 (42)
T ss_pred hHHhhhhHHHHHHHHHHHHhh
Confidence 578999999999999987654
No 63
>KOG0977 consensus Nuclear envelope protein lamin, intermediate filament superfamily [Cell cycle control, cell division, chromosome partitioning; Nuclear structure]
Probab=48.10 E-value=79 Score=34.27 Aligned_cols=74 Identities=28% Similarity=0.346 Sum_probs=39.7
Q ss_pred HHhhHHHHHhHHHHHHHHHHHH-HHHHhhhhhhhhHhHHHHHHHhhhhhhHHHHHHHHHHhhhccHHHHHHHHHh
Q 018721 263 LRTLSAEMNQLQSELRLARSFV-LRVRNTNNQYVEENERLRAILGEWSTRAAKLERALEVERMSNIELQKKISTR 336 (351)
Q Consensus 263 lRa~~Ae~~q~~~el~~ar~li-~~~r~~n~q~~eEn~rlRa~l~Ews~raakle~ale~er~~~~~~~~~~~~~ 336 (351)
+|-|.||=..|+.++...|+-. ...=..+..|..|-..+|.+++|=-..-||+|.-+-.=+-..-|+++++.+.
T Consensus 58 VR~LEaqN~~L~~di~~lr~~~~~~ts~ik~~ye~El~~ar~~l~e~~~~ra~~e~ei~kl~~e~~elr~~~~~~ 132 (546)
T KOG0977|consen 58 VRFLEAQNRKLEHDINLLRGVVGRETSGIKAKYEAELATARKLLDETARERAKLEIEITKLREELKELRKKLEKA 132 (546)
T ss_pred HHHHHHHHHHHHHHHHHHHhhccCCCcchhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHH
Confidence 3445555555555555555443 1222334556666666666666665555666665555555555555555543
No 64
>PF06638 Strabismus: Strabismus protein; InterPro: IPR009539 This family consists of several strabismus (STB) or Van Gogh-like (VANGL) proteins 1 and 2. The exact function of this family is unknown. It is thought, however that STB1 gene and STB2 may be potent tumour suppressor gene candidates [].; GO: 0007275 multicellular organismal development, 0016021 integral to membrane
Probab=47.77 E-value=39 Score=36.24 Aligned_cols=27 Identities=26% Similarity=0.368 Sum_probs=19.2
Q ss_pred CCCCCCcccCCCchhHhhHHHH---HHHHH
Q 018721 194 SSLEGLRYHDGGRLSDEQMALL---QYQRE 220 (351)
Q Consensus 194 ~~~~ElGfrd~g~LlEKQADLI---rYLkd 220 (351)
+..||.+|..-|.++=|.|++- .|.+|
T Consensus 238 SpDGesR~Y~iG~lSIQrAAv~vLe~Yy~d 267 (505)
T PF06638_consen 238 SPDGESRFYNIGQLSIQRAAVWVLEKYYKD 267 (505)
T ss_pred CCCCceeeeecCchhHHHHHHHHHHHHhhc
Confidence 5667888888888777777754 35555
No 65
>PF04156 IncA: IncA protein; InterPro: IPR007285 Chlamydia trachomatis is an obligate intracellular bacterium that develops within a parasitophorous vacuole termed an inclusion. The inclusion is nonfusogenic with lysosomes but intercepts lipids from a host cell exocytic pathway. Initiation of chlamydial development is concurrent with modification of the inclusion membrane by a set of C. trachomatis-encoded proteins collectively designated Incs. One of these Incs, IncA (Inclusion membrane protein A), is functionally associated with the homotypic fusion of inclusions [].
Probab=47.11 E-value=2.1e+02 Score=25.46 Aligned_cols=12 Identities=42% Similarity=0.498 Sum_probs=5.4
Q ss_pred hhHHHHHHHHHh
Q 018721 225 LSEEILRLQECL 236 (351)
Q Consensus 225 LSkrIL~Lq~~l 236 (351)
+.+++-.++.++
T Consensus 100 l~~~~~~~~~~l 111 (191)
T PF04156_consen 100 LQERIQELESEL 111 (191)
T ss_pred HHHHHHHHHHHH
Confidence 444444444444
No 66
>cd08915 V_Alix_like Protein-interacting V-domain of mammalian Alix and related domains. This superfamily contains the V-shaped (V) domain of mammalian Alix (apoptosis-linked gene-2 interacting protein X), His-Domain type N23 protein tyrosine phosphatase (HD-PTP, also known as PTPN23), Bro1 and Rim20 (also known as PalA) from Saccharomyces cerevisiae, and related domains. Alix, HD-PTP, Bro1, and Rim20 all interact with the ESCRT (Endosomal Sorting Complexes Required for Transport) system. Alix, also known as apoptosis-linked gene-2 interacting protein 1 (AIP1), participates in membrane remodeling processes during the budding of enveloped viruses, vesicle budding inside late endosomal multivesicular bodies (MVBs), and the abscission reactions of mammalian cell division. It also functions in apoptosis. HD-PTP functions in cell migration and endosomal trafficking, Bro1 in endosomal trafficking, and Rim20 in the response to the external pH via the Rim101 pathway. The Alix V-domain contains
Probab=46.99 E-value=2.9e+02 Score=27.11 Aligned_cols=90 Identities=18% Similarity=0.213 Sum_probs=70.9
Q ss_pred hhhhHHHhhhhh-hhHHHhhHHHHHhHHHHHHHHHHHHHHHHhhhhhhhhHhHHHHHHHhh-hhhh-------------H
Q 018721 248 PQVDLAHLLAAR-DQELRTLSAEMNQLQSELRLARSFVLRVRNTNNQYVEENERLRAILGE-WSTR-------------A 312 (351)
Q Consensus 248 ~qvdl~h~la~r-~qelRa~~Ae~~q~~~el~~ar~li~~~r~~n~q~~eEn~rlRa~l~E-ws~r-------------a 312 (351)
|+- +.|.=+-+ .+.+..+...++.+..-...++..|......-+.=..|.+++|+-.|. |..| .
T Consensus 57 P~~-~~~~~~i~~~gg~~~l~~~~~~l~~l~~~~~~~l~~~~~~L~~E~~ed~~~R~k~g~~~w~~~~S~~~~~~l~~~~ 135 (342)
T cd08915 57 PDS-IQHSQEIIEEGGLDNIEQSFKELSKLRQNVEELLQECEELLEEEAAEDDQLRAKFGTLRWRRPSSDEAAKELYEKV 135 (342)
T ss_pred Cch-HHHHHHHHccCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhCcccCCCCChHHHHHHHHHHH
Confidence 444 66655544 456778888899998888888888888888888889999999999998 6655 4
Q ss_pred HHHHHHHHHhhhccHHHHHHHHHhhh
Q 018721 313 AKLERALEVERMSNIELQKKISTRRN 338 (351)
Q Consensus 313 akle~ale~er~~~~~~~~~~~~~r~ 338 (351)
.|+.-.|+.-.=|+-.+++++..-+.
T Consensus 136 ~k~~~~L~~A~~sD~~l~~~~~~~~~ 161 (342)
T cd08915 136 TKLRGYLEQASNSDNEVLQCYESIDP 161 (342)
T ss_pred HHHHHHHHHHHhhhHHHHHHHHHHHH
Confidence 67888888888888888888887654
No 67
>TIGR02894 DNA_bind_RsfA transcription factor, RsfA family. In a subset of endospore-forming members of the Firmcutes, members of this protein family are found, several to a genome. Two very strongly conserved sequences regions are separated by a highly variable linker region. Much of the linker region was excised from the seed alignment for this model. A characterized member is the prespore-specific transcription RsfA from Bacillus subtilis, previously called YwfN, which is controlled by sigma factor F and seems to fine-tune expression of some genes in the sigma-F regulon. A paralog in Bacillus subtilis is designated YlbO.
Probab=46.78 E-value=69 Score=29.68 Aligned_cols=53 Identities=26% Similarity=0.302 Sum_probs=38.3
Q ss_pred HHHHHhhhhhh---hhHhHHHHHHHhhhhhhHHHHHHHHHHhhhccHHHHHHHHHh
Q 018721 284 VLRVRNTNNQY---VEENERLRAILGEWSTRAAKLERALEVERMSNIELQKKISTR 336 (351)
Q Consensus 284 i~~~r~~n~q~---~eEn~rlRa~l~Ews~raakle~ale~er~~~~~~~~~~~~~ 336 (351)
+|.+...+.+. ..||++|+.-+.+|..+...||.-++.-.-....++..|.++
T Consensus 89 Lq~l~~~~~~~~~~~~e~~~l~~e~~~l~~~~e~Le~e~~~L~~~~~~~~eDY~~L 144 (161)
T TIGR02894 89 LQNLKTTNPSDQALQKENERLKNQNESLQKRNEELEKELEKLRQRLSTIEEDYQTL 144 (161)
T ss_pred HHHHHhcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 37777665544 469999999999999999999888766555555555555554
No 68
>PF09574 DUF2374: Protein of unknown function (Duf2374); InterPro: IPR014175 This very small protein (about 46 amino acids) consists largely of a single predicted membrane-spanning region. It is found in Photobacterium profundum SS9 and in three species of Vibrio, always near periplasmic nitrate reductase genes, but far from the periplasmic nitrate reductase genes in Aeromonas hydrophila ATCC 7966.
Probab=45.94 E-value=17 Score=26.80 Aligned_cols=21 Identities=24% Similarity=0.700 Sum_probs=18.0
Q ss_pred HHHHHHHHHHHHhhhhhhccc
Q 018721 46 CFVLAGYAILAAGTTWIFHPI 66 (351)
Q Consensus 46 ~~~L~~yA~~~~~~pw~~~~~ 66 (351)
.|+|.||+++++++.|++...
T Consensus 19 vI~L~GF~~Vav~~~~lL~~~ 39 (42)
T PF09574_consen 19 VIILSGFAAVAVASIWLLSLT 39 (42)
T ss_pred HHHHhhHHHHHHHHHHHHHhh
Confidence 578999999999999987654
No 69
>PF09755 DUF2046: Uncharacterized conserved protein H4 (DUF2046); InterPro: IPR019152 This is the conserved N-terminal 350 residues of a family of proteins of unknown function possibly containing a coiled-coil domain.
Probab=44.90 E-value=3.6e+02 Score=27.52 Aligned_cols=80 Identities=30% Similarity=0.363 Sum_probs=49.1
Q ss_pred hHHHhhhhhhhHHH-hhHHHHHhHHHHHHH-HHHH--------------HHHHHhhhhhhhhHhHHHHHHHhhhhhhHHH
Q 018721 251 DLAHLLAARDQELR-TLSAEMNQLQSELRL-ARSF--------------VLRVRNTNNQYVEENERLRAILGEWSTRAAK 314 (351)
Q Consensus 251 dl~h~la~r~qelR-a~~Ae~~q~~~el~~-ar~l--------------i~~~r~~n~q~~eEn~rlRa~l~Ews~raak 314 (351)
||++-+...|.-|. +|+.-++|++.|--. ...+ |.+++.--..+-.+-+|||+.- -.
T Consensus 95 ~L~~~~e~EEE~ltn~L~rkl~qLr~EK~~lE~~Le~EqE~~V~kL~k~i~~Le~e~~~~q~~le~Lr~EK-------Vd 167 (310)
T PF09755_consen 95 TLALKYEQEEEFLTNDLSRKLNQLRQEKVELENQLEQEQEYLVNKLQKKIERLEKEKSAKQEELERLRREK-------VD 167 (310)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHH-------Hh
Confidence 67777777776665 888889988887432 2222 2444443445556667777632 23
Q ss_pred HHHHHHHh----------hhccHHHHHHHHHhh
Q 018721 315 LERALEVE----------RMSNIELQKKISTRR 337 (351)
Q Consensus 315 le~ale~e----------r~~~~~~~~~~~~~r 337 (351)
||.+||.| ||+.++-.|...+-+
T Consensus 168 lEn~LE~EQE~lvN~L~Kqm~~l~~eKr~Lq~~ 200 (310)
T PF09755_consen 168 LENTLEQEQEALVNRLWKQMDKLEAEKRRLQEK 200 (310)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 88899887 455555555444333
No 70
>TIGR03007 pepcterm_ChnLen polysaccharide chain length determinant protein, PEP-CTERM locus subfamily. Members of this protein family belong to the family of polysaccharide chain length determinant proteins (pfam02706). All are found in species that encode the PEP-CTERM/exosortase system predicted to act in protein sorting in a number of Gram-negative bacteria, and are found near the epsH homolog that is the putative exosortase gene.
Probab=44.79 E-value=3e+02 Score=28.09 Aligned_cols=33 Identities=18% Similarity=0.260 Sum_probs=17.1
Q ss_pred HhhhhhhHHHHHHHHHHhhhccHHHHHHHHHhh
Q 018721 305 LGEWSTRAAKLERALEVERMSNIELQKKISTRR 337 (351)
Q Consensus 305 l~Ews~raakle~ale~er~~~~~~~~~~~~~r 337 (351)
+......-..|+|..+..+=-=..+.+++.+.+
T Consensus 350 ~~~~~~el~~L~Re~~~~~~~Y~~l~~r~eea~ 382 (498)
T TIGR03007 350 IPEVEAELTQLNRDYEVNKSNYEQLLTRRESAE 382 (498)
T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334444455666666665544444555554444
No 71
>PRK11637 AmiB activator; Provisional
Probab=44.50 E-value=3.6e+02 Score=27.43 Aligned_cols=88 Identities=16% Similarity=0.168 Sum_probs=39.3
Q ss_pred hhHhhHHHHHHHHHHhHHhhHHHHHHHHHhhhhcccCCCCChhhhHHHhhhhhhhHHHhhHHHHHhHHHHHHHHHHHHHH
Q 018721 207 LSDEQMALLQYQRENLHFLSEEILRLQECLSKYEQSDDGSTPQVDLAHLLAARDQELRTLSAEMNQLQSELRLARSFVLR 286 (351)
Q Consensus 207 LlEKQADLIrYLkdHNa~LSkrIL~Lq~~l~kye~~~~g~t~qvdl~h~la~r~qelRa~~Ae~~q~~~el~~ar~li~~ 286 (351)
+.+.+.++|..+++.-..|....-.|...+. ++.-+++..+++...|.+++++-+.++..-.+.++.
T Consensus 164 i~~~d~~~l~~l~~~~~~L~~~k~~le~~~~-------------~l~~~~~e~~~~k~~L~~~k~e~~~~l~~L~~~~~~ 230 (428)
T PRK11637 164 LNQARQETIAELKQTREELAAQKAELEEKQS-------------QQKTLLYEQQAQQQKLEQARNERKKTLTGLESSLQK 230 (428)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5555566666666655555554444444442 333444444444444444433322222222222222
Q ss_pred HHhhhhhhhhHhHHHHHHHhh
Q 018721 287 VRNTNNQYVEENERLRAILGE 307 (351)
Q Consensus 287 ~r~~n~q~~eEn~rlRa~l~E 307 (351)
....-.+...+..+|.+.|.+
T Consensus 231 ~~~~l~~l~~~~~~L~~~I~~ 251 (428)
T PRK11637 231 DQQQLSELRANESRLRDSIAR 251 (428)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 222223334455566666654
No 72
>COG1196 Smc Chromosome segregation ATPases [Cell division and chromosome partitioning]
Probab=43.58 E-value=4.7e+02 Score=30.38 Aligned_cols=27 Identities=30% Similarity=0.363 Sum_probs=13.2
Q ss_pred hhhHhHHHHHHHhhhhhhHHHHHHHHH
Q 018721 294 YVEENERLRAILGEWSTRAAKLERALE 320 (351)
Q Consensus 294 ~~eEn~rlRa~l~Ews~raakle~ale 320 (351)
...|-+.++..+.+|..+.+-++..++
T Consensus 826 ~~~ei~~l~~~~~~~~~~~~~l~~~~~ 852 (1163)
T COG1196 826 LEQEIEELEEEIEELEEKLDELEEELE 852 (1163)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhHH
Confidence 344444555555555555554444443
No 73
>PF06156 DUF972: Protein of unknown function (DUF972); InterPro: IPR010377 FUNCTION: Involved in initiation control of chromosome replication. SUBUNIT: Interacts with both DnaA and DnaN, acting as a bridge between these two proteins. SIMILARITY: Belongs to the YabA family.
Probab=42.60 E-value=1.2e+02 Score=26.02 Aligned_cols=44 Identities=30% Similarity=0.334 Sum_probs=23.9
Q ss_pred hHHHHHhHHHHHHHHHHHHHHHHhhhhhhhhHhHHHHHHHhhhh
Q 018721 266 LSAEMNQLQSELRLARSFVLRVRNTNNQYVEENERLRAILGEWS 309 (351)
Q Consensus 266 ~~Ae~~q~~~el~~ar~li~~~r~~n~q~~eEn~rlRa~l~Ews 309 (351)
+...+.++.+|+..-+..+..+=..|..-.-||+.||..|++-.
T Consensus 13 le~~l~~l~~~~~~LK~~~~~l~EEN~~L~~EN~~Lr~~l~~~~ 56 (107)
T PF06156_consen 13 LEQQLGQLLEELEELKKQLQELLEENARLRIENEHLRERLEELE 56 (107)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 33344444444444444455565666666666666666666543
No 74
>PHA02562 46 endonuclease subunit; Provisional
Probab=42.09 E-value=4e+02 Score=27.37 Aligned_cols=26 Identities=19% Similarity=0.276 Sum_probs=18.2
Q ss_pred HHHHHHHhHHhhHHHHHHHHHhhhhc
Q 018721 215 LQYQRENLHFLSEEILRLQECLSKYE 240 (351)
Q Consensus 215 IrYLkdHNa~LSkrIL~Lq~~l~kye 240 (351)
+.=+++....+..++-.++.....|+
T Consensus 257 L~~l~~~~~~~~~~l~~~~~~~~~~~ 282 (562)
T PHA02562 257 LNKLNTAAAKIKSKIEQFQKVIKMYE 282 (562)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 33455666677777888888887775
No 75
>PF10186 Atg14: UV radiation resistance protein and autophagy-related subunit 14; InterPro: IPR018791 Class III phosphatidylinositol 3-kinase (PI3-kinase) regulates multiple membrane trafficking. In yeast, two distinct PI3-kinase complexes are known: complex I (Vps34, Vps15, Vps30/Atg6, and Atg14) is involved in autophagy, and complex II (Vps34, Vps15, Vps30/Atg6, and Vps38) functions in the vacuolar protein sorting pathway. In mammals, the counterparts of Vps34, Vps15, and Vps30/Atg6 are Vps34, p150, and Beclin 1, respectively. Mammalian UV irradiation resistance-associated gene (UVRAG) has been identified as identical to yeast Vps38 []. The Atg14 (autophagy-related protein 14) proteins are hydrophilic proteins and have a coiled-coil motif at the N terminus region. Yeast cells with mutant Atg14 are defective not only in autophagy but also in sorting of carboxypeptidase Y (CPY), a vacuolar-soluble hydrolase, to the vacuole []. This entry represents Atg14 and UVRAG, which bind Beclin 1 to forms two distinct PI3-kinase complexes. This entry also includes Bakor (beclin-1-associated autophagy-related key regulator), also known as autophagy-related protein 14-like protein, which share sequence similarity to the yeast Atg14 protein []. Barkor positively regulates autophagy through its interaction with Beclin-1, with decreased levels of autophagosome formation observed when Barkor expression is eliminated []. Autophagy mediates the cellular response to nutrient deprivation, protein aggregation, and pathogen invasion in humans, and malfunction of autophagy has been implicated in multiple human diseases including cancer. ; GO: 0010508 positive regulation of autophagy
Probab=41.92 E-value=2.9e+02 Score=25.63 Aligned_cols=120 Identities=22% Similarity=0.232 Sum_probs=56.6
Q ss_pred hhHhhHHHHHHHHHHhHHhhHHHHHHHHHhhhhcccCCC-CChhhhHHHhhhhhhhHHHhhHHHHHhHHHHHHHHHHHHH
Q 018721 207 LSDEQMALLQYQRENLHFLSEEILRLQECLSKYEQSDDG-STPQVDLAHLLAARDQELRTLSAEMNQLQSELRLARSFVL 285 (351)
Q Consensus 207 LlEKQADLIrYLkdHNa~LSkrIL~Lq~~l~kye~~~~g-~t~qvdl~h~la~r~qelRa~~Ae~~q~~~el~~ar~li~ 285 (351)
|.+++.++.+=. ..+..|.++|-..=+.- .+| -.+-..+..-....+..+..+..++.+++.++...|..|.
T Consensus 22 L~~~~~~l~~~~-~~~~~l~~~i~~~l~~~------~~~~~~~~~~~~~~~~~~~~r~~~l~~~i~~~~~~i~~~r~~l~ 94 (302)
T PF10186_consen 22 LLELRSELQQLK-EENEELRRRIEEILESD------SNGQLLEIQQLKREIEELRERLERLRERIERLRKRIEQKRERLE 94 (302)
T ss_pred HHHHHHHHHHHH-HHHHHHHHHHHHHHHhh------hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 677777666543 34445555444332211 011 0111122222223333444555666666666666666654
Q ss_pred HHHhhhhh--------------hhhHhHHHHHHHhhhhhhHHHHHHHHHHhhhccHHHHHHH
Q 018721 286 RVRNTNNQ--------------YVEENERLRAILGEWSTRAAKLERALEVERMSNIELQKKI 333 (351)
Q Consensus 286 ~~r~~n~q--------------~~eEn~rlRa~l~Ews~raakle~ale~er~~~~~~~~~~ 333 (351)
.+|..+.+ ..+..+.+...+.++..+-.+++..+..-|..-+..-.+|
T Consensus 95 ~~~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~l~~~l~~~r~~l~~~l~~i 156 (302)
T PF10186_consen 95 ELRESLEQRRSRLSASQDLVESRQEQLEELQNELEERKQRLSQLQSQLARRRRQLIQELSEI 156 (302)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44444433 2233344555566666666666666665555444433333
No 76
>PF07926 TPR_MLP1_2: TPR/MLP1/MLP2-like protein; InterPro: IPR012929 This domain is found in a number of proteins, including TPR protein (P12270 from SWISSPROT) and yeast myosin-like proteins 1 (MLP1, Q02455 from SWISSPROT) and 2 (MLP2, P40457 from SWISSPROT). These proteins share a number of features; for example, they all have coiled-coil regions and all three are associated with nuclear pores [, , ]. TPR is thought to be a component of nuclear pore complex- attached intranuclear filaments [], and is implicated in nuclear protein import []. Moreover, its N-terminal region is involved in the activation of oncogenic kinases, possibly by mediating the dimerisation of kinase domains or by targeting these kinases to the nuclear pore complex []. MLP1 and MLP2 are involved in the process of telomere length regulation, where they are thought to interact with proteins such as Tel1p and modulate their activity []. ; GO: 0006606 protein import into nucleus, 0005643 nuclear pore
Probab=41.69 E-value=1.6e+02 Score=25.38 Aligned_cols=66 Identities=23% Similarity=0.212 Sum_probs=45.5
Q ss_pred HHHHHHHHHHHHHHhhhhhhhhHhHHHHHHHhhhhhhHHHHHHHHHHhhhccHHHHHHHHHhhhcC
Q 018721 275 SELRLARSFVLRVRNTNNQYVEENERLRAILGEWSTRAAKLERALEVERMSNIELQKKISTRRNQH 340 (351)
Q Consensus 275 ~el~~ar~li~~~r~~n~q~~eEn~rlRa~l~Ews~raakle~ale~er~~~~~~~~~~~~~r~~~ 340 (351)
+|+....+.+++++.....|.+.-..+|.-|+.=+.++..-+.-.|.|=+...+.-+.+.++|.+.
T Consensus 3 ~e~~~l~~e~~~~~~~~~~~~~~~~~~~~dl~~q~~~a~~Aq~~YE~El~~Ha~~~~~L~~lr~e~ 68 (132)
T PF07926_consen 3 SELSSLQSELQRLKEQEEDAEEQLQSLREDLESQAKIAQEAQQKYERELVKHAEDIKELQQLREEL 68 (132)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHH
Confidence 344444555555566666677777777777777777777777777777777777777777777653
No 77
>PF08618 Opi1: Transcription factor Opi1; InterPro: IPR013927 Opi1 is a leucine zipper containing yeast transcription factor that negatively regulates phospholipid biosynthesis []. It represses the expression of several UAS(INO) cis acting element containing genes and its activity is mediated by phosphorylations catalysed by protein kinase A, protein kinase C and casein kinase II [].
Probab=41.65 E-value=60 Score=34.13 Aligned_cols=30 Identities=27% Similarity=0.361 Sum_probs=27.3
Q ss_pred HHHHHHHHHhHHhhHHHHHHHHHhhhhccc
Q 018721 213 ALLQYQRENLHFLSEEILRLQECLSKYEQS 242 (351)
Q Consensus 213 DLIrYLkdHNa~LSkrIL~Lq~~l~kye~~ 242 (351)
=.|++||--|.+|+.+|..||..|.+|++.
T Consensus 235 yCL~~Lr~AN~~i~~~i~~Lq~~l~e~e~~ 264 (427)
T PF08618_consen 235 YCLHWLRLANAHIDSKINFLQDVLEEYERD 264 (427)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcc
Confidence 357899999999999999999999999953
No 78
>PF08581 Tup_N: Tup N-terminal; InterPro: IPR013890 The N-terminal region of the Tup protein has been shown to interact with the Ssn6 transcriptional co-repressor []. ; PDB: 3VP9_B 3VP8_B.
Probab=41.10 E-value=1.7e+02 Score=23.94 Aligned_cols=23 Identities=39% Similarity=0.439 Sum_probs=18.5
Q ss_pred HHHHhhhhhhhhHhHHHHHHHhh
Q 018721 285 LRVRNTNNQYVEENERLRAILGE 307 (351)
Q Consensus 285 ~~~r~~n~q~~eEn~rlRa~l~E 307 (351)
+.-+.+..+|.||-.|||+.|+.
T Consensus 53 ~~h~kmK~~YEeEI~rLr~eLe~ 75 (79)
T PF08581_consen 53 QAHRKMKQQYEEEIARLRRELEQ 75 (79)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHCH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHh
Confidence 44566778999999999998874
No 79
>PF06156 DUF972: Protein of unknown function (DUF972); InterPro: IPR010377 FUNCTION: Involved in initiation control of chromosome replication. SUBUNIT: Interacts with both DnaA and DnaN, acting as a bridge between these two proteins. SIMILARITY: Belongs to the YabA family.
Probab=41.03 E-value=1e+02 Score=26.44 Aligned_cols=53 Identities=25% Similarity=0.305 Sum_probs=40.4
Q ss_pred HhhHHHHHhHHHHHHHHHHHHHHHHhhhhhhhhHhHHHHHHHhhhhhhHHHHH
Q 018721 264 RTLSAEMNQLQSELRLARSFVLRVRNTNNQYVEENERLRAILGEWSTRAAKLE 316 (351)
Q Consensus 264 Ra~~Ae~~q~~~el~~ar~li~~~r~~n~q~~eEn~rlRa~l~Ews~raakle 316 (351)
+.+-..++++.+.+..-=..|..|+..=..=+|||.+||-.-..-..|..+++
T Consensus 4 ~~l~~~l~~le~~l~~l~~~~~~LK~~~~~l~EEN~~L~~EN~~Lr~~l~~~~ 56 (107)
T PF06156_consen 4 KELFDRLDQLEQQLGQLLEELEELKKQLQELLEENARLRIENEHLRERLEELE 56 (107)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 45677889999999888888888888888999999999865544444444443
No 80
>PF04156 IncA: IncA protein; InterPro: IPR007285 Chlamydia trachomatis is an obligate intracellular bacterium that develops within a parasitophorous vacuole termed an inclusion. The inclusion is nonfusogenic with lysosomes but intercepts lipids from a host cell exocytic pathway. Initiation of chlamydial development is concurrent with modification of the inclusion membrane by a set of C. trachomatis-encoded proteins collectively designated Incs. One of these Incs, IncA (Inclusion membrane protein A), is functionally associated with the homotypic fusion of inclusions [].
Probab=41.01 E-value=2.6e+02 Score=24.84 Aligned_cols=18 Identities=39% Similarity=0.506 Sum_probs=8.5
Q ss_pred HhHHhhHHHHHHHHHhhh
Q 018721 221 NLHFLSEEILRLQECLSK 238 (351)
Q Consensus 221 HNa~LSkrIL~Lq~~l~k 238 (351)
+...|.+++-.+++..+.
T Consensus 89 ~l~~l~~el~~l~~~~~~ 106 (191)
T PF04156_consen 89 QLQQLQEELDQLQERIQE 106 (191)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 334444555555555433
No 81
>PF07099 DUF1361: Protein of unknown function (DUF1361); InterPro: IPR009793 This family consists of several hypothetical bacterial proteins of around 200 residues in length. The function of this family is unknown although some members are annotated as being putative integral membrane proteins.
Probab=40.56 E-value=58 Score=29.45 Aligned_cols=31 Identities=26% Similarity=0.420 Sum_probs=23.1
Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCC--CCchh
Q 018721 153 LIEAICAASFMSVYIGYVHQYNSLNS--QPDVM 183 (351)
Q Consensus 153 ~lElv~~l~~li~YIvkV~rFNk~kp--~PDVl 183 (351)
.+=.++.+++.++|++|.-|+|+=+- +|+..
T Consensus 109 ~~~~~~~Lss~GIYlGRflR~NSWDi~~~P~~l 141 (168)
T PF07099_consen 109 FIILISFLSSFGIYLGRFLRLNSWDILTNPQSL 141 (168)
T ss_pred HHHHHHHHHHHHHHHHhhcccchhHHhCCHHHH
Confidence 33456678889999999999999654 45443
No 82
>KOG0161 consensus Myosin class II heavy chain [Cytoskeleton]
Probab=40.35 E-value=5.7e+02 Score=32.13 Aligned_cols=41 Identities=27% Similarity=0.362 Sum_probs=19.1
Q ss_pred hHHhhHHHHHHHHHhhhhcccCCCCChhhhHHHhhhhhhhHHHhhHH
Q 018721 222 LHFLSEEILRLQECLSKYEQSDDGSTPQVDLAHLLAARDQELRTLSA 268 (351)
Q Consensus 222 Na~LSkrIL~Lq~~l~kye~~~~g~t~qvdl~h~la~r~qelRa~~A 268 (351)
.+.|.+++-++...++.+++.. -|+++.+++-.+++|...+
T Consensus 1430 ~~~le~k~k~f~k~l~e~k~~~------e~l~~Eld~aq~e~r~~~t 1470 (1930)
T KOG0161|consen 1430 VAALEKKQKRFEKLLAEWKKKL------EKLQAELDAAQRELRQLST 1470 (1930)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH------HHHHHHHHHHHHHHHHhHH
Confidence 4445555555555555555442 2444444444444443333
No 83
>PF08614 ATG16: Autophagy protein 16 (ATG16); InterPro: IPR013923 Macroautophagy is a bulk degradation process induced by starvation in eukaryotic cells. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. No molecule involved in autophagy has yet been identified in higher eukaryotes []. The pre-autophagosomal structure contains at least five Apg proteins: Apg1p, Apg2p, Apg5p, Aut7p/Apg8p and Apg16p. It is found in the vacuole []. The C-terminal glycine of Apg12p is conjugated to a lysine residue of Apg5p via an isopeptide bond. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. Auotphagy protein 16 (Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate []. Autophagy protein 5 (Apg5) is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway []. This entry represents auotphagy protein 16 (Apg16), which is required for the function of the Apg12p-Apg5p conjugate.; PDB: 3A7O_D 3A7P_B.
Probab=40.14 E-value=1.2e+02 Score=27.66 Aligned_cols=73 Identities=21% Similarity=0.220 Sum_probs=30.9
Q ss_pred HHHhhHHHHHhHHHHHHHHHHHHHHHHhhhhhhhhHhHHHHHHHhhhhhhHHHHHHHHHHhhhccHHHHHHHH
Q 018721 262 ELRTLSAEMNQLQSELRLARSFVLRVRNTNNQYVEENERLRAILGEWSTRAAKLERALEVERMSNIELQKKIS 334 (351)
Q Consensus 262 elRa~~Ae~~q~~~el~~ar~li~~~r~~n~q~~eEn~rlRa~l~Ews~raakle~ale~er~~~~~~~~~~~ 334 (351)
.|-.+..++..+..++..-...|..++..+.+.-++...|...|.|+..-...|=-.+.+=.+.+--+.+++.
T Consensus 96 ~L~~~~~~l~~l~~~~~~~~~~l~~l~~~~~~L~~~~~~l~~~l~ek~k~~e~l~DE~~~L~l~~~~~e~k~~ 168 (194)
T PF08614_consen 96 QLVELNDELQELEKELSEKERRLAELEAELAQLEEKIKDLEEELKEKNKANEILQDELQALQLQLNMLEEKLR 168 (194)
T ss_dssp ------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred cccccccccchhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4556666666666666666666666666666666666666666666665555444444443333333333333
No 84
>TIGR03185 DNA_S_dndD DNA sulfur modification protein DndD. This model describes the DndB protein encoded by an operon associated with a sulfur-containing modification to DNA. The operon is sporadically distributed in bacteria, much like some restriction enzyme operons. DndD is described as a putative ATPase. The small number of examples known so far include species from among the Firmicutes, Actinomycetes, Proteobacteria, and Cyanobacteria.
Probab=40.08 E-value=1.6e+02 Score=31.59 Aligned_cols=27 Identities=26% Similarity=0.244 Sum_probs=12.8
Q ss_pred hhhhHhHHHHHHHhhhhhhHHHHHHHH
Q 018721 293 QYVEENERLRAILGEWSTRAAKLERAL 319 (351)
Q Consensus 293 q~~eEn~rlRa~l~Ews~raakle~al 319 (351)
...+|-+.|.+-+.++..+....+..+
T Consensus 259 ~~~~~r~~Le~ei~~le~e~~e~~~~l 285 (650)
T TIGR03185 259 DLFEEREQLERQLKEIEAARKANRAQL 285 (650)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 444444455555555554444444444
No 85
>PF12128 DUF3584: Protein of unknown function (DUF3584); InterPro: IPR021979 This family consist of uncharacterised bacterial proteins.
Probab=40.02 E-value=4.4e+02 Score=30.78 Aligned_cols=72 Identities=24% Similarity=0.263 Sum_probs=43.1
Q ss_pred hHHHhhhhhhhHHHhhHHHHHhHHHHHHHHHHHHHHHHhhhhhhhhHhHHHHHHHhhhhhhHHHHHHHHHHh
Q 018721 251 DLAHLLAARDQELRTLSAEMNQLQSELRLARSFVLRVRNTNNQYVEENERLRAILGEWSTRAAKLERALEVE 322 (351)
Q Consensus 251 dl~h~la~r~qelRa~~Ae~~q~~~el~~ar~li~~~r~~n~q~~eEn~rlRa~l~Ews~raakle~ale~e 322 (351)
+...-++.=+.++.....+.++...++..++...+..+...++-.++-..+|..+.+=..+-+.|++.|..+
T Consensus 466 e~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~l~~~~~~~~~~~~~~~~l~~~L~p~ 537 (1201)
T PF12128_consen 466 EEKEQLEQADKRLEQAQEQQNQAQQAVEELQAEEQELRKERDQAEEELRQARRELEELRAQIAELQRQLDPQ 537 (1201)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCC
Confidence 344444444445545555555555555555555566666666666666777777777777777777776544
No 86
>PF07798 DUF1640: Protein of unknown function (DUF1640); InterPro: IPR024461 This family consists of uncharacterised proteins.
Probab=40.00 E-value=2.8e+02 Score=24.98 Aligned_cols=39 Identities=18% Similarity=0.344 Sum_probs=22.5
Q ss_pred HhHHHHHHHhhhhhhHHHHHHHHHHhhhccHHHHHHHHHhhh
Q 018721 297 ENERLRAILGEWSTRAAKLERALEVERMSNIELQKKISTRRN 338 (351)
Q Consensus 297 En~rlRa~l~Ews~raakle~ale~er~~~~~~~~~~~~~r~ 338 (351)
|-.|.|....+=..+-..++..++.| ++ .|+.+|...|.
T Consensus 114 eK~~~r~e~~~~~~ki~e~~~ki~~e-i~--~lr~~iE~~K~ 152 (177)
T PF07798_consen 114 EKGRIREEQAKQELKIQELNNKIDTE-IA--NLRTEIESLKW 152 (177)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHH-HH--HHHHHHHHHHH
Confidence 34456666666666666666666665 33 36666665554
No 87
>PF09726 Macoilin: Transmembrane protein; InterPro: IPR019130 This entry represents the multi-pass transmembrane protein Macoilin, which is highly conserved in eukaryotes. ; GO: 0016021 integral to membrane
Probab=39.94 E-value=1.7e+02 Score=32.56 Aligned_cols=72 Identities=18% Similarity=0.318 Sum_probs=44.2
Q ss_pred HHHHHhHHHHHHHHHHH-------HHHHHhhhhhhhhHhHHHHHHHhhhhhhHHHHHHHHHHhhhccHHHHHHHHHhhh
Q 018721 267 SAEMNQLQSELRLARSF-------VLRVRNTNNQYVEENERLRAILGEWSTRAAKLERALEVERMSNIELQKKISTRRN 338 (351)
Q Consensus 267 ~Ae~~q~~~el~~ar~l-------i~~~r~~n~q~~eEn~rlRa~l~Ews~raakle~ale~er~~~~~~~~~~~~~r~ 338 (351)
-.|..+|+.||+..... ++.+|..+.-=..|.|-|-..|.-=...++.||.-|-+|=.=.+||=.-+-.-|+
T Consensus 551 E~E~~~lr~elk~kee~~~~~e~~~~~lr~~~~e~~~~~e~L~~aL~amqdk~~~LE~sLsaEtriKldLfsaLg~akr 629 (697)
T PF09726_consen 551 ESELKKLRRELKQKEEQIRELESELQELRKYEKESEKDTEVLMSALSAMQDKNQHLENSLSAETRIKLDLFSALGDAKR 629 (697)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHH
Confidence 34444444444444433 3444443211123678888888888889999999999997666776554444444
No 88
>COG3524 KpsE Capsule polysaccharide export protein [Cell envelope biogenesis, outer membrane]
Probab=39.62 E-value=2.2e+02 Score=29.42 Aligned_cols=104 Identities=18% Similarity=0.333 Sum_probs=65.5
Q ss_pred HHHHHHHHHhHHhhHHHHHHHHHhhhhcccCCCC-C--hhhhH-HHhhhhhhhHHHhhHHHHHhHHHHH----HHHHHHH
Q 018721 213 ALLQYQRENLHFLSEEILRLQECLSKYEQSDDGS-T--PQVDL-AHLLAARDQELRTLSAEMNQLQSEL----RLARSFV 284 (351)
Q Consensus 213 DLIrYLkdHNa~LSkrIL~Lq~~l~kye~~~~g~-t--~qvdl-~h~la~r~qelRa~~Ae~~q~~~el----~~ar~li 284 (351)
|.++|-.+-...=-+|+-.-+..|..|.-. .|+ + .|-++ ..|.+.=+.||-.+.|.++|++|-. -.-+.|.
T Consensus 179 dt~r~Ae~eV~~~eerv~kAs~~L~~yr~k-ngvfdp~~qaevq~~Lvs~Le~eL~~iqaqL~tvks~m~~~nPqi~~Lk 257 (372)
T COG3524 179 DTVRFAEEEVQKAEERVKKASNDLTDYRIK-NGVFDPKAQAEVQMSLVSKLEDELIVIQAQLDTVKSVMNPENPQIPGLK 257 (372)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhHHHHHHhh-cCccChhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcchhHH
Confidence 556666666666666777777778888665 666 3 34566 4555555788888888888888765 2234455
Q ss_pred HHHHhhhhhhhhHhHHHHHHHh--hhhhhHHHHHH
Q 018721 285 LRVRNTNNQYVEENERLRAILG--EWSTRAAKLER 317 (351)
Q Consensus 285 ~~~r~~n~q~~eEn~rlRa~l~--Ews~raakle~ 317 (351)
.|+.+...|-.-|-.++-+-=. .-+.|||+.+|
T Consensus 258 arieSlrkql~qe~q~isag~~~~sl~~qaAefq~ 292 (372)
T COG3524 258 ARIESLRKQLLQEKQAISAGGSSQSLSNQAAEFQR 292 (372)
T ss_pred HHHHHHHHHHHHHHHHhcCCCCccchhHHHHHHHH
Confidence 6666665555555444433111 34678888765
No 89
>PF15619 Lebercilin: Ciliary protein causing Leber congenital amaurosis disease
Probab=39.51 E-value=3.2e+02 Score=25.52 Aligned_cols=83 Identities=24% Similarity=0.415 Sum_probs=48.0
Q ss_pred HHHHhhhhcccCCCCChhhhHHHhhhhhhhHHH--------------hhHHHHHhHHHHHHHHHHHHHHHHhhhh-hhh-
Q 018721 232 LQECLSKYEQSDDGSTPQVDLAHLLAARDQELR--------------TLSAEMNQLQSELRLARSFVLRVRNTNN-QYV- 295 (351)
Q Consensus 232 Lq~~l~kye~~~~g~t~qvdl~h~la~r~qelR--------------a~~Ae~~q~~~el~~ar~li~~~r~~n~-q~~- 295 (351)
-...+++|+.+ ++.=|| +++.-..|+| +++..+.....|+..++.-++++..... ...
T Consensus 45 q~kAL~k~e~~-e~~Lpq-----ll~~h~eEvr~Lr~~LR~~q~~~r~~~~klk~~~~el~k~~~~l~~L~~L~~dknL~ 118 (194)
T PF15619_consen 45 QEKALQKYEDT-EAELPQ-----LLQRHNEEVRVLRERLRKSQEQERELERKLKDKDEELLKTKDELKHLKKLSEDKNLA 118 (194)
T ss_pred HHHHHHHHHhh-hhhHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCch
Confidence 33469999987 444454 4444444444 4555555555666666666655544322 222
Q ss_pred ------hHhHHHHHHHhhhhhhHHHHHHHHH
Q 018721 296 ------EENERLRAILGEWSTRAAKLERALE 320 (351)
Q Consensus 296 ------eEn~rlRa~l~Ews~raakle~ale 320 (351)
.+-+.+.+.|.+--.+...||+-+|
T Consensus 119 eReeL~~kL~~~~~~l~~~~~ki~~Lek~le 149 (194)
T PF15619_consen 119 EREELQRKLSQLEQKLQEKEKKIQELEKQLE 149 (194)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344566777777777777777655
No 90
>PHA02562 46 endonuclease subunit; Provisional
Probab=39.17 E-value=4.5e+02 Score=27.04 Aligned_cols=90 Identities=13% Similarity=0.220 Sum_probs=38.3
Q ss_pred HHHHHHHHhHHhhHHHHHHHHHhhhhcccCCCCChhhhHHHhhhhhhhHHHhhHHHHHhHHHHHHHHHHHHHHHHhhhhh
Q 018721 214 LLQYQRENLHFLSEEILRLQECLSKYEQSDDGSTPQVDLAHLLAARDQELRTLSAEMNQLQSELRLARSFVLRVRNTNNQ 293 (351)
Q Consensus 214 LIrYLkdHNa~LSkrIL~Lq~~l~kye~~~~g~t~qvdl~h~la~r~qelRa~~Ae~~q~~~el~~ar~li~~~r~~n~q 293 (351)
.+..+.+....|..++=.+.+.+.+++...+ ---.+...+...+++++.....++.+..+....++.|.++...
T Consensus 300 ~~~~l~d~i~~l~~~l~~l~~~i~~~~~~~~---~~~~~~~~i~el~~~i~~~~~~i~~~~~~~~~l~~ei~~l~~~--- 373 (562)
T PHA02562 300 RITKIKDKLKELQHSLEKLDTAIDELEEIMD---EFNEQSKKLLELKNKISTNKQSLITLVDKAKKVKAAIEELQAE--- 373 (562)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh---
Confidence 3334444444555555555544444433321 0001122233333344444444444444444444444444444
Q ss_pred hhhHhHHHHHHHhhhh
Q 018721 294 YVEENERLRAILGEWS 309 (351)
Q Consensus 294 ~~eEn~rlRa~l~Ews 309 (351)
..+-+++|+...++|-
T Consensus 374 ~~~~~~~l~~l~~~l~ 389 (562)
T PHA02562 374 FVDNAEELAKLQDELD 389 (562)
T ss_pred hhchHHHHHHHHHHHH
Confidence 4444455555555553
No 91
>COG4942 Membrane-bound metallopeptidase [Cell division and chromosome partitioning]
Probab=38.96 E-value=5e+02 Score=27.53 Aligned_cols=110 Identities=20% Similarity=0.236 Sum_probs=64.8
Q ss_pred HHHHHHhhhhcccCCCCChhh-----h------HHHhhhhhhhHHHhhHHHHHhHHHHHHHHHHHH----HHHHhhhhhh
Q 018721 230 LRLQECLSKYEQSDDGSTPQV-----D------LAHLLAARDQELRTLSAEMNQLQSELRLARSFV----LRVRNTNNQY 294 (351)
Q Consensus 230 L~Lq~~l~kye~~~~g~t~qv-----d------l~h~la~r~qelRa~~Ae~~q~~~el~~ar~li----~~~r~~n~q~ 294 (351)
-.|.+.|+-+.++++-..|.+ | ++-++.+=-++.+....++.+.+.+|..-+.-| ..++..=-+.
T Consensus 115 ~~La~~L~A~~r~g~~p~~~ll~~~eda~~~~R~ai~~~~l~~~~~~~i~~l~~~~~~l~~~~~~iaaeq~~l~~~~~eq 194 (420)
T COG4942 115 RRLAEQLAALQRSGRNPPPALLVSPEDAQRSVRLAIYYGALNPARAERIDALKATLKQLAAVRAEIAAEQAELTTLLSEQ 194 (420)
T ss_pred HHHHHHHHHHHhccCCCCchhhcChhhhhHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 445555666667755554433 1 122222223344444555555566666666666 4444445566
Q ss_pred hhHhHHHHHHHhhhhhhHHHHHHHHHHhhhccHHHHHHHHHhhhc
Q 018721 295 VEENERLRAILGEWSTRAAKLERALEVERMSNIELQKKISTRRNQ 339 (351)
Q Consensus 295 ~eEn~rlRa~l~Ews~raakle~ale~er~~~~~~~~~~~~~r~~ 339 (351)
.++.++|-..+.|-..=-++|+..++.++-+.-||+.|=+.+++.
T Consensus 195 ~~q~~kl~~~~~E~kk~~~~l~~~l~~~q~~l~eL~~~~~~L~~~ 239 (420)
T COG4942 195 RAQQAKLAQLLEERKKTLAQLNSELSADQKKLEELRANESRLKNE 239 (420)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHH
Confidence 777777777777777777777777777777777777666666553
No 92
>TIGR01843 type_I_hlyD type I secretion membrane fusion protein, HlyD family. Type I secretion is an ABC transport process that exports proteins, without cleavage of any signal sequence, from the cytosol to extracellular medium across both inner and outer membranes. The secretion signal is found in the C-terminus of the transported protein. This model represents the adaptor protein between the ATP-binding cassette (ABC) protein of the inner membrane and the outer membrane protein, and is called the membrane fusion protein. This model selects a subfamily closely related to HlyD; it is defined narrowly and excludes, for example, colicin V secretion protein CvaA and multidrug efflux proteins.
Probab=38.63 E-value=3.8e+02 Score=26.06 Aligned_cols=19 Identities=37% Similarity=0.468 Sum_probs=8.0
Q ss_pred hhHHHHHhHHHHHHHHHHH
Q 018721 265 TLSAEMNQLQSELRLARSF 283 (351)
Q Consensus 265 a~~Ae~~q~~~el~~ar~l 283 (351)
.+.+++.++++++..++..
T Consensus 155 ~~~~~i~~~~~~l~~~~~~ 173 (423)
T TIGR01843 155 QLEAELAGLQAQLQALRQQ 173 (423)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 3444444444444444433
No 93
>PF00769 ERM: Ezrin/radixin/moesin family; InterPro: IPR011259 The ERM family consists of three closely-related proteins, ezrin, radixin and moesin []. Ezrin was first identified as a constituent of microvilli [], radixin as a barbed, end-capping actin-modulating protein from isolated junctional fractions [], and moesin as a heparin binding protein []. A tumour suppressor molecule responsible for neurofibromatosis type 2 (NF2) is highly similar to ERM proteins and has been designated merlin (moesin-ezrin-radixin-like protein). ERM molecules contain 3 domains, an N-terminal globular domain; an extended alpha-helical domain; and a charged C-terminal domain []. Ezrin, radixin and merlin also contain a polyproline region between the helical and C-terminal domains. The N-terminal domain is highly conserved, and is also found in merlin, band 4.1 proteins and members of the band 4.1 superfamily. ERM proteins crosslink actin filaments with plasma membranes. They co-localise with CD44 at actin filament-plasma membrane interaction sites, associating with CD44 via their N-terminal domains and with actin filaments via their C-terminal domains [].; GO: 0008092 cytoskeletal protein binding, 0005737 cytoplasm, 0019898 extrinsic to membrane; PDB: 2I1J_A 2I1K_A 1E5W_A 1EF1_C.
Probab=38.62 E-value=2.2e+02 Score=27.32 Aligned_cols=76 Identities=25% Similarity=0.276 Sum_probs=38.8
Q ss_pred HHHhhHHHHHhHHHHHHHHHHHHHHHHhhhhhhhhHhHHHHHHHhhhhhhHHHHHHHHHHhhhccHHHHHHHHHhh
Q 018721 262 ELRTLSAEMNQLQSELRLARSFVLRVRNTNNQYVEENERLRAILGEWSTRAAKLERALEVERMSNIELQKKISTRR 337 (351)
Q Consensus 262 elRa~~Ae~~q~~~el~~ar~li~~~r~~n~q~~eEn~rlRa~l~Ews~raakle~ale~er~~~~~~~~~~~~~r 337 (351)
+++-+-.|...|..+..-|...+++|+.....-.+|.++|-+-+.|-...+++|+.+-+.----.-+|++++..-|
T Consensus 41 k~k~aeeea~~Le~k~~eaee~~~rL~~~~~~~~eEk~~Le~e~~e~~~~i~~l~ee~~~ke~Ea~~lq~el~~ar 116 (246)
T PF00769_consen 41 KLKQAEEEAEELEQKRQEAEEEKQRLEEEAEMQEEEKEQLEQELREAEAEIARLEEESERKEEEAEELQEELEEAR 116 (246)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHH------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444444445555666666667788888888888888888888888888888887665543333344444444333
No 94
>KOG4673 consensus Transcription factor TMF, TATA element modulatory factor [Transcription]
Probab=38.13 E-value=5.5e+02 Score=29.46 Aligned_cols=130 Identities=24% Similarity=0.233 Sum_probs=84.3
Q ss_pred hhHh---hHHHHHHHHHH-------hHHhhHHHHHHHHHhhhhcccCCCCC------------------hhh-hHH---H
Q 018721 207 LSDE---QMALLQYQREN-------LHFLSEEILRLQECLSKYEQSDDGST------------------PQV-DLA---H 254 (351)
Q Consensus 207 LlEK---QADLIrYLkdH-------Na~LSkrIL~Lq~~l~kye~~~~g~t------------------~qv-dl~---h 254 (351)
|+-| |..+|+.||-. ..++++.|-.|+.+.++-++.-+|.+ -|- |.. .
T Consensus 465 LSK~ql~qs~iIkKLRAk~ke~etl~~K~ge~i~~L~sE~~~lk~il~~Kee~Ek~~~E~I~k~~ae~~rq~~~~~~sr~ 544 (961)
T KOG4673|consen 465 LSKKQLAQSAIIKKLRAKIKEAETLEEKKGELITKLQSEENKLKSILRDKEETEKLLQETIEKHQAELTRQKDYYSNSRA 544 (961)
T ss_pred hHHHHHHHHHHHHHHHHHhhhhhHHHHHhhhHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHH
Confidence 5554 45788888743 46888999999999999999877773 222 111 2
Q ss_pred hhhhhhhHHHhhHHHHHhHHHHHHHHHHHH-HHHHhhhhhhhhHhHHHHHHHhhhhhhHHHHHHHHHHhhhccHHHHHHH
Q 018721 255 LLAARDQELRTLSAEMNQLQSELRLARSFV-LRVRNTNNQYVEENERLRAILGEWSTRAAKLERALEVERMSNIELQKKI 333 (351)
Q Consensus 255 ~la~r~qelRa~~Ae~~q~~~el~~ar~li-~~~r~~n~q~~eEn~rlRa~l~Ews~raakle~ale~er~~~~~~~~~~ 333 (351)
+.++-+..++|+.|-.|-+.+.|..+-.+= ...|.--.-||---+-||-.|.--...||+=|--+-.| +-+||+.+
T Consensus 545 ~~~~le~~~~a~qat~d~a~~Dlqk~nrlkQdear~~~~~lvqqv~dLR~~L~~~Eq~aarrEd~~R~E---i~~LqrRl 621 (961)
T KOG4673|consen 545 LAAALEAQALAEQATNDEARSDLQKENRLKQDEARERESMLVQQVEDLRQTLSKKEQQAARREDMFRGE---IEDLQRRL 621 (961)
T ss_pred HHHHHHHHHHHHHHhhhhhhhhHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHH
Confidence 233334556677777777777664433332 35566666677777789999998888898877766544 44555544
Q ss_pred --HHhhhc
Q 018721 334 --STRRNQ 339 (351)
Q Consensus 334 --~~~r~~ 339 (351)
++.|.+
T Consensus 622 qaaE~R~e 629 (961)
T KOG4673|consen 622 QAAERRCE 629 (961)
T ss_pred HHHHHHHH
Confidence 344443
No 95
>PF05010 TACC: Transforming acidic coiled-coil-containing protein (TACC); InterPro: IPR007707 This family contains the proteins TACC 1, 2 and 3, found concentrated in the centrosomes of eukaryotes which may play a conserved role in organising centrosomal microtubules. The human TACC proteins have been linked to cancer and TACC2 has been identified as a possible tumour suppressor (AZU-1) [].
Probab=37.16 E-value=3.8e+02 Score=25.58 Aligned_cols=69 Identities=29% Similarity=0.276 Sum_probs=54.5
Q ss_pred hhhhHHHhhHHHHHhHHHHHHHHHHHHHHHHhhhhhhhhHhHHHHHHHhhhhhhHHHHHHHHHHhhhccHHHHHHHHH
Q 018721 258 ARDQELRTLSAEMNQLQSELRLARSFVLRVRNTNNQYVEENERLRAILGEWSTRAAKLERALEVERMSNIELQKKIST 335 (351)
Q Consensus 258 ~r~qelRa~~Ae~~q~~~el~~ar~li~~~r~~n~q~~eEn~rlRa~l~Ews~raakle~ale~er~~~~~~~~~~~~ 335 (351)
.|=+-|++++.| .|..|-..|.+||. .+-.|...|+|.|.-=..+..-||+.|+..---|-||-|=-..
T Consensus 132 qry~aLK~hAee------kL~~ANeei~~v~~---~~~~e~~aLqa~lkk~e~~~~SLe~~LeQK~kEn~ELtkICDe 200 (207)
T PF05010_consen 132 QRYQALKAHAEE------KLEKANEEIAQVRS---KHQAELLALQASLKKEEMKVQSLEESLEQKTKENEELTKICDE 200 (207)
T ss_pred HHHHHHHHHHHH------HHHHHHHHHHHHHH---HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 444566666554 67788888887775 4567999999999999999999999999998888888764433
No 96
>PF10168 Nup88: Nuclear pore component; InterPro: IPR019321 Nup88 can be divided into two structural domains; the N-terminal two-thirds of the protein have no obvious structural motifs. It is, however, where it binds to Nup98; one of the components of the nuclear pore. The C-terminal end is a predicted coiled-coil domain []. Nup88 is over expressed in tumour cells [].
Probab=37.15 E-value=2.4e+02 Score=31.49 Aligned_cols=63 Identities=21% Similarity=0.216 Sum_probs=37.5
Q ss_pred HHHHHHHH-HH---HHhhhhhhhhHhHHHHHHHhhhhhhHHHHHHHHHHhhhccHHHHHHHHHhhhc
Q 018721 277 LRLARSFV-LR---VRNTNNQYVEENERLRAILGEWSTRAAKLERALEVERMSNIELQKKISTRRNQ 339 (351)
Q Consensus 277 l~~ar~li-~~---~r~~n~q~~eEn~rlRa~l~Ews~raakle~ale~er~~~~~~~~~~~~~r~~ 339 (351)
..+||.+| .| +...-+|+.+|-+.++...+.=+.+|.+|..++|.-+=.--.|.|++.++.+.
T Consensus 556 ~~~ar~ei~~rv~~Lk~~~e~Ql~~L~~l~e~~~~l~~~ae~LaeR~e~a~d~Qe~L~~R~~~vl~~ 622 (717)
T PF10168_consen 556 QDLAREEIQRRVKLLKQQKEQQLKELQELQEERKSLRESAEKLAERYEEAKDKQEKLMKRVDRVLQL 622 (717)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34667776 33 34444466667777777777777777777666655444444555555555443
No 97
>PF07856 Orai-1: Mediator of CRAC channel activity; InterPro: IPR012446 This entry includes Drosophila Orai and human Orai1, Orai2 and Orai3. ORAI-1 GFP reporters are co-expressed with STIM-1 (ER CA(2+) sensors) in the gonad and intestine. The protein has four predicted transmembrane domains with a highly conserved region between TM2 ad TM3. This conserved domain is thought to function in channel regulation. ORAI1-related proteins are required for the production of the calcium channel, CRAC, along with STIM1-related proteins [].
Probab=37.13 E-value=1.4e+02 Score=27.69 Aligned_cols=20 Identities=0% Similarity=0.313 Sum_probs=12.9
Q ss_pred HHHHHHHHHHHHHHHHHhcC
Q 018721 156 AICAASFMSVYIGYVHQYNS 175 (351)
Q Consensus 156 lv~~l~~li~YIvkV~rFNk 175 (351)
.++++|..++|++.+..|.+
T Consensus 146 t~i~~~~~li~~~~~~~~wr 165 (175)
T PF07856_consen 146 TAILVPVLLIFVVFIQHFWR 165 (175)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 34566777777777666654
No 98
>KOG0288 consensus WD40 repeat protein TipD [General function prediction only]
Probab=37.01 E-value=3.1e+02 Score=29.25 Aligned_cols=57 Identities=26% Similarity=0.261 Sum_probs=30.3
Q ss_pred hhhhhhHHHhhHHHHHhHHHHHHHHHHHHHHHHhhhhhhhhHhHHHHHHHhhhhhhHHH
Q 018721 256 LAARDQELRTLSAEMNQLQSELRLARSFVLRVRNTNNQYVEENERLRAILGEWSTRAAK 314 (351)
Q Consensus 256 la~r~qelRa~~Ae~~q~~~el~~ar~li~~~r~~n~q~~eEn~rlRa~l~Ews~raak 314 (351)
++-+|-||+-|..|..|+..|.-.++.. .=+.+-+-..+||+|.|-..++-..|-.|
T Consensus 50 ~~~~E~~l~~Lq~e~~~l~e~~v~~~a~--~~~~t~~~~~~en~~~r~~~eir~~~~q~ 106 (459)
T KOG0288|consen 50 LQEKELELNRLQEENTQLNEERVREEAT--EKTLTVDVLIAENLRIRSLNEIRELREQK 106 (459)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 4445556666666666666555442222 22233445566777777666655554443
No 99
>PF11003 DUF2842: Protein of unknown function (DUF2842); InterPro: IPR021265 This bacterial family of proteins have no known function.
Probab=36.75 E-value=1.1e+02 Score=24.01 Aligned_cols=25 Identities=32% Similarity=0.587 Sum_probs=19.6
Q ss_pred ccchhhHHHHHHHHHHHHHHHh-hhh
Q 018721 37 HSIFGSVVYCFVLAGYAILAAG-TTW 61 (351)
Q Consensus 37 ~~~~g~~~y~~~L~~yA~~~~~-~pw 61 (351)
|..+|..+-++.++.|++++++ +.+
T Consensus 1 Rk~ig~v~ll~~l~vY~~~a~~l~~~ 26 (62)
T PF11003_consen 1 RKLIGLVLLLVGLPVYAVLAVTLADW 26 (62)
T ss_pred CceehhHHHHHHHHHHHHHHHHHHHH
Confidence 4567888888899999999888 444
No 100
>PF10174 Cast: RIM-binding protein of the cytomatrix active zone; InterPro: IPR019323 This entry represents a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion []. Located at the C terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). These proteins also contain four coiled-coil domains [].
Probab=36.50 E-value=6.9e+02 Score=28.43 Aligned_cols=74 Identities=14% Similarity=0.135 Sum_probs=59.5
Q ss_pred HHHHHhHHHHHHHHHHHHHHHHhhhh-hhhhHhHHHHHHHhhhhhhHHHHHHHHHHhhhccHHHHHHHHHhhhcC
Q 018721 267 SAEMNQLQSELRLARSFVLRVRNTNN-QYVEENERLRAILGEWSTRAAKLERALEVERMSNIELQKKISTRRNQH 340 (351)
Q Consensus 267 ~Ae~~q~~~el~~ar~li~~~r~~n~-q~~eEn~rlRa~l~Ews~raakle~ale~er~~~~~~~~~~~~~r~~~ 340 (351)
...++....|....+.-|...|.... .+.||.+..+.-+.+|...-..|+.-|..-=++-.+++-..+++.++.
T Consensus 435 ~~~lEea~~eker~~e~l~e~r~~~e~e~~Eele~~~~e~~~lk~~~~~LQ~eLsEk~~~l~~~kee~s~l~s~~ 509 (775)
T PF10174_consen 435 LETLEEALREKERLQERLEEQRERAEKERQEELETYQKELKELKAKLESLQKELSEKELQLEDAKEEASKLASSQ 509 (775)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHhhhHHHHHhhcc
Confidence 35666777777777777755655555 556999999999999999999999999887788888888888888776
No 101
>PF09486 HrpB7: Bacterial type III secretion protein (HrpB7); InterPro: IPR013392 This entry represents proteins encoded by genes which are found in type III secretion operons in a narrow range of species including Xanthomonas, Burkholderia and Ralstonia.
Probab=36.50 E-value=3.4e+02 Score=24.91 Aligned_cols=56 Identities=27% Similarity=0.319 Sum_probs=37.6
Q ss_pred hHHHhhHHHHHhHHHHHHHHHHHHHHHHhhhhhhhhHhHHHHHHHhhhhhhHHHHHHHHHHhh
Q 018721 261 QELRTLSAEMNQLQSELRLARSFVLRVRNTNNQYVEENERLRAILGEWSTRAAKLERALEVER 323 (351)
Q Consensus 261 qelRa~~Ae~~q~~~el~~ar~li~~~r~~n~q~~eEn~rlRa~l~Ews~raakle~ale~er 323 (351)
.+++.+.++..++++.+..+...|..++. .-.|+++.++--+-|.++|.+++|+.+
T Consensus 86 ~~~~~~e~~~a~l~~~l~~~~~~ia~~~r-------aIarn~a~id~~~er~~~l~r~~ea~~ 141 (158)
T PF09486_consen 86 ERVRAAEAELAALRQALRAAEDEIAATRR-------AIARNDARIDVCRERIDRLRRAAEAAA 141 (158)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHhhhhHHHHHHHHHHHHHHHHHhH
Confidence 46667777777887777777777744432 235666667777777777777776654
No 102
>PF14197 Cep57_CLD_2: Centrosome localisation domain of PPC89
Probab=36.43 E-value=2.2e+02 Score=22.63 Aligned_cols=41 Identities=32% Similarity=0.308 Sum_probs=25.5
Q ss_pred hhHHHhhHHHHHhHHHHHHHHHHHHHHHHhhhhhhhhHhHHHHHHHhh
Q 018721 260 DQELRTLSAEMNQLQSELRLARSFVLRVRNTNNQYVEENERLRAILGE 307 (351)
Q Consensus 260 ~qelRa~~Ae~~q~~~el~~ar~li~~~r~~n~q~~eEn~rlRa~l~E 307 (351)
+.++..|..|.|-.-..|.-|...+..++ +||+.|+.-|++
T Consensus 25 ~~~~k~L~~ERd~~~~~l~~a~~e~~~Lk-------~E~e~L~~el~~ 65 (69)
T PF14197_consen 25 EIENKRLRRERDSAERQLGDAYEENNKLK-------EENEALRKELEE 65 (69)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHH
Confidence 35666777777777677766666555544 566666665543
No 103
>PF06548 Kinesin-related: Kinesin-related; InterPro: IPR010544 This entry represents a domain within kinesin-related proteins from higher plants. Many proteins containing this domain also contain the IPR001752 from INTERPRO domain. Kinesins are ATP-driven microtubule motor proteins that produce directed force []. Some family members are associated with the phragmoplast, a structure composed mainly of microtubules that executes cytokinesis in higher plants [].
Probab=36.09 E-value=5.9e+02 Score=27.52 Aligned_cols=35 Identities=34% Similarity=0.378 Sum_probs=29.4
Q ss_pred HHHHHHHhhhhhhHHHHHHHHHHhhhccHHHHHHH
Q 018721 299 ERLRAILGEWSTRAAKLERALEVERMSNIELQKKI 333 (351)
Q Consensus 299 ~rlRa~l~Ews~raakle~ale~er~~~~~~~~~~ 333 (351)
+-||.-|+--..+|-|+|.-|+.|+..+-||..-+
T Consensus 298 eeLR~dle~~r~~aek~~~EL~~Ek~c~eEL~~al 332 (488)
T PF06548_consen 298 EELRVDLESSRSLAEKLEMELDSEKKCTEELDDAL 332 (488)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHH
Confidence 44777788888999999999999999999887644
No 104
>PF04977 DivIC: Septum formation initiator; InterPro: IPR007060 DivIC, from the spore-forming, Gram-positive bacterium Bacillus subtilis, is necessary for both vegetative and sporulation septum formation []. These proteins are mainly composed of an N-terminal coiled-coil. DivIB, DivIC and FtsL inter-depend on each other for stabilisation and localisation. The latter two form a heterodimer. DivIC is always centre cell but the other two associate with it during septation [].; GO: 0007049 cell cycle
Probab=35.99 E-value=96 Score=23.41 Aligned_cols=16 Identities=25% Similarity=0.383 Sum_probs=6.4
Q ss_pred hhhhHhHHHHHHHhhh
Q 018721 293 QYVEENERLRAILGEW 308 (351)
Q Consensus 293 q~~eEn~rlRa~l~Ew 308 (351)
+..+||+.|.+.++.+
T Consensus 35 ~l~~e~~~L~~ei~~l 50 (80)
T PF04977_consen 35 ELKKENEELKEEIERL 50 (80)
T ss_pred HHHHHHHHHHHHHHHh
Confidence 3334444444443333
No 105
>PF10186 Atg14: UV radiation resistance protein and autophagy-related subunit 14; InterPro: IPR018791 Class III phosphatidylinositol 3-kinase (PI3-kinase) regulates multiple membrane trafficking. In yeast, two distinct PI3-kinase complexes are known: complex I (Vps34, Vps15, Vps30/Atg6, and Atg14) is involved in autophagy, and complex II (Vps34, Vps15, Vps30/Atg6, and Vps38) functions in the vacuolar protein sorting pathway. In mammals, the counterparts of Vps34, Vps15, and Vps30/Atg6 are Vps34, p150, and Beclin 1, respectively. Mammalian UV irradiation resistance-associated gene (UVRAG) has been identified as identical to yeast Vps38 []. The Atg14 (autophagy-related protein 14) proteins are hydrophilic proteins and have a coiled-coil motif at the N terminus region. Yeast cells with mutant Atg14 are defective not only in autophagy but also in sorting of carboxypeptidase Y (CPY), a vacuolar-soluble hydrolase, to the vacuole []. This entry represents Atg14 and UVRAG, which bind Beclin 1 to forms two distinct PI3-kinase complexes. This entry also includes Bakor (beclin-1-associated autophagy-related key regulator), also known as autophagy-related protein 14-like protein, which share sequence similarity to the yeast Atg14 protein []. Barkor positively regulates autophagy through its interaction with Beclin-1, with decreased levels of autophagosome formation observed when Barkor expression is eliminated []. Autophagy mediates the cellular response to nutrient deprivation, protein aggregation, and pathogen invasion in humans, and malfunction of autophagy has been implicated in multiple human diseases including cancer. ; GO: 0010508 positive regulation of autophagy
Probab=35.69 E-value=3.6e+02 Score=24.98 Aligned_cols=47 Identities=30% Similarity=0.377 Sum_probs=25.5
Q ss_pred HHHhhHHHHHhHHHHHHHHHHHHHHHHhhhhhhhhHhHHHHHHHhhh
Q 018721 262 ELRTLSAEMNQLQSELRLARSFVLRVRNTNNQYVEENERLRAILGEW 308 (351)
Q Consensus 262 elRa~~Ae~~q~~~el~~ar~li~~~r~~n~q~~eEn~rlRa~l~Ew 308 (351)
..+.+..|..+.+..+..-+..|.+++....+--++-+.+|..+..-
T Consensus 57 ~~~~~~~~~~~~~~r~~~l~~~i~~~~~~i~~~r~~l~~~~~~l~~~ 103 (302)
T PF10186_consen 57 EIQQLKREIEELRERLERLRERIERLRKRIEQKRERLEELRESLEQR 103 (302)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44455666666666666666666555555544444444444444443
No 106
>PF00038 Filament: Intermediate filament protein; InterPro: IPR016044 Intermediate filaments (IF) [, , ] are proteins which are primordial components of the cytoskeleton and the nuclear envelope. They generally form filamentous structures 8 to 14 nm wide. IF proteins are members of a very large multigene family of proteins which has been subdivided in five major subgroups: Type I: Acidic cytokeratins. Type II: Basic cytokeratins. Type III: Vimentin, desmin, glial fibrillary acidic protein (GFAP), peripherin, and plasticin. Type IV: Neurofilaments L, H and M, alpha-internexin and nestin. Type V: Nuclear lamins A, B1, B2 and C. All IF proteins are structurally similar in that they consist of: a central rod domain comprising some 300 to 350 residues which is arranged in coiled-coiled alpha-helices, with at least two short characteristic interruptions; a N-terminal non-helical domain (head) of variable length; and a C-terminal domain (tail) which is also non-helical, and which shows extreme length variation between different IF proteins. While IF proteins are evolutionary and structurally related, they have limited sequence homologies except in several regions of the rod domain. This entry represents the central rod domain found in IF proteins.; PDB: 3TNU_B 3KLT_D 1GK4_F 3TRT_A 3G1E_A 3UF1_C 1GK6_B 1GK7_A 3TYY_B 3V4W_A ....
Probab=35.62 E-value=4e+02 Score=25.41 Aligned_cols=53 Identities=23% Similarity=0.281 Sum_probs=27.9
Q ss_pred HHHHhhhhhhhhHhHHHHHHHhhhhhhHHHHHHHHHHhhhccHHHHHHHHHhh
Q 018721 285 LRVRNTNNQYVEENERLRAILGEWSTRAAKLERALEVERMSNIELQKKISTRR 337 (351)
Q Consensus 285 ~~~r~~n~q~~eEn~rlRa~l~Ews~raakle~ale~er~~~~~~~~~~~~~r 337 (351)
..++....+..++...++..+.++...-..|+..|+.=+-.|..|.+.+..+.
T Consensus 198 ~~l~~~~~~~~~~~~~~~~E~~~~r~~~~~l~~el~~l~~~~~~Le~~l~~le 250 (312)
T PF00038_consen 198 EELRQQSEKSSEELESAKEELKELRRQIQSLQAELESLRAKNASLERQLRELE 250 (312)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred ccccccccccccccchhHhHHHHHHhhhhHhhhhhhccccchhhhhhhHHHHH
Confidence 44444444455555555555555555555555555555555555555554443
No 107
>PF05667 DUF812: Protein of unknown function (DUF812); InterPro: IPR008530 This family consists of several eukaryotic proteins of unknown function.
Probab=35.53 E-value=1.7e+02 Score=31.83 Aligned_cols=82 Identities=22% Similarity=0.336 Sum_probs=44.8
Q ss_pred hhhhhhHHHhhHHHHHhHHHHHHHHHHHHHHHHhhhh-------hhhhHhHHHH----------HHHhhhhhhHHHHHHH
Q 018721 256 LAARDQELRTLSAEMNQLQSELRLARSFVLRVRNTNN-------QYVEENERLR----------AILGEWSTRAAKLERA 318 (351)
Q Consensus 256 la~r~qelRa~~Ae~~q~~~el~~ar~li~~~r~~n~-------q~~eEn~rlR----------a~l~Ews~raakle~a 318 (351)
...+++|+..+..+++.+..++......+..+...-. +...+++++- .+|.+=..--+|||.-
T Consensus 323 ~~~~~~el~~l~~~l~~l~~~i~~~~~~~~~l~~~~~q~~~e~~~~~~~~~~le~~~~l~~k~~~lL~d~e~ni~kL~~~ 402 (594)
T PF05667_consen 323 QEEQEQELEELQEQLDELESQIEELEAEIKMLKSSLKQLEEELEEKEAENEELEEELKLKKKTVELLPDAEENIAKLQAL 402 (594)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcHHHHHHHHHH
Confidence 4566778888888888888777777777633333333 3333333322 2233333444666666
Q ss_pred HHHhhhccHHHHHHHHHhh
Q 018721 319 LEVERMSNIELQKKISTRR 337 (351)
Q Consensus 319 le~er~~~~~~~~~~~~~r 337 (351)
.++..-.=.+|+.+-.+-|
T Consensus 403 v~~s~~rl~~L~~qWe~~R 421 (594)
T PF05667_consen 403 VEASEQRLVELAQQWEKHR 421 (594)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 6654444444444444333
No 108
>PF05961 Chordopox_A13L: Chordopoxvirus A13L protein; InterPro: IPR009236 This family consists of A13L proteins from the Chordopoxviruses. A13L or p8 is one of the three most abundant membrane proteins of the intracellular mature Vaccinia virus [].
Probab=35.47 E-value=60 Score=26.28 Aligned_cols=47 Identities=21% Similarity=0.215 Sum_probs=32.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCCCCchhhhccCCCCCCCCCCCCcccCC
Q 018721 152 MLIEAICAASFMSVYIGYVHQYNSLNSQPDVMKSLYSPLQPSSSLEGLRYHDG 204 (351)
Q Consensus 152 l~lElv~~l~~li~YIvkV~rFNk~kp~PDVl~ee~s~~~ps~~~~ElGfrd~ 204 (351)
+.+=+|+++..+|+|-++-++=+.+.++|+- +++... -.-.+||.|.
T Consensus 6 iLi~ICVaii~lIlY~iYnr~~~~q~~~~~~--e~y~~~----~~~kT~yVd~ 52 (68)
T PF05961_consen 6 ILIIICVAIIGLILYGIYNRKKTTQNTNPST--ENYEKM----ENLKTGYVDK 52 (68)
T ss_pred HHHHHHHHHHHHHHHHHHhcccccCCCCCch--hhcCCc----cccchhHHhc
Confidence 3344566777888999999999998899987 333221 1224788773
No 109
>PRK15178 Vi polysaccharide export inner membrane protein VexD; Provisional
Probab=35.28 E-value=3.4e+02 Score=28.73 Aligned_cols=88 Identities=11% Similarity=0.139 Sum_probs=58.6
Q ss_pred hhHHHHHHHHHHhHHhhHHHHHHHHHhhhhcccCCCCChhhhHHHhhhhhhhHHHhhHHHHHhHHHHHHHHHHHH-----
Q 018721 210 EQMALLQYQRENLHFLSEEILRLQECLSKYEQSDDGSTPQVDLAHLLAARDQELRTLSAEMNQLQSELRLARSFV----- 284 (351)
Q Consensus 210 KQADLIrYLkdHNa~LSkrIL~Lq~~l~kye~~~~g~t~qvdl~h~la~r~qelRa~~Ae~~q~~~el~~ar~li----- 284 (351)
.+.|.++|-++-...--+|+-.-+..|..|.....-.+|+-+.+..+. -+=.|.+|+-.++.||...++-+
T Consensus 239 ar~D~v~~Ae~ev~~Ae~rl~~Ar~aL~~fRn~~gvlDP~~~a~~~~~----lI~~Le~qLa~~~aeL~~L~~~~~p~sP 314 (434)
T PRK15178 239 MQKERILWLENDVKSAQENLGAARLELLKIQHIQKDIDPKETITAIYQ----LIAGFETQLAEAKAEYAQLMVNGLDQNP 314 (434)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcChHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHhhcCCCCC
Confidence 367999999999999999999999999999988666688877654332 22234444444444444433321
Q ss_pred ------HHHHhhhhhhhhHhHHH
Q 018721 285 ------LRVRNTNNQYVEENERL 301 (351)
Q Consensus 285 ------~~~r~~n~q~~eEn~rl 301 (351)
.+|.+...|=-+|..++
T Consensus 315 qV~~l~~rI~aLe~QIa~er~kl 337 (434)
T PRK15178 315 LIPRLSAKIKVLEKQIGEQRNRL 337 (434)
T ss_pred chhHHHHHHHHHHHHHHHHHHHh
Confidence 55556666666666665
No 110
>PRK14127 cell division protein GpsB; Provisional
Probab=35.25 E-value=1e+02 Score=26.76 Aligned_cols=45 Identities=20% Similarity=0.283 Sum_probs=32.1
Q ss_pred hhhHhHHHHHHHhhhhhhHHHHHHHH------HHhhhccHHHHHHHHHhhh
Q 018721 294 YVEENERLRAILGEWSTRAAKLERAL------EVERMSNIELQKKISTRRN 338 (351)
Q Consensus 294 ~~eEn~rlRa~l~Ews~raakle~al------e~er~~~~~~~~~~~~~r~ 338 (351)
--+||.||++.|+||+.|.+..+..- ...=.+|.++-|.++.|=.
T Consensus 49 Lk~e~~~l~~~l~e~~~~~~~~~~~~~~~~~~~~~~~tn~DiLKRls~LEk 99 (109)
T PRK14127 49 LQQENARLKAQVDELTKQVSVGASSSSVATTQPSSSATNYDILKRLSNLEK 99 (109)
T ss_pred HHHHHHHHHHHHHHHHHhhccccccccccccCCCCCcchHHHHHHHHHHHH
Confidence 34678888889999999888665332 1224688998888887744
No 111
>PF10212 TTKRSYEDQ: Predicted coiled-coil domain-containing protein; InterPro: IPR019348 This entry represents a C-terminal 500 residue region, which contains a conserved TTKRSYEDQ motif. It is found in a family of coiled-coil domain-containing proteins that are conserved from nematodes to humans. These proteins also contain an N-terminal domain with a KLRAQ motif (IPR019343 from INTERPRO). The function of these proteins is not known.
Probab=35.25 E-value=3.9e+02 Score=29.03 Aligned_cols=248 Identities=19% Similarity=0.164 Sum_probs=123.9
Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHhHHHhhhHHHHHHHhhcccccchhhhhHHHH----HHHHHHHhhh--cc--c--
Q 018721 71 PPLLCSCGVILLALTGIFQQYFVYQVQKIRLQGYYSFSQKLKHIVRLPFAITAYGTA----AMLLVIVWRP--HI--S-- 140 (351)
Q Consensus 71 ~~lL~y~~v~LWlLt~l~d~yv~~qH~KlRl~GYl~FYR~Tr~lkRlPl~IvSlGNa----~LLLI~a~~~--~~--~-- 140 (351)
..++.-.-++|==++-++...-+|-++|+..- ++|=..|++++..-=-|.|.+-. ..=+..++.. +| +
T Consensus 167 s~v~~~l~~~l~~l~d~~k~~sk~y~~k~~~E--~~~pt~tqkl~t~~~c~~~sl~~l~~~~~k~a~f~~nnld~~~~~~ 244 (518)
T PF10212_consen 167 SAVFTQLAASLHKLHDVLKDLSKHYNQKASLE--HRLPTATQKLRTTNECILSSLVSLTNGTGKIAAFFSNNLDFFTSSS 244 (518)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--hhcchhhheeeccHHHHHHHHHHHHhhhHHHHHHHhcchHHhhccc
Confidence 34666677777778888888888888888775 56667788877755444333222 1111112211 11 0
Q ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC--CCCCCchhhhc-cCC-----CCCCCCCCCCc------------
Q 018721 141 ILSISTLLRIIMLIEAICAASFMSVYIGYVHQYNS--LNSQPDVMKSL-YSP-----LQPSSSLEGLR------------ 200 (351)
Q Consensus 141 ~Ls~~~lLriil~lElv~~l~~li~YIvkV~rFNk--~kp~PDVl~ee-~s~-----~~ps~~~~ElG------------ 200 (351)
-..|..- .++=--...+||+-|=.|-.-|=+ .+|.||-.-.+ -.. ..++.+-++++
T Consensus 245 ~y~~~~~----~~~~~~~~~~~~~~~~~~a~~~~~~~~~p~~~svpy~~a~~n~ril~sstes~e~L~qqV~qs~EKIa~ 320 (518)
T PF10212_consen 245 GYGPKGA----TTFTNPLSAECMLQYKKRAAAYMSSLKKPCPESVPYEEALANRRILLSSTESREGLAQQVQQSQEKIAK 320 (518)
T ss_pred ccCCCcc----cccCCccchHHHHHHHHHHHHHHHHhcCCCCccCChHHHHhhhHHHhhhHHhHHHHHHHHHHHHHHHHH
Confidence 0001110 000001367888866555444322 36666643221 111 11111111111
Q ss_pred -ccCCCc-hhHhhHHHHHHHHHHhH--HhhHHHHHHHHHhhhhcccCC-------C---------------------CCh
Q 018721 201 -YHDGGR-LSDEQMALLQYQRENLH--FLSEEILRLQECLSKYEQSDD-------G---------------------STP 248 (351)
Q Consensus 201 -frd~g~-LlEKQADLIrYLkdHNa--~LSkrIL~Lq~~l~kye~~~~-------g---------------------~t~ 248 (351)
..+.+- ++|+|-+=|+.-|++.. .+.+++...+.+-..=+.... + .+.
T Consensus 321 LEqEKEHw~LEaQL~kIKLEKEnkRiadLekevak~~v~~s~~e~~~l~~~~e~~se~s~~~~~e~~~~t~l~gml~~~~ 400 (518)
T PF10212_consen 321 LEQEKEHWMLEAQLAKIKLEKENKRIADLEKEVAKGQVAESSQESSVLSEASEQQSEASSQSVDEPLQPTSLSGMLTSTS 400 (518)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccchhhhhhhcccccccccccccccccccccccccccccccc
Confidence 111111 88888888887777655 355555444332221111111 0 011
Q ss_pred hhhHHHhhhhhhhHHH-hhHHHHHhHHHHHHHHHHHH----HHHHhhhhh---hhhHhHHHHHHHhhhhhhHHHHHHHHH
Q 018721 249 QVDLAHLLAARDQELR-TLSAEMNQLQSELRLARSFV----LRVRNTNNQ---YVEENERLRAILGEWSTRAAKLERALE 320 (351)
Q Consensus 249 qvdl~h~la~r~qelR-a~~Ae~~q~~~el~~ar~li----~~~r~~n~q---~~eEn~rlRa~l~Ews~raakle~ale 320 (351)
..++ +=-.+||+.++ --...++++.+++..|.|=- ..-+.+..+ =.+|.+-+...|.+=..+-.+||--|+
T Consensus 401 ~~~~-~E~esRE~LIk~~Y~~RI~eLt~qlQ~adSKa~~f~~Ec~aL~~rL~~aE~ek~~l~eeL~~a~~~i~~LqDEL~ 479 (518)
T PF10212_consen 401 EQES-PEEESREQLIKSYYMSRIEELTSQLQHADSKAVHFYAECRALQKRLESAEKEKESLEEELKEANQNISRLQDELE 479 (518)
T ss_pred cccC-CchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 1121 12267788777 45677888888888777543 111221111 133444456666777777777777777
Q ss_pred Hhhhc
Q 018721 321 VERMS 325 (351)
Q Consensus 321 ~er~~ 325 (351)
.-|.+
T Consensus 480 TTr~N 484 (518)
T PF10212_consen 480 TTRRN 484 (518)
T ss_pred HHHhh
Confidence 77654
No 112
>PF01166 TSC22: TSC-22/dip/bun family; InterPro: IPR000580 Several eukaryotic proteins are evolutionary related and are thought to be involved in transcriptional regulation. These proteins are highly similar in a region of about 50 residues that include a conserved leucine-zipper domain most probably involved in homo- or hetero-dimerisation. Proteins containing this signature include: Vertebrate protein TSC-22 [], a transcriptional regulator which seems to act on C-type natriuretic peptide (CNP) promoter. Mammalian protein DIP (DSIP-immunoreactive peptide) [], a protein whose function is not yet known. Drosophila protein bunched [] (gene bun) (also known as shortsighted), a probable transcription factor required for peripheral nervous system morphogenesis, eye development and oogenesis. Caenorhabditis elegans hypothetical protein T18D3.7. ; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent; PDB: 1DIP_B.
Probab=35.06 E-value=36 Score=26.87 Aligned_cols=32 Identities=22% Similarity=0.359 Sum_probs=24.9
Q ss_pred HHHHHHHHHHHHHHHHhhhhhhhhHhHHHHHH
Q 018721 273 LQSELRLARSFVLRVRNTNNQYVEENERLRAI 304 (351)
Q Consensus 273 ~~~el~~ar~li~~~r~~n~q~~eEn~rlRa~ 304 (351)
++.|+.--|.-|..+..-|++...||..||..
T Consensus 12 VrEEVevLK~~I~eL~~~n~~Le~EN~~Lk~~ 43 (59)
T PF01166_consen 12 VREEVEVLKEQIAELEERNSQLEEENNLLKQN 43 (59)
T ss_dssp -TTSHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 34455666677888899999999999999875
No 113
>PF09971 DUF2206: Predicted membrane protein (DUF2206); InterPro: IPR018701 This family of predicted membrane proteins from archaea has no known function.
Probab=35.06 E-value=5.1e+02 Score=26.51 Aligned_cols=83 Identities=16% Similarity=0.281 Sum_probs=51.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHHhHHHhhhHHHHHHHhhcccccchhhhhHHHHHHHHHHHhhhccc-cccHHHHHHHHHHH
Q 018721 76 SCGVILLALTGIFQQYFVYQVQKIRLQGYYSFSQKLKHIVRLPFAITAYGTAAMLLVIVWRPHIS-ILSISTLLRIIMLI 154 (351)
Q Consensus 76 y~~v~LWlLt~l~d~yv~~qH~KlRl~GYl~FYR~Tr~lkRlPl~IvSlGNa~LLLI~a~~~~~~-~Ls~~~lLriil~l 154 (351)
+.-+.+.++.|++-..++..+.|.+-+ .-..+.|..+.++|+.....|-|+ .+.+.++.++.+
T Consensus 121 ~l~~~~~i~IG~l~~~~~~~~~k~~~~--------------~~Yl~fs~~~~iiLia~i~lP~fa~~mn~~RLy~itl-- 184 (367)
T PF09971_consen 121 HLFIQFFIIIGFLALILKRIYKKIKFN--------------IEYLAFSLVSLIILIASIVLPFFASVMNPTRLYQITL-- 184 (367)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhh--------------HHHHHHHHHHHHHHHHHHhccchhhhcCHHHHHHHHH--
Confidence 333456777888888888888776622 344667777777777666677774 788777764333
Q ss_pred HHHHHHHHHHH-HHHHHHHhcCC
Q 018721 155 EAICAASFMSV-YIGYVHQYNSL 176 (351)
Q Consensus 155 Elv~~l~~li~-YIvkV~rFNk~ 176 (351)
+..+|++++ ++.-+.-+||-
T Consensus 185 --i~LAPf~iiG~~~~~~~i~k~ 205 (367)
T PF09971_consen 185 --IFLAPFFIIGGITLFKLINKL 205 (367)
T ss_pred --HHHHHHHHHHHHHHHHHHHHh
Confidence 445666655 44333333333
No 114
>PF07888 CALCOCO1: Calcium binding and coiled-coil domain (CALCOCO1) like; InterPro: IPR012852 Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein expressed by Mus musculus (CoCoA, Q8CGU1 from SWISSPROT). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1 (Q61026 from SWISSPROT), and thus enhances transcriptional activation by a number of nuclear receptors. CoCoA has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region [].
Probab=35.05 E-value=3.8e+02 Score=29.33 Aligned_cols=96 Identities=28% Similarity=0.338 Sum_probs=44.9
Q ss_pred hhHhhHHHHHHHHHHhHHhhHHHHHHHHHhhhhcccCCCCChhhhHHHhhhhhhhHHHhhHHHHHhHHHHHHHHHHHHHH
Q 018721 207 LSDEQMALLQYQRENLHFLSEEILRLQECLSKYEQSDDGSTPQVDLAHLLAARDQELRTLSAEMNQLQSELRLARSFVLR 286 (351)
Q Consensus 207 LlEKQADLIrYLkdHNa~LSkrIL~Lq~~l~kye~~~~g~t~qvdl~h~la~r~qelRa~~Ae~~q~~~el~~ar~li~~ 286 (351)
+|-.++.+++-.-+-...--++|+.....| +... .++-.-+..-+.+|+....++.++..+.+ .
T Consensus 133 vV~~ka~~lQ~qlE~~qkE~eeL~~~~~~L---e~e~------~~l~~~v~~l~~eL~~~~ee~e~L~~~~k-------e 196 (546)
T PF07888_consen 133 VVTTKAQLLQNQLEECQKEKEELLKENEQL---EEEV------EQLREEVERLEAELEQEEEEMEQLKQQQK-------E 196 (546)
T ss_pred EEehhHHHHHHHHHHHHHHHHHHHHHHHHH---HHHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------H
Confidence 333345555544444445555566666655 2221 22222222223344444444444444433 3
Q ss_pred HHhhhhhhhhHhHHHHHHHhhhhhhHHHHHHH
Q 018721 287 VRNTNNQYVEENERLRAILGEWSTRAAKLERA 318 (351)
Q Consensus 287 ~r~~n~q~~eEn~rlRa~l~Ews~raakle~a 318 (351)
+...+..-.+|++-|..-+.+...|-..||--
T Consensus 197 l~~~~e~l~~E~~~L~~q~~e~~~ri~~LEed 228 (546)
T PF07888_consen 197 LTESSEELKEERESLKEQLAEARQRIRELEED 228 (546)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33334444556666666666666666666633
No 115
>TIGR00870 trp transient-receptor-potential calcium channel protein. after chronic exposure to capsaicin. (McCleskey and Gold, 1999).
Probab=34.94 E-value=2.9e+02 Score=29.82 Aligned_cols=23 Identities=17% Similarity=0.050 Sum_probs=16.3
Q ss_pred chhhHHHHHHHHHHHHHHHhhhh
Q 018721 39 IFGSVVYCFVLAGYAILAAGTTW 61 (351)
Q Consensus 39 ~~g~~~y~~~L~~yA~~~~~~pw 61 (351)
-+.+.+|+++|+.|+.++..-|.
T Consensus 355 ~~~~~~~fl~l~~~~~~~~~~~~ 377 (743)
T TIGR00870 355 HSASYLYFLYLIIFTSVAYYRPT 377 (743)
T ss_pred HHHHHHHHHHHHHHHHHHhhccc
Confidence 34567777777888887777664
No 116
>KOG4403 consensus Cell surface glycoprotein STIM, contains SAM domain [General function prediction only]
Probab=34.44 E-value=2e+02 Score=30.95 Aligned_cols=69 Identities=26% Similarity=0.324 Sum_probs=47.2
Q ss_pred HHHHHHhHHhhH----------HHHHHHHHhhhhcccCCCC-ChhhhHHHhhh--hhhhHHH------hhHHHHHhHHHH
Q 018721 216 QYQRENLHFLSE----------EILRLQECLSKYEQSDDGS-TPQVDLAHLLA--ARDQELR------TLSAEMNQLQSE 276 (351)
Q Consensus 216 rYLkdHNa~LSk----------rIL~Lq~~l~kye~~~~g~-t~qvdl~h~la--~r~qelR------a~~Ae~~q~~~e 276 (351)
+|-|+|..++++ .+..||++|.|-......+ +-.+||...+. -|=.|+| ..-.|+.||+-+
T Consensus 238 k~akehv~km~kdle~Lq~aEqsl~dlQk~Lekar~e~rnvavek~~lerkl~ea~rl~elreg~e~e~~rkelE~lR~~ 317 (575)
T KOG4403|consen 238 KKAKEHVNKMMKDLEGLQRAEQSLEDLQKRLEKAREEQRNVAVEKLDLERKLDEAPRLSELREGVENETSRKELEQLRVA 317 (575)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhhhhhhHHHHHhhhhhhhhhhcchhHHHHHHHHHHHHHH
Confidence 477888888887 5677888887765554444 67778877765 2222333 333577888888
Q ss_pred HHHHHHHH
Q 018721 277 LRLARSFV 284 (351)
Q Consensus 277 l~~ar~li 284 (351)
|+.|...+
T Consensus 318 L~kAEkel 325 (575)
T KOG4403|consen 318 LEKAEKEL 325 (575)
T ss_pred HHHHHHHH
Confidence 88887665
No 117
>PF15254 CCDC14: Coiled-coil domain-containing protein 14
Probab=34.23 E-value=5.3e+02 Score=29.67 Aligned_cols=114 Identities=20% Similarity=0.211 Sum_probs=69.5
Q ss_pred HHHHHHHhHHhhHHHHHHHHHhhhhcccCCCCC-hhhhH-HHhhhhh-------------hhH-HHh----hHHHHHhHH
Q 018721 215 LQYQRENLHFLSEEILRLQECLSKYEQSDDGST-PQVDL-AHLLAAR-------------DQE-LRT----LSAEMNQLQ 274 (351)
Q Consensus 215 IrYLkdHNa~LSkrIL~Lq~~l~kye~~~~g~t-~qvdl-~h~la~r-------------~qe-lRa----~~Ae~~q~~ 274 (351)
.+=||--|+.|-+++=-|..+|.+-|++...+. +..++ .-.|.+. .|| |.. +...++...
T Consensus 389 ~QplrsENaqLrRrLrilnqqlreqe~~~k~~~~~~~n~El~sLqSlN~~Lq~ql~es~k~~e~lq~kneellk~~e~q~ 468 (861)
T PF15254_consen 389 MQPLRSENAQLRRRLRILNQQLREQEKAEKTSGSQDCNLELFSLQSLNMSLQNQLQESLKSQELLQSKNEELLKVIENQK 468 (861)
T ss_pred hhhhhhhhHHHHHHHHHHHHHHHHHHhhcccCCCcccchhhHHHHHHHHHHHHHHHHHHHhHHHHHHhHHHHHHHHHHHH
Confidence 456788899999999999999998888865553 33333 1111111 111 111 222233344
Q ss_pred HHHHHHHHHH----HHHHhhhhhhhhHhHHHHHHHhhhhhhHHHHHHHHHHhhhccHH
Q 018721 275 SELRLARSFV----LRVRNTNNQYVEENERLRAILGEWSTRAAKLERALEVERMSNIE 328 (351)
Q Consensus 275 ~el~~ar~li----~~~r~~n~q~~eEn~rlRa~l~Ews~raakle~ale~er~~~~~ 328 (351)
.|=+..+.+| +.+-...+||--|++|++-.++|=-.-..-+--.||+-..-|.-
T Consensus 469 ~Enk~~~~~~~ekd~~l~~~kq~~d~e~~rik~ev~eal~~~k~~q~kLe~sekEN~i 526 (861)
T PF15254_consen 469 EENKRLRKMFQEKDQELLENKQQFDIETTRIKIEVEEALVNVKSLQFKLEASEKENQI 526 (861)
T ss_pred HHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHhhhhH
Confidence 5666666666 67777777888888888777777555555455555555555543
No 118
>PF08614 ATG16: Autophagy protein 16 (ATG16); InterPro: IPR013923 Macroautophagy is a bulk degradation process induced by starvation in eukaryotic cells. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. No molecule involved in autophagy has yet been identified in higher eukaryotes []. The pre-autophagosomal structure contains at least five Apg proteins: Apg1p, Apg2p, Apg5p, Aut7p/Apg8p and Apg16p. It is found in the vacuole []. The C-terminal glycine of Apg12p is conjugated to a lysine residue of Apg5p via an isopeptide bond. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. Auotphagy protein 16 (Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate []. Autophagy protein 5 (Apg5) is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway []. This entry represents auotphagy protein 16 (Apg16), which is required for the function of the Apg12p-Apg5p conjugate.; PDB: 3A7O_D 3A7P_B.
Probab=34.02 E-value=2.5e+02 Score=25.65 Aligned_cols=19 Identities=21% Similarity=0.406 Sum_probs=0.0
Q ss_pred HHhhHHHHHHHHHhhhhcc
Q 018721 223 HFLSEEILRLQECLSKYEQ 241 (351)
Q Consensus 223 a~LSkrIL~Lq~~l~kye~ 241 (351)
..+..+|..++..+.+-+.
T Consensus 91 ~el~~~L~~~~~~l~~l~~ 109 (194)
T PF08614_consen 91 GELAQQLVELNDELQELEK 109 (194)
T ss_dssp -------------------
T ss_pred ccccccccccccccchhhh
Confidence 4455566666666644444
No 119
>PF14661 HAUS6_N: HAUS augmin-like complex subunit 6 N-terminus
Probab=33.81 E-value=4.2e+02 Score=25.16 Aligned_cols=35 Identities=26% Similarity=0.279 Sum_probs=28.6
Q ss_pred HHHHhhhhhhhhHhHHHHHHHhhhhhhHHHHHHHH
Q 018721 285 LRVRNTNNQYVEENERLRAILGEWSTRAAKLERAL 319 (351)
Q Consensus 285 ~~~r~~n~q~~eEn~rlRa~l~Ews~raakle~al 319 (351)
++--...++|.+.++-|+....+.+.|.+.++..+
T Consensus 174 ~~~~~~~~~~~~~aq~L~~k~r~l~~~~~~~~~~~ 208 (247)
T PF14661_consen 174 QEKDAARQKYQEFAQLLRKKYRELSAECAELQAQL 208 (247)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 33334445699999999999999999999999988
No 120
>KOG0161 consensus Myosin class II heavy chain [Cytoskeleton]
Probab=32.93 E-value=6.2e+02 Score=31.86 Aligned_cols=67 Identities=21% Similarity=0.276 Sum_probs=45.2
Q ss_pred hHHHHHHHHHHHHHHHHhhhhhhhhHhHHHHHHHhhhhhhHHHHHHHHHHhhhccHHHHHHHHHhhh
Q 018721 272 QLQSELRLARSFVLRVRNTNNQYVEENERLRAILGEWSTRAAKLERALEVERMSNIELQKKISTRRN 338 (351)
Q Consensus 272 q~~~el~~ar~li~~~r~~n~q~~eEn~rlRa~l~Ews~raakle~ale~er~~~~~~~~~~~~~r~ 338 (351)
.++.+++-....++++-+-++.--.+-.-+-..+.|=+.|-+.|+--||.||-+..++.|+.+.+..
T Consensus 1066 el~~~l~kke~El~~l~~k~e~e~~~~~~l~k~i~eL~~~i~el~e~le~er~~r~K~ek~r~dL~~ 1132 (1930)
T KOG0161|consen 1066 ELDNQLKKKESELSQLQSKLEDEQAEVAQLQKQIKELEARIKELEEELEAERASRAKAERQRRDLSE 1132 (1930)
T ss_pred HHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3334444444444444444444444445556677888999999999999999999888888877765
No 121
>PF04094 DUF390: Protein of unknown function (DUF390); InterPro: IPR007228 This domain is found in a family of long proteins that are currently found only in rice. They have no known function. However they may be some kind of transposable element. There is a putative gypsy type transposon domain (IPR007321 from INTERPRO) towards the N terminus of the proteins.
Probab=32.79 E-value=2.4e+02 Score=32.08 Aligned_cols=89 Identities=31% Similarity=0.345 Sum_probs=63.4
Q ss_pred HHhhhhhhhHHHhhHHHHHhHHHHHHHHHHHH-HHHHhhhh-----hhhhHhHHHH-HHHhhhhhh---HHHHHHHHHHh
Q 018721 253 AHLLAARDQELRTLSAEMNQLQSELRLARSFV-LRVRNTNN-----QYVEENERLR-AILGEWSTR---AAKLERALEVE 322 (351)
Q Consensus 253 ~h~la~r~qelRa~~Ae~~q~~~el~~ar~li-~~~r~~n~-----q~~eEn~rlR-a~l~Ews~r---aakle~ale~e 322 (351)
++.++.||+.|.++.+......+.||+=-.-+ .|=+++|. +.-.|+=+|| ..-.+|..| .|+-|||--+.
T Consensus 536 e~a~a~Re~TLAahEaa~AE~E~aLRLREeA~aER~~~~~~aEaaa~Rlae~L~lREeA~~~~~~r~le~araeraa~~~ 615 (828)
T PF04094_consen 536 ERAAAQREATLAAHEAAAAEEESALRLREEALAERDRALNRAEAAAQRLAEQLALREEAVEERERRHLESARAERAAMAA 615 (828)
T ss_pred HHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccHHHHHHHHHhhhhhhHHHHHHHHH
Confidence 77888888888888777666667776644444 33333433 5556777777 445566555 56788988899
Q ss_pred hhccHHHHHHHHHhhhcCC
Q 018721 323 RMSNIELQKKISTRRNQHG 341 (351)
Q Consensus 323 r~~~~~~~~~~~~~r~~~~ 341 (351)
|.+.+|-|.|-..-|.+..
T Consensus 616 ra~eleArekel~a~~~~g 634 (828)
T PF04094_consen 616 RASELEAREKELAARGQSG 634 (828)
T ss_pred HHHHHHHHHHhhccccccC
Confidence 9999999999888888863
No 122
>PF03653 UPF0093: Uncharacterised protein family (UPF0093); InterPro: IPR005265 There are currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function. However, they are predicted to be integral membrane proteins (with several transmembrane segments). There is some indirect indication of a link between this protein and the function or assembly of cytochromes: in Escherichia coli, strains overproducing this protein turn pink, perhaps because of an excess of accumulated haems [].
Probab=32.77 E-value=3.5e+02 Score=23.91 Aligned_cols=93 Identities=14% Similarity=0.176 Sum_probs=50.9
Q ss_pred HHHHHHHHHHHH-HHHHHHHhHHHh--hhHHHHHHHhhccc---ccchhhhhHHHHHHHHHHHhh---hccccccHHHHH
Q 018721 78 GVILLALTGIFQ-QYFVYQVQKIRL--QGYYSFSQKLKHIV---RLPFAITAYGTAAMLLVIVWR---PHISILSISTLL 148 (351)
Q Consensus 78 ~v~LWlLt~l~d-~yv~~qH~KlRl--~GYl~FYR~Tr~lk---RlPl~IvSlGNa~LLLI~a~~---~~~~~Ls~~~lL 148 (351)
-++.|+...+.- +.+.+|.+..-. ..+-.|-..++++- -.|-++.+.+...++++.... ...+|+.....+
T Consensus 15 avi~W~aGl~yLprl~v~~~~~~~~~~~~~~~l~~m~rrl~~~I~~PAmilt~~~G~~ll~~~~~~~~~~~~Wl~vKL~~ 94 (147)
T PF03653_consen 15 AVISWMAGLFYLPRLFVYHAEASDGSEEFYERLAIMERRLYRIIMTPAMILTWISGLLLLFLNPGLGWFSSPWLHVKLVL 94 (147)
T ss_pred HHHHHHHHHHHHHHHHHHHHhCcCCCccHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHhhhhhhhcCcHHHHHHHH
Confidence 356677666643 444444333221 23555544444433 459999999888666555432 234788876554
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCC
Q 018721 149 RIIMLIEAICAASFMSVYIGYVHQYNSLN 177 (351)
Q Consensus 149 riil~lElv~~l~~li~YIvkV~rFNk~k 177 (351)
+++ ...+.+..-...+||++..
T Consensus 95 -V~~------L~~~H~~~~~~~k~~~~~~ 116 (147)
T PF03653_consen 95 -VLL------LVAYHGWCGRYLKRLARGE 116 (147)
T ss_pred -HHH------HHHHHHHHHHHHHHHhCCC
Confidence 222 3334444455677787744
No 123
>TIGR00606 rad50 rad50. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=32.30 E-value=9e+02 Score=28.54 Aligned_cols=63 Identities=10% Similarity=0.141 Sum_probs=33.6
Q ss_pred hhHHHHHHHHHhhhhcccCCCCCh---hhhHHHhhhhhhhHHHhhHHHHHhHHHHHHHHHHHHHHH
Q 018721 225 LSEEILRLQECLSKYEQSDDGSTP---QVDLAHLLAARDQELRTLSAEMNQLQSELRLARSFVLRV 287 (351)
Q Consensus 225 LSkrIL~Lq~~l~kye~~~~g~t~---qvdl~h~la~r~qelRa~~Ae~~q~~~el~~ar~li~~~ 287 (351)
+.+++-.++.++..-+....++++ -.++..-+.+-+.+++++.++.+.++.+.......|.++
T Consensus 797 ~~~ei~~l~~qie~l~~~l~~~~~~~s~~ele~ei~~~~~el~~l~~~~e~l~~e~e~~~~eI~~L 862 (1311)
T TIGR00606 797 FQMELKDVERKIAQQAAKLQGSDLDRTVQQVNQEKQEKQHELDTVVSKIELNRKLIQDQQEQIQHL 862 (1311)
T ss_pred HHHHHHHHHHHHHHHHHHhccccccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345666666666553333222222 234555566666677777555555566655555555333
No 124
>KOG0976 consensus Rho/Rac1-interacting serine/threonine kinase Citron [Signal transduction mechanisms]
Probab=32.29 E-value=1.4e+02 Score=34.37 Aligned_cols=27 Identities=30% Similarity=0.391 Sum_probs=17.3
Q ss_pred HHHHHHHHhhhhhhhhHhHHHHHHHhh
Q 018721 281 RSFVLRVRNTNNQYVEENERLRAILGE 307 (351)
Q Consensus 281 r~li~~~r~~n~q~~eEn~rlRa~l~E 307 (351)
.+++..+|..-..-.|||.+||.-+.+
T Consensus 557 q~lL~evrq~q~k~leenv~lRkgma~ 583 (1265)
T KOG0976|consen 557 QSLLAEVRQRQKKSLEENVFLRKGMAR 583 (1265)
T ss_pred HHHhhchhhhhhhccChHHHHHHHHHH
Confidence 333344444444667999999987764
No 125
>KOG4637 consensus Adaptor for phosphoinositide 3-kinase [Signal transduction mechanisms]
Probab=32.25 E-value=4.9e+02 Score=27.69 Aligned_cols=100 Identities=18% Similarity=0.219 Sum_probs=59.4
Q ss_pred CCCcccCCC---chhHhhHHHHHHHH-----HHhHHhhHHHHHHHHHhhhhcccCCCCChhhhH---HHh-------hhh
Q 018721 197 EGLRYHDGG---RLSDEQMALLQYQR-----ENLHFLSEEILRLQECLSKYEQSDDGSTPQVDL---AHL-------LAA 258 (351)
Q Consensus 197 ~ElGfrd~g---~LlEKQADLIrYLk-----dHNa~LSkrIL~Lq~~l~kye~~~~g~t~qvdl---~h~-------la~ 258 (351)
+--||-|+- |+|| ||.|-| +.|..|--+ |.-=+++|..++.|.. ..|+ ..- |..
T Consensus 79 gKyGF~d~ltf~SVVe----lIn~yr~~SL~~yN~~LDvr---LlyPVs~~r~dq~~kp-~~~~~~~~~~~~~~~q~lq~ 150 (464)
T KOG4637|consen 79 GKYGFSDPLTFNSVVE----LINHYRNESLAQYNPKLDVR---LLYPVSRYRQDQNVKP-DFNINAVGKKLREYHQQLQE 150 (464)
T ss_pred CccCCCCchhhHHHHH----HHHHHhhhHHHhhCccccee---eechHHHhhhccccCC-CcchhhhhHHHHHHHHHHHH
Confidence 344566543 2666 888855 345555544 4456889998877653 3333 222 222
Q ss_pred hhhHHHhhHHHHHhHHHHHHHHHHHH-----------HHHHhhhhhhhhHhHHHHHH
Q 018721 259 RDQELRTLSAEMNQLQSELRLARSFV-----------LRVRNTNNQYVEENERLRAI 304 (351)
Q Consensus 259 r~qelRa~~Ae~~q~~~el~~ar~li-----------~~~r~~n~q~~eEn~rlRa~ 304 (351)
-++|+--+..|-+-.++|+..+|+.| -++...|+---++.+|.|.+
T Consensus 151 ~~~~~er~~~~y~~~~qElq~k~t~~~afn~tikife~q~~~~e~~~ka~~d~~~~e 207 (464)
T KOG4637|consen 151 KSLEYERLYEEYTRTSQELQMKRTAIEAFNETIKIFEEQCGTQENLSKAYIDRFRRE 207 (464)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHhHHHHH
Confidence 23344455566667788899999887 34445555455566777766
No 126
>PF05557 MAD: Mitotic checkpoint protein; InterPro: IPR008672 This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in Saccharomyces cerevisiae and higher eukaryotes. In Saccharomyces cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated [].; PDB: 1GO4_F 4DZO_A.
Probab=31.42 E-value=16 Score=39.65 Aligned_cols=90 Identities=30% Similarity=0.367 Sum_probs=0.0
Q ss_pred HHHHHHHHhhhhcccCCC-----CChhhhHHHhhhhhhhHHHhhHHHHHhHHHHHHHHHHHHHHHHhhhhhhhhHhHHHH
Q 018721 228 EILRLQECLSKYEQSDDG-----STPQVDLAHLLAARDQELRTLSAEMNQLQSELRLARSFVLRVRNTNNQYVEENERLR 302 (351)
Q Consensus 228 rIL~Lq~~l~kye~~~~g-----~t~qvdl~h~la~r~qelRa~~Ae~~q~~~el~~ar~li~~~r~~n~q~~eEn~rlR 302 (351)
+...|...+..+..-... +|| .|++..+..=.++-=.+..+.+.++++++.-...|+.+....+++-.+++-++
T Consensus 313 e~~~Le~el~sW~sl~~~~~~~~~sP-e~l~~~l~~lq~~~~~L~ek~g~~~~~~~~l~~~~~~Le~e~~~l~~~~~~l~ 391 (722)
T PF05557_consen 313 ENEKLEDELNSWESLLQDIGLEFDSP-EDLARALVQLQQENASLTEKLGSLQSELRELEEEIQELEQEKEQLLKEIEELE 391 (722)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred HHHHHHHHHHHHHHHHhcCCCCCCCH-HHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344444445444443221 234 46666665544444477777778888888777777777777777777777777
Q ss_pred HHHhhhhhhHHHHHHH
Q 018721 303 AILGEWSTRAAKLERA 318 (351)
Q Consensus 303 a~l~Ews~raakle~a 318 (351)
+.+..=..-.++|||.
T Consensus 392 ~~~~~~~~~~~RLerq 407 (722)
T PF05557_consen 392 ASLEALKKLIRRLERQ 407 (722)
T ss_dssp ----------------
T ss_pred HHHHHHHHHHHHHHHH
Confidence 7666544444444443
No 127
>TIGR00606 rad50 rad50. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=31.26 E-value=5.6e+02 Score=30.17 Aligned_cols=35 Identities=11% Similarity=0.145 Sum_probs=18.1
Q ss_pred hHhhHHHHHHHHHHhHHhhHHHHHHHHHhhhhccc
Q 018721 208 SDEQMALLQYQRENLHFLSEEILRLQECLSKYEQS 242 (351)
Q Consensus 208 lEKQADLIrYLkdHNa~LSkrIL~Lq~~l~kye~~ 242 (351)
.+.+-..+...|++-..+-.+|-.++..+..+...
T Consensus 205 ~~~~~~~~~~~~~~~~~ir~~l~~~q~kie~~~~~ 239 (1311)
T TIGR00606 205 HQMELKYLKQYKEKACEIRDQITSKEAQLESSREI 239 (1311)
T ss_pred HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33344444445555555555666666655444443
No 128
>PRK02224 chromosome segregation protein; Provisional
Probab=31.20 E-value=7.5e+02 Score=27.25 Aligned_cols=76 Identities=21% Similarity=0.220 Sum_probs=34.7
Q ss_pred hhHHHHHhHHHHHHHHHHHHHHHHhhhhhhhhHhHHHHHHHhhhhhhHH-------HHHHHHHHhhhccHHHHHHHHHhh
Q 018721 265 TLSAEMNQLQSELRLARSFVLRVRNTNNQYVEENERLRAILGEWSTRAA-------KLERALEVERMSNIELQKKISTRR 337 (351)
Q Consensus 265 a~~Ae~~q~~~el~~ar~li~~~r~~n~q~~eEn~rlRa~l~Ews~raa-------kle~ale~er~~~~~~~~~~~~~r 337 (351)
.+......++.++...+..+..+......+.++.+.+..-+.+...+++ .++..++..+-+--.+++++.+++
T Consensus 318 ~l~~k~~el~~~l~~~~~~l~~~~~~~e~~~~~~~~le~~~~~l~~~~~~l~~~~~~~~~~l~~~~~~l~~l~~el~el~ 397 (880)
T PRK02224 318 ELEDRDEELRDRLEECRVAAQAHNEEAESLREDADDLEERAEELREEAAELESELEEAREAVEDRREEIEELEEEIEELR 397 (880)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344444444444444444444433344444555555555555554444 334444444434344555555544
Q ss_pred hcC
Q 018721 338 NQH 340 (351)
Q Consensus 338 ~~~ 340 (351)
.+.
T Consensus 398 ~~l 400 (880)
T PRK02224 398 ERF 400 (880)
T ss_pred HHH
Confidence 433
No 129
>TIGR02209 ftsL_broad cell division protein FtsL. This model represents FtsL, both forms similar to that in E. coli and similar to that in B. subtilis. FtsL is one of the later proteins active in cell division septum formation. FtsL is small, low in complexity, and highly divergent. The scope of this model is broader than that of the Pfam model pfam04999.3 for FtsL, as this one includes FtsL from Bacillus subtilis and related species.
Probab=31.10 E-value=2.1e+02 Score=22.16 Aligned_cols=38 Identities=13% Similarity=0.168 Sum_probs=21.2
Q ss_pred HHHHHHHHhhhhhhhhHhHHHHHHHhhhhhhHHHHHHHH
Q 018721 281 RSFVLRVRNTNNQYVEENERLRAILGEWSTRAAKLERAL 319 (351)
Q Consensus 281 r~li~~~r~~n~q~~eEn~rlRa~l~Ews~raakle~al 319 (351)
...++.+.....+--+||++|+.....|+. -.++|..-
T Consensus 30 ~~~~~~~~~~~~~l~~en~~L~~ei~~l~~-~~rIe~~A 67 (85)
T TIGR02209 30 NNELQKLQLEIDKLQKEWRDLQLEVAELSR-HERIEKIA 67 (85)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHcC-HHHHHHHH
Confidence 333344444444556677888877777775 33344433
No 130
>cd08318 Death_NMPP84 Death domain of Nuclear Matrix Protein P84. Death domain (DD) found in the Nuclear Matrix Protein P84 (also known as HPR1 or THOC1). HPR1/p84 resides in the nuclear matrix and is part of the THO complex, also called TREX (transcription/export) complex, which functions in mRNP biogenesis at the interface between transcription and export of mRNA from the nucleus. Mice lacking THOC1 have abnormal testis development and are sterile. In general, DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including CARD (Caspase activation and recruitment domain), DED (Death Effector Domain), and PYRIN. They serve as adaptors in signaling pathways and can recruit other proteins into signaling complexes.
Probab=30.60 E-value=94 Score=25.06 Aligned_cols=39 Identities=15% Similarity=0.303 Sum_probs=27.8
Q ss_pred HHHHHHHHHhhhhhhhhHhHHHHHHHhhhhhhHH------HHHHHHHH
Q 018721 280 ARSFVLRVRNTNNQYVEENERLRAILGEWSTRAA------KLERALEV 321 (351)
Q Consensus 280 ar~li~~~r~~n~q~~eEn~rlRa~l~Ews~raa------kle~ale~ 321 (351)
....|.+++..|+ .-.++-+++|..|..|.. +|..||..
T Consensus 31 s~~dI~~i~~~~~---~~~eq~~~mL~~W~~r~g~~AT~~~L~~aL~~ 75 (86)
T cd08318 31 KDKEIRAIESDSE---DIKMQAKQLLVAWQDREGSQATPETLITALNA 75 (86)
T ss_pred CHHHHHHHHhcCC---CHHHHHHHHHHHHHHhcCccccHHHHHHHHHH
Confidence 3334677775443 347889999999999854 67777765
No 131
>PRK00888 ftsB cell division protein FtsB; Reviewed
Probab=30.53 E-value=1.2e+02 Score=25.57 Aligned_cols=17 Identities=18% Similarity=0.220 Sum_probs=8.5
Q ss_pred hhhHhHHHHHHHhhhhh
Q 018721 294 YVEENERLRAILGEWST 310 (351)
Q Consensus 294 ~~eEn~rlRa~l~Ews~ 310 (351)
-..||++|++-+..|..
T Consensus 46 l~~~n~~L~~eI~~L~~ 62 (105)
T PRK00888 46 LKARNDQLFAEIDDLKG 62 (105)
T ss_pred HHHHHHHHHHHHHHhhC
Confidence 34455555555555544
No 132
>PF12128 DUF3584: Protein of unknown function (DUF3584); InterPro: IPR021979 This family consist of uncharacterised bacterial proteins.
Probab=30.37 E-value=9.5e+02 Score=28.19 Aligned_cols=111 Identities=22% Similarity=0.313 Sum_probs=55.1
Q ss_pred hhHHHHHHHHHHhHHhhHHHHHHHHHhhhhcccCCCCChhhh--HHHhhhhhhhHHH-------hhHHHHHhHHHHHHHH
Q 018721 210 EQMALLQYQRENLHFLSEEILRLQECLSKYEQSDDGSTPQVD--LAHLLAARDQELR-------TLSAEMNQLQSELRLA 280 (351)
Q Consensus 210 KQADLIrYLkdHNa~LSkrIL~Lq~~l~kye~~~~g~t~qvd--l~h~la~r~qelR-------a~~Ae~~q~~~el~~a 280 (351)
.+++-+.-.-..+..+-+++-.|...+...+......-|+.. +.+....+++..+ .+..+.+.+.+++...
T Consensus 246 ~~~~~l~~~~~~l~~~~~~L~~l~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~e~~~~~~~~~~~~~~~l~~~ 325 (1201)
T PF12128_consen 246 PEFDKLQQQYRQLQALEQQLCHLHAELNADEQQLEQEQPELKEELNELNEELEKLEDEIKELRDELNKELSALNADLARI 325 (1201)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333444444444444445555555666666665555555542 2444444433222 4455555556666666
Q ss_pred HHHHHHHHhhhhhh----hhHhHHHHHHHhhhhhhHHHHHHHHH
Q 018721 281 RSFVLRVRNTNNQY----VEENERLRAILGEWSTRAAKLERALE 320 (351)
Q Consensus 281 r~li~~~r~~n~q~----~eEn~rlRa~l~Ews~raakle~ale 320 (351)
.+.|..+..--..| +++-...=.-+++|..+...++..+.
T Consensus 326 ~~~L~~i~~~~~~ye~~~i~~~~~~~~~l~~~~~~~~~l~~~~~ 369 (1201)
T PF12128_consen 326 KSELDEIEQQKKDYEDADIEQLIARVDQLPEWRNELENLQEQLD 369 (1201)
T ss_pred HHHHHHHHHHHHHHHHCCHHHHHHHHHhhHHHHHHHHHHHHHHH
Confidence 66664444444434 33334444555666666555555443
No 133
>PF04799 Fzo_mitofusin: fzo-like conserved region; InterPro: IPR006884 This entry represents the heptad repeat domain which is conserved at the C terminus of Fzo/mitofusion family of GTPases. Fzo is a mediator of mitochondrial fusion during spermatogenesis []. This conserved region is also found in the human mitofusin protein []. This domain forms a dimeric antiparallel coiled coil structure, which has been proposed to act as a mitochodrial tether before vesicle fusion [].; GO: 0003924 GTPase activity, 0006184 GTP catabolic process, 0008053 mitochondrial fusion, 0005741 mitochondrial outer membrane, 0016021 integral to membrane; PDB: 1T3J_A.
Probab=29.95 E-value=1.4e+02 Score=27.99 Aligned_cols=13 Identities=38% Similarity=0.409 Sum_probs=7.4
Q ss_pred hhhhHHHHHHHHH
Q 018721 308 WSTRAAKLERALE 320 (351)
Q Consensus 308 ws~raakle~ale 320 (351)
+..+|.-||..||
T Consensus 149 LrnKa~~L~~eL~ 161 (171)
T PF04799_consen 149 LRNKANWLESELE 161 (171)
T ss_dssp HHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHH
Confidence 4455666666655
No 134
>COG1988 Predicted membrane-bound metal-dependent hydrolases [General function prediction only]
Probab=29.92 E-value=97 Score=28.44 Aligned_cols=33 Identities=15% Similarity=0.095 Sum_probs=17.6
Q ss_pred ccc-CCCCcccchhhHHHHHHHHHHHHHHHhhhh
Q 018721 29 PLL-GHRKSHSIFGSVVYCFVLAGYAILAAGTTW 61 (351)
Q Consensus 29 p~~-~~r~~~~~~g~~~y~~~L~~yA~~~~~~pw 61 (351)
+++ |||+..--+.+......++++.......|+
T Consensus 59 ~i~~gHRGfTHSl~~~~~~~~l~~~~~~~~~~~~ 92 (190)
T COG1988 59 YIRLGHRGFTHSLLFAAVALLLLGLLLFLFFLPL 92 (190)
T ss_pred HHHhcccceeehhHHHHHHHHHHHHHHHHhcCcc
Confidence 344 999987644444433344444444444444
No 135
>COG5102 SFT2 Membrane protein involved in ER to Golgi transport [Intracellular trafficking and secretion]
Probab=29.87 E-value=5e+02 Score=24.81 Aligned_cols=36 Identities=25% Similarity=0.481 Sum_probs=28.1
Q ss_pred hhhHHHHHHHhhcccccchhhhhHHHHHHHHHHHhh
Q 018721 101 LQGYYSFSQKLKHIVRLPFAITAYGTAAMLLVIVWR 136 (351)
Q Consensus 101 l~GYl~FYR~Tr~lkRlPl~IvSlGNa~LLLI~a~~ 136 (351)
++|+..+.+.+---+|+|+-.-=.|+..+-++.++.
T Consensus 117 l~Gf~ayl~~Lts~erlp~s~~ff~t~l~Tiy~~~k 152 (201)
T COG5102 117 LLGFRAYLEGLTSKERLPHSSWFFGTTLLTIYVVLK 152 (201)
T ss_pred HHhHHHHHHhhhhhhccchhHHHHHHHHHHHHHHHH
Confidence 468888888888889999877777777777766654
No 136
>PF07760 DUF1616: Protein of unknown function (DUF1616); InterPro: IPR011674 This is a group of sequences from hypothetical archaeal proteins. The region in question is approximately 330 amino acid residues long.
Probab=29.81 E-value=1.7e+02 Score=28.46 Aligned_cols=49 Identities=8% Similarity=0.121 Sum_probs=23.8
Q ss_pred hhhhHHHHHHHHHHH-hhhcc-c-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 018721 120 AITAYGTAAMLLVIV-WRPHI-S-ILSISTLLRIIMLIEAICAASFMSVYIGYVH 171 (351)
Q Consensus 120 ~IvSlGNa~LLLI~a-~~~~~-~-~Ls~~~lLriil~lElv~~l~~li~YIvkV~ 171 (351)
...|.|-++.++.+. +..++ . .++... +++++-.+..+.|++.|+-|-+
T Consensus 59 ~~ls~glSi~~~~~~g~~l~~~~~~i~~~~---i~~~l~~~t~~~~~~a~~rr~~ 110 (287)
T PF07760_consen 59 LALSVGLSIAIVPLIGLLLNYTPWGIRLIP---ILISLSIFTLVLSIIAYIRRRR 110 (287)
T ss_pred HHHHHHHHHHHHHHHHHHHHhccCCcchhH---HHHHHHHHHHHHHHHHHHhccc
Confidence 367777766444443 33333 2 233332 3345555555666655554433
No 137
>PF09323 DUF1980: Domain of unknown function (DUF1980); InterPro: IPR015402 Members of this occur in gene pairs with members of PF03773 from PFAM. The N-terminal region contains several predicted transmembrane helix regions while the few invariant residues (G, CxxD, and W) occur in the C-terminal region. Members of this family are found in a set of prokaryotic hypothetical proteins. Their exact function has not, as yet, been defined.
Probab=29.61 E-value=1.4e+02 Score=27.03 Aligned_cols=44 Identities=32% Similarity=0.464 Sum_probs=24.2
Q ss_pred HHHHHHHHHHHHhhhhhhccccccch---hHHHHHHHHHHHHHHHHHH
Q 018721 46 CFVLAGYAILAAGTTWIFHPIHYLIP---PLLCSCGVILLALTGIFQQ 90 (351)
Q Consensus 46 ~~~L~~yA~~~~~~pw~~~~~~~~~~---~lL~y~~v~LWlLt~l~d~ 90 (351)
+++|+||+++.+..-+-=......+| |++..+-++|+++ .+++-
T Consensus 4 ~liL~~~~~l~~~l~~sG~i~~YI~P~~~~~~~~a~i~l~il-ai~q~ 50 (182)
T PF09323_consen 4 FLILLGFGILLFYLILSGKILLYIHPRYIPLLYFAAILLLIL-AIVQL 50 (182)
T ss_pred HHHHHHHHHHHHHHHHhCcHHHHhCccHHHHHHHHHHHHHHH-HHHHH
Confidence 67889999998887553333344444 3444344444433 34443
No 138
>KOG3402 consensus Predicted membrane protein [Function unknown]
Probab=29.52 E-value=36 Score=29.15 Aligned_cols=32 Identities=25% Similarity=0.512 Sum_probs=25.5
Q ss_pred hhccCcccCCCC------cccchhhHHHHHHHHHHHHH
Q 018721 24 ILHEAPLLGHRK------SHSIFGSVVYCFVLAGYAIL 55 (351)
Q Consensus 24 ~~~e~p~~~~r~------~~~~~g~~~y~~~L~~yA~~ 55 (351)
.+--.|-+.||. -||++|..++.|+|-+|+..
T Consensus 40 ~af~~pa~~~r~QIr~YVvrSavGf~fw~ivLsaW~~~ 77 (101)
T KOG3402|consen 40 VAFHSPAFPHRRQIRNYVVRSAVGFSFWTIVLSAWALT 77 (101)
T ss_pred HHHcCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344568888875 38999999999999999864
No 139
>PF07106 TBPIP: Tat binding protein 1(TBP-1)-interacting protein (TBPIP); InterPro: IPR010776 This family consists of several eukaryotic TBP-1 interacting protein (TBPIP) sequences. TBP-1 has been demonstrated to interact with the human immunodeficiency virus type 1 (HIV-1) viral protein Tat, then modulate the essential replication process of HIV. In addition, TBP-1 has been shown to be a component of the 26S proteasome, a basic multiprotein complex that degrades ubiquitinated proteins in an ATP-dependent fashion. Human TBPIP interacts with human TBP-1 then modulates the inhibitory action of human TBP-1 on HIV-Tat-mediated transactivation [].
Probab=29.43 E-value=2.2e+02 Score=25.26 Aligned_cols=17 Identities=29% Similarity=0.382 Sum_probs=12.1
Q ss_pred hhHHHHHHHHHHhHHhh
Q 018721 210 EQMALLQYQRENLHFLS 226 (351)
Q Consensus 210 KQADLIrYLkdHNa~LS 226 (351)
-...+++||++.|+-.|
T Consensus 2 Ae~~Il~y~~~qNRPys 18 (169)
T PF07106_consen 2 AEDAILEYMKEQNRPYS 18 (169)
T ss_pred hHHHHHHHHHHcCCCCc
Confidence 34667788888887555
No 140
>PRK15396 murein lipoprotein; Provisional
Probab=29.37 E-value=1.9e+02 Score=23.78 Aligned_cols=47 Identities=28% Similarity=0.446 Sum_probs=0.0
Q ss_pred CChhhhHHHhhhhhhhHHHhhHHHHHhHHHHHHHHHHHHHHHHhhhhhhhhHhHHH
Q 018721 246 STPQVDLAHLLAARDQELRTLSAEMNQLQSELRLARSFVLRVRNTNNQYVEENERL 301 (351)
Q Consensus 246 ~t~qvdl~h~la~r~qelRa~~Ae~~q~~~el~~ar~li~~~r~~n~q~~eEn~rl 301 (351)
+||+|| .=..++-+|.++.+|+.++...+|+-++. ..+-=.--|+||
T Consensus 23 s~~kvd------~LssqV~~L~~kvdql~~dv~~~~~~~~~---a~~eA~raN~Rl 69 (78)
T PRK15396 23 SNAKID------QLSSDVQTLNAKVDQLSNDVNAMRSDVQA---AKDDAARANQRL 69 (78)
T ss_pred CchhHH------HHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHH
No 141
>PRK04863 mukB cell division protein MukB; Provisional
Probab=29.31 E-value=8.7e+02 Score=29.69 Aligned_cols=35 Identities=23% Similarity=0.227 Sum_probs=22.2
Q ss_pred hhHhhHHHHHHHHHHhHHhhHHHHHHHHHhhhhccc
Q 018721 207 LSDEQMALLQYQRENLHFLSEEILRLQECLSKYEQS 242 (351)
Q Consensus 207 LlEKQADLIrYLkdHNa~LSkrIL~Lq~~l~kye~~ 242 (351)
+-+++..+.+. .+-...|.+++=.|+.+..+++.-
T Consensus 302 Le~tE~nL~rI-~diL~ELe~rL~kLEkQaEkA~ky 336 (1486)
T PRK04863 302 LAAEQYRLVEM-ARELAELNEAESDLEQDYQAASDH 336 (1486)
T ss_pred HHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44555555554 445577777777777777766654
No 142
>PF05392 COX7B: Cytochrome C oxidase chain VIIB; InterPro: IPR008433 Cytochrome oxidase subunit VIIB is one of the nuclear-coded polypeptide chains of cytochrome c oxidase, the terminal oxidase in mitochondrial electron transport. The X-ray structure of azide-bound fully oxidized cytochrome c oxidase from bovine heart at 2.9 A resolution has been determined [].; GO: 0004129 cytochrome-c oxidase activity, 0005746 mitochondrial respiratory chain; PDB: 3AG2_X 3ASO_K 3ABL_X 1V55_K 1OCR_K 2DYS_X 1OCO_X 2EIK_X 3AG1_K 2Y69_X ....
Probab=29.23 E-value=47 Score=27.59 Aligned_cols=33 Identities=18% Similarity=0.561 Sum_probs=22.3
Q ss_pred chhhHHHHHHHHHHHHHHHhhhhhhccccccch
Q 018721 39 IFGSVVYCFVLAGYAILAAGTTWIFHPIHYLIP 71 (351)
Q Consensus 39 ~~g~~~y~~~L~~yA~~~~~~pw~~~~~~~~~~ 71 (351)
++||+.||+..-+|...-.+.-|-+.|+-...|
T Consensus 42 L~~Ga~FC~~~W~y~~TQ~GIeWNlSPVGRVtP 74 (80)
T PF05392_consen 42 LASGATFCVAVWTYVATQIGIEWNLSPVGRVTP 74 (80)
T ss_dssp HHHHHHHHHHHHHHHHHHSS------STTTS--
T ss_pred eecccchhhhhHhhhheecceeecCCcccccCc
Confidence 679999999999999999999999999865554
No 143
>TIGR00219 mreC rod shape-determining protein MreC. MreC (murein formation C) is involved in the rod shape determination in E. coli, and more generally in cell shape determination of bacteria whether or not they are rod-shaped. Cells defective in MreC are round. Species with MreC include many of the Proteobacteria, Gram-positives, and spirochetes.
Probab=29.22 E-value=77 Score=30.98 Aligned_cols=15 Identities=33% Similarity=0.133 Sum_probs=11.1
Q ss_pred hhhHhHHHHHHHhhh
Q 018721 294 YVEENERLRAILGEW 308 (351)
Q Consensus 294 ~~eEn~rlRa~l~Ew 308 (351)
-.+||++||+-+.+-
T Consensus 71 l~~EN~~Lr~e~~~l 85 (283)
T TIGR00219 71 LEYENYKLRQELLKK 85 (283)
T ss_pred HHHHHHHHHHHHHHH
Confidence 357899999876654
No 144
>PF08687 ASD2: Apx/Shroom domain ASD2; InterPro: IPR014799 Cell shape changes require the coordination of actin and microtubule cytoskeletons. The Shroom family is a small group of related proteins that are defined by sequence similarity and in most cases by some link to the actin cytoskeleton. The Shroom (Shrm) protein family is found only in animals. Proteins of this family are predicted to be utilised in multiple morphogenic and developmental processes across animal phyla to regulate cells shape or intracellular architecture in an actin and myosin-dependent manner []. While the founding member of the Shrm family is Shrm1 (formerly Apx), it appears that this protein is found only in Xenopus []. In mice and humans, the Shrm family of proteins consists of: Shrm2 (formerly Apxl), a protein involved in the morphogenesis, maintenance, and/or function of vascular endothelial cells. Shrm3 (formerly Shroom), a protein necessary for neural tube closure in vertebrate development as deficiency in Shrm results in spina bifida. Shrm3 is also conserved in some invertebrates, as orthologues can be found in sea urchins. Shrm4, a regulator of cyto-skeletal architecture that may play an important role in vertebrate development. It is implicated in X-linked mental retardation in humans. This protein family is based on the conservation of a specific arrangement of an N-terminal PDZ domain, a centrally positioned sequence motif termed ASD1 (Apx/Shrm Domain 1) and a C-terminal motif termed ASD2 [, , ]. Shrm2 and Shrm3 contain all three domains, while Shrm4 contains the PDZ and ASD2 domains, but lacks a discernible ASD1 element. To date, the ASD1 and ASD2 elements have only been found in Shrm-related proteins and do not appear in combination with other conserved domains. ASD1 is required for targeting actin, while ASD2 is capable of eliciting an actomyosin based constriction event [, ]. ASD2 is the most highly conserved sequence element shared by Shrm1, Shrm2, Shrm3, and Shrm4. It possesses a well conserved series of leucine residues that exhibit spacing consistent with that of a leucine zipper motif []. Shroom2 is both necessary and sufficient to govern the localization of pigment granules at the apical surface of epithelial cells. Shroom2 is a central regulator of RPE pigmentation. Despite their diverse biological roles, Shroom family proteins share a common activity. Since the locus encoding human SHROOM2 lies within the critical region for two distinct forms of ocular albinism, it is possible that SHROOM2 mutations may contribute to human visual system disorders [].; GO: 0000902 cell morphogenesis, 0005737 cytoplasm; PDB: 3THF_B.
Probab=29.22 E-value=4.9e+02 Score=25.86 Aligned_cols=67 Identities=25% Similarity=0.370 Sum_probs=50.5
Q ss_pred hhhhhhhHHHhhHHHHHhHHHHHHHHHHHH-------HHHHhhh-----hhhhhHhHHHHHHHhhhhhhHHHHHHHHHH
Q 018721 255 LLAARDQELRTLSAEMNQLQSELRLARSFV-------LRVRNTN-----NQYVEENERLRAILGEWSTRAAKLERALEV 321 (351)
Q Consensus 255 ~la~r~qelRa~~Ae~~q~~~el~~ar~li-------~~~r~~n-----~q~~eEn~rlRa~l~Ews~raakle~ale~ 321 (351)
|..+=.+.+..+..|.+.++.|.+.-.++= +++=.-| +-||.+.++.=--|=-+|+|-|+.|.+|..
T Consensus 94 Li~~l~~kl~~L~~eqe~l~ee~~~n~~lG~~ve~~v~~~c~p~E~~Ky~~fi~Dl~kv~~LLLsLs~RLaRve~aL~~ 172 (264)
T PF08687_consen 94 LIESLSKKLEVLQEEQEALQEEIQANEALGAEVEALVQEVCKPNEFEKYRMFIGDLEKVVNLLLSLSGRLARVENALSS 172 (264)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHS-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 344446678888999999999988776662 2221122 268999999999999999999999999964
No 145
>PF07407 Seadorna_VP6: Seadornavirus VP6 protein; InterPro: IPR009982 This family consists of several VP6 proteins from the Banna virus as well as a related protein VP5 from the Kadipiro virus. Members of this family are typically of around 420 residues in length. The function of this family is unknown.
Probab=29.18 E-value=74 Score=32.94 Aligned_cols=30 Identities=23% Similarity=0.247 Sum_probs=14.3
Q ss_pred hHhHHHHHHHhhhhhhHHHHHHHHHHhhh-ccHHH
Q 018721 296 EENERLRAILGEWSTRAAKLERALEVERM-SNIEL 329 (351)
Q Consensus 296 eEn~rlRa~l~Ews~raakle~ale~er~-~~~~~ 329 (351)
+||++|++.-++-+.--++| |+|++ |++..
T Consensus 39 ~EN~~LKkEN~~Lk~eVerL----E~e~l~s~V~E 69 (420)
T PF07407_consen 39 MENHSLKKENNDLKIEVERL----ENEMLRSHVCE 69 (420)
T ss_pred HHhHHHHHHHHHHHHHHHHH----HHHhhhhhhhh
Confidence 45555555555554433333 55555 44433
No 146
>PF10174 Cast: RIM-binding protein of the cytomatrix active zone; InterPro: IPR019323 This entry represents a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion []. Located at the C terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). These proteins also contain four coiled-coil domains [].
Probab=28.82 E-value=2.6e+02 Score=31.60 Aligned_cols=67 Identities=25% Similarity=0.268 Sum_probs=53.5
Q ss_pred HHHhhHHHHHhHHHHHHHHHHHHHHHHh-hhhhhhhHhHHHHHHHhhhhhhHHHHHHHHHHhhhccHH
Q 018721 262 ELRTLSAEMNQLQSELRLARSFVLRVRN-TNNQYVEENERLRAILGEWSTRAAKLERALEVERMSNIE 328 (351)
Q Consensus 262 elRa~~Ae~~q~~~el~~ar~li~~~r~-~n~q~~eEn~rlRa~l~Ews~raakle~ale~er~~~~~ 328 (351)
.|+-+.+|.+.|+.|+..+.+-+..... .+.....|-+|-|+.-.|++++...+..-|..--..|.-
T Consensus 4 ql~~~q~E~e~L~~ele~~~~~l~~~~~~i~~fwspElkrer~~rkee~a~l~~~k~qlr~~q~e~q~ 71 (775)
T PF10174_consen 4 QLERLQRENERLRRELERKQSKLGSSMNSIKTFWSPELKRERALRKEEAAELSRLKEQLRVTQEENQK 71 (775)
T ss_pred HHHHHHHHHHHHHHHHHHHHhHHHHHHHhHhcccchhhHHHHHHHHHHHHHHHhHHHHHHHHHhhHHH
Confidence 5778899999999999999999954444 455689999999999999999999998888554444433
No 147
>PF12896 Apc4: Anaphase-promoting complex, cyclosome, subunit 4; InterPro: IPR024790 Apc4 is one of the larger of the subunits of the anaphase-promoting complex (APC) or cyclosome. The anaphase-promoting complex is a multiprotein subunit E3 ubiquitin ligase complex that controls segregation of chromosomes and exit from mitosis in eukaryotes [, ]. Results in Caenorhabditis elegans show that the primary essential role of the spindle assembly checkpoint is not in the chromosome segregation process itself but rather in delaying anaphase onset until all chromosomes are properly attached to the spindle. The APC is likely to be required for all metaphase-to-anaphase transitions in a multicellular organism []. This entry represents the long domain downstream of the WD40 repeat/s that are present on the Apc4 subunits.
Probab=28.76 E-value=1.1e+02 Score=27.48 Aligned_cols=49 Identities=22% Similarity=0.327 Sum_probs=36.1
Q ss_pred hHHHHHHHHHHhHHhhHHH----HHHHHHhhhhccc----CCCCChhhhHHHhhhhh
Q 018721 211 QMALLQYQRENLHFLSEEI----LRLQECLSKYEQS----DDGSTPQVDLAHLLAAR 259 (351)
Q Consensus 211 QADLIrYLkdHNa~LSkrI----L~Lq~~l~kye~~----~~g~t~qvdl~h~la~r 259 (351)
=..+++|+++|...+.++. ..+.+.+++|..+ +.+.+++.|+-|+|..=
T Consensus 29 i~~ll~yi~~~l~~i~~~w~~~~~~~~~~l~~~~~~l~~~~~~~~~~~el~~lLltG 85 (210)
T PF12896_consen 29 IQSLLRYIKDTLDAIQEEWEEALQEFDRKLTNLADELQEKGGEGSLQDELLDLLLTG 85 (210)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcHHHHHHHHHHhc
Confidence 3468899999999998875 3446677777754 55667888888877654
No 148
>KOG1853 consensus LIS1-interacting protein NUDE [Cytoskeleton]
Probab=28.69 E-value=6.4e+02 Score=25.63 Aligned_cols=84 Identities=21% Similarity=0.283 Sum_probs=0.0
Q ss_pred HHHHHHhHHhhHHHHHHHHHhhhhcccCCCCChhhhHHHhhhhhhhHHHhhHHHHHhHHHHHHHHHHHHHHHHhhhhhhh
Q 018721 216 QYQRENLHFLSEEILRLQECLSKYEQSDDGSTPQVDLAHLLAARDQELRTLSAEMNQLQSELRLARSFVLRVRNTNNQYV 295 (351)
Q Consensus 216 rYLkdHNa~LSkrIL~Lq~~l~kye~~~~g~t~qvdl~h~la~r~qelRa~~Ae~~q~~~el~~ar~li~~~r~~n~q~~ 295 (351)
+|-|+--+.+-...+.+++++++|... -|.+.|||+ +||..+.--...+.+-|++--
T Consensus 16 ~ywk~l~~~ykq~f~~~reEl~EFQeg--------------------SrE~Eaele---sqL~q~etrnrdl~t~nqrl~ 72 (333)
T KOG1853|consen 16 QYWKLLHHEYKQHFLQMREELNEFQEG--------------------SREIEAELE---SQLDQLETRNRDLETRNQRLT 72 (333)
T ss_pred HHHhhhHHHHHHHHHHHHHHHHHHhhh--------------------hHHHHHHHH---HHHHHHHHHHHHHHHHHHHHH
Q ss_pred hHhHHHHHHHhhhhhhHHHHHHHHHHh
Q 018721 296 EENERLRAILGEWSTRAAKLERALEVE 322 (351)
Q Consensus 296 eEn~rlRa~l~Ews~raakle~ale~e 322 (351)
-|.+|+..-++---..-++-|-+||.+
T Consensus 73 ~E~e~~Kek~e~q~~q~y~q~s~Ledd 99 (333)
T KOG1853|consen 73 TEQERNKEKQEDQRVQFYQQESQLEDD 99 (333)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH
No 149
>PF15022 DUF4522: Protein of unknown function (DUF4522)
Probab=28.64 E-value=50 Score=28.90 Aligned_cols=30 Identities=30% Similarity=0.472 Sum_probs=25.3
Q ss_pred hhhHHHHHHHHHHhhhccHHHHHHHHHhhh
Q 018721 309 STRAAKLERALEVERMSNIELQKKISTRRN 338 (351)
Q Consensus 309 s~raakle~ale~er~~~~~~~~~~~~~r~ 338 (351)
|+-.-||||.-|..|+.+++.|+|.++-=+
T Consensus 71 saesiklEreyE~KrL~~lk~qena~eeiQ 100 (117)
T PF15022_consen 71 SAESIKLEREYEMKRLTKLKCQENASEEIQ 100 (117)
T ss_pred cHHHHHHHHHHHHHHHHHHHhhhhhHHHHH
Confidence 556679999999999999999999876433
No 150
>COG3879 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=28.53 E-value=1.4e+02 Score=29.44 Aligned_cols=43 Identities=28% Similarity=0.386 Sum_probs=27.3
Q ss_pred hHHHhhHHHHHhHHHHHHHHHHHHHHHH----hhhhhhhhHhHHHHH
Q 018721 261 QELRTLSAEMNQLQSELRLARSFVLRVR----NTNNQYVEENERLRA 303 (351)
Q Consensus 261 qelRa~~Ae~~q~~~el~~ar~li~~~r----~~n~q~~eEn~rlRa 303 (351)
+|+|.+.++.++|++|.+..-..+..++ +.+++=.+|-+-||-
T Consensus 57 ~e~~s~Q~~~~~L~~ev~~~~~~~~s~~~~~~t~~~~ie~~l~~l~~ 103 (247)
T COG3879 57 KELRSLQKKVNTLAAEVEDLENKLDSVRRSVLTDDAALEDRLEKLRM 103 (247)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhHHHHHHHHHHHHHH
Confidence 4677778888888888887777776655 444444333444443
No 151
>PRK11281 hypothetical protein; Provisional
Probab=28.48 E-value=8.6e+02 Score=28.82 Aligned_cols=59 Identities=15% Similarity=0.195 Sum_probs=36.0
Q ss_pred HhhHHHHHHHHHhhhhcccCC-------CCChhhhHHHhhhhhhhHHHhhHHHHHhHHHHHHHHHH
Q 018721 224 FLSEEILRLQECLSKYEQSDD-------GSTPQVDLAHLLAARDQELRTLSAEMNQLQSELRLARS 282 (351)
Q Consensus 224 ~LSkrIL~Lq~~l~kye~~~~-------g~t~qvdl~h~la~r~qelRa~~Ae~~q~~~el~~ar~ 282 (351)
..-+++=..++++.+.++..+ ...+..+|+..|+..+++|-+..+.+++..+++..+++
T Consensus 91 ~Ap~~l~~a~~~Le~Lk~~~~~~~~~~~~~~Sl~qLEq~L~q~~~~Lq~~Q~~La~~NsqLi~~qT 156 (1113)
T PRK11281 91 QAPAKLRQAQAELEALKDDNDEETRETLSTLSLRQLESRLAQTLDQLQNAQNDLAEYNSQLVSLQT 156 (1113)
T ss_pred HhHHHHHHHHHHHHHhhccccccccccccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhc
Confidence 334455556666666665322 22344568888888888888777777777555544443
No 152
>PF09755 DUF2046: Uncharacterized conserved protein H4 (DUF2046); InterPro: IPR019152 This is the conserved N-terminal 350 residues of a family of proteins of unknown function possibly containing a coiled-coil domain.
Probab=28.41 E-value=6.2e+02 Score=25.84 Aligned_cols=29 Identities=38% Similarity=0.362 Sum_probs=23.5
Q ss_pred hhHHHHHHHHHHhHHhhHHHHHHHHHhhh
Q 018721 210 EQMALLQYQRENLHFLSEEILRLQECLSK 238 (351)
Q Consensus 210 KQADLIrYLkdHNa~LSkrIL~Lq~~l~k 238 (351)
-|-.|+.-|......|..+==.||..|..
T Consensus 175 EQE~lvN~L~Kqm~~l~~eKr~Lq~~l~~ 203 (310)
T PF09755_consen 175 EQEALVNRLWKQMDKLEAEKRRLQEKLEQ 203 (310)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHcc
Confidence 47778888888888888888889999854
No 153
>PRK13169 DNA replication intiation control protein YabA; Reviewed
Probab=28.40 E-value=2.1e+02 Score=24.79 Aligned_cols=40 Identities=25% Similarity=0.297 Sum_probs=24.4
Q ss_pred HHHhHHHHHHHHHHHHHHHHhhhhhhhhHhHHHHHHHhhh
Q 018721 269 EMNQLQSELRLARSFVLRVRNTNNQYVEENERLRAILGEW 308 (351)
Q Consensus 269 e~~q~~~el~~ar~li~~~r~~n~q~~eEn~rlRa~l~Ew 308 (351)
.++++..|+..-...++.+=..|..=--||+-||..|++-
T Consensus 16 ~l~~l~~el~~LK~~~~el~EEN~~L~iEN~~Lr~~l~~~ 55 (110)
T PRK13169 16 NLGVLLKELGALKKQLAELLEENTALRLENDKLRERLEEL 55 (110)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 3334444444444445566667777777888888888753
No 154
>PF14142 YrzO: YrzO-like protein
Probab=28.31 E-value=50 Score=24.60 Aligned_cols=15 Identities=27% Similarity=0.257 Sum_probs=13.3
Q ss_pred hhHhhHHHHHHHHHH
Q 018721 207 LSDEQMALLQYQREN 221 (351)
Q Consensus 207 LlEKQADLIrYLkdH 221 (351)
=+-|||+||+-|||.
T Consensus 27 ~ikqqaeliqllkel 41 (46)
T PF14142_consen 27 KIKQQAELIQLLKEL 41 (46)
T ss_pred HHHHHHHHHHHHHHH
Confidence 678999999999985
No 155
>KOG2129 consensus Uncharacterized conserved protein H4 [Function unknown]
Probab=28.25 E-value=7.9e+02 Score=26.60 Aligned_cols=98 Identities=31% Similarity=0.349 Sum_probs=56.4
Q ss_pred HhhHHHHHHHHHHhHHhhHHHHHHHHHhhhhcccCCCCChhh--hHHHhhhhh----h---hHHHhhHHHHHhHHHHHHH
Q 018721 209 DEQMALLQYQRENLHFLSEEILRLQECLSKYEQSDDGSTPQV--DLAHLLAAR----D---QELRTLSAEMNQLQSELRL 279 (351)
Q Consensus 209 EKQADLIrYLkdHNa~LSkrIL~Lq~~l~kye~~~~g~t~qv--dl~h~la~r----~---qelRa~~Ae~~q~~~el~~ 279 (351)
..|-.|..-|-.-..+|..+==-||..+ ++.-+||-. |++..=|-- + .-.+-|.+|+.-+++.++.
T Consensus 197 QEqEalvN~LwKrmdkLe~ekr~Lq~Kl-----Dqpvs~p~~prdia~~~~~~gD~a~~~~~hi~~l~~EveRlrt~l~~ 271 (552)
T KOG2129|consen 197 QEQEALVNSLWKRMDKLEQEKRYLQKKL-----DQPVSTPSLPRDIAKIPDVHGDEAAAEKLHIDKLQAEVERLRTYLSR 271 (552)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHh-----cCcccCCCchhhhhcCccccCchHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3466666666666777776666666655 333333322 444221111 1 1233556666666666666
Q ss_pred HHHH----HHHHHhhhhhhhhHhHHHHHHHh-hhhhh
Q 018721 280 ARSF----VLRVRNTNNQYVEENERLRAILG-EWSTR 311 (351)
Q Consensus 280 ar~l----i~~~r~~n~q~~eEn~rlRa~l~-Ews~r 311 (351)
|.-. +++.|....-.-|||+||-.-|- |-..|
T Consensus 272 Aqk~~~ek~~qy~~Ee~~~reen~rlQrkL~~e~erR 308 (552)
T KOG2129|consen 272 AQKSYQEKLMQYRAEEVDHREENERLQRKLINELERR 308 (552)
T ss_pred HHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHH
Confidence 6544 28888888888899999866553 44333
No 156
>PF02932 Neur_chan_memb: Neurotransmitter-gated ion-channel transmembrane region ion channel family signature gamma-aminobutyric acid (GABA) receptor signature nicotinic acetylcholine receptor signature; InterPro: IPR006029 Neurotransmitter ligand-gated ion channels are transmembrane receptor-ion channel complexes that open transiently upon binding of specific ligands, allowing rapid transmission of signals at chemical synapses [, ]. Five of these ion channel receptor families have been shown to form a sequence-related superfamily: Nicotinic acetylcholine receptor (AchR), an excitatory cation channel in vertebrates and invertebrates; in vertebrate motor endplates it is composed of alpha, beta, gamma and delta/epsilon subunits; in neurons it is composed of alpha and non-alpha (or beta) subunits []. Glycine receptor, an inhibitory chloride ion channel composed of alpha and beta subunits []. Gamma-aminobutyric acid (GABA) receptor, an inhibitory chloride ion channel; at least four types of subunits (alpha, beta, gamma and delta) are known []. Serotonin 5HT3 receptor, of which there are seven major types (5HT3-5HT7) []. Glutamate receptor, an excitatory cation channel of which at least three types have been described (kainate, N-methyl-D-aspartate (NMDA) and quisqualate) []. These receptors possess a pentameric structure (made up of varying subunits), surrounding a central pore. All known sequences of subunits from neurotransmitter-gated ion-channels are structurally related. They are composed of a large extracellular glycosylated N-terminal ligand-binding domain, followed by three hydrophobic transmembrane regions which form the ionic channel, followed by an intracellular region of variable length. A fourth hydrophobic region is found at the C-terminal of the sequence [, ]. This domain represents four transmembrane helices of a variety of neurotransmitter-gated ion-channels.; GO: 0006811 ion transport, 0016020 membrane; PDB: 1DXZ_A 3MRA_A 1EQ8_C 1OED_C 2PR9_P 1A11_A 1CEK_A 2BG9_E 2KSR_A 2K59_B ....
Probab=28.24 E-value=2e+02 Score=23.40 Aligned_cols=24 Identities=25% Similarity=0.432 Sum_probs=15.8
Q ss_pred hhHhHHHHHHHhhhhhhHHHHHHH
Q 018721 295 VEENERLRAILGEWSTRAAKLERA 318 (351)
Q Consensus 295 ~eEn~rlRa~l~Ews~raakle~a 318 (351)
..+.+.-..+-+||..-|-+++|-
T Consensus 197 ~~~~~~~~~~~~ew~~~A~viDR~ 220 (237)
T PF02932_consen 197 LREQDEEEEIKEEWKFVAMVIDRL 220 (237)
T ss_dssp HHHHHHHHHHHHHHHSTSHHHHHH
T ss_pred cccccccccccccccccHHHHHHH
Confidence 333344445667899988888864
No 157
>PF01763 Herpes_UL6: Herpesvirus UL6 like; InterPro: IPR002660 This family consists of various proteins from the Herpesviridae that are similar to Human herpesvirus 1 (HHV-1) UL6 virion protein. UL6 is essential for cleavage and packaging of the viral genome [].; GO: 0006323 DNA packaging
Probab=28.01 E-value=70 Score=34.72 Aligned_cols=39 Identities=18% Similarity=0.302 Sum_probs=35.1
Q ss_pred hHhhHHHHHHHHHHhHHhhHHHHHHHHHhhhhcccCCCC
Q 018721 208 SDEQMALLQYQRENLHFLSEEILRLQECLSKYEQSDDGS 246 (351)
Q Consensus 208 lEKQADLIrYLkdHNa~LSkrIL~Lq~~l~kye~~~~g~ 246 (351)
+..|.|-|+=||+-|..+-+|+=.+..+|.+|++.+...
T Consensus 372 In~qf~tIe~Lk~~n~~~~~kl~~~e~~L~r~~~~~~~~ 410 (557)
T PF01763_consen 372 INNQFDTIEDLKEENQDLEKKLRELESELSRYREEAQRA 410 (557)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccc
Confidence 667999999999999999999999999999999974433
No 158
>PF10251 PEN-2: Presenilin enhancer-2 subunit of gamma secretase; InterPro: IPR019379 This entry is a short, 101 peptide protein, which is the smallest subunit of the gamma-secretase aspartyl protease complex. It catalyses the intra-membrane cleavage of a subset of type I transmembrane proteins. The other active constituents of the complex are presenilin (PS) nicastrin and anterior pharynx defective-1 (APH-1) protein. Presenilin enhancer-2 (PEN-2) adopts a hairpin orientation in the membrane with its N- and C-terminal domains facing the luminal/extracellular space. The C-terminal domain maintains PS stability within the complex [].
Probab=28.00 E-value=41 Score=28.57 Aligned_cols=21 Identities=24% Similarity=0.437 Sum_probs=17.7
Q ss_pred cccchhhHHHHHHHHHHHHHH
Q 018721 36 SHSIFGSVVYCFVLAGYAILA 56 (351)
Q Consensus 36 ~~~~~g~~~y~~~L~~yA~~~ 56 (351)
-+|.+|.++|.++|++|+++-
T Consensus 53 i~SaiG~~vw~v~l~~W~~~F 73 (94)
T PF10251_consen 53 IRSAIGFLVWTVVLISWILIF 73 (94)
T ss_pred HHHHHHHHHHHHHHHHHHHHH
Confidence 468999999999999988753
No 159
>KOG0982 consensus Centrosomal protein Nuf [Cell cycle control, cell division, chromosome partitioning; Cytoskeleton]
Probab=27.89 E-value=6.1e+02 Score=27.39 Aligned_cols=46 Identities=33% Similarity=0.367 Sum_probs=24.9
Q ss_pred HHhhhhhhhhHhHHHHHHHhhhhhhHHHH-------HHHHHHhhhccHHHHHH
Q 018721 287 VRNTNNQYVEENERLRAILGEWSTRAAKL-------ERALEVERMSNIELQKK 332 (351)
Q Consensus 287 ~r~~n~q~~eEn~rlRa~l~Ews~raakl-------e~ale~er~~~~~~~~~ 332 (351)
+.+..+|-.+||++||-..---..-+.|| +-.||+-|+--.+.||.
T Consensus 302 lqmr~qqleeentelRs~~arlksl~dklaee~qr~sd~LE~lrlql~~eq~l 354 (502)
T KOG0982|consen 302 LQMRDQQLEEENTELRSLIARLKSLADKLAEEDQRSSDLLEALRLQLICEQKL 354 (502)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHH
Confidence 33444566677777776554444444443 34556655554444444
No 160
>PF08826 DMPK_coil: DMPK coiled coil domain like; InterPro: IPR014930 This domain is found in the myotonic dystrophy protein kinase (DMPK) and adopts a coiled coil structure. It plays a role in dimerisation []. ; GO: 0004674 protein serine/threonine kinase activity, 0005524 ATP binding, 0006468 protein phosphorylation; PDB: 1WT6_D.
Probab=27.83 E-value=3e+02 Score=21.60 Aligned_cols=43 Identities=21% Similarity=0.263 Sum_probs=21.7
Q ss_pred hhHHHHHhHHHHHHHHHHHHHHHHhhhhhhhhHhHHHHHHHhh
Q 018721 265 TLSAEMNQLQSELRLARSFVLRVRNTNNQYVEENERLRAILGE 307 (351)
Q Consensus 265 a~~Ae~~q~~~el~~ar~li~~~r~~n~q~~eEn~rlRa~l~E 307 (351)
++.-|++.+++.-..+-+-++.-..-|..-..|-++|+.-++|
T Consensus 15 ~~~eEL~kvk~~n~~~e~kLqeaE~rn~eL~~ei~~L~~e~ee 57 (61)
T PF08826_consen 15 AIQEELTKVKSANLAFESKLQEAEKRNRELEQEIERLKKEMEE 57 (61)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444444444444444444444445555555555555555554
No 161
>PF07782 DC_STAMP: DC-STAMP-like protein; InterPro: IPR012858 This group of sequences is similar to a region of the dendritic cell-specific transmembrane protein (DC-STAMP, Q9H295 from SWISSPROT). This is thought to be a novel receptor protein that shares no identity with other multimembrane-spanning proteins []. It is thought to have seven putative transmembrane regions [], two of which are found in the region featured in this family. DC-STAMP is also described as having potential N-linked glycosylation sites and a potential phosphorylation site for PKC [], but these are not conserved. ; GO: 0016021 integral to membrane
Probab=27.62 E-value=4.7e+02 Score=23.76 Aligned_cols=31 Identities=13% Similarity=0.143 Sum_probs=21.9
Q ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q 018721 143 SISTLLRIIMLIEAICAASFMSVYIGYVHQY 173 (351)
Q Consensus 143 s~~~lLriil~lElv~~l~~li~YIvkV~rF 173 (351)
....+..+.+..=+++.++++-.|+.|.|+-
T Consensus 143 ~~~~~~~i~~l~~l~~ll~~le~Y~~RLR~~ 173 (191)
T PF07782_consen 143 DYSVYIQIGLLYLLLWLLVLLEPYALRLRRV 173 (191)
T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3455555666666777778888899998874
No 162
>PF08657 DASH_Spc34: DASH complex subunit Spc34 ; InterPro: IPR013966 The DASH complex is a ~10 subunit microtubule-binding complex that is transferred to the kinetochore prior to mitosis []. In Saccharomyces cerevisiae (Baker's yeast) DASH forms both rings and spiral structures on microtubules in vitro [, ]. Components of the DASH complex, including Dam1, Duo1, Spc34, Dad1 and Ask1, are essential and connect the centromere to the plus end of spindle microtubules [].
Probab=27.53 E-value=2.2e+02 Score=27.90 Aligned_cols=66 Identities=18% Similarity=0.305 Sum_probs=42.8
Q ss_pred HhhHHHHHHHHHHhH------HhhHHHHHHHHHhhhhcccCCCC-----------ChhhhHHHhhhhhhhHHHhhHHHHH
Q 018721 209 DEQMALLQYQRENLH------FLSEEILRLQECLSKYEQSDDGS-----------TPQVDLAHLLAARDQELRTLSAEMN 271 (351)
Q Consensus 209 EKQADLIrYLkdHNa------~LSkrIL~Lq~~l~kye~~~~g~-----------t~qvdl~h~la~r~qelRa~~Ae~~ 271 (351)
+-+..+-.+..+|+. +|=.+|-.-+.+|.++.++..-. .+.+|.+.++..=+.|.|.|-++++
T Consensus 177 ga~eki~~Lr~~y~~l~~~i~~lE~~VaeQ~~qL~~~n~~~~~~~~~~~~~~~~~~~~~~~de~I~rEeeEIreLE~k~~ 256 (259)
T PF08657_consen 177 GAREKIAALRQRYNQLSNSIAYLEAEVAEQEAQLERMNRSSSDSSSDDEESEESSEDSVDTDEDIRREEEEIRELERKKR 256 (259)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccccccccccccccccchhHHHHHHHHHHHHHHHHHHHH
Confidence 334455556666654 33346666667777765542222 2345778888888999999999988
Q ss_pred hHH
Q 018721 272 QLQ 274 (351)
Q Consensus 272 q~~ 274 (351)
++|
T Consensus 257 ~Lq 259 (259)
T PF08657_consen 257 ELQ 259 (259)
T ss_pred hcC
Confidence 764
No 163
>cd09236 V_AnPalA_UmRIM20_like Protein-interacting V-domains of Aspergillus nidulans PalA/RIM20, Ustilago maydis RIM20, and related proteins. This family belongs to the V_Alix_like superfamily which includes the V-shaped (V) domains of Bro1 and Rim20 from Saccharomyces cerevisiae, mammalian Alix (apoptosis-linked gene-2 interacting protein X), His-Domain type N23 protein tyrosine phosphatase (HD-PTP, also known as PTPN23), and related domains. Aspergillus nidulas PalA/RIM20 and Ustilago maydis RIM20, like Saccharomyces cerevisiae Rim20, participate in the response to the external pH via the Pal/Rim101 pathway; however, Saccharomyces cerevisiae Rim20 does not belong to this family. This pathway is a signaling cascade resulting in the activation of the transcription factor PacC/Rim101. The mammalian Alix V-domain (belonging to a different family) contains a binding site, partially conserved in the superfamily, for the retroviral late assembly (L) domain YPXnL motif. Aspergillus nidulas Pa
Probab=27.24 E-value=6.4e+02 Score=25.21 Aligned_cols=87 Identities=21% Similarity=0.326 Sum_probs=70.7
Q ss_pred HHhhhhh-hhHHHhhHHHHHhHHHHHHHHHHHHHHHHhhhhhhhhHhHHHHHHHh--hhh------------hhHHHHHH
Q 018721 253 AHLLAAR-DQELRTLSAEMNQLQSELRLARSFVLRVRNTNNQYVEENERLRAILG--EWS------------TRAAKLER 317 (351)
Q Consensus 253 ~h~la~r-~qelRa~~Ae~~q~~~el~~ar~li~~~r~~n~q~~eEn~rlRa~l~--Ews------------~raakle~ 317 (351)
.|.-+-| .+.+..+...|+.+..-...++..+......=+.=.+|.+++|+--| .|. .+..+++-
T Consensus 62 ~~~~~v~~~~g~~~l~~~l~~l~~l~~~~~~~L~e~~~~L~~E~~ed~~~R~k~g~~~Wtr~~S~~~~~~l~~~~~~~~~ 141 (353)
T cd09236 62 RHAEEIRQEDGLERIRASLDDVARLAASDRAILEEAMDILDDEASEDESLRRKFGTDRWTRPDSHEANPKLYTQAAEYEG 141 (353)
T ss_pred HHHHHHHhCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHcCCCCCCCCCcHHHHHHHHHHHHHHHH
Confidence 3333333 45677888999999988889999997777777778889999999999 595 67788999
Q ss_pred HHHHhhhccHHHHHHHHHhhhc
Q 018721 318 ALEVERMSNIELQKKISTRRNQ 339 (351)
Q Consensus 318 ale~er~~~~~~~~~~~~~r~~ 339 (351)
.|+.-.=|+-.+++++...+..
T Consensus 142 ~L~~A~~sD~~v~~k~~~~~~~ 163 (353)
T cd09236 142 YLKQAGASDELVRRKLDEWEDL 163 (353)
T ss_pred HHHHHHhhHHHHHHHHHHHHHH
Confidence 9999999999999999887663
No 164
>PF11833 DUF3353: Protein of unknown function (DUF3353); InterPro: IPR021788 This family of proteins are functionally uncharacterised. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 205 to 258 amino acids in length.
Probab=26.81 E-value=1e+02 Score=28.78 Aligned_cols=55 Identities=15% Similarity=0.172 Sum_probs=37.3
Q ss_pred CCCCcccchhhHHHHHH--HHHHHHHHHhhhhhhccc-c-ccchhHHH--HHHHHHHHHHH
Q 018721 32 GHRKSHSIFGSVVYCFV--LAGYAILAAGTTWIFHPI-H-YLIPPLLC--SCGVILLALTG 86 (351)
Q Consensus 32 ~~r~~~~~~g~~~y~~~--L~~yA~~~~~~pw~~~~~-~-~~~~~lL~--y~~v~LWlLt~ 86 (351)
=+|+-++...+.++.+. .+||.+-.+..+|+...+ + ...|..+. ..-+.||+...
T Consensus 132 l~~K~~~~~rA~~~~~~~L~~G~~lGs~l~~~l~~~~~p~~~s~~~~~sl~~~i~lwl~s~ 192 (194)
T PF11833_consen 132 LNRKERKLGRAFLWTLGGLVVGLILGSLLASWLPVDIVPGPWSPEQLVSLFTYILLWLVSL 192 (194)
T ss_pred HHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCHHHHHHHHHHHHHHHHHh
Confidence 36778888888888765 678888888999988765 4 23333322 23467787654
No 165
>PF14931 IFT20: Intraflagellar transport complex B, subunit 20
Probab=26.63 E-value=4.4e+02 Score=23.07 Aligned_cols=88 Identities=23% Similarity=0.289 Sum_probs=56.2
Q ss_pred HHHHhHHhhHHHHHHHHHhhhhcccCCCCChhhhHHHhhhhh--hhHHHhhHHHHHhHHHHHHHHHHHHHHHHhhhhhhh
Q 018721 218 QRENLHFLSEEILRLQECLSKYEQSDDGSTPQVDLAHLLAAR--DQELRTLSAEMNQLQSELRLARSFVLRVRNTNNQYV 295 (351)
Q Consensus 218 LkdHNa~LSkrIL~Lq~~l~kye~~~~g~t~qvdl~h~la~r--~qelRa~~Ae~~q~~~el~~ar~li~~~r~~n~q~~ 295 (351)
..++...|..+.-.....++.|.+-..| -++...-+|.+ ..-+||+.+ .|+|.++.+-..+--+.+-..=.---
T Consensus 18 ~~~~t~~Lk~ec~~F~~ki~~F~~iv~~---~~~~~~~~A~~VE~eKlkAIG~-RN~l~s~~k~R~~~~q~lq~~I~Ek~ 93 (120)
T PF14931_consen 18 KADQTQELKEECKEFVEKISEFQKIVKG---FIEILDELAKRVENEKLKAIGA-RNLLKSEAKQREAQQQQLQALIAEKK 93 (120)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHh-HHHHHHHHHhHHHHHHHHHHHHHHHH
Confidence 3466677777777777777777666333 34555566666 446788765 58888887666665544444433444
Q ss_pred hHhHHHHHHHhhhh
Q 018721 296 EENERLRAILGEWS 309 (351)
Q Consensus 296 eEn~rlRa~l~Ews 309 (351)
.|-+|||...+-+.
T Consensus 94 ~eLERl~~E~~sL~ 107 (120)
T PF14931_consen 94 MELERLRSEYESLQ 107 (120)
T ss_pred HHHHHHHHHHHHHH
Confidence 56777777766553
No 166
>COG4792 EscU Type III secretory pathway, component EscU [Intracellular trafficking and secretion]
Probab=26.39 E-value=4e+02 Score=27.52 Aligned_cols=68 Identities=25% Similarity=0.291 Sum_probs=38.9
Q ss_pred hHHHHHHHHHHHHHHHhhhhhhccccccchhHHHHHHHHHHHHHHHHHHHHHHHHhHHHhhhHHHHHHHhhccc
Q 018721 42 SVVYCFVLAGYAILAAGTTWIFHPIHYLIPPLLCSCGVILLALTGIFQQYFVYQVQKIRLQGYYSFSQKLKHIV 115 (351)
Q Consensus 42 ~~~y~~~L~~yA~~~~~~pw~~~~~~~~~~~lL~y~~v~LWlLt~l~d~yv~~qH~KlRl~GYl~FYR~Tr~lk 115 (351)
+.+|.+.+-|||......|. -++.--..+.+-...|+..++.-.|+....-.-=-+ |+.|++++|.=|
T Consensus 151 slif~f~l~~~~~t~~~lp~-----CG~~C~~~Vv~~~~~~L~~g~~~~ylv~sv~Dy~fq-r~~~~K~lkMSK 218 (349)
T COG4792 151 SLIFWFMLHGYANTFLYLPG-----CGLYCALPVVSFLLRLLWVGVAVGYLVFSVADYAFQ-RYQILKELKMSK 218 (349)
T ss_pred HHHHHHHHHHHHHHHhhccc-----cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhccCH
Confidence 45666777888888888887 444444444555555655555555554432222222 556666666533
No 167
>PF14584 DUF4446: Protein of unknown function (DUF4446)
Probab=26.19 E-value=1.7e+02 Score=26.49 Aligned_cols=53 Identities=17% Similarity=0.316 Sum_probs=39.1
Q ss_pred HHHHHhhhhcccCCCCChhhhHHHhhhhhhhHHHhhHHHHHhHHHHHHHHHHHH
Q 018721 231 RLQECLSKYEQSDDGSTPQVDLAHLLAARDQELRTLSAEMNQLQSELRLARSFV 284 (351)
Q Consensus 231 ~Lq~~l~kye~~~~g~t~qvdl~h~la~r~qelRa~~Ae~~q~~~el~~ar~li 284 (351)
+++....+|+.--.|.+++ |++..+...-++++.+..+.++++.+++......
T Consensus 24 kl~kl~r~Y~~lm~g~~~~-~lE~~l~~~~~~~~~~~~~~~~~~~~~~~l~~~~ 76 (151)
T PF14584_consen 24 KLRKLKRRYDALMRGKDGK-NLEDLLNELFDQIDELKEELEELEKRIEELEEKL 76 (151)
T ss_pred HHHHHHHHHHHHhCCCCcc-cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444555677776777766 8999998888888888888888887777665543
No 168
>PF10654 DUF2481: Protein of unknown function (DUF2481) ; InterPro: IPR018916 This entry is represented by Bacteriophage A500, Gp59. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.
Probab=25.95 E-value=65 Score=28.71 Aligned_cols=34 Identities=21% Similarity=0.242 Sum_probs=28.1
Q ss_pred hHhhHHHHHHHHHHhHHhhHHHHHHHHHhhhhccc
Q 018721 208 SDEQMALLQYQRENLHFLSEEILRLQECLSKYEQS 242 (351)
Q Consensus 208 lEKQADLIrYLkdHNa~LSkrIL~Lq~~l~kye~~ 242 (351)
=|+|..+|.|+-.++-.+|++ ..||.+|+.-...
T Consensus 9 KerQreIIsyl~n~dl~~~~~-k~LqkeLn~Lm~~ 42 (126)
T PF10654_consen 9 KERQREIISYLVNNDLSFSKR-KELQKELNQLMNE 42 (126)
T ss_pred HHHHHHHHHHHHhCCCChHHH-HHHHHHHHHHHhc
Confidence 489999999999999999875 6788888765543
No 169
>PF11239 DUF3040: Protein of unknown function (DUF3040); InterPro: IPR021401 Some members in this family of proteins with unknown function are annotated as membrane proteins however this cannot be confirmed.
Probab=25.62 E-value=2.2e+02 Score=22.62 Aligned_cols=43 Identities=21% Similarity=0.396 Sum_probs=20.5
Q ss_pred cchhhhhhhhccCcccCCCCcccchhhHHHHHHHHHHHHHHHhhhh
Q 018721 16 EENAMFLDILHEAPLLGHRKSHSIFGSVVYCFVLAGYAILAAGTTW 61 (351)
Q Consensus 16 ~~~~~f~d~~~e~p~~~~r~~~~~~g~~~y~~~L~~yA~~~~~~pw 61 (351)
+||..|-.-+...+-..++..+.+.|.++ +++|.++++++...
T Consensus 20 ~~DP~fa~~l~~~~~~~~~~r~~~~~~~~---~v~gl~llv~G~~~ 62 (82)
T PF11239_consen 20 ADDPRFAARLRSGRPRRPSRRRRVLGVLL---VVVGLALLVAGVVL 62 (82)
T ss_pred hcCcHHHHHhccCCCCCCchhHHHHHHHH---HHHHHHHHHHHHHH
Confidence 34555655555433333333333344333 55666666666554
No 170
>PF04420 CHD5: CHD5-like protein; InterPro: IPR007514 Members of this family are probably coiled-coil proteins that are similar to the CHD5 (Congenital heart disease 5) protein. The exact molecular function of these eukaryotic proteins is unknown.; PDB: 3SJA_H 3SJC_D 3SJB_D 3ZS8_D 3VLC_E.
Probab=25.42 E-value=1.6e+02 Score=26.48 Aligned_cols=59 Identities=22% Similarity=0.238 Sum_probs=0.0
Q ss_pred hhHHHHHhHHHHHHHHHHHHHHHHhhhhhhhhHhHHHHHHHhhhhhhHHHHHHHHHHhhhccHHHHHHHHHhhhc
Q 018721 265 TLSAEMNQLQSELRLARSFVLRVRNTNNQYVEENERLRAILGEWSTRAAKLERALEVERMSNIELQKKISTRRNQ 339 (351)
Q Consensus 265 a~~Ae~~q~~~el~~ar~li~~~r~~n~q~~eEn~rlRa~l~Ews~raakle~ale~er~~~~~~~~~~~~~r~~ 339 (351)
..+.|..+++.|+...+.+...+-..++. +|.|||+|.++.---.-.+++++.++-|.+
T Consensus 37 ~~~~~~~~l~~Ei~~l~~E~~~iS~qDeF----------------AkwaKl~Rk~~kl~~el~~~~~~~~~~~~~ 95 (161)
T PF04420_consen 37 KSSKEQRQLRKEILQLKRELNAISAQDEF----------------AKWAKLNRKLDKLEEELEKLNKSLSSEKSS 95 (161)
T ss_dssp HHHHHHHHHHHHHHHHHHHHTTS-TTTSH----------------HHHHHHHHHHHHHHHHHHHHHHHHHHTCHH
T ss_pred cccHHHHHHHHHHHHHHHHHHcCCcHHHH----------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
No 171
>COG4372 Uncharacterized protein conserved in bacteria with the myosin-like domain [Function unknown]
Probab=25.36 E-value=7.2e+02 Score=26.67 Aligned_cols=71 Identities=21% Similarity=0.227 Sum_probs=38.0
Q ss_pred hhHHHhhHHHHHhHHHHHHHHHHHH----HHHHhhhhhh----------hhHhHHHHHHHhhhhhhHHHHHHHHHHhhhc
Q 018721 260 DQELRTLSAEMNQLQSELRLARSFV----LRVRNTNNQY----------VEENERLRAILGEWSTRAAKLERALEVERMS 325 (351)
Q Consensus 260 ~qelRa~~Ae~~q~~~el~~ar~li----~~~r~~n~q~----------~eEn~rlRa~l~Ews~raakle~ale~er~~ 325 (351)
+++.|++.-|.|-+.+||..+|-.= ++.+..|..| .+...+|+--|+--+..-++||.-+..---+
T Consensus 94 ~~~k~~~e~er~~~~~El~~~r~e~~~v~~~~~~a~~n~~kAqQ~lar~t~Q~q~lqtrl~~l~~qr~ql~aq~qsl~a~ 173 (499)
T COG4372 94 QGEKRAAETEREAARSELQKARQEREAVRQELAAARQNLAKAQQELARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQAS 173 (499)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3455555555566666665555543 3333333322 2223456666667777777777665554444
Q ss_pred cHHHH
Q 018721 326 NIELQ 330 (351)
Q Consensus 326 ~~~~~ 330 (351)
.-+||
T Consensus 174 ~k~LQ 178 (499)
T COG4372 174 QKQLQ 178 (499)
T ss_pred HHHHH
Confidence 44555
No 172
>KOG0996 consensus Structural maintenance of chromosome protein 4 (chromosome condensation complex Condensin, subunit C) [Chromatin structure and dynamics; Cell cycle control, cell division, chromosome partitioning]
Probab=25.32 E-value=1.2e+03 Score=28.11 Aligned_cols=79 Identities=20% Similarity=0.275 Sum_probs=42.4
Q ss_pred hhHHHhhHHHHHhHHHHHHHHHHHH-----------HH-------HHhhhhhhhhHhHHHHHHH---hhhhhhHHHHHHH
Q 018721 260 DQELRTLSAEMNQLQSELRLARSFV-----------LR-------VRNTNNQYVEENERLRAIL---GEWSTRAAKLERA 318 (351)
Q Consensus 260 ~qelRa~~Ae~~q~~~el~~ar~li-----------~~-------~r~~n~q~~eEn~rlRa~l---~Ews~raakle~a 318 (351)
+.+|--.....|++.+|++-|+|.+ .+ +-..-.-+.|.++.|.+.. .-|..-....|.-
T Consensus 485 ekel~~~~~~~n~~~~e~~vaesel~~L~~~~~~~~~~~e~lk~~L~~~~~~~~e~~~~l~~~k~~l~~~k~e~~~~~k~ 564 (1293)
T KOG0996|consen 485 EKELMPLLKQVNEARSELDVAESELDILLSRHETGLKKVEELKGKLLASSESLKEKKTELDDLKEELPSLKQELKEKEKE 564 (1293)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHh
Confidence 3455566777788888888888886 11 1122222444444444433 3344445555555
Q ss_pred HHHhhhccHHHHHHHHHhhh
Q 018721 319 LEVERMSNIELQKKISTRRN 338 (351)
Q Consensus 319 le~er~~~~~~~~~~~~~r~ 338 (351)
|+.-|-.--.++++..++|+
T Consensus 565 l~~~~~e~~~~~~~~~~~rq 584 (1293)
T KOG0996|consen 565 LPKLRKEERNLKSQLNKLRQ 584 (1293)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 55545444445555555554
No 173
>KOG0288 consensus WD40 repeat protein TipD [General function prediction only]
Probab=25.12 E-value=3.3e+02 Score=29.12 Aligned_cols=56 Identities=30% Similarity=0.331 Sum_probs=40.1
Q ss_pred hhhHHHhhhhhhhHHHhhHHHHHhHHHHHHHHHHH-------HHHHHhhhhhhhhHhHHHHHH
Q 018721 249 QVDLAHLLAARDQELRTLSAEMNQLQSELRLARSF-------VLRVRNTNNQYVEENERLRAI 304 (351)
Q Consensus 249 qvdl~h~la~r~qelRa~~Ae~~q~~~el~~ar~l-------i~~~r~~n~q~~eEn~rlRa~ 304 (351)
-.|+.|-||-=++..-.++|++..+..|-++-++- |+++.+.|.|--||--|.++.
T Consensus 15 ~~~~~~~laq~~k~~s~~~aq~~~~~a~~~ai~a~~~~~E~~l~~Lq~e~~~l~e~~v~~~a~ 77 (459)
T KOG0288|consen 15 LIDLNTELAQCEKAQSRLSAQLVILRAESRAIKAKLQEKELELNRLQEENTQLNEERVREEAT 77 (459)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 35777777776766667777777777776665554 488888888888887775554
No 174
>PF10066 DUF2304: Uncharacterized conserved protein (DUF2304); InterPro: IPR019277 This entry represents hypothetical archaeal and bacterial proteins that have no known function.
Probab=25.02 E-value=3.9e+02 Score=22.47 Aligned_cols=64 Identities=22% Similarity=0.173 Sum_probs=36.5
Q ss_pred CCcccchhhHHHHHHHHHHHHHHHhhhhhhcccc----ccchh-HHHHHHHHHHHHHHHHHHHHHHHHhH
Q 018721 34 RKSHSIFGSVVYCFVLAGYAILAAGTTWIFHPIH----YLIPP-LLCSCGVILLALTGIFQQYFVYQVQK 98 (351)
Q Consensus 34 r~~~~~~g~~~y~~~L~~yA~~~~~~pw~~~~~~----~~~~~-lL~y~~v~LWlLt~l~d~yv~~qH~K 98 (351)
|+.+--..-.+..+++....+.++..|.+...+- --.|| ++.|+ .++.++...+..|.+-.+..
T Consensus 24 r~~~l~~~~~l~Wl~~~i~~l~~~ifP~~~~~vA~~lGi~~~~n~lf~~-~i~~ll~~~~~l~~~is~le 92 (115)
T PF10066_consen 24 RKRKLRLKYSLLWLVFSIILLILSIFPNILDWVAKLLGIGRPPNLLFYL-GILFLLVIIFSLYVRISRLE 92 (115)
T ss_pred HHhhccHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHCCCchhHHHHHH-HHHHHHHHHHHHHHHHHHHH
Confidence 4444445555666677777778888888777642 22333 44444 34444445666666654443
No 175
>PRK13922 rod shape-determining protein MreC; Provisional
Probab=25.01 E-value=93 Score=29.51 Aligned_cols=24 Identities=42% Similarity=0.470 Sum_probs=16.1
Q ss_pred hhhhHhHHHHHHHhhhhhhHHHHH
Q 018721 293 QYVEENERLRAILGEWSTRAAKLE 316 (351)
Q Consensus 293 q~~eEn~rlRa~l~Ews~raakle 316 (351)
+..+||++|++.+.+-..+.++++
T Consensus 73 ~l~~en~~L~~e~~~l~~~~~~~~ 96 (276)
T PRK13922 73 DLREENEELKKELLELESRLQELE 96 (276)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 456777888777776666666553
No 176
>PF11559 ADIP: Afadin- and alpha -actinin-Binding; InterPro: IPR021622 This family is found in mammals where it is localised at cell-cell adherens junctions [], and in Sch. pombe and other fungi where it anchors spindle-pole bodies to spindle microtubules []. It is a coiled-coil structure, and in pombe, it is required for anchoring the minus end of spindle microtubules to the centrosome equivalent, the spindle-pole body. The name ADIP derives from the family being composed of Afadin- and alpha -Actinin-Binding Proteins Localised at Cell-Cell Adherens Junctions.
Probab=24.99 E-value=4.6e+02 Score=22.72 Aligned_cols=30 Identities=17% Similarity=0.179 Sum_probs=11.8
Q ss_pred HHHHHHHhhhhhhHHHHHHHHHHhhhccHH
Q 018721 299 ERLRAILGEWSTRAAKLERALEVERMSNIE 328 (351)
Q Consensus 299 ~rlRa~l~Ews~raakle~ale~er~~~~~ 328 (351)
+-+-..+.-+..+...++..+..+.-.+-.
T Consensus 83 ~~~ere~~~~~~~~~~l~~~~~~~~~~~k~ 112 (151)
T PF11559_consen 83 EELERELASAEEKERQLQKQLKSLEAKLKQ 112 (151)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 333333333444444444444443333333
No 177
>PF04728 LPP: Lipoprotein leucine-zipper; InterPro: IPR006817 This repeating sequence, NAKVDQLSNDV, is found in the enterobacterial outer membrane lipoprotein LPP. The outer membrane lipoprotein is the most abundant protein in an Escherichia coli cell. The messenger RNA for the lipoprotein of the E. coli outer membrane codes for a putative precursor, prolipoprotein, which has 20 additional amino acid residues extending from the amino terminus of the lipoprotein.; GO: 0019867 outer membrane; PDB: 1JCC_A 2GUV_C 2GUS_A 1JCD_A 1KFM_A 1T8Z_D 1KFN_A 1EQ7_A.
Probab=24.50 E-value=2.3e+02 Score=22.08 Aligned_cols=24 Identities=29% Similarity=0.475 Sum_probs=18.2
Q ss_pred HHHhhHHHHHhHHHHHHHHHHHHH
Q 018721 262 ELRTLSAEMNQLQSELRLARSFVL 285 (351)
Q Consensus 262 elRa~~Ae~~q~~~el~~ar~li~ 285 (351)
++-+|.+..+||++++...|+.++
T Consensus 11 dVq~L~~kvdqLs~dv~~lr~~v~ 34 (56)
T PF04728_consen 11 DVQTLNSKVDQLSSDVNALRADVQ 34 (56)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH
Confidence 455778888888888888887764
No 178
>PRK11877 psaI photosystem I reaction center subunit VIII; Reviewed
Probab=24.37 E-value=50 Score=24.00 Aligned_cols=27 Identities=22% Similarity=0.248 Sum_probs=19.9
Q ss_pred HHHhhhhhhccccccchhHHHHHHHHH
Q 018721 55 LAAGTTWIFHPIHYLIPPLLCSCGVIL 81 (351)
Q Consensus 55 ~~~~~pw~~~~~~~~~~~lL~y~~v~L 81 (351)
++...||||.|.-++..|...++-..+
T Consensus 6 aas~LPsI~VPlVGlvfPai~Mallf~ 32 (38)
T PRK11877 6 AASWLPWIFVPLVGWVFPAVFMVLLGR 32 (38)
T ss_pred hHHhCchHHHHHHHHHHHHHHHHHHHH
Confidence 456789999999888888766544333
No 179
>PRK10245 adrA diguanylate cyclase AdrA; Provisional
Probab=24.33 E-value=7.1e+02 Score=24.73 Aligned_cols=53 Identities=6% Similarity=-0.046 Sum_probs=37.7
Q ss_pred hhhHHHHHHHHHHHHHHHhhhhhhccccccchhHHHHHHHHHHHHHHHHHHHHHHHHh
Q 018721 40 FGSVVYCFVLAGYAILAAGTTWIFHPIHYLIPPLLCSCGVILLALTGIFQQYFVYQVQ 97 (351)
Q Consensus 40 ~g~~~y~~~L~~yA~~~~~~pw~~~~~~~~~~~lL~y~~v~LWlLt~l~d~yv~~qH~ 97 (351)
.|..+||+..+.+.--.+...|.+ +.+-.++|-|+++|+..-..+.+-...++
T Consensus 44 ~~~~~~~~~~~~~~~~~~~~~~~~-----l~~~~~~~p~~a~~~~~~~~~~~~~~~~~ 96 (366)
T PRK10245 44 LAGMFLPIASTLVSHPPPGWWWLL-----LVGWAFVWPHLAWQIASRAVDPLSREIYN 96 (366)
T ss_pred HHHhHHHHHHHHHhcccchHHHHH-----HHHHHHHhHHHHHHHHHhCCChhHHHHHH
Confidence 366777887777777666667744 34556777888888888877877766655
No 180
>PRK11281 hypothetical protein; Provisional
Probab=24.29 E-value=8.5e+02 Score=28.84 Aligned_cols=43 Identities=14% Similarity=0.158 Sum_probs=20.6
Q ss_pred hHHHHHHHhhhhhhHHHHHHHHHHhhhccHHHHHHHHHhhhcC
Q 018721 298 NERLRAILGEWSTRAAKLERALEVERMSNIELQKKISTRRNQH 340 (351)
Q Consensus 298 n~rlRa~l~Ews~raakle~ale~er~~~~~~~~~~~~~r~~~ 340 (351)
-+...+.|.+|..+.+-++-+.|+-+=.=.|-++.+.+.|++.
T Consensus 137 Lq~~Q~~La~~NsqLi~~qT~PERAQ~~lsea~~RlqeI~~~L 179 (1113)
T PRK11281 137 LQNAQNDLAEYNSQLVSLQTQPERAQAALYANSQRLQQIRNLL 179 (1113)
T ss_pred HHHHHHHHHHHHHHHHhhhcchHHHHHHHHHHHHHHHHHHHHH
Confidence 3344455555555555555444444333334455555555544
No 181
>KOG0243 consensus Kinesin-like protein [Cytoskeleton]
Probab=24.19 E-value=9.9e+02 Score=28.32 Aligned_cols=120 Identities=18% Similarity=0.254 Sum_probs=80.5
Q ss_pred cccCCCchhHhhHHHHHHHHHHhHHhhHHHHHHHHHhhhhcccCCCCChhhhH-HHhhhhhhhHHHhhHHHHHhHHHHHH
Q 018721 200 RYHDGGRLSDEQMALLQYQRENLHFLSEEILRLQECLSKYEQSDDGSTPQVDL-AHLLAARDQELRTLSAEMNQLQSELR 278 (351)
Q Consensus 200 Gfrd~g~LlEKQADLIrYLkdHNa~LSkrIL~Lq~~l~kye~~~~g~t~qvdl-~h~la~r~qelRa~~Ae~~q~~~el~ 278 (351)
.|++...-...+++.|+=|.++...+-++|-.++++...=...-..-+-..|. ..-|.-+.+|++.+.-|..|++..|+
T Consensus 435 ~y~~~e~e~~~~~~~ieele~el~~~~~~l~~~~e~~~~~~~~~~~l~~~~~~~k~~L~~~~~el~~~~ee~~~~~~~l~ 514 (1041)
T KOG0243|consen 435 RYTQEEKEKKEMAEQIEELEEELENLEKQLKDLTELYMNQLEIKELLKEEKEKLKSKLQNKNKELESLKEELQQAKATLK 514 (1041)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44433335667888899999999999998888888873211111111222222 44456667788888888888888888
Q ss_pred HHHHHHHHHHhhhhhhhhHhHHHHHHHhhhhhhHHHHHHHH
Q 018721 279 LARSFVLRVRNTNNQYVEENERLRAILGEWSTRAAKLERAL 319 (351)
Q Consensus 279 ~ar~li~~~r~~n~q~~eEn~rlRa~l~Ews~raakle~al 319 (351)
..--.|++....-.--++-+.-||..+++-.....-|=..|
T Consensus 515 ~~e~ii~~~~~se~~l~~~a~~l~~~~~~s~~d~s~l~~kl 555 (1041)
T KOG0243|consen 515 EEEEIISQQEKSEEKLVDRATKLRRSLEESQDDLSSLFEKL 555 (1041)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 88777866666655555558888888888877766444333
No 182
>TIGR02559 HrpB7 type III secretion protein HrpB7. This family of genes is found in type III secretion operons in a narrow range of species including Xanthomonas, Burkholderia and Ralstonia.
Probab=24.11 E-value=5.9e+02 Score=23.69 Aligned_cols=72 Identities=22% Similarity=0.254 Sum_probs=0.0
Q ss_pred CCChhhhHHHhhhhh------hhHHHhhHHHHHhHHHHHHHHHHHHHHHHhhhhhhhhHhHHHHHHHhhhhhhHHHHHHH
Q 018721 245 GSTPQVDLAHLLAAR------DQELRTLSAEMNQLQSELRLARSFVLRVRNTNNQYVEENERLRAILGEWSTRAAKLERA 318 (351)
Q Consensus 245 g~t~qvdl~h~la~r------~qelRa~~Ae~~q~~~el~~ar~li~~~r~~n~q~~eEn~rlRa~l~Ews~raakle~a 318 (351)
|.++-+-++.+++-| +-.+....++..++++.|......|.-.+. +-.|+++.++--+.|++.|-++
T Consensus 64 tG~~~f~id~~la~~ryr~vl~~~~~~aE~~~aaa~~al~~~~~~laa~~r-------~iaRn~a~id~c~eR~~~l~ra 136 (158)
T TIGR02559 64 TGTASFTIDAYLQCRAYRDVLEAHLGAAEQAEAAARAALQALAAALAAKKR-------EIARLDAQIDVCRERAERLRRA 136 (158)
T ss_pred cCCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHH
Q ss_pred HHHhh
Q 018721 319 LEVER 323 (351)
Q Consensus 319 le~er 323 (351)
.|+++
T Consensus 137 ~ea~~ 141 (158)
T TIGR02559 137 GEAAR 141 (158)
T ss_pred HHHHH
No 183
>TIGR01005 eps_transp_fam exopolysaccharide transport protein family. The model describes the exopolysaccharide transport protein family in bacteria. The transport protein is part of a large genetic locus which is associated with exopolysaccharide (EPS) biosynthesis. Detailed molecular characterization and gene fusion analysis revealed atleast seven gene products are involved in the overall regulation, which among other things, include exopolysaccharide biosynthesis, property of conferring virulence and exopolysaccharide export.
Probab=24.00 E-value=9.6e+02 Score=26.13 Aligned_cols=32 Identities=9% Similarity=0.032 Sum_probs=16.6
Q ss_pred hhhhHHHHHHHHHHhhhccHHHHHHHHHhhhc
Q 018721 308 WSTRAAKLERALEVERMSNIELQKKISTRRNQ 339 (351)
Q Consensus 308 ws~raakle~ale~er~~~~~~~~~~~~~r~~ 339 (351)
...+-..|+|..+.-|-.=..+.+++.+.+-+
T Consensus 374 ~~~e~~~L~Re~~~~~~~Y~~ll~r~~e~~~~ 405 (754)
T TIGR01005 374 QQVDLDALQRDAAAKRQLYESYLTNYRQAASR 405 (754)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 34444556666665554444455555555443
No 184
>KOG4673 consensus Transcription factor TMF, TATA element modulatory factor [Transcription]
Probab=23.86 E-value=1.2e+03 Score=27.03 Aligned_cols=106 Identities=15% Similarity=0.117 Sum_probs=55.9
Q ss_pred HHHHHHHHHHhHHhhHHHHHHHHHhhhhcccCCCCChhhhHHHhhhhhhhHHHhhHHHHHhHHHHHHHH-------HHHH
Q 018721 212 MALLQYQRENLHFLSEEILRLQECLSKYEQSDDGSTPQVDLAHLLAARDQELRTLSAEMNQLQSELRLA-------RSFV 284 (351)
Q Consensus 212 ADLIrYLkdHNa~LSkrIL~Lq~~l~kye~~~~g~t~qvdl~h~la~r~qelRa~~Ae~~q~~~el~~a-------r~li 284 (351)
-++|+-|+..--+|||++|.=+....|-.--+.-+ +.+++.-..-+--|.-|.|-|.+=|+.- +--|
T Consensus 452 dE~I~~lm~EGEkLSK~ql~qs~iIkKLRAk~ke~------etl~~K~ge~i~~L~sE~~~lk~il~~Kee~Ek~~~E~I 525 (961)
T KOG4673|consen 452 DEIINQLMAEGEKLSKKQLAQSAIIKKLRAKIKEA------ETLEEKKGELITKLQSEENKLKSILRDKEETEKLLQETI 525 (961)
T ss_pred HHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHhhhh------hHHHHHhhhHHHHHHHHHHHHHHHhhhHHHHHHHHHHHH
Confidence 46788888888899999988877776544321111 1111111111112222222222222110 1112
Q ss_pred HHHHhhhhhhhhHhHHHHHHHhhhhhhHHHHHHHHHHhh
Q 018721 285 LRVRNTNNQYVEENERLRAILGEWSTRAAKLERALEVER 323 (351)
Q Consensus 285 ~~~r~~n~q~~eEn~rlRa~l~Ews~raakle~ale~er 323 (351)
-+.-+.---.-++|.++|+-+++-..|++-+++++..-|
T Consensus 526 ~k~~ae~~rq~~~~~~sr~~~~~le~~~~a~qat~d~a~ 564 (961)
T KOG4673|consen 526 EKHQAELTRQKDYYSNSRALAAALEAQALAEQATNDEAR 564 (961)
T ss_pred HHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHhhhhhh
Confidence 333333334567788888888888888877777765443
No 185
>KOG0933 consensus Structural maintenance of chromosome protein 2 (chromosome condensation complex Condensin, subunit E) [Chromatin structure and dynamics; Cell cycle control, cell division, chromosome partitioning]
Probab=23.76 E-value=6.2e+02 Score=30.08 Aligned_cols=52 Identities=19% Similarity=0.262 Sum_probs=39.1
Q ss_pred hhhhhhHHHhhHHHHHhHHHHHHHHHHHHHHHHhhhhhhhhHhHHHHHHHhh
Q 018721 256 LAARDQELRTLSAEMNQLQSELRLARSFVLRVRNTNNQYVEENERLRAILGE 307 (351)
Q Consensus 256 la~r~qelRa~~Ae~~q~~~el~~ar~li~~~r~~n~q~~eEn~rlRa~l~E 307 (351)
+..|++|.-.+..|..+++.|++...+.++.+...=.-+.+|++-|++.+.-
T Consensus 810 ~ek~~~e~e~l~lE~e~l~~e~~~~k~~l~~~~~~~~~l~~e~~~l~~kv~~ 861 (1174)
T KOG0933|consen 810 LEKRENEYERLQLEHEELEKEISSLKQQLEQLEKQISSLKSELGNLEAKVDK 861 (1174)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 4455667777888888888888888887777777777788888888877753
No 186
>TIGR03495 phage_LysB phage lysis regulatory protein, LysB family. Members of this protein family are phage lysis regulatory protein, including the well-studied protein LysB (lysis protein B) of Enterobacteria phage P2. For members of this family, genes are found in phage or in prophage regions of bacterial genomes, typically near a phage lysozyme or phage holin.
Probab=23.65 E-value=3.2e+02 Score=24.54 Aligned_cols=80 Identities=21% Similarity=0.246 Sum_probs=35.3
Q ss_pred HHHHHhHHhhHHHHHHHHHhhhhcccCCCCChhhhHHHhhhhh-hhHHHhhHHHHHhHHHHHHHHHHHHHHHHhhhhhhh
Q 018721 217 YQRENLHFLSEEILRLQECLSKYEQSDDGSTPQVDLAHLLAAR-DQELRTLSAEMNQLQSELRLARSFVLRVRNTNNQYV 295 (351)
Q Consensus 217 YLkdHNa~LSkrIL~Lq~~l~kye~~~~g~t~qvdl~h~la~r-~qelRa~~Ae~~q~~~el~~ar~li~~~r~~n~q~~ 295 (351)
|++.+|..|+..+=.-+..++.=...-.-...|.......+.+ ++.-..|-.+..++.+.+......|.++ .
T Consensus 16 ~~~~~~~~l~~~~~~a~~~~~~~~~~l~~~~~qL~~l~~~a~~~~~~Q~~Lr~~~~~~~~~l~~re~~i~rL-------~ 88 (135)
T TIGR03495 16 WQSQRLRNARADLERANRVLKAQQAELASKANQLIVLLALAKRNEEAQAQLRQQLAQARALLAQREQRIERL-------K 88 (135)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------H
Confidence 5666666666666666655533222211112233222222222 2222333344444444444444444443 3
Q ss_pred hHhHHHHH
Q 018721 296 EENERLRA 303 (351)
Q Consensus 296 eEn~rlRa 303 (351)
.||+-||+
T Consensus 89 ~ENe~lR~ 96 (135)
T TIGR03495 89 RENEDLRR 96 (135)
T ss_pred HcCHHHHH
Confidence 46777664
No 187
>PF12325 TMF_TATA_bd: TATA element modulatory factor 1 TATA binding; InterPro: IPR022091 This is the C-terminal conserved coiled coil region of a family of TATA element modulatory factor 1 proteins conserved in eukaryotes []. The proteins bind to the TATA element of some RNA polymerase II promoters and repress their activity. by competing with the binding of TATA binding protein. TMF1_TATA_bd is the most conserved part of the TMFs []. TMFs are evolutionarily conserved golgins that bind Rab6, a ubiquitous ras-like GTP-binding Golgi protein, and contribute to Golgi organisation in animal [] and plant cells. The Rab6-binding domain appears to be the same region as this C-terminal family [].
Probab=23.62 E-value=5e+02 Score=22.71 Aligned_cols=87 Identities=23% Similarity=0.318 Sum_probs=49.8
Q ss_pred HHHHHhHHhhHHHHHHHHHhhhhcccCCCCChhhhHHHhhhhhhhHHHhhHHHHHhHHHHHHHHHHHHHHHHhhhhhhhh
Q 018721 217 YQRENLHFLSEEILRLQECLSKYEQSDDGSTPQVDLAHLLAARDQELRTLSAEMNQLQSELRLARSFVLRVRNTNNQYVE 296 (351)
Q Consensus 217 YLkdHNa~LSkrIL~Lq~~l~kye~~~~g~t~qvdl~h~la~r~qelRa~~Ae~~q~~~el~~ar~li~~~r~~n~q~~e 296 (351)
.|.-..+.+--++-.++.++++-+..-|. -.+|+=.+..+.+.+++.......+=.++...+..|
T Consensus 20 ~L~s~lr~~E~E~~~l~~el~~l~~~r~~-------------l~~Eiv~l~~~~e~~~~~~~~~~~L~~el~~l~~ry-- 84 (120)
T PF12325_consen 20 RLQSQLRRLEGELASLQEELARLEAERDE-------------LREEIVKLMEENEELRALKKEVEELEQELEELQQRY-- 84 (120)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--
Confidence 66666666666666666666443332110 022444555555555555544444445555555555
Q ss_pred HhHHHHHHHhhhhhhHHHHHHHHH
Q 018721 297 ENERLRAILGEWSTRAAKLERALE 320 (351)
Q Consensus 297 En~rlRa~l~Ews~raakle~ale 320 (351)
+-+--+|||.+-+..-|+.-++
T Consensus 85 --~t~LellGEK~E~veEL~~Dv~ 106 (120)
T PF12325_consen 85 --QTLLELLGEKSEEVEELRADVQ 106 (120)
T ss_pred --HHHHHHhcchHHHHHHHHHHHH
Confidence 3466788888888777766554
No 188
>KOG0980 consensus Actin-binding protein SLA2/Huntingtin-interacting protein Hip1 [Cytoskeleton]
Probab=23.62 E-value=1.2e+03 Score=27.51 Aligned_cols=50 Identities=26% Similarity=0.278 Sum_probs=26.9
Q ss_pred HhHHhhHHHHHHHHHhhhhcccCCCCChhhhHHHhhhhhhhHHHhhHHHHHhHHHH
Q 018721 221 NLHFLSEEILRLQECLSKYEQSDDGSTPQVDLAHLLAARDQELRTLSAEMNQLQSE 276 (351)
Q Consensus 221 HNa~LSkrIL~Lq~~l~kye~~~~g~t~qvdl~h~la~r~qelRa~~Ae~~q~~~e 276 (351)
-|..|...|.++|.+..+.++. -=+.+..+.+-+||++.+.-|..++|-+
T Consensus 467 ~~~~L~d~le~~~~~~~~~~~K------~e~~~~~le~l~~El~~l~~e~~~lq~~ 516 (980)
T KOG0980|consen 467 ENTNLNDQLEELQRAAGRAETK------TESQAKALESLRQELALLLIELEELQRT 516 (980)
T ss_pred HHHHHHHHHHHHHHHHHHHHHh------hHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444444444444444433333 1234556666677777777777666666
No 189
>KOG3973 consensus Uncharacterized conserved glycine-rich protein [Function unknown]
Probab=23.43 E-value=53 Score=34.27 Aligned_cols=28 Identities=36% Similarity=0.372 Sum_probs=20.5
Q ss_pred HHhhhhcccCCCCChhhhHHHhhhhhhh
Q 018721 234 ECLSKYEQSDDGSTPQVDLAHLLAARDQ 261 (351)
Q Consensus 234 ~~l~kye~~~~g~t~qvdl~h~la~r~q 261 (351)
..++.-.|+-+-...+.|||||||||.-
T Consensus 247 ~~~~~~~rei~~~K~~~dvahLLaArsd 274 (465)
T KOG3973|consen 247 SILSARVREIGRVKANSDVAHLLAARSD 274 (465)
T ss_pred HHHHHHHHHhccccchhHHHHHHHhhhh
Confidence 3455555666666778899999999954
No 190
>PF07926 TPR_MLP1_2: TPR/MLP1/MLP2-like protein; InterPro: IPR012929 This domain is found in a number of proteins, including TPR protein (P12270 from SWISSPROT) and yeast myosin-like proteins 1 (MLP1, Q02455 from SWISSPROT) and 2 (MLP2, P40457 from SWISSPROT). These proteins share a number of features; for example, they all have coiled-coil regions and all three are associated with nuclear pores [, , ]. TPR is thought to be a component of nuclear pore complex- attached intranuclear filaments [], and is implicated in nuclear protein import []. Moreover, its N-terminal region is involved in the activation of oncogenic kinases, possibly by mediating the dimerisation of kinase domains or by targeting these kinases to the nuclear pore complex []. MLP1 and MLP2 are involved in the process of telomere length regulation, where they are thought to interact with proteins such as Tel1p and modulate their activity []. ; GO: 0006606 protein import into nucleus, 0005643 nuclear pore
Probab=23.16 E-value=4.9e+02 Score=22.40 Aligned_cols=29 Identities=17% Similarity=0.307 Sum_probs=17.8
Q ss_pred HHhhhhhhHHHHHHHHHHhhhccHHHHHH
Q 018721 304 ILGEWSTRAAKLERALEVERMSNIELQKK 332 (351)
Q Consensus 304 ~l~Ews~raakle~ale~er~~~~~~~~~ 332 (351)
.=..|..+-..||..++.-.-.--||.++
T Consensus 92 ~e~sw~~qk~~le~e~~~~~~r~~dL~~Q 120 (132)
T PF07926_consen 92 SEASWEEQKEQLEKELSELEQRIEDLNEQ 120 (132)
T ss_pred HHHhHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33569999888888876533333344333
No 191
>PF09731 Mitofilin: Mitochondrial inner membrane protein; InterPro: IPR019133 Mitofilin controls mitochondrial cristae morphology. Mitofilin is enriched in the narrow space between the inner boundary and the outer membranes, where it forms a homotypic interaction and assembles into a large multimeric protein complex []. The first 78 amino acids contain a typical amino-terminal-cleavable mitochondrial presequence (residues 1-43) rich in positive-charged and hydroxylated residues and a membrane anchor domain (residues 47-66). In addition, it has three centrally located coiled coil domains (residues 200-240,280-310 and 400-420) []. ; GO: 0031305 integral to mitochondrial inner membrane
Probab=22.93 E-value=9.1e+02 Score=25.45 Aligned_cols=28 Identities=21% Similarity=0.317 Sum_probs=20.8
Q ss_pred HhhHHHHHHHHHHhHHhhHHHHHHHHHh
Q 018721 209 DEQMALLQYQRENLHFLSEEILRLQECL 236 (351)
Q Consensus 209 EKQADLIrYLkdHNa~LSkrIL~Lq~~l 236 (351)
++...+|...++....|+++|-.++...
T Consensus 247 ~~~~~~i~~a~~~i~~L~~~l~~l~~~~ 274 (582)
T PF09731_consen 247 SDLNSLIAHAKERIDALQKELAELKEEE 274 (582)
T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5667778888888888888877666544
No 192
>PRK05349 Na(+)-translocating NADH-quinone reductase subunit B; Provisional
Probab=22.84 E-value=3.2e+02 Score=28.78 Aligned_cols=52 Identities=17% Similarity=0.133 Sum_probs=39.9
Q ss_pred HHHHHHHhhhhhhccccccchhHHHHHHHHHHHHHHHHHHHHHHHHhHHHhh
Q 018721 51 GYAILAAGTTWIFHPIHYLIPPLLCSCGVILLALTGIFQQYFVYQVQKIRLQ 102 (351)
Q Consensus 51 ~yA~~~~~~pw~~~~~~~~~~~lL~y~~v~LWlLt~l~d~yv~~qH~KlRl~ 102 (351)
.|++.|-+..+++...-...|.=.+|+-.+.=+++=++|+|++.++.|.|.+
T Consensus 350 IyGi~iG~lt~iIR~~g~~yPEGV~fAILlmN~~~PlID~~~v~~~~~~r~~ 401 (405)
T PRK05349 350 IYGALIGFMTVLIRVVNPAYPEGMMLAILFANLFAPLFDYFVVQANIKRRLA 401 (405)
T ss_pred HHHHHHHHHHhhHhhcCCCCchHHHHHHHHHHhhHHHHHHHHHHHHHHHHHh
Confidence 7888888888877754435776666666666778899999999988887765
No 193
>PF03980 Nnf1: Nnf1 ; InterPro: IPR007128 NNF1 is an essential yeast gene required for proper spindle orientation, nucleolar and nuclear envelope structure and mRNA export [].
Probab=22.54 E-value=2.3e+02 Score=23.32 Aligned_cols=38 Identities=29% Similarity=0.434 Sum_probs=25.4
Q ss_pred ChhhhH-HHhhhhhhhHHHhhHHHHHhHHHHHHHHHHHH
Q 018721 247 TPQVDL-AHLLAARDQELRTLSAEMNQLQSELRLARSFV 284 (351)
Q Consensus 247 t~qvdl-~h~la~r~qelRa~~Ae~~q~~~el~~ar~li 284 (351)
+|..|+ +|+...+.++.-.|.+.++.++.|-..-...|
T Consensus 65 ~P~~~i~a~l~~~~~~~~~~L~~~l~~l~~eN~~L~~~i 103 (109)
T PF03980_consen 65 TPEEDIRAHLAPYKKKEREQLNARLQELEEENEALAEEI 103 (109)
T ss_pred ChHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 688888 78888877777677777666666554433333
No 194
>PRK04778 septation ring formation regulator EzrA; Provisional
Probab=22.52 E-value=5.9e+02 Score=27.14 Aligned_cols=21 Identities=33% Similarity=0.400 Sum_probs=11.1
Q ss_pred HHHHHHHhHHhhHHHHHHHHH
Q 018721 215 LQYQRENLHFLSEEILRLQEC 235 (351)
Q Consensus 215 IrYLkdHNa~LSkrIL~Lq~~ 235 (351)
+..+++.|..|..++-+|+..
T Consensus 319 l~~~~e~~~~l~~Ei~~l~~s 339 (569)
T PRK04778 319 LEHAKEQNKELKEEIDRVKQS 339 (569)
T ss_pred HHHHHHHHHHHHHHHHHHHHc
Confidence 344455555555555555544
No 195
>TIGR02209 ftsL_broad cell division protein FtsL. This model represents FtsL, both forms similar to that in E. coli and similar to that in B. subtilis. FtsL is one of the later proteins active in cell division septum formation. FtsL is small, low in complexity, and highly divergent. The scope of this model is broader than that of the Pfam model pfam04999.3 for FtsL, as this one includes FtsL from Bacillus subtilis and related species.
Probab=22.35 E-value=3.5e+02 Score=20.90 Aligned_cols=39 Identities=26% Similarity=0.369 Sum_probs=17.1
Q ss_pred hhHHHhhHHHHHhHHHHHHHHHHHHHHHHhhhhhhhhHhHH
Q 018721 260 DQELRTLSAEMNQLQSELRLARSFVLRVRNTNNQYVEENER 300 (351)
Q Consensus 260 ~qelRa~~Ae~~q~~~el~~ar~li~~~r~~n~q~~eEn~r 300 (351)
.+++..+-++.+++++|-..-+..|..+- +..|+|+-.|
T Consensus 30 ~~~~~~~~~~~~~l~~en~~L~~ei~~l~--~~~rIe~~Ar 68 (85)
T TIGR02209 30 NNELQKLQLEIDKLQKEWRDLQLEVAELS--RHERIEKIAK 68 (85)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHc--CHHHHHHHHH
Confidence 34444444444444444444444443322 3455555443
No 196
>KOG4807 consensus F-actin binding protein, regulates actin cytoskeletal organization [Cytoskeleton]
Probab=22.14 E-value=3.4e+02 Score=29.11 Aligned_cols=62 Identities=24% Similarity=0.315 Sum_probs=41.6
Q ss_pred HHhHHhhHHHHHHHHHhhhhcccCCCCC--------hhhhHHHhhhhhhhHHHhhHHHHHhHHHHHHHHH
Q 018721 220 ENLHFLSEEILRLQECLSKYEQSDDGST--------PQVDLAHLLAARDQELRTLSAEMNQLQSELRLAR 281 (351)
Q Consensus 220 dHNa~LSkrIL~Lq~~l~kye~~~~g~t--------~qvdl~h~la~r~qelRa~~Ae~~q~~~el~~ar 281 (351)
.||..|+-|+-.=-.+|.-+-+++.|.| +--.|.-+|.-.|-|+.-|..|+.-+..||..|-
T Consensus 470 aHNQELnnRLaaEItrLRtlltgdGgGtGsplaqgkdayELEVLLRVKEsEiQYLKqEissLkDELQtal 539 (593)
T KOG4807|consen 470 AHNQELNNRLAAEITRLRTLLTGDGGGTGSPLAQGKDAYELEVLLRVKESEIQYLKQEISSLKDELQTAL 539 (593)
T ss_pred HHHHHHhhHHHHHHHHHHHHhccCCCCCCCccccCcchhhHHHHHHhhHHHHHHHHHHHHHHHHHHHHHH
Confidence 4565555544433334433334444333 3347788999999999999999999999998874
No 197
>PF10003 DUF2244: Integral membrane protein (DUF2244); InterPro: IPR019253 This entry consists of various bacterial putative membrane proteins with no known function.
Probab=22.03 E-value=1.8e+02 Score=25.34 Aligned_cols=24 Identities=29% Similarity=0.375 Sum_probs=16.6
Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHH
Q 018721 72 PLLCSCGVILLALTGIFQQYFVYQ 95 (351)
Q Consensus 72 ~lL~y~~v~LWlLt~l~d~yv~~q 95 (351)
|++.++-+-++++.+.|..+.++.
T Consensus 37 ~Vl~F~glev~~l~~a~~~~~r~~ 60 (140)
T PF10003_consen 37 PVLPFAGLEVLALWYAFRRNYRHA 60 (140)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhhC
Confidence 566666677777777777776653
No 198
>PF03245 Phage_lysis: Bacteriophage Rz lysis protein; InterPro: IPR004929 Many bacteriophages with Gram-negative hosts contain two auxiliary lysis genes Rz and Rz1. These genes are nested, with Rz1 occupying the last third of Rz in a +1 reading frame. Both of these genes are required for host cell lysis if the outer membrane is stabilised by millimolar concentrations of divalent cations, but are otherwise uneccessary []. The Rz protein is believed to posses endopeptidase activity, while Rz1 encodes a prolipoprotein which, after cleavage by a signal peptidase, is located in the outer membrane. It has been suggested that these two proteins may form a complex which cleaves the oligopeptide crosslinks between glycosidic strands in the peptidoglycan and the Lpp lipoproteins of the outer bacterial membrane. For more information see []. This entry represents the Rz protein and related sequences. This family is not considered to be a peptidase according to the MEROPs database.; GO: 0019835 cytolysis
Probab=21.96 E-value=2.3e+02 Score=24.52 Aligned_cols=17 Identities=12% Similarity=0.206 Sum_probs=10.3
Q ss_pred HHHHHHHHhhhhhhhhH
Q 018721 281 RSFVLRVRNTNNQYVEE 297 (351)
Q Consensus 281 r~li~~~r~~n~q~~eE 297 (351)
...+++|-.++.+|-+|
T Consensus 27 ~~~~~~la~ld~k~tkE 43 (125)
T PF03245_consen 27 QQRQQALAALDAKYTKE 43 (125)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 33346666677777665
No 199
>PRK10929 putative mechanosensitive channel protein; Provisional
Probab=21.86 E-value=1.4e+03 Score=27.22 Aligned_cols=74 Identities=16% Similarity=0.142 Sum_probs=44.7
Q ss_pred hHHHHHHHHHHhHHhhH--------------------HHHHHHHHhhhhcccCC---CCChhhhHHHhhhhhhhHHHhhH
Q 018721 211 QMALLQYQRENLHFLSE--------------------EILRLQECLSKYEQSDD---GSTPQVDLAHLLAARDQELRTLS 267 (351)
Q Consensus 211 QADLIrYLkdHNa~LSk--------------------rIL~Lq~~l~kye~~~~---g~t~qvdl~h~la~r~qelRa~~ 267 (351)
|++.++.+++...+|.+ ++-.+++++.+.+.... .+.|.-||+..+.....+|-.+.
T Consensus 43 ~k~~~~~l~~tl~~l~~~~~~~~~~~~~~~~i~~ap~~~~~~~~~l~~~~~~~~~~~~~~s~~~Leq~l~~~~~~L~~~q 122 (1109)
T PRK10929 43 QAEIVEALQSALNWLEERKGSLERAKQYQQVIDNFPKLSAELRQQLNNERDEPRSVPPNMSTDALEQEILQVSSQLLEKS 122 (1109)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHhhhcccccccccCCHHHHHHHHHHHHHHHHHHH
Confidence 66666666665555544 45566666665443321 33467788888887777777777
Q ss_pred HHHHhHHHHHHHHHHHH
Q 018721 268 AEMNQLQSELRLARSFV 284 (351)
Q Consensus 268 Ae~~q~~~el~~ar~li 284 (351)
.+..+.++.++.++.-+
T Consensus 123 ~~l~~~~~~~~~~~~~l 139 (1109)
T PRK10929 123 RQAQQEQDRAREISDSL 139 (1109)
T ss_pred HHHHHHhhhhHHHHHHH
Confidence 77766666664444333
No 200
>PRK12821 aspartyl/glutamyl-tRNA amidotransferase subunit C-like protein; Provisional
Probab=21.69 E-value=6.3e+02 Score=27.30 Aligned_cols=121 Identities=18% Similarity=0.187 Sum_probs=70.6
Q ss_pred chhhHHHHHHHHHHHHHHHhhhhhhccccccchhHHHHHHHHHHHHHHHHHHHHHHHHhH-----HHhhhHHHHHHHhhc
Q 018721 39 IFGSVVYCFVLAGYAILAAGTTWIFHPIHYLIPPLLCSCGVILLALTGIFQQYFVYQVQK-----IRLQGYYSFSQKLKH 113 (351)
Q Consensus 39 ~~g~~~y~~~L~~yA~~~~~~pw~~~~~~~~~~~lL~y~~v~LWlLt~l~d~yv~~qH~K-----lRl~GYl~FYR~Tr~ 113 (351)
-+..++|.-+..||++..+.+-|....+..+.-|=..|.+-. +...+|.+..|+. .+-+=|+.|
T Consensus 218 ~is~~~~~~i~~~f~~~~i~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~nh~~~~~~~~k~n~f~~~------ 286 (477)
T PRK12821 218 KISLVVYTWIILYFGIGIIIFMWVLYLVWKLKQPHNAYSLSG-----FFHRRYKHANHQFTLFVLTKENWFYLI------ 286 (477)
T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCccchhhhhh-----hhhhhhhhcccchhhhhhhhHHHHHHH------
Confidence 356788998999999999999998877766666655555543 3445565555552 122222222
Q ss_pred ccccchhhhhHHHHHHHHHHHhhhcc----------ccccHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhcC
Q 018721 114 IVRLPFAITAYGTAAMLLVIVWRPHI----------SILSISTLL-RIIMLIEAICAASFMSVYIGYVHQYNS 175 (351)
Q Consensus 114 lkRlPl~IvSlGNa~LLLI~a~~~~~----------~~Ls~~~lL-riil~lElv~~l~~li~YIvkV~rFNk 175 (351)
-..++-.|++.+++=++..|.| -|+.+-.+. -++..+.++++-|.+.+ +.-++++|=
T Consensus 287 ----~~~~~~~~~~~~~in~~~~p~~d~~~~~~~y~~w~~~r~~~~p~~~~~~~~vi~pi~~~-~~p~~~~~~ 354 (477)
T PRK12821 287 ----LNVITLAGTSLLMINIAFIPIFDTQTTGQTYAFWLLIRLLFAPAIFLLDIIVIYPILLL-LTPIMLKGF 354 (477)
T ss_pred ----HHHHHHHHHHHHHHHHHhhccccccccCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHhhh
Confidence 2345556666666656666666 244432211 14446677777776543 334555543
No 201
>PF11932 DUF3450: Protein of unknown function (DUF3450); InterPro: IPR016866 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function. However, they are found in an operon along with components of a TonB transport system (typified by Vibrio cholerae TonB2 [], and are predicted to be localized to the periplasmic space. Caution: the low-complexity nature of these sequences produces spurious BLAST hits to chromosome segregation ATPases (which are much longer in length and contain canonical Walker motifs). Accordingly, some members are misidentified as such.
Probab=21.61 E-value=5.1e+02 Score=24.41 Aligned_cols=30 Identities=20% Similarity=0.174 Sum_probs=13.8
Q ss_pred hhHHHHHhHHHHHHHHHHHHHHHHhhhhhh
Q 018721 265 TLSAEMNQLQSELRLARSFVLRVRNTNNQY 294 (351)
Q Consensus 265 a~~Ae~~q~~~el~~ar~li~~~r~~n~q~ 294 (351)
.+..|...+.+|++.....+..++.-|+++
T Consensus 46 ~~~~e~~~L~~e~~~l~~e~e~L~~~~~~l 75 (251)
T PF11932_consen 46 QWDDEKQELLAEYRQLEREIENLEVYNEQL 75 (251)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 444444444444444444444444444433
No 202
>PF12911 OppC_N: N-terminal TM domain of oligopeptide transport permease C
Probab=21.60 E-value=61 Score=23.43 Aligned_cols=26 Identities=19% Similarity=0.024 Sum_probs=16.7
Q ss_pred chhhHHHHHHHHHHHHHHHhhhhhhc
Q 018721 39 IFGSVVYCFVLAGYAILAAGTTWIFH 64 (351)
Q Consensus 39 ~~g~~~y~~~L~~yA~~~~~~pw~~~ 64 (351)
--.+++=+++++.+.++|+.+||+.-
T Consensus 15 nk~a~~gl~il~~~vl~ai~~p~~~p 40 (56)
T PF12911_consen 15 NKLAVIGLIILLILVLLAIFAPFISP 40 (56)
T ss_pred CchHHHHHHHHHHHHHHHHHHHHcCC
Confidence 33444445567777778888898644
No 203
>PF09679 TraQ: Type-F conjugative transfer system pilin chaperone (TraQ); InterPro: IPR014112 This entry represents TraQ, a protein that makes a specific interaction with pilin (TraA) to aid its transfer through the inner membrane during the process of F-type conjugative pilus assembly [, ].
Probab=21.45 E-value=87 Score=26.66 Aligned_cols=20 Identities=45% Similarity=0.752 Sum_probs=16.5
Q ss_pred HHHhhhhhhHHHHHHH-HHHh
Q 018721 303 AILGEWSTRAAKLERA-LEVE 322 (351)
Q Consensus 303 a~l~Ews~raakle~a-le~e 322 (351)
.|||+|-+|+.|=||+ ||++
T Consensus 57 rILda~iarv~~eer~~~ear 77 (93)
T PF09679_consen 57 RILDAWIARVSREERAALEAR 77 (93)
T ss_pred HHHHHHHHHHhHHHHHHHHHH
Confidence 5899999999999997 4443
No 204
>PF13815 Dzip-like_N: Iguana/Dzip1-like DAZ-interacting protein N-terminal
Probab=21.23 E-value=2.5e+02 Score=23.85 Aligned_cols=53 Identities=23% Similarity=0.394 Sum_probs=31.0
Q ss_pred HHHHHHHHHhHHhhHHHHHHHHHhhhhcccCCCCChhhhHHHhhhhhhhHHHhhHHHHHhHHHHHH
Q 018721 213 ALLQYQRENLHFLSEEILRLQECLSKYEQSDDGSTPQVDLAHLLAARDQELRTLSAEMNQLQSELR 278 (351)
Q Consensus 213 DLIrYLkdHNa~LSkrIL~Lq~~l~kye~~~~g~t~qvdl~h~la~r~qelRa~~Ae~~q~~~el~ 278 (351)
-.|+||-.-...|+..+-.|++.+..-... .+.-.+++....+|+..++.|+|
T Consensus 66 l~ieYLl~~q~~L~~~~~~l~~~~~~~~~~-------------~~~l~~~~~~~~~~~k~lk~E~k 118 (118)
T PF13815_consen 66 LSIEYLLHCQEYLSSQLEQLEERLQELQQE-------------IEKLKQKLKKQKEEIKKLKKESK 118 (118)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------------HHHHHHHHHHHHHHHHHHHHhcC
Confidence 456787777777777777777766222111 12224455566666666666653
No 205
>PF10224 DUF2205: Predicted coiled-coil protein (DUF2205); InterPro: IPR019357 This entry represents a highly conserved 100 residue region which is likely to have a coiled-coil structure. The exact function is unknown.
Probab=21.15 E-value=4.5e+02 Score=21.71 Aligned_cols=48 Identities=29% Similarity=0.400 Sum_probs=37.6
Q ss_pred hhHHH-hhHHHHHhHHHHHHHHHHHHHHHHhhhhhhhhHhHHHHHHHhh
Q 018721 260 DQELR-TLSAEMNQLQSELRLARSFVLRVRNTNNQYVEENERLRAILGE 307 (351)
Q Consensus 260 ~qelR-a~~Ae~~q~~~el~~ar~li~~~r~~n~q~~eEn~rlRa~l~E 307 (351)
+.|-| .+-.+...+|.-|..-..-|..|+..|++...||+=|..-++-
T Consensus 14 ~~e~k~~Li~ei~~LQ~sL~~L~~Rve~Vk~E~~kL~~EN~~Lq~YI~n 62 (80)
T PF10224_consen 14 EKEEKEELIQEILELQDSLEALSDRVEEVKEENEKLESENEYLQQYIGN 62 (80)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 33434 5556777788888888888899999999999999988776654
No 206
>PF10329 DUF2417: Region of unknown function (DUF2417); InterPro: IPR019431 This entry represents a family of fungal proteins with no known function. In some cases these proteins also contain an alpha/beta hydrolase fold (IPR000073 from INTERPRO).
Probab=21.13 E-value=1.3e+02 Score=29.19 Aligned_cols=40 Identities=15% Similarity=0.170 Sum_probs=21.6
Q ss_pred HHHHHHHHHHHHHHHhHHHhhhHHHHHHHhhcccccchhhhhHHHHHHHH
Q 018721 82 LALTGIFQQYFVYQVQKIRLQGYYSFSQKLKHIVRLPFAITAYGTAAMLL 131 (351)
Q Consensus 82 WlLt~l~d~yv~~qH~KlRl~GYl~FYR~Tr~lkRlPl~IvSlGNa~LLL 131 (351)
|.+..++.-|+---.-.-|+.||++|- +.++|+++-++-+
T Consensus 52 w~v~llvS~F~s~Pg~~~Rg~~F~~~~----------l~~lsl~~~~~~L 91 (232)
T PF10329_consen 52 WWVLLLVSDFFSPPGFNNRGSGFLDFD----------LTLLSLITNLFNL 91 (232)
T ss_pred HHHHHHHHHHhCCCCCCCCCCchHHHH----------HHHHHHHHHHHHH
Confidence 344444444553333455788888874 4555555544433
No 207
>KOG0018 consensus Structural maintenance of chromosome protein 1 (sister chromatid cohesion complex Cohesin, subunit SMC1) [Cell cycle control, cell division, chromosome partitioning]
Probab=21.04 E-value=7.4e+02 Score=29.52 Aligned_cols=85 Identities=20% Similarity=0.379 Sum_probs=45.3
Q ss_pred hHHHhhhhhhhHHHhhHHHHHhHHHHHHH--HHHHHHHHHhhhhhhhhHhHHHHHHHhhhhhhHHHHHHHHHHhhhccHH
Q 018721 251 DLAHLLAARDQELRTLSAEMNQLQSELRL--ARSFVLRVRNTNNQYVEENERLRAILGEWSTRAAKLERALEVERMSNIE 328 (351)
Q Consensus 251 dl~h~la~r~qelRa~~Ae~~q~~~el~~--ar~li~~~r~~n~q~~eEn~rlRa~l~Ews~raakle~ale~er~~~~~ 328 (351)
++.--|..||.+.-.|.-.||++.+.+=. |+..=-++|.-.+-.+ ++-.-.-+-|.+..-+|||-.|+-|+= -+
T Consensus 729 ~i~r~l~~~e~~~~~L~~~~n~ved~if~~f~~~igv~ir~Yee~~~--~~~~a~k~~ef~~q~~~l~~~l~fe~~--~d 804 (1141)
T KOG0018|consen 729 EIKRKLQNREGEMKELEERMNKVEDRIFKGFCRRIGVRIREYEEREL--QQEFAKKRLEFENQKAKLENQLDFEKQ--KD 804 (1141)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCeeeehHHHHHH--HHHHHHHHHHHHHHHHHHhhhhhheec--cc
Confidence 44444555555555555555555554321 1111011332222211 222222345788999999999999876 67
Q ss_pred HHHHHHHhhhc
Q 018721 329 LQKKISTRRNQ 339 (351)
Q Consensus 329 ~~~~~~~~r~~ 339 (351)
.++++...+..
T Consensus 805 ~~~~ve~~~~~ 815 (1141)
T KOG0018|consen 805 TQRRVERWERS 815 (1141)
T ss_pred HHHHHHHHHHH
Confidence 77777766653
No 208
>PF12329 TMF_DNA_bd: TATA element modulatory factor 1 DNA binding; InterPro: IPR022092 This is the middle region of a family of TATA element modulatory factor 1 proteins conserved in eukaryotes that contains at its N-terminal section a number of leucine zippers that could potentially form coiled coil structures. The whole proteins bind to the TATA element of some RNA polymerase II promoters and repress their activity. by competing with the binding of TATA binding protein. TMFs are evolutionarily conserved golgins that bind Rab6, a ubiquitous ras-like GTP-binding Golgi protein, and contribute to Golgi organisation in animal [] and plant [] cells.
Probab=20.96 E-value=4.3e+02 Score=21.00 Aligned_cols=69 Identities=20% Similarity=0.265 Sum_probs=45.6
Q ss_pred hHHHhhhhhhhHHHhhHHHHHhHHHHHHHHHHHHHHHHhhhhhhhhHhHHHHHHHhhhhhhHHHHHHHH
Q 018721 251 DLAHLLAARDQELRTLSAEMNQLQSELRLARSFVLRVRNTNNQYVEENERLRAILGEWSTRAAKLERAL 319 (351)
Q Consensus 251 dl~h~la~r~qelRa~~Ae~~q~~~el~~ar~li~~~r~~n~q~~eEn~rlRa~l~Ews~raakle~al 319 (351)
++.-.|+..|.....|-.|-..|...--.-+..|-.+|..|..-.++...|..-+++=..-...|+..|
T Consensus 2 sl~~~l~EKDe~Ia~L~eEGekLSk~el~~~~~IKKLr~~~~e~e~~~~~l~~~~~~~e~~~~~l~~~l 70 (74)
T PF12329_consen 2 SLEKKLAEKDEQIAQLMEEGEKLSKKELKLNNTIKKLRAKIKELEKQIKELKKKLEELEKELESLEERL 70 (74)
T ss_pred hHHHHHHhHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 345567777777777777776666655555666777887777666677777666666555555555544
No 209
>KOG2302 consensus T-type voltage-gated Ca2+ channel, pore-forming alpha1I subunit [Inorganic ion transport and metabolism; Signal transduction mechanisms]
Probab=20.94 E-value=1.3e+03 Score=28.19 Aligned_cols=133 Identities=14% Similarity=0.167 Sum_probs=85.0
Q ss_pred ccCcccCCCCcccchhhHHHHHHHHHHHHHHHhhhhhhccccccchhHHHHHHH--HHHHHHHHHHHHHHH-HHhHHHhh
Q 018721 26 HEAPLLGHRKSHSIFGSVVYCFVLAGYAILAAGTTWIFHPIHYLIPPLLCSCGV--ILLALTGIFQQYFVY-QVQKIRLQ 102 (351)
Q Consensus 26 ~e~p~~~~r~~~~~~g~~~y~~~L~~yA~~~~~~pw~~~~~~~~~~~lL~y~~v--~LWlLt~l~d~yv~~-qH~KlRl~ 102 (351)
.+.|-+|-|-+-++--.++.||+++---+=+++.-|.|----.+..+ -++ .+.+++.++|-.+-- +-.-.|..
T Consensus 1147 ~i~~~s~EriFltlsnyIFtaIfV~Em~lKVVALGl~fge~aYl~ss----WN~LDgflv~vsviDilvs~asa~g~kIL 1222 (1956)
T KOG2302|consen 1147 AIVEGSTERIFLTLSNYIFTAIFVVEMTLKVVALGLYFGEQAYLRSS----WNVLDGFLVAVSVIDILVSQASAGGAKIL 1222 (1956)
T ss_pred ccccCcceEEEEEecchHHHHHHHHHHHHHHHhhhhccchHHHHHHH----HHhhhHHHHHHHHHHHHHHHhhhhhHHHH
Confidence 35677888888888889999999998888888888866532222222 121 344555556654433 33456778
Q ss_pred hHHHHHHHhhcccccchhhhhHHHHHHHHHHHhhhcccccc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q 018721 103 GYYSFSQKLKHIVRLPFAITAYGTAAMLLVIVWRPHISILS-ISTLLRIIMLIEAICAASFMSVYIGYVHQYN 174 (351)
Q Consensus 103 GYl~FYR~Tr~lkRlPl~IvSlGNa~LLLI~a~~~~~~~Ls-~~~lLriil~lElv~~l~~li~YIvkV~rFN 174 (351)
|-++-.|-+|-++ |+.++|--+.+=|++-++..+ |. +-+|+ ++|...++|.=|+-|.-|-
T Consensus 1223 gVlrvLRlLRtlR--pLRviSra~glklVveTL~sS---LkpIgnIv-------liccaffiiFgilgvqLFk 1283 (1956)
T KOG2302|consen 1223 GVLRVLRLLRTLR--PLRVISRAPGLKLVVETLISS---LKPIGNIV-------LICCAFFIIFGILGVQLFK 1283 (1956)
T ss_pred HHHHHHHHHHHhh--HHHHHhhcccHHHHHHHHHhc---cccHHHHH-------HHHHHHHHHHHHHHHHHhc
Confidence 8888777777766 788899988888888777653 33 33333 2344444444455555553
No 210
>TIGR02680 conserved hypothetical protein TIGR02680. Members of this protein family belong to a conserved gene four-gene neighborhood found sporadically in a phylogenetically broad range of bacteria: Nocardia farcinica, Symbiobacterium thermophilum, and Streptomyces avermitilis (Actinobacteria), Geobacillus kaustophilus (Firmicutes), Azoarcus sp. EbN1 and Ralstonia solanacearum (Betaproteobacteria). Proteins in this family average over 1400 amino acids in length.
Probab=20.79 E-value=1.5e+03 Score=27.17 Aligned_cols=61 Identities=20% Similarity=0.200 Sum_probs=30.5
Q ss_pred HHHHHHHHHHHHHHhhhhhhhhHhHHHHHHHhhhhhhHHHHHHHHHHhhhccHHHHHHHHHhhh
Q 018721 275 SELRLARSFVLRVRNTNNQYVEENERLRAILGEWSTRAAKLERALEVERMSNIELQKKISTRRN 338 (351)
Q Consensus 275 ~el~~ar~li~~~r~~n~q~~eEn~rlRa~l~Ews~raakle~ale~er~~~~~~~~~~~~~r~ 338 (351)
.++..++.-+.+++..-.+..+|.+.+...+.+.+.+.+.++..++.. .-+++.++.+.|.
T Consensus 875 ~~~~~a~~~le~ae~~l~~~~~e~~~~~~e~~~a~~~l~~l~e~l~~~---~eel~a~L~e~r~ 935 (1353)
T TIGR02680 875 TRAAEQRARAARAESDAREAAEDAAEARAEAEEASLRLRTLEESVGAM---VDEIRARLAETRA 935 (1353)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHH
Confidence 333333333344444444445555556666666666666666666543 2344444444443
No 211
>PF08961 DUF1875: Domain of unknown function (DUF1875); InterPro: IPR015056 MIT can be found in the Nuclear receptor-binding factor 2, it has no known function. ; PDB: 2CRB_A.
Probab=20.68 E-value=33 Score=33.48 Aligned_cols=35 Identities=26% Similarity=0.319 Sum_probs=0.0
Q ss_pred HHHHHhhhhhhhhHhHHHHHHHhhhhhhHHHHHHH
Q 018721 284 VLRVRNTNNQYVEENERLRAILGEWSTRAAKLERA 318 (351)
Q Consensus 284 i~~~r~~n~q~~eEn~rlRa~l~Ews~raakle~a 318 (351)
|..+|..=.--++||+|||..-..|+++.|+|...
T Consensus 131 I~dLrrlVe~L~aeNErLr~EnkqL~ae~arL~k~ 165 (243)
T PF08961_consen 131 IADLRRLVEFLLAENERLRRENKQLKAENARLLKG 165 (243)
T ss_dssp -----------------------------------
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC
Confidence 34555555667899999999999999999998443
No 212
>PF10234 Cluap1: Clusterin-associated protein-1; InterPro: IPR019366 This protein of 413 amino acids contains a central coiled-coil domain, possibly the region that binds to clusterin. Cluap1 expression is highest in the nucleus and gradually increases during late S to G2/M phases of the cell cycle and returns to the basal level in the G0/G1 phases. In addition, it is upregulated in colon cancer tissues compared to corresponding non-cancerous mucosa. It thus plays a crucial role in the life of the cell [].
Probab=20.63 E-value=3.9e+02 Score=26.56 Aligned_cols=43 Identities=19% Similarity=0.342 Sum_probs=32.9
Q ss_pred hhHHHHHhHHHHHHHHHHHHHHHHhhhhhhhhHhHHHHHHHhh
Q 018721 265 TLSAEMNQLQSELRLARSFVLRVRNTNNQYVEENERLRAILGE 307 (351)
Q Consensus 265 a~~Ae~~q~~~el~~ar~li~~~r~~n~q~~eEn~rlRa~l~E 307 (351)
+|.+.+..-.+||..++.=++.+..+.-.|++|=|+|.+.|.+
T Consensus 194 ~Le~KIekkk~ELER~qKRL~sLq~vRPAfmdEyEklE~EL~~ 236 (267)
T PF10234_consen 194 NLEAKIEKKKQELERNQKRLQSLQSVRPAFMDEYEKLEEELQK 236 (267)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHHH
Confidence 4555566666677777776788888888999999999888765
No 213
>PHA02246 hypothetical protein
Probab=20.39 E-value=1e+02 Score=28.99 Aligned_cols=83 Identities=17% Similarity=0.233 Sum_probs=52.9
Q ss_pred HHHHHhhcccccchhhhhHHHHHHHHHHHhh-------hccc-----cccHHHHHH--HHHHHHHHHHHHHHHHHHHHHH
Q 018721 106 SFSQKLKHIVRLPFAITAYGTAAMLLVIVWR-------PHIS-----ILSISTLLR--IIMLIEAICAASFMSVYIGYVH 171 (351)
Q Consensus 106 ~FYR~Tr~lkRlPl~IvSlGNa~LLLI~a~~-------~~~~-----~Ls~~~lLr--iil~lElv~~l~~li~YIvkV~ 171 (351)
.||.-.+- -...|+|+|.|--..|-+.|.. +-|. .+|...++- ..-.-+.+.....+..|+-++.
T Consensus 51 SfyNlL~T-~~~~fqi~svg~nl~lgivcLlv~~~rkkd~f~~~fiiifSLllfll~~~~evtQtVat~tIiLaYi~QII 129 (192)
T PHA02246 51 SFYNLLLT-DASVFQIVSVGLNLTLGIVCLLVASYRKKDYFSIPFIIVFSLLLFLLSDFTALTQTVATITIILAYVTQIT 129 (192)
T ss_pred HHHHHHhc-CCceEEEeeeehhhhhhhhheeeehhhccccccchHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHH
Confidence 34443332 3447899999988887777752 1111 122211110 0123467777888889999999
Q ss_pred HhcCCCCCCchhhhccCC
Q 018721 172 QYNSLNSQPDVMKSLYSP 189 (351)
Q Consensus 172 rFNk~kp~PDVl~ee~s~ 189 (351)
+|=|.|..-|.-...|.-
T Consensus 130 qfyKTK~SEg~n~~l~li 147 (192)
T PHA02246 130 TFYKTKSAEGTNRFLFLI 147 (192)
T ss_pred HHhhhcccCCCChhHHHH
Confidence 999999998888776653
No 214
>PF06818 Fez1: Fez1; InterPro: IPR009638 This family represents the eukaryotic Fez1 protein. Fez1 contains a leucine-zipper region with similarity to the DNA-binding domain of the cAMP-responsive activating-transcription factor 5 []. There is evidence that Fez1 inhibits cancer cell growth through regulation of mitosis, and that its alterations result in abnormal cell growth []. Note that some family members contain more than one copy of this region.; GO: 0005737 cytoplasm, 0016020 membrane
Probab=20.27 E-value=7.8e+02 Score=23.66 Aligned_cols=47 Identities=21% Similarity=0.322 Sum_probs=26.6
Q ss_pred HHHHHHHHhhhhcccCCCCChhh-hHHHhhhhhhhHHHhhHHHHHhHH
Q 018721 228 EILRLQECLSKYEQSDDGSTPQV-DLAHLLAARDQELRTLSAEMNQLQ 274 (351)
Q Consensus 228 rIL~Lq~~l~kye~~~~g~t~qv-dl~h~la~r~qelRa~~Ae~~q~~ 274 (351)
+|+.|..++..-...-+++.++. ++...+-++..|+-....|+.+..
T Consensus 32 Eiv~Lr~ql~e~~~~l~~~~~~~~~l~~~~~~K~~ELE~ce~ELqr~~ 79 (202)
T PF06818_consen 32 EIVSLRAQLRELRAELRNKESQIQELQDSLRTKQLELEVCENELQRKK 79 (202)
T ss_pred HHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhHhHHHhHHHHHHHh
Confidence 68888888866555555555554 445555555554444444433333
No 215
>KOG0963 consensus Transcription factor/CCAAT displacement protein CDP1 [Transcription]
Probab=20.20 E-value=1.2e+03 Score=26.01 Aligned_cols=117 Identities=26% Similarity=0.303 Sum_probs=70.8
Q ss_pred hhHhhHHHHHHH---HHHhHHhhHHHHHHHHH-------h----hhhcccCCCCChhhhH-HHhhhhhhhHHHhhHHHHH
Q 018721 207 LSDEQMALLQYQ---RENLHFLSEEILRLQEC-------L----SKYEQSDDGSTPQVDL-AHLLAARDQELRTLSAEMN 271 (351)
Q Consensus 207 LlEKQADLIrYL---kdHNa~LSkrIL~Lq~~-------l----~kye~~~~g~t~qvdl-~h~la~r~qelRa~~Ae~~ 271 (351)
.-|+|++|+.=. ++.|..+-++|..||.. + ++|+....+-..+|.+ -.=|+.-.+-++.+..|..
T Consensus 180 ~~e~e~~L~~~~~~~~~q~~~le~ki~~lq~a~~~t~~el~~~~s~~dee~~~k~aev~lim~eLe~aq~ri~~lE~e~e 259 (629)
T KOG0963|consen 180 WAEREAGLKDEEQNLQEQLEELEKKISSLQSAIEDTQNELFDLKSKYDEEVAAKAAEVSLIMTELEDAQQRIVFLEREVE 259 (629)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 667777776543 44455555567666433 2 2333333333455544 2223444556677777777
Q ss_pred hHHHHHHHHHHHH---------------HHHHhhhhhhhhHhHHHHHH----HhhhhhhHHHHHHHHHHhh
Q 018721 272 QLQSELRLARSFV---------------LRVRNTNNQYVEENERLRAI----LGEWSTRAAKLERALEVER 323 (351)
Q Consensus 272 q~~~el~~ar~li---------------~~~r~~n~q~~eEn~rlRa~----l~Ews~raakle~ale~er 323 (351)
++..++..+-+-. -+.-..|.|-..+-+|+++. ++.|-..-+.||+-+++-+
T Consensus 260 ~L~~ql~~~N~~~~~~~~~~i~~~~~~L~~kd~~i~~L~~di~~~~~S~~~e~e~~~~qI~~le~~l~~~~ 330 (629)
T KOG0963|consen 260 QLREQLAKANSSKKLAKIDDIDALGSVLNQKDSEIAQLSNDIERLEASLVEEREKHKAQISALEKELKAKI 330 (629)
T ss_pred HHHHHHHhhhhhhhhccCCchHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 7777777766653 22344455666666776665 5779999999999988754
No 216
>PRK11578 macrolide transporter subunit MacA; Provisional
Probab=20.13 E-value=8.3e+02 Score=23.97 Aligned_cols=28 Identities=36% Similarity=0.432 Sum_probs=15.1
Q ss_pred HHHhhHHHHHhHHHHHHHHHHHHHHHHh
Q 018721 262 ELRTLSAEMNQLQSELRLARSFVLRVRN 289 (351)
Q Consensus 262 elRa~~Ae~~q~~~el~~ar~li~~~r~ 289 (351)
++..+.+++.+.++++..|+....|.+.
T Consensus 107 ~l~~~~~~l~~a~~~l~~a~~~~~r~~~ 134 (370)
T PRK11578 107 TLMELRAQRQQAEAELKLARVTLSRQQR 134 (370)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4444555555566666666555544443
No 217
>PF15099 PIRT: Phosphoinositide-interacting protein family
Probab=20.08 E-value=1.6e+02 Score=26.56 Aligned_cols=40 Identities=35% Similarity=0.601 Sum_probs=24.3
Q ss_pred hhhHHHHHHHHHHHHHHHhhhhhhcc-ccccchhHHHHHHHHHH
Q 018721 40 FGSVVYCFVLAGYAILAAGTTWIFHP-IHYLIPPLLCSCGVILL 82 (351)
Q Consensus 40 ~g~~~y~~~L~~yA~~~~~~pw~~~~-~~~~~~~lL~y~~v~LW 82 (351)
+|+++ +++|-++.|++..+--+. +..+..|.+++.-..|+
T Consensus 55 fG~vi---li~GvvvT~vays~n~~~si~~~~G~vlLs~GLmlL 95 (129)
T PF15099_consen 55 FGVVI---LIAGVVVTAVAYSFNSHGSIISIFGPVLLSLGLMLL 95 (129)
T ss_pred ehHHH---HHHhhHhheeeEeecCCcchhhhehHHHHHHHHHHH
Confidence 57776 888888888876663333 24455565554443333
Done!