Query psy16295
Match_columns 128
No_of_seqs 103 out of 210
Neff 5.3
Searched_HMMs 46136
Date Fri Aug 16 17:42:51 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy16295.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/16295hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG3364|consensus 100.0 5.6E-45 1.2E-49 272.9 10.5 109 2-112 34-142 (149)
2 PF14853 Fis1_TPR_C: Fis1 C-te 99.9 7.4E-26 1.6E-30 144.5 6.7 53 41-93 1-53 (53)
3 PF14852 Fis1_TPR_N: Fis1 N-te 99.6 2.3E-15 5.1E-20 88.9 3.7 32 2-33 3-34 (35)
4 PF13432 TPR_16: Tetratricopep 98.8 4.2E-08 9.2E-13 61.8 7.6 65 4-75 1-65 (65)
5 PF14559 TPR_19: Tetratricopep 98.7 3.7E-08 8E-13 62.1 6.3 64 17-84 5-68 (68)
6 PF13371 TPR_9: Tetratricopept 98.6 1.3E-07 2.8E-12 60.4 6.4 66 16-85 8-73 (73)
7 PRK10370 formate-dependent nit 98.5 4.4E-07 9.6E-12 70.1 8.5 79 3-85 110-188 (198)
8 COG4235 Cytochrome c biogenesi 98.5 3.5E-07 7.5E-12 75.8 7.8 77 4-84 194-270 (287)
9 PF07719 TPR_2: Tetratricopept 98.3 1E-06 2.3E-11 48.7 4.3 33 42-74 2-34 (34)
10 PF13414 TPR_11: TPR repeat; P 98.3 5.4E-06 1.2E-10 52.3 7.2 62 4-72 7-69 (69)
11 TIGR02552 LcrH_SycD type III s 98.3 1.1E-05 2.4E-10 56.4 9.4 76 3-85 20-95 (135)
12 PLN03088 SGT1, suppressor of 98.2 1.7E-05 3.7E-10 66.3 10.3 82 3-91 39-120 (356)
13 PRK15359 type III secretion sy 98.2 2.3E-05 5.1E-10 57.3 9.3 66 18-87 73-138 (144)
14 PF00515 TPR_1: Tetratricopept 98.1 3.9E-06 8.5E-11 46.9 3.7 33 42-74 2-34 (34)
15 TIGR02552 LcrH_SycD type III s 98.1 5E-05 1.1E-09 53.0 9.5 79 4-89 55-133 (135)
16 PF12895 Apc3: Anaphase-promot 98.0 1E-05 2.2E-10 53.5 5.2 65 16-83 2-66 (84)
17 TIGR02795 tol_pal_ybgF tol-pal 98.0 5.2E-05 1.1E-09 50.9 8.0 69 3-75 5-73 (119)
18 cd00189 TPR Tetratricopeptide 97.9 0.00039 8.4E-09 42.0 10.2 69 3-78 3-71 (100)
19 TIGR02795 tol_pal_ybgF tol-pal 97.8 0.00031 6.7E-09 47.1 8.7 75 3-81 42-116 (119)
20 PF13174 TPR_6: Tetratricopept 97.8 6.2E-05 1.3E-09 41.0 4.2 33 42-74 1-33 (33)
21 PRK10803 tol-pal system protei 97.7 0.00021 4.5E-09 58.1 8.5 75 3-80 145-219 (263)
22 PF13428 TPR_14: Tetratricopep 97.7 0.00013 2.7E-09 43.5 5.2 40 42-81 2-41 (44)
23 PRK15359 type III secretion sy 97.7 0.0003 6.5E-09 51.5 8.2 77 4-87 28-104 (144)
24 PRK10866 outer membrane biogen 97.7 0.00017 3.7E-09 57.5 7.3 61 17-78 46-106 (243)
25 cd00189 TPR Tetratricopeptide 97.6 0.00048 1E-08 41.6 7.5 63 4-73 38-100 (100)
26 TIGR02521 type_IV_pilW type IV 97.6 0.00069 1.5E-08 48.8 9.5 70 3-79 34-103 (234)
27 PRK02603 photosystem I assembl 97.6 0.00078 1.7E-08 50.0 9.5 80 3-86 38-117 (172)
28 PF12895 Apc3: Anaphase-promot 97.6 0.00028 6.1E-09 46.5 6.3 56 3-66 28-83 (84)
29 PF13429 TPR_15: Tetratricopep 97.6 0.00016 3.5E-09 57.0 5.8 71 3-80 183-253 (280)
30 PF13181 TPR_8: Tetratricopept 97.6 0.00013 2.8E-09 40.4 3.6 33 42-74 2-34 (34)
31 PF13525 YfiO: Outer membrane 97.6 0.00039 8.4E-09 53.5 7.5 72 3-78 8-79 (203)
32 PLN03088 SGT1, suppressor of 97.5 0.0005 1.1E-08 57.5 8.1 68 17-88 16-83 (356)
33 TIGR03302 OM_YfiO outer membra 97.5 0.00082 1.8E-08 51.2 8.6 72 2-77 35-106 (235)
34 PF13432 TPR_16: Tetratricopep 97.5 0.00031 6.7E-09 43.9 5.2 45 45-89 1-45 (65)
35 PRK10370 formate-dependent nit 97.5 0.00074 1.6E-08 52.1 8.3 64 16-83 52-115 (198)
36 TIGR02521 type_IV_pilW type IV 97.4 0.0029 6.3E-08 45.6 10.1 77 3-84 102-178 (234)
37 PF13414 TPR_11: TPR repeat; P 97.4 0.0007 1.5E-08 42.5 5.8 50 40-89 2-51 (69)
38 TIGR00990 3a0801s09 mitochondr 97.2 0.0025 5.4E-08 56.1 9.8 69 4-79 369-437 (615)
39 smart00028 TPR Tetratricopepti 97.2 0.00089 1.9E-08 33.7 4.2 33 42-74 2-34 (34)
40 PRK09782 bacteriophage N4 rece 97.2 0.0021 4.6E-08 60.8 9.3 73 3-82 646-718 (987)
41 PRK11189 lipoprotein NlpI; Pro 97.1 0.0043 9.2E-08 50.2 9.4 60 19-82 80-139 (296)
42 TIGR03302 OM_YfiO outer membra 97.1 0.0066 1.4E-07 46.2 9.7 74 4-81 74-155 (235)
43 PRK10049 pgaA outer membrane p 97.1 0.0036 7.9E-08 57.0 9.5 63 19-85 409-471 (765)
44 PF12688 TPR_5: Tetratrico pep 97.1 0.0082 1.8E-07 43.6 9.5 79 2-84 3-84 (120)
45 PF13429 TPR_15: Tetratricopep 97.0 0.0023 5E-08 50.4 6.9 69 6-79 116-184 (280)
46 PRK10803 tol-pal system protei 97.0 0.0065 1.4E-07 49.4 9.6 78 3-84 183-260 (263)
47 PRK12370 invasion protein regu 97.0 0.0048 1E-07 54.1 9.2 57 18-78 353-409 (553)
48 PRK15363 pathogenicity island 97.0 0.0045 9.8E-08 47.4 7.9 68 4-78 39-106 (157)
49 CHL00033 ycf3 photosystem I as 97.0 0.012 2.7E-07 43.2 9.9 79 4-86 39-117 (168)
50 TIGR02917 PEP_TPR_lipo putativ 97.0 0.0055 1.2E-07 52.7 9.1 69 4-80 808-876 (899)
51 TIGR02917 PEP_TPR_lipo putativ 96.9 0.0071 1.5E-07 52.0 9.4 70 3-79 162-231 (899)
52 PRK12370 invasion protein regu 96.9 0.0086 1.9E-07 52.5 9.7 65 15-83 316-380 (553)
53 PF13176 TPR_7: Tetratricopept 96.9 0.0017 3.7E-08 37.2 3.5 32 43-74 1-34 (36)
54 PRK10747 putative protoheme IX 96.8 0.0053 1.2E-07 51.6 7.7 66 17-86 308-373 (398)
55 KOG0543|consensus 96.8 0.01 2.3E-07 51.4 9.4 81 3-90 260-344 (397)
56 PF13512 TPR_18: Tetratricopep 96.8 0.0031 6.6E-08 47.6 5.3 71 4-78 14-84 (142)
57 PRK15174 Vi polysaccharide exp 96.8 0.011 2.4E-07 53.1 9.7 78 3-84 249-327 (656)
58 PF06552 TOM20_plant: Plant sp 96.6 0.0088 1.9E-07 47.1 7.0 81 4-88 29-137 (186)
59 COG4105 ComL DNA uptake lipopr 96.6 0.0049 1.1E-07 50.6 5.6 63 16-79 47-109 (254)
60 PRK11189 lipoprotein NlpI; Pro 96.6 0.026 5.5E-07 45.7 9.6 56 17-76 112-167 (296)
61 PRK10049 pgaA outer membrane p 96.5 0.02 4.3E-07 52.3 9.6 71 4-81 363-433 (765)
62 TIGR00540 hemY_coli hemY prote 96.5 0.012 2.5E-07 49.6 7.6 66 16-85 312-381 (409)
63 PF07720 TPR_3: Tetratricopept 96.5 0.0062 1.3E-07 35.7 4.2 34 41-74 1-36 (36)
64 PRK11788 tetratricopeptide rep 96.4 0.034 7.5E-07 44.9 9.7 52 19-74 196-247 (389)
65 PRK02603 photosystem I assembl 96.4 0.021 4.5E-07 42.3 7.7 66 3-75 75-154 (172)
66 TIGR00990 3a0801s09 mitochondr 96.4 0.025 5.5E-07 49.8 9.4 59 18-80 414-472 (615)
67 PRK15174 Vi polysaccharide exp 96.4 0.031 6.6E-07 50.3 9.8 70 3-79 287-356 (656)
68 COG4783 Putative Zn-dependent 96.3 0.025 5.5E-07 50.1 8.8 82 3-91 377-458 (484)
69 PF13424 TPR_12: Tetratricopep 96.3 0.012 2.7E-07 37.7 5.2 65 3-70 8-75 (78)
70 PF13371 TPR_9: Tetratricopept 96.3 0.008 1.7E-07 37.9 4.1 43 48-90 2-44 (73)
71 KOG1126|consensus 96.2 0.012 2.7E-07 53.5 6.1 78 3-84 453-532 (638)
72 PRK15179 Vi polysaccharide bio 96.1 0.027 5.7E-07 51.7 8.2 58 21-82 104-161 (694)
73 TIGR03504 FimV_Cterm FimV C-te 96.1 0.016 3.5E-07 35.5 4.7 40 45-85 3-42 (44)
74 PRK11447 cellulose synthase su 96.1 0.018 3.8E-07 54.8 7.2 57 17-77 283-339 (1157)
75 PRK15363 pathogenicity island 96.1 0.06 1.3E-06 41.3 8.8 81 3-88 70-150 (157)
76 PRK09782 bacteriophage N4 rece 96.1 0.032 6.9E-07 53.0 8.7 76 4-84 46-121 (987)
77 cd05804 StaR_like StaR_like; a 96.1 0.044 9.5E-07 43.8 8.3 74 2-79 8-81 (355)
78 PLN02789 farnesyltranstransfer 96.0 0.031 6.7E-07 46.6 7.5 62 20-85 125-186 (320)
79 COG4235 Cytochrome c biogenesi 96.0 0.035 7.6E-07 46.3 7.7 67 13-83 132-198 (287)
80 PRK15179 Vi polysaccharide bio 95.9 0.059 1.3E-06 49.5 9.3 69 3-78 123-191 (694)
81 PF13374 TPR_10: Tetratricopep 95.9 0.016 3.4E-07 32.5 3.7 29 42-70 3-31 (42)
82 CHL00033 ycf3 photosystem I as 95.9 0.082 1.8E-06 38.8 8.5 68 3-77 75-156 (168)
83 PLN02789 farnesyltranstransfer 95.8 0.077 1.7E-06 44.2 9.0 80 4-89 40-120 (320)
84 PRK11447 cellulose synthase su 95.7 0.038 8.2E-07 52.5 7.7 61 16-80 474-534 (1157)
85 PRK11788 tetratricopeptide rep 95.7 0.036 7.8E-07 44.8 6.4 32 47-78 186-217 (389)
86 KOG2002|consensus 95.6 0.059 1.3E-06 51.3 8.3 80 3-87 683-762 (1018)
87 PF04733 Coatomer_E: Coatomer 95.5 0.032 7E-07 45.8 5.6 70 3-77 168-237 (290)
88 PF13525 YfiO: Outer membrane 95.5 0.21 4.5E-06 38.3 9.6 81 2-86 44-135 (203)
89 PRK14720 transcript cleavage f 95.4 0.038 8.3E-07 52.3 6.5 56 21-80 100-155 (906)
90 PLN03098 LPA1 LOW PSII ACCUMUL 95.3 0.053 1.1E-06 47.8 6.5 39 39-77 73-111 (453)
91 PF14559 TPR_19: Tetratricopep 95.2 0.048 1E-06 33.8 4.4 32 52-83 2-33 (68)
92 PF09976 TPR_21: Tetratricopep 95.2 0.17 3.7E-06 36.4 7.9 54 20-74 28-81 (145)
93 KOG0553|consensus 95.0 0.14 2.9E-06 43.2 7.9 67 19-89 131-197 (304)
94 PLN03098 LPA1 LOW PSII ACCUMUL 95.0 0.095 2.1E-06 46.2 7.3 62 2-70 77-141 (453)
95 COG1729 Uncharacterized protei 95.0 0.12 2.6E-06 42.6 7.4 71 3-77 144-214 (262)
96 PF14863 Alkyl_sulf_dimr: Alky 95.0 0.12 2.6E-06 38.8 6.7 51 40-90 69-119 (141)
97 cd05804 StaR_like StaR_like; a 94.9 0.12 2.6E-06 41.3 7.1 51 19-73 130-180 (355)
98 PRK14574 hmsH outer membrane p 94.8 0.08 1.7E-06 49.5 6.7 61 16-80 115-175 (822)
99 PF13424 TPR_12: Tetratricopep 94.8 0.05 1.1E-06 34.8 3.8 32 39-70 3-34 (78)
100 PF13431 TPR_17: Tetratricopep 94.8 0.027 5.8E-07 32.1 2.2 32 27-62 3-34 (34)
101 KOG0543|consensus 94.6 0.047 1E-06 47.4 4.3 39 40-78 256-294 (397)
102 KOG1125|consensus 94.6 0.16 3.5E-06 46.0 7.6 68 5-79 290-357 (579)
103 KOG2076|consensus 94.4 0.17 3.8E-06 47.7 7.7 70 4-79 418-487 (895)
104 PF14561 TPR_20: Tetratricopep 94.4 0.44 9.4E-06 32.8 7.9 59 22-84 7-67 (90)
105 PF09976 TPR_21: Tetratricopep 94.2 0.17 3.7E-06 36.4 5.9 65 16-82 61-125 (145)
106 KOG1156|consensus 94.2 0.24 5.2E-06 45.6 7.9 65 21-89 59-123 (700)
107 PF12569 NARP1: NMDA receptor- 94.1 0.13 2.8E-06 45.9 6.0 45 43-87 196-240 (517)
108 PF09986 DUF2225: Uncharacteri 94.1 0.28 6E-06 38.8 7.3 72 4-75 122-199 (214)
109 PRK10747 putative protoheme IX 93.9 0.41 8.9E-06 40.3 8.5 65 16-84 131-196 (398)
110 KOG4162|consensus 93.7 0.2 4.4E-06 46.7 6.6 65 6-75 724-788 (799)
111 PRK11906 transcriptional regul 93.6 0.55 1.2E-05 41.6 8.8 74 4-84 342-415 (458)
112 KOG0548|consensus 92.8 0.65 1.4E-05 41.9 8.3 71 16-90 371-441 (539)
113 PRK10866 outer membrane biogen 92.8 1.8 3.8E-05 34.5 10.0 79 3-85 72-168 (243)
114 COG5010 TadD Flp pilus assembl 92.8 0.34 7.3E-06 40.0 6.0 56 19-78 116-171 (257)
115 PRK11906 transcriptional regul 92.5 1.1 2.4E-05 39.7 9.3 75 7-85 262-348 (458)
116 PF09295 ChAPs: ChAPs (Chs5p-A 92.1 0.25 5.5E-06 42.7 4.7 44 20-67 251-294 (395)
117 KOG0553|consensus 92.1 0.39 8.4E-06 40.5 5.6 56 17-76 95-150 (304)
118 KOG3081|consensus 91.9 0.77 1.7E-05 38.6 7.2 68 6-77 175-243 (299)
119 TIGR00540 hemY_coli hemY prote 91.9 2.5 5.3E-05 35.6 10.3 50 20-73 170-219 (409)
120 PF12688 TPR_5: Tetratrico pep 91.8 1.2 2.7E-05 32.2 7.4 63 3-69 41-103 (120)
121 PF03704 BTAD: Bacterial trans 91.7 1.7 3.6E-05 30.9 7.9 39 41-79 62-100 (146)
122 COG1729 Uncharacterized protei 91.6 0.87 1.9E-05 37.6 7.1 79 4-86 182-260 (262)
123 PF03704 BTAD: Bacterial trans 91.4 0.61 1.3E-05 33.2 5.4 50 15-68 74-123 (146)
124 PRK15331 chaperone protein Sic 91.4 1.3 2.8E-05 34.3 7.5 64 5-75 42-105 (165)
125 PRK10941 hypothetical protein; 91.3 1.7 3.6E-05 35.7 8.5 74 7-87 188-261 (269)
126 COG4976 Predicted methyltransf 91.3 0.45 9.7E-06 39.5 5.0 72 6-84 1-72 (287)
127 PF13428 TPR_14: Tetratricopep 91.1 0.7 1.5E-05 27.0 4.6 41 2-49 3-43 (44)
128 PF04733 Coatomer_E: Coatomer 91.1 1.7 3.7E-05 35.7 8.4 73 4-83 205-278 (290)
129 COG3063 PilF Tfp pilus assembl 90.9 0.95 2.1E-05 37.2 6.6 68 5-78 108-176 (250)
130 PF12569 NARP1: NMDA receptor- 90.4 0.79 1.7E-05 40.9 6.2 57 16-76 17-73 (517)
131 COG3063 PilF Tfp pilus assembl 89.9 0.99 2.1E-05 37.1 5.9 47 39-85 33-79 (250)
132 PF07721 TPR_4: Tetratricopept 89.7 0.43 9.3E-06 25.3 2.5 23 43-65 3-25 (26)
133 PRK10153 DNA-binding transcrip 89.1 3.5 7.6E-05 36.7 9.3 29 6-35 345-373 (517)
134 KOG1840|consensus 88.6 0.92 2E-05 40.6 5.3 49 20-71 265-313 (508)
135 KOG1174|consensus 88.5 1.8 3.8E-05 38.8 6.8 65 20-89 455-519 (564)
136 KOG2581|consensus 88.4 1.6 3.4E-05 38.8 6.4 48 39-87 245-292 (493)
137 COG3763 Uncharacterized protei 88.3 0.64 1.4E-05 31.5 3.1 26 95-120 4-31 (71)
138 KOG4162|consensus 88.1 2 4.3E-05 40.4 7.2 66 16-85 457-523 (799)
139 KOG3785|consensus 88.1 1.2 2.6E-05 39.3 5.5 54 16-72 35-88 (557)
140 COG4455 ImpE Protein of avirul 87.5 3.8 8.3E-05 33.9 7.8 80 8-94 9-89 (273)
141 PRK00523 hypothetical protein; 87.4 0.78 1.7E-05 31.1 3.2 28 94-121 4-33 (72)
142 PF13281 DUF4071: Domain of un 87.3 4.8 0.0001 34.8 8.7 80 2-84 181-269 (374)
143 PF11846 DUF3366: Domain of un 87.3 7.3 0.00016 29.3 8.9 64 12-80 120-183 (193)
144 KOG0548|consensus 87.2 2.1 4.6E-05 38.7 6.6 45 40-84 69-113 (539)
145 PRK14574 hmsH outer membrane p 86.9 2.5 5.4E-05 39.8 7.3 60 19-82 466-525 (822)
146 KOG1128|consensus 86.9 1.8 3.8E-05 40.6 6.1 58 16-77 498-555 (777)
147 KOG1155|consensus 86.9 3.3 7.2E-05 37.4 7.6 71 17-89 480-555 (559)
148 COG5010 TadD Flp pilus assembl 86.5 5.1 0.00011 33.1 8.0 65 22-90 85-149 (257)
149 KOG1173|consensus 86.2 4.6 9.9E-05 37.0 8.2 71 18-92 470-541 (611)
150 KOG4234|consensus 86.0 5 0.00011 33.0 7.6 37 45-81 172-208 (271)
151 KOG0624|consensus 86.0 5.9 0.00013 34.9 8.6 66 16-85 320-385 (504)
152 KOG4340|consensus 85.2 2.1 4.6E-05 37.1 5.4 61 9-76 19-79 (459)
153 KOG1125|consensus 85.1 2.5 5.3E-05 38.6 6.0 58 16-75 407-464 (579)
154 KOG3060|consensus 84.8 3.6 7.9E-05 34.5 6.5 51 21-75 138-188 (289)
155 KOG0624|consensus 84.6 3.7 8.1E-05 36.2 6.7 63 7-76 196-258 (504)
156 COG3071 HemY Uncharacterized e 84.4 6.3 0.00014 34.5 8.0 72 10-85 301-372 (400)
157 PRK10153 DNA-binding transcrip 84.3 3.7 8E-05 36.5 6.7 39 42-81 421-459 (517)
158 PF13281 DUF4071: Domain of un 83.9 9.3 0.0002 33.1 8.8 78 14-91 152-233 (374)
159 KOG2076|consensus 83.7 8.2 0.00018 37.0 8.9 55 17-75 153-207 (895)
160 PF09295 ChAPs: ChAPs (Chs5p-A 83.6 11 0.00024 32.7 9.2 57 18-81 184-240 (395)
161 KOG1129|consensus 83.5 2 4.4E-05 37.6 4.6 40 43-82 6-45 (478)
162 KOG1174|consensus 83.3 12 0.00026 33.7 9.3 80 39-120 298-386 (564)
163 PF13512 TPR_18: Tetratricopep 83.2 11 0.00025 28.3 8.1 74 2-79 49-137 (142)
164 KOG3785|consensus 83.2 5.8 0.00013 35.2 7.3 76 16-95 164-244 (557)
165 PF14938 SNAP: Soluble NSF att 82.8 21 0.00046 28.5 10.1 79 5-86 160-244 (282)
166 PF05843 Suf: Suppressor of fo 82.8 3.6 7.8E-05 33.2 5.6 68 7-78 40-107 (280)
167 KOG4626|consensus 82.6 4.1 8.9E-05 38.3 6.4 66 5-77 325-390 (966)
168 PRK01844 hypothetical protein; 82.6 1.8 3.9E-05 29.4 3.1 18 104-121 15-32 (72)
169 KOG0376|consensus 82.5 3.7 7.9E-05 36.7 5.9 71 19-93 54-124 (476)
170 KOG1126|consensus 82.3 6.3 0.00014 36.4 7.5 64 21-88 541-604 (638)
171 COG4783 Putative Zn-dependent 82.1 7.8 0.00017 34.7 7.8 65 19-87 356-420 (484)
172 PRK15331 chaperone protein Sic 82.0 9.9 0.00021 29.4 7.5 65 6-76 75-139 (165)
173 TIGR03504 FimV_Cterm FimV C-te 81.8 4.2 9.2E-05 24.7 4.4 30 2-34 1-30 (44)
174 PF13830 DUF4192: Domain of un 81.4 11 0.00025 31.0 8.2 66 20-87 255-320 (324)
175 PF10579 Rapsyn_N: Rapsyn N-te 81.1 7.7 0.00017 26.8 5.9 53 15-68 18-70 (80)
176 KOG2002|consensus 80.2 3.8 8.3E-05 39.5 5.5 68 15-85 176-243 (1018)
177 COG2956 Predicted N-acetylgluc 80.2 3.2 7E-05 35.9 4.6 54 17-74 194-247 (389)
178 PF04212 MIT: MIT (microtubule 78.8 5.2 0.00011 25.5 4.4 34 57-90 28-61 (69)
179 PF06957 COPI_C: Coatomer (COP 78.7 3.1 6.7E-05 36.6 4.2 36 50-85 309-344 (422)
180 KOG0547|consensus 77.8 4.2 9E-05 37.0 4.8 56 16-75 373-428 (606)
181 PRK10941 hypothetical protein; 77.5 2.4 5.1E-05 34.8 3.0 36 41-76 181-216 (269)
182 PRK14720 transcript cleavage f 77.3 11 0.00024 36.2 7.6 75 4-86 120-194 (906)
183 PF04211 MtrC: Tetrahydrometha 76.1 9.9 0.00022 31.6 6.2 49 71-119 194-242 (262)
184 cd02683 MIT_1 MIT: domain cont 76.0 6.8 0.00015 26.2 4.4 60 19-90 3-62 (77)
185 PRK12793 flaF flagellar biosyn 75.7 16 0.00035 26.6 6.6 66 1-84 1-78 (115)
186 COG4785 NlpI Lipoprotein NlpI, 75.2 5.5 0.00012 33.2 4.5 39 39-77 97-135 (297)
187 PF12862 Apc5: Anaphase-promot 74.7 11 0.00024 25.4 5.3 58 17-74 12-74 (94)
188 PF10300 DUF3808: Protein of u 73.4 10 0.00022 33.2 6.0 53 16-70 280-334 (468)
189 PF09613 HrpB1_HrpK: Bacterial 73.2 19 0.00041 27.7 6.8 52 39-90 42-93 (160)
190 cd02682 MIT_AAA_Arch MIT: doma 72.6 10 0.00023 25.6 4.7 34 57-90 29-62 (75)
191 PLN03081 pentatricopeptide (PP 72.4 10 0.00022 34.0 5.9 27 48-74 501-527 (697)
192 KOG1156|consensus 72.4 7.3 0.00016 36.3 5.0 45 38-82 368-412 (700)
193 COG4649 Uncharacterized protei 71.8 39 0.00085 27.2 8.4 62 2-69 134-195 (221)
194 PF14938 SNAP: Soluble NSF att 71.4 12 0.00026 30.0 5.6 58 17-75 129-189 (282)
195 KOG0545|consensus 71.2 17 0.00036 30.8 6.5 64 17-80 199-269 (329)
196 PF11588 DUF3243: Protein of u 71.1 4.2 9.2E-05 28.1 2.5 31 47-85 31-61 (81)
197 KOG4626|consensus 70.6 12 0.00026 35.4 6.0 56 15-74 230-285 (966)
198 PF10602 RPN7: 26S proteasome 70.4 28 0.0006 26.4 7.2 64 17-80 10-75 (177)
199 KOG0551|consensus 70.4 14 0.0003 32.2 6.0 28 47-74 125-152 (390)
200 COG0457 NrfG FOG: TPR repeat [ 70.1 21 0.00046 23.5 5.7 53 17-73 181-234 (291)
201 COG3629 DnrI DNA-binding trans 70.0 22 0.00048 29.6 7.0 79 18-100 168-252 (280)
202 TIGR01148 mtrC N5-methyltetrah 69.6 15 0.00033 30.5 5.8 50 71-120 193-245 (265)
203 KOG2471|consensus 69.5 11 0.00025 34.5 5.5 49 40-88 618-668 (696)
204 cd02678 MIT_VPS4 MIT: domain c 69.2 13 0.00027 24.4 4.4 60 19-90 3-62 (75)
205 PF02184 HAT: HAT (Half-A-TPR) 68.9 5.3 0.00012 23.0 2.2 22 55-76 1-22 (32)
206 PF09797 NatB_MDM20: N-acetylt 68.9 21 0.00044 29.7 6.6 72 12-90 192-263 (365)
207 KOG0551|consensus 68.1 21 0.00046 31.1 6.6 47 39-85 151-197 (390)
208 PF01239 PPTA: Protein prenylt 67.7 7.2 0.00016 21.1 2.6 26 59-84 1-26 (31)
209 PRK01030 tetrahydromethanopter 67.5 20 0.00043 29.8 6.1 50 71-120 187-243 (264)
210 PF04781 DUF627: Protein of un 67.4 9.2 0.0002 27.8 3.7 31 47-77 2-32 (111)
211 smart00745 MIT Microtubule Int 67.3 15 0.00033 23.6 4.5 62 17-90 3-64 (77)
212 KOG3060|consensus 66.8 19 0.00041 30.3 5.9 57 16-77 167-227 (289)
213 PF04184 ST7: ST7 protein; In 66.1 25 0.00055 31.9 6.9 56 3-63 262-317 (539)
214 COG2956 Predicted N-acetylgluc 65.7 42 0.00091 29.3 7.9 69 16-87 227-295 (389)
215 KOG1130|consensus 64.8 8.2 0.00018 34.8 3.6 53 9-64 26-78 (639)
216 smart00386 HAT HAT (Half-A-TPR 64.0 10 0.00022 19.3 2.7 22 55-76 1-22 (33)
217 PF11817 Foie-gras_1: Foie gra 63.7 34 0.00073 27.1 6.7 56 14-70 149-207 (247)
218 PF11087 DUF2881: Protein of u 63.5 4 8.6E-05 25.9 1.0 23 102-126 14-36 (54)
219 KOG1127|consensus 63.2 15 0.00033 36.0 5.3 57 17-77 16-73 (1238)
220 cd00197 VHS_ENTH_ANTH VHS, ENT 63.1 15 0.00032 25.5 4.1 25 10-34 45-69 (115)
221 COG4700 Uncharacterized protei 62.4 45 0.00097 27.3 7.1 51 21-73 142-192 (251)
222 cd02656 MIT MIT: domain contai 62.4 20 0.00044 23.1 4.4 35 56-90 28-62 (75)
223 cd02684 MIT_2 MIT: domain cont 61.6 22 0.00047 23.6 4.5 60 19-90 3-62 (75)
224 COG2023 RPR2 RNase P subunit R 61.2 13 0.00027 27.0 3.4 33 38-71 11-43 (105)
225 PF04910 Tcf25: Transcriptiona 60.9 15 0.00032 31.2 4.4 34 44-77 106-140 (360)
226 KOG1840|consensus 60.9 47 0.001 29.9 7.7 47 21-70 182-228 (508)
227 PF09613 HrpB1_HrpK: Bacterial 60.8 74 0.0016 24.5 7.8 73 7-88 51-123 (160)
228 PF10363 DUF2435: Protein of u 60.7 52 0.0011 22.6 6.9 69 3-73 4-76 (92)
229 PF14689 SPOB_a: Sensor_kinase 60.3 10 0.00022 24.2 2.6 31 39-69 19-51 (62)
230 TIGR01716 RGG_Cterm transcript 59.9 31 0.00066 26.1 5.6 83 3-86 131-216 (220)
231 PRK04841 transcriptional regul 59.3 50 0.0011 30.1 7.7 72 4-75 413-486 (903)
232 PLN03081 pentatricopeptide (PP 59.3 45 0.00097 30.0 7.4 56 16-74 439-494 (697)
233 KOG4648|consensus 59.2 6 0.00013 35.0 1.8 43 25-75 89-131 (536)
234 PF10661 EssA: WXG100 protein 58.9 11 0.00024 28.3 3.0 26 92-118 119-144 (145)
235 KOG1129|consensus 58.3 6.7 0.00014 34.4 1.9 51 21-75 308-358 (478)
236 PF08238 Sel1: Sel1 repeat; I 57.8 27 0.00059 18.8 3.9 32 2-33 3-38 (39)
237 cd02680 MIT_calpain7_2 MIT: do 57.6 22 0.00047 24.0 3.9 16 18-33 2-17 (75)
238 KOG0550|consensus 57.4 81 0.0018 28.4 8.4 76 19-98 303-381 (486)
239 PF13646 HEAT_2: HEAT repeats; 57.3 46 0.001 20.9 6.9 61 3-74 17-77 (88)
240 PF10883 DUF2681: Protein of u 56.2 13 0.00029 25.9 2.8 25 95-119 3-27 (87)
241 PRK11901 hypothetical protein; 55.4 11 0.00024 32.2 2.7 25 91-115 37-61 (327)
242 KOG4507|consensus 54.9 71 0.0015 30.2 7.8 76 3-87 645-722 (886)
243 TIGR00756 PPR pentatricopeptid 54.9 16 0.00034 18.7 2.4 23 47-69 6-28 (35)
244 PF06552 TOM20_plant: Plant sp 54.4 65 0.0014 25.5 6.7 54 22-79 10-73 (186)
245 KOG1173|consensus 54.2 30 0.00066 31.9 5.4 61 16-79 427-493 (611)
246 PF05844 YopD: YopD protein; 53.9 4.3 9.3E-05 34.2 0.0 32 90-121 119-151 (298)
247 KOG4648|consensus 53.9 28 0.0006 30.9 4.9 37 48-84 172-208 (536)
248 KOG3824|consensus 53.7 18 0.0004 31.6 3.8 31 48-78 123-153 (472)
249 PF05283 MGC-24: Multi-glycosy 53.5 13 0.00028 29.3 2.6 27 93-119 158-185 (186)
250 KOG0547|consensus 53.3 52 0.0011 30.2 6.7 62 5-73 433-494 (606)
251 COG5442 FlaF Flagellar biosynt 52.9 52 0.0011 24.1 5.4 70 1-84 1-78 (115)
252 KOG1127|consensus 52.8 48 0.001 32.8 6.6 71 15-89 574-644 (1238)
253 PF11207 DUF2989: Protein of u 52.8 28 0.00061 27.8 4.5 44 17-60 154-197 (203)
254 PF01102 Glycophorin_A: Glycop 52.4 23 0.00049 26.1 3.6 24 93-116 68-91 (122)
255 PF05957 DUF883: Bacterial pro 52.3 52 0.0011 22.2 5.2 39 75-116 56-94 (94)
256 TIGR02561 HrpB1_HrpK type III 52.0 84 0.0018 24.2 6.8 78 12-96 22-99 (153)
257 PF13041 PPR_2: PPR repeat fam 52.0 17 0.00037 21.2 2.5 27 48-74 10-38 (50)
258 PF12862 Apc5: Anaphase-promot 51.8 37 0.0008 22.8 4.4 34 45-78 2-35 (94)
259 KOG4340|consensus 51.0 56 0.0012 28.6 6.3 65 17-85 58-122 (459)
260 KOG1155|consensus 50.8 85 0.0019 28.6 7.5 67 18-88 447-520 (559)
261 KOG2053|consensus 50.4 96 0.0021 30.1 8.1 65 16-84 22-86 (932)
262 PF01535 PPR: PPR repeat; Int 50.1 18 0.00038 18.4 2.1 23 48-70 7-29 (31)
263 PF12301 CD99L2: CD99 antigen 49.9 12 0.00027 28.9 2.0 17 104-120 124-140 (169)
264 COG3947 Response regulator con 49.8 42 0.00091 28.9 5.3 38 47-84 285-322 (361)
265 PF10255 Paf67: RNA polymerase 49.6 27 0.00058 30.6 4.2 49 18-69 137-192 (404)
266 COG3071 HemY Uncharacterized e 49.0 23 0.0005 31.1 3.7 35 50-84 162-196 (400)
267 PRK10404 hypothetical protein; 47.8 60 0.0013 22.9 5.1 28 85-115 73-100 (101)
268 PF12968 DUF3856: Domain of Un 47.7 32 0.0007 26.1 3.8 30 41-70 55-84 (144)
269 KOG0550|consensus 46.7 55 0.0012 29.4 5.7 60 22-85 188-247 (486)
270 PF12854 PPR_1: PPR repeat 46.6 17 0.00036 20.2 1.7 22 46-67 12-33 (34)
271 PF10300 DUF3808: Protein of u 46.4 57 0.0012 28.5 5.8 62 15-80 245-306 (468)
272 COG4575 ElaB Uncharacterized c 46.1 1.1E+02 0.0024 22.1 8.2 79 17-116 15-104 (104)
273 COG4218 MtrF Tetrahydromethano 45.9 65 0.0014 21.9 4.7 25 71-95 13-37 (73)
274 PF12732 YtxH: YtxH-like prote 45.7 28 0.00061 22.6 3.0 28 97-125 4-31 (74)
275 PF12652 CotJB: CotJB protein; 44.9 67 0.0015 21.8 4.8 36 54-89 8-43 (78)
276 PRK04841 transcriptional regul 44.9 1.2E+02 0.0025 27.7 7.8 70 5-77 696-767 (903)
277 PLN03218 maturation of RBCL 1; 44.3 63 0.0014 31.5 6.2 27 43-69 616-642 (1060)
278 PF07332 DUF1469: Protein of u 44.1 44 0.00095 23.3 4.0 26 94-119 73-98 (121)
279 PF12768 Rax2: Cortical protei 43.9 15 0.00032 30.4 1.7 27 93-119 231-257 (281)
280 PF02259 FAT: FAT domain; Int 43.7 1.5E+02 0.0034 23.3 7.5 73 3-78 187-295 (352)
281 COG0457 NrfG FOG: TPR repeat [ 43.3 93 0.002 20.3 7.8 23 50-72 139-161 (291)
282 PF11349 DUF3151: Protein of u 43.1 41 0.00088 25.2 3.7 37 40-76 91-127 (129)
283 KOG0292|consensus 43.1 38 0.00082 33.2 4.4 37 49-85 1092-1128(1202)
284 COG4062 MtrB Tetrahydromethano 42.8 36 0.00078 24.7 3.3 23 67-89 30-52 (108)
285 KOG2003|consensus 42.7 90 0.0019 28.9 6.5 58 17-78 504-561 (840)
286 PLN03218 maturation of RBCL 1; 42.3 2.1E+02 0.0046 28.0 9.3 55 17-74 698-754 (1060)
287 cd02677 MIT_SNX15 MIT: domain 41.6 68 0.0015 21.3 4.4 60 19-90 3-62 (75)
288 PF05131 Pep3_Vps18: Pep3/Vps1 41.6 14 0.0003 27.7 1.1 24 39-66 105-128 (147)
289 PF04910 Tcf25: Transcriptiona 41.1 61 0.0013 27.5 5.0 37 28-68 31-67 (360)
290 KOG4234|consensus 40.7 95 0.0021 25.7 5.8 44 41-84 134-177 (271)
291 PLN03077 Protein ECB2; Provisi 40.7 1.1E+02 0.0023 28.2 6.9 26 42-67 692-717 (857)
292 KOG0545|consensus 38.9 2.5E+02 0.0054 24.0 9.3 67 4-77 234-300 (329)
293 PF13908 Shisa: Wnt and FGF in 38.7 13 0.00028 28.1 0.5 24 94-117 80-103 (179)
294 KOG3026|consensus 38.3 71 0.0015 26.5 4.8 36 54-89 5-40 (262)
295 PF09205 DUF1955: Domain of un 38.0 1.9E+02 0.0041 22.4 6.8 37 24-64 107-143 (161)
296 TIGR00823 EIIA-LAC phosphotran 37.9 43 0.00093 23.6 3.1 28 40-67 16-43 (99)
297 cd00215 PTS_IIA_lac PTS_IIA, P 37.5 52 0.0011 23.1 3.4 27 41-67 15-41 (97)
298 COG2015 Alkyl sulfatase and re 37.0 62 0.0013 29.8 4.6 47 41-87 452-498 (655)
299 PF03672 UPF0154: Uncharacteri 36.7 37 0.0008 22.5 2.4 13 108-120 12-24 (64)
300 PF02255 PTS_IIA: PTS system, 36.5 56 0.0012 22.8 3.5 29 40-68 13-41 (96)
301 PRK01209 cobD cobalamin biosyn 36.5 1.4E+02 0.0031 24.7 6.5 45 52-96 109-156 (312)
302 PF02479 Herpes_IE68: Herpesvi 36.3 40 0.00086 25.3 2.8 44 43-91 64-107 (132)
303 KOG0546|consensus 36.1 95 0.0021 27.1 5.4 47 39-85 307-353 (372)
304 PF06072 Herpes_US9: Alphaherp 35.6 1.3E+02 0.0028 19.7 5.7 14 71-84 5-18 (60)
305 COG4061 MtrC Tetrahydromethano 35.1 64 0.0014 26.6 4.0 48 71-118 193-241 (262)
306 smart00567 EZ_HEAT E-Z type HE 34.5 70 0.0015 16.7 3.0 27 2-32 3-29 (30)
307 PRK09591 celC cellobiose phosp 34.0 58 0.0013 23.1 3.3 28 40-67 19-46 (104)
308 smart00671 SEL1 Sel1-like repe 33.9 75 0.0016 16.6 3.5 31 2-32 3-34 (36)
309 PF06957 COPI_C: Coatomer (COP 33.7 1.8E+02 0.0038 25.8 6.8 48 48-97 211-263 (422)
310 KOG2376|consensus 33.7 1.8E+02 0.0039 27.2 7.0 62 4-75 83-144 (652)
311 PF06812 ImpA-rel_N: ImpA-rela 33.3 68 0.0015 20.1 3.2 27 6-32 26-52 (62)
312 KOG0495|consensus 33.2 2.5E+02 0.0054 27.0 7.9 39 52-90 662-700 (913)
313 PRK07630 CobD/CbiB family prot 33.1 1.6E+02 0.0036 24.4 6.3 44 53-96 106-152 (312)
314 KOG2796|consensus 33.0 35 0.00075 29.3 2.2 39 47-85 258-296 (366)
315 PF02259 FAT: FAT domain; Int 32.0 1.5E+02 0.0033 23.3 5.7 41 26-69 17-57 (352)
316 PF03186 CobD_Cbib: CobD/Cbib 31.9 1.8E+02 0.004 23.9 6.3 46 51-96 105-153 (295)
317 PF03392 OS-D: Insect pheromon 31.6 94 0.002 21.6 4.0 32 59-90 19-50 (95)
318 PF07439 DUF1515: Protein of u 31.6 46 0.001 24.4 2.4 38 75-114 54-97 (112)
319 KOG1310|consensus 31.5 1.6E+02 0.0035 27.6 6.3 48 43-90 447-494 (758)
320 KOG1941|consensus 31.2 79 0.0017 28.3 4.2 46 38-83 203-252 (518)
321 KOG3824|consensus 30.9 92 0.002 27.4 4.5 66 19-88 132-204 (472)
322 PLN03077 Protein ECB2; Provisi 30.8 1.1E+02 0.0025 28.1 5.4 33 43-75 356-390 (857)
323 PRK15180 Vi polysaccharide bio 30.6 1.2E+02 0.0025 28.4 5.2 37 42-78 777-817 (831)
324 KOG2062|consensus 30.5 2.3E+02 0.005 27.5 7.3 58 18-78 38-99 (929)
325 PF07079 DUF1347: Protein of u 30.5 3.2E+02 0.007 25.0 7.9 77 1-83 423-503 (549)
326 KOG0615|consensus 30.4 32 0.00069 30.8 1.7 19 58-76 411-429 (475)
327 PF05808 Podoplanin: Podoplani 30.3 17 0.00037 28.2 0.0 23 88-110 121-146 (162)
328 PRK10454 PTS system N,N'-diace 30.3 84 0.0018 22.8 3.6 29 40-68 30-58 (115)
329 PF12729 4HB_MCP_1: Four helix 30.2 95 0.0021 21.4 3.9 31 93-123 9-39 (181)
330 PF11981 DUF3482: Domain of un 30.0 73 0.0016 26.5 3.7 34 87-120 151-193 (292)
331 KOG0495|consensus 29.8 2.7E+02 0.0058 26.9 7.5 69 16-88 664-732 (913)
332 COG3629 DnrI DNA-binding trans 29.7 1.7E+02 0.0037 24.4 5.8 41 39-79 151-191 (280)
333 KOG2300|consensus 29.3 1.2E+02 0.0026 28.0 5.1 69 3-71 407-475 (629)
334 TIGR01987 HI0074 nucleotidyltr 29.3 79 0.0017 23.0 3.4 24 53-76 1-24 (123)
335 PF04406 TP6A_N: Type IIB DNA 29.0 46 0.001 21.3 1.9 52 21-74 13-64 (68)
336 KOG2376|consensus 28.9 2.8E+02 0.006 26.0 7.4 67 19-97 357-423 (652)
337 PRK08878 adenosylcobinamide-ph 28.8 2.3E+02 0.005 23.7 6.5 44 53-96 117-163 (317)
338 TIGR00380 cobD cobalamin biosy 28.6 3.3E+02 0.0071 22.8 7.4 44 53-96 114-160 (305)
339 KOG1839|consensus 28.5 73 0.0016 31.9 3.9 54 19-75 954-1014(1236)
340 KOG1550|consensus 28.5 4.3E+02 0.0092 23.6 9.2 31 41-71 360-394 (552)
341 PF05843 Suf: Suppressor of fo 28.4 2E+02 0.0044 23.0 6.0 56 18-77 16-72 (280)
342 PLN03094 Substrate binding sub 28.1 84 0.0018 27.2 3.8 44 72-116 62-109 (370)
343 KOG2003|consensus 27.9 3.6E+02 0.0077 25.1 7.8 61 16-80 639-699 (840)
344 PRK03954 ribonuclease P protei 27.8 94 0.002 22.8 3.6 32 38-70 19-50 (121)
345 PF13812 PPR_3: Pentatricopept 27.2 98 0.0021 15.8 3.5 25 45-69 5-29 (34)
346 KOG1070|consensus 27.1 2.5E+02 0.0054 29.2 7.1 60 18-82 1545-1604(1710)
347 TIGR03017 EpsF chain length de 27.1 97 0.0021 26.1 4.0 32 94-125 400-432 (444)
348 COG1270 CbiB Cobalamin biosynt 27.0 2.7E+02 0.0057 23.8 6.6 46 51-96 112-160 (320)
349 COG4872 Predicted membrane pro 27.0 1.4E+02 0.0031 26.1 4.9 40 73-112 22-65 (394)
350 PRK10132 hypothetical protein; 26.9 2.4E+02 0.0052 20.1 11.0 35 79-116 73-107 (108)
351 PF11691 DUF3288: Protein of u 26.5 1.2E+02 0.0027 21.3 3.8 28 57-84 8-35 (90)
352 KOG1380|consensus 26.5 48 0.001 29.0 2.0 34 40-73 328-361 (409)
353 KOG1915|consensus 26.4 52 0.0011 30.3 2.4 25 53-77 449-473 (677)
354 PF00244 14-3-3: 14-3-3 protei 26.4 3E+02 0.0066 21.8 6.6 71 18-91 16-89 (236)
355 COG5499 Predicted transcriptio 26.3 87 0.0019 23.1 3.1 28 54-81 11-38 (120)
356 KOG2396|consensus 25.9 1.4E+02 0.003 27.5 4.9 51 22-76 124-175 (568)
357 TIGR02230 ATPase_gene1 F0F1-AT 25.7 2.2E+02 0.0047 20.3 5.0 23 92-115 46-68 (100)
358 PF09670 Cas_Cas02710: CRISPR- 25.6 3.2E+02 0.007 23.2 7.0 68 15-85 143-214 (379)
359 PRK09956 hypothetical protein; 25.6 3.9E+02 0.0084 22.1 7.8 81 7-94 176-256 (308)
360 PF12911 OppC_N: N-terminal TM 25.4 1.1E+02 0.0023 18.4 3.0 28 90-118 16-43 (56)
361 PF02439 Adeno_E3_CR2: Adenovi 25.2 96 0.0021 18.6 2.6 13 97-109 7-19 (38)
362 PF07849 DUF1641: Protein of u 24.5 80 0.0017 18.7 2.2 25 8-32 14-38 (42)
363 COG1447 CelC Phosphotransferas 24.5 1.2E+02 0.0026 21.9 3.5 26 41-66 19-44 (105)
364 PF06041 DUF924: Bacterial pro 24.4 82 0.0018 24.4 2.9 25 9-33 112-136 (188)
365 KOG3364|consensus 24.2 77 0.0017 24.3 2.6 38 39-76 30-70 (149)
366 PF08424 NRDE-2: NRDE-2, neces 24.2 3.3E+02 0.0072 22.3 6.6 56 16-75 44-99 (321)
367 PF14007 YtpI: YtpI-like prote 24.1 82 0.0018 22.0 2.5 24 98-121 61-84 (89)
368 COG3118 Thioredoxin domain-con 24.0 2.7E+02 0.0059 23.7 6.1 68 26-96 122-206 (304)
369 PRK12482 flagellar motor prote 23.5 2.3E+02 0.0051 23.6 5.6 18 16-33 51-68 (287)
370 PF12304 BCLP: Beta-casein lik 23.3 83 0.0018 25.0 2.7 42 66-115 24-66 (188)
371 PRK12794 flaF flagellar biosyn 23.3 3E+02 0.0065 20.0 6.7 67 1-84 3-80 (122)
372 COG1849 Uncharacterized protei 23.2 45 0.00099 23.5 1.1 13 2-14 62-74 (90)
373 cd00280 TRFH Telomeric Repeat 23.0 1.8E+02 0.0039 23.3 4.6 68 20-88 86-158 (200)
374 PF11640 TAN: Telomere-length 22.9 1.6E+02 0.0035 21.6 4.1 33 1-33 1-35 (155)
375 KOG1086|consensus 22.4 2.2E+02 0.0048 26.0 5.4 51 8-68 179-229 (594)
376 KOG4459|consensus 22.2 71 0.0015 28.7 2.4 39 39-77 131-169 (471)
377 COG1963 Uncharacterized protei 22.1 2.5E+02 0.0054 21.7 5.0 19 73-92 96-114 (153)
378 PF12323 HTH_OrfB_IS605: Helix 22.1 62 0.0013 19.0 1.4 17 69-85 8-24 (46)
379 PRK13807 maltose phosphorylase 21.7 52 0.0011 30.9 1.5 63 38-111 609-687 (756)
380 PF03241 HpaB: 4-hydroxyphenyl 21.6 2E+02 0.0044 22.4 4.6 54 3-70 152-205 (205)
381 PHA02537 M terminase endonucle 21.2 4.5E+02 0.0097 21.3 7.1 50 39-89 165-225 (230)
382 COG3014 Uncharacterized protei 21.2 1.7E+02 0.0037 25.9 4.4 31 41-71 125-155 (449)
383 COG3671 Predicted membrane pro 21.2 1.2E+02 0.0026 22.6 3.0 29 91-119 28-56 (125)
384 KOG4279|consensus 21.0 46 0.001 32.3 1.0 80 2-84 242-330 (1226)
385 COG4700 Uncharacterized protei 21.0 4.4E+02 0.0095 21.7 6.4 26 44-69 92-117 (251)
386 PF07980 SusD: SusD family; I 20.9 3.7E+02 0.0079 20.1 6.5 32 38-69 130-161 (266)
387 PF10579 Rapsyn_N: Rapsyn N-te 20.7 2.9E+02 0.0064 19.0 5.6 33 49-81 14-46 (80)
388 PF09813 Coiled-coil_56: Coile 20.6 2.9E+02 0.0063 19.8 4.8 22 79-100 39-60 (100)
389 KOG2058|consensus 20.6 1.2E+02 0.0027 26.8 3.5 47 24-75 147-193 (436)
390 PRK13184 pknD serine/threonine 20.5 6.6E+02 0.014 24.6 8.5 83 2-89 514-600 (932)
391 PRK14750 kdpF potassium-transp 20.4 1.6E+02 0.0034 16.6 2.7 17 96-112 3-19 (29)
392 PF13179 DUF4006: Family of un 20.4 1.6E+02 0.0034 19.7 3.2 13 91-103 11-23 (66)
393 PF12921 ATP13: Mitochondrial 20.3 2E+02 0.0044 20.7 4.1 32 41-72 52-83 (126)
394 PF03564 DUF1759: Protein of u 20.2 3.2E+02 0.0069 19.2 6.2 68 9-86 17-101 (145)
395 COG2976 Uncharacterized protei 20.1 2.1E+02 0.0046 23.1 4.4 36 50-85 168-203 (207)
396 PF06195 DUF996: Protein of un 20.1 1.8E+02 0.004 21.4 3.9 32 94-125 74-106 (139)
397 KOG2053|consensus 20.1 1.6E+02 0.0034 28.8 4.2 40 51-90 19-58 (932)
No 1
>KOG3364|consensus
Probab=100.00 E-value=5.6e-45 Score=272.85 Aligned_cols=109 Identities=43% Similarity=0.748 Sum_probs=102.3
Q ss_pred hHHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHH
Q psy16295 2 TQFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLE 81 (128)
Q Consensus 2 ~~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk 81 (128)
++||||||||||++.+|+++||.+|+++++++.| +++|||+||||+||||+|+|++|++|++++|++||+|+||..||
T Consensus 34 s~f~lAwaLV~S~~~~dv~~GI~iLe~l~~~~~~--~~rRe~lyYLAvg~yRlkeY~~s~~yvd~ll~~e~~n~Qa~~Lk 111 (149)
T KOG3364|consen 34 SQFNLAWALVRSRDTEDVQEGIVILEDLLKSAHP--ERRRECLYYLAVGHYRLKEYSKSLRYVDALLETEPNNRQALELK 111 (149)
T ss_pred HHHHHHHHHHcccchHHHHHhHHHHHHHhhhcCc--ccchhhhhhhHHHHHHHhhHHHHHHHHHHHHhhCCCcHHHHHHH
Confidence 6999999999999999999999999999985436 89999999999999999999999999999999999999999999
Q ss_pred HHHHHHHhhhHHHHHHHHHHHHHHHHhhHHH
Q psy16295 82 SIVKKRMETEGLKGMALAGGAVLAQGQGTME 112 (128)
Q Consensus 82 ~~Ie~~i~kdGliG~~i~gga~~~~g~~~~~ 112 (128)
+.|+++|+|||+|||+|+||+++++|++..+
T Consensus 112 ~~ied~itkegliGm~v~gGa~~~v~gl~g~ 142 (149)
T KOG3364|consen 112 ETIEDKITKEGLIGMVVVGGAALAVGGLAGI 142 (149)
T ss_pred HHHHHHHhhcceeeeeehhhHHHHHHHHHHH
Confidence 9999999999999999999999777755433
No 2
>PF14853 Fis1_TPR_C: Fis1 C-terminal tetratricopeptide repeat; PDB: 1IYG_A 1PC2_A 1NZN_A 3UUX_C 1Y8M_A 2PQR_A 2PQN_A 3O48_A.
Probab=99.93 E-value=7.4e-26 Score=144.53 Aligned_cols=53 Identities=45% Similarity=0.913 Sum_probs=50.4
Q ss_pred HHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhhhHH
Q psy16295 41 RDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKRMETEGL 93 (128)
Q Consensus 41 rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i~kdGl 93 (128)
|||+||||+||||+|||++|++|++.+|++||+|+||++|+++|+++|++||+
T Consensus 1 Rd~lY~lAig~ykl~~Y~~A~~~~~~lL~~eP~N~Qa~~L~~~i~~~i~kdgl 53 (53)
T PF14853_consen 1 RDCLYYLAIGHYKLGEYEKARRYCDALLEIEPDNRQAQSLKELIEDKIQKDGL 53 (53)
T ss_dssp HHHHHHHHHHHHHTT-HHHHHHHHHHHHHHTTS-HHHHHHHHHHHHHHHHTTT
T ss_pred ChhHHHHHHHHHHhhhHHHHHHHHHHHHhhCCCcHHHHHHHHHHHHHHhccCC
Confidence 89999999999999999999999999999999999999999999999999996
No 3
>PF14852 Fis1_TPR_N: Fis1 N-terminal tetratricopeptide repeat; PDB: 1IYG_A 3UUX_C 1Y8M_A 2PQR_A 2PQN_A 3O48_A 1PC2_A 1NZN_A.
Probab=99.57 E-value=2.3e-15 Score=88.88 Aligned_cols=32 Identities=53% Similarity=0.832 Sum_probs=31.1
Q ss_pred hHHHHHHHHhccCChhhHHHHHHHHHHHhhcc
Q psy16295 2 TQFNYSFCLVRSEFCADIRKGILLLEELFKTE 33 (128)
Q Consensus 2 ~~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~ 33 (128)
|+||||||||||++++|+++||.||+++++++
T Consensus 3 t~FnyAw~Lv~S~~~~d~~~Gi~lLe~l~~~~ 34 (35)
T PF14852_consen 3 TQFNYAWGLVKSNNREDQQEGIALLEELYRDE 34 (35)
T ss_dssp HHHHHHHHHHHSSSHHHHHHHHHHHHHHCCCS
T ss_pred chhHHHHHHhcCCCHHHHHHHHHHHHHHHhcc
Confidence 79999999999999999999999999999987
No 4
>PF13432 TPR_16: Tetratricopeptide repeat; PDB: 3CVP_A 3CVL_A 3CVQ_A 3CV0_A 2GW1_B 3CVN_A 3QKY_A 2PL2_B.
Probab=98.79 E-value=4.2e-08 Score=61.76 Aligned_cols=65 Identities=18% Similarity=0.316 Sum_probs=56.3
Q ss_pred HHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCH
Q psy16295 4 FNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENS 75 (128)
Q Consensus 4 F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~ 75 (128)
|..|..+..+. +..++++.++++++.+ | ...+.+|.+|..++++|+|++|+.+++.+++..|+|+
T Consensus 1 ~~~a~~~~~~g---~~~~A~~~~~~~l~~~-P---~~~~a~~~lg~~~~~~g~~~~A~~~~~~a~~~~P~~p 65 (65)
T PF13432_consen 1 YALARALYQQG---DYDEAIAAFEQALKQD-P---DNPEAWYLLGRILYQQGRYDEALAYYERALELDPDNP 65 (65)
T ss_dssp HHHHHHHHHCT---HHHHHHHHHHHHHCCS-T---THHHHHHHHHHHHHHTT-HHHHHHHHHHHHHHSTT-H
T ss_pred ChHHHHHHHcC---CHHHHHHHHHHHHHHC-C---CCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCcCCC
Confidence 45667777766 4568999999999999 7 6999999999999999999999999999999999985
No 5
>PF14559 TPR_19: Tetratricopeptide repeat; PDB: 2R5S_A 3QDN_B 3QOU_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 3FP3_A 3LCA_A ....
Probab=98.74 E-value=3.7e-08 Score=62.14 Aligned_cols=64 Identities=19% Similarity=0.263 Sum_probs=58.5
Q ss_pred hhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHH
Q psy16295 17 ADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIV 84 (128)
Q Consensus 17 ~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~I 84 (128)
.+..++++++++++..+ | ...+.++.+|..+++.|+|++|+..++.++..+|+|+....|+..|
T Consensus 5 ~~~~~A~~~~~~~l~~~-p---~~~~~~~~la~~~~~~g~~~~A~~~l~~~~~~~~~~~~~~~l~a~i 68 (68)
T PF14559_consen 5 GDYDEAIELLEKALQRN-P---DNPEARLLLAQCYLKQGQYDEAEELLERLLKQDPDNPEYQQLLAQI 68 (68)
T ss_dssp THHHHHHHHHHHHHHHT-T---TSHHHHHHHHHHHHHTT-HHHHHHHHHCCHGGGTTHHHHHHHHHHH
T ss_pred cCHHHHHHHHHHHHHHC-C---CCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCcCHHHHHHHHhcC
Confidence 56779999999999999 7 6899999999999999999999999999999999999999988654
No 6
>PF13371 TPR_9: Tetratricopeptide repeat
Probab=98.64 E-value=1.3e-07 Score=60.42 Aligned_cols=66 Identities=12% Similarity=0.222 Sum_probs=60.1
Q ss_pred hhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHH
Q psy16295 16 CADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVK 85 (128)
Q Consensus 16 ~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie 85 (128)
.++..++++.++..++-+ | .....++..|..++++|+|++|++.++..|+..|+++++..++..++
T Consensus 8 ~~~~~~A~~~~~~~l~~~-p---~~~~~~~~~a~~~~~~g~~~~A~~~l~~~l~~~p~~~~~~~~~a~l~ 73 (73)
T PF13371_consen 8 QEDYEEALEVLERALELD-P---DDPELWLQRARCLFQLGRYEEALEDLERALELSPDDPDARALRAMLN 73 (73)
T ss_pred CCCHHHHHHHHHHHHHhC-c---ccchhhHHHHHHHHHhccHHHHHHHHHHHHHHCCCcHHHHHHHHhcC
Confidence 346678999999999998 7 68999999999999999999999999999999999999999988763
No 7
>PRK10370 formate-dependent nitrite reductase complex subunit NrfG; Provisional
Probab=98.55 E-value=4.4e-07 Score=70.12 Aligned_cols=79 Identities=10% Similarity=0.084 Sum_probs=67.4
Q ss_pred HHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHH
Q psy16295 3 QFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLES 82 (128)
Q Consensus 3 ~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~ 82 (128)
.++||-+|..+.+..+..+++++|++.++.+ | ...+.++.+|+.++++|||++|..+|+.+|+.+|.+.+...+-+
T Consensus 110 ~~~lA~aL~~~~g~~~~~~A~~~l~~al~~d-P---~~~~al~~LA~~~~~~g~~~~Ai~~~~~aL~l~~~~~~r~~~i~ 185 (198)
T PRK10370 110 YAALATVLYYQAGQHMTPQTREMIDKALALD-A---NEVTALMLLASDAFMQADYAQAIELWQKVLDLNSPRVNRTQLVE 185 (198)
T ss_pred HHHHHHHHHHhcCCCCcHHHHHHHHHHHHhC-C---CChhHHHHHHHHHHHcCCHHHHHHHHHHHHhhCCCCccHHHHHH
Confidence 4678888866665555678999999999999 6 67899999999999999999999999999999999988776555
Q ss_pred HHH
Q psy16295 83 IVK 85 (128)
Q Consensus 83 ~Ie 85 (128)
-|+
T Consensus 186 ~i~ 188 (198)
T PRK10370 186 SIN 188 (198)
T ss_pred HHH
Confidence 444
No 8
>COG4235 Cytochrome c biogenesis factor [Posttranslational modification, protein turnover, chaperones]
Probab=98.52 E-value=3.5e-07 Score=75.81 Aligned_cols=77 Identities=13% Similarity=0.073 Sum_probs=71.4
Q ss_pred HHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHH
Q psy16295 4 FNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESI 83 (128)
Q Consensus 4 F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~ 83 (128)
-.||.+|++|.++.+..++..+|++.++.+ | .....+||||++++..|||.+|..+|+.+|+..|.|.-.+++-+.
T Consensus 194 ~g~aeaL~~~a~~~~ta~a~~ll~~al~~D-~---~~iral~lLA~~afe~g~~~~A~~~Wq~lL~~lp~~~~rr~~ie~ 269 (287)
T COG4235 194 LGLAEALYYQAGQQMTAKARALLRQALALD-P---ANIRALSLLAFAAFEQGDYAEAAAAWQMLLDLLPADDPRRSLIER 269 (287)
T ss_pred HHHHHHHHHhcCCcccHHHHHHHHHHHhcC-C---ccHHHHHHHHHHHHHcccHHHHHHHHHHHHhcCCCCCchHHHHHH
Confidence 469999999999999999999999999999 6 689999999999999999999999999999999999988877554
Q ss_pred H
Q psy16295 84 V 84 (128)
Q Consensus 84 I 84 (128)
+
T Consensus 270 ~ 270 (287)
T COG4235 270 S 270 (287)
T ss_pred H
Confidence 4
No 9
>PF07719 TPR_2: Tetratricopeptide repeat; InterPro: IPR013105 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. This repeat includes outlying Tetratricopeptide-like repeats (TPR) that are not matched by IPR001440 from INTERPRO.; PDB: 1XNF_B 3Q15_A 4ABN_A 1OUV_A 3U4T_A 3MA5_C 2KCV_A 2KCL_A 2XEV_A 3NF1_A ....
Probab=98.34 E-value=1e-06 Score=48.73 Aligned_cols=33 Identities=21% Similarity=0.529 Sum_probs=30.8
Q ss_pred HHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCC
Q psy16295 42 DYLYYLAIGNARIKEYTKSLKYCRAFLAIESEN 74 (128)
Q Consensus 42 d~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N 74 (128)
+.+|.+|..++++|+|++|+++++..++++|+|
T Consensus 2 ~~~~~lg~~~~~~~~~~~A~~~~~~al~l~p~~ 34 (34)
T PF07719_consen 2 EAWYYLGQAYYQLGNYEEAIEYFEKALELDPNN 34 (34)
T ss_dssp HHHHHHHHHHHHTT-HHHHHHHHHHHHHHSTTS
T ss_pred HHHHHHHHHHHHhCCHHHHHHHHHHHHHHCcCC
Confidence 578999999999999999999999999999997
No 10
>PF13414 TPR_11: TPR repeat; PDB: 2HO1_B 2FI7_B 2DBA_A 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2PL2_B 3IEG_B 2FBN_A ....
Probab=98.28 E-value=5.4e-06 Score=52.35 Aligned_cols=62 Identities=13% Similarity=0.319 Sum_probs=53.8
Q ss_pred HHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhh-chHHHHHHHHHHHhcCC
Q psy16295 4 FNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIK-EYTKSLKYCRAFLAIES 72 (128)
Q Consensus 4 F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklg-dY~~A~~~~~~lL~~eP 72 (128)
++.|..+.+ ..+..+++..++..++-+ | ...+.+|.+|++|.++| +|++|+++++..|+++|
T Consensus 7 ~~~g~~~~~---~~~~~~A~~~~~~ai~~~-p---~~~~~~~~~g~~~~~~~~~~~~A~~~~~~al~l~P 69 (69)
T PF13414_consen 7 YNLGQIYFQ---QGDYEEAIEYFEKAIELD-P---NNAEAYYNLGLAYMKLGKDYEEAIEDFEKALKLDP 69 (69)
T ss_dssp HHHHHHHHH---TTHHHHHHHHHHHHHHHS-T---THHHHHHHHHHHHHHTTTHHHHHHHHHHHHHHHST
T ss_pred HHHHHHHHH---cCCHHHHHHHHHHHHHcC-C---CCHHHHHHHHHHHHHhCccHHHHHHHHHHHHHcCc
Confidence 455666665 446778999999999998 6 68999999999999999 89999999999999999
No 11
>TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD. ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array.
Probab=98.27 E-value=1.1e-05 Score=56.38 Aligned_cols=76 Identities=13% Similarity=0.169 Sum_probs=42.6
Q ss_pred HHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHH
Q psy16295 3 QFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLES 82 (128)
Q Consensus 3 ~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~ 82 (128)
.+.+|+++....+ ..+++++++.+...+ | ...+.++.+|..++++|+|++|..+++.+++.+|+|.+..-...
T Consensus 20 ~~~~a~~~~~~~~---~~~A~~~~~~~~~~~-p---~~~~~~~~la~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~la 92 (135)
T TIGR02552 20 IYALAYNLYQQGR---YDEALKLFQLLAAYD-P---YNSRYWLGLAACCQMLKEYEEAIDAYALAAALDPDDPRPYFHAA 92 (135)
T ss_pred HHHHHHHHHHccc---HHHHHHHHHHHHHhC-C---CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChHHHHHHH
Confidence 3455555555443 334556666655555 3 34566666666666666666666666666666666655554444
Q ss_pred HHH
Q psy16295 83 IVK 85 (128)
Q Consensus 83 ~Ie 85 (128)
.+.
T Consensus 93 ~~~ 95 (135)
T TIGR02552 93 ECL 95 (135)
T ss_pred HHH
Confidence 443
No 12
>PLN03088 SGT1, suppressor of G2 allele of SKP1; Provisional
Probab=98.18 E-value=1.7e-05 Score=66.25 Aligned_cols=82 Identities=6% Similarity=0.067 Sum_probs=71.1
Q ss_pred HHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHH
Q psy16295 3 QFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLES 82 (128)
Q Consensus 3 ~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~ 82 (128)
.+++|.++.+.. +..+++..++..++-+ | ...+.++.+|.+++++|+|++|+.+++.+++++|+|.++..+..
T Consensus 39 ~~~~a~~~~~~g---~~~eAl~~~~~Al~l~-P---~~~~a~~~lg~~~~~lg~~~eA~~~~~~al~l~P~~~~~~~~l~ 111 (356)
T PLN03088 39 YADRAQANIKLG---NFTEAVADANKAIELD-P---SLAKAYLRKGTACMKLEEYQTAKAALEKGASLAPGDSRFTKLIK 111 (356)
T ss_pred HHHHHHHHHHcC---CHHHHHHHHHHHHHhC-c---CCHHHHHHHHHHHHHhCCHHHHHHHHHHHHHhCCCCHHHHHHHH
Confidence 356677777654 5568999999999988 6 57889999999999999999999999999999999999999999
Q ss_pred HHHHHHhhh
Q psy16295 83 IVKKRMETE 91 (128)
Q Consensus 83 ~Ie~~i~kd 91 (128)
.++.++..+
T Consensus 112 ~~~~kl~~~ 120 (356)
T PLN03088 112 ECDEKIAEE 120 (356)
T ss_pred HHHHHHHhh
Confidence 999888654
No 13
>PRK15359 type III secretion system chaperone protein SscB; Provisional
Probab=98.15 E-value=2.3e-05 Score=57.34 Aligned_cols=66 Identities=5% Similarity=0.036 Sum_probs=45.7
Q ss_pred hHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHH
Q psy16295 18 DIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKR 87 (128)
Q Consensus 18 d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~ 87 (128)
+..++++.++..+.-+ | ...+.++.+++.+.++|++++|+.+++..++..|+|.+...++..+...
T Consensus 73 ~~~~A~~~y~~Al~l~-p---~~~~a~~~lg~~l~~~g~~~eAi~~~~~Al~~~p~~~~~~~~~~~~~~~ 138 (144)
T PRK15359 73 EYTTAINFYGHALMLD-A---SHPEPVYQTGVCLKMMGEPGLAREAFQTAIKMSYADASWSEIRQNAQIM 138 (144)
T ss_pred hHHHHHHHHHHHHhcC-C---CCcHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCCCChHHHHHHHHHHHH
Confidence 3445666666666666 4 4667777777777777777777777777777777777777776666544
No 14
>PF00515 TPR_1: Tetratricopeptide repeat; InterPro: IPR001440 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees; within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B. ; GO: 0005515 protein binding; PDB: 3SF4_C 2LNI_A 1ELW_A 2C0M_A 1FCH_B 3R9A_B 2J9Q_A 2C0L_A 1KT1_A 3FWV_A ....
Probab=98.13 E-value=3.9e-06 Score=46.86 Aligned_cols=33 Identities=21% Similarity=0.494 Sum_probs=30.6
Q ss_pred HHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCC
Q psy16295 42 DYLYYLAIGNARIKEYTKSLKYCRAFLAIESEN 74 (128)
Q Consensus 42 d~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N 74 (128)
+.+|.+|..++.+|+|++|..++++.|+++|+|
T Consensus 2 ~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~ 34 (34)
T PF00515_consen 2 EAYYNLGNAYFQLGDYEEALEYYQRALELDPDN 34 (34)
T ss_dssp HHHHHHHHHHHHTT-HHHHHHHHHHHHHHSTTH
T ss_pred HHHHHHHHHHHHhCCchHHHHHHHHHHHHCcCC
Confidence 578999999999999999999999999999986
No 15
>TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD. ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array.
Probab=98.08 E-value=5e-05 Score=53.00 Aligned_cols=79 Identities=14% Similarity=0.078 Sum_probs=66.3
Q ss_pred HHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHH
Q psy16295 4 FNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESI 83 (128)
Q Consensus 4 F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~ 83 (128)
++.|+++.+. .+..+++..++..++.+ | ...+.+|++|..++..|+|++|.++++..++.+|++.....+++.
T Consensus 55 ~~la~~~~~~---~~~~~A~~~~~~~~~~~-p---~~~~~~~~la~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~ 127 (135)
T TIGR02552 55 LGLAACCQML---KEYEEAIDAYALAAALD-P---DDPRPYFHAAECLLALGEPESALKALDLAIEICGENPEYSELKER 127 (135)
T ss_pred HHHHHHHHHH---HHHHHHHHHHHHHHhcC-C---CChHHHHHHHHHHHHcCCHHHHHHHHHHHHHhccccchHHHHHHH
Confidence 4556666655 34568999999988888 6 579999999999999999999999999999999999998888887
Q ss_pred HHHHHh
Q psy16295 84 VKKRME 89 (128)
Q Consensus 84 Ie~~i~ 89 (128)
+.+.+.
T Consensus 128 ~~~~~~ 133 (135)
T TIGR02552 128 AEAMLE 133 (135)
T ss_pred HHHHHh
Confidence 776553
No 16
>PF12895 Apc3: Anaphase-promoting complex, cyclosome, subunit 3; PDB: 3KAE_D 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2XPI_A 3ULQ_A.
Probab=98.05 E-value=1e-05 Score=53.48 Aligned_cols=65 Identities=18% Similarity=0.334 Sum_probs=53.6
Q ss_pred hhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHH
Q psy16295 16 CADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESI 83 (128)
Q Consensus 16 ~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~ 83 (128)
..+..++|.+++.++..+ |. ....+++|.+|..+|++|+|++|..+++. ++.+|.|.+..-+...
T Consensus 2 ~~~y~~Ai~~~~k~~~~~-~~-~~~~~~~~~la~~~~~~~~y~~A~~~~~~-~~~~~~~~~~~~l~a~ 66 (84)
T PF12895_consen 2 QGNYENAIKYYEKLLELD-PT-NPNSAYLYNLAQCYFQQGKYEEAIELLQK-LKLDPSNPDIHYLLAR 66 (84)
T ss_dssp TT-HHHHHHHHHHHHHHH-CG-THHHHHHHHHHHHHHHTTHHHHHHHHHHC-HTHHHCHHHHHHHHHH
T ss_pred CccHHHHHHHHHHHHHHC-CC-ChhHHHHHHHHHHHHHCCCHHHHHHHHHH-hCCCCCCHHHHHHHHH
Confidence 457789999999999998 51 12567888899999999999999999999 9999988777666543
No 17
>TIGR02795 tol_pal_ybgF tol-pal system protein YbgF. Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.
Probab=98.00 E-value=5.2e-05 Score=50.91 Aligned_cols=69 Identities=12% Similarity=0.174 Sum_probs=44.6
Q ss_pred HHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCH
Q psy16295 3 QFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENS 75 (128)
Q Consensus 3 ~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~ 75 (128)
.|+.+-.+.... +..++++.++.+.... |+.....+..|.++..+++.|+|++|..+++.+++..|+++
T Consensus 5 ~~~~~~~~~~~~---~~~~A~~~~~~~~~~~-~~~~~~~~~~~~l~~~~~~~~~~~~A~~~~~~~~~~~p~~~ 73 (119)
T TIGR02795 5 YYDAALLVLKAG---DYADAIQAFQAFLKKY-PKSTYAPNAHYWLGEAYYAQGKYADAAKAFLAVVKKYPKSP 73 (119)
T ss_pred HHHHHHHHHHcC---CHHHHHHHHHHHHHHC-CCccccHHHHHHHHHHHHhhccHHHHHHHHHHHHHHCCCCC
Confidence 455555555543 4456777777776665 32123356777777777777777777777777777777763
No 18
>cd00189 TPR Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and C
Probab=97.91 E-value=0.00039 Score=42.03 Aligned_cols=69 Identities=16% Similarity=0.271 Sum_probs=57.7
Q ss_pred HHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHH
Q psy16295 3 QFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQ 78 (128)
Q Consensus 3 ~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~ 78 (128)
.++.|..+.... +..+++..++..++.. | ...+.++.+|..++..++|++|.++++..++..|.+.++.
T Consensus 3 ~~~~a~~~~~~~---~~~~A~~~~~~~~~~~-~---~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~ 71 (100)
T cd00189 3 LLNLGNLYYKLG---DYDEALEYYEKALELD-P---DNADAYYNLAAAYYKLGKYEEALEDYEKALELDPDNAKAY 71 (100)
T ss_pred HHHHHHHHHHHh---cHHHHHHHHHHHHhcC-C---ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhHH
Confidence 466777777644 5567999999999887 5 3458999999999999999999999999999999998444
No 19
>TIGR02795 tol_pal_ybgF tol-pal system protein YbgF. Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.
Probab=97.77 E-value=0.00031 Score=47.08 Aligned_cols=75 Identities=8% Similarity=0.008 Sum_probs=60.7
Q ss_pred HHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHH
Q psy16295 3 QFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLE 81 (128)
Q Consensus 3 ~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk 81 (128)
.|.++.++.+..+ ..+++.+++.++... |+.....+.++.++..+.+++++++|.++++.+++..|++.+++.-.
T Consensus 42 ~~~l~~~~~~~~~---~~~A~~~~~~~~~~~-p~~~~~~~~~~~~~~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~ 116 (119)
T TIGR02795 42 HYWLGEAYYAQGK---YADAAKAFLAVVKKY-PKSPKAPDALLKLGMSLQELGDKEKAKATLQQVIKRYPGSSAAKLAQ 116 (119)
T ss_pred HHHHHHHHHhhcc---HHHHHHHHHHHHHHC-CCCCcccHHHHHHHHHHHHhCChHHHHHHHHHHHHHCcCChhHHHHH
Confidence 4566666666554 457999999999876 52123478999999999999999999999999999999999887653
No 20
>PF13174 TPR_6: Tetratricopeptide repeat; PDB: 3QKY_A 2XEV_A 3URZ_B 2Q7F_A.
Probab=97.75 E-value=6.2e-05 Score=41.03 Aligned_cols=33 Identities=21% Similarity=0.438 Sum_probs=31.3
Q ss_pred HHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCC
Q psy16295 42 DYLYYLAIGNARIKEYTKSLKYCRAFLAIESEN 74 (128)
Q Consensus 42 d~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N 74 (128)
|.+|.+|..++++|++++|+++++.+++.-|+.
T Consensus 1 ~a~~~~a~~~~~~g~~~~A~~~~~~~~~~~P~s 33 (33)
T PF13174_consen 1 DALYRLARCYYKLGDYDEAIEYFQRLIKRYPDS 33 (33)
T ss_dssp HHHHHHHHHHHHHCHHHHHHHHHHHHHHHSTTS
T ss_pred CHHHHHHHHHHHccCHHHHHHHHHHHHHHCcCC
Confidence 679999999999999999999999999999974
No 21
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=97.72 E-value=0.00021 Score=58.08 Aligned_cols=75 Identities=13% Similarity=0.196 Sum_probs=62.0
Q ss_pred HHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHH
Q psy16295 3 QFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHL 80 (128)
Q Consensus 3 ~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~L 80 (128)
.|++|.+|+... .+..++|..|+.+++.. |+.....+.+|++|..+|..|+|++|+.+++.+++..|+++-+-.-
T Consensus 145 ~Y~~A~~l~~~~--~~y~~Ai~af~~fl~~y-P~s~~a~~A~y~LG~~y~~~g~~~~A~~~f~~vv~~yP~s~~~~dA 219 (263)
T PRK10803 145 DYNAAIALVQDK--SRQDDAIVAFQNFVKKY-PDSTYQPNANYWLGQLNYNKGKKDDAAYYFASVVKNYPKSPKAADA 219 (263)
T ss_pred HHHHHHHHHHhc--CCHHHHHHHHHHHHHHC-cCCcchHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCCCCcchhHH
Confidence 578899986543 45568999999999988 7434457999999999999999999999999999999997655433
No 22
>PF13428 TPR_14: Tetratricopeptide repeat
Probab=97.69 E-value=0.00013 Score=43.46 Aligned_cols=40 Identities=15% Similarity=0.151 Sum_probs=36.4
Q ss_pred HHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHH
Q psy16295 42 DYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLE 81 (128)
Q Consensus 42 d~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk 81 (128)
+-++.+|-.+.++|++++|++.++.+|+.+|+|..+....
T Consensus 2 ~~~~~la~~~~~~G~~~~A~~~~~~~l~~~P~~~~a~~~L 41 (44)
T PF13428_consen 2 AAWLALARAYRRLGQPDEAERLLRRALALDPDDPEAWRAL 41 (44)
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHHHHCcCCHHHHHHh
Confidence 5688999999999999999999999999999999887543
No 23
>PRK15359 type III secretion system chaperone protein SscB; Provisional
Probab=97.69 E-value=0.0003 Score=51.46 Aligned_cols=77 Identities=13% Similarity=0.154 Sum_probs=63.0
Q ss_pred HHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHH
Q psy16295 4 FNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESI 83 (128)
Q Consensus 4 F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~ 83 (128)
++.++++.... +..++++.++.++..+ | ...+.++.+|..+.++|+|++|..+++..++.+|+|..+..-.-.
T Consensus 28 ~~~g~~~~~~g---~~~~A~~~~~~al~~~-P---~~~~a~~~lg~~~~~~g~~~~A~~~y~~Al~l~p~~~~a~~~lg~ 100 (144)
T PRK15359 28 YASGYASWQEG---DYSRAVIDFSWLVMAQ-P---WSWRAHIALAGTWMMLKEYTTAINFYGHALMLDASHPEPVYQTGV 100 (144)
T ss_pred HHHHHHHHHcC---CHHHHHHHHHHHHHcC-C---CcHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcCCCCcHHHHHHHH
Confidence 45566665544 4557999999999998 6 689999999999999999999999999999999999888766555
Q ss_pred HHHH
Q psy16295 84 VKKR 87 (128)
Q Consensus 84 Ie~~ 87 (128)
+-.+
T Consensus 101 ~l~~ 104 (144)
T PRK15359 101 CLKM 104 (144)
T ss_pred HHHH
Confidence 4433
No 24
>PRK10866 outer membrane biogenesis protein BamD; Provisional
Probab=97.67 E-value=0.00017 Score=57.54 Aligned_cols=61 Identities=10% Similarity=0.061 Sum_probs=52.8
Q ss_pred hhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHH
Q psy16295 17 ADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQ 78 (128)
Q Consensus 17 ~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~ 78 (128)
.+..++++.++++.... |.+.....-.|.+|.+||+++||++|+.+.+.+++..|+|+.+-
T Consensus 46 g~y~~Ai~~f~~l~~~y-P~s~~a~~a~l~la~ayy~~~~y~~A~~~~e~fi~~~P~~~~~~ 106 (243)
T PRK10866 46 GNWKQAITQLEALDNRY-PFGPYSQQVQLDLIYAYYKNADLPLAQAAIDRFIRLNPTHPNID 106 (243)
T ss_pred CCHHHHHHHHHHHHHhC-CCChHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHhCcCCCchH
Confidence 45568999999998876 65456667789999999999999999999999999999998764
No 25
>cd00189 TPR Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and C
Probab=97.65 E-value=0.00048 Score=41.63 Aligned_cols=63 Identities=11% Similarity=0.262 Sum_probs=52.6
Q ss_pred HHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCC
Q psy16295 4 FNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESE 73 (128)
Q Consensus 4 F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~ 73 (128)
+.+|.++... .+..++++.++..++.. | ...+.++.++..+..+|++++|..+++.+++..|+
T Consensus 38 ~~~~~~~~~~---~~~~~a~~~~~~~~~~~-~---~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~ 100 (100)
T cd00189 38 YNLAAAYYKL---GKYEEALEDYEKALELD-P---DNAKAYYNLGLAYYKLGKYEEALEAYEKALELDPN 100 (100)
T ss_pred HHHHHHHHHH---HHHHHHHHHHHHHHhCC-C---cchhHHHHHHHHHHHHHhHHHHHHHHHHHHccCCC
Confidence 4556666554 55678999999999887 5 45689999999999999999999999999999884
No 26
>TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW. Members of this family are designated PilF in ref (PubMed:8973346) and PilW in ref (PubMed:15612916). This outer membrane protein is required both for pilus stability and for pilus function such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.
Probab=97.64 E-value=0.00069 Score=48.84 Aligned_cols=70 Identities=13% Similarity=0.124 Sum_probs=59.2
Q ss_pred HHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHH
Q psy16295 3 QFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQH 79 (128)
Q Consensus 3 ~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~ 79 (128)
.++.|+.+....+ ..++++.+++.++.+ | ...+.++.++..+.++|+|++|+++++..++..|++..+..
T Consensus 34 ~~~la~~~~~~~~---~~~A~~~~~~~l~~~-p---~~~~~~~~la~~~~~~~~~~~A~~~~~~al~~~~~~~~~~~ 103 (234)
T TIGR02521 34 RVQLALGYLEQGD---LEVAKENLDKALEHD-P---DDYLAYLALALYYQQLGELEKAEDSFRRALTLNPNNGDVLN 103 (234)
T ss_pred HHHHHHHHHHCCC---HHHHHHHHHHHHHhC-c---ccHHHHHHHHHHHHHcCCHHHHHHHHHHHHhhCCCCHHHHH
Confidence 4567788877654 558999999998887 6 56789999999999999999999999999999999976543
No 27
>PRK02603 photosystem I assembly protein Ycf3; Provisional
Probab=97.60 E-value=0.00078 Score=49.97 Aligned_cols=80 Identities=19% Similarity=0.242 Sum_probs=61.4
Q ss_pred HHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHH
Q psy16295 3 QFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLES 82 (128)
Q Consensus 3 ~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~ 82 (128)
.|+.+..+... .+..+++..+++.++.. |+.......++.+|..++++|+|++|+.+++..++..|++.++.....
T Consensus 38 ~~~lg~~~~~~---g~~~~A~~~~~~al~~~-~~~~~~~~~~~~la~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~lg 113 (172)
T PRK02603 38 YYRDGMSAQAD---GEYAEALENYEEALKLE-EDPNDRSYILYNMGIIYASNGEHDKALEYYHQALELNPKQPSALNNIA 113 (172)
T ss_pred HHHHHHHHHHc---CCHHHHHHHHHHHHHHh-hccchHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCcccHHHHHHHH
Confidence 35555555544 45568999999998765 311234579999999999999999999999999999999998876555
Q ss_pred HHHH
Q psy16295 83 IVKK 86 (128)
Q Consensus 83 ~Ie~ 86 (128)
.+-.
T Consensus 114 ~~~~ 117 (172)
T PRK02603 114 VIYH 117 (172)
T ss_pred HHHH
Confidence 5443
No 28
>PF12895 Apc3: Anaphase-promoting complex, cyclosome, subunit 3; PDB: 3KAE_D 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2XPI_A 3ULQ_A.
Probab=97.59 E-value=0.00028 Score=46.50 Aligned_cols=56 Identities=21% Similarity=0.358 Sum_probs=45.2
Q ss_pred HHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHH
Q psy16295 3 QFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRA 66 (128)
Q Consensus 3 ~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~ 66 (128)
.|++|.++.++.+ ..+++++++. .+.+ | ...++.|++|-+++++|+|++|++..+.
T Consensus 28 ~~~la~~~~~~~~---y~~A~~~~~~-~~~~-~---~~~~~~~l~a~~~~~l~~y~eAi~~l~~ 83 (84)
T PF12895_consen 28 LYNLAQCYFQQGK---YEEAIELLQK-LKLD-P---SNPDIHYLLARCLLKLGKYEEAIKALEK 83 (84)
T ss_dssp HHHHHHHHHHTTH---HHHHHHHHHC-HTHH-H---CHHHHHHHHHHHHHHTT-HHHHHHHHHH
T ss_pred HHHHHHHHHHCCC---HHHHHHHHHH-hCCC-C---CCHHHHHHHHHHHHHhCCHHHHHHHHhc
Confidence 3566888888874 4689999999 5555 3 5689999999999999999999998875
No 29
>PF13429 TPR_15: Tetratricopeptide repeat; PDB: 2VQ2_A 2PL2_B.
Probab=97.58 E-value=0.00016 Score=57.04 Aligned_cols=71 Identities=15% Similarity=0.194 Sum_probs=54.2
Q ss_pred HHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHH
Q psy16295 3 QFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHL 80 (128)
Q Consensus 3 ~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~L 80 (128)
...++|.|+...+..+ ..++|+.+.+.. | ...+.+..+|.++.++|+|++|+.+++.+++..|+|+.....
T Consensus 183 ~~~l~~~li~~~~~~~---~~~~l~~~~~~~-~---~~~~~~~~la~~~~~lg~~~~Al~~~~~~~~~~p~d~~~~~~ 253 (280)
T PF13429_consen 183 RNALAWLLIDMGDYDE---AREALKRLLKAA-P---DDPDLWDALAAAYLQLGRYEEALEYLEKALKLNPDDPLWLLA 253 (280)
T ss_dssp HHHHHHHHCTTCHHHH---HHHHHHHHHHH--H---TSCCHCHHHHHHHHHHT-HHHHHHHHHHHHHHSTT-HHHHHH
T ss_pred HHHHHHHHHHCCChHH---HHHHHHHHHHHC-c---CHHHHHHHHHHHhccccccccccccccccccccccccccccc
Confidence 4678999998876665 445555555554 3 356688899999999999999999999999999999987654
No 30
>PF13181 TPR_8: Tetratricopeptide repeat; PDB: 3GW4_B 3MA5_C 2KCV_A 2KCL_A 3FP3_A 3LCA_A 3FP4_A 3FP2_A 1W3B_B 1ELW_A ....
Probab=97.55 E-value=0.00013 Score=40.35 Aligned_cols=33 Identities=21% Similarity=0.443 Sum_probs=30.2
Q ss_pred HHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCC
Q psy16295 42 DYLYYLAIGNARIKEYTKSLKYCRAFLAIESEN 74 (128)
Q Consensus 42 d~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N 74 (128)
+.++.+|..+..+|+|++|.++++..++++|+|
T Consensus 2 ~~~~~lg~~y~~~~~~~~A~~~~~~a~~~~~~n 34 (34)
T PF13181_consen 2 EAYYNLGKIYEQLGDYEEALEYFEKALELNPDN 34 (34)
T ss_dssp HHHHHHHHHHHHTTSHHHHHHHHHHHHHHHTT-
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHHhhCCCC
Confidence 468899999999999999999999999999965
No 31
>PF13525 YfiO: Outer membrane lipoprotein; PDB: 3TGO_A 3Q5M_A 2YHC_A.
Probab=97.55 E-value=0.00039 Score=53.49 Aligned_cols=72 Identities=13% Similarity=0.220 Sum_probs=56.5
Q ss_pred HHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHH
Q psy16295 3 QFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQ 78 (128)
Q Consensus 3 ~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~ 78 (128)
.|+-|..+.... +..++|+.|+++...- |.+....+..+.+|.++|+.|||++|+..++.+++.-|+++.+-
T Consensus 8 lY~~a~~~~~~g---~y~~Ai~~f~~l~~~~-P~s~~a~~A~l~la~a~y~~~~y~~A~~~~~~fi~~yP~~~~~~ 79 (203)
T PF13525_consen 8 LYQKALEALQQG---DYEEAIKLFEKLIDRY-PNSPYAPQAQLMLAYAYYKQGDYEEAIAAYERFIKLYPNSPKAD 79 (203)
T ss_dssp HHHHHHHHHHCT----HHHHHHHHHHHHHH--TTSTTHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHH-TT-TTHH
T ss_pred HHHHHHHHHHCC---CHHHHHHHHHHHHHHC-CCChHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCCCCcchh
Confidence 355666666655 4568999999999885 55568899999999999999999999999999999999988643
No 32
>PLN03088 SGT1, suppressor of G2 allele of SKP1; Provisional
Probab=97.50 E-value=0.0005 Score=57.47 Aligned_cols=68 Identities=7% Similarity=0.131 Sum_probs=59.6
Q ss_pred hhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHH
Q psy16295 17 ADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKRM 88 (128)
Q Consensus 17 ~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i 88 (128)
.+..++++++++.++.+ | ...++++.+|.++.++|+|++|+.+++.+++++|++.++.-.+-.+--.+
T Consensus 16 ~~~~~Ai~~~~~Al~~~-P---~~~~a~~~~a~~~~~~g~~~eAl~~~~~Al~l~P~~~~a~~~lg~~~~~l 83 (356)
T PLN03088 16 DDFALAVDLYTQAIDLD-P---NNAELYADRAQANIKLGNFTEAVADANKAIELDPSLAKAYLRKGTACMKL 83 (356)
T ss_pred CCHHHHHHHHHHHHHhC-C---CCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCcCCHHHHHHHHHHHHHh
Confidence 35668999999999998 6 57899999999999999999999999999999999999887776654433
No 33
>TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795).
Probab=97.50 E-value=0.00082 Score=51.25 Aligned_cols=72 Identities=11% Similarity=0.078 Sum_probs=59.8
Q ss_pred hHHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHH
Q psy16295 2 TQFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQV 77 (128)
Q Consensus 2 ~~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA 77 (128)
..|+.+.++....+. ..++..+++++... |+.....+.+|.+|..+++.|+|++|+..++.+++..|++..+
T Consensus 35 ~~~~~g~~~~~~~~~---~~A~~~~~~~~~~~-p~~~~~~~a~~~la~~~~~~~~~~~A~~~~~~~l~~~p~~~~~ 106 (235)
T TIGR03302 35 ELYEEAKEALDSGDY---TEAIKYFEALESRY-PFSPYAEQAQLDLAYAYYKSGDYAEAIAAADRFIRLHPNHPDA 106 (235)
T ss_pred HHHHHHHHHHHcCCH---HHHHHHHHHHHHhC-CCchhHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHCcCCCch
Confidence 357788888876654 47899999998887 5322456889999999999999999999999999999998874
No 34
>PF13432 TPR_16: Tetratricopeptide repeat; PDB: 3CVP_A 3CVL_A 3CVQ_A 3CV0_A 2GW1_B 3CVN_A 3QKY_A 2PL2_B.
Probab=97.50 E-value=0.00031 Score=43.86 Aligned_cols=45 Identities=16% Similarity=0.149 Sum_probs=40.1
Q ss_pred HHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHh
Q psy16295 45 YYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKRME 89 (128)
Q Consensus 45 YyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i~ 89 (128)
|-+|..+++.|+|++|++.++.+++.+|+|.++..+.-.+.-...
T Consensus 1 ~~~a~~~~~~g~~~~A~~~~~~~l~~~P~~~~a~~~lg~~~~~~g 45 (65)
T PF13432_consen 1 YALARALYQQGDYDEAIAAFEQALKQDPDNPEAWYLLGRILYQQG 45 (65)
T ss_dssp HHHHHHHHHCTHHHHHHHHHHHHHCCSTTHHHHHHHHHHHHHHTT
T ss_pred ChHHHHHHHcCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHcC
Confidence 567999999999999999999999999999999998888765443
No 35
>PRK10370 formate-dependent nitrite reductase complex subunit NrfG; Provisional
Probab=97.48 E-value=0.00074 Score=52.14 Aligned_cols=64 Identities=13% Similarity=0.128 Sum_probs=57.7
Q ss_pred hhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHH
Q psy16295 16 CADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESI 83 (128)
Q Consensus 16 ~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~ 83 (128)
.++..+.+..|+..++.+ | ...+.|+.|+..+..+|+|++|..+++..++++|+|.++..-...
T Consensus 52 ~~~~~~~i~~l~~~L~~~-P---~~~~~w~~Lg~~~~~~g~~~~A~~a~~~Al~l~P~~~~~~~~lA~ 115 (198)
T PRK10370 52 QQTPEAQLQALQDKIRAN-P---QNSEQWALLGEYYLWRNDYDNALLAYRQALQLRGENAELYAALAT 115 (198)
T ss_pred chhHHHHHHHHHHHHHHC-C---CCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCCHHHHHHHHH
Confidence 567788999999999999 7 689999999999999999999999999999999999998766554
No 36
>TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW. Members of this family are designated PilF in ref (PubMed:8973346) and PilW in ref (PubMed:15612916). This outer membrane protein is required both for pilus stability and for pilus function such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.
Probab=97.40 E-value=0.0029 Score=45.57 Aligned_cols=77 Identities=13% Similarity=0.141 Sum_probs=49.0
Q ss_pred HHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHH
Q psy16295 3 QFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLES 82 (128)
Q Consensus 3 ~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~ 82 (128)
.+++++.+.... +..++++.++..+..... ....+.++.++..+++.|+|++|..+++..++.+|++..+.....
T Consensus 102 ~~~~~~~~~~~g---~~~~A~~~~~~~~~~~~~--~~~~~~~~~l~~~~~~~g~~~~A~~~~~~~~~~~~~~~~~~~~la 176 (234)
T TIGR02521 102 LNNYGTFLCQQG---KYEQAMQQFEQAIEDPLY--PQPARSLENAGLCALKAGDFDKAEKYLTRALQIDPQRPESLLELA 176 (234)
T ss_pred HHHHHHHHHHcc---cHHHHHHHHHHHHhcccc--ccchHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCcCChHHHHHHH
Confidence 356666666544 445677777776654211 234566777777777777777777777777777777766554433
Q ss_pred HH
Q psy16295 83 IV 84 (128)
Q Consensus 83 ~I 84 (128)
.+
T Consensus 177 ~~ 178 (234)
T TIGR02521 177 EL 178 (234)
T ss_pred HH
Confidence 33
No 37
>PF13414 TPR_11: TPR repeat; PDB: 2HO1_B 2FI7_B 2DBA_A 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2PL2_B 3IEG_B 2FBN_A ....
Probab=97.38 E-value=0.0007 Score=42.50 Aligned_cols=50 Identities=10% Similarity=0.214 Sum_probs=43.9
Q ss_pred hHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHh
Q psy16295 40 KRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKRME 89 (128)
Q Consensus 40 ~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i~ 89 (128)
+-+.++.+|..+++.|+|++|+.+++..++++|+|..+.--...+..+..
T Consensus 2 ~a~~~~~~g~~~~~~~~~~~A~~~~~~ai~~~p~~~~~~~~~g~~~~~~~ 51 (69)
T PF13414_consen 2 NAEAWYNLGQIYFQQGDYEEAIEYFEKAIELDPNNAEAYYNLGLAYMKLG 51 (69)
T ss_dssp SHHHHHHHHHHHHHTTHHHHHHHHHHHHHHHSTTHHHHHHHHHHHHHHTT
T ss_pred HHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHhC
Confidence 35789999999999999999999999999999999998777777766554
No 38
>TIGR00990 3a0801s09 mitochondrial precursor proteins import receptor (72 kDa mitochondrial outermembrane protein) (mitochondrial import receptor for the ADP/ATP carrier) (translocase of outermembrane tom70).
Probab=97.25 E-value=0.0025 Score=56.08 Aligned_cols=69 Identities=12% Similarity=0.019 Sum_probs=56.2
Q ss_pred HHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHH
Q psy16295 4 FNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQH 79 (128)
Q Consensus 4 F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~ 79 (128)
+++|+++.... +..+++..++..+..+ | ...+++|.++..++.+|+|++|+.+++..|+++|+|..+..
T Consensus 369 ~~la~~~~~~g---~~~eA~~~~~~al~~~-p---~~~~~~~~lg~~~~~~g~~~~A~~~~~kal~l~P~~~~~~~ 437 (615)
T TIGR00990 369 IKRASMNLELG---DPDKAEEDFDKALKLN-S---EDPDIYYHRAQLHFIKGEFAQAGKDYQKSIDLDPDFIFSHI 437 (615)
T ss_pred HHHHHHHHHCC---CHHHHHHHHHHHHHhC-C---CCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCccCHHHHH
Confidence 45666666544 4567888899888887 6 56899999999999999999999999999999999876654
No 39
>smart00028 TPR Tetratricopeptide repeats. Repeats present in 4 or more copies in proteins. Contain a minimum of 34 amino acids each and self-associate via a "knobs and holes" mechanism.
Probab=97.23 E-value=0.00089 Score=33.67 Aligned_cols=33 Identities=24% Similarity=0.531 Sum_probs=30.6
Q ss_pred HHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCC
Q psy16295 42 DYLYYLAIGNARIKEYTKSLKYCRAFLAIESEN 74 (128)
Q Consensus 42 d~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N 74 (128)
+.++.+|..+..+++|++|..+++..++++|++
T Consensus 2 ~~~~~~a~~~~~~~~~~~a~~~~~~~~~~~~~~ 34 (34)
T smart00028 2 EALYNLGNAYLKLGDYDEALEYYEKALELDPNN 34 (34)
T ss_pred hHHHHHHHHHHHHhhHHHHHHHHHHHHccCCCC
Confidence 568899999999999999999999999999974
No 40
>PRK09782 bacteriophage N4 receptor, outer membrane subunit; Provisional
Probab=97.20 E-value=0.0021 Score=60.82 Aligned_cols=73 Identities=19% Similarity=0.202 Sum_probs=63.2
Q ss_pred HHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHH
Q psy16295 3 QFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLES 82 (128)
Q Consensus 3 ~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~ 82 (128)
.++++++|....+ ..+++++++..++.+ | ...+.++.+|..+.++|+|++|+.+++.+++.+|++-+.....-
T Consensus 646 ~~nLG~aL~~~G~---~eeAi~~l~~AL~l~-P---~~~~a~~nLA~al~~lGd~~eA~~~l~~Al~l~P~~a~i~~~~g 718 (987)
T PRK09782 646 QAALGYALWDSGD---IAQSREMLERAHKGL-P---DDPALIRQLAYVNQRLDDMAATQHYARLVIDDIDNQALITPLTP 718 (987)
T ss_pred HHHHHHHHHHCCC---HHHHHHHHHHHHHhC-C---CCHHHHHHHHHHHHHCCCHHHHHHHHHHHHhcCCCCchhhhhhh
Confidence 5678888888654 468999999999998 7 67899999999999999999999999999999999977764433
No 41
>PRK11189 lipoprotein NlpI; Provisional
Probab=97.14 E-value=0.0043 Score=50.22 Aligned_cols=60 Identities=8% Similarity=0.030 Sum_probs=41.3
Q ss_pred HHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHH
Q psy16295 19 IRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLES 82 (128)
Q Consensus 19 ~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~ 82 (128)
..+++..+++.++.+ | ...+.++.++..+.++|+|++|...++..|+++|++..+...+-
T Consensus 80 ~~~A~~~~~~Al~l~-P---~~~~a~~~lg~~~~~~g~~~~A~~~~~~Al~l~P~~~~a~~~lg 139 (296)
T PRK11189 80 RALARNDFSQALALR-P---DMADAYNYLGIYLTQAGNFDAAYEAFDSVLELDPTYNYAYLNRG 139 (296)
T ss_pred HHHHHHHHHHHHHcC-C---CCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCCHHHHHHHH
Confidence 345666677777666 5 45677777777777777777777777777777777776644433
No 42
>TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795).
Probab=97.10 E-value=0.0066 Score=46.24 Aligned_cols=74 Identities=7% Similarity=0.143 Sum_probs=57.9
Q ss_pred HHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhh--------hchHHHHHHHHHHHhcCCCCH
Q psy16295 4 FNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARI--------KEYTKSLKYCRAFLAIESENS 75 (128)
Q Consensus 4 F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~ykl--------gdY~~A~~~~~~lL~~eP~N~ 75 (128)
|..|-++.... +..++++.+++.++.. |+.....+.+|.++..+++. |++++|++.++.+++..|++.
T Consensus 74 ~~la~~~~~~~---~~~~A~~~~~~~l~~~-p~~~~~~~a~~~~g~~~~~~~~~~~~~~~~~~~A~~~~~~~~~~~p~~~ 149 (235)
T TIGR03302 74 LDLAYAYYKSG---DYAEAIAAADRFIRLH-PNHPDADYAYYLRGLSNYNQIDRVDRDQTAAREAFEAFQELIRRYPNSE 149 (235)
T ss_pred HHHHHHHHhcC---CHHHHHHHHHHHHHHC-cCCCchHHHHHHHHHHHHHhcccccCCHHHHHHHHHHHHHHHHHCCCCh
Confidence 45555555544 5668999999999887 53234456899999999987 999999999999999999998
Q ss_pred HHHHHH
Q psy16295 76 QVQHLE 81 (128)
Q Consensus 76 QA~~Lk 81 (128)
.+....
T Consensus 150 ~~~~a~ 155 (235)
T TIGR03302 150 YAPDAK 155 (235)
T ss_pred hHHHHH
Confidence 765443
No 43
>PRK10049 pgaA outer membrane protein PgaA; Provisional
Probab=97.09 E-value=0.0036 Score=56.97 Aligned_cols=63 Identities=17% Similarity=0.068 Sum_probs=43.3
Q ss_pred HHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHH
Q psy16295 19 IRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVK 85 (128)
Q Consensus 19 ~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie 85 (128)
..++++.|+..+..+ | +..+.+|.+|..+.++++|++|....+.+++.+|+|.+++.|...-+
T Consensus 409 ~~~A~~~l~~al~l~-P---d~~~l~~~~a~~al~~~~~~~A~~~~~~ll~~~Pd~~~~~~~~~~~~ 471 (765)
T PRK10049 409 PRAAENELKKAEVLE-P---RNINLEVEQAWTALDLQEWRQMDVLTDDVVAREPQDPGVQRLARARD 471 (765)
T ss_pred HHHHHHHHHHHHhhC-C---CChHHHHHHHHHHHHhCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHH
Confidence 346666666666666 5 45557777777777777777777777777777777777777766654
No 44
>PF12688 TPR_5: Tetratrico peptide repeat
Probab=97.07 E-value=0.0082 Score=43.64 Aligned_cols=79 Identities=13% Similarity=0.114 Sum_probs=65.3
Q ss_pred hHHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCC---CHHHH
Q psy16295 2 TQFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESE---NSQVQ 78 (128)
Q Consensus 2 ~~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~---N~QA~ 78 (128)
..|+.||++=--.. -.++|.+.+.-+..+ .+...+++++-.+|-.+-.+|+|++|...++..++-.|+ +...+
T Consensus 3 ~~~~~A~a~d~~G~---~~~Ai~~Y~~Al~~g-L~~~~~~~a~i~lastlr~LG~~deA~~~L~~~~~~~p~~~~~~~l~ 78 (120)
T PF12688_consen 3 ALYELAWAHDSLGR---EEEAIPLYRRALAAG-LSGADRRRALIQLASTLRNLGRYDEALALLEEALEEFPDDELNAALR 78 (120)
T ss_pred hHHHHHHHHHhcCC---HHHHHHHHHHHHHcC-CCchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCCCccccHHHH
Confidence 46999999866554 458999999998875 222577889999999999999999999999999999899 77777
Q ss_pred HHHHHH
Q psy16295 79 HLESIV 84 (128)
Q Consensus 79 ~Lk~~I 84 (128)
....+.
T Consensus 79 ~f~Al~ 84 (120)
T PF12688_consen 79 VFLALA 84 (120)
T ss_pred HHHHHH
Confidence 665554
No 45
>PF13429 TPR_15: Tetratricopeptide repeat; PDB: 2VQ2_A 2PL2_B.
Probab=97.05 E-value=0.0023 Score=50.45 Aligned_cols=69 Identities=17% Similarity=0.170 Sum_probs=46.3
Q ss_pred HHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHH
Q psy16295 6 YSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQH 79 (128)
Q Consensus 6 YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~ 79 (128)
+++.+.+.. +..+..++|+.+...... ....++++.+|..+.+.|++++|+++++..|+.+|+|+++..
T Consensus 116 ~l~~~~~~~---~~~~~~~~l~~~~~~~~~--~~~~~~~~~~a~~~~~~G~~~~A~~~~~~al~~~P~~~~~~~ 184 (280)
T PF13429_consen 116 ALQLYYRLG---DYDEAEELLEKLEELPAA--PDSARFWLALAEIYEQLGDPDKALRDYRKALELDPDDPDARN 184 (280)
T ss_dssp --H-HHHTT----HHHHHHHHHHHHH-T-----T-HHHHHHHHHHHHHCCHHHHHHHHHHHHHHH-TT-HHHHH
T ss_pred HHHHHHHHh---HHHHHHHHHHHHHhccCC--CCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCCCHHHHH
Confidence 344445544 445566666665544311 356788889999999999999999999999999999999875
No 46
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=97.03 E-value=0.0065 Score=49.37 Aligned_cols=78 Identities=4% Similarity=0.015 Sum_probs=63.8
Q ss_pred HHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHH
Q psy16295 3 QFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLES 82 (128)
Q Consensus 3 ~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~ 82 (128)
.|-+|.++.... +...++..++.+++.. |+.....|.+|.++..+.++|++++|+++++.+++..|+...+..=++
T Consensus 183 ~y~LG~~y~~~g---~~~~A~~~f~~vv~~y-P~s~~~~dAl~klg~~~~~~g~~~~A~~~~~~vi~~yP~s~~a~~A~~ 258 (263)
T PRK10803 183 NYWLGQLNYNKG---KKDDAAYYFASVVKNY-PKSPKAADAMFKVGVIMQDKGDTAKAKAVYQQVIKKYPGTDGAKQAQK 258 (263)
T ss_pred HHHHHHHHHHcC---CHHHHHHHHHHHHHHC-CCCcchhHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCcCCHHHHHHHH
Confidence 466677776654 5558999999998775 544578999999999999999999999999999999999998776554
Q ss_pred HH
Q psy16295 83 IV 84 (128)
Q Consensus 83 ~I 84 (128)
.+
T Consensus 259 rL 260 (263)
T PRK10803 259 RL 260 (263)
T ss_pred HH
Confidence 43
No 47
>PRK12370 invasion protein regulator; Provisional
Probab=97.02 E-value=0.0048 Score=54.10 Aligned_cols=57 Identities=14% Similarity=0.166 Sum_probs=48.9
Q ss_pred hHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHH
Q psy16295 18 DIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQ 78 (128)
Q Consensus 18 d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~ 78 (128)
+..+++..+++.++.+ | ...+.+|++|..+...|+|++|+..++..++.+|++..+.
T Consensus 353 ~~~~A~~~~~~Al~l~-P---~~~~a~~~lg~~l~~~G~~~eAi~~~~~Al~l~P~~~~~~ 409 (553)
T PRK12370 353 EYIVGSLLFKQANLLS-P---ISADIKYYYGWNLFMAGQLEEALQTINECLKLDPTRAAAG 409 (553)
T ss_pred CHHHHHHHHHHHHHhC-C---CCHHHHHHHHHHHHHCCCHHHHHHHHHHHHhcCCCChhhH
Confidence 4567899999988888 6 5778899999999999999999999999999999987653
No 48
>PRK15363 pathogenicity island 2 chaperone protein SscA; Provisional
Probab=97.01 E-value=0.0045 Score=47.42 Aligned_cols=68 Identities=15% Similarity=0.201 Sum_probs=57.4
Q ss_pred HHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHH
Q psy16295 4 FNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQ 78 (128)
Q Consensus 4 F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~ 78 (128)
+.||.-|-... +...+..+++-++.-+ | ...+++|-|+..+-.+|+|++|+.++...+.++|+|+.+-
T Consensus 39 Y~~A~~ly~~G---~l~~A~~~f~~L~~~D-p---~~~~y~~gLG~~~Q~~g~~~~AI~aY~~A~~L~~ddp~~~ 106 (157)
T PRK15363 39 YRYAMQLMEVK---EFAGAARLFQLLTIYD-A---WSFDYWFRLGECCQAQKHWGEAIYAYGRAAQIKIDAPQAP 106 (157)
T ss_pred HHHHHHHHHCC---CHHHHHHHHHHHHHhC-c---ccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcCCCCchHH
Confidence 45666666655 4557888999999888 6 7899999999999999999999999999999999998753
No 49
>CHL00033 ycf3 photosystem I assembly protein Ycf3
Probab=96.99 E-value=0.012 Score=43.18 Aligned_cols=79 Identities=15% Similarity=0.119 Sum_probs=57.2
Q ss_pred HHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHH
Q psy16295 4 FNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESI 83 (128)
Q Consensus 4 F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~ 83 (128)
++.+.++.... +..+++..++..+... |+.......++-+|+.+.+.|+|++|+.+++..|+++|++..+......
T Consensus 39 ~~~g~~~~~~g---~~~~A~~~~~~al~l~-~~~~~~~~~~~~lg~~~~~~g~~~eA~~~~~~Al~~~~~~~~~~~~la~ 114 (168)
T CHL00033 39 YRDGMSAQSEG---EYAEALQNYYEAMRLE-IDPYDRSYILYNIGLIHTSNGEHTKALEYYFQALERNPFLPQALNNMAV 114 (168)
T ss_pred HHHHHHHHHcC---CHHHHHHHHHHHHhcc-ccchhhHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCcCcHHHHHHHHH
Confidence 34444444333 3558888888887665 3212345689999999999999999999999999999999876555444
Q ss_pred HHH
Q psy16295 84 VKK 86 (128)
Q Consensus 84 Ie~ 86 (128)
|..
T Consensus 115 i~~ 117 (168)
T CHL00033 115 ICH 117 (168)
T ss_pred HHH
Confidence 444
No 50
>TIGR02917 PEP_TPR_lipo putative PEP-CTERM system TPR-repeat lipoprotein. This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
Probab=96.99 E-value=0.0055 Score=52.72 Aligned_cols=69 Identities=14% Similarity=0.083 Sum_probs=47.3
Q ss_pred HHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHH
Q psy16295 4 FNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHL 80 (128)
Q Consensus 4 F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~L 80 (128)
.+++|.+.+..+ .+++++++..+... | ...+.++.++..++++|++++|..+++.+++..|+|+++...
T Consensus 808 ~~l~~~~~~~~~----~~A~~~~~~~~~~~-~---~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~~~~~~~~~~ 876 (899)
T TIGR02917 808 NNLAWLYLELKD----PRALEYAEKALKLA-P---NIPAILDTLGWLLVEKGEADRALPLLRKAVNIAPEAAAIRYH 876 (899)
T ss_pred HHHHHHHHhcCc----HHHHHHHHHHHhhC-C---CCcHHHHHHHHHHHHcCCHHHHHHHHHHHHhhCCCChHHHHH
Confidence 456666666554 23777777777765 4 456677777777777777777777777777777777665443
No 51
>TIGR02917 PEP_TPR_lipo putative PEP-CTERM system TPR-repeat lipoprotein. This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
Probab=96.93 E-value=0.0071 Score=52.04 Aligned_cols=70 Identities=13% Similarity=0.134 Sum_probs=49.2
Q ss_pred HHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHH
Q psy16295 3 QFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQH 79 (128)
Q Consensus 3 ~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~ 79 (128)
.+..|+.+.... +..++++++++.+..+ | ...+.+++++..+++.|+|++|..+++.+++.+|+|.++..
T Consensus 162 ~~~la~~~~~~~---~~~~A~~~~~~~~~~~-~---~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~ 231 (899)
T TIGR02917 162 KLGLAQLALAEN---RFDEARALIDEVLTAD-P---GNVDALLLKGDLLLSLGNIELALAAYRKAIALRPNNPAVLL 231 (899)
T ss_pred HHHHHHHHHHCC---CHHHHHHHHHHHHHhC-C---CChHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCCCCHHHHH
Confidence 345566665544 4456777777777766 5 45677788888888888888888888888888887776543
No 52
>PRK12370 invasion protein regulator; Provisional
Probab=96.89 E-value=0.0086 Score=52.51 Aligned_cols=65 Identities=9% Similarity=-0.040 Sum_probs=57.5
Q ss_pred ChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHH
Q psy16295 15 FCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESI 83 (128)
Q Consensus 15 ~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~ 83 (128)
...+..+++..++..++-+ | ...+.+..+|..+...|+|++|..+++..|+.+|+|.++....-.
T Consensus 316 ~~~~~~~A~~~~~~Al~ld-P---~~~~a~~~lg~~~~~~g~~~~A~~~~~~Al~l~P~~~~a~~~lg~ 380 (553)
T PRK12370 316 KQNAMIKAKEHAIKATELD-H---NNPQALGLLGLINTIHSEYIVGSLLFKQANLLSPISADIKYYYGW 380 (553)
T ss_pred cchHHHHHHHHHHHHHhcC-C---CCHHHHHHHHHHHHHccCHHHHHHHHHHHHHhCCCCHHHHHHHHH
Confidence 4567889999999999998 7 578899999999999999999999999999999999998655433
No 53
>PF13176 TPR_7: Tetratricopeptide repeat; PDB: 3SF4_C 3RO3_A 3RO2_A.
Probab=96.86 E-value=0.0017 Score=37.19 Aligned_cols=32 Identities=25% Similarity=0.313 Sum_probs=25.4
Q ss_pred HHHHHHHHHhhhhchHHHHHHHHHHH--hcCCCC
Q psy16295 43 YLYYLAIGNARIKEYTKSLKYCRAFL--AIESEN 74 (128)
Q Consensus 43 ~LYyLAvg~yklgdY~~A~~~~~~lL--~~eP~N 74 (128)
++..||-.|.++|+|++|..++++.| ..+|++
T Consensus 1 al~~Lg~~~~~~g~~~~Ai~~y~~aL~l~~~~~~ 34 (36)
T PF13176_consen 1 ALNNLGRIYRQQGDYEKAIEYYEQALALARDPED 34 (36)
T ss_dssp HHHHHHHHHHHCT-HHHHHHHHHHHHHHHHHCT-
T ss_pred CHHHHHHHHHHcCCHHHHHHHHHHHHHhcccccC
Confidence 47789999999999999999999955 555554
No 54
>PRK10747 putative protoheme IX biogenesis protein; Provisional
Probab=96.83 E-value=0.0053 Score=51.64 Aligned_cols=66 Identities=12% Similarity=0.025 Sum_probs=57.9
Q ss_pred hhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHH
Q psy16295 17 ADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKK 86 (128)
Q Consensus 17 ~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~ 86 (128)
++..+.++.+++.++.+ | +..+.++.+|-.+++.++|++|+++.+.+++.+|++.....|-.+-+.
T Consensus 308 ~~~~~al~~~e~~lk~~-P---~~~~l~l~lgrl~~~~~~~~~A~~~le~al~~~P~~~~~~~La~~~~~ 373 (398)
T PRK10747 308 NNPEQLEKVLRQQIKQH-G---DTPLLWSTLGQLLMKHGEWQEASLAFRAALKQRPDAYDYAWLADALDR 373 (398)
T ss_pred CChHHHHHHHHHHHhhC-C---CCHHHHHHHHHHHHHCCCHHHHHHHHHHHHhcCCCHHHHHHHHHHHHH
Confidence 45667899999999998 6 688999999999999999999999999999999999987777666653
No 55
>KOG0543|consensus
Probab=96.81 E-value=0.01 Score=51.38 Aligned_cols=81 Identities=15% Similarity=0.229 Sum_probs=66.5
Q ss_pred HHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHH----
Q psy16295 3 QFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQ---- 78 (128)
Q Consensus 3 ~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~---- 78 (128)
-.|.|.|.++...+. .+|......+.-+ | .+.-.||=-+-++.-+++|+.|+..++.+++++|+|.-++
T Consensus 260 ~lNlA~c~lKl~~~~---~Ai~~c~kvLe~~-~---~N~KALyRrG~A~l~~~e~~~A~~df~ka~k~~P~Nka~~~el~ 332 (397)
T KOG0543|consen 260 HLNLAACYLKLKEYK---EAIESCNKVLELD-P---NNVKALYRRGQALLALGEYDLARDDFQKALKLEPSNKAARAELI 332 (397)
T ss_pred hhHHHHHHHhhhhHH---HHHHHHHHHHhcC-C---CchhHHHHHHHHHHhhccHHHHHHHHHHHHHhCCCcHHHHHHHH
Confidence 358888999887555 8899999999888 5 6899999999999999999999999999999999997665
Q ss_pred HHHHHHHHHHhh
Q psy16295 79 HLESIVKKRMET 90 (128)
Q Consensus 79 ~Lk~~Ie~~i~k 90 (128)
.|++.|.+.-.+
T Consensus 333 ~l~~k~~~~~~k 344 (397)
T KOG0543|consen 333 KLKQKIREYEEK 344 (397)
T ss_pred HHHHHHHHHHHH
Confidence 455555444444
No 56
>PF13512 TPR_18: Tetratricopeptide repeat
Probab=96.79 E-value=0.0031 Score=47.63 Aligned_cols=71 Identities=13% Similarity=0.129 Sum_probs=56.4
Q ss_pred HHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHH
Q psy16295 4 FNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQ 78 (128)
Q Consensus 4 F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~ 78 (128)
|+=|....++. +..++|+.|+.|-..- |-..-....-.-|+-+||+.++|++|+..+++++++.|.++++-
T Consensus 14 y~~a~~~l~~~---~Y~~A~~~le~L~~ry-P~g~ya~qAqL~l~yayy~~~~y~~A~a~~~rFirLhP~hp~vd 84 (142)
T PF13512_consen 14 YQEAQEALQKG---NYEEAIKQLEALDTRY-PFGEYAEQAQLDLAYAYYKQGDYEEAIAAYDRFIRLHPTHPNVD 84 (142)
T ss_pred HHHHHHHHHhC---CHHHHHHHHHHHHhcC-CCCcccHHHHHHHHHHHHHccCHHHHHHHHHHHHHhCCCCCCcc
Confidence 34455555554 4567899999988875 43346677888999999999999999999999999999998753
No 57
>PRK15174 Vi polysaccharide export protein VexE; Provisional
Probab=96.78 E-value=0.011 Score=53.12 Aligned_cols=78 Identities=9% Similarity=0.019 Sum_probs=57.6
Q ss_pred HHHHHHHHhccCChhh-HHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHH
Q psy16295 3 QFNYSFCLVRSEFCAD-IRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLE 81 (128)
Q Consensus 3 ~F~YAwaLv~S~~~~d-~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk 81 (128)
.+++|..+....+.++ ..+++..++..+..+ | ...+.+..+|..+.++|+|++|+.+++.+++.+|+|.++....
T Consensus 249 ~~~Lg~~l~~~G~~~eA~~~A~~~~~~Al~l~-P---~~~~a~~~lg~~l~~~g~~~eA~~~l~~al~l~P~~~~a~~~L 324 (656)
T PRK15174 249 RRSLGLAYYQSGRSREAKLQAAEHWRHALQFN-S---DNVRIVTLYADALIRTGQNEKAIPLLQQSLATHPDLPYVRAMY 324 (656)
T ss_pred HHHHHHHHHHcCCchhhHHHHHHHHHHHHhhC-C---CCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCCHHHHHHH
Confidence 3556666666655443 235788888888877 6 4677888888888888888888888888888888888776654
Q ss_pred HHH
Q psy16295 82 SIV 84 (128)
Q Consensus 82 ~~I 84 (128)
..+
T Consensus 325 a~~ 327 (656)
T PRK15174 325 ARA 327 (656)
T ss_pred HHH
Confidence 443
No 58
>PF06552 TOM20_plant: Plant specific mitochondrial import receptor subunit TOM20; InterPro: IPR010547 This family consists of several plant specific mitochondrial import receptor subunit TOM20 (translocase of outer membrane 20 kDa subunit) proteins. Most mitochondrial proteins are encoded by the nuclear genome, and are synthesised in the cytosol. TOM20 is a general import receptor that binds to mitochondrial pre-sequences in the early step of protein import into the mitochondria [].; GO: 0045040 protein import into mitochondrial outer membrane, 0005742 mitochondrial outer membrane translocase complex; PDB: 1ZU2_A.
Probab=96.63 E-value=0.0088 Score=47.10 Aligned_cols=81 Identities=21% Similarity=0.240 Sum_probs=59.3
Q ss_pred HHHHHHHhcc-------CChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhc-----------hHHHHHHHH
Q psy16295 4 FNYSFCLVRS-------EFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKE-----------YTKSLKYCR 65 (128)
Q Consensus 4 F~YAwaLv~S-------~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgd-----------Y~~A~~~~~ 65 (128)
.+.+.+|... ..+.-++++|+=|++-++-+ | ...|.+|.++.+|+-++. |++|..|.+
T Consensus 29 ~~WG~ALLELAqfk~g~es~~miedAisK~eeAL~I~-P---~~hdAlw~lGnA~ts~A~l~~d~~~A~~~F~kA~~~Fq 104 (186)
T PF06552_consen 29 TNWGGALLELAQFKQGPESKKMIEDAISKFEEALKIN-P---NKHDALWCLGNAYTSLAFLTPDTAEAEEYFEKATEYFQ 104 (186)
T ss_dssp HHHHHHHHHHHHHS-HHHHHHHHHHHHHHHHHHHHH--T---T-HHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHhccCcchHHHHHHHHHHHHHHHHhcC-C---chHHHHHHHHHHHHHHHhhcCChHHHHHHHHHHHHHHH
Confidence 3445555544 23466788999999999999 7 789999999999998876 899999999
Q ss_pred HHHhcCCCCHH----------HHHHHHHHHHHH
Q psy16295 66 AFLAIESENSQ----------VQHLESIVKKRM 88 (128)
Q Consensus 66 ~lL~~eP~N~Q----------A~~Lk~~Ie~~i 88 (128)
...+.||+|.- |-.|+..|.+.+
T Consensus 105 kAv~~~P~ne~Y~ksLe~~~kap~lh~e~~~~~ 137 (186)
T PF06552_consen 105 KAVDEDPNNELYRKSLEMAAKAPELHMEIHKQG 137 (186)
T ss_dssp HHHHH-TT-HHHHHHHHHHHTHHHHHHHHHHSS
T ss_pred HHHhcCCCcHHHHHHHHHHHhhHHHHHHHHHHH
Confidence 99999999964 455666665553
No 59
>COG4105 ComL DNA uptake lipoprotein [General function prediction only]
Probab=96.60 E-value=0.0049 Score=50.59 Aligned_cols=63 Identities=16% Similarity=0.140 Sum_probs=55.3
Q ss_pred hhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHH
Q psy16295 16 CADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQH 79 (128)
Q Consensus 16 ~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~ 79 (128)
..+..++++-++.+.+.. |-+.-.+..+.-++.+|||.++|+.|+.++|+.++.-|.++.+-=
T Consensus 47 ~gn~~~A~~~fe~l~~~~-p~s~~~~qa~l~l~yA~Yk~~~y~~A~~~~drFi~lyP~~~n~dY 109 (254)
T COG4105 47 KGNYEEAIKYFEALDSRH-PFSPYSEQAQLDLAYAYYKNGEYDLALAYIDRFIRLYPTHPNADY 109 (254)
T ss_pred cCCHHHHHHHHHHHHHcC-CCCcccHHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCCChhH
Confidence 346678999999999886 655678999999999999999999999999999999999987643
No 60
>PRK11189 lipoprotein NlpI; Provisional
Probab=96.56 E-value=0.026 Score=45.70 Aligned_cols=56 Identities=7% Similarity=0.010 Sum_probs=50.0
Q ss_pred hhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHH
Q psy16295 17 ADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQ 76 (128)
Q Consensus 17 ~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~Q 76 (128)
.+..++++.++..++-+ | ...+.++.+++.++..|+|++|.+.++..++.+|+|+.
T Consensus 112 g~~~~A~~~~~~Al~l~-P---~~~~a~~~lg~~l~~~g~~~eA~~~~~~al~~~P~~~~ 167 (296)
T PRK11189 112 GNFDAAYEAFDSVLELD-P---TYNYAYLNRGIALYYGGRYELAQDDLLAFYQDDPNDPY 167 (296)
T ss_pred CCHHHHHHHHHHHHHhC-C---CCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCCHH
Confidence 35568899999999888 6 56889999999999999999999999999999999974
No 61
>PRK10049 pgaA outer membrane protein PgaA; Provisional
Probab=96.52 E-value=0.02 Score=52.25 Aligned_cols=71 Identities=11% Similarity=-0.091 Sum_probs=59.0
Q ss_pred HHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHH
Q psy16295 4 FNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLE 81 (128)
Q Consensus 4 F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk 81 (128)
+.+|..+..+ .+..+++++|++++... | ...+.++.+|..+.+.|++++|++.++.+|+.+|+|.++.--.
T Consensus 363 ~~~a~~l~~~---g~~~eA~~~l~~al~~~-P---~n~~l~~~lA~l~~~~g~~~~A~~~l~~al~l~Pd~~~l~~~~ 433 (765)
T PRK10049 363 SLLSQVAKYS---NDLPQAEMRARELAYNA-P---GNQGLRIDYASVLQARGWPRAAENELKKAEVLEPRNINLEVEQ 433 (765)
T ss_pred HHHHHHHHHc---CCHHHHHHHHHHHHHhC-C---CCHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCCCChHHHHHH
Confidence 4456666655 35568999999999998 7 4589999999999999999999999999999999996654433
No 62
>TIGR00540 hemY_coli hemY protein. This is an uncharacterized protein encoded next to a heme-biosynthetic enzyme in two gamma division proteobacteria (E. coli and H. influenzae). It is known in no other species. The gene symbol hemY is unfortunate in that an unrelated protein, protoporphyrinogen oxidase, is designated as HemG in E. coli but as HemY in Bacillus subtilis.
Probab=96.52 E-value=0.012 Score=49.59 Aligned_cols=66 Identities=14% Similarity=-0.005 Sum_probs=55.2
Q ss_pred hhhHHHHHHHHHHHhhccCccccchH--HHHHHHHHHHhhhhchHHHHHHHH--HHHhcCCCCHHHHHHHHHHH
Q psy16295 16 CADIRKGILLLEELFKTETEDIDRKR--DYLYYLAIGNARIKEYTKSLKYCR--AFLAIESENSQVQHLESIVK 85 (128)
Q Consensus 16 ~~d~~~GI~LL~~l~~~~~~~~~~~r--d~LYyLAvg~yklgdY~~A~~~~~--~lL~~eP~N~QA~~Lk~~Ie 85 (128)
+++....++.++.-++.+ | +.. ..+..++--+++.|+|++|++|.+ ..++.+|++.....|-++.+
T Consensus 312 ~~~~~~~~~~~e~~lk~~-p---~~~~~~ll~sLg~l~~~~~~~~~A~~~le~a~a~~~~p~~~~~~~La~ll~ 381 (409)
T TIGR00540 312 PEDNEKLEKLIEKQAKNV-D---DKPKCCINRALGQLLMKHGEFIEAADAFKNVAACKEQLDANDLAMAADAFD 381 (409)
T ss_pred CCChHHHHHHHHHHHHhC-C---CChhHHHHHHHHHHHHHcccHHHHHHHHHHhHHhhcCCCHHHHHHHHHHHH
Confidence 466677888999888888 6 456 566688888999999999999999 79999999998888877665
No 63
>PF07720 TPR_3: Tetratricopeptide repeat; InterPro: IPR011716 This entry includes tetratricopeptide-like repeats found in the LcrH/SycD-like chaperones [].; PDB: 3KS2_O 3GZ2_A 3GZ1_A 3GYZ_A 4AM9_A 2VGX_A 2VGY_A.
Probab=96.50 E-value=0.0062 Score=35.73 Aligned_cols=34 Identities=24% Similarity=0.418 Sum_probs=28.7
Q ss_pred HHHHHHHHHHHhhhhchHHHHHH--HHHHHhcCCCC
Q psy16295 41 RDYLYYLAIGNARIKEYTKSLKY--CRAFLAIESEN 74 (128)
Q Consensus 41 rd~LYyLAvg~yklgdY~~A~~~--~~~lL~~eP~N 74 (128)
.|++|-+|...+..|+|++|+++ ..-+...+|.|
T Consensus 1 ~e~~y~~a~~~y~~~ky~~A~~~~~y~~l~~ld~~n 36 (36)
T PF07720_consen 1 PEYLYGLAYNFYQKGKYDEAIHFFQYAFLCALDKYN 36 (36)
T ss_dssp HHHHHHHHHHHHHTT-HHHHHHHHHHHHHHHHTTT-
T ss_pred CcHHHHHHHHHHHHhhHHHHHHHHHHHHHHHhcccC
Confidence 37899999999999999999999 55888888876
No 64
>PRK11788 tetratricopeptide repeat protein; Provisional
Probab=96.45 E-value=0.034 Score=44.92 Aligned_cols=52 Identities=10% Similarity=0.052 Sum_probs=26.4
Q ss_pred HHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCC
Q psy16295 19 IRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESEN 74 (128)
Q Consensus 19 ~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N 74 (128)
..+++..+++.++.+ | ...+.++.++..+.+.|++++|.++++.+++.+|++
T Consensus 196 ~~~A~~~~~~al~~~-p---~~~~~~~~la~~~~~~g~~~~A~~~~~~~~~~~p~~ 247 (389)
T PRK11788 196 LDAARALLKKALAAD-P---QCVRASILLGDLALAQGDYAAAIEALERVEEQDPEY 247 (389)
T ss_pred HHHHHHHHHHHHhHC-c---CCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHChhh
Confidence 344555555555444 3 234455555555555555555555555555555544
No 65
>PRK02603 photosystem I assembly protein Ycf3; Provisional
Probab=96.42 E-value=0.021 Score=42.28 Aligned_cols=66 Identities=12% Similarity=0.236 Sum_probs=54.0
Q ss_pred HHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhc--------------hHHHHHHHHHHH
Q psy16295 3 QFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKE--------------YTKSLKYCRAFL 68 (128)
Q Consensus 3 ~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgd--------------Y~~A~~~~~~lL 68 (128)
.+++|.++.+.. +..+++..+++.++.. | ...+.++.++..+..+|+ |++|+++++..+
T Consensus 75 ~~~la~~~~~~g---~~~~A~~~~~~al~~~-p---~~~~~~~~lg~~~~~~g~~~~a~~~~~~A~~~~~~A~~~~~~a~ 147 (172)
T PRK02603 75 LYNMGIIYASNG---EHDKALEYYHQALELN-P---KQPSALNNIAVIYHKRGEKAEEAGDQDEAEALFDKAAEYWKQAI 147 (172)
T ss_pred HHHHHHHHHHcC---CHHHHHHHHHHHHHhC-c---ccHHHHHHHHHHHHHcCChHhHhhCHHHHHHHHHHHHHHHHHHH
Confidence 355666666654 4567899999999887 6 568889999999999988 789999999999
Q ss_pred hcCCCCH
Q psy16295 69 AIESENS 75 (128)
Q Consensus 69 ~~eP~N~ 75 (128)
+.+|+|-
T Consensus 148 ~~~p~~~ 154 (172)
T PRK02603 148 RLAPNNY 154 (172)
T ss_pred hhCchhH
Confidence 9999994
No 66
>TIGR00990 3a0801s09 mitochondrial precursor proteins import receptor (72 kDa mitochondrial outermembrane protein) (mitochondrial import receptor for the ADP/ATP carrier) (translocase of outermembrane tom70).
Probab=96.42 E-value=0.025 Score=49.79 Aligned_cols=59 Identities=8% Similarity=-0.059 Sum_probs=41.1
Q ss_pred hHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHH
Q psy16295 18 DIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHL 80 (128)
Q Consensus 18 d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~L 80 (128)
+..+++..++..+..+ | ...+.++.+|..++++|+|++|+.+++..++..|++..+...
T Consensus 414 ~~~~A~~~~~kal~l~-P---~~~~~~~~la~~~~~~g~~~eA~~~~~~al~~~P~~~~~~~~ 472 (615)
T TIGR00990 414 EFAQAGKDYQKSIDLD-P---DFIFSHIQLGVTQYKEGSIASSMATFRRCKKNFPEAPDVYNY 472 (615)
T ss_pred CHHHHHHHHHHHHHcC-c---cCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCChHHHHH
Confidence 4456777777777776 5 456667777777777777777777777777777777655433
No 67
>PRK15174 Vi polysaccharide export protein VexE; Provisional
Probab=96.37 E-value=0.031 Score=50.33 Aligned_cols=70 Identities=17% Similarity=0.107 Sum_probs=58.2
Q ss_pred HHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHH
Q psy16295 3 QFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQH 79 (128)
Q Consensus 3 ~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~ 79 (128)
.+++|+.+.+.. +..+++..+++.+..+ | ...+.++.++..+.++|+|++|+..++.+++.+|++..+..
T Consensus 287 ~~~lg~~l~~~g---~~~eA~~~l~~al~l~-P---~~~~a~~~La~~l~~~G~~~eA~~~l~~al~~~P~~~~~~~ 356 (656)
T PRK15174 287 VTLYADALIRTG---QNEKAIPLLQQSLATH-P---DLPYVRAMYARALRQVGQYTAASDEFVQLAREKGVTSKWNR 356 (656)
T ss_pred HHHHHHHHHHCC---CHHHHHHHHHHHHHhC-C---CCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCccchHHHH
Confidence 467788887765 4457899999999887 6 56788999999999999999999999999999999865443
No 68
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=96.34 E-value=0.025 Score=50.11 Aligned_cols=82 Identities=13% Similarity=0.082 Sum_probs=74.6
Q ss_pred HHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHH
Q psy16295 3 QFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLES 82 (128)
Q Consensus 3 ~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~ 82 (128)
+.+||.+|++-.+++ ++|.+|....+++ | ...+-|+|||-+|-++|+-.+|....-..+...-...||+....
T Consensus 377 ~~~~a~all~~g~~~---eai~~L~~~~~~~-p---~dp~~w~~LAqay~~~g~~~~a~~A~AE~~~~~G~~~~A~~~l~ 449 (484)
T COG4783 377 QLNLAQALLKGGKPQ---EAIRILNRYLFND-P---EDPNGWDLLAQAYAELGNRAEALLARAEGYALAGRLEQAIIFLM 449 (484)
T ss_pred HHHHHHHHHhcCChH---HHHHHHHHHhhcC-C---CCchHHHHHHHHHHHhCchHHHHHHHHHHHHhCCCHHHHHHHHH
Confidence 579999999999998 8999999999999 6 68999999999999999999999999999999999999998888
Q ss_pred HHHHHHhhh
Q psy16295 83 IVKKRMETE 91 (128)
Q Consensus 83 ~Ie~~i~kd 91 (128)
.-.++++.+
T Consensus 450 ~A~~~~~~~ 458 (484)
T COG4783 450 RASQQVKLG 458 (484)
T ss_pred HHHHhccCC
Confidence 877777553
No 69
>PF13424 TPR_12: Tetratricopeptide repeat; PDB: 3RO2_A 3Q15_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 4A1S_B 3CEQ_B 3EDT_H ....
Probab=96.31 E-value=0.012 Score=37.66 Aligned_cols=65 Identities=17% Similarity=0.155 Sum_probs=44.7
Q ss_pred HHHHHHHHhccCChhhHHHHHHHHHHHhhc---cCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhc
Q psy16295 3 QFNYSFCLVRSEFCADIRKGILLLEELFKT---ETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAI 70 (128)
Q Consensus 3 ~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~---~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~ 70 (128)
-+++|.++.+..+ ..++++.+++.+.- ..++....-.+++-+|..++++|+|++|+++.+..+++
T Consensus 8 ~~~la~~~~~~~~---~~~A~~~~~~al~~~~~~~~~~~~~a~~~~~lg~~~~~~g~~~~A~~~~~~al~i 75 (78)
T PF13424_consen 8 YNNLARVYRELGR---YDEALDYYEKALDIEEQLGDDHPDTANTLNNLGECYYRLGDYEEALEYYQKALDI 75 (78)
T ss_dssp HHHHHHHHHHTT----HHHHHHHHHHHHHHHHHTTTHHHHHHHHHHHHHHHHHHTTHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHcCC---HHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHhh
Confidence 4677777776543 34555555554422 10100245789999999999999999999999998875
No 70
>PF13371 TPR_9: Tetratricopeptide repeat
Probab=96.28 E-value=0.008 Score=37.88 Aligned_cols=43 Identities=7% Similarity=0.108 Sum_probs=37.4
Q ss_pred HHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhh
Q psy16295 48 AIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKRMET 90 (128)
Q Consensus 48 Avg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i~k 90 (128)
+-.|.+.++|++|+++++.+++++|++......+-.+.-+..+
T Consensus 2 ~~~~~~~~~~~~A~~~~~~~l~~~p~~~~~~~~~a~~~~~~g~ 44 (73)
T PF13371_consen 2 KQIYLQQEDYEEALEVLERALELDPDDPELWLQRARCLFQLGR 44 (73)
T ss_pred HHHHHhCCCHHHHHHHHHHHHHhCcccchhhHHHHHHHHHhcc
Confidence 4568899999999999999999999999999888887766543
No 71
>KOG1126|consensus
Probab=96.15 E-value=0.012 Score=53.51 Aligned_cols=78 Identities=14% Similarity=0.180 Sum_probs=66.0
Q ss_pred HHHHHHHHhccC--ChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHH
Q psy16295 3 QFNYSFCLVRSE--FCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHL 80 (128)
Q Consensus 3 ~F~YAwaLv~S~--~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~L 80 (128)
.|.||++|.... -.++..++..-++.-++-+ | +.-..||-|++.|.|++.|+.|.-+.+..+++.|.|.+-.-=
T Consensus 453 ~faYayTLlGhE~~~~ee~d~a~~~fr~Al~~~-~---rhYnAwYGlG~vy~Kqek~e~Ae~~fqkA~~INP~nsvi~~~ 528 (638)
T KOG1126|consen 453 RFAYAYTLLGHESIATEEFDKAMKSFRKALGVD-P---RHYNAWYGLGTVYLKQEKLEFAEFHFQKAVEINPSNSVILCH 528 (638)
T ss_pred ccchhhhhcCChhhhhHHHHhHHHHHHhhhcCC-c---hhhHHHHhhhhheeccchhhHHHHHHHhhhcCCccchhHHhh
Confidence 489999999864 3566778888888888888 6 789999999999999999999999999999999999986543
Q ss_pred HHHH
Q psy16295 81 ESIV 84 (128)
Q Consensus 81 k~~I 84 (128)
--.+
T Consensus 529 ~g~~ 532 (638)
T KOG1126|consen 529 IGRI 532 (638)
T ss_pred hhHH
Confidence 3333
No 72
>PRK15179 Vi polysaccharide biosynthesis protein TviE; Provisional
Probab=96.13 E-value=0.027 Score=51.74 Aligned_cols=58 Identities=10% Similarity=0.126 Sum_probs=29.9
Q ss_pred HHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHH
Q psy16295 21 KGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLES 82 (128)
Q Consensus 21 ~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~ 82 (128)
++..+|+.++.-. | +.-..+..+|....+++.+++|+..++++|+.+|+|..++.++.
T Consensus 104 ea~~~l~~~~~~~-P---d~~~a~~~~a~~L~~~~~~eeA~~~~~~~l~~~p~~~~~~~~~a 161 (694)
T PRK15179 104 EGLAVWRGIHQRF-P---DSSEAFILMLRGVKRQQGIEAGRAEIELYFSGGSSSAREILLEA 161 (694)
T ss_pred HHHHHHHHHHhhC-C---CcHHHHHHHHHHHHHhccHHHHHHHHHHHhhcCCCCHHHHHHHH
Confidence 3445555555554 4 34455555555555555555555555555555555555544443
No 73
>TIGR03504 FimV_Cterm FimV C-terminal domain. This protein is found at the extreme C-terminus of FimV from Pseudomonas aeruginosa, and of TspA of Neisseria meningitidis. Disruption of the former blocks twitching motility from type IV pili; Semmler, et al. suggest a role in peptidoglycan layer remodelling required by type IV fimbrial systems.
Probab=96.12 E-value=0.016 Score=35.51 Aligned_cols=40 Identities=13% Similarity=0.160 Sum_probs=33.2
Q ss_pred HHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHH
Q psy16295 45 YYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVK 85 (128)
Q Consensus 45 YyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie 85 (128)
|-||-+|.++||++.||..++.++ -+++.+|-..=+++++
T Consensus 3 LdLA~ayie~Gd~e~Ar~lL~evl-~~~~~~q~~eA~~LL~ 42 (44)
T TIGR03504 3 LDLARAYIEMGDLEGARELLEEVI-EEGDEAQRQEARALLA 42 (44)
T ss_pred hHHHHHHHHcCChHHHHHHHHHHH-HcCCHHHHHHHHHHHh
Confidence 568999999999999999999999 4777777666555544
No 74
>PRK11447 cellulose synthase subunit BcsC; Provisional
Probab=96.11 E-value=0.018 Score=54.76 Aligned_cols=57 Identities=14% Similarity=0.163 Sum_probs=51.5
Q ss_pred hhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHH
Q psy16295 17 ADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQV 77 (128)
Q Consensus 17 ~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA 77 (128)
.+..+++..+++.++.+ | ...+.++.|+..++++|+|++|+.+++.+++.+|++...
T Consensus 283 g~~~~A~~~l~~aL~~~-P---~~~~a~~~Lg~~~~~~g~~~eA~~~l~~Al~~~p~~~~~ 339 (1157)
T PRK11447 283 GQGGKAIPELQQAVRAN-P---KDSEALGALGQAYSQQGDRARAVAQFEKALALDPHSSNR 339 (1157)
T ss_pred CCHHHHHHHHHHHHHhC-C---CCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCCCccch
Confidence 45668999999999988 6 568999999999999999999999999999999998754
No 75
>PRK15363 pathogenicity island 2 chaperone protein SscA; Provisional
Probab=96.10 E-value=0.06 Score=41.28 Aligned_cols=81 Identities=12% Similarity=0.030 Sum_probs=69.3
Q ss_pred HHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHH
Q psy16295 3 QFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLES 82 (128)
Q Consensus 3 ~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~ 82 (128)
.|=|..++++.. ..+..++|+.....+.-+ | +..+..+++|+++.++|+-++|++..+..+..--+++|=..|++
T Consensus 70 ~y~~gLG~~~Q~-~g~~~~AI~aY~~A~~L~-~---ddp~~~~~ag~c~L~lG~~~~A~~aF~~Ai~~~~~~~~~~~l~~ 144 (157)
T PRK15363 70 DYWFRLGECCQA-QKHWGEAIYAYGRAAQIK-I---DAPQAPWAAAECYLACDNVCYAIKALKAVVRICGEVSEHQILRQ 144 (157)
T ss_pred HHHHHHHHHHHH-HhhHHHHHHHHHHHHhcC-C---CCchHHHHHHHHHHHcCCHHHHHHHHHHHHHHhccChhHHHHHH
Confidence 455777788776 778899999999988888 6 67899999999999999999999999999998888888888888
Q ss_pred HHHHHH
Q psy16295 83 IVKKRM 88 (128)
Q Consensus 83 ~Ie~~i 88 (128)
..+.-.
T Consensus 145 ~A~~~L 150 (157)
T PRK15363 145 RAEKML 150 (157)
T ss_pred HHHHHH
Confidence 776543
No 76
>PRK09782 bacteriophage N4 receptor, outer membrane subunit; Provisional
Probab=96.07 E-value=0.032 Score=53.04 Aligned_cols=76 Identities=11% Similarity=0.043 Sum_probs=62.3
Q ss_pred HHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHH
Q psy16295 4 FNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESI 83 (128)
Q Consensus 4 F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~ 83 (128)
|.|..+..... +.|..++++.|+..++.+ | ...+..+.||-.|.++|++++|+.+++..++.+|+|.....+...
T Consensus 46 ~~f~~a~~~~~-~Gd~~~A~~~l~~Al~~d-P---~n~~~~~~LA~~yl~~g~~~~A~~~~~kAv~ldP~n~~~~~~La~ 120 (987)
T PRK09782 46 PRLDKALKAQK-NNDEATAIREFEYIHQQV-P---DNIPLTLYLAEAYRHFGHDDRARLLLEDQLKRHPGDARLERSLAA 120 (987)
T ss_pred HHHHHHHHHHh-CCCHHHHHHHHHHHHHhC-C---CCHHHHHHHHHHHHHCCCHHHHHHHHHHHHhcCcccHHHHHHHHH
Confidence 45566665554 455569999999999999 7 348888999999999999999999999999999999888776555
Q ss_pred H
Q psy16295 84 V 84 (128)
Q Consensus 84 I 84 (128)
|
T Consensus 121 i 121 (987)
T PRK09782 121 I 121 (987)
T ss_pred h
Confidence 5
No 77
>cd05804 StaR_like StaR_like; a well-conserved protein found in bacteria, plants, and animals. A family member from Streptomyces toyocaensis, StaR is part of a gene cluster involved in the biosynthesis of glycopeptide antibiotics (GPAs), specifically A47934. It has been speculated that StaR could be a flavoprotein hydroxylating a tyrosine sidechain. Some family members have been annotated as proteins containing tetratricopeptide (TPR) repeats, which may at least indicate mostly alpha-helical secondary structure.
Probab=96.07 E-value=0.044 Score=43.84 Aligned_cols=74 Identities=8% Similarity=-0.027 Sum_probs=57.2
Q ss_pred hHHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHH
Q psy16295 2 TQFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQH 79 (128)
Q Consensus 2 ~~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~ 79 (128)
.-+..|..+.--.++.+-.++.....+....+ . ..+++.+..|+.++..||+++|..+++.+|+..|+|..+..
T Consensus 8 a~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~-~---~~~e~~~~~a~~~~~~g~~~~A~~~~~~~l~~~P~~~~a~~ 81 (355)
T cd05804 8 GHAAAALLLLLGGERPAAAAKAAAAAQALAAR-A---TERERAHVEALSAWIAGDLPKALALLEQLLDDYPRDLLALK 81 (355)
T ss_pred HHHHHHHHHHhcCCcchHHHHHHHHHHHhccC-C---CHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCCCcHHHHH
Confidence 34556666655566666556666655555555 2 57999999999999999999999999999999999997755
No 78
>PLN02789 farnesyltranstransferase
Probab=96.02 E-value=0.031 Score=46.61 Aligned_cols=62 Identities=11% Similarity=0.183 Sum_probs=53.4
Q ss_pred HHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHH
Q psy16295 20 RKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVK 85 (128)
Q Consensus 20 ~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie 85 (128)
.+.+++++..+..+ | ..-..+++.+....++|+|++|+.+++.+|+.+|.|..|-.-+..+-
T Consensus 125 ~~el~~~~kal~~d-p---kNy~AW~~R~w~l~~l~~~~eeL~~~~~~I~~d~~N~sAW~~R~~vl 186 (320)
T PLN02789 125 NKELEFTRKILSLD-A---KNYHAWSHRQWVLRTLGGWEDELEYCHQLLEEDVRNNSAWNQRYFVI 186 (320)
T ss_pred HHHHHHHHHHHHhC-c---ccHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHCCCchhHHHHHHHHH
Confidence 45688888888888 5 78999999999999999999999999999999999988776665553
No 79
>COG4235 Cytochrome c biogenesis factor [Posttranslational modification, protein turnover, chaperones]
Probab=96.00 E-value=0.035 Score=46.33 Aligned_cols=67 Identities=12% Similarity=0.050 Sum_probs=59.8
Q ss_pred cCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHH
Q psy16295 13 SEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESI 83 (128)
Q Consensus 13 S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~ 83 (128)
-.++++..+-|.-|+.-+..+ | .+.+-|..|+=.|.++++++.|...+...++++|+|++....--.
T Consensus 132 ~~~~~~~~~l~a~Le~~L~~n-P---~d~egW~~Lg~~ym~~~~~~~A~~AY~~A~rL~g~n~~~~~g~ae 198 (287)
T COG4235 132 PPAEQEMEALIARLETHLQQN-P---GDAEGWDLLGRAYMALGRASDALLAYRNALRLAGDNPEILLGLAE 198 (287)
T ss_pred CCCcccHHHHHHHHHHHHHhC-C---CCchhHHHHHHHHHHhcchhHHHHHHHHHHHhCCCCHHHHHHHHH
Confidence 356677888999999999999 7 789999999999999999999999999999999999998765433
No 80
>PRK15179 Vi polysaccharide biosynthesis protein TviE; Provisional
Probab=95.90 E-value=0.059 Score=49.50 Aligned_cols=69 Identities=12% Similarity=-0.029 Sum_probs=56.0
Q ss_pred HHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHH
Q psy16295 3 QFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQ 78 (128)
Q Consensus 3 ~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~ 78 (128)
..+||-.|.+- +...++....+.++..+ | .+.+.++.+|...-++|+|++|..+++.+|.-.|++.++.
T Consensus 123 ~~~~a~~L~~~---~~~eeA~~~~~~~l~~~-p---~~~~~~~~~a~~l~~~g~~~~A~~~y~~~~~~~p~~~~~~ 191 (694)
T PRK15179 123 FILMLRGVKRQ---QGIEAGRAEIELYFSGG-S---SSAREILLEAKSWDEIGQSEQADACFERLSRQHPEFENGY 191 (694)
T ss_pred HHHHHHHHHHh---ccHHHHHHHHHHHhhcC-C---CCHHHHHHHHHHHHHhcchHHHHHHHHHHHhcCCCcHHHH
Confidence 45677777664 45667888888888888 6 6899999999999999999999999999999888777664
No 81
>PF13374 TPR_10: Tetratricopeptide repeat; PDB: 3CEQ_B 3EDT_H 3NF1_A.
Probab=95.89 E-value=0.016 Score=32.55 Aligned_cols=29 Identities=24% Similarity=0.211 Sum_probs=25.6
Q ss_pred HHHHHHHHHHhhhhchHHHHHHHHHHHhc
Q psy16295 42 DYLYYLAIGNARIKEYTKSLKYCRAFLAI 70 (128)
Q Consensus 42 d~LYyLAvg~yklgdY~~A~~~~~~lL~~ 70 (128)
+++.-||..+..+|+|++|+.+++..+++
T Consensus 3 ~~~~~la~~~~~~g~~~~A~~~~~~al~~ 31 (42)
T PF13374_consen 3 SALNNLANAYRAQGRYEEALELLEEALEI 31 (42)
T ss_dssp HHHHHHHHHHHHCT-HHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHhhhhcchhhHHHHHHHHH
Confidence 57889999999999999999999988875
No 82
>CHL00033 ycf3 photosystem I assembly protein Ycf3
Probab=95.88 E-value=0.082 Score=38.75 Aligned_cols=68 Identities=10% Similarity=0.064 Sum_probs=51.3
Q ss_pred HHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHh-------hhhchH-------HHHHHHHHHH
Q psy16295 3 QFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNA-------RIKEYT-------KSLKYCRAFL 68 (128)
Q Consensus 3 ~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~y-------klgdY~-------~A~~~~~~lL 68 (128)
-+++|.++.+..+. .+++..++..++.+ | ...+.++.++..+. ++|+|+ +|+.+++..+
T Consensus 75 ~~~lg~~~~~~g~~---~eA~~~~~~Al~~~-~---~~~~~~~~la~i~~~~~~~~~~~g~~~~A~~~~~~a~~~~~~a~ 147 (168)
T CHL00033 75 LYNIGLIHTSNGEH---TKALEYYFQALERN-P---FLPQALNNMAVICHYRGEQAIEQGDSEIAEAWFDQAAEYWKQAI 147 (168)
T ss_pred HHHHHHHHHHcCCH---HHHHHHHHHHHHhC-c---CcHHHHHHHHHHHHHhhHHHHHcccHHHHHHHHHHHHHHHHHHH
Confidence 45666666665544 57999999988887 5 45788999999999 777766 6666677899
Q ss_pred hcCCCCHHH
Q psy16295 69 AIESENSQV 77 (128)
Q Consensus 69 ~~eP~N~QA 77 (128)
..+|++.+.
T Consensus 148 ~~~p~~~~~ 156 (168)
T CHL00033 148 ALAPGNYIE 156 (168)
T ss_pred HhCcccHHH
Confidence 999988754
No 83
>PLN02789 farnesyltranstransferase
Probab=95.81 E-value=0.077 Score=44.24 Aligned_cols=80 Identities=9% Similarity=0.159 Sum_probs=62.2
Q ss_pred HHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhh-chHHHHHHHHHHHhcCCCCHHHHHHHH
Q psy16295 4 FNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIK-EYTKSLKYCRAFLAIESENSQVQHLES 82 (128)
Q Consensus 4 F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklg-dY~~A~~~~~~lL~~eP~N~QA~~Lk~ 82 (128)
++|-.++..+... -.+++.+.++.+.-+ | ..-..++|.+....+++ +|++++.+++.+++.+|.|.|+=.=+.
T Consensus 40 ~~~~ra~l~~~e~--serAL~lt~~aI~ln-P---~~ytaW~~R~~iL~~L~~~l~eeL~~~~~~i~~npknyqaW~~R~ 113 (320)
T PLN02789 40 MDYFRAVYASDER--SPRALDLTADVIRLN-P---GNYTVWHFRRLCLEALDADLEEELDFAEDVAEDNPKNYQIWHHRR 113 (320)
T ss_pred HHHHHHHHHcCCC--CHHHHHHHHHHHHHC-c---hhHHHHHHHHHHHHHcchhHHHHHHHHHHHHHHCCcchHHhHHHH
Confidence 4455556555443 348999999999888 6 56788899999989998 689999999999999999999877666
Q ss_pred HHHHHHh
Q psy16295 83 IVKKRME 89 (128)
Q Consensus 83 ~Ie~~i~ 89 (128)
.+-.++.
T Consensus 114 ~~l~~l~ 120 (320)
T PLN02789 114 WLAEKLG 120 (320)
T ss_pred HHHHHcC
Confidence 5555554
No 84
>PRK11447 cellulose synthase subunit BcsC; Provisional
Probab=95.75 E-value=0.038 Score=52.54 Aligned_cols=61 Identities=8% Similarity=-0.006 Sum_probs=54.2
Q ss_pred hhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHH
Q psy16295 16 CADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHL 80 (128)
Q Consensus 16 ~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~L 80 (128)
..+..++++.+++.++.+ | ...+.+|.+|..++++|+|++|+..++.+++.+|+|.++.-.
T Consensus 474 ~g~~~eA~~~~~~Al~~~-P---~~~~~~~~LA~~~~~~G~~~~A~~~l~~al~~~P~~~~~~~a 534 (1157)
T PRK11447 474 QGKWAQAAELQRQRLALD-P---GSVWLTYRLAQDLRQAGQRSQADALMRRLAQQKPNDPEQVYA 534 (1157)
T ss_pred CCCHHHHHHHHHHHHHhC-C---CCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCCCHHHHHH
Confidence 356678999999999998 7 567899999999999999999999999999999999876543
No 85
>PRK11788 tetratricopeptide repeat protein; Provisional
Probab=95.68 E-value=0.036 Score=44.79 Aligned_cols=32 Identities=13% Similarity=0.071 Sum_probs=15.4
Q ss_pred HHHHHhhhhchHHHHHHHHHHHhcCCCCHHHH
Q psy16295 47 LAIGNARIKEYTKSLKYCRAFLAIESENSQVQ 78 (128)
Q Consensus 47 LAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~ 78 (128)
++..+++.|++++|.++++.+++.+|++.++.
T Consensus 186 la~~~~~~~~~~~A~~~~~~al~~~p~~~~~~ 217 (389)
T PRK11788 186 LAQQALARGDLDAARALLKKALAADPQCVRAS 217 (389)
T ss_pred HHHHHHhCCCHHHHHHHHHHHHhHCcCCHHHH
Confidence 34444445555555555555555555444433
No 86
>KOG2002|consensus
Probab=95.61 E-value=0.059 Score=51.29 Aligned_cols=80 Identities=19% Similarity=0.221 Sum_probs=67.4
Q ss_pred HHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHH
Q psy16295 3 QFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLES 82 (128)
Q Consensus 3 ~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~ 82 (128)
..|.|+|++-.. ..+.+|++.+..++.-++ .++++.+-|||=++|+-+.|.+|..+....++.-|.|.+++-=..
T Consensus 683 ~lNlah~~~e~~---qy~~AIqmYe~~lkkf~~--~~~~~vl~~Lara~y~~~~~~eak~~ll~a~~~~p~~~~v~FN~a 757 (1018)
T KOG2002|consen 683 WLNLAHCYVEQG---QYRLAIQMYENCLKKFYK--KNRSEVLHYLARAWYEAGKLQEAKEALLKARHLAPSNTSVKFNLA 757 (1018)
T ss_pred eeeHHHHHHHHH---HHHHHHHHHHHHHHHhcc--cCCHHHHHHHHHHHHHhhhHHHHHHHHHHHHHhCCccchHHhHHH
Confidence 357888888654 677999999999999776 689999999999999999999999999999999999999664433
Q ss_pred HHHHH
Q psy16295 83 IVKKR 87 (128)
Q Consensus 83 ~Ie~~ 87 (128)
++-.+
T Consensus 758 ~v~kk 762 (1018)
T KOG2002|consen 758 LVLKK 762 (1018)
T ss_pred HHHHH
Confidence 33333
No 87
>PF04733 Coatomer_E: Coatomer epsilon subunit; InterPro: IPR006822 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the epsilon subunit of the coatomer complex, which is involved in the regulation of intracellular protein trafficking between the endoplasmic reticulum and the Golgi complex []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006890 retrograde vesicle-mediated transport, Golgi to ER, 0030126 COPI vesicle coat; PDB: 3MV2_B 3MV3_F 3MKR_A.
Probab=95.52 E-value=0.032 Score=45.77 Aligned_cols=70 Identities=13% Similarity=0.174 Sum_probs=53.4
Q ss_pred HHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHH
Q psy16295 3 QFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQV 77 (128)
Q Consensus 3 ~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA 77 (128)
+.--||--+.... +.++.+..+++++...- + ....-+.-+|+++..+|+|++|...++..|+.+|.|+.+
T Consensus 168 qLa~awv~l~~g~-e~~~~A~y~f~El~~~~-~---~t~~~lng~A~~~l~~~~~~eAe~~L~~al~~~~~~~d~ 237 (290)
T PF04733_consen 168 QLAEAWVNLATGG-EKYQDAFYIFEELSDKF-G---STPKLLNGLAVCHLQLGHYEEAEELLEEALEKDPNDPDT 237 (290)
T ss_dssp HHHHHHHHHHHTT-TCCCHHHHHHHHHHCCS------SHHHHHHHHHHHHHCT-HHHHHHHHHHHCCC-CCHHHH
T ss_pred HHHHHHHHHHhCc-hhHHHHHHHHHHHHhcc-C---CCHHHHHHHHHHHHHhCCHHHHHHHHHHHHHhccCCHHH
Confidence 3444665555543 66788999999977664 3 467889999999999999999999999999999998764
No 88
>PF13525 YfiO: Outer membrane lipoprotein; PDB: 3TGO_A 3Q5M_A 2YHC_A.
Probab=95.46 E-value=0.21 Score=38.29 Aligned_cols=81 Identities=11% Similarity=0.168 Sum_probs=61.8
Q ss_pred hHHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhc-----------hHHHHHHHHHHHhc
Q psy16295 2 TQFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKE-----------YTKSLKYCRAFLAI 70 (128)
Q Consensus 2 ~~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgd-----------Y~~A~~~~~~lL~~ 70 (128)
++|..|++.-+.. +...++..++..++.. |+.....+.+|.++++++++.+ ..+|...++.+++.
T Consensus 44 A~l~la~a~y~~~---~y~~A~~~~~~fi~~y-P~~~~~~~A~Y~~g~~~~~~~~~~~~~~~D~~~~~~A~~~~~~li~~ 119 (203)
T PF13525_consen 44 AQLMLAYAYYKQG---DYEEAIAAYERFIKLY-PNSPKADYALYMLGLSYYKQIPGILRSDRDQTSTRKAIEEFEELIKR 119 (203)
T ss_dssp HHHHHHHHHHHTT----HHHHHHHHHHHHHH--TT-TTHHHHHHHHHHHHHHHHHHHH-TT---HHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHcC---CHHHHHHHHHHHHHHC-CCCcchhhHHHHHHHHHHHhCccchhcccChHHHHHHHHHHHHHHHH
Confidence 4678888888776 4557888888888776 6545778999999999998754 34899999999999
Q ss_pred CCCCHHHHHHHHHHHH
Q psy16295 71 ESENSQVQHLESIVKK 86 (128)
Q Consensus 71 eP~N~QA~~Lk~~Ie~ 86 (128)
-|+.+-+..-++.|..
T Consensus 120 yP~S~y~~~A~~~l~~ 135 (203)
T PF13525_consen 120 YPNSEYAEEAKKRLAE 135 (203)
T ss_dssp -TTSTTHHHHHHHHHH
T ss_pred CcCchHHHHHHHHHHH
Confidence 9999988777665543
No 89
>PRK14720 transcript cleavage factor/unknown domain fusion protein; Provisional
Probab=95.44 E-value=0.038 Score=52.28 Aligned_cols=56 Identities=13% Similarity=0.057 Sum_probs=47.3
Q ss_pred HHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHH
Q psy16295 21 KGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHL 80 (128)
Q Consensus 21 ~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~L 80 (128)
..|+-+...+.+. + .+++.||.||.+|-++|++++|...|+.+|+.+|+|..+..=
T Consensus 100 ~~ve~~~~~i~~~-~---~~k~Al~~LA~~Ydk~g~~~ka~~~yer~L~~D~~n~~aLNn 155 (906)
T PRK14720 100 AIVEHICDKILLY-G---ENKLALRTLAEAYAKLNENKKLKGVWERLVKADRDNPEIVKK 155 (906)
T ss_pred hHHHHHHHHHHhh-h---hhhHHHHHHHHHHHHcCChHHHHHHHHHHHhcCcccHHHHHH
Confidence 5666666656555 3 577899999999999999999999999999999999998653
No 90
>PLN03098 LPA1 LOW PSII ACCUMULATION1; Provisional
Probab=95.30 E-value=0.053 Score=47.82 Aligned_cols=39 Identities=8% Similarity=0.076 Sum_probs=32.9
Q ss_pred chHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHH
Q psy16295 39 RKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQV 77 (128)
Q Consensus 39 ~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA 77 (128)
+..+.++-++++++++|+|++|+..++..|+++|++.++
T Consensus 73 ~~a~a~~NLG~AL~~lGryeEAIa~f~rALeL~Pd~aeA 111 (453)
T PLN03098 73 KTAEDAVNLGLSLFSKGRVKDALAQFETALELNPNPDEA 111 (453)
T ss_pred CCHHHHHHHHHHHHHcCCHHHHHHHHHHHHhhCCCchHH
Confidence 567788888888888888888888888888888888766
No 91
>PF14559 TPR_19: Tetratricopeptide repeat; PDB: 2R5S_A 3QDN_B 3QOU_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 3FP3_A 3LCA_A ....
Probab=95.18 E-value=0.048 Score=33.76 Aligned_cols=32 Identities=13% Similarity=0.240 Sum_probs=28.0
Q ss_pred hhhhchHHHHHHHHHHHhcCCCCHHHHHHHHH
Q psy16295 52 ARIKEYTKSLKYCRAFLAIESENSQVQHLESI 83 (128)
Q Consensus 52 yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~ 83 (128)
.+.|+|++|++.++.+++.+|+|..+.-..-.
T Consensus 2 l~~~~~~~A~~~~~~~l~~~p~~~~~~~~la~ 33 (68)
T PF14559_consen 2 LKQGDYDEAIELLEKALQRNPDNPEARLLLAQ 33 (68)
T ss_dssp HHTTHHHHHHHHHHHHHHHTTTSHHHHHHHHH
T ss_pred hhccCHHHHHHHHHHHHHHCCCCHHHHHHHHH
Confidence 57899999999999999999999998875433
No 92
>PF09976 TPR_21: Tetratricopeptide repeat; InterPro: IPR018704 This domain, found in various hypothetical prokaryotic proteins, has no known function.
Probab=95.15 E-value=0.17 Score=36.42 Aligned_cols=54 Identities=17% Similarity=0.001 Sum_probs=36.6
Q ss_pred HHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCC
Q psy16295 20 RKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESEN 74 (128)
Q Consensus 20 ~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N 74 (128)
....+.++.+.++. |++.-...-.+.+|-.++..|+|++|...++.++...|++
T Consensus 28 ~~~~~~~~~l~~~~-~~s~ya~~A~l~lA~~~~~~g~~~~A~~~l~~~~~~~~d~ 81 (145)
T PF09976_consen 28 AKAEAAAEQLAKDY-PSSPYAALAALQLAKAAYEQGDYDEAKAALEKALANAPDP 81 (145)
T ss_pred HHHHHHHHHHHHHC-CCChHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHhhCCCH
Confidence 33444566666665 4323456777778888888888888888888888877655
No 93
>KOG0553|consensus
Probab=95.03 E-value=0.14 Score=43.21 Aligned_cols=67 Identities=9% Similarity=0.184 Sum_probs=52.2
Q ss_pred HHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHh
Q psy16295 19 IRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKRME 89 (128)
Q Consensus 19 ~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i~ 89 (128)
...+|+=.+.-+.-+ | .--..+-=|.++++-+|+|++|.+++...|++||+|.-.++=++..+.+..
T Consensus 131 ~~~AVkDce~Al~iD-p---~yskay~RLG~A~~~~gk~~~A~~aykKaLeldP~Ne~~K~nL~~Ae~~l~ 197 (304)
T KOG0553|consen 131 YEDAVKDCESALSID-P---HYSKAYGRLGLAYLALGKYEEAIEAYKKALELDPDNESYKSNLKIAEQKLN 197 (304)
T ss_pred hHHHHHHHHHHHhcC-h---HHHHHHHHHHHHHHccCcHHHHHHHHHhhhccCCCcHHHHHHHHHHHHHhc
Confidence 345666666666666 5 556667778999999999999999999999999999976666666666554
No 94
>PLN03098 LPA1 LOW PSII ACCUMULATION1; Provisional
Probab=95.03 E-value=0.095 Score=46.25 Aligned_cols=62 Identities=16% Similarity=0.173 Sum_probs=52.5
Q ss_pred hHHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHH---HHHHHHHHHhhhhchHHHHHHHHHHHhc
Q psy16295 2 TQFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRD---YLYYLAIGNARIKEYTKSLKYCRAFLAI 70 (128)
Q Consensus 2 ~~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd---~LYyLAvg~yklgdY~~A~~~~~~lL~~ 70 (128)
..+|++.+|.+- .+..++|..++.-+..+ | +..+ .+|.+|.+|.++|++++|+.+++..++.
T Consensus 77 a~~NLG~AL~~l---GryeEAIa~f~rALeL~-P---d~aeA~~A~yNLAcaya~LGr~dEAla~LrrALel 141 (453)
T PLN03098 77 DAVNLGLSLFSK---GRVKDALAQFETALELN-P---NPDEAQAAYYNKACCHAYREEGKKAADCLRTALRD 141 (453)
T ss_pred HHHHHHHHHHHc---CCHHHHHHHHHHHHhhC-C---CchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHh
Confidence 457888888775 46668999999999998 7 3444 4999999999999999999999998876
No 95
>COG1729 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=95.00 E-value=0.12 Score=42.62 Aligned_cols=71 Identities=10% Similarity=0.086 Sum_probs=60.6
Q ss_pred HHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHH
Q psy16295 3 QFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQV 77 (128)
Q Consensus 3 ~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA 77 (128)
-|+-|..+++|.+ ...+..-|.+-++.- |+.....+..|+|+-.+|.+|||+.|-.....+.+--|+++-|
T Consensus 144 ~Y~~A~~~~ksgd---y~~A~~~F~~fi~~Y-P~s~~~~nA~yWLGe~~y~qg~y~~Aa~~f~~~~k~~P~s~KA 214 (262)
T COG1729 144 LYNAALDLYKSGD---YAEAEQAFQAFIKKY-PNSTYTPNAYYWLGESLYAQGDYEDAAYIFARVVKDYPKSPKA 214 (262)
T ss_pred HHHHHHHHHHcCC---HHHHHHHHHHHHHcC-CCCcccchhHHHHHHHHHhcccchHHHHHHHHHHHhCCCCCCC
Confidence 3788999999987 556888888877775 6556789999999999999999999999999999999988876
No 96
>PF14863 Alkyl_sulf_dimr: Alkyl sulfatase dimerisation; PDB: 2YHE_C 2CG2_A 2CG3_A 2CFU_A 2CFZ_A.
Probab=94.96 E-value=0.12 Score=38.78 Aligned_cols=51 Identities=10% Similarity=0.124 Sum_probs=42.0
Q ss_pred hHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhh
Q psy16295 40 KRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKRMET 90 (128)
Q Consensus 40 ~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i~k 90 (128)
-.|-+.-.|-.++..|||.-|...++.++..||+|.+|+.|+.-.-+++..
T Consensus 69 G~d~vl~~A~~~~~~gd~~wA~~L~d~l~~adp~n~~ar~l~A~al~~lg~ 119 (141)
T PF14863_consen 69 GADKVLERAQAALAAGDYQWAAELLDHLVFADPDNEEARQLKADALEQLGY 119 (141)
T ss_dssp CHHHHHHHHHHHHHCT-HHHHHHHHHHHHHH-TT-HHHHHHHHHHHHHHHH
T ss_pred CHHHHHHHHHHHHHCCCHHHHHHHHHHHHHcCCCcHHHHHHHHHHHHHHHH
Confidence 467778889999999999999999999999999999999999877666543
No 97
>cd05804 StaR_like StaR_like; a well-conserved protein found in bacteria, plants, and animals. A family member from Streptomyces toyocaensis, StaR is part of a gene cluster involved in the biosynthesis of glycopeptide antibiotics (GPAs), specifically A47934. It has been speculated that StaR could be a flavoprotein hydroxylating a tyrosine sidechain. Some family members have been annotated as proteins containing tetratricopeptide (TPR) repeats, which may at least indicate mostly alpha-helical secondary structure.
Probab=94.91 E-value=0.12 Score=41.29 Aligned_cols=51 Identities=2% Similarity=-0.090 Sum_probs=32.1
Q ss_pred HHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCC
Q psy16295 19 IRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESE 73 (128)
Q Consensus 19 ~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~ 73 (128)
..+++..+++.+... | .....++.+|..++..|++++|+.+++..++..|.
T Consensus 130 ~~~A~~~~~~al~~~-p---~~~~~~~~la~i~~~~g~~~eA~~~l~~~l~~~~~ 180 (355)
T cd05804 130 YDRAEEAARRALELN-P---DDAWAVHAVAHVLEMQGRFKEGIAFMESWRDTWDC 180 (355)
T ss_pred HHHHHHHHHHHHhhC-C---CCcHHHHHHHHHHHHcCCHHHHHHHHHhhhhccCC
Confidence 445666666666665 4 34555666666666666666666666666666654
No 98
>PRK14574 hmsH outer membrane protein; Provisional
Probab=94.83 E-value=0.08 Score=49.51 Aligned_cols=61 Identities=11% Similarity=0.079 Sum_probs=49.5
Q ss_pred hhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHH
Q psy16295 16 CADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHL 80 (128)
Q Consensus 16 ~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~L 80 (128)
..+..++|++++++++.+ | ...+.++.++..+...+++++|+..++.+...+|+|...+.+
T Consensus 115 ~gdyd~Aiely~kaL~~d-P---~n~~~l~gLa~~y~~~~q~~eAl~~l~~l~~~dp~~~~~l~l 175 (822)
T PRK14574 115 EKRWDQALALWQSSLKKD-P---TNPDLISGMIMTQADAGRGGVVLKQATELAERDPTVQNYMTL 175 (822)
T ss_pred cCCHHHHHHHHHHHHhhC-C---CCHHHHHHHHHHHhhcCCHHHHHHHHHHhcccCcchHHHHHH
Confidence 345558888999888888 6 457888888888888899999999988888888888777444
No 99
>PF13424 TPR_12: Tetratricopeptide repeat; PDB: 3RO2_A 3Q15_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 4A1S_B 3CEQ_B 3EDT_H ....
Probab=94.80 E-value=0.05 Score=34.76 Aligned_cols=32 Identities=22% Similarity=0.261 Sum_probs=28.1
Q ss_pred chHHHHHHHHHHHhhhhchHHHHHHHHHHHhc
Q psy16295 39 RKRDYLYYLAIGNARIKEYTKSLKYCRAFLAI 70 (128)
Q Consensus 39 ~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~ 70 (128)
..-.+++.+|..+.++|+|++|+.+++..|++
T Consensus 3 ~~a~~~~~la~~~~~~~~~~~A~~~~~~al~~ 34 (78)
T PF13424_consen 3 DTANAYNNLARVYRELGRYDEALDYYEKALDI 34 (78)
T ss_dssp HHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHH
Confidence 45689999999999999999999999998865
No 100
>PF13431 TPR_17: Tetratricopeptide repeat
Probab=94.75 E-value=0.027 Score=32.09 Aligned_cols=32 Identities=9% Similarity=0.103 Sum_probs=26.4
Q ss_pred HHHhhccCccccchHHHHHHHHHHHhhhhchHHHHH
Q psy16295 27 EELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLK 62 (128)
Q Consensus 27 ~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~ 62 (128)
+..++.+ | ...+.++.||..+++.|++++|++
T Consensus 3 ~kAie~~-P---~n~~a~~nla~~~~~~g~~~~A~~ 34 (34)
T PF13431_consen 3 KKAIELN-P---NNAEAYNNLANLYLNQGDYEEAIA 34 (34)
T ss_pred HHHHHHC-C---CCHHHHHHHHHHHHHCcCHHhhcC
Confidence 3444557 7 789999999999999999999973
No 101
>KOG0543|consensus
Probab=94.63 E-value=0.047 Score=47.42 Aligned_cols=39 Identities=23% Similarity=0.408 Sum_probs=34.7
Q ss_pred hHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHH
Q psy16295 40 KRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQ 78 (128)
Q Consensus 40 ~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~ 78 (128)
+-=|+--||+++.|+++|.+|+..|+..|.++|+|.-|.
T Consensus 256 k~~~~lNlA~c~lKl~~~~~Ai~~c~kvLe~~~~N~KAL 294 (397)
T KOG0543|consen 256 KLACHLNLAACYLKLKEYKEAIESCNKVLELDPNNVKAL 294 (397)
T ss_pred HHHHhhHHHHHHHhhhhHHHHHHHHHHHHhcCCCchhHH
Confidence 455777899999999999999999999999999997653
No 102
>KOG1125|consensus
Probab=94.57 E-value=0.16 Score=45.97 Aligned_cols=68 Identities=16% Similarity=0.222 Sum_probs=58.7
Q ss_pred HHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHH
Q psy16295 5 NYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQH 79 (128)
Q Consensus 5 ~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~ 79 (128)
.=++-|++-. ++.+++-.||.-++.+ | ..-|.|-+|++.|....+=..|..+...+|+++|+|.+|..
T Consensus 290 ~eG~~lm~nG---~L~~A~LafEAAVkqd-P---~haeAW~~LG~~qaENE~E~~ai~AL~rcl~LdP~NleaLm 357 (579)
T KOG1125|consen 290 KEGCNLMKNG---DLSEAALAFEAAVKQD-P---QHAEAWQKLGITQAENENEQNAISALRRCLELDPTNLEALM 357 (579)
T ss_pred HHHHHHHhcC---CchHHHHHHHHHHhhC-h---HHHHHHHHhhhHhhhccchHHHHHHHHHHHhcCCccHHHHH
Confidence 3344455443 3889999999999999 7 78999999999999999999999999999999999998753
No 103
>KOG2076|consensus
Probab=94.40 E-value=0.17 Score=47.75 Aligned_cols=70 Identities=16% Similarity=0.264 Sum_probs=57.8
Q ss_pred HHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHH
Q psy16295 4 FNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQH 79 (128)
Q Consensus 4 F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~ 79 (128)
++.|-+|.. ......++.+|..+.... + .+....||-+|..+--+|+|++|..+++.+|..+|+|--++-
T Consensus 418 ~d~a~al~~---~~~~~~Al~~l~~i~~~~-~--~~~~~vw~~~a~c~~~l~e~e~A~e~y~kvl~~~p~~~D~Ri 487 (895)
T KOG2076|consen 418 LDLADALTN---IGKYKEALRLLSPITNRE-G--YQNAFVWYKLARCYMELGEYEEAIEFYEKVLILAPDNLDARI 487 (895)
T ss_pred HHHHHHHHh---cccHHHHHHHHHHHhcCc-c--ccchhhhHHHHHHHHHHhhHHHHHHHHHHHHhcCCCchhhhh
Confidence 344555544 445568999999999987 4 466889999999999999999999999999999999987753
No 104
>PF14561 TPR_20: Tetratricopeptide repeat; PDB: 3QOU_A 2R5S_A 3QDN_B.
Probab=94.36 E-value=0.44 Score=32.80 Aligned_cols=59 Identities=15% Similarity=0.127 Sum_probs=45.8
Q ss_pred HHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCC--HHHHHHHHHH
Q psy16295 22 GILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESEN--SQVQHLESIV 84 (128)
Q Consensus 22 GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N--~QA~~Lk~~I 84 (128)
.|+-|+.-+..+ | ..-|..|-+|..+...|+|+.|+.-+-.+++.+|++ .+++...-.|
T Consensus 7 ~~~al~~~~a~~-P---~D~~ar~~lA~~~~~~g~~e~Al~~Ll~~v~~dr~~~~~~ar~~ll~~ 67 (90)
T PF14561_consen 7 DIAALEAALAAN-P---DDLDARYALADALLAAGDYEEALDQLLELVRRDRDYEDDAARKRLLDI 67 (90)
T ss_dssp HHHHHHHHHHHS-T---T-HHHHHHHHHHHHHTT-HHHHHHHHHHHHCC-TTCCCCHHHHHHHHH
T ss_pred cHHHHHHHHHcC-C---CCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCccccccHHHHHHHHH
Confidence 466677778888 6 577999999999999999999999999999999988 5554443333
No 105
>PF09976 TPR_21: Tetratricopeptide repeat; InterPro: IPR018704 This domain, found in various hypothetical prokaryotic proteins, has no known function.
Probab=94.20 E-value=0.17 Score=36.42 Aligned_cols=65 Identities=17% Similarity=0.056 Sum_probs=47.8
Q ss_pred hhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHH
Q psy16295 16 CADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLES 82 (128)
Q Consensus 16 ~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~ 82 (128)
..+..++++.|+.++.+. |+...+.-..+.||-.++..|+|++|+..++.. .-+|-.+.+..|+-
T Consensus 61 ~g~~~~A~~~l~~~~~~~-~d~~l~~~a~l~LA~~~~~~~~~d~Al~~L~~~-~~~~~~~~~~~~~G 125 (145)
T PF09976_consen 61 QGDYDEAKAALEKALANA-PDPELKPLARLRLARILLQQGQYDEALATLQQI-PDEAFKALAAELLG 125 (145)
T ss_pred CCCHHHHHHHHHHHHhhC-CCHHHHHHHHHHHHHHHHHcCCHHHHHHHHHhc-cCcchHHHHHHHHH
Confidence 467789999999999987 432455678888999999999999999999772 33333344444433
No 106
>KOG1156|consensus
Probab=94.16 E-value=0.24 Score=45.64 Aligned_cols=65 Identities=20% Similarity=0.233 Sum_probs=49.5
Q ss_pred HHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHh
Q psy16295 21 KGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKRME 89 (128)
Q Consensus 21 ~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i~ 89 (128)
+|-+..+--++++ + .+.=||.-+++-+-.-++|++|.+++...|.+||+|.|-..=..+.+-.|.
T Consensus 59 ea~~~vr~glr~d-~---~S~vCwHv~gl~~R~dK~Y~eaiKcy~nAl~~~~dN~qilrDlslLQ~QmR 123 (700)
T KOG1156|consen 59 EAYELVRLGLRND-L---KSHVCWHVLGLLQRSDKKYDEAIKCYRNALKIEKDNLQILRDLSLLQIQMR 123 (700)
T ss_pred HHHHHHHHHhccC-c---ccchhHHHHHHHHhhhhhHHHHHHHHHHHHhcCCCcHHHHHHHHHHHHHHH
Confidence 3444444444444 3 677899999999999999999999999999999999997655555554443
No 107
>PF12569 NARP1: NMDA receptor-regulated protein 1 ; InterPro: IPR021183 This group represents N-terminal acetyltransferase A (NatA) auxiliary subunit and represents a non-catalytic component of the NatA N-terminal acetyltransferase, which catalyzes acetylation of proteins beginning with Met-Ser, Met-Gly and Met-Ala. N-terminal acetylation plays a role in normal eukaryotic translation and processing, protect against proteolytic degradation and protein turnover. NAT1 anchors ARD1 and NAT5 to the ribosome and may present the N- terminal of nascent polypeptides for acetylation [], [].
Probab=94.12 E-value=0.13 Score=45.86 Aligned_cols=45 Identities=20% Similarity=0.302 Sum_probs=40.0
Q ss_pred HHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHH
Q psy16295 43 YLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKR 87 (128)
Q Consensus 43 ~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~ 87 (128)
++||||..|-++|+|++|+.|++..+++.|....---+|..|-+.
T Consensus 196 ~~~~lAqhyd~~g~~~~Al~~Id~aI~htPt~~ely~~KarilKh 240 (517)
T PF12569_consen 196 TLYFLAQHYDYLGDYEKALEYIDKAIEHTPTLVELYMTKARILKH 240 (517)
T ss_pred HHHHHHHHHHHhCCHHHHHHHHHHHHhcCCCcHHHHHHHHHHHHH
Confidence 679999999999999999999999999999998877777666544
No 108
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=94.07 E-value=0.28 Score=38.80 Aligned_cols=72 Identities=14% Similarity=0.193 Sum_probs=51.3
Q ss_pred HHHHHHHhccCC----hhhHHHHHHHHHHHhhccCcc--ccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCH
Q psy16295 4 FNYSFCLVRSEF----CADIRKGILLLEELFKTETED--IDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENS 75 (128)
Q Consensus 4 F~YAwaLv~S~~----~~d~~~GI~LL~~l~~~~~~~--~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~ 75 (128)
.+.||..=-..+ ..-.+++.+.+++.+..+..+ ....--.+|++|--++|+|+|++|++++..++..--.+.
T Consensus 122 LrlAWlyR~~~~~~~E~~fl~~Al~~y~~a~~~e~~~~~~~~~~~l~YLigeL~rrlg~~~eA~~~fs~vi~~~~~s~ 199 (214)
T PF09986_consen 122 LRLAWLYRDLGDEENEKRFLRKALEFYEEAYENEDFPIEGMDEATLLYLIGELNRRLGNYDEAKRWFSRVIGSKKASK 199 (214)
T ss_pred HHHHHHhhccCCHHHHHHHHHHHHHHHHHHHHhCcCCCCCchHHHHHHHHHHHHHHhCCHHHHHHHHHHHHcCCCCCC
Confidence 345665443333 234568889999888775221 123456889999999999999999999999998764444
No 109
>PRK10747 putative protoheme IX biogenesis protein; Provisional
Probab=93.92 E-value=0.41 Score=40.29 Aligned_cols=65 Identities=11% Similarity=-0.004 Sum_probs=44.8
Q ss_pred hhhHHHHHHHHHHHhhccCccccchHHHHH-HHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHH
Q psy16295 16 CADIRKGILLLEELFKTETEDIDRKRDYLY-YLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIV 84 (128)
Q Consensus 16 ~~d~~~GI~LL~~l~~~~~~~~~~~rd~LY-yLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~I 84 (128)
..+..++.+.|++..+.+ | +.+-... ..+--+...|+|+.|+.+.+.+++.+|+|+.+..+...+
T Consensus 131 ~g~~~~A~~~l~~A~~~~-~---~~~~~~~l~~a~l~l~~g~~~~Al~~l~~~~~~~P~~~~al~ll~~~ 196 (398)
T PRK10747 131 RGDEARANQHLERAAELA-D---NDQLPVEITRVRIQLARNENHAARHGVDKLLEVAPRHPEVLRLAEQA 196 (398)
T ss_pred CCCHHHHHHHHHHHHhcC-C---cchHHHHHHHHHHHHHCCCHHHHHHHHHHHHhcCCCCHHHHHHHHHH
Confidence 344556777777777766 4 2222222 336677888999999999999999999998776665433
No 110
>KOG4162|consensus
Probab=93.67 E-value=0.2 Score=46.74 Aligned_cols=65 Identities=15% Similarity=0.198 Sum_probs=54.2
Q ss_pred HHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCH
Q psy16295 6 YSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENS 75 (128)
Q Consensus 6 YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~ 75 (128)
.|.+|....++. +...-.+|.+-++-+ | ..+|.||||+-.+-++||.+.|-.+.+..++.||.||
T Consensus 724 la~~lle~G~~~-la~~~~~L~dalr~d-p---~n~eaW~~LG~v~k~~Gd~~~Aaecf~aa~qLe~S~P 788 (799)
T KOG4162|consen 724 LAELLLELGSPR-LAEKRSLLSDALRLD-P---LNHEAWYYLGEVFKKLGDSKQAAECFQAALQLEESNP 788 (799)
T ss_pred HHHHHHHhCCcc-hHHHHHHHHHHHhhC-C---CCHHHHHHHHHHHHHccchHHHHHHHHHHHhhccCCC
Confidence 566677766444 233334888999999 7 7899999999999999999999999999999999986
No 111
>PRK11906 transcriptional regulator; Provisional
Probab=93.56 E-value=0.55 Score=41.61 Aligned_cols=74 Identities=12% Similarity=0.063 Sum_probs=53.2
Q ss_pred HHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHH
Q psy16295 4 FNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESI 83 (128)
Q Consensus 4 F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~ 83 (128)
|..+.++..... ...|+.+|+.-..-+ | +.-+-+||.|+.++-.|+.++|++.++..|+.+|.-.-+--+|.-
T Consensus 342 ~~~g~~~~~~~~---~~~a~~~f~rA~~L~-P---n~A~~~~~~~~~~~~~G~~~~a~~~i~~alrLsP~~~~~~~~~~~ 414 (458)
T PRK11906 342 AIMGLITGLSGQ---AKVSHILFEQAKIHS-T---DIASLYYYRALVHFHNEKIEEARICIDKSLQLEPRRRKAVVIKEC 414 (458)
T ss_pred HHHHHHHHhhcc---hhhHHHHHHHHhhcC-C---ccHHHHHHHHHHHHHcCCHHHHHHHHHHHhccCchhhHHHHHHHH
Confidence 344444444443 556777777777666 6 577888888888888888888888888888888877777777766
Q ss_pred H
Q psy16295 84 V 84 (128)
Q Consensus 84 I 84 (128)
|
T Consensus 415 ~ 415 (458)
T PRK11906 415 V 415 (458)
T ss_pred H
Confidence 6
No 112
>KOG0548|consensus
Probab=92.85 E-value=0.65 Score=41.86 Aligned_cols=71 Identities=13% Similarity=0.157 Sum_probs=56.4
Q ss_pred hhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhh
Q psy16295 16 CADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKRMET 90 (128)
Q Consensus 16 ~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i~k 90 (128)
..|.+.+|..+.+-++.+ | ......--.|.+|.++++|.+|++-|+..++.+|+|.-+-.=|-.+...|++
T Consensus 371 ~gdy~~Av~~YteAIkr~-P---~Da~lYsNRAac~~kL~~~~~aL~Da~~~ieL~p~~~kgy~RKg~al~~mk~ 441 (539)
T KOG0548|consen 371 KGDYPEAVKHYTEAIKRD-P---EDARLYSNRAACYLKLGEYPEALKDAKKCIELDPNFIKAYLRKGAALRAMKE 441 (539)
T ss_pred ccCHHHHHHHHHHHHhcC-C---chhHHHHHHHHHHHHHhhHHHHHHHHHHHHhcCchHHHHHHHHHHHHHHHHH
Confidence 356678999999988888 7 4555666789999999999999999999999999998776555555444443
No 113
>PRK10866 outer membrane biogenesis protein BamD; Provisional
Probab=92.78 E-value=1.8 Score=34.55 Aligned_cols=79 Identities=5% Similarity=0.079 Sum_probs=61.6
Q ss_pred HHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhh---------------ch---HHHHHHH
Q psy16295 3 QFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIK---------------EY---TKSLKYC 64 (128)
Q Consensus 3 ~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklg---------------dY---~~A~~~~ 64 (128)
.+.+|.+..+..+ ...++..++..++.. |+.....+.+|.++++++.++ |- .+|.+.+
T Consensus 72 ~l~la~ayy~~~~---y~~A~~~~e~fi~~~-P~~~~~~~a~Y~~g~~~~~~~~~~~~~~~~~~~~~rD~~~~~~A~~~~ 147 (243)
T PRK10866 72 QLDLIYAYYKNAD---LPLAQAAIDRFIRLN-PTHPNIDYVLYMRGLTNMALDDSALQGFFGVDRSDRDPQHARAAFRDF 147 (243)
T ss_pred HHHHHHHHHhcCC---HHHHHHHHHHHHHhC-cCCCchHHHHHHHHHhhhhcchhhhhhccCCCccccCHHHHHHHHHHH
Confidence 5778888888765 558999999998887 654567888999999987775 11 4677899
Q ss_pred HHHHhcCCCCHHHHHHHHHHH
Q psy16295 65 RAFLAIESENSQVQHLESIVK 85 (128)
Q Consensus 65 ~~lL~~eP~N~QA~~Lk~~Ie 85 (128)
+.+++.-|+-+=+..-+..+.
T Consensus 148 ~~li~~yP~S~ya~~A~~rl~ 168 (243)
T PRK10866 148 SKLVRGYPNSQYTTDATKRLV 168 (243)
T ss_pred HHHHHHCcCChhHHHHHHHHH
Confidence 999999999887777666443
No 114
>COG5010 TadD Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking and secretion]
Probab=92.77 E-value=0.34 Score=40.01 Aligned_cols=56 Identities=7% Similarity=0.175 Sum_probs=36.7
Q ss_pred HHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHH
Q psy16295 19 IRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQ 78 (128)
Q Consensus 19 ~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~ 78 (128)
..+++..++...+-. | ..-+.+.-+++++-++|+.++|+..+.+.|++.|+++.+.
T Consensus 116 ~~~A~~~~rkA~~l~-p---~d~~~~~~lgaaldq~Gr~~~Ar~ay~qAl~L~~~~p~~~ 171 (257)
T COG5010 116 FGEAVSVLRKAARLA-P---TDWEAWNLLGAALDQLGRFDEARRAYRQALELAPNEPSIA 171 (257)
T ss_pred hHHHHHHHHHHhccC-C---CChhhhhHHHHHHHHccChhHHHHHHHHHHHhccCCchhh
Confidence 345666666666666 4 3566666677777777777777777777777777666654
No 115
>PRK11906 transcriptional regulator; Provisional
Probab=92.54 E-value=1.1 Score=39.68 Aligned_cols=75 Identities=11% Similarity=0.102 Sum_probs=61.1
Q ss_pred HHHHhccCChhhHHHHHHHHHHHh---hccCccccchHHHHHHHHHHHhhh---------hchHHHHHHHHHHHhcCCCC
Q psy16295 7 SFCLVRSEFCADIRKGILLLEELF---KTETEDIDRKRDYLYYLAIGNARI---------KEYTKSLKYCRAFLAIESEN 74 (128)
Q Consensus 7 AwaLv~S~~~~d~~~GI~LL~~l~---~~~~~~~~~~rd~LYyLAvg~ykl---------gdY~~A~~~~~~lL~~eP~N 74 (128)
|...++...++++.++..+|+..+ .-+ | .--.++-.+|.+|.-. .+-.+|++..+..++++|+|
T Consensus 262 g~~~~~~~t~~~~~~Al~lf~ra~~~~~ld-p---~~a~a~~~lA~~h~~~~~~g~~~~~~~~~~a~~~A~rAveld~~D 337 (458)
T PRK11906 262 GKKELYDFTPESIYRAMTIFDRLQNKSDIQ-T---LKTECYCLLAECHMSLALHGKSELELAAQKALELLDYVSDITTVD 337 (458)
T ss_pred HHHHhhccCHHHHHHHHHHHHHHhhcccCC-c---ccHHHHHHHHHHHHHHHHhcCCCchHHHHHHHHHHHHHHhcCCCC
Confidence 566778889999999999999999 555 5 4566777777777654 46678999999999999999
Q ss_pred HHHHHHHHHHH
Q psy16295 75 SQVQHLESIVK 85 (128)
Q Consensus 75 ~QA~~Lk~~Ie 85 (128)
+||..+.-.+-
T Consensus 338 a~a~~~~g~~~ 348 (458)
T PRK11906 338 GKILAIMGLIT 348 (458)
T ss_pred HHHHHHHHHHH
Confidence 99998877753
No 116
>PF09295 ChAPs: ChAPs (Chs5p-Arf1p-binding proteins); InterPro: IPR015374 ChAPs (Chs5p-Arf1p-binding proteins) are required for the export of specialised cargo from the Golgi. They physically interact with Chs3, Chs5 and the small GTPase Arf1, and they also form interactions with each other [].
Probab=92.12 E-value=0.25 Score=42.65 Aligned_cols=44 Identities=9% Similarity=0.126 Sum_probs=28.0
Q ss_pred HHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHH
Q psy16295 20 RKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAF 67 (128)
Q Consensus 20 ~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~l 67 (128)
..++++.++...-. | ..-++||+||..|.++|||++|+-.++.+
T Consensus 251 ~lAL~iAk~av~ls-P---~~f~~W~~La~~Yi~~~d~e~ALlaLNs~ 294 (395)
T PF09295_consen 251 ELALEIAKKAVELS-P---SEFETWYQLAECYIQLGDFENALLALNSC 294 (395)
T ss_pred HHHHHHHHHHHHhC-c---hhHHHHHHHHHHHHhcCCHHHHHHHHhcC
Confidence 34555555555555 4 46677777777777777777777665543
No 117
>KOG0553|consensus
Probab=92.06 E-value=0.39 Score=40.51 Aligned_cols=56 Identities=20% Similarity=0.340 Sum_probs=42.8
Q ss_pred hhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHH
Q psy16295 17 ADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQ 76 (128)
Q Consensus 17 ~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~Q 76 (128)
.+.+++|..-.+-+.-+ | ..-=|+---|-+|.+||+|+.|.+-|+..|++||.-.-
T Consensus 95 ~~Y~eAv~kY~~AI~l~-P---~nAVyycNRAAAy~~Lg~~~~AVkDce~Al~iDp~ysk 150 (304)
T KOG0553|consen 95 KDYQEAVDKYTEAIELD-P---TNAVYYCNRAAAYSKLGEYEDAVKDCESALSIDPHYSK 150 (304)
T ss_pred hhHHHHHHHHHHHHhcC-C---CcchHHHHHHHHHHHhcchHHHHHHHHHHHhcChHHHH
Confidence 45667777777777776 5 33444444799999999999999999999999997543
No 118
>KOG3081|consensus
Probab=91.94 E-value=0.77 Score=38.57 Aligned_cols=68 Identities=13% Similarity=0.150 Sum_probs=55.7
Q ss_pred HHHHHhcc-CChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHH
Q psy16295 6 YSFCLVRS-EFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQV 77 (128)
Q Consensus 6 YAwaLv~S-~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA 77 (128)
+|-+.|+. .....++.+-.+++++.... | ....-+--.|+.|.-+++|++|...++..|..+|.++.+
T Consensus 175 LA~awv~la~ggek~qdAfyifeE~s~k~-~---~T~~llnG~Av~~l~~~~~eeAe~lL~eaL~kd~~dpet 243 (299)
T KOG3081|consen 175 LAQAWVKLATGGEKIQDAFYIFEELSEKT-P---PTPLLLNGQAVCHLQLGRYEEAESLLEEALDKDAKDPET 243 (299)
T ss_pred HHHHHHHHhccchhhhhHHHHHHHHhccc-C---CChHHHccHHHHHHHhcCHHHHHHHHHHHHhccCCCHHH
Confidence 45555554 33455889999999998876 3 478899999999999999999999999999999999543
No 119
>TIGR00540 hemY_coli hemY protein. This is an uncharacterized protein encoded next to a heme-biosynthetic enzyme in two gamma division proteobacteria (E. coli and H. influenzae). It is known in no other species. The gene symbol hemY is unfortunate in that an unrelated protein, protoporphyrinogen oxidase, is designated as HemG in E. coli but as HemY in Bacillus subtilis.
Probab=91.87 E-value=2.5 Score=35.60 Aligned_cols=50 Identities=8% Similarity=-0.000 Sum_probs=38.4
Q ss_pred HHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCC
Q psy16295 20 RKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESE 73 (128)
Q Consensus 20 ~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~ 73 (128)
..+.+.++.+.+.+ | +..+.++.++..+.+.||+++|++..+.+++..+.
T Consensus 170 ~~Al~~l~~l~~~~-P---~~~~~l~ll~~~~~~~~d~~~a~~~l~~l~k~~~~ 219 (409)
T TIGR00540 170 HAARHGVDKLLEMA-P---RHKEVLKLAEEAYIRSGAWQALDDIIDNMAKAGLF 219 (409)
T ss_pred HHHHHHHHHHHHhC-C---CCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHcCCC
Confidence 35677777777777 6 56678888888888888888888888888876433
No 120
>PF12688 TPR_5: Tetratrico peptide repeat
Probab=91.76 E-value=1.2 Score=32.16 Aligned_cols=63 Identities=17% Similarity=0.166 Sum_probs=47.7
Q ss_pred HHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHh
Q psy16295 3 QFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLA 69 (128)
Q Consensus 3 ~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~ 69 (128)
.+++|.+|-.-. ...+++.+|++.+.+. |+.+....-.+++|++.+.+|++++|++-+-..|.
T Consensus 41 ~i~lastlr~LG---~~deA~~~L~~~~~~~-p~~~~~~~l~~f~Al~L~~~gr~~eAl~~~l~~la 103 (120)
T PF12688_consen 41 LIQLASTLRNLG---RYDEALALLEEALEEF-PDDELNAALRVFLALALYNLGRPKEALEWLLEALA 103 (120)
T ss_pred HHHHHHHHHHcC---CHHHHHHHHHHHHHHC-CCccccHHHHHHHHHHHHHCCCHHHHHHHHHHHHH
Confidence 345666665443 4558999999999885 52234677778999999999999999998877665
No 121
>PF03704 BTAD: Bacterial transcriptional activator domain; InterPro: IPR005158 Found in the DNRI/REDD/AFSR family of regulators, this region of AFSR (P25941 from SWISSPROT) along with the C-terminal region is capable of independently directing actinorhodin production. It is important for the formation of secondary metabolites.; PDB: 2FF4_B 2FEZ_A.
Probab=91.68 E-value=1.7 Score=30.86 Aligned_cols=39 Identities=21% Similarity=0.237 Sum_probs=22.9
Q ss_pred HHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHH
Q psy16295 41 RDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQH 79 (128)
Q Consensus 41 rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~ 79 (128)
.+++--++-.+...|+|++|.+.++.++..+|-|.++-.
T Consensus 62 ~~~~~~l~~~~~~~~~~~~a~~~~~~~l~~dP~~E~~~~ 100 (146)
T PF03704_consen 62 LDALERLAEALLEAGDYEEALRLLQRALALDPYDEEAYR 100 (146)
T ss_dssp HHHHHHHHHHHHHTT-HHHHHHHHHHHHHHSTT-HHHHH
T ss_pred HHHHHHHHHHHHhccCHHHHHHHHHHHHhcCCCCHHHHH
Confidence 455555566666666666666666666666666665543
No 122
>COG1729 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=91.55 E-value=0.87 Score=37.62 Aligned_cols=79 Identities=10% Similarity=0.101 Sum_probs=63.6
Q ss_pred HHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHH
Q psy16295 4 FNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESI 83 (128)
Q Consensus 4 F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~ 83 (128)
|=+..++-. ..+...+...+....++- |++..-.|-|+=|++.-.++|+-++|++.++.+.+--|+.+.|+.=+..
T Consensus 182 yWLGe~~y~---qg~y~~Aa~~f~~~~k~~-P~s~KApdallKlg~~~~~l~~~d~A~atl~qv~k~YP~t~aA~~Ak~~ 257 (262)
T COG1729 182 YWLGESLYA---QGDYEDAAYIFARVVKDY-PKSPKAPDALLKLGVSLGRLGNTDEACATLQQVIKRYPGTDAAKLAKVA 257 (262)
T ss_pred HHHHHHHHh---cccchHHHHHHHHHHHhC-CCCCCChHHHHHHHHHHHHhcCHHHHHHHHHHHHHHCCCCHHHHHHHHH
Confidence 334444443 345567888898888875 6545778999999999999999999999999999999999999887766
Q ss_pred HHH
Q psy16295 84 VKK 86 (128)
Q Consensus 84 Ie~ 86 (128)
++.
T Consensus 258 ~~~ 260 (262)
T COG1729 258 LKA 260 (262)
T ss_pred Hhc
Confidence 643
No 123
>PF03704 BTAD: Bacterial transcriptional activator domain; InterPro: IPR005158 Found in the DNRI/REDD/AFSR family of regulators, this region of AFSR (P25941 from SWISSPROT) along with the C-terminal region is capable of independently directing actinorhodin production. It is important for the formation of secondary metabolites.; PDB: 2FF4_B 2FEZ_A.
Probab=91.41 E-value=0.61 Score=33.15 Aligned_cols=50 Identities=12% Similarity=0.132 Sum_probs=40.7
Q ss_pred ChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHH
Q psy16295 15 FCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFL 68 (128)
Q Consensus 15 ~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL 68 (128)
...+..+++.+++.++..+ | ..-+.+..+--++++.|++.+|++.++.+-
T Consensus 74 ~~~~~~~a~~~~~~~l~~d-P---~~E~~~~~lm~~~~~~g~~~~A~~~Y~~~~ 123 (146)
T PF03704_consen 74 EAGDYEEALRLLQRALALD-P---YDEEAYRLLMRALAAQGRRAEALRVYERYR 123 (146)
T ss_dssp HTT-HHHHHHHHHHHHHHS-T---T-HHHHHHHHHHHHHTT-HHHHHHHHHHHH
T ss_pred hccCHHHHHHHHHHHHhcC-C---CCHHHHHHHHHHHHHCcCHHHHHHHHHHHH
Confidence 4567889999999999999 7 678888888999999999999999976653
No 124
>PRK15331 chaperone protein SicA; Provisional
Probab=91.38 E-value=1.3 Score=34.30 Aligned_cols=64 Identities=11% Similarity=0.079 Sum_probs=35.0
Q ss_pred HHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCH
Q psy16295 5 NYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENS 75 (128)
Q Consensus 5 ~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~ 75 (128)
.||..+-.+. +..++..+++-|+.-+ | ..+||++-||..+=.+++|++|...+....-++|+|+
T Consensus 42 ~~Ay~~y~~G---k~~eA~~~F~~L~~~d-~---~n~~Y~~GLaa~~Q~~k~y~~Ai~~Y~~A~~l~~~dp 105 (165)
T PRK15331 42 AHAYEFYNQG---RLDEAETFFRFLCIYD-F---YNPDYTMGLAAVCQLKKQFQKACDLYAVAFTLLKNDY 105 (165)
T ss_pred HHHHHHHHCC---CHHHHHHHHHHHHHhC-c---CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccCCC
Confidence 4444444443 2334555555555555 3 4466666666666666666666666655555555543
No 125
>PRK10941 hypothetical protein; Provisional
Probab=91.32 E-value=1.7 Score=35.73 Aligned_cols=74 Identities=11% Similarity=0.022 Sum_probs=54.6
Q ss_pred HHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHH
Q psy16295 7 SFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKK 86 (128)
Q Consensus 7 AwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~ 86 (128)
=+++++..+-..--+.++++-.+.-++ | ...|| -++.+++++.+..|+.=.+..++.-|+++.+..+|..|+.
T Consensus 188 K~~~~~~~~~~~AL~~~e~ll~l~P~d-p--~e~RD----RGll~~qL~c~~~A~~DL~~fl~~~P~dp~a~~ik~ql~~ 260 (269)
T PRK10941 188 KAALMEEKQMELALRASEALLQFDPED-P--YEIRD----RGLIYAQLDCEHVALSDLSYFVEQCPEDPISEMIRAQIHS 260 (269)
T ss_pred HHHHHHcCcHHHHHHHHHHHHHhCCCC-H--HHHHH----HHHHHHHcCCcHHHHHHHHHHHHhCCCchhHHHHHHHHHH
Confidence 345555555554444445544444444 4 46677 4788999999999999999999999999999999998875
Q ss_pred H
Q psy16295 87 R 87 (128)
Q Consensus 87 ~ 87 (128)
-
T Consensus 261 l 261 (269)
T PRK10941 261 I 261 (269)
T ss_pred H
Confidence 3
No 126
>COG4976 Predicted methyltransferase (contains TPR repeat) [General function prediction only]
Probab=91.26 E-value=0.45 Score=39.52 Aligned_cols=72 Identities=14% Similarity=0.159 Sum_probs=53.2
Q ss_pred HHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHH
Q psy16295 6 YSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIV 84 (128)
Q Consensus 6 YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~I 84 (128)
||..+..|.|. .-..+++.+.+.-- | +--+-|+=++.-+-|-|+.+.|-+.++..|+++|++++..+||-.+
T Consensus 1 ~a~~~~~~~D~---~aaaely~qal~la-p---~w~~gwfR~g~~~ekag~~daAa~a~~~~L~ldp~D~~gaa~kLa~ 72 (287)
T COG4976 1 YAYMLAESGDA---EAAAELYNQALELA-P---EWAAGWFRLGEYTEKAGEFDAAAAAYEEVLELDPEDHGGAALKLAV 72 (287)
T ss_pred CcchhcccCCh---HHHHHHHHHHhhcC-c---hhhhhhhhcchhhhhcccHHHHHHHHHHHHcCCcccccchhhhHHh
Confidence 34555555544 35667777777666 5 4556666666666777777789999999999999999999998654
No 127
>PF13428 TPR_14: Tetratricopeptide repeat
Probab=91.08 E-value=0.7 Score=26.97 Aligned_cols=41 Identities=12% Similarity=-0.011 Sum_probs=34.1
Q ss_pred hHHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHH
Q psy16295 2 TQFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAI 49 (128)
Q Consensus 2 ~~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAv 49 (128)
+.+.+|.++.++.+.+ +++.+++..++.+ | +..+.++.||.
T Consensus 3 ~~~~la~~~~~~G~~~---~A~~~~~~~l~~~-P---~~~~a~~~La~ 43 (44)
T PF13428_consen 3 AWLALARAYRRLGQPD---EAERLLRRALALD-P---DDPEAWRALAQ 43 (44)
T ss_pred HHHHHHHHHHHcCCHH---HHHHHHHHHHHHC-c---CCHHHHHHhhh
Confidence 3578899999988766 8999999999999 7 57888887763
No 128
>PF04733 Coatomer_E: Coatomer epsilon subunit; InterPro: IPR006822 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the epsilon subunit of the coatomer complex, which is involved in the regulation of intracellular protein trafficking between the endoplasmic reticulum and the Golgi complex []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006890 retrograde vesicle-mediated transport, Golgi to ER, 0030126 COPI vesicle coat; PDB: 3MV2_B 3MV3_F 3MKR_A.
Probab=91.07 E-value=1.7 Score=35.66 Aligned_cols=73 Identities=15% Similarity=0.211 Sum_probs=59.6
Q ss_pred HHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhch-HHHHHHHHHHHhcCCCCHHHHHHHH
Q psy16295 4 FNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEY-TKSLKYCRAFLAIESENSQVQHLES 82 (128)
Q Consensus 4 F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY-~~A~~~~~~lL~~eP~N~QA~~Lk~ 82 (128)
...|+|.++-. +..++.++|++-+..+ | ...|.+.-+++...-+|+- +.+.+|..++-+..|+++-...+.+
T Consensus 205 ng~A~~~l~~~---~~~eAe~~L~~al~~~-~---~~~d~LaNliv~~~~~gk~~~~~~~~l~qL~~~~p~h~~~~~~~~ 277 (290)
T PF04733_consen 205 NGLAVCHLQLG---HYEEAEELLEEALEKD-P---NDPDTLANLIVCSLHLGKPTEAAERYLSQLKQSNPNHPLVKDLAE 277 (290)
T ss_dssp HHHHHHHHHCT----HHHHHHHHHHHCCC--C---CHHHHHHHHHHHHHHTT-TCHHHHHHHHHCHHHTTTSHHHHHHHH
T ss_pred HHHHHHHHHhC---CHHHHHHHHHHHHHhc-c---CCHHHHHHHHHHHHHhCCChhHHHHHHHHHHHhCCCChHHHHHHH
Confidence 45677777765 4557888999988887 5 6899999999999999988 7899999999999999999998875
Q ss_pred H
Q psy16295 83 I 83 (128)
Q Consensus 83 ~ 83 (128)
.
T Consensus 278 ~ 278 (290)
T PF04733_consen 278 K 278 (290)
T ss_dssp H
T ss_pred H
Confidence 4
No 129
>COG3063 PilF Tfp pilus assembly protein PilF [Cell motility and secretion / Intracellular trafficking and secretion]
Probab=90.90 E-value=0.95 Score=37.23 Aligned_cols=68 Identities=13% Similarity=0.189 Sum_probs=54.1
Q ss_pred HHHHHHhccCChhhHHHHHHHHHHHhhc-cCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHH
Q psy16295 5 NYSFCLVRSEFCADIRKGILLLEELFKT-ETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQ 78 (128)
Q Consensus 5 ~YAwaLv~S~~~~d~~~GI~LL~~l~~~-~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~ 78 (128)
||.|=|..-..++ ++...|+.-+.+ .++ .+.|-+=-+.++.+|.|+.+.|+.+.++.|+++|+|+-+.
T Consensus 108 NYG~FLC~qg~~~---eA~q~F~~Al~~P~Y~---~~s~t~eN~G~Cal~~gq~~~A~~~l~raL~~dp~~~~~~ 176 (250)
T COG3063 108 NYGAFLCAQGRPE---EAMQQFERALADPAYG---EPSDTLENLGLCALKAGQFDQAEEYLKRALELDPQFPPAL 176 (250)
T ss_pred hhhHHHHhCCChH---HHHHHHHHHHhCCCCC---CcchhhhhhHHHHhhcCCchhHHHHHHHHHHhCcCCChHH
Confidence 6778777666443 677778776666 444 6788888899999999999999999999999999988653
No 130
>PF12569 NARP1: NMDA receptor-regulated protein 1 ; InterPro: IPR021183 This group represents N-terminal acetyltransferase A (NatA) auxiliary subunit and represents a non-catalytic component of the NatA N-terminal acetyltransferase, which catalyzes acetylation of proteins beginning with Met-Ser, Met-Gly and Met-Ala. N-terminal acetylation plays a role in normal eukaryotic translation and processing, protect against proteolytic degradation and protein turnover. NAT1 anchors ARD1 and NAT5 to the ribosome and may present the N- terminal of nascent polypeptides for acetylation [], [].
Probab=90.39 E-value=0.79 Score=40.91 Aligned_cols=57 Identities=18% Similarity=0.060 Sum_probs=48.8
Q ss_pred hhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHH
Q psy16295 16 CADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQ 76 (128)
Q Consensus 16 ~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~Q 76 (128)
..+.+++++.|++.-+.- . ++.-.+=..|--+.++|++++|...|..||+..|+|..
T Consensus 17 ~g~~~~AL~~L~~~~~~I-~---Dk~~~~E~rA~ll~kLg~~~eA~~~y~~Li~rNPdn~~ 73 (517)
T PF12569_consen 17 AGDYEEALEHLEKNEKQI-L---DKLAVLEKRAELLLKLGRKEEAEKIYRELIDRNPDNYD 73 (517)
T ss_pred CCCHHHHHHHHHhhhhhC-C---CHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCCCcHH
Confidence 356779999998866555 3 57788888899999999999999999999999999984
No 131
>COG3063 PilF Tfp pilus assembly protein PilF [Cell motility and secretion / Intracellular trafficking and secretion]
Probab=89.92 E-value=0.99 Score=37.14 Aligned_cols=47 Identities=13% Similarity=0.092 Sum_probs=33.4
Q ss_pred chHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHH
Q psy16295 39 RKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVK 85 (128)
Q Consensus 39 ~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie 85 (128)
...+..-.||++|...||+..|+.-++..|++||++.=+...+..+-
T Consensus 33 ~aa~arlqLal~YL~~gd~~~A~~nlekAL~~DPs~~~a~~~~A~~Y 79 (250)
T COG3063 33 EAAKARLQLALGYLQQGDYAQAKKNLEKALEHDPSYYLAHLVRAHYY 79 (250)
T ss_pred HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCcccHHHHHHHHHHH
Confidence 45666667777777777777777777777777777777766666654
No 132
>PF07721 TPR_4: Tetratricopeptide repeat; InterPro: IPR011717 This entry includes tetratricopeptide-like repeats not detected by the IPR001440 from INTERPRO, IPR013105 from INTERPRO and IPR011716 from INTERPRO models. The tetratricopeptide repeat (TPR) motif is a protein-protein interaction module found in multiple copies in a number of functionally different proteins that facilitates specific interactions with a partner protein(s) [].; GO: 0042802 identical protein binding
Probab=89.66 E-value=0.43 Score=25.31 Aligned_cols=23 Identities=13% Similarity=-0.068 Sum_probs=20.9
Q ss_pred HHHHHHHHHhhhhchHHHHHHHH
Q psy16295 43 YLYYLAIGNARIKEYTKSLKYCR 65 (128)
Q Consensus 43 ~LYyLAvg~yklgdY~~A~~~~~ 65 (128)
..+.||..+..+||+++|++.++
T Consensus 3 a~~~la~~~~~~G~~~eA~~~l~ 25 (26)
T PF07721_consen 3 ARLALARALLAQGDPDEAERLLR 25 (26)
T ss_pred HHHHHHHHHHHcCCHHHHHHHHh
Confidence 57899999999999999998876
No 133
>PRK10153 DNA-binding transcriptional activator CadC; Provisional
Probab=89.14 E-value=3.5 Score=36.67 Aligned_cols=29 Identities=28% Similarity=0.305 Sum_probs=22.2
Q ss_pred HHHHHhccCChhhHHHHHHHHHHHhhccCc
Q psy16295 6 YSFCLVRSEFCADIRKGILLLEELFKTETE 35 (128)
Q Consensus 6 YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~ 35 (128)
-|+.+++..++++..+++++|++.++.+ |
T Consensus 345 rg~~~~~~~~~~~~~~A~~lle~Ai~ld-P 373 (517)
T PRK10153 345 QAHHYLNSGDAKSLNKASDLLEEILKSE-P 373 (517)
T ss_pred HHHHHHhcCCHHHHHHHHHHHHHHHHhC-C
Confidence 4566677777888888888888888877 6
No 134
>KOG1840|consensus
Probab=88.64 E-value=0.92 Score=40.58 Aligned_cols=49 Identities=20% Similarity=0.327 Sum_probs=39.8
Q ss_pred HHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcC
Q psy16295 20 RKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIE 71 (128)
Q Consensus 20 ~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~e 71 (128)
.++..++++.+..+.| ..---+.=||+.|++.|+|.+|..||+..+++-
T Consensus 265 ~~AL~i~e~~~G~~h~---~va~~l~nLa~ly~~~GKf~EA~~~~e~Al~I~ 313 (508)
T KOG1840|consen 265 EEALTIREEVFGEDHP---AVAATLNNLAVLYYKQGKFAEAEEYCERALEIY 313 (508)
T ss_pred HHHHHHHHHhcCCCCH---HHHHHHHHHHHHHhccCChHHHHHHHHHHHHHH
Confidence 4566666666666633 677888889999999999999999999998764
No 135
>KOG1174|consensus
Probab=88.54 E-value=1.8 Score=38.82 Aligned_cols=65 Identities=17% Similarity=0.092 Sum_probs=49.2
Q ss_pred HHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHh
Q psy16295 20 RKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKRME 89 (128)
Q Consensus 20 ~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i~ 89 (128)
..||.|||.-+.+- || -...-.|+--..-..+|.+|..++...|+++|+|.-+..=.+..|+.+.
T Consensus 455 ~D~i~LLe~~L~~~-~D----~~LH~~Lgd~~~A~Ne~Q~am~~y~~ALr~dP~~~~sl~Gl~~lEK~~~ 519 (564)
T KOG1174|consen 455 KDIIKLLEKHLIIF-PD----VNLHNHLGDIMRAQNEPQKAMEYYYKALRQDPKSKRTLRGLRLLEKSDD 519 (564)
T ss_pred chHHHHHHHHHhhc-cc----cHHHHHHHHHHHHhhhHHHHHHHHHHHHhcCccchHHHHHHHHHHhccC
Confidence 45888888888776 32 2334455656666788999999999999999999988877777776654
No 136
>KOG2581|consensus
Probab=88.43 E-value=1.6 Score=38.80 Aligned_cols=48 Identities=25% Similarity=0.310 Sum_probs=41.3
Q ss_pred chHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHH
Q psy16295 39 RKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKR 87 (128)
Q Consensus 39 ~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~ 87 (128)
..-.|+||++.-..=+.||+.|++|.-+.++.-|+ .-|.--++.+.+-
T Consensus 245 e~ARY~yY~GrIkaiqldYssA~~~~~qa~rkapq-~~alGf~q~v~k~ 292 (493)
T KOG2581|consen 245 EWARYLYYLGRIKAIQLDYSSALEYFLQALRKAPQ-HAALGFRQQVNKL 292 (493)
T ss_pred HHHHHHHHHhhHHHhhcchhHHHHHHHHHHHhCcc-hhhhhHHHHHHHH
Confidence 55789999999999999999999999999999998 6677666666543
No 137
>COG3763 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=88.27 E-value=0.64 Score=31.49 Aligned_cols=26 Identities=23% Similarity=0.193 Sum_probs=17.7
Q ss_pred HHHHHHHHH-HHHH-hhHHHhhhhhhcc
Q psy16295 95 GMALAGGAV-LAQG-QGTMEMSCKSYKN 120 (128)
Q Consensus 95 G~~i~gga~-~~~g-~~~~~~~~~~~~~ 120 (128)
+++|++.++ +++| ++|+++|||-++.
T Consensus 4 ~lail~ivl~ll~G~~~G~fiark~~~k 31 (71)
T COG3763 4 WLAILLIVLALLAGLIGGFFIARKQMKK 31 (71)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345555444 6666 5699999998864
No 138
>KOG4162|consensus
Probab=88.11 E-value=2 Score=40.42 Aligned_cols=66 Identities=27% Similarity=0.383 Sum_probs=54.1
Q ss_pred hhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCC-HHHHHHHHHHH
Q psy16295 16 CADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESEN-SQVQHLESIVK 85 (128)
Q Consensus 16 ~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N-~QA~~Lk~~Ie 85 (128)
..-.++++..|++-+.-+ | ...+.+||+|+=|..+++-+.|+.++...|.+.|.. .-+..|..++-
T Consensus 457 ~~~h~kslqale~av~~d-~---~dp~~if~lalq~A~~R~l~sAl~~~~eaL~l~~~~~~~~whLLALvl 523 (799)
T KOG4162|consen 457 DALHKKSLQALEEAVQFD-P---TDPLVIFYLALQYAEQRQLTSALDYAREALALNRGDSAKAWHLLALVL 523 (799)
T ss_pred HHHHHHHHHHHHHHHhcC-C---CCchHHHHHHHHHHHHHhHHHHHHHHHHHHHhcCCccHHHHHHHHHHH
Confidence 345678999999999998 5 567999999999999999999999999999996554 44555555554
No 139
>KOG3785|consensus
Probab=88.05 E-value=1.2 Score=39.33 Aligned_cols=54 Identities=15% Similarity=0.236 Sum_probs=41.5
Q ss_pred hhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCC
Q psy16295 16 CADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIES 72 (128)
Q Consensus 16 ~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP 72 (128)
.+|..-+|.+|+--...+. +.-.+.=-++|..+|.+|||++|+.-++.+.+.+-
T Consensus 35 ~rDytGAislLefk~~~~~---EEE~~~~lWia~C~fhLgdY~~Al~~Y~~~~~~~~ 88 (557)
T KOG3785|consen 35 NRDYTGAISLLEFKLNLDR---EEEDSLQLWIAHCYFHLGDYEEALNVYTFLMNKDD 88 (557)
T ss_pred cccchhHHHHHHHhhccch---hhhHHHHHHHHHHHHhhccHHHHHHHHHHHhccCC
Confidence 4677789999987665552 33345567899999999999999999888877553
No 140
>COG4455 ImpE Protein of avirulence locus involved in temperature-dependent protein secretion [General function prediction only]
Probab=87.51 E-value=3.8 Score=33.92 Aligned_cols=80 Identities=9% Similarity=0.070 Sum_probs=59.6
Q ss_pred HHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHH-HHHHHHH
Q psy16295 8 FCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQH-LESIVKK 86 (128)
Q Consensus 8 waLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~-Lk~~Ie~ 86 (128)
-.|+++. .++++|.+.++-+|.. | ...+|..+|=--++-.|+|++|..-++.+=+++|+..|..+ -+.+|..
T Consensus 9 seLL~~~---sL~dai~~a~~qVkak-P---tda~~RhflfqLlcvaGdw~kAl~Ql~l~a~l~p~~t~~a~lyr~lir~ 81 (273)
T COG4455 9 SELLDDN---SLQDAIGLARDQVKAK-P---TDAGGRHFLFQLLCVAGDWEKALAQLNLAATLSPQDTVGASLYRHLIRC 81 (273)
T ss_pred HHHHHhc---cHHHHHHHHHHHHhcC-C---ccccchhHHHHHHhhcchHHHHHHHHHHHhhcCcccchHHHHHHHHHHH
Confidence 3566655 3568999999999998 6 34444444444467789999999999999999999876554 5778887
Q ss_pred HHhhhHHH
Q psy16295 87 RMETEGLK 94 (128)
Q Consensus 87 ~i~kdGli 94 (128)
.+.|+-+.
T Consensus 82 ea~R~evf 89 (273)
T COG4455 82 EAARNEVF 89 (273)
T ss_pred HHHHHHHh
Confidence 77776544
No 141
>PRK00523 hypothetical protein; Provisional
Probab=87.42 E-value=0.78 Score=31.14 Aligned_cols=28 Identities=32% Similarity=0.268 Sum_probs=17.7
Q ss_pred HHHHHHHHHH-HHHH-hhHHHhhhhhhcch
Q psy16295 94 KGMALAGGAV-LAQG-QGTMEMSCKSYKNT 121 (128)
Q Consensus 94 iG~~i~gga~-~~~g-~~~~~~~~~~~~~~ 121 (128)
+|++|.-+++ +++| ++|++++||.|++-
T Consensus 4 ~~l~I~l~i~~li~G~~~Gffiark~~~k~ 33 (72)
T PRK00523 4 IGLALGLGIPLLIVGGIIGYFVSKKMFKKQ 33 (72)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4444443333 4444 67999999999864
No 142
>PF13281 DUF4071: Domain of unknown function (DUF4071)
Probab=87.34 E-value=4.8 Score=34.82 Aligned_cols=80 Identities=19% Similarity=0.272 Sum_probs=54.4
Q ss_pred hHHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhh---------hhchHHHHHHHHHHHhcCC
Q psy16295 2 TQFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNAR---------IKEYTKSLKYCRAFLAIES 72 (128)
Q Consensus 2 ~~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yk---------lgdY~~A~~~~~~lL~~eP 72 (128)
.+|.||.||.|-+.+.|..+++.++.+++....+ ...|-+-.++=.|.. ....++|...+....+++|
T Consensus 181 i~~~yafALnRrn~~gdre~Al~il~~~l~~~~~---~~~d~~gL~GRIyKD~~~~s~~~d~~~ldkAi~~Y~kgFe~~~ 257 (374)
T PF13281_consen 181 IKFQYAFALNRRNKPGDREKALQILLPVLESDEN---PDPDTLGLLGRIYKDLFLESNFTDRESLDKAIEWYRKGFEIEP 257 (374)
T ss_pred HHHHHHHHHhhcccCCCHHHHHHHHHHHHhccCC---CChHHHHHHHHHHHHHHHHcCccchHHHHHHHHHHHHHHcCCc
Confidence 5899999999988889999999999995555422 455644444433322 2335677777777777777
Q ss_pred CCHHHHHHHHHH
Q psy16295 73 ENSQVQHLESIV 84 (128)
Q Consensus 73 ~N~QA~~Lk~~I 84 (128)
+--.+..+-.+.
T Consensus 258 ~~Y~GIN~AtLL 269 (374)
T PF13281_consen 258 DYYSGINAATLL 269 (374)
T ss_pred cccchHHHHHHH
Confidence 665555554443
No 143
>PF11846 DUF3366: Domain of unknown function (DUF3366); InterPro: IPR021797 This domain is functionally uncharacterised. This domain is found in bacteria. This presumed domain is about 200 amino acids in length.
Probab=87.25 E-value=7.3 Score=29.35 Aligned_cols=64 Identities=8% Similarity=-0.026 Sum_probs=55.0
Q ss_pred ccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHH
Q psy16295 12 RSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHL 80 (128)
Q Consensus 12 ~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~L 80 (128)
...|++....-++..++.++.. | ..+.+.-++.....+||-++|++..++....-|.+.-.+.+
T Consensus 120 ~~~~~~~l~~~~~~a~~~l~~~-P----~~~~~~~~a~~l~~~G~~~eA~~~~~~~~~lyP~~~~~~~~ 183 (193)
T PF11846_consen 120 LPPDPEMLEAYIEWAERLLRRR-P----DPNVYQRYALALALLGDPEEARQWLARARRLYPADEFAAAQ 183 (193)
T ss_pred CCCCHHHHHHHHHHHHHHHHhC-C----CHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCCcHHHHHHH
Confidence 6788888999999999999988 6 45666667999999999999999999999999976666555
No 144
>KOG0548|consensus
Probab=87.22 E-value=2.1 Score=38.69 Aligned_cols=45 Identities=9% Similarity=-0.039 Sum_probs=37.9
Q ss_pred hHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHH
Q psy16295 40 KRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIV 84 (128)
Q Consensus 40 ~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~I 84 (128)
=.+-+.=++.++.-+|+|++|+..+..-|++||+|.|..+=+.-.
T Consensus 69 w~kgy~r~Gaa~~~lg~~~eA~~ay~~GL~~d~~n~~L~~gl~~a 113 (539)
T KOG0548|consen 69 WAKGYSRKGAALFGLGDYEEAILAYSEGLEKDPSNKQLKTGLAQA 113 (539)
T ss_pred hhhHHHHhHHHHHhcccHHHHHHHHHHHhhcCCchHHHHHhHHHh
Confidence 456677889999999999999999999999999999976544333
No 145
>PRK14574 hmsH outer membrane protein; Provisional
Probab=86.92 E-value=2.5 Score=39.78 Aligned_cols=60 Identities=15% Similarity=0.046 Sum_probs=47.3
Q ss_pred HHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHH
Q psy16295 19 IRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLES 82 (128)
Q Consensus 19 ~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~ 82 (128)
-+++.++++.....+ | +..+..+-.+..+-.++|+.+|....+.+++..|+|..++.|+.
T Consensus 466 p~~A~~~~k~a~~l~-P---~~~~~~~~~~~~al~l~e~~~A~~~~~~l~~~~Pe~~~~~~l~r 525 (822)
T PRK14574 466 PRKAEQELKAVESLA-P---RSLILERAQAETAMALQEWHQMELLTDDVISRSPEDIPSQELDR 525 (822)
T ss_pred HHHHHHHHHHHhhhC-C---ccHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhCCCchhHHHHHH
Confidence 345555555544444 4 56788888889999999999999999999999999999999886
No 146
>KOG1128|consensus
Probab=86.91 E-value=1.8 Score=40.62 Aligned_cols=58 Identities=12% Similarity=0.167 Sum_probs=54.2
Q ss_pred hhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHH
Q psy16295 16 CADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQV 77 (128)
Q Consensus 16 ~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA 77 (128)
..|..+....|+.-++-+ | .+-+-||.+.-+..+++++..|.++....+..||+|..+
T Consensus 498 ~~~fs~~~~hle~sl~~n-p---lq~~~wf~~G~~ALqlek~q~av~aF~rcvtL~Pd~~ea 555 (777)
T KOG1128|consen 498 NKDFSEADKHLERSLEIN-P---LQLGTWFGLGCAALQLEKEQAAVKAFHRCVTLEPDNAEA 555 (777)
T ss_pred chhHHHHHHHHHHHhhcC-c---cchhHHHhccHHHHHHhhhHHHHHHHHHHhhcCCCchhh
Confidence 467889999999999999 6 899999999999999999999999999999999999876
No 147
>KOG1155|consensus
Probab=86.91 E-value=3.3 Score=37.35 Aligned_cols=71 Identities=17% Similarity=0.233 Sum_probs=56.7
Q ss_pred hhHHHHHHHHHHHhhcc-----CccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHh
Q psy16295 17 ADIRKGILLLEELFKTE-----TEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKRME 89 (128)
Q Consensus 17 ~d~~~GI~LL~~l~~~~-----~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i~ 89 (128)
+|..++....+..+... .. +..+...-+||.-..|.+||++|--|+..+|.-+|+...|++|.+-|.+.++
T Consensus 480 ~d~~eAa~~yek~v~~~~~eg~~~--~~t~ka~~fLA~~f~k~~~~~~As~Ya~~~~~~~~e~eeak~LlReir~~~~ 555 (559)
T KOG1155|consen 480 KDLNEAAQYYEKYVEVSELEGEID--DETIKARLFLAEYFKKMKDFDEASYYATLVLKGETECEEAKALLREIRKIQA 555 (559)
T ss_pred HhHHHHHHHHHHHHHHHHhhcccc--hHHHHHHHHHHHHHHhhcchHHHHHHHHHHhcCCchHHHHHHHHHHHHHhcC
Confidence 45566766666655521 11 2467777779999999999999999999999999999999999999987654
No 148
>COG5010 TadD Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking and secretion]
Probab=86.46 E-value=5.1 Score=33.14 Aligned_cols=65 Identities=9% Similarity=0.111 Sum_probs=52.1
Q ss_pred HHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhh
Q psy16295 22 GILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKRMET 90 (128)
Q Consensus 22 GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i~k 90 (128)
+...+..+.... | ..++.+--++..+++.|+|.+|...+..+.+..|+|.-+..++-.+=++.-+
T Consensus 85 ~l~~~~~~~~~~-~---~d~~ll~~~gk~~~~~g~~~~A~~~~rkA~~l~p~d~~~~~~lgaaldq~Gr 149 (257)
T COG5010 85 SLAVLQKSAIAY-P---KDRELLAAQGKNQIRNGNFGEAVSVLRKAARLAPTDWEAWNLLGAALDQLGR 149 (257)
T ss_pred hHHHHhhhhccC-c---ccHHHHHHHHHHHHHhcchHHHHHHHHHHhccCCCChhhhhHHHHHHHHccC
Confidence 334444444444 3 4677777789999999999999999999999999999999999999777644
No 149
>KOG1173|consensus
Probab=86.18 E-value=4.6 Score=37.00 Aligned_cols=71 Identities=10% Similarity=0.079 Sum_probs=57.3
Q ss_pred hHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHH-HHHHHHHHHHHHhhhH
Q psy16295 18 DIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQ-VQHLESIVKKRMETEG 92 (128)
Q Consensus 18 d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~Q-A~~Lk~~Ie~~i~kdG 92 (128)
...++|.-++.-+.-. | ...+..--+|+.|..+|+.+.|..+....|-+.|+|.- ..-|+..|++-..+.|
T Consensus 470 ~~~eAI~~~q~aL~l~-~---k~~~~~asig~iy~llgnld~Aid~fhKaL~l~p~n~~~~~lL~~aie~~~~~~~ 541 (611)
T KOG1173|consen 470 KYEEAIDYYQKALLLS-P---KDASTHASIGYIYHLLGNLDKAIDHFHKALALKPDNIFISELLKLAIEDSECKSG 541 (611)
T ss_pred hHHHHHHHHHHHHHcC-C---CchhHHHHHHHHHHHhcChHHHHHHHHHHHhcCCccHHHHHHHHHHHHhhhhhcc
Confidence 4557888888877777 5 67888888999999999999999999999999999954 4556777777555544
No 150
>KOG4234|consensus
Probab=86.03 E-value=5 Score=33.03 Aligned_cols=37 Identities=11% Similarity=0.185 Sum_probs=32.7
Q ss_pred HHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHH
Q psy16295 45 YYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLE 81 (128)
Q Consensus 45 YyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk 81 (128)
-=-|.+|-++-.|++|+.=+..+|+++|.++||+.=-
T Consensus 172 ~RRAeayek~ek~eealeDyKki~E~dPs~~ear~~i 208 (271)
T KOG4234|consen 172 ERRAEAYEKMEKYEEALEDYKKILESDPSRREAREAI 208 (271)
T ss_pred HHHHHHHHhhhhHHHHHHHHHHHHHhCcchHHHHHHH
Confidence 3448899999999999999999999999999997533
No 151
>KOG0624|consensus
Probab=86.02 E-value=5.9 Score=34.94 Aligned_cols=66 Identities=12% Similarity=0.083 Sum_probs=56.0
Q ss_pred hhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHH
Q psy16295 16 CADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVK 85 (128)
Q Consensus 16 ~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie 85 (128)
-.+.-++|.-..+.+..+ | ...+.|---|-+|.--.+|+.|+.-++..++.+|+|+|+++=++-.+
T Consensus 320 d~~~~eAiqqC~evL~~d-~---~dv~~l~dRAeA~l~dE~YD~AI~dye~A~e~n~sn~~~reGle~Ak 385 (504)
T KOG0624|consen 320 DEQFGEAIQQCKEVLDID-P---DDVQVLCDRAEAYLGDEMYDDAIHDYEKALELNESNTRAREGLERAK 385 (504)
T ss_pred cCCHHHHHHHHHHHHhcC-c---hHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhcCcccHHHHHHHHHHH
Confidence 345668999999999888 6 57888999999999999999999999999999999999876554443
No 152
>KOG4340|consensus
Probab=85.19 E-value=2.1 Score=37.08 Aligned_cols=61 Identities=21% Similarity=0.236 Sum_probs=50.5
Q ss_pred HHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHH
Q psy16295 9 CLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQ 76 (128)
Q Consensus 9 aLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~Q 76 (128)
.|||..- ...+|++|..-.+.+ | +.|--|..|+-+||+..+|.+|-.+++++=+..|+-.|
T Consensus 19 ~lI~d~r---y~DaI~~l~s~~Er~-p---~~rAgLSlLgyCYY~~Q~f~~AA~CYeQL~ql~P~~~q 79 (459)
T KOG4340|consen 19 RLIRDAR---YADAIQLLGSELERS-P---RSRAGLSLLGYCYYRLQEFALAAECYEQLGQLHPELEQ 79 (459)
T ss_pred HHHHHhh---HHHHHHHHHHHHhcC-c---cchHHHHHHHHHHHHHHHHHHHHHHHHHHHhhChHHHH
Confidence 3555432 335788888877777 5 78999999999999999999999999999999998776
No 153
>KOG1125|consensus
Probab=85.11 E-value=2.5 Score=38.57 Aligned_cols=58 Identities=12% Similarity=0.070 Sum_probs=46.2
Q ss_pred hhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCH
Q psy16295 16 CADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENS 75 (128)
Q Consensus 16 ~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~ 75 (128)
..+..+--+++-+..+.. |. ....|.---|+|-||-.|+|++|..+.+.+|+.+|+|-
T Consensus 407 ~~~l~~i~~~fLeaa~~~-~~-~~DpdvQ~~LGVLy~ls~efdraiDcf~~AL~v~Pnd~ 464 (579)
T KOG1125|consen 407 SSHLAHIQELFLEAARQL-PT-KIDPDVQSGLGVLYNLSGEFDRAVDCFEAALQVKPNDY 464 (579)
T ss_pred HHHHHHHHHHHHHHHHhC-CC-CCChhHHhhhHHHHhcchHHHHHHHHHHHHHhcCCchH
Confidence 344455556677777776 41 46788888889999999999999999999999999974
No 154
>KOG3060|consensus
Probab=84.82 E-value=3.6 Score=34.47 Aligned_cols=51 Identities=18% Similarity=0.080 Sum_probs=42.1
Q ss_pred HHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCH
Q psy16295 21 KGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENS 75 (128)
Q Consensus 21 ~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~ 75 (128)
.+|+-|.+.++.= ....|.|-+||=-|.-.|+|++|--+.+.++=+.|.|+
T Consensus 138 ~aIk~ln~YL~~F----~~D~EAW~eLaeiY~~~~~f~kA~fClEE~ll~~P~n~ 188 (289)
T KOG3060|consen 138 EAIKELNEYLDKF----MNDQEAWHELAEIYLSEGDFEKAAFCLEELLLIQPFNP 188 (289)
T ss_pred HHHHHHHHHHHHh----cCcHHHHHHHHHHHHhHhHHHHHHHHHHHHHHcCCCcH
Confidence 3555555555554 25689999999999999999999999999999999997
No 155
>KOG0624|consensus
Probab=84.60 E-value=3.7 Score=36.16 Aligned_cols=63 Identities=17% Similarity=0.220 Sum_probs=49.3
Q ss_pred HHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHH
Q psy16295 7 SFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQ 76 (128)
Q Consensus 7 AwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~Q 76 (128)
|-|++.-. +.+++|.=|+..-+-. .++.|-+|-++--+|..||-+++++-+..+|+.+|+..-
T Consensus 196 akc~i~~~---e~k~AI~Dlk~askLs----~DnTe~~ykis~L~Y~vgd~~~sL~~iRECLKldpdHK~ 258 (504)
T KOG0624|consen 196 AKCYIAEG---EPKKAIHDLKQASKLS----QDNTEGHYKISQLLYTVGDAENSLKEIRECLKLDPDHKL 258 (504)
T ss_pred HHHHHhcC---cHHHHHHHHHHHHhcc----ccchHHHHHHHHHHHhhhhHHHHHHHHHHHHccCcchhh
Confidence 34444433 3446777776666555 368999999999999999999999999999999999753
No 156
>COG3071 HemY Uncharacterized enzyme of heme biosynthesis [Coenzyme metabolism]
Probab=84.38 E-value=6.3 Score=34.50 Aligned_cols=72 Identities=18% Similarity=0.071 Sum_probs=58.5
Q ss_pred HhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHH
Q psy16295 10 LVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVK 85 (128)
Q Consensus 10 Lv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie 85 (128)
++-+=.+.|..+=|..+|.-++.+ | .+.+.++.|+.-++|.+.|.+|..+.+..|...|+..--..|-...+
T Consensus 301 ~~~~l~~~d~~~l~k~~e~~l~~h-~---~~p~L~~tLG~L~~k~~~w~kA~~~leaAl~~~~s~~~~~~la~~~~ 372 (400)
T COG3071 301 LIPRLRPGDPEPLIKAAEKWLKQH-P---EDPLLLSTLGRLALKNKLWGKASEALEAALKLRPSASDYAELADALD 372 (400)
T ss_pred HHhhcCCCCchHHHHHHHHHHHhC-C---CChhHHHHHHHHHHHhhHHHHHHHHHHHHHhcCCChhhHHHHHHHHH
Confidence 333444566666778888888888 5 45699999999999999999999999999999999888777766665
No 157
>PRK10153 DNA-binding transcriptional activator CadC; Provisional
Probab=84.32 E-value=3.7 Score=36.54 Aligned_cols=39 Identities=10% Similarity=-0.058 Sum_probs=21.5
Q ss_pred HHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHH
Q psy16295 42 DYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLE 81 (128)
Q Consensus 42 d~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk 81 (128)
+.+.-+|+.+...|++++|..++++.++.+|+ -.+-.+.
T Consensus 421 ~~~~ala~~~~~~g~~~~A~~~l~rAl~L~ps-~~a~~~l 459 (517)
T PRK10153 421 RIYEILAVQALVKGKTDEAYQAINKAIDLEMS-WLNYVLL 459 (517)
T ss_pred HHHHHHHHHHHhcCCHHHHHHHHHHHHHcCCC-HHHHHHH
Confidence 44444555555566666666666666666663 3343333
No 158
>PF13281 DUF4071: Domain of unknown function (DUF4071)
Probab=83.90 E-value=9.3 Score=33.06 Aligned_cols=78 Identities=19% Similarity=0.126 Sum_probs=60.9
Q ss_pred CChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhh---hhchHHHHHHHHH-HHhcCCCCHHHHHHHHHHHHHHh
Q psy16295 14 EFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNAR---IKEYTKSLKYCRA-FLAIESENSQVQHLESIVKKRME 89 (128)
Q Consensus 14 ~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yk---lgdY~~A~~~~~~-lL~~eP~N~QA~~Lk~~Ie~~i~ 89 (128)
++-+|...=|.|.+.+=.-...+-..+...-+++|++..| -||.++|++.+.. +...++.++....|.-.|-+.+-
T Consensus 152 RdiqdydamI~Lve~l~~~p~~~~~~~~~i~~~yafALnRrn~~gdre~Al~il~~~l~~~~~~~~d~~gL~GRIyKD~~ 231 (374)
T PF13281_consen 152 RDIQDYDAMIKLVETLEALPTCDVANQHNIKFQYAFALNRRNKPGDREKALQILLPVLESDENPDPDTLGLLGRIYKDLF 231 (374)
T ss_pred hhhhhHHHHHHHHHHhhccCccchhcchHHHHHHHHHHhhcccCCCHHHHHHHHHHHHhccCCCChHHHHHHHHHHHHHH
Confidence 5667777778887776555111113567788899999999 9999999999988 77888899999999988887775
Q ss_pred hh
Q psy16295 90 TE 91 (128)
Q Consensus 90 kd 91 (128)
.+
T Consensus 232 ~~ 233 (374)
T PF13281_consen 232 LE 233 (374)
T ss_pred HH
Confidence 54
No 159
>KOG2076|consensus
Probab=83.71 E-value=8.2 Score=36.95 Aligned_cols=55 Identities=16% Similarity=0.199 Sum_probs=41.6
Q ss_pred hhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCH
Q psy16295 17 ADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENS 75 (128)
Q Consensus 17 ~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~ 75 (128)
.+..++.++|.++++.+ | ....-+|+||..|=.+||-++|+.+|-..=...|.|.
T Consensus 153 g~~eeA~~i~~EvIkqd-p---~~~~ay~tL~~IyEqrGd~eK~l~~~llAAHL~p~d~ 207 (895)
T KOG2076|consen 153 GDLEEAEEILMEVIKQD-P---RNPIAYYTLGEIYEQRGDIEKALNFWLLAAHLNPKDY 207 (895)
T ss_pred CCHHHHHHHHHHHHHhC-c---cchhhHHHHHHHHHHcccHHHHHHHHHHHHhcCCCCh
Confidence 67788999999999998 6 6677777777777777777777766666666666665
No 160
>PF09295 ChAPs: ChAPs (Chs5p-Arf1p-binding proteins); InterPro: IPR015374 ChAPs (Chs5p-Arf1p-binding proteins) are required for the export of specialised cargo from the Golgi. They physically interact with Chs3, Chs5 and the small GTPase Arf1, and they also form interactions with each other [].
Probab=83.64 E-value=11 Score=32.68 Aligned_cols=57 Identities=19% Similarity=0.231 Sum_probs=40.6
Q ss_pred hHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHH
Q psy16295 18 DIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLE 81 (128)
Q Consensus 18 d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk 81 (128)
...+++++|++|.+.+ | +..+.||-.+...++..+|.+..+..|+..|.+.....+.
T Consensus 184 ~~~~ai~lle~L~~~~-p------ev~~~LA~v~l~~~~E~~AI~ll~~aL~~~p~d~~LL~~Q 240 (395)
T PF09295_consen 184 RYDEAIELLEKLRERD-P------EVAVLLARVYLLMNEEVEAIRLLNEALKENPQDSELLNLQ 240 (395)
T ss_pred cHHHHHHHHHHHHhcC-C------cHHHHHHHHHHhcCcHHHHHHHHHHHHHhCCCCHHHHHHH
Confidence 3667888888877776 5 2456677777777888888888888888888775444333
No 161
>KOG1129|consensus
Probab=83.51 E-value=2 Score=37.56 Aligned_cols=40 Identities=23% Similarity=0.390 Sum_probs=36.0
Q ss_pred HHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHH
Q psy16295 43 YLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLES 82 (128)
Q Consensus 43 ~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~ 82 (128)
-=|++|+.+||.+.|+..-..|.++|+.-|+++-+-.||-
T Consensus 6 ~~~~~a~s~~rrr~~~~ca~lc~~~l~~~~~~q~~W~LKm 45 (478)
T KOG1129|consen 6 LDYFRAVSLYRRRSYERCAELCNALLQAGHDMEGVWQLKM 45 (478)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhCCchhHHHHHHH
Confidence 3489999999999999999999999999999887777764
No 162
>KOG1174|consensus
Probab=83.32 E-value=12 Score=33.71 Aligned_cols=80 Identities=14% Similarity=0.157 Sum_probs=56.2
Q ss_pred chHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhh--hHHHHHHHHHHHHH-------HHHhh
Q psy16295 39 RKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKRMET--EGLKGMALAGGAVL-------AQGQG 109 (128)
Q Consensus 39 ~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i~k--dGliG~~i~gga~~-------~~g~~ 109 (128)
..+.-|+--+--.|-.++|++|+.|.+...+.||.|.|+.-||-..=..+.| +..|.. =...-+ =-|++
T Consensus 298 ~ta~~wfV~~~~l~~~K~~~rAL~~~eK~I~~~~r~~~alilKG~lL~~~~R~~~A~IaF--R~Aq~Lap~rL~~Y~GL~ 375 (564)
T KOG1174|consen 298 YTASHWFVHAQLLYDEKKFERALNFVEKCIDSEPRNHEALILKGRLLIALERHTQAVIAF--RTAQMLAPYRLEIYRGLF 375 (564)
T ss_pred cchhhhhhhhhhhhhhhhHHHHHHHHHHHhccCcccchHHHhccHHHHhccchHHHHHHH--HHHHhcchhhHHHHHHHH
Confidence 4556666666667888999999999999999999999999998766555544 333321 110112 23667
Q ss_pred HHHhhhhhhcc
Q psy16295 110 TMEMSCKSYKN 120 (128)
Q Consensus 110 ~~~~~~~~~~~ 120 (128)
--+|++++||.
T Consensus 376 hsYLA~~~~kE 386 (564)
T KOG1174|consen 376 HSYLAQKRFKE 386 (564)
T ss_pred HHHHhhchHHH
Confidence 77888888875
No 163
>PF13512 TPR_18: Tetratricopeptide repeat
Probab=83.20 E-value=11 Score=28.33 Aligned_cols=74 Identities=5% Similarity=0.092 Sum_probs=56.1
Q ss_pred hHHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhc---------------hHHHHHHHHH
Q psy16295 2 TQFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKE---------------YTKSLKYCRA 66 (128)
Q Consensus 2 ~~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgd---------------Y~~A~~~~~~ 66 (128)
++...+.+.-++.+-. .++.-++..++-+ |....-.-.+|..+++++++.+ ..+|++..+.
T Consensus 49 AqL~l~yayy~~~~y~---~A~a~~~rFirLh-P~hp~vdYa~Y~~gL~~~~~~~~~~~~~~~~drD~~~~~~A~~~f~~ 124 (142)
T PF13512_consen 49 AQLDLAYAYYKQGDYE---EAIAAYDRFIRLH-PTHPNVDYAYYMRGLSYYEQDEGSLQSFFRSDRDPTPARQAFRDFEQ 124 (142)
T ss_pred HHHHHHHHHHHccCHH---HHHHHHHHHHHhC-CCCCCccHHHHHHHHHHHHHhhhHHhhhcccccCcHHHHHHHHHHHH
Confidence 3556677777776554 6777777777776 5323555667888999999998 8999999999
Q ss_pred HHhcCCCCHHHHH
Q psy16295 67 FLAIESENSQVQH 79 (128)
Q Consensus 67 lL~~eP~N~QA~~ 79 (128)
+++.-|+..=|..
T Consensus 125 lv~~yP~S~ya~d 137 (142)
T PF13512_consen 125 LVRRYPNSEYAAD 137 (142)
T ss_pred HHHHCcCChhHHH
Confidence 9999999876654
No 164
>KOG3785|consensus
Probab=83.17 E-value=5.8 Score=35.20 Aligned_cols=76 Identities=12% Similarity=0.218 Sum_probs=60.0
Q ss_pred hhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHH-----HHHHHHhh
Q psy16295 16 CADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLES-----IVKKRMET 90 (128)
Q Consensus 16 ~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~-----~Ie~~i~k 90 (128)
+.|.+++|++....+-++ | .---.=-|+|+.+||+-=|+-+-..++--|+--|+..=|+.|+. +|..+.+.
T Consensus 164 R~HYQeAIdvYkrvL~dn-~---ey~alNVy~ALCyyKlDYydvsqevl~vYL~q~pdStiA~NLkacn~fRl~ngr~ae 239 (557)
T KOG3785|consen 164 RMHYQEAIDVYKRVLQDN-P---EYIALNVYMALCYYKLDYYDVSQEVLKVYLRQFPDSTIAKNLKACNLFRLINGRTAE 239 (557)
T ss_pred HHHHHHHHHHHHHHHhcC-h---hhhhhHHHHHHHHHhcchhhhHHHHHHHHHHhCCCcHHHHHHHHHHHhhhhccchhH
Confidence 567889999999998887 4 22333458999999999999999999999999999999999885 44445544
Q ss_pred hHHHH
Q psy16295 91 EGLKG 95 (128)
Q Consensus 91 dGliG 95 (128)
|-..-
T Consensus 240 ~E~k~ 244 (557)
T KOG3785|consen 240 DEKKE 244 (557)
T ss_pred HHHHH
Confidence 44443
No 165
>PF14938 SNAP: Soluble NSF attachment protein, SNAP; PDB: 1QQE_A 2IFU_A.
Probab=82.85 E-value=21 Score=28.52 Aligned_cols=79 Identities=13% Similarity=0.196 Sum_probs=56.5
Q ss_pred HHHHHHhccCChhhHHHHHHHHHHHhhccC--ccc-cchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCC---CCHHHH
Q psy16295 5 NYSFCLVRSEFCADIRKGILLLEELFKTET--EDI-DRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIES---ENSQVQ 78 (128)
Q Consensus 5 ~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~--~~~-~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP---~N~QA~ 78 (128)
+.|..+++.. +..++|++++++.+..- +-+ ..-++++.-..+.|.-.||+..|++.++...+.+| +-+..+
T Consensus 160 ~~A~l~~~l~---~y~~A~~~~e~~~~~~l~~~l~~~~~~~~~l~a~l~~L~~~D~v~A~~~~~~~~~~~~~F~~s~E~~ 236 (282)
T PF14938_consen 160 KAADLYARLG---RYEEAIEIYEEVAKKCLENNLLKYSAKEYFLKAILCHLAMGDYVAARKALERYCSQDPSFASSREYK 236 (282)
T ss_dssp HHHHHHHHTT----HHHHHHHHHHHHHTCCCHCTTGHHHHHHHHHHHHHHHHTT-HHHHHHHHHHHGTTSTTSTTSHHHH
T ss_pred HHHHHHHHhC---CHHHHHHHHHHHHHHhhcccccchhHHHHHHHHHHHHHHcCCHHHHHHHHHHHHhhCCCCCCcHHHH
Confidence 3444455544 77899999999887531 100 13456667778899999999999999999999999 567777
Q ss_pred HHHHHHHH
Q psy16295 79 HLESIVKK 86 (128)
Q Consensus 79 ~Lk~~Ie~ 86 (128)
-|..+|+.
T Consensus 237 ~~~~l~~A 244 (282)
T PF14938_consen 237 FLEDLLEA 244 (282)
T ss_dssp HHHHHHHH
T ss_pred HHHHHHHH
Confidence 77777763
No 166
>PF05843 Suf: Suppressor of forked protein (Suf); InterPro: IPR008847 This domain consists of several eukaryotic suppressor of forked (Suf) like proteins. The Drosophila melanogaster suppressor of forked [Su(f)] protein shares homology with the Saccharomyces cerevisiae RNA14 protein and the 77 kDa subunit of Homo sapiens cleavage stimulation factor, which are proteins involved in mRNA 3' end formation. This suggests a role for Su(f) in mRNA 3' end formation in Drosophila. The su(f) gene produces three transcripts; two of them are polyadenylated at the end of the transcription unit, and one is a truncated transcript, polyadenylated in intron 4. It is thought that su(f) plays a role in the regulation of poly(A) site utilisation and the GU-rich sequence is important for this regulation to occur [].; GO: 0006397 mRNA processing, 0005634 nucleus; PDB: 2L9B_B 2OND_B 2OOE_A 4E85_B 4EBA_C 4E6H_A 2UY1_B.
Probab=82.79 E-value=3.6 Score=33.21 Aligned_cols=68 Identities=9% Similarity=0.079 Sum_probs=46.2
Q ss_pred HHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHH
Q psy16295 7 SFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQ 78 (128)
Q Consensus 7 AwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~ 78 (128)
++|++=-....|...|..++|..++.- | ...++|....-=..+++|.++||...++.+..-|...+..
T Consensus 40 ~~A~~E~~~~~d~~~A~~Ife~glk~f-~---~~~~~~~~Y~~~l~~~~d~~~aR~lfer~i~~l~~~~~~~ 107 (280)
T PF05843_consen 40 AYALMEYYCNKDPKRARKIFERGLKKF-P---SDPDFWLEYLDFLIKLNDINNARALFERAISSLPKEKQSK 107 (280)
T ss_dssp HHHHHHHHTCS-HHHHHHHHHHHHHHH-T---T-HHHHHHHHHHHHHTT-HHHHHHHHHHHCCTSSCHHHCH
T ss_pred HHHHHHHHhCCCHHHHHHHHHHHHHHC-C---CCHHHHHHHHHHHHHhCcHHHHHHHHHHHHHhcCchhHHH
Confidence 455552222344557899999988886 4 4566666555555688999999999999999988888633
No 167
>KOG4626|consensus
Probab=82.62 E-value=4.1 Score=38.34 Aligned_cols=66 Identities=9% Similarity=0.020 Sum_probs=49.8
Q ss_pred HHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHH
Q psy16295 5 NYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQV 77 (128)
Q Consensus 5 ~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA 77 (128)
|+|.|| ++..++.+|+.....-++-. | ...|.+|-|+..+-.+|..++|.+.+...|++-|+...+
T Consensus 325 NlanAL---kd~G~V~ea~~cYnkaL~l~-p---~hadam~NLgni~~E~~~~e~A~~ly~~al~v~p~~aaa 390 (966)
T KOG4626|consen 325 NLANAL---KDKGSVTEAVDCYNKALRLC-P---NHADAMNNLGNIYREQGKIEEATRLYLKALEVFPEFAAA 390 (966)
T ss_pred HHHHHH---HhccchHHHHHHHHHHHHhC-C---ccHHHHHHHHHHHHHhccchHHHHHHHHHHhhChhhhhh
Confidence 455555 35577788888888877777 5 578888888888888888888888888888888876544
No 168
>PRK01844 hypothetical protein; Provisional
Probab=82.58 E-value=1.8 Score=29.41 Aligned_cols=18 Identities=17% Similarity=-0.092 Sum_probs=13.4
Q ss_pred HHHHhhHHHhhhhhhcch
Q psy16295 104 LAQGQGTMEMSCKSYKNT 121 (128)
Q Consensus 104 ~~~g~~~~~~~~~~~~~~ 121 (128)
++.+++|++++||.|++-
T Consensus 15 i~G~~~Gff~ark~~~k~ 32 (72)
T PRK01844 15 VAGVALGFFIARKYMMNY 32 (72)
T ss_pred HHHHHHHHHHHHHHHHHH
Confidence 333467999999999863
No 169
>KOG0376|consensus
Probab=82.46 E-value=3.7 Score=36.70 Aligned_cols=71 Identities=7% Similarity=0.002 Sum_probs=57.0
Q ss_pred HHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhhhHH
Q psy16295 19 IRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKRMETEGL 93 (128)
Q Consensus 19 ~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i~kdGl 93 (128)
...++.=+...++.+ | ..-.|.|.-|.++.++++|.+|+.-.+....+.|+.++++..-...+..+++++.
T Consensus 54 ~~~Al~Da~kaie~d-P---~~~K~Y~rrg~a~m~l~~~~~A~~~l~~~~~l~Pnd~~~~r~~~Ec~~~vs~~~f 124 (476)
T KOG0376|consen 54 FGGALHDALKAIELD-P---TYIKAYVRRGTAVMALGEFKKALLDLEKVKKLAPNDPDATRKIDECNKIVSEEKF 124 (476)
T ss_pred hhhHHHHHHhhhhcC-c---hhhheeeeccHHHHhHHHHHHHHHHHHHhhhcCcCcHHHHHHHHHHHHHHHHHhh
Confidence 334444444455556 6 5677888889999999999999999999999999999999999988888877543
No 170
>KOG1126|consensus
Probab=82.30 E-value=6.3 Score=36.39 Aligned_cols=64 Identities=19% Similarity=0.191 Sum_probs=30.0
Q ss_pred HHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHH
Q psy16295 21 KGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKRM 88 (128)
Q Consensus 21 ~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i 88 (128)
+++.+++.-+.-+ | ...=|.|+-|.-.+-+++|.+|+.-.|.|=++-|++.-+-.|.-.|-+++
T Consensus 541 ~AL~~~~~A~~ld-~---kn~l~~~~~~~il~~~~~~~eal~~LEeLk~~vP~es~v~~llgki~k~~ 604 (638)
T KOG1126|consen 541 KALQLYEKAIHLD-P---KNPLCKYHRASILFSLGRYVEALQELEELKELVPQESSVFALLGKIYKRL 604 (638)
T ss_pred HHHHHHHHHHhcC-C---CCchhHHHHHHHHHhhcchHHHHHHHHHHHHhCcchHHHHHHHHHHHHHH
Confidence 4444444444444 2 33444444444444444444444444444444444444444444444443
No 171
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=82.05 E-value=7.8 Score=34.75 Aligned_cols=65 Identities=11% Similarity=0.058 Sum_probs=51.7
Q ss_pred HHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHH
Q psy16295 19 IRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKR 87 (128)
Q Consensus 19 ~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~ 87 (128)
..++++.++..+.-. | .++=..+-+|-++.+.|++.+|.+.++..+.-+|+|+-.-.+..-..+.
T Consensus 356 ~~~A~e~~~kal~l~-P---~~~~l~~~~a~all~~g~~~eai~~L~~~~~~~p~dp~~w~~LAqay~~ 420 (484)
T COG4783 356 AKEAIERLKKALALD-P---NSPLLQLNLAQALLKGGKPQEAIRILNRYLFNDPEDPNGWDLLAQAYAE 420 (484)
T ss_pred hHHHHHHHHHHHhcC-C---CccHHHHHHHHHHHhcCChHHHHHHHHHHhhcCCCCchHHHHHHHHHHH
Confidence 346777777777776 5 4577888899999999999999999999999999999887776655443
No 172
>PRK15331 chaperone protein SicA; Provisional
Probab=81.96 E-value=9.9 Score=29.42 Aligned_cols=65 Identities=9% Similarity=0.083 Sum_probs=45.1
Q ss_pred HHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHH
Q psy16295 6 YSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQ 76 (128)
Q Consensus 6 YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~Q 76 (128)
++++.++-- .++.+++|.+..--.--+ + ....-.||.|..+..+|+-++|+.+.+...+ .|++.+
T Consensus 75 ~GLaa~~Q~-~k~y~~Ai~~Y~~A~~l~-~---~dp~p~f~agqC~l~l~~~~~A~~~f~~a~~-~~~~~~ 139 (165)
T PRK15331 75 MGLAAVCQL-KKQFQKACDLYAVAFTLL-K---NDYRPVFFTGQCQLLMRKAAKARQCFELVNE-RTEDES 139 (165)
T ss_pred HHHHHHHHH-HHHHHHHHHHHHHHHHcc-c---CCCCccchHHHHHHHhCCHHHHHHHHHHHHh-CcchHH
Confidence 344444433 455667777766544333 2 2344589999999999999999999999998 466554
No 173
>TIGR03504 FimV_Cterm FimV C-terminal domain. This protein is found at the extreme C-terminus of FimV from Pseudomonas aeruginosa, and of TspA of Neisseria meningitidis. Disruption of the former blocks twitching motility from type IV pili; Semmler, et al. suggest a role in peptidoglycan layer remodelling required by type IV fimbrial systems.
Probab=81.76 E-value=4.2 Score=24.70 Aligned_cols=30 Identities=17% Similarity=0.162 Sum_probs=25.4
Q ss_pred hHHHHHHHHhccCChhhHHHHHHHHHHHhhccC
Q psy16295 2 TQFNYSFCLVRSEFCADIRKGILLLEELFKTET 34 (128)
Q Consensus 2 ~~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~ 34 (128)
++|++|.+++...+.+ .+.++|++++.++.
T Consensus 1 ~kLdLA~ayie~Gd~e---~Ar~lL~evl~~~~ 30 (44)
T TIGR03504 1 TKLDLARAYIEMGDLE---GARELLEEVIEEGD 30 (44)
T ss_pred CchHHHHHHHHcCChH---HHHHHHHHHHHcCC
Confidence 5899999999998665 78899999997763
No 174
>PF13830 DUF4192: Domain of unknown function (DUF4192)
Probab=81.37 E-value=11 Score=30.97 Aligned_cols=66 Identities=14% Similarity=0.006 Sum_probs=58.2
Q ss_pred HHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHH
Q psy16295 20 RKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKR 87 (128)
Q Consensus 20 ~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~ 87 (128)
..+.+|+.++.+.-.+ ..+-+-+-.+|+.+|..|+=..|.-+.+..|+.+|+++=+.-|.+.++.-
T Consensus 255 ~~a~~lw~~~~r~~~~--~~ra~~l~l~a~~a~~~g~g~~A~~al~~a~~~~p~~~la~ll~~~l~~g 320 (324)
T PF13830_consen 255 QAAERLWRALARRLPG--PWRAAALALLAWAAWLRGDGALAGVALDRALEADPDHSLAQLLDQALQAG 320 (324)
T ss_pred hHHHHHHHHHHHhcCC--ccchHHHHHHHHHHHhcCCchHHHHHHHHHHhhCCCCcHHHHHHHHHHcC
Confidence 3578899999998633 57899999999999999999999999999999999999999888887653
No 175
>PF10579 Rapsyn_N: Rapsyn N-terminal myristoylation and linker region; InterPro: IPR019568 Neuromuscular junction formation relies upon the clustering of acetylcholine receptors and other proteins in the muscle membrane. Rapsyn is a peripheral membrane protein that is selectively concentrated at the neuromuscular junction and is essential for the formation of synaptic acetylcholine receptor aggregates. Acetylcholine receptors fail to aggregate beneath nerve terminals in mice where rapsyn has been knocked out. The N-terminal six amino acids of rapsyn are its myristoylation site, and myristoylation is necessary for the targeting of the protein to the membrane []. ; GO: 0008270 zinc ion binding, 0033130 acetylcholine receptor binding, 0007268 synaptic transmission, 0005856 cytoskeleton, 0030054 cell junction, 0045211 postsynaptic membrane
Probab=81.13 E-value=7.7 Score=26.78 Aligned_cols=53 Identities=13% Similarity=0.186 Sum_probs=39.8
Q ss_pred ChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHH
Q psy16295 15 FCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFL 68 (128)
Q Consensus 15 ~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL 68 (128)
+..+.+++|...+..++.. ++...+=..|=||+-+|...|+|.+.++|...=+
T Consensus 18 ~~~~~~~Al~~W~~aL~k~-~~~~~rf~~lG~l~qA~~e~Gkyr~~L~fA~~Q~ 70 (80)
T PF10579_consen 18 HQNETQQALQKWRKALEKI-TDREDRFRVLGYLIQAHMEWGKYREMLAFALQQL 70 (80)
T ss_pred ccchHHHHHHHHHHHHhhc-CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4567778999999988886 3213444556788999999999999999965433
No 176
>KOG2002|consensus
Probab=80.22 E-value=3.8 Score=39.51 Aligned_cols=68 Identities=12% Similarity=0.001 Sum_probs=45.2
Q ss_pred ChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHH
Q psy16295 15 FCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVK 85 (128)
Q Consensus 15 ~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie 85 (128)
++.|.+.++.+....++.+ | .-..|-.--++....++++-+.|+..|++.|+++|.|--+.-....++
T Consensus 176 nkkdY~~al~yyk~al~in-p--~~~aD~rIgig~Cf~kl~~~~~a~~a~~ralqLdp~~v~alv~L~~~~ 243 (1018)
T KOG2002|consen 176 NKKDYRGALKYYKKALRIN-P--ACKADVRIGIGHCFWKLGMSEKALLAFERALQLDPTCVSALVALGEVD 243 (1018)
T ss_pred ccccHHHHHHHHHHHHhcC-c--ccCCCccchhhhHHHhccchhhHHHHHHHHHhcChhhHHHHHHHHHHH
Confidence 3556666777777777766 5 355565555566666777888888888888888886665555555444
No 177
>COG2956 Predicted N-acetylglucosaminyl transferase [Carbohydrate transport and metabolism]
Probab=80.16 E-value=3.2 Score=35.94 Aligned_cols=54 Identities=15% Similarity=0.187 Sum_probs=37.5
Q ss_pred hhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCC
Q psy16295 17 ADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESEN 74 (128)
Q Consensus 17 ~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N 74 (128)
++..+++++|..-+..+ | .....--.++=-+...|+|..|.+.|+.+++.+|+-
T Consensus 194 ~~~d~A~~~l~kAlqa~-~---~cvRAsi~lG~v~~~~g~y~~AV~~~e~v~eQn~~y 247 (389)
T COG2956 194 SDVDRARELLKKALQAD-K---KCVRASIILGRVELAKGDYQKAVEALERVLEQNPEY 247 (389)
T ss_pred hhHHHHHHHHHHHHhhC-c---cceehhhhhhHHHHhccchHHHHHHHHHHHHhChHH
Confidence 34556777777777777 4 344455566666777788888888888888877764
No 178
>PF04212 MIT: MIT (microtubule interacting and transport) domain; InterPro: IPR007330 The MIT domain is found in vacuolar sorting proteins, spastin (probable ATPase involved in the assembly or function of nuclear protein complexes), and a sorting nexin, which may play a role in intracellular trafficking.; PDB: 2DL1_A 2JQK_A 1WR0_A 2CPT_A 2JQH_A 2V6Y_A 2JQ9_A 2K3W_A 1YXR_A 3EAB_E ....
Probab=78.82 E-value=5.2 Score=25.52 Aligned_cols=34 Identities=26% Similarity=0.378 Sum_probs=26.9
Q ss_pred hHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhh
Q psy16295 57 YTKSLKYCRAFLAIESENSQVQHLESIVKKRMET 90 (128)
Q Consensus 57 Y~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i~k 90 (128)
|.+|..++..+++.+|+..+...|++.+..-+.+
T Consensus 28 Y~~ai~~l~~~~~~~~~~~~~~~l~~k~~~yl~R 61 (69)
T PF04212_consen 28 YKEAIEYLMQALKSESNPERRQALRQKMKEYLER 61 (69)
T ss_dssp HHHHHHHHHHHHHHSTTHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHhccCCCHHHHHHHHHHHHHHHHH
Confidence 5677778889999998777777788888776655
No 179
>PF06957 COPI_C: Coatomer (COPI) alpha subunit C-terminus; InterPro: IPR010714 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the C terminus (approximately 500 residues) of the eukaryotic coatomer alpha subunit [, ]. This domain is found along with the IPR006692 from INTERPRO domain. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0005515 protein binding, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030126 COPI vesicle coat; PDB: 3MKR_B 3MV2_E 3MKQ_B 3MV3_A.
Probab=78.69 E-value=3.1 Score=36.56 Aligned_cols=36 Identities=11% Similarity=0.353 Sum_probs=28.7
Q ss_pred HHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHH
Q psy16295 50 GNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVK 85 (128)
Q Consensus 50 g~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie 85 (128)
-.||.+||..|-.++++||++.|...+++.-++.+.
T Consensus 309 ~~~K~KNf~tAa~FArRLLel~p~~~~a~qArKil~ 344 (422)
T PF06957_consen 309 QAFKLKNFITAASFARRLLELNPSPEVAEQARKILQ 344 (422)
T ss_dssp HCCCTTBHHHHHHHHHHHHCT--SCHHHHHHHHHHH
T ss_pred HHHHhccHHHHHHHHHHHHHcCCCHHHHHHHHHHHH
Confidence 468999999999999999999999987666665554
No 180
>KOG0547|consensus
Probab=77.80 E-value=4.2 Score=37.01 Aligned_cols=56 Identities=11% Similarity=0.112 Sum_probs=42.2
Q ss_pred hhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCH
Q psy16295 16 CADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENS 75 (128)
Q Consensus 16 ~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~ 75 (128)
.++-.+++.-+..-.+-+ | ...|..|+-+--.+-+++|++|..=.+...+++|+|-
T Consensus 373 ~~~~~~~~~~F~~A~~ld-p---~n~dvYyHRgQm~flL~q~e~A~aDF~Kai~L~pe~~ 428 (606)
T KOG0547|consen 373 ENQSEKMWKDFNKAEDLD-P---ENPDVYYHRGQMRFLLQQYEEAIADFQKAISLDPENA 428 (606)
T ss_pred hhccHHHHHHHHHHHhcC-C---CCCchhHhHHHHHHHHHHHHHHHHHHHHHhhcChhhh
Confidence 344456666666666666 5 6788888888888888888888888888888888874
No 181
>PRK10941 hypothetical protein; Provisional
Probab=77.47 E-value=2.4 Score=34.84 Aligned_cols=36 Identities=22% Similarity=0.254 Sum_probs=31.0
Q ss_pred HHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHH
Q psy16295 41 RDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQ 76 (128)
Q Consensus 41 rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~Q 76 (128)
+.-+--|=.+|.+.++|++|+++++.+|.+.|+++-
T Consensus 181 ~Rml~nLK~~~~~~~~~~~AL~~~e~ll~l~P~dp~ 216 (269)
T PRK10941 181 RKLLDTLKAALMEEKQMELALRASEALLQFDPEDPY 216 (269)
T ss_pred HHHHHHHHHHHHHcCcHHHHHHHHHHHHHhCCCCHH
Confidence 444556677889999999999999999999999984
No 182
>PRK14720 transcript cleavage factor/unknown domain fusion protein; Provisional
Probab=77.28 E-value=11 Score=36.23 Aligned_cols=75 Identities=16% Similarity=0.126 Sum_probs=55.2
Q ss_pred HHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHH
Q psy16295 4 FNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESI 83 (128)
Q Consensus 4 F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~ 83 (128)
|.+|.|+=+-. +..+++...+++++-+ | ....++-++|-.+... |.++|+.+....++..=++.|-...++.
T Consensus 120 ~~LA~~Ydk~g---~~~ka~~~yer~L~~D-~---~n~~aLNn~AY~~ae~-dL~KA~~m~~KAV~~~i~~kq~~~~~e~ 191 (906)
T PRK14720 120 RTLAEAYAKLN---ENKKLKGVWERLVKAD-R---DNPEIVKKLATSYEEE-DKEKAITYLKKAIYRFIKKKQYVGIEEI 191 (906)
T ss_pred HHHHHHHHHcC---ChHHHHHHHHHHHhcC-c---ccHHHHHHHHHHHHHh-hHHHHHHHHHHHHHHHHhhhcchHHHHH
Confidence 55666665544 4558999999999999 6 6889999999888888 8888888866666666656665555554
Q ss_pred HHH
Q psy16295 84 VKK 86 (128)
Q Consensus 84 Ie~ 86 (128)
.++
T Consensus 192 W~k 194 (906)
T PRK14720 192 WSK 194 (906)
T ss_pred HHH
Confidence 443
No 183
>PF04211 MtrC: Tetrahydromethanopterin S-methyltransferase, subunit C ; InterPro: IPR005865 This model describes the N5-methyltetrahydromethanopterin: coenzyme M methyltransferase subunit C in methanogenic archaea. This methyltranferase is a membrane-associated enzyme complex that uses methyl-transfer reaction to drive a sodium-ion pump. Archaea have evolved energy-yielding pathways marked by one-carbon biochemistry featuring novel cofactors and enzymes. This transferase is involved in the transfer of a methyl group from N5-methyltetrahydromethanopterin to coenzyme M. In an accompanying reaction, methane is produced by two-electron reduction of the methyl moiety in methyl-coenzyme M by another enzyme methyl-coenzyme M reductase.; GO: 0030269 tetrahydromethanopterin S-methyltransferase activity, 0015948 methanogenesis, 0016021 integral to membrane
Probab=76.06 E-value=9.9 Score=31.56 Aligned_cols=49 Identities=18% Similarity=0.081 Sum_probs=33.4
Q ss_pred CCCCHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHhhHHHhhhhhhc
Q psy16295 71 ESENSQVQHLESIVKKRMETEGLKGMALAGGAVLAQGQGTMEMSCKSYK 119 (128)
Q Consensus 71 eP~N~QA~~Lk~~Ie~~i~kdGliG~~i~gga~~~~g~~~~~~~~~~~~ 119 (128)
-||-+|-|+|.-.++.---.==..|+.-.+-..+++|+++.+.+++.|-
T Consensus 194 GPnE~q~RTL~la~~~G~ls~ii~~i~s~~~~sivv~~i~W~~~y~~y~ 242 (262)
T PF04211_consen 194 GPNESQDRTLTLAVECGFLSMIIFGIVSLDLISIVVGAIGWYISYKKYV 242 (262)
T ss_pred CCCcchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 6999999999999986443322222222222247889999999998873
No 184
>cd02683 MIT_1 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in proteins with unknown function, co-occuring with an as yet undescribed domain. The molecular function of the MIT domain is unclear.
Probab=75.97 E-value=6.8 Score=26.25 Aligned_cols=60 Identities=13% Similarity=0.175 Sum_probs=38.4
Q ss_pred HHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhh
Q psy16295 19 IRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKRMET 90 (128)
Q Consensus 19 ~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i~k 90 (128)
...|++++..-...+.. .+-. =|+ .-|.+|+.++-.+++.+|+...-..+++.|.+-+.|
T Consensus 3 ~~~a~~l~~~Ave~D~~--g~y~-----eAl-----~~Y~~aie~l~~~lk~e~d~~~k~~~r~ki~eY~~R 62 (77)
T cd02683 3 ELAAKEVLKRAVELDQE--GRFQ-----EAL-----VCYQEGIDLLMQVLKGTKDEAKKKNLRQKISEYMDR 62 (77)
T ss_pred hHHHHHHHHHHHHHHHh--ccHH-----HHH-----HHHHHHHHHHHHHHhhCCCHHHHHHHHHHHHHHHHH
Confidence 34677777665555421 1111 122 235677777888999999988888888888766655
No 185
>PRK12793 flaF flagellar biosynthesis regulatory protein FlaF; Reviewed
Probab=75.72 E-value=16 Score=26.58 Aligned_cols=66 Identities=23% Similarity=0.256 Sum_probs=44.9
Q ss_pred ChHHHHHHHHhccC-ChhhHH-----HHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHh-----
Q psy16295 1 MTQFNYSFCLVRSE-FCADIR-----KGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLA----- 69 (128)
Q Consensus 1 ~~~F~YAwaLv~S~-~~~d~~-----~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~----- 69 (128)
|.+|.||...-.|. ++.+.. +++.+|++.-..+ + ..++. -+|+.+...+-.
T Consensus 1 my~~aYa~~~~~s~~~~R~~E~~~l~r~~~~L~~a~~~~-~---~~~~~--------------~eAL~~NrrLWt~~~~D 62 (115)
T PRK12793 1 MYQFSYAEVMEDSVASARERERQAFDRSIDLLEAARAKG-A---YSREA--------------IEALYFTRRLWTVLIED 62 (115)
T ss_pred CchHHHHHHHHHcccChHHHHHHHHHHHHHHHHHHHhcC-C---ChHHH--------------HHHHHHHHHHHHHHHHH
Confidence 88999999999888 887765 4556666554443 2 22222 266666555544
Q ss_pred -cCCCCHHHHHHHHHH
Q psy16295 70 -IESENSQVQHLESIV 84 (128)
Q Consensus 70 -~eP~N~QA~~Lk~~I 84 (128)
.+|+|+.-.+||.-|
T Consensus 63 l~~p~N~LP~eLRa~l 78 (115)
T PRK12793 63 LGSPENALPEELRADL 78 (115)
T ss_pred hCCCCCCCCHHHHHHH
Confidence 489999999998755
No 186
>COG4785 NlpI Lipoprotein NlpI, contains TPR repeats [General function prediction only]
Probab=75.20 E-value=5.5 Score=33.16 Aligned_cols=39 Identities=10% Similarity=0.195 Sum_probs=34.7
Q ss_pred chHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHH
Q psy16295 39 RKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQV 77 (128)
Q Consensus 39 ~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA 77 (128)
...+.+.||++=...-|||+-|....+.+|+.||.+.=|
T Consensus 97 ~m~~vfNyLG~Yl~~a~~fdaa~eaFds~~ELDp~y~Ya 135 (297)
T COG4785 97 DMPEVFNYLGIYLTQAGNFDAAYEAFDSVLELDPTYNYA 135 (297)
T ss_pred CcHHHHHHHHHHHHhcccchHHHHHhhhHhccCCcchHH
Confidence 568999999999999999999999999999999987543
No 187
>PF12862 Apc5: Anaphase-promoting complex subunit 5
Probab=74.66 E-value=11 Score=25.39 Aligned_cols=58 Identities=19% Similarity=0.212 Sum_probs=39.4
Q ss_pred hhHHHHHHHHHHHhhccCcccc-----chHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCC
Q psy16295 17 ADIRKGILLLEELFKTETEDID-----RKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESEN 74 (128)
Q Consensus 17 ~d~~~GI~LL~~l~~~~~~~~~-----~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N 74 (128)
.|...+++-|...+.-...... .-.-.+..+|..|+..|++++|+..++..+++-.++
T Consensus 12 ~dy~~A~d~L~~~fD~~~~~~~~~~~~~~~~all~lA~~~~~~G~~~~A~~~l~eAi~~Are~ 74 (94)
T PF12862_consen 12 GDYSEALDALHRYFDYAKQSNNSSSNSGLAYALLNLAELHRRFGHYEEALQALEEAIRLAREN 74 (94)
T ss_pred CCHHHHHHHHHHHHHHHhhcccchhhHHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHHHH
Confidence 4556677777776654311001 113456778999999999999999999888775443
No 188
>PF10300 DUF3808: Protein of unknown function (DUF3808); InterPro: IPR019412 This entry represents a family of proteins conserved from fungi to humans. In humans this protein is expressed in primary breast carcinomas but not in normal breast tissue, and has a putative eukaryotic RNP-1 RNA binding region and a candidate anchoring transmembrane domain. The human protein is coordinately regulated with oestrogen receptor, but is not necessarily oestradiol-responsive []. Members of this family carry a tetratricopeptide repeat (IPR013105 from INTERPRO) at their C terminus.
Probab=73.40 E-value=10 Score=33.16 Aligned_cols=53 Identities=13% Similarity=0.167 Sum_probs=42.9
Q ss_pred hhhHHHHHHHHHHHhhc--cCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhc
Q psy16295 16 CADIRKGILLLEELFKT--ETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAI 70 (128)
Q Consensus 16 ~~d~~~GI~LL~~l~~~--~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~ 70 (128)
..++.++|+.++..... +-+ .-+.=|+|-++..|.-+.||++|..++..|++.
T Consensus 280 ~g~~~~Ai~~~~~a~~~q~~~~--Ql~~l~~~El~w~~~~~~~w~~A~~~f~~L~~~ 334 (468)
T PF10300_consen 280 KGNLEEAIESFERAIESQSEWK--QLHHLCYFELAWCHMFQHDWEEAAEYFLRLLKE 334 (468)
T ss_pred hcCHHHHHHHHHHhccchhhHH--hHHHHHHHHHHHHHHHHchHHHHHHHHHHHHhc
Confidence 56788999999976643 222 345679999999999999999999999999884
No 189
>PF09613 HrpB1_HrpK: Bacterial type III secretion protein (HrpB1_HrpK); InterPro: IPR013394 This family of proteins is encoded by genes found within type III secretion operons in a limited range of species including Xanthomonas, Ralstonia and Burkholderia.
Probab=73.24 E-value=19 Score=27.71 Aligned_cols=52 Identities=6% Similarity=-0.006 Sum_probs=37.2
Q ss_pred chHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhh
Q psy16295 39 RKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKRMET 90 (128)
Q Consensus 39 ~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i~k 90 (128)
...+.-.+-+.-|.+.|++.+|++..+.+-+-.|.++=+++|....=.....
T Consensus 42 ~~~e~~~~~~~l~i~r~~w~dA~rlLr~l~~~~~~~p~~kALlA~CL~~~~D 93 (160)
T PF09613_consen 42 EFPELDLFDGWLHIVRGDWDDALRLLRELEERAPGFPYAKALLALCLYALGD 93 (160)
T ss_pred CchHHHHHHHHHHHHhCCHHHHHHHHHHHhccCCCChHHHHHHHHHHHHcCC
Confidence 4566677777777788888888888777777777777777777666444433
No 190
>cd02682 MIT_AAA_Arch MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in mostly archaebacterial AAA-ATPases. The molecular function of the MIT domain is unclear.
Probab=72.58 E-value=10 Score=25.61 Aligned_cols=34 Identities=9% Similarity=0.170 Sum_probs=26.2
Q ss_pred hHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhh
Q psy16295 57 YTKSLKYCRAFLAIESENSQVQHLESIVKKRMET 90 (128)
Q Consensus 57 Y~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i~k 90 (128)
|.+|....-.+++..|++++....++.|.+=+.|
T Consensus 29 Y~~aIe~L~q~~~~~pD~~~k~~yr~ki~eY~~R 62 (75)
T cd02682 29 YKKAIEVLSQIVKNYPDSPTRLIYEQMINEYKRR 62 (75)
T ss_pred HHHHHHHHHHHHHhCCChHHHHHHHHHHHHHHHH
Confidence 4455666778899999999988888888765554
No 191
>PLN03081 pentatricopeptide (PPR) repeat-containing protein; Provisional
Probab=72.40 E-value=10 Score=34.04 Aligned_cols=27 Identities=7% Similarity=-0.034 Sum_probs=12.5
Q ss_pred HHHHhhhhchHHHHHHHHHHHhcCCCC
Q psy16295 48 AIGNARIKEYTKSLKYCRAFLAIESEN 74 (128)
Q Consensus 48 Avg~yklgdY~~A~~~~~~lL~~eP~N 74 (128)
.-++.+.|+++.|++..+.+++++|+|
T Consensus 501 l~a~~~~g~~~~a~~~~~~l~~~~p~~ 527 (697)
T PLN03081 501 LTACRIHKNLELGRLAAEKLYGMGPEK 527 (697)
T ss_pred HHHHHHcCCcHHHHHHHHHHhCCCCCC
Confidence 334444444444444444444444443
No 192
>KOG1156|consensus
Probab=72.37 E-value=7.3 Score=36.28 Aligned_cols=45 Identities=16% Similarity=0.214 Sum_probs=36.1
Q ss_pred cchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHH
Q psy16295 38 DRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLES 82 (128)
Q Consensus 38 ~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~ 82 (128)
..--=++|++|..+.++|+|+.|..|+|..+.+=|.-..--.+|.
T Consensus 368 ttllWt~y~laqh~D~~g~~~~A~~yId~AIdHTPTliEly~~Ka 412 (700)
T KOG1156|consen 368 TTLLWTLYFLAQHYDKLGDYEVALEYIDLAIDHTPTLIELYLVKA 412 (700)
T ss_pred hHHHHHHHHHHHHHHHcccHHHHHHHHHHHhccCchHHHHHHHHH
Confidence 344568999999999999999999999999999986444333333
No 193
>COG4649 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=71.78 E-value=39 Score=27.24 Aligned_cols=62 Identities=11% Similarity=0.156 Sum_probs=48.4
Q ss_pred hHHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHh
Q psy16295 2 TQFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLA 69 (128)
Q Consensus 2 ~~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~ 69 (128)
++.+-||.||-..+-+|+. ..+|.|-.+..| .+.-.-=-|+++.||-|||..|++..+++-.
T Consensus 134 ARlraa~lLvD~gsy~dV~---srvepLa~d~n~---mR~sArEALglAa~kagd~a~A~~~F~qia~ 195 (221)
T COG4649 134 ARLRAAYLLVDNGSYDDVS---SRVEPLAGDGNP---MRHSAREALGLAAYKAGDFAKAKSWFVQIAN 195 (221)
T ss_pred HHHHHHHHHhccccHHHHH---HHhhhccCCCCh---hHHHHHHHHhHHHHhccchHHHHHHHHHHHc
Confidence 5677899999888777764 556677777644 5555555679999999999999999998876
No 194
>PF14938 SNAP: Soluble NSF attachment protein, SNAP; PDB: 1QQE_A 2IFU_A.
Probab=71.40 E-value=12 Score=29.97 Aligned_cols=58 Identities=19% Similarity=0.205 Sum_probs=41.4
Q ss_pred hhHHHHHHHHHHH---hhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCH
Q psy16295 17 ADIRKGILLLEEL---FKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENS 75 (128)
Q Consensus 17 ~d~~~GI~LL~~l---~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~ 75 (128)
.|..++|+..++- ++.+ .......+|+--+|--+.++|+|++|...++.+.+.--+|+
T Consensus 129 ~d~e~Ai~~Y~~A~~~y~~e-~~~~~a~~~~~~~A~l~~~l~~y~~A~~~~e~~~~~~l~~~ 189 (282)
T PF14938_consen 129 GDYEKAIEYYQKAAELYEQE-GSPHSAAECLLKAADLYARLGRYEEAIEIYEEVAKKCLENN 189 (282)
T ss_dssp --HHHHHHHHHHHHHHHHHT-T-HHHHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHTCCCHC
T ss_pred CCHHHHHHHHHHHHHHHHHC-CChhhHHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHhhccc
Confidence 5667788777763 3333 11256789999999999999999999999999988654443
No 195
>KOG0545|consensus
Probab=71.18 E-value=17 Score=30.79 Aligned_cols=64 Identities=19% Similarity=0.195 Sum_probs=46.7
Q ss_pred hhHHHHHHHHHHHhhccCccc-------cchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHH
Q psy16295 17 ADIRKGILLLEELFKTETEDI-------DRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHL 80 (128)
Q Consensus 17 ~d~~~GI~LL~~l~~~~~~~~-------~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~L 80 (128)
.-.+++|-.|+.|.-.+.|.. ....-.+--++-++.+.+||-+++..|..+|+.+|.|-.|.--
T Consensus 199 ~~YreAi~~l~~L~lkEkP~e~eW~eLdk~~tpLllNy~QC~L~~~e~yevleh~seiL~~~~~nvKA~fr 269 (329)
T KOG0545|consen 199 SKYREAIICLRNLQLKEKPGEPEWLELDKMITPLLLNYCQCLLKKEEYYEVLEHCSEILRHHPGNVKAYFR 269 (329)
T ss_pred HHHHHHHHHHHHHHhccCCCChHHHHHHHhhhHHHHhHHHHHhhHHHHHHHHHHHHHHHhcCCchHHHHHH
Confidence 346789999999877665631 0111223345778889999999999999999999999766543
No 196
>PF11588 DUF3243: Protein of unknown function (DUF3243); InterPro: IPR021637 This family of proteins with unknown function includes uncharacterised proteins ymfJ and yflH. The family appears to be restricted to Firmicutes.; PDB: 3D0W_B.
Probab=71.06 E-value=4.2 Score=28.11 Aligned_cols=31 Identities=16% Similarity=0.253 Sum_probs=18.1
Q ss_pred HHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHH
Q psy16295 47 LAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVK 85 (128)
Q Consensus 47 LAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie 85 (128)
++-..|++|||-. =+.||.|+|-+-|+++=+
T Consensus 31 i~~~A~~iGdyLA--------~~vdP~N~EerlLkELW~ 61 (81)
T PF11588_consen 31 IANLAYQIGDYLA--------KNVDPKNPEERLLKELWD 61 (81)
T ss_dssp ---HHHHHHHHHH--------T-----SHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHH--------hcCCCCCHHHHHHHHHHH
Confidence 3456788888822 267999999999999865
No 197
>KOG4626|consensus
Probab=70.63 E-value=12 Score=35.36 Aligned_cols=56 Identities=14% Similarity=0.095 Sum_probs=42.2
Q ss_pred ChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCC
Q psy16295 15 FCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESEN 74 (128)
Q Consensus 15 ~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N 74 (128)
...++.+||.-.++-++-+ | .=-|..|-|+..+-..+.|++|..++...|...|++
T Consensus 230 ~~Gei~~aiq~y~eAvkld-P---~f~dAYiNLGnV~ke~~~~d~Avs~Y~rAl~lrpn~ 285 (966)
T KOG4626|consen 230 AQGEIWLAIQHYEEAVKLD-P---NFLDAYINLGNVYKEARIFDRAVSCYLRALNLRPNH 285 (966)
T ss_pred hcchHHHHHHHHHHhhcCC-C---cchHHHhhHHHHHHHHhcchHHHHHHHHHHhcCCcc
Confidence 4568899999999999998 7 567777777777777777777777755555555544
No 198
>PF10602 RPN7: 26S proteasome subunit RPN7; InterPro: IPR019585 This entry represents the regulatory subunit RPN7 (known as the non-ATPase regulatory subunit 6 in higher eukaryotes) of the 26S proteasome. This entry also matches the evolutionarily related subunit 1 of the COP9 signalosome complex (CSN) from Arabidopsis []. The 26S proteasome plays a major role in ATP-dependent degradation of ubiquitinated proteins. Substrate specificity is conferred by the regulatory particle (RP), which can dissociate into stable lid and base subcomplexes. The regulatory subunit RPN7 is one of the lid subunits of the 26S proteasome and has been shown in Saccharomyces cerevisiae (Baker's yeast) to be required for structural integrity []. The COP9 signalosome is a conserved protein complex composed of eight subunits, where Individual subunits of the complex have been linked to various signal transduction pathways leading to gene expression and cell cycle control []. The overall organisation and the amino acid sequences of the COP9 signalosome subunits resemble the lid subcomplex of the 19 S regulatory particle for the 26 S proteasome []. COP9 subunit 1 (CSN1 or GPS1) of the COP9 complex is an essential subunit of the complex with regard to both structural integrity and functionality. The N-terminal region of subunit 1 (CSN1-N) can inhibit c-fos expression from either a transfected template or a chromosomal transgene (fos-lacZ), and may contain the activity domain that confers most of the repression functions of CSN1. The C-terminal region of subunit 1 (CSN1-C) allows integration of the protein into the COP9 signalosome.
Probab=70.42 E-value=28 Score=26.45 Aligned_cols=64 Identities=17% Similarity=0.052 Sum_probs=46.4
Q ss_pred hhHHHHHHHHHHHhhccC--ccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHH
Q psy16295 17 ADIRKGILLLEELFKTET--EDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHL 80 (128)
Q Consensus 17 ~d~~~GI~LL~~l~~~~~--~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~L 80 (128)
..+.+=++-|+.-+++-. ..-+..|+.+.-+|--|++.||+++|++.+..+.+.-....|...+
T Consensus 10 ~~~~~~~~~Le~elk~~~~n~~kesir~~~~~l~~~~~~~Gd~~~A~k~y~~~~~~~~~~~~~id~ 75 (177)
T PF10602_consen 10 AKNAEELEKLEAELKDAKSNLGKESIRMALEDLADHYCKIGDLEEALKAYSRARDYCTSPGHKIDM 75 (177)
T ss_pred HHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhhcCCHHHHHHH
Confidence 334455666666665410 1115679999999999999999999999999988877666666553
No 199
>KOG0551|consensus
Probab=70.36 E-value=14 Score=32.20 Aligned_cols=28 Identities=21% Similarity=0.376 Sum_probs=14.4
Q ss_pred HHHHHhhhhchHHHHHHHHHHHhcCCCC
Q psy16295 47 LAIGNARIKEYTKSLKYCRAFLAIESEN 74 (128)
Q Consensus 47 LAvg~yklgdY~~A~~~~~~lL~~eP~N 74 (128)
-|-+++-+|||..|++=|.+.|+.+|.+
T Consensus 125 RAAa~~~l~NyRs~l~Dcs~al~~~P~h 152 (390)
T KOG0551|consen 125 RAAAQLYLGNYRSALNDCSAALKLKPTH 152 (390)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCcch
Confidence 4445555555555555555555555544
No 200
>COG0457 NrfG FOG: TPR repeat [General function prediction only]
Probab=70.06 E-value=21 Score=23.49 Aligned_cols=53 Identities=19% Similarity=0.372 Sum_probs=33.9
Q ss_pred hhHHHHHHHHHHHhhccCccccc-hHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCC
Q psy16295 17 ADIRKGILLLEELFKTETEDIDR-KRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESE 73 (128)
Q Consensus 17 ~d~~~GI~LL~~l~~~~~~~~~~-~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~ 73 (128)
.+...++..+...++.. + . ..+.++.++..+...+++..|..++...++..|.
T Consensus 181 ~~~~~a~~~~~~~~~~~-~---~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~ 234 (291)
T COG0457 181 GRYEEALELLEKALKLN-P---DDDAEALLNLGLLYLKLGKYEEALEYYEKALELDPD 234 (291)
T ss_pred cCHHHHHHHHHHHHhhC-c---ccchHHHHHhhHHHHHcccHHHHHHHHHHHHhhCcc
Confidence 34556666666666665 3 2 4566666666666666777777777777776666
No 201
>COG3629 DnrI DNA-binding transcriptional activator of the SARP family [Signal transduction mechanisms]
Probab=70.03 E-value=22 Score=29.58 Aligned_cols=79 Identities=13% Similarity=0.078 Sum_probs=55.3
Q ss_pred hHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHH---HhcCCCCHHHHHHHHHHHHHHhh---h
Q psy16295 18 DIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAF---LAIESENSQVQHLESIVKKRMET---E 91 (128)
Q Consensus 18 d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~l---L~~eP~N~QA~~Lk~~Ie~~i~k---d 91 (128)
.....++.|++|+..+ | -...-++.+=-++++.|+++.|++.++.+ +..|+.=.++..+....++.+.+ +
T Consensus 168 ~~~~~~~~l~~Li~~d-p---~~E~~~~~lm~~y~~~g~~~~ai~~y~~l~~~~~edlgi~P~~~~~~~y~~~~~~~~~~ 243 (280)
T COG3629 168 RADAVIEHLERLIELD-P---YDEPAYLRLMEAYLVNGRQSAAIRAYRQLKKTLAEELGIDPAPELRALYEEILRQDPLD 243 (280)
T ss_pred cHHHHHHHHHHHHhcC-c---cchHHHHHHHHHHHHcCCchHHHHHHHHHHHHhhhhcCCCccHHHHHHHHHHhcccccc
Confidence 3456789999999998 6 56666677777899999999999998876 45566666666666665555444 4
Q ss_pred HHHHHHHHH
Q psy16295 92 GLKGMALAG 100 (128)
Q Consensus 92 GliG~~i~g 100 (128)
.-+++...+
T Consensus 244 ~~~~~~~~~ 252 (280)
T COG3629 244 NKVSVVTKE 252 (280)
T ss_pred cccceeccc
Confidence 444444443
No 202
>TIGR01148 mtrC N5-methyltetrahydromethanopterin:coenzyme M methyltransferase subunit C. coenzyme M methyltransferase subunit C in methanogenic archaea. This methyltranferase is membrane-associated enzyme complex that uses methyl-transfer reaction to drive sodium-ion pump. Archaea have evolved energy-yielding pathways marked by one-carbon biochemistry featuring novel cofactors and enzymes. This transferase is involved in the transfer of 'methyl' group from N5-methyltetrahydromethanopterin to coenzyme M. In an accompanying reaction, methane is produced by two-electron reduction of the methyl moiety in methyl-coenzyme M by another enzyme methyl-coenzyme M reductase.
Probab=69.62 E-value=15 Score=30.54 Aligned_cols=50 Identities=16% Similarity=0.088 Sum_probs=36.4
Q ss_pred CCCCHHHHHHHHHHHHHHhh---hHHHHHHHHHHHHHHHHhhHHHhhhhhhcc
Q psy16295 71 ESENSQVQHLESIVKKRMET---EGLKGMALAGGAVLAQGQGTMEMSCKSYKN 120 (128)
Q Consensus 71 eP~N~QA~~Lk~~Ie~~i~k---dGliG~~i~gga~~~~g~~~~~~~~~~~~~ 120 (128)
-|+-+|-++|.-.+|.-.-. -|++...++.|..+++|+++.+++++.|-+
T Consensus 193 GPnE~q~RTL~La~e~G~ls~ii~~i~s~~~~s~~~iiig~i~W~v~y~~fv~ 245 (265)
T TIGR01148 193 GPNESQDRTLWLAVECGFITGFVSSLHEGLMVAGLNILVGAIIWYVAFSKFVA 245 (265)
T ss_pred CCCcchhHHHHHHHHHhHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHH
Confidence 69999999999999875433 333334444455588999999999988743
No 203
>KOG2471|consensus
Probab=69.52 E-value=11 Score=34.51 Aligned_cols=49 Identities=12% Similarity=-0.016 Sum_probs=42.6
Q ss_pred hHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCC--CCHHHHHHHHHHHHHH
Q psy16295 40 KRDYLYYLAIGNARIKEYTKSLKYCRAFLAIES--ENSQVQHLESIVKKRM 88 (128)
Q Consensus 40 ~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP--~N~QA~~Lk~~Ie~~i 88 (128)
+.--+|-||.++.-+++|+.|+.++.....+-| -|.||..|...|+=++
T Consensus 618 r~v~~~nLa~a~alq~~~dqAk~ll~~aatl~hs~v~~~A~~lavyidL~~ 668 (696)
T KOG2471|consen 618 RGVLFANLAAALALQGHHDQAKSLLTHAATLLHSLVNVQATVLAVYIDLML 668 (696)
T ss_pred hHHHHHHHHHHHHHhcccHHHHHHHHHHHHhhhccccHHHHHHHHHHHHhc
Confidence 344568899999999999999999988888877 8999999999998665
No 204
>cd02678 MIT_VPS4 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in intracellular protein transport proteins of the AAA-ATPase family. The molecular function of the MIT domain is unclear.
Probab=69.17 E-value=13 Score=24.41 Aligned_cols=60 Identities=15% Similarity=0.327 Sum_probs=37.6
Q ss_pred HHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhh
Q psy16295 19 IRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKRMET 90 (128)
Q Consensus 19 ~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i~k 90 (128)
+.+++.++.+-++.+.. .+..+ |+ .-|.+|..++-..++.+|+......+++.+..-+.+
T Consensus 3 ~~~A~~l~~~Av~~D~~--g~y~e-----A~-----~~Y~~aie~l~~~~k~e~~~~~k~~~~~k~~eyl~R 62 (75)
T cd02678 3 LQKAIELVKKAIEEDNA--GNYEE-----AL-----RLYQHALEYFMHALKYEKNPKSKESIRAKCTEYLDR 62 (75)
T ss_pred HHHHHHHHHHHHHHHHc--CCHHH-----HH-----HHHHHHHHHHHHHHhhCCCHHHHHHHHHHHHHHHHH
Confidence 45677777776655421 12111 22 235667777888889999777777777777766655
No 205
>PF02184 HAT: HAT (Half-A-TPR) repeat; InterPro: IPR003107 The HAT (Half A TPR) repeat has a repetitive pattern characterised by three aromatic residues with a conserved spacing. They are structurally and sequentially similar to TPRs (tetratricopeptide repeats), though they lack the highly conserved alanine and glycine residues found in TPRs. The number of HAT repeats found in different proteins varies between 9 and 12. HAT-repeat-containing proteins appear to be components of macromolecular complexes that are required for RNA processing []. The repeats may be involved in protein-protein interactions. The HAT motif has striking structural similarities to HEAT repeats (IPR000357 from INTERPRO), being of a similar length and consisting of two short helices connected by a loop domain, as in HEAT repeats.; GO: 0006396 RNA processing, 0005622 intracellular
Probab=68.92 E-value=5.3 Score=23.04 Aligned_cols=22 Identities=18% Similarity=0.394 Sum_probs=18.9
Q ss_pred hchHHHHHHHHHHHhcCCCCHH
Q psy16295 55 KEYTKSLKYCRAFLAIESENSQ 76 (128)
Q Consensus 55 gdY~~A~~~~~~lL~~eP~N~Q 76 (128)
+++++||..+++++.+.|+-..
T Consensus 1 kE~dRAR~IyeR~v~~hp~~k~ 22 (32)
T PF02184_consen 1 KEFDRARSIYERFVLVHPEVKN 22 (32)
T ss_pred ChHHHHHHHHHHHHHhCCCchH
Confidence 4789999999999999997543
No 206
>PF09797 NatB_MDM20: N-acetyltransferase B complex (NatB) non catalytic subunit; InterPro: IPR019183 This is the non-catalytic subunit of the N-terminal acetyltransferase B complex (NatB). The NatB complex catalyses the acetylation of the amino-terminal methionine residue of all proteins beginning with Met-Asp or Met-Glu and of some proteins beginning with Met-Asn or Met-Met. In Saccharomyces cerevisiae (Baker's yeast) this subunit is called MDM20 and in Schizosaccharomyces pombe (Fission yeast) it is called Arm1. NatB acetylates the Tpm1 protein and regulates and tropomyocin-actin interactions. This subunit is required by the NatB complex for the N-terminal acetylation of Tpm1 [].
Probab=68.89 E-value=21 Score=29.74 Aligned_cols=72 Identities=17% Similarity=0.203 Sum_probs=55.1
Q ss_pred ccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhh
Q psy16295 12 RSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKRMET 90 (128)
Q Consensus 12 ~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i~k 90 (128)
.+++...+-++|.+||..+..+ | .+.+....|.--|..+|-.+.|...++.| +=.|-|--+|--.+-+++..
T Consensus 192 ~~~~~~~l~~Ai~lLE~~l~~s-~---~n~~~~LlLvrlY~~LG~~~~A~~~~~~L---~iK~IQ~DTL~h~~~~r~~~ 263 (365)
T PF09797_consen 192 KTKDSEYLLQAIALLEHALKKS-P---HNYQLKLLLVRLYSLLGAGSLALEHYESL---DIKNIQLDTLGHLILDRLST 263 (365)
T ss_pred ccCCHHHHHHHHHHHHHHHHcC-C---CcHHHHHHHHHHHHHcCCHHHHHHHHHhc---ChHHHHHHHhHHHHHHHHhc
Confidence 5788999999999999999998 6 78999999999999999999998886432 22344555554444444443
No 207
>KOG0551|consensus
Probab=68.09 E-value=21 Score=31.09 Aligned_cols=47 Identities=13% Similarity=0.258 Sum_probs=42.6
Q ss_pred chHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHH
Q psy16295 39 RKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVK 85 (128)
Q Consensus 39 ~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie 85 (128)
..--|.|=-|-+++.+..+.+|.+.|+..|+++-++.-+..|..+|.
T Consensus 151 ~h~Ka~~R~Akc~~eLe~~~~a~nw~ee~~~~d~e~K~~~~l~~l~~ 197 (390)
T KOG0551|consen 151 THLKAYIRGAKCLLELERFAEAVNWCEEGLQIDDEAKKAIELRNLIH 197 (390)
T ss_pred chhhhhhhhhHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHHHHhhcC
Confidence 45678888999999999999999999999999999999999997663
No 208
>PF01239 PPTA: Protein prenyltransferase alpha subunit repeat; InterPro: IPR002088 Protein prenylation is the posttranslational attachment of either a farnesyl group or a geranylgeranyl group via a thioether linkage (-C-S-C-) to a cysteine at or near the carboxyl terminus of the protein. Farnesyl and geranylgeranyl groups are polyisoprenes, unsaturated hydrocarbons with a multiple of five carbons; the chain is 15 carbons long in the farnesyl moiety and 20 carbons long in the geranylgeranyl moiety. There are three different protein prenyltransferases in humans: farnesyltransferase (FT) and geranylgeranyltransferase 1 (GGT1) share the same motif (the CaaX box) around the cysteine in their substrates, and are thus called CaaX prenyltransferases, whereas geranylgeranyltransferase 2 (GGT2, also called Rab geranylgeranyltransferase) recognises a different motif and is thus called a non-CaaX prenyltransferase. Protein prenyltransferases are currently known only in eukaryotes, but they are widespread, being found in vertebrates, insects, nematodes, plants, fungi and protozoa, including several parasites. Each protein consists of two subunits, alpha and beta; the alpha subunit of FT and GGT1 is encoded by the same gene, FNTA. The alpha subunit is thought to participate in a stable complex with the isoprenyl substrate; the beta subunit binds the peptide substrate. In the alpha subunits of both types of protein prenyltransferases, seven tetratricopeptide repeats are formed by pairs of helices that are stabilised by conserved intercalating residues. The alpha subunits of GGT2 in mammals and plants also have an immunoglobulin-like domain between the fifth and sixth tetratricopeptide repeat, as well as leucine-rich repeats at the carboxyl terminus. The functions of these additional domains in GGT2 are as yet undefined, but they are apparently not directly involved in the interaction with substrates and Rab escort proteins. The tetratricopeptide repeats of the alpha subunit form a right-handed superhelix, which embraces the (alpha-alpha)6 barrel of the beta subunit []. ; GO: 0008318 protein prenyltransferase activity, 0018342 protein prenylation; PDB: 1S63_A 1LD7_A 1LD8_A 2H6G_A 1SA4_A 1MZC_A 1TN6_A 2F0Y_A 2H6H_A 2H6F_A ....
Probab=67.68 E-value=7.2 Score=21.14 Aligned_cols=26 Identities=12% Similarity=0.227 Sum_probs=20.1
Q ss_pred HHHHHHHHHHhcCCCCHHHHHHHHHH
Q psy16295 59 KSLKYCRAFLAIESENSQVQHLESIV 84 (128)
Q Consensus 59 ~A~~~~~~lL~~eP~N~QA~~Lk~~I 84 (128)
+.+.++..++..+|.|.++=.=+.-|
T Consensus 1 ~El~~~~~~l~~~pknys~W~yR~~l 26 (31)
T PF01239_consen 1 DELEFTKKALEKDPKNYSAWNYRRWL 26 (31)
T ss_dssp HHHHHHHHHHHHSTTCHHHHHHHHHH
T ss_pred CHHHHHHHHHHHCcccccHHHHHHHH
Confidence 35789999999999999885544443
No 209
>PRK01030 tetrahydromethanopterin S-methyltransferase subunit C; Provisional
Probab=67.53 E-value=20 Score=29.84 Aligned_cols=50 Identities=16% Similarity=0.133 Sum_probs=34.0
Q ss_pred CCCCHHHHHHHHHHHHHHhhh---HHHHHHHHH----HHHHHHHhhHHHhhhhhhcc
Q psy16295 71 ESENSQVQHLESIVKKRMETE---GLKGMALAG----GAVLAQGQGTMEMSCKSYKN 120 (128)
Q Consensus 71 eP~N~QA~~Lk~~Ie~~i~kd---GliG~~i~g----ga~~~~g~~~~~~~~~~~~~ 120 (128)
-|+-+|-|+|.-.+|.-.-.= |++..+..+ ...+++|+++.+.+++.|-+
T Consensus 187 GP~E~q~RTL~la~e~G~ls~ii~gi~s~~~~~~~~~~i~iivg~i~W~~~y~~y~~ 243 (264)
T PRK01030 187 GPNESQDRTLTLAVECGFLSMIIFGIASLAFLGLAAAIISIIVGLIGWYYAYVKFVK 243 (264)
T ss_pred CCCcchhHHHHHHHHHHHHHHHHHHHHHHHhhchhhHHHHHHHHHHHHHHHHHHHHH
Confidence 699999999999998754332 222222222 12478899999999988743
No 210
>PF04781 DUF627: Protein of unknown function (DUF627); InterPro: IPR006866 This domain represents the N-terminal region of several plant proteins of unknown function.
Probab=67.37 E-value=9.2 Score=27.83 Aligned_cols=31 Identities=13% Similarity=0.066 Sum_probs=23.9
Q ss_pred HHHHHhhhhchHHHHHHHHHHHhcCCCCHHH
Q psy16295 47 LAIGNARIKEYTKSLKYCRAFLAIESENSQV 77 (128)
Q Consensus 47 LAvg~yklgdY~~A~~~~~~lL~~eP~N~QA 77 (128)
.|..+++.|||.+|++..+.+....+++.+.
T Consensus 2 ~A~~~~~rGnhiKAL~iied~i~~h~~~~~~ 32 (111)
T PF04781_consen 2 KAKDYFARGNHIKALEIIEDLISRHGEDESS 32 (111)
T ss_pred hHHHHHHccCHHHHHHHHHHHHHHccCCCch
Confidence 3667788888888888888888888877754
No 211
>smart00745 MIT Microtubule Interacting and Trafficking molecule domain.
Probab=67.30 E-value=15 Score=23.64 Aligned_cols=62 Identities=16% Similarity=0.304 Sum_probs=38.7
Q ss_pred hhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhh
Q psy16295 17 ADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKRMET 90 (128)
Q Consensus 17 ~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i~k 90 (128)
....++++++..-++.+.. .. |--|+ .-|.+|...+...++.+|+......++..+.+-+.+
T Consensus 3 ~~~~~A~~li~~Av~~d~~--g~-----~~eAl-----~~Y~~a~e~l~~~~~~~~~~~~~~~~~~k~~eyl~r 64 (77)
T smart00745 3 DYLSKAKELISKALKADEA--GD-----YEEAL-----ELYKKAIEYLLEGIKVESDSKRREAVKAKAAEYLDR 64 (77)
T ss_pred cHHHHHHHHHHHHHHHHHc--CC-----HHHHH-----HHHHHHHHHHHHHhccCCCHHHHHHHHHHHHHHHHH
Confidence 3456777777665555411 11 11232 235677777888899999877777777777665554
No 212
>KOG3060|consensus
Probab=66.76 E-value=19 Score=30.26 Aligned_cols=57 Identities=23% Similarity=0.272 Sum_probs=42.3
Q ss_pred hhhHHHHHHHHHHHhhccCcc----ccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHH
Q psy16295 16 CADIRKGILLLEELFKTETED----IDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQV 77 (128)
Q Consensus 16 ~~d~~~GI~LL~~l~~~~~~~----~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA 77 (128)
..+..+++-.+|+++-.+ |. ..+--|.+|++|- ..++.-||+|+.+.|++.|.|.-+
T Consensus 167 ~~~f~kA~fClEE~ll~~-P~n~l~f~rlae~~Yt~gg----~eN~~~arkyy~~alkl~~~~~ra 227 (289)
T KOG3060|consen 167 EGDFEKAAFCLEELLLIQ-PFNPLYFQRLAEVLYTQGG----AENLELARKYYERALKLNPKNLRA 227 (289)
T ss_pred HhHHHHHHHHHHHHHHcC-CCcHHHHHHHHHHHHHHhh----HHHHHHHHHHHHHHHHhChHhHHH
Confidence 467788999999998888 62 0122355555554 568999999999999999977544
No 213
>PF04184 ST7: ST7 protein; InterPro: IPR007311 The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain.
Probab=66.10 E-value=25 Score=31.93 Aligned_cols=56 Identities=13% Similarity=0.192 Sum_probs=32.4
Q ss_pred HHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHH
Q psy16295 3 QFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKY 63 (128)
Q Consensus 3 ~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~ 63 (128)
+.++|.|+.+-. ..+++|+++++++++. |. ...-...|-|.-+...++.|.++...
T Consensus 262 KrRLAmCarklG---r~~EAIk~~rdLlke~-p~-~~~l~IrenLie~LLelq~Yad~q~l 317 (539)
T PF04184_consen 262 KRRLAMCARKLG---RLREAIKMFRDLLKEF-PN-LDNLNIRENLIEALLELQAYADVQAL 317 (539)
T ss_pred HHHHHHHHHHhC---ChHHHHHHHHHHHhhC-Cc-cchhhHHHHHHHHHHhcCCHHHHHHH
Confidence 455666666443 3446777777777665 31 23344556666666666666665544
No 214
>COG2956 Predicted N-acetylglucosaminyl transferase [Carbohydrate transport and metabolism]
Probab=65.68 E-value=42 Score=29.30 Aligned_cols=69 Identities=14% Similarity=0.207 Sum_probs=61.4
Q ss_pred hhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHH
Q psy16295 16 CADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKR 87 (128)
Q Consensus 16 ~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~ 87 (128)
..|...+|+-++.+...+ | +--.+.+--|..+|-.+|+-.+.+.+....-+.-|+-.-...|-++|+..
T Consensus 227 ~g~y~~AV~~~e~v~eQn-~--~yl~evl~~L~~~Y~~lg~~~~~~~fL~~~~~~~~g~~~~l~l~~lie~~ 295 (389)
T COG2956 227 KGDYQKAVEALERVLEQN-P--EYLSEVLEMLYECYAQLGKPAEGLNFLRRAMETNTGADAELMLADLIELQ 295 (389)
T ss_pred ccchHHHHHHHHHHHHhC-h--HHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHHccCCccHHHHHHHHHHHh
Confidence 356778999999999999 6 68899999999999999999999999999999999988888888888754
No 215
>KOG1130|consensus
Probab=64.80 E-value=8.2 Score=34.84 Aligned_cols=53 Identities=26% Similarity=0.383 Sum_probs=39.0
Q ss_pred HHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHH
Q psy16295 9 CLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYC 64 (128)
Q Consensus 9 aLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~ 64 (128)
-|.|+. |-+.||.+|+.-+.-+.+|...-.-....|.-+|+=++||++|++|-
T Consensus 26 RLck~g---dcraGv~ff~aA~qvGTeDl~tLSAIYsQLGNAyfyL~DY~kAl~yH 78 (639)
T KOG1130|consen 26 RLCKMG---DCRAGVDFFKAALQVGTEDLSTLSAIYSQLGNAYFYLKDYEKALKYH 78 (639)
T ss_pred HHHhcc---chhhhHHHHHHHHHhcchHHHHHHHHHHHhcchhhhHhhHHHHHhhh
Confidence 355554 56789999999998875432222344557888999999999999994
No 216
>smart00386 HAT HAT (Half-A-TPR) repeats. Present in several RNA-binding proteins. Structurally and sequentially thought to be similar to TPRs.
Probab=64.03 E-value=10 Score=19.27 Aligned_cols=22 Identities=9% Similarity=0.049 Sum_probs=18.2
Q ss_pred hchHHHHHHHHHHHhcCCCCHH
Q psy16295 55 KEYTKSLKYCRAFLAIESENSQ 76 (128)
Q Consensus 55 gdY~~A~~~~~~lL~~eP~N~Q 76 (128)
|+.++|+..++..++.-|.+..
T Consensus 1 ~~~~~~r~i~e~~l~~~~~~~~ 22 (33)
T smart00386 1 GDIERARKIYERALEKFPKSVE 22 (33)
T ss_pred CcHHHHHHHHHHHHHHCCCChH
Confidence 5788999999999999986654
No 217
>PF11817 Foie-gras_1: Foie gras liver health family 1; InterPro: IPR021773 Mutating the gene foie gras in zebrafish has been shown to affect development; the mutants develop large, lipid-filled hepatocytes in the liver, resembling those in individuals with fatty liver disease []. Foie-gras protein is long and has several well-defined domains though none of them has a known function. We have annotated this one as the first []. THe C terminus of this region contains TPR repeats.
Probab=63.74 E-value=34 Score=27.10 Aligned_cols=56 Identities=18% Similarity=0.096 Sum_probs=41.2
Q ss_pred CChhhHHHHHHHHHHHhhc---cCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhc
Q psy16295 14 EFCADIRKGILLLEELFKT---ETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAI 70 (128)
Q Consensus 14 ~~~~d~~~GI~LL~~l~~~---~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~ 70 (128)
+..++...-|++|+.-+.+ ... .-..+.+...+|--|++.|||++|+++++.++..
T Consensus 149 ~~~~hs~~iI~lL~~A~~~f~~~~~-~R~~~~l~~~~A~ey~~~g~~~~A~~~l~~~~~~ 207 (247)
T PF11817_consen 149 KGVDHSKLIIELLEKAYEQFKKYGQ-NRMASYLSLEMAEEYFRLGDYDKALKLLEPAASS 207 (247)
T ss_pred cccchHHHHHHHHHHHHHHHHHhcc-chHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH
Confidence 4456777889999876554 111 0244666778999999999999999999988543
No 218
>PF11087 DUF2881: Protein of unknown function (DUF2881); InterPro: IPR020085 This entry represents the DNA delivery proteins P32 and P34 that are found in the virion membrane. It is responsible for DNA delivery and viral infectivity []. It is also required for DNA injection in the membrane transformation event. In Enterobacteria phage PRD1 virions, which are composed of a tail-less icosahedral capsid, an inner protein-lipid membrane, and a dsDNA genome which is located inside the lipid vesicle; the DNA is packaged into a preformed procapsid. The internal membrane plays an active role in DNA delivery to the host cell by forming a tubular structure used for injecting the DNA into the host cytoplasm.
Probab=63.50 E-value=4 Score=25.92 Aligned_cols=23 Identities=30% Similarity=0.322 Sum_probs=18.4
Q ss_pred HHHHHHhhHHHhhhhhhcchhhhhh
Q psy16295 102 AVLAQGQGTMEMSCKSYKNTASVLK 126 (128)
Q Consensus 102 a~~~~g~~~~~~~~~~~~~~~~~~~ 126 (128)
+.++++|++..+|+.| |||-|.|
T Consensus 14 aiigvai~av~vs~~s--ntagvi~ 36 (54)
T PF11087_consen 14 AIIGVAIIAVIVSQNS--NTAGVIQ 36 (54)
T ss_pred HHHHHHHHhheeeccC--Cccceee
Confidence 5577888899999987 7887765
No 219
>KOG1127|consensus
Probab=63.23 E-value=15 Score=36.02 Aligned_cols=57 Identities=7% Similarity=0.122 Sum_probs=51.3
Q ss_pred hhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhc-hHHHHHHHHHHHhcCCCCHHH
Q psy16295 17 ADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKE-YTKSLKYCRAFLAIESENSQV 77 (128)
Q Consensus 17 ~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgd-Y~~A~~~~~~lL~~eP~N~QA 77 (128)
.+..++|+..+..++.+ | ++-....+|++++.-++. -++|-..+-..-++||+|-=|
T Consensus 16 k~YeealEqskkvLk~d-p---dNYnA~vFLGvAl~sl~q~le~A~ehYv~AaKldpdnlLA 73 (1238)
T KOG1127|consen 16 KEYEEALEQSKKVLKED-P---DNYNAQVFLGVALWSLGQDLEKAAEHYVLAAKLDPDNLLA 73 (1238)
T ss_pred ccHHHHHHHHHHHHhcC-C---CcchhhhHHHHHHHhccCCHHHHHHHHHHHHhcChhhhHH
Confidence 46778999999999999 6 689999999999999998 999999999999999998654
No 220
>cd00197 VHS_ENTH_ANTH VHS, ENTH and ANTH domain superfamily; composed of proteins containing a VHS, ENTH or ANTH domain. The VHS domain is present in Vps27 (Vacuolar Protein Sorting), Hrs (Hepatocyte growth factor-regulated tyrosine kinase substrate) and STAM (Signal Transducing Adaptor Molecule). It is located at the N-termini of proteins involved in intracellular membrane trafficking. The epsin N-terminal homology (ENTH) domain is an evolutionarily conserved protein module found primarily in proteins that participate in clathrin-mediated endocytosis. A set of proteins previously designated as harboring an ENTH domain in fact contains a highly similar, yet unique module referred to as an AP180 N-terminal homology (ANTH) domain. VHS, ENTH and ANTH domains are structurally similar and are composed of a superhelix of eight alpha helices. ENTH adnd ANTH (E/ANTH) domains bind both inositol phospholipids and proteins and contribute to the nucleation and formation of clathrin coats on membra
Probab=63.14 E-value=15 Score=25.46 Aligned_cols=25 Identities=20% Similarity=0.090 Sum_probs=20.9
Q ss_pred HhccCChhhHHHHHHHHHHHhhccC
Q psy16295 10 LVRSEFCADIRKGILLLEELFKTET 34 (128)
Q Consensus 10 Lv~S~~~~d~~~GI~LL~~l~~~~~ 34 (128)
=+.++++..+.+++.+|+.+++++.
T Consensus 45 Rl~~~~~~~~lkaL~lLe~lvkN~g 69 (115)
T cd00197 45 RINNKNPHVVLKALTLLEYCVKNCG 69 (115)
T ss_pred HhcCCcHHHHHHHHHHHHHHHHHcc
Confidence 3456789999999999999999973
No 221
>COG4700 Uncharacterized protein conserved in bacteria containing a divergent form of TPR repeats [Function unknown]
Probab=62.42 E-value=45 Score=27.28 Aligned_cols=51 Identities=16% Similarity=0.063 Sum_probs=44.3
Q ss_pred HHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCC
Q psy16295 21 KGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESE 73 (128)
Q Consensus 21 ~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~ 73 (128)
.+...|+++++-+ |+ .++.|-.-.+|-.+.-+|.|..|+...+.++..-|+
T Consensus 142 ~a~~tLe~l~e~~-pa-~r~pd~~Ll~aR~laa~g~~a~Aesafe~a~~~ypg 192 (251)
T COG4700 142 AAQQTLEDLMEYN-PA-FRSPDGHLLFARTLAAQGKYADAESAFEVAISYYPG 192 (251)
T ss_pred HHHHHHHHHhhcC-Cc-cCCCCchHHHHHHHHhcCCchhHHHHHHHHHHhCCC
Confidence 5677899999988 64 788999999999999999999999999988888875
No 222
>cd02656 MIT MIT: domain contained within Microtubule Interacting and Trafficking molecules. The MIT domain is found in sorting nexins, the nuclear thiol protease PalBH, the AAA protein spastin and archaebacterial proteins with similar domain architecture, vacuolar sorting proteins and others. The molecular function of the MIT domain is unclear.
Probab=62.38 E-value=20 Score=23.10 Aligned_cols=35 Identities=26% Similarity=0.311 Sum_probs=26.2
Q ss_pred chHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhh
Q psy16295 56 EYTKSLKYCRAFLAIESENSQVQHLESIVKKRMET 90 (128)
Q Consensus 56 dY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i~k 90 (128)
-|.+|...+...++.+|+...-..|++.|+.-+.+
T Consensus 28 ~Y~~a~e~l~~~~~~~~~~~~k~~l~~k~~~yl~R 62 (75)
T cd02656 28 LYKEALDYLLQALKAEKEPKLRKLLRKKVKEYLDR 62 (75)
T ss_pred HHHHHHHHHHHHhccCCCHHHHHHHHHHHHHHHHH
Confidence 35566777888888999877777788888776665
No 223
>cd02684 MIT_2 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in proteins with an n-terminal serine/threonine kinase domain. The molecular function of the MIT domain is unclear.
Probab=61.57 E-value=22 Score=23.63 Aligned_cols=60 Identities=20% Similarity=0.299 Sum_probs=40.6
Q ss_pred HHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhh
Q psy16295 19 IRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKRMET 90 (128)
Q Consensus 19 ~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i~k 90 (128)
..+||+++.+-+..+.. .+.+ -|+ .-|..|+.+|-..++-||+-.+-..|++.+..-+.+
T Consensus 3 l~~Ai~lv~~Av~~D~~---g~y~----eA~-----~lY~~ale~~~~~~k~e~~~~~k~~lr~k~~eyl~R 62 (75)
T cd02684 3 LEKAIALVVQAVKKDQR---GDAA----AAL-----SLYCSALQYFVPALHYETDAQRKEALRQKVLQYVSR 62 (75)
T ss_pred HHHHHHHHHHHHHHHHh---ccHH----HHH-----HHHHHHHHHHHHHHhhCCCHHHHHHHHHHHHHHHHH
Confidence 45788888877666521 1111 111 235677778888899998888878888888877765
No 224
>COG2023 RPR2 RNase P subunit RPR2 [Translation, ribosomal structure and biogenesis]
Probab=61.24 E-value=13 Score=26.98 Aligned_cols=33 Identities=27% Similarity=0.322 Sum_probs=29.8
Q ss_pred cchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcC
Q psy16295 38 DRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIE 71 (128)
Q Consensus 38 ~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~e 71 (128)
..+-|+||-+|-..++.+ .+.|++|++.+-.+.
T Consensus 11 ~eRi~~L~~lA~~~~~~~-~~laRrYv~la~~Is 43 (105)
T COG2023 11 AERIDYLYSLAEETFRTG-PDLARRYVKLARRIS 43 (105)
T ss_pred HHHHHHHHHHHHHHHhcC-cHHHHHHHHHHHHHH
Confidence 468999999999999999 899999999887764
No 225
>PF04910 Tcf25: Transcriptional repressor TCF25; InterPro: IPR006994 This entry appears to represent a novel family of basic helix-loop-helix (bHLH) proteins that control differentiation and development of a variety of organs [, ]. Human Nulp1 (Q2MK75 from SWISSPROT) is a basic helix-loop-helix protein expressed broadly during early embryonic organogenesis. Over expression of human Nulp1 in COS-7 cells inhibits the transcriptional activity of serum response factor (SRF), suggesting that Nulp1 may act as a novel bHLH transcriptional repressor in the SRF signalling pathway to mediate cellular functions [].
Probab=60.87 E-value=15 Score=31.18 Aligned_cols=34 Identities=15% Similarity=0.408 Sum_probs=26.9
Q ss_pred HHHHHHHHhhhhchHHHHHHHHHHHhcCCC-CHHH
Q psy16295 44 LYYLAIGNARIKEYTKSLKYCRAFLAIESE-NSQV 77 (128)
Q Consensus 44 LYyLAvg~yklgdY~~A~~~~~~lL~~eP~-N~QA 77 (128)
+|-.-...-+.|-|..|+++|+.||.+||+ |+-+
T Consensus 106 l~r~i~~L~~RG~~rTAlE~~KlLlsLdp~~DP~g 140 (360)
T PF04910_consen 106 LFRYIQSLGRRGCWRTALEWCKLLLSLDPDEDPLG 140 (360)
T ss_pred HHHHHHHHHhcCcHHHHHHHHHHHHhcCCCCCcch
Confidence 344445566889999999999999999999 5443
No 226
>KOG1840|consensus
Probab=60.87 E-value=47 Score=29.90 Aligned_cols=47 Identities=26% Similarity=0.239 Sum_probs=35.6
Q ss_pred HHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhc
Q psy16295 21 KGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAI 70 (128)
Q Consensus 21 ~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~ 70 (128)
.+..+...-..++.| ..-+.|.| ||..|...|+|++|..-|+..|++
T Consensus 182 ~~~~~~~~~~~~~~P--~~~~~~~~-La~~y~~~g~~e~A~~l~k~Al~~ 228 (508)
T KOG1840|consen 182 KGLDIQAKGLGDEDP--ERLRTLRN-LAEMYAVQGRLEKAEPLCKQALRI 228 (508)
T ss_pred HHHHHHHHhcccCCc--hHHHHHHH-HHHHHHHhccHHHHHHHHHHHHHH
Confidence 344444444555545 56666666 999999999999999999999998
No 227
>PF09613 HrpB1_HrpK: Bacterial type III secretion protein (HrpB1_HrpK); InterPro: IPR013394 This family of proteins is encoded by genes found within type III secretion operons in a limited range of species including Xanthomonas, Ralstonia and Burkholderia.
Probab=60.82 E-value=74 Score=24.47 Aligned_cols=73 Identities=15% Similarity=0.107 Sum_probs=55.9
Q ss_pred HHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHH
Q psy16295 7 SFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKK 86 (128)
Q Consensus 7 AwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~ 86 (128)
+|-.|+ ..+..+++.+|+++.... | ...-|--++|++.+-+||-+ =+.|.+.+|+-.| |+++..|-+....
T Consensus 51 ~~l~i~---r~~w~dA~rlLr~l~~~~-~---~~p~~kALlA~CL~~~~D~~-Wr~~A~evle~~~-d~~a~~Lv~~Ll~ 121 (160)
T PF09613_consen 51 GWLHIV---RGDWDDALRLLRELEERA-P---GFPYAKALLALCLYALGDPS-WRRYADEVLESGA-DPDARALVRALLA 121 (160)
T ss_pred HHHHHH---hCCHHHHHHHHHHHhccC-C---CChHHHHHHHHHHHHcCChH-HHHHHHHHHhcCC-ChHHHHHHHHHHH
Confidence 444444 346678999999987777 5 68889999999999999864 5777888888776 8888888777665
Q ss_pred HH
Q psy16295 87 RM 88 (128)
Q Consensus 87 ~i 88 (128)
+-
T Consensus 122 ~~ 123 (160)
T PF09613_consen 122 RA 123 (160)
T ss_pred hc
Confidence 43
No 228
>PF10363 DUF2435: Protein of unknown function (DUF2435)
Probab=60.69 E-value=52 Score=22.64 Aligned_cols=69 Identities=12% Similarity=0.173 Sum_probs=54.6
Q ss_pred HHHHHHHHhccCChhhHHHHHHHHHHHhhccC-ccccchHHHHHHHHHHHhhhhc---hHHHHHHHHHHHhcCCC
Q psy16295 3 QFNYSFCLVRSEFCADIRKGILLLEELFKTET-EDIDRKRDYLYYLAIGNARIKE---YTKSLKYCRAFLAIESE 73 (128)
Q Consensus 3 ~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~-~~~~~~rd~LYyLAvg~yklgd---Y~~A~~~~~~lL~~eP~ 73 (128)
.|+-|+..++++.+----.|+.+|..++++.. + ....+.+--+-+.+-+..| |-+|.+....+-...|+
T Consensus 4 ~~~~al~~L~dp~~PvRa~gL~~L~~Li~~~~~~--~~~~~~il~l~l~~L~d~DsyVYL~aI~~L~~La~~~p~ 76 (92)
T PF10363_consen 4 TLQEALSDLNDPLPPVRAHGLVLLRKLIESKSEP--VIDIPKILDLFLSQLKDEDSYVYLNAIKGLAALADRHPD 76 (92)
T ss_pred HHHHHHHHccCCCcchHHHHHHHHHHHHHcCCcc--hhhHHHHHHHHHHHcCCCCchHHHHHHHHHHHHHHHChH
Confidence 58889999999999988899999999999974 3 3455666666666555555 78888888888888886
No 229
>PF14689 SPOB_a: Sensor_kinase_SpoOB-type, alpha-helical domain; PDB: 1F51_C 2FTK_B 1IXM_B.
Probab=60.33 E-value=10 Score=24.20 Aligned_cols=31 Identities=19% Similarity=0.503 Sum_probs=24.7
Q ss_pred chHHHHHHHHH--HHhhhhchHHHHHHHHHHHh
Q psy16295 39 RKRDYLYYLAI--GNARIKEYTKSLKYCRAFLA 69 (128)
Q Consensus 39 ~~rd~LYyLAv--g~yklgdY~~A~~~~~~lL~ 69 (128)
.+-|++..|-+ |+..+|+|++|.+|++.+.+
T Consensus 19 ~RHD~~NhLqvI~gllqlg~~~~a~eYi~~~~~ 51 (62)
T PF14689_consen 19 QRHDFLNHLQVIYGLLQLGKYEEAKEYIKELSK 51 (62)
T ss_dssp HHHHHHHHHHHHHHHHHTT-HHHHHHHHHHHHH
T ss_pred HhHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH
Confidence 46788777765 78899999999999998765
No 230
>TIGR01716 RGG_Cterm transcriptional activator, Rgg/GadR/MutR family, C-terminal domain. This model describes the whole, except for a 60 residue N-terminal helix-turn-helix DNA-binding domain (PFAM pfam01381) of the family of proteins related to the transcriptional regulator Rgg, also called RopB. Rgg is required for secretion of several proteins, including a cysteine proteinase associated with virulence. GadR is a positive regulator of a glutamate-dependent acid resistance mechanism. MutR is a transcriptional activator for mutacin biosynthesis genes in Streptococcus mutans. This family appears restricted to the low-GC Gram-positive bacteria, including at least eight members in Lactococcus lactis.
Probab=59.91 E-value=31 Score=26.07 Aligned_cols=83 Identities=11% Similarity=0.081 Sum_probs=54.2
Q ss_pred HHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhc---CCCCHHHHH
Q psy16295 3 QFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAI---ESENSQVQH 79 (128)
Q Consensus 3 ~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~---eP~N~QA~~ 79 (128)
-.|+...+++.++....++-+..++++...+ .+...+--+.||-++-.|+.|+-+++.+-++.++++ --....+..
T Consensus 131 l~N~~~~~i~~~~~~~a~~~l~~l~~l~~~~-~~~~~ki~~~f~~~l~~y~~g~~~~~~~~i~~~i~~l~~lg~~~~~~~ 209 (220)
T TIGR01716 131 LLNIAVLLIEKNEFSYAQYFLEKLEKILDPE-DDLYERILFNFLKGIILYKEGQKESGEEKIEQAIEIFDELGYPTLAAY 209 (220)
T ss_pred HHHHHHHHHHhhHHHHHHHHHHHHHHHhchh-hhHHHHHHHHHHHHHHHHHcCCCcccHHHHHHHHHHHHHcCCHHHHHH
Confidence 3577878888888888888888888877433 222356677788888888988866555544444332 223555655
Q ss_pred HHHHHHH
Q psy16295 80 LESIVKK 86 (128)
Q Consensus 80 Lk~~Ie~ 86 (128)
+.+.+++
T Consensus 210 ~~~~~~~ 216 (220)
T TIGR01716 210 YQKLLEK 216 (220)
T ss_pred HHHHHHH
Confidence 5555443
No 231
>PRK04841 transcriptional regulator MalT; Provisional
Probab=59.34 E-value=50 Score=30.10 Aligned_cols=72 Identities=8% Similarity=-0.058 Sum_probs=46.5
Q ss_pred HHHHHHHhccCChhhHHHHHHHHHHHhhccC--ccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCH
Q psy16295 4 FNYSFCLVRSEFCADIRKGILLLEELFKTET--EDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENS 75 (128)
Q Consensus 4 F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~--~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~ 75 (128)
...+|.+....+.++..+-+....+.+.... .+...+.++...++..+...|+|++|..+.+..|+..|.+.
T Consensus 413 ~~~a~~~~~~g~~~~a~~~l~~a~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~g~~~~A~~~~~~al~~~~~~~ 486 (903)
T PRK04841 413 LLQAWLAQSQHRYSEVNTLLARAEQELKDRNIELDGTLQAEFNALRAQVAINDGDPEEAERLAELALAELPLTW 486 (903)
T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHhccccCcccchhHHHHHHHHHHHHHHhCCCHHHHHHHHHHHHhcCCCcc
Confidence 4567777766655544333333333333210 00023667888889999999999999999999998666543
No 232
>PLN03081 pentatricopeptide (PPR) repeat-containing protein; Provisional
Probab=59.28 E-value=45 Score=29.96 Aligned_cols=56 Identities=5% Similarity=-0.046 Sum_probs=36.0
Q ss_pred hhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCC
Q psy16295 16 CADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESEN 74 (128)
Q Consensus 16 ~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N 74 (128)
..++.+|.+++++..++..- .-..+.+.-+--++.|.|++++|.+.++.+ ...|+-
T Consensus 439 ~g~~~~a~~~f~~m~~~~g~--~p~~~~y~~li~~l~r~G~~~eA~~~~~~~-~~~p~~ 494 (697)
T PLN03081 439 SGLSEQGWEIFQSMSENHRI--KPRAMHYACMIELLGREGLLDEAYAMIRRA-PFKPTV 494 (697)
T ss_pred CCcHHHHHHHHHHHHHhcCC--CCCccchHhHHHHHHhcCCHHHHHHHHHHC-CCCCCH
Confidence 34567888888888764211 123344555667788899999998887764 344543
No 233
>KOG4648|consensus
Probab=59.19 E-value=6 Score=34.95 Aligned_cols=43 Identities=12% Similarity=0.227 Sum_probs=31.9
Q ss_pred HHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCH
Q psy16295 25 LLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENS 75 (128)
Q Consensus 25 LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~ 75 (128)
++++|+.... + +---+--|||+|.|++|..|+..-...+|.|+
T Consensus 89 I~~~LL~~~S---E-----iKE~GN~yFKQgKy~EAIDCYs~~ia~~P~Np 131 (536)
T KOG4648|consen 89 IAQQLLKKAS---E-----IKERGNTYFKQGKYEEAIDCYSTAIAVYPHNP 131 (536)
T ss_pred HHHHHHHhhH---H-----HHHhhhhhhhccchhHHHHHhhhhhccCCCCc
Confidence 5666666652 2 11124468999999999999999999999775
No 234
>PF10661 EssA: WXG100 protein secretion system (Wss), protein EssA; InterPro: IPR018920 The Wss (WXG100 protein secretion system) in Staphylococcus aureus seems to be encoded by a locus of eight ORFs, called ess (eSAT-6 secretion system) []. This locus encodes, amongst several other proteins, EssA, a protein predicted to possess one transmembrane domain. Due to its predicted membrane location and its absolute requirement for WXG100 protein secretion, it has been speculated that EssA could form a secretion apparatus in conjunction with YukC and YukAB. Proteins homologous to EssA, YukC, EsaA and YukD were absent from mycobacteria []. Members of this family are associated with type VII secretion of WXG100 family targets in the Firmicutes, but not in the Actinobacteria. This highly divergent protein family consists largely of a central region of highly polar low-complexity sequence containing occasional LF motifs in weak repeats about 17 residues in length, flanked by hydrophobic N- and C-terminal regions.
Probab=58.92 E-value=11 Score=28.33 Aligned_cols=26 Identities=23% Similarity=0.178 Sum_probs=17.3
Q ss_pred HHHHHHHHHHHHHHHHhhHHHhhhhhh
Q psy16295 92 GLKGMALAGGAVLAQGQGTMEMSCKSY 118 (128)
Q Consensus 92 GliG~~i~gga~~~~g~~~~~~~~~~~ 118 (128)
..||+ +++|++++++++...+.||.|
T Consensus 119 ~~i~~-~i~g~ll~i~~giy~~~r~~~ 144 (145)
T PF10661_consen 119 PTILL-SIGGILLAICGGIYVVLRKVW 144 (145)
T ss_pred HHHHH-HHHHHHHHHHHHHHHHHHHhh
Confidence 35655 455577777767777888866
No 235
>KOG1129|consensus
Probab=58.34 E-value=6.7 Score=34.44 Aligned_cols=51 Identities=16% Similarity=0.261 Sum_probs=22.4
Q ss_pred HHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCH
Q psy16295 21 KGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENS 75 (128)
Q Consensus 21 ~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~ 75 (128)
.++++.+..++.+ | .+.|.+--+|++||=.++-+-|++|+.++|++--.++
T Consensus 308 ~a~~lYk~vlk~~-~---~nvEaiAcia~~yfY~~~PE~AlryYRRiLqmG~~sp 358 (478)
T KOG1129|consen 308 DALQLYKLVLKLH-P---INVEAIACIAVGYFYDNNPEMALRYYRRILQMGAQSP 358 (478)
T ss_pred HHHHHHHHHHhcC-C---ccceeeeeeeeccccCCChHHHHHHHHHHHHhcCCCh
Confidence 3444444444444 2 3344444444444444444444444444444444333
No 236
>PF08238 Sel1: Sel1 repeat; InterPro: IPR006597 Sel1-like repeats are tetratricopeptide repeat sequences originally identified in a Caenorhabditis elegans receptor molecule which is a key negative regulator of the Notch pathway []. Mammalian homologues have since been identified although these mainly pancreatic proteins have yet to have a function assigned.; PDB: 2XM6_A 3RJV_A 1OUV_A 1KLX_A.
Probab=57.81 E-value=27 Score=18.83 Aligned_cols=32 Identities=16% Similarity=0.233 Sum_probs=23.0
Q ss_pred hHHHHH--HHHhccC--ChhhHHHHHHHHHHHhhcc
Q psy16295 2 TQFNYS--FCLVRSE--FCADIRKGILLLEELFKTE 33 (128)
Q Consensus 2 ~~F~YA--waLv~S~--~~~d~~~GI~LL~~l~~~~ 33 (128)
++|++| ..+.... .+.|..++++.++.-.+.+
T Consensus 3 A~~~lg~~~~~~~g~~g~~~d~~~A~~~~~~Aa~~g 38 (39)
T PF08238_consen 3 AQYNLGMYYMYYNGKGGVPKDYEKAFKWYEKAAEQG 38 (39)
T ss_dssp HHHHHHHHHHHHHTSTSSCHHHHHHHHHHHHHHHTT
T ss_pred HHHHHHHHHhhhhccCCccccccchHHHHHHHHHcc
Confidence 567888 6666665 6778888888888765543
No 237
>cd02680 MIT_calpain7_2 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in the nuclear thiol protease PalBH. The molecular function of the MIT domain is unclear.
Probab=57.55 E-value=22 Score=24.00 Aligned_cols=16 Identities=19% Similarity=0.567 Sum_probs=12.0
Q ss_pred hHHHHHHHHHHHhhcc
Q psy16295 18 DIRKGILLLEELFKTE 33 (128)
Q Consensus 18 d~~~GI~LL~~l~~~~ 33 (128)
|..+||.|+.+-+..+
T Consensus 2 ~l~kai~Lv~~A~~eD 17 (75)
T cd02680 2 DLERAHFLVTQAFDED 17 (75)
T ss_pred CHHHHHHHHHHHHHhh
Confidence 4568888888876665
No 238
>KOG0550|consensus
Probab=57.43 E-value=81 Score=28.35 Aligned_cols=76 Identities=12% Similarity=0.106 Sum_probs=48.8
Q ss_pred HHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHH---hhhHHHH
Q psy16295 19 IRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKRM---ETEGLKG 95 (128)
Q Consensus 19 ~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i---~kdGliG 95 (128)
.+++|.=.+.-++-+ | .--..+---|-+|.-++++++|.++++..++.+-+..-.+.|+++...-- .+|-+.=
T Consensus 303 l~eaisdc~~Al~iD-~---syikall~ra~c~l~le~~e~AV~d~~~a~q~~~s~e~r~~l~~A~~aLkkSkRkd~yki 378 (486)
T KOG0550|consen 303 LREAISDCNEALKID-S---SYIKALLRRANCHLALEKWEEAVEDYEKAMQLEKDCEIRRTLREAQLALKKSKRKDWYKI 378 (486)
T ss_pred chhhhhhhhhhhhcC-H---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccchHHHHHHHHHHHHHhhhhhHHHH
Confidence 445555555555554 2 33445555677888888888888888888888877666666666654322 4566654
Q ss_pred HHH
Q psy16295 96 MAL 98 (128)
Q Consensus 96 ~~i 98 (128)
+||
T Consensus 379 lGi 381 (486)
T KOG0550|consen 379 LGI 381 (486)
T ss_pred hhh
Confidence 444
No 239
>PF13646 HEAT_2: HEAT repeats; PDB: 1OYZ_A 3FGA_A 2PF4_C 2IAE_A 3B2A_A.
Probab=57.27 E-value=46 Score=20.95 Aligned_cols=61 Identities=20% Similarity=0.152 Sum_probs=35.5
Q ss_pred HHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCC
Q psy16295 3 QFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESEN 74 (128)
Q Consensus 3 ~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N 74 (128)
+-.-+++|.+..++ +.+..|.++++++.| .-++... .|+|. +|+ +++...+..+|+.++++
T Consensus 17 r~~a~~~L~~~~~~----~~~~~L~~~l~d~~~--~vr~~a~--~aL~~--i~~-~~~~~~L~~~l~~~~~~ 77 (88)
T PF13646_consen 17 RAEAARALGELGDP----EAIPALIELLKDEDP--MVRRAAA--RALGR--IGD-PEAIPALIKLLQDDDDE 77 (88)
T ss_dssp HHHHHHHHHCCTHH----HHHHHHHHHHTSSSH--HHHHHHH--HHHHC--CHH-HHTHHHHHHHHTC-SSH
T ss_pred HHHHHHHHHHcCCH----hHHHHHHHHHcCCCH--HHHHHHH--HHHHH--hCC-HHHHHHHHHHHcCCCcH
Confidence 34456677766544 578888888866533 3333333 34443 454 55777777777776543
No 240
>PF10883 DUF2681: Protein of unknown function (DUF2681); InterPro: IPR020274 This entry contains membrane proteins with no known function.
Probab=56.18 E-value=13 Score=25.89 Aligned_cols=25 Identities=16% Similarity=0.066 Sum_probs=17.2
Q ss_pred HHHHHHHHHHHHHhhHHHhhhhhhc
Q psy16295 95 GMALAGGAVLAQGQGTMEMSCKSYK 119 (128)
Q Consensus 95 G~~i~gga~~~~g~~~~~~~~~~~~ 119 (128)
-+-|++|++.+++++++++.+|+-|
T Consensus 3 ~l~iv~~~~~v~~~i~~y~~~k~~k 27 (87)
T PF10883_consen 3 NLQIVGGVGAVVALILAYLWWKVKK 27 (87)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3456666666677777888887754
No 241
>PRK11901 hypothetical protein; Reviewed
Probab=55.36 E-value=11 Score=32.22 Aligned_cols=25 Identities=16% Similarity=0.054 Sum_probs=19.8
Q ss_pred hHHHHHHHHHHHHHHHHhhHHHhhh
Q psy16295 91 EGLKGMALAGGAVLAQGQGTMEMSC 115 (128)
Q Consensus 91 dGliG~~i~gga~~~~g~~~~~~~~ 115 (128)
-=|||+||+-=.+|++||++++=|-
T Consensus 37 h~MiGiGilVLlLLIi~IgSALksP 61 (327)
T PRK11901 37 HMMIGIGILVLLLLIIAIGSALKSP 61 (327)
T ss_pred HHHHHHHHHHHHHHHHHHhhhccCC
Confidence 5689999997667899998887553
No 242
>KOG4507|consensus
Probab=54.89 E-value=71 Score=30.24 Aligned_cols=76 Identities=16% Similarity=0.249 Sum_probs=52.8
Q ss_pred HHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHH--HhhhhchHHHHHHHHHHHhcCCCCHHHHHH
Q psy16295 3 QFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIG--NARIKEYTKSLKYCRAFLAIESENSQVQHL 80 (128)
Q Consensus 3 ~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg--~yklgdY~~A~~~~~~lL~~eP~N~QA~~L 80 (128)
-.|+|--|+||.-..| +-.+|.+-+.-.+ .|-|+++..| +.-++|-+.|++.....|+.+|+|.--+.=
T Consensus 645 ~v~la~~~~~~~~~~d---a~~~l~q~l~~~~------sepl~~~~~g~~~l~l~~i~~a~~~~~~a~~~~~~~~~~~~~ 715 (886)
T KOG4507|consen 645 LVNLANLLIHYGLHLD---ATKLLLQALAINS------SEPLTFLSLGNAYLALKNISGALEAFRQALKLTTKCPECENS 715 (886)
T ss_pred HHHHHHHHHHhhhhcc---HHHHHHHHHhhcc------cCchHHHhcchhHHHHhhhHHHHHHHHHHHhcCCCChhhHHH
Confidence 4578999999984443 4445555443332 2334555555 555799999999999999999999987776
Q ss_pred HHHHHHH
Q psy16295 81 ESIVKKR 87 (128)
Q Consensus 81 k~~Ie~~ 87 (128)
..+|..+
T Consensus 716 l~~i~c~ 722 (886)
T KOG4507|consen 716 LKLIRCM 722 (886)
T ss_pred HHHHHHh
Confidence 6666553
No 243
>TIGR00756 PPR pentatricopeptide repeat domain (PPR motif). This family has a similar consensus to the TPR domain (tetratricopeptide), pfam pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.
Probab=54.87 E-value=16 Score=18.73 Aligned_cols=23 Identities=17% Similarity=0.270 Sum_probs=19.1
Q ss_pred HHHHHhhhhchHHHHHHHHHHHh
Q psy16295 47 LAIGNARIKEYTKSLKYCRAFLA 69 (128)
Q Consensus 47 LAvg~yklgdY~~A~~~~~~lL~ 69 (128)
+=-++.+.|++++|.+.++.+.+
T Consensus 6 li~~~~~~~~~~~a~~~~~~M~~ 28 (35)
T TIGR00756 6 LIDGLCKAGRVEEALELFKEMLE 28 (35)
T ss_pred HHHHHHHCCCHHHHHHHHHHHHH
Confidence 34578899999999999998865
No 244
>PF06552 TOM20_plant: Plant specific mitochondrial import receptor subunit TOM20; InterPro: IPR010547 This family consists of several plant specific mitochondrial import receptor subunit TOM20 (translocase of outer membrane 20 kDa subunit) proteins. Most mitochondrial proteins are encoded by the nuclear genome, and are synthesised in the cytosol. TOM20 is a general import receptor that binds to mitochondrial pre-sequences in the early step of protein import into the mitochondria [].; GO: 0045040 protein import into mitochondrial outer membrane, 0005742 mitochondrial outer membrane translocase complex; PDB: 1ZU2_A.
Probab=54.43 E-value=65 Score=25.50 Aligned_cols=54 Identities=11% Similarity=0.039 Sum_probs=32.5
Q ss_pred HHHHHHHHhhccCccccchHHHHHHHHHHHhhh----------hchHHHHHHHHHHHhcCCCCHHHHH
Q psy16295 22 GILLLEELFKTETEDIDRKRDYLYYLAIGNARI----------KEYTKSLKYCRAFLAIESENSQVQH 79 (128)
Q Consensus 22 GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~ykl----------gdY~~A~~~~~~lL~~eP~N~QA~~ 79 (128)
+.+--+.-+..+ | ...|-|+.=+.+...+ .-|++|..-.+..|+++|+.+.|.=
T Consensus 10 ark~aea~y~~n-P---~DadnL~~WG~ALLELAqfk~g~es~~miedAisK~eeAL~I~P~~hdAlw 73 (186)
T PF06552_consen 10 ARKKAEAAYAKN-P---LDADNLTNWGGALLELAQFKQGPESKKMIEDAISKFEEALKINPNKHDALW 73 (186)
T ss_dssp HHHHHHHHHHH--T---T-HHHHHHHHHHHHHHHHHS-HHHHHHHHHHHHHHHHHHHHH-TT-HHHHH
T ss_pred HHHHHHHHHHhC-c---HhHHHHHHHHHHHHHHHhccCcchHHHHHHHHHHHHHHHHhcCCchHHHHH
Confidence 334445555556 5 4566666555554444 3467788889999999999988753
No 245
>KOG1173|consensus
Probab=54.17 E-value=30 Score=31.88 Aligned_cols=61 Identities=8% Similarity=0.254 Sum_probs=45.2
Q ss_pred hhhHHHHHHHHHHHhhccCccccchHH------HHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHH
Q psy16295 16 CADIRKGILLLEELFKTETEDIDRKRD------YLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQH 79 (128)
Q Consensus 16 ~~d~~~GI~LL~~l~~~~~~~~~~~rd------~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~ 79 (128)
.++..+++..|+.-+..- + +...+ -+--|+-.+-|++.|++|+.+.+..|.+.|.|.-.-+
T Consensus 427 ~~~y~~A~~~f~~~l~~i-k--~~~~e~~~w~p~~~NLGH~~Rkl~~~~eAI~~~q~aL~l~~k~~~~~a 493 (611)
T KOG1173|consen 427 YEEYPEALKYFQKALEVI-K--SVLNEKIFWEPTLNNLGHAYRKLNKYEEAIDYYQKALLLSPKDASTHA 493 (611)
T ss_pred HhhhHHHHHHHHHHHHHh-h--hccccccchhHHHHhHHHHHHHHhhHHHHHHHHHHHHHcCCCchhHHH
Confidence 345667777777765221 1 12222 2677899999999999999999999999999987654
No 246
>PF05844 YopD: YopD protein; InterPro: IPR008898 This family consists of several bacterial YopD like proteins. Virulent Yersinia species harbour a common plasmid that encodes essential virulence determinants (Yersinia outer proteins [Yops]), which are regulated by the extracellular stimuli Ca2+ and temperature. YopD is thought to be a possible transmembrane protein and contains an amphipathic alpha-helix in its carboxy terminus [].; PDB: 1KDL_A.
Probab=53.88 E-value=4.3 Score=34.22 Aligned_cols=32 Identities=22% Similarity=0.078 Sum_probs=0.0
Q ss_pred hhHHHHHHHHHHHH-HHHHhhHHHhhhhhhcch
Q psy16295 90 TEGLKGMALAGGAV-LAQGQGTMEMSCKSYKNT 121 (128)
Q Consensus 90 kdGliG~~i~gga~-~~~g~~~~~~~~~~~~~~ 121 (128)
-.-+|.|+||.|+. .+.+++|+.-++|+-|..
T Consensus 119 A~~mIaMaVVSGvma~~sa~~g~~s~~Ka~K~i 151 (298)
T PF05844_consen 119 AKLMIAMAVVSGVMAGASAVMGAFSSFKAGKTI 151 (298)
T ss_dssp ---------------------------------
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHH
Confidence 47899999999987 677788999999887753
No 247
>KOG4648|consensus
Probab=53.87 E-value=28 Score=30.92 Aligned_cols=37 Identities=16% Similarity=0.278 Sum_probs=27.6
Q ss_pred HHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHH
Q psy16295 48 AIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIV 84 (128)
Q Consensus 48 Avg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~I 84 (128)
+-+-..+|+..+|.+=|+.+|++||+|..-+.-...|
T Consensus 172 ~~AR~~Lg~~~EAKkD~E~vL~LEP~~~ELkK~~a~i 208 (536)
T KOG4648|consen 172 MQARESLGNNMEAKKDCETVLALEPKNIELKKSLARI 208 (536)
T ss_pred HHHHHHHhhHHHHHHhHHHHHhhCcccHHHHHHHHHh
Confidence 3455668888899999999999999987554443333
No 248
>KOG3824|consensus
Probab=53.71 E-value=18 Score=31.59 Aligned_cols=31 Identities=19% Similarity=0.192 Sum_probs=23.8
Q ss_pred HHHHhhhhchHHHHHHHHHHHhcCCCCHHHH
Q psy16295 48 AIGNARIKEYTKSLKYCRAFLAIESENSQVQ 78 (128)
Q Consensus 48 Avg~yklgdY~~A~~~~~~lL~~eP~N~QA~ 78 (128)
|.-.+|.|+-++|.+..+..|.+.|.|+|+.
T Consensus 123 A~~~~~~Gk~ekA~~lfeHAlalaP~~p~~L 153 (472)
T KOG3824|consen 123 AGRSRKDGKLEKAMTLFEHALALAPTNPQIL 153 (472)
T ss_pred HHHHHhccchHHHHHHHHHHHhcCCCCHHHH
Confidence 4446777888888888888888888888875
No 249
>PF05283 MGC-24: Multi-glycosylated core protein 24 (MGC-24); InterPro: IPR007947 CD164 is a mucin-like receptor, or sialomucin, with specificity in receptor/ ligand interactions that depends on the structural characteristics of the mucin-like receptor. Its functions include mediating, or regulating, haematopoietic progenitor cell adhesion and the negative regulation of their growth and/or-differentiation. It exists in the native state as a disulphide- linked homodimer of two 80-85kDa subunits. It is usually expressed by CD34+ and CD341o/- haematopoietic stem cells and associated microenvironmental cells. It contains, in its extracellular region, two mucin domains (I and II) linked by a non-mucin domain, which has been predicted to contain intra- disulphide bridges. This receptor may play a key role in haematopoiesis by facilitating the adhesion of human CD34+ cells to bone marrow stroma and by negatively regulating CD34+ CD341o/- haematopoietic progenitor cell proliferation. These effects involve the CD164 class I and/or II epitopes recognised by the monoclonal antibodies (mAbs) 105A5 and 103B2/9E10. These epitopes are carbohydrate-dependent and are located on the N-terminal mucin domain I [, ]. It has been found that murine MGC-24v and rat endolyn share significant sequence similarities with human CD164. However, CD164 lacks the consensus glycosaminoglycan (GAG)-attachment site found in MGC-24; it is possible that GAG-association is responsible for the high molecular weight of the epithelial-derived MGC-24 glycoprotein []. Genomic structure studies have placed CD164 within the mucin-subgroup that comprises multiple exons, and demonstrate the diverse chromosomal distribution of this family of molecules. Molecules with such multiple exons may have sophisticated regulatory mechanisms that involve not only post-translational modifications of the oligosaccharide side chains, but also differential exon usage. Although differences in the intron and exon sizes are seen between the mouse and human genes, the predicted proteins are similar in size and structure, maintaining functionally important motifs that regulate cell proliferation or subcellular distribution []. CD164 is a gene whose expression depends on differential usage of poly- adenylation sites within the 3'-UTR. The conserved distribution of the 3.2- and 1.2-kb CD164 transcripts between mouse and human suggests that (i) a mechanism may exist to regulate tissue-specific polyadenylation, and (ii) differences in polyadenylation are important for the expression and function of CD164 in different tissues. Two other aspects of the structure of CD164 are of particular interest. First, it shares one of several conserved features of a cytokine-binding pocket - in this respect, it is notable that evidence exists for a class of cell-surface sialomucin modulators that directly interact with growth factor receptors to regulate their response to physiological ligands. Second, its cytoplasmic tail contains a C-terminal YHTL motif found in many endocytic membrane proteins or receptors. These Tyr-based motifs bind to adaptor proteins, which mediate the sorting of membrane proteins into transport vesicles from the plasma membrane to the endosomes, and between intracellular compartments.
Probab=53.51 E-value=13 Score=29.30 Aligned_cols=27 Identities=26% Similarity=0.300 Sum_probs=21.0
Q ss_pred HHHHHHHHHHHHHHHhh-HHHhhhhhhc
Q psy16295 93 LKGMALAGGAVLAQGQG-TMEMSCKSYK 119 (128)
Q Consensus 93 liG~~i~gga~~~~g~~-~~~~~~~~~~ 119 (128)
.=|.-.|||+||.+|+. .+++..|-||
T Consensus 158 FD~~SFiGGIVL~LGv~aI~ff~~KF~k 185 (186)
T PF05283_consen 158 FDAASFIGGIVLTLGVLAIIFFLYKFCK 185 (186)
T ss_pred CchhhhhhHHHHHHHHHHHHHHHhhhcc
Confidence 33567899999999966 6777888777
No 250
>KOG0547|consensus
Probab=53.25 E-value=52 Score=30.19 Aligned_cols=62 Identities=16% Similarity=0.156 Sum_probs=50.7
Q ss_pred HHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCC
Q psy16295 5 NYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESE 73 (128)
Q Consensus 5 ~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~ 73 (128)
+.++++-|-. -+......+++..+.= | +-.||.-|.|-...-+++|++|.+++|...++||.
T Consensus 433 Ql~~a~Yr~~---k~~~~m~~Fee~kkkF-P---~~~Evy~~fAeiLtDqqqFd~A~k~YD~ai~LE~~ 494 (606)
T KOG0547|consen 433 QLCCALYRQH---KIAESMKTFEEAKKKF-P---NCPEVYNLFAEILTDQQQFDKAVKQYDKAIELEPR 494 (606)
T ss_pred HHHHHHHHHH---HHHHHHHHHHHHHHhC-C---CCchHHHHHHHHHhhHHhHHHHHHHHHHHHhhccc
Confidence 4455555544 4556677777777776 6 68999999999999999999999999999999999
No 251
>COG5442 FlaF Flagellar biosynthesis regulator FlaF [Cell motility and secretion]
Probab=52.92 E-value=52 Score=24.06 Aligned_cols=70 Identities=19% Similarity=0.256 Sum_probs=46.5
Q ss_pred ChHHHHHHHH------hccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHh--cCC
Q psy16295 1 MTQFNYSFCL------VRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLA--IES 72 (128)
Q Consensus 1 ~~~F~YAwaL------v~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~--~eP 72 (128)
|-||.||.-+ .+-+..+-..+.|.||+.--..+ .++...-|.+||--- .|-.+.+ -.|
T Consensus 1 myqf~yadvm~~~va~akdRer~~ltRsiall~aa~a~~-~~sre~IeAl~ftrr-------------vW~~fieDl~~p 66 (115)
T COG5442 1 MYQFSYADVMEDGVASAKDRERQLLTRSIALLDAARAPG-DDSREAIEALYFTRR-------------VWTRFIEDLGSP 66 (115)
T ss_pred CcchhHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHhccc-hhHHHHHHHHHHHHH-------------HHHHHHHHhcCc
Confidence 6789998754 34455667789999998866655 333456777877643 3333332 368
Q ss_pred CCHHHHHHHHHH
Q psy16295 73 ENSQVQHLESIV 84 (128)
Q Consensus 73 ~N~QA~~Lk~~I 84 (128)
+|+-..+|+..|
T Consensus 67 dNqLp~ELRAnl 78 (115)
T COG5442 67 DNQLPMELRANL 78 (115)
T ss_pred cccccHHHHHHH
Confidence 998888887654
No 252
>KOG1127|consensus
Probab=52.77 E-value=48 Score=32.84 Aligned_cols=71 Identities=8% Similarity=0.098 Sum_probs=62.8
Q ss_pred ChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHh
Q psy16295 15 FCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKRME 89 (128)
Q Consensus 15 ~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i~ 89 (128)
++.+-..+|.-++.-+|.+ | ...+||--|.-+|-+-|.|+.|++..+..-.+.|+.-=++--+..++..+-
T Consensus 574 ea~n~h~aV~~fQsALR~d-P---kD~n~W~gLGeAY~~sGry~~AlKvF~kAs~LrP~s~y~~fk~A~~ecd~G 644 (1238)
T KOG1127|consen 574 EAHNLHGAVCEFQSALRTD-P---KDYNLWLGLGEAYPESGRYSHALKVFTKASLLRPLSKYGRFKEAVMECDNG 644 (1238)
T ss_pred CccchhhHHHHHHHHhcCC-c---hhHHHHHHHHHHHHhcCceehHHHhhhhhHhcCcHhHHHHHHHHHHHHHhh
Confidence 4667778999999999999 7 789999999999999999999999999999999999888877777766553
No 253
>PF11207 DUF2989: Protein of unknown function (DUF2989); InterPro: IPR021372 Some members in this bacterial family of proteins are annotated as lipoproteins however this cannot be confirmed.
Probab=52.76 E-value=28 Score=27.83 Aligned_cols=44 Identities=25% Similarity=0.252 Sum_probs=27.1
Q ss_pred hhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHH
Q psy16295 17 ADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKS 60 (128)
Q Consensus 17 ~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A 60 (128)
.|-.+++.+|...+.-..++..-+.|-+..||-.++++++|+.|
T Consensus 154 rD~~Kt~~ll~~~L~l~~~~~~~n~eil~sLas~~~~~~~~e~A 197 (203)
T PF11207_consen 154 RDPEKTIQLLLRALELSNPDDNFNPEILKSLASIYQKLKNYEQA 197 (203)
T ss_pred cCHHHHHHHHHHHHHhcCCCCCCCHHHHHHHHHHHHHhcchhhh
Confidence 34446777776655543221134577777777777777777766
No 254
>PF01102 Glycophorin_A: Glycophorin A; InterPro: IPR001195 Proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Glycophorin A (PAS-2) and glycophorin B (PAS-3) belong to the MNS blood group system and are associated with antigens that include M/N, S/s, U, He, Mi(a), M(c), Vw, Mur, M(g), Vr, M(e), Mt(a), St(a), Ri(a), Cl(a), Ny(a), Hut, Hil, M(v), Far, Mit, Dantu, Hop, Nob, En(a), ENKT, amongst others. Glycophorin A is the major sialoglycoprotein of the erythrocyte membrane []. Structurally, glycophorin A consists of an N-terminal extracellular domain, heavily glycosylated on serine and threonine residues, followed by a transmembrane region and a C-terminal cytoplasmic domain. Other glycophorins in this entry such as Glycophorin B and Glycophorin E represent minor sialoglycoproteins in the erythrocyte membrane.; GO: 0016021 integral to membrane; PDB: 2KPF_B 1AFO_B 2KPE_A.
Probab=52.45 E-value=23 Score=26.11 Aligned_cols=24 Identities=4% Similarity=-0.258 Sum_probs=13.3
Q ss_pred HHHHHHHHHHHHHHHhhHHHhhhh
Q psy16295 93 LKGMALAGGAVLAQGQGTMEMSCK 116 (128)
Q Consensus 93 liG~~i~gga~~~~g~~~~~~~~~ 116 (128)
+|-+|+++|+++++.++.+++-|.
T Consensus 68 ~Ii~gv~aGvIg~Illi~y~irR~ 91 (122)
T PF01102_consen 68 GIIFGVMAGVIGIILLISYCIRRL 91 (122)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred ehhHHHHHHHHHHHHHHHHHHHHH
Confidence 444666666665555555555443
No 255
>PF05957 DUF883: Bacterial protein of unknown function (DUF883); InterPro: IPR010279 This family consists of several bacterial proteins of unknown function that include the Escherichia coli genes for ElaB, YgaM and YqjD.
Probab=52.35 E-value=52 Score=22.24 Aligned_cols=39 Identities=8% Similarity=0.145 Sum_probs=24.4
Q ss_pred HHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHhhHHHhhhh
Q psy16295 75 SQVQHLESIVKKRMETEGLKGMALAGGAVLAQGQGTMEMSCK 116 (128)
Q Consensus 75 ~QA~~Lk~~Ie~~i~kdGliG~~i~gga~~~~g~~~~~~~~~ 116 (128)
.|+...-...++-|...=+..+||+.|+.+++ |++++||
T Consensus 56 ~~~~~~~~~~~~~V~e~P~~svgiAagvG~ll---G~Ll~RR 94 (94)
T PF05957_consen 56 EQAREAAEQTEDYVRENPWQSVGIAAGVGFLL---GLLLRRR 94 (94)
T ss_pred HHHHHHHHHHHHHHHHChHHHHHHHHHHHHHH---HHHHhCC
Confidence 45566666677777777777776665544433 5566654
No 256
>TIGR02561 HrpB1_HrpK type III secretion protein HrpB1/HrpK. This gene is found within type III secretion operons in a limited range of species including Xanthomonas, Ralstonia and Burkholderia.
Probab=52.00 E-value=84 Score=24.16 Aligned_cols=78 Identities=12% Similarity=-0.009 Sum_probs=55.9
Q ss_pred ccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhhh
Q psy16295 12 RSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKRMETE 91 (128)
Q Consensus 12 ~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i~kd 91 (128)
.+.++.|+..=+.-|+-|- | +..+.--+-+.-|.+.|+|++|.+..+.+-+-.|.-+=+++|....=......
T Consensus 22 ~~~d~~D~e~lLdALrvLr----P---~~~e~d~~dg~l~i~rg~w~eA~rvlr~l~~~~~~~p~~kAL~A~CL~al~Dp 94 (153)
T TIGR02561 22 RSADPYDAQAMLDALRVLR----P---NLKELDMFDGWLLIARGNYDEAARILRELLSSAGAPPYGKALLALCLNAKGDA 94 (153)
T ss_pred hcCCHHHHHHHHHHHHHhC----C---CccccchhHHHHHHHcCCHHHHHHHHHhhhccCCCchHHHHHHHHHHHhcCCh
Confidence 3566666655444444332 4 45666777788888999999999999999998888888888888776666655
Q ss_pred HHHHH
Q psy16295 92 GLKGM 96 (128)
Q Consensus 92 GliG~ 96 (128)
+.-+.
T Consensus 95 ~Wr~~ 99 (153)
T TIGR02561 95 EWHVH 99 (153)
T ss_pred HHHHH
Confidence 55543
No 257
>PF13041 PPR_2: PPR repeat family
Probab=51.95 E-value=17 Score=21.25 Aligned_cols=27 Identities=11% Similarity=0.290 Sum_probs=21.8
Q ss_pred HHHHhhhhchHHHHHHHHHHHhc--CCCC
Q psy16295 48 AIGNARIKEYTKSLKYCRAFLAI--ESEN 74 (128)
Q Consensus 48 Avg~yklgdY~~A~~~~~~lL~~--eP~N 74 (128)
=-++.+.|++++|.+.++.+.+. .|+.
T Consensus 10 i~~~~~~~~~~~a~~l~~~M~~~g~~P~~ 38 (50)
T PF13041_consen 10 ISGYCKAGKFEEALKLFKEMKKRGIKPDS 38 (50)
T ss_pred HHHHHHCcCHHHHHHHHHHHHHcCCCCCH
Confidence 34789999999999999999874 4543
No 258
>PF12862 Apc5: Anaphase-promoting complex subunit 5
Probab=51.81 E-value=37 Score=22.77 Aligned_cols=34 Identities=12% Similarity=0.311 Sum_probs=24.9
Q ss_pred HHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHH
Q psy16295 45 YYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQ 78 (128)
Q Consensus 45 YyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~ 78 (128)
|..-+-+.+.|||.+|+..+...+...+.+.+-.
T Consensus 2 ~l~~~~~~~~~dy~~A~d~L~~~fD~~~~~~~~~ 35 (94)
T PF12862_consen 2 YLRYLNALRSGDYSEALDALHRYFDYAKQSNNSS 35 (94)
T ss_pred HHHHHHHHHcCCHHHHHHHHHHHHHHHhhcccch
Confidence 3444567899999999888888887766655543
No 259
>KOG4340|consensus
Probab=51.00 E-value=56 Score=28.58 Aligned_cols=65 Identities=18% Similarity=0.189 Sum_probs=50.3
Q ss_pred hhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHH
Q psy16295 17 ADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVK 85 (128)
Q Consensus 17 ~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie 85 (128)
++...+-+..++|.... | .-.-|-.|-|-..|+-+.|..|++...+++..+---.|...|...|.
T Consensus 58 Q~f~~AA~CYeQL~ql~-P---~~~qYrlY~AQSLY~A~i~ADALrV~~~~~D~~~L~~~~lqLqaAIk 122 (459)
T KOG4340|consen 58 QEFALAAECYEQLGQLH-P---ELEQYRLYQAQSLYKACIYADALRVAFLLLDNPALHSRVLQLQAAIK 122 (459)
T ss_pred HHHHHHHHHHHHHHhhC-h---HHHHHHHHHHHHHHHhcccHHHHHHHHHhcCCHHHHHHHHHHHHHHh
Confidence 34445555666666666 5 45667788899999999999999999999988666778888888874
No 260
>KOG1155|consensus
Probab=50.77 E-value=85 Score=28.64 Aligned_cols=67 Identities=15% Similarity=0.217 Sum_probs=49.4
Q ss_pred hHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHh-------cCCCCHHHHHHHHHHHHHH
Q psy16295 18 DIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLA-------IESENSQVQHLESIVKKRM 88 (128)
Q Consensus 18 d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~-------~eP~N~QA~~Lk~~Ie~~i 88 (128)
...++|.....-+.-+ + ...+.+++||-.|-+++|+++|-.|.+..++ ++|+-..|..-...-+.++
T Consensus 447 ~~~eAiKCykrai~~~-d---te~~~l~~LakLye~l~d~~eAa~~yek~v~~~~~eg~~~~~t~ka~~fLA~~f~k~ 520 (559)
T KOG1155|consen 447 RLEEAIKCYKRAILLG-D---TEGSALVRLAKLYEELKDLNEAAQYYEKYVEVSELEGEIDDETIKARLFLAEYFKKM 520 (559)
T ss_pred cHHHHHHHHHHHHhcc-c---cchHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHhhcccchHHHHHHHHHHHHHHhh
Confidence 3456787777777666 2 5789999999999999999999999999988 4455555554444444443
No 261
>KOG2053|consensus
Probab=50.36 E-value=96 Score=30.15 Aligned_cols=65 Identities=15% Similarity=0.167 Sum_probs=42.3
Q ss_pred hhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHH
Q psy16295 16 CADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIV 84 (128)
Q Consensus 16 ~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~I 84 (128)
.++.+++...++.+++.. | ...=..-+=|+.+.|+|.+++|..+.+.+=..-|++.--..-.+.+
T Consensus 22 ~~qfkkal~~~~kllkk~-P---n~~~a~vLkaLsl~r~gk~~ea~~~Le~~~~~~~~D~~tLq~l~~~ 86 (932)
T KOG2053|consen 22 SSQFKKALAKLGKLLKKH-P---NALYAKVLKALSLFRLGKGDEALKLLEALYGLKGTDDLTLQFLQNV 86 (932)
T ss_pred hHHHHHHHHHHHHHHHHC-C---CcHHHHHHHHHHHHHhcCchhHHHHHhhhccCCCCchHHHHHHHHH
Confidence 446678888888888888 6 3444455667788888888888866666555555544433333333
No 262
>PF01535 PPR: PPR repeat; InterPro: IPR002885 This entry represents the PPR repeat. Pentatricopeptide repeat (PPR) proteins are characterised by tandem repeats of a degenerate 35 amino acid motif []. Most of PPR proteins have roles in mitochondria or plastid []. PPR repeats were discovered while screening Arabidopsis proteins for those predicted to be targeted to mitochondria or chloroplast [, ]. Some of these proteins have been shown to play a role in post-transcriptional processes within organelles and they are thought to be sequence-specific RNA-binding proteins [, , ]. Plant genomes have between one hundred to five hundred PPR genes per genome whereas non-plant genomes encode two to six PPR proteins. Although no PPR structures are yet known, the motif is predicted to fold into a helix-turn-helix structure similar to those found in the tetratricopeptide repeat (TPR) family (see PDOC50005 from PROSITEDOC) []. The plant PPR protein family has been divided in two subfamilies on the basis of their motif content and organisation [, ]. Examples of PPR repeat-containing proteins include PET309 P32522 from SWISSPROT, which may be involved in RNA stabilisation [], and crp1, which is involved in RNA processing []. The repeat is associated with a predicted plant protein O49549 from SWISSPROT that has a domain organisation similar to the human BRCA1 protein.
Probab=50.15 E-value=18 Score=18.35 Aligned_cols=23 Identities=9% Similarity=0.237 Sum_probs=18.5
Q ss_pred HHHHhhhhchHHHHHHHHHHHhc
Q psy16295 48 AIGNARIKEYTKSLKYCRAFLAI 70 (128)
Q Consensus 48 Avg~yklgdY~~A~~~~~~lL~~ 70 (128)
=-++.+.|+.++|.+.++.+.+.
T Consensus 7 i~~~~~~~~~~~a~~~~~~M~~~ 29 (31)
T PF01535_consen 7 ISGYCKMGQFEEALEVFDEMRER 29 (31)
T ss_pred HHHHHccchHHHHHHHHHHHhHC
Confidence 34788899999999998887653
No 263
>PF12301 CD99L2: CD99 antigen like protein 2; InterPro: IPR022078 This family of proteins is found in eukaryotes. Proteins in this family are typically between 165 and 237 amino acids in length. CD99L2 and CD99 are involved in trans-endothelial migration of neutrophils in vitro and in the recruitment of neutrophils into inflamed peritoneum.
Probab=49.93 E-value=12 Score=28.94 Aligned_cols=17 Identities=12% Similarity=0.036 Sum_probs=9.9
Q ss_pred HHHHhhHHHhhhhhhcc
Q psy16295 104 LAQGQGTMEMSCKSYKN 120 (128)
Q Consensus 104 ~~~g~~~~~~~~~~~~~ 120 (128)
++|+++||.-|+=+|.+
T Consensus 124 v~valvGAvsSyiaYqk 140 (169)
T PF12301_consen 124 VVVALVGAVSSYIAYQK 140 (169)
T ss_pred HHHHHHHHHHHHHHHHh
Confidence 45555666666666543
No 264
>COG3947 Response regulator containing CheY-like receiver and SARP domains [Signal transduction mechanisms]
Probab=49.76 E-value=42 Score=28.95 Aligned_cols=38 Identities=8% Similarity=0.004 Sum_probs=31.9
Q ss_pred HHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHH
Q psy16295 47 LAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIV 84 (128)
Q Consensus 47 LAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~I 84 (128)
-|-.|.+.|.|.+|..+++.+|+.+|-|.|.-.+...+
T Consensus 285 va~~yle~g~~neAi~l~qr~ltldpL~e~~nk~lm~~ 322 (361)
T COG3947 285 VARAYLEAGKPNEAIQLHQRALTLDPLSEQDNKGLMAS 322 (361)
T ss_pred HHHHHHHcCChHHHHHHHHHHhhcChhhhHHHHHHHHH
Confidence 45677889999999999999999999999986655443
No 265
>PF10255 Paf67: RNA polymerase I-associated factor PAF67; InterPro: IPR019382 RNA polymerase I is a multi-subunit enzyme and its transcription competence is dependent on the presence of PAF67 [].
Probab=49.60 E-value=27 Score=30.56 Aligned_cols=49 Identities=14% Similarity=0.293 Sum_probs=36.7
Q ss_pred hHHHHHHHHHHHh-------hccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHh
Q psy16295 18 DIRKGILLLEELF-------KTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLA 69 (128)
Q Consensus 18 d~~~GI~LL~~l~-------~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~ 69 (128)
|...+++.|+.+= ... | .-..-.+||++++|.=+++|.+|.+....+|-
T Consensus 137 DY~~Alk~l~~idl~~~~l~~~V-~--~~~is~~YyvGFaylMlrRY~DAir~f~~iL~ 192 (404)
T PF10255_consen 137 DYYQALKVLENIDLNKKGLYTKV-P--ACHISTYYYVGFAYLMLRRYADAIRTFSQILL 192 (404)
T ss_pred CHHHHHHHhhccCcccchhhccC-c--chheehHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5566777776642 222 2 35567789999999999999999999887774
No 266
>COG3071 HemY Uncharacterized enzyme of heme biosynthesis [Coenzyme metabolism]
Probab=48.97 E-value=23 Score=31.11 Aligned_cols=35 Identities=11% Similarity=0.144 Sum_probs=29.8
Q ss_pred HHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHH
Q psy16295 50 GNARIKEYTKSLKYCRAFLAIESENSQVQHLESIV 84 (128)
Q Consensus 50 g~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~I 84 (128)
-....|||..|+.-++.+++..|.|+++..|..-+
T Consensus 162 lll~~~d~~aA~~~v~~ll~~~pr~~~vlrLa~r~ 196 (400)
T COG3071 162 LLLNRRDYPAARENVDQLLEMTPRHPEVLRLALRA 196 (400)
T ss_pred HHHhCCCchhHHHHHHHHHHhCcCChHHHHHHHHH
Confidence 34567899999999999999999999998886544
No 267
>PRK10404 hypothetical protein; Provisional
Probab=47.78 E-value=60 Score=22.95 Aligned_cols=28 Identities=11% Similarity=0.264 Sum_probs=17.7
Q ss_pred HHHHhhhHHHHHHHHHHHHHHHHhhHHHhhh
Q psy16295 85 KKRMETEGLKGMALAGGAVLAQGQGTMEMSC 115 (128)
Q Consensus 85 e~~i~kdGliG~~i~gga~~~~g~~~~~~~~ 115 (128)
++-|...=+-++||++|+.+++| ++++|
T Consensus 73 d~yV~e~Pw~avGiaagvGlllG---~Ll~R 100 (101)
T PRK10404 73 DDYVHEKPWQGIGVGAAVGLVLG---LLLAR 100 (101)
T ss_pred HHHHHhCcHHHHHHHHHHHHHHH---HHHhc
Confidence 55566677777887766555555 44555
No 268
>PF12968 DUF3856: Domain of Unknown Function (DUF3856); InterPro: IPR024552 This domain of unknown function is found in a small group of tetratricopeptide-like proteins, which includes the uncharacterised protein Q8KAL8 from SWISSPROT. The structure of Q8KAL8 is known and belongs to the SCOP all alpha class, TPR-like superfamily, CT2138-like family.; PDB: 2HR2_D.
Probab=47.67 E-value=32 Score=26.06 Aligned_cols=30 Identities=17% Similarity=0.009 Sum_probs=24.2
Q ss_pred HHHHHHHHHHHhhhhchHHHHHHHHHHHhc
Q psy16295 41 RDYLYYLAIGNARIKEYTKSLKYCRAFLAI 70 (128)
Q Consensus 41 rd~LYyLAvg~yklgdY~~A~~~~~~lL~~ 70 (128)
-=|.--||-+..++|+|++++.-.+..|.-
T Consensus 55 A~chA~Ls~A~~~Lgry~e~L~sA~~aL~Y 84 (144)
T PF12968_consen 55 AFCHAGLSGALAGLGRYDECLQSADRALRY 84 (144)
T ss_dssp HHHHHHHHHHHHHTT-HHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHhhccHHHHHHHHHHHHHH
Confidence 347778889999999999999988888753
No 269
>KOG0550|consensus
Probab=46.68 E-value=55 Score=29.38 Aligned_cols=60 Identities=10% Similarity=0.125 Sum_probs=47.7
Q ss_pred HHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHH
Q psy16295 22 GILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVK 85 (128)
Q Consensus 22 GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie 85 (128)
++..=-.+++-+ + .+-|.+|.-....|=..+-+.|.+..++.|+++|+..-+.......+
T Consensus 188 a~~ea~~ilkld-~---~n~~al~vrg~~~yy~~~~~ka~~hf~qal~ldpdh~~sk~~~~~~k 247 (486)
T KOG0550|consen 188 AQSEAIDILKLD-A---TNAEALYVRGLCLYYNDNADKAINHFQQALRLDPDHQKSKSASMMPK 247 (486)
T ss_pred HHHHHHHHHhcc-c---chhHHHHhcccccccccchHHHHHHHhhhhccChhhhhHHhHhhhHH
Confidence 333333445556 3 78999999999999999999999999999999999988877665554
No 270
>PF12854 PPR_1: PPR repeat
Probab=46.62 E-value=17 Score=20.21 Aligned_cols=22 Identities=9% Similarity=0.143 Sum_probs=17.8
Q ss_pred HHHHHHhhhhchHHHHHHHHHH
Q psy16295 46 YLAIGNARIKEYTKSLKYCRAF 67 (128)
Q Consensus 46 yLAvg~yklgdY~~A~~~~~~l 67 (128)
-|=-++.|.|+.++|++..+.+
T Consensus 12 ~lI~~~Ck~G~~~~A~~l~~~M 33 (34)
T PF12854_consen 12 TLIDGYCKAGRVDEAFELFDEM 33 (34)
T ss_pred HHHHHHHHCCCHHHHHHHHHhC
Confidence 3455899999999999988753
No 271
>PF10300 DUF3808: Protein of unknown function (DUF3808); InterPro: IPR019412 This entry represents a family of proteins conserved from fungi to humans. In humans this protein is expressed in primary breast carcinomas but not in normal breast tissue, and has a putative eukaryotic RNP-1 RNA binding region and a candidate anchoring transmembrane domain. The human protein is coordinately regulated with oestrogen receptor, but is not necessarily oestradiol-responsive []. Members of this family carry a tetratricopeptide repeat (IPR013105 from INTERPRO) at their C terminus.
Probab=46.45 E-value=57 Score=28.51 Aligned_cols=62 Identities=18% Similarity=0.190 Sum_probs=52.8
Q ss_pred ChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHH
Q psy16295 15 FCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHL 80 (128)
Q Consensus 15 ~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~L 80 (128)
...++..+.++|+...+.- | +..=.+++-|--+...||-++|.+..+..+...++.+|...|
T Consensus 245 ~~~~~~~a~~lL~~~~~~y-P---~s~lfl~~~gR~~~~~g~~~~Ai~~~~~a~~~q~~~~Ql~~l 306 (468)
T PF10300_consen 245 EDVPLEEAEELLEEMLKRY-P---NSALFLFFEGRLERLKGNLEEAIESFERAIESQSEWKQLHHL 306 (468)
T ss_pred cCCCHHHHHHHHHHHHHhC-C---CcHHHHHHHHHHHHHhcCHHHHHHHHHHhccchhhHHhHHHH
Confidence 4667788888898887776 7 678889999999999999999999999999888888887654
No 272
>COG4575 ElaB Uncharacterized conserved protein [Function unknown]
Probab=46.10 E-value=1.1e+02 Score=22.08 Aligned_cols=79 Identities=18% Similarity=0.169 Sum_probs=42.5
Q ss_pred hhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHh-----hh
Q psy16295 17 ADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKRME-----TE 91 (128)
Q Consensus 17 ~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i~-----kd 91 (128)
.|++.=++=+|+++++... .--.+|+++|.-.+.+|+-- +.+...++..+-.+.+ .|
T Consensus 15 ~el~~L~d~lEevL~ssg~----------------~a~~e~~~lR~r~~~~Lk~~--r~rl~~~~d~v~~~sk~a~~~tD 76 (104)
T COG4575 15 AELQELLDTLEEVLKSSGS----------------LAGDEAEELRSKAESALKEA--RDRLGDTGDAVVQRSKAAADATD 76 (104)
T ss_pred HHHHHHHHHHHHHHHhccc----------------chhhHHHHHHHHHHHHHHHH--HHHHHhhhhHHHHHHHHHHHHHH
Confidence 4555566667777777521 12346778888777777643 3344444333332221 14
Q ss_pred HHH------HHHHHHHHHHHHHhhHHHhhhh
Q psy16295 92 GLK------GMALAGGAVLAQGQGTMEMSCK 116 (128)
Q Consensus 92 Gli------G~~i~gga~~~~g~~~~~~~~~ 116 (128)
-++ |+||.+|+-+. +|.+|||+
T Consensus 77 ~yV~e~PWq~VGvaAaVGll---lGlLlsRR 104 (104)
T COG4575 77 DYVRENPWQGVGVAAAVGLL---LGLLLSRR 104 (104)
T ss_pred HHHHcCCchHHHHHHHHHHH---HHHHHhcC
Confidence 443 46655543333 46778875
No 273
>COG4218 MtrF Tetrahydromethanopterin S-methyltransferase, subunit F [Coenzyme metabolism]
Probab=45.90 E-value=65 Score=21.87 Aligned_cols=25 Identities=16% Similarity=0.250 Sum_probs=20.5
Q ss_pred CCCCHHHHHHHHHHHHHHhhhHHHH
Q psy16295 71 ESENSQVQHLESIVKKRMETEGLKG 95 (128)
Q Consensus 71 eP~N~QA~~Lk~~Ie~~i~kdGliG 95 (128)
-|+++|....+..+||--.|.++||
T Consensus 13 ~snkpn~~~id~yVediryr~qlig 37 (73)
T COG4218 13 ASNKPNTDRIDTYVEDIRYRSQLIG 37 (73)
T ss_pred ccCCCChhHHHHHHHHHHHHHHhhc
Confidence 4788888999999998888777766
No 274
>PF12732 YtxH: YtxH-like protein; InterPro: IPR024623 This family of uncharacterised proteins is found in bacteria. Proteins in this family are typically between 100 and 143 amino acids in length. The N-terminal region is the most conserved.
Probab=45.72 E-value=28 Score=22.63 Aligned_cols=28 Identities=25% Similarity=0.233 Sum_probs=15.1
Q ss_pred HHHHHHHHHHHhhHHHhhhhhhcchhhhh
Q psy16295 97 ALAGGAVLAQGQGTMEMSCKSYKNTASVL 125 (128)
Q Consensus 97 ~i~gga~~~~g~~~~~~~~~~~~~~~~~~ 125 (128)
|++.|++++++ +|.+++-+|=+.|-.-+
T Consensus 4 g~l~Ga~~Ga~-~glL~aP~sG~e~R~~l 31 (74)
T PF12732_consen 4 GFLAGAAAGAA-AGLLFAPKSGKETREKL 31 (74)
T ss_pred HHHHHHHHHHH-HHHHhCCCCcHHHHHHH
Confidence 44555554444 56666666666554433
No 275
>PF12652 CotJB: CotJB protein; InterPro: IPR024207 The cotJ operon proteins affect spore coat composition, and is controlled by sigma E. The genes, which include CotJB, are either required for the normal formation of the inner layers of the coat or are themselves structural components of the coat []. CotJB has been identified as a spore coat protein [].
Probab=44.95 E-value=67 Score=21.78 Aligned_cols=36 Identities=11% Similarity=0.146 Sum_probs=26.0
Q ss_pred hhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHh
Q psy16295 54 IKEYTKSLKYCRAFLAIESENSQVQHLESIVKKRME 89 (128)
Q Consensus 54 lgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i~ 89 (128)
+..++=|..=+...|...|+|.+|.+.-+.+.++..
T Consensus 8 I~~~~Fa~~dl~LyLDTHP~d~~Al~~y~~~~~~~~ 43 (78)
T PF12652_consen 8 IQEVSFAVVDLNLYLDTHPDDQEALEYYNEYSKQRK 43 (78)
T ss_pred HHHHhhHHHHHHHHhcCCCCcHHHHHHHHHHHHHHH
Confidence 334444555577789999999999988777765543
No 276
>PRK04841 transcriptional regulator MalT; Provisional
Probab=44.93 E-value=1.2e+02 Score=27.73 Aligned_cols=70 Identities=11% Similarity=0.046 Sum_probs=48.1
Q ss_pred HHHHHHhccCChhhHHHHHHHHHHHhhccC--ccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHH
Q psy16295 5 NYSFCLVRSEFCADIRKGILLLEELFKTET--EDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQV 77 (128)
Q Consensus 5 ~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~--~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA 77 (128)
+.|.+++.-.+ ..+++.+|++.+.... .........+..+|.++.++|++++|+.++...|+.-..+.=.
T Consensus 696 ~~a~~~~~~g~---~~~A~~~l~~al~~~~~~g~~~~~a~~~~~la~a~~~~G~~~~A~~~L~~Al~la~~~g~~ 767 (903)
T PRK04841 696 NIARAQILLGQ---FDEAEIILEELNENARSLRLMSDLNRNLILLNQLYWQQGRKSEAQRVLLEALKLANRTGFI 767 (903)
T ss_pred HHHHHHHHcCC---HHHHHHHHHHHHHHHHHhCchHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHhCccchh
Confidence 45555555544 4567888888766421 1001234578899999999999999999999999877655443
No 277
>PLN03218 maturation of RBCL 1; Provisional
Probab=44.32 E-value=63 Score=31.51 Aligned_cols=27 Identities=4% Similarity=-0.042 Sum_probs=12.6
Q ss_pred HHHHHHHHHhhhhchHHHHHHHHHHHh
Q psy16295 43 YLYYLAIGNARIKEYTKSLKYCRAFLA 69 (128)
Q Consensus 43 ~LYyLAvg~yklgdY~~A~~~~~~lL~ 69 (128)
.+-.+--++.+.|++++|++.++.+.+
T Consensus 616 tynsLI~ay~k~G~~deAl~lf~eM~~ 642 (1060)
T PLN03218 616 VYTIAVNSCSQKGDWDFALSIYDDMKK 642 (1060)
T ss_pred HHHHHHHHHHhcCCHHHHHHHHHHHHH
Confidence 333344444445555555555444444
No 278
>PF07332 DUF1469: Protein of unknown function (DUF1469); InterPro: IPR009937 This entry represents proteins found in hypothetical bacterial proteins where is is annotated as ycf49 or ycf49-like. The function is not known.
Probab=44.11 E-value=44 Score=23.27 Aligned_cols=26 Identities=8% Similarity=0.088 Sum_probs=16.8
Q ss_pred HHHHHHHHHHHHHHhhHHHhhhhhhc
Q psy16295 94 KGMALAGGAVLAQGQGTMEMSCKSYK 119 (128)
Q Consensus 94 iG~~i~gga~~~~g~~~~~~~~~~~~ 119 (128)
.+..+++|..++++++.++..++..+
T Consensus 73 ~a~liv~~~~l~la~i~~~~~~~~l~ 98 (121)
T PF07332_consen 73 LAFLIVAGLYLLLALILLLIGRRRLR 98 (121)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 56666776666666666666666554
No 279
>PF12768 Rax2: Cortical protein marker for cell polarity
Probab=43.94 E-value=15 Score=30.41 Aligned_cols=27 Identities=19% Similarity=0.118 Sum_probs=21.3
Q ss_pred HHHHHHHHHHHHHHHhhHHHhhhhhhc
Q psy16295 93 LKGMALAGGAVLAQGQGTMEMSCKSYK 119 (128)
Q Consensus 93 liG~~i~gga~~~~g~~~~~~~~~~~~ 119 (128)
+||+||+-|.++.++++|..+.+...|
T Consensus 231 lIslAiALG~v~ll~l~Gii~~~~~r~ 257 (281)
T PF12768_consen 231 LISLAIALGTVFLLVLIGIILAYIRRR 257 (281)
T ss_pred EEehHHHHHHHHHHHHHHHHHHHHHhh
Confidence 578899889888888888777776655
No 280
>PF02259 FAT: FAT domain; InterPro: IPR003151 The FAT domain is a domain present in the PIK-related kinases. Members of the family of PIK-related kinases may act as intracellular sensors that govern radial and horizontal pathways [].; GO: 0005515 protein binding
Probab=43.67 E-value=1.5e+02 Score=23.27 Aligned_cols=73 Identities=19% Similarity=0.177 Sum_probs=48.6
Q ss_pred HHHHHHHHhccCChhhHHHHHHHHHHHhhc---cC---------------------------ccccchHHHHHHHHHHHh
Q psy16295 3 QFNYSFCLVRSEFCADIRKGILLLEELFKT---ET---------------------------EDIDRKRDYLYYLAIGNA 52 (128)
Q Consensus 3 ~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~---~~---------------------------~~~~~~rd~LYyLAvg~y 52 (128)
.+.+|--|....++ .++|..|++++.. .. .......++++.+|-=..
T Consensus 187 ~~e~akllw~~g~~---~~Ai~~L~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~l~~a~w~~ 263 (352)
T PF02259_consen 187 FLEYAKLLWAQGEQ---EEAIQKLRELLKCRLSKNIDSISNAELKSGLLESLEVISSTNLDKESKELKAKAFLLLAKWLD 263 (352)
T ss_pred HHHHHHHHHHcCCH---HHHHHHHHHHHHHHhhhccccccHHHHhhccccccccccccchhhhhHHHHHHHHHHHHHHHH
Confidence 34455555554433 5788888888772 10 001234566666666666
Q ss_pred hh------hchHHHHHHHHHHHhcCCCCHHHH
Q psy16295 53 RI------KEYTKSLKYCRAFLAIESENSQVQ 78 (128)
Q Consensus 53 kl------gdY~~A~~~~~~lL~~eP~N~QA~ 78 (128)
.+ ++++++.++.....+..|++.++-
T Consensus 264 ~~~~~~~~~~~~~~~~~~~~a~~~~~~~~k~~ 295 (352)
T PF02259_consen 264 ELYSKLSSESSDEILKYYKEATKLDPSWEKAW 295 (352)
T ss_pred hhccccccccHHHHHHHHHHHHHhChhHHHHH
Confidence 66 888889999999999999998843
No 281
>COG0457 NrfG FOG: TPR repeat [General function prediction only]
Probab=43.28 E-value=93 Score=20.33 Aligned_cols=23 Identities=13% Similarity=0.326 Sum_probs=12.2
Q ss_pred HHhhhhchHHHHHHHHHHHhcCC
Q psy16295 50 GNARIKEYTKSLKYCRAFLAIES 72 (128)
Q Consensus 50 g~yklgdY~~A~~~~~~lL~~eP 72 (128)
.++..+++++|..+++..+...|
T Consensus 139 ~~~~~~~~~~a~~~~~~~~~~~~ 161 (291)
T COG0457 139 ALYELGDYEEALELYEKALELDP 161 (291)
T ss_pred HHHHcCCHHHHHHHHHHHHhcCC
Confidence 45555555555555555555444
No 282
>PF11349 DUF3151: Protein of unknown function (DUF3151); InterPro: IPR014487 This group represents an uncharacterised conserved protein.
Probab=43.12 E-value=41 Score=25.22 Aligned_cols=37 Identities=16% Similarity=0.004 Sum_probs=33.3
Q ss_pred hHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHH
Q psy16295 40 KRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQ 76 (128)
Q Consensus 40 ~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~Q 76 (128)
--.|++-||.+.-.+|+-+|+.++.+.|-..+|.-.+
T Consensus 91 fLRal~aLa~AA~~IGE~dE~~Rc~~~L~Dsdp~A~~ 127 (129)
T PF11349_consen 91 FLRALAALARAAQAIGETDEYDRCRQFLRDSDPEAAD 127 (129)
T ss_pred HHHHHHHHHHHHHHhCChhHHHHHHHHHHhCCHHhHh
Confidence 3589999999999999999999999999999997554
No 283
>KOG0292|consensus
Probab=43.10 E-value=38 Score=33.17 Aligned_cols=37 Identities=11% Similarity=0.205 Sum_probs=32.4
Q ss_pred HHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHH
Q psy16295 49 IGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVK 85 (128)
Q Consensus 49 vg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie 85 (128)
-.++|+++|..|-.++..+|+.-|..++|..-++.+.
T Consensus 1092 n~ffK~kN~ktAs~fa~rLlel~~~~~~A~q~rki~~ 1128 (1202)
T KOG0292|consen 1092 NVFFKLKNLKTAAEFARRLLELAPSPPVAEQARKIKQ 1128 (1202)
T ss_pred HHHHHhccHHHHHHHHHHHHhhCCCChHHHHHHHHHH
Confidence 3589999999999999999999999998887776654
No 284
>COG4062 MtrB Tetrahydromethanopterin S-methyltransferase, subunit B [Coenzyme metabolism]
Probab=42.77 E-value=36 Score=24.69 Aligned_cols=23 Identities=26% Similarity=0.525 Sum_probs=20.4
Q ss_pred HHhcCCCCHHHHHHHHHHHHHHh
Q psy16295 67 FLAIESENSQVQHLESIVKKRME 89 (128)
Q Consensus 67 lL~~eP~N~QA~~Lk~~Ie~~i~ 89 (128)
+|.++|=+.|...|...+++-+.
T Consensus 30 ~ldv~pi~Eqi~kLe~~vddl~~ 52 (108)
T COG4062 30 FLDVDPIEEQIKKLETLVDDLEN 52 (108)
T ss_pred EEeccHHHHHHHHHHHHHHHHHh
Confidence 46789999999999999998875
No 285
>KOG2003|consensus
Probab=42.68 E-value=90 Score=28.86 Aligned_cols=58 Identities=16% Similarity=0.263 Sum_probs=48.6
Q ss_pred hhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHH
Q psy16295 17 ADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQ 78 (128)
Q Consensus 17 ~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~ 78 (128)
.|..++.+..++-++++. .-.+.||-+++.+-++|+.++|+.+.-.|-.+.-+|-|+.
T Consensus 504 gd~dka~~~ykeal~nda----sc~ealfniglt~e~~~~ldeald~f~klh~il~nn~evl 561 (840)
T KOG2003|consen 504 GDLDKAAEFYKEALNNDA----SCTEALFNIGLTAEALGNLDEALDCFLKLHAILLNNAEVL 561 (840)
T ss_pred CcHHHHHHHHHHHHcCch----HHHHHHHHhcccHHHhcCHHHHHHHHHHHHHHHHhhHHHH
Confidence 578899999999999883 5789999999999999999999999777766666666654
No 286
>PLN03218 maturation of RBCL 1; Provisional
Probab=42.32 E-value=2.1e+02 Score=28.02 Aligned_cols=55 Identities=11% Similarity=0.103 Sum_probs=39.5
Q ss_pred hhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHh--cCCCC
Q psy16295 17 ADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLA--IESEN 74 (128)
Q Consensus 17 ~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~--~eP~N 74 (128)
.+..+++++++++...+- ......+..|--++.+.|++++|.++++.+.+ +.|+.
T Consensus 698 G~~eeA~~lf~eM~~~g~---~PdvvtyN~LI~gy~k~G~~eeAlelf~eM~~~Gi~Pd~ 754 (1060)
T PLN03218 698 KNWKKALELYEDIKSIKL---RPTVSTMNALITALCEGNQLPKALEVLSEMKRLGLCPNT 754 (1060)
T ss_pred CCHHHHHHHHHHHHHcCC---CCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHcCCCCCH
Confidence 456678888888766542 23456677788888888888888888887765 45653
No 287
>cd02677 MIT_SNX15 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This MIT domain sub-family is found in sorting nexin 15 and related proteins. The molecular function of the MIT domain is unclear.
Probab=41.64 E-value=68 Score=21.29 Aligned_cols=60 Identities=10% Similarity=0.113 Sum_probs=34.7
Q ss_pred HHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhh
Q psy16295 19 IRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKRMET 90 (128)
Q Consensus 19 ~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i~k 90 (128)
..+||+++..-+..+. . .-|-=|+ .-|.+|+.++-..++-||+..+-..++..|.+=+.+
T Consensus 3 l~~A~~l~~~Ave~d~---~----~~y~eA~-----~~Y~~~i~~~~~~~k~e~~~~~k~~ir~K~~eYl~R 62 (75)
T cd02677 3 LEQAAELIRLALEKEE---E----GDYEAAF-----EFYRAGVDLLLKGVQGDSSPERREAVKRKIAEYLKR 62 (75)
T ss_pred HHHHHHHHHHHHHHHH---H----hhHHHHH-----HHHHHHHHHHHHHhccCCCHHHHHHHHHHHHHHHHH
Confidence 4567777777665551 1 0011111 224455555667777898877777777777665554
No 288
>PF05131 Pep3_Vps18: Pep3/Vps18/deep orange family; InterPro: IPR007810 This region is found in a number of proteins identified as being involved in Golgi function and vacuolar sorting. The molecular function of this region is unknown. Proteins containing this domain also contain a C-terminal ring finger domain.
Probab=41.61 E-value=14 Score=27.72 Aligned_cols=24 Identities=21% Similarity=0.526 Sum_probs=20.6
Q ss_pred chHHHHHHHHHHHhhhhchHHHHHHHHH
Q psy16295 39 RKRDYLYYLAIGNARIKEYTKSLKYCRA 66 (128)
Q Consensus 39 ~~rd~LYyLAvg~yklgdY~~A~~~~~~ 66 (128)
..||-|+. ++++|+|++|+++|+.
T Consensus 105 E~r~vWk~----yl~~~~fd~Al~~~~~ 128 (147)
T PF05131_consen 105 EDRDVWKI----YLDKGDFDEALQYCKT 128 (147)
T ss_pred chHHHHHH----HHhcCcHHHHHHHccC
Confidence 56888876 5899999999999975
No 289
>PF04910 Tcf25: Transcriptional repressor TCF25; InterPro: IPR006994 This entry appears to represent a novel family of basic helix-loop-helix (bHLH) proteins that control differentiation and development of a variety of organs [, ]. Human Nulp1 (Q2MK75 from SWISSPROT) is a basic helix-loop-helix protein expressed broadly during early embryonic organogenesis. Over expression of human Nulp1 in COS-7 cells inhibits the transcriptional activity of serum response factor (SRF), suggesting that Nulp1 may act as a novel bHLH transcriptional repressor in the SRF signalling pathway to mediate cellular functions [].
Probab=41.09 E-value=61 Score=27.49 Aligned_cols=37 Identities=14% Similarity=0.067 Sum_probs=32.5
Q ss_pred HHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHH
Q psy16295 28 ELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFL 68 (128)
Q Consensus 28 ~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL 68 (128)
.+++.. | -..|.|-.+|.-+..+||++.|-..+++.|
T Consensus 31 ~ll~~~-P---yHidtLlqls~v~~~~gd~~~A~~lleRAL 67 (360)
T PF04910_consen 31 NLLQKN-P---YHIDTLLQLSEVYRQQGDHAQANDLLERAL 67 (360)
T ss_pred HHHHHC-C---CcHHHHHHHHHHHHHcCCHHHHHHHHHHHH
Confidence 455566 6 789999999999999999999999998876
No 290
>KOG4234|consensus
Probab=40.74 E-value=95 Score=25.70 Aligned_cols=44 Identities=5% Similarity=0.123 Sum_probs=35.1
Q ss_pred HHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHH
Q psy16295 41 RDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIV 84 (128)
Q Consensus 41 rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~I 84 (128)
.=|+---|.+..|++..+.|..-|-..+++.|.+..|..=+...
T Consensus 134 sIly~Nraaa~iKl~k~e~aI~dcsKaiel~pty~kAl~RRAea 177 (271)
T KOG4234|consen 134 SILYSNRAAALIKLRKWESAIEDCSKAIELNPTYEKALERRAEA 177 (271)
T ss_pred HHHHhhhHHHHHHhhhHHHHHHHHHhhHhcCchhHHHHHHHHHH
Confidence 33444568999999999999999999999999988776543333
No 291
>PLN03077 Protein ECB2; Provisional
Probab=40.71 E-value=1.1e+02 Score=28.25 Aligned_cols=26 Identities=12% Similarity=0.155 Sum_probs=12.5
Q ss_pred HHHHHHHHHHhhhhchHHHHHHHHHH
Q psy16295 42 DYLYYLAIGNARIKEYTKSLKYCRAF 67 (128)
Q Consensus 42 d~LYyLAvg~yklgdY~~A~~~~~~l 67 (128)
.+...|+-.|.+.|++++|.+..+.+
T Consensus 692 ~~y~ll~n~ya~~g~~~~a~~vr~~M 717 (857)
T PLN03077 692 GYYILLCNLYADAGKWDEVARVRKTM 717 (857)
T ss_pred chHHHHHHHHHHCCChHHHHHHHHHH
Confidence 33344444455555555555554444
No 292
>KOG0545|consensus
Probab=38.87 E-value=2.5e+02 Score=23.96 Aligned_cols=67 Identities=13% Similarity=0.210 Sum_probs=58.4
Q ss_pred HHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHH
Q psy16295 4 FNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQV 77 (128)
Q Consensus 4 F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA 77 (128)
.||+-|+.. ..|.-+.++--.++++.+ | .+.-.+|=-|-+|.--=+-.+|+.=...+|+.+|.-.-+
T Consensus 234 lNy~QC~L~---~~e~yevleh~seiL~~~-~---~nvKA~frRakAhaa~Wn~~eA~~D~~~vL~ldpslasv 300 (329)
T KOG0545|consen 234 LNYCQCLLK---KEEYYEVLEHCSEILRHH-P---GNVKAYFRRAKAHAAVWNEAEAKADLQKVLELDPSLASV 300 (329)
T ss_pred HhHHHHHhh---HHHHHHHHHHHHHHHhcC-C---chHHHHHHHHHHHHhhcCHHHHHHHHHHHHhcChhhHHH
Confidence 578888775 567778899999999998 6 788999999999999999999999999999999986654
No 293
>PF13908 Shisa: Wnt and FGF inhibitory regulator
Probab=38.68 E-value=13 Score=28.10 Aligned_cols=24 Identities=8% Similarity=-0.072 Sum_probs=15.6
Q ss_pred HHHHHHHHHHHHHHhhHHHhhhhh
Q psy16295 94 KGMALAGGAVLAQGQGTMEMSCKS 117 (128)
Q Consensus 94 iG~~i~gga~~~~g~~~~~~~~~~ 117 (128)
|.++|+.|+++++++++..+.+|=
T Consensus 80 iivgvi~~Vi~Iv~~Iv~~~Cc~c 103 (179)
T PF13908_consen 80 IIVGVICGVIAIVVLIVCFCCCCC 103 (179)
T ss_pred eeeehhhHHHHHHHhHhhheeccc
Confidence 446667777777777777775443
No 294
>KOG3026|consensus
Probab=38.26 E-value=71 Score=26.53 Aligned_cols=36 Identities=25% Similarity=0.337 Sum_probs=29.4
Q ss_pred hhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHh
Q psy16295 54 IKEYTKSLKYCRAFLAIESENSQVQHLESIVKKRME 89 (128)
Q Consensus 54 lgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i~ 89 (128)
+..|..=+.-++++|+-+|+|..-+.|++-.++-|+
T Consensus 5 L~sYK~QLqqVeaaL~~dP~NeEllkLe~DLkEvIs 40 (262)
T KOG3026|consen 5 LASYKLQLQQVEAALQGDPENEELLKLEKDLKEVIS 40 (262)
T ss_pred HHHHHHHHHHHHHHHccCCccHHHHHHHHHHHHHHH
Confidence 456777788899999999999999888877766553
No 295
>PF09205 DUF1955: Domain of unknown function (DUF1955); InterPro: IPR015288 Members of this family are found in hypothetical proteins synthesised by the Archaeal organism Sulfolobus. Their exact function has not, as yet, been determined. ; PDB: 1WY6_A.
Probab=37.99 E-value=1.9e+02 Score=22.38 Aligned_cols=37 Identities=16% Similarity=0.230 Sum_probs=24.3
Q ss_pred HHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHH
Q psy16295 24 LLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYC 64 (128)
Q Consensus 24 ~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~ 64 (128)
+++.++++++ +...+.|+=+|.+|-|+|+-.+|-...
T Consensus 107 ki~~~l~kn~----~~~p~~L~kia~Ay~klg~~r~~~ell 143 (161)
T PF09205_consen 107 KIYNELKKNE----EINPEFLVKIANAYKKLGNTREANELL 143 (161)
T ss_dssp HHHHHH---------S-HHHHHHHHHHHHHTT-HHHHHHHH
T ss_pred HHHHHHhhcc----CCCHHHHHHHHHHHHHhcchhhHHHHH
Confidence 3456677655 258999999999999999998886543
No 296
>TIGR00823 EIIA-LAC phosphotransferase system enzyme II, lactose-specific, factor III. operon. While the Lac permeases consist of two polypeptide chains (IIA and IICB), the Chb permease of E. coli consists of three (IIA, IIB and IIC). In B. subtilis, a PTS permease similar to the Chb permease of E. coli is believed to transport lichenan (a b-1,3;1,4 glucan) degradation products, oligosaccharides of 2-4 glucose units. This model is specific for the IIA subunit of the Lac PTS family.
Probab=37.89 E-value=43 Score=23.56 Aligned_cols=28 Identities=11% Similarity=0.069 Sum_probs=23.1
Q ss_pred hHHHHHHHHHHHhhhhchHHHHHHHHHH
Q psy16295 40 KRDYLYYLAIGNARIKEYTKSLKYCRAF 67 (128)
Q Consensus 40 ~rd~LYyLAvg~yklgdY~~A~~~~~~l 67 (128)
.-...|+=|+-..|.|||++|++.++..
T Consensus 16 ~Ars~~~eAl~~a~~gdfe~A~~~l~eA 43 (99)
T TIGR00823 16 DARSKALEALKAAKAGDFAKARALVEQA 43 (99)
T ss_pred HHHHHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 3456778899999999999999987654
No 297
>cd00215 PTS_IIA_lac PTS_IIA, PTS system, lactose/cellobiose specific IIA subunit. The bacterial phosphoenolpyruvate: sugar phosphotransferase system (PTS) is a multi-protein system involved in the regulation of a variety of metabolic and transcriptional processes. This family is one of four structurally and functionally distinct group IIA PTS system cytoplasmic enzymes, necessary for the uptake of carbohydrates across the cytoplasmic membrane and their phosphorylation. This family of proteins normally function as a homotrimer, stabilized by a centrally located metal ion. Separation into subunits is thought to occur after phosphorylation.
Probab=37.48 E-value=52 Score=23.07 Aligned_cols=27 Identities=7% Similarity=0.014 Sum_probs=22.6
Q ss_pred HHHHHHHHHHHhhhhchHHHHHHHHHH
Q psy16295 41 RDYLYYLAIGNARIKEYTKSLKYCRAF 67 (128)
Q Consensus 41 rd~LYyLAvg~yklgdY~~A~~~~~~l 67 (128)
-...|+=|+-..|.|||++|+..++..
T Consensus 15 Ars~~~eAl~~a~~g~fe~A~~~l~ea 41 (97)
T cd00215 15 ARSKALEALKAAKEGDFAEAEELLEEA 41 (97)
T ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 456677899999999999999987654
No 298
>COG2015 Alkyl sulfatase and related hydrolases [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=37.01 E-value=62 Score=29.81 Aligned_cols=47 Identities=13% Similarity=0.095 Sum_probs=39.5
Q ss_pred HHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHH
Q psy16295 41 RDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKR 87 (128)
Q Consensus 41 rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~ 87 (128)
-|..--+|--.+..|||.=+-..++++.--||+|+||+.|..-.-++
T Consensus 452 adrVl~la~ea~~kGdyrW~a~lln~~VfAdp~n~~Ar~L~Ad~lEQ 498 (655)
T COG2015 452 ADRVLELAREAFDKGDYRWAAELLNQAVFADPGNKAARELQADALEQ 498 (655)
T ss_pred HHHHHHHHHHHHhcccchHHHHHHhhHHhcCCccHHHHHHHHhHHHH
Confidence 45566678888999999999999999999999999999987654443
No 299
>PF03672 UPF0154: Uncharacterised protein family (UPF0154); InterPro: IPR005359 The proteins in this entry are functionally uncharacterised.
Probab=36.72 E-value=37 Score=22.49 Aligned_cols=13 Identities=8% Similarity=-0.123 Sum_probs=11.1
Q ss_pred hhHHHhhhhhhcc
Q psy16295 108 QGTMEMSCKSYKN 120 (128)
Q Consensus 108 ~~~~~~~~~~~~~ 120 (128)
++|++++||.|++
T Consensus 12 ~~Gff~ar~~~~k 24 (64)
T PF03672_consen 12 VIGFFIARKYMEK 24 (64)
T ss_pred HHHHHHHHHHHHH
Confidence 5799999999876
No 300
>PF02255 PTS_IIA: PTS system, Lactose/Cellobiose specific IIA subunit; InterPro: IPR003188 The phosphoenolpyruvate-dependent sugar phosphotransferase system (PTS) [, ] is a major carbohydrate transport system in bacteria. The PTS catalyses the phosphorylation of incoming sugar substrates and coupled with translocation across the cell membrane, makes the PTS a link between the uptake and metabolism of sugars. The general mechanism of the PTS is the following: a phosphoryl group from phosphoenolpyruvate (PEP) is transferred via a signal transduction pathway, to enzyme I (EI) which in turn transfers it to a phosphoryl carrier, the histidine protein (HPr). Phospho-HPr then transfers the phosphoryl group to a sugar-specific permease, a membrane-bound complex known as enzyme 2 (EII), which transports the sugar to the cell. EII consists of at least three structurally distinct domains IIA, IIB and IIC []. These can either be fused together in a single polypeptide chain or exist as two or three interactive chains, formerly called enzymes II (EII) and III (EIII). The first domain (IIA or EIIA) carries the first permease-specific phosphorylation site, a histidine which is phosphorylated by phospho-HPr. The second domain (IIB or EIIB) is phosphorylated by phospho-IIA on a cysteinyl or histidyl residue, depending on the sugar transported. Finally, the phosphoryl group is transferred from the IIB domain to the sugar substrate concomitantly with the sugar uptake processed by the IIC domain. This third domain (IIC or EIIC) forms the translocation channel and the specific substrate-binding site. An additional transmembrane domain IID, homologous to IIC, can be found in some PTSs, e.g. for mannose [, , , ]. The lactose/cellobiose-specific family are one of four structurally and functionally distinct group IIA PTS system enzymes. This family of proteins normally function as a homotrimer, stabilised by a centrally located metal ion []. Separation into subunits is thought to occur after phosphorylation.; GO: 0005351 sugar:hydrogen symporter activity, 0006810 transport, 0009401 phosphoenolpyruvate-dependent sugar phosphotransferase system, 0016020 membrane; PDB: 3L8R_D 2E2A_B 1E2A_C 3K1S_C 2LRK_C 2LRL_A 2WY2_A 1WCR_A 2WWV_C.
Probab=36.54 E-value=56 Score=22.75 Aligned_cols=29 Identities=14% Similarity=0.082 Sum_probs=22.8
Q ss_pred hHHHHHHHHHHHhhhhchHHHHHHHHHHH
Q psy16295 40 KRDYLYYLAIGNARIKEYTKSLKYCRAFL 68 (128)
Q Consensus 40 ~rd~LYyLAvg~yklgdY~~A~~~~~~lL 68 (128)
.-..+|+-|+-..|.|||++|...++..=
T Consensus 13 ~Ars~~~eAl~~a~~~~fe~A~~~l~~a~ 41 (96)
T PF02255_consen 13 DARSLAMEALKAAREGDFEEAEELLKEAD 41 (96)
T ss_dssp HHHHHHHHHHHHHHTT-HHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHhCCHHHHHHHHHHHH
Confidence 34567888999999999999999876543
No 301
>PRK01209 cobD cobalamin biosynthesis protein; Provisional
Probab=36.54 E-value=1.4e+02 Score=24.69 Aligned_cols=45 Identities=4% Similarity=0.124 Sum_probs=32.4
Q ss_pred hhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhh---hHHHHH
Q psy16295 52 ARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKRMET---EGLKGM 96 (128)
Q Consensus 52 yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i~k---dGliG~ 96 (128)
.+.||-+.||+....+..-|+++-....+.+..-+.+.. ||+++.
T Consensus 109 l~~gd~~~AR~~l~~~v~Rdt~~Ls~~~i~~a~iEs~~en~~d~~~ap 156 (312)
T PRK01209 109 LRAGDLEEARRAVSMIVGRDTSQLDEEGIARAAIESVAENTSDGVIAP 156 (312)
T ss_pred HHcCCHHHHHHHHHHHhCCCcccCCHHHHHHHHHHHHHHhcccchHHH
Confidence 346899999999999999888886655555554444433 888774
No 302
>PF02479 Herpes_IE68: Herpesvirus immediate early protein; InterPro: IPR003403 This regulatory protein is expressed from an immediate early gene in the cell cycle of Herpesviridae. The protein is known by various names including IE-68, US1, ICP22 and IR4.
Probab=36.27 E-value=40 Score=25.34 Aligned_cols=44 Identities=16% Similarity=0.277 Sum_probs=31.0
Q ss_pred HHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhhh
Q psy16295 43 YLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKRMETE 91 (128)
Q Consensus 43 ~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i~kd 91 (128)
++=+..+|+++.. --...|+.+||+-|+ |+.-|+..|.+...+.
T Consensus 64 l~D~YlmGy~~~R---l~~~~We~lLQlsp~--qs~~LR~tLr~~~~r~ 107 (132)
T PF02479_consen 64 LLDFYLMGYTRQR---LTPACWERLLQLSPE--QSRRLRRTLREADSRS 107 (132)
T ss_pred HHHHHHHhhccCC---CCHHHHHHHHhhCcc--cchHHHHHHHHHHccC
Confidence 3333344544432 235789999999997 5999999999877664
No 303
>KOG0546|consensus
Probab=36.11 E-value=95 Score=27.11 Aligned_cols=47 Identities=13% Similarity=0.130 Sum_probs=38.6
Q ss_pred chHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHH
Q psy16295 39 RKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVK 85 (128)
Q Consensus 39 ~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie 85 (128)
....-+|=.+.++.++.+|++|++-.+...+.+|++.+-..=...+.
T Consensus 307 s~tka~~Rr~~~~~~~~~~~~a~~~~~~a~~~~p~d~~i~~~~~~~~ 353 (372)
T KOG0546|consen 307 SKTKAHYRRGQAYKLLKNYDEALEDLKKAKQKAPNDKAIEEELENVR 353 (372)
T ss_pred hhCcHHHHHHhHHHhhhchhhhHHHHHHhhccCcchHHHHHHHHHhh
Confidence 34566888999999999999999999999999999987655444433
No 304
>PF06072 Herpes_US9: Alphaherpesvirus tegument protein US9; InterPro: IPR009278 This family consists of several US9 and related proteins from the Alphaherpesviruses. The function of the US9 protein is unknown although in Bovine herpesvirus 5 Us9 is essential for the anterograde spread of the virus from the olfactory mucosa to the bulb [].; GO: 0019033 viral tegument
Probab=35.63 E-value=1.3e+02 Score=19.75 Aligned_cols=14 Identities=29% Similarity=0.311 Sum_probs=6.7
Q ss_pred CCCCHHHHHHHHHH
Q psy16295 71 ESENSQVQHLESIV 84 (128)
Q Consensus 71 eP~N~QA~~Lk~~I 84 (128)
|.||..|-+...-|
T Consensus 5 ESDnETA~~FL~Rv 18 (60)
T PF06072_consen 5 ESDNETATEFLRRV 18 (60)
T ss_pred ccccccHHHHHHHH
Confidence 44555555444333
No 305
>COG4061 MtrC Tetrahydromethanopterin S-methyltransferase, subunit C [Coenzyme metabolism]
Probab=35.13 E-value=64 Score=26.57 Aligned_cols=48 Identities=21% Similarity=0.142 Sum_probs=30.6
Q ss_pred CCCCHHHHHHHHHHHHHHhhhHHHHHHHHHHHH-HHHHhhHHHhhhhhh
Q psy16295 71 ESENSQVQHLESIVKKRMETEGLKGMALAGGAV-LAQGQGTMEMSCKSY 118 (128)
Q Consensus 71 eP~N~QA~~Lk~~Ie~~i~kdGliG~~i~gga~-~~~g~~~~~~~~~~~ 118 (128)
-|+..|-++|.-.++.---.--+-|++-..=++ +.+|++.....|++|
T Consensus 193 GPnE~q~RTL~la~e~g~i~~~vagl~~~~~~v~l~vgai~W~iaf~kf 241 (262)
T COG4061 193 GPNESQKRTLTLAVECGFITWFVAGLAKLNDIVSLIVGAILWAIAFVKF 241 (262)
T ss_pred CCCccchhhhhhhHhhhHHHHHHHHHhhcchhHHHHHHHHHHHHHHHHH
Confidence 688999999999988654433233322222223 557878777777755
No 306
>smart00567 EZ_HEAT E-Z type HEAT repeats. Present in subunits of cyanobacterial phycocyanin lyase, and other proteins. Probable scaffolding role.
Probab=34.55 E-value=70 Score=16.75 Aligned_cols=27 Identities=11% Similarity=0.065 Sum_probs=19.7
Q ss_pred hHHHHHHHHhccCChhhHHHHHHHHHHHhhc
Q psy16295 2 TQFNYSFCLVRSEFCADIRKGILLLEELFKT 32 (128)
Q Consensus 2 ~~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~ 32 (128)
.+..-|++|-+..+++ ++..|.+.+.+
T Consensus 3 vR~~aa~aLg~~~~~~----a~~~L~~~l~d 29 (30)
T smart00567 3 VRHEAAFALGQLGDEE----AVPALIKALED 29 (30)
T ss_pred HHHHHHHHHHHcCCHh----HHHHHHHHhcC
Confidence 5677788998887766 77777776654
No 307
>PRK09591 celC cellobiose phosphotransferase system IIA component; Reviewed
Probab=34.00 E-value=58 Score=23.13 Aligned_cols=28 Identities=7% Similarity=-0.113 Sum_probs=23.3
Q ss_pred hHHHHHHHHHHHhhhhchHHHHHHHHHH
Q psy16295 40 KRDYLYYLAIGNARIKEYTKSLKYCRAF 67 (128)
Q Consensus 40 ~rd~LYyLAvg~yklgdY~~A~~~~~~l 67 (128)
.-...|+=|+-..|.|||++|+..++..
T Consensus 19 ~Ars~~~eAl~~ak~gdf~~A~~~l~eA 46 (104)
T PRK09591 19 NARTEVHEAFAAMREGNFDLAEQKLNQS 46 (104)
T ss_pred HHHHHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 3556778899999999999999987654
No 308
>smart00671 SEL1 Sel1-like repeats. These represent a subfamily of TPR (tetratricopeptide repeat) sequences.
Probab=33.92 E-value=75 Score=16.56 Aligned_cols=31 Identities=13% Similarity=0.134 Sum_probs=22.0
Q ss_pred hHHHHHHHHhcc-CChhhHHHHHHHHHHHhhc
Q psy16295 2 TQFNYSFCLVRS-EFCADIRKGILLLEELFKT 32 (128)
Q Consensus 2 ~~F~YAwaLv~S-~~~~d~~~GI~LL~~l~~~ 32 (128)
++|++|..+... ..+.|..+++..++.....
T Consensus 3 a~~~lg~~~~~G~g~~~d~~~A~~~~~~Aa~~ 34 (36)
T smart00671 3 AQYNLGQMYEYGLGVKKDLEKALEYYKKAAEL 34 (36)
T ss_pred HHHHHHHHHHcCCCCCcCHHHHHHHHHHHHHc
Confidence 578888887765 3567778888888775544
No 309
>PF06957 COPI_C: Coatomer (COPI) alpha subunit C-terminus; InterPro: IPR010714 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the C terminus (approximately 500 residues) of the eukaryotic coatomer alpha subunit [, ]. This domain is found along with the IPR006692 from INTERPRO domain. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0005515 protein binding, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030126 COPI vesicle coat; PDB: 3MKR_B 3MV2_E 3MKQ_B 3MV3_A.
Probab=33.70 E-value=1.8e+02 Score=25.75 Aligned_cols=48 Identities=8% Similarity=0.191 Sum_probs=37.4
Q ss_pred HHHHhhhhchHHHHHHHHHHHhcC-----CCCHHHHHHHHHHHHHHhhhHHHHHH
Q psy16295 48 AIGNARIKEYTKSLKYCRAFLAIE-----SENSQVQHLESIVKKRMETEGLKGMA 97 (128)
Q Consensus 48 Avg~yklgdY~~A~~~~~~lL~~e-----P~N~QA~~Lk~~Ie~~i~kdGliG~~ 97 (128)
|.-+++.|++++|+..+..+|..- -+.++...++++|+ +.++-++||-
T Consensus 211 gyk~~t~gKF~eA~~~Fr~iL~~i~l~vv~~~~E~~e~~eli~--icrEYilgl~ 263 (422)
T PF06957_consen 211 GYKLFTAGKFEEAIEIFRSILHSIPLLVVESREEEDEAKELIE--ICREYILGLS 263 (422)
T ss_dssp HHHHHHTT-HHHHHHHHHHHHHHHHC--BSSCHHHHHHHHHHH--HHHHHHHHHH
T ss_pred HHHHHhcCCHHHHHHHHHHHHHHhheeeecCHHHHHHHHHHHH--HHHHHHHHHH
Confidence 456789999999999999988863 35578999999986 6677777763
No 310
>KOG2376|consensus
Probab=33.66 E-value=1.8e+02 Score=27.24 Aligned_cols=62 Identities=16% Similarity=0.255 Sum_probs=41.5
Q ss_pred HHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCH
Q psy16295 4 FNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENS 75 (128)
Q Consensus 4 F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~ 75 (128)
|+=|+|.-|-+..+ +++..++. + + + ..+-.+-.-|--.||+++|++|...++.|.+..-++.
T Consensus 83 fEKAYc~Yrlnk~D---ealk~~~~-~--~-~---~~~~ll~L~AQvlYrl~~ydealdiY~~L~kn~~dd~ 144 (652)
T KOG2376|consen 83 FEKAYCEYRLNKLD---EALKTLKG-L--D-R---LDDKLLELRAQVLYRLERYDEALDIYQHLAKNNSDDQ 144 (652)
T ss_pred HHHHHHHHHcccHH---HHHHHHhc-c--c-c---cchHHHHHHHHHHHHHhhHHHHHHHHHHHHhcCCchH
Confidence 45566666655444 45555541 1 1 1 2344777789999999999999999999988875543
No 311
>PF06812 ImpA-rel_N: ImpA-related N-terminal; InterPro: IPR010657 This entry represents a conserved region located towards the N-terminal end of ImpA and related proteins. ImpA is an inner membrane protein, which has been suggested to be involved with proteins that are exported and associated with colony variations in Actinobacillus actinomycetemcomitans []. Note that many members are hypothetical proteins.
Probab=33.28 E-value=68 Score=20.13 Aligned_cols=27 Identities=30% Similarity=0.424 Sum_probs=17.6
Q ss_pred HHHHHhccCChhhHHHHHHHHHHHhhc
Q psy16295 6 YSFCLVRSEFCADIRKGILLLEELFKT 32 (128)
Q Consensus 6 YAwaLv~S~~~~d~~~GI~LL~~l~~~ 32 (128)
|++|+++.....-...|+.+|..++..
T Consensus 26 l~~a~~~~~g~~gl~~gl~ll~~ll~~ 52 (62)
T PF06812_consen 26 LTEALLRLEGLAGLAEGLELLADLLEN 52 (62)
T ss_pred HHHHHHHccChhHHHHHHHHHHHHHHH
Confidence 456666666666666666666666655
No 312
>KOG0495|consensus
Probab=33.22 E-value=2.5e+02 Score=27.03 Aligned_cols=39 Identities=10% Similarity=0.143 Sum_probs=30.2
Q ss_pred hhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhh
Q psy16295 52 ARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKRMET 90 (128)
Q Consensus 52 yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i~k 90 (128)
--+++.++|++.|+..|++-|+.+.-=-.+--|++.+..
T Consensus 662 r~ld~~eeA~rllEe~lk~fp~f~Kl~lmlGQi~e~~~~ 700 (913)
T KOG0495|consen 662 RYLDNVEEALRLLEEALKSFPDFHKLWLMLGQIEEQMEN 700 (913)
T ss_pred HHhhhHHHHHHHHHHHHHhCCchHHHHHHHhHHHHHHHH
Confidence 347788888888999999999888877777777766543
No 313
>PRK07630 CobD/CbiB family protein; Provisional
Probab=33.07 E-value=1.6e+02 Score=24.38 Aligned_cols=44 Identities=7% Similarity=0.226 Sum_probs=30.9
Q ss_pred hhhchHHHHHHHHHHHhcCCCCHHHHHHHH-HHHHHHhh--hHHHHH
Q psy16295 53 RIKEYTKSLKYCRAFLAIESENSQVQHLES-IVKKRMET--EGLKGM 96 (128)
Q Consensus 53 klgdY~~A~~~~~~lL~~eP~N~QA~~Lk~-~Ie~~i~k--dGliG~ 96 (128)
+.||.+.||++...+...|+++--...+.+ .||.-... ||+++.
T Consensus 106 ~~~dl~~AR~~l~~~v~rdt~~l~~~~i~~a~iEs~~en~~d~~~ap 152 (312)
T PRK07630 106 RNDDLPRARELLGEWTGLDTVDMPVSEIVRHTIEHALVASHRHVFGV 152 (312)
T ss_pred HcCCHHHHHHHHHHHhCCCcccCChhHHHHHHHHHHHHHhcccchHH
Confidence 568999999999999888887654444444 45555443 887765
No 314
>KOG2796|consensus
Probab=33.01 E-value=35 Score=29.28 Aligned_cols=39 Identities=8% Similarity=0.210 Sum_probs=33.8
Q ss_pred HHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHH
Q psy16295 47 LAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVK 85 (128)
Q Consensus 47 LAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie 85 (128)
.|+-|.-.+||.+|-++.+.+++.||.|.|+..=|.++.
T Consensus 258 ~a~i~lg~nn~a~a~r~~~~i~~~D~~~~~a~NnKALcl 296 (366)
T KOG2796|consen 258 SAFLHLGQNNFAEAHRFFTEILRMDPRNAVANNNKALCL 296 (366)
T ss_pred hhhheecccchHHHHHHHhhccccCCCchhhhchHHHHH
Confidence 466677788999999999999999999999988777765
No 315
>PF02259 FAT: FAT domain; InterPro: IPR003151 The FAT domain is a domain present in the PIK-related kinases. Members of the family of PIK-related kinases may act as intracellular sensors that govern radial and horizontal pathways [].; GO: 0005515 protein binding
Probab=31.97 E-value=1.5e+02 Score=23.32 Aligned_cols=41 Identities=24% Similarity=0.424 Sum_probs=30.6
Q ss_pred HHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHh
Q psy16295 26 LEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLA 69 (128)
Q Consensus 26 L~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~ 69 (128)
|++.+... + ....++.+|-|+...+.++|+++.++++..-+
T Consensus 17 l~~~~~~~-~--~~~~~~~~~~al~~l~~~~~~~~~~~i~~~r~ 57 (352)
T PF02259_consen 17 LEEYLSQS-N--EDSPEYSFYRALLALRQGDYDEAKKYIEKARQ 57 (352)
T ss_pred HHHHHhhc-c--CCChhHHHHHHHHHHhCccHHHHHHHHHHHHH
Confidence 44555554 2 23459999999999999999999999776544
No 316
>PF03186 CobD_Cbib: CobD/Cbib protein; InterPro: IPR004485 Cobalamin (vitamin B12) is a structurally complex cofactor, consisting of a modified tetrapyrrole with a centrally chelated cobalt. Cobalamin is usually found in one of two biologically active forms: methylcobalamin and adocobalamin. Most prokaryotes, as well as animals, have cobalamin-dependent enzymes, whereas plants and fungi do not appear to use it. In bacteria and archaea, these include methionine synthase, ribonucleotide reductase, glutamate and methylmalonyl-CoA mutases, ethanolamine ammonia lyase, and diol dehydratase []. In mammals, cobalamin is obtained through the diet, and is required for methionine synthase and methylmalonyl-CoA mutase []. There are at least two distinct cobalamin biosynthetic pathways in bacteria []: Aerobic pathway that requires oxygen and in which cobalt is inserted late in the pathway []; found in Pseudomonas denitrificans and Rhodobacter capsulatus. Anaerobic pathway in which cobalt insertion is the first committed step towards cobalamin synthesis []; found in Salmonella typhimurium, Bacillus megaterium, and Propionibacterium freudenreichii subsp. shermanii. Either pathway can be divided into two parts: (1) corrin ring synthesis (differs in aerobic and anaerobic pathways) and (2) adenosylation of corrin ring, attachment of aminopropanol arm, and assembly of the nucleotide loop (common to both pathways) []. There are about 30 enzymes involved in either pathway, where those involved in the aerobic pathway are prefixed Cob and those of the anaerobic pathway Cbi. Several of these enzymes are pathway-specific: CbiD, CbiG, and CbiK are specific to the anaerobic route of S. typhimurium, whereas CobE, CobF, CobG, CobN, CobS, CobT, and CobW are unique to the aerobic pathway of P. denitrificans. This entry represents the CbiB protein, which is involved in cobalamin biosynthesis and porphyrin biosynthesis. It converts cobyric acid to cobinamide by the addition of aminopropanol on the F carboxylic group. It is part of the cob operon [].; GO: 0009236 cobalamin biosynthetic process, 0016021 integral to membrane
Probab=31.92 E-value=1.8e+02 Score=23.87 Aligned_cols=46 Identities=4% Similarity=0.122 Sum_probs=34.2
Q ss_pred HhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHH-h--hhHHHHH
Q psy16295 51 NARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKRM-E--TEGLKGM 96 (128)
Q Consensus 51 ~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i-~--kdGliG~ 96 (128)
..+.||-++||++...+..-|.++.....+-+..-+-+ . .||+++.
T Consensus 105 aL~~gdl~~AR~~l~~lv~Rdt~~L~~~~i~ra~iEs~aen~~d~v~ap 153 (295)
T PF03186_consen 105 ALEAGDLEAARQALSPLVGRDTDQLSEEGIARAAIESLAENLSDGVFAP 153 (295)
T ss_pred HHhcCCHHHHHHHHHHhhCCCcccCCHHHHHHHHHHHHHHHhcchhHHH
Confidence 34678999999999999999988877766655444444 3 3888874
No 317
>PF03392 OS-D: Insect pheromone-binding family, A10/OS-D; InterPro: IPR005055 A class of small (14-20 Kd) water-soluble proteins, called odorant binding proteins (OBPs), first discovered in the insect sensillar lymph but also in the mucus of vertebrates, is postulated to mediate the solubilisation of hydrophobic odorant molecules, and thereby to facilitate their transport to the receptor neurons. The product of a gene expressed in the olfactory system of Drosophila melanogaster (Fruit fly), OS-D, shares features common to vertebrate odorant-binding proteins, but has a primary structure unlike odorant-binding proteins []. OS-D derivatives have subsequently been found in chemosensory organs of phylogenetically distinct insects, including cockroaches, phasmids and moths, suggesting that OS-D-like proteins seem to be conserved in the insect phylum.; PDB: 1KX9_A 1N8U_A 1KX8_A 1K19_A 1N8V_A 2GVS_A 2JNT_A.
Probab=31.60 E-value=94 Score=21.62 Aligned_cols=32 Identities=13% Similarity=0.211 Sum_probs=27.9
Q ss_pred HHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhh
Q psy16295 59 KSLKYCRAFLAIESENSQVQHLESIVKKRMET 90 (128)
Q Consensus 59 ~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i~k 90 (128)
.-.+|++.+|..-|=++.++.||..|-+.++.
T Consensus 19 l~~~y~~ClldkgpCt~~~~~lK~~ipeal~t 50 (95)
T PF03392_consen 19 LLKSYIDCLLDKGPCTPEGKELKKVIPEALKT 50 (95)
T ss_dssp HHHHHHHHHTSSSTSHHHHHHHHHHHHHHHHC
T ss_pred HHHHHHHHHhcCCCCCHHHHHHHHHHHHHHHc
Confidence 34578999999999999999999999888853
No 318
>PF07439 DUF1515: Protein of unknown function (DUF1515); InterPro: IPR010889 This family consists of several hypothetical bacterial proteins of around 130 residues in length. Members of this family seem to be found exclusively in Rhizobium species. The function of this family is unknown.
Probab=31.57 E-value=46 Score=24.36 Aligned_cols=38 Identities=26% Similarity=0.386 Sum_probs=22.5
Q ss_pred HHHHHHHHHH------HHHHhhhHHHHHHHHHHHHHHHHhhHHHhh
Q psy16295 75 SQVQHLESIV------KKRMETEGLKGMALAGGAVLAQGQGTMEMS 114 (128)
Q Consensus 75 ~QA~~Lk~~I------e~~i~kdGliG~~i~gga~~~~g~~~~~~~ 114 (128)
.-+..|+.-| -+.++|-=+.|+|.+| +.++|+....++
T Consensus 54 s~~~~lk~dVsemKpVT~dV~rwklmG~GaLg--vigig~aalgvt 97 (112)
T PF07439_consen 54 SSVSTLKADVSEMKPVTDDVKRWKLMGMGALG--VIGIGGAALGVT 97 (112)
T ss_pred HHHHHHHhhHHhccchHHHHHHHHHhccchhh--hhhHhHHHHHhh
Confidence 3344555544 3566777888888877 444444444455
No 319
>KOG1310|consensus
Probab=31.51 E-value=1.6e+02 Score=27.58 Aligned_cols=48 Identities=13% Similarity=0.120 Sum_probs=36.5
Q ss_pred HHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhh
Q psy16295 43 YLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKRMET 90 (128)
Q Consensus 43 ~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i~k 90 (128)
.+|+||-+...++.|.+|+.+..++-..-|.|.-.+.--..+.+.++.
T Consensus 447 ah~~la~aL~el~r~~eal~~~~alq~~~Ptd~a~~~~v~~l~rDi~a 494 (758)
T KOG1310|consen 447 AHFRLARALNELTRYLEALSCHWALQMSFPTDVARQNFVLCLPRDISA 494 (758)
T ss_pred HHHHHHHHHHHHhhHHHhhhhHHHHhhcCchhhhhhhhhhccccchHH
Confidence 469999999999999999999999999888665544444444444443
No 320
>KOG1941|consensus
Probab=31.24 E-value=79 Score=28.29 Aligned_cols=46 Identities=15% Similarity=0.293 Sum_probs=35.5
Q ss_pred cchHHHHHHHHHHHhhhhchHHHHHHHHHHH--hcCCCCH--HHHHHHHH
Q psy16295 38 DRKRDYLYYLAIGNARIKEYTKSLKYCRAFL--AIESENS--QVQHLESI 83 (128)
Q Consensus 38 ~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL--~~eP~N~--QA~~Lk~~ 83 (128)
..++=.+|.+||++-++|+...|..||+..- .++-.|+ |++.+...
T Consensus 203 kyr~~~lyhmaValR~~G~LgdA~e~C~Ea~klal~~Gdra~~arc~~~~ 252 (518)
T KOG1941|consen 203 KYRAMSLYHMAVALRLLGRLGDAMECCEEAMKLALQHGDRALQARCLLCF 252 (518)
T ss_pred HHHHHHHHHHHHHHHHhcccccHHHHHHHHHHHHHHhCChHHHHHHHHHH
Confidence 4678889999999999999999999999763 3455555 55555444
No 321
>KOG3824|consensus
Probab=30.90 E-value=92 Score=27.37 Aligned_cols=66 Identities=18% Similarity=0.175 Sum_probs=43.6
Q ss_pred HHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHH-------HHHHHHH
Q psy16295 19 IRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLE-------SIVKKRM 88 (128)
Q Consensus 19 ~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk-------~~Ie~~i 88 (128)
..++..|++--+.-. | +..|.|--++.-.--.+|--+|-.|+-+.|.+.|.|..|...+ +.|++++
T Consensus 132 ~ekA~~lfeHAlala-P---~~p~~L~e~G~f~E~~~~iv~ADq~Y~~ALtisP~nseALvnR~RT~plV~~iD~r~ 204 (472)
T KOG3824|consen 132 LEKAMTLFEHALALA-P---TNPQILIEMGQFREMHNEIVEADQCYVKALTISPGNSEALVNRARTTPLVSAIDRRM 204 (472)
T ss_pred hHHHHHHHHHHHhcC-C---CCHHHHHHHhHHHHhhhhhHhhhhhhheeeeeCCCchHHHhhhhccchHHHHHHHHH
Confidence 345555666555555 4 4556555444433344666788999999999999999987654 5566555
No 322
>PLN03077 Protein ECB2; Provisional
Probab=30.78 E-value=1.1e+02 Score=28.11 Aligned_cols=33 Identities=9% Similarity=0.013 Sum_probs=18.9
Q ss_pred HHHHHHHHHhhhhchHHHHHHHHHHHh--cCCCCH
Q psy16295 43 YLYYLAIGNARIKEYTKSLKYCRAFLA--IESENS 75 (128)
Q Consensus 43 ~LYyLAvg~yklgdY~~A~~~~~~lL~--~eP~N~ 75 (128)
.+..+--++.+.|++++|++.++.+.+ +.|+..
T Consensus 356 s~n~li~~~~~~g~~~~A~~lf~~M~~~g~~Pd~~ 390 (857)
T PLN03077 356 SWTAMISGYEKNGLPDKALETYALMEQDNVSPDEI 390 (857)
T ss_pred eHHHHHHHHHhCCCHHHHHHHHHHHHHhCCCCCce
Confidence 344455566666666666666665543 346554
No 323
>PRK15180 Vi polysaccharide biosynthesis protein TviD; Provisional
Probab=30.55 E-value=1.2e+02 Score=28.36 Aligned_cols=37 Identities=24% Similarity=0.481 Sum_probs=29.0
Q ss_pred HHHHHHHHHHhhhhchHHHHHHHHHHHhc----CCCCHHHH
Q psy16295 42 DYLYYLAIGNARIKEYTKSLKYCRAFLAI----ESENSQVQ 78 (128)
Q Consensus 42 d~LYyLAvg~yklgdY~~A~~~~~~lL~~----eP~N~QA~ 78 (128)
-.+-|-|+...-++||..|++||+++=++ ||--+|.-
T Consensus 777 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 817 (831)
T PRK15180 777 YALKYAALNAMHLRDYTQALQYWQRLEKVNGPTEPVTRQIS 817 (831)
T ss_pred HHHHHHHhhHhHHHHHHHHHHHHHHHHhccCCCcchHHHHH
Confidence 34668899999999999999999999875 44444443
No 324
>KOG2062|consensus
Probab=30.51 E-value=2.3e+02 Score=27.46 Aligned_cols=58 Identities=19% Similarity=0.306 Sum_probs=38.8
Q ss_pred hHHHHHHHHHHHhhccCccccchHHHH-HHHHHHHhhhhchHHHHHHHHH---HHhcCCCCHHHH
Q psy16295 18 DIRKGILLLEELFKTETEDIDRKRDYL-YYLAIGNARIKEYTKSLKYCRA---FLAIESENSQVQ 78 (128)
Q Consensus 18 d~~~GI~LL~~l~~~~~~~~~~~rd~L-YyLAvg~yklgdY~~A~~~~~~---lL~~eP~N~QA~ 78 (128)
+|...+.-.|.|+.++.+ ..|+.- -..+=-||-+|+|++|+.|.=. .+.++|+++-..
T Consensus 38 EIsd~l~~IE~lyed~~F---~er~~AaL~~SKVyy~Lgeye~Al~yAL~ag~~F~Vd~~S~y~e 99 (929)
T KOG2062|consen 38 EISDSLPKIESLYEDETF---PERQLAALLASKVYYYLGEYEDALEYALRAGDDFDVDENSDYVE 99 (929)
T ss_pred HhhhhHHHHHHHhccCCC---chhHHHHHHHHHHHHHHHHHHHHHHHHHcCCccccccCccchhh
Confidence 456677777888888754 344443 5566789999999999999433 345555544433
No 325
>PF07079 DUF1347: Protein of unknown function (DUF1347); InterPro: IPR010764 This family consists of several hypothetical bacterial proteins of around 610 residues in length. Members of this family are highly conserved and seem to be specific to Chlamydia species. The function of this family is unknown.
Probab=30.47 E-value=3.2e+02 Score=25.02 Aligned_cols=77 Identities=16% Similarity=0.155 Sum_probs=50.8
Q ss_pred ChHHHHHHHHhccCChhhHHHHHHHHHHHhhc-cC-ccccchHHHHHHHHHHH--hhhhchHHHHHHHHHHHhcCCCCHH
Q psy16295 1 MTQFNYSFCLVRSEFCADIRKGILLLEELFKT-ET-EDIDRKRDYLYYLAIGN--ARIKEYTKSLKYCRAFLAIESENSQ 76 (128)
Q Consensus 1 ~~~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~-~~-~~~~~~rd~LYyLAvg~--yklgdY~~A~~~~~~lL~~eP~N~Q 76 (128)
|.++.|+-+|..-.-|+-..- +.-+.+ +- |.+-..+|.-.+||=|- |-.|+|.++-.|..=+.++.| -+|
T Consensus 423 fvKq~Y~qaLs~~~~~rLlkL-----e~fi~e~gl~~i~i~e~eian~LaDAEyLysqgey~kc~~ys~WL~~iaP-S~~ 496 (549)
T PF07079_consen 423 FVKQAYKQALSMHAIPRLLKL-----EDFITEVGLTPITISEEEIANFLADAEYLYSQGEYHKCYLYSSWLTKIAP-SPQ 496 (549)
T ss_pred HHHHHHHHHHhhhhHHHHHHH-----HHHHHhcCCCcccccHHHHHHHHHHHHHHHhcccHHHHHHHHHHHHHhCC-cHH
Confidence 357788888876655553332 222222 11 11134566666676664 468999999999999999999 888
Q ss_pred HHHHHHH
Q psy16295 77 VQHLESI 83 (128)
Q Consensus 77 A~~Lk~~ 83 (128)
+..|.-+
T Consensus 497 ~~RLlGl 503 (549)
T PF07079_consen 497 AYRLLGL 503 (549)
T ss_pred HHHHHHH
Confidence 8887644
No 326
>KOG0615|consensus
Probab=30.43 E-value=32 Score=30.81 Aligned_cols=19 Identities=21% Similarity=0.373 Sum_probs=17.5
Q ss_pred HHHHHHHHHHHhcCCCCHH
Q psy16295 58 TKSLKYCRAFLAIESENSQ 76 (128)
Q Consensus 58 ~~A~~~~~~lL~~eP~N~Q 76 (128)
++|+..++.+|.+||+|+-
T Consensus 411 eea~dlI~~mL~VdP~~R~ 429 (475)
T KOG0615|consen 411 EEALDLINWMLVVDPENRP 429 (475)
T ss_pred HHHHHHHHHhhEeCcccCc
Confidence 6899999999999999974
No 327
>PF05808 Podoplanin: Podoplanin; InterPro: IPR008783 This family consists of several mammalian podoplanin-like proteins which are thought to control specifically the unique shape of podocytes [].; GO: 0016021 integral to membrane; PDB: 3IET_X.
Probab=30.26 E-value=17 Score=28.19 Aligned_cols=23 Identities=39% Similarity=0.608 Sum_probs=0.0
Q ss_pred HhhhHHHHHHHHH---HHHHHHHhhH
Q psy16295 88 METEGLKGMALAG---GAVLAQGQGT 110 (128)
Q Consensus 88 i~kdGliG~~i~g---ga~~~~g~~~ 110 (128)
..||||-..++|| |+++++|++|
T Consensus 121 ~ek~GL~T~tLVGIIVGVLlaIG~ig 146 (162)
T PF05808_consen 121 VEKDGLSTVTLVGIIVGVLLAIGFIG 146 (162)
T ss_dssp --------------------------
T ss_pred cccCCcceeeeeeehhhHHHHHHHHh
Confidence 6789987755554 5555555443
No 328
>PRK10454 PTS system N,N'-diacetylchitobiose-specific transporter subunit IIA; Provisional
Probab=30.25 E-value=84 Score=22.83 Aligned_cols=29 Identities=14% Similarity=0.163 Sum_probs=23.9
Q ss_pred hHHHHHHHHHHHhhhhchHHHHHHHHHHH
Q psy16295 40 KRDYLYYLAIGNARIKEYTKSLKYCRAFL 68 (128)
Q Consensus 40 ~rd~LYyLAvg~yklgdY~~A~~~~~~lL 68 (128)
.-..+|+=|+-..|.|||++|...++..=
T Consensus 30 ~ArS~~~eAl~~Ak~gdfe~A~~~l~eA~ 58 (115)
T PRK10454 30 QARSLAYAALKQAKQGDFAAAKAMMDQSR 58 (115)
T ss_pred HHHHHHHHHHHHHHhCCHHHHHHHHHHHH
Confidence 35567888999999999999999876543
No 329
>PF12729 4HB_MCP_1: Four helix bundle sensory module for signal transduction; InterPro: IPR024478 This entry represents a four-helix bundle that operates as a ubiquitous sensory module in prokaryotic signal-transduction, which is known as four-helix bundles methyl-accepting chemotaxis protein (4HB_MCP) domain. The 4HB_MCP is always found between two predicted transmembrane helices indicating that it detects only extracellular signals. In many cases the domain is associated with a cytoplasmic HAMP domain suggesting that most proteins carrying the bundle might share the mechanism of transmembrane signalling which is well-characterised in E coli chemoreceptors [].
Probab=30.20 E-value=95 Score=21.41 Aligned_cols=31 Identities=16% Similarity=0.075 Sum_probs=17.9
Q ss_pred HHHHHHHHHHHHHHHhhHHHhhhhhhcchhh
Q psy16295 93 LKGMALAGGAVLAQGQGTMEMSCKSYKNTAS 123 (128)
Q Consensus 93 liG~~i~gga~~~~g~~~~~~~~~~~~~~~~ 123 (128)
++|++++....+++|+.|.+-..+...++..
T Consensus 9 ~~~f~~~~~l~~~~~~~~~~~l~~~~~~~~~ 39 (181)
T PF12729_consen 9 ILGFGLIILLLLIVGIVGLYSLSQINQNVEE 39 (181)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4555666555566776666655555554443
No 330
>PF11981 DUF3482: Domain of unknown function (DUF3482); InterPro: IPR021871 This presumed domain is functionally uncharacterised. This domain is found in bacteria and eukaryotes. This domain is typically between 289 to 301 amino acids in length. This domain is found associated with PF01926 from PFAM.
Probab=30.03 E-value=73 Score=26.53 Aligned_cols=34 Identities=18% Similarity=0.251 Sum_probs=16.4
Q ss_pred HHhhhHHHHHHH---HHHHHHHH------HhhHHHhhhhhhcc
Q psy16295 87 RMETEGLKGMAL---AGGAVLAQ------GQGTMEMSCKSYKN 120 (128)
Q Consensus 87 ~i~kdGliG~~i---~gga~~~~------g~~~~~~~~~~~~~ 120 (128)
-...-+++|+|| +||..+++ .++|.|-..+.|.+
T Consensus 151 GaaaGAa~GagiDl~tgG~SLG~gaaiGal~Gg~~~~~~~~~~ 193 (292)
T PF11981_consen 151 GAAAGAAAGAGIDLATGGLSLGAGAAIGALAGGAWQGGRRYGK 193 (292)
T ss_pred hHHHHHHHhHHHHHHhhhhhhHHHHHHHHHHHHHHhhhhhhHH
Confidence 444555666665 34433222 23355555555554
No 331
>KOG0495|consensus
Probab=29.77 E-value=2.7e+02 Score=26.87 Aligned_cols=69 Identities=10% Similarity=0.034 Sum_probs=56.0
Q ss_pred hhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHH
Q psy16295 16 CADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKRM 88 (128)
Q Consensus 16 ~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i 88 (128)
.+.+.+++.||++.++.= | .=.-.+..++.-+=..++-+.||..+.+-+++=|...--=-|..-+|++.
T Consensus 664 ld~~eeA~rllEe~lk~f-p---~f~Kl~lmlGQi~e~~~~ie~aR~aY~~G~k~cP~~ipLWllLakleEk~ 732 (913)
T KOG0495|consen 664 LDNVEEALRLLEEALKSF-P---DFHKLWLMLGQIEEQMENIEMAREAYLQGTKKCPNSIPLWLLLAKLEEKD 732 (913)
T ss_pred hhhHHHHHHHHHHHHHhC-C---chHHHHHHHhHHHHHHHHHHHHHHHHHhccccCCCCchHHHHHHHHHHHh
Confidence 357789999999999986 6 34455566777888899999999999999999999988777777777664
No 332
>COG3629 DnrI DNA-binding transcriptional activator of the SARP family [Signal transduction mechanisms]
Probab=29.70 E-value=1.7e+02 Score=24.37 Aligned_cols=41 Identities=7% Similarity=-0.066 Sum_probs=35.7
Q ss_pred chHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHH
Q psy16295 39 RKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQH 79 (128)
Q Consensus 39 ~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~ 79 (128)
.-..+++-++=.+.-.++++.+...++.++..||-|.++..
T Consensus 151 ~~~~~l~~lae~~~~~~~~~~~~~~l~~Li~~dp~~E~~~~ 191 (280)
T COG3629 151 LFIKALTKLAEALIACGRADAVIEHLERLIELDPYDEPAYL 191 (280)
T ss_pred HHHHHHHHHHHHHHhcccHHHHHHHHHHHHhcCccchHHHH
Confidence 34667788888999999999999999999999999988754
No 333
>KOG2300|consensus
Probab=29.31 E-value=1.2e+02 Score=27.96 Aligned_cols=69 Identities=14% Similarity=0.222 Sum_probs=47.5
Q ss_pred HHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcC
Q psy16295 3 QFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIE 71 (128)
Q Consensus 3 ~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~e 71 (128)
..|.|..-++-.+.+|..+..+.+...=......-...-..+|--++-.|+.++|.||+++...-|++-
T Consensus 407 nlnlAi~YL~~~~~ed~y~~ld~i~p~nt~s~ssq~l~a~~~~v~glfaf~qn~lnEaK~~l~e~Lkma 475 (629)
T KOG2300|consen 407 NLNLAISYLRIGDAEDLYKALDLIGPLNTNSLSSQRLEASILYVYGLFAFKQNDLNEAKRFLRETLKMA 475 (629)
T ss_pred HHhHHHHHHHhccHHHHHHHHHhcCCCCCCcchHHHHHHHHHHHHHHHHHHhccHHHHHHHHHHHHhhc
Confidence 346777777777777777766665543222110001334567888999999999999999999999876
No 334
>TIGR01987 HI0074 nucleotidyltransferase substrate binding protein, HI0074 family. The member of this family from Haemophilus influenzae, HI0074, has been shown by crystal structure to resemble nucleotidyltransferase substrate binding proteins. It forms a complex with HI0073, encoded by the adjacent gene and containing a nucleotidyltransferase nucleotide binding domain (pfam01909).
Probab=29.29 E-value=79 Score=23.01 Aligned_cols=24 Identities=8% Similarity=0.094 Sum_probs=19.8
Q ss_pred hhhchHHHHHHHHHHHhcCCCCHH
Q psy16295 53 RIKEYTKSLKYCRAFLAIESENSQ 76 (128)
Q Consensus 53 klgdY~~A~~~~~~lL~~eP~N~Q 76 (128)
|..+|.+|+.-++...+.+|.|.-
T Consensus 1 r~~nf~kAl~~L~~a~~~~~~~~~ 24 (123)
T TIGR01987 1 KFESFEQALMQLSDANWFDLTNDI 24 (123)
T ss_pred CHHHHHHHHHHHHHHHhcCccchH
Confidence 467899999999999988887764
No 335
>PF04406 TP6A_N: Type IIB DNA topoisomerase; InterPro: IPR013049 This entry represents the N-terminal domain found in Spo11, a meiotic recombination protein found in eukaryotes, and in subunit A of topoisomerase VI, a type IIB topoisomerase found predominantly in archaea [, ]. These two types of proteins share structural homology. Spo11 is a meiosis-specific protein that is responsible for the initiation of recombination through the formation of DNA double-strand breaks by a type II DNA topoisomerase-like activity. Spo11 acts in conjunction with several other proteins, including Rec102 in yeast, to bring about meiotic recombination []. DNA topoisomerases regulate the number of topological links between two DNA strands (i.e. change the number of superhelical turns) by catalysing transient single- or double-strand breaks, crossing the strands through one another, then resealing the breaks. They can be divided into two classes: type I enzymes (5.99.1.2 from EC, topoisomerases I, III and V) break single-strand DNA, and type II enzymes (5.99.1.3 from EC, topoisomerases II, IV and VI) break double-strand DNA []. Topoisomerase VI is a type IIB enzymes that assembles as a heterotetramer, consisting of two A subunits required for DNA cleavage and two B subunits required for ATP hydrolysis. The B subunit is structurally similar to the ATPase domain of type IIA topoisomerases, but the A subunit is distinct, and instead shares homology with the Spo11 protein. More information about this protein can be found at Protein of the Month: DNA Topoisomerase [].; GO: 0003677 DNA binding, 0003824 catalytic activity, 0005524 ATP binding, 0006259 DNA metabolic process, 0005694 chromosome; PDB: 1D3Y_B 2ZBK_A 2Q2E_A.
Probab=29.02 E-value=46 Score=21.33 Aligned_cols=52 Identities=17% Similarity=0.291 Sum_probs=32.9
Q ss_pred HHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCC
Q psy16295 21 KGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESEN 74 (128)
Q Consensus 21 ~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N 74 (128)
.-++...+++.++.. ...||.+|-----+-.+.+-+++..-+..+|.+.+++
T Consensus 13 ~vl~~i~~~l~~~~~--~T~RdiYY~~~~lF~~Q~~vd~~i~di~~~l~~~R~~ 64 (68)
T PF04406_consen 13 RVLAIIHELLQNNKT--STKRDIYYRNVSLFKSQREVDRAIDDICCLLGVSRED 64 (68)
T ss_dssp HHHHHHHHHHHTT----EEHHHHHHHCSCS-SSHHHHHHHHHHHHHHHTS-GGG
T ss_pred HHHHHHHHHHHcCCc--cchHHHHHhchhhccCHHHHHHHHHHHHHHHCCCHHH
Confidence 445666777777754 5777766654444455667778888888888886554
No 336
>KOG2376|consensus
Probab=28.95 E-value=2.8e+02 Score=26.03 Aligned_cols=67 Identities=18% Similarity=0.150 Sum_probs=53.1
Q ss_pred HHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhhhHHHHHH
Q psy16295 19 IRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKRMETEGLKGMA 97 (128)
Q Consensus 19 ~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i~kdGliG~~ 97 (128)
-.+.+++|.+....+ | +..-+.+..+|--+...|+.+.|.+....++ .+=+.-|.+.-..-|++|..
T Consensus 357 ~~ka~e~L~~~~~~~-p--~~s~~v~L~~aQl~is~gn~~~A~~il~~~~---------~~~~ss~~~~~~~P~~V~ai 423 (652)
T KOG2376|consen 357 HKKAIELLLQFADGH-P--EKSKVVLLLRAQLKISQGNPEVALEILSLFL---------ESWKSSILEAKHLPGTVGAI 423 (652)
T ss_pred HhhhHHHHHHHhccC-C--chhHHHHHHHHHHHHhcCCHHHHHHHHHHHh---------hhhhhhhhhhccChhHHHHH
Confidence 457899999998888 6 5779999999999999999999999998777 23345566666667777653
No 337
>PRK08878 adenosylcobinamide-phosphate synthase; Provisional
Probab=28.76 E-value=2.3e+02 Score=23.75 Aligned_cols=44 Identities=7% Similarity=0.162 Sum_probs=30.4
Q ss_pred hhhchHHHHHHHHHHHhcCCCCHHHHHHHHH-HHHHH--hhhHHHHH
Q psy16295 53 RIKEYTKSLKYCRAFLAIESENSQVQHLESI-VKKRM--ETEGLKGM 96 (128)
Q Consensus 53 klgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~-Ie~~i--~kdGliG~ 96 (128)
+.||-+.||++...+..-|.++.....+.+. ||.-. ..||+++.
T Consensus 117 ~~gdl~~AR~~l~~iV~Rdt~~L~~~~i~rA~iEs~aen~~d~v~Ap 163 (317)
T PRK08878 117 AKEDKAQARELLKPYVNRDTETLSLLGLGKAGAETIIMGYGRNVIGV 163 (317)
T ss_pred HcCCHHHHHHHHHHHhCCCcccCCHHHHHHHHHHHHHHHccccchHH
Confidence 4678999999999998888866655555433 33333 23887774
No 338
>TIGR00380 cobD cobalamin biosynthesis protein CobD. This protein is involved in cobalamin (vitamin B12) biosynthesis and porphyrin biosynthesis. It converts cobyric acid to cobinamide by the addition of aminopropanol on the F carboxylic group. It is part of the cob operon.
Probab=28.63 E-value=3.3e+02 Score=22.76 Aligned_cols=44 Identities=9% Similarity=0.183 Sum_probs=31.5
Q ss_pred hhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhh---hHHHHH
Q psy16295 53 RIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKRMET---EGLKGM 96 (128)
Q Consensus 53 klgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i~k---dGliG~ 96 (128)
+.||-++||+....+..-|.++--...+.+..-+-+.. ||+++.
T Consensus 114 ~~gdl~~AR~~l~~iv~Rdt~~Ls~~~i~ra~iEs~aen~~d~~~ap 160 (305)
T TIGR00380 114 KAGDLETARKKLQMIVSRDTNNLSEEQILSAAVESVAENIVDGVTAP 160 (305)
T ss_pred HcCCHHHHHHHHHHHhCCCcccCCHHHHHHHHHHHHHHHhcchhHHH
Confidence 45789999999999998888876665555444443333 888874
No 339
>KOG1839|consensus
Probab=28.52 E-value=73 Score=31.89 Aligned_cols=54 Identities=15% Similarity=0.174 Sum_probs=42.9
Q ss_pred HHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHH-------HHHHHhcCCCCH
Q psy16295 19 IRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKY-------CRAFLAIESENS 75 (128)
Q Consensus 19 ~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~-------~~~lL~~eP~N~ 75 (128)
+-+.+.+++..+..--| ....|+-+||.-+++++||.+|..+ +++++.+++.|.
T Consensus 954 ~~~slnl~~~v~~~~h~---~~~~~~~~La~l~~~~~d~~~Ai~~~~ka~ii~eR~~g~ds~~t 1014 (1236)
T KOG1839|consen 954 LPESLNLLNNVMGVLHP---EVASKYRSLAKLSNRLGDNQEAIAQQRKACIISERVLGKDSPNT 1014 (1236)
T ss_pred hhhhhhHHHHhhhhcch---hHHHHHHHHHHHHhhhcchHHHHHhcccceeeechhccCCCHHH
Confidence 45566788887777424 6799999999999999999999998 567777776554
No 340
>KOG1550|consensus
Probab=28.52 E-value=4.3e+02 Score=23.57 Aligned_cols=31 Identities=13% Similarity=0.073 Sum_probs=13.5
Q ss_pred HHHHHHHHHHHhhh----hchHHHHHHHHHHHhcC
Q psy16295 41 RDYLYYLAIGNARI----KEYTKSLKYCRAFLAIE 71 (128)
Q Consensus 41 rd~LYyLAvg~ykl----gdY~~A~~~~~~lL~~e 71 (128)
.+..|++|+.+..- .+.++|..+..+.=+..
T Consensus 360 ~~A~~~la~~y~~G~gv~r~~~~A~~~~k~aA~~g 394 (552)
T KOG1550|consen 360 ILAIYRLALCYELGLGVERNLELAFAYYKKAAEKG 394 (552)
T ss_pred hHHHHHHHHHHHhCCCcCCCHHHHHHHHHHHHHcc
Confidence 34444444443322 34444444444444444
No 341
>PF05843 Suf: Suppressor of forked protein (Suf); InterPro: IPR008847 This domain consists of several eukaryotic suppressor of forked (Suf) like proteins. The Drosophila melanogaster suppressor of forked [Su(f)] protein shares homology with the Saccharomyces cerevisiae RNA14 protein and the 77 kDa subunit of Homo sapiens cleavage stimulation factor, which are proteins involved in mRNA 3' end formation. This suggests a role for Su(f) in mRNA 3' end formation in Drosophila. The su(f) gene produces three transcripts; two of them are polyadenylated at the end of the transcription unit, and one is a truncated transcript, polyadenylated in intron 4. It is thought that su(f) plays a role in the regulation of poly(A) site utilisation and the GU-rich sequence is important for this regulation to occur [].; GO: 0006397 mRNA processing, 0005634 nucleus; PDB: 2L9B_B 2OND_B 2OOE_A 4E85_B 4EBA_C 4E6H_A 2UY1_B.
Probab=28.41 E-value=2e+02 Score=23.03 Aligned_cols=56 Identities=11% Similarity=-0.019 Sum_probs=35.6
Q ss_pred hHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhh-hhchHHHHHHHHHHHhcCCCCHHH
Q psy16295 18 DIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNAR-IKEYTKSLKYCRAFLAIESENSQV 77 (128)
Q Consensus 18 d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yk-lgdY~~A~~~~~~lL~~eP~N~QA 77 (128)
.+..+-.+++...+.. + ..-+.+...|.--|+ .+|.+.|++.++..|+.-|++...
T Consensus 16 g~~~aR~vF~~a~~~~-~---~~~~vy~~~A~~E~~~~~d~~~A~~Ife~glk~f~~~~~~ 72 (280)
T PF05843_consen 16 GIEAARKVFKRARKDK-R---CTYHVYVAYALMEYYCNKDPKRARKIFERGLKKFPSDPDF 72 (280)
T ss_dssp HHHHHHHHHHHHHCCC-C---S-THHHHHHHHHHHHTCS-HHHHHHHHHHHHHHHTT-HHH
T ss_pred ChHHHHHHHHHHHcCC-C---CCHHHHHHHHHHHHHhCCCHHHHHHHHHHHHHHCCCCHHH
Confidence 3555555666665443 2 345666666666556 677777999999999999988654
No 342
>PLN03094 Substrate binding subunit of ER-derived-lipid transporter; Provisional
Probab=28.09 E-value=84 Score=27.20 Aligned_cols=44 Identities=27% Similarity=0.162 Sum_probs=29.0
Q ss_pred CCCHHHHHHHHHHH----HHHhhhHHHHHHHHHHHHHHHHhhHHHhhhh
Q psy16295 72 SENSQVQHLESIVK----KRMETEGLKGMALAGGAVLAQGQGTMEMSCK 116 (128)
Q Consensus 72 P~N~QA~~Lk~~Ie----~~i~kdGliG~~i~gga~~~~g~~~~~~~~~ 116 (128)
|.|-=.+.||-+-+ ++.-+||.+|+-+++|+++.++ +..|+...
T Consensus 62 ~~~~~~~~~~~~~~~~~~rrsvrEg~VGlfvL~gi~ll~~-~~~~L~g~ 109 (370)
T PLN03094 62 PRNIWRQTLKPLSDFGFGKRSVWEGGVGLFLLSGAALLAL-TLAWLRGF 109 (370)
T ss_pred hHHHHHHhhcchhhcCCcchhHHHHHHHHHHHHHHHHHHH-HHHHHcCC
Confidence 44455555665544 7778899999999987775554 44555433
No 343
>KOG2003|consensus
Probab=27.91 E-value=3.6e+02 Score=25.12 Aligned_cols=61 Identities=13% Similarity=0.110 Sum_probs=47.5
Q ss_pred hhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHH
Q psy16295 16 CADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHL 80 (128)
Q Consensus 16 ~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~L 80 (128)
.+--.++|..||.-.--. | .+...-..+|-+.-|.|+|.+|...+..+-+.-|+|-.-...
T Consensus 639 tqf~ekai~y~ekaaliq-p---~~~kwqlmiasc~rrsgnyqka~d~yk~~hrkfpedldclkf 699 (840)
T KOG2003|consen 639 TQFSEKAINYFEKAALIQ-P---NQSKWQLMIASCFRRSGNYQKAFDLYKDIHRKFPEDLDCLKF 699 (840)
T ss_pred hHHHHHHHHHHHHHHhcC-c---cHHHHHHHHHHHHHhcccHHHHHHHHHHHHHhCccchHHHHH
Confidence 445567888888766666 5 566666778888899999999999999999999998765443
No 344
>PRK03954 ribonuclease P protein component 4; Validated
Probab=27.83 E-value=94 Score=22.84 Aligned_cols=32 Identities=22% Similarity=0.146 Sum_probs=26.4
Q ss_pred cchHHHHHHHHHHHhhhhchHHHHHHHHHHHhc
Q psy16295 38 DRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAI 70 (128)
Q Consensus 38 ~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~ 70 (128)
..+-++||.+|...+ ..+++-|++|+..+-++
T Consensus 19 ~eRi~~L~~~A~~~~-~~~pelar~Yv~lar~I 50 (121)
T PRK03954 19 RERIDTLFTLAERVF-PYSPELANRYVELALAV 50 (121)
T ss_pred HHHHHHHHHHHHHHh-hcCHHHHHHHHHHHHHH
Confidence 367899999998888 45899999999887654
No 345
>PF13812 PPR_3: Pentatricopeptide repeat domain
Probab=27.17 E-value=98 Score=15.78 Aligned_cols=25 Identities=12% Similarity=0.041 Sum_probs=19.6
Q ss_pred HHHHHHHhhhhchHHHHHHHHHHHh
Q psy16295 45 YYLAIGNARIKEYTKSLKYCRAFLA 69 (128)
Q Consensus 45 YyLAvg~yklgdY~~A~~~~~~lL~ 69 (128)
-.+--++.+.|+.+.|.+.++.+.+
T Consensus 5 ~~ll~a~~~~g~~~~a~~~~~~M~~ 29 (34)
T PF13812_consen 5 NALLRACAKAGDPDAALQLFDEMKE 29 (34)
T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHH
Confidence 3455678889999999998888765
No 346
>KOG1070|consensus
Probab=27.12 E-value=2.5e+02 Score=29.15 Aligned_cols=60 Identities=15% Similarity=-0.007 Sum_probs=49.8
Q ss_pred hHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHH
Q psy16295 18 DIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLES 82 (128)
Q Consensus 18 d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~ 82 (128)
-+.+++++|+..++.-. +.+..|-.++-...++.+=+.|+..+.+.|+.-|. +|-..+-.
T Consensus 1545 k~~~A~ell~~m~KKF~----q~~~vW~~y~~fLl~~ne~~aa~~lL~rAL~~lPk-~eHv~~Is 1604 (1710)
T KOG1070|consen 1545 KNDEADELLRLMLKKFG----QTRKVWIMYADFLLRQNEAEAARELLKRALKSLPK-QEHVEFIS 1604 (1710)
T ss_pred cchhHHHHHHHHHHHhc----chhhHHHHHHHHHhcccHHHHHHHHHHHHHhhcch-hhhHHHHH
Confidence 45679999999999862 68999999999999999999999999999999999 44444433
No 347
>TIGR03017 EpsF chain length determinant protein EpsF. Sequences in this family of proteins are members of the chain length determinant family (pfam02706) which includes the wzc protein from E.coli. This family of proteins are homologous to the EpsF protein of the methanolan biosynthesis operon of Methylobacillus species strain 12S. The distribution of this protein appears to be restricted to a subset of exopolysaccharide operons containing a syntenic grouping of genes including a variant of the EpsH exosortase protein. Exosortase has been proposed to be involved in the targetting and processing of proteins containing the PEP-CTERM domain to the exopolysaccharide layer.
Probab=27.05 E-value=97 Score=26.14 Aligned_cols=32 Identities=13% Similarity=0.220 Sum_probs=21.1
Q ss_pred HHHHHHHHHHHHHHhh-HHHhhhhhhcchhhhh
Q psy16295 94 KGMALAGGAVLAQGQG-TMEMSCKSYKNTASVL 125 (128)
Q Consensus 94 iG~~i~gga~~~~g~~-~~~~~~~~~~~~~~~~ 125 (128)
+.+|+++|.++++|++ ...+-..++++...+-
T Consensus 400 l~~~~~~Gl~lg~~~~~l~e~ld~~i~s~~~v~ 432 (444)
T TIGR03017 400 LVLSIFLGMLLGIGFAFLAELMDRRVRSADDII 432 (444)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCCHHHHH
Confidence 4466677777766633 6667777787776653
No 348
>COG1270 CbiB Cobalamin biosynthesis protein CobD/CbiB [Coenzyme metabolism]
Probab=27.00 E-value=2.7e+02 Score=23.79 Aligned_cols=46 Identities=9% Similarity=0.129 Sum_probs=35.2
Q ss_pred HhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHH-HHHHHh--hhHHHHH
Q psy16295 51 NARIKEYTKSLKYCRAFLAIESENSQVQHLESI-VKKRME--TEGLKGM 96 (128)
Q Consensus 51 ~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~-Ie~~i~--kdGliG~ 96 (128)
.-+.||-++||+.+..+-.-||++-...++-+. ||.-.+ -||+++-
T Consensus 112 ~L~~gdl~~aR~~ls~iV~RDts~L~~~~i~~AaIES~aEN~~DgvvaP 160 (320)
T COG1270 112 ALRRGDLEGARRALSMIVGRDTSKLSEAEIASAAIESLAENLVDGVVAP 160 (320)
T ss_pred HHHhCCHHHHHHHHHHHhcCCcccCCHHHHHHHHHHHHHHhcccchHHH
Confidence 457789999999999999999999877766554 444443 3888873
No 349
>COG4872 Predicted membrane protein [Function unknown]
Probab=26.98 E-value=1.4e+02 Score=26.12 Aligned_cols=40 Identities=23% Similarity=0.248 Sum_probs=28.1
Q ss_pred CCHHHHHHHHHHHHH----HhhhHHHHHHHHHHHHHHHHhhHHH
Q psy16295 73 ENSQVQHLESIVKKR----METEGLKGMALAGGAVLAQGQGTME 112 (128)
Q Consensus 73 ~N~QA~~Lk~~Ie~~----i~kdGliG~~i~gga~~~~g~~~~~ 112 (128)
+..|++.+++-.+.+ .++|-+.-++.+||..+++|++.+.
T Consensus 22 daaqa~~~~~d~~~~~re~~s~~~~~i~atigglLlgagvi~fV 65 (394)
T COG4872 22 DAAQAQQIADDYKFKNREIASRDRMMIAATIGGLLLGAGVITFV 65 (394)
T ss_pred cHHHHHHHHHHHHHHHHHhhccchhHHHHHHHHHHHHHHHHHHH
Confidence 567888888777633 3456566688888888888876543
No 350
>PRK10132 hypothetical protein; Provisional
Probab=26.89 E-value=2.4e+02 Score=20.14 Aligned_cols=35 Identities=9% Similarity=0.008 Sum_probs=21.5
Q ss_pred HHHHHHHHHHhhhHHHHHHHHHHHHHHHHhhHHHhhhh
Q psy16295 79 HLESIVKKRMETEGLKGMALAGGAVLAQGQGTMEMSCK 116 (128)
Q Consensus 79 ~Lk~~Ie~~i~kdGliG~~i~gga~~~~g~~~~~~~~~ 116 (128)
.--...++-|...=+.++||++|+.+++| ++++|+
T Consensus 73 ~a~~~~~~~V~~~Pw~svgiaagvG~llG---~Ll~RR 107 (108)
T PRK10132 73 DAVGCADTFVRERPWCSVGTAAAVGIFIG---ALLSLR 107 (108)
T ss_pred HHHHHHHHHHHhCcHHHHHHHHHHHHHHH---HHHhcc
Confidence 33334455666677888888776655555 446654
No 351
>PF11691 DUF3288: Protein of unknown function (DUF3288); InterPro: IPR021705 This family of proteins with unknown function appears to be restricted to Cyanobacteria.
Probab=26.55 E-value=1.2e+02 Score=21.33 Aligned_cols=28 Identities=18% Similarity=0.255 Sum_probs=23.4
Q ss_pred hHHHHHHHHHHHhcCCCCHHHHHHHHHH
Q psy16295 57 YTKSLKYCRAFLAIESENSQVQHLESIV 84 (128)
Q Consensus 57 Y~~A~~~~~~lL~~eP~N~QA~~Lk~~I 84 (128)
|..=|..++.||..+|+..+-.+|-.+.
T Consensus 8 ~~~DR~~vd~Ll~~~p~d~~L~eLARL~ 35 (90)
T PF11691_consen 8 YKTDREIVDRLLAGEPTDYNLAELARLR 35 (90)
T ss_pred hhhhHHHHHHHHcCCCCchhHHHHHHHH
Confidence 5666888999999999999888777664
No 352
>KOG1380|consensus
Probab=26.48 E-value=48 Score=28.99 Aligned_cols=34 Identities=29% Similarity=0.237 Sum_probs=29.7
Q ss_pred hHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCC
Q psy16295 40 KRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESE 73 (128)
Q Consensus 40 ~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~ 73 (128)
-.-++-|+|.-.|+..||.+||+....-|=+.|-
T Consensus 328 ~na~l~~~afkFY~~~d~~~aRklF~aSLlhLP~ 361 (409)
T KOG1380|consen 328 LNAYLAYLAFKFYRDRDYHNARKLFFASLLHLPL 361 (409)
T ss_pred HhHHHHHHHHHHhhccchhhHHHHHHHHHHHHHH
Confidence 4568889999999999999999998888877774
No 353
>KOG1915|consensus
Probab=26.45 E-value=52 Score=30.30 Aligned_cols=25 Identities=24% Similarity=0.501 Sum_probs=0.0
Q ss_pred hhhchHHHHHHHHHHHhcCCCCHHH
Q psy16295 53 RIKEYTKSLKYCRAFLAIESENSQV 77 (128)
Q Consensus 53 klgdY~~A~~~~~~lL~~eP~N~QA 77 (128)
++++++++|+.++..|+..|+|-|+
T Consensus 449 qL~efDRcRkLYEkfle~~Pe~c~~ 473 (677)
T KOG1915|consen 449 QLREFDRCRKLYEKFLEFSPENCYA 473 (677)
T ss_pred HHhhHHHHHHHHHHHHhcChHhhHH
No 354
>PF00244 14-3-3: 14-3-3 protein; InterPro: IPR023410 The 14-3-3 proteins are a large family of approximately 30kDa acidic proteins which exist primarily as homo- and heterodimeric within all eukaryotic cells [, ]. There is a high degree of sequence identity and conservation between all the 14-3-3 isotypes, particularly in the regions which form the dimer interface or line the central ligand binding channel of the dimeric molecule. Each 14-3-3 protein sequence can be roughly divided into three sections: a divergent amino terminus, the conserved core region and a divergent carboxyl terminus. The conserved middle core region of the 14-3-3s encodes an amphipathic groove that forms the main functional domain, a cradle for interacting with client proteins. The monomer consists of nine helices organised in an antiparallel manner, forming an L-shaped structure. The interior of the L-structure is composed of four helices: H3 and H5, which contain many charged and polar amino acids, and H7 and H9, which contain hydrophobic amino acids. These four helices form the concave amphipathic groove that interacts with target peptides. 14-3-3 proteins mainly bind proteins containing phosphothreonine or phosphoserine motifs however exceptions to this rule do exist. Extensive investigation of the 14-3-3 binding site of the mammalian serine/threonine kinase Raf-1 has produced a consensus sequence for 14-3-3-binding, RSxpSxP (in the single-letter amino-acid code, where x denotes any amino acid and p indicates that the next residue is phosphorylated). 14-3-3 proteins appear to effect intracellular signalling in one of three ways - by direct regulation of the catalytic activity of the bound protein, by regulating interactions between the bound protein and other molecules in the cell by sequestration or modification or by controlling the subcellular localisation of the bound ligand. Proteins appear to initially bind to a single dominant site and then subsequently to many, much weaker secondary interaction sites. The 14-3-3 dimer is capable of changing the conformation of its bound ligand whilst itself undergoing minimal structural alteration. This entry represents the structural domain found in 14-3-3 proteins.; PDB: 2O8P_A 3AXY_D 2C74_A 2C63_A 4DX0_A 1YWT_A 3P1O_A 3P1N_A 4DAU_A 3U9X_A ....
Probab=26.44 E-value=3e+02 Score=21.80 Aligned_cols=71 Identities=14% Similarity=0.142 Sum_probs=48.6
Q ss_pred hHHHHHHHHHHHhhccCccccchHHHHHHHHHHH-hhhhchHHHHHHHHHHHhcCCCC--HHHHHHHHHHHHHHhhh
Q psy16295 18 DIRKGILLLEELFKTETEDIDRKRDYLYYLAIGN-ARIKEYTKSLKYCRAFLAIESEN--SQVQHLESIVKKRMETE 91 (128)
Q Consensus 18 d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~-yklgdY~~A~~~~~~lL~~eP~N--~QA~~Lk~~Ie~~i~kd 91 (128)
-..+-++.+.++...+ | +-..|=...|+++| --.+.+..+.+-+..+.+.++.+ .....+-+-...++.+|
T Consensus 16 Ry~dmv~~mk~~~~~~-~--eLt~eERnLlsvayKn~i~~~R~s~R~l~~~e~~~~~~~~~~~~~~i~~yk~kie~E 89 (236)
T PF00244_consen 16 RYDDMVEYMKQLIEMN-P--ELTEEERNLLSVAYKNVIGSRRASWRILSSIEQKEENKGNEKQVKLIKDYKKKIEDE 89 (236)
T ss_dssp HHHHHHHHHHHHHHTS-S-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHTTTTHHHHHHHHHHHHHHHHH
T ss_pred CHHHHHHHHHHHHccC-C--CCCHHHHHHHHHHHHhccccchHHHHhhhhHhhhhcccchhHHHHHHHHHHHHHHHH
Confidence 3456788888888887 5 57788888888888 45788888888888888888775 44444444444444444
No 355
>COG5499 Predicted transcription regulator containing HTH domain [Transcription]
Probab=26.29 E-value=87 Score=23.14 Aligned_cols=28 Identities=14% Similarity=0.159 Sum_probs=21.9
Q ss_pred hhchHHHHHHHHHHHhcCCCCHHHHHHH
Q psy16295 54 IKEYTKSLKYCRAFLAIESENSQVQHLE 81 (128)
Q Consensus 54 lgdY~~A~~~~~~lL~~eP~N~QA~~Lk 81 (128)
-.||..|+.-++.++..+|+|+--...+
T Consensus 11 e~dy~~Al~eV~~lfd~dpE~~~~e~dr 38 (120)
T COG5499 11 EADYRWALAEVDKLFDNDPENPLLELDR 38 (120)
T ss_pred HHHHHHHHHHHHHHhcCCcccCchHHHH
Confidence 3578999999999999999997654433
No 356
>KOG2396|consensus
Probab=25.92 E-value=1.4e+02 Score=27.51 Aligned_cols=51 Identities=10% Similarity=0.063 Sum_probs=44.8
Q ss_pred HHHHHHHHhhccCccccchHHHHHHHHHHHhhhhc-hHHHHHHHHHHHhcCCCCHH
Q psy16295 22 GILLLEELFKTETEDIDRKRDYLYYLAIGNARIKE-YTKSLKYCRAFLAIESENSQ 76 (128)
Q Consensus 22 GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgd-Y~~A~~~~~~lL~~eP~N~Q 76 (128)
--.++.+.+.-+ | ...|.|-|-|.=-|..+. .+.||.....-|+..|+++-
T Consensus 124 v~ki~~~~l~~H-p---~~~dLWI~aA~wefe~n~ni~saRalflrgLR~npdsp~ 175 (568)
T KOG2396|consen 124 VKKIFAAMLAKH-P---NNPDLWIYAAKWEFEINLNIESARALFLRGLRFNPDSPK 175 (568)
T ss_pred HHHHHHHHHHhC-C---CCchhHHhhhhhHHhhccchHHHHHHHHHHhhcCCCChH
Confidence 345678888888 6 789999999999999888 89999999999999999985
No 357
>TIGR02230 ATPase_gene1 F0F1-ATPase subunit, putative. This model represents a protein found encoded in F1F0-ATPase operons in several genomes, including Methanosarcina barkeri (archaeal) and Chlorobium tepidum (bacterial). It is a small protein (about 100 amino acids) with long hydrophic stretches and is presumed to be a subunit of the enzyme.
Probab=25.71 E-value=2.2e+02 Score=20.25 Aligned_cols=23 Identities=22% Similarity=0.196 Sum_probs=9.4
Q ss_pred HHHHHHHHHHHHHHHHhhHHHhhh
Q psy16295 92 GLKGMALAGGAVLAQGQGTMEMSC 115 (128)
Q Consensus 92 GliG~~i~gga~~~~g~~~~~~~~ 115 (128)
|.||..++..+++++ .+|.||=.
T Consensus 46 g~IG~~~v~pil~G~-~lG~WLD~ 68 (100)
T TIGR02230 46 GLIGWSVAIPTLLGV-AVGIWLDR 68 (100)
T ss_pred HHHHHHHHHHHHHHH-HHHHHHHh
Confidence 344444443332222 24556543
No 358
>PF09670 Cas_Cas02710: CRISPR-associated protein (Cas_Cas02710)
Probab=25.61 E-value=3.2e+02 Score=23.21 Aligned_cols=68 Identities=21% Similarity=0.254 Sum_probs=0.0
Q ss_pred ChhhHHHHHHHHHHHhhccCccccchH-HHHHHHHHHHhh--hhchHHHHHHHHHHHhcCCC-CHHHHHHHHHHH
Q psy16295 15 FCADIRKGILLLEELFKTETEDIDRKR-DYLYYLAIGNAR--IKEYTKSLKYCRAFLAIESE-NSQVQHLESIVK 85 (128)
Q Consensus 15 ~~~d~~~GI~LL~~l~~~~~~~~~~~r-d~LYyLAvg~yk--lgdY~~A~~~~~~lL~~eP~-N~QA~~Lk~~Ie 85 (128)
|..+...+..+|+++... -| ...+ +.+..|+-||-. .-||.+|.++.+.++...=. +.|...|...++
T Consensus 143 n~~~y~aA~~~l~~l~~r-l~--~~~~~~~~~~l~~~y~~WD~fd~~~A~~~l~~~~~~~~~l~~~~~~l~~~~~ 214 (379)
T PF09670_consen 143 NRYDYGAAARILEELLRR-LP--GREEYQRYKDLCEGYDAWDRFDHKEALEYLEKLLKRDKALNQEREGLKELVE 214 (379)
T ss_pred hcCCHHHHHHHHHHHHHh-CC--chhhHHHHHHHHHHHHHHHccCHHHHHHHHHHHHHHhhhhHhHHHHHHHHHH
No 359
>PRK09956 hypothetical protein; Provisional
Probab=25.60 E-value=3.9e+02 Score=22.14 Aligned_cols=81 Identities=10% Similarity=0.110 Sum_probs=50.2
Q ss_pred HHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHH
Q psy16295 7 SFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKK 86 (128)
Q Consensus 7 AwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~ 86 (128)
.+.+.+...+.|+.+.+.-+.++++...++...-.-+++|+ +.-++.++-.+.++.+.+..|+..-. + -.|.+
T Consensus 176 l~ll~k~i~~~dl~~~~~~l~~~l~~~~~~~~~~~~ll~Yi----l~~~~~~~~~~~i~~l~~~~~~~~e~--i-MTiAe 248 (308)
T PRK09956 176 LELIQKHIRDRDLIGMVDRITTLLVRGFTNDSQLQTLFNYL----LQCGDTSRFTRFIQEIAERSPLQKER--L-MTIAE 248 (308)
T ss_pred HHHHHHHhcchhHHHHHHHHHHHHHhccCcHHHHHHHHHHH----hhccCcchHHHHHHHHHHhCcccchH--H-HHHHH
Confidence 34566666677888888888888877522101223466666 24467777778888888888864433 2 24555
Q ss_pred HHhhhHHH
Q psy16295 87 RMETEGLK 94 (128)
Q Consensus 87 ~i~kdGli 94 (128)
++.++|.-
T Consensus 249 ~l~qeG~e 256 (308)
T PRK09956 249 RLRQEGHQ 256 (308)
T ss_pred HHHHHHHH
Confidence 55655543
No 360
>PF12911 OppC_N: N-terminal TM domain of oligopeptide transport permease C
Probab=25.43 E-value=1.1e+02 Score=18.39 Aligned_cols=28 Identities=7% Similarity=-0.141 Sum_probs=17.1
Q ss_pred hhHHHHHHHHHHHHHHHHhhHHHhhhhhh
Q psy16295 90 TEGLKGMALAGGAVLAQGQGTMEMSCKSY 118 (128)
Q Consensus 90 kdGliG~~i~gga~~~~g~~~~~~~~~~~ 118 (128)
|=+++|+.++. ..+++++++-+++--++
T Consensus 16 k~a~~gl~il~-~~vl~ai~~p~~~p~~~ 43 (56)
T PF12911_consen 16 KLAVIGLIILL-ILVLLAIFAPFISPYDP 43 (56)
T ss_pred chHHHHHHHHH-HHHHHHHHHHHcCCCCc
Confidence 45777777774 44566666666655433
No 361
>PF02439 Adeno_E3_CR2: Adenovirus E3 region protein CR2; InterPro: IPR003470 Early region 3 (E3) of human adenoviruses (Ads) codes for proteins that appear to control viral interactions with the host []. This region called CR1 (conserved region 1) [] is found three times in Human adenovirus 19 (a subgroup D adenovirus) 49 kDa protein in the E3 region. CR1 is also found in the 20.1 Kd protein of subgroup B adenoviruses. The function of this 80 amino acid region is unknown. This region is probably a divergent immunoglobulin domain.
Probab=25.15 E-value=96 Score=18.57 Aligned_cols=13 Identities=15% Similarity=0.015 Sum_probs=6.4
Q ss_pred HHHHHHHHHHHhh
Q psy16295 97 ALAGGAVLAQGQG 109 (128)
Q Consensus 97 ~i~gga~~~~g~~ 109 (128)
||++|+++++.++
T Consensus 7 aIIv~V~vg~~ii 19 (38)
T PF02439_consen 7 AIIVAVVVGMAII 19 (38)
T ss_pred hHHHHHHHHHHHH
Confidence 4455555544443
No 362
>PF07849 DUF1641: Protein of unknown function (DUF1641); InterPro: IPR012440 Archaeal and bacterial hypothetical proteins are found in this family, with the region in question being approximately 40 residues long.
Probab=24.50 E-value=80 Score=18.72 Aligned_cols=25 Identities=28% Similarity=0.477 Sum_probs=18.4
Q ss_pred HHHhccCChhhHHHHHHHHHHHhhc
Q psy16295 8 FCLVRSEFCADIRKGILLLEELFKT 32 (128)
Q Consensus 8 waLv~S~~~~d~~~GI~LL~~l~~~ 32 (128)
|.|++.=.-.|+++|+.++-.+++.
T Consensus 14 ~gl~~~l~DpdvqrgL~~ll~~lk~ 38 (42)
T PF07849_consen 14 FGLLRALRDPDVQRGLGFLLAFLKA 38 (42)
T ss_pred HHHHHHHcCHHHHHHHHHHHHHHHH
Confidence 5667776667788888888777764
No 363
>COG1447 CelC Phosphotransferase system cellobiose-specific component IIA [Carbohydrate transport and metabolism]
Probab=24.48 E-value=1.2e+02 Score=21.88 Aligned_cols=26 Identities=12% Similarity=0.153 Sum_probs=21.2
Q ss_pred HHHHHHHHHHHhhhhchHHHHHHHHH
Q psy16295 41 RDYLYYLAIGNARIKEYTKSLKYCRA 66 (128)
Q Consensus 41 rd~LYyLAvg~yklgdY~~A~~~~~~ 66 (128)
-...+|-|+-++|-|+|++|...++.
T Consensus 19 ArS~~~eAl~~Ak~g~f~~A~~~i~e 44 (105)
T COG1447 19 ARSKAYEALKAAKEGDFEEAEELIQE 44 (105)
T ss_pred HhHHHHHHHHHHHcCCHHHHHHHHHH
Confidence 44567789999999999999987653
No 364
>PF06041 DUF924: Bacterial protein of unknown function (DUF924); InterPro: IPR010323 This entry consists of several hypothetical bacterial proteins of unknown function.; PDB: 2I6H_B.
Probab=24.41 E-value=82 Score=24.37 Aligned_cols=25 Identities=28% Similarity=0.309 Sum_probs=19.9
Q ss_pred HHhccCChhhHHHHHHHHHHHhhcc
Q psy16295 9 CLVRSEFCADIRKGILLLEELFKTE 33 (128)
Q Consensus 9 aLv~S~~~~d~~~GI~LL~~l~~~~ 33 (128)
-|.||.+..++..++.+++.+..+.
T Consensus 112 Pl~HSE~l~~q~~~v~l~~~l~~~~ 136 (188)
T PF06041_consen 112 PLMHSEDLEDQERAVRLFERLAEDA 136 (188)
T ss_dssp HHHCSSSHHHHHHHHHHHTTT----
T ss_pred ccccCCCHHHHHHHHHHHHHHhccc
Confidence 5889999999999999999998775
No 365
>KOG3364|consensus
Probab=24.20 E-value=77 Score=24.30 Aligned_cols=38 Identities=11% Similarity=0.153 Sum_probs=22.5
Q ss_pred chHHHHHHHHHHHhhhhc---hHHHHHHHHHHHhcCCCCHH
Q psy16295 39 RKRDYLYYLAIGNARIKE---YTKSLKYCRAFLAIESENSQ 76 (128)
Q Consensus 39 ~~rd~LYyLAvg~yklgd---Y~~A~~~~~~lL~~eP~N~Q 76 (128)
-.+.|.+-+|.+.-+..+ -.+.+...+.+++.+|..++
T Consensus 30 ~s~~s~f~lAwaLV~S~~~~dv~~GI~iLe~l~~~~~~~~r 70 (149)
T KOG3364|consen 30 VSKQSQFNLAWALVRSRDTEDVQEGIVILEDLLKSAHPERR 70 (149)
T ss_pred chHHHHHHHHHHHHcccchHHHHHhHHHHHHHhhhcCcccc
Confidence 356777777766666544 34555667777764444443
No 366
>PF08424 NRDE-2: NRDE-2, necessary for RNA interference; InterPro: IPR013633 This is domain is found in eukaryotic proteins of unknown function.
Probab=24.15 E-value=3.3e+02 Score=22.34 Aligned_cols=56 Identities=23% Similarity=0.256 Sum_probs=43.3
Q ss_pred hhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCH
Q psy16295 16 CADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENS 75 (128)
Q Consensus 16 ~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~ 75 (128)
..-..+-+.+|+.-++.+ | ...+=-+.||-.+. ++-+=++-.+-|+.+|...|++.
T Consensus 44 ~a~~E~klsilerAL~~n-p--~~~~L~l~~l~~~~-~~~~~~~l~~~we~~l~~~~~~~ 99 (321)
T PF08424_consen 44 RALAERKLSILERALKHN-P--DSERLLLGYLEEGE-KVWDSEKLAKKWEELLFKNPGSP 99 (321)
T ss_pred HHHHHHHHHHHHHHHHhC-C--CCHHHHHHHHHHHH-HhCCHHHHHHHHHHHHHHCCCCh
Confidence 344457789999999998 7 46666677777765 66688888899999999999865
No 367
>PF14007 YtpI: YtpI-like protein
Probab=24.05 E-value=82 Score=21.97 Aligned_cols=24 Identities=17% Similarity=0.324 Sum_probs=19.3
Q ss_pred HHHHHHHHHHhhHHHhhhhhhcch
Q psy16295 98 LAGGAVLAQGQGTMEMSCKSYKNT 121 (128)
Q Consensus 98 i~gga~~~~g~~~~~~~~~~~~~~ 121 (128)
|+|++.+++|++..+...|.||--
T Consensus 61 iV~~ifl~lG~~n~~~G~r~y~hy 84 (89)
T PF14007_consen 61 IVGAIFLVLGLFNLFAGIRAYRHY 84 (89)
T ss_pred HHHHHHHHHhHHHHHHHHHHHHhh
Confidence 456566999999999999988754
No 368
>COG3118 Thioredoxin domain-containing protein [Posttranslational modification, protein turnover, chaperones]
Probab=24.01 E-value=2.7e+02 Score=23.68 Aligned_cols=68 Identities=10% Similarity=0.146 Sum_probs=0.0
Q ss_pred HHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCC-----------------HHHHHHHHHHHHHH
Q psy16295 26 LEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESEN-----------------SQVQHLESIVKKRM 88 (128)
Q Consensus 26 L~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N-----------------~QA~~Lk~~Ie~~i 88 (128)
+++.+...-| ...|-.+.-+......+++.+|...++..|+.+|+| ..|+++...+-...
T Consensus 122 lr~~ld~~~~---~~~e~~~~~~~~~~~~e~~~~a~~~~~~al~~~~~~~~~~~~la~~~l~~g~~e~A~~iL~~lP~~~ 198 (304)
T COG3118 122 LRQFLDKVLP---AEEEEALAEAKELIEAEDFGEAAPLLKQALQAAPENSEAKLLLAECLLAAGDVEAAQAILAALPLQA 198 (304)
T ss_pred HHHHHHHhcC---hHHHHHHHHhhhhhhccchhhHHHHHHHHHHhCcccchHHHHHHHHHHHcCChHHHHHHHHhCcccc
Q ss_pred hhhHHHHH
Q psy16295 89 ETEGLKGM 96 (128)
Q Consensus 89 ~kdGliG~ 96 (128)
..+-.+|+
T Consensus 199 ~~~~~~~l 206 (304)
T COG3118 199 QDKAAHGL 206 (304)
T ss_pred hhhHHHHH
No 369
>PRK12482 flagellar motor protein MotA; Provisional
Probab=23.54 E-value=2.3e+02 Score=23.61 Aligned_cols=18 Identities=0% Similarity=0.058 Sum_probs=8.8
Q ss_pred hhhHHHHHHHHHHHhhcc
Q psy16295 16 CADIRKGILLLEELFKTE 33 (128)
Q Consensus 16 ~~d~~~GI~LL~~l~~~~ 33 (128)
.+++.+....+...++..
T Consensus 51 ~~~~~~~~k~~~~~f~~~ 68 (287)
T PRK12482 51 KSVLKEMWHQIKGVIRRK 68 (287)
T ss_pred HHHHHHHHHHHHHHhcCC
Confidence 444555555555555443
No 370
>PF12304 BCLP: Beta-casein like protein; InterPro: IPR020977 This entry represents eukaryotic proteins that are typically between 216 to 240 amino acids in length which have two conserved sequence motifs: VLR and TRIY. Beta-casein-like protein is associated with cell morphology and a regulation of growth pattern of tumours. It is found in adenocarcinomas of uterine cervical tissues[].
Probab=23.31 E-value=83 Score=24.99 Aligned_cols=42 Identities=19% Similarity=0.215 Sum_probs=29.1
Q ss_pred HHHhcCCCCHHHHHHHHHHHHHHhhhHHHHHHHHHHHH-HHHHhhHHHhhh
Q psy16295 66 AFLAIESENSQVQHLESIVKKRMETEGLKGMALAGGAV-LAQGQGTMEMSC 115 (128)
Q Consensus 66 ~lL~~eP~N~QA~~Lk~~Ie~~i~kdGliG~~i~gga~-~~~g~~~~~~~~ 115 (128)
..|++--..+|+.++...|-+-+ +++.|.+ ++.||.+..+||
T Consensus 24 ~VLRhv~~~~~av~~eY~vsNii--------sv~Sgll~I~~GI~AIvlSr 66 (188)
T PF12304_consen 24 TVLRHVSNPKDAVTLEYAVSNII--------SVTSGLLSIICGIVAIVLSR 66 (188)
T ss_pred HHHHhhccccccceehhhHHHHH--------HHHHHHHHHHHhHHHHhhhc
Confidence 46777778889999999888765 4555555 455555666665
No 371
>PRK12794 flaF flagellar biosynthesis regulatory protein FlaF; Reviewed
Probab=23.31 E-value=3e+02 Score=20.01 Aligned_cols=67 Identities=15% Similarity=0.195 Sum_probs=43.6
Q ss_pred ChHHHHHHHHhccCChhhHH-----HHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHh------
Q psy16295 1 MTQFNYSFCLVRSEFCADIR-----KGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLA------ 69 (128)
Q Consensus 1 ~~~F~YAwaLv~S~~~~d~~-----~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~------ 69 (128)
|+++-|+...-.+.++++.. +++..|++.-..+ ++ ..++. .+|+.+..++..
T Consensus 3 ~a~~AY~~~~~~~~~~Re~E~~~l~~~~~~L~~a~~~~-~~--~~~~~--------------~~AL~~NrrLWt~~~~dl 65 (122)
T PRK12794 3 MAAQAYARAAQPTRTPRETEYQLLAKATRQLKDAQTNG-PD--RFAAL--------------AEALHFNRKLWSIFASDV 65 (122)
T ss_pred hHHHHHHHHHhhcCChHHHHHHHHHHHHHHHHHHHhcC-Cc--cHHHH--------------HHHHHHHHHHHHHHHHHh
Confidence 67888999988888888765 4566666655554 21 11222 256655444433
Q ss_pred cCCCCHHHHHHHHHH
Q psy16295 70 IESENSQVQHLESIV 84 (128)
Q Consensus 70 ~eP~N~QA~~Lk~~I 84 (128)
.+|+|+.-.+||.-|
T Consensus 66 ~~~~N~LP~eLRa~l 80 (122)
T PRK12794 66 ADPDNPLPIELRARI 80 (122)
T ss_pred cCCCCCCCHHHHHHH
Confidence 589999999998765
No 372
>COG1849 Uncharacterized protein conserved in archaea [Function unknown]
Probab=23.25 E-value=45 Score=23.51 Aligned_cols=13 Identities=8% Similarity=0.289 Sum_probs=10.8
Q ss_pred hHHHHHHHHhccC
Q psy16295 2 TQFNYSFCLVRSE 14 (128)
Q Consensus 2 ~~F~YAwaLv~S~ 14 (128)
+.++|||+++.+.
T Consensus 62 a~~sYa~g~lDag 74 (90)
T COG1849 62 AALSYAHGWLDAG 74 (90)
T ss_pred HHHHHHHHHHHHH
Confidence 5789999998764
No 373
>cd00280 TRFH Telomeric Repeat binding Factor or TTAGGG Repeat binding Factor, central (dimerization) domain Homology; TRFH. Telomeres are protein/DNA complexes that make up the physical ends of eukaryotic linear chromosomes and are essential for chromosome stability, protecting the chromosome ends from degradation and end-to-end fusion. Proteins TRF1, TRF2 and Taz1 bind telomeric DNA and are also involved in recruiting interacting proteins, TIN2, and Rap1, to the telomeres. It has also been demonstrated that PARP1 associates with TRF2 and is capable of poly(ADP-ribosyl)ation of TRF2, which affects binding of TRF2 to telomeric DNA. TRF1, TRF2 and Taz1 proteins contain three functional domains: an N-terminal acidic domain, a central TRF-specific/dimerization domain, and a C-terminal DNA binding domain with a single Myb-like repeat. Homodimerization, a prerequisite to DNA binding, results in the juxtaposition of two Myb DNA binding domains.
Probab=23.03 E-value=1.8e+02 Score=23.34 Aligned_cols=68 Identities=15% Similarity=0.240 Sum_probs=44.8
Q ss_pred HHHHHHHHHHhhccCcc---ccchHHHHHHHHH-HHhhhhchHHHHHHHHHHHhcCCCCHHHH-HHHHHHHHHH
Q psy16295 20 RKGILLLEELFKTETED---IDRKRDYLYYLAI-GNARIKEYTKSLKYCRAFLAIESENSQVQ-HLESIVKKRM 88 (128)
Q Consensus 20 ~~GI~LL~~l~~~~~~~---~~~~rd~LYyLAv-g~yklgdY~~A~~~~~~lL~~eP~N~QA~-~Lk~~Ie~~i 88 (128)
.-++.+++.+-++..++ -+..+-++---|| .+.+.|+|++|.+..+.+.. +|+++-=+ .|...|.+|=
T Consensus 86 ESAl~v~~~I~~E~~~~~~lhe~i~~lik~~aV~VCm~~g~Fk~A~eiLkr~~~-d~~~~~~r~kL~~II~~Kd 158 (200)
T cd00280 86 ESALMVLESIEKEFSLPETLHEEIRKLIKEQAVAVCMENGEFKKAEEVLKRLFS-DPESQKLRMKLLMIIREKD 158 (200)
T ss_pred HHHHHHHHHHHHhcCCcHHHHHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHhc-CCCchhHHHHHHHHHHccc
Confidence 45677777776665221 0223444444444 57899999999999999999 88887654 4555555543
No 374
>PF11640 TAN: Telomere-length maintenance and DNA damage repair; InterPro: IPR021668 ATM is a large protein kinase, in humans, critical for responding to DNA double-strand breaks (DSBs). Tel1, the orthologue from budding yeast, also regulates responses to DSBs. Tel1 is important for maintaining viability and for phosphorylation of the DNA damage signal transducer kinase Rad53 (an orthologue of mammalian CHK2). In addition to functioning in the response to DSBs, numerous findings indicate that Tel1/ATM regulates telomeres. The overall domain structure of Tel1/ATM is shared by proteins of the phosphatidylinositol 3-kinase (PI3K)-related kinase (PIKK) family, but this family carries a unique and functionally important TAN sequence motif, near its N-terminal, LxxxKxxE/DRxxxL. which is conserved specifically in the Tel1/ATM subclass of the PIKKs. The TAN motif is essential for both telomere length maintenance and Tel1 action in response to DNA damage []. It is classified as an 2.7.11.1 from EC. ; GO: 0004674 protein serine/threonine kinase activity
Probab=22.88 E-value=1.6e+02 Score=21.59 Aligned_cols=33 Identities=15% Similarity=0.244 Sum_probs=29.2
Q ss_pred ChHHHH--HHHHhccCChhhHHHHHHHHHHHhhcc
Q psy16295 1 MTQFNY--SFCLVRSEFCADIRKGILLLEELFKTE 33 (128)
Q Consensus 1 ~~~F~Y--AwaLv~S~~~~d~~~GI~LL~~l~~~~ 33 (128)
|..++. +..++.|..-.|-.+|++-|+.++++.
T Consensus 1 M~~~~i~~~~~~L~S~k~keR~~al~~L~~il~~~ 35 (155)
T PF11640_consen 1 MSTLDINSILRLLSSDKIKERNKALEDLRHILSSP 35 (155)
T ss_pred CccchHHHHHHHHhccccchHHHHHHHHHHHHcCc
Confidence 556666 888999999999999999999999986
No 375
>KOG1086|consensus
Probab=22.39 E-value=2.2e+02 Score=25.97 Aligned_cols=51 Identities=22% Similarity=0.242 Sum_probs=38.5
Q ss_pred HHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHH
Q psy16295 8 FCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFL 68 (128)
Q Consensus 8 waLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL 68 (128)
-.|.+|+.|+|.+.+=.|...+++++ . .+-+. -+-|.+-.+++++-|+.+=
T Consensus 179 ~rLLkSn~PeDLqaANkLIK~lVkee-e---~k~eK------iskR~~aleev~n~vk~l~ 229 (594)
T KOG1086|consen 179 ARLLKSNHPEDLQAANKLIKTLVKEE-E---HKLEK------ISKRVKALEEVNNNVKLLE 229 (594)
T ss_pred HHHHhcCChHHHHHHHHHHHHHHHHH-H---HHHHH------HHHHHHHHHHHHHHHHHHH
Confidence 36899999999999999999999997 3 34333 3566777777777765543
No 376
>KOG4459|consensus
Probab=22.21 E-value=71 Score=28.69 Aligned_cols=39 Identities=21% Similarity=0.373 Sum_probs=34.7
Q ss_pred chHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHH
Q psy16295 39 RKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQV 77 (128)
Q Consensus 39 ~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA 77 (128)
+.|.-+-||=++|||.|++.+|.......|-..|++.-.
T Consensus 131 ~~r~py~YL~~ay~k~~~l~kAv~aa~tflv~~Pdde~i 169 (471)
T KOG4459|consen 131 RPRLPYQYLQFAYFKVGELEKAVAAAHTFLVANPDDEDI 169 (471)
T ss_pred ccchHHHHHHHHHHHhhhHHHHHHhcceeeecCCcHHHH
Confidence 446778899999999999999999999999999998644
No 377
>COG1963 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=22.13 E-value=2.5e+02 Score=21.65 Aligned_cols=19 Identities=16% Similarity=0.331 Sum_probs=13.5
Q ss_pred CCHHHHHHHHHHHHHHhhhH
Q psy16295 73 ENSQVQHLESIVKKRMETEG 92 (128)
Q Consensus 73 ~N~QA~~Lk~~Ie~~i~kdG 92 (128)
.-.||+-|.++|++ +.+|+
T Consensus 96 aG~QA~iLN~l~~~-~~~e~ 114 (153)
T COG1963 96 AGVQARILNQLIEE-LVNEK 114 (153)
T ss_pred ccchHHHHHHHHHH-HHHhh
Confidence 45589999999887 44444
No 378
>PF12323 HTH_OrfB_IS605: Helix-turn-helix domain; InterPro: IPR021027 This entry represents an N-terminal helix-turn-helix domain found in a variety of putative transposases [, , ]. It is usually associated with PF01385 from PFAM and PF07282 from PFAM.
Probab=22.09 E-value=62 Score=18.98 Aligned_cols=17 Identities=18% Similarity=0.169 Sum_probs=14.7
Q ss_pred hcCCCCHHHHHHHHHHH
Q psy16295 69 AIESENSQVQHLESIVK 85 (128)
Q Consensus 69 ~~eP~N~QA~~Lk~~Ie 85 (128)
++.|++.|...|.+.+.
T Consensus 8 rl~Pt~~Q~~~L~~~~~ 24 (46)
T PF12323_consen 8 RLYPTKEQEEKLERWFG 24 (46)
T ss_pred EEecCHHHHHHHHHHHH
Confidence 57899999999998875
No 379
>PRK13807 maltose phosphorylase; Provisional
Probab=21.71 E-value=52 Score=30.89 Aligned_cols=63 Identities=19% Similarity=0.351 Sum_probs=45.4
Q ss_pred cchHHHHHHH--------------HHHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhhhHHHHHHHHHHHH
Q psy16295 38 DRKRDYLYYL--------------AIGNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKRMETEGLKGMALAGGAV 103 (128)
Q Consensus 38 ~~~rd~LYyL--------------Avg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i~kdGliG~~i~gga~ 103 (128)
..+++.-||. |+...++|+.++|-+++...+.+|.+|.|. -+.+|+= ++..||+.
T Consensus 609 ~~~~n~~yYe~rT~HdSsLS~~i~ai~aa~~g~~~~A~~~f~~a~~~Dl~~~~~----------~t~~GiH-~~~~gG~w 677 (756)
T PRK13807 609 EKRRNFDFYEPLTVHESSLSPCVHSILAAELGKEDKAVELYLRTARLDLDNYNN----------DTEDGLH-ITSMAGSW 677 (756)
T ss_pred HHHHHHHHhcCcccCCCCCCHHHHHHHHHhcCCHHHHHHHHHHHHHhhhhhccC----------CCCcchh-hhcchHHH
Confidence 3677777876 557899999999999999999998776541 2446764 66666654
Q ss_pred --HHHHhhHH
Q psy16295 104 --LAQGQGTM 111 (128)
Q Consensus 104 --~~~g~~~~ 111 (128)
++-|.+|.
T Consensus 678 ~~vv~Gf~Gl 687 (756)
T PRK13807 678 LAIVQGFAGM 687 (756)
T ss_pred HHHHheecce
Confidence 45555554
No 380
>PF03241 HpaB: 4-hydroxyphenylacetate 3-hydroxylase C terminal; InterPro: IPR024719 This C-terminal domain is found in HpaB, which encodes part of the 4-hydroxyphenylacetate 3-hydroxylase from Escherichia coli []. The enzyme is NADH-dependent and uses FAD as the redox chromophore. This domain is also found in pyoverdin chromophore biosynthetic protein PvcC, which may play a role in one of the proposed hydroxylation steps of pyoverdine chromophore biosynthesis [] and in 4-hydroxybutyryl-CoA dehydratase (4-BUDH), a key enzyme in the metabolism of gamma-aminobutyrate [].; PDB: 3HWC_D 1U8V_D 2YYM_A 2YYI_A 2YYJ_A 2YYL_A 2YYG_A 2YYK_A.
Probab=21.56 E-value=2e+02 Score=22.36 Aligned_cols=54 Identities=26% Similarity=0.260 Sum_probs=36.0
Q ss_pred HHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhc
Q psy16295 3 QFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAI 70 (128)
Q Consensus 3 ~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~ 70 (128)
-|+++|-|+-|+--. -..+-+..-.++| ... .+..|+.-|++.+.+.++.+|.+
T Consensus 152 l~rLawDl~~s~fg~-----r~~~ye~~~~G~p--~~~-------~~~~~~~yd~~~~~~~v~~~l~~ 205 (205)
T PF03241_consen 152 LFRLAWDLTGSEFGG-----RQELYERFYAGSP--VRQ-------RIMLYRSYDLEEAKALVKRFLGI 205 (205)
T ss_dssp HHHHHHHHHTSHHHH-----HHHHHHHHTTTHH--HHH-------HHHHHHHS-HHHHHHHHHHHCC-
T ss_pred HHHHHHHHhcCcchh-----hhHhhhhhcCCCH--HHH-------HHHHHHHcChHHHHHHHHHHhcC
Confidence 489999999987533 3344455555535 232 35667888999999999988753
No 381
>PHA02537 M terminase endonuclease subunit; Provisional
Probab=21.21 E-value=4.5e+02 Score=21.26 Aligned_cols=50 Identities=12% Similarity=0.161 Sum_probs=39.1
Q ss_pred chHHHHHHHHHHHhhh-----------hchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHh
Q psy16295 39 RKRDYLYYLAIGNARI-----------KEYTKSLKYCRAFLAIESENSQVQHLESIVKKRME 89 (128)
Q Consensus 39 ~~rd~LYyLAvg~ykl-----------gdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i~ 89 (128)
++...-+|-+.|+.-+ ++...|+.++++.++++| +-.++...+.|++++.
T Consensus 165 d~vrAKl~K~~G~~llr~~~g~~~~d~~~l~~Al~~L~rA~~l~~-k~GVK~~i~~l~~~lr 225 (230)
T PHA02537 165 DEVRAKLYKAAGYLLLRNEKGEPIGDAETLQLALALLQRAFQLND-KCGVKKDIERLERRLK 225 (230)
T ss_pred hHHHHHHHHHHHHHHhhcccCCCccCcccHHHHHHHHHHHHHhCC-CCChHHHHHHHHHHHh
Confidence 5667777777787653 467899999999999996 4567777788888775
No 382
>COG3014 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=21.19 E-value=1.7e+02 Score=25.94 Aligned_cols=31 Identities=13% Similarity=0.010 Sum_probs=26.4
Q ss_pred HHHHHHHHHHHhhhhchHHHHHHHHHHHhcC
Q psy16295 41 RDYLYYLAIGNARIKEYTKSLKYCRAFLAIE 71 (128)
Q Consensus 41 rd~LYyLAvg~yklgdY~~A~~~~~~lL~~e 71 (128)
.=.-||+|+.|.+..||+.||--.++.++.+
T Consensus 125 ~~~n~YkaLNYm~~nD~~~ArVEfnRan~rQ 155 (449)
T COG3014 125 VLINYYKALNYMLLNDSAKARVEFNRANERQ 155 (449)
T ss_pred HHHHHHHHhhHHHhcchhhhHHHHHHHHHHH
Confidence 3356899999999999999999888887654
No 383
>COG3671 Predicted membrane protein [Function unknown]
Probab=21.16 E-value=1.2e+02 Score=22.59 Aligned_cols=29 Identities=10% Similarity=-0.153 Sum_probs=24.2
Q ss_pred hHHHHHHHHHHHHHHHHhhHHHhhhhhhc
Q psy16295 91 EGLKGMALAGGAVLAQGQGTMEMSCKSYK 119 (128)
Q Consensus 91 dGliG~~i~gga~~~~g~~~~~~~~~~~~ 119 (128)
=+|-+.|++.|..-.+|++-++++|++-.
T Consensus 28 Y~Ly~~G~v~git~lvgvi~AYv~rd~~~ 56 (125)
T COG3671 28 YILYLLGAVTGITPLVGVIFAYVNRDKAD 56 (125)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhccccccc
Confidence 46778899999889999999999998543
No 384
>KOG4279|consensus
Probab=21.01 E-value=46 Score=32.30 Aligned_cols=80 Identities=18% Similarity=0.294 Sum_probs=62.3
Q ss_pred hHHHHHHHHhccCChhhHHHHHHHHHHHhhccCccccchHHHH---------HHHHHHHhhhhchHHHHHHHHHHHhcCC
Q psy16295 2 TQFNYSFCLVRSEFCADIRKGILLLEELFKTETEDIDRKRDYL---------YYLAIGNARIKEYTKSLKYCRAFLAIES 72 (128)
Q Consensus 2 ~~F~YAwaLv~S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~L---------YyLAvg~yklgdY~~A~~~~~~lL~~eP 72 (128)
.+|.||+||-|-+-+.|-.++++.+-.++..+.| ...|-+ -+++-++---+.-+.|.+.+...+++||
T Consensus 242 v~f~YaFALNRRNr~GDRakAL~~~l~lve~eg~---vapDm~Cl~GRIYKDmF~~S~ytDa~s~~~a~~WyrkaFeveP 318 (1226)
T KOG4279|consen 242 VRFHYAFALNRRNRPGDRAKALNTVLPLVEKEGP---VAPDMYCLCGRIYKDMFIASNYTDAESLNHAIEWYRKAFEVEP 318 (1226)
T ss_pred eEEEeeehhcccCCCccHHHHHHHHHHHHHhcCC---CCCceeeeechhhhhhhhccCCcchhhHHHHHHHHHHHhccCc
Confidence 4799999999999999999999999888887533 233321 3566666667777888899999999999
Q ss_pred CCHHHHHHHHHH
Q psy16295 73 ENSQVQHLESIV 84 (128)
Q Consensus 73 ~N~QA~~Lk~~I 84 (128)
.-.-...|-.+.
T Consensus 319 ~~~sGIN~atLL 330 (1226)
T KOG4279|consen 319 LEYSGINLATLL 330 (1226)
T ss_pred hhhccccHHHHH
Confidence 887776666554
No 385
>COG4700 Uncharacterized protein conserved in bacteria containing a divergent form of TPR repeats [Function unknown]
Probab=21.01 E-value=4.4e+02 Score=21.66 Aligned_cols=26 Identities=23% Similarity=0.286 Sum_probs=16.2
Q ss_pred HHHHHHHHhhhhchHHHHHHHHHHHh
Q psy16295 44 LYYLAIGNARIKEYTKSLKYCRAFLA 69 (128)
Q Consensus 44 LYyLAvg~yklgdY~~A~~~~~~lL~ 69 (128)
.|=||.+..++|+|.||+..+++.|.
T Consensus 92 r~rLa~al~elGr~~EA~~hy~qals 117 (251)
T COG4700 92 RYRLANALAELGRYHEAVPHYQQALS 117 (251)
T ss_pred HHHHHHHHHHhhhhhhhHHHHHHHhc
Confidence 35566666666666666666555554
No 386
>PF07980 SusD: SusD family; InterPro: IPR012944 This domain occurs in several hypothetical proteins. It also occurs in RagB, Q9ZA59 from SWISSPROT, a protein involved in signalling [] and SusD, Q8A1G2 from SWISSPROT, an outer membrane protein involved in nutrient binding [].; PDB: 3IHV_A 3LEW_A 3JQ1_A 3JQ0_A 3NQP_B 3SNX_A 3L22_A 3OTN_A 3IV0_A 3QNK_C ....
Probab=20.92 E-value=3.7e+02 Score=20.14 Aligned_cols=32 Identities=19% Similarity=0.178 Sum_probs=28.5
Q ss_pred cchHHHHHHHHHHHhhhhchHHHHHHHHHHHh
Q psy16295 38 DRKRDYLYYLAIGNARIKEYTKSLKYCRAFLA 69 (128)
Q Consensus 38 ~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~ 69 (128)
.+--|-+...|=+.+++|+.+.|+.+++.+-+
T Consensus 130 ~R~aEvyL~~AEA~~~~g~~~~A~~~lN~vR~ 161 (266)
T PF07980_consen 130 MRLAEVYLIYAEALARLGNTAEALEYLNQVRK 161 (266)
T ss_dssp EEHHHHHHHHHHHHHHTTSHHHHHHHHHHHHH
T ss_pred eEHHHHHHHHHHHHHHhCCHHHHHHHHHHHHH
Confidence 45678899999999999999999999999874
No 387
>PF10579 Rapsyn_N: Rapsyn N-terminal myristoylation and linker region; InterPro: IPR019568 Neuromuscular junction formation relies upon the clustering of acetylcholine receptors and other proteins in the muscle membrane. Rapsyn is a peripheral membrane protein that is selectively concentrated at the neuromuscular junction and is essential for the formation of synaptic acetylcholine receptor aggregates. Acetylcholine receptors fail to aggregate beneath nerve terminals in mice where rapsyn has been knocked out. The N-terminal six amino acids of rapsyn are its myristoylation site, and myristoylation is necessary for the targeting of the protein to the membrane []. ; GO: 0008270 zinc ion binding, 0033130 acetylcholine receptor binding, 0007268 synaptic transmission, 0005856 cytoskeleton, 0030054 cell junction, 0045211 postsynaptic membrane
Probab=20.73 E-value=2.9e+02 Score=18.96 Aligned_cols=33 Identities=12% Similarity=0.048 Sum_probs=25.9
Q ss_pred HHHhhhhchHHHHHHHHHHHhcCCCCHHHHHHH
Q psy16295 49 IGNARIKEYTKSLKYCRAFLAIESENSQVQHLE 81 (128)
Q Consensus 49 vg~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk 81 (128)
+=.|...+..+|++.|...|+.-|+.++.=.+.
T Consensus 14 lkLY~~~~~~~Al~~W~~aL~k~~~~~~rf~~l 46 (80)
T PF10579_consen 14 LKLYHQNETQQALQKWRKALEKITDREDRFRVL 46 (80)
T ss_pred HHHhccchHHHHHHHHHHHHhhcCChHHHHHHH
Confidence 335667788999999999999999988754443
No 388
>PF09813 Coiled-coil_56: Coiled-coil domain-containing protein 56; InterPro: IPR018628 Members of this family of proteins have no known function.
Probab=20.62 E-value=2.9e+02 Score=19.85 Aligned_cols=22 Identities=27% Similarity=0.466 Sum_probs=15.9
Q ss_pred HHHHHHHHHHhhhHHHHHHHHH
Q psy16295 79 HLESIVKKRMETEGLKGMALAG 100 (128)
Q Consensus 79 ~Lk~~Ie~~i~kdGliG~~i~g 100 (128)
.++........+..+.|+|+.|
T Consensus 39 ~~kr~~~~~R~rN~~Tgl~L~~ 60 (100)
T PF09813_consen 39 QLKRKLQRRRRRNLLTGLALGA 60 (100)
T ss_pred HHHHHHHHHhhhhHHHHHHHHH
Confidence 3445666667789999998776
No 389
>KOG2058|consensus
Probab=20.61 E-value=1.2e+02 Score=26.85 Aligned_cols=47 Identities=17% Similarity=0.229 Sum_probs=31.5
Q ss_pred HHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCH
Q psy16295 24 LLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENS 75 (128)
Q Consensus 24 ~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~ 75 (128)
.-|+.++|.+.|+ +.+.+.|-+++= .++..+| ..+++.+|+..|+..
T Consensus 147 ~elk~liRkGiP~-~~R~~VW~~~~g-~~~~~~~---~~~yq~ll~~~~~~~ 193 (436)
T KOG2058|consen 147 DELKRLIRKGIPP-ELRGEVWWVLSG-ARRQLNY---PGYYQELLRKGDEKK 193 (436)
T ss_pred HHHHHHHHcCCCh-hhhhHHHHHHhc-chhhccC---chhHHHHHhcCCCcc
Confidence 4477788887774 677788777765 5565555 566666666655544
No 390
>PRK13184 pknD serine/threonine-protein kinase; Reviewed
Probab=20.48 E-value=6.6e+02 Score=24.56 Aligned_cols=83 Identities=19% Similarity=0.218 Sum_probs=59.9
Q ss_pred hHHHHHHHHhc----cCChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHHHHHHHhcCCCCHHH
Q psy16295 2 TQFNYSFCLVR----SEFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKYCRAFLAIESENSQV 77 (128)
Q Consensus 2 ~~F~YAwaLv~----S~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP~N~QA 77 (128)
++|+...+|.- ..|+++..+++.-++.|-.. | ..+-|||= =|+.|-+++||+|=.+++...|+--|+.+--
T Consensus 514 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~--~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 588 (932)
T PRK13184 514 AQFRLGITLLEKASEQGDPRDFTQALSEFSYLHGG--V--GAPLEYLG-KALVYQRLGEYNEEIKSLLLALKRYSQHPEI 588 (932)
T ss_pred HHHHhhHHHHHHHHhcCChHHHHHHHHHHHHhcCC--C--CCchHHHh-HHHHHHHhhhHHHHHHHHHHHHHhcCCCCcc
Confidence 35666666543 24455666666666654333 2 24455543 3889999999999999999999999999999
Q ss_pred HHHHHHHHHHHh
Q psy16295 78 QHLESIVKKRME 89 (128)
Q Consensus 78 ~~Lk~~Ie~~i~ 89 (128)
-.|++-|-.++-
T Consensus 589 ~~~~~~~~~~~~ 600 (932)
T PRK13184 589 SRLRDHLVYRLH 600 (932)
T ss_pred HHHHHHHHHHHH
Confidence 999988877764
No 391
>PRK14750 kdpF potassium-transporting ATPase subunit F; Provisional
Probab=20.44 E-value=1.6e+02 Score=16.63 Aligned_cols=17 Identities=18% Similarity=0.049 Sum_probs=9.9
Q ss_pred HHHHHHHHHHHHhhHHH
Q psy16295 96 MALAGGAVLAQGQGTME 112 (128)
Q Consensus 96 ~~i~gga~~~~g~~~~~ 112 (128)
+++++|++++.-..|.+
T Consensus 3 ~~vi~g~llv~lLl~YL 19 (29)
T PRK14750 3 FSIVCGALLVLLLLGYL 19 (29)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 45677777655555443
No 392
>PF13179 DUF4006: Family of unknown function (DUF4006)
Probab=20.43 E-value=1.6e+02 Score=19.68 Aligned_cols=13 Identities=31% Similarity=0.496 Sum_probs=9.0
Q ss_pred hHHHHHHHHHHHH
Q psy16295 91 EGLKGMALAGGAV 103 (128)
Q Consensus 91 dGliG~~i~gga~ 103 (128)
+|+.|+-|+.-..
T Consensus 11 nGi~G~LIAvvLL 23 (66)
T PF13179_consen 11 NGITGMLIAVVLL 23 (66)
T ss_pred cchHhHHHHHHHH
Confidence 6899987765443
No 393
>PF12921 ATP13: Mitochondrial ATPase expression; InterPro: IPR024319 ATPase expression protein 2 (also known as ATP13 in some species) is necessary for the expression of subunit 9 of mitochondrial ATPase. The protein has a basic amino terminal signal sequence that is cleaved upon import into mitochondria [].
Probab=20.28 E-value=2e+02 Score=20.68 Aligned_cols=32 Identities=13% Similarity=-0.009 Sum_probs=29.2
Q ss_pred HHHHHHHHHHHhhhhchHHHHHHHHHHHhcCC
Q psy16295 41 RDYLYYLAIGNARIKEYTKSLKYCRAFLAIES 72 (128)
Q Consensus 41 rd~LYyLAvg~yklgdY~~A~~~~~~lL~~eP 72 (128)
.+-|.=++.++...+++..|+++++.+.+.-|
T Consensus 52 ~~lL~AIv~sf~~n~~i~~al~~vd~fs~~Y~ 83 (126)
T PF12921_consen 52 SRLLIAIVHSFGYNGDIFSALKLVDFFSRKYP 83 (126)
T ss_pred HHHHHHHHHHHHhcccHHHHHHHHHHHHHHcC
Confidence 78899999999999999999999999888766
No 394
>PF03564 DUF1759: Protein of unknown function (DUF1759); InterPro: IPR005312 This is a small family of proteins of unknown function.
Probab=20.21 E-value=3.2e+02 Score=19.18 Aligned_cols=68 Identities=22% Similarity=0.296 Sum_probs=36.4
Q ss_pred HHhcc-CChhhHHHHHHHHHHHhhccCccccchHHHHHHHHHHHhhhhchHHHHHH---------------HHHHHhcCC
Q psy16295 9 CLVRS-EFCADIRKGILLLEELFKTETEDIDRKRDYLYYLAIGNARIKEYTKSLKY---------------CRAFLAIES 72 (128)
Q Consensus 9 aLv~S-~~~~d~~~GI~LL~~l~~~~~~~~~~~rd~LYyLAvg~yklgdY~~A~~~---------------~~~lL~~eP 72 (128)
.+|+. ++-.+..+-. .|+..+++. + .+++-.+.+ --.+|++|++. .+.+.++.|
T Consensus 17 ~~v~~n~~~~d~~K~~-~L~~~L~G~-A-----~~~i~~~~~---~~~~Y~~a~~~L~~~yg~~~~i~~~~~~~l~~l~~ 86 (145)
T PF03564_consen 17 SLVHENPDLSDIEKLN-YLRSCLKGE-A-----KELIRGLPL---SEENYEEAWELLEERYGNPRRIIQALLEELRNLPP 86 (145)
T ss_pred HHHhcccCCCHHHHHH-HHHHHhcch-H-----HHHHHcccc---cchhhHHHHHHHHHHhCCchHHHHHHHHHHhcccc
Confidence 35566 5555555543 355566555 2 333333322 34555555544 666777774
Q ss_pred -CCHHHHHHHHHHHH
Q psy16295 73 -ENSQVQHLESIVKK 86 (128)
Q Consensus 73 -~N~QA~~Lk~~Ie~ 86 (128)
.+....+|+..+++
T Consensus 87 ~~~~d~~~L~~~~~~ 101 (145)
T PF03564_consen 87 ISNDDPEALRSLVDK 101 (145)
T ss_pred ccchhHHHHHHHHHH
Confidence 56666666666553
No 395
>COG2976 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=20.11 E-value=2.1e+02 Score=23.07 Aligned_cols=36 Identities=6% Similarity=-0.078 Sum_probs=28.9
Q ss_pred HHhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHH
Q psy16295 50 GNARIKEYTKSLKYCRAFLAIESENSQVQHLESIVK 85 (128)
Q Consensus 50 g~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie 85 (128)
.....||-.+||..|+..|+.+|++.+.+.|.=.++
T Consensus 168 ill~kg~k~~Ar~ay~kAl~~~~s~~~~~~lqmKLn 203 (207)
T COG2976 168 ILLAKGDKQEARAAYEKALESDASPAAREILQMKLN 203 (207)
T ss_pred HHHHcCchHHHHHHHHHHHHccCChHHHHHHHhHHH
Confidence 346789999999999999999988887776654443
No 396
>PF06195 DUF996: Protein of unknown function (DUF996); InterPro: IPR010397 This is a family of uncharacterised bacterial and archaeal proteins.
Probab=20.08 E-value=1.8e+02 Score=21.45 Aligned_cols=32 Identities=22% Similarity=-0.028 Sum_probs=23.1
Q ss_pred HHHHHHHHHH-HHHHhhHHHhhhhhhcchhhhh
Q psy16295 94 KGMALAGGAV-LAQGQGTMEMSCKSYKNTASVL 125 (128)
Q Consensus 94 iG~~i~gga~-~~~g~~~~~~~~~~~~~~~~~~ 125 (128)
++..+++-.+ .++.+++++.=+|||.+.++..
T Consensus 74 ~~~~l~~~vi~~v~~Iisa~f~kks~~~l~~~t 106 (139)
T PF06195_consen 74 IASFLAGLVILWVLFIISAYFLKKSYDLLAEYT 106 (139)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 5555555444 5667889999999999977643
No 397
>KOG2053|consensus
Probab=20.06 E-value=1.6e+02 Score=28.78 Aligned_cols=40 Identities=13% Similarity=0.117 Sum_probs=35.8
Q ss_pred HhhhhchHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhh
Q psy16295 51 NARIKEYTKSLKYCRAFLAIESENSQVQHLESIVKKRMET 90 (128)
Q Consensus 51 ~yklgdY~~A~~~~~~lL~~eP~N~QA~~Lk~~Ie~~i~k 90 (128)
..-.+++.+|+.-++.+|+..|+-.=|+.|+.++--++.|
T Consensus 19 ~ld~~qfkkal~~~~kllkk~Pn~~~a~vLkaLsl~r~gk 58 (932)
T KOG2053|consen 19 LLDSSQFKKALAKLGKLLKKHPNALYAKVLKALSLFRLGK 58 (932)
T ss_pred HhhhHHHHHHHHHHHHHHHHCCCcHHHHHHHHHHHHHhcC
Confidence 3445788899999999999999999999999999988876
Done!