Query psy17695
Match_columns 222
No_of_seqs 204 out of 1678
Neff 8.7
Searched_HMMs 46136
Date Fri Aug 16 20:36:40 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy17695.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/17695hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4626|consensus 100.0 3.7E-52 8E-57 363.8 17.1 211 2-222 450-690 (966)
2 COG3914 Spy Predicted O-linked 100.0 2.1E-36 4.6E-41 264.2 14.3 213 1-222 94-367 (620)
3 PF13844 Glyco_transf_41: Glyc 100.0 1.4E-31 3.1E-36 234.4 6.2 82 138-222 1-109 (468)
4 PF13414 TPR_11: TPR repeat; P 99.4 2.5E-12 5.3E-17 84.8 6.7 68 6-73 1-69 (69)
5 PRK15359 type III secretion sy 99.3 1.3E-11 2.7E-16 93.6 7.1 73 2-74 52-124 (144)
6 PRK15363 pathogenicity island 99.2 3.5E-11 7.7E-16 91.3 7.4 72 2-73 63-134 (157)
7 PLN03098 LPA1 LOW PSII ACCUMUL 99.1 1.9E-10 4E-15 100.3 9.1 69 3-71 70-141 (453)
8 PF13432 TPR_16: Tetratricopep 99.1 3.6E-10 7.8E-15 73.5 5.9 63 12-74 1-63 (65)
9 KOG4626|consensus 99.0 4.8E-10 1E-14 100.3 7.5 75 1-75 313-387 (966)
10 PRK11189 lipoprotein NlpI; Pro 99.0 1.1E-09 2.5E-14 92.5 8.3 73 2-74 92-164 (296)
11 PRK10370 formate-dependent nit 99.0 1.6E-09 3.4E-14 86.5 6.9 75 1-75 66-143 (198)
12 KOG0553|consensus 98.9 1.2E-09 2.6E-14 90.3 5.3 74 1-74 108-181 (304)
13 PRK15359 type III secretion sy 98.9 3.9E-09 8.3E-14 80.0 6.8 79 8-94 24-102 (144)
14 TIGR02552 LcrH_SycD type III s 98.9 5.7E-09 1.2E-13 77.4 6.6 71 3-73 12-82 (135)
15 PLN03088 SGT1, suppressor of 98.9 7.1E-09 1.5E-13 89.9 8.0 74 1-74 29-102 (356)
16 PRK12370 invasion protein regu 98.8 8.3E-09 1.8E-13 94.4 8.2 74 1-74 331-404 (553)
17 PRK10370 formate-dependent nit 98.8 1.8E-08 3.9E-13 80.3 8.1 73 2-74 101-176 (198)
18 KOG1125|consensus 98.8 9.1E-09 2E-13 91.3 5.6 74 1-74 457-530 (579)
19 PF13424 TPR_12: Tetratricopep 98.8 8.5E-09 1.8E-13 69.4 4.1 69 5-73 2-77 (78)
20 PRK15363 pathogenicity island 98.8 1.4E-08 3.1E-13 77.2 5.7 70 5-74 32-101 (157)
21 COG3063 PilF Tfp pilus assembl 98.8 1.9E-08 4.1E-13 80.4 6.2 100 1-106 62-161 (250)
22 PF13428 TPR_14: Tetratricopep 98.7 2.4E-08 5.3E-13 59.9 4.3 43 8-50 1-43 (44)
23 PF13371 TPR_9: Tetratricopept 98.7 4.1E-08 8.9E-13 65.1 5.8 61 14-74 1-61 (73)
24 PRK11189 lipoprotein NlpI; Pro 98.7 6.5E-08 1.4E-12 81.8 8.1 70 6-75 62-131 (296)
25 TIGR02552 LcrH_SycD type III s 98.7 5.7E-08 1.2E-12 72.0 6.8 73 2-74 45-117 (135)
26 PRK09782 bacteriophage N4 rece 98.7 5E-08 1.1E-12 94.2 8.0 73 2-74 637-709 (987)
27 COG5010 TadD Flp pilus assembl 98.6 1.2E-07 2.7E-12 77.0 6.8 84 3-94 95-178 (257)
28 PRK15179 Vi polysaccharide bio 98.6 1.8E-07 3.9E-12 87.3 8.8 70 2-71 148-217 (694)
29 COG5010 TadD Flp pilus assembl 98.6 2.2E-07 4.8E-12 75.5 7.8 69 2-70 128-196 (257)
30 PF14559 TPR_19: Tetratricopep 98.6 7.6E-08 1.6E-12 62.9 4.1 57 18-74 1-57 (68)
31 PF13431 TPR_17: Tetratricopep 98.6 7.4E-08 1.6E-12 54.5 3.2 33 30-62 1-33 (34)
32 PRK09782 bacteriophage N4 rece 98.5 1.5E-07 3.2E-12 91.0 7.0 73 2-75 604-676 (987)
33 cd00189 TPR Tetratricopeptide 98.5 2.1E-07 4.5E-12 62.4 5.6 64 10-73 2-65 (100)
34 TIGR00990 3a0801s09 mitochondr 98.5 2.9E-07 6.4E-12 85.2 8.2 72 3-74 360-431 (615)
35 TIGR00990 3a0801s09 mitochondr 98.5 3E-07 6.5E-12 85.2 8.0 73 2-74 393-465 (615)
36 PF00515 TPR_1: Tetratricopept 98.5 1.3E-07 2.8E-12 53.3 3.4 34 8-41 1-34 (34)
37 KOG1126|consensus 98.5 6E-08 1.3E-12 87.3 2.8 74 2-75 415-488 (638)
38 PRK12370 invasion protein regu 98.5 2.8E-07 6.1E-12 84.4 7.0 75 1-75 288-371 (553)
39 TIGR02795 tol_pal_ybgF tol-pal 98.5 6.7E-07 1.5E-11 64.2 7.4 66 8-73 39-107 (119)
40 PRK15174 Vi polysaccharide exp 98.5 5.2E-07 1.1E-11 84.3 8.4 73 2-74 278-350 (656)
41 PRK15179 Vi polysaccharide bio 98.5 3.9E-07 8.4E-12 85.1 7.4 73 2-74 114-186 (694)
42 CHL00033 ycf3 photosystem I as 98.4 1.4E-06 3.1E-11 67.3 8.9 67 8-74 35-104 (168)
43 COG3063 PilF Tfp pilus assembl 98.4 4.4E-07 9.6E-12 72.7 5.9 75 1-75 96-172 (250)
44 PF07719 TPR_2: Tetratricopept 98.4 3.8E-07 8.3E-12 51.1 4.1 34 8-41 1-34 (34)
45 COG4235 Cytochrome c biogenesi 98.4 6.1E-07 1.3E-11 74.5 6.6 74 2-75 150-226 (287)
46 TIGR02521 type_IV_pilW type IV 98.4 1.2E-06 2.5E-11 69.4 8.1 66 4-69 61-126 (234)
47 PF12895 Apc3: Anaphase-promot 98.4 2.4E-07 5.1E-12 63.4 3.5 61 7-68 24-84 (84)
48 PRK02603 photosystem I assembl 98.4 1E-06 2.2E-11 68.5 7.5 72 3-74 30-104 (172)
49 KOG1126|consensus 98.4 5.3E-07 1.2E-11 81.3 5.5 74 1-74 482-555 (638)
50 PRK15174 Vi polysaccharide exp 98.4 1.4E-06 3E-11 81.4 8.4 73 2-74 240-316 (656)
51 PF13432 TPR_16: Tetratricopep 98.3 5.4E-07 1.2E-11 58.3 3.6 41 2-42 25-65 (65)
52 cd00189 TPR Tetratricopeptide 98.3 1.6E-06 3.4E-11 58.0 5.9 72 2-73 28-99 (100)
53 KOG1125|consensus 98.3 9.4E-07 2E-11 78.7 5.9 70 6-75 428-497 (579)
54 KOG4162|consensus 98.3 2.1E-06 4.6E-11 78.7 8.0 83 1-90 711-795 (799)
55 PRK10803 tol-pal system protei 98.3 2.3E-06 5E-11 71.2 7.6 72 3-74 172-249 (263)
56 PRK15331 chaperone protein Sic 98.3 1.2E-06 2.6E-11 67.1 5.2 69 2-70 65-133 (165)
57 TIGR02795 tol_pal_ybgF tol-pal 98.3 4.2E-06 9.2E-11 59.9 7.6 67 8-74 2-71 (119)
58 CHL00033 ycf3 photosystem I as 98.3 4.1E-06 8.8E-11 64.8 7.9 68 7-74 71-138 (168)
59 PRK11906 transcriptional regul 98.3 2.9E-06 6.3E-11 74.6 7.3 74 2-75 332-405 (458)
60 PRK02603 photosystem I assembl 98.2 5.2E-06 1.1E-10 64.5 8.0 56 7-62 71-126 (172)
61 KOG1173|consensus 98.2 2.5E-06 5.4E-11 76.0 6.6 70 6-75 453-522 (611)
62 KOG1155|consensus 98.2 3.6E-06 7.7E-11 73.6 7.2 66 4-69 394-459 (559)
63 PRK10049 pgaA outer membrane p 98.2 3.6E-06 7.8E-11 80.0 7.8 73 2-74 387-459 (765)
64 TIGR03302 OM_YfiO outer membra 98.2 7.9E-06 1.7E-10 66.3 8.8 71 4-74 29-102 (235)
65 PF06552 TOM20_plant: Plant sp 98.2 2.8E-06 6E-11 65.8 5.3 70 3-72 20-103 (186)
66 TIGR02521 type_IV_pilW type IV 98.2 7.9E-06 1.7E-10 64.6 8.2 69 6-74 29-97 (234)
67 PF12688 TPR_5: Tetratrico pep 98.2 7.4E-06 1.6E-10 60.0 7.3 67 8-74 1-70 (120)
68 PLN03088 SGT1, suppressor of 98.2 4.3E-06 9.3E-11 72.6 7.0 64 11-74 5-68 (356)
69 KOG0543|consensus 98.2 4.1E-06 8.9E-11 72.1 6.7 67 9-75 258-324 (397)
70 KOG0553|consensus 98.2 2.2E-06 4.8E-11 71.1 4.7 68 8-75 81-148 (304)
71 KOG1155|consensus 98.2 3.4E-06 7.3E-11 73.7 6.0 74 1-74 357-430 (559)
72 cd05804 StaR_like StaR_like; a 98.1 8.3E-06 1.8E-10 70.0 7.8 71 3-73 109-179 (355)
73 PRK10153 DNA-binding transcrip 98.1 9.3E-06 2E-10 73.7 8.3 71 3-74 415-485 (517)
74 KOG0548|consensus 98.1 4E-06 8.8E-11 74.2 5.7 73 2-74 386-458 (539)
75 PLN02789 farnesyltranstransfer 98.1 8.1E-06 1.8E-10 69.8 7.0 73 2-74 100-174 (320)
76 TIGR03302 OM_YfiO outer membra 98.1 5.8E-06 1.3E-10 67.1 5.9 73 2-74 61-147 (235)
77 PF13429 TPR_15: Tetratricopep 98.1 9.7E-06 2.1E-10 67.7 7.0 70 5-74 143-212 (280)
78 KOG0547|consensus 98.1 6.1E-06 1.3E-10 72.6 5.7 73 2-74 388-460 (606)
79 PRK10049 pgaA outer membrane p 98.1 1.1E-05 2.5E-10 76.6 8.0 72 2-74 77-148 (765)
80 PRK11447 cellulose synthase su 98.1 8.2E-06 1.8E-10 80.9 7.0 71 3-73 598-668 (1157)
81 PF13414 TPR_11: TPR repeat; P 98.1 1.7E-06 3.7E-11 56.6 1.4 38 2-39 31-69 (69)
82 PF13371 TPR_9: Tetratricopept 98.0 7E-06 1.5E-10 54.1 4.3 49 1-49 22-70 (73)
83 TIGR02917 PEP_TPR_lipo putativ 98.0 1.7E-05 3.7E-10 74.9 8.0 72 2-74 798-869 (899)
84 KOG0547|consensus 98.0 1.3E-05 2.8E-10 70.6 6.0 68 8-75 115-182 (606)
85 PF13181 TPR_8: Tetratricopept 98.0 8.3E-06 1.8E-10 45.6 3.2 34 8-41 1-34 (34)
86 PLN02789 farnesyltranstransfer 98.0 1.9E-05 4.2E-10 67.5 6.7 74 1-74 135-215 (320)
87 PF13429 TPR_15: Tetratricopep 98.0 3.8E-06 8.2E-11 70.2 2.3 73 2-74 174-246 (280)
88 PRK11447 cellulose synthase su 97.9 2.2E-05 4.8E-10 77.9 7.6 73 2-74 631-703 (1157)
89 PRK14720 transcript cleavage f 97.9 2.6E-05 5.6E-10 74.3 7.6 68 4-72 112-179 (906)
90 PRK10747 putative protoheme IX 97.9 3.8E-05 8.2E-10 67.6 8.0 72 1-73 321-392 (398)
91 TIGR00540 hemY_coli hemY prote 97.9 4E-05 8.8E-10 67.7 7.9 70 2-72 327-400 (409)
92 TIGR02917 PEP_TPR_lipo putativ 97.9 3.5E-05 7.6E-10 72.7 8.0 72 3-74 596-667 (899)
93 PRK11788 tetratricopeptide rep 97.9 4.3E-05 9.3E-10 66.4 7.4 66 9-74 181-246 (389)
94 KOG0548|consensus 97.9 4.3E-05 9.4E-10 67.8 7.0 73 3-75 353-425 (539)
95 KOG2076|consensus 97.9 4E-05 8.6E-10 71.6 7.0 73 2-74 167-239 (895)
96 KOG1129|consensus 97.9 3.3E-05 7.2E-10 65.2 5.9 70 6-75 393-462 (478)
97 COG4783 Putative Zn-dependent 97.8 5.4E-05 1.2E-09 66.6 7.2 73 2-74 334-406 (484)
98 PF14559 TPR_19: Tetratricopep 97.8 2.4E-05 5.2E-10 50.8 3.8 49 2-50 19-67 (68)
99 KOG0624|consensus 97.8 2.6E-05 5.6E-10 66.2 4.7 73 2-74 66-138 (504)
100 PF13431 TPR_17: Tetratricopep 97.8 7.9E-06 1.7E-10 46.1 1.1 29 1-29 6-34 (34)
101 COG4785 NlpI Lipoprotein NlpI, 97.8 1.9E-05 4.2E-10 63.1 3.4 73 1-73 92-164 (297)
102 COG4235 Cytochrome c biogenesi 97.8 9.2E-05 2E-09 61.7 7.5 73 2-74 184-259 (287)
103 KOG1173|consensus 97.8 5E-05 1.1E-09 67.9 5.7 72 3-74 307-378 (611)
104 PRK15331 chaperone protein Sic 97.7 7.7E-05 1.7E-09 57.2 5.9 70 5-74 34-103 (165)
105 PRK11788 tetratricopeptide rep 97.7 0.00015 3.2E-09 63.0 8.6 72 3-74 136-212 (389)
106 cd05804 StaR_like StaR_like; a 97.7 8.1E-05 1.8E-09 63.9 6.8 69 2-70 142-214 (355)
107 PF13176 TPR_7: Tetratricopept 97.7 1.5E-05 3.3E-10 45.5 1.5 29 10-38 1-29 (36)
108 KOG4642|consensus 97.7 7.6E-05 1.7E-09 60.4 6.0 74 2-75 38-111 (284)
109 KOG1156|consensus 97.7 8.7E-05 1.9E-09 67.3 6.8 75 1-75 68-142 (700)
110 KOG4648|consensus 97.7 3.8E-05 8.3E-10 65.2 4.2 63 12-74 101-163 (536)
111 KOG0550|consensus 97.7 7.9E-05 1.7E-09 64.5 6.1 72 2-73 277-352 (486)
112 KOG2076|consensus 97.7 0.00014 2.9E-09 68.1 8.0 71 3-73 202-272 (895)
113 PF12688 TPR_5: Tetratrico pep 97.7 0.00029 6.3E-09 51.6 7.9 64 6-69 36-102 (120)
114 KOG4555|consensus 97.7 0.00016 3.4E-09 53.5 6.3 65 10-74 45-109 (175)
115 PF03704 BTAD: Bacterial trans 97.6 0.00046 1E-08 51.8 9.0 69 8-76 62-130 (146)
116 KOG1128|consensus 97.6 0.0001 2.2E-09 67.7 5.5 65 1-65 512-576 (777)
117 PRK10803 tol-pal system protei 97.6 0.00043 9.3E-09 57.6 8.8 68 7-74 141-212 (263)
118 PF09295 ChAPs: ChAPs (Chs5p-A 97.5 0.0003 6.5E-09 61.7 7.4 65 2-66 228-292 (395)
119 PF04733 Coatomer_E: Coatomer 97.5 0.00019 4.2E-09 60.6 6.0 71 4-74 197-268 (290)
120 KOG4234|consensus 97.5 0.00027 5.8E-09 56.0 6.1 67 8-74 134-200 (271)
121 COG4783 Putative Zn-dependent 97.5 0.00046 1E-08 60.9 7.9 72 1-72 367-438 (484)
122 PRK14574 hmsH outer membrane p 97.5 0.00028 6E-09 67.5 7.1 73 2-74 444-516 (822)
123 COG1729 Uncharacterized protei 97.5 0.00065 1.4E-08 56.0 8.0 68 7-74 177-247 (262)
124 smart00028 TPR Tetratricopepti 97.5 0.00012 2.5E-09 39.0 2.5 33 9-41 2-34 (34)
125 PF12895 Apc3: Anaphase-promot 97.4 0.00021 4.5E-09 48.6 4.3 53 20-73 1-55 (84)
126 KOG4555|consensus 97.4 0.00088 1.9E-08 49.6 7.5 69 3-71 72-144 (175)
127 PRK10866 outer membrane biogen 97.4 0.00097 2.1E-08 54.9 8.4 70 6-75 30-102 (243)
128 KOG4642|consensus 97.4 0.00012 2.7E-09 59.2 2.9 68 8-75 10-77 (284)
129 COG3071 HemY Uncharacterized e 97.4 0.00066 1.4E-08 58.5 7.3 71 2-73 322-392 (400)
130 PF09976 TPR_21: Tetratricopep 97.4 0.00088 1.9E-08 50.4 7.4 64 4-67 41-110 (145)
131 KOG2002|consensus 97.4 0.00012 2.6E-09 69.1 3.0 71 1-71 639-709 (1018)
132 PF13512 TPR_18: Tetratricopep 97.3 0.0013 2.8E-08 49.4 7.5 69 7-75 9-80 (142)
133 KOG3060|consensus 97.3 0.0011 2.3E-08 54.3 7.4 70 5-74 151-223 (289)
134 PRK11906 transcriptional regul 97.3 0.00042 9E-09 61.2 5.3 73 2-74 289-370 (458)
135 KOG1127|consensus 97.3 0.00064 1.4E-08 64.6 6.8 75 1-75 29-107 (1238)
136 PF00515 TPR_1: Tetratricopept 97.3 0.00062 1.3E-08 37.8 4.3 33 42-74 1-33 (34)
137 PRK14574 hmsH outer membrane p 97.3 0.00082 1.8E-08 64.3 7.6 70 5-74 99-168 (822)
138 PF13174 TPR_6: Tetratricopept 97.3 0.00026 5.7E-09 38.8 2.6 33 9-41 1-33 (33)
139 PF09976 TPR_21: Tetratricopep 97.3 0.0008 1.7E-08 50.6 6.2 61 8-69 85-145 (145)
140 PF13525 YfiO: Outer membrane 97.3 0.0016 3.4E-08 52.0 8.1 70 6-75 3-75 (203)
141 PRK10747 putative protoheme IX 97.3 0.0012 2.7E-08 58.1 8.1 69 3-71 147-216 (398)
142 KOG0543|consensus 97.2 0.00095 2.1E-08 57.8 6.8 73 1-73 284-357 (397)
143 KOG0624|consensus 97.2 0.00036 7.7E-09 59.4 3.8 73 4-76 34-106 (504)
144 PF14853 Fis1_TPR_C: Fis1 C-te 97.2 0.002 4.3E-08 40.1 5.9 43 9-51 2-44 (53)
145 KOG1840|consensus 97.1 0.00099 2.1E-08 60.2 6.3 70 5-74 238-315 (508)
146 PF07719 TPR_2: Tetratricopept 97.1 0.0011 2.3E-08 36.6 4.3 33 42-74 1-33 (34)
147 KOG1128|consensus 97.1 0.00055 1.2E-08 63.1 4.5 73 2-74 547-619 (777)
148 KOG1840|consensus 97.1 0.0011 2.4E-08 59.9 6.0 72 4-75 195-274 (508)
149 PF12569 NARP1: NMDA receptor- 97.1 0.0019 4.2E-08 58.7 7.6 65 10-74 196-260 (517)
150 KOG2003|consensus 97.0 0.001 2.3E-08 58.5 5.1 71 4-74 486-556 (840)
151 TIGR00540 hemY_coli hemY prote 97.0 0.0026 5.5E-08 56.3 7.7 68 4-71 148-216 (409)
152 KOG2003|consensus 97.0 0.00098 2.1E-08 58.7 4.5 74 2-75 518-591 (840)
153 KOG1308|consensus 97.0 0.00052 1.1E-08 58.2 2.8 72 1-72 141-212 (377)
154 PF13176 TPR_7: Tetratricopept 96.9 0.0027 5.8E-08 36.0 4.8 31 44-74 1-31 (36)
155 KOG2002|consensus 96.9 0.0016 3.4E-08 61.8 5.8 70 6-75 305-375 (1018)
156 KOG1156|consensus 96.9 0.0011 2.5E-08 60.3 4.6 75 1-75 34-108 (700)
157 KOG1174|consensus 96.9 0.0033 7.1E-08 54.9 7.1 73 2-74 328-400 (564)
158 PRK10941 hypothetical protein; 96.9 0.0047 1E-07 51.5 7.9 66 9-74 182-247 (269)
159 KOG3060|consensus 96.9 0.0045 9.7E-08 50.7 7.3 73 1-73 113-185 (289)
160 PLN03098 LPA1 LOW PSII ACCUMUL 96.9 0.0012 2.7E-08 58.2 4.3 37 1-37 102-141 (453)
161 COG4785 NlpI Lipoprotein NlpI, 96.9 0.0021 4.6E-08 51.6 5.2 68 7-74 64-131 (297)
162 PF13374 TPR_10: Tetratricopep 96.9 0.0018 3.8E-08 37.4 3.6 29 9-37 3-31 (42)
163 KOG0550|consensus 96.9 0.0013 2.7E-08 57.3 4.1 75 1-75 196-282 (486)
164 KOG4162|consensus 96.8 0.0022 4.8E-08 59.5 5.8 72 3-74 679-752 (799)
165 PF13512 TPR_18: Tetratricopep 96.8 0.0078 1.7E-07 45.2 7.7 49 7-55 46-97 (142)
166 KOG1127|consensus 96.8 0.0026 5.7E-08 60.6 6.3 72 1-72 589-660 (1238)
167 KOG4648|consensus 96.8 0.0013 2.8E-08 56.1 3.6 75 1-75 124-198 (536)
168 KOG1174|consensus 96.7 0.0054 1.2E-07 53.6 7.1 74 1-75 431-504 (564)
169 KOG1129|consensus 96.7 0.00065 1.4E-08 57.6 0.9 72 6-77 254-325 (478)
170 KOG3081|consensus 96.5 0.0093 2E-07 49.2 6.8 61 3-63 202-262 (299)
171 PF06552 TOM20_plant: Plant sp 96.5 0.0016 3.5E-08 50.7 2.2 51 1-51 62-123 (186)
172 PRK10866 outer membrane biogen 96.5 0.015 3.3E-07 47.8 8.1 53 3-55 61-119 (243)
173 PF13428 TPR_14: Tetratricopep 96.4 0.0056 1.2E-07 36.2 3.8 34 42-75 1-34 (44)
174 COG2956 Predicted N-acetylgluc 96.4 0.012 2.6E-07 49.9 6.8 68 8-75 180-247 (389)
175 PF14561 TPR_20: Tetratricopep 96.4 0.027 5.8E-07 39.0 7.4 44 28-71 8-51 (90)
176 COG2956 Predicted N-acetylgluc 96.4 0.0087 1.9E-07 50.7 5.7 72 2-73 208-280 (389)
177 PRK14720 transcript cleavage f 96.3 0.0057 1.2E-07 58.7 5.0 47 4-50 27-73 (906)
178 PF13181 TPR_8: Tetratricopept 96.3 0.013 2.9E-07 32.1 4.7 32 43-74 2-33 (34)
179 PRK10153 DNA-binding transcrip 96.3 0.0082 1.8E-07 54.7 5.7 74 1-74 369-452 (517)
180 KOG4234|consensus 96.3 0.0099 2.2E-07 47.3 5.4 63 12-74 99-166 (271)
181 KOG3824|consensus 96.3 0.0078 1.7E-07 50.8 5.0 65 10-74 118-182 (472)
182 PF13174 TPR_6: Tetratricopept 96.2 0.0087 1.9E-07 32.5 3.4 32 43-74 1-32 (33)
183 KOG0376|consensus 96.1 0.0021 4.5E-08 56.9 1.0 73 2-74 32-104 (476)
184 PF13525 YfiO: Outer membrane 96.0 0.021 4.5E-07 45.5 6.3 70 5-74 39-122 (203)
185 PLN03081 pentatricopeptide (PP 96.0 0.023 5.1E-07 53.6 7.5 64 7-70 493-556 (697)
186 KOG0545|consensus 95.9 0.027 5.8E-07 46.2 6.4 66 9-74 231-296 (329)
187 COG1729 Uncharacterized protei 95.9 0.028 6E-07 46.5 6.6 65 10-74 143-210 (262)
188 PF09295 ChAPs: ChAPs (Chs5p-A 95.9 0.038 8.3E-07 48.6 7.7 69 7-75 199-267 (395)
189 PF13424 TPR_12: Tetratricopep 95.7 0.011 2.3E-07 39.2 3.0 31 7-37 45-75 (78)
190 PLN03077 Protein ECB2; Provisi 95.7 0.053 1.1E-06 52.5 8.7 63 8-70 657-719 (857)
191 PF14938 SNAP: Soluble NSF att 95.7 0.062 1.4E-06 45.0 8.0 64 8-71 114-184 (282)
192 COG4976 Predicted methyltransf 95.7 0.015 3.2E-07 47.2 3.9 60 16-75 3-62 (287)
193 PF12569 NARP1: NMDA receptor- 95.6 0.16 3.4E-06 46.4 10.8 69 6-74 36-109 (517)
194 KOG3824|consensus 95.6 0.0086 1.9E-07 50.6 2.4 52 1-52 143-194 (472)
195 PF14561 TPR_20: Tetratricopep 95.5 0.037 8E-07 38.3 5.0 65 3-67 17-83 (90)
196 PF13374 TPR_10: Tetratricopep 95.5 0.05 1.1E-06 31.0 4.9 33 42-74 2-34 (42)
197 KOG2796|consensus 95.4 0.025 5.4E-07 46.9 4.5 68 7-74 251-318 (366)
198 COG4105 ComL DNA uptake lipopr 95.4 0.098 2.1E-06 43.0 7.9 70 6-75 32-104 (254)
199 PF07720 TPR_3: Tetratricopept 95.3 0.043 9.3E-07 31.1 4.0 34 8-41 1-36 (36)
200 PF04184 ST7: ST7 protein; In 95.2 0.071 1.5E-06 47.8 7.1 61 8-68 259-321 (539)
201 PF09613 HrpB1_HrpK: Bacterial 95.2 0.15 3.2E-06 39.1 7.8 57 3-59 39-95 (160)
202 PF07721 TPR_4: Tetratricopept 95.2 0.016 3.5E-07 30.2 1.8 24 9-32 2-25 (26)
203 PF10300 DUF3808: Protein of u 95.1 0.048 1E-06 49.2 5.8 70 2-71 261-334 (468)
204 KOG1130|consensus 95.0 0.017 3.6E-07 50.7 2.5 65 9-73 196-266 (639)
205 PF12968 DUF3856: Domain of Un 94.8 0.25 5.5E-06 36.1 7.8 68 7-74 54-132 (144)
206 COG4700 Uncharacterized protei 94.7 0.18 3.9E-06 39.9 7.3 69 6-74 122-192 (251)
207 smart00028 TPR Tetratricopepti 94.7 0.038 8.3E-07 28.5 2.6 31 43-73 2-32 (34)
208 PF12862 Apc5: Anaphase-promot 94.6 0.17 3.6E-06 35.1 6.4 57 19-75 9-74 (94)
209 KOG0495|consensus 94.5 0.051 1.1E-06 50.2 4.4 62 9-70 652-713 (913)
210 KOG4340|consensus 94.5 0.11 2.4E-06 43.9 5.9 66 6-71 142-207 (459)
211 KOG2376|consensus 94.4 0.27 5.8E-06 45.0 8.6 75 4-78 106-211 (652)
212 PF10602 RPN7: 26S proteasome 94.4 0.24 5.2E-06 38.7 7.6 67 8-74 36-105 (177)
213 KOG2053|consensus 94.4 0.092 2E-06 49.9 5.9 74 1-74 36-109 (932)
214 KOG3785|consensus 94.3 0.07 1.5E-06 46.1 4.6 59 9-67 58-116 (557)
215 PF10579 Rapsyn_N: Rapsyn N-te 94.2 0.49 1.1E-05 31.8 7.5 62 9-70 7-68 (80)
216 COG4700 Uncharacterized protei 94.1 0.19 4.2E-06 39.7 6.3 66 9-74 90-156 (251)
217 TIGR02561 HrpB1_HrpK type III 94.0 0.29 6.2E-06 37.1 6.9 58 3-60 39-96 (153)
218 KOG0545|consensus 93.8 0.12 2.7E-06 42.4 4.8 67 8-74 178-262 (329)
219 PF10516 SHNi-TPR: SHNi-TPR; 93.6 0.083 1.8E-06 30.3 2.6 29 9-37 2-30 (38)
220 COG3629 DnrI DNA-binding trans 93.6 0.48 1E-05 39.8 8.1 67 8-74 153-219 (280)
221 KOG1130|consensus 93.5 0.11 2.4E-06 45.7 4.4 65 9-73 236-306 (639)
222 KOG0376|consensus 93.4 0.069 1.5E-06 47.5 3.0 65 11-75 7-71 (476)
223 KOG2471|consensus 93.2 0.13 2.7E-06 46.2 4.3 67 8-74 283-367 (696)
224 COG2912 Uncharacterized conser 93.0 0.27 5.9E-06 40.8 5.7 66 9-74 182-247 (269)
225 KOG0551|consensus 92.9 0.18 3.8E-06 43.1 4.5 64 7-70 118-181 (390)
226 COG4976 Predicted methyltransf 92.8 0.088 1.9E-06 42.8 2.5 44 1-44 22-65 (287)
227 COG3118 Thioredoxin domain-con 92.6 0.45 9.7E-06 40.0 6.5 57 11-67 137-193 (304)
228 KOG2610|consensus 92.6 0.36 7.8E-06 41.5 6.0 60 8-67 175-234 (491)
229 PF14938 SNAP: Soluble NSF att 92.5 0.43 9.2E-06 40.0 6.5 66 8-74 35-106 (282)
230 PF04733 Coatomer_E: Coatomer 92.3 0.21 4.5E-06 42.2 4.4 52 22-73 181-232 (290)
231 COG0457 NrfG FOG: TPR repeat [ 92.2 0.86 1.9E-05 34.2 7.5 68 7-74 166-234 (291)
232 PRK04841 transcriptional regul 92.2 0.64 1.4E-05 45.1 8.2 66 9-74 492-563 (903)
233 COG0457 NrfG FOG: TPR repeat [ 92.2 0.75 1.6E-05 34.5 7.1 67 4-70 91-158 (291)
234 PF09986 DUF2225: Uncharacteri 92.1 0.95 2.1E-05 36.5 7.8 62 8-69 118-192 (214)
235 PF08424 NRDE-2: NRDE-2, neces 92.1 0.78 1.7E-05 39.3 7.7 73 2-74 13-97 (321)
236 KOG1308|consensus 92.0 0.024 5.2E-07 48.4 -1.7 60 16-75 122-181 (377)
237 KOG4340|consensus 91.9 0.38 8.2E-06 40.8 5.2 62 3-64 39-100 (459)
238 PRK04841 transcriptional regul 91.6 0.63 1.4E-05 45.1 7.4 64 11-74 694-763 (903)
239 PLN03218 maturation of RBCL 1; 91.6 0.93 2E-05 45.1 8.5 62 8-69 579-641 (1060)
240 PF07721 TPR_4: Tetratricopept 91.6 0.26 5.7E-06 25.4 2.7 24 43-66 2-25 (26)
241 PF05843 Suf: Suppressor of fo 91.4 0.89 1.9E-05 38.1 7.2 67 6-72 33-100 (280)
242 smart00386 HAT HAT (Half-A-TPR 91.4 0.65 1.4E-05 24.4 4.4 30 22-51 1-30 (33)
243 PLN03218 maturation of RBCL 1; 91.3 1.2 2.6E-05 44.3 8.9 62 7-69 683-746 (1060)
244 PF09613 HrpB1_HrpK: Bacterial 91.2 1.9 4.1E-05 33.1 8.1 61 9-69 11-71 (160)
245 COG3071 HemY Uncharacterized e 91.0 1.7 3.7E-05 38.0 8.5 64 7-70 152-215 (400)
246 PLN03081 pentatricopeptide (PP 91.0 1.3 2.7E-05 42.0 8.6 65 5-70 287-353 (697)
247 KOG0495|consensus 90.8 1.2 2.7E-05 41.5 7.8 73 1-73 678-750 (913)
248 KOG0551|consensus 90.5 0.6 1.3E-05 40.0 5.2 70 7-76 80-153 (390)
249 KOG2376|consensus 89.2 1.2 2.6E-05 40.9 6.4 68 7-74 174-256 (652)
250 COG3118 Thioredoxin domain-con 88.7 3.1 6.7E-05 35.2 8.1 69 3-71 163-265 (304)
251 COG4455 ImpE Protein of avirul 88.5 0.89 1.9E-05 36.9 4.6 59 16-74 9-67 (273)
252 KOG1941|consensus 88.2 1.4 2.9E-05 38.5 5.8 65 8-72 206-276 (518)
253 COG3947 Response regulator con 88.2 2.8 6E-05 35.5 7.4 64 11-74 282-345 (361)
254 KOG2396|consensus 88.0 1.9 4.1E-05 39.0 6.7 71 4-74 101-172 (568)
255 PLN03077 Protein ECB2; Provisi 87.9 2.1 4.5E-05 41.6 7.6 65 5-70 551-617 (857)
256 PF04910 Tcf25: Transcriptiona 87.7 1.3 2.8E-05 38.7 5.6 73 2-74 34-135 (360)
257 KOG1914|consensus 87.4 3.1 6.8E-05 38.0 7.8 69 1-70 13-81 (656)
258 PF05843 Suf: Suppressor of fo 87.3 2.3 5.1E-05 35.6 6.7 72 3-74 65-139 (280)
259 PF10373 EST1_DNA_bind: Est1 D 87.2 1.5 3.3E-05 36.0 5.6 51 4-54 12-62 (278)
260 KOG3364|consensus 86.8 1.4 3E-05 32.9 4.4 66 9-74 33-103 (149)
261 PRK10941 hypothetical protein; 86.3 2 4.3E-05 35.9 5.7 48 2-49 209-256 (269)
262 KOG2053|consensus 86.2 2.7 5.8E-05 40.4 7.0 58 4-62 73-130 (932)
263 PF12862 Apc5: Anaphase-promot 86.1 1.1 2.4E-05 30.9 3.6 32 7-38 40-71 (94)
264 TIGR02561 HrpB1_HrpK type III 85.3 4.6 0.0001 30.6 6.7 54 16-69 18-71 (153)
265 PF11207 DUF2989: Protein of u 85.0 4.6 0.0001 32.2 6.9 55 6-61 139-197 (203)
266 TIGR03504 FimV_Cterm FimV C-te 84.2 1.6 3.5E-05 25.8 3.1 25 12-36 3-27 (44)
267 KOG4507|consensus 83.8 1.4 3E-05 40.7 3.8 65 11-75 645-709 (886)
268 PF14853 Fis1_TPR_C: Fis1 C-te 83.7 2.9 6.2E-05 25.8 4.2 32 43-74 2-33 (53)
269 KOG2610|consensus 83.6 1.9 4.1E-05 37.3 4.4 73 2-74 131-207 (491)
270 cd02682 MIT_AAA_Arch MIT: doma 83.4 1.6 3.4E-05 29.1 3.1 28 9-36 7-34 (75)
271 PF10516 SHNi-TPR: SHNi-TPR; 83.3 3.5 7.7E-05 23.5 4.2 33 43-75 2-34 (38)
272 PF12854 PPR_1: PPR repeat 83.2 2.3 4.9E-05 23.4 3.3 27 7-33 6-32 (34)
273 PF11846 DUF3366: Domain of un 82.8 5 0.00011 31.4 6.4 53 20-73 123-175 (193)
274 KOG4507|consensus 82.5 2 4.4E-05 39.7 4.3 46 6-51 674-719 (886)
275 KOG3785|consensus 82.1 2.8 6.1E-05 36.5 4.9 65 10-74 153-217 (557)
276 COG4105 ComL DNA uptake lipopr 82.1 6.6 0.00014 32.5 6.9 50 5-54 68-120 (254)
277 KOG1585|consensus 81.6 9.5 0.00021 31.6 7.5 65 10-74 73-142 (308)
278 PF13281 DUF4071: Domain of un 81.4 5.9 0.00013 34.7 6.7 69 5-73 176-257 (374)
279 COG3914 Spy Predicted O-linked 81.3 7.4 0.00016 35.9 7.5 71 3-73 60-133 (620)
280 PF04184 ST7: ST7 protein; In 81.0 3.8 8.3E-05 37.1 5.5 72 1-74 195-291 (539)
281 KOG1310|consensus 81.0 2.6 5.6E-05 38.5 4.4 71 3-73 403-476 (758)
282 PF14863 Alkyl_sulf_dimr: Alky 79.7 6.5 0.00014 29.5 5.7 50 8-57 70-119 (141)
283 PF01535 PPR: PPR repeat; Int 79.3 3.7 8E-05 21.2 3.2 25 45-69 3-27 (31)
284 PF07079 DUF1347: Protein of u 78.1 5.8 0.00013 35.6 5.6 50 16-66 470-519 (549)
285 PF02259 FAT: FAT domain; Int 77.7 7.1 0.00015 33.1 6.2 48 8-55 252-305 (352)
286 PF13041 PPR_2: PPR repeat fam 77.2 12 0.00025 22.1 5.4 42 7-49 2-45 (50)
287 COG5191 Uncharacterized conser 77.2 2 4.4E-05 36.7 2.5 69 5-73 104-173 (435)
288 PF08424 NRDE-2: NRDE-2, neces 76.1 23 0.00049 30.3 8.8 74 2-75 59-135 (321)
289 KOG2471|consensus 76.1 5.1 0.00011 36.4 4.8 48 7-54 334-381 (696)
290 KOG1915|consensus 75.0 8 0.00017 35.1 5.7 67 8-74 73-139 (677)
291 COG3898 Uncharacterized membra 75.0 11 0.00023 33.5 6.3 52 18-69 164-215 (531)
292 PF04212 MIT: MIT (microtubule 74.9 5.6 0.00012 25.6 3.7 25 12-36 9-33 (69)
293 KOG1941|consensus 74.8 4.9 0.00011 35.2 4.2 64 9-72 163-236 (518)
294 PF04781 DUF627: Protein of un 74.5 5.3 0.00012 28.7 3.7 55 9-63 34-99 (111)
295 KOG2758|consensus 74.5 12 0.00027 32.2 6.4 71 4-74 125-199 (432)
296 KOG2796|consensus 74.2 18 0.0004 30.4 7.2 62 9-70 178-240 (366)
297 PF10300 DUF3808: Protein of u 73.8 13 0.00028 33.6 7.0 65 8-72 305-377 (468)
298 KOG3081|consensus 73.2 16 0.00035 30.6 6.7 53 22-74 187-239 (299)
299 PF03704 BTAD: Bacterial trans 72.6 6.2 0.00013 29.1 4.0 35 2-36 90-124 (146)
300 COG2912 Uncharacterized conser 72.1 7.2 0.00016 32.6 4.5 49 1-49 208-256 (269)
301 cd02681 MIT_calpain7_1 MIT: do 71.2 7.5 0.00016 25.9 3.7 27 10-36 8-34 (76)
302 KOG2396|consensus 71.1 8.3 0.00018 35.0 4.9 49 1-49 132-181 (568)
303 KOG1915|consensus 71.1 7.8 0.00017 35.1 4.7 51 20-70 449-499 (677)
304 KOG1585|consensus 71.0 24 0.00052 29.4 7.1 67 8-74 31-103 (308)
305 KOG1464|consensus 70.5 20 0.00043 30.3 6.7 55 22-76 41-99 (440)
306 PF15015 NYD-SP12_N: Spermatog 70.4 8.1 0.00018 34.5 4.6 53 12-64 232-284 (569)
307 PF11817 Foie-gras_1: Foie gra 70.3 15 0.00032 30.2 6.0 58 10-67 180-243 (247)
308 KOG0985|consensus 70.1 18 0.0004 35.9 7.2 62 6-72 1102-1163(1666)
309 KOG0546|consensus 70.0 7 0.00015 33.9 4.1 52 4-55 305-356 (372)
310 TIGR00756 PPR pentatricopeptid 69.7 12 0.00026 19.4 3.8 25 45-69 3-27 (35)
311 PF07720 TPR_3: Tetratricopept 69.6 17 0.00038 20.3 4.4 30 43-72 2-33 (36)
312 PF12854 PPR_1: PPR repeat 69.5 13 0.00029 20.2 3.9 27 41-67 6-32 (34)
313 PF13041 PPR_2: PPR repeat fam 69.2 14 0.00031 21.7 4.4 28 42-69 3-30 (50)
314 PF10373 EST1_DNA_bind: Est1 D 68.8 9.9 0.00022 31.1 4.8 43 27-69 1-43 (278)
315 PF08238 Sel1: Sel1 repeat; I 68.4 8.7 0.00019 21.0 3.1 30 8-37 1-37 (39)
316 KOG4814|consensus 68.4 16 0.00035 34.3 6.2 67 6-72 392-458 (872)
317 cd02683 MIT_1 MIT: domain cont 68.0 8.6 0.00019 25.6 3.5 27 10-36 8-34 (77)
318 cd02679 MIT_spastin MIT: domai 67.1 11 0.00023 25.4 3.8 25 51-75 17-41 (79)
319 smart00671 SEL1 Sel1-like repe 66.8 11 0.00024 20.1 3.3 29 9-37 2-34 (36)
320 COG2976 Uncharacterized protei 66.7 19 0.00042 28.7 5.6 59 9-67 90-151 (207)
321 KOG1070|consensus 66.6 17 0.00036 37.3 6.3 67 8-74 1530-1596(1710)
322 TIGR03504 FimV_Cterm FimV C-te 66.6 14 0.00029 21.8 3.7 25 45-69 2-26 (44)
323 COG4259 Uncharacterized protei 65.5 21 0.00046 25.3 5.0 34 41-74 71-104 (121)
324 PF08311 Mad3_BUB1_I: Mad3/BUB 63.8 35 0.00077 24.9 6.4 61 3-69 64-126 (126)
325 COG5107 RNA14 Pre-mRNA 3'-end 63.6 22 0.00048 32.1 6.0 69 1-69 35-103 (660)
326 KOG0530|consensus 63.3 35 0.00076 28.7 6.7 69 3-71 107-176 (318)
327 PF10952 DUF2753: Protein of u 63.0 47 0.001 24.5 6.6 60 10-69 3-77 (140)
328 PF12968 DUF3856: Domain of Un 62.9 59 0.0013 24.0 7.4 66 9-74 8-87 (144)
329 PF13812 PPR_3: Pentatricopept 62.9 20 0.00044 18.6 4.1 26 44-69 3-28 (34)
330 COG5191 Uncharacterized conser 62.9 3.3 7.3E-05 35.4 0.8 49 1-49 134-183 (435)
331 PF13226 DUF4034: Domain of un 62.8 21 0.00046 30.0 5.5 58 4-61 73-152 (277)
332 KOG3783|consensus 62.2 38 0.00081 31.1 7.3 65 9-73 450-522 (546)
333 KOG1586|consensus 61.5 36 0.00079 28.1 6.4 63 9-72 75-144 (288)
334 COG0790 FOG: TPR repeat, SEL1 61.3 60 0.0013 26.8 8.2 47 7-56 186-236 (292)
335 COG3898 Uncharacterized membra 60.9 39 0.00085 30.1 6.9 44 1-44 256-299 (531)
336 smart00745 MIT Microtubule Int 60.4 16 0.00034 23.9 3.7 21 54-74 20-40 (77)
337 PF07219 HemY_N: HemY protein 60.4 41 0.00089 23.7 6.1 47 10-56 61-107 (108)
338 PF10255 Paf67: RNA polymerase 60.3 21 0.00045 31.8 5.3 64 11-74 125-196 (404)
339 PF08631 SPO22: Meiosis protei 60.2 62 0.0013 26.9 8.0 56 8-63 35-105 (278)
340 PF02259 FAT: FAT domain; Int 60.1 38 0.00083 28.5 7.0 66 5-70 143-212 (352)
341 KOG2997|consensus 59.0 11 0.00024 32.3 3.2 39 8-46 19-57 (366)
342 PF02064 MAS20: MAS20 protein 58.9 16 0.00034 26.7 3.7 33 11-43 66-98 (121)
343 COG2976 Uncharacterized protei 58.5 44 0.00095 26.7 6.3 44 26-69 70-116 (207)
344 KOG2047|consensus 58.3 36 0.00078 32.2 6.5 63 7-69 386-452 (835)
345 KOG3364|consensus 57.9 32 0.0007 25.8 5.1 42 7-48 70-111 (149)
346 TIGR02996 rpt_mate_G_obs repea 57.5 32 0.0007 20.1 4.1 33 29-61 3-35 (42)
347 cd02656 MIT MIT: domain contai 56.2 21 0.00045 23.3 3.7 18 56-73 20-37 (75)
348 PF09797 NatB_MDM20: N-acetylt 56.1 33 0.00072 29.7 5.9 48 21-68 196-243 (365)
349 PF12583 TPPII_N: Tripeptidyl 56.0 25 0.00055 26.1 4.3 44 9-52 77-120 (139)
350 cd02678 MIT_VPS4 MIT: domain c 55.9 21 0.00046 23.4 3.7 22 14-35 12-33 (75)
351 PF09205 DUF1955: Domain of un 55.6 70 0.0015 24.1 6.5 54 16-69 94-147 (161)
352 PRK15180 Vi polysaccharide bio 55.6 24 0.00052 32.2 4.8 62 2-63 317-378 (831)
353 PF05053 Menin: Menin; InterP 55.2 76 0.0016 29.5 8.0 68 6-73 275-349 (618)
354 KOG0686|consensus 54.0 27 0.00059 31.0 4.8 62 9-70 151-215 (466)
355 cd08440 PBP2_LTTR_like_4 TThe 53.5 23 0.00051 26.0 4.1 56 137-197 1-56 (197)
356 COG0790 FOG: TPR repeat, SEL1 52.7 94 0.002 25.6 8.0 52 4-55 105-161 (292)
357 PF04053 Coatomer_WDAD: Coatom 52.3 37 0.00081 30.6 5.7 30 5-34 344-373 (443)
358 PF01239 PPTA: Protein prenylt 52.3 34 0.00074 17.9 4.6 28 27-54 2-29 (31)
359 KOG3325|consensus 52.0 27 0.00059 26.6 4.0 25 176-202 93-117 (183)
360 cd02680 MIT_calpain7_2 MIT: do 51.5 24 0.00051 23.5 3.3 25 13-37 11-35 (75)
361 KOG1586|consensus 51.3 69 0.0015 26.5 6.5 66 8-74 34-105 (288)
362 KOG3617|consensus 51.0 71 0.0015 31.4 7.3 65 6-70 910-995 (1416)
363 PHA02537 M terminase endonucle 50.2 26 0.00057 28.6 4.0 33 8-40 169-210 (230)
364 COG2099 CobK Precorrin-6x redu 49.8 20 0.00043 29.7 3.3 24 174-198 51-74 (257)
365 cd02684 MIT_2 MIT: domain cont 49.0 30 0.00064 22.9 3.5 21 15-35 13-33 (75)
366 cd08441 PBP2_MetR The C-termin 47.7 35 0.00076 25.4 4.3 55 137-197 1-56 (198)
367 KOG4814|consensus 47.5 48 0.001 31.3 5.6 62 13-74 359-426 (872)
368 TIGR00160 MGSA methylglyoxal s 47.4 23 0.00049 26.7 3.0 30 168-198 52-81 (143)
369 KOG1070|consensus 46.5 69 0.0015 33.1 6.8 70 6-75 1562-1633(1710)
370 PF04053 Coatomer_WDAD: Coatom 46.4 96 0.0021 28.0 7.3 63 11-73 298-378 (443)
371 KOG1258|consensus 45.1 75 0.0016 29.5 6.5 73 2-74 73-146 (577)
372 cd08431 PBP2_HupR The C-termin 44.4 32 0.00068 25.6 3.6 57 137-198 1-57 (195)
373 cd02677 MIT_SNX15 MIT: domain 44.3 32 0.00069 22.7 3.1 19 56-74 20-38 (75)
374 PF04910 Tcf25: Transcriptiona 44.3 72 0.0016 27.8 6.1 41 33-73 31-71 (360)
375 PF08631 SPO22: Meiosis protei 43.3 1.1E+02 0.0024 25.4 6.9 56 19-74 4-68 (278)
376 cd08415 PBP2_LysR_opines_like 42.9 54 0.0012 24.1 4.7 55 137-196 1-55 (196)
377 cd08436 PBP2_LTTR_like_3 The C 42.6 36 0.00078 25.0 3.6 54 138-196 2-55 (194)
378 PF02184 HAT: HAT (Half-A-TPR) 42.3 53 0.0012 18.0 3.2 26 23-49 2-27 (32)
379 cd07405 MPP_UshA_N Escherichia 42.1 43 0.00093 28.0 4.3 41 161-203 167-220 (285)
380 cd07409 MPP_CD73_N CD73 ecto-5 41.6 43 0.00094 27.9 4.2 38 161-202 176-216 (281)
381 cd08458 PBP2_NocR The C-termin 41.1 55 0.0012 24.4 4.5 56 137-197 1-56 (196)
382 PRK15490 Vi polysaccharide bio 40.8 98 0.0021 28.9 6.6 54 7-62 41-94 (578)
383 KOG1310|consensus 40.6 20 0.00044 33.0 2.1 42 1-42 438-479 (758)
384 PRK15326 type III secretion sy 39.8 1.1E+02 0.0025 20.5 5.1 40 21-63 20-59 (80)
385 KOG2047|consensus 39.6 51 0.0011 31.2 4.5 58 8-65 477-534 (835)
386 cd08419 PBP2_CbbR_RubisCO_like 39.5 43 0.00094 24.6 3.7 25 171-196 30-54 (197)
387 smart00854 PGA_cap Bacterial c 39.3 66 0.0014 26.0 4.9 35 161-195 167-210 (239)
388 COG4455 ImpE Protein of avirul 39.1 49 0.0011 27.1 3.9 50 2-51 29-81 (273)
389 KOG0529|consensus 39.0 1E+02 0.0022 27.5 6.0 73 2-74 57-143 (421)
390 KOG2422|consensus 38.8 2.9E+02 0.0063 26.0 9.1 73 2-74 278-374 (665)
391 cd08438 PBP2_CidR The C-termin 38.2 61 0.0013 23.8 4.3 55 137-196 1-55 (197)
392 TIGR00985 3a0801s04tom mitocho 38.1 58 0.0013 24.7 4.0 31 13-43 95-126 (148)
393 PF04343 DUF488: Protein of un 38.0 30 0.00066 24.9 2.4 17 180-196 3-19 (122)
394 KOG2709|consensus 38.0 33 0.00071 30.6 2.9 30 8-37 22-51 (560)
395 cd02679 MIT_spastin MIT: domai 37.8 96 0.0021 20.8 4.6 29 9-37 9-37 (79)
396 KOG1258|consensus 36.3 1.7E+02 0.0037 27.3 7.3 68 7-74 330-398 (577)
397 PF05387 Chorion_3: Chorion fa 35.7 14 0.0003 30.2 0.3 42 174-219 104-159 (277)
398 COG1803 MgsA Methylglyoxal syn 35.4 65 0.0014 23.8 3.7 57 137-197 4-82 (142)
399 KOG4056|consensus 34.9 73 0.0016 23.8 3.9 34 12-45 85-118 (143)
400 KOG0546|consensus 34.2 17 0.00037 31.5 0.7 65 10-74 277-341 (372)
401 PF10345 Cohesin_load: Cohesin 34.0 1.6E+02 0.0035 27.5 7.1 61 10-70 363-432 (608)
402 PF04781 DUF627: Protein of un 34.0 1E+02 0.0022 22.1 4.5 40 14-53 2-44 (111)
403 cd07381 MPP_CapA CapA and rela 33.6 96 0.0021 24.9 5.0 35 161-195 169-212 (239)
404 PF03466 LysR_substrate: LysR 33.5 26 0.00057 26.3 1.6 61 134-200 4-65 (209)
405 cd08427 PBP2_LTTR_like_2 The C 33.0 65 0.0014 23.6 3.7 26 171-197 31-56 (195)
406 PF13281 DUF4071: Domain of un 32.9 2.1E+02 0.0046 25.2 7.1 62 8-69 141-209 (374)
407 cd05466 PBP2_LTTR_substrate Th 32.9 57 0.0012 23.6 3.3 55 138-197 2-56 (197)
408 cd08446 PBP2_Chlorocatechol Th 32.8 60 0.0013 24.0 3.5 55 136-196 1-56 (198)
409 cd08414 PBP2_LTTR_aromatics_li 32.5 65 0.0014 23.6 3.6 27 169-196 29-55 (197)
410 PRK10076 pyruvate formate lyas 32.2 52 0.0011 26.4 3.1 26 180-205 57-82 (213)
411 cd08442 PBP2_YofA_SoxR_like Th 32.0 80 0.0017 23.1 4.1 25 171-196 31-55 (193)
412 KOG2709|consensus 32.0 63 0.0014 28.9 3.7 33 42-74 22-54 (560)
413 PRK11716 DNA-binding transcrip 32.0 61 0.0013 26.0 3.6 59 134-197 65-123 (269)
414 PF00448 SRP54: SRP54-type pro 31.9 2.2E+02 0.0047 22.4 6.6 62 137-205 29-99 (196)
415 cd08456 PBP2_LysR The C-termin 31.8 58 0.0013 24.0 3.3 55 137-196 1-55 (196)
416 PRK13184 pknD serine/threonine 31.5 93 0.002 30.9 5.2 65 9-74 513-584 (932)
417 KOG2581|consensus 31.3 46 0.001 29.7 2.8 36 6-41 245-280 (493)
418 cd07410 MPP_CpdB_N Escherichia 31.0 75 0.0016 26.3 4.0 40 161-202 177-228 (277)
419 cd08449 PBP2_XapR The C-termin 31.0 76 0.0016 23.3 3.8 54 138-196 2-55 (197)
420 smart00777 Mad3_BUB1_I Mad3/BU 30.9 1.7E+02 0.0037 21.4 5.4 59 2-66 63-123 (125)
421 KOG2300|consensus 30.5 2.7E+02 0.0058 25.7 7.4 67 7-73 6-78 (629)
422 cd08485 PBP2_ClcR The C-termin 30.4 86 0.0019 23.4 4.0 54 137-196 2-56 (198)
423 cd08461 PBP2_DntR_like_3 The C 30.4 57 0.0012 24.2 3.0 55 137-196 1-55 (198)
424 PF03129 HGTP_anticodon: Antic 30.2 42 0.0009 22.5 2.0 38 180-217 19-56 (94)
425 KOG3617|consensus 30.0 1.5E+02 0.0033 29.3 6.0 64 8-71 858-941 (1416)
426 cd07412 MPP_YhcR_N Bacillus su 29.8 95 0.0021 26.0 4.4 39 161-202 185-239 (288)
427 cd00861 ProRS_anticodon_short 29.8 77 0.0017 21.0 3.3 39 180-218 21-59 (94)
428 PF04190 DUF410: Protein of un 29.7 1.3E+02 0.0027 24.9 5.1 66 6-71 88-170 (260)
429 COG4259 Uncharacterized protei 29.6 1.5E+02 0.0032 21.2 4.5 40 5-44 69-108 (121)
430 KOG1550|consensus 29.5 3E+02 0.0065 25.5 8.0 63 6-70 323-392 (552)
431 cd08420 PBP2_CysL_like C-termi 29.3 1E+02 0.0022 22.5 4.3 26 171-197 31-56 (201)
432 cd08412 PBP2_PAO1_like The C-t 28.9 88 0.0019 23.0 3.9 55 138-197 2-56 (198)
433 PRK13184 pknD serine/threonine 28.9 55 0.0012 32.4 3.2 42 3-44 547-588 (932)
434 COG2909 MalT ATP-dependent tra 28.7 2.9E+02 0.0063 27.3 7.7 67 8-74 458-529 (894)
435 PRK11074 putative DNA-binding 28.7 84 0.0018 26.0 4.0 58 134-196 90-147 (300)
436 cd08425 PBP2_CynR The C-termin 28.7 81 0.0017 23.3 3.6 57 136-197 1-57 (197)
437 cd00280 TRFH Telomeric Repeat 28.2 1.4E+02 0.0031 23.7 4.7 35 16-51 119-153 (200)
438 KOG0530|consensus 28.1 2.7E+02 0.0058 23.6 6.5 66 8-73 43-109 (318)
439 cd08429 PBP2_NhaR The C-termin 27.9 76 0.0016 24.2 3.4 55 138-197 2-56 (204)
440 PF10602 RPN7: 26S proteasome 27.0 94 0.002 24.0 3.7 29 43-71 37-65 (177)
441 PF10345 Cohesin_load: Cohesin 26.8 3.7E+02 0.0081 25.1 8.2 66 8-74 59-131 (608)
442 KOG1550|consensus 26.6 1.7E+02 0.0036 27.1 5.8 29 24-54 309-337 (552)
443 PF04552 Sigma54_DBD: Sigma-54 26.3 40 0.00086 25.9 1.4 33 160-193 106-138 (160)
444 KOG0890|consensus 26.1 75 0.0016 34.5 3.6 50 17-66 1458-1507(2382)
445 cd05568 PTS_IIB_bgl_like PTS_I 26.1 63 0.0014 21.0 2.3 54 139-199 2-55 (85)
446 cd08445 PBP2_BenM_CatM_CatR Th 25.7 1.1E+02 0.0023 22.9 3.8 54 137-196 2-56 (203)
447 TIGR03298 argP transcriptional 25.6 86 0.0019 25.7 3.5 57 135-197 89-145 (292)
448 cd01422 MGS Methylglyoxal synt 25.3 90 0.0019 22.3 3.1 33 180-212 60-94 (115)
449 cd08162 MPP_PhoA_N Synechococc 25.2 72 0.0016 27.2 2.9 37 162-202 202-242 (313)
450 cd08434 PBP2_GltC_like The sub 25.0 1.2E+02 0.0027 21.9 4.0 25 171-196 31-55 (195)
451 KOG0889|consensus 24.5 1E+02 0.0022 35.0 4.2 46 8-53 2812-2857(3550)
452 COG0029 NadB Aspartate oxidase 24.4 1.2E+02 0.0027 27.8 4.3 41 161-202 287-353 (518)
453 PF02571 CbiJ: Precorrin-6x re 24.3 90 0.0019 25.7 3.3 27 171-198 47-74 (249)
454 cd08430 PBP2_IlvY The C-termin 24.0 1.4E+02 0.003 21.9 4.2 55 138-197 2-56 (199)
455 cd08466 PBP2_LeuO The C-termin 24.0 89 0.0019 23.1 3.1 25 170-195 30-54 (200)
456 cd08450 PBP2_HcaR The C-termin 23.7 1.3E+02 0.0028 22.1 3.9 28 168-196 28-55 (196)
457 PF10255 Paf67: RNA polymerase 23.7 49 0.0011 29.4 1.7 30 7-36 163-192 (404)
458 COG1654 BirA Biotin operon rep 23.6 92 0.002 20.9 2.6 29 181-209 38-67 (79)
459 PF11817 Foie-gras_1: Foie gra 23.4 3.6E+02 0.0078 21.9 6.7 55 20-74 150-210 (247)
460 COG0300 DltE Short-chain dehyd 23.2 82 0.0018 26.3 2.8 20 185-204 77-98 (265)
461 smart00540 LEM in nuclear memb 23.2 53 0.0012 19.4 1.2 21 172-193 2-22 (44)
462 PRK11233 nitrogen assimilation 23.1 1.1E+02 0.0024 25.4 3.7 60 134-197 89-148 (305)
463 KOG2300|consensus 22.9 3.1E+02 0.0066 25.4 6.4 67 8-74 445-517 (629)
464 cd08437 PBP2_MleR The substrat 22.9 98 0.0021 22.9 3.1 25 171-196 31-55 (198)
465 cd08418 PBP2_TdcA The C-termin 22.9 1.1E+02 0.0024 22.5 3.4 25 171-196 31-55 (201)
466 PRK08057 cobalt-precorrin-6x r 22.8 1E+02 0.0022 25.4 3.3 25 173-198 48-73 (248)
467 cd08451 PBP2_BudR The C-termin 22.5 2.2E+02 0.0049 20.7 5.1 55 138-198 2-58 (199)
468 cd08411 PBP2_OxyR The C-termin 22.3 1.3E+02 0.0028 22.2 3.6 55 137-197 2-57 (200)
469 cd06822 PLPDE_III_YBL036c_euk 22.2 64 0.0014 26.2 2.0 60 146-208 66-135 (227)
470 PRK09508 leuO leucine transcri 22.0 1.1E+02 0.0023 25.6 3.4 58 134-196 110-167 (314)
471 KOG4563|consensus 22.0 1.5E+02 0.0032 26.1 4.1 53 9-61 42-102 (400)
472 KOG0529|consensus 21.8 1.5E+02 0.0033 26.4 4.3 70 1-70 102-177 (421)
473 KOG4079|consensus 21.7 61 0.0013 24.2 1.5 40 162-202 57-96 (169)
474 COG2909 MalT ATP-dependent tra 21.4 4.7E+02 0.01 25.9 7.6 69 7-75 414-491 (894)
475 PRK12469 RNA polymerase factor 21.4 1.2E+02 0.0025 27.8 3.6 34 160-194 426-459 (481)
476 PRK05234 mgsA methylglyoxal sy 21.3 1E+02 0.0022 23.0 2.8 33 180-212 65-99 (142)
477 TIGR00715 precor6x_red precorr 21.1 1.1E+02 0.0025 25.2 3.3 26 173-199 49-74 (256)
478 cd08417 PBP2_Nitroaromatics_li 20.7 1.1E+02 0.0023 22.6 2.9 28 168-196 28-55 (200)
479 PRK09558 ushA bifunctional UDP 20.6 1.1E+02 0.0025 28.1 3.5 40 161-202 202-255 (551)
480 cd08433 PBP2_Nac The C-teminal 20.5 1.9E+02 0.0042 21.1 4.3 25 172-197 32-56 (198)
481 COG4649 Uncharacterized protei 20.3 4.5E+02 0.0097 20.9 6.9 63 8-70 94-195 (221)
482 cd00860 ThrRS_anticodon ThrRS 20.3 1E+02 0.0023 20.1 2.5 38 180-217 18-55 (91)
483 KOG3807|consensus 20.2 3.9E+02 0.0085 23.5 6.3 53 19-73 195-247 (556)
484 cd08416 PBP2_MdcR The C-termin 20.2 1.4E+02 0.0031 21.8 3.5 26 171-197 31-56 (199)
485 cd08421 PBP2_LTTR_like_1 The C 20.1 1.8E+02 0.0039 21.3 4.1 25 171-196 31-55 (198)
No 1
>KOG4626|consensus
Probab=100.00 E-value=3.7e-52 Score=363.81 Aligned_cols=211 Identities=64% Similarity=0.990 Sum_probs=198.0
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhhhhcCCC
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQLDKNRL 81 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~l~~~~~ 81 (222)
..+|.++++|.|||.+|.+.|+.++||+.|++||+++|++++++.|++++++-.|+|.+....++++.+++.++++..++
T Consensus 450 ~~nPt~AeAhsNLasi~kDsGni~~AI~sY~~aLklkPDfpdA~cNllh~lq~vcdw~D~d~~~~kl~sivrdql~~~rl 529 (966)
T KOG4626|consen 450 QINPTFAEAHSNLASIYKDSGNIPEAIQSYRTALKLKPDFPDAYCNLLHCLQIVCDWTDYDKRMKKLVSIVRDQLEKNRL 529 (966)
T ss_pred hcCcHHHHHHhhHHHHhhccCCcHHHHHHHHHHHccCCCCchhhhHHHHHHHHHhcccchHHHHHHHHHHHHHHHhhhcC
Confidence 46899999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred CCCCccccccccCCHHHHHHHHHHHHHHHHHHHHHhhhhhhhcCCCCcccc---hhccccceeeEeeccccccCCCchhh
Q psy17695 82 PSVHPHHSMLYPLTHEYRKAIAARHANLCLLKAMRAKVWQARTESPLFCVQ---RYAQRLESLYKVMWDRYSQNLPVTHI 158 (222)
Q Consensus 82 ~~~~p~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~rLRIGYvS~~d~~f~~H~v~~l 158 (222)
|++.||+.+++|+++..++.+++.|+..|..++.. ...+|+.++. .+++|||||||| +| |.+|||++|
T Consensus 530 psvhP~hsm~ypl~~~~~~aia~k~a~~c~~~~~~------~~k~pyth~~~l~~~~~rlrIGYvS-sD--FgnHp~Shl 600 (966)
T KOG4626|consen 530 PSVHPHHSMLYPLSHILRKAIAAKHANLCLDKVHV------LGKPPYTHPDNLKVKEGRLRIGYVS-SD--FGNHPTSHL 600 (966)
T ss_pred CccCcccccccccchHHHHHHHHHHhhhhHHHHHh------ccCCCCCChhhCCCCcCceEEEeec-cc--ccCCchHHH
Confidence 99999999999999999999999999999887652 3445665553 356999999999 99 999999999
Q ss_pred H---------------------------HHHHHhhhcceeecCCCCChHHHHHHHHhCCCcEEEeCCCCCCCcchhhhhh
Q psy17695 159 T---------------------------QAKIAREAEHFLDLSQVPCNGKAADRIHKDGIHILVNMNGYTKGARNEIFAL 211 (222)
Q Consensus 159 ~---------------------------~~~~~~~~~~~~~~~~~~~~~~~a~~i~~d~iDILvDL~G~t~~~r~~i~a~ 211 (222)
| |.++.+..+||+|++++++ ..+|+.|++|+|||||+|+|||+|+|.+|||+
T Consensus 601 mqsv~gmHdr~kveVfcYals~~d~t~fR~kv~~e~ehf~Dls~i~~-~kiA~~I~qD~I~ILvnlnGyTkgarneifAl 679 (966)
T KOG4626|consen 601 MQSVPGMHDRSKVEVFCYALSVNDGTNFRDKVMKEAEHFVDLSQIPC-NKIADKIRQDKIHILVNLNGYTKGARNEIFAL 679 (966)
T ss_pred hccCcCcCCccceEEEEEEeecCCCchHHHHHHhhccceeehhcCCh-HHHHHHHhhcCceEEEeccccccccccceeec
Confidence 9 7899999999999999998 79999999999999999999999999999999
Q ss_pred CCchhhhcccC
Q psy17695 212 RPAPIQSSFTA 222 (222)
Q Consensus 212 rpAPvQv~~lG 222 (222)
|||||||+|+|
T Consensus 680 rPAPIQv~wlG 690 (966)
T KOG4626|consen 680 RPAPIQVMWLG 690 (966)
T ss_pred cCCceeEEeec
Confidence 99999999998
No 2
>COG3914 Spy Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones]
Probab=100.00 E-value=2.1e-36 Score=264.22 Aligned_cols=213 Identities=22% Similarity=0.238 Sum_probs=161.3
Q ss_pred CCCCCCCHHHHHHHHHhcccCCCcHHHHHHHHH-HHHhCCCCHHHHHHH------HHHHHHcCCHHHHHHHHHHHHHHHH
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGNIPEAIQSYRT-ALKLKPDFPDAYCNL------AHCLQIVCDWTDYEARMKKLVSIVA 73 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~-Al~l~P~~~~a~~~l------~~~~~~~~~~~~a~~~~~~~~~~~~ 73 (222)
+..+|+++.++.+||.++...|....++..+.. +.++.|++.+....+ +..+..+++..++.....+++.+.|
T Consensus 94 l~~~~~~~~~~~~L~~ale~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~l~~~~d~~p 173 (620)
T COG3914 94 LSVNPENCPAVQNLAAALELDGLQFLALADISEIAEWLSPDNAEFLGHLIRFYQLGRYLKLLGRTAEAELALERAVDLLP 173 (620)
T ss_pred HhcCcccchHHHHHHHHHHHhhhHHHHHHHHHHHHHhcCcchHHHHhhHHHHHHHHHHHHHhccHHHHHHHHHHHHHhhh
Confidence 356889999999999999988887777766655 899999998887777 6666677777777777788777777
Q ss_pred hhhhc-----------CCCC---------------CCCccccccccCCHHHHHHHHHHHHHHHHHHHHHhhhhhhhcCCC
Q psy17695 74 EQLDK-----------NRLP---------------SVHPHHSMLYPLTHEYRKAIAARHANLCLLKAMRAKVWQARTESP 127 (222)
Q Consensus 74 ~~l~~-----------~~~~---------------~~~p~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~ 127 (222)
+...- .+-+ ...+|+...+.++|..++.++..+.+.+....... .+. +.
T Consensus 174 ~~~~~~~~~~~~r~~~cs~~~~~~tnl~~~~~~~~~~~~~~~~~~~d~p~~~l~~~~q~~~i~~~~~~~~----~~~-~~ 248 (620)
T COG3914 174 KYPRVLGALMTARQEQCSWPEEAPTNLLSQLALFLGIYGFSLAYLADDPLKNLAIANQLLKILSQNAPEL----VRF-PI 248 (620)
T ss_pred hhhhhHhHHHHHHHHhccchhccchHHHHHHHHHhhccccchhhhccchhhhHHHHHHHHHHhcccCCCc----ccc-cc
Confidence 64111 0000 01234444455667777777776655443111100 000 00
Q ss_pred C-cccchhccccceeeEeeccccccCCCchhhH---------------------------HHHHHhhhcceeecCCCCCh
Q psy17695 128 L-FCVQRYAQRLESLYKVMWDRYSQNLPVTHIT---------------------------QAKIAREAEHFLDLSQVPCN 179 (222)
Q Consensus 128 ~-~~~~~~~~rLRIGYvS~~d~~f~~H~v~~l~---------------------------~~~~~~~~~~~~~~~~~~~~ 179 (222)
+ .+-+..++||||||+| +| |++|+||+++ +.||+..+++|+++.+|+|
T Consensus 249 ~~~~~~~~~~rlRvGylS-~d--lr~Havg~l~~~v~e~hDRdkfEvfay~~g~~~~dal~~rI~a~~~~~~~~~~~dd- 324 (620)
T COG3914 249 RDENIKRNGKRLRVGYLS-SD--LRSHAVGFLLRWVFEYHDRDKFEVFAYSLGPPHTDALQERISAAVEKWYPIGRMDD- 324 (620)
T ss_pred chhhccccccceeEEEec-cc--cccchHHHHHHHHHHHhchhheEEEEEecCCCCchhHHHHHHHhhhheeccCCcCH-
Confidence 1 1123458899999999 99 9999999999 6899999999999999998
Q ss_pred HHHHHHHHhCCCcEEEeCCCCCCCcchhhhhhCCchhhhcccC
Q psy17695 180 GKAADRIHKDGIHILVNMNGYTKGARNEIFALRPAPIQSSFTA 222 (222)
Q Consensus 180 ~~~a~~i~~d~iDILvDL~G~t~~~r~~i~a~rpAPvQv~~lG 222 (222)
.++|++|+.|||||||||+|||..+|++|||+|||||||||||
T Consensus 325 ~e~a~~I~~d~IdILvDl~g~T~d~r~~v~A~RpAPiqvswlG 367 (620)
T COG3914 325 AEIANAIRTDGIDILVDLDGHTVDTRCQVFAHRPAPIQVSWLG 367 (620)
T ss_pred HHHHHHHHhcCCeEEEeccCceeccchhhhhcCCCceEEeecc
Confidence 8999999999999999999999999999999999999999998
No 3
>PF13844 Glyco_transf_41: Glycosyl transferase family 41; PDB: 3PE4_C 3PE3_D 3TAX_C 2XGO_A 2JLB_B 2XGM_A 2VSY_B 2XGS_B 2VSN_A.
Probab=99.97 E-value=1.4e-31 Score=234.38 Aligned_cols=82 Identities=49% Similarity=0.736 Sum_probs=74.4
Q ss_pred cceeeEeeccccccCCCchhhH---------------------------HHHHHhhhcceeecCCCCChHHHHHHHHhCC
Q psy17695 138 LESLYKVMWDRYSQNLPVTHIT---------------------------QAKIAREAEHFLDLSQVPCNGKAADRIHKDG 190 (222)
Q Consensus 138 LRIGYvS~~d~~f~~H~v~~l~---------------------------~~~~~~~~~~~~~~~~~~~~~~~a~~i~~d~ 190 (222)
||||||| +| |++|||++|+ |+++++.+++|+++++++++.++|++|++|+
T Consensus 1 LRIGyvS-~D--f~~Hpv~~l~~~ll~~hDr~rfev~~Ys~~~~d~~~~~~~~~~~~~~~~dl~~~~~~~~~A~~Ir~D~ 77 (468)
T PF13844_consen 1 LRIGYVS-SD--FRNHPVGHLLQPLLENHDRSRFEVFCYSLGPDDGDTFRQRIRAEADHFVDLSGLSDDAEAAQRIRADG 77 (468)
T ss_dssp EEEEEEE-S---SSSSHHHHHHHHHHHHHHHHTEEEEEEESS---SCHHHHHHHHHSEEEEEEEE-TTSHHHHHHHHHTT
T ss_pred CEEEEEC-Cc--chhCHHHHHHHHHHHhcCcccEEEEEEECCCCCCcHHHHHHHHhCCeEEECccCCcHHHHHHHHHHCC
Confidence 7999999 99 9999999999 6889999999999999873279999999999
Q ss_pred CcEEEeCCCCCCCcchhhhhhCCchhhhcccC
Q psy17695 191 IHILVNMNGYTKGARNEIFALRPAPIQSSFTA 222 (222)
Q Consensus 191 iDILvDL~G~t~~~r~~i~a~rpAPvQv~~lG 222 (222)
|||||||+|||.|+|++|||+|||||||+|||
T Consensus 78 IDILVDL~GhT~~~R~~ifA~RpAPVQvswlG 109 (468)
T PF13844_consen 78 IDILVDLSGHTAGNRLEIFALRPAPVQVSWLG 109 (468)
T ss_dssp -SEEEESSTSSTSSSHHHHHT-SSSEEEEESS
T ss_pred CCEEEeCCCcCCCCHhHHHhcCCcceEEEecC
Confidence 99999999999999999999999999999998
No 4
>PF13414 TPR_11: TPR repeat; PDB: 2HO1_B 2FI7_B 2DBA_A 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2PL2_B 3IEG_B 2FBN_A ....
Probab=99.36 E-value=2.5e-12 Score=84.83 Aligned_cols=68 Identities=19% Similarity=0.268 Sum_probs=64.9
Q ss_pred CCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcC-CHHHHHHHHHHHHHHHH
Q psy17695 6 FFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVC-DWTDYEARMKKLVSIVA 73 (222)
Q Consensus 6 ~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~-~~~~a~~~~~~~~~~~~ 73 (222)
+++..|..+|.++...|++++|+.+|+++++++|+++.+++++|.++..+| ++++|.+.+++++++.|
T Consensus 1 e~a~~~~~~g~~~~~~~~~~~A~~~~~~ai~~~p~~~~~~~~~g~~~~~~~~~~~~A~~~~~~al~l~P 69 (69)
T PF13414_consen 1 ENAEAWYNLGQIYFQQGDYEEAIEYFEKAIELDPNNAEAYYNLGLAYMKLGKDYEEAIEDFEKALKLDP 69 (69)
T ss_dssp TSHHHHHHHHHHHHHTTHHHHHHHHHHHHHHHSTTHHHHHHHHHHHHHHTTTHHHHHHHHHHHHHHHST
T ss_pred CHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHhCccHHHHHHHHHHHHHcCc
Confidence 478999999999999999999999999999999999999999999999999 79999999999998765
No 5
>PRK15359 type III secretion system chaperone protein SscB; Provisional
Probab=99.27 E-value=1.3e-11 Score=93.63 Aligned_cols=73 Identities=15% Similarity=0.200 Sum_probs=68.8
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
..+|+++.+|+++|.++..+|++++|+.+|+++++++|+++.+++++|.++..+|++++|.+.|++++++.|+
T Consensus 52 ~~~P~~~~a~~~lg~~~~~~g~~~~A~~~y~~Al~l~p~~~~a~~~lg~~l~~~g~~~eAi~~~~~Al~~~p~ 124 (144)
T PRK15359 52 MAQPWSWRAHIALAGTWMMLKEYTTAINFYGHALMLDASHPEPVYQTGVCLKMMGEPGLAREAFQTAIKMSYA 124 (144)
T ss_pred HcCCCcHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcCCCCcHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCCC
Confidence 5789999999999999999999999999999999999999999999999999999999999999999988876
No 6
>PRK15363 pathogenicity island 2 chaperone protein SscA; Provisional
Probab=99.22 E-value=3.5e-11 Score=91.33 Aligned_cols=72 Identities=19% Similarity=0.129 Sum_probs=68.4
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHH
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVA 73 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~ 73 (222)
..+|.++..|++||.++..+|++++|+.+|.+|+.++|+++.+++|+|.|+...|+.+.|.+.|+.++....
T Consensus 63 ~~Dp~~~~y~~gLG~~~Q~~g~~~~AI~aY~~A~~L~~ddp~~~~~ag~c~L~lG~~~~A~~aF~~Ai~~~~ 134 (157)
T PRK15363 63 IYDAWSFDYWFRLGECCQAQKHWGEAIYAYGRAAQIKIDAPQAPWAAAECYLACDNVCYAIKALKAVVRICG 134 (157)
T ss_pred HhCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcCCCCchHHHHHHHHHHHcCCHHHHHHHHHHHHHHhc
Confidence 468999999999999999999999999999999999999999999999999999999999999999998774
No 7
>PLN03098 LPA1 LOW PSII ACCUMULATION1; Provisional
Probab=99.14 E-value=1.9e-10 Score=100.34 Aligned_cols=69 Identities=20% Similarity=0.219 Sum_probs=66.4
Q ss_pred CCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHH---HHHHHHHHHHcCCHHHHHHHHHHHHHH
Q psy17695 3 KTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDA---YCNLAHCLQIVCDWTDYEARMKKLVSI 71 (222)
Q Consensus 3 ~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a---~~~l~~~~~~~~~~~~a~~~~~~~~~~ 71 (222)
.+|+++++|+|+|.+|..+|++++|+.+|++|++++|++.++ |+|+|.+|..+|+.++|++++++++++
T Consensus 70 ~dP~~a~a~~NLG~AL~~lGryeEAIa~f~rALeL~Pd~aeA~~A~yNLAcaya~LGr~dEAla~LrrALel 141 (453)
T PLN03098 70 ADVKTAEDAVNLGLSLFSKGRVKDALAQFETALELNPNPDEAQAAYYNKACCHAYREEGKKAADCLRTALRD 141 (453)
T ss_pred CCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHhhCCCchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHh
Confidence 479999999999999999999999999999999999999965 999999999999999999999999986
No 8
>PF13432 TPR_16: Tetratricopeptide repeat; PDB: 3CVP_A 3CVL_A 3CVQ_A 3CV0_A 2GW1_B 3CVN_A 3QKY_A 2PL2_B.
Probab=99.07 E-value=3.6e-10 Score=73.54 Aligned_cols=63 Identities=22% Similarity=0.320 Sum_probs=59.3
Q ss_pred HHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 12 LFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 12 ~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
+.+|..+.+.|++++|+.+|+++++.+|+++.+++.+|.++..+|++++|...|+++++..|+
T Consensus 1 ~~~a~~~~~~g~~~~A~~~~~~~l~~~P~~~~a~~~lg~~~~~~g~~~~A~~~~~~a~~~~P~ 63 (65)
T PF13432_consen 1 YALARALYQQGDYDEAIAAFEQALKQDPDNPEAWYLLGRILYQQGRYDEALAYYERALELDPD 63 (65)
T ss_dssp HHHHHHHHHCTHHHHHHHHHHHHHCCSTTHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHHSTT
T ss_pred ChHHHHHHHcCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCcC
Confidence 468999999999999999999999999999999999999999999999999999999988875
No 9
>KOG4626|consensus
Probab=99.05 E-value=4.8e-10 Score=100.26 Aligned_cols=75 Identities=21% Similarity=0.276 Sum_probs=63.8
Q ss_pred CCCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhh
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQ 75 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~ 75 (222)
|..+|+++++++|||+++.+.|+..||++||.+||.+.|+++++.+|||++++++|..++|...|.++++..|+.
T Consensus 313 l~~~P~F~~Ay~NlanALkd~G~V~ea~~cYnkaL~l~p~hadam~NLgni~~E~~~~e~A~~ly~~al~v~p~~ 387 (966)
T KOG4626|consen 313 LELQPNFPDAYNNLANALKDKGSVTEAVDCYNKALRLCPNHADAMNNLGNIYREQGKIEEATRLYLKALEVFPEF 387 (966)
T ss_pred HhcCCCchHHHhHHHHHHHhccchHHHHHHHHHHHHhCCccHHHHHHHHHHHHHhccchHHHHHHHHHHhhChhh
Confidence 456788888888888888888888888888888888888888888888888888888888888888888877763
No 10
>PRK11189 lipoprotein NlpI; Provisional
Probab=99.01 E-value=1.1e-09 Score=92.48 Aligned_cols=73 Identities=15% Similarity=0.174 Sum_probs=61.2
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
.++|+++.+|+++|.++..+|++++|+.+|+++++++|++..++.++|.++...|++++|.+.++++++..|+
T Consensus 92 ~l~P~~~~a~~~lg~~~~~~g~~~~A~~~~~~Al~l~P~~~~a~~~lg~~l~~~g~~~eA~~~~~~al~~~P~ 164 (296)
T PRK11189 92 ALRPDMADAYNYLGIYLTQAGNFDAAYEAFDSVLELDPTYNYAYLNRGIALYYGGRYELAQDDLLAFYQDDPN 164 (296)
T ss_pred HcCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCC
Confidence 4678888888888888888888888888888888888888888888888888888888888888888877664
No 11
>PRK10370 formate-dependent nitrite reductase complex subunit NrfG; Provisional
Probab=98.96 E-value=1.6e-09 Score=86.46 Aligned_cols=75 Identities=17% Similarity=0.217 Sum_probs=68.9
Q ss_pred CCCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHH-HHcCC--HHHHHHHHHHHHHHHHhh
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCL-QIVCD--WTDYEARMKKLVSIVAEQ 75 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~-~~~~~--~~~a~~~~~~~~~~~~~~ 75 (222)
|..+|+++++|..||.++...|++++|+.+|++|++++|+++..+.++|.++ ...|+ .++|.+.++++++..|+.
T Consensus 66 L~~~P~~~~~w~~Lg~~~~~~g~~~~A~~a~~~Al~l~P~~~~~~~~lA~aL~~~~g~~~~~~A~~~l~~al~~dP~~ 143 (198)
T PRK10370 66 IRANPQNSEQWALLGEYYLWRNDYDNALLAYRQALQLRGENAELYAALATVLYYQAGQHMTPQTREMIDKALALDANE 143 (198)
T ss_pred HHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHhcCCCCcHHHHHHHHHHHHhCCCC
Confidence 3568999999999999999999999999999999999999999999999986 66677 499999999999998873
No 12
>KOG0553|consensus
Probab=98.94 E-value=1.2e-09 Score=90.26 Aligned_cols=74 Identities=16% Similarity=0.280 Sum_probs=71.5
Q ss_pred CCCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
|.++|.++..|.|.+.+|.++|.++.|++..++||++||.+..+|..||.++..+|++++|++.|++++.+.|+
T Consensus 108 I~l~P~nAVyycNRAAAy~~Lg~~~~AVkDce~Al~iDp~yskay~RLG~A~~~~gk~~~A~~aykKaLeldP~ 181 (304)
T KOG0553|consen 108 IELDPTNAVYYCNRAAAYSKLGEYEDAVKDCESALSIDPHYSKAYGRLGLAYLALGKYEEAIEAYKKALELDPD 181 (304)
T ss_pred HhcCCCcchHHHHHHHHHHHhcchHHHHHHHHHHHhcChHHHHHHHHHHHHHHccCcHHHHHHHHHhhhccCCC
Confidence 46899999999999999999999999999999999999999999999999999999999999999999998886
No 13
>PRK15359 type III secretion system chaperone protein SscB; Provisional
Probab=98.91 E-value=3.9e-09 Score=79.97 Aligned_cols=79 Identities=14% Similarity=0.012 Sum_probs=69.1
Q ss_pred HHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhhhhcCCCCCCCcc
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQLDKNRLPSVHPH 87 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~l~~~~~~~~~p~ 87 (222)
|..+.++|.++...|++++|+.+|++++.++|++..++.++|.++...|++++|+..|++++.+.|+. ...+
T Consensus 24 p~~~~~~g~~~~~~g~~~~A~~~~~~al~~~P~~~~a~~~lg~~~~~~g~~~~A~~~y~~Al~l~p~~--------~~a~ 95 (144)
T PRK15359 24 PETVYASGYASWQEGDYSRAVIDFSWLVMAQPWSWRAHIALAGTWMMLKEYTTAINFYGHALMLDASH--------PEPV 95 (144)
T ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCCcHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcCCCC--------cHHH
Confidence 44577899999999999999999999999999999999999999999999999999999999888763 3445
Q ss_pred ccccccC
Q psy17695 88 HSMLYPL 94 (222)
Q Consensus 88 ~~l~~~~ 94 (222)
.+++..+
T Consensus 96 ~~lg~~l 102 (144)
T PRK15359 96 YQTGVCL 102 (144)
T ss_pred HHHHHHH
Confidence 5555544
No 14
>TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD. ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array.
Probab=98.87 E-value=5.7e-09 Score=77.43 Aligned_cols=71 Identities=13% Similarity=0.116 Sum_probs=39.9
Q ss_pred CCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHH
Q psy17695 3 KTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVA 73 (222)
Q Consensus 3 ~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~ 73 (222)
.+|++..+...+|.++...|++++|+..|+++++++|+++.++.++|.++...|++++|...+++++...|
T Consensus 12 ~~p~~~~~~~~~a~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~la~~~~~~~~~~~A~~~~~~~~~~~p 82 (135)
T TIGR02552 12 LDSEQLEQIYALAYNLYQQGRYDEALKLFQLLAAYDPYNSRYWLGLAACCQMLKEYEEAIDAYALAAALDP 82 (135)
T ss_pred CChhhHHHHHHHHHHHHHcccHHHHHHHHHHHHHhCCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCC
Confidence 44555555555555555555555555555555555555555555555555555555555555555554444
No 15
>PLN03088 SGT1, suppressor of G2 allele of SKP1; Provisional
Probab=98.87 E-value=7.1e-09 Score=89.89 Aligned_cols=74 Identities=18% Similarity=0.166 Sum_probs=70.8
Q ss_pred CCCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
|..+|+++.+++++|.+|..+|++++|+..+++|++++|+++.+++++|.++..+|++++|...|++++.+.|+
T Consensus 29 l~~~P~~~~a~~~~a~~~~~~g~~~eAl~~~~~Al~l~P~~~~a~~~lg~~~~~lg~~~eA~~~~~~al~l~P~ 102 (356)
T PLN03088 29 IDLDPNNAELYADRAQANIKLGNFTEAVADANKAIELDPSLAKAYLRKGTACMKLEEYQTAKAALEKGASLAPG 102 (356)
T ss_pred HHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCcCCHHHHHHHHHHHHHhCCHHHHHHHHHHHHHhCCC
Confidence 35789999999999999999999999999999999999999999999999999999999999999999988875
No 16
>PRK12370 invasion protein regulator; Provisional
Probab=98.85 E-value=8.3e-09 Score=94.43 Aligned_cols=74 Identities=7% Similarity=-0.091 Sum_probs=71.3
Q ss_pred CCCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
+.++|+++.+|..+|.++...|++++|+.+|++|++++|+++.+++++|.++...|++++|...+++++++.|.
T Consensus 331 l~ldP~~~~a~~~lg~~~~~~g~~~~A~~~~~~Al~l~P~~~~a~~~lg~~l~~~G~~~eAi~~~~~Al~l~P~ 404 (553)
T PRK12370 331 TELDHNNPQALGLLGLINTIHSEYIVGSLLFKQANLLSPISADIKYYYGWNLFMAGQLEEALQTINECLKLDPT 404 (553)
T ss_pred HhcCCCCHHHHHHHHHHHHHccCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHhcCCC
Confidence 45799999999999999999999999999999999999999999999999999999999999999999998886
No 17
>PRK10370 formate-dependent nitrite reductase complex subunit NrfG; Provisional
Probab=98.81 E-value=1.8e-08 Score=80.34 Aligned_cols=73 Identities=14% Similarity=0.034 Sum_probs=67.9
Q ss_pred CCCCCCHHHHHHHHHhc-ccCCC--cHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 2 MKTPFFKVGILFSNSSF-PSPGN--IPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l-~~~g~--~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
.++|+++..+.++|.++ ...|+ .++|++.++++++++|+++.+++++|..+...|++++|+..++++++..|.
T Consensus 101 ~l~P~~~~~~~~lA~aL~~~~g~~~~~~A~~~l~~al~~dP~~~~al~~LA~~~~~~g~~~~Ai~~~~~aL~l~~~ 176 (198)
T PRK10370 101 QLRGENAELYAALATVLYYQAGQHMTPQTREMIDKALALDANEVTALMLLASDAFMQADYAQAIELWQKVLDLNSP 176 (198)
T ss_pred HhCCCCHHHHHHHHHHHHHhcCCCCcHHHHHHHHHHHHhCCCChhHHHHHHHHHHHcCCHHHHHHHHHHHHhhCCC
Confidence 57899999999999975 67787 599999999999999999999999999999999999999999999988774
No 18
>KOG1125|consensus
Probab=98.77 E-value=9.1e-09 Score=91.26 Aligned_cols=74 Identities=16% Similarity=0.178 Sum_probs=67.3
Q ss_pred CCCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
|..+|++...|+.||.+|..-.+.+||+.+|++|++|.|+++.+++|||.+++.+|.+++|...+-.++.+.++
T Consensus 457 L~v~Pnd~~lWNRLGAtLAN~~~s~EAIsAY~rALqLqP~yVR~RyNlgIS~mNlG~ykEA~~hlL~AL~mq~k 530 (579)
T KOG1125|consen 457 LQVKPNDYLLWNRLGATLANGNRSEEAISAYNRALQLQPGYVRVRYNLGISCMNLGAYKEAVKHLLEALSMQRK 530 (579)
T ss_pred HhcCCchHHHHHHhhHHhcCCcccHHHHHHHHHHHhcCCCeeeeehhhhhhhhhhhhHHHHHHHHHHHHHhhhc
Confidence 45689999999999999999999999999999999999999999999999999999999999999888888876
No 19
>PF13424 TPR_12: Tetratricopeptide repeat; PDB: 3RO2_A 3Q15_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 4A1S_B 3CEQ_B 3EDT_H ....
Probab=98.76 E-value=8.5e-09 Score=69.43 Aligned_cols=69 Identities=22% Similarity=0.313 Sum_probs=59.7
Q ss_pred CCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhC-------CCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHH
Q psy17695 5 PFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLK-------PDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVA 73 (222)
Q Consensus 5 P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~-------P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~ 73 (222)
|+-+.+++++|.+|..+|++++|+++|++++++. |..+.++.++|.++...|++++|++.+++++++..
T Consensus 2 ~~~a~~~~~la~~~~~~~~~~~A~~~~~~al~~~~~~~~~~~~~a~~~~~lg~~~~~~g~~~~A~~~~~~al~i~~ 77 (78)
T PF13424_consen 2 PDTANAYNNLARVYRELGRYDEALDYYEKALDIEEQLGDDHPDTANTLNNLGECYYRLGDYEEALEYYQKALDIFE 77 (78)
T ss_dssp HHHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHHHHHTTTHHHHHHHHHHHHHHHHHHTTHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHhhhc
Confidence 4567889999999999999999999999999772 22356799999999999999999999999998764
No 20
>PRK15363 pathogenicity island 2 chaperone protein SscA; Provisional
Probab=98.76 E-value=1.4e-08 Score=77.17 Aligned_cols=70 Identities=10% Similarity=0.028 Sum_probs=67.3
Q ss_pred CCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 5 PFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 5 P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
++.-+..+.+|..+...|++++|+..|+....++|.+...|++||.|++.+|+|++|+.+|..+..+.|+
T Consensus 32 ~~~l~~lY~~A~~ly~~G~l~~A~~~f~~L~~~Dp~~~~y~~gLG~~~Q~~g~~~~AI~aY~~A~~L~~d 101 (157)
T PRK15363 32 TQPLNTLYRYAMQLMEVKEFAGAARLFQLLTIYDAWSFDYWFRLGECCQAQKHWGEAIYAYGRAAQIKID 101 (157)
T ss_pred HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcCCC
Confidence 6677899999999999999999999999999999999999999999999999999999999999998886
No 21
>COG3063 PilF Tfp pilus assembly protein PilF [Cell motility and secretion / Intracellular trafficking and secretion]
Probab=98.75 E-value=1.9e-08 Score=80.44 Aligned_cols=100 Identities=15% Similarity=0.088 Sum_probs=73.9
Q ss_pred CCCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhhhhcCC
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQLDKNR 80 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~l~~~~ 80 (222)
|..+|++..+|..++.+|..+|..+-|.+.|++|++++|++.+.++|.|..+..+|.+++|...|++++. .|..
T Consensus 62 L~~DPs~~~a~~~~A~~Yq~~Ge~~~A~e~YrkAlsl~p~~GdVLNNYG~FLC~qg~~~eA~q~F~~Al~-~P~Y----- 135 (250)
T COG3063 62 LEHDPSYYLAHLVRAHYYQKLGENDLADESYRKALSLAPNNGDVLNNYGAFLCAQGRPEEAMQQFERALA-DPAY----- 135 (250)
T ss_pred HHhCcccHHHHHHHHHHHHHcCChhhHHHHHHHHHhcCCCccchhhhhhHHHHhCCChHHHHHHHHHHHh-CCCC-----
Confidence 4568888899999999999999999999999999999999999999999888888888888888887762 2221
Q ss_pred CCCCCccccccccCCHHHHHHHHHHH
Q psy17695 81 LPSVHPHHSMLYPLTHEYRKAIAARH 106 (222)
Q Consensus 81 ~~~~~p~~~l~~~~~~~~~~~~a~~~ 106 (222)
.....+|.|++++..-..+...++.+
T Consensus 136 ~~~s~t~eN~G~Cal~~gq~~~A~~~ 161 (250)
T COG3063 136 GEPSDTLENLGLCALKAGQFDQAEEY 161 (250)
T ss_pred CCcchhhhhhHHHHhhcCCchhHHHH
Confidence 11134566777765543333444444
No 22
>PF13428 TPR_14: Tetratricopeptide repeat
Probab=98.70 E-value=2.4e-08 Score=59.90 Aligned_cols=43 Identities=33% Similarity=0.400 Sum_probs=40.9
Q ss_pred HHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHH
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAH 50 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~ 50 (222)
|.+|..+|.+|..+|++++|+++|+++++.+|+++.+|..|+.
T Consensus 1 p~~~~~la~~~~~~G~~~~A~~~~~~~l~~~P~~~~a~~~La~ 43 (44)
T PF13428_consen 1 PAAWLALARAYRRLGQPDEAERLLRRALALDPDDPEAWRALAQ 43 (44)
T ss_pred CHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCcCCHHHHHHhhh
Confidence 4689999999999999999999999999999999999999985
No 23
>PF13371 TPR_9: Tetratricopeptide repeat
Probab=98.70 E-value=4.1e-08 Score=65.07 Aligned_cols=61 Identities=18% Similarity=0.368 Sum_probs=57.6
Q ss_pred HHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 14 SNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 14 lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
|..+|...+++++|+.+++++++++|+++..+..+|.++..+|++++|.+.++++++..|+
T Consensus 1 l~~~~~~~~~~~~A~~~~~~~l~~~p~~~~~~~~~a~~~~~~g~~~~A~~~l~~~l~~~p~ 61 (73)
T PF13371_consen 1 LKQIYLQQEDYEEALEVLERALELDPDDPELWLQRARCLFQLGRYEEALEDLERALELSPD 61 (73)
T ss_pred CHHHHHhCCCHHHHHHHHHHHHHhCcccchhhHHHHHHHHHhccHHHHHHHHHHHHHHCCC
Confidence 4578899999999999999999999999999999999999999999999999999988775
No 24
>PRK11189 lipoprotein NlpI; Provisional
Probab=98.69 E-value=6.5e-08 Score=81.82 Aligned_cols=70 Identities=19% Similarity=0.153 Sum_probs=66.6
Q ss_pred CCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhh
Q psy17695 6 FFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQ 75 (222)
Q Consensus 6 ~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~ 75 (222)
..+..|+++|.++...|++++|+..|+++++++|+++.+++++|.++...|++++|.+.|++++++.|+.
T Consensus 62 ~~a~~~~~~g~~~~~~g~~~~A~~~~~~Al~l~P~~~~a~~~lg~~~~~~g~~~~A~~~~~~Al~l~P~~ 131 (296)
T PRK11189 62 ERAQLHYERGVLYDSLGLRALARNDFSQALALRPDMADAYNYLGIYLTQAGNFDAAYEAFDSVLELDPTY 131 (296)
T ss_pred hhHHHHHHHHHHHHHCCCHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCC
Confidence 3488999999999999999999999999999999999999999999999999999999999999988863
No 25
>TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD. ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array.
Probab=98.69 E-value=5.7e-08 Score=71.99 Aligned_cols=73 Identities=23% Similarity=0.255 Sum_probs=69.8
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
..+|+++.+|.++|.++..+|++++|+..|+++++++|+++..++++|.++...|++++|...++++++..|+
T Consensus 45 ~~~p~~~~~~~~la~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~la~~~~~~g~~~~A~~~~~~al~~~p~ 117 (135)
T TIGR02552 45 AYDPYNSRYWLGLAACCQMLKEYEEAIDAYALAAALDPDDPRPYFHAAECLLALGEPESALKALDLAIEICGE 117 (135)
T ss_pred HhCCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChHHHHHHHHHHHHcCCHHHHHHHHHHHHHhccc
Confidence 3589999999999999999999999999999999999999999999999999999999999999999988875
No 26
>PRK09782 bacteriophage N4 receptor, outer membrane subunit; Provisional
Probab=98.68 E-value=5e-08 Score=94.20 Aligned_cols=73 Identities=16% Similarity=0.050 Sum_probs=57.5
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
.++|+++.+++++|.++...|++++|+++|++|++++|+++.++.++|.++...|++++|+..+++++++.|+
T Consensus 637 ~l~Pd~~~a~~nLG~aL~~~G~~eeAi~~l~~AL~l~P~~~~a~~nLA~al~~lGd~~eA~~~l~~Al~l~P~ 709 (987)
T PRK09782 637 ELEPNNSNYQAALGYALWDSGDIAQSREMLERAHKGLPDDPALIRQLAYVNQRLDDMAATQHYARLVIDDIDN 709 (987)
T ss_pred HhCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHhcCCC
Confidence 4577888888888888888888888888888888888888888888888888888888888888777776664
No 27
>COG5010 TadD Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking and secretion]
Probab=98.60 E-value=1.2e-07 Score=76.98 Aligned_cols=84 Identities=15% Similarity=0.166 Sum_probs=72.7
Q ss_pred CCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhhhhcCCCC
Q psy17695 3 KTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQLDKNRLP 82 (222)
Q Consensus 3 ~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~l~~~~~~ 82 (222)
.+|.+.+.+..+|..+...|++.+|+..++++.+++|+++++|+.+|.+|.+.|+.++|...|.+++++.++.
T Consensus 95 ~~~~d~~ll~~~gk~~~~~g~~~~A~~~~rkA~~l~p~d~~~~~~lgaaldq~Gr~~~Ar~ay~qAl~L~~~~------- 167 (257)
T COG5010 95 AYPKDRELLAAQGKNQIRNGNFGEAVSVLRKAARLAPTDWEAWNLLGAALDQLGRFDEARRAYRQALELAPNE------- 167 (257)
T ss_pred cCcccHHHHHHHHHHHHHhcchHHHHHHHHHHhccCCCChhhhhHHHHHHHHccChhHHHHHHHHHHHhccCC-------
Confidence 3577888888899999999999999999999999999999999999999999999999999999999998862
Q ss_pred CCCccccccccC
Q psy17695 83 SVHPHHSMLYPL 94 (222)
Q Consensus 83 ~~~p~~~l~~~~ 94 (222)
..+..|+.+.+
T Consensus 168 -p~~~nNlgms~ 178 (257)
T COG5010 168 -PSIANNLGMSL 178 (257)
T ss_pred -chhhhhHHHHH
Confidence 34566666543
No 28
>PRK15179 Vi polysaccharide biosynthesis protein TviE; Provisional
Probab=98.59 E-value=1.8e-07 Score=87.34 Aligned_cols=70 Identities=14% Similarity=0.145 Sum_probs=54.7
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHH
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSI 71 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~ 71 (222)
..+|+++.+++.+|.++.++|++++|+++|+++++-+|+++.++.++|++++..|+.++|...|++++..
T Consensus 148 ~~~p~~~~~~~~~a~~l~~~g~~~~A~~~y~~~~~~~p~~~~~~~~~a~~l~~~G~~~~A~~~~~~a~~~ 217 (694)
T PRK15179 148 SGGSSSAREILLEAKSWDEIGQSEQADACFERLSRQHPEFENGYVGWAQSLTRRGALWRARDVLQAGLDA 217 (694)
T ss_pred hcCCCCHHHHHHHHHHHHHhcchHHHHHHHHHHHhcCCCcHHHHHHHHHHHHHcCCHHHHHHHHHHHHHh
Confidence 4577788888888888888888888888888888777777888888888888888888887777777654
No 29
>COG5010 TadD Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking and secretion]
Probab=98.58 E-value=2.2e-07 Score=75.52 Aligned_cols=69 Identities=25% Similarity=0.236 Sum_probs=65.5
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHH
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVS 70 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~ 70 (222)
.++|+++++|+.+|.+|.+.|++++|...|.+|+++.|+++....|+|..+.-.|+.+.|+..+..+..
T Consensus 128 ~l~p~d~~~~~~lgaaldq~Gr~~~Ar~ay~qAl~L~~~~p~~~nNlgms~~L~gd~~~A~~lll~a~l 196 (257)
T COG5010 128 RLAPTDWEAWNLLGAALDQLGRFDEARRAYRQALELAPNEPSIANNLGMSLLLRGDLEDAETLLLPAYL 196 (257)
T ss_pred ccCCCChhhhhHHHHHHHHccChhHHHHHHHHHHHhccCCchhhhhHHHHHHHcCCHHHHHHHHHHHHh
Confidence 578999999999999999999999999999999999999999999999999999999999998877654
No 30
>PF14559 TPR_19: Tetratricopeptide repeat; PDB: 2R5S_A 3QDN_B 3QOU_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 3FP3_A 3LCA_A ....
Probab=98.57 E-value=7.6e-08 Score=62.86 Aligned_cols=57 Identities=26% Similarity=0.404 Sum_probs=50.9
Q ss_pred cccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 18 FPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 18 l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
+.+.|++++|+..|+++++.+|++..++..++.++...|++++|...++++....|+
T Consensus 1 ll~~~~~~~A~~~~~~~l~~~p~~~~~~~~la~~~~~~g~~~~A~~~l~~~~~~~~~ 57 (68)
T PF14559_consen 1 LLKQGDYDEAIELLEKALQRNPDNPEARLLLAQCYLKQGQYDEAEELLERLLKQDPD 57 (68)
T ss_dssp HHHTTHHHHHHHHHHHHHHHTTTSHHHHHHHHHHHHHTT-HHHHHHHHHCCHGGGTT
T ss_pred ChhccCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCcC
Confidence 357899999999999999999999999999999999999999999988887766654
No 31
>PF13431 TPR_17: Tetratricopeptide repeat
Probab=98.55 E-value=7.4e-08 Score=54.50 Aligned_cols=33 Identities=30% Similarity=0.491 Sum_probs=31.3
Q ss_pred HHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHH
Q psy17695 30 SYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYE 62 (222)
Q Consensus 30 ~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~ 62 (222)
+|++||+++|+++.+|++||.++...|++++|+
T Consensus 1 ~y~kAie~~P~n~~a~~nla~~~~~~g~~~~A~ 33 (34)
T PF13431_consen 1 CYKKAIELNPNNAEAYNNLANLYLNQGDYEEAI 33 (34)
T ss_pred ChHHHHHHCCCCHHHHHHHHHHHHHCcCHHhhc
Confidence 589999999999999999999999999999886
No 32
>PRK09782 bacteriophage N4 receptor, outer membrane subunit; Provisional
Probab=98.54 E-value=1.5e-07 Score=91.00 Aligned_cols=73 Identities=16% Similarity=0.216 Sum_probs=69.3
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhh
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQ 75 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~ 75 (222)
..+|+ +.++.++|.++.++|++++|+.+|+++++++|+++.++.++|.++...|++++|++.++++++..|+.
T Consensus 604 ~l~P~-~~a~~~LA~~l~~lG~~deA~~~l~~AL~l~Pd~~~a~~nLG~aL~~~G~~eeAi~~l~~AL~l~P~~ 676 (987)
T PRK09782 604 NIAPS-ANAYVARATIYRQRHNVPAAVSDLRAALELEPNNSNYQAALGYALWDSGDIAQSREMLERAHKGLPDD 676 (987)
T ss_pred HhCCC-HHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCC
Confidence 46786 99999999999999999999999999999999999999999999999999999999999999988864
No 33
>cd00189 TPR Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and C
Probab=98.53 E-value=2.1e-07 Score=62.41 Aligned_cols=64 Identities=27% Similarity=0.389 Sum_probs=40.3
Q ss_pred HHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHH
Q psy17695 10 GILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVA 73 (222)
Q Consensus 10 ~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~ 73 (222)
+++++|.++...|++++|+..++++++..|++..++..+|.++...+++++|...+++++...+
T Consensus 2 ~~~~~a~~~~~~~~~~~A~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~ 65 (100)
T cd00189 2 ALLNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALELDP 65 (100)
T ss_pred HHHHHHHHHHHHhcHHHHHHHHHHHHhcCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC
Confidence 3556666666666666666666666666666666666666666666666666666666555443
No 34
>TIGR00990 3a0801s09 mitochondrial precursor proteins import receptor (72 kDa mitochondrial outermembrane protein) (mitochondrial import receptor for the ADP/ATP carrier) (translocase of outermembrane tom70).
Probab=98.52 E-value=2.9e-07 Score=85.25 Aligned_cols=72 Identities=19% Similarity=0.225 Sum_probs=48.8
Q ss_pred CCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 3 KTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 3 ~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
++|+++.+|.++|.++..+|++++|+..|+++++++|+++.+++++|.++...|++++|..+|++++.+.|+
T Consensus 360 l~P~~~~~~~~la~~~~~~g~~~eA~~~~~~al~~~p~~~~~~~~lg~~~~~~g~~~~A~~~~~kal~l~P~ 431 (615)
T TIGR00990 360 LDPRVTQSYIKRASMNLELGDPDKAEEDFDKALKLNSEDPDIYYHRAQLHFIKGEFAQAGKDYQKSIDLDPD 431 (615)
T ss_pred cCCCcHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCcc
Confidence 466666666666666666666666666666666666666666666666666666666666666666665554
No 35
>TIGR00990 3a0801s09 mitochondrial precursor proteins import receptor (72 kDa mitochondrial outermembrane protein) (mitochondrial import receptor for the ADP/ATP carrier) (translocase of outermembrane tom70).
Probab=98.51 E-value=3e-07 Score=85.21 Aligned_cols=73 Identities=14% Similarity=0.033 Sum_probs=63.3
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
..+|+++.+++++|.++..+|++++|+.+|+++++++|++..++.++|.++..+|++++|...+++++...|+
T Consensus 393 ~~~p~~~~~~~~lg~~~~~~g~~~~A~~~~~kal~l~P~~~~~~~~la~~~~~~g~~~eA~~~~~~al~~~P~ 465 (615)
T TIGR00990 393 KLNSEDPDIYYHRAQLHFIKGEFAQAGKDYQKSIDLDPDFIFSHIQLGVTQYKEGSIASSMATFRRCKKNFPE 465 (615)
T ss_pred HhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCccCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCC
Confidence 4578888899999999999999999999999999999998888888998888888888888888888876664
No 36
>PF00515 TPR_1: Tetratricopeptide repeat; InterPro: IPR001440 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees; within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B. ; GO: 0005515 protein binding; PDB: 3SF4_C 2LNI_A 1ELW_A 2C0M_A 1FCH_B 3R9A_B 2J9Q_A 2C0L_A 1KT1_A 3FWV_A ....
Probab=98.51 E-value=1.3e-07 Score=53.26 Aligned_cols=34 Identities=26% Similarity=0.398 Sum_probs=31.1
Q ss_pred HHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCC
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDF 41 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~ 41 (222)
+.+|+++|.++..+|++++|+.+|++|++++|++
T Consensus 1 a~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~ 34 (34)
T PF00515_consen 1 AEAYYNLGNAYFQLGDYEEALEYYQRALELDPDN 34 (34)
T ss_dssp HHHHHHHHHHHHHTT-HHHHHHHHHHHHHHSTTH
T ss_pred CHHHHHHHHHHHHhCCchHHHHHHHHHHHHCcCC
Confidence 5789999999999999999999999999999974
No 37
>KOG1126|consensus
Probab=98.50 E-value=6e-08 Score=87.28 Aligned_cols=74 Identities=18% Similarity=0.232 Sum_probs=69.3
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhh
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQ 75 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~ 75 (222)
.-+|+.|+.|..+|++|.-+++++.|+++|+||+++||+++.++..+|+-+....+++.|..+|+.++...|++
T Consensus 415 ~~~~~sPesWca~GNcfSLQkdh~~Aik~f~RAiQldp~faYayTLlGhE~~~~ee~d~a~~~fr~Al~~~~rh 488 (638)
T KOG1126|consen 415 DTDPNSPESWCALGNCFSLQKDHDTAIKCFKRAIQLDPRFAYAYTLLGHESIATEEFDKAMKSFRKALGVDPRH 488 (638)
T ss_pred hhCCCCcHHHHHhcchhhhhhHHHHHHHHHHHhhccCCccchhhhhcCChhhhhHHHHhHHHHHHhhhcCCchh
Confidence 45799999999999999999999999999999999999999999999999999999999999999999888875
No 38
>PRK12370 invasion protein regulator; Provisional
Probab=98.49 E-value=2.8e-07 Score=84.43 Aligned_cols=75 Identities=13% Similarity=0.002 Sum_probs=68.2
Q ss_pred CCCCCCCHHHHHHHHHhcccC---------CCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHH
Q psy17695 1 MMKTPFFKVGILFSNSSFPSP---------GNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSI 71 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~---------g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~ 71 (222)
+.++|+++.+|.++|.++..+ +++++|+.++++|++++|+++.++..+|.++...|++++|++.+++++++
T Consensus 288 l~ldP~~a~a~~~La~~~~~~~~~g~~~~~~~~~~A~~~~~~Al~ldP~~~~a~~~lg~~~~~~g~~~~A~~~~~~Al~l 367 (553)
T PRK12370 288 VNMSPNSIAPYCALAECYLSMAQMGIFDKQNAMIKAKEHAIKATELDHNNPQALGLLGLINTIHSEYIVGSLLFKQANLL 367 (553)
T ss_pred HhcCCccHHHHHHHHHHHHHHHHcCCcccchHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHccCHHHHHHHHHHHHHh
Confidence 467999999999999877633 45899999999999999999999999999999999999999999999999
Q ss_pred HHhh
Q psy17695 72 VAEQ 75 (222)
Q Consensus 72 ~~~~ 75 (222)
.|+.
T Consensus 368 ~P~~ 371 (553)
T PRK12370 368 SPIS 371 (553)
T ss_pred CCCC
Confidence 8863
No 39
>TIGR02795 tol_pal_ybgF tol-pal system protein YbgF. Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.
Probab=98.48 E-value=6.7e-07 Score=64.20 Aligned_cols=66 Identities=18% Similarity=0.288 Sum_probs=34.9
Q ss_pred HHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCC---HHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHH
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDF---PDAYCNLAHCLQIVCDWTDYEARMKKLVSIVA 73 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~---~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~ 73 (222)
+.+++.+|.++...|++++|+..|+++++..|++ +.++.++|.++...+++++|...+++++...|
T Consensus 39 ~~~~~~l~~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~A~~~~~~~~~~~p 107 (119)
T TIGR02795 39 PNAHYWLGEAYYAQGKYADAAKAFLAVVKKYPKSPKAPDALLKLGMSLQELGDKEKAKATLQQVIKRYP 107 (119)
T ss_pred HHHHHHHHHHHHhhccHHHHHHHHHHHHHHCCCCCcccHHHHHHHHHHHHhCChHHHHHHHHHHHHHCc
Confidence 3455555555555555555555555555555543 34455555555555555555555555554444
No 40
>PRK15174 Vi polysaccharide export protein VexE; Provisional
Probab=98.47 E-value=5.2e-07 Score=84.26 Aligned_cols=73 Identities=16% Similarity=0.126 Sum_probs=48.9
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
..+|+++.++.++|.++..+|++++|+..|+++++++|+++.++.+++.++...|++++|.+.+++++...|+
T Consensus 278 ~l~P~~~~a~~~lg~~l~~~g~~~eA~~~l~~al~l~P~~~~a~~~La~~l~~~G~~~eA~~~l~~al~~~P~ 350 (656)
T PRK15174 278 QFNSDNVRIVTLYADALIRTGQNEKAIPLLQQSLATHPDLPYVRAMYARALRQVGQYTAASDEFVQLAREKGV 350 (656)
T ss_pred hhCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCcc
Confidence 3456666667777777777777777777777777777777666666666666666776666666666655443
No 41
>PRK15179 Vi polysaccharide biosynthesis protein TviE; Provisional
Probab=98.46 E-value=3.9e-07 Score=85.12 Aligned_cols=73 Identities=8% Similarity=-0.033 Sum_probs=65.3
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
...|++..++.+++.++.++++++||+.+++++++.+|+++.+++++|.++.++|++++|++.|++++...|+
T Consensus 114 ~~~Pd~~~a~~~~a~~L~~~~~~eeA~~~~~~~l~~~p~~~~~~~~~a~~l~~~g~~~~A~~~y~~~~~~~p~ 186 (694)
T PRK15179 114 QRFPDSSEAFILMLRGVKRQQGIEAGRAEIELYFSGGSSSAREILLEAKSWDEIGQSEQADACFERLSRQHPE 186 (694)
T ss_pred hhCCCcHHHHHHHHHHHHHhccHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHhcchHHHHHHHHHHHhcCCC
Confidence 4679999999999999999999999999999999999999999999999999999999999999999875543
No 42
>CHL00033 ycf3 photosystem I assembly protein Ycf3
Probab=98.44 E-value=1.4e-06 Score=67.33 Aligned_cols=67 Identities=18% Similarity=0.070 Sum_probs=47.7
Q ss_pred HHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCC---HHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDF---PDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~---~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
+..++++|.++...|++++|+..|++++++.|+. +.++.++|.++...|++++|...+++++++.|.
T Consensus 35 a~~~~~~g~~~~~~g~~~~A~~~~~~al~l~~~~~~~~~~~~~lg~~~~~~g~~~eA~~~~~~Al~~~~~ 104 (168)
T CHL00033 35 AFTYYRDGMSAQSEGEYAEALQNYYEAMRLEIDPYDRSYILYNIGLIHTSNGEHTKALEYYFQALERNPF 104 (168)
T ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHHhccccchhhHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCcC
Confidence 5666777777777777777777777777776653 346777777777777777777777777766554
No 43
>COG3063 PilF Tfp pilus assembly protein PilF [Cell motility and secretion / Intracellular trafficking and secretion]
Probab=98.43 E-value=4.4e-07 Score=72.70 Aligned_cols=75 Identities=17% Similarity=0.147 Sum_probs=68.7
Q ss_pred CCCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHh--CCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhh
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKL--KPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQ 75 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l--~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~ 75 (222)
|+++|++.++++|.|..|..+|++++|.+.|++|+.. =|..+.++-|+|.|..+.|+.+.|++.++++++..|+.
T Consensus 96 lsl~p~~GdVLNNYG~FLC~qg~~~eA~q~F~~Al~~P~Y~~~s~t~eN~G~Cal~~gq~~~A~~~l~raL~~dp~~ 172 (250)
T COG3063 96 LSLAPNNGDVLNNYGAFLCAQGRPEEAMQQFERALADPAYGEPSDTLENLGLCALKAGQFDQAEEYLKRALELDPQF 172 (250)
T ss_pred HhcCCCccchhhhhhHHHHhCCChHHHHHHHHHHHhCCCCCCcchhhhhhHHHHhhcCCchhHHHHHHHHHHhCcCC
Confidence 5689999999999999999999999999999999983 24456899999999999999999999999999998874
No 44
>PF07719 TPR_2: Tetratricopeptide repeat; InterPro: IPR013105 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. This repeat includes outlying Tetratricopeptide-like repeats (TPR) that are not matched by IPR001440 from INTERPRO.; PDB: 1XNF_B 3Q15_A 4ABN_A 1OUV_A 3U4T_A 3MA5_C 2KCV_A 2KCL_A 2XEV_A 3NF1_A ....
Probab=98.43 E-value=3.8e-07 Score=51.09 Aligned_cols=34 Identities=26% Similarity=0.436 Sum_probs=31.3
Q ss_pred HHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCC
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDF 41 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~ 41 (222)
|++|+.+|.++..+|++++|+.+|+++++++|++
T Consensus 1 a~~~~~lg~~~~~~~~~~~A~~~~~~al~l~p~~ 34 (34)
T PF07719_consen 1 AEAWYYLGQAYYQLGNYEEAIEYFEKALELDPNN 34 (34)
T ss_dssp HHHHHHHHHHHHHTT-HHHHHHHHHHHHHHSTTS
T ss_pred CHHHHHHHHHHHHhCCHHHHHHHHHHHHHHCcCC
Confidence 5789999999999999999999999999999986
No 45
>COG4235 Cytochrome c biogenesis factor [Posttranslational modification, protein turnover, chaperones]
Probab=98.42 E-value=6.1e-07 Score=74.52 Aligned_cols=74 Identities=20% Similarity=0.279 Sum_probs=66.9
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCC---HHHHHHHHHHHHHHHHhh
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCD---WTDYEARMKKLVSIVAEQ 75 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~---~~~a~~~~~~~~~~~~~~ 75 (222)
..||++++.|..||.+|..+|+++.|..+|++|+++.|++++.+..++.++....+ -.++...++++++..|..
T Consensus 150 ~~nP~d~egW~~Lg~~ym~~~~~~~A~~AY~~A~rL~g~n~~~~~g~aeaL~~~a~~~~ta~a~~ll~~al~~D~~~ 226 (287)
T COG4235 150 QQNPGDAEGWDLLGRAYMALGRASDALLAYRNALRLAGDNPEILLGLAEALYYQAGQQMTAKARALLRQALALDPAN 226 (287)
T ss_pred HhCCCCchhHHHHHHHHHHhcchhHHHHHHHHHHHhCCCCHHHHHHHHHHHHHhcCCcccHHHHHHHHHHHhcCCcc
Confidence 46999999999999999999999999999999999999999999999999887654 566788888888887764
No 46
>TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW. Members of this family are designated PilF in ref (PubMed:8973346) and PilW in ref (PubMed:15612916). This outer membrane protein is required both for pilus stability and for pilus function such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.
Probab=98.41 E-value=1.2e-06 Score=69.37 Aligned_cols=66 Identities=20% Similarity=0.240 Sum_probs=36.8
Q ss_pred CCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHH
Q psy17695 4 TPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLV 69 (222)
Q Consensus 4 ~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~ 69 (222)
+|++..++..+|.++..+|++++|+..|+++++++|++..++.+++.++...|++++|...+++++
T Consensus 61 ~p~~~~~~~~la~~~~~~~~~~~A~~~~~~al~~~~~~~~~~~~~~~~~~~~g~~~~A~~~~~~~~ 126 (234)
T TIGR02521 61 DPDDYLAYLALALYYQQLGELEKAEDSFRRALTLNPNNGDVLNNYGTFLCQQGKYEQAMQQFEQAI 126 (234)
T ss_pred CcccHHHHHHHHHHHHHcCCHHHHHHHHHHHHhhCCCCHHHHHHHHHHHHHcccHHHHHHHHHHHH
Confidence 455555555555555555555555555555555555555555555555555555555555555544
No 47
>PF12895 Apc3: Anaphase-promoting complex, cyclosome, subunit 3; PDB: 3KAE_D 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2XPI_A 3ULQ_A.
Probab=98.41 E-value=2.4e-07 Score=63.36 Aligned_cols=61 Identities=25% Similarity=0.317 Sum_probs=55.5
Q ss_pred CHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHH
Q psy17695 7 FKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKL 68 (222)
Q Consensus 7 ~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~ 68 (222)
+...++.+|.++.++|++++|+..+++ ++.+|.+...++.+|.|+..+|+|++|+..++++
T Consensus 24 ~~~~~~~la~~~~~~~~y~~A~~~~~~-~~~~~~~~~~~~l~a~~~~~l~~y~eAi~~l~~~ 84 (84)
T PF12895_consen 24 NSAYLYNLAQCYFQQGKYEEAIELLQK-LKLDPSNPDIHYLLARCLLKLGKYEEAIKALEKA 84 (84)
T ss_dssp HHHHHHHHHHHHHHTTHHHHHHHHHHC-HTHHHCHHHHHHHHHHHHHHTT-HHHHHHHHHHH
T ss_pred hHHHHHHHHHHHHHCCCHHHHHHHHHH-hCCCCCCHHHHHHHHHHHHHhCCHHHHHHHHhcC
Confidence 577888999999999999999999999 9999999999999999999999999999988764
No 48
>PRK02603 photosystem I assembly protein Ycf3; Provisional
Probab=98.41 E-value=1e-06 Score=68.45 Aligned_cols=72 Identities=17% Similarity=0.116 Sum_probs=65.3
Q ss_pred CCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCC---HHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 3 KTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDF---PDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 3 ~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~---~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
.+|..+.+++++|..+...|++++|+.+|++++++.|+. ..++.++|.++...|++++|...+++++...|+
T Consensus 30 ~~~~~a~~~~~lg~~~~~~g~~~~A~~~~~~al~~~~~~~~~~~~~~~la~~~~~~g~~~~A~~~~~~al~~~p~ 104 (172)
T PRK02603 30 KKAKEAFVYYRDGMSAQADGEYAEALENYEEALKLEEDPNDRSYILYNMGIIYASNGEHDKALEYYHQALELNPK 104 (172)
T ss_pred cHhhhHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCcc
Confidence 345778899999999999999999999999999998774 468999999999999999999999999988775
No 49
>KOG1126|consensus
Probab=98.37 E-value=5.3e-07 Score=81.33 Aligned_cols=74 Identities=8% Similarity=0.066 Sum_probs=66.4
Q ss_pred CCCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
|..+|++-.||+.||.+|.++++++.|+-.|++|++++|.+....+.+|.++.+.|+.++|...+++++.+.|.
T Consensus 482 l~~~~rhYnAwYGlG~vy~Kqek~e~Ae~~fqkA~~INP~nsvi~~~~g~~~~~~k~~d~AL~~~~~A~~ld~k 555 (638)
T KOG1126|consen 482 LGVDPRHYNAWYGLGTVYLKQEKLEFAEFHFQKAVEINPSNSVILCHIGRIQHQLKRKDKALQLYEKAIHLDPK 555 (638)
T ss_pred hcCCchhhHHHHhhhhheeccchhhHHHHHHHhhhcCCccchhHHhhhhHHHHHhhhhhHHHHHHHHHHhcCCC
Confidence 45688899999999999999999999999999999999999999999999999999999999999999888775
No 50
>PRK15174 Vi polysaccharide export protein VexE; Provisional
Probab=98.36 E-value=1.4e-06 Score=81.42 Aligned_cols=73 Identities=12% Similarity=0.003 Sum_probs=69.2
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHH----HHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPE----AIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~e----A~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
..+|+++.+++++|.++..+|++++ |+..|+++++++|+++.++.++|.++...|++++|...+++++++.|+
T Consensus 240 ~~~p~~~~~~~~Lg~~l~~~G~~~eA~~~A~~~~~~Al~l~P~~~~a~~~lg~~l~~~g~~~eA~~~l~~al~l~P~ 316 (656)
T PRK15174 240 ARGLDGAALRRSLGLAYYQSGRSREAKLQAAEHWRHALQFNSDNVRIVTLYADALIRTGQNEKAIPLLQQSLATHPD 316 (656)
T ss_pred hcCCCCHHHHHHHHHHHHHcCCchhhHHHHHHHHHHHHhhCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCC
Confidence 4679999999999999999999996 899999999999999999999999999999999999999999988775
No 51
>PF13432 TPR_16: Tetratricopeptide repeat; PDB: 3CVP_A 3CVL_A 3CVQ_A 3CV0_A 2GW1_B 3CVN_A 3QKY_A 2PL2_B.
Probab=98.33 E-value=5.4e-07 Score=58.34 Aligned_cols=41 Identities=27% Similarity=0.348 Sum_probs=37.4
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCH
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFP 42 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~ 42 (222)
..+|+++.+|+.+|.++..+|++++|+..|+++++++|+++
T Consensus 25 ~~~P~~~~a~~~lg~~~~~~g~~~~A~~~~~~a~~~~P~~p 65 (65)
T PF13432_consen 25 KQDPDNPEAWYLLGRILYQQGRYDEALAYYERALELDPDNP 65 (65)
T ss_dssp CCSTTHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHHSTT-H
T ss_pred HHCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCcCCC
Confidence 57899999999999999999999999999999999999985
No 52
>cd00189 TPR Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and C
Probab=98.32 E-value=1.6e-06 Score=57.98 Aligned_cols=72 Identities=21% Similarity=0.260 Sum_probs=66.0
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHH
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVA 73 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~ 73 (222)
...|.+..++..+|.++...|++++|+.+|++++++.|.+..++..++.++...+++++|...+.+++...|
T Consensus 28 ~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~ 99 (100)
T cd00189 28 ELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALELDPDNAKAYYNLGLAYYKLGKYEEALEAYEKALELDP 99 (100)
T ss_pred hcCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhHHHHHHHHHHHHHhHHHHHHHHHHHHccCC
Confidence 457888899999999999999999999999999999999999999999999999999999998888775443
No 53
>KOG1125|consensus
Probab=98.32 E-value=9.4e-07 Score=78.72 Aligned_cols=70 Identities=11% Similarity=0.042 Sum_probs=67.8
Q ss_pred CCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhh
Q psy17695 6 FFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQ 75 (222)
Q Consensus 6 ~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~ 75 (222)
.+++.+..||++|...|+|+.|++||+.||..+|++...|+.||..+..-.+.++|+..|++++.+.|..
T Consensus 428 ~DpdvQ~~LGVLy~ls~efdraiDcf~~AL~v~Pnd~~lWNRLGAtLAN~~~s~EAIsAY~rALqLqP~y 497 (579)
T KOG1125|consen 428 IDPDVQSGLGVLYNLSGEFDRAVDCFEAALQVKPNDYLLWNRLGATLANGNRSEEAISAYNRALQLQPGY 497 (579)
T ss_pred CChhHHhhhHHHHhcchHHHHHHHHHHHHHhcCCchHHHHHHhhHHhcCCcccHHHHHHHHHHHhcCCCe
Confidence 7899999999999999999999999999999999999999999999999999999999999999999975
No 54
>KOG4162|consensus
Probab=98.31 E-value=2.1e-06 Score=78.71 Aligned_cols=83 Identities=14% Similarity=0.141 Sum_probs=74.9
Q ss_pred CCCCCCCHHHHHHHHHhcccCCCcHHHHH--HHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhhhhc
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGNIPEAIQ--SYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQLDK 78 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~~~eA~~--~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~l~~ 78 (222)
+.+||+++.....+|.+|.+.|+-.-|.. .+..|++++|.++++|+++|.+++..|+.++|.++|+.++++.+..
T Consensus 711 l~ldP~hv~s~~Ala~~lle~G~~~la~~~~~L~dalr~dp~n~eaW~~LG~v~k~~Gd~~~Aaecf~aa~qLe~S~--- 787 (799)
T KOG4162|consen 711 LALDPDHVPSMTALAELLLELGSPRLAEKRSLLSDALRLDPLNHEAWYYLGEVFKKLGDSKQAAECFQAALQLEESN--- 787 (799)
T ss_pred HhcCCCCcHHHHHHHHHHHHhCCcchHHHHHHHHHHHhhCCCCHHHHHHHHHHHHHccchHHHHHHHHHHHhhccCC---
Confidence 45799999999999999999999888888 9999999999999999999999999999999999999999887752
Q ss_pred CCCCCCCccccc
Q psy17695 79 NRLPSVHPHHSM 90 (222)
Q Consensus 79 ~~~~~~~p~~~l 90 (222)
.+.||.++
T Consensus 788 ----PV~pFs~i 795 (799)
T KOG4162|consen 788 ----PVLPFSNI 795 (799)
T ss_pred ----Cccccccc
Confidence 36788664
No 55
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=98.31 E-value=2.3e-06 Score=71.16 Aligned_cols=72 Identities=15% Similarity=0.085 Sum_probs=54.2
Q ss_pred CCCCC---HHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCC---CHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 3 KTPFF---KVGILFSNSSFPSPGNIPEAIQSYRTALKLKPD---FPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 3 ~~P~~---~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~---~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
..|+. +.+++.+|.+|...|++++|+..|+++++..|+ .+++++.+|.++..+|++++|...|+++++..|+
T Consensus 172 ~yP~s~~a~~A~y~LG~~y~~~g~~~~A~~~f~~vv~~yP~s~~~~dAl~klg~~~~~~g~~~~A~~~~~~vi~~yP~ 249 (263)
T PRK10803 172 KYPDSTYQPNANYWLGQLNYNKGKKDDAAYYFASVVKNYPKSPKAADAMFKVGVIMQDKGDTAKAKAVYQQVIKKYPG 249 (263)
T ss_pred HCcCCcchHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCCCCcchhHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCcC
Confidence 35655 577888888888888888888888888877666 4677777788887888888888888777776665
No 56
>PRK15331 chaperone protein SicA; Provisional
Probab=98.29 E-value=1.2e-06 Score=67.09 Aligned_cols=69 Identities=7% Similarity=-0.038 Sum_probs=55.9
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHH
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVS 70 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~ 70 (222)
+.+|.++..|+.||.++..++++++|+.+|..|..++++++...+..|.|+..+++.+.|..+|..++.
T Consensus 65 ~~d~~n~~Y~~GLaa~~Q~~k~y~~Ai~~Y~~A~~l~~~dp~p~f~agqC~l~l~~~~~A~~~f~~a~~ 133 (165)
T PRK15331 65 IYDFYNPDYTMGLAAVCQLKKQFQKACDLYAVAFTLLKNDYRPVFFTGQCQLLMRKAAKARQCFELVNE 133 (165)
T ss_pred HhCcCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccCCCCccchHHHHHHHhCCHHHHHHHHHHHHh
Confidence 357778888888888888888888888888888888888888888888888888888888887776664
No 57
>TIGR02795 tol_pal_ybgF tol-pal system protein YbgF. Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.
Probab=98.28 E-value=4.2e-06 Score=59.95 Aligned_cols=67 Identities=18% Similarity=0.194 Sum_probs=62.1
Q ss_pred HHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCC---HHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDF---PDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~---~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
+..++.+|..+...|++++|+..|+++++.+|++ +.+++.+|.++...+++++|...+++++...|+
T Consensus 2 ~~~~~~~~~~~~~~~~~~~A~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~A~~~~~~~~~~~p~ 71 (119)
T TIGR02795 2 EEAYYDAALLVLKAGDYADAIQAFQAFLKKYPKSTYAPNAHYWLGEAYYAQGKYADAAKAFLAVVKKYPK 71 (119)
T ss_pred cHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCCCccccHHHHHHHHHHHHhhccHHHHHHHHHHHHHHCCC
Confidence 4678999999999999999999999999999987 578999999999999999999999999987765
No 58
>CHL00033 ycf3 photosystem I assembly protein Ycf3
Probab=98.27 E-value=4.1e-06 Score=64.77 Aligned_cols=68 Identities=16% Similarity=0.080 Sum_probs=58.2
Q ss_pred CHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 7 FKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 7 ~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
.+.+|.++|.++...|++++|+.+|+++++++|.+..++.+++.++...|........+++++....+
T Consensus 71 ~~~~~~~lg~~~~~~g~~~eA~~~~~~Al~~~~~~~~~~~~la~i~~~~~~~~~~~g~~~~A~~~~~~ 138 (168)
T CHL00033 71 RSYILYNIGLIHTSNGEHTKALEYYFQALERNPFLPQALNNMAVICHYRGEQAIEQGDSEIAEAWFDQ 138 (168)
T ss_pred hHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCcCcHHHHHHHHHHHHHhhHHHHHcccHHHHHHHHHH
Confidence 45799999999999999999999999999999999999999999999998765555556666555443
No 59
>PRK11906 transcriptional regulator; Provisional
Probab=98.25 E-value=2.9e-06 Score=74.57 Aligned_cols=74 Identities=9% Similarity=-0.077 Sum_probs=71.2
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhh
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQ 75 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~ 75 (222)
.++|+|+.++..+|.++.-.|+++.|+..|++|+.++|+++.+++..|.++...|+.++|.+.+++++++.|..
T Consensus 332 eld~~Da~a~~~~g~~~~~~~~~~~a~~~f~rA~~L~Pn~A~~~~~~~~~~~~~G~~~~a~~~i~~alrLsP~~ 405 (458)
T PRK11906 332 DITTVDGKILAIMGLITGLSGQAKVSHILFEQAKIHSTDIASLYYYRALVHFHNEKIEEARICIDKSLQLEPRR 405 (458)
T ss_pred hcCCCCHHHHHHHHHHHHhhcchhhHHHHHHHHhhcCCccHHHHHHHHHHHHHcCCHHHHHHHHHHHhccCchh
Confidence 57999999999999999999999999999999999999999999999999999999999999999999988864
No 60
>PRK02603 photosystem I assembly protein Ycf3; Provisional
Probab=98.24 E-value=5.2e-06 Score=64.46 Aligned_cols=56 Identities=21% Similarity=0.238 Sum_probs=50.2
Q ss_pred CHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHH
Q psy17695 7 FKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYE 62 (222)
Q Consensus 7 ~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~ 62 (222)
.+.++.++|.++..+|++++|+.+|+++++++|++..++.++|.++...++...+.
T Consensus 71 ~~~~~~~la~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~lg~~~~~~g~~~~a~ 126 (172)
T PRK02603 71 RSYILYNMGIIYASNGEHDKALEYYHQALELNPKQPSALNNIAVIYHKRGEKAEEA 126 (172)
T ss_pred HHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCcccHHHHHHHHHHHHHcCChHhHh
Confidence 36799999999999999999999999999999999999999999998877643333
No 61
>KOG1173|consensus
Probab=98.24 E-value=2.5e-06 Score=75.97 Aligned_cols=70 Identities=14% Similarity=0.244 Sum_probs=65.7
Q ss_pred CCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhh
Q psy17695 6 FFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQ 75 (222)
Q Consensus 6 ~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~ 75 (222)
.....+.|||.++++++++++|+.+|+++|.+.|.++.++..+|.++..+|+.+.|.+.|.+++.+.|+.
T Consensus 453 ~w~p~~~NLGH~~Rkl~~~~eAI~~~q~aL~l~~k~~~~~asig~iy~llgnld~Aid~fhKaL~l~p~n 522 (611)
T KOG1173|consen 453 FWEPTLNNLGHAYRKLNKYEEAIDYYQKALLLSPKDASTHASIGYIYHLLGNLDKAIDHFHKALALKPDN 522 (611)
T ss_pred chhHHHHhHHHHHHHHhhHHHHHHHHHHHHHcCCCchhHHHHHHHHHHHhcChHHHHHHHHHHHhcCCcc
Confidence 3556799999999999999999999999999999999999999999999999999999999999887763
No 62
>KOG1155|consensus
Probab=98.23 E-value=3.6e-06 Score=73.57 Aligned_cols=66 Identities=15% Similarity=0.136 Sum_probs=33.3
Q ss_pred CCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHH
Q psy17695 4 TPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLV 69 (222)
Q Consensus 4 ~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~ 69 (222)
+|.+-.+|+.||++|.-++-..=|+-+|++|+++.|++...|..||.||.++++.++|+.+|..++
T Consensus 394 ~p~DyRAWYGLGQaYeim~Mh~YaLyYfqkA~~~kPnDsRlw~aLG~CY~kl~~~~eAiKCykrai 459 (559)
T KOG1155|consen 394 NPRDYRAWYGLGQAYEIMKMHFYALYYFQKALELKPNDSRLWVALGECYEKLNRLEEAIKCYKRAI 459 (559)
T ss_pred CchhHHHHhhhhHHHHHhcchHHHHHHHHHHHhcCCCchHHHHHHHHHHHHhccHHHHHHHHHHHH
Confidence 344444444444444444444444444444444444444445555555555666666666665554
No 63
>PRK10049 pgaA outer membrane protein PgaA; Provisional
Probab=98.22 E-value=3.6e-06 Score=80.03 Aligned_cols=73 Identities=14% Similarity=0.060 Sum_probs=62.2
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
...|+++.++.++|.++...|++++|++.++++++++|++..+++.++.++...++|++|++.++++++..|+
T Consensus 387 ~~~P~n~~l~~~lA~l~~~~g~~~~A~~~l~~al~l~Pd~~~l~~~~a~~al~~~~~~~A~~~~~~ll~~~Pd 459 (765)
T PRK10049 387 YNAPGNQGLRIDYASVLQARGWPRAAENELKKAEVLEPRNINLEVEQAWTALDLQEWRQMDVLTDDVVAREPQ 459 (765)
T ss_pred HhCCCCHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCCCChHHHHHHHHHHHHhCCHHHHHHHHHHHHHhCCC
Confidence 3568888888888888888888888888888888888888888888888888888888888888888887765
No 64
>TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795).
Probab=98.22 E-value=7.9e-06 Score=66.30 Aligned_cols=71 Identities=15% Similarity=0.160 Sum_probs=66.5
Q ss_pred CCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCH---HHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 4 TPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFP---DAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 4 ~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~---~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
++..+..++.+|..+...|++++|+..|+++++.+|+++ .+++++|.++...++|++|...++++++..|+
T Consensus 29 ~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~a~~~la~~~~~~~~~~~A~~~~~~~l~~~p~ 102 (235)
T TIGR03302 29 EEWPAEELYEEAKEALDSGDYTEAIKYFEALESRYPFSPYAEQAQLDLAYAYYKSGDYAEAIAAADRFIRLHPN 102 (235)
T ss_pred ccCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCCCchhHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHCcC
Confidence 456789999999999999999999999999999999986 68899999999999999999999999998886
No 65
>PF06552 TOM20_plant: Plant specific mitochondrial import receptor subunit TOM20; InterPro: IPR010547 This family consists of several plant specific mitochondrial import receptor subunit TOM20 (translocase of outer membrane 20 kDa subunit) proteins. Most mitochondrial proteins are encoded by the nuclear genome, and are synthesised in the cytosol. TOM20 is a general import receptor that binds to mitochondrial pre-sequences in the early step of protein import into the mitochondria [].; GO: 0045040 protein import into mitochondrial outer membrane, 0005742 mitochondrial outer membrane translocase complex; PDB: 1ZU2_A.
Probab=98.19 E-value=2.8e-06 Score=65.83 Aligned_cols=70 Identities=21% Similarity=0.278 Sum_probs=50.6
Q ss_pred CCCCCHHHHHHHHHhcccCCC----------cHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcC----CHHHHHHHHHHH
Q psy17695 3 KTPFFKVGILFSNSSFPSPGN----------IPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVC----DWTDYEARMKKL 68 (222)
Q Consensus 3 ~~P~~~~~~~~lg~~l~~~g~----------~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~----~~~~a~~~~~~~ 68 (222)
++|.+++.+++.|.+|..+.+ +++|+.-|+.||.|+|+..++++++|+++...+ +-.+|...|+++
T Consensus 20 ~nP~DadnL~~WG~ALLELAqfk~g~es~~miedAisK~eeAL~I~P~~hdAlw~lGnA~ts~A~l~~d~~~A~~~F~kA 99 (186)
T PF06552_consen 20 KNPLDADNLTNWGGALLELAQFKQGPESKKMIEDAISKFEEALKINPNKHDALWCLGNAYTSLAFLTPDTAEAEEYFEKA 99 (186)
T ss_dssp H-TT-HHHHHHHHHHHHHHHHHS-HHHHHHHHHHHHHHHHHHHHH-TT-HHHHHHHHHHHHHHHHH---HHHHHHHHHHH
T ss_pred hCcHhHHHHHHHHHHHHHHHhccCcchHHHHHHHHHHHHHHHHhcCCchHHHHHHHHHHHHHHHhhcCChHHHHHHHHHH
Confidence 589999999999999987744 577899999999999999999999999987643 333444444444
Q ss_pred HHHH
Q psy17695 69 VSIV 72 (222)
Q Consensus 69 ~~~~ 72 (222)
....
T Consensus 100 ~~~F 103 (186)
T PF06552_consen 100 TEYF 103 (186)
T ss_dssp HHHH
T ss_pred HHHH
Confidence 4433
No 66
>TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW. Members of this family are designated PilF in ref (PubMed:8973346) and PilW in ref (PubMed:15612916). This outer membrane protein is required both for pilus stability and for pilus function such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.
Probab=98.19 E-value=7.9e-06 Score=64.55 Aligned_cols=69 Identities=17% Similarity=0.144 Sum_probs=64.8
Q ss_pred CCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 6 FFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 6 ~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
..+..+.++|.++...|++++|+..|+++++.+|++..++..+|.++...|++++|...+++++...|+
T Consensus 29 ~~~~~~~~la~~~~~~~~~~~A~~~~~~~l~~~p~~~~~~~~la~~~~~~~~~~~A~~~~~~al~~~~~ 97 (234)
T TIGR02521 29 KAAKIRVQLALGYLEQGDLEVAKENLDKALEHDPDDYLAYLALALYYQQLGELEKAEDSFRRALTLNPN 97 (234)
T ss_pred cHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCcccHHHHHHHHHHHHHcCCHHHHHHHHHHHHhhCCC
Confidence 347899999999999999999999999999999999999999999999999999999999999887664
No 67
>PF12688 TPR_5: Tetratrico peptide repeat
Probab=98.19 E-value=7.4e-06 Score=60.01 Aligned_cols=67 Identities=21% Similarity=0.145 Sum_probs=60.1
Q ss_pred HHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCC---HHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDF---PDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~---~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
|.+++++|.++..+|+.++|+..|+++++..++. ..++..+|.++..+|+.++|...+++.+...|+
T Consensus 1 ~~~~~~~A~a~d~~G~~~~Ai~~Y~~Al~~gL~~~~~~~a~i~lastlr~LG~~deA~~~L~~~~~~~p~ 70 (120)
T PF12688_consen 1 PRALYELAWAHDSLGREEEAIPLYRRALAAGLSGADRRRALIQLASTLRNLGRYDEALALLEEALEEFPD 70 (120)
T ss_pred CchHHHHHHHHHhcCCHHHHHHHHHHHHHcCCCchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCCC
Confidence 4688999999999999999999999999986555 568999999999999999999999998876664
No 68
>PLN03088 SGT1, suppressor of G2 allele of SKP1; Provisional
Probab=98.19 E-value=4.3e-06 Score=72.59 Aligned_cols=64 Identities=16% Similarity=0.206 Sum_probs=61.0
Q ss_pred HHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 11 ILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 11 ~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
+...|..+...|++++|+.+|++|++++|+++.+++++|.++..+|++++|...+++++.+.|+
T Consensus 5 l~~~a~~a~~~~~~~~Ai~~~~~Al~~~P~~~~a~~~~a~~~~~~g~~~eAl~~~~~Al~l~P~ 68 (356)
T PLN03088 5 LEDKAKEAFVDDDFALAVDLYTQAIDLDPNNAELYADRAQANIKLGNFTEAVADANKAIELDPS 68 (356)
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCcC
Confidence 5677899999999999999999999999999999999999999999999999999999998886
No 69
>KOG0543|consensus
Probab=98.18 E-value=4.1e-06 Score=72.10 Aligned_cols=67 Identities=16% Similarity=0.185 Sum_probs=64.4
Q ss_pred HHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhh
Q psy17695 9 VGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQ 75 (222)
Q Consensus 9 ~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~ 75 (222)
..|+||+.++.+++++.+|+....++|+++|+|+.+++..|.++..+++++.|...|++++++.|++
T Consensus 258 ~~~lNlA~c~lKl~~~~~Ai~~c~kvLe~~~~N~KALyRrG~A~l~~~e~~~A~~df~ka~k~~P~N 324 (397)
T KOG0543|consen 258 ACHLNLAACYLKLKEYKEAIESCNKVLELDPNNVKALYRRGQALLALGEYDLARDDFQKALKLEPSN 324 (397)
T ss_pred HHhhHHHHHHHhhhhHHHHHHHHHHHHhcCCCchhHHHHHHHHHHhhccHHHHHHHHHHHHHhCCCc
Confidence 5789999999999999999999999999999999999999999999999999999999999988863
No 70
>KOG0553|consensus
Probab=98.18 E-value=2.2e-06 Score=71.12 Aligned_cols=68 Identities=19% Similarity=0.273 Sum_probs=63.3
Q ss_pred HHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhh
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQ 75 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~ 75 (222)
++-+-+-|+-+.+.++|.+|+..|.+||+++|.++-.+.|.+.+|.++|.++.|++.++.++.+.|..
T Consensus 81 AE~LK~eGN~~m~~~~Y~eAv~kY~~AI~l~P~nAVyycNRAAAy~~Lg~~~~AVkDce~Al~iDp~y 148 (304)
T KOG0553|consen 81 AESLKNEGNKLMKNKDYQEAVDKYTEAIELDPTNAVYYCNRAAAYSKLGEYEDAVKDCESALSIDPHY 148 (304)
T ss_pred HHHHHHHHHHHHHhhhHHHHHHHHHHHHhcCCCcchHHHHHHHHHHHhcchHHHHHHHHHHHhcChHH
Confidence 45566778889999999999999999999999999999999999999999999999999999998874
No 71
>KOG1155|consensus
Probab=98.17 E-value=3.4e-06 Score=73.71 Aligned_cols=74 Identities=18% Similarity=0.207 Sum_probs=63.5
Q ss_pred CCCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
|++||....+|..+|.=|..+.+...|+++|++|++++|.+..+|+.||..|-.++-..=|.-.|+++....|+
T Consensus 357 LkLNp~~~~aWTLmGHEyvEmKNt~AAi~sYRrAvdi~p~DyRAWYGLGQaYeim~Mh~YaLyYfqkA~~~kPn 430 (559)
T KOG1155|consen 357 LKLNPKYLSAWTLMGHEYVEMKNTHAAIESYRRAVDINPRDYRAWYGLGQAYEIMKMHFYALYYFQKALELKPN 430 (559)
T ss_pred HhcCcchhHHHHHhhHHHHHhcccHHHHHHHHHHHhcCchhHHHHhhhhHHHHHhcchHHHHHHHHHHHhcCCC
Confidence 46788888999999999999999999999999999999999999999998888888777788888888877665
No 72
>cd05804 StaR_like StaR_like; a well-conserved protein found in bacteria, plants, and animals. A family member from Streptomyces toyocaensis, StaR is part of a gene cluster involved in the biosynthesis of glycopeptide antibiotics (GPAs), specifically A47934. It has been speculated that StaR could be a flavoprotein hydroxylating a tyrosine sidechain. Some family members have been annotated as proteins containing tetratricopeptide (TPR) repeats, which may at least indicate mostly alpha-helical secondary structure.
Probab=98.14 E-value=8.3e-06 Score=70.02 Aligned_cols=71 Identities=21% Similarity=0.179 Sum_probs=65.6
Q ss_pred CCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHH
Q psy17695 3 KTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVA 73 (222)
Q Consensus 3 ~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~ 73 (222)
.+|+...++..+|.++..+|++++|+..++++++++|+++.++..++.++...|++++|++.+++++...+
T Consensus 109 ~~~~~~~~~~~~a~~~~~~G~~~~A~~~~~~al~~~p~~~~~~~~la~i~~~~g~~~eA~~~l~~~l~~~~ 179 (355)
T cd05804 109 ENPDYWYLLGMLAFGLEEAGQYDRAEEAARRALELNPDDAWAVHAVAHVLEMQGRFKEGIAFMESWRDTWD 179 (355)
T ss_pred CCCCcHHHHHHHHHHHHHcCCHHHHHHHHHHHHhhCCCCcHHHHHHHHHHHHcCCHHHHHHHHHhhhhccC
Confidence 46777888889999999999999999999999999999999999999999999999999999998887655
No 73
>PRK10153 DNA-binding transcriptional activator CadC; Provisional
Probab=98.13 E-value=9.3e-06 Score=73.73 Aligned_cols=71 Identities=13% Similarity=-0.007 Sum_probs=67.0
Q ss_pred CCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 3 KTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 3 ~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
.+|.++.+|..+|..+...|++++|...|++|++++| +..++..+|.++...|+.++|.+.|+++.++.|.
T Consensus 415 ~~~~~~~~~~ala~~~~~~g~~~~A~~~l~rAl~L~p-s~~a~~~lG~~~~~~G~~~eA~~~~~~A~~L~P~ 485 (517)
T PRK10153 415 ELNVLPRIYEILAVQALVKGKTDEAYQAINKAIDLEM-SWLNYVLLGKVYELKGDNRLAADAYSTAFNLRPG 485 (517)
T ss_pred cCcCChHHHHHHHHHHHhcCCHHHHHHHHHHHHHcCC-CHHHHHHHHHHHHHcCCHHHHHHHHHHHHhcCCC
Confidence 4778889999999999999999999999999999999 5899999999999999999999999999988885
No 74
>KOG0548|consensus
Probab=98.13 E-value=4e-06 Score=74.20 Aligned_cols=73 Identities=15% Similarity=0.251 Sum_probs=39.9
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
+.+|+++.++.|.+.+|.++|.+.+|+...+++++++|++..++..-|.++..+.+|+.|.+.|++++...|+
T Consensus 386 kr~P~Da~lYsNRAac~~kL~~~~~aL~Da~~~ieL~p~~~kgy~RKg~al~~mk~ydkAleay~eale~dp~ 458 (539)
T KOG0548|consen 386 KRDPEDARLYSNRAACYLKLGEYPEALKDAKKCIELDPNFIKAYLRKGAALRAMKEYDKALEAYQEALELDPS 458 (539)
T ss_pred hcCCchhHHHHHHHHHHHHHhhHHHHHHHHHHHHhcCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCch
Confidence 3455555555555555555555555555555555555555555555555555555555555555555544443
No 75
>PLN02789 farnesyltranstransferase
Probab=98.11 E-value=8.1e-06 Score=69.78 Aligned_cols=73 Identities=14% Similarity=0.051 Sum_probs=56.7
Q ss_pred CCCCCCHHHHHHHHHhcccCCCc--HHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNI--PEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~--~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
..+|.+..+|++.+.++..+|+. ++++.+++++++++|++..+|.+++.++..+++|+++.+.+.++++..+.
T Consensus 100 ~~npknyqaW~~R~~~l~~l~~~~~~~el~~~~kal~~dpkNy~AW~~R~w~l~~l~~~~eeL~~~~~~I~~d~~ 174 (320)
T PLN02789 100 EDNPKNYQIWHHRRWLAEKLGPDAANKELEFTRKILSLDAKNYHAWSHRQWVLRTLGGWEDELEYCHQLLEEDVR 174 (320)
T ss_pred HHCCcchHHhHHHHHHHHHcCchhhHHHHHHHHHHHHhCcccHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHCCC
Confidence 35777888888888777777763 66777778888888888888888888888888888888888888777664
No 76
>TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795).
Probab=98.11 E-value=5.8e-06 Score=67.06 Aligned_cols=73 Identities=19% Similarity=0.218 Sum_probs=63.9
Q ss_pred CCCCCCH---HHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHH---HHHHHHHHHHHc--------CCHHHHHHHHHH
Q psy17695 2 MKTPFFK---VGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPD---AYCNLAHCLQIV--------CDWTDYEARMKK 67 (222)
Q Consensus 2 ~~~P~~~---~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~---a~~~l~~~~~~~--------~~~~~a~~~~~~ 67 (222)
..+|+++ .+++.+|.++...|++++|+..|+++++..|+++. +++.+|.++... +++++|...+++
T Consensus 61 ~~~p~~~~~~~a~~~la~~~~~~~~~~~A~~~~~~~l~~~p~~~~~~~a~~~~g~~~~~~~~~~~~~~~~~~~A~~~~~~ 140 (235)
T TIGR03302 61 SRYPFSPYAEQAQLDLAYAYYKSGDYAEAIAAADRFIRLHPNHPDADYAYYLRGLSNYNQIDRVDRDQTAAREAFEAFQE 140 (235)
T ss_pred HhCCCchhHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHCcCCCchHHHHHHHHHHHHHhcccccCCHHHHHHHHHHHHH
Confidence 3456655 68899999999999999999999999999998886 799999999876 788899999999
Q ss_pred HHHHHHh
Q psy17695 68 LVSIVAE 74 (222)
Q Consensus 68 ~~~~~~~ 74 (222)
++...|+
T Consensus 141 ~~~~~p~ 147 (235)
T TIGR03302 141 LIRRYPN 147 (235)
T ss_pred HHHHCCC
Confidence 9887775
No 77
>PF13429 TPR_15: Tetratricopeptide repeat; PDB: 2VQ2_A 2PL2_B.
Probab=98.09 E-value=9.7e-06 Score=67.71 Aligned_cols=70 Identities=29% Similarity=0.365 Sum_probs=60.5
Q ss_pred CCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 5 PFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 5 P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
|+++..|..+|.++.+.|+.++|+.+|++|++++|+++.++..++.++...|+.+++.+.+.......|.
T Consensus 143 ~~~~~~~~~~a~~~~~~G~~~~A~~~~~~al~~~P~~~~~~~~l~~~li~~~~~~~~~~~l~~~~~~~~~ 212 (280)
T PF13429_consen 143 PDSARFWLALAEIYEQLGDPDKALRDYRKALELDPDDPDARNALAWLLIDMGDYDEAREALKRLLKAAPD 212 (280)
T ss_dssp -T-HHHHHHHHHHHHHCCHHHHHHHHHHHHHHH-TT-HHHHHHHHHHHCTTCHHHHHHHHHHHHHHH-HT
T ss_pred CCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHCCChHHHHHHHHHHHHHCcC
Confidence 6789999999999999999999999999999999999999999999999999999988888777776654
No 78
>KOG0547|consensus
Probab=98.08 E-value=6.1e-06 Score=72.58 Aligned_cols=73 Identities=15% Similarity=0.066 Sum_probs=56.9
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
.+||+++..++..|.++.-++++++|+.-|++|++|+|+++.++..++.++.+++.+++.+..|+++....|+
T Consensus 388 ~ldp~n~dvYyHRgQm~flL~q~e~A~aDF~Kai~L~pe~~~~~iQl~~a~Yr~~k~~~~m~~Fee~kkkFP~ 460 (606)
T KOG0547|consen 388 DLDPENPDVYYHRGQMRFLLQQYEEAIADFQKAISLDPENAYAYIQLCCALYRQHKIAESMKTFEEAKKKFPN 460 (606)
T ss_pred hcCCCCCchhHhHHHHHHHHHHHHHHHHHHHHHhhcChhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC
Confidence 3677788888888888888888888888888888888888888888877777777777777777777776665
No 79
>PRK10049 pgaA outer membrane protein PgaA; Provisional
Probab=98.07 E-value=1.1e-05 Score=76.64 Aligned_cols=72 Identities=14% Similarity=0.007 Sum_probs=57.2
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
..+|+++.++..+|.++...|++++|+..++++++.+|+++. +..++.++...|+.++|+..++++++..|+
T Consensus 77 ~~~P~~~~a~~~la~~l~~~g~~~eA~~~l~~~l~~~P~~~~-~~~la~~l~~~g~~~~Al~~l~~al~~~P~ 148 (765)
T PRK10049 77 SLEPQNDDYQRGLILTLADAGQYDEALVKAKQLVSGAPDKAN-LLALAYVYKRAGRHWDELRAMTQALPRAPQ 148 (765)
T ss_pred HhCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCCHH-HHHHHHHHHHCCCHHHHHHHHHHHHHhCCC
Confidence 456778888888888888888888888888888888888888 888888888888888888888888777665
No 80
>PRK11447 cellulose synthase subunit BcsC; Provisional
Probab=98.06 E-value=8.2e-06 Score=80.90 Aligned_cols=71 Identities=20% Similarity=0.116 Sum_probs=64.0
Q ss_pred CCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHH
Q psy17695 3 KTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVA 73 (222)
Q Consensus 3 ~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~ 73 (222)
.+|+++..+.++|.++.+.|++++|+.+|+++++++|+++.++.+++.++...|++++|++.+++++...|
T Consensus 598 ~~p~~~~~~~~La~~~~~~g~~~~A~~~y~~al~~~P~~~~a~~~la~~~~~~g~~~eA~~~l~~ll~~~p 668 (1157)
T PRK11447 598 QQPPSTRIDLTLADWAQQRGDYAAARAAYQRVLTREPGNADARLGLIEVDIAQGDLAAARAQLAKLPATAN 668 (1157)
T ss_pred hCCCCchHHHHHHHHHHHcCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHhccCC
Confidence 46888899999999999999999999999999999999999999999999999999999999988776544
No 81
>PF13414 TPR_11: TPR repeat; PDB: 2HO1_B 2FI7_B 2DBA_A 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2PL2_B 3IEG_B 2FBN_A ....
Probab=98.05 E-value=1.7e-06 Score=56.55 Aligned_cols=38 Identities=26% Similarity=0.322 Sum_probs=36.2
Q ss_pred CCCCCCHHHHHHHHHhcccCC-CcHHHHHHHHHHHHhCC
Q psy17695 2 MKTPFFKVGILFSNSSFPSPG-NIPEAIQSYRTALKLKP 39 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g-~~~eA~~~~~~Al~l~P 39 (222)
..+|+++.+|+++|.++..+| ++++|+.+|+++++++|
T Consensus 31 ~~~p~~~~~~~~~g~~~~~~~~~~~~A~~~~~~al~l~P 69 (69)
T PF13414_consen 31 ELDPNNAEAYYNLGLAYMKLGKDYEEAIEDFEKALKLDP 69 (69)
T ss_dssp HHSTTHHHHHHHHHHHHHHTTTHHHHHHHHHHHHHHHST
T ss_pred HcCCCCHHHHHHHHHHHHHhCccHHHHHHHHHHHHHcCc
Confidence 468999999999999999999 79999999999999998
No 82
>PF13371 TPR_9: Tetratricopeptide repeat
Probab=98.05 E-value=7e-06 Score=54.14 Aligned_cols=49 Identities=29% Similarity=0.223 Sum_probs=44.8
Q ss_pred CCCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHH
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLA 49 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~ 49 (222)
+..+|+++..|..+|.++..+|++++|+..|+++++++|+++.+...+.
T Consensus 22 l~~~p~~~~~~~~~a~~~~~~g~~~~A~~~l~~~l~~~p~~~~~~~~~a 70 (73)
T PF13371_consen 22 LELDPDDPELWLQRARCLFQLGRYEEALEDLERALELSPDDPDARALRA 70 (73)
T ss_pred HHhCcccchhhHHHHHHHHHhccHHHHHHHHHHHHHHCCCcHHHHHHHH
Confidence 3578999999999999999999999999999999999999998876554
No 83
>TIGR02917 PEP_TPR_lipo putative PEP-CTERM system TPR-repeat lipoprotein. This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
Probab=98.02 E-value=1.7e-05 Score=74.87 Aligned_cols=72 Identities=24% Similarity=0.196 Sum_probs=59.3
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
..+|+++.++.++|.++...|+ ++|+..+++++++.|+++..+.++|.++...|++++|...++++++..|.
T Consensus 798 ~~~p~~~~~~~~l~~~~~~~~~-~~A~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~~~ 869 (899)
T TIGR02917 798 KKAPDNAVVLNNLAWLYLELKD-PRALEYAEKALKLAPNIPAILDTLGWLLVEKGEADRALPLLRKAVNIAPE 869 (899)
T ss_pred HhCCCCHHHHHHHHHHHHhcCc-HHHHHHHHHHHhhCCCCcHHHHHHHHHHHHcCCHHHHHHHHHHHHhhCCC
Confidence 3568888888888888888888 77888888888888888888888888888888888888888888877664
No 84
>KOG0547|consensus
Probab=97.99 E-value=1.3e-05 Score=70.58 Aligned_cols=68 Identities=25% Similarity=0.383 Sum_probs=64.1
Q ss_pred HHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhh
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQ 75 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~ 75 (222)
+..+-+.|+-+...|+|++||++|..|+++.|+.+..+.|++.||-.+|+|++..+...+++++.|+.
T Consensus 115 A~~lK~~GN~~f~~kkY~eAIkyY~~AI~l~p~epiFYsNraAcY~~lgd~~~Vied~TkALEl~P~Y 182 (606)
T KOG0547|consen 115 AAALKTKGNKFFRNKKYDEAIKYYTQAIELCPDEPIFYSNRAACYESLGDWEKVIEDCTKALELNPDY 182 (606)
T ss_pred HHHHHhhhhhhhhcccHHHHHHHHHHHHhcCCCCchhhhhHHHHHHHHhhHHHHHHHHHHHhhcCcHH
Confidence 56677899999999999999999999999999999999999999999999999999999999988864
No 85
>PF13181 TPR_8: Tetratricopeptide repeat; PDB: 3GW4_B 3MA5_C 2KCV_A 2KCL_A 3FP3_A 3LCA_A 3FP4_A 3FP2_A 1W3B_B 1ELW_A ....
Probab=97.98 E-value=8.3e-06 Score=45.59 Aligned_cols=34 Identities=24% Similarity=0.363 Sum_probs=30.3
Q ss_pred HHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCC
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDF 41 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~ 41 (222)
+++|+.+|.++..+|++++|+.+|+++++++|++
T Consensus 1 a~~~~~lg~~y~~~~~~~~A~~~~~~a~~~~~~n 34 (34)
T PF13181_consen 1 AEAYYNLGKIYEQLGDYEEALEYFEKALELNPDN 34 (34)
T ss_dssp -HHHHHHHHHHHHTTSHHHHHHHHHHHHHHHTT-
T ss_pred CHHHHHHHHHHHHcCCHHHHHHHHHHHHhhCCCC
Confidence 4689999999999999999999999999999964
No 86
>PLN02789 farnesyltranstransferase
Probab=97.97 E-value=1.9e-05 Score=67.46 Aligned_cols=74 Identities=4% Similarity=-0.114 Sum_probs=63.9
Q ss_pred CCCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHc---CCH----HHHHHHHHHHHHHHH
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIV---CDW----TDYEARMKKLVSIVA 73 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~---~~~----~~a~~~~~~~~~~~~ 73 (222)
+..+|.|..+|.+.|.++..+|++++|++++.++|+++|++..+|++++.++... +.. ++..++..+++...|
T Consensus 135 l~~dpkNy~AW~~R~w~l~~l~~~~eeL~~~~~~I~~d~~N~sAW~~R~~vl~~~~~l~~~~~~~e~el~y~~~aI~~~P 214 (320)
T PLN02789 135 LSLDAKNYHAWSHRQWVLRTLGGWEDELEYCHQLLEEDVRNNSAWNQRYFVITRSPLLGGLEAMRDSELKYTIDAILANP 214 (320)
T ss_pred HHhCcccHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHCCCchhHHHHHHHHHHhccccccccccHHHHHHHHHHHHHhCC
Confidence 3578999999999999999999999999999999999999999999999998776 333 345666667777777
Q ss_pred h
Q psy17695 74 E 74 (222)
Q Consensus 74 ~ 74 (222)
+
T Consensus 215 ~ 215 (320)
T PLN02789 215 R 215 (320)
T ss_pred C
Confidence 6
No 87
>PF13429 TPR_15: Tetratricopeptide repeat; PDB: 2VQ2_A 2PL2_B.
Probab=97.97 E-value=3.8e-06 Score=70.18 Aligned_cols=73 Identities=18% Similarity=0.126 Sum_probs=51.8
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
..+|+++.++..++.++.+.|+.+++...++...+..|+++..+..+|.++..+|+.++|...++++.+..|+
T Consensus 174 ~~~P~~~~~~~~l~~~li~~~~~~~~~~~l~~~~~~~~~~~~~~~~la~~~~~lg~~~~Al~~~~~~~~~~p~ 246 (280)
T PF13429_consen 174 ELDPDDPDARNALAWLLIDMGDYDEAREALKRLLKAAPDDPDLWDALAAAYLQLGRYEEALEYLEKALKLNPD 246 (280)
T ss_dssp HH-TT-HHHHHHHHHHHCTTCHHHHHHHHHHHHHHH-HTSCCHCHHHHHHHHHHT-HHHHHHHHHHHHHHSTT
T ss_pred HcCCCCHHHHHHHHHHHHHCCChHHHHHHHHHHHHHCcCHHHHHHHHHHHhcccccccccccccccccccccc
Confidence 3567777777777777777777777777777777777777777777777777777777777777777776664
No 88
>PRK11447 cellulose synthase subunit BcsC; Provisional
Probab=97.95 E-value=2.2e-05 Score=77.88 Aligned_cols=73 Identities=11% Similarity=-0.005 Sum_probs=68.7
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
..+|+++.++.++|.+|...|++++|++.|+++++.+|+++.++..++.++...|++++|.+.+++++...++
T Consensus 631 ~~~P~~~~a~~~la~~~~~~g~~~eA~~~l~~ll~~~p~~~~~~~~la~~~~~~g~~~eA~~~~~~al~~~~~ 703 (1157)
T PRK11447 631 TREPGNADARLGLIEVDIAQGDLAAARAQLAKLPATANDSLNTQRRVALAWAALGDTAAAQRTFNRLIPQAKS 703 (1157)
T ss_pred HhCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHhccCCCChHHHHHHHHHHHhCCCHHHHHHHHHHHhhhCcc
Confidence 4689999999999999999999999999999999999999999999999999999999999999999876553
No 89
>PRK14720 transcript cleavage factor/unknown domain fusion protein; Provisional
Probab=97.94 E-value=2.6e-05 Score=74.33 Aligned_cols=68 Identities=13% Similarity=0.049 Sum_probs=63.4
Q ss_pred CCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHH
Q psy17695 4 TPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIV 72 (222)
Q Consensus 4 ~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~ 72 (222)
.+++-.+++.||.+|.++|+.++|.+.|+++++++|+++.++.|+|..+... +.++|++++.+++...
T Consensus 112 ~~~~k~Al~~LA~~Ydk~g~~~ka~~~yer~L~~D~~n~~aLNn~AY~~ae~-dL~KA~~m~~KAV~~~ 179 (906)
T PRK14720 112 YGENKLALRTLAEAYAKLNENKKLKGVWERLVKADRDNPEIVKKLATSYEEE-DKEKAITYLKKAIYRF 179 (906)
T ss_pred hhhhhHHHHHHHHHHHHcCChHHHHHHHHHHHhcCcccHHHHHHHHHHHHHh-hHHHHHHHHHHHHHHH
Confidence 3566689999999999999999999999999999999999999999999999 9999999999998754
No 90
>PRK10747 putative protoheme IX biogenesis protein; Provisional
Probab=97.93 E-value=3.8e-05 Score=67.65 Aligned_cols=72 Identities=21% Similarity=0.152 Sum_probs=63.8
Q ss_pred CCCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHH
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVA 73 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~ 73 (222)
++..|+++..+..+|.++...|++++|..+|+++++++|++.. +..++.++...|+.++|..+|++.+.+..
T Consensus 321 lk~~P~~~~l~l~lgrl~~~~~~~~~A~~~le~al~~~P~~~~-~~~La~~~~~~g~~~~A~~~~~~~l~~~~ 392 (398)
T PRK10747 321 IKQHGDTPLLWSTLGQLLMKHGEWQEASLAFRAALKQRPDAYD-YAWLADALDRLHKPEEAAAMRRDGLMLTL 392 (398)
T ss_pred HhhCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHhcCCCHHH-HHHHHHHHHHcCCHHHHHHHHHHHHhhhc
Confidence 3578999999999999999999999999999999999999655 56799999999999999999999887654
No 91
>TIGR00540 hemY_coli hemY protein. This is an uncharacterized protein encoded next to a heme-biosynthetic enzyme in two gamma division proteobacteria (E. coli and H. influenzae). It is known in no other species. The gene symbol hemY is unfortunate in that an unrelated protein, protoporphyrinogen oxidase, is designated as HemG in E. coli but as HemY in Bacillus subtilis.
Probab=97.91 E-value=4e-05 Score=67.66 Aligned_cols=70 Identities=14% Similarity=0.045 Sum_probs=57.7
Q ss_pred CCCCCCH--HHHHHHHHhcccCCCcHHHHHHHH--HHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHH
Q psy17695 2 MKTPFFK--VGILFSNSSFPSPGNIPEAIQSYR--TALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIV 72 (222)
Q Consensus 2 ~~~P~~~--~~~~~lg~~l~~~g~~~eA~~~~~--~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~ 72 (222)
+.+|+++ ..+..+|.++.+.|++++|.++|+ ++++.+|++.. +..+|.++...|+.++|.+.|++++...
T Consensus 327 k~~p~~~~~~ll~sLg~l~~~~~~~~~A~~~le~a~a~~~~p~~~~-~~~La~ll~~~g~~~~A~~~~~~~l~~~ 400 (409)
T TIGR00540 327 KNVDDKPKCCINRALGQLLMKHGEFIEAADAFKNVAACKEQLDAND-LAMAADAFDQAGDKAEAAAMRQDSLGLM 400 (409)
T ss_pred HhCCCChhHHHHHHHHHHHHHcccHHHHHHHHHHhHHhhcCCCHHH-HHHHHHHHHHcCCHHHHHHHHHHHHHHH
Confidence 5678888 888889999999999999999998 57778888766 4488888899999999988888876543
No 92
>TIGR02917 PEP_TPR_lipo putative PEP-CTERM system TPR-repeat lipoprotein. This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
Probab=97.91 E-value=3.5e-05 Score=72.73 Aligned_cols=72 Identities=18% Similarity=0.292 Sum_probs=61.5
Q ss_pred CCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 3 KTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 3 ~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
..|.++..|..+|.++...|++++|+.+|+++++.+|+++.++..++.++...|++++|...+++++...|+
T Consensus 596 ~~~~~~~~~~~l~~~~~~~~~~~~A~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~A~~~~~~~~~~~~~ 667 (899)
T TIGR02917 596 AAPDSPEAWLMLGRAQLAAGDLNKAVSSFKKLLALQPDSALALLLLADAYAVMKNYAKAITSLKRALELKPD 667 (899)
T ss_pred cCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCCCChHHHHHHHHHHHHcCCHHHHHHHHHHHHhcCCC
Confidence 467888888888888888888888888888888888888888888888888888888888888888776654
No 93
>PRK11788 tetratricopeptide repeat protein; Provisional
Probab=97.88 E-value=4.3e-05 Score=66.36 Aligned_cols=66 Identities=14% Similarity=0.027 Sum_probs=52.6
Q ss_pred HHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 9 VGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 9 ~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
..+.++|.++.+.|++++|+..|+++++.+|++..++..+|.++...|++++|.+.++++....|.
T Consensus 181 ~~~~~la~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~~la~~~~~~g~~~~A~~~~~~~~~~~p~ 246 (389)
T PRK11788 181 HFYCELAQQALARGDLDAARALLKKALAADPQCVRASILLGDLALAQGDYAAAIEALERVEEQDPE 246 (389)
T ss_pred HHHHHHHHHHHhCCCHHHHHHHHHHHHhHCcCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHChh
Confidence 356678888888888888888888888888888888888888888888888888888887765543
No 94
>KOG0548|consensus
Probab=97.86 E-value=4.3e-05 Score=67.81 Aligned_cols=73 Identities=21% Similarity=0.255 Sum_probs=67.6
Q ss_pred CCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhh
Q psy17695 3 KTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQ 75 (222)
Q Consensus 3 ~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~ 75 (222)
.+|+-+.--..-|+.++..|+|.+|+..|.+|++.+|+++.+++|++.|+..++++..|....++++++.|+.
T Consensus 353 ~~pe~A~e~r~kGne~Fk~gdy~~Av~~YteAIkr~P~Da~lYsNRAac~~kL~~~~~aL~Da~~~ieL~p~~ 425 (539)
T KOG0548|consen 353 INPEKAEEEREKGNEAFKKGDYPEAVKHYTEAIKRDPEDARLYSNRAACYLKLGEYPEALKDAKKCIELDPNF 425 (539)
T ss_pred hChhHHHHHHHHHHHHHhccCHHHHHHHHHHHHhcCCchhHHHHHHHHHHHHHhhHHHHHHHHHHHHhcCchH
Confidence 4667777778889999999999999999999999999999999999999999999999999999999987764
No 95
>KOG2076|consensus
Probab=97.85 E-value=4e-05 Score=71.60 Aligned_cols=73 Identities=12% Similarity=0.139 Sum_probs=52.8
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
+.+|.++.+|..||.+|.++|+.+.+..++-.|--++|++.+.|..++....++|+|++|.-+|.++++..|.
T Consensus 167 kqdp~~~~ay~tL~~IyEqrGd~eK~l~~~llAAHL~p~d~e~W~~ladls~~~~~i~qA~~cy~rAI~~~p~ 239 (895)
T KOG2076|consen 167 KQDPRNPIAYYTLGEIYEQRGDIEKALNFWLLAAHLNPKDYELWKRLADLSEQLGNINQARYCYSRAIQANPS 239 (895)
T ss_pred HhCccchhhHHHHHHHHHHcccHHHHHHHHHHHHhcCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHhcCCc
Confidence 4567777777777777777777777777777777777777777777777777777777777777777766654
No 96
>KOG1129|consensus
Probab=97.85 E-value=3.3e-05 Score=65.17 Aligned_cols=70 Identities=16% Similarity=0.026 Sum_probs=65.5
Q ss_pred CCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhh
Q psy17695 6 FFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQ 75 (222)
Q Consensus 6 ~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~ 75 (222)
.-++.|+|||.+....|++.-|..||+-||..||++.++++||+......|+.++|...+..+....|..
T Consensus 393 ~aaDvWYNlg~vaV~iGD~nlA~rcfrlaL~~d~~h~ealnNLavL~~r~G~i~~Arsll~~A~s~~P~m 462 (478)
T KOG1129|consen 393 QAADVWYNLGFVAVTIGDFNLAKRCFRLALTSDAQHGEALNNLAVLAARSGDILGARSLLNAAKSVMPDM 462 (478)
T ss_pred hhhhhhhccceeEEeccchHHHHHHHHHHhccCcchHHHHHhHHHHHhhcCchHHHHHHHHHhhhhCccc
Confidence 3478999999999999999999999999999999999999999999999999999999999998888764
No 97
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=97.83 E-value=5.4e-05 Score=66.61 Aligned_cols=73 Identities=19% Similarity=0.126 Sum_probs=66.7
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
+..|+|+-.+...|.++...|+.++|++.+++++.++|+.+-.+.|+|..+++.|+..+|+..++......|+
T Consensus 334 ~~~P~N~~~~~~~~~i~~~~nk~~~A~e~~~kal~l~P~~~~l~~~~a~all~~g~~~eai~~L~~~~~~~p~ 406 (484)
T COG4783 334 AAQPDNPYYLELAGDILLEANKAKEAIERLKKALALDPNSPLLQLNLAQALLKGGKPQEAIRILNRYLFNDPE 406 (484)
T ss_pred HhCCCCHHHHHHHHHHHHHcCChHHHHHHHHHHHhcCCCccHHHHHHHHHHHhcCChHHHHHHHHHHhhcCCC
Confidence 4679999999999999999999999999999999999999999999999999999999888877777766665
No 98
>PF14559 TPR_19: Tetratricopeptide repeat; PDB: 2R5S_A 3QDN_B 3QOU_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 3FP3_A 3LCA_A ....
Probab=97.82 E-value=2.4e-05 Score=50.77 Aligned_cols=49 Identities=24% Similarity=0.247 Sum_probs=43.8
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHH
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAH 50 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~ 50 (222)
..+|++..+++.+|.++.+.|++++|...+++++..+|+++..+..++.
T Consensus 19 ~~~p~~~~~~~~la~~~~~~g~~~~A~~~l~~~~~~~~~~~~~~~l~a~ 67 (68)
T PF14559_consen 19 QRNPDNPEARLLLAQCYLKQGQYDEAEELLERLLKQDPDNPEYQQLLAQ 67 (68)
T ss_dssp HHTTTSHHHHHHHHHHHHHTT-HHHHHHHHHCCHGGGTTHHHHHHHHHH
T ss_pred HHCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCcCHHHHHHHHhc
Confidence 3579999999999999999999999999999999999999888776654
No 99
>KOG0624|consensus
Probab=97.81 E-value=2.6e-05 Score=66.19 Aligned_cols=73 Identities=21% Similarity=0.207 Sum_probs=66.9
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
..+|++-.+++..|.+|..+|+-..|+.-+.+.|++.|++..+....|++++++|.+++|++.|++++.-.|+
T Consensus 66 e~dp~~Y~aifrRaT~yLAmGksk~al~Dl~rVlelKpDF~~ARiQRg~vllK~Gele~A~~DF~~vl~~~~s 138 (504)
T KOG0624|consen 66 EGDPNNYQAIFRRATVYLAMGKSKAALQDLSRVLELKPDFMAARIQRGVVLLKQGELEQAEADFDQVLQHEPS 138 (504)
T ss_pred cCCchhHHHHHHHHHHHhhhcCCccchhhHHHHHhcCccHHHHHHHhchhhhhcccHHHHHHHHHHHHhcCCC
Confidence 4689999999999999999999999999999999999999999999999999999999999999998876553
No 100
>PF13431 TPR_17: Tetratricopeptide repeat
Probab=97.81 E-value=7.9e-06 Score=46.08 Aligned_cols=29 Identities=17% Similarity=0.106 Sum_probs=26.6
Q ss_pred CCCCCCCHHHHHHHHHhcccCCCcHHHHH
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGNIPEAIQ 29 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~~~eA~~ 29 (222)
|.++|+++.+|++||.+|...|++++|++
T Consensus 6 ie~~P~n~~a~~nla~~~~~~g~~~~A~~ 34 (34)
T PF13431_consen 6 IELNPNNAEAYNNLANLYLNQGDYEEAIA 34 (34)
T ss_pred HHHCCCCHHHHHHHHHHHHHCcCHHhhcC
Confidence 35799999999999999999999999973
No 101
>COG4785 NlpI Lipoprotein NlpI, contains TPR repeats [General function prediction only]
Probab=97.79 E-value=1.9e-05 Score=63.09 Aligned_cols=73 Identities=14% Similarity=0.132 Sum_probs=58.9
Q ss_pred CCCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHH
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVA 73 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~ 73 (222)
|++.|+.|++++.||.-+...|+++.|.+.|...+++||.+.-++.|.|..+...|++.-|...+.+.-+..|
T Consensus 92 Lai~P~m~~vfNyLG~Yl~~a~~fdaa~eaFds~~ELDp~y~Ya~lNRgi~~YY~gR~~LAq~d~~~fYQ~D~ 164 (297)
T COG4785 92 LAIRPDMPEVFNYLGIYLTQAGNFDAAYEAFDSVLELDPTYNYAHLNRGIALYYGGRYKLAQDDLLAFYQDDP 164 (297)
T ss_pred hhcCCCcHHHHHHHHHHHHhcccchHHHHHhhhHhccCCcchHHHhccceeeeecCchHhhHHHHHHHHhcCC
Confidence 4678888888888888888888888888888888888888888888888888888888888776665554444
No 102
>COG4235 Cytochrome c biogenesis factor [Posttranslational modification, protein turnover, chaperones]
Probab=97.78 E-value=9.2e-05 Score=61.69 Aligned_cols=73 Identities=18% Similarity=0.046 Sum_probs=66.7
Q ss_pred CCCCCCHHHHHHHHHhcccCC---CcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 2 MKTPFFKVGILFSNSSFPSPG---NIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g---~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
.+.|++++.+..+|.++.-+. ...++.+.|++++++||++..+++.|+..+.+.|++.+|...++.++...|.
T Consensus 184 rL~g~n~~~~~g~aeaL~~~a~~~~ta~a~~ll~~al~~D~~~iral~lLA~~afe~g~~~~A~~~Wq~lL~~lp~ 259 (287)
T COG4235 184 RLAGDNPEILLGLAEALYYQAGQQMTAKARALLRQALALDPANIRALSLLAFAAFEQGDYAEAAAAWQMLLDLLPA 259 (287)
T ss_pred HhCCCCHHHHHHHHHHHHHhcCCcccHHHHHHHHHHHhcCCccHHHHHHHHHHHHHcccHHHHHHHHHHHHhcCCC
Confidence 478999999999999998544 4789999999999999999999999999999999999999999999877774
No 103
>KOG1173|consensus
Probab=97.75 E-value=5e-05 Score=67.85 Aligned_cols=72 Identities=15% Similarity=0.156 Sum_probs=68.6
Q ss_pred CCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 3 KTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 3 ~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
..|+.+..|+..|.-|...|++++|..+|-+|..+||.+..+|...|+.+...+.-++|..+|..+-++.+.
T Consensus 307 ~yP~~a~sW~aVg~YYl~i~k~seARry~SKat~lD~~fgpaWl~fghsfa~e~EhdQAmaaY~tAarl~~G 378 (611)
T KOG1173|consen 307 LYPSKALSWFAVGCYYLMIGKYSEARRYFSKATTLDPTFGPAWLAFGHSFAGEGEHDQAMAAYFTAARLMPG 378 (611)
T ss_pred hCCCCCcchhhHHHHHHHhcCcHHHHHHHHHHhhcCccccHHHHHHhHHhhhcchHHHHHHHHHHHHHhccC
Confidence 579999999999999999999999999999999999999999999999999999999999999999988775
No 104
>PRK15331 chaperone protein SicA; Provisional
Probab=97.74 E-value=7.7e-05 Score=57.21 Aligned_cols=70 Identities=16% Similarity=0.090 Sum_probs=64.1
Q ss_pred CCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 5 PFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 5 P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
++.-+..+..|.-+...|++++|...|+-...++|.+++.+..||.|++..++|++|+..|..+..+.++
T Consensus 34 ~~~le~iY~~Ay~~y~~Gk~~eA~~~F~~L~~~d~~n~~Y~~GLaa~~Q~~k~y~~Ai~~Y~~A~~l~~~ 103 (165)
T PRK15331 34 QDMMDGLYAHAYEFYNQGRLDEAETFFRFLCIYDFYNPDYTMGLAAVCQLKKQFQKACDLYAVAFTLLKN 103 (165)
T ss_pred HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCcCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccC
Confidence 4456778899999999999999999999999999999999999999999999999999999888877665
No 105
>PRK11788 tetratricopeptide repeat protein; Provisional
Probab=97.74 E-value=0.00015 Score=62.96 Aligned_cols=72 Identities=19% Similarity=0.145 Sum_probs=57.2
Q ss_pred CCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCH-----HHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 3 KTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFP-----DAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 3 ~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~-----~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
.+|.+..++..++.++...|++++|++.|+++++.+|.+. ..+.+++.++...+++++|...++++++..|+
T Consensus 136 ~~~~~~~~~~~la~~~~~~g~~~~A~~~~~~~~~~~~~~~~~~~~~~~~~la~~~~~~~~~~~A~~~~~~al~~~p~ 212 (389)
T PRK11788 136 EGDFAEGALQQLLEIYQQEKDWQKAIDVAERLEKLGGDSLRVEIAHFYCELAQQALARGDLDAARALLKKALAADPQ 212 (389)
T ss_pred CCcchHHHHHHHHHHHHHhchHHHHHHHHHHHHHhcCCcchHHHHHHHHHHHHHHHhCCCHHHHHHHHHHHHhHCcC
Confidence 4677788888888888888888888888888888887753 25667888888888888888888888776654
No 106
>cd05804 StaR_like StaR_like; a well-conserved protein found in bacteria, plants, and animals. A family member from Streptomyces toyocaensis, StaR is part of a gene cluster involved in the biosynthesis of glycopeptide antibiotics (GPAs), specifically A47934. It has been speculated that StaR could be a flavoprotein hydroxylating a tyrosine sidechain. Some family members have been annotated as proteins containing tetratricopeptide (TPR) repeats, which may at least indicate mostly alpha-helical secondary structure.
Probab=97.74 E-value=8.1e-05 Score=63.87 Aligned_cols=69 Identities=12% Similarity=-0.034 Sum_probs=61.4
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCH----HHHHHHHHHHHHcCCHHHHHHHHHHHHH
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFP----DAYCNLAHCLQIVCDWTDYEARMKKLVS 70 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~----~a~~~l~~~~~~~~~~~~a~~~~~~~~~ 70 (222)
..+|+++.++..+|.++...|++++|+.+++++++..|.++ ..+..++.++...|++++|...+++++.
T Consensus 142 ~~~p~~~~~~~~la~i~~~~g~~~eA~~~l~~~l~~~~~~~~~~~~~~~~la~~~~~~G~~~~A~~~~~~~~~ 214 (355)
T cd05804 142 ELNPDDAWAVHAVAHVLEMQGRFKEGIAFMESWRDTWDCSSMLRGHNWWHLALFYLERGDYEAALAIYDTHIA 214 (355)
T ss_pred hhCCCCcHHHHHHHHHHHHcCCHHHHHHHHHhhhhccCCCcchhHHHHHHHHHHHHHCCCHHHHHHHHHHHhc
Confidence 56899999999999999999999999999999999987543 3466899999999999999999988853
No 107
>PF13176 TPR_7: Tetratricopeptide repeat; PDB: 3SF4_C 3RO3_A 3RO2_A.
Probab=97.74 E-value=1.5e-05 Score=45.45 Aligned_cols=29 Identities=24% Similarity=0.352 Sum_probs=22.9
Q ss_pred HHHHHHHhcccCCCcHHHHHHHHHHHHhC
Q psy17695 10 GILFSNSSFPSPGNIPEAIQSYRTALKLK 38 (222)
Q Consensus 10 ~~~~lg~~l~~~g~~~eA~~~~~~Al~l~ 38 (222)
++.+||.++.++|++++|+.+|+++|.+.
T Consensus 1 al~~Lg~~~~~~g~~~~Ai~~y~~aL~l~ 29 (36)
T PF13176_consen 1 ALNNLGRIYRQQGDYEKAIEYYEQALALA 29 (36)
T ss_dssp HHHHHHHHHHHCT-HHHHHHHHHHHHHHH
T ss_pred CHHHHHHHHHHcCCHHHHHHHHHHHHHhc
Confidence 57788999999999999999999966553
No 108
>KOG4642|consensus
Probab=97.74 E-value=7.6e-05 Score=60.36 Aligned_cols=74 Identities=16% Similarity=0.110 Sum_probs=61.1
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhh
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQ 75 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~ 75 (222)
.++|..+..|.|-+.++.++.+|+.+....++|++++|+.+.+++.+|.+++....+.+++..++++..+..++
T Consensus 38 ~~nP~~~~Y~tnralchlk~~~~~~v~~dcrralql~~N~vk~h~flg~~~l~s~~~~eaI~~Lqra~sl~r~~ 111 (284)
T KOG4642|consen 38 CINPTVASYYTNRALCHLKLKHWEPVEEDCRRALQLDPNLVKAHYFLGQWLLQSKGYDEAIKVLQRAYSLLREQ 111 (284)
T ss_pred hcCCCcchhhhhHHHHHHHhhhhhhhhhhHHHHHhcChHHHHHHHHHHHHHHhhccccHHHHHHHHHHHHHhcC
Confidence 46788888888888888888888888888888888888888888888888888888888888888887777653
No 109
>KOG1156|consensus
Probab=97.72 E-value=8.7e-05 Score=67.31 Aligned_cols=75 Identities=21% Similarity=0.176 Sum_probs=66.1
Q ss_pred CCCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhh
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQ 75 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~ 75 (222)
+.+++.....|..+|.+++...+|+||+.||+.|++++|+|-..+..++....++++++.....-.+.+++.|.+
T Consensus 68 lr~d~~S~vCwHv~gl~~R~dK~Y~eaiKcy~nAl~~~~dN~qilrDlslLQ~QmRd~~~~~~tr~~LLql~~~~ 142 (700)
T KOG1156|consen 68 LRNDLKSHVCWHVLGLLQRSDKKYDEAIKCYRNALKIEKDNLQILRDLSLLQIQMRDYEGYLETRNQLLQLRPSQ 142 (700)
T ss_pred hccCcccchhHHHHHHHHhhhhhHHHHHHHHHHHHhcCCCcHHHHHHHHHHHHHHHhhhhHHHHHHHHHHhhhhh
Confidence 457888999999999999999999999999999999999999999999999999999888777666666665543
No 110
>KOG4648|consensus
Probab=97.71 E-value=3.8e-05 Score=65.19 Aligned_cols=63 Identities=19% Similarity=0.185 Sum_probs=57.2
Q ss_pred HHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 12 LFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 12 ~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
-..|+-|+++|+|+|||+||.+++.++|.++-.+.|.+.+|..+..+..|+..+..++.+...
T Consensus 101 KE~GN~yFKQgKy~EAIDCYs~~ia~~P~NpV~~~NRA~AYlk~K~FA~AE~DC~~AiaLd~~ 163 (536)
T KOG4648|consen 101 KERGNTYFKQGKYEEAIDCYSTAIAVYPHNPVYHINRALAYLKQKSFAQAEEDCEAAIALDKL 163 (536)
T ss_pred HHhhhhhhhccchhHHHHHhhhhhccCCCCccchhhHHHHHHHHHHHHHHHHhHHHHHHhhHH
Confidence 456899999999999999999999999999999999999999999999999888888877654
No 111
>KOG0550|consensus
Probab=97.70 E-value=7.9e-05 Score=64.50 Aligned_cols=72 Identities=18% Similarity=0.158 Sum_probs=63.6
Q ss_pred CCCCC----CHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHH
Q psy17695 2 MKTPF----FKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVA 73 (222)
Q Consensus 2 ~~~P~----~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~ 73 (222)
.++|+ ++..+.|.+.+..++|+..||+...++|++|+|....++...|.|+..+++|++|.+.|+++.+...
T Consensus 277 ~idP~n~~~naklY~nra~v~~rLgrl~eaisdc~~Al~iD~syikall~ra~c~l~le~~e~AV~d~~~a~q~~~ 352 (486)
T KOG0550|consen 277 NIDPSNKKTNAKLYGNRALVNIRLGRLREAISDCNEALKIDSSYIKALLRRANCHLALEKWEEAVEDYEKAMQLEK 352 (486)
T ss_pred cCCccccchhHHHHHHhHhhhcccCCchhhhhhhhhhhhcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc
Confidence 45555 4677889999999999999999999999999999999999999999999999999999999886543
No 112
>KOG2076|consensus
Probab=97.70 E-value=0.00014 Score=68.15 Aligned_cols=71 Identities=15% Similarity=0.022 Sum_probs=69.0
Q ss_pred CCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHH
Q psy17695 3 KTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVA 73 (222)
Q Consensus 3 ~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~ 73 (222)
++|.+.+.|..++....++|++.+|+-||.+|++.+|.+....++.+..+.++|+...|...|.++..+.|
T Consensus 202 L~p~d~e~W~~ladls~~~~~i~qA~~cy~rAI~~~p~n~~~~~ers~L~~~~G~~~~Am~~f~~l~~~~p 272 (895)
T KOG2076|consen 202 LNPKDYELWKRLADLSEQLGNINQARYCYSRAIQANPSNWELIYERSSLYQKTGDLKRAMETFLQLLQLDP 272 (895)
T ss_pred cCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHhcCCcchHHHHHHHHHHHHhChHHHHHHHHHHHHhhCC
Confidence 68999999999999999999999999999999999999999999999999999999999999999998887
No 113
>PF12688 TPR_5: Tetratrico peptide repeat
Probab=97.68 E-value=0.00029 Score=51.59 Aligned_cols=64 Identities=17% Similarity=0.065 Sum_probs=57.0
Q ss_pred CCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCC---CHHHHHHHHHHHHHcCCHHHHHHHHHHHH
Q psy17695 6 FFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPD---FPDAYCNLAHCLQIVCDWTDYEARMKKLV 69 (222)
Q Consensus 6 ~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~---~~~a~~~l~~~~~~~~~~~~a~~~~~~~~ 69 (222)
.-..++..+|.++..+|++++|+..+++++.-.|+ +......++.++...|++++|.+.+-.++
T Consensus 36 ~~~~a~i~lastlr~LG~~deA~~~L~~~~~~~p~~~~~~~l~~f~Al~L~~~gr~~eAl~~~l~~l 102 (120)
T PF12688_consen 36 DRRRALIQLASTLRNLGRYDEALALLEEALEEFPDDELNAALRVFLALALYNLGRPKEALEWLLEAL 102 (120)
T ss_pred HHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCCCccccHHHHHHHHHHHHHCCCHHHHHHHHHHHH
Confidence 34678999999999999999999999999999899 88899999999999999999987664443
No 114
>KOG4555|consensus
Probab=97.66 E-value=0.00016 Score=53.46 Aligned_cols=65 Identities=20% Similarity=0.341 Sum_probs=59.6
Q ss_pred HHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 10 GILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 10 ~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
.+-.-|+++...|+.++|++.|.+++.+-|..+.+++|.+..++.+|+.++|...+.+++++...
T Consensus 45 ~LEl~~valaE~g~Ld~AlE~F~qal~l~P~raSayNNRAQa~RLq~~~e~ALdDLn~AleLag~ 109 (175)
T KOG4555|consen 45 ELELKAIALAEAGDLDGALELFGQALCLAPERASAYNNRAQALRLQGDDEEALDDLNKALELAGD 109 (175)
T ss_pred HHHHHHHHHHhccchHHHHHHHHHHHHhcccchHhhccHHHHHHHcCChHHHHHHHHHHHHhcCc
Confidence 44567899999999999999999999999999999999999999999999999999999987654
No 115
>PF03704 BTAD: Bacterial transcriptional activator domain; InterPro: IPR005158 Found in the DNRI/REDD/AFSR family of regulators, this region of AFSR (P25941 from SWISSPROT) along with the C-terminal region is capable of independently directing actinorhodin production. It is important for the formation of secondary metabolites.; PDB: 2FF4_B 2FEZ_A.
Probab=97.65 E-value=0.00046 Score=51.82 Aligned_cols=69 Identities=19% Similarity=0.165 Sum_probs=60.6
Q ss_pred HHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhhh
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQL 76 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~l 76 (222)
..+...++..+...|++++|+..+++++.++|-+..++..+..++...|+..+|...|+++.+...+.+
T Consensus 62 ~~~~~~l~~~~~~~~~~~~a~~~~~~~l~~dP~~E~~~~~lm~~~~~~g~~~~A~~~Y~~~~~~l~~el 130 (146)
T PF03704_consen 62 LDALERLAEALLEAGDYEEALRLLQRALALDPYDEEAYRLLMRALAAQGRRAEALRVYERYRRRLREEL 130 (146)
T ss_dssp HHHHHHHHHHHHHTT-HHHHHHHHHHHHHHSTT-HHHHHHHHHHHHHTT-HHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHhccCHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHCcCHHHHHHHHHHHHHHHHHHh
Confidence 356777888899999999999999999999999999999999999999999999999999998888643
No 116
>KOG1128|consensus
Probab=97.60 E-value=0.0001 Score=67.71 Aligned_cols=65 Identities=12% Similarity=0.115 Sum_probs=51.9
Q ss_pred CCCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHH
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARM 65 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~ 65 (222)
|..+|-....|+++|.+..+++++..|.++|.+++.++|++.++|+|++..+...++-.+|-..+
T Consensus 512 l~~nplq~~~wf~~G~~ALqlek~q~av~aF~rcvtL~Pd~~eaWnNls~ayi~~~~k~ra~~~l 576 (777)
T KOG1128|consen 512 LEINPLQLGTWFGLGCAALQLEKEQAAVKAFHRCVTLEPDNAEAWNNLSTAYIRLKKKKRAFRKL 576 (777)
T ss_pred hhcCccchhHHHhccHHHHHHhhhHHHHHHHHHHhhcCCCchhhhhhhhHHHHHHhhhHHHHHHH
Confidence 45788888999999999999999999999999999999999999999988776544443333333
No 117
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=97.59 E-value=0.00043 Score=57.61 Aligned_cols=68 Identities=15% Similarity=0.042 Sum_probs=61.6
Q ss_pred CHHHHHHHHHhc-ccCCCcHHHHHHHHHHHHhCCCC---HHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 7 FKVGILFSNSSF-PSPGNIPEAIQSYRTALKLKPDF---PDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 7 ~~~~~~~lg~~l-~~~g~~~eA~~~~~~Al~l~P~~---~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
....++..+..+ .+.|++++|+..|++.++..|++ +.+++.+|.++...|++++|...|++++...|+
T Consensus 141 ~e~~~Y~~A~~l~~~~~~y~~Ai~af~~fl~~yP~s~~a~~A~y~LG~~y~~~g~~~~A~~~f~~vv~~yP~ 212 (263)
T PRK10803 141 DANTDYNAAIALVQDKSRQDDAIVAFQNFVKKYPDSTYQPNANYWLGQLNYNKGKKDDAAYYFASVVKNYPK 212 (263)
T ss_pred CHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHCcCCcchHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCCC
Confidence 457788888876 66799999999999999999998 589999999999999999999999999988886
No 118
>PF09295 ChAPs: ChAPs (Chs5p-Arf1p-binding proteins); InterPro: IPR015374 ChAPs (Chs5p-Arf1p-binding proteins) are required for the export of specialised cargo from the Golgi. They physically interact with Chs3, Chs5 and the small GTPase Arf1, and they also form interactions with each other [].
Probab=97.53 E-value=0.0003 Score=61.72 Aligned_cols=65 Identities=15% Similarity=0.183 Sum_probs=59.6
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHH
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMK 66 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~ 66 (222)
..+|.+.+.+...+..|.+.++++.|+++.++|+++.|++-..|+.|+.+|..+|+|++|...+.
T Consensus 228 ~~~p~d~~LL~~Qa~fLl~k~~~~lAL~iAk~av~lsP~~f~~W~~La~~Yi~~~d~e~ALlaLN 292 (395)
T PF09295_consen 228 KENPQDSELLNLQAEFLLSKKKYELALEIAKKAVELSPSEFETWYQLAECYIQLGDFENALLALN 292 (395)
T ss_pred HhCCCCHHHHHHHHHHHHhcCCHHHHHHHHHHHHHhCchhHHHHHHHHHHHHhcCCHHHHHHHHh
Confidence 45788999999999999999999999999999999999999999999999999999999886443
No 119
>PF04733 Coatomer_E: Coatomer epsilon subunit; InterPro: IPR006822 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the epsilon subunit of the coatomer complex, which is involved in the regulation of intracellular protein trafficking between the endoplasmic reticulum and the Golgi complex []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006890 retrograde vesicle-mediated transport, Golgi to ER, 0030126 COPI vesicle coat; PDB: 3MV2_B 3MV3_F 3MKR_A.
Probab=97.53 E-value=0.00019 Score=60.55 Aligned_cols=71 Identities=18% Similarity=0.070 Sum_probs=57.1
Q ss_pred CCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCH-HHHHHHHHHHHHHHHh
Q psy17695 4 TPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDW-TDYEARMKKLVSIVAE 74 (222)
Q Consensus 4 ~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~-~~a~~~~~~~~~~~~~ 74 (222)
.+..+..++.++.++..+|+++||...+++|++.+|++++++.|+..+...+|+. +.+.+.+.++....|+
T Consensus 197 ~~~t~~~lng~A~~~l~~~~~~eAe~~L~~al~~~~~~~d~LaNliv~~~~~gk~~~~~~~~l~qL~~~~p~ 268 (290)
T PF04733_consen 197 FGSTPKLLNGLAVCHLQLGHYEEAEELLEEALEKDPNDPDTLANLIVCSLHLGKPTEAAERYLSQLKQSNPN 268 (290)
T ss_dssp S--SHHHHHHHHHHHHHCT-HHHHHHHHHHHCCC-CCHHHHHHHHHHHHHHTT-TCHHHHHHHHHCHHHTTT
T ss_pred cCCCHHHHHHHHHHHHHhCCHHHHHHHHHHHHHhccCCHHHHHHHHHHHHHhCCChhHHHHHHHHHHHhCCC
Confidence 3567889999999999999999999999999999999999999999999999998 5566666666555553
No 120
>KOG4234|consensus
Probab=97.51 E-value=0.00027 Score=56.00 Aligned_cols=67 Identities=13% Similarity=0.147 Sum_probs=55.9
Q ss_pred HHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
...+.|.|.++.++++++.|+....+|++|+|.+..++...+.+|-.+..+++|++.|.+++...|.
T Consensus 134 sIly~Nraaa~iKl~k~e~aI~dcsKaiel~pty~kAl~RRAeayek~ek~eealeDyKki~E~dPs 200 (271)
T KOG4234|consen 134 SILYSNRAAALIKLRKWESAIEDCSKAIELNPTYEKALERRAEAYEKMEKYEEALEDYKKILESDPS 200 (271)
T ss_pred HHHHhhhHHHHHHhhhHHHHHHHHHhhHhcCchhHHHHHHHHHHHHhhhhHHHHHHHHHHHHHhCcc
Confidence 3467788888888888888888888888888888888888888888888888888888888877775
No 121
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=97.49 E-value=0.00046 Score=60.90 Aligned_cols=72 Identities=15% Similarity=0.089 Sum_probs=65.1
Q ss_pred CCCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHH
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIV 72 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~ 72 (222)
+.++|+.+..+.++|.+|.+.|++.+|+..+.+.+.-+|+++..|..|+..+..+|+-.++...+.+...+.
T Consensus 367 l~l~P~~~~l~~~~a~all~~g~~~eai~~L~~~~~~~p~dp~~w~~LAqay~~~g~~~~a~~A~AE~~~~~ 438 (484)
T COG4783 367 LALDPNSPLLQLNLAQALLKGGKPQEAIRILNRYLFNDPEDPNGWDLLAQAYAELGNRAEALLARAEGYALA 438 (484)
T ss_pred HhcCCCccHHHHHHHHHHHhcCChHHHHHHHHHHhhcCCCCchHHHHHHHHHHHhCchHHHHHHHHHHHHhC
Confidence 457899999999999999999999999999999999999999999999999999999888887766655443
No 122
>PRK14574 hmsH outer membrane protein; Provisional
Probab=97.48 E-value=0.00028 Score=67.48 Aligned_cols=73 Identities=15% Similarity=0.056 Sum_probs=61.1
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
...|.|+..+..+|.++...|+..+|+..+++++.++|++..+...++.+++.+++|.+|+...+++++..|+
T Consensus 444 ~~aP~n~~l~~~~A~v~~~Rg~p~~A~~~~k~a~~l~P~~~~~~~~~~~~al~l~e~~~A~~~~~~l~~~~Pe 516 (822)
T PRK14574 444 STAPANQNLRIALASIYLARDLPRKAEQELKAVESLAPRSLILERAQAETAMALQEWHQMELLTDDVISRSPE 516 (822)
T ss_pred HhCCCCHHHHHHHHHHHHhcCCHHHHHHHHHHHhhhCCccHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhCCC
Confidence 3568888888888888888888888888888888888888888888888888888888888888777777765
No 123
>COG1729 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=97.46 E-value=0.00065 Score=56.03 Aligned_cols=68 Identities=16% Similarity=0.189 Sum_probs=61.1
Q ss_pred CHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCC---HHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 7 FKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDF---PDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 7 ~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~---~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
-+++++.||.++..+|++++|...|..+++--|++ +++++-+|.++..+++-++|...++++.+..|+
T Consensus 177 ~~nA~yWLGe~~y~qg~y~~Aa~~f~~~~k~~P~s~KApdallKlg~~~~~l~~~d~A~atl~qv~k~YP~ 247 (262)
T COG1729 177 TPNAYYWLGESLYAQGDYEDAAYIFARVVKDYPKSPKAPDALLKLGVSLGRLGNTDEACATLQQVIKRYPG 247 (262)
T ss_pred cchhHHHHHHHHHhcccchHHHHHHHHHHHhCCCCCCChHHHHHHHHHHHHhcCHHHHHHHHHHHHHHCCC
Confidence 47899999999999999999999999999987665 588999999999999999999999998887775
No 124
>smart00028 TPR Tetratricopeptide repeats. Repeats present in 4 or more copies in proteins. Contain a minimum of 34 amino acids each and self-associate via a "knobs and holes" mechanism.
Probab=97.45 E-value=0.00012 Score=38.97 Aligned_cols=33 Identities=24% Similarity=0.428 Sum_probs=30.0
Q ss_pred HHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCC
Q psy17695 9 VGILFSNSSFPSPGNIPEAIQSYRTALKLKPDF 41 (222)
Q Consensus 9 ~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~ 41 (222)
.+++++|.++..+|++++|+.+|+++++++|++
T Consensus 2 ~~~~~~a~~~~~~~~~~~a~~~~~~~~~~~~~~ 34 (34)
T smart00028 2 EALYNLGNAYLKLGDYDEALEYYEKALELDPNN 34 (34)
T ss_pred hHHHHHHHHHHHHhhHHHHHHHHHHHHccCCCC
Confidence 578899999999999999999999999998864
No 125
>PF12895 Apc3: Anaphase-promoting complex, cyclosome, subunit 3; PDB: 3KAE_D 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2XPI_A 3ULQ_A.
Probab=97.45 E-value=0.00021 Score=48.60 Aligned_cols=53 Identities=25% Similarity=0.377 Sum_probs=44.7
Q ss_pred cCCCcHHHHHHHHHHHHhCCC--CHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHH
Q psy17695 20 SPGNIPEAIQSYRTALKLKPD--FPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVA 73 (222)
Q Consensus 20 ~~g~~~eA~~~~~~Al~l~P~--~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~ 73 (222)
++|++++|+..|+++++.+|. +...++++|.|+...|++++|...+++ ....+
T Consensus 1 ~~~~y~~Ai~~~~k~~~~~~~~~~~~~~~~la~~~~~~~~y~~A~~~~~~-~~~~~ 55 (84)
T PF12895_consen 1 DQGNYENAIKYYEKLLELDPTNPNSAYLYNLAQCYFQQGKYEEAIELLQK-LKLDP 55 (84)
T ss_dssp HTT-HHHHHHHHHHHHHHHCGTHHHHHHHHHHHHHHHTTHHHHHHHHHHC-HTHHH
T ss_pred CCccHHHHHHHHHHHHHHCCCChhHHHHHHHHHHHHHCCCHHHHHHHHHH-hCCCC
Confidence 368999999999999999995 466788899999999999999998887 44444
No 126
>KOG4555|consensus
Probab=97.43 E-value=0.00088 Score=49.61 Aligned_cols=69 Identities=14% Similarity=0.134 Sum_probs=62.0
Q ss_pred CCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCH----HHHHHHHHHHHHcCCHHHHHHHHHHHHHH
Q psy17695 3 KTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFP----DAYCNLAHCLQIVCDWTDYEARMKKLVSI 71 (222)
Q Consensus 3 ~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~----~a~~~l~~~~~~~~~~~~a~~~~~~~~~~ 71 (222)
+-|..+.+++|.+.+++-+|+.++|++-+++|+++..+.. .+++..|.+|+..|+-+.|...|+.+.++
T Consensus 72 l~P~raSayNNRAQa~RLq~~~e~ALdDLn~AleLag~~trtacqa~vQRg~lyRl~g~dd~AR~DFe~AA~L 144 (175)
T KOG4555|consen 72 LAPERASAYNNRAQALRLQGDDEEALDDLNKALELAGDQTRTACQAFVQRGLLYRLLGNDDAARADFEAAAQL 144 (175)
T ss_pred hcccchHhhccHHHHHHHcCChHHHHHHHHHHHHhcCccchHHHHHHHHHHHHHHHhCchHHHHHhHHHHHHh
Confidence 4589999999999999999999999999999999976654 56899999999999999999999887654
No 127
>PRK10866 outer membrane biogenesis protein BamD; Provisional
Probab=97.39 E-value=0.00097 Score=54.86 Aligned_cols=70 Identities=14% Similarity=0.047 Sum_probs=64.7
Q ss_pred CCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHH---HHHHHHHHHcCCHHHHHHHHHHHHHHHHhh
Q psy17695 6 FFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAY---CNLAHCLQIVCDWTDYEARMKKLVSIVAEQ 75 (222)
Q Consensus 6 ~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~---~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~ 75 (222)
..+..++..|..+.+.|++++|+..|++++...|+.+.+. .++|.++...+++++|...+++.++..|+.
T Consensus 30 ~~~~~~Y~~A~~~~~~g~y~~Ai~~f~~l~~~yP~s~~a~~a~l~la~ayy~~~~y~~A~~~~e~fi~~~P~~ 102 (243)
T PRK10866 30 NPPSEIYATAQQKLQDGNWKQAITQLEALDNRYPFGPYSQQVQLDLIYAYYKNADLPLAQAAIDRFIRLNPTH 102 (243)
T ss_pred CCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCChHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHhCcCC
Confidence 3678889999999999999999999999999999987664 899999999999999999999999999974
No 128
>KOG4642|consensus
Probab=97.39 E-value=0.00012 Score=59.17 Aligned_cols=68 Identities=18% Similarity=0.291 Sum_probs=61.8
Q ss_pred HHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhh
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQ 75 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~ 75 (222)
++-+..-|+-+..-.+|..|+.+|-+|+.++|..+..+.|.+.|++++.+|+...+..++++++.|+.
T Consensus 10 a~qlkE~gnk~f~~k~y~~ai~~y~raI~~nP~~~~Y~tnralchlk~~~~~~v~~dcrralql~~N~ 77 (284)
T KOG4642|consen 10 AEQLKEQGNKCFIPKRYDDAIDCYSRAICINPTVASYYTNRALCHLKLKHWEPVEEDCRRALQLDPNL 77 (284)
T ss_pred HHHHHhccccccchhhhchHHHHHHHHHhcCCCcchhhhhHHHHHHHhhhhhhhhhhHHHHHhcChHH
Confidence 34455667888888999999999999999999999999999999999999999999999999999874
No 129
>COG3071 HemY Uncharacterized enzyme of heme biosynthesis [Coenzyme metabolism]
Probab=97.38 E-value=0.00066 Score=58.50 Aligned_cols=71 Identities=17% Similarity=0.141 Sum_probs=64.9
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHH
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVA 73 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~ 73 (222)
+..|++|..+..||.++.+.+.|.+|..+|+.|++..|+ ...+..+|+++.++|+..+|.+.+++++....
T Consensus 322 ~~h~~~p~L~~tLG~L~~k~~~w~kA~~~leaAl~~~~s-~~~~~~la~~~~~~g~~~~A~~~r~e~L~~~~ 392 (400)
T COG3071 322 KQHPEDPLLLSTLGRLALKNKLWGKASEALEAALKLRPS-ASDYAELADALDQLGEPEEAEQVRREALLLTR 392 (400)
T ss_pred HhCCCChhHHHHHHHHHHHhhHHHHHHHHHHHHHhcCCC-hhhHHHHHHHHHHcCChHHHHHHHHHHHHHhc
Confidence 567999999999999999999999999999999999998 67778999999999999999999999985544
No 130
>PF09976 TPR_21: Tetratricopeptide repeat; InterPro: IPR018704 This domain, found in various hypothetical prokaryotic proteins, has no known function.
Probab=97.37 E-value=0.00088 Score=50.40 Aligned_cols=64 Identities=23% Similarity=0.191 Sum_probs=55.5
Q ss_pred CCCC---HHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCC---HHHHHHHHHHHHHcCCHHHHHHHHHH
Q psy17695 4 TPFF---KVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDF---PDAYCNLAHCLQIVCDWTDYEARMKK 67 (222)
Q Consensus 4 ~P~~---~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~---~~a~~~l~~~~~~~~~~~~a~~~~~~ 67 (222)
.|+. ..+.+.+|.++...|++++|+..|+.++...|+. ..+...|+.++...|++++|...++.
T Consensus 41 ~~~s~ya~~A~l~lA~~~~~~g~~~~A~~~l~~~~~~~~d~~l~~~a~l~LA~~~~~~~~~d~Al~~L~~ 110 (145)
T PF09976_consen 41 YPSSPYAALAALQLAKAAYEQGDYDEAKAALEKALANAPDPELKPLARLRLARILLQQGQYDEALATLQQ 110 (145)
T ss_pred CCCChHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHhhCCCHHHHHHHHHHHHHHHHHcCCHHHHHHHHHh
Confidence 4555 6788899999999999999999999999988776 46889999999999999999887644
No 131
>KOG2002|consensus
Probab=97.37 E-value=0.00012 Score=69.06 Aligned_cols=71 Identities=17% Similarity=0.175 Sum_probs=64.2
Q ss_pred CCCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHH
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSI 71 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~ 71 (222)
|..+|.|.-+-+.+|.+|...|++.+|++.|.+..+---++.++|.|+|+|+..+|+|..|++.|+.+++-
T Consensus 639 L~~dpkN~yAANGIgiVLA~kg~~~~A~dIFsqVrEa~~~~~dv~lNlah~~~e~~qy~~AIqmYe~~lkk 709 (1018)
T KOG2002|consen 639 LRNDPKNMYAANGIGIVLAEKGRFSEARDIFSQVREATSDFEDVWLNLAHCYVEQGQYRLAIQMYENCLKK 709 (1018)
T ss_pred HhcCcchhhhccchhhhhhhccCchHHHHHHHHHHHHHhhCCceeeeHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45789999999999999999999999999999999887788999999999999999999999999888754
No 132
>PF13512 TPR_18: Tetratricopeptide repeat
Probab=97.32 E-value=0.0013 Score=49.39 Aligned_cols=69 Identities=16% Similarity=0.088 Sum_probs=47.5
Q ss_pred CHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCC---HHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhh
Q psy17695 7 FKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDF---PDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQ 75 (222)
Q Consensus 7 ~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~---~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~ 75 (222)
.+..++.-|....+.|+|++|++.|+....--|-. ..+...|+.++...+++++|.+.+++.+++.|.+
T Consensus 9 ~~~~ly~~a~~~l~~~~Y~~A~~~le~L~~ryP~g~ya~qAqL~l~yayy~~~~y~~A~a~~~rFirLhP~h 80 (142)
T PF13512_consen 9 SPQELYQEAQEALQKGNYEEAIKQLEALDTRYPFGEYAEQAQLDLAYAYYKQGDYEEAIAAYDRFIRLHPTH 80 (142)
T ss_pred CHHHHHHHHHHHHHhCCHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHccCHHHHHHHHHHHHHhCCCC
Confidence 35566677777777777777777777777665553 3566777777777777777777777777776653
No 133
>KOG3060|consensus
Probab=97.31 E-value=0.0011 Score=54.30 Aligned_cols=70 Identities=13% Similarity=0.027 Sum_probs=56.1
Q ss_pred CCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCC---HHHHHHHHHHHHHHHHh
Q psy17695 5 PFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCD---WTDYEARMKKLVSIVAE 74 (222)
Q Consensus 5 P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~---~~~a~~~~~~~~~~~~~ 74 (222)
+.++++|..|+.+|...|+|+.|.-||+..+-++|-++..+..++.++.-+|. .+-+..+|.+++.+.+.
T Consensus 151 ~~D~EAW~eLaeiY~~~~~f~kA~fClEE~ll~~P~n~l~f~rlae~~Yt~gg~eN~~~arkyy~~alkl~~~ 223 (289)
T KOG3060|consen 151 MNDQEAWHELAEIYLSEGDFEKAAFCLEELLLIQPFNPLYFQRLAEVLYTQGGAENLELARKYYERALKLNPK 223 (289)
T ss_pred cCcHHHHHHHHHHHHhHhHHHHHHHHHHHHHHcCCCcHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHhChH
Confidence 67888888888888888888888888888888888888888888888777665 34466777777777774
No 134
>PRK11906 transcriptional regulator; Provisional
Probab=97.29 E-value=0.00042 Score=61.23 Aligned_cols=73 Identities=3% Similarity=-0.177 Sum_probs=66.2
Q ss_pred CCCCCCHHHHHHHHHhcccC---------CCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHH
Q psy17695 2 MKTPFFKVGILFSNSSFPSP---------GNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIV 72 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~---------g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~ 72 (222)
.++|+++.++..++.++... ....+|...-++|++++|+++.++..+|.++...++.+.+...++++..+.
T Consensus 289 ~ldp~~a~a~~~lA~~h~~~~~~g~~~~~~~~~~a~~~A~rAveld~~Da~a~~~~g~~~~~~~~~~~a~~~f~rA~~L~ 368 (458)
T PRK11906 289 DIQTLKTECYCLLAECHMSLALHGKSELELAAQKALELLDYVSDITTVDGKILAIMGLITGLSGQAKVSHILFEQAKIHS 368 (458)
T ss_pred cCCcccHHHHHHHHHHHHHHHHhcCCCchHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHhhcchhhHHHHHHHHhhcC
Confidence 47899999999999888754 346788999999999999999999999999999999999999999999999
Q ss_pred Hh
Q psy17695 73 AE 74 (222)
Q Consensus 73 ~~ 74 (222)
|+
T Consensus 369 Pn 370 (458)
T PRK11906 369 TD 370 (458)
T ss_pred Cc
Confidence 87
No 135
>KOG1127|consensus
Probab=97.29 E-value=0.00064 Score=64.60 Aligned_cols=75 Identities=23% Similarity=0.216 Sum_probs=65.6
Q ss_pred CCCCCCCHHHHHHHHHhcccCCC-cHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcC---CHHHHHHHHHHHHHHHHhh
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGN-IPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVC---DWTDYEARMKKLVSIVAEQ 75 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~-~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~---~~~~a~~~~~~~~~~~~~~ 75 (222)
|+.+|+|-.+|.-||.++..+|. .++|..+|..|.+++|++.-||-.|++.|.... +.+++-.+|++++-+++++
T Consensus 29 Lk~dpdNYnA~vFLGvAl~sl~q~le~A~ehYv~AaKldpdnlLAWkGL~nLye~~~dIl~ld~~~~~yq~~~l~le~q 107 (1238)
T KOG1127|consen 29 LKEDPDNYNAQVFLGVALWSLGQDLEKAAEHYVLAAKLDPDNLLAWKGLGNLYERYNDILDLDRAAKCYQRAVLILENQ 107 (1238)
T ss_pred HhcCCCcchhhhHHHHHHHhccCCHHHHHHHHHHHHhcChhhhHHHHHHHHHHHccchhhhhhHhHHHHHHHHHhhhhh
Confidence 56899999999999999999999 999999999999999999999999999998744 4666777787777766653
No 136
>PF00515 TPR_1: Tetratricopeptide repeat; InterPro: IPR001440 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees; within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B. ; GO: 0005515 protein binding; PDB: 3SF4_C 2LNI_A 1ELW_A 2C0M_A 1FCH_B 3R9A_B 2J9Q_A 2C0L_A 1KT1_A 3FWV_A ....
Probab=97.29 E-value=0.00062 Score=37.81 Aligned_cols=33 Identities=15% Similarity=0.308 Sum_probs=29.7
Q ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 42 PDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 42 ~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
+.+|+++|.++..++++++|+.+|++++++.|+
T Consensus 1 a~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~ 33 (34)
T PF00515_consen 1 AEAYYNLGNAYFQLGDYEEALEYYQRALELDPD 33 (34)
T ss_dssp HHHHHHHHHHHHHTT-HHHHHHHHHHHHHHSTT
T ss_pred CHHHHHHHHHHHHhCCchHHHHHHHHHHHHCcC
Confidence 468999999999999999999999999998875
No 137
>PRK14574 hmsH outer membrane protein; Provisional
Probab=97.28 E-value=0.00082 Score=64.32 Aligned_cols=70 Identities=11% Similarity=0.093 Sum_probs=54.1
Q ss_pred CCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 5 PFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 5 P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
|........+|.++..+|++++|++.|+++++.+|+++.++..++..+...++.++|.+.++++....|.
T Consensus 99 n~~~~~llalA~ly~~~gdyd~Aiely~kaL~~dP~n~~~l~gLa~~y~~~~q~~eAl~~l~~l~~~dp~ 168 (822)
T PRK14574 99 NISSRGLASAARAYRNEKRWDQALALWQSSLKKDPTNPDLISGMIMTQADAGRGGVVLKQATELAERDPT 168 (822)
T ss_pred CCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHhhCCCCHHHHHHHHHHHhhcCCHHHHHHHHHHhcccCcc
Confidence 3344445555778888888888888888888888888888888888888888888888777777665554
No 138
>PF13174 TPR_6: Tetratricopeptide repeat; PDB: 3QKY_A 2XEV_A 3URZ_B 2Q7F_A.
Probab=97.28 E-value=0.00026 Score=38.85 Aligned_cols=33 Identities=21% Similarity=0.285 Sum_probs=27.0
Q ss_pred HHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCC
Q psy17695 9 VGILFSNSSFPSPGNIPEAIQSYRTALKLKPDF 41 (222)
Q Consensus 9 ~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~ 41 (222)
++++++|.++.++|++++|+..|++.++..|++
T Consensus 1 ~a~~~~a~~~~~~g~~~~A~~~~~~~~~~~P~s 33 (33)
T PF13174_consen 1 DALYRLARCYYKLGDYDEAIEYFQRLIKRYPDS 33 (33)
T ss_dssp HHHHHHHHHHHHHCHHHHHHHHHHHHHHHSTTS
T ss_pred CHHHHHHHHHHHccCHHHHHHHHHHHHHHCcCC
Confidence 467788888888888888888888888888874
No 139
>PF09976 TPR_21: Tetratricopeptide repeat; InterPro: IPR018704 This domain, found in various hypothetical prokaryotic proteins, has no known function.
Probab=97.28 E-value=0.0008 Score=50.63 Aligned_cols=61 Identities=15% Similarity=0.056 Sum_probs=53.5
Q ss_pred HHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHH
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLV 69 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~ 69 (222)
+.+.+.|+.++..+|++++|+..++. +.-.+-.+.++..+|.++...|++++|...|++++
T Consensus 85 ~~a~l~LA~~~~~~~~~d~Al~~L~~-~~~~~~~~~~~~~~Gdi~~~~g~~~~A~~~y~~Al 145 (145)
T PF09976_consen 85 PLARLRLARILLQQGQYDEALATLQQ-IPDEAFKALAAELLGDIYLAQGDYDEARAAYQKAL 145 (145)
T ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHh-ccCcchHHHHHHHHHHHHHHCCCHHHHHHHHHHhC
Confidence 56889999999999999999999976 44455667889999999999999999999998864
No 140
>PF13525 YfiO: Outer membrane lipoprotein; PDB: 3TGO_A 3Q5M_A 2YHC_A.
Probab=97.27 E-value=0.0016 Score=52.02 Aligned_cols=70 Identities=16% Similarity=0.171 Sum_probs=59.3
Q ss_pred CCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCC---HHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhh
Q psy17695 6 FFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDF---PDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQ 75 (222)
Q Consensus 6 ~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~---~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~ 75 (222)
..+..++..|..+.+.|++++|+..|++.+...|.+ +.+.+.+|.++...+++++|...+++.++..|+.
T Consensus 3 ~~~~~lY~~a~~~~~~g~y~~Ai~~f~~l~~~~P~s~~a~~A~l~la~a~y~~~~y~~A~~~~~~fi~~yP~~ 75 (203)
T PF13525_consen 3 DTAEALYQKALEALQQGDYEEAIKLFEKLIDRYPNSPYAPQAQLMLAYAYYKQGDYEEAIAAYERFIKLYPNS 75 (203)
T ss_dssp --HHHHHHHHHHHHHCT-HHHHHHHHHHHHHH-TTSTTHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHH-TT-
T ss_pred CCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCChHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCCCC
Confidence 357889999999999999999999999999998875 4789999999999999999999999999998873
No 141
>PRK10747 putative protoheme IX biogenesis protein; Provisional
Probab=97.26 E-value=0.0012 Score=58.05 Aligned_cols=69 Identities=7% Similarity=0.056 Sum_probs=58.2
Q ss_pred CCCCCHHHH-HHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHH
Q psy17695 3 KTPFFKVGI-LFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSI 71 (222)
Q Consensus 3 ~~P~~~~~~-~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~ 71 (222)
.+|++..+. ...+.++...|++++|++.+++.++.+|+++.++..++.++...|+|+++.+.+.++.+.
T Consensus 147 ~~~~~~~~~~l~~a~l~l~~g~~~~Al~~l~~~~~~~P~~~~al~ll~~~~~~~gdw~~a~~~l~~l~k~ 216 (398)
T PRK10747 147 LADNDQLPVEITRVRIQLARNENHAARHGVDKLLEVAPRHPEVLRLAEQAYIRTGAWSSLLDILPSMAKA 216 (398)
T ss_pred cCCcchHHHHHHHHHHHHHCCCHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHhHHHHHHHHHHHHHc
Confidence 456664433 344889999999999999999999999999999999999999999999999777776643
No 142
>KOG0543|consensus
Probab=97.23 E-value=0.00095 Score=57.78 Aligned_cols=73 Identities=16% Similarity=0.200 Sum_probs=61.0
Q ss_pred CCCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHH-HHHHHHHHHHHH
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDY-EARMKKLVSIVA 73 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a-~~~~~~~~~~~~ 73 (222)
|..+|+|..+++..|.++..+|+++.|+..|++|++++|+|-.+...+..+......+++. .+.|..+.....
T Consensus 284 Le~~~~N~KALyRrG~A~l~~~e~~~A~~df~ka~k~~P~Nka~~~el~~l~~k~~~~~~kekk~y~~mF~k~~ 357 (397)
T KOG0543|consen 284 LELDPNNVKALYRRGQALLALGEYDLARDDFQKALKLEPSNKAARAELIKLKQKIREYEEKEKKMYANMFAKLA 357 (397)
T ss_pred HhcCCCchhHHHHHHHHHHhhccHHHHHHHHHHHHHhCCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc
Confidence 4578999999999999999999999999999999999999999999999888777665554 345555544433
No 143
>KOG0624|consensus
Probab=97.20 E-value=0.00036 Score=59.45 Aligned_cols=73 Identities=12% Similarity=0.142 Sum_probs=68.4
Q ss_pred CCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhhh
Q psy17695 4 TPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQL 76 (222)
Q Consensus 4 ~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~l 76 (222)
+|.+.+-|+.+|..+...|++..|+..|..|++++|++..+++..+.+|..+|.-..|...+.+++++.|+..
T Consensus 34 ~~advekhlElGk~lla~~Q~sDALt~yHaAve~dp~~Y~aifrRaT~yLAmGksk~al~Dl~rVlelKpDF~ 106 (504)
T KOG0624|consen 34 SPADVEKHLELGKELLARGQLSDALTHYHAAVEGDPNNYQAIFRRATVYLAMGKSKAALQDLSRVLELKPDFM 106 (504)
T ss_pred CHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHcCCchhHHHHHHHHHHHhhhcCCccchhhHHHHHhcCccHH
Confidence 5667888999999999999999999999999999999999999999999999999999999999999888753
No 144
>PF14853 Fis1_TPR_C: Fis1 C-terminal tetratricopeptide repeat; PDB: 1IYG_A 1PC2_A 1NZN_A 3UUX_C 1Y8M_A 2PQR_A 2PQN_A 3O48_A.
Probab=97.16 E-value=0.002 Score=40.06 Aligned_cols=43 Identities=14% Similarity=0.084 Sum_probs=36.1
Q ss_pred HHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHH
Q psy17695 9 VGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHC 51 (222)
Q Consensus 9 ~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~ 51 (222)
+.++.++..+.++|++++|..+.+++|+++|++.++....-.+
T Consensus 2 d~lY~lAig~ykl~~Y~~A~~~~~~lL~~eP~N~Qa~~L~~~i 44 (53)
T PF14853_consen 2 DCLYYLAIGHYKLGEYEKARRYCDALLEIEPDNRQAQSLKELI 44 (53)
T ss_dssp HHHHHHHHHHHHTT-HHHHHHHHHHHHHHTTS-HHHHHHHHHH
T ss_pred hhHHHHHHHHHHhhhHHHHHHHHHHHHhhCCCcHHHHHHHHHH
Confidence 4578899999999999999999999999999999987666544
No 145
>KOG1840|consensus
Probab=97.15 E-value=0.00099 Score=60.22 Aligned_cols=70 Identities=19% Similarity=0.214 Sum_probs=61.4
Q ss_pred CCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHh--------CCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 5 PFFKVGILFSNSSFPSPGNIPEAIQSYRTALKL--------KPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 5 P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l--------~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
|.-.....++|.+|..++++.+|+..|++|+.+ +|.-+.++.||+..|...|++++|..+++.++.+..+
T Consensus 238 l~va~~l~~~a~~y~~~~k~~eAv~ly~~AL~i~e~~~G~~h~~va~~l~nLa~ly~~~GKf~EA~~~~e~Al~I~~~ 315 (508)
T KOG1840|consen 238 LVVASMLNILALVYRSLGKYDEAVNLYEEALTIREEVFGEDHPAVAATLNNLAVLYYKQGKFAEAEEYCERALEIYEK 315 (508)
T ss_pred HHHHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhccCChHHHHHHHHHHHHHHHH
Confidence 444556667999999999999999999999987 3555678999999999999999999999999998776
No 146
>PF07719 TPR_2: Tetratricopeptide repeat; InterPro: IPR013105 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. This repeat includes outlying Tetratricopeptide-like repeats (TPR) that are not matched by IPR001440 from INTERPRO.; PDB: 1XNF_B 3Q15_A 4ABN_A 1OUV_A 3U4T_A 3MA5_C 2KCV_A 2KCL_A 2XEV_A 3NF1_A ....
Probab=97.15 E-value=0.0011 Score=36.62 Aligned_cols=33 Identities=9% Similarity=0.205 Sum_probs=29.6
Q ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 42 PDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 42 ~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
+.+++.+|.++...|++++|.+.+++++++.|+
T Consensus 1 a~~~~~lg~~~~~~~~~~~A~~~~~~al~l~p~ 33 (34)
T PF07719_consen 1 AEAWYYLGQAYYQLGNYEEAIEYFEKALELDPN 33 (34)
T ss_dssp HHHHHHHHHHHHHTT-HHHHHHHHHHHHHHSTT
T ss_pred CHHHHHHHHHHHHhCCHHHHHHHHHHHHHHCcC
Confidence 468999999999999999999999999998875
No 147
>KOG1128|consensus
Probab=97.13 E-value=0.00055 Score=63.06 Aligned_cols=73 Identities=12% Similarity=0.038 Sum_probs=69.1
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
.++|++.++|+|++.+|...|+-.+|-.++.+|++-+-++...|-|...+....|.|++|...|.+.+.+-.+
T Consensus 547 tL~Pd~~eaWnNls~ayi~~~~k~ra~~~l~EAlKcn~~~w~iWENymlvsvdvge~eda~~A~~rll~~~~~ 619 (777)
T KOG1128|consen 547 TLEPDNAEAWNNLSTAYIRLKKKKRAFRKLKEALKCNYQHWQIWENYMLVSVDVGEFEDAIKAYHRLLDLRKK 619 (777)
T ss_pred hcCCCchhhhhhhhHHHHHHhhhHHHHHHHHHHhhcCCCCCeeeechhhhhhhcccHHHHHHHHHHHHHhhhh
Confidence 5789999999999999999999999999999999999999999999999999999999999999998876554
No 148
>KOG1840|consensus
Probab=97.08 E-value=0.0011 Score=59.95 Aligned_cols=72 Identities=13% Similarity=0.092 Sum_probs=65.7
Q ss_pred CCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHh--------CCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhh
Q psy17695 4 TPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKL--------KPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQ 75 (222)
Q Consensus 4 ~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l--------~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~ 75 (222)
.|....+..+||..|..+|+++.|+..+++|+++ .|.-.....++|.+|+.++++.+|+..|++++.+..+.
T Consensus 195 ~P~~~~~~~~La~~y~~~g~~e~A~~l~k~Al~~l~k~~G~~hl~va~~l~~~a~~y~~~~k~~eAv~ly~~AL~i~e~~ 274 (508)
T KOG1840|consen 195 DPERLRTLRNLAEMYAVQGRLEKAEPLCKQALRILEKTSGLKHLVVASMLNILALVYRSLGKYDEAVNLYEEALTIREEV 274 (508)
T ss_pred CchHHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHccCccCHHHHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHh
Confidence 5777788889999999999999999999999999 78877788889999999999999999999999887753
No 149
>PF12569 NARP1: NMDA receptor-regulated protein 1 ; InterPro: IPR021183 This group represents N-terminal acetyltransferase A (NatA) auxiliary subunit and represents a non-catalytic component of the NatA N-terminal acetyltransferase, which catalyzes acetylation of proteins beginning with Met-Ser, Met-Gly and Met-Ala. N-terminal acetylation plays a role in normal eukaryotic translation and processing, protect against proteolytic degradation and protein turnover. NAT1 anchors ARD1 and NAT5 to the ribosome and may present the N- terminal of nascent polypeptides for acetylation [], [].
Probab=97.08 E-value=0.0019 Score=58.67 Aligned_cols=65 Identities=15% Similarity=0.246 Sum_probs=60.5
Q ss_pred HHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 10 GILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 10 ~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
+++.++..|...|++++|+.+.++||+..|..++.++..|.++...|++.+|.+.++++..+...
T Consensus 196 ~~~~lAqhyd~~g~~~~Al~~Id~aI~htPt~~ely~~KarilKh~G~~~~Aa~~~~~Ar~LD~~ 260 (517)
T PF12569_consen 196 TLYFLAQHYDYLGDYEKALEYIDKAIEHTPTLVELYMTKARILKHAGDLKEAAEAMDEARELDLA 260 (517)
T ss_pred HHHHHHHHHHHhCCHHHHHHHHHHHHhcCCCcHHHHHHHHHHHHHCCCHHHHHHHHHHHHhCChh
Confidence 45788999999999999999999999999999999999999999999999999999999886553
No 150
>KOG2003|consensus
Probab=97.03 E-value=0.001 Score=58.55 Aligned_cols=71 Identities=14% Similarity=0.179 Sum_probs=56.5
Q ss_pred CCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 4 TPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 4 ~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
+--++.+..|.|++-...|+++.|.+.|+.||.-+....++++|.|..+..+|+.++|..+|-++..++.+
T Consensus 486 dryn~~a~~nkgn~~f~ngd~dka~~~ykeal~ndasc~ealfniglt~e~~~~ldeald~f~klh~il~n 556 (840)
T KOG2003|consen 486 DRYNAAALTNKGNIAFANGDLDKAAEFYKEALNNDASCTEALFNIGLTAEALGNLDEALDCFLKLHAILLN 556 (840)
T ss_pred cccCHHHhhcCCceeeecCcHHHHHHHHHHHHcCchHHHHHHHHhcccHHHhcCHHHHHHHHHHHHHHHHh
Confidence 44567778888888888888888888888888888888888888888888888888888888777665554
No 151
>TIGR00540 hemY_coli hemY protein. This is an uncharacterized protein encoded next to a heme-biosynthetic enzyme in two gamma division proteobacteria (E. coli and H. influenzae). It is known in no other species. The gene symbol hemY is unfortunate in that an unrelated protein, protoporphyrinogen oxidase, is designated as HemG in E. coli but as HemY in Bacillus subtilis.
Probab=97.02 E-value=0.0026 Score=56.25 Aligned_cols=68 Identities=6% Similarity=0.065 Sum_probs=57.2
Q ss_pred CCCCH-HHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHH
Q psy17695 4 TPFFK-VGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSI 71 (222)
Q Consensus 4 ~P~~~-~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~ 71 (222)
.|++. .+....+.++...|++++|...+++.++.+|+++.++..++.++...|+|+++.+.+.++.+.
T Consensus 148 ~p~~~l~~~~~~a~l~l~~~~~~~Al~~l~~l~~~~P~~~~~l~ll~~~~~~~~d~~~a~~~l~~l~k~ 216 (409)
T TIGR00540 148 AGNDNILVEIARTRILLAQNELHAARHGVDKLLEMAPRHKEVLKLAEEAYIRSGAWQALDDIIDNMAKA 216 (409)
T ss_pred CCcCchHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHHHc
Confidence 45554 455566888899999999999999999999999999999999999999999998888777754
No 152
>KOG2003|consensus
Probab=96.98 E-value=0.00098 Score=58.70 Aligned_cols=74 Identities=9% Similarity=0.116 Sum_probs=67.6
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhh
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQ 75 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~ 75 (222)
.++....++++|+|..+..+|++++|++||-+.-.+--++++.++.++++|-.+.+..+|++.+-++..++|+.
T Consensus 518 ~ndasc~ealfniglt~e~~~~ldeald~f~klh~il~nn~evl~qianiye~led~aqaie~~~q~~slip~d 591 (840)
T KOG2003|consen 518 NNDASCTEALFNIGLTAEALGNLDEALDCFLKLHAILLNNAEVLVQIANIYELLEDPAQAIELLMQANSLIPND 591 (840)
T ss_pred cCchHHHHHHHHhcccHHHhcCHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhhCHHHHHHHHHHhcccCCCC
Confidence 34556789999999999999999999999999999989999999999999999999999999999998888763
No 153
>KOG1308|consensus
Probab=96.98 E-value=0.00052 Score=58.22 Aligned_cols=72 Identities=14% Similarity=0.178 Sum_probs=66.7
Q ss_pred CCCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHH
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIV 72 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~ 72 (222)
+.++|..+..|...+.++.++++..+|+.-|..|++++|+.+.-+-..+.+...+|+|+++...++.+.++.
T Consensus 141 i~lnp~~a~l~~kr~sv~lkl~kp~~airD~d~A~ein~Dsa~~ykfrg~A~rllg~~e~aa~dl~~a~kld 212 (377)
T KOG1308|consen 141 IELNPPLAILYAKRASVFLKLKKPNAAIRDCDFAIEINPDSAKGYKFRGYAERLLGNWEEAAHDLALACKLD 212 (377)
T ss_pred cccCCchhhhcccccceeeeccCCchhhhhhhhhhccCcccccccchhhHHHHHhhchHHHHHHHHHHHhcc
Confidence 468899999999999999999999999999999999999999999999999999999999999888887653
No 154
>PF13176 TPR_7: Tetratricopeptide repeat; PDB: 3SF4_C 3RO3_A 3RO2_A.
Probab=96.95 E-value=0.0027 Score=35.96 Aligned_cols=31 Identities=13% Similarity=0.176 Sum_probs=26.7
Q ss_pred HHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 44 AYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 44 a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
++.+||.++..+|+|++|+++|++++.+..+
T Consensus 1 al~~Lg~~~~~~g~~~~Ai~~y~~aL~l~~~ 31 (36)
T PF13176_consen 1 ALNNLGRIYRQQGDYEKAIEYYEQALALARD 31 (36)
T ss_dssp HHHHHHHHHHHCT-HHHHHHHHHHHHHHHHH
T ss_pred CHHHHHHHHHHcCCHHHHHHHHHHHHHhccc
Confidence 5789999999999999999999998866654
No 155
>KOG2002|consensus
Probab=96.94 E-value=0.0016 Score=61.76 Aligned_cols=70 Identities=14% Similarity=0.153 Sum_probs=63.7
Q ss_pred CCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCC-HHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhh
Q psy17695 6 FFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDF-PDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQ 75 (222)
Q Consensus 6 ~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~-~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~ 75 (222)
--++.++.+|.++..+|++++|-.+|.++++.+|++ ...++.+|..+...|+.+.+..+|+++....|+.
T Consensus 305 ~~aes~Y~~gRs~Ha~Gd~ekA~~yY~~s~k~~~d~~~l~~~GlgQm~i~~~dle~s~~~fEkv~k~~p~~ 375 (1018)
T KOG2002|consen 305 IKAESFYQLGRSYHAQGDFEKAFKYYMESLKADNDNFVLPLVGLGQMYIKRGDLEESKFCFEKVLKQLPNN 375 (1018)
T ss_pred HHHHHHHHHHHHHHhhccHHHHHHHHHHHHccCCCCccccccchhHHHHHhchHHHHHHHHHHHHHhCcch
Confidence 346779999999999999999999999999999998 8889999999999999999999999999888764
No 156
>KOG1156|consensus
Probab=96.93 E-value=0.0011 Score=60.30 Aligned_cols=75 Identities=12% Similarity=0.078 Sum_probs=71.1
Q ss_pred CCCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhh
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQ 75 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~ 75 (222)
|++.|++.+.+.-.|..+..+|+-+||..+.+.+++.++.+.-.|.-+|.+++...++++|+.+|+.++.+.+++
T Consensus 34 L~k~~eHgeslAmkGL~L~~lg~~~ea~~~vr~glr~d~~S~vCwHv~gl~~R~dK~Y~eaiKcy~nAl~~~~dN 108 (700)
T KOG1156|consen 34 LKKFPEHGESLAMKGLTLNCLGKKEEAYELVRLGLRNDLKSHVCWHVLGLLQRSDKKYDEAIKCYRNALKIEKDN 108 (700)
T ss_pred HHhCCccchhHHhccchhhcccchHHHHHHHHHHhccCcccchhHHHHHHHHhhhhhHHHHHHHHHHHHhcCCCc
Confidence 467899999999999999999999999999999999999999999999999999999999999999999887764
No 157
>KOG1174|consensus
Probab=96.92 E-value=0.0033 Score=54.87 Aligned_cols=73 Identities=15% Similarity=0.165 Sum_probs=66.9
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
..+|++.+++...|.++..+|+.++|+-+|+.|..+.|..-+.+-.|.++|...+...||....+.+.+..++
T Consensus 328 ~~~~r~~~alilKG~lL~~~~R~~~A~IaFR~Aq~Lap~rL~~Y~GL~hsYLA~~~~kEA~~~An~~~~~~~~ 400 (564)
T KOG1174|consen 328 DSEPRNHEALILKGRLLIALERHTQAVIAFRTAQMLAPYRLEIYRGLFHSYLAQKRFKEANALANWTIRLFQN 400 (564)
T ss_pred ccCcccchHHHhccHHHHhccchHHHHHHHHHHHhcchhhHHHHHHHHHHHHhhchHHHHHHHHHHHHHHhhc
Confidence 4688999999999999999999999999999999999999999999999999999999999888888888776
No 158
>PRK10941 hypothetical protein; Provisional
Probab=96.91 E-value=0.0047 Score=51.55 Aligned_cols=66 Identities=9% Similarity=-0.006 Sum_probs=62.0
Q ss_pred HHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 9 VGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 9 ~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
....||=.+|.+.++++.|+.+.++.+.++|+++.-+-..|.++.+++.+..|...++..++..|+
T Consensus 182 Rml~nLK~~~~~~~~~~~AL~~~e~ll~l~P~dp~e~RDRGll~~qL~c~~~A~~DL~~fl~~~P~ 247 (269)
T PRK10941 182 KLLDTLKAALMEEKQMELALRASEALLQFDPEDPYEIRDRGLIYAQLDCEHVALSDLSYFVEQCPE 247 (269)
T ss_pred HHHHHHHHHHHHcCcHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCcHHHHHHHHHHHHhCCC
Confidence 456788899999999999999999999999999999999999999999999999999999988876
No 159
>KOG3060|consensus
Probab=96.89 E-value=0.0045 Score=50.74 Aligned_cols=73 Identities=18% Similarity=0.090 Sum_probs=62.2
Q ss_pred CCCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHH
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVA 73 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~ 73 (222)
|..||.+...+-..-.++..+|+.-+||+.+..-++.-+++.++|..++.+|...++++.|.-|+++++-..|
T Consensus 113 L~ddpt~~v~~KRKlAilka~GK~l~aIk~ln~YL~~F~~D~EAW~eLaeiY~~~~~f~kA~fClEE~ll~~P 185 (289)
T KOG3060|consen 113 LEDDPTDTVIRKRKLAILKAQGKNLEAIKELNEYLDKFMNDQEAWHELAEIYLSEGDFEKAAFCLEELLLIQP 185 (289)
T ss_pred hccCcchhHHHHHHHHHHHHcCCcHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHhHHHHHHHHHHHHHHcCC
Confidence 3457888888877778888888888999999889999999999999999999999999999999998875444
No 160
>PLN03098 LPA1 LOW PSII ACCUMULATION1; Provisional
Probab=96.87 E-value=0.0012 Score=58.23 Aligned_cols=37 Identities=16% Similarity=0.036 Sum_probs=34.6
Q ss_pred CCCCCCCHHH---HHHHHHhcccCCCcHHHHHHHHHHHHh
Q psy17695 1 MMKTPFFKVG---ILFSNSSFPSPGNIPEAIQSYRTALKL 37 (222)
Q Consensus 1 l~~~P~~~~~---~~~lg~~l~~~g~~~eA~~~~~~Al~l 37 (222)
|.++|+++++ |+|+|.+|..+|++++|+++|++|+++
T Consensus 102 LeL~Pd~aeA~~A~yNLAcaya~LGr~dEAla~LrrALel 141 (453)
T PLN03098 102 LELNPNPDEAQAAYYNKACCHAYREEGKKAADCLRTALRD 141 (453)
T ss_pred HhhCCCchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHh
Confidence 4678999965 999999999999999999999999998
No 161
>COG4785 NlpI Lipoprotein NlpI, contains TPR repeats [General function prediction only]
Probab=96.87 E-value=0.0021 Score=51.63 Aligned_cols=68 Identities=15% Similarity=0.215 Sum_probs=64.4
Q ss_pred CHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 7 FKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 7 ~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
-+..++..|+.|-.+|-+.-|.--|.+++.+.|+.+++++.||..+...|+++.|.+.|....++.|.
T Consensus 64 RA~l~fERGvlYDSlGL~~LAR~DftQaLai~P~m~~vfNyLG~Yl~~a~~fdaa~eaFds~~ELDp~ 131 (297)
T COG4785 64 RAQLLFERGVLYDSLGLRALARNDFSQALAIRPDMPEVFNYLGIYLTQAGNFDAAYEAFDSVLELDPT 131 (297)
T ss_pred HHHHHHHhcchhhhhhHHHHHhhhhhhhhhcCCCcHHHHHHHHHHHHhcccchHHHHHhhhHhccCCc
Confidence 46788999999999999999999999999999999999999999999999999999999999988875
No 162
>PF13374 TPR_10: Tetratricopeptide repeat; PDB: 3CEQ_B 3EDT_H 3NF1_A.
Probab=96.86 E-value=0.0018 Score=37.37 Aligned_cols=29 Identities=17% Similarity=0.297 Sum_probs=17.6
Q ss_pred HHHHHHHHhcccCCCcHHHHHHHHHHHHh
Q psy17695 9 VGILFSNSSFPSPGNIPEAIQSYRTALKL 37 (222)
Q Consensus 9 ~~~~~lg~~l~~~g~~~eA~~~~~~Al~l 37 (222)
.++++||.+|..+|++++|+.++++++++
T Consensus 3 ~~~~~la~~~~~~g~~~~A~~~~~~al~~ 31 (42)
T PF13374_consen 3 SALNNLANAYRAQGRYEEALELLEEALEI 31 (42)
T ss_dssp HHHHHHHHHHHHCT-HHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHhhhhcchhhHHHHHHHHH
Confidence 45566666666666666666666666654
No 163
>KOG0550|consensus
Probab=96.85 E-value=0.0013 Score=57.26 Aligned_cols=75 Identities=12% Similarity=0.094 Sum_probs=67.1
Q ss_pred CCCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHH------------HHHHHHHHHHcCCHHHHHHHHHHH
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDA------------YCNLAHCLQIVCDWTDYEARMKKL 68 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a------------~~~l~~~~~~~~~~~~a~~~~~~~ 68 (222)
|++++.+.++...-|.++.-.++.+.|+..|+++|.++|++..+ +-+-|+-..+.|++..|.++|.++
T Consensus 196 lkld~~n~~al~vrg~~~yy~~~~~ka~~hf~qal~ldpdh~~sk~~~~~~k~le~~k~~gN~~fk~G~y~~A~E~Ytea 275 (486)
T KOG0550|consen 196 LKLDATNAEALYVRGLCLYYNDNADKAINHFQQALRLDPDHQKSKSASMMPKKLEVKKERGNDAFKNGNYRKAYECYTEA 275 (486)
T ss_pred HhcccchhHHHHhcccccccccchHHHHHHHhhhhccChhhhhHHhHhhhHHHHHHHHhhhhhHhhccchhHHHHHHHHh
Confidence 57899999999999999999999999999999999999998754 445566677899999999999999
Q ss_pred HHHHHhh
Q psy17695 69 VSIVAEQ 75 (222)
Q Consensus 69 ~~~~~~~ 75 (222)
+.+.|+.
T Consensus 276 l~idP~n 282 (486)
T KOG0550|consen 276 LNIDPSN 282 (486)
T ss_pred hcCCccc
Confidence 9998875
No 164
>KOG4162|consensus
Probab=96.84 E-value=0.0022 Score=59.47 Aligned_cols=72 Identities=18% Similarity=0.154 Sum_probs=63.9
Q ss_pred CCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHH--HHHHHHHHHHh
Q psy17695 3 KTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEA--RMKKLVSIVAE 74 (222)
Q Consensus 3 ~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~--~~~~~~~~~~~ 74 (222)
++|..+..|+..|.++...|+++||.++|..|+.+||+++.....+|.++.+.|+..-++. ....++++.|.
T Consensus 679 ~~~l~~~~~~~~G~~~~~~~~~~EA~~af~~Al~ldP~hv~s~~Ala~~lle~G~~~la~~~~~L~dalr~dp~ 752 (799)
T KOG4162|consen 679 IDPLSASVYYLRGLLLEVKGQLEEAKEAFLVALALDPDHVPSMTALAELLLELGSPRLAEKRSLLSDALRLDPL 752 (799)
T ss_pred cchhhHHHHHHhhHHHHHHHhhHHHHHHHHHHHhcCCCCcHHHHHHHHHHHHhCCcchHHHHHHHHHHHhhCCC
Confidence 4567778889999999999999999999999999999999999999999999998877777 77888877775
No 165
>PF13512 TPR_18: Tetratricopeptide repeat
Probab=96.83 E-value=0.0078 Score=45.21 Aligned_cols=49 Identities=22% Similarity=0.295 Sum_probs=40.9
Q ss_pred CHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHH---HHHHHHHHHHHc
Q psy17695 7 FKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPD---AYCNLAHCLQIV 55 (222)
Q Consensus 7 ~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~---a~~~l~~~~~~~ 55 (222)
-..+.+.||.++.+.|++++|+..+++-++++|+++. +++..|.+...+
T Consensus 46 a~qAqL~l~yayy~~~~y~~A~a~~~rFirLhP~hp~vdYa~Y~~gL~~~~~ 97 (142)
T PF13512_consen 46 AEQAQLDLAYAYYKQGDYEEAIAAYDRFIRLHPTHPNVDYAYYMRGLSYYEQ 97 (142)
T ss_pred cHHHHHHHHHHHHHccCHHHHHHHHHHHHHhCCCCCCccHHHHHHHHHHHHH
Confidence 4578899999999999999999999999999999975 456666555443
No 166
>KOG1127|consensus
Probab=96.83 E-value=0.0026 Score=60.60 Aligned_cols=72 Identities=13% Similarity=0.043 Sum_probs=65.7
Q ss_pred CCCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHH
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIV 72 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~ 72 (222)
|..+|.+.+.|..+|.+|...|++..|++.|.+|..++|.+..+.+-.+.....+|.+.++...+...+...
T Consensus 589 LR~dPkD~n~W~gLGeAY~~sGry~~AlKvF~kAs~LrP~s~y~~fk~A~~ecd~GkYkeald~l~~ii~~~ 660 (1238)
T KOG1127|consen 589 LRTDPKDYNLWLGLGEAYPESGRYSHALKVFTKASLLRPLSKYGRFKEAVMECDNGKYKEALDALGLIIYAF 660 (1238)
T ss_pred hcCCchhHHHHHHHHHHHHhcCceehHHHhhhhhHhcCcHhHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHH
Confidence 356899999999999999999999999999999999999999999999999999999999998887776543
No 167
>KOG4648|consensus
Probab=96.79 E-value=0.0013 Score=56.11 Aligned_cols=75 Identities=9% Similarity=0.126 Sum_probs=70.9
Q ss_pred CCCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhh
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQ 75 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~ 75 (222)
|..+|-|+..+.|.+.+|.++.+|..|..-++.|+.|+-.+..++...+.+...+|...+|.+.++.++++.|+.
T Consensus 124 ia~~P~NpV~~~NRA~AYlk~K~FA~AE~DC~~AiaLd~~Y~KAYSRR~~AR~~Lg~~~EAKkD~E~vL~LEP~~ 198 (536)
T KOG4648|consen 124 IAVYPHNPVYHINRALAYLKQKSFAQAEEDCEAAIALDKLYVKAYSRRMQARESLGNNMEAKKDCETVLALEPKN 198 (536)
T ss_pred hccCCCCccchhhHHHHHHHHHHHHHHHHhHHHHHHhhHHHHHHHHHHHHHHHHHhhHHHHHHhHHHHHhhCccc
Confidence 457899999999999999999999999999999999999999999999999999999999999999999988763
No 168
>KOG1174|consensus
Probab=96.75 E-value=0.0054 Score=53.56 Aligned_cols=74 Identities=11% Similarity=0.101 Sum_probs=67.4
Q ss_pred CCCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhh
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQ 75 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~ 75 (222)
|+.+|+...+...++.++..-|++++++..++++|+..|+ ...+..||.++..+..+.++.+.|..++++.|+.
T Consensus 431 L~~~P~Y~~AV~~~AEL~~~Eg~~~D~i~LLe~~L~~~~D-~~LH~~Lgd~~~A~Ne~Q~am~~y~~ALr~dP~~ 504 (564)
T KOG1174|consen 431 LKINPIYTPAVNLIAELCQVEGPTKDIIKLLEKHLIIFPD-VNLHNHLGDIMRAQNEPQKAMEYYYKALRQDPKS 504 (564)
T ss_pred hccCCccHHHHHHHHHHHHhhCccchHHHHHHHHHhhccc-cHHHHHHHHHHHHhhhHHHHHHHHHHHHhcCccc
Confidence 5688999999999999999999999999999999998887 5678899999999999999999999999998863
No 169
>KOG1129|consensus
Probab=96.65 E-value=0.00065 Score=57.58 Aligned_cols=72 Identities=10% Similarity=-0.034 Sum_probs=54.3
Q ss_pred CCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhhhh
Q psy17695 6 FFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQLD 77 (222)
Q Consensus 6 ~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~l~ 77 (222)
..++.+..|+.+|.+..+.+.|+..|...+..-|.++..+...+.++..+++.++|.+.|+.+++..+.+++
T Consensus 254 ~~~dTfllLskvY~ridQP~~AL~~~~~gld~fP~~VT~l~g~ARi~eam~~~~~a~~lYk~vlk~~~~nvE 325 (478)
T KOG1129|consen 254 PHPDTFLLLSKVYQRIDQPERALLVIGEGLDSFPFDVTYLLGQARIHEAMEQQEDALQLYKLVLKLHPINVE 325 (478)
T ss_pred CchhHHHHHHHHHHHhccHHHHHHHHhhhhhcCCchhhhhhhhHHHHHHHHhHHHHHHHHHHHHhcCCccce
Confidence 456677777777777777777777777777777777777777777777777777777777777777665544
No 170
>KOG3081|consensus
Probab=96.52 E-value=0.0093 Score=49.22 Aligned_cols=61 Identities=18% Similarity=0.053 Sum_probs=55.7
Q ss_pred CCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHH
Q psy17695 3 KTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEA 63 (222)
Q Consensus 3 ~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~ 63 (222)
+.|..+...+.+++++..+|+++||...++.||.-++++++++.|+..+-..+|...++.+
T Consensus 202 k~~~T~~llnG~Av~~l~~~~~eeAe~lL~eaL~kd~~dpetL~Nliv~a~~~Gkd~~~~~ 262 (299)
T KOG3081|consen 202 KTPPTPLLLNGQAVCHLQLGRYEEAESLLEEALDKDAKDPETLANLIVLALHLGKDAEVTE 262 (299)
T ss_pred ccCCChHHHccHHHHHHHhcCHHHHHHHHHHHHhccCCCHHHHHHHHHHHHHhCCChHHHH
Confidence 3577888999999999999999999999999999999999999999999999998866654
No 171
>PF06552 TOM20_plant: Plant specific mitochondrial import receptor subunit TOM20; InterPro: IPR010547 This family consists of several plant specific mitochondrial import receptor subunit TOM20 (translocase of outer membrane 20 kDa subunit) proteins. Most mitochondrial proteins are encoded by the nuclear genome, and are synthesised in the cytosol. TOM20 is a general import receptor that binds to mitochondrial pre-sequences in the early step of protein import into the mitochondria [].; GO: 0045040 protein import into mitochondrial outer membrane, 0005742 mitochondrial outer membrane translocase complex; PDB: 1ZU2_A.
Probab=96.52 E-value=0.0016 Score=50.67 Aligned_cols=51 Identities=12% Similarity=0.101 Sum_probs=40.7
Q ss_pred CCCCCCCHHHHHHHHHhcccCCC-----------cHHHHHHHHHHHHhCCCCHHHHHHHHHH
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGN-----------IPEAIQSYRTALKLKPDFPDAYCNLAHC 51 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~-----------~~eA~~~~~~Al~l~P~~~~a~~~l~~~ 51 (222)
|.++|+...++.++|++|..++. |++|..+|++|...+|++...+-.|-.+
T Consensus 62 L~I~P~~hdAlw~lGnA~ts~A~l~~d~~~A~~~F~kA~~~FqkAv~~~P~ne~Y~ksLe~~ 123 (186)
T PF06552_consen 62 LKINPNKHDALWCLGNAYTSLAFLTPDTAEAEEYFEKATEYFQKAVDEDPNNELYRKSLEMA 123 (186)
T ss_dssp HHH-TT-HHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHH-TT-HHHHHHHHHH
T ss_pred HhcCCchHHHHHHHHHHHHHHHhhcCChHHHHHHHHHHHHHHHHHHhcCCCcHHHHHHHHHH
Confidence 35789999999999999998875 8899999999999999998887777544
No 172
>PRK10866 outer membrane biogenesis protein BamD; Provisional
Probab=96.50 E-value=0.015 Score=47.77 Aligned_cols=53 Identities=15% Similarity=0.147 Sum_probs=43.1
Q ss_pred CCCCCHHH---HHHHHHhcccCCCcHHHHHHHHHHHHhCCCCH---HHHHHHHHHHHHc
Q psy17695 3 KTPFFKVG---ILFSNSSFPSPGNIPEAIQSYRTALKLKPDFP---DAYCNLAHCLQIV 55 (222)
Q Consensus 3 ~~P~~~~~---~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~---~a~~~l~~~~~~~ 55 (222)
..|..+.+ .+.+|.++.+++++++|+..|++.+++.|+++ .+++.+|.+....
T Consensus 61 ~yP~s~~a~~a~l~la~ayy~~~~y~~A~~~~e~fi~~~P~~~~~~~a~Y~~g~~~~~~ 119 (243)
T PRK10866 61 RYPFGPYSQQVQLDLIYAYYKNADLPLAQAAIDRFIRLNPTHPNIDYVLYMRGLTNMAL 119 (243)
T ss_pred hCCCChHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHhCcCCCchHHHHHHHHHhhhhc
Confidence 34555444 48999999999999999999999999998875 5688888875444
No 173
>PF13428 TPR_14: Tetratricopeptide repeat
Probab=96.42 E-value=0.0056 Score=36.21 Aligned_cols=34 Identities=15% Similarity=0.289 Sum_probs=31.3
Q ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhh
Q psy17695 42 PDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQ 75 (222)
Q Consensus 42 ~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~ 75 (222)
+.++..++.++...|++++|++.|+++++..|+.
T Consensus 1 p~~~~~la~~~~~~G~~~~A~~~~~~~l~~~P~~ 34 (44)
T PF13428_consen 1 PAAWLALARAYRRLGQPDEAERLLRRALALDPDD 34 (44)
T ss_pred CHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCcCC
Confidence 4678999999999999999999999999999874
No 174
>COG2956 Predicted N-acetylglucosaminyl transferase [Carbohydrate transport and metabolism]
Probab=96.39 E-value=0.012 Score=49.89 Aligned_cols=68 Identities=10% Similarity=0.001 Sum_probs=54.1
Q ss_pred HHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhh
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQ 75 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~ 75 (222)
+..+..|+..+....+.+.|+..+.+|++.+|+.+.+-..+|.+....|+++.|.+.++.+.+..|+.
T Consensus 180 AqfyCELAq~~~~~~~~d~A~~~l~kAlqa~~~cvRAsi~lG~v~~~~g~y~~AV~~~e~v~eQn~~y 247 (389)
T COG2956 180 AQFYCELAQQALASSDVDRARELLKKALQADKKCVRASIILGRVELAKGDYQKAVEALERVLEQNPEY 247 (389)
T ss_pred HHHHHHHHHHHhhhhhHHHHHHHHHHHHhhCccceehhhhhhHHHHhccchHHHHHHHHHHHHhChHH
Confidence 45566777777777888888888888888888888888888888888888888888888777766653
No 175
>PF14561 TPR_20: Tetratricopeptide repeat; PDB: 3QOU_A 2R5S_A 3QDN_B.
Probab=96.37 E-value=0.027 Score=39.01 Aligned_cols=44 Identities=30% Similarity=0.326 Sum_probs=24.0
Q ss_pred HHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHH
Q psy17695 28 IQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSI 71 (222)
Q Consensus 28 ~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~ 71 (222)
+..++++++.+|++.++.+.++..+...|++++|.+.+-++++.
T Consensus 8 ~~al~~~~a~~P~D~~ar~~lA~~~~~~g~~e~Al~~Ll~~v~~ 51 (90)
T PF14561_consen 8 IAALEAALAANPDDLDARYALADALLAAGDYEEALDQLLELVRR 51 (90)
T ss_dssp HHHHHHHHHHSTT-HHHHHHHHHHHHHTT-HHHHHHHHHHHHCC
T ss_pred HHHHHHHHHcCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHh
Confidence 44555666666666666666666666666666555554444443
No 176
>COG2956 Predicted N-acetylglucosaminyl transferase [Carbohydrate transport and metabolism]
Probab=96.35 E-value=0.0087 Score=50.74 Aligned_cols=72 Identities=8% Similarity=0.110 Sum_probs=63.6
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCH-HHHHHHHHHHHHcCCHHHHHHHHHHHHHHHH
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFP-DAYCNLAHCLQIVCDWTDYEARMKKLVSIVA 73 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~-~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~ 73 (222)
.-+|....+-.-+|.+....|+++.|++.++++++-||++. ++.-.|..||..+|+.++....+.++.+..+
T Consensus 208 qa~~~cvRAsi~lG~v~~~~g~y~~AV~~~e~v~eQn~~yl~evl~~L~~~Y~~lg~~~~~~~fL~~~~~~~~ 280 (389)
T COG2956 208 QADKKCVRASIILGRVELAKGDYQKAVEALERVLEQNPEYLSEVLEMLYECYAQLGKPAEGLNFLRRAMETNT 280 (389)
T ss_pred hhCccceehhhhhhHHHHhccchHHHHHHHHHHHHhChHHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHHccC
Confidence 45788899999999999999999999999999999999985 6778888999999999999888877776544
No 177
>PRK14720 transcript cleavage factor/unknown domain fusion protein; Provisional
Probab=96.30 E-value=0.0057 Score=58.75 Aligned_cols=47 Identities=13% Similarity=-0.154 Sum_probs=29.6
Q ss_pred CCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHH
Q psy17695 4 TPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAH 50 (222)
Q Consensus 4 ~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~ 50 (222)
+|.+..+|..|..++...|++++|++..+.+++.+|+...+++.+|.
T Consensus 27 ~p~n~~a~~~Li~~~~~~~~~deai~i~~~~l~~~P~~i~~yy~~G~ 73 (906)
T PRK14720 27 SLSKFKELDDLIDAYKSENLTDEAKDICEEHLKEHKKSISALYISGI 73 (906)
T ss_pred CcchHHHHHHHHHHHHhcCCHHHHHHHHHHHHHhCCcceehHHHHHH
Confidence 56666666666666666666666666666666666655544444444
No 178
>PF13181 TPR_8: Tetratricopeptide repeat; PDB: 3GW4_B 3MA5_C 2KCV_A 2KCL_A 3FP3_A 3LCA_A 3FP4_A 3FP2_A 1W3B_B 1ELW_A ....
Probab=96.30 E-value=0.013 Score=32.11 Aligned_cols=32 Identities=19% Similarity=0.364 Sum_probs=29.6
Q ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 43 DAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 43 ~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
.+++.+|.++..+|++++|.+.|++++++.|+
T Consensus 2 ~~~~~lg~~y~~~~~~~~A~~~~~~a~~~~~~ 33 (34)
T PF13181_consen 2 EAYYNLGKIYEQLGDYEEALEYFEKALELNPD 33 (34)
T ss_dssp HHHHHHHHHHHHTTSHHHHHHHHHHHHHHHTT
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHHhhCCC
Confidence 57899999999999999999999999998774
No 179
>PRK10153 DNA-binding transcriptional activator CadC; Provisional
Probab=96.28 E-value=0.0082 Score=54.75 Aligned_cols=74 Identities=11% Similarity=0.025 Sum_probs=61.1
Q ss_pred CCCCCCCHHHHHHHHHhcccC--------CCcHHHHHHHHHHHHh--CCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHH
Q psy17695 1 MMKTPFFKVGILFSNSSFPSP--------GNIPEAIQSYRTALKL--KPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVS 70 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~--------g~~~eA~~~~~~Al~l--~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~ 70 (222)
+.++|+++.+|..++.++... ++..++....++++.+ +|.++.++.-++..+...|++++|...+++++.
T Consensus 369 i~ldP~~a~a~A~la~~~~~~~~~~~~~~~~l~~a~~~~~~a~al~~~~~~~~~~~ala~~~~~~g~~~~A~~~l~rAl~ 448 (517)
T PRK10153 369 LKSEPDFTYAQAEKALADIVRHSQQPLDEKQLAALSTELDNIVALPELNVLPRIYEILAVQALVKGKTDEAYQAINKAID 448 (517)
T ss_pred HHhCCCcHHHHHHHHHHHHHHHhcCCccHHHHHHHHHHHHHhhhcccCcCChHHHHHHHHHHHhcCCHHHHHHHHHHHHH
Confidence 357999999999998877553 2355677777887775 888889999999999999999999999999998
Q ss_pred HHHh
Q psy17695 71 IVAE 74 (222)
Q Consensus 71 ~~~~ 74 (222)
+.|.
T Consensus 449 L~ps 452 (517)
T PRK10153 449 LEMS 452 (517)
T ss_pred cCCC
Confidence 8764
No 180
>KOG4234|consensus
Probab=96.28 E-value=0.0099 Score=47.28 Aligned_cols=63 Identities=21% Similarity=0.237 Sum_probs=55.8
Q ss_pred HHHHHhcccCCCcHHHHHHHHHHHHhCCCCH-----HHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 12 LFSNSSFPSPGNIPEAIQSYRTALKLKPDFP-----DAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 12 ~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~-----~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
-.=|+-++..|+|++|..-|..||++-|.-. ..+.|.+.++.+++.|+.|+..+.+++.+.|.
T Consensus 99 K~EGN~~F~ngdyeeA~skY~~Ale~cp~~~~e~rsIly~Nraaa~iKl~k~e~aI~dcsKaiel~pt 166 (271)
T KOG4234|consen 99 KKEGNELFKNGDYEEANSKYQEALESCPSTSTEERSILYSNRAAALIKLRKWESAIEDCSKAIELNPT 166 (271)
T ss_pred HHHHHHhhhcccHHHHHHHHHHHHHhCccccHHHHHHHHhhhHHHHHHhhhHHHHHHHHHhhHhcCch
Confidence 3457888999999999999999999999865 46899999999999999999999999988773
No 181
>KOG3824|consensus
Probab=96.27 E-value=0.0078 Score=50.80 Aligned_cols=65 Identities=15% Similarity=0.218 Sum_probs=54.7
Q ss_pred HHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 10 GILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 10 ~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
+-.+.+.-..+.|+.+.|...|+.|++++|++++++..+|...-...+.-+|..+|-+++.+.|.
T Consensus 118 ~Al~~A~~~~~~Gk~ekA~~lfeHAlalaP~~p~~L~e~G~f~E~~~~iv~ADq~Y~~ALtisP~ 182 (472)
T KOG3824|consen 118 LALKAAGRSRKDGKLEKAMTLFEHALALAPTNPQILIEMGQFREMHNEIVEADQCYVKALTISPG 182 (472)
T ss_pred HHHHHHHHHHhccchHHHHHHHHHHHhcCCCCHHHHHHHhHHHHhhhhhHhhhhhhheeeeeCCC
Confidence 34455556678899999999999999999999999999999988888888899999888877664
No 182
>PF13174 TPR_6: Tetratricopeptide repeat; PDB: 3QKY_A 2XEV_A 3URZ_B 2Q7F_A.
Probab=96.17 E-value=0.0087 Score=32.50 Aligned_cols=32 Identities=22% Similarity=0.404 Sum_probs=29.2
Q ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 43 DAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 43 ~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
++++++|.++...|++++|...++++++..|+
T Consensus 1 ~a~~~~a~~~~~~g~~~~A~~~~~~~~~~~P~ 32 (33)
T PF13174_consen 1 DALYRLARCYYKLGDYDEAIEYFQRLIKRYPD 32 (33)
T ss_dssp HHHHHHHHHHHHHCHHHHHHHHHHHHHHHSTT
T ss_pred CHHHHHHHHHHHccCHHHHHHHHHHHHHHCcC
Confidence 47899999999999999999999999988775
No 183
>KOG0376|consensus
Probab=96.12 E-value=0.0021 Score=56.86 Aligned_cols=73 Identities=11% Similarity=0.152 Sum_probs=47.2
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
.++|+++..+.+.+.++.+.+++.+|+.-+.+|++++|....+++..|.+.+..+.+.+|...++....+.|+
T Consensus 32 ~ldpnca~~~anRa~a~lK~e~~~~Al~Da~kaie~dP~~~K~Y~rrg~a~m~l~~~~~A~~~l~~~~~l~Pn 104 (476)
T KOG0376|consen 32 ELDPNCAIYFANRALAHLKVESFGGALHDALKAIELDPTYIKAYVRRGTAVMALGEFKKALLDLEKVKKLAPN 104 (476)
T ss_pred hcCCcceeeechhhhhheeechhhhHHHHHHhhhhcCchhhheeeeccHHHHhHHHHHHHHHHHHHhhhcCcC
Confidence 3566666666666666666666666666666666666666666666666666666666666666665555543
No 184
>PF13525 YfiO: Outer membrane lipoprotein; PDB: 3TGO_A 3Q5M_A 2YHC_A.
Probab=96.02 E-value=0.021 Score=45.54 Aligned_cols=70 Identities=20% Similarity=0.253 Sum_probs=51.7
Q ss_pred CCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCH---HHHHHHHHHHHHcC-----------CHHHHHHHHHHHHH
Q psy17695 5 PFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFP---DAYCNLAHCLQIVC-----------DWTDYEARMKKLVS 70 (222)
Q Consensus 5 P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~---~a~~~l~~~~~~~~-----------~~~~a~~~~~~~~~ 70 (222)
|--+.+.+.+|.++.+.|++++|+..|++.++..|+++ .+++.+|.++.... ...+|...++.+++
T Consensus 39 ~~a~~A~l~la~a~y~~~~y~~A~~~~~~fi~~yP~~~~~~~A~Y~~g~~~~~~~~~~~~~~~D~~~~~~A~~~~~~li~ 118 (203)
T PF13525_consen 39 PYAPQAQLMLAYAYYKQGDYEEAIAAYERFIKLYPNSPKADYALYMLGLSYYKQIPGILRSDRDQTSTRKAIEEFEELIK 118 (203)
T ss_dssp TTHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHH-TT-TTHHHHHHHHHHHHHHHHHHHH-TT---HHHHHHHHHHHHHHH
T ss_pred hHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCCCCcchhhHHHHHHHHHHHhCccchhcccChHHHHHHHHHHHHHHH
Confidence 45578999999999999999999999999999999976 57888888865542 12356666666666
Q ss_pred HHHh
Q psy17695 71 IVAE 74 (222)
Q Consensus 71 ~~~~ 74 (222)
..|+
T Consensus 119 ~yP~ 122 (203)
T PF13525_consen 119 RYPN 122 (203)
T ss_dssp H-TT
T ss_pred HCcC
Confidence 6665
No 185
>PLN03081 pentatricopeptide (PPR) repeat-containing protein; Provisional
Probab=96.00 E-value=0.023 Score=53.60 Aligned_cols=64 Identities=8% Similarity=-0.066 Sum_probs=58.7
Q ss_pred CHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHH
Q psy17695 7 FKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVS 70 (222)
Q Consensus 7 ~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~ 70 (222)
+...|..|..++...|+++.|...+++.++++|++...+..++++|...|+|++|.+.+++..+
T Consensus 493 ~~~~~~~Ll~a~~~~g~~~~a~~~~~~l~~~~p~~~~~y~~L~~~y~~~G~~~~A~~v~~~m~~ 556 (697)
T PLN03081 493 TVNMWAALLTACRIHKNLELGRLAAEKLYGMGPEKLNNYVVLLNLYNSSGRQAEAAKVVETLKR 556 (697)
T ss_pred CHHHHHHHHHHHHHcCCcHHHHHHHHHHhCCCCCCCcchHHHHHHHHhCCCHHHHHHHHHHHHH
Confidence 4667999999999999999999999999999999999999999999999999999998877654
No 186
>KOG0545|consensus
Probab=95.92 E-value=0.027 Score=46.18 Aligned_cols=66 Identities=15% Similarity=0.150 Sum_probs=61.3
Q ss_pred HHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 9 VGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 9 ~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
..+.|++.++...|++-|+++.....|..+|+++.|++..+.+....-+..+|.+.+.+++++.|.
T Consensus 231 pLllNy~QC~L~~~e~yevleh~seiL~~~~~nvKA~frRakAhaa~Wn~~eA~~D~~~vL~ldps 296 (329)
T KOG0545|consen 231 PLLLNYCQCLLKKEEYYEVLEHCSEILRHHPGNVKAYFRRAKAHAAVWNEAEAKADLQKVLELDPS 296 (329)
T ss_pred HHHHhHHHHHhhHHHHHHHHHHHHHHHhcCCchHHHHHHHHHHHHhhcCHHHHHHHHHHHHhcChh
Confidence 467899999999999999999999999999999999999999999888889999999999988774
No 187
>COG1729 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=95.91 E-value=0.028 Score=46.53 Aligned_cols=65 Identities=22% Similarity=0.297 Sum_probs=60.0
Q ss_pred HHHHHHHhcccCCCcHHHHHHHHHHHHhCCCC---HHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 10 GILFSNSSFPSPGNIPEAIQSYRTALKLKPDF---PDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 10 ~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~---~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
-.++.+.-+.+.|+|.+|++.|..-++.=|++ +.+++=||.++..+|++++|...|..+.+-.|+
T Consensus 143 ~~Y~~A~~~~ksgdy~~A~~~F~~fi~~YP~s~~~~nA~yWLGe~~y~qg~y~~Aa~~f~~~~k~~P~ 210 (262)
T COG1729 143 KLYNAALDLYKSGDYAEAEQAFQAFIKKYPNSTYTPNAYYWLGESLYAQGDYEDAAYIFARVVKDYPK 210 (262)
T ss_pred HHHHHHHHHHHcCCHHHHHHHHHHHHHcCCCCcccchhHHHHHHHHHhcccchHHHHHHHHHHHhCCC
Confidence 37889999999999999999999999998886 578999999999999999999999999887776
No 188
>PF09295 ChAPs: ChAPs (Chs5p-Arf1p-binding proteins); InterPro: IPR015374 ChAPs (Chs5p-Arf1p-binding proteins) are required for the export of specialised cargo from the Golgi. They physically interact with Chs3, Chs5 and the small GTPase Arf1, and they also form interactions with each other [].
Probab=95.86 E-value=0.038 Score=48.63 Aligned_cols=69 Identities=17% Similarity=0.023 Sum_probs=63.9
Q ss_pred CHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhh
Q psy17695 7 FKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQ 75 (222)
Q Consensus 7 ~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~ 75 (222)
++++...++.++...++-.+|+..+.++++.+|.+...+...+..+...++++.|....++++...|..
T Consensus 199 ~pev~~~LA~v~l~~~~E~~AI~ll~~aL~~~p~d~~LL~~Qa~fLl~k~~~~lAL~iAk~av~lsP~~ 267 (395)
T PF09295_consen 199 DPEVAVLLARVYLLMNEEVEAIRLLNEALKENPQDSELLNLQAEFLLSKKKYELALEIAKKAVELSPSE 267 (395)
T ss_pred CCcHHHHHHHHHHhcCcHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHhcCCHHHHHHHHHHHHHhCchh
Confidence 456778899999999999999999999999999999999999999999999999999999999988873
No 189
>PF13424 TPR_12: Tetratricopeptide repeat; PDB: 3RO2_A 3Q15_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 4A1S_B 3CEQ_B 3EDT_H ....
Probab=95.74 E-value=0.011 Score=39.19 Aligned_cols=31 Identities=19% Similarity=0.262 Sum_probs=29.0
Q ss_pred CHHHHHHHHHhcccCCCcHHHHHHHHHHHHh
Q psy17695 7 FKVGILFSNSSFPSPGNIPEAIQSYRTALKL 37 (222)
Q Consensus 7 ~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l 37 (222)
.+.++.++|.++..+|++++|+++|++|+++
T Consensus 45 ~a~~~~~lg~~~~~~g~~~~A~~~~~~al~i 75 (78)
T PF13424_consen 45 TANTLNNLGECYYRLGDYEEALEYYQKALDI 75 (78)
T ss_dssp HHHHHHHHHHHHHHTTHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHcCCHHHHHHHHHHHHhh
Confidence 4678999999999999999999999999986
No 190
>PLN03077 Protein ECB2; Provisional
Probab=95.69 E-value=0.053 Score=52.46 Aligned_cols=63 Identities=10% Similarity=0.027 Sum_probs=54.1
Q ss_pred HHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHH
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVS 70 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~ 70 (222)
+..|..|-.++...|+.+.+....+++++++|+++..+..|+++|...|+|+++....+...+
T Consensus 657 ~~~~~aLl~ac~~~~~~e~~e~~a~~l~~l~p~~~~~y~ll~n~ya~~g~~~~a~~vr~~M~~ 719 (857)
T PLN03077 657 PAVWGALLNACRIHRHVELGELAAQHIFELDPNSVGYYILLCNLYADAGKWDEVARVRKTMRE 719 (857)
T ss_pred HHHHHHHHHHHHHcCChHHHHHHHHHHHhhCCCCcchHHHHHHHHHHCCChHHHHHHHHHHHH
Confidence 667777777788888888888888899999999999999999999999999999876665543
No 191
>PF14938 SNAP: Soluble NSF attachment protein, SNAP; PDB: 1QQE_A 2IFU_A.
Probab=95.66 E-value=0.062 Score=45.03 Aligned_cols=64 Identities=11% Similarity=0.126 Sum_probs=50.9
Q ss_pred HHHHHHHHHhcccC-CCcHHHHHHHHHHHHh--CCCC----HHHHHHHHHHHHHcCCHHHHHHHHHHHHHH
Q psy17695 8 KVGILFSNSSFPSP-GNIPEAIQSYRTALKL--KPDF----PDAYCNLAHCLQIVCDWTDYEARMKKLVSI 71 (222)
Q Consensus 8 ~~~~~~lg~~l~~~-g~~~eA~~~~~~Al~l--~P~~----~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~ 71 (222)
+..+.++|.+|... |++++|+++|++|+++ .-+. ...+.+++.++...++|++|.+.|+++...
T Consensus 114 A~~~~~lA~~ye~~~~d~e~Ai~~Y~~A~~~y~~e~~~~~a~~~~~~~A~l~~~l~~y~~A~~~~e~~~~~ 184 (282)
T PF14938_consen 114 AKCLKELAEIYEEQLGDYEKAIEYYQKAAELYEQEGSPHSAAECLLKAADLYARLGRYEEAIEIYEEVAKK 184 (282)
T ss_dssp HHHHHHHHHHHCCTT--HHHHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHT
T ss_pred HHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHCCChhhHHHHHHHHHHHHHHhCCHHHHHHHHHHHHHH
Confidence 56789999999999 9999999999999998 2222 245788899999999999999999887653
No 192
>COG4976 Predicted methyltransferase (contains TPR repeat) [General function prediction only]
Probab=95.66 E-value=0.015 Score=47.15 Aligned_cols=60 Identities=17% Similarity=0.350 Sum_probs=54.6
Q ss_pred HhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhh
Q psy17695 16 SSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQ 75 (222)
Q Consensus 16 ~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~ 75 (222)
..+.+.|+.+.|.+.|.+|+++.|+....|+.+|....+.|+.+.|.+.|++.+++.|+.
T Consensus 3 ~~~~~~~D~~aaaely~qal~lap~w~~gwfR~g~~~ekag~~daAa~a~~~~L~ldp~D 62 (287)
T COG4976 3 YMLAESGDAEAAAELYNQALELAPEWAAGWFRLGEYTEKAGEFDAAAAAYEEVLELDPED 62 (287)
T ss_pred chhcccCChHHHHHHHHHHhhcCchhhhhhhhcchhhhhcccHHHHHHHHHHHHcCCccc
Confidence 456678999999999999999999999999999999999999999999999999988863
No 193
>PF12569 NARP1: NMDA receptor-regulated protein 1 ; InterPro: IPR021183 This group represents N-terminal acetyltransferase A (NatA) auxiliary subunit and represents a non-catalytic component of the NatA N-terminal acetyltransferase, which catalyzes acetylation of proteins beginning with Met-Ser, Met-Gly and Met-Ala. N-terminal acetylation plays a role in normal eukaryotic translation and processing, protect against proteolytic degradation and protein turnover. NAT1 anchors ARD1 and NAT5 to the ribosome and may present the N- terminal of nascent polypeptides for acetylation [], [].
Probab=95.59 E-value=0.16 Score=46.41 Aligned_cols=69 Identities=19% Similarity=0.109 Sum_probs=45.6
Q ss_pred CCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcC-----CHHHHHHHHHHHHHHHHh
Q psy17695 6 FFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVC-----DWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 6 ~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~-----~~~~a~~~~~~~~~~~~~ 74 (222)
+.....-..|.+|.++|++++|...|+..|+.+|++...+..+..++.... +.+...+.|+++....|.
T Consensus 36 Dk~~~~E~rA~ll~kLg~~~eA~~~y~~Li~rNPdn~~Yy~~L~~~~g~~~~~~~~~~~~~~~~y~~l~~~yp~ 109 (517)
T PF12569_consen 36 DKLAVLEKRAELLLKLGRKEEAEKIYRELIDRNPDNYDYYRGLEEALGLQLQLSDEDVEKLLELYDELAEKYPR 109 (517)
T ss_pred CHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCCCcHHHHHHHHHHHhhhcccccccHHHHHHHHHHHHHhCcc
Confidence 444556677778888888888888888888888888877777777763332 234445555555544443
No 194
>KOG3824|consensus
Probab=95.56 E-value=0.0086 Score=50.55 Aligned_cols=52 Identities=23% Similarity=0.162 Sum_probs=48.6
Q ss_pred CCCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHH
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCL 52 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~ 52 (222)
|+++|++++++..+|......+++-+|-+||-+||.++|.+.+++.|.....
T Consensus 143 lalaP~~p~~L~e~G~f~E~~~~iv~ADq~Y~~ALtisP~nseALvnR~RT~ 194 (472)
T KOG3824|consen 143 LALAPTNPQILIEMGQFREMHNEIVEADQCYVKALTISPGNSEALVNRARTT 194 (472)
T ss_pred HhcCCCCHHHHHHHhHHHHhhhhhHhhhhhhheeeeeCCCchHHHhhhhccc
Confidence 5789999999999999999999999999999999999999999999987653
No 195
>PF14561 TPR_20: Tetratricopeptide repeat; PDB: 3QOU_A 2R5S_A 3QDN_B.
Probab=95.47 E-value=0.037 Score=38.29 Aligned_cols=65 Identities=11% Similarity=0.107 Sum_probs=49.8
Q ss_pred CCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCC--HHHHHHHHHHHHHcCCHHHHHHHHHH
Q psy17695 3 KTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDF--PDAYCNLAHCLQIVCDWTDYEARMKK 67 (222)
Q Consensus 3 ~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~--~~a~~~l~~~~~~~~~~~~a~~~~~~ 67 (222)
.+|++..+.+.++..+...|++++|++.+-..++.++++ ..+.-.+..++..+|.-+.....|++
T Consensus 17 ~~P~D~~ar~~lA~~~~~~g~~e~Al~~Ll~~v~~dr~~~~~~ar~~ll~~f~~lg~~~plv~~~RR 83 (90)
T PF14561_consen 17 ANPDDLDARYALADALLAAGDYEEALDQLLELVRRDRDYEDDAARKRLLDIFELLGPGDPLVSEYRR 83 (90)
T ss_dssp HSTT-HHHHHHHHHHHHHTT-HHHHHHHHHHHHCC-TTCCCCHHHHHHHHHHHHH-TT-HHHHHHHH
T ss_pred cCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCccccccHHHHHHHHHHHHcCCCChHHHHHHH
Confidence 589999999999999999999999999999999999887 66777777777777775544444443
No 196
>PF13374 TPR_10: Tetratricopeptide repeat; PDB: 3CEQ_B 3EDT_H 3NF1_A.
Probab=95.47 E-value=0.05 Score=31.01 Aligned_cols=33 Identities=15% Similarity=0.185 Sum_probs=27.6
Q ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 42 PDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 42 ~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
..++.++|.++..+|++++|+..+++++.+..+
T Consensus 2 a~~~~~la~~~~~~g~~~~A~~~~~~al~~~~~ 34 (42)
T PF13374_consen 2 ASALNNLANAYRAQGRYEEALELLEEALEIRER 34 (42)
T ss_dssp HHHHHHHHHHHHHCT-HHHHHHHHHHHHHHH--
T ss_pred HHHHHHHHHHHHhhhhcchhhHHHHHHHHHHHH
Confidence 457899999999999999999999999988765
No 197
>KOG2796|consensus
Probab=95.41 E-value=0.025 Score=46.89 Aligned_cols=68 Identities=19% Similarity=0.173 Sum_probs=61.1
Q ss_pred CHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 7 FKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 7 ~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
+-.++.|.+.++.-.+++.+|...|.+.++-||.++.+-+|.+.|++.+|+..+|.+..+.++..+|.
T Consensus 251 ~~~V~~n~a~i~lg~nn~a~a~r~~~~i~~~D~~~~~a~NnKALcllYlg~l~DAiK~~e~~~~~~P~ 318 (366)
T KOG2796|consen 251 KIMVLMNSAFLHLGQNNFAEAHRFFTEILRMDPRNAVANNNKALCLLYLGKLKDALKQLEAMVQQDPR 318 (366)
T ss_pred hHHHHhhhhhheecccchHHHHHHHhhccccCCCchhhhchHHHHHHHHHHHHHHHHHHHHHhccCCc
Confidence 34567788889999999999999999999999999999999999999999999999988888877775
No 198
>COG4105 ComL DNA uptake lipoprotein [General function prediction only]
Probab=95.40 E-value=0.098 Score=43.04 Aligned_cols=70 Identities=16% Similarity=0.127 Sum_probs=64.0
Q ss_pred CCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCH---HHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhh
Q psy17695 6 FFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFP---DAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQ 75 (222)
Q Consensus 6 ~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~---~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~ 75 (222)
..+..|++-|....+.|++++|+..|+......|..+ .+...++.++.+.+++++|+...++.+++.|.+
T Consensus 32 ~p~~~LY~~g~~~L~~gn~~~A~~~fe~l~~~~p~s~~~~qa~l~l~yA~Yk~~~y~~A~~~~drFi~lyP~~ 104 (254)
T COG4105 32 LPASELYNEGLTELQKGNYEEAIKYFEALDSRHPFSPYSEQAQLDLAYAYYKNGEYDLALAYIDRFIRLYPTH 104 (254)
T ss_pred CCHHHHHHHHHHHHhcCCHHHHHHHHHHHHHcCCCCcccHHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCC
Confidence 3578899999999999999999999999999988765 688999999999999999999999999999874
No 199
>PF07720 TPR_3: Tetratricopeptide repeat; InterPro: IPR011716 This entry includes tetratricopeptide-like repeats found in the LcrH/SycD-like chaperones [].; PDB: 3KS2_O 3GZ2_A 3GZ1_A 3GYZ_A 4AM9_A 2VGX_A 2VGY_A.
Probab=95.30 E-value=0.043 Score=31.12 Aligned_cols=34 Identities=21% Similarity=0.056 Sum_probs=27.4
Q ss_pred HHHHHHHHHhcccCCCcHHHHHH--HHHHHHhCCCC
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQS--YRTALKLKPDF 41 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~--~~~Al~l~P~~ 41 (222)
++.+..+|..+..+|++++|+.. |+-+..++|.|
T Consensus 1 ~e~~y~~a~~~y~~~ky~~A~~~~~y~~l~~ld~~n 36 (36)
T PF07720_consen 1 PEYLYGLAYNFYQKGKYDEAIHFFQYAFLCALDKYN 36 (36)
T ss_dssp HHHHHHHHHHHHHTT-HHHHHHHHHHHHHHHHTTT-
T ss_pred CcHHHHHHHHHHHHhhHHHHHHHHHHHHHHHhcccC
Confidence 46788999999999999999999 66888888764
No 200
>PF04184 ST7: ST7 protein; InterPro: IPR007311 The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain.
Probab=95.24 E-value=0.071 Score=47.77 Aligned_cols=61 Identities=23% Similarity=0.183 Sum_probs=52.3
Q ss_pred HHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCC--CHHHHHHHHHHHHHcCCHHHHHHHHHHH
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQSYRTALKLKPD--FPDAYCNLAHCLQIVCDWTDYEARMKKL 68 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~--~~~a~~~l~~~~~~~~~~~~a~~~~~~~ 68 (222)
..+-..||.+..++|+.+||++.|+..++..|. +-..+.||..++..++.+.++.+.+.+.
T Consensus 259 ~y~KrRLAmCarklGr~~EAIk~~rdLlke~p~~~~l~IrenLie~LLelq~Yad~q~lL~kY 321 (539)
T PF04184_consen 259 VYAKRRLAMCARKLGRLREAIKMFRDLLKEFPNLDNLNIRENLIEALLELQAYADVQALLAKY 321 (539)
T ss_pred hhhHHHHHHHHHHhCChHHHHHHHHHHHhhCCccchhhHHHHHHHHHHhcCCHHHHHHHHHHh
Confidence 445678999999999999999999999998876 4568999999999999999887655543
No 201
>PF09613 HrpB1_HrpK: Bacterial type III secretion protein (HrpB1_HrpK); InterPro: IPR013394 This family of proteins is encoded by genes found within type III secretion operons in a limited range of species including Xanthomonas, Ralstonia and Burkholderia.
Probab=95.17 E-value=0.15 Score=39.15 Aligned_cols=57 Identities=28% Similarity=0.283 Sum_probs=43.5
Q ss_pred CCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHH
Q psy17695 3 KTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWT 59 (222)
Q Consensus 3 ~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~ 59 (222)
+.|++++.-..-|.++...|+|.+|+..|+...+-.|..+.+...++.|+..++|.+
T Consensus 39 LRP~~~e~~~~~~~l~i~r~~w~dA~rlLr~l~~~~~~~p~~kALlA~CL~~~~D~~ 95 (160)
T PF09613_consen 39 LRPEFPELDLFDGWLHIVRGDWDDALRLLRELEERAPGFPYAKALLALCLYALGDPS 95 (160)
T ss_pred hCCCchHHHHHHHHHHHHhCCHHHHHHHHHHHhccCCCChHHHHHHHHHHHHcCChH
Confidence 457777777777778888888888888888877777777777777777777777743
No 202
>PF07721 TPR_4: Tetratricopeptide repeat; InterPro: IPR011717 This entry includes tetratricopeptide-like repeats not detected by the IPR001440 from INTERPRO, IPR013105 from INTERPRO and IPR011716 from INTERPRO models. The tetratricopeptide repeat (TPR) motif is a protein-protein interaction module found in multiple copies in a number of functionally different proteins that facilitates specific interactions with a partner protein(s) [].; GO: 0042802 identical protein binding
Probab=95.16 E-value=0.016 Score=30.19 Aligned_cols=24 Identities=21% Similarity=0.138 Sum_probs=17.0
Q ss_pred HHHHHHHHhcccCCCcHHHHHHHH
Q psy17695 9 VGILFSNSSFPSPGNIPEAIQSYR 32 (222)
Q Consensus 9 ~~~~~lg~~l~~~g~~~eA~~~~~ 32 (222)
.++.++|.++..+|++++|...++
T Consensus 2 ~a~~~la~~~~~~G~~~eA~~~l~ 25 (26)
T PF07721_consen 2 RARLALARALLAQGDPDEAERLLR 25 (26)
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHh
Confidence 456677777777777777777665
No 203
>PF10300 DUF3808: Protein of unknown function (DUF3808); InterPro: IPR019412 This entry represents a family of proteins conserved from fungi to humans. In humans this protein is expressed in primary breast carcinomas but not in normal breast tissue, and has a putative eukaryotic RNP-1 RNA binding region and a candidate anchoring transmembrane domain. The human protein is coordinately regulated with oestrogen receptor, but is not necessarily oestradiol-responsive []. Members of this family carry a tetratricopeptide repeat (IPR013105 from INTERPRO) at their C terminus.
Probab=95.08 E-value=0.048 Score=49.19 Aligned_cols=70 Identities=23% Similarity=0.398 Sum_probs=59.9
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCH----HHHHHHHHHHHHcCCHHHHHHHHHHHHHH
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFP----DAYCNLAHCLQIVCDWTDYEARMKKLVSI 71 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~----~a~~~l~~~~~~~~~~~~a~~~~~~~~~~ 71 (222)
...|+.+...+..|.++...|+.++|++.|++++....+.. ..++.++.++..+.+|++|..++.++.+.
T Consensus 261 ~~yP~s~lfl~~~gR~~~~~g~~~~Ai~~~~~a~~~q~~~~Ql~~l~~~El~w~~~~~~~w~~A~~~f~~L~~~ 334 (468)
T PF10300_consen 261 KRYPNSALFLFFEGRLERLKGNLEEAIESFERAIESQSEWKQLHHLCYFELAWCHMFQHDWEEAAEYFLRLLKE 334 (468)
T ss_pred HhCCCcHHHHHHHHHHHHHhcCHHHHHHHHHHhccchhhHHhHHHHHHHHHHHHHHHHchHHHHHHHHHHHHhc
Confidence 35699999999999999999999999999999996554433 45788999999999999999998887753
No 204
>KOG1130|consensus
Probab=95.02 E-value=0.017 Score=50.66 Aligned_cols=65 Identities=15% Similarity=0.201 Sum_probs=55.7
Q ss_pred HHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCC------HHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHH
Q psy17695 9 VGILFSNSSFPSPGNIPEAIQSYRTALKLKPDF------PDAYCNLAHCLQIVCDWTDYEARMKKLVSIVA 73 (222)
Q Consensus 9 ~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~------~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~ 73 (222)
.++-+||+.|.-+|+|++|+.+-+.=|++.-.+ -.++.|+|+|+..+|+++.|.+.|+..+.+.-
T Consensus 196 Ra~GnLGNTyYlLGdf~~ai~~H~~RL~ia~efGDrAaeRRA~sNlgN~hiflg~fe~A~ehYK~tl~LAi 266 (639)
T KOG1130|consen 196 RAYGNLGNTYYLLGDFDQAIHFHKLRLEIAQEFGDRAAERRAHSNLGNCHIFLGNFELAIEHYKLTLNLAI 266 (639)
T ss_pred chhcccCceeeeeccHHHHHHHHHHHHHHHHHhhhHHHHHHhhcccchhhhhhcccHhHHHHHHHHHHHHH
Confidence 467799999999999999999998888886443 35899999999999999999999998876544
No 205
>PF12968 DUF3856: Domain of Unknown Function (DUF3856); InterPro: IPR024552 This domain of unknown function is found in a small group of tetratricopeptide-like proteins, which includes the uncharacterised protein Q8KAL8 from SWISSPROT. The structure of Q8KAL8 is known and belongs to the SCOP all alpha class, TPR-like superfamily, CT2138-like family.; PDB: 2HR2_D.
Probab=94.84 E-value=0.25 Score=36.06 Aligned_cols=68 Identities=19% Similarity=0.080 Sum_probs=54.9
Q ss_pred CHHHHHHHHHhcccCCCcHHHHHHHHHHH-------HhCCCCH----HHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 7 FKVGILFSNSSFPSPGNIPEAIQSYRTAL-------KLKPDFP----DAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 7 ~~~~~~~lg~~l~~~g~~~eA~~~~~~Al-------~l~P~~~----~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
++..|..|+.++..+|+|++++..-+++| +|+-+.. .+.++.+..+..+|..++|...|+.+.+.+.+
T Consensus 54 DA~chA~Ls~A~~~Lgry~e~L~sA~~aL~YFNRRGEL~qdeGklWIaaVfsra~Al~~~Gr~~eA~~~fr~agEMiaE 132 (144)
T PF12968_consen 54 DAFCHAGLSGALAGLGRYDECLQSADRALRYFNRRGELHQDEGKLWIAAVFSRAVALEGLGRKEEALKEFRMAGEMIAE 132 (144)
T ss_dssp HHHHHHHHHHHHHHTT-HHHHHHHHHHHHHHHHHH--TTSTHHHHHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHhhccHHHHHHHHHHHHHHHhhccccccccchhHHHHHHHHHHHHHhcCChHHHHHHHHHHHHHHHH
Confidence 46678899999999999999998888888 4555544 35688899999999999999999999988775
No 206
>COG4700 Uncharacterized protein conserved in bacteria containing a divergent form of TPR repeats [Function unknown]
Probab=94.73 E-value=0.18 Score=39.86 Aligned_cols=69 Identities=16% Similarity=0.115 Sum_probs=48.9
Q ss_pred CCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCC--CHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 6 FFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPD--FPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 6 ~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~--~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
+++..+..++++....+++.+|.+.+++..+.+|. .++....++.++...|.+++|+..++.++...|.
T Consensus 122 ~d~a~lLglA~Aqfa~~~~A~a~~tLe~l~e~~pa~r~pd~~Ll~aR~laa~g~~a~Aesafe~a~~~ypg 192 (251)
T COG4700 122 HDAAMLLGLAQAQFAIQEFAAAQQTLEDLMEYNPAFRSPDGHLLFARTLAAQGKYADAESAFEVAISYYPG 192 (251)
T ss_pred CCHHHHHHHHHHHHhhccHHHHHHHHHHHhhcCCccCCCCchHHHHHHHHhcCCchhHHHHHHHHHHhCCC
Confidence 45666777777777777777777777777777765 3556666677777777777777777777766654
No 207
>smart00028 TPR Tetratricopeptide repeats. Repeats present in 4 or more copies in proteins. Contain a minimum of 34 amino acids each and self-associate via a "knobs and holes" mechanism.
Probab=94.65 E-value=0.038 Score=28.51 Aligned_cols=31 Identities=16% Similarity=0.223 Sum_probs=27.2
Q ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHHHHHH
Q psy17695 43 DAYCNLAHCLQIVCDWTDYEARMKKLVSIVA 73 (222)
Q Consensus 43 ~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~ 73 (222)
.++.++|.++...+++++|...++++++..|
T Consensus 2 ~~~~~~a~~~~~~~~~~~a~~~~~~~~~~~~ 32 (34)
T smart00028 2 EALYNLGNAYLKLGDYDEALEYYEKALELDP 32 (34)
T ss_pred hHHHHHHHHHHHHhhHHHHHHHHHHHHccCC
Confidence 5788999999999999999999998886544
No 208
>PF12862 Apc5: Anaphase-promoting complex subunit 5
Probab=94.64 E-value=0.17 Score=35.12 Aligned_cols=57 Identities=14% Similarity=0.220 Sum_probs=47.2
Q ss_pred ccCCCcHHHHHHHHHHHHhCCC---------CHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhh
Q psy17695 19 PSPGNIPEAIQSYRTALKLKPD---------FPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQ 75 (222)
Q Consensus 19 ~~~g~~~eA~~~~~~Al~l~P~---------~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~ 75 (222)
...|++.+|++.+.+.+..... ...+..+++.+....|++++|...+++++++..+.
T Consensus 9 ~~~~dy~~A~d~L~~~fD~~~~~~~~~~~~~~~~all~lA~~~~~~G~~~~A~~~l~eAi~~Are~ 74 (94)
T PF12862_consen 9 LRSGDYSEALDALHRYFDYAKQSNNSSSNSGLAYALLNLAELHRRFGHYEEALQALEEAIRLAREN 74 (94)
T ss_pred HHcCCHHHHHHHHHHHHHHHhhcccchhhHHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHHHH
Confidence 4678999998888888766322 24678899999999999999999999999988863
No 209
>KOG0495|consensus
Probab=94.54 E-value=0.051 Score=50.24 Aligned_cols=62 Identities=18% Similarity=0.063 Sum_probs=31.3
Q ss_pred HHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHH
Q psy17695 9 VGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVS 70 (222)
Q Consensus 9 ~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~ 70 (222)
..|+.-+.+...+++.++|+..++++|+.-|++...|..+|.++-.+++.+.|.+.|..-.+
T Consensus 652 Rv~mKs~~~er~ld~~eeA~rllEe~lk~fp~f~Kl~lmlGQi~e~~~~ie~aR~aY~~G~k 713 (913)
T KOG0495|consen 652 RVWMKSANLERYLDNVEEALRLLEEALKSFPDFHKLWLMLGQIEEQMENIEMAREAYLQGTK 713 (913)
T ss_pred hhhHHHhHHHHHhhhHHHHHHHHHHHHHhCCchHHHHHHHhHHHHHHHHHHHHHHHHHhccc
Confidence 34444444455555555555555555555555555555555555555555555544444433
No 210
>KOG4340|consensus
Probab=94.48 E-value=0.11 Score=43.94 Aligned_cols=66 Identities=15% Similarity=0.113 Sum_probs=61.0
Q ss_pred CCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHH
Q psy17695 6 FFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSI 71 (222)
Q Consensus 6 ~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~ 71 (222)
+.+....+.|-++.+.|++++|++-|..|++...-++..-+|++.+....+++..|.....++++.
T Consensus 142 n~Ad~~in~gCllykegqyEaAvqkFqaAlqvsGyqpllAYniALaHy~~~qyasALk~iSEIieR 207 (459)
T KOG4340|consen 142 NEADGQINLGCLLYKEGQYEAAVQKFQAALQVSGYQPLLAYNLALAHYSSRQYASALKHISEIIER 207 (459)
T ss_pred CccchhccchheeeccccHHHHHHHHHHHHhhcCCCchhHHHHHHHHHhhhhHHHHHHHHHHHHHh
Confidence 578889999999999999999999999999999999999999999999999999999888777654
No 211
>KOG2376|consensus
Probab=94.43 E-value=0.27 Score=44.96 Aligned_cols=75 Identities=16% Similarity=0.122 Sum_probs=61.0
Q ss_pred CCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhC------------------------------CC-CHHHHHHHHHHH
Q psy17695 4 TPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLK------------------------------PD-FPDAYCNLAHCL 52 (222)
Q Consensus 4 ~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~------------------------------P~-~~~a~~~l~~~~ 52 (222)
++.+.......|.++.++|+|++|.+.|+..++-+ |. ..+..+|.+.++
T Consensus 106 ~~~~~~ll~L~AQvlYrl~~ydealdiY~~L~kn~~dd~d~~~r~nl~a~~a~l~~~~~q~v~~v~e~syel~yN~Ac~~ 185 (652)
T KOG2376|consen 106 DRLDDKLLELRAQVLYRLERYDEALDIYQHLAKNNSDDQDEERRANLLAVAAALQVQLLQSVPEVPEDSYELLYNTACIL 185 (652)
T ss_pred cccchHHHHHHHHHHHHHhhHHHHHHHHHHHHhcCCchHHHHHHHHHHHHHHhhhHHHHHhccCCCcchHHHHHHHHHHH
Confidence 45666678888999999999999999999986443 22 445688999999
Q ss_pred HHcCCHHHHHHHHHHHHHHHHhhhhc
Q psy17695 53 QIVCDWTDYEARMKKLVSIVAEQLDK 78 (222)
Q Consensus 53 ~~~~~~~~a~~~~~~~~~~~~~~l~~ 78 (222)
...|+|.+|++.+++++++..+.|..
T Consensus 186 i~~gky~qA~elL~kA~~~~~e~l~~ 211 (652)
T KOG2376|consen 186 IENGKYNQAIELLEKALRICREKLED 211 (652)
T ss_pred HhcccHHHHHHHHHHHHHHHHHhhcc
Confidence 99999999999999998877665443
No 212
>PF10602 RPN7: 26S proteasome subunit RPN7; InterPro: IPR019585 This entry represents the regulatory subunit RPN7 (known as the non-ATPase regulatory subunit 6 in higher eukaryotes) of the 26S proteasome. This entry also matches the evolutionarily related subunit 1 of the COP9 signalosome complex (CSN) from Arabidopsis []. The 26S proteasome plays a major role in ATP-dependent degradation of ubiquitinated proteins. Substrate specificity is conferred by the regulatory particle (RP), which can dissociate into stable lid and base subcomplexes. The regulatory subunit RPN7 is one of the lid subunits of the 26S proteasome and has been shown in Saccharomyces cerevisiae (Baker's yeast) to be required for structural integrity []. The COP9 signalosome is a conserved protein complex composed of eight subunits, where Individual subunits of the complex have been linked to various signal transduction pathways leading to gene expression and cell cycle control []. The overall organisation and the amino acid sequences of the COP9 signalosome subunits resemble the lid subcomplex of the 19 S regulatory particle for the 26 S proteasome []. COP9 subunit 1 (CSN1 or GPS1) of the COP9 complex is an essential subunit of the complex with regard to both structural integrity and functionality. The N-terminal region of subunit 1 (CSN1-N) can inhibit c-fos expression from either a transfected template or a chromosomal transgene (fos-lacZ), and may contain the activity domain that confers most of the repression functions of CSN1. The C-terminal region of subunit 1 (CSN1-C) allows integration of the protein into the COP9 signalosome.
Probab=94.42 E-value=0.24 Score=38.68 Aligned_cols=67 Identities=18% Similarity=0.287 Sum_probs=55.9
Q ss_pred HHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCC---HHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDF---PDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~---~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
-.++..+|..|.+.|++++|+++|.++....-.. .+...++..+....++|........++......
T Consensus 36 r~~~~~l~~~~~~~Gd~~~A~k~y~~~~~~~~~~~~~id~~l~~irv~i~~~d~~~v~~~i~ka~~~~~~ 105 (177)
T PF10602_consen 36 RMALEDLADHYCKIGDLEEALKAYSRARDYCTSPGHKIDMCLNVIRVAIFFGDWSHVEKYIEKAESLIEK 105 (177)
T ss_pred HHHHHHHHHHHHHhhhHHHHHHHHHHHhhhcCCHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHhc
Confidence 3688999999999999999999999988865332 356788888888999999999888888776654
No 213
>KOG2053|consensus
Probab=94.41 E-value=0.092 Score=49.90 Aligned_cols=74 Identities=12% Similarity=-0.009 Sum_probs=62.0
Q ss_pred CCCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
|++.|+.+-+...-|.++.++|+.+||..+++..-.+.+++...+.-+-+||..+++.+++..+|+++..-.|+
T Consensus 36 lkk~Pn~~~a~vLkaLsl~r~gk~~ea~~~Le~~~~~~~~D~~tLq~l~~~y~d~~~~d~~~~~Ye~~~~~~P~ 109 (932)
T KOG2053|consen 36 LKKHPNALYAKVLKALSLFRLGKGDEALKLLEALYGLKGTDDLTLQFLQNVYRDLGKLDEAVHLYERANQKYPS 109 (932)
T ss_pred HHHCCCcHHHHHHHHHHHHHhcCchhHHHHHhhhccCCCCchHHHHHHHHHHHHHhhhhHHHHHHHHHHhhCCc
Confidence 35678888888888888888999999998888888888888888888888888888888888888888877665
No 214
>KOG3785|consensus
Probab=94.35 E-value=0.07 Score=46.06 Aligned_cols=59 Identities=17% Similarity=0.202 Sum_probs=52.0
Q ss_pred HHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHH
Q psy17695 9 VGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKK 67 (222)
Q Consensus 9 ~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~ 67 (222)
..-..+|.++..+|++++|+..|+-+..-+--+.+.+.||+.|+..+|.+.+|.....+
T Consensus 58 ~~~lWia~C~fhLgdY~~Al~~Y~~~~~~~~~~~el~vnLAcc~FyLg~Y~eA~~~~~k 116 (557)
T KOG3785|consen 58 SLQLWIAHCYFHLGDYEEALNVYTFLMNKDDAPAELGVNLACCKFYLGQYIEAKSIAEK 116 (557)
T ss_pred HHHHHHHHHHHhhccHHHHHHHHHHHhccCCCCcccchhHHHHHHHHHHHHHHHHHHhh
Confidence 34566899999999999999999999998877899999999999999999999875544
No 215
>PF10579 Rapsyn_N: Rapsyn N-terminal myristoylation and linker region; InterPro: IPR019568 Neuromuscular junction formation relies upon the clustering of acetylcholine receptors and other proteins in the muscle membrane. Rapsyn is a peripheral membrane protein that is selectively concentrated at the neuromuscular junction and is essential for the formation of synaptic acetylcholine receptor aggregates. Acetylcholine receptors fail to aggregate beneath nerve terminals in mice where rapsyn has been knocked out. The N-terminal six amino acids of rapsyn are its myristoylation site, and myristoylation is necessary for the targeting of the protein to the membrane []. ; GO: 0008270 zinc ion binding, 0033130 acetylcholine receptor binding, 0007268 synaptic transmission, 0005856 cytoskeleton, 0030054 cell junction, 0045211 postsynaptic membrane
Probab=94.18 E-value=0.49 Score=31.78 Aligned_cols=62 Identities=18% Similarity=0.244 Sum_probs=48.3
Q ss_pred HHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHH
Q psy17695 9 VGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVS 70 (222)
Q Consensus 9 ~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~ 70 (222)
..+..-|.-+....+.++|+..++++++..++.++-+.-+|......++|....+.+.-++.
T Consensus 7 k~~ie~GlkLY~~~~~~~Al~~W~~aL~k~~~~~~rf~~lG~l~qA~~e~Gkyr~~L~fA~~ 68 (80)
T PF10579_consen 7 KQQIEKGLKLYHQNETQQALQKWRKALEKITDREDRFRVLGYLIQAHMEWGKYREMLAFALQ 68 (80)
T ss_pred HHHHHHHHHHhccchHHHHHHHHHHHHhhcCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34556677788999999999999999999999998888888777777777665555544443
No 216
>COG4700 Uncharacterized protein conserved in bacteria containing a divergent form of TPR repeats [Function unknown]
Probab=94.10 E-value=0.19 Score=39.69 Aligned_cols=66 Identities=14% Similarity=0.136 Sum_probs=58.1
Q ss_pred HHHHHHHHhcccCCCcHHHHHHHHHHHHh-CCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 9 VGILFSNSSFPSPGNIPEAIQSYRTALKL-KPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 9 ~~~~~lg~~l~~~g~~~eA~~~~~~Al~l-~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
.-.+.||+.+.++|++.||...|++++.- --+++..+..+++.+...++..++...++++.+..|.
T Consensus 90 qnr~rLa~al~elGr~~EA~~hy~qalsG~fA~d~a~lLglA~Aqfa~~~~A~a~~tLe~l~e~~pa 156 (251)
T COG4700 90 QNRYRLANALAELGRYHEAVPHYQQALSGIFAHDAAMLLGLAQAQFAIQEFAAAQQTLEDLMEYNPA 156 (251)
T ss_pred HHHHHHHHHHHHhhhhhhhHHHHHHHhccccCCCHHHHHHHHHHHHhhccHHHHHHHHHHHhhcCCc
Confidence 34678999999999999999999999974 6778999999999999999999999988888876654
No 217
>TIGR02561 HrpB1_HrpK type III secretion protein HrpB1/HrpK. This gene is found within type III secretion operons in a limited range of species including Xanthomonas, Ralstonia and Burkholderia.
Probab=94.04 E-value=0.29 Score=37.10 Aligned_cols=58 Identities=26% Similarity=0.164 Sum_probs=48.2
Q ss_pred CCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHH
Q psy17695 3 KTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTD 60 (222)
Q Consensus 3 ~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~ 60 (222)
+.|+.++...--|.++...|+|.||+..|+...+-.|..+.+...++.|+..++|.+.
T Consensus 39 LrP~~~e~d~~dg~l~i~rg~w~eA~rvlr~l~~~~~~~p~~kAL~A~CL~al~Dp~W 96 (153)
T TIGR02561 39 LRPNLKELDMFDGWLLIARGNYDEAARILRELLSSAGAPPYGKALLALCLNAKGDAEW 96 (153)
T ss_pred hCCCccccchhHHHHHHHcCCHHHHHHHHHhhhccCCCchHHHHHHHHHHHhcCChHH
Confidence 4588888888888888888888888888888888888888888888888888888443
No 218
>KOG0545|consensus
Probab=93.77 E-value=0.12 Score=42.40 Aligned_cols=67 Identities=18% Similarity=0.223 Sum_probs=55.6
Q ss_pred HHHHHHHHHhcccCCCcHHHHHHHHHHHHh--------CCCCHH----------HHHHHHHHHHHcCCHHHHHHHHHHHH
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQSYRTALKL--------KPDFPD----------AYCNLAHCLQIVCDWTDYEARMKKLV 69 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l--------~P~~~~----------a~~~l~~~~~~~~~~~~a~~~~~~~~ 69 (222)
..++..-|+-+.++|++.||..+|+.|+.. .|..++ .+.|++.|+...+++-++++...+++
T Consensus 178 v~~l~q~GN~lfk~~~ykEA~~~YreAi~~l~~L~lkEkP~e~eW~eLdk~~tpLllNy~QC~L~~~e~yevleh~seiL 257 (329)
T KOG0545|consen 178 VPVLHQEGNRLFKLGRYKEASSKYREAIICLRNLQLKEKPGEPEWLELDKMITPLLLNYCQCLLKKEEYYEVLEHCSEIL 257 (329)
T ss_pred hHHHHHhhhhhhhhccHHHHHHHHHHHHHHHHHHHhccCCCChHHHHHHHhhhHHHHhHHHHHhhHHHHHHHHHHHHHHH
Confidence 356778899999999999999999999742 466653 68899999999999999998888888
Q ss_pred HHHHh
Q psy17695 70 SIVAE 74 (222)
Q Consensus 70 ~~~~~ 74 (222)
+..|.
T Consensus 258 ~~~~~ 262 (329)
T KOG0545|consen 258 RHHPG 262 (329)
T ss_pred hcCCc
Confidence 76664
No 219
>PF10516 SHNi-TPR: SHNi-TPR; InterPro: IPR019544 The tetratrico peptide repeat region (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B. This entry represents SHNi-TPR (Sim3-Hif1-NASP interrupted TPR), a sequence that is an interrupted form of TPR repeat [].
Probab=93.58 E-value=0.083 Score=30.33 Aligned_cols=29 Identities=21% Similarity=0.169 Sum_probs=17.9
Q ss_pred HHHHHHHHhcccCCCcHHHHHHHHHHHHh
Q psy17695 9 VGILFSNSSFPSPGNIPEAIQSYRTALKL 37 (222)
Q Consensus 9 ~~~~~lg~~l~~~g~~~eA~~~~~~Al~l 37 (222)
+.+..||.+-...++|++|+.-|+++|++
T Consensus 2 dv~~~Lgeisle~e~f~qA~~D~~~aL~i 30 (38)
T PF10516_consen 2 DVYDLLGEISLENENFEQAIEDYEKALEI 30 (38)
T ss_pred cHHHHHHHHHHHhccHHHHHHHHHHHHHH
Confidence 34556666666666666666666666655
No 220
>COG3629 DnrI DNA-binding transcriptional activator of the SARP family [Signal transduction mechanisms]
Probab=93.58 E-value=0.48 Score=39.78 Aligned_cols=67 Identities=15% Similarity=0.126 Sum_probs=61.9
Q ss_pred HHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
..++..+...+...|+++.++..+++.+..+|-+..+|..+...+...|+...|+..|+++.....+
T Consensus 153 ~~~l~~lae~~~~~~~~~~~~~~l~~Li~~dp~~E~~~~~lm~~y~~~g~~~~ai~~y~~l~~~~~e 219 (280)
T COG3629 153 IKALTKLAEALIACGRADAVIEHLERLIELDPYDEPAYLRLMEAYLVNGRQSAAIRAYRQLKKTLAE 219 (280)
T ss_pred HHHHHHHHHHHHhcccHHHHHHHHHHHHhcCccchHHHHHHHHHHHHcCCchHHHHHHHHHHHHhhh
Confidence 4678889999999999999999999999999999999999999999999999999999999876443
No 221
>KOG1130|consensus
Probab=93.52 E-value=0.11 Score=45.69 Aligned_cols=65 Identities=14% Similarity=0.126 Sum_probs=51.9
Q ss_pred HHHHHHHHhcccCCCcHHHHHHHHHHHHh----CCCC--HHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHH
Q psy17695 9 VGILFSNSSFPSPGNIPEAIQSYRTALKL----KPDF--PDAYCNLAHCLQIVCDWTDYEARMKKLVSIVA 73 (222)
Q Consensus 9 ~~~~~lg~~l~~~g~~~eA~~~~~~Al~l----~P~~--~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~ 73 (222)
.++.|||+++.-+|+++.|+++|++++.+ .-.. ++.-+.||+.|-...+++.|+.+.++-+.+..
T Consensus 236 RA~sNlgN~hiflg~fe~A~ehYK~tl~LAielg~r~vEAQscYSLgNtytll~e~~kAI~Yh~rHLaIAq 306 (639)
T KOG1130|consen 236 RAHSNLGNCHIFLGNFELAIEHYKLTLNLAIELGNRTVEAQSCYSLGNTYTLLKEVQKAITYHQRHLAIAQ 306 (639)
T ss_pred HhhcccchhhhhhcccHhHHHHHHHHHHHHHHhcchhHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHH
Confidence 57899999999999999999999987755 3333 34567888998888889999888877666544
No 222
>KOG0376|consensus
Probab=93.38 E-value=0.069 Score=47.47 Aligned_cols=65 Identities=14% Similarity=0.082 Sum_probs=59.0
Q ss_pred HHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhh
Q psy17695 11 ILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQ 75 (222)
Q Consensus 11 ~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~ 75 (222)
.-+-+.-....+.++.|+..|-+|++++|+.+..+.|.+..+.+.+++.+|+....++++..|..
T Consensus 7 ~k~ean~~l~~~~fd~avdlysKaI~ldpnca~~~anRa~a~lK~e~~~~Al~Da~kaie~dP~~ 71 (476)
T KOG0376|consen 7 LKNEANEALKDKVFDVAVDLYSKAIELDPNCAIYFANRALAHLKVESFGGALHDALKAIELDPTY 71 (476)
T ss_pred hhhHHhhhcccchHHHHHHHHHHHHhcCCcceeeechhhhhheeechhhhHHHHHHhhhhcCchh
Confidence 34556778888999999999999999999999999999999999999999999999999988864
No 223
>KOG2471|consensus
Probab=93.17 E-value=0.13 Score=46.19 Aligned_cols=67 Identities=13% Similarity=-0.070 Sum_probs=57.2
Q ss_pred HHHHHHHHHhcccCCCcHHHHHHHHHHHH---------hCC---------CCHHHHHHHHHHHHHcCCHHHHHHHHHHHH
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQSYRTALK---------LKP---------DFPDAYCNLAHCLQIVCDWTDYEARMKKLV 69 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~~~~Al~---------l~P---------~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~ 69 (222)
-..|+|||.++.++|.+..++.+|.+|++ +.| ...+.++|.|..+...|+.-.|-++|.+++
T Consensus 283 cif~NNlGcIh~~~~~y~~~~~~F~kAL~N~c~qL~~g~~~~~~~tls~nks~eilYNcG~~~Lh~grPl~AfqCf~~av 362 (696)
T KOG2471|consen 283 CIFNNNLGCIHYQLGCYQASSVLFLKALRNSCSQLRNGLKPAKTFTLSQNKSMEILYNCGLLYLHSGRPLLAFQCFQKAV 362 (696)
T ss_pred heeecCcceEeeehhhHHHHHHHHHHHHHHHHHHHhccCCCCcceehhcccchhhHHhhhHHHHhcCCcHHHHHHHHHHH
Confidence 35689999999999999999999999996 122 245678999999999999999999999998
Q ss_pred HHHHh
Q psy17695 70 SIVAE 74 (222)
Q Consensus 70 ~~~~~ 74 (222)
.....
T Consensus 363 ~vfh~ 367 (696)
T KOG2471|consen 363 HVFHR 367 (696)
T ss_pred HHHhc
Confidence 86554
No 224
>COG2912 Uncharacterized conserved protein [Function unknown]
Probab=92.97 E-value=0.27 Score=40.83 Aligned_cols=66 Identities=9% Similarity=-0.015 Sum_probs=59.8
Q ss_pred HHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 9 VGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 9 ~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
....++=.++...++++.|..+-++.+.++|+++.-+-..|.+|.+++....|.+.+...++.+|+
T Consensus 182 rll~~lk~~~~~e~~~~~al~~~~r~l~l~P~dp~eirDrGliY~ql~c~~vAl~dl~~~~~~~P~ 247 (269)
T COG2912 182 RLLRNLKAALLRELQWELALRVAERLLDLNPEDPYEIRDRGLIYAQLGCYHVALEDLSYFVEHCPD 247 (269)
T ss_pred HHHHHHHHHHHHhhchHHHHHHHHHHHhhCCCChhhccCcHHHHHhcCCchhhHHHHHHHHHhCCC
Confidence 345677788889999999999999999999999999999999999999999999999888877775
No 225
>KOG0551|consensus
Probab=92.85 E-value=0.18 Score=43.15 Aligned_cols=64 Identities=23% Similarity=0.202 Sum_probs=41.8
Q ss_pred CHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHH
Q psy17695 7 FKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVS 70 (222)
Q Consensus 7 ~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~ 70 (222)
++..++|.+.+..-+|+|..|+.-..+|++++|.+..+++.=+.|+..+..+.+|...+++.+.
T Consensus 118 navLY~NRAAa~~~l~NyRs~l~Dcs~al~~~P~h~Ka~~R~Akc~~eLe~~~~a~nw~ee~~~ 181 (390)
T KOG0551|consen 118 NAVLYTNRAAAQLYLGNYRSALNDCSAALKLKPTHLKAYIRGAKCLLELERFAEAVNWCEEGLQ 181 (390)
T ss_pred HHHHHhhHHHHHHHHHHHHHHHHHHHHHHhcCcchhhhhhhhhHHHHHHHHHHHHHHHHhhhhh
Confidence 3455666667777777777777777777777777777776666666666666555555554443
No 226
>COG4976 Predicted methyltransferase (contains TPR repeat) [General function prediction only]
Probab=92.78 E-value=0.088 Score=42.78 Aligned_cols=44 Identities=18% Similarity=0.287 Sum_probs=39.8
Q ss_pred CCCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHH
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDA 44 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a 44 (222)
|.+.|+....|+.+|....+.|+++.|.+.|++.++++|.+...
T Consensus 22 l~lap~w~~gwfR~g~~~ekag~~daAa~a~~~~L~ldp~D~~g 65 (287)
T COG4976 22 LELAPEWAAGWFRLGEYTEKAGEFDAAAAAYEEVLELDPEDHGG 65 (287)
T ss_pred hhcCchhhhhhhhcchhhhhcccHHHHHHHHHHHHcCCcccccc
Confidence 34678999999999999999999999999999999999987654
No 227
>COG3118 Thioredoxin domain-containing protein [Posttranslational modification, protein turnover, chaperones]
Probab=92.60 E-value=0.45 Score=40.05 Aligned_cols=57 Identities=23% Similarity=0.242 Sum_probs=50.3
Q ss_pred HHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHH
Q psy17695 11 ILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKK 67 (222)
Q Consensus 11 ~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~ 67 (222)
-+.-+.-+...|++.+|...|..++...|++.++...|+.|+...|+.++|.+.+..
T Consensus 137 ~~~~~~~~~~~e~~~~a~~~~~~al~~~~~~~~~~~~la~~~l~~g~~e~A~~iL~~ 193 (304)
T COG3118 137 ALAEAKELIEAEDFGEAAPLLKQALQAAPENSEAKLLLAECLLAAGDVEAAQAILAA 193 (304)
T ss_pred HHHHhhhhhhccchhhHHHHHHHHHHhCcccchHHHHHHHHHHHcCChHHHHHHHHh
Confidence 445566778999999999999999999999999999999999999999988876543
No 228
>KOG2610|consensus
Probab=92.59 E-value=0.36 Score=41.53 Aligned_cols=60 Identities=17% Similarity=-0.041 Sum_probs=50.4
Q ss_pred HHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHH
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKK 67 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~ 67 (222)
.-.+--++..+.+.|-+++|++.-++|+++||.+.-+...+++++...|+..++.+...+
T Consensus 175 sYv~GmyaFgL~E~g~y~dAEk~A~ralqiN~~D~Wa~Ha~aHVlem~~r~Keg~eFM~~ 234 (491)
T KOG2610|consen 175 SYVHGMYAFGLEECGIYDDAEKQADRALQINRFDCWASHAKAHVLEMNGRHKEGKEFMYK 234 (491)
T ss_pred HHHHHHHHhhHHHhccchhHHHHHHhhccCCCcchHHHHHHHHHHHhcchhhhHHHHHHh
Confidence 444555677788999999999999999999999999999999999999988887765443
No 229
>PF14938 SNAP: Soluble NSF attachment protein, SNAP; PDB: 1QQE_A 2IFU_A.
Probab=92.49 E-value=0.43 Score=39.98 Aligned_cols=66 Identities=18% Similarity=0.126 Sum_probs=53.2
Q ss_pred HHHHHHHHHhcccCCCcHHHHHHHHHHHHhC--CC----CHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQSYRTALKLK--PD----FPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~--P~----~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
++.+...|+.|...|++++|..+|.+|..+. -+ -..++.+.+.++... ++++|+.++++++.+..+
T Consensus 35 a~~y~~Aa~~fk~~~~~~~A~~ay~kAa~~~~~~~~~~~Aa~~~~~Aa~~~k~~-~~~~Ai~~~~~A~~~y~~ 106 (282)
T PF14938_consen 35 ADLYEKAANCFKLAKDWEKAAEAYEKAADCYEKLGDKFEAAKAYEEAANCYKKG-DPDEAIECYEKAIEIYRE 106 (282)
T ss_dssp HHHHHHHHHHHHHTT-CHHHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHHHHHT-THHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHhccchhHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhh-CHHHHHHHHHHHHHHHHh
Confidence 4677888999999999999999999998774 12 234677788888777 999999999999987654
No 230
>PF04733 Coatomer_E: Coatomer epsilon subunit; InterPro: IPR006822 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the epsilon subunit of the coatomer complex, which is involved in the regulation of intracellular protein trafficking between the endoplasmic reticulum and the Golgi complex []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006890 retrograde vesicle-mediated transport, Golgi to ER, 0030126 COPI vesicle coat; PDB: 3MV2_B 3MV3_F 3MKR_A.
Probab=92.32 E-value=0.21 Score=42.24 Aligned_cols=52 Identities=12% Similarity=0.124 Sum_probs=39.6
Q ss_pred CCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHH
Q psy17695 22 GNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVA 73 (222)
Q Consensus 22 g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~ 73 (222)
+++.+|.-.|+.....-|.++..++.++.+.+.+|+|++|++.+++++...+
T Consensus 181 e~~~~A~y~f~El~~~~~~t~~~lng~A~~~l~~~~~~eAe~~L~~al~~~~ 232 (290)
T PF04733_consen 181 EKYQDAFYIFEELSDKFGSTPKLLNGLAVCHLQLGHYEEAEELLEEALEKDP 232 (290)
T ss_dssp TCCCHHHHHHHHHHCCS--SHHHHHHHHHHHHHCT-HHHHHHHHHHHCCC-C
T ss_pred hhHHHHHHHHHHHHhccCCCHHHHHHHHHHHHHhCCHHHHHHHHHHHHHhcc
Confidence 3688888888887776678888889999999999999999998888764443
No 231
>COG0457 NrfG FOG: TPR repeat [General function prediction only]
Probab=92.23 E-value=0.86 Score=34.20 Aligned_cols=68 Identities=21% Similarity=0.258 Sum_probs=46.9
Q ss_pred CHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCC-CHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 7 FKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPD-FPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 7 ~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~-~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
.......++..+...+++++|+..+.++++..|. ....+.+++.++...++++++...+..+....+.
T Consensus 166 ~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~ 234 (291)
T COG0457 166 LAEALLALGALLEALGRYEEALELLEKALKLNPDDDAEALLNLGLLYLKLGKYEEALEYYEKALELDPD 234 (291)
T ss_pred hHHHHHHhhhHHHHhcCHHHHHHHHHHHHhhCcccchHHHHHhhHHHHHcccHHHHHHHHHHHHhhCcc
Confidence 4455555666666777777777777777777777 5777777777777777777777777766666553
No 232
>PRK04841 transcriptional regulator MalT; Provisional
Probab=92.22 E-value=0.64 Score=45.09 Aligned_cols=66 Identities=9% Similarity=0.026 Sum_probs=54.0
Q ss_pred HHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCC------HHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 9 VGILFSNSSFPSPGNIPEAIQSYRTALKLKPDF------PDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 9 ~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~------~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
.++..+|.++...|++++|...+++++++.... ..++.+++.++...|++++|.+.+++++.+...
T Consensus 492 ~a~~~lg~~~~~~G~~~~A~~~~~~al~~~~~~g~~~~~~~~~~~la~~~~~~G~~~~A~~~~~~al~~~~~ 563 (903)
T PRK04841 492 VATSVLGEVHHCKGELARALAMMQQTEQMARQHDVYHYALWSLLQQSEILFAQGFLQAAYETQEKAFQLIEE 563 (903)
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHHHHHhhhcchHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHH
Confidence 466789999999999999999999999764321 246678899999999999999999888876554
No 233
>COG0457 NrfG FOG: TPR repeat [General function prediction only]
Probab=92.17 E-value=0.75 Score=34.53 Aligned_cols=67 Identities=24% Similarity=0.209 Sum_probs=50.3
Q ss_pred CCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHH-HHHHcCCHHHHHHHHHHHHH
Q psy17695 4 TPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAH-CLQIVCDWTDYEARMKKLVS 70 (222)
Q Consensus 4 ~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~-~~~~~~~~~~a~~~~~~~~~ 70 (222)
.+.....+..+|..+...+++.+++..+..++..++........... ++...++++.+...+.+++.
T Consensus 91 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~ 158 (291)
T COG0457 91 LPNLAEALLNLGLLLEALGKYEEALELLEKALALDPDPDLAEALLALGALYELGDYEEALELYEKALE 158 (291)
T ss_pred ccchHHHHHHHHHHHHHHhhHHHHHHHHHHHHcCCCCcchHHHHHHHHHHHHcCCHHHHHHHHHHHHh
Confidence 45666777788888888888888888888888877777555555655 77788888888887777754
No 234
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=92.14 E-value=0.95 Score=36.48 Aligned_cols=62 Identities=8% Similarity=0.036 Sum_probs=48.7
Q ss_pred HHHHHHHHHhcccCCCc-------HHHHHHHHHHHHhCCC--C----HHHHHHHHHHHHHcCCHHHHHHHHHHHH
Q psy17695 8 KVGILFSNSSFPSPGNI-------PEAIQSYRTALKLKPD--F----PDAYCNLAHCLQIVCDWTDYEARMKKLV 69 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~-------~eA~~~~~~Al~l~P~--~----~~a~~~l~~~~~~~~~~~~a~~~~~~~~ 69 (222)
+..+..++.+|+.+|+- ..|.+.|++|++.... . ....+.+|.+....|++++|...+.+++
T Consensus 118 A~l~LrlAWlyR~~~~~~~E~~fl~~Al~~y~~a~~~e~~~~~~~~~~~l~YLigeL~rrlg~~~eA~~~fs~vi 192 (214)
T PF09986_consen 118 AGLCLRLAWLYRDLGDEENEKRFLRKALEFYEEAYENEDFPIEGMDEATLLYLIGELNRRLGNYDEAKRWFSRVI 192 (214)
T ss_pred HHHHHHHHHHhhccCCHHHHHHHHHHHHHHHHHHHHhCcCCCCCchHHHHHHHHHHHHHHhCCHHHHHHHHHHHH
Confidence 67889999999999994 4566666666655432 2 4567889999999999999999998876
No 235
>PF08424 NRDE-2: NRDE-2, necessary for RNA interference; InterPro: IPR013633 This is domain is found in eukaryotic proteins of unknown function.
Probab=92.10 E-value=0.78 Score=39.30 Aligned_cols=73 Identities=14% Similarity=-0.032 Sum_probs=54.8
Q ss_pred CCCCCCHHHHHHHHHhcccCCC------------cHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHH
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGN------------IPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLV 69 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~------------~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~ 69 (222)
..+|.+.++|..+.......-. .+.-+..|++||+.+|++...+..+..+..+..+-++....+++++
T Consensus 13 ~~~P~di~~Wl~li~~Qd~~~~~~~~~~~~~~a~~E~klsilerAL~~np~~~~L~l~~l~~~~~~~~~~~l~~~we~~l 92 (321)
T PF08424_consen 13 RENPHDIEAWLELIEFQDELFRLQSSSKAERRALAERKLSILERALKHNPDSERLLLGYLEEGEKVWDSEKLAKKWEELL 92 (321)
T ss_pred HhCcccHHHHHHHHHHHHHhccccccchhhHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHhCCHHHHHHHHHHHH
Confidence 4688999999888766655533 4566778999999999999988888888877777666666666666
Q ss_pred HHHHh
Q psy17695 70 SIVAE 74 (222)
Q Consensus 70 ~~~~~ 74 (222)
...|+
T Consensus 93 ~~~~~ 97 (321)
T PF08424_consen 93 FKNPG 97 (321)
T ss_pred HHCCC
Confidence 65543
No 236
>KOG1308|consensus
Probab=91.97 E-value=0.024 Score=48.40 Aligned_cols=60 Identities=17% Similarity=0.186 Sum_probs=54.8
Q ss_pred HhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhh
Q psy17695 16 SSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQ 75 (222)
Q Consensus 16 ~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~ 75 (222)
.-....|++++|++.|.+|++++|..+..+...+.++.++..|..++..+..++.+.++.
T Consensus 122 ~eAln~G~~~~ai~~~t~ai~lnp~~a~l~~kr~sv~lkl~kp~~airD~d~A~ein~Ds 181 (377)
T KOG1308|consen 122 SEALNDGEFDTAIELFTSAIELNPPLAILYAKRASVFLKLKKPNAAIRDCDFAIEINPDS 181 (377)
T ss_pred HHHhcCcchhhhhcccccccccCCchhhhcccccceeeeccCCchhhhhhhhhhccCccc
Confidence 334567899999999999999999999999999999999999999999999999888863
No 237
>KOG4340|consensus
Probab=91.85 E-value=0.38 Score=40.80 Aligned_cols=62 Identities=16% Similarity=0.228 Sum_probs=51.9
Q ss_pred CCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHH
Q psy17695 3 KTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEAR 64 (222)
Q Consensus 3 ~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~ 64 (222)
.+|.+-.++..||.+|....++.+|.+||++.-.+.|.........+..+...|.+.+|...
T Consensus 39 r~p~~rAgLSlLgyCYY~~Q~f~~AA~CYeQL~ql~P~~~qYrlY~AQSLY~A~i~ADALrV 100 (459)
T KOG4340|consen 39 RSPRSRAGLSLLGYCYYRLQEFALAAECYEQLGQLHPELEQYRLYQAQSLYKACIYADALRV 100 (459)
T ss_pred cCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhhChHHHHHHHHHHHHHHHhcccHHHHHH
Confidence 46777888888999999999999999999999999999888888888888888877776643
No 238
>PRK04841 transcriptional regulator MalT; Provisional
Probab=91.62 E-value=0.63 Score=45.13 Aligned_cols=64 Identities=8% Similarity=-0.086 Sum_probs=55.5
Q ss_pred HHHHHHhcccCCCcHHHHHHHHHHHHhC------CCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 11 ILFSNSSFPSPGNIPEAIQSYRTALKLK------PDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 11 ~~~lg~~l~~~g~~~eA~~~~~~Al~l~------P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
+.++|.++..+|++++|+..|+++++.. +....++..+|.++...|+.++|.+.+++++.+...
T Consensus 694 ~~~~a~~~~~~g~~~~A~~~l~~al~~~~~~g~~~~~a~~~~~la~a~~~~G~~~~A~~~L~~Al~la~~ 763 (903)
T PRK04841 694 WRNIARAQILLGQFDEAEIILEELNENARSLRLMSDLNRNLILLNQLYWQQGRKSEAQRVLLEALKLANR 763 (903)
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHHHHHHhCchHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHhCc
Confidence 5689999999999999999999999873 233467889999999999999999999999987754
No 239
>PLN03218 maturation of RBCL 1; Provisional
Probab=91.61 E-value=0.93 Score=45.12 Aligned_cols=62 Identities=11% Similarity=0.170 Sum_probs=35.9
Q ss_pred HHHHHHHHHhcccCCCcHHHHHHHHHHHHhC-CCCHHHHHHHHHHHHHcCCHHHHHHHHHHHH
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQSYRTALKLK-PDFPDAYCNLAHCLQIVCDWTDYEARMKKLV 69 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~-P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~ 69 (222)
...++.+-.+|.+.|++++|.+.|++..+.+ +.+...|..+...+.+.|++++|...|.+..
T Consensus 579 ~vTynaLI~ay~k~G~ldeA~elf~~M~e~gi~p~~~tynsLI~ay~k~G~~deAl~lf~eM~ 641 (1060)
T PLN03218 579 HITVGALMKACANAGQVDRAKEVYQMIHEYNIKGTPEVYTIAVNSCSQKGDWDFALSIYDDMK 641 (1060)
T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHcCCCCChHHHHHHHHHHHhcCCHHHHHHHHHHHH
Confidence 3455555556666666666666666666554 3345555666666666666666665555544
No 240
>PF07721 TPR_4: Tetratricopeptide repeat; InterPro: IPR011717 This entry includes tetratricopeptide-like repeats not detected by the IPR001440 from INTERPRO, IPR013105 from INTERPRO and IPR011716 from INTERPRO models. The tetratricopeptide repeat (TPR) motif is a protein-protein interaction module found in multiple copies in a number of functionally different proteins that facilitates specific interactions with a partner protein(s) [].; GO: 0042802 identical protein binding
Probab=91.60 E-value=0.26 Score=25.44 Aligned_cols=24 Identities=25% Similarity=0.149 Sum_probs=21.2
Q ss_pred HHHHHHHHHHHHcCCHHHHHHHHH
Q psy17695 43 DAYCNLAHCLQIVCDWTDYEARMK 66 (222)
Q Consensus 43 ~a~~~l~~~~~~~~~~~~a~~~~~ 66 (222)
.++.+++.++...|++++|+..++
T Consensus 2 ~a~~~la~~~~~~G~~~eA~~~l~ 25 (26)
T PF07721_consen 2 RARLALARALLAQGDPDEAERLLR 25 (26)
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHh
Confidence 578899999999999999998765
No 241
>PF05843 Suf: Suppressor of forked protein (Suf); InterPro: IPR008847 This domain consists of several eukaryotic suppressor of forked (Suf) like proteins. The Drosophila melanogaster suppressor of forked [Su(f)] protein shares homology with the Saccharomyces cerevisiae RNA14 protein and the 77 kDa subunit of Homo sapiens cleavage stimulation factor, which are proteins involved in mRNA 3' end formation. This suggests a role for Su(f) in mRNA 3' end formation in Drosophila. The su(f) gene produces three transcripts; two of them are polyadenylated at the end of the transcription unit, and one is a truncated transcript, polyadenylated in intron 4. It is thought that su(f) plays a role in the regulation of poly(A) site utilisation and the GU-rich sequence is important for this regulation to occur [].; GO: 0006397 mRNA processing, 0005634 nucleus; PDB: 2L9B_B 2OND_B 2OOE_A 4E85_B 4EBA_C 4E6H_A 2UY1_B.
Probab=91.45 E-value=0.89 Score=38.11 Aligned_cols=67 Identities=15% Similarity=0.040 Sum_probs=46.7
Q ss_pred CCHHHHHHHHHhccc-CCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHH
Q psy17695 6 FFKVGILFSNSSFPS-PGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIV 72 (222)
Q Consensus 6 ~~~~~~~~lg~~l~~-~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~ 72 (222)
-....|...|.+-.. .++.+.|...|+++++.-|++...|......+...++.+.+...|++++..+
T Consensus 33 ~~~~vy~~~A~~E~~~~~d~~~A~~Ife~glk~f~~~~~~~~~Y~~~l~~~~d~~~aR~lfer~i~~l 100 (280)
T PF05843_consen 33 CTYHVYVAYALMEYYCNKDPKRARKIFERGLKKFPSDPDFWLEYLDFLIKLNDINNARALFERAISSL 100 (280)
T ss_dssp S-THHHHHHHHHHHHTCS-HHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHTT-HHHHHHHHHHHCCTS
T ss_pred CCHHHHHHHHHHHHHhCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHhCcHHHHHHHHHHHHHhc
Confidence 345667777777444 4555558888888888888888888888888888888888777777766443
No 242
>smart00386 HAT HAT (Half-A-TPR) repeats. Present in several RNA-binding proteins. Structurally and sequentially thought to be similar to TPRs.
Probab=91.40 E-value=0.65 Score=24.43 Aligned_cols=30 Identities=27% Similarity=0.368 Sum_probs=26.6
Q ss_pred CCcHHHHHHHHHHHHhCCCCHHHHHHHHHH
Q psy17695 22 GNIPEAIQSYRTALKLKPDFPDAYCNLAHC 51 (222)
Q Consensus 22 g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~ 51 (222)
|+.+.|...|+++++..|.++..|...+..
T Consensus 1 ~~~~~~r~i~e~~l~~~~~~~~~W~~y~~~ 30 (33)
T smart00386 1 GDIERARKIYERALEKFPKSVELWLKYAEF 30 (33)
T ss_pred CcHHHHHHHHHHHHHHCCCChHHHHHHHHH
Confidence 578899999999999999999999887654
No 243
>PLN03218 maturation of RBCL 1; Provisional
Probab=91.26 E-value=1.2 Score=44.30 Aligned_cols=62 Identities=11% Similarity=0.051 Sum_probs=41.8
Q ss_pred CHHHHHHHHHhcccCCCcHHHHHHHHHHHHh--CCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHH
Q psy17695 7 FKVGILFSNSSFPSPGNIPEAIQSYRTALKL--KPDFPDAYCNLAHCLQIVCDWTDYEARMKKLV 69 (222)
Q Consensus 7 ~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l--~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~ 69 (222)
+...++.|..+|.+.|++++|+..|++..+. .|+ ...|..+...+.+.|++++|.+.+++..
T Consensus 683 d~~tynsLI~ay~k~G~~eeA~~lf~eM~~~g~~Pd-vvtyN~LI~gy~k~G~~eeAlelf~eM~ 746 (1060)
T PLN03218 683 GTVSYSSLMGACSNAKNWKKALELYEDIKSIKLRPT-VSTMNALITALCEGNQLPKALEVLSEMK 746 (1060)
T ss_pred CHHHHHHHHHHHHhCCCHHHHHHHHHHHHHcCCCCC-HHHHHHHHHHHHHCCCHHHHHHHHHHHH
Confidence 4566677777777777777777777766543 443 5567777777777777777777666654
No 244
>PF09613 HrpB1_HrpK: Bacterial type III secretion protein (HrpB1_HrpK); InterPro: IPR013394 This family of proteins is encoded by genes found within type III secretion operons in a limited range of species including Xanthomonas, Ralstonia and Burkholderia.
Probab=91.21 E-value=1.9 Score=33.11 Aligned_cols=61 Identities=20% Similarity=0.197 Sum_probs=53.7
Q ss_pred HHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHH
Q psy17695 9 VGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLV 69 (222)
Q Consensus 9 ~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~ 69 (222)
.++..+..+=...++.+++...+...-.+.|+.++....-|.++...|+|.+|...++.+.
T Consensus 11 ~gLie~~~~al~~~~~~D~e~lL~ALrvLRP~~~e~~~~~~~l~i~r~~w~dA~rlLr~l~ 71 (160)
T PF09613_consen 11 GGLIEVLSVALRLGDPDDAEALLDALRVLRPEFPELDLFDGWLHIVRGDWDDALRLLRELE 71 (160)
T ss_pred HHHHHHHHHHHccCChHHHHHHHHHHHHhCCCchHHHHHHHHHHHHhCCHHHHHHHHHHHh
Confidence 4555666666788899999999999999999999999999999999999999998887764
No 245
>COG3071 HemY Uncharacterized enzyme of heme biosynthesis [Coenzyme metabolism]
Probab=91.04 E-value=1.7 Score=37.97 Aligned_cols=64 Identities=16% Similarity=0.156 Sum_probs=55.5
Q ss_pred CHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHH
Q psy17695 7 FKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVS 70 (222)
Q Consensus 7 ~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~ 70 (222)
........+.++...|+++.|..-..++++..|.++.++-....+|...|+|.+......++.+
T Consensus 152 ~l~v~ltrarlll~~~d~~aA~~~v~~ll~~~pr~~~vlrLa~r~y~~~g~~~~ll~~l~~L~k 215 (400)
T COG3071 152 TLAVELTRARLLLNRRDYPAARENVDQLLEMTPRHPEVLRLALRAYIRLGAWQALLAILPKLRK 215 (400)
T ss_pred hHHHHHHHHHHHHhCCCchhHHHHHHHHHHhCcCChHHHHHHHHHHHHhccHHHHHHHHHHHHH
Confidence 3455667788999999999999999999999999999999999999999999988776655543
No 246
>PLN03081 pentatricopeptide (PPR) repeat-containing protein; Provisional
Probab=91.03 E-value=1.3 Score=42.02 Aligned_cols=65 Identities=11% Similarity=0.004 Sum_probs=46.9
Q ss_pred CCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHh--CCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHH
Q psy17695 5 PFFKVGILFSNSSFPSPGNIPEAIQSYRTALKL--KPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVS 70 (222)
Q Consensus 5 P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l--~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~ 70 (222)
+.+...|+.+...|.+.|++++|+..|++..+. .|+ ..++..+..++...|+++++.+.+.++++
T Consensus 287 ~~~~vt~n~li~~y~~~g~~~eA~~lf~~M~~~g~~pd-~~t~~~ll~a~~~~g~~~~a~~i~~~m~~ 353 (697)
T PLN03081 287 EKTTVAWNSMLAGYALHGYSEEALCLYYEMRDSGVSID-QFTFSIMIRIFSRLALLEHAKQAHAGLIR 353 (697)
T ss_pred CCChhHHHHHHHHHHhCCCHHHHHHHHHHHHHcCCCCC-HHHHHHHHHHHHhccchHHHHHHHHHHHH
Confidence 346778888888888888888888888887664 454 45677777777777777777766655544
No 247
>KOG0495|consensus
Probab=90.81 E-value=1.2 Score=41.53 Aligned_cols=73 Identities=15% Similarity=0.008 Sum_probs=54.5
Q ss_pred CCCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHH
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVA 73 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~ 73 (222)
|+.-|++...|+.+|.++.++++.+.|..+|...++.-|+....|..|+.+--..|+.-.|...+.++.--.|
T Consensus 678 lk~fp~f~Kl~lmlGQi~e~~~~ie~aR~aY~~G~k~cP~~ipLWllLakleEk~~~~~rAR~ildrarlkNP 750 (913)
T KOG0495|consen 678 LKSFPDFHKLWLMLGQIEEQMENIEMAREAYLQGTKKCPNSIPLWLLLAKLEEKDGQLVRARSILDRARLKNP 750 (913)
T ss_pred HHhCCchHHHHHHHhHHHHHHHHHHHHHHHHHhccccCCCCchHHHHHHHHHHHhcchhhHHHHHHHHHhcCC
Confidence 3456888888888888888888888888888888888888888888887776555555555555555444333
No 248
>KOG0551|consensus
Probab=90.51 E-value=0.6 Score=39.99 Aligned_cols=70 Identities=16% Similarity=0.132 Sum_probs=59.8
Q ss_pred CHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCH----HHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhhh
Q psy17695 7 FKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFP----DAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQL 76 (222)
Q Consensus 7 ~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~----~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~l 76 (222)
.++-+---|+-|++-.+|..|+.+|.++|+-+-.++ -.++|.+.|...+|++..++....+++.+.|.++
T Consensus 80 ~Aen~KeeGN~~fK~Kryk~A~~~Yt~Glk~kc~D~dlnavLY~NRAAa~~~l~NyRs~l~Dcs~al~~~P~h~ 153 (390)
T KOG0551|consen 80 QAENYKEEGNEYFKEKRYKDAVESYTEGLKKKCADPDLNAVLYTNRAAAQLYLGNYRSALNDCSAALKLKPTHL 153 (390)
T ss_pred HHHHHHHHhHHHHHhhhHHHHHHHHHHHHhhcCCCccHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhcCcchh
Confidence 456666779999999999999999999999965543 4689999999999999999999999998887653
No 249
>KOG2376|consensus
Probab=89.21 E-value=1.2 Score=40.88 Aligned_cols=68 Identities=13% Similarity=0.053 Sum_probs=55.6
Q ss_pred CHHHHHHHHHhcccCCCcHHHHHHHHHHHHhC-------CCC--------HHHHHHHHHHHHHcCCHHHHHHHHHHHHHH
Q psy17695 7 FKVGILFSNSSFPSPGNIPEAIQSYRTALKLK-------PDF--------PDAYCNLAHCLQIVCDWTDYEARMKKLVSI 71 (222)
Q Consensus 7 ~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~-------P~~--------~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~ 71 (222)
.-+..+|.+.++...|+|.+|++.+++|+++- -.+ ......|+++++.+|+-++|.+.|...++.
T Consensus 174 syel~yN~Ac~~i~~gky~qA~elL~kA~~~~~e~l~~~d~~eEeie~el~~IrvQlayVlQ~~Gqt~ea~~iy~~~i~~ 253 (652)
T KOG2376|consen 174 SYELLYNTACILIENGKYNQAIELLEKALRICREKLEDEDTNEEEIEEELNPIRVQLAYVLQLQGQTAEASSIYVDIIKR 253 (652)
T ss_pred hHHHHHHHHHHHHhcccHHHHHHHHHHHHHHHHHhhcccccchhhHHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHHh
Confidence 56789999999999999999999999994431 111 125788999999999999999999888876
Q ss_pred HHh
Q psy17695 72 VAE 74 (222)
Q Consensus 72 ~~~ 74 (222)
.+.
T Consensus 254 ~~~ 256 (652)
T KOG2376|consen 254 NPA 256 (652)
T ss_pred cCC
Confidence 653
No 250
>COG3118 Thioredoxin domain-containing protein [Posttranslational modification, protein turnover, chaperones]
Probab=88.71 E-value=3.1 Score=35.15 Aligned_cols=69 Identities=19% Similarity=0.148 Sum_probs=54.6
Q ss_pred CCCCCHHHHHHHHHhcccCCCcHHHHHHHHH----------------------------------HHHhCCCCHHHHHHH
Q psy17695 3 KTPFFKVGILFSNSSFPSPGNIPEAIQSYRT----------------------------------ALKLKPDFPDAYCNL 48 (222)
Q Consensus 3 ~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~----------------------------------Al~l~P~~~~a~~~l 48 (222)
..|++.++...|+.+|...|+.++|...+.. .+..+|++.++.+.+
T Consensus 163 ~~~~~~~~~~~la~~~l~~g~~e~A~~iL~~lP~~~~~~~~~~l~a~i~ll~qaa~~~~~~~l~~~~aadPdd~~aa~~l 242 (304)
T COG3118 163 AAPENSEAKLLLAECLLAAGDVEAAQAILAALPLQAQDKAAHGLQAQIELLEQAAATPEIQDLQRRLAADPDDVEAALAL 242 (304)
T ss_pred hCcccchHHHHHHHHHHHcCChHHHHHHHHhCcccchhhHHHHHHHHHHHHHHHhcCCCHHHHHHHHHhCCCCHHHHHHH
Confidence 4678888999999999999998777665543 234579999999999
Q ss_pred HHHHHHcCCHHHHHHHHHHHHHH
Q psy17695 49 AHCLQIVCDWTDYEARMKKLVSI 71 (222)
Q Consensus 49 ~~~~~~~~~~~~a~~~~~~~~~~ 71 (222)
+..+...|+.++|.+.+-.+++.
T Consensus 243 A~~~~~~g~~e~Ale~Ll~~l~~ 265 (304)
T COG3118 243 ADQLHLVGRNEAALEHLLALLRR 265 (304)
T ss_pred HHHHHHcCCHHHHHHHHHHHHHh
Confidence 99999999999888766554443
No 251
>COG4455 ImpE Protein of avirulence locus involved in temperature-dependent protein secretion [General function prediction only]
Probab=88.47 E-value=0.89 Score=36.87 Aligned_cols=59 Identities=20% Similarity=0.267 Sum_probs=51.5
Q ss_pred HhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 16 SSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 16 ~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
.-+.+.+.+.+|+...+.-++-+|.+......|-..+.-.|+|+.|...++-+-++.|+
T Consensus 9 seLL~~~sL~dai~~a~~qVkakPtda~~RhflfqLlcvaGdw~kAl~Ql~l~a~l~p~ 67 (273)
T COG4455 9 SELLDDNSLQDAIGLARDQVKAKPTDAGGRHFLFQLLCVAGDWEKALAQLNLAATLSPQ 67 (273)
T ss_pred HHHHHhccHHHHHHHHHHHHhcCCccccchhHHHHHHhhcchHHHHHHHHHHHhhcCcc
Confidence 35667889999999999999999999999999999999999999999877766666554
No 252
>KOG1941|consensus
Probab=88.25 E-value=1.4 Score=38.49 Aligned_cols=65 Identities=12% Similarity=0.130 Sum_probs=53.8
Q ss_pred HHHHHHHHHhcccCCCcHHHHHHHHHHHHhC------CCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHH
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQSYRTALKLK------PDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIV 72 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~------P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~ 72 (222)
....+.++++|+.+|+...|.++.+.|.++. |-++....-++.+|+..|+.+.|-..|+.+...-
T Consensus 206 ~~~lyhmaValR~~G~LgdA~e~C~Ea~klal~~Gdra~~arc~~~~aDIyR~~gd~e~af~rYe~Am~~m 276 (518)
T KOG1941|consen 206 AMSLYHMAVALRLLGRLGDAMECCEEAMKLALQHGDRALQARCLLCFADIYRSRGDLERAFRRYEQAMGTM 276 (518)
T ss_pred HHHHHHHHHHHHHhcccccHHHHHHHHHHHHHHhCChHHHHHHHHHHHHHHHhcccHhHHHHHHHHHHHHH
Confidence 3567899999999999999999999998884 4455667788999999999888877777776543
No 253
>COG3947 Response regulator containing CheY-like receiver and SARP domains [Signal transduction mechanisms]
Probab=88.19 E-value=2.8 Score=35.53 Aligned_cols=64 Identities=19% Similarity=0.157 Sum_probs=55.0
Q ss_pred HHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 11 ILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 11 ~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
..-.+..+...|.+.+|++..+++++++|-+...+..+..++..+||--.+...|++..+.+..
T Consensus 282 lgkva~~yle~g~~neAi~l~qr~ltldpL~e~~nk~lm~~la~~gD~is~~khyerya~vlea 345 (361)
T COG3947 282 LGKVARAYLEAGKPNEAIQLHQRALTLDPLSEQDNKGLMASLATLGDEISAIKHYERYAEVLEA 345 (361)
T ss_pred HHHHHHHHHHcCChHHHHHHHHHHhhcChhhhHHHHHHHHHHHHhccchhhhhHHHHHHHHHHH
Confidence 3445667889999999999999999999999999999999999999988888878777665544
No 254
>KOG2396|consensus
Probab=88.04 E-value=1.9 Score=38.97 Aligned_cols=71 Identities=11% Similarity=-0.077 Sum_probs=58.3
Q ss_pred CCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCC-HHHHHHHHHHHHHHHHh
Q psy17695 4 TPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCD-WTDYEARMKKLVSIVAE 74 (222)
Q Consensus 4 ~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~-~~~a~~~~~~~~~~~~~ 74 (222)
-+.+...|.+......+.+.+.+--..|.++++.+|++++.|..-+......+. .+.+.+.+.+.++..|+
T Consensus 101 f~~D~~lW~~yi~f~kk~~~~~~v~ki~~~~l~~Hp~~~dLWI~aA~wefe~n~ni~saRalflrgLR~npd 172 (568)
T KOG2396|consen 101 FNGDVKLWLSYIAFCKKKKTYGEVKKIFAAMLAKHPNNPDLWIYAAKWEFEINLNIESARALFLRGLRFNPD 172 (568)
T ss_pred cCCCHHHHHHHHHHHHHhcchhHHHHHHHHHHHhCCCCchhHHhhhhhHHhhccchHHHHHHHHHHhhcCCC
Confidence 356788888887777777779999999999999999999999988888887776 77788888888877765
No 255
>PLN03077 Protein ECB2; Provisional
Probab=87.85 E-value=2.1 Score=41.56 Aligned_cols=65 Identities=11% Similarity=0.011 Sum_probs=51.6
Q ss_pred CCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHh--CCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHH
Q psy17695 5 PFFKVGILFSNSSFPSPGNIPEAIQSYRTALKL--KPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVS 70 (222)
Q Consensus 5 P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l--~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~ 70 (222)
+.+...|+.+...|.+.|+.++|+..|++..+. .|+. .++..+...+...|.++++.+.|+...+
T Consensus 551 ~~d~~s~n~lI~~~~~~G~~~~A~~lf~~M~~~g~~Pd~-~T~~~ll~a~~~~g~v~ea~~~f~~M~~ 617 (857)
T PLN03077 551 EKDVVSWNILLTGYVAHGKGSMAVELFNRMVESGVNPDE-VTFISLLCACSRSGMVTQGLEYFHSMEE 617 (857)
T ss_pred CCChhhHHHHHHHHHHcCCHHHHHHHHHHHHHcCCCCCc-ccHHHHHHHHhhcChHHHHHHHHHHHHH
Confidence 456778999999999999999999999988774 5664 4456666778888899998888887763
No 256
>PF04910 Tcf25: Transcriptional repressor TCF25; InterPro: IPR006994 This entry appears to represent a novel family of basic helix-loop-helix (bHLH) proteins that control differentiation and development of a variety of organs [, ]. Human Nulp1 (Q2MK75 from SWISSPROT) is a basic helix-loop-helix protein expressed broadly during early embryonic organogenesis. Over expression of human Nulp1 in COS-7 cells inhibits the transcriptional activity of serum response factor (SRF), suggesting that Nulp1 may act as a novel bHLH transcriptional repressor in the SRF signalling pathway to mediate cellular functions [].
Probab=87.74 E-value=1.3 Score=38.68 Aligned_cols=73 Identities=21% Similarity=0.229 Sum_probs=58.6
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHh-----CCCC---------------------H---HHHHHHHHHH
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKL-----KPDF---------------------P---DAYCNLAHCL 52 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l-----~P~~---------------------~---~a~~~l~~~~ 52 (222)
..+|-+.+++..++.++..+|+.+.|.+..+|||=. .|.. - .+++.....+
T Consensus 34 ~~~PyHidtLlqls~v~~~~gd~~~A~~lleRALf~~e~~~~~~F~~~~~~~~~g~~rL~~~~~eNR~fflal~r~i~~L 113 (360)
T PF04910_consen 34 QKNPYHIDTLLQLSEVYRQQGDHAQANDLLERALFAFERAFHPSFSPFRSNLTSGNCRLDYRRPENRQFFLALFRYIQSL 113 (360)
T ss_pred HHCCCcHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccCccccCCccccchHHHHHHHHHHHHH
Confidence 467899999999999999999999999999999721 1222 1 2455666677
Q ss_pred HHcCCHHHHHHHHHHHHHHHHh
Q psy17695 53 QIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 53 ~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
.+.|.|.-|.+..+-++.+.|.
T Consensus 114 ~~RG~~rTAlE~~KlLlsLdp~ 135 (360)
T PF04910_consen 114 GRRGCWRTALEWCKLLLSLDPD 135 (360)
T ss_pred HhcCcHHHHHHHHHHHHhcCCC
Confidence 8889999999999888888875
No 257
>KOG1914|consensus
Probab=87.39 E-value=3.1 Score=38.01 Aligned_cols=69 Identities=12% Similarity=0.098 Sum_probs=61.5
Q ss_pred CCCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHH
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVS 70 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~ 70 (222)
+..+|.+..+|+.|-.-+..+ -+++....|++.+..-|..+.+|...........+++..+..|.+++.
T Consensus 13 ie~nP~di~sw~~lire~qt~-~~~~~R~~YEq~~~~FP~s~r~W~~yi~~El~skdfe~VEkLF~RCLv 81 (656)
T KOG1914|consen 13 IEENPYDIDSWSQLIREAQTQ-PIDKVRETYEQLVNVFPSSPRAWKLYIERELASKDFESVEKLFSRCLV 81 (656)
T ss_pred HhcCCccHHHHHHHHHHHccC-CHHHHHHHHHHHhccCCCCcHHHHHHHHHHHHhhhHHHHHHHHHHHHH
Confidence 356899999999997777766 999999999999999999999999999999999999998888877764
No 258
>PF05843 Suf: Suppressor of forked protein (Suf); InterPro: IPR008847 This domain consists of several eukaryotic suppressor of forked (Suf) like proteins. The Drosophila melanogaster suppressor of forked [Su(f)] protein shares homology with the Saccharomyces cerevisiae RNA14 protein and the 77 kDa subunit of Homo sapiens cleavage stimulation factor, which are proteins involved in mRNA 3' end formation. This suggests a role for Su(f) in mRNA 3' end formation in Drosophila. The su(f) gene produces three transcripts; two of them are polyadenylated at the end of the transcription unit, and one is a truncated transcript, polyadenylated in intron 4. It is thought that su(f) plays a role in the regulation of poly(A) site utilisation and the GU-rich sequence is important for this regulation to occur [].; GO: 0006397 mRNA processing, 0005634 nucleus; PDB: 2L9B_B 2OND_B 2OOE_A 4E85_B 4EBA_C 4E6H_A 2UY1_B.
Probab=87.28 E-value=2.3 Score=35.58 Aligned_cols=72 Identities=14% Similarity=-0.002 Sum_probs=60.0
Q ss_pred CCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCH---HHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 3 KTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFP---DAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 3 ~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~---~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
.-|.+...|......+...|+.+.|...|++++..-|... ..|-.+...-...|+++.....++++.+..++
T Consensus 65 ~f~~~~~~~~~Y~~~l~~~~d~~~aR~lfer~i~~l~~~~~~~~iw~~~i~fE~~~Gdl~~v~~v~~R~~~~~~~ 139 (280)
T PF05843_consen 65 KFPSDPDFWLEYLDFLIKLNDINNARALFERAISSLPKEKQSKKIWKKFIEFESKYGDLESVRKVEKRAEELFPE 139 (280)
T ss_dssp HHTT-HHHHHHHHHHHHHTT-HHHHHHHHHHHCCTSSCHHHCHHHHHHHHHHHHHHS-HHHHHHHHHHHHHHTTT
T ss_pred HCCCCHHHHHHHHHHHHHhCcHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHhhh
Confidence 3478899999999999999999999999999999877765 57777788888899999999988888888775
No 259
>PF10373 EST1_DNA_bind: Est1 DNA/RNA binding domain; InterPro: IPR018834 Est1 is a protein which recruits or activates telomerase at the site of polymerisation [, ]. This is the DNA/RNA binding domain of EST1 []. ; PDB: 1YA0_B.
Probab=87.20 E-value=1.5 Score=36.04 Aligned_cols=51 Identities=20% Similarity=0.127 Sum_probs=28.1
Q ss_pred CCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHH
Q psy17695 4 TPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQI 54 (222)
Q Consensus 4 ~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~ 54 (222)
.|++...|+.||.+....|+.-+|+=+|-|++...--++.+..||...+.+
T Consensus 12 ~P~~G~p~nQLAvl~~~~~~~l~avy~y~Rsl~~~~Pf~~A~~NL~~lf~~ 62 (278)
T PF10373_consen 12 LPSNGNPYNQLAVLASYQGDDLDAVYYYIRSLAVRIPFPSARENLQKLFEK 62 (278)
T ss_dssp -TTBSHHHHHHHHHHHHTT-HHHHHHHHHHHHSSSB--HHHHHHHHHHHHH
T ss_pred CCCCCCcccchhhhhccccchHHHHHHHHHHHhcCCCcHHHHHHHHHHHHH
Confidence 455566666666666666666666666666665544445566666555544
No 260
>KOG3364|consensus
Probab=86.77 E-value=1.4 Score=32.93 Aligned_cols=66 Identities=12% Similarity=0.091 Sum_probs=45.1
Q ss_pred HHHHHHHHhcccC---CCcHHHHHHHHHHHH-hCCCC-HHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 9 VGILFSNSSFPSP---GNIPEAIQSYRTALK-LKPDF-PDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 9 ~~~~~lg~~l~~~---g~~~eA~~~~~~Al~-l~P~~-~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
+..++++.++... .+..+.+.+++..++ -.|.. -+..+.|+..+.++++|+.+..+.+.+++..|+
T Consensus 33 ~s~f~lAwaLV~S~~~~dv~~GI~iLe~l~~~~~~~~rRe~lyYLAvg~yRlkeY~~s~~yvd~ll~~e~~ 103 (149)
T KOG3364|consen 33 QSQFNLAWALVRSRDTEDVQEGIVILEDLLKSAHPERRRECLYYLAVGHYRLKEYSKSLRYVDALLETEPN 103 (149)
T ss_pred HHHHHHHHHHHcccchHHHHHhHHHHHHHhhhcCcccchhhhhhhHHHHHHHhhHHHHHHHHHHHHhhCCC
Confidence 4556667666643 346677788888886 44443 356677777777888888888887777776654
No 261
>PRK10941 hypothetical protein; Provisional
Probab=86.32 E-value=2 Score=35.93 Aligned_cols=48 Identities=13% Similarity=-0.091 Sum_probs=43.8
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHH
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLA 49 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~ 49 (222)
...|+++.-+-..|.+|.++|.+..|..-++..++..|+++.+-.-..
T Consensus 209 ~l~P~dp~e~RDRGll~~qL~c~~~A~~DL~~fl~~~P~dp~a~~ik~ 256 (269)
T PRK10941 209 QFDPEDPYEIRDRGLIYAQLDCEHVALSDLSYFVEQCPEDPISEMIRA 256 (269)
T ss_pred HhCCCCHHHHHHHHHHHHHcCCcHHHHHHHHHHHHhCCCchhHHHHHH
Confidence 568999999999999999999999999999999999999998865443
No 262
>KOG2053|consensus
Probab=86.23 E-value=2.7 Score=40.43 Aligned_cols=58 Identities=14% Similarity=0.185 Sum_probs=50.2
Q ss_pred CCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHH
Q psy17695 4 TPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYE 62 (222)
Q Consensus 4 ~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~ 62 (222)
.+++-..+..+-.+|.++|++++|..+|++++..+|+ .+.++.+-.++.+.+.+.+-.
T Consensus 73 ~~~D~~tLq~l~~~y~d~~~~d~~~~~Ye~~~~~~P~-eell~~lFmayvR~~~yk~qQ 130 (932)
T KOG2053|consen 73 KGTDDLTLQFLQNVYRDLGKLDEAVHLYERANQKYPS-EELLYHLFMAYVREKSYKKQQ 130 (932)
T ss_pred CCCchHHHHHHHHHHHHHhhhhHHHHHHHHHHhhCCc-HHHHHHHHHHHHHHHHHHHHH
Confidence 3567788889999999999999999999999999999 888888888888887776533
No 263
>PF12862 Apc5: Anaphase-promoting complex subunit 5
Probab=86.14 E-value=1.1 Score=30.88 Aligned_cols=32 Identities=22% Similarity=0.277 Sum_probs=29.2
Q ss_pred CHHHHHHHHHhcccCCCcHHHHHHHHHHHHhC
Q psy17695 7 FKVGILFSNSSFPSPGNIPEAIQSYRTALKLK 38 (222)
Q Consensus 7 ~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~ 38 (222)
...+..++|.++...|++++|+..+++|+++.
T Consensus 40 ~~~all~lA~~~~~~G~~~~A~~~l~eAi~~A 71 (94)
T PF12862_consen 40 LAYALLNLAELHRRFGHYEEALQALEEAIRLA 71 (94)
T ss_pred HHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHH
Confidence 35678999999999999999999999999984
No 264
>TIGR02561 HrpB1_HrpK type III secretion protein HrpB1/HrpK. This gene is found within type III secretion operons in a limited range of species including Xanthomonas, Ralstonia and Burkholderia.
Probab=85.30 E-value=4.6 Score=30.64 Aligned_cols=54 Identities=9% Similarity=0.043 Sum_probs=47.3
Q ss_pred HhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHH
Q psy17695 16 SSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLV 69 (222)
Q Consensus 16 ~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~ 69 (222)
..-...++.+++...+...--+.|+.++....-|.++...|+|.+|...++.+.
T Consensus 18 ~~aL~~~d~~D~e~lLdALrvLrP~~~e~d~~dg~l~i~rg~w~eA~rvlr~l~ 71 (153)
T TIGR02561 18 MYALRSADPYDAQAMLDALRVLRPNLKELDMFDGWLLIARGNYDEAARILRELL 71 (153)
T ss_pred HHHHhcCCHHHHHHHHHHHHHhCCCccccchhHHHHHHHcCCHHHHHHHHHhhh
Confidence 333448999999999999999999999999999999999999999998776654
No 265
>PF11207 DUF2989: Protein of unknown function (DUF2989); InterPro: IPR021372 Some members in this bacterial family of proteins are annotated as lipoproteins however this cannot be confirmed.
Probab=84.98 E-value=4.6 Score=32.22 Aligned_cols=55 Identities=16% Similarity=0.070 Sum_probs=47.4
Q ss_pred CCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCC----CHHHHHHHHHHHHHcCCHHHH
Q psy17695 6 FFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPD----FPDAYCNLAHCLQIVCDWTDY 61 (222)
Q Consensus 6 ~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~----~~~a~~~l~~~~~~~~~~~~a 61 (222)
+.++..+.||..|. .-+.+.|+..|-++|++.+. +++.+..|+.++...++++.|
T Consensus 139 ~t~elq~aLAtyY~-krD~~Kt~~ll~~~L~l~~~~~~~n~eil~sLas~~~~~~~~e~A 197 (203)
T PF11207_consen 139 ETAELQYALATYYT-KRDPEKTIQLLLRALELSNPDDNFNPEILKSLASIYQKLKNYEQA 197 (203)
T ss_pred CCHHHHHHHHHHHH-ccCHHHHHHHHHHHHHhcCCCCCCCHHHHHHHHHHHHHhcchhhh
Confidence 57889999998888 45899999999999999544 489999999999999998765
No 266
>TIGR03504 FimV_Cterm FimV C-terminal domain. This protein is found at the extreme C-terminus of FimV from Pseudomonas aeruginosa, and of TspA of Neisseria meningitidis. Disruption of the former blocks twitching motility from type IV pili; Semmler, et al. suggest a role in peptidoglycan layer remodelling required by type IV fimbrial systems.
Probab=84.23 E-value=1.6 Score=25.82 Aligned_cols=25 Identities=12% Similarity=0.170 Sum_probs=22.1
Q ss_pred HHHHHhcccCCCcHHHHHHHHHHHH
Q psy17695 12 LFSNSSFPSPGNIPEAIQSYRTALK 36 (222)
Q Consensus 12 ~~lg~~l~~~g~~~eA~~~~~~Al~ 36 (222)
++|+.+|.++|+.+.|...++..++
T Consensus 3 LdLA~ayie~Gd~e~Ar~lL~evl~ 27 (44)
T TIGR03504 3 LDLARAYIEMGDLEGARELLEEVIE 27 (44)
T ss_pred hHHHHHHHHcCChHHHHHHHHHHHH
Confidence 5788999999999999999999885
No 267
>KOG4507|consensus
Probab=83.81 E-value=1.4 Score=40.73 Aligned_cols=65 Identities=9% Similarity=0.095 Sum_probs=45.7
Q ss_pred HHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhh
Q psy17695 11 ILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQ 75 (222)
Q Consensus 11 ~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~ 75 (222)
..+|++++..-|-.-+|-..+.++|.++-..+..++.+|+.+..+.+.++|++.+++++...++.
T Consensus 645 ~v~la~~~~~~~~~~da~~~l~q~l~~~~sepl~~~~~g~~~l~l~~i~~a~~~~~~a~~~~~~~ 709 (886)
T KOG4507|consen 645 LVNLANLLIHYGLHLDATKLLLQALAINSSEPLTFLSLGNAYLALKNISGALEAFRQALKLTTKC 709 (886)
T ss_pred HHHHHHHHHHhhhhccHHHHHHHHHhhcccCchHHHhcchhHHHHhhhHHHHHHHHHHHhcCCCC
Confidence 45666777777777777777777777776667777777777777777777777777777665543
No 268
>PF14853 Fis1_TPR_C: Fis1 C-terminal tetratricopeptide repeat; PDB: 1IYG_A 1PC2_A 1NZN_A 3UUX_C 1Y8M_A 2PQR_A 2PQN_A 3O48_A.
Probab=83.68 E-value=2.9 Score=25.79 Aligned_cols=32 Identities=16% Similarity=0.136 Sum_probs=27.6
Q ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 43 DAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 43 ~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
+.++.++..+.++|++++|....+.+++..|+
T Consensus 2 d~lY~lAig~ykl~~Y~~A~~~~~~lL~~eP~ 33 (53)
T PF14853_consen 2 DCLYYLAIGHYKLGEYEKARRYCDALLEIEPD 33 (53)
T ss_dssp HHHHHHHHHHHHTT-HHHHHHHHHHHHHHTTS
T ss_pred hhHHHHHHHHHHhhhHHHHHHHHHHHHhhCCC
Confidence 45788899999999999999999999988886
No 269
>KOG2610|consensus
Probab=83.59 E-value=1.9 Score=37.28 Aligned_cols=73 Identities=15% Similarity=0.071 Sum_probs=62.4
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHh-CCCC---HHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKL-KPDF---PDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l-~P~~---~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
...|.+..++..--.++.-+|+...-...+++.+-. +|+. ...+..++.++.+.|-+++|++..++++++++.
T Consensus 131 ~d~PtDlla~kfsh~a~fy~G~~~~~k~ai~kIip~wn~dlp~~sYv~GmyaFgL~E~g~y~dAEk~A~ralqiN~~ 207 (491)
T KOG2610|consen 131 DDYPTDLLAVKFSHDAHFYNGNQIGKKNAIEKIIPKWNADLPCYSYVHGMYAFGLEECGIYDDAEKQADRALQINRF 207 (491)
T ss_pred HhCchhhhhhhhhhhHHHhccchhhhhhHHHHhccccCCCCcHHHHHHHHHHhhHHHhccchhHHHHHHhhccCCCc
Confidence 345788888888888899999999999999999988 7776 667788888999999999999999999887764
No 270
>cd02682 MIT_AAA_Arch MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in mostly archaebacterial AAA-ATPases. The molecular function of the MIT domain is unclear.
Probab=83.43 E-value=1.6 Score=29.08 Aligned_cols=28 Identities=21% Similarity=0.342 Sum_probs=17.2
Q ss_pred HHHHHHHHhcccCCCcHHHHHHHHHHHH
Q psy17695 9 VGILFSNSSFPSPGNIPEAIQSYRTALK 36 (222)
Q Consensus 9 ~~~~~lg~~l~~~g~~~eA~~~~~~Al~ 36 (222)
.-+...++-+-+.|++++|+.+|+.+++
T Consensus 7 ~~~a~~AVe~D~~gr~~eAi~~Y~~aIe 34 (75)
T cd02682 7 RKYAINAVKAEKEGNAEDAITNYKKAIE 34 (75)
T ss_pred HHHHHHHHHHHhcCCHHHHHHHHHHHHH
Confidence 3445556666666776666666666554
No 271
>PF10516 SHNi-TPR: SHNi-TPR; InterPro: IPR019544 The tetratrico peptide repeat region (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B. This entry represents SHNi-TPR (Sim3-Hif1-NASP interrupted TPR), a sequence that is an interrupted form of TPR repeat [].
Probab=83.31 E-value=3.5 Score=23.53 Aligned_cols=33 Identities=18% Similarity=0.098 Sum_probs=29.2
Q ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhh
Q psy17695 43 DAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQ 75 (222)
Q Consensus 43 ~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~ 75 (222)
+.+..||.+-+...+|++|...|++++.+..+.
T Consensus 2 dv~~~Lgeisle~e~f~qA~~D~~~aL~i~~~l 34 (38)
T PF10516_consen 2 DVYDLLGEISLENENFEQAIEDYEKALEIQEEL 34 (38)
T ss_pred cHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHh
Confidence 467889999999999999999999999887763
No 272
>PF12854 PPR_1: PPR repeat
Probab=83.24 E-value=2.3 Score=23.39 Aligned_cols=27 Identities=15% Similarity=0.116 Sum_probs=17.2
Q ss_pred CHHHHHHHHHhcccCCCcHHHHHHHHH
Q psy17695 7 FKVGILFSNSSFPSPGNIPEAIQSYRT 33 (222)
Q Consensus 7 ~~~~~~~lg~~l~~~g~~~eA~~~~~~ 33 (222)
+...|+.|-..|.+.|+.++|.+.|++
T Consensus 6 d~~ty~~lI~~~Ck~G~~~~A~~l~~~ 32 (34)
T PF12854_consen 6 DVVTYNTLIDGYCKAGRVDEAFELFDE 32 (34)
T ss_pred cHhHHHHHHHHHHHCCCHHHHHHHHHh
Confidence 445566666666667777777666654
No 273
>PF11846 DUF3366: Domain of unknown function (DUF3366); InterPro: IPR021797 This domain is functionally uncharacterised. This domain is found in bacteria. This presumed domain is about 200 amino acids in length.
Probab=82.84 E-value=5 Score=31.41 Aligned_cols=53 Identities=19% Similarity=0.312 Sum_probs=44.6
Q ss_pred cCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHH
Q psy17695 20 SPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVA 73 (222)
Q Consensus 20 ~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~ 73 (222)
+.+..+..++..++.++-.|+ +..+.+++.++...|+.++|....+++..+.|
T Consensus 123 ~~~~l~~~~~~a~~~l~~~P~-~~~~~~~a~~l~~~G~~~eA~~~~~~~~~lyP 175 (193)
T PF11846_consen 123 DPEMLEAYIEWAERLLRRRPD-PNVYQRYALALALLGDPEEARQWLARARRLYP 175 (193)
T ss_pred CHHHHHHHHHHHHHHHHhCCC-HHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCC
Confidence 344466777788888888895 88999999999999999999998888887777
No 274
>KOG4507|consensus
Probab=82.48 E-value=2 Score=39.68 Aligned_cols=46 Identities=24% Similarity=0.312 Sum_probs=30.7
Q ss_pred CCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHH
Q psy17695 6 FFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHC 51 (222)
Q Consensus 6 ~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~ 51 (222)
.-|..++.+|.++..+.+.+.|+++|+.|++++|+++..-..|..+
T Consensus 674 sepl~~~~~g~~~l~l~~i~~a~~~~~~a~~~~~~~~~~~~~l~~i 719 (886)
T KOG4507|consen 674 SEPLTFLSLGNAYLALKNISGALEAFRQALKLTTKCPECENSLKLI 719 (886)
T ss_pred cCchHHHhcchhHHHHhhhHHHHHHHHHHHhcCCCChhhHHHHHHH
Confidence 3455666777777777777777777777777777776666555444
No 275
>KOG3785|consensus
Probab=82.13 E-value=2.8 Score=36.51 Aligned_cols=65 Identities=17% Similarity=0.174 Sum_probs=52.1
Q ss_pred HHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 10 GILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 10 ~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
-...|+.++...-.+.|||+.|.+.+.-+|+....-++++.|+.++.-++-+.+.+.-.++..|+
T Consensus 153 dqLSLAsvhYmR~HYQeAIdvYkrvL~dn~ey~alNVy~ALCyyKlDYydvsqevl~vYL~q~pd 217 (557)
T KOG3785|consen 153 DQLSLASVHYMRMHYQEAIDVYKRVLQDNPEYIALNVYMALCYYKLDYYDVSQEVLKVYLRQFPD 217 (557)
T ss_pred HHHhHHHHHHHHHHHHHHHHHHHHHHhcChhhhhhHHHHHHHHHhcchhhhHHHHHHHHHHhCCC
Confidence 34455666666668999999999999999999999999999999998888777766666665554
No 276
>COG4105 ComL DNA uptake lipoprotein [General function prediction only]
Probab=82.08 E-value=6.6 Score=32.46 Aligned_cols=50 Identities=18% Similarity=0.235 Sum_probs=41.2
Q ss_pred CCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHH---HHHHHHHHHHH
Q psy17695 5 PFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPD---AYCNLAHCLQI 54 (222)
Q Consensus 5 P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~---a~~~l~~~~~~ 54 (222)
|-...+...++.++.+.+++++|+...++-+++.|+++. +++..|.++..
T Consensus 68 ~~~~qa~l~l~yA~Yk~~~y~~A~~~~drFi~lyP~~~n~dY~~YlkgLs~~~ 120 (254)
T COG4105 68 PYSEQAQLDLAYAYYKNGEYDLALAYIDRFIRLYPTHPNADYAYYLKGLSYFF 120 (254)
T ss_pred cccHHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCCChhHHHHHHHHHHhc
Confidence 444678899999999999999999999999999998875 45666666543
No 277
>KOG1585|consensus
Probab=81.65 E-value=9.5 Score=31.64 Aligned_cols=65 Identities=15% Similarity=0.078 Sum_probs=37.1
Q ss_pred HHHHHHHhcccCCCcHHHHHHHHHHHHh-----CCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 10 GILFSNSSFPSPGNIPEAIQSYRTALKL-----KPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 10 ~~~~lg~~l~~~g~~~eA~~~~~~Al~l-----~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
++-..|.++.++..+.|+.+.|++|..+ .|+-+.+-...+.-..+.-+.++|+..|++++.++.+
T Consensus 73 ayEqaamLake~~klsEvvdl~eKAs~lY~E~GspdtAAmaleKAak~lenv~Pd~AlqlYqralavve~ 142 (308)
T KOG1585|consen 73 AYEQAAMLAKELSKLSEVVDLYEKASELYVECGSPDTAAMALEKAAKALENVKPDDALQLYQRALAVVEE 142 (308)
T ss_pred HHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHhCCcchHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHhc
Confidence 3444555666666667777777766655 2444433333333344555666777777766666543
No 278
>PF13281 DUF4071: Domain of unknown function (DUF4071)
Probab=81.36 E-value=5.9 Score=34.74 Aligned_cols=69 Identities=13% Similarity=-0.022 Sum_probs=53.1
Q ss_pred CCCHHHHHHHHHhccc---CCCcHHHHHHHHH-HHHhCCCCHHHHHHHHHHHHHc---------CCHHHHHHHHHHHHHH
Q psy17695 5 PFFKVGILFSNSSFPS---PGNIPEAIQSYRT-ALKLKPDFPDAYCNLAHCLQIV---------CDWTDYEARMKKLVSI 71 (222)
Q Consensus 5 P~~~~~~~~lg~~l~~---~g~~~eA~~~~~~-Al~l~P~~~~a~~~l~~~~~~~---------~~~~~a~~~~~~~~~~ 71 (222)
++.+..-+.+|.++-+ .|+.++|+..+.. ..+-++.+++++..+|.++... ...+.|.+.|+++.++
T Consensus 176 ~~~~~i~~~yafALnRrn~~gdre~Al~il~~~l~~~~~~~~d~~gL~GRIyKD~~~~s~~~d~~~ldkAi~~Y~kgFe~ 255 (374)
T PF13281_consen 176 ANQHNIKFQYAFALNRRNKPGDREKALQILLPVLESDENPDPDTLGLLGRIYKDLFLESNFTDRESLDKAIEWYRKGFEI 255 (374)
T ss_pred hcchHHHHHHHHHHhhcccCCCHHHHHHHHHHHHhccCCCChHHHHHHHHHHHHHHHHcCccchHHHHHHHHHHHHHHcC
Confidence 4567778889999999 9999999999999 5555788999999999998753 1245566666666655
Q ss_pred HH
Q psy17695 72 VA 73 (222)
Q Consensus 72 ~~ 73 (222)
.+
T Consensus 256 ~~ 257 (374)
T PF13281_consen 256 EP 257 (374)
T ss_pred Cc
Confidence 54
No 279
>COG3914 Spy Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones]
Probab=81.33 E-value=7.4 Score=35.92 Aligned_cols=71 Identities=15% Similarity=-0.011 Sum_probs=55.6
Q ss_pred CCCCCHHHHHH--HHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHH-HHHHHH
Q psy17695 3 KTPFFKVGILF--SNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKK-LVSIVA 73 (222)
Q Consensus 3 ~~P~~~~~~~~--lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~-~~~~~~ 73 (222)
.+|.+++.+.. +...+...++...+...++.++..+|.+..+..||+..+...+....+...+.+ +....+
T Consensus 60 ~~~~~~~llla~~lsi~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~L~~ale~~~~~~~~~~~~~~~a~~~~~ 133 (620)
T COG3914 60 INDVNPELLLAAFLSILLAPLADSTLAFLAKRIPLSVNPENCPAVQNLAAALELDGLQFLALADISEIAEWLSP 133 (620)
T ss_pred cCCCCHHHHHHHHHHhhccccccchhHHHHHhhhHhcCcccchHHHHHHHHHHHhhhHHHHHHHHHHHHHhcCc
Confidence 45666666433 588899999999999999999999999999999999999887776666555544 333333
No 280
>PF04184 ST7: ST7 protein; InterPro: IPR007311 The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain.
Probab=80.99 E-value=3.8 Score=37.08 Aligned_cols=72 Identities=17% Similarity=0.140 Sum_probs=52.7
Q ss_pred CCCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCC-------------------------CHHHHHHHHHHHHHc
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPD-------------------------FPDAYCNLAHCLQIV 55 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~-------------------------~~~a~~~l~~~~~~~ 55 (222)
|.++|+.+.++..|+. ....-..||+..|++|++.... ...+...|+.|+.++
T Consensus 195 Lei~pdCAdAYILLAE--EeA~Ti~Eae~l~rqAvkAgE~~lg~s~~~~~~g~~~e~~~~Rdt~~~~y~KrRLAmCarkl 272 (539)
T PF04184_consen 195 LEINPDCADAYILLAE--EEASTIVEAEELLRQAVKAGEASLGKSQFLQHHGHFWEAWHRRDTNVLVYAKRRLAMCARKL 272 (539)
T ss_pred HHhhhhhhHHHhhccc--ccccCHHHHHHHHHHHHHHHHHhhchhhhhhcccchhhhhhccccchhhhhHHHHHHHHHHh
Confidence 3567888888866543 2344578888888888865310 123567789999999
Q ss_pred CCHHHHHHHHHHHHHHHHh
Q psy17695 56 CDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 56 ~~~~~a~~~~~~~~~~~~~ 74 (222)
|+.+||++.++++++..|.
T Consensus 273 Gr~~EAIk~~rdLlke~p~ 291 (539)
T PF04184_consen 273 GRLREAIKMFRDLLKEFPN 291 (539)
T ss_pred CChHHHHHHHHHHHhhCCc
Confidence 9999999999999887664
No 281
>KOG1310|consensus
Probab=80.99 E-value=2.6 Score=38.55 Aligned_cols=71 Identities=17% Similarity=0.116 Sum_probs=59.3
Q ss_pred CCCCCHHHHHHHHHhccc---CCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHH
Q psy17695 3 KTPFFKVGILFSNSSFPS---PGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVA 73 (222)
Q Consensus 3 ~~P~~~~~~~~lg~~l~~---~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~ 73 (222)
..|.....+.|.+.++.+ .|+.-.|+.-...|++++|....+|+.|+.++.+++.+.+|+.+...+....|
T Consensus 403 ~~~~~~~~l~nraa~lmkRkW~~d~~~AlrDch~Alrln~s~~kah~~la~aL~el~r~~eal~~~~alq~~~P 476 (758)
T KOG1310|consen 403 YVPDAIYLLENRAAALMKRKWRGDSYLALRDCHVALRLNPSIQKAHFRLARALNELTRYLEALSCHWALQMSFP 476 (758)
T ss_pred hccchhHHHHhHHHHHHhhhccccHHHHHHhHHhhccCChHHHHHHHHHHHHHHHHhhHHHhhhhHHHHhhcCc
Confidence 346677777788877774 46677778888899999999999999999999999999999998887776666
No 282
>PF14863 Alkyl_sulf_dimr: Alkyl sulfatase dimerisation; PDB: 2YHE_C 2CG2_A 2CG3_A 2CFU_A 2CFZ_A.
Probab=79.69 E-value=6.5 Score=29.51 Aligned_cols=50 Identities=16% Similarity=0.041 Sum_probs=40.6
Q ss_pred HHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCC
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCD 57 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~ 57 (222)
.+.....+......|++.-|....+.++..+|++.++...++.++..++.
T Consensus 70 ~d~vl~~A~~~~~~gd~~wA~~L~d~l~~adp~n~~ar~l~A~al~~lg~ 119 (141)
T PF14863_consen 70 ADKVLERAQAALAAGDYQWAAELLDHLVFADPDNEEARQLKADALEQLGY 119 (141)
T ss_dssp HHHHHHHHHHHHHCT-HHHHHHHHHHHHHH-TT-HHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHcCCCcHHHHHHHHHHHHHHHH
Confidence 45667778888899999999999999999999999999999988877654
No 283
>PF01535 PPR: PPR repeat; InterPro: IPR002885 This entry represents the PPR repeat. Pentatricopeptide repeat (PPR) proteins are characterised by tandem repeats of a degenerate 35 amino acid motif []. Most of PPR proteins have roles in mitochondria or plastid []. PPR repeats were discovered while screening Arabidopsis proteins for those predicted to be targeted to mitochondria or chloroplast [, ]. Some of these proteins have been shown to play a role in post-transcriptional processes within organelles and they are thought to be sequence-specific RNA-binding proteins [, , ]. Plant genomes have between one hundred to five hundred PPR genes per genome whereas non-plant genomes encode two to six PPR proteins. Although no PPR structures are yet known, the motif is predicted to fold into a helix-turn-helix structure similar to those found in the tetratricopeptide repeat (TPR) family (see PDOC50005 from PROSITEDOC) []. The plant PPR protein family has been divided in two subfamilies on the basis of their motif content and organisation [, ]. Examples of PPR repeat-containing proteins include PET309 P32522 from SWISSPROT, which may be involved in RNA stabilisation [], and crp1, which is involved in RNA processing []. The repeat is associated with a predicted plant protein O49549 from SWISSPROT that has a domain organisation similar to the human BRCA1 protein.
Probab=79.26 E-value=3.7 Score=21.16 Aligned_cols=25 Identities=8% Similarity=0.098 Sum_probs=15.2
Q ss_pred HHHHHHHHHHcCCHHHHHHHHHHHH
Q psy17695 45 YCNLAHCLQIVCDWTDYEARMKKLV 69 (222)
Q Consensus 45 ~~~l~~~~~~~~~~~~a~~~~~~~~ 69 (222)
|..+...+.+.|++++|.+.+++..
T Consensus 3 y~~li~~~~~~~~~~~a~~~~~~M~ 27 (31)
T PF01535_consen 3 YNSLISGYCKMGQFEEALEVFDEMR 27 (31)
T ss_pred HHHHHHHHHccchHHHHHHHHHHHh
Confidence 4555566666666666666665543
No 284
>PF07079 DUF1347: Protein of unknown function (DUF1347); InterPro: IPR010764 This family consists of several hypothetical bacterial proteins of around 610 residues in length. Members of this family are highly conserved and seem to be specific to Chlamydia species. The function of this family is unknown.
Probab=78.14 E-value=5.8 Score=35.63 Aligned_cols=50 Identities=20% Similarity=0.202 Sum_probs=44.0
Q ss_pred HhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHH
Q psy17695 16 SSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMK 66 (222)
Q Consensus 16 ~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~ 66 (222)
.-+..+|++.++.-.-.=..+++| ++.++-.+|.|+....++++|-+++.
T Consensus 470 EyLysqgey~kc~~ys~WL~~iaP-S~~~~RLlGl~l~e~k~Y~eA~~~l~ 519 (549)
T PF07079_consen 470 EYLYSQGEYHKCYLYSSWLTKIAP-SPQAYRLLGLCLMENKRYQEAWEYLQ 519 (549)
T ss_pred HHHHhcccHHHHHHHHHHHHHhCC-cHHHHHHHHHHHHHHhhHHHHHHHHH
Confidence 446789999999999999999999 89999999999999988888876553
No 285
>PF02259 FAT: FAT domain; InterPro: IPR003151 The FAT domain is a domain present in the PIK-related kinases. Members of the family of PIK-related kinases may act as intracellular sensors that govern radial and horizontal pathways [].; GO: 0005515 protein binding
Probab=77.73 E-value=7.1 Score=33.06 Aligned_cols=48 Identities=19% Similarity=0.247 Sum_probs=42.8
Q ss_pred HHHHHHHHHhcccC------CCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHc
Q psy17695 8 KVGILFSNSSFPSP------GNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIV 55 (222)
Q Consensus 8 ~~~~~~lg~~l~~~------g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~ 55 (222)
+.++..+|.-...+ +..++++..|++|++++|+...+|+.+|..+...
T Consensus 252 a~~~l~~a~w~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~k~~~~~a~~~~~~ 305 (352)
T PF02259_consen 252 AKAFLLLAKWLDELYSKLSSESSDEILKYYKEATKLDPSWEKAWHSWALFNDKL 305 (352)
T ss_pred HHHHHHHHHHHHhhccccccccHHHHHHHHHHHHHhChhHHHHHHHHHHHHHHH
Confidence 56788888888888 9999999999999999999999999999987654
No 286
>PF13041 PPR_2: PPR repeat family
Probab=77.20 E-value=12 Score=22.09 Aligned_cols=42 Identities=24% Similarity=0.225 Sum_probs=29.2
Q ss_pred CHHHHHHHHHhcccCCCcHHHHHHHHHHHHhC--CCCHHHHHHHH
Q psy17695 7 FKVGILFSNSSFPSPGNIPEAIQSYRTALKLK--PDFPDAYCNLA 49 (222)
Q Consensus 7 ~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~--P~~~~a~~~l~ 49 (222)
+...|+.+=..+.+.|++++|.+.|++..+.. |+ ...+..+.
T Consensus 2 ~~~~yn~li~~~~~~~~~~~a~~l~~~M~~~g~~P~-~~Ty~~li 45 (50)
T PF13041_consen 2 DVVTYNTLISGYCKAGKFEEALKLFKEMKKRGIKPD-SYTYNILI 45 (50)
T ss_pred chHHHHHHHHHHHHCcCHHHHHHHHHHHHHcCCCCC-HHHHHHHH
Confidence 34567777788888888888888888888763 44 34444433
No 287
>COG5191 Uncharacterized conserved protein, contains HAT (Half-A-TPR) repeat [General function prediction only]
Probab=77.18 E-value=2 Score=36.68 Aligned_cols=69 Identities=12% Similarity=-0.113 Sum_probs=42.7
Q ss_pred CCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHH-HHHHHHHcCCHHHHHHHHHHHHHHHH
Q psy17695 5 PFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCN-LAHCLQIVCDWTDYEARMKKLVSIVA 73 (222)
Q Consensus 5 P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~-l~~~~~~~~~~~~a~~~~~~~~~~~~ 73 (222)
|+++..|.....-..+.|-+.+--..|.++++.+|.+++.|.. -.+-+...++.+.+.+.+.+.+++.+
T Consensus 104 f~D~k~w~~y~~Y~~k~k~y~~~~nI~~~~l~khP~nvdlWI~~c~~e~~~~ani~s~Ra~f~~glR~N~ 173 (435)
T COG5191 104 FNDPKIWSQYAAYVIKKKMYGEMKNIFAECLTKHPLNVDLWIYCCAFELFEIANIESSRAMFLKGLRMNS 173 (435)
T ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeeeeeccchhhhhccHHHHHHHHHhhhccCC
Confidence 5566666666666666666666666777777777777766654 33334455666666666666665544
No 288
>PF08424 NRDE-2: NRDE-2, necessary for RNA interference; InterPro: IPR013633 This is domain is found in eukaryotic proteins of unknown function.
Probab=76.13 E-value=23 Score=30.29 Aligned_cols=74 Identities=11% Similarity=0.024 Sum_probs=62.3
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHH---cCCHHHHHHHHHHHHHHHHhh
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQI---VCDWTDYEARMKKLVSIVAEQ 75 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~---~~~~~~a~~~~~~~~~~~~~~ 75 (222)
+.+|++...+..+=.+..+..+-++..+-+++++..+|++...|..+....+. .+.++.....|.++++.+...
T Consensus 59 ~~np~~~~L~l~~l~~~~~~~~~~~l~~~we~~l~~~~~~~~LW~~yL~~~q~~~~~f~v~~~~~~y~~~l~~L~~~ 135 (321)
T PF08424_consen 59 KHNPDSERLLLGYLEEGEKVWDSEKLAKKWEELLFKNPGSPELWREYLDFRQSNFASFTVSDVRDVYEKCLRALSRR 135 (321)
T ss_pred HhCCCCHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHCCCChHHHHHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHh
Confidence 45888888888888888888899999999999999999999999888777655 456888888898888877754
No 289
>KOG2471|consensus
Probab=76.11 E-value=5.1 Score=36.38 Aligned_cols=48 Identities=17% Similarity=0.180 Sum_probs=44.1
Q ss_pred CHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHH
Q psy17695 7 FKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQI 54 (222)
Q Consensus 7 ~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~ 54 (222)
.-+..+|.|..|.-.|+.-+|.+||.+++..--.+|..|..|+.|...
T Consensus 334 s~eilYNcG~~~Lh~grPl~AfqCf~~av~vfh~nPrlWLRlAEcCim 381 (696)
T KOG2471|consen 334 SMEILYNCGLLYLHSGRPLLAFQCFQKAVHVFHRNPRLWLRLAECCIM 381 (696)
T ss_pred chhhHHhhhHHHHhcCCcHHHHHHHHHHHHHHhcCcHHHHHHHHHHHH
Confidence 357789999999999999999999999999999999999999998754
No 290
>KOG1915|consensus
Probab=75.00 E-value=8 Score=35.05 Aligned_cols=67 Identities=12% Similarity=-0.031 Sum_probs=57.1
Q ss_pred HHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
...|...|.-=.+++++..|...|++||..+-.+...|...+.+-++....+.|...+.+++.++|.
T Consensus 73 ~~~WikYaqwEesq~e~~RARSv~ERALdvd~r~itLWlkYae~Emknk~vNhARNv~dRAvt~lPR 139 (677)
T KOG1915|consen 73 MQVWIKYAQWEESQKEIQRARSVFERALDVDYRNITLWLKYAEFEMKNKQVNHARNVWDRAVTILPR 139 (677)
T ss_pred HHHHHHHHHHHHhHHHHHHHHHHHHHHHhcccccchHHHHHHHHHHhhhhHhHHHHHHHHHHHhcch
Confidence 4567777777778888999999999999999889999999999888888888888888888888875
No 291
>COG3898 Uncharacterized membrane-bound protein [Function unknown]
Probab=74.97 E-value=11 Score=33.48 Aligned_cols=52 Identities=15% Similarity=0.046 Sum_probs=45.9
Q ss_pred cccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHH
Q psy17695 18 FPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLV 69 (222)
Q Consensus 18 l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~ 69 (222)
-..+|..+.|+.+-++|.+..|.-+-++......+...|+|++|+...+...
T Consensus 164 Aqr~GareaAr~yAe~Aa~~Ap~l~WA~~AtLe~r~~~gdWd~AlkLvd~~~ 215 (531)
T COG3898 164 AQRLGAREAARHYAERAAEKAPQLPWAARATLEARCAAGDWDGALKLVDAQR 215 (531)
T ss_pred HHhcccHHHHHHHHHHHHhhccCCchHHHHHHHHHHhcCChHHHHHHHHHHH
Confidence 3478999999999999999999999999988899999999999998766544
No 292
>PF04212 MIT: MIT (microtubule interacting and transport) domain; InterPro: IPR007330 The MIT domain is found in vacuolar sorting proteins, spastin (probable ATPase involved in the assembly or function of nuclear protein complexes), and a sorting nexin, which may play a role in intracellular trafficking.; PDB: 2DL1_A 2JQK_A 1WR0_A 2CPT_A 2JQH_A 2V6Y_A 2JQ9_A 2K3W_A 1YXR_A 3EAB_E ....
Probab=74.86 E-value=5.6 Score=25.58 Aligned_cols=25 Identities=24% Similarity=0.407 Sum_probs=12.0
Q ss_pred HHHHHhcccCCCcHHHHHHHHHHHH
Q psy17695 12 LFSNSSFPSPGNIPEAIQSYRTALK 36 (222)
Q Consensus 12 ~~lg~~l~~~g~~~eA~~~~~~Al~ 36 (222)
...|.-.-+.|++++|+.+|.+|+.
T Consensus 9 ~~~Av~~D~~g~~~~A~~~Y~~ai~ 33 (69)
T PF04212_consen 9 IKKAVEADEAGNYEEALELYKEAIE 33 (69)
T ss_dssp HHHHHHHHHTTSHHHHHHHHHHHHH
T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHH
Confidence 3344444445555555555554443
No 293
>KOG1941|consensus
Probab=74.82 E-value=4.9 Score=35.17 Aligned_cols=64 Identities=13% Similarity=0.074 Sum_probs=50.7
Q ss_pred HHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCC----------HHHHHHHHHHHHHcCCHHHHHHHHHHHHHHH
Q psy17695 9 VGILFSNSSFPSPGNIPEAIQSYRTALKLKPDF----------PDAYCNLAHCLQIVCDWTDYEARMKKLVSIV 72 (222)
Q Consensus 9 ~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~----------~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~ 72 (222)
.....||.++..+.++++|+.+..+|+++--+. ..+++.++..++.+|..-+|.++.+++.++.
T Consensus 163 qvcv~Lgslf~~l~D~~Kal~f~~kA~~lv~s~~l~d~~~kyr~~~lyhmaValR~~G~LgdA~e~C~Ea~kla 236 (518)
T KOG1941|consen 163 QVCVSLGSLFAQLKDYEKALFFPCKAAELVNSYGLKDWSLKYRAMSLYHMAVALRLLGRLGDAMECCEEAMKLA 236 (518)
T ss_pred ehhhhHHHHHHHHHhhhHHhhhhHhHHHHHHhcCcCchhHHHHHHHHHHHHHHHHHhcccccHHHHHHHHHHHH
Confidence 345678888888889999988888888875332 2457888899999999999999998887654
No 294
>PF04781 DUF627: Protein of unknown function (DUF627); InterPro: IPR006866 This domain represents the N-terminal region of several plant proteins of unknown function.
Probab=74.51 E-value=5.3 Score=28.68 Aligned_cols=55 Identities=16% Similarity=0.199 Sum_probs=39.6
Q ss_pred HHHHHHHHhcccCCC-----------cHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHH
Q psy17695 9 VGILFSNSSFPSPGN-----------IPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEA 63 (222)
Q Consensus 9 ~~~~~lg~~l~~~g~-----------~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~ 63 (222)
..|..-|.++.+++. +-.++++|.++..+.|..+..++.+|.-+...-.|+++..
T Consensus 34 ~lh~~QG~if~~lA~~ten~d~k~~yLl~sve~~s~a~~Lsp~~A~~L~~la~~l~s~~~Ykk~v~ 99 (111)
T PF04781_consen 34 LLHRLQGTIFYKLAKKTENPDVKFRYLLGSVECFSRAVELSPDSAHSLFELASQLGSVKYYKKAVK 99 (111)
T ss_pred HHHHHHhHHHHHHHHhccCchHHHHHHHHhHHHHHHHhccChhHHHHHHHHHHHhhhHHHHHHHHH
Confidence 667777777765543 4578999999999999998888888865444434444433
No 295
>KOG2758|consensus
Probab=74.46 E-value=12 Score=32.15 Aligned_cols=71 Identities=15% Similarity=0.091 Sum_probs=52.3
Q ss_pred CCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHh--CCC--CHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 4 TPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKL--KPD--FPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 4 ~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l--~P~--~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
.|+..++.+..+......|+|.+|-.++-....+ +|+ +..+++.....-.-+-+|+.|.+.+.++.+.+.+
T Consensus 125 ~~e~i~~lykyakfqyeCGNY~gAs~yLY~~r~l~~~~d~n~lsalwGKlASEIL~qnWd~A~edL~rLre~IDs 199 (432)
T KOG2758|consen 125 TPERIETLYKYAKFQYECGNYSGASDYLYFYRALVSDPDRNYLSALWGKLASEILTQNWDGALEDLTRLREYIDS 199 (432)
T ss_pred CHHHHHHHHHHHHHHHhccCcccHHHHHHHHHHhcCCcchhhHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHcc
Confidence 4677889999999999999999999976444443 333 3444444444445567999999999888887664
No 296
>KOG2796|consensus
Probab=74.21 E-value=18 Score=30.40 Aligned_cols=62 Identities=10% Similarity=0.089 Sum_probs=52.6
Q ss_pred HHHHHHHHhcccCCCcHHHHHHHHHHHHhC-CCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHH
Q psy17695 9 VGILFSNSSFPSPGNIPEAIQSYRTALKLK-PDFPDAYCNLAHCLQIVCDWTDYEARMKKLVS 70 (222)
Q Consensus 9 ~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~-P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~ 70 (222)
.+.+-+..++.-+|.|.-..+.+.+.++.+ |..+.....||.+.++.||.+.|...++...+
T Consensus 178 ~Vmy~~~~~llG~kEy~iS~d~~~~vi~~~~e~~p~L~s~Lgr~~MQ~GD~k~a~~yf~~vek 240 (366)
T KOG2796|consen 178 RVMYSMANCLLGMKEYVLSVDAYHSVIKYYPEQEPQLLSGLGRISMQIGDIKTAEKYFQDVEK 240 (366)
T ss_pred HHHHHHHHHHhcchhhhhhHHHHHHHHHhCCcccHHHHHHHHHHHHhcccHHHHHHHHHHHHH
Confidence 345667788888999999999999999998 77888889999999999999999988876553
No 297
>PF10300 DUF3808: Protein of unknown function (DUF3808); InterPro: IPR019412 This entry represents a family of proteins conserved from fungi to humans. In humans this protein is expressed in primary breast carcinomas but not in normal breast tissue, and has a putative eukaryotic RNP-1 RNA binding region and a candidate anchoring transmembrane domain. The human protein is coordinately regulated with oestrogen receptor, but is not necessarily oestradiol-responsive []. Members of this family carry a tetratricopeptide repeat (IPR013105 from INTERPRO) at their C terminus.
Probab=73.81 E-value=13 Score=33.62 Aligned_cols=65 Identities=14% Similarity=-0.007 Sum_probs=48.2
Q ss_pred HHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCC-HHHHHHHHHHHHHcCCH-------HHHHHHHHHHHHHH
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDF-PDAYCNLAHCLQIVCDW-------TDYEARMKKLVSIV 72 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~-~~a~~~l~~~~~~~~~~-------~~a~~~~~~~~~~~ 72 (222)
...++.+|.++.-+++|++|..+|.+.++.+.-+ +...+..|.|+...++- ++|.+.++++-.+.
T Consensus 305 ~l~~~El~w~~~~~~~w~~A~~~f~~L~~~s~WSka~Y~Y~~a~c~~~l~~~~~~~~~~~~a~~l~~~vp~l~ 377 (468)
T PF10300_consen 305 HLCYFELAWCHMFQHDWEEAAEYFLRLLKESKWSKAFYAYLAAACLLMLGREEEAKEHKKEAEELFRKVPKLK 377 (468)
T ss_pred HHHHHHHHHHHHHHchHHHHHHHHHHHHhccccHHHHHHHHHHHHHHhhccchhhhhhHHHHHHHHHHHHHHH
Confidence 3567789999999999999999999999976553 34456667788888888 45555554444433
No 298
>KOG3081|consensus
Probab=73.20 E-value=16 Score=30.61 Aligned_cols=53 Identities=15% Similarity=0.189 Sum_probs=43.0
Q ss_pred CCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 22 GNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 22 g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
+++.+|--.|+..-+.-|-.+..+..++.|.+.+++|++|+..++.++.-.++
T Consensus 187 ek~qdAfyifeE~s~k~~~T~~llnG~Av~~l~~~~~eeAe~lL~eaL~kd~~ 239 (299)
T KOG3081|consen 187 EKIQDAFYIFEELSEKTPPTPLLLNGQAVCHLQLGRYEEAESLLEEALDKDAK 239 (299)
T ss_pred hhhhhHHHHHHHHhcccCCChHHHccHHHHHHHhcCHHHHHHHHHHHHhccCC
Confidence 45777777787777766777888889999999999999999999888765443
No 299
>PF03704 BTAD: Bacterial transcriptional activator domain; InterPro: IPR005158 Found in the DNRI/REDD/AFSR family of regulators, this region of AFSR (P25941 from SWISSPROT) along with the C-terminal region is capable of independently directing actinorhodin production. It is important for the formation of secondary metabolites.; PDB: 2FF4_B 2FEZ_A.
Probab=72.59 E-value=6.2 Score=29.09 Aligned_cols=35 Identities=14% Similarity=0.235 Sum_probs=29.9
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHH
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALK 36 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~ 36 (222)
..+|.+-.+|..+-.+|..+|+..+|+..|++..+
T Consensus 90 ~~dP~~E~~~~~lm~~~~~~g~~~~A~~~Y~~~~~ 124 (146)
T PF03704_consen 90 ALDPYDEEAYRLLMRALAAQGRRAEALRVYERYRR 124 (146)
T ss_dssp HHSTT-HHHHHHHHHHHHHTT-HHHHHHHHHHHHH
T ss_pred hcCCCCHHHHHHHHHHHHHCcCHHHHHHHHHHHHH
Confidence 35899999999999999999999999999998754
No 300
>COG2912 Uncharacterized conserved protein [Function unknown]
Probab=72.14 E-value=7.2 Score=32.56 Aligned_cols=49 Identities=18% Similarity=0.016 Sum_probs=44.2
Q ss_pred CCCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHH
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLA 49 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~ 49 (222)
|..+|+++.-+.--|.+|.++|-+..|++.++..++.-|+.+.+-.-..
T Consensus 208 l~l~P~dp~eirDrGliY~ql~c~~vAl~dl~~~~~~~P~~~~a~~ir~ 256 (269)
T COG2912 208 LDLNPEDPYEIRDRGLIYAQLGCYHVALEDLSYFVEHCPDDPIAEMIRA 256 (269)
T ss_pred HhhCCCChhhccCcHHHHHhcCCchhhHHHHHHHHHhCCCchHHHHHHH
Confidence 3578999999999999999999999999999999999999998765544
No 301
>cd02681 MIT_calpain7_1 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in the nuclear thiol protease PalBH. The molecular function of the MIT domain is unclear.
Probab=71.17 E-value=7.5 Score=25.89 Aligned_cols=27 Identities=19% Similarity=0.156 Sum_probs=16.0
Q ss_pred HHHHHHHhcccCCCcHHHHHHHHHHHH
Q psy17695 10 GILFSNSSFPSPGNIPEAIQSYRTALK 36 (222)
Q Consensus 10 ~~~~lg~~l~~~g~~~eA~~~~~~Al~ 36 (222)
-+...++-+-+.|++++|+.+|..|++
T Consensus 8 ~~a~~Ave~D~~g~y~eA~~~Y~~aie 34 (76)
T cd02681 8 QFARLAVQRDQEGRYSEAVFYYKEAAQ 34 (76)
T ss_pred HHHHHHHHHHHccCHHHHHHHHHHHHH
Confidence 344455555566666666666666655
No 302
>KOG2396|consensus
Probab=71.14 E-value=8.3 Score=35.02 Aligned_cols=49 Identities=16% Similarity=0.137 Sum_probs=42.4
Q ss_pred CCCCCCCHHHHHHHHHhcccCCC-cHHHHHHHHHHHHhCCCCHHHHHHHH
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGN-IPEAIQSYRTALKLKPDFPDAYCNLA 49 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~-~~eA~~~~~~Al~l~P~~~~a~~~l~ 49 (222)
|.+.|+++..|-.-+.=....+. .+.|.+.|.++|+.+|+++..|....
T Consensus 132 l~~Hp~~~dLWI~aA~wefe~n~ni~saRalflrgLR~npdsp~Lw~eyf 181 (568)
T KOG2396|consen 132 LAKHPNNPDLWIYAAKWEFEINLNIESARALFLRGLRFNPDSPKLWKEYF 181 (568)
T ss_pred HHhCCCCchhHHhhhhhHHhhccchHHHHHHHHHHhhcCCCChHHHHHHH
Confidence 35789999999999888888777 99999999999999999998775543
No 303
>KOG1915|consensus
Probab=71.14 E-value=7.8 Score=35.14 Aligned_cols=51 Identities=16% Similarity=0.122 Sum_probs=40.7
Q ss_pred cCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHH
Q psy17695 20 SPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVS 70 (222)
Q Consensus 20 ~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~ 70 (222)
++++++.-...|++-|+-.|.+..+|...|..-..+|+++.+.+.|.-++.
T Consensus 449 qL~efDRcRkLYEkfle~~Pe~c~~W~kyaElE~~LgdtdRaRaifelAi~ 499 (677)
T KOG1915|consen 449 QLREFDRCRKLYEKFLEFSPENCYAWSKYAELETSLGDTDRARAIFELAIS 499 (677)
T ss_pred HHhhHHHHHHHHHHHHhcChHhhHHHHHHHHHHHHhhhHHHHHHHHHHHhc
Confidence 456777788888888888888888888888888888888888877766653
No 304
>KOG1585|consensus
Probab=71.00 E-value=24 Score=29.36 Aligned_cols=67 Identities=13% Similarity=0.019 Sum_probs=53.2
Q ss_pred HHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCH------HHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFP------DAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~------~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
+..+..-+++|+...++++|..++++|.+-.-++. .++-..+..+.++..|.|++..|+++..+.-+
T Consensus 31 as~yekAAvafRnAk~feKakdcLlkA~~~yEnnrslfhAAKayEqaamLake~~klsEvvdl~eKAs~lY~E 103 (308)
T KOG1585|consen 31 ASLYEKAAVAFRNAKKFEKAKDCLLKASKGYENNRSLFHAAKAYEQAAMLAKELSKLSEVVDLYEKASELYVE 103 (308)
T ss_pred HHHHHHHHHHHHhhccHHHHHHHHHHHHHHHHhcccHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHH
Confidence 46677788999999999999999999996543332 34555667778889999999999999887654
No 305
>KOG1464|consensus
Probab=70.46 E-value=20 Score=30.32 Aligned_cols=55 Identities=16% Similarity=0.280 Sum_probs=46.0
Q ss_pred CCcHHHHHHHHHHHHhCCCCHH----HHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhhh
Q psy17695 22 GNIPEAIQSYRTALKLKPDFPD----AYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQL 76 (222)
Q Consensus 22 g~~~eA~~~~~~Al~l~P~~~~----a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~l 76 (222)
.+.++|+..|++.+++.|...+ |+-.+..+...+|++++..+.|++++..+...+
T Consensus 41 ~~p~~Al~sF~kVlelEgEKgeWGFKALKQmiKI~f~l~~~~eMm~~Y~qlLTYIkSAV 99 (440)
T KOG1464|consen 41 DEPKEALSSFQKVLELEGEKGEWGFKALKQMIKINFRLGNYKEMMERYKQLLTYIKSAV 99 (440)
T ss_pred cCHHHHHHHHHHHHhcccccchhHHHHHHHHHHHHhccccHHHHHHHHHHHHHHHHHHH
Confidence 3899999999999999988653 566666778899999999999999998776543
No 306
>PF15015 NYD-SP12_N: Spermatogenesis-associated, N-terminal
Probab=70.42 E-value=8.1 Score=34.46 Aligned_cols=53 Identities=8% Similarity=0.069 Sum_probs=48.1
Q ss_pred HHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHH
Q psy17695 12 LFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEAR 64 (222)
Q Consensus 12 ~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~ 64 (222)
..|..+|..+++.+-|+..-.+.+-++|.+..-+..-+.|.+.+.+|.+|...
T Consensus 232 tklv~CYL~~rkpdlALnh~hrsI~lnP~~frnHLrqAavfR~LeRy~eAarS 284 (569)
T PF15015_consen 232 TKLVTCYLRMRKPDLALNHSHRSINLNPSYFRNHLRQAAVFRRLERYSEAARS 284 (569)
T ss_pred HHHHHhhhhcCCCchHHHHHhhhhhcCcchhhHHHHHHHHHHHHHHHHHHHHH
Confidence 46788999999999999999999999999999999999999999999887653
No 307
>PF11817 Foie-gras_1: Foie gras liver health family 1; InterPro: IPR021773 Mutating the gene foie gras in zebrafish has been shown to affect development; the mutants develop large, lipid-filled hepatocytes in the liver, resembling those in individuals with fatty liver disease []. Foie-gras protein is long and has several well-defined domains though none of them has a known function. We have annotated this one as the first []. THe C terminus of this region contains TPR repeats.
Probab=70.30 E-value=15 Score=30.17 Aligned_cols=58 Identities=16% Similarity=0.138 Sum_probs=42.0
Q ss_pred HHHHHHHhcccCCCcHHHHHHHHHHHHhCC------CCHHHHHHHHHHHHHcCCHHHHHHHHHH
Q psy17695 10 GILFSNSSFPSPGNIPEAIQSYRTALKLKP------DFPDAYCNLAHCLQIVCDWTDYEARMKK 67 (222)
Q Consensus 10 ~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P------~~~~a~~~l~~~~~~~~~~~~a~~~~~~ 67 (222)
.-..+|.-|...|++++|++.|+.+...-- =....+..+..|...+|+.++.....-+
T Consensus 180 l~~~~A~ey~~~g~~~~A~~~l~~~~~~yr~egW~~l~~~~l~~l~~Ca~~~~~~~~~l~~~le 243 (247)
T PF11817_consen 180 LSLEMAEEYFRLGDYDKALKLLEPAASSYRREGWWSLLTEVLWRLLECAKRLGDVEDYLTTSLE 243 (247)
T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHHHHhCCcHHHHHHHHHHHHHHHHHhCCHHHHHHHHHH
Confidence 345788899999999999999999965521 1235667777888888888776654433
No 308
>KOG0985|consensus
Probab=70.14 E-value=18 Score=35.92 Aligned_cols=62 Identities=18% Similarity=0.088 Sum_probs=52.7
Q ss_pred CCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHH
Q psy17695 6 FFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIV 72 (222)
Q Consensus 6 ~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~ 72 (222)
+.+..|..+|.+..+.|...+|++.|-+| +++..+.+...+....|.|++...++.-+.+-.
T Consensus 1102 n~p~vWsqlakAQL~~~~v~dAieSyika-----dDps~y~eVi~~a~~~~~~edLv~yL~MaRkk~ 1163 (1666)
T KOG0985|consen 1102 NEPAVWSQLAKAQLQGGLVKDAIESYIKA-----DDPSNYLEVIDVASRTGKYEDLVKYLLMARKKV 1163 (1666)
T ss_pred CChHHHHHHHHHHHhcCchHHHHHHHHhc-----CCcHHHHHHHHHHHhcCcHHHHHHHHHHHHHhh
Confidence 46889999999999999999999999654 678888888888899999999888776665543
No 309
>KOG0546|consensus
Probab=70.01 E-value=7 Score=33.86 Aligned_cols=52 Identities=13% Similarity=0.022 Sum_probs=33.2
Q ss_pred CCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHc
Q psy17695 4 TPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIV 55 (222)
Q Consensus 4 ~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~ 55 (222)
++....+|+..|..+..+.++++|++.++.+...+|++....-.+..+.+..
T Consensus 305 ~~s~tka~~Rr~~~~~~~~~~~~a~~~~~~a~~~~p~d~~i~~~~~~~~~~~ 356 (372)
T KOG0546|consen 305 ERSKTKAHYRRGQAYKLLKNYDEALEDLKKAKQKAPNDKAIEEELENVRQKK 356 (372)
T ss_pred ChhhCcHHHHHHhHHHhhhchhhhHHHHHHhhccCcchHHHHHHHHHhhhHH
Confidence 4555666667777777777777777777777777777666555555544443
No 310
>TIGR00756 PPR pentatricopeptide repeat domain (PPR motif). This family has a similar consensus to the TPR domain (tetratricopeptide), pfam pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.
Probab=69.66 E-value=12 Score=19.42 Aligned_cols=25 Identities=16% Similarity=0.120 Sum_probs=18.4
Q ss_pred HHHHHHHHHHcCCHHHHHHHHHHHH
Q psy17695 45 YCNLAHCLQIVCDWTDYEARMKKLV 69 (222)
Q Consensus 45 ~~~l~~~~~~~~~~~~a~~~~~~~~ 69 (222)
|..+...+.+.|++++|.+.|++..
T Consensus 3 ~n~li~~~~~~~~~~~a~~~~~~M~ 27 (35)
T TIGR00756 3 YNTLIDGLCKAGRVEEALELFKEML 27 (35)
T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHH
Confidence 5566677778888888887777654
No 311
>PF07720 TPR_3: Tetratricopeptide repeat; InterPro: IPR011716 This entry includes tetratricopeptide-like repeats found in the LcrH/SycD-like chaperones [].; PDB: 3KS2_O 3GZ2_A 3GZ1_A 3GYZ_A 4AM9_A 2VGX_A 2VGY_A.
Probab=69.61 E-value=17 Score=20.28 Aligned_cols=30 Identities=10% Similarity=0.010 Sum_probs=22.7
Q ss_pred HHHHHHHHHHHHcCCHHHHHHHHH--HHHHHH
Q psy17695 43 DAYCNLAHCLQIVCDWTDYEARMK--KLVSIV 72 (222)
Q Consensus 43 ~a~~~l~~~~~~~~~~~~a~~~~~--~~~~~~ 72 (222)
+.+..+|..+...|++++|+..++ -+..+.
T Consensus 2 e~~y~~a~~~y~~~ky~~A~~~~~y~~l~~ld 33 (36)
T PF07720_consen 2 EYLYGLAYNFYQKGKYDEAIHFFQYAFLCALD 33 (36)
T ss_dssp HHHHHHHHHHHHTT-HHHHHHHHHHHHHHHHT
T ss_pred cHHHHHHHHHHHHhhHHHHHHHHHHHHHHHhc
Confidence 567889999999999999999944 554433
No 312
>PF12854 PPR_1: PPR repeat
Probab=69.55 E-value=13 Score=20.19 Aligned_cols=27 Identities=7% Similarity=-0.146 Sum_probs=23.3
Q ss_pred CHHHHHHHHHHHHHcCCHHHHHHHHHH
Q psy17695 41 FPDAYCNLAHCLQIVCDWTDYEARMKK 67 (222)
Q Consensus 41 ~~~a~~~l~~~~~~~~~~~~a~~~~~~ 67 (222)
+...+..+...+.+.|+.++|.+.+++
T Consensus 6 d~~ty~~lI~~~Ck~G~~~~A~~l~~~ 32 (34)
T PF12854_consen 6 DVVTYNTLIDGYCKAGRVDEAFELFDE 32 (34)
T ss_pred cHhHHHHHHHHHHHCCCHHHHHHHHHh
Confidence 367788999999999999999988765
No 313
>PF13041 PPR_2: PPR repeat family
Probab=69.22 E-value=14 Score=21.69 Aligned_cols=28 Identities=11% Similarity=0.041 Sum_probs=24.6
Q ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHH
Q psy17695 42 PDAYCNLAHCLQIVCDWTDYEARMKKLV 69 (222)
Q Consensus 42 ~~a~~~l~~~~~~~~~~~~a~~~~~~~~ 69 (222)
...+..+...+.+.|++++|.+.|++..
T Consensus 3 ~~~yn~li~~~~~~~~~~~a~~l~~~M~ 30 (50)
T PF13041_consen 3 VVTYNTLISGYCKAGKFEEALKLFKEMK 30 (50)
T ss_pred hHHHHHHHHHHHHCcCHHHHHHHHHHHH
Confidence 5678888999999999999999998776
No 314
>PF10373 EST1_DNA_bind: Est1 DNA/RNA binding domain; InterPro: IPR018834 Est1 is a protein which recruits or activates telomerase at the site of polymerisation [, ]. This is the DNA/RNA binding domain of EST1 []. ; PDB: 1YA0_B.
Probab=68.76 E-value=9.9 Score=31.12 Aligned_cols=43 Identities=26% Similarity=0.210 Sum_probs=35.2
Q ss_pred HHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHH
Q psy17695 27 AIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLV 69 (222)
Q Consensus 27 A~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~ 69 (222)
|+.+|.+|+.+.|++...++.||.+....++.-+|.-.|-+++
T Consensus 1 A~~~Y~~A~~l~P~~G~p~nQLAvl~~~~~~~l~avy~y~Rsl 43 (278)
T PF10373_consen 1 AERYYRKAIRLLPSNGNPYNQLAVLASYQGDDLDAVYYYIRSL 43 (278)
T ss_dssp HHHHHHHHHHH-TTBSHHHHHHHHHHHHTT-HHHHHHHHHHHH
T ss_pred CHHHHHHHHHhCCCCCCcccchhhhhccccchHHHHHHHHHHH
Confidence 7899999999999999999999999999888877775554443
No 315
>PF08238 Sel1: Sel1 repeat; InterPro: IPR006597 Sel1-like repeats are tetratricopeptide repeat sequences originally identified in a Caenorhabditis elegans receptor molecule which is a key negative regulator of the Notch pathway []. Mammalian homologues have since been identified although these mainly pancreatic proteins have yet to have a function assigned.; PDB: 2XM6_A 3RJV_A 1OUV_A 1KLX_A.
Probab=68.42 E-value=8.7 Score=21.04 Aligned_cols=30 Identities=10% Similarity=0.017 Sum_probs=18.8
Q ss_pred HHHHHHHH--HhcccCC-----CcHHHHHHHHHHHHh
Q psy17695 8 KVGILFSN--SSFPSPG-----NIPEAIQSYRTALKL 37 (222)
Q Consensus 8 ~~~~~~lg--~~l~~~g-----~~~eA~~~~~~Al~l 37 (222)
|++.+.|| .++..-. +.++|+.+|++|.+.
T Consensus 1 a~A~~~lg~~~~~~~g~~g~~~d~~~A~~~~~~Aa~~ 37 (39)
T PF08238_consen 1 AEAQYNLGMYYMYYNGKGGVPKDYEKAFKWYEKAAEQ 37 (39)
T ss_dssp HHHHHHHHHHHHHHHTSTSSCHHHHHHHHHHHHHHHT
T ss_pred ChHHHHHHHHHhhhhccCCccccccchHHHHHHHHHc
Confidence 45667777 4333322 467888888887664
No 316
>KOG4814|consensus
Probab=68.40 E-value=16 Score=34.30 Aligned_cols=67 Identities=10% Similarity=0.019 Sum_probs=52.8
Q ss_pred CCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHH
Q psy17695 6 FFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIV 72 (222)
Q Consensus 6 ~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~ 72 (222)
+++....+|..+|..+.+.+.|.++++.|-+.+|.++-.......+...-+.-++|..+..+.....
T Consensus 392 ~FaK~qR~l~~CYL~L~QLD~A~E~~~EAE~~d~~~~l~q~~~~~~~~~E~~Se~AL~~~~~~~s~~ 458 (872)
T KOG4814|consen 392 RFAKIQRALQVCYLKLEQLDNAVEVYQEAEEVDRQSPLCQLLMLQSFLAEDKSEEALTCLQKIKSSE 458 (872)
T ss_pred HHHHHHHHHHHHHhhHHHHHHHHHHHHHHHhhccccHHHHHHHHHHHHHhcchHHHHHHHHHHHhhh
Confidence 4567778888888888888889999988888888888887777777777777777777766665543
No 317
>cd02683 MIT_1 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in proteins with unknown function, co-occuring with an as yet undescribed domain. The molecular function of the MIT domain is unclear.
Probab=68.02 E-value=8.6 Score=25.60 Aligned_cols=27 Identities=19% Similarity=0.381 Sum_probs=16.5
Q ss_pred HHHHHHHhcccCCCcHHHHHHHHHHHH
Q psy17695 10 GILFSNSSFPSPGNIPEAIQSYRTALK 36 (222)
Q Consensus 10 ~~~~lg~~l~~~g~~~eA~~~~~~Al~ 36 (222)
.+...|+-+-..|++++|+.+|..+++
T Consensus 8 ~l~~~Ave~D~~g~y~eAl~~Y~~aie 34 (77)
T cd02683 8 EVLKRAVELDQEGRFQEALVCYQEGID 34 (77)
T ss_pred HHHHHHHHHHHhccHHHHHHHHHHHHH
Confidence 344555666666676666666666654
No 318
>cd02679 MIT_spastin MIT: domain contained within Microtubule Interacting and Trafficking molecules. This MIT domain sub-family is found in the AAA protein spastin, a probable ATPase involved in the assembly or function of nuclear protein complexes; spastins might also be involved in microtubule dynamics. The molecular function of the MIT domain is unclear.
Probab=67.13 E-value=11 Score=25.36 Aligned_cols=25 Identities=16% Similarity=-0.066 Sum_probs=19.0
Q ss_pred HHHHcCCHHHHHHHHHHHHHHHHhh
Q psy17695 51 CLQIVCDWTDYEARMKKLVSIVAEQ 75 (222)
Q Consensus 51 ~~~~~~~~~~a~~~~~~~~~~~~~~ 75 (222)
.+-+.|++++|...|+++++.+-+.
T Consensus 17 ~~dE~g~~e~Al~~Y~~gi~~l~eg 41 (79)
T cd02679 17 RADEWGDKEQALAHYRKGLRELEEG 41 (79)
T ss_pred hhhhcCCHHHHHHHHHHHHHHHHHH
Confidence 3345588899999999998877763
No 319
>smart00671 SEL1 Sel1-like repeats. These represent a subfamily of TPR (tetratricopeptide repeat) sequences.
Probab=66.77 E-value=11 Score=20.15 Aligned_cols=29 Identities=14% Similarity=0.213 Sum_probs=20.0
Q ss_pred HHHHHHHHhcccC----CCcHHHHHHHHHHHHh
Q psy17695 9 VGILFSNSSFPSP----GNIPEAIQSYRTALKL 37 (222)
Q Consensus 9 ~~~~~lg~~l~~~----g~~~eA~~~~~~Al~l 37 (222)
.+.+.||..|..- .+.++|...|++|.+.
T Consensus 2 ~a~~~lg~~~~~G~g~~~d~~~A~~~~~~Aa~~ 34 (36)
T smart00671 2 EAQYNLGQMYEYGLGVKKDLEKALEYYKKAAEL 34 (36)
T ss_pred HHHHHHHHHHHcCCCCCcCHHHHHHHHHHHHHc
Confidence 4667777776632 3778888888887654
No 320
>COG2976 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=66.73 E-value=19 Score=28.70 Aligned_cols=59 Identities=15% Similarity=-0.014 Sum_probs=40.7
Q ss_pred HHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCC---HHHHHHHHHHHHHcCCHHHHHHHHHH
Q psy17695 9 VGILFSNSSFPSPGNIPEAIQSYRTALKLKPDF---PDAYCNLAHCLQIVCDWTDYEARMKK 67 (222)
Q Consensus 9 ~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~---~~a~~~l~~~~~~~~~~~~a~~~~~~ 67 (222)
.+-+-++..+.+.|++++|+..++.++...-+. .-+-.+|+.++..++.+++|...+..
T Consensus 90 laaL~lAk~~ve~~~~d~A~aqL~~~l~~t~De~lk~l~~lRLArvq~q~~k~D~AL~~L~t 151 (207)
T COG2976 90 LAALELAKAEVEANNLDKAEAQLKQALAQTKDENLKALAALRLARVQLQQKKADAALKTLDT 151 (207)
T ss_pred HHHHHHHHHHHhhccHHHHHHHHHHHHccchhHHHHHHHHHHHHHHHHHhhhHHHHHHHHhc
Confidence 345567788888888888888888888643332 23456777777777777777765543
No 321
>KOG1070|consensus
Probab=66.64 E-value=17 Score=37.26 Aligned_cols=67 Identities=13% Similarity=-0.015 Sum_probs=54.4
Q ss_pred HHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
...|..|.-+|...+++++|.+.|+..++---+....|..++..++.+.+-++|...+.+++..+|+
T Consensus 1530 ~~V~~~L~~iy~k~ek~~~A~ell~~m~KKF~q~~~vW~~y~~fLl~~ne~~aa~~lL~rAL~~lPk 1596 (1710)
T KOG1070|consen 1530 YTVHLKLLGIYEKSEKNDEADELLRLMLKKFGQTRKVWIMYADFLLRQNEAEAARELLKRALKSLPK 1596 (1710)
T ss_pred HHHHHHHHHHHHHhhcchhHHHHHHHHHHHhcchhhHHHHHHHHHhcccHHHHHHHHHHHHHhhcch
Confidence 3467778888888888888888888888877778888888888888887777777888888877776
No 322
>TIGR03504 FimV_Cterm FimV C-terminal domain. This protein is found at the extreme C-terminus of FimV from Pseudomonas aeruginosa, and of TspA of Neisseria meningitidis. Disruption of the former blocks twitching motility from type IV pili; Semmler, et al. suggest a role in peptidoglycan layer remodelling required by type IV fimbrial systems.
Probab=66.58 E-value=14 Score=21.83 Aligned_cols=25 Identities=12% Similarity=0.151 Sum_probs=22.1
Q ss_pred HHHHHHHHHHcCCHHHHHHHHHHHH
Q psy17695 45 YCNLAHCLQIVCDWTDYEARMKKLV 69 (222)
Q Consensus 45 ~~~l~~~~~~~~~~~~a~~~~~~~~ 69 (222)
..+|+.+|..+|+.+.|.+.+++++
T Consensus 2 kLdLA~ayie~Gd~e~Ar~lL~evl 26 (44)
T TIGR03504 2 KLDLARAYIEMGDLEGARELLEEVI 26 (44)
T ss_pred chHHHHHHHHcCChHHHHHHHHHHH
Confidence 4689999999999999998888777
No 323
>COG4259 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=65.46 E-value=21 Score=25.28 Aligned_cols=34 Identities=9% Similarity=-0.048 Sum_probs=20.4
Q ss_pred CHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 41 FPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 41 ~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
-+.++..||..|...|+-+.+...|+.-.++.|+
T Consensus 71 pPG~HAhLGlLys~~G~~e~a~~eFetEKalFPE 104 (121)
T COG4259 71 PPGYHAHLGLLYSNSGKDEQAVREFETEKALFPE 104 (121)
T ss_pred CCcHHHHHHHHHhhcCChHHHHHHHHHhhhhCcc
Confidence 3455566666666666666666666655555554
No 324
>PF08311 Mad3_BUB1_I: Mad3/BUB1 homology region 1; InterPro: IPR013212 Proteins containing this domain are checkpoint proteins involved in cell division. This region has been shown to be essential for the binding of BUB1 and MAD3 to CDC20p [].; PDB: 3ESL_B 4AEZ_I 4A1G_B 2LAH_A 2WVI_A 3SI5_B.
Probab=63.84 E-value=35 Score=24.85 Aligned_cols=61 Identities=5% Similarity=-0.037 Sum_probs=45.0
Q ss_pred CCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHh--CCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHH
Q psy17695 3 KTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKL--KPDFPDAYCNLAHCLQIVCDWTDYEARMKKLV 69 (222)
Q Consensus 3 ~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l--~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~ 69 (222)
+|+.....|...+.... ++.+.|...... --..+..+...+..+-..|++++|.+.|+..+
T Consensus 64 nD~RylkiWi~ya~~~~------~~~~if~~l~~~~IG~~~A~fY~~wA~~le~~~~~~~A~~I~~~Gi 126 (126)
T PF08311_consen 64 NDERYLKIWIKYADLSS------DPREIFKFLYSKGIGTKLALFYEEWAEFLEKRGNFKKADEIYQLGI 126 (126)
T ss_dssp T-HHHHHHHHHHHTTBS------HHHHHHHHHHHHTTSTTBHHHHHHHHHHHHHTT-HHHHHHHHHHHH
T ss_pred CCHHHHHHHHHHHHHcc------CHHHHHHHHHHcCccHHHHHHHHHHHHHHHHcCCHHHHHHHHHhhC
Confidence 45566677777766444 888888877765 45677888888999999999999999887653
No 325
>COG5107 RNA14 Pre-mRNA 3'-end processing (cleavage and polyadenylation) factor [RNA processing and modification]
Probab=63.59 E-value=22 Score=32.14 Aligned_cols=69 Identities=9% Similarity=0.030 Sum_probs=55.8
Q ss_pred CCCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHH
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLV 69 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~ 69 (222)
++-+|++...|+.|-.-|..+|.+++-.+.|++...--|=.+.+|.....--....++...+..+.+++
T Consensus 35 IkdNPtnI~S~fqLiq~~~tq~s~~~~re~yeq~~~pfp~~~~aw~ly~s~ELA~~df~svE~lf~rCL 103 (660)
T COG5107 35 IKDNPTNILSYFQLIQYLETQESMDAEREMYEQLSSPFPIMEHAWRLYMSGELARKDFRSVESLFGRCL 103 (660)
T ss_pred hhcCchhHHHHHHHHHHHhhhhhHHHHHHHHHHhcCCCccccHHHHHHhcchhhhhhHHHHHHHHHHHH
Confidence 356899999999999999999999999999999999889888888776655555566666665555544
No 326
>KOG0530|consensus
Probab=63.30 E-value=35 Score=28.66 Aligned_cols=69 Identities=13% Similarity=0.044 Sum_probs=53.8
Q ss_pred CCCCCHHHHHHHHHhcccCCCcH-HHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHH
Q psy17695 3 KTPFFKVGILFSNSSFPSPGNIP-EAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSI 71 (222)
Q Consensus 3 ~~P~~~~~~~~lg~~l~~~g~~~-eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~ 71 (222)
.+|.|-..|...-.+....|+.. .-++..+..+..+.++..+|...--+++..++|+.-.....++++.
T Consensus 107 ~npKNYQvWHHRr~ive~l~d~s~rELef~~~~l~~DaKNYHaWshRqW~~r~F~~~~~EL~y~~~Lle~ 176 (318)
T KOG0530|consen 107 DNPKNYQVWHHRRVIVELLGDPSFRELEFTKLMLDDDAKNYHAWSHRQWVLRFFKDYEDELAYADELLEE 176 (318)
T ss_pred hCccchhHHHHHHHHHHHhcCcccchHHHHHHHHhccccchhhhHHHHHHHHHHhhHHHHHHHHHHHHHH
Confidence 46777888888888888888877 7788888888888888888888888888888887766665555543
No 327
>PF10952 DUF2753: Protein of unknown function (DUF2753); InterPro: IPR020206 This entry represents a group of uncharacterised proteins.
Probab=62.95 E-value=47 Score=24.54 Aligned_cols=60 Identities=15% Similarity=0.023 Sum_probs=45.8
Q ss_pred HHHHHHHhcccCCCcHHHHHHHHHHHHhCCCC---------------HHHHHHHHHHHHHcCCHHHHHHHHHHHH
Q psy17695 10 GILFSNSSFPSPGNIPEAIQSYRTALKLKPDF---------------PDAYCNLAHCLQIVCDWTDYEARMKKLV 69 (222)
Q Consensus 10 ~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~---------------~~a~~~l~~~~~~~~~~~~a~~~~~~~~ 69 (222)
-|.++|..-.+.+++-.++-+|++|+.+.-+- +..-.||+...+.+|+-+-...+++-+-
T Consensus 3 ~htllAd~a~~~~~~l~si~hYQqAls~se~~~~~~~~el~dll~i~VisCHNLA~FWR~~gd~~yELkYLqlAS 77 (140)
T PF10952_consen 3 KHTLLADQAFKEADPLRSILHYQQALSLSEEIDESNEIELEDLLTISVISCHNLADFWRSQGDSDYELKYLQLAS 77 (140)
T ss_pred hHHHHHHHHhhcccHHHHHHHHHHHHHHHHHhcccccccHHHHHHHHHHHHhhHHHHHHHcCChHHHHHHHHHHH
Confidence 47789999999999999999999999874221 1235788999999999776666554443
No 328
>PF12968 DUF3856: Domain of Unknown Function (DUF3856); InterPro: IPR024552 This domain of unknown function is found in a small group of tetratricopeptide-like proteins, which includes the uncharacterised protein Q8KAL8 from SWISSPROT. The structure of Q8KAL8 is known and belongs to the SCOP all alpha class, TPR-like superfamily, CT2138-like family.; PDB: 2HR2_D.
Probab=62.94 E-value=59 Score=23.98 Aligned_cols=66 Identities=12% Similarity=0.103 Sum_probs=47.7
Q ss_pred HHHHHHHHhcc--cCCCcHHHHHHHHHHHHhCCC------------CHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 9 VGILFSNSSFP--SPGNIPEAIQSYRTALKLKPD------------FPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 9 ~~~~~lg~~l~--~~g~~~eA~~~~~~Al~l~P~------------~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
.++..|+..-. .-|-|++|...+++|....-. ++..+..|+.++..+|+++++......++.....
T Consensus 8 ~aY~aLs~ae~ql~~g~~~eAa~s~r~AM~~srtiP~eEaFDh~GFDA~chA~Ls~A~~~Lgry~e~L~sA~~aL~YFNR 87 (144)
T PF12968_consen 8 MAYMALSDAERQLQDGAYEEAAASCRKAMEVSRTIPAEEAFDHDGFDAFCHAGLSGALAGLGRYDECLQSADRALRYFNR 87 (144)
T ss_dssp HHHHHHHHHHHHHHHT-HHHHHHHHHHHHHHHTTS-TTS---HHHHHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhccCChHhhcccccHHHHHHHHHHHHHHhhccHHHHHHHHHHHHHHHhh
Confidence 45566655544 457899999999999987522 1245677888889999999999888888876654
No 329
>PF13812 PPR_3: Pentatricopeptide repeat domain
Probab=62.93 E-value=20 Score=18.64 Aligned_cols=26 Identities=12% Similarity=0.014 Sum_probs=18.6
Q ss_pred HHHHHHHHHHHcCCHHHHHHHHHHHH
Q psy17695 44 AYCNLAHCLQIVCDWTDYEARMKKLV 69 (222)
Q Consensus 44 a~~~l~~~~~~~~~~~~a~~~~~~~~ 69 (222)
.+..+..++.+.|+++.+...+++..
T Consensus 3 ty~~ll~a~~~~g~~~~a~~~~~~M~ 28 (34)
T PF13812_consen 3 TYNALLRACAKAGDPDAALQLFDEMK 28 (34)
T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHH
Confidence 45667777778888888777776654
No 330
>COG5191 Uncharacterized conserved protein, contains HAT (Half-A-TPR) repeat [General function prediction only]
Probab=62.90 E-value=3.3 Score=35.40 Aligned_cols=49 Identities=12% Similarity=0.179 Sum_probs=41.4
Q ss_pred CCCCCCCHHHHHH-HHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHH
Q psy17695 1 MMKTPFFKVGILF-SNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLA 49 (222)
Q Consensus 1 l~~~P~~~~~~~~-lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~ 49 (222)
|.+.|.|++.|.. -..=+...++++.+.+.|.++++.+|+++..|...-
T Consensus 134 l~khP~nvdlWI~~c~~e~~~~ani~s~Ra~f~~glR~N~~~p~iw~eyf 183 (435)
T COG5191 134 LTKHPLNVDLWIYCCAFELFEIANIESSRAMFLKGLRMNSRSPRIWIEYF 183 (435)
T ss_pred HhcCCCCceeeeeeccchhhhhccHHHHHHHHHhhhccCCCCchHHHHHH
Confidence 3578999998876 555677889999999999999999999998886654
No 331
>PF13226 DUF4034: Domain of unknown function (DUF4034)
Probab=62.78 E-value=21 Score=29.99 Aligned_cols=58 Identities=12% Similarity=-0.004 Sum_probs=45.2
Q ss_pred CCCCHHHHHHHHHhcccCC----------------------CcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHH
Q psy17695 4 TPFFKVGILFSNSSFPSPG----------------------NIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDY 61 (222)
Q Consensus 4 ~P~~~~~~~~lg~~l~~~g----------------------~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a 61 (222)
.|+...+|..+|..+...+ -.+.|...+.+|++++|..+.+...+.++-...|+..-.
T Consensus 73 ~P~Sy~A~La~g~~~~~~Aw~~RG~~~A~~V~~~~W~~~~~~~d~A~~~ll~A~~l~pr~~~A~~~m~~~s~~fgeP~WL 152 (277)
T PF13226_consen 73 CPKSYHAHLAMGMYWVHRAWDIRGSGYASTVTEAQWLGAHQACDQAVAALLKAIELSPRPVAAAIGMINISAYFGEPDWL 152 (277)
T ss_pred CCCChHHHHHHHHHHHHHHHHHHccchhcccCHHHHHHHHHHHHHHHHHHHHHHhcCCCchHHHHHHHHHHhhcCCchHH
Confidence 5677777777776665442 357788999999999999999999999988888875543
No 332
>KOG3783|consensus
Probab=62.17 E-value=38 Score=31.13 Aligned_cols=65 Identities=14% Similarity=0.141 Sum_probs=52.1
Q ss_pred HHHHHHHHhcccCCCcHHHHHHHHHHHHh---CCC----CHHHHHHHHHHHHHcCC-HHHHHHHHHHHHHHHH
Q psy17695 9 VGILFSNSSFPSPGNIPEAIQSYRTALKL---KPD----FPDAYCNLAHCLQIVCD-WTDYEARMKKLVSIVA 73 (222)
Q Consensus 9 ~~~~~lg~~l~~~g~~~eA~~~~~~Al~l---~P~----~~~a~~~l~~~~~~~~~-~~~a~~~~~~~~~~~~ 73 (222)
.-.+.+|.+++.+|+.+.|..+|..+++- .-. .|.|++.||..+...+. ..++.+.+.++.....
T Consensus 450 lk~lL~g~~lR~Lg~~~~a~~~f~i~~~~e~~~~~d~w~~PfA~YElA~l~~~~~g~~~e~~~~L~kAr~~~~ 522 (546)
T KOG3783|consen 450 LKYLLKGVILRNLGDSEVAPKCFKIQVEKESKRTEDLWAVPFALYELALLYWDLGGGLKEARALLLKAREYAS 522 (546)
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhhccccccccHHHHHHHHHHHhcccChHHHHHHHHHHHhhcc
Confidence 34678899999999999999999999833 222 36799999999999988 8888887777765443
No 333
>KOG1586|consensus
Probab=61.50 E-value=36 Score=28.13 Aligned_cols=63 Identities=11% Similarity=0.145 Sum_probs=29.4
Q ss_pred HHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHH------HHHHHHHHHH-HcCCHHHHHHHHHHHHHHH
Q psy17695 9 VGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPD------AYCNLAHCLQ-IVCDWTDYEARMKKLVSIV 72 (222)
Q Consensus 9 ~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~------a~~~l~~~~~-~~~~~~~a~~~~~~~~~~~ 72 (222)
..+.-.+++|.+. +.++|+.|++++++|--+-.. .+..+|.+|- .+.+++.|+.+|+++-...
T Consensus 75 t~YveA~~cykk~-~~~eAv~cL~~aieIyt~~Grf~~aAk~~~~iaEiyEsdl~d~ekaI~~YE~Aae~y 144 (288)
T KOG1586|consen 75 TTYVEAANCYKKV-DPEEAVNCLEKAIEIYTDMGRFTMAAKHHIEIAEIYESDLQDFEKAIAHYEQAAEYY 144 (288)
T ss_pred HHHHHHHHHhhcc-ChHHHHHHHHHHHHHHHhhhHHHHHHhhhhhHHHHHhhhHHHHHHHHHHHHHHHHHH
Confidence 3344444444444 666666666666665332211 1223343332 2245555555555555443
No 334
>COG0790 FOG: TPR repeat, SEL1 subfamily [General function prediction only]
Probab=61.29 E-value=60 Score=26.79 Aligned_cols=47 Identities=17% Similarity=0.206 Sum_probs=32.2
Q ss_pred CHHHHHHHHHhccc----CCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcC
Q psy17695 7 FKVGILFSNSSFPS----PGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVC 56 (222)
Q Consensus 7 ~~~~~~~lg~~l~~----~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~ 56 (222)
++.+.+++|.+|.. ..+.++|...|.+|-+... ..+.++++ ++...|
T Consensus 186 ~~~a~~~lg~~y~~G~Gv~~d~~~A~~wy~~Aa~~g~--~~a~~~~~-~~~~~g 236 (292)
T COG0790 186 NPDAQLLLGRMYEKGLGVPRDLKKAFRWYKKAAEQGD--GAACYNLG-LMYLNG 236 (292)
T ss_pred CHHHHHHHHHHHHcCCCCCcCHHHHHHHHHHHHHCCC--HHHHHHHH-HHHhcC
Confidence 45677777777743 3477888888888888766 77777777 555444
No 335
>COG3898 Uncharacterized membrane-bound protein [Function unknown]
Probab=60.95 E-value=39 Score=30.07 Aligned_cols=44 Identities=18% Similarity=0.145 Sum_probs=38.8
Q ss_pred CCCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHH
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDA 44 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a 44 (222)
+++.|++..+-..-+.+|.+.|+..++-..++.+++.+|.-..+
T Consensus 256 ~KL~pdlvPaav~AAralf~d~~~rKg~~ilE~aWK~ePHP~ia 299 (531)
T COG3898 256 NKLAPDLVPAAVVAARALFRDGNLRKGSKILETAWKAEPHPDIA 299 (531)
T ss_pred hhcCCccchHHHHHHHHHHhccchhhhhhHHHHHHhcCCChHHH
Confidence 46889999999999999999999999999999999999975443
No 336
>smart00745 MIT Microtubule Interacting and Trafficking molecule domain.
Probab=60.45 E-value=16 Score=23.89 Aligned_cols=21 Identities=19% Similarity=0.183 Sum_probs=12.8
Q ss_pred HcCCHHHHHHHHHHHHHHHHh
Q psy17695 54 IVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 54 ~~~~~~~a~~~~~~~~~~~~~ 74 (222)
..|++++|..+|..++..+-.
T Consensus 20 ~~g~~~eAl~~Y~~a~e~l~~ 40 (77)
T smart00745 20 EAGDYEEALELYKKAIEYLLE 40 (77)
T ss_pred HcCCHHHHHHHHHHHHHHHHH
Confidence 356666666666666665544
No 337
>PF07219 HemY_N: HemY protein N-terminus; InterPro: IPR010817 This entry represents the N terminus (approximately 150 residues) of bacterial HemY porphyrin biosynthesis proteins. These are membrane protein involved in a late step of protoheme IX synthesis [].
Probab=60.41 E-value=41 Score=23.70 Aligned_cols=47 Identities=19% Similarity=0.002 Sum_probs=30.5
Q ss_pred HHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcC
Q psy17695 10 GILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVC 56 (222)
Q Consensus 10 ~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~ 56 (222)
....-|.+-...||++.|.+...++-+..+...-.+..-+.+-..+|
T Consensus 61 ~al~~Gl~al~~G~~~~A~k~~~~a~~~~~~~~l~~L~AA~AA~~~g 107 (108)
T PF07219_consen 61 RALSRGLIALAEGDWQRAEKLLAKAAKLSDNPLLNYLLAARAAQAQG 107 (108)
T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHcC
Confidence 34556677777888888888888887765554444444444444444
No 338
>PF10255 Paf67: RNA polymerase I-associated factor PAF67; InterPro: IPR019382 RNA polymerase I is a multi-subunit enzyme and its transcription competence is dependent on the presence of PAF67 [].
Probab=60.30 E-value=21 Score=31.76 Aligned_cols=64 Identities=8% Similarity=0.098 Sum_probs=49.4
Q ss_pred HHHHHHhcccCCCcHHHHHHHHHH--------HHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 11 ILFSNSSFPSPGNIPEAIQSYRTA--------LKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 11 ~~~lg~~l~~~g~~~eA~~~~~~A--------l~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
...|-.++.-+|+|..|++.++-. .+.-+-+...++..|.+++.++++.+|...+..++-...+
T Consensus 125 ligLlRvh~LLGDY~~Alk~l~~idl~~~~l~~~V~~~~is~~YyvGFaylMlrRY~DAir~f~~iL~yi~r 196 (404)
T PF10255_consen 125 LIGLLRVHCLLGDYYQALKVLENIDLNKKGLYTKVPACHISTYYYVGFAYLMLRRYADAIRTFSQILLYIQR 196 (404)
T ss_pred HHHHHHHHHhccCHHHHHHHhhccCcccchhhccCcchheehHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345667888999999999987642 1112345677899999999999999999999988876554
No 339
>PF08631 SPO22: Meiosis protein SPO22/ZIP4 like; InterPro: IPR013940 SPO22 is a meiosis-specific protein with similarity to phospholipase A2, involved in completion of nuclear divisions during meiosis; induced early in meiosis []. It is also involved in sporulation [].
Probab=60.16 E-value=62 Score=26.90 Aligned_cols=56 Identities=16% Similarity=0.022 Sum_probs=42.4
Q ss_pred HHHHHHHHHhcccCC-CcHHHHHHHHHHHHh----C---CCCH-------HHHHHHHHHHHHcCCHHHHHH
Q psy17695 8 KVGILFSNSSFPSPG-NIPEAIQSYRTALKL----K---PDFP-------DAYCNLAHCLQIVCDWTDYEA 63 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g-~~~eA~~~~~~Al~l----~---P~~~-------~a~~~l~~~~~~~~~~~~a~~ 63 (222)
+...++.|..+.+.+ ++++|...+++|+++ . ..++ ..+..++.++...+..+..+.
T Consensus 35 a~~~yn~G~~l~~~~~~~~~a~~wL~~a~~~l~~~~~~~~~~~~~~elr~~iL~~La~~~l~~~~~~~~~k 105 (278)
T PF08631_consen 35 ARVCYNIGKSLLSKKDKYEEAVKWLQRAYDILEKPGKMDKLSPDGSELRLSILRLLANAYLEWDTYESVEK 105 (278)
T ss_pred HHHHHHHHHHHHHcCCChHHHHHHHHHHHHHHHhhhhccccCCcHHHHHHHHHHHHHHHHHcCCChHHHHH
Confidence 577899999999999 999999999999998 2 2222 245667777777777655444
No 340
>PF02259 FAT: FAT domain; InterPro: IPR003151 The FAT domain is a domain present in the PIK-related kinases. Members of the family of PIK-related kinases may act as intracellular sensors that govern radial and horizontal pathways [].; GO: 0005515 protein binding
Probab=60.15 E-value=38 Score=28.52 Aligned_cols=66 Identities=15% Similarity=0.207 Sum_probs=56.0
Q ss_pred CCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCC----CHHHHHHHHHHHHHcCCHHHHHHHHHHHHH
Q psy17695 5 PFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPD----FPDAYCNLAHCLQIVCDWTDYEARMKKLVS 70 (222)
Q Consensus 5 P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~----~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~ 70 (222)
......|...+.+.++.|+++.|..++.++...++. .+...+..+..+...|+-.+|...++..+.
T Consensus 143 ~~~~~~~l~~a~~aRk~g~~~~A~~~l~~~~~~~~~~~~~~~~v~~e~akllw~~g~~~~Ai~~L~~~~~ 212 (352)
T PF02259_consen 143 EELAETWLKFAKLARKAGNFQLALSALNRLFQLNPSSESLLPRVFLEYAKLLWAQGEQEEAIQKLRELLK 212 (352)
T ss_pred hHHHHHHHHHHHHHHHCCCcHHHHHHHHHHhccCCcccCCCcchHHHHHHHHHHcCCHHHHHHHHHHHHH
Confidence 456788999999999999999999999999987632 467777788888999999999888877776
No 341
>KOG2997|consensus
Probab=58.98 E-value=11 Score=32.28 Aligned_cols=39 Identities=26% Similarity=0.417 Sum_probs=34.0
Q ss_pred HHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHH
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYC 46 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~ 46 (222)
+.+++.-|+...+.|...+|+..|+.|+++.|+-...+.
T Consensus 19 A~~l~~~av~~Eq~G~l~dai~fYR~AlqI~~diEs~~r 57 (366)
T KOG2997|consen 19 AIALYEKAVLKEQDGSLYDAINFYRDALQIVPDIESKYR 57 (366)
T ss_pred HHHHHHHHHHHhhcCcHHHHHHHHHhhhcCCchHHHHHH
Confidence 677888999999999999999999999999998555444
No 342
>PF02064 MAS20: MAS20 protein import receptor; InterPro: IPR002056 Virtually all mitochondrial precursors are imported via the same mechanism []: precursors first bind to receptors on the mitochondrial surface, then insert into the translocation channel in the outer membrane. Many outer-membrane proteins participate in the early stages of import, four of which (MAS20, MAS22, MAS37 and MAS70) are components of the receptor. MAS20, which forms a subcomplex with MAS22, seems to interact with most or all mitochondrial precursors, suggesting that the protein binds directly to mitochondrial targeting sequences. The MAS37 and MAS70 components also form a subcomplex, the two subcomplexes possibly binding via their trans- membrane (TM) regions - the TM region of MAS70 promotes oligomerisation of attatched protein domains and shares sequence similarity with the TM region of MAS20 []. MAS20 is also known as TOM20.; GO: 0006605 protein targeting, 0006886 intracellular protein transport, 0005742 mitochondrial outer membrane translocase complex; PDB: 3AX3_A 3AWR_B 2V1S_A 3AX5_C 3AX2_C 1OM2_A 2V1T_B.
Probab=58.93 E-value=16 Score=26.70 Aligned_cols=33 Identities=21% Similarity=0.315 Sum_probs=26.3
Q ss_pred HHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHH
Q psy17695 11 ILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPD 43 (222)
Q Consensus 11 ~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~ 43 (222)
...+|..+...|++++|..+|-+|+..-|+-.+
T Consensus 66 qV~lGE~L~~~G~~~~aa~hf~nAl~V~~qP~~ 98 (121)
T PF02064_consen 66 QVQLGEQLLAQGDYEEAAEHFYNALKVCPQPAE 98 (121)
T ss_dssp HHHHHHHHHHTT-HHHHHHHHHHHHHTSSSHHH
T ss_pred HHHHHHHHHhCCCHHHHHHHHHHHHHhCCCHHH
Confidence 346888888889999999999999999887543
No 343
>COG2976 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=58.49 E-value=44 Score=26.70 Aligned_cols=44 Identities=16% Similarity=0.087 Sum_probs=20.7
Q ss_pred HHHHHHHHHHHhCCCCHHH---HHHHHHHHHHcCCHHHHHHHHHHHH
Q psy17695 26 EAIQSYRTALKLKPDFPDA---YCNLAHCLQIVCDWTDYEARMKKLV 69 (222)
Q Consensus 26 eA~~~~~~Al~l~P~~~~a---~~~l~~~~~~~~~~~~a~~~~~~~~ 69 (222)
+.+...++...-+|....+ -..++..+...+++++|+..++.++
T Consensus 70 ~~~~~~ekf~~~n~~t~Ya~laaL~lAk~~ve~~~~d~A~aqL~~~l 116 (207)
T COG2976 70 KSIAAAEKFVQANGKTIYAVLAALELAKAEVEANNLDKAEAQLKQAL 116 (207)
T ss_pred hhHHHHHHHHhhccccHHHHHHHHHHHHHHHhhccHHHHHHHHHHHH
Confidence 4444555555555444322 2333444445555555555544433
No 344
>KOG2047|consensus
Probab=58.28 E-value=36 Score=32.18 Aligned_cols=63 Identities=11% Similarity=0.092 Sum_probs=50.1
Q ss_pred CHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCC----HHHHHHHHHHHHHcCCHHHHHHHHHHHH
Q psy17695 7 FKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDF----PDAYCNLAHCLQIVCDWTDYEARMKKLV 69 (222)
Q Consensus 7 ~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~----~~a~~~l~~~~~~~~~~~~a~~~~~~~~ 69 (222)
....|..+|..|...|+.+.|...|++|++.+=.. +..|.+.|..-+...+++.|....+.+.
T Consensus 386 ~~~Lw~~faklYe~~~~l~~aRvifeka~~V~y~~v~dLa~vw~~waemElrh~~~~~Al~lm~~A~ 452 (835)
T KOG2047|consen 386 PGTLWVEFAKLYENNGDLDDARVIFEKATKVPYKTVEDLAEVWCAWAEMELRHENFEAALKLMRRAT 452 (835)
T ss_pred hhhHHHHHHHHHHhcCcHHHHHHHHHHhhcCCccchHHHHHHHHHHHHHHHhhhhHHHHHHHHHhhh
Confidence 34678899999999999999999999999987544 4677888877777777777777666655
No 345
>KOG3364|consensus
Probab=57.93 E-value=32 Score=25.84 Aligned_cols=42 Identities=7% Similarity=0.027 Sum_probs=36.0
Q ss_pred CHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHH
Q psy17695 7 FKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNL 48 (222)
Q Consensus 7 ~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l 48 (222)
.-+..+.|+..+.++++|+.++.+.+..++.+|++.++....
T Consensus 70 rRe~lyYLAvg~yRlkeY~~s~~yvd~ll~~e~~n~Qa~~Lk 111 (149)
T KOG3364|consen 70 RRECLYYLAVGHYRLKEYSKSLRYVDALLETEPNNRQALELK 111 (149)
T ss_pred chhhhhhhHHHHHHHhhHHHHHHHHHHHHhhCCCcHHHHHHH
Confidence 356778889999999999999999999999999998886544
No 346
>TIGR02996 rpt_mate_G_obs repeat-companion domain TIGR02996. This model describes an abundant paralogous domain of Gemmata obscuriglobus UQM 2246, a member of the Planctomycetes. The domain also occurs, although rarely, in Myxococcus xanthus DK 1622 and related species. Most member proteins have extensive repeats similar to the leucine-rich repeat, or another repeat class or region of low-complexity sequence. This domain is not repeated, and in Gemmata is usually found at the protein N-terminus.
Probab=57.52 E-value=32 Score=20.09 Aligned_cols=33 Identities=18% Similarity=0.029 Sum_probs=29.2
Q ss_pred HHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHH
Q psy17695 29 QSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDY 61 (222)
Q Consensus 29 ~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a 61 (222)
..|.+|+--+|++...+.-++..+...|+...+
T Consensus 3 ~all~AI~~~P~ddt~RLvYADWL~e~gdp~ra 35 (42)
T TIGR02996 3 EALLRAILAHPDDDTPRLVYADWLDEHGDPARA 35 (42)
T ss_pred HHHHHHHHhCCCCcchHHHHHHHHHHcCCHHHH
Confidence 468899999999999999999999999998544
No 347
>cd02656 MIT MIT: domain contained within Microtubule Interacting and Trafficking molecules. The MIT domain is found in sorting nexins, the nuclear thiol protease PalBH, the AAA protein spastin and archaebacterial proteins with similar domain architecture, vacuolar sorting proteins and others. The molecular function of the MIT domain is unclear.
Probab=56.25 E-value=21 Score=23.26 Aligned_cols=18 Identities=6% Similarity=0.093 Sum_probs=9.4
Q ss_pred CCHHHHHHHHHHHHHHHH
Q psy17695 56 CDWTDYEARMKKLVSIVA 73 (222)
Q Consensus 56 ~~~~~a~~~~~~~~~~~~ 73 (222)
|++++|..+|..++..+-
T Consensus 20 g~~~~Al~~Y~~a~e~l~ 37 (75)
T cd02656 20 GNYEEALELYKEALDYLL 37 (75)
T ss_pred CCHHHHHHHHHHHHHHHH
Confidence 555555555555554444
No 348
>PF09797 NatB_MDM20: N-acetyltransferase B complex (NatB) non catalytic subunit; InterPro: IPR019183 This is the non-catalytic subunit of the N-terminal acetyltransferase B complex (NatB). The NatB complex catalyses the acetylation of the amino-terminal methionine residue of all proteins beginning with Met-Asp or Met-Glu and of some proteins beginning with Met-Asn or Met-Met. In Saccharomyces cerevisiae (Baker's yeast) this subunit is called MDM20 and in Schizosaccharomyces pombe (Fission yeast) it is called Arm1. NatB acetylates the Tpm1 protein and regulates and tropomyocin-actin interactions. This subunit is required by the NatB complex for the N-terminal acetylation of Tpm1 [].
Probab=56.08 E-value=33 Score=29.74 Aligned_cols=48 Identities=17% Similarity=0.051 Sum_probs=43.4
Q ss_pred CCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHH
Q psy17695 21 PGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKL 68 (222)
Q Consensus 21 ~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~ 68 (222)
....-+|+..++.+++.+|.+.....-+..+|..+|..+.|...|..+
T Consensus 196 ~~~l~~Ai~lLE~~l~~s~~n~~~~LlLvrlY~~LG~~~~A~~~~~~L 243 (365)
T PF09797_consen 196 SEYLLQAIALLEHALKKSPHNYQLKLLLVRLYSLLGAGSLALEHYESL 243 (365)
T ss_pred HHHHHHHHHHHHHHHHcCCCcHHHHHHHHHHHHHcCCHHHHHHHHHhc
Confidence 445778999999999999999999999999999999999999888665
No 349
>PF12583 TPPII_N: Tripeptidyl peptidase II N terminal; InterPro: IPR022232 This entry represents a region of approximately 190 amino acids in length and is found in association with PF00082 from PFAM. The members are serine peptidases belonging to MEROPS peptidase family S8A, tripeptidyl peptidase II (TPPII), clan SB. They are a crucial component of the proteolytic cascade acting downstream of the 26S proteasome in the ubiquitin-proteasome pathway. It is an amino peptidase belonging to the subtilase family removing tripeptides from the free N terminus of oligopeptides. ; PDB: 3LXU_X.
Probab=56.03 E-value=25 Score=26.08 Aligned_cols=44 Identities=20% Similarity=0.128 Sum_probs=27.7
Q ss_pred HHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHH
Q psy17695 9 VGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCL 52 (222)
Q Consensus 9 ~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~ 52 (222)
+++..+=..+...-+.+.|.+.|+..+++.|++..++..|...+
T Consensus 77 EaLRDfq~~~iaKle~e~Ae~vY~el~~~~P~HLpaHla~i~~l 120 (139)
T PF12583_consen 77 EALRDFQCSWIAKLEPENAEQVYEELLEAHPDHLPAHLAMIQNL 120 (139)
T ss_dssp HHHHHHHHHHHTTS-HHHHHHHHHHHHHH-TT-THHHHHHHHHH
T ss_pred HHHHHHHHHHHHhhCHHHHHHHHHHHHHHCcchHHHHHHHHHcc
Confidence 34444444444455668888888888888888888877776554
No 350
>cd02678 MIT_VPS4 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in intracellular protein transport proteins of the AAA-ATPase family. The molecular function of the MIT domain is unclear.
Probab=55.89 E-value=21 Score=23.38 Aligned_cols=22 Identities=32% Similarity=0.419 Sum_probs=10.2
Q ss_pred HHHhcccCCCcHHHHHHHHHHH
Q psy17695 14 SNSSFPSPGNIPEAIQSYRTAL 35 (222)
Q Consensus 14 lg~~l~~~g~~~eA~~~~~~Al 35 (222)
.|+-.-..|++++|+.+|..|+
T Consensus 12 ~Av~~D~~g~y~eA~~~Y~~ai 33 (75)
T cd02678 12 KAIEEDNAGNYEEALRLYQHAL 33 (75)
T ss_pred HHHHHHHcCCHHHHHHHHHHHH
Confidence 3344444455555544444444
No 351
>PF09205 DUF1955: Domain of unknown function (DUF1955); InterPro: IPR015288 Members of this family are found in hypothetical proteins synthesised by the Archaeal organism Sulfolobus. Their exact function has not, as yet, been determined. ; PDB: 1WY6_A.
Probab=55.59 E-value=70 Score=24.12 Aligned_cols=54 Identities=11% Similarity=0.108 Sum_probs=38.2
Q ss_pred HhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHH
Q psy17695 16 SSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLV 69 (222)
Q Consensus 16 ~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~ 69 (222)
.++..+|+-++=.+.+....+-+-.++..+.-++.+|.++|+..++.+..+++.
T Consensus 94 d~lv~~~kkDqLdki~~~l~kn~~~~p~~L~kia~Ay~klg~~r~~~ell~~AC 147 (161)
T PF09205_consen 94 DILVKQGKKDQLDKIYNELKKNEEINPEFLVKIANAYKKLGNTREANELLKEAC 147 (161)
T ss_dssp HHHHHTT-HHHHHHHHHHH-----S-HHHHHHHHHHHHHTT-HHHHHHHHHHHH
T ss_pred HHHHHhccHHHHHHHHHHHhhccCCCHHHHHHHHHHHHHhcchhhHHHHHHHHH
Confidence 456677887777778887777666779999999999999999999988777765
No 352
>PRK15180 Vi polysaccharide biosynthesis protein TviD; Provisional
Probab=55.56 E-value=24 Score=32.24 Aligned_cols=62 Identities=15% Similarity=0.076 Sum_probs=41.2
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHH
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEA 63 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~ 63 (222)
...|.+|......+.++..+|.|++|.+.+.-+=++=..-..+...+..-+..+++|++|..
T Consensus 317 r~~~~~p~~i~l~~~i~~~lg~ye~~~~~~s~~~~~~~s~~~~~~~~~r~~~~l~r~~~a~s 378 (831)
T PRK15180 317 RNQQQDPVLIQLRSVIFSHLGYYEQAYQDISDVEKIIGTTDSTLRCRLRSLHGLARWREALS 378 (831)
T ss_pred HhCCCCchhhHHHHHHHHHhhhHHHHHHHhhchhhhhcCCchHHHHHHHhhhchhhHHHHHH
Confidence 45677777777788888888888888877755554444444445555555666777766554
No 353
>PF05053 Menin: Menin; InterPro: IPR007747 MEN1, the gene responsible for multiple endocrine neoplasia type 1, is a tumour suppressor gene that encodes a protein called Menin which may be an atypical GTPase stimulated by nm23 [].; GO: 0005634 nucleus; PDB: 3RE2_A 3U84_B 3U86_A 3U88_B 3U85_A.
Probab=55.16 E-value=76 Score=29.52 Aligned_cols=68 Identities=6% Similarity=0.035 Sum_probs=47.1
Q ss_pred CCHHHHHHHHHhcc--cCCCcHHHHHHHHHHHHh-----CCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHH
Q psy17695 6 FFKVGILFSNSSFP--SPGNIPEAIQSYRTALKL-----KPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVA 73 (222)
Q Consensus 6 ~~~~~~~~lg~~l~--~~g~~~eA~~~~~~Al~l-----~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~ 73 (222)
.+|.++.+||.+-. ....-..+++.|.+|+.. +-.+..-+..+|..+.+.+++.+|...+.++-..+.
T Consensus 275 ~YPmALg~LadLeEi~pt~~r~~~~~l~~~AI~sa~~~Y~n~HvYPYty~gg~~yR~~~~~eA~~~Wa~aa~Vi~ 349 (618)
T PF05053_consen 275 RYPMALGNLADLEEIDPTPGRPTPLELFNEAISSARTYYNNHHVYPYTYLGGYYYRHKRYREALRSWAEAADVIR 349 (618)
T ss_dssp T-HHHHHHHHHHHHHS--TTS--HHHHHHHHHHHHHHHCTT--SHHHHHHHHHHHHTT-HHHHHHHHHHHHHHHT
T ss_pred hCchhhhhhHhHHhhccCCCCCCHHHHHHHHHHHHHHHhcCCccccceehhhHHHHHHHHHHHHHHHHHHHHHHH
Confidence 46788888888765 344567788889988876 455667788999999999999999887777765544
No 354
>KOG0686|consensus
Probab=54.02 E-value=27 Score=31.04 Aligned_cols=62 Identities=15% Similarity=0.157 Sum_probs=48.2
Q ss_pred HHHHHHHHhcccCCCcHHHHHHHHHHHHhCCC---CHHHHHHHHHHHHHcCCHHHHHHHHHHHHH
Q psy17695 9 VGILFSNSSFPSPGNIPEAIQSYRTALKLKPD---FPDAYCNLAHCLQIVCDWTDYEARMKKLVS 70 (222)
Q Consensus 9 ~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~---~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~ 70 (222)
.++..+|.-|.+.|+++.|+.+|-|+-..--+ -..++.|+..+-..+++|........++.+
T Consensus 151 ra~~Dl~dhy~~cG~l~~Alr~YsR~RdYCTs~khvInm~ln~i~VSI~~~nw~hv~sy~~~A~s 215 (466)
T KOG0686|consen 151 RALEDLGDHYLDCGQLDNALRCYSRARDYCTSAKHVINMCLNLILVSIYMGNWGHVLSYISKAES 215 (466)
T ss_pred HHHHHHHHHHHHhccHHHHHhhhhhhhhhhcchHHHHHHHHHHHHHHHhhcchhhhhhHHHHHHh
Confidence 57888999999999999999999996555333 346788888888889999876655555443
No 355
>cd08440 PBP2_LTTR_like_4 TThe C-terminal substrate binding domain of an uncharacterized LysR-type transcriptional regulator, contains the type 2 periplasmic binding fold. LysR-transcriptional regulators comprise the largest family of prokaryotic transcription factor. Homologs of some of LTTRs with similar domain organizations are also found in the archaea and eukaryotic organisms. The LTTRs are composed of two functional domains joined by a linker helix involved in oligomerization: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal substrate-binding domain, which is structurally homologous to the type 2 periplasmic binding proteins. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcriptional repressor undergoes a conformational change upon substrate binding which in turn changes the DNA binding affinity of the repressor. The genes controlled by the LTTRs have diverse funct
Probab=53.53 E-value=23 Score=26.04 Aligned_cols=56 Identities=9% Similarity=0.053 Sum_probs=32.5
Q ss_pred ccceeeEeeccccccCCCchhhHHHHHHhhhcceeecCCCCChHHHHHHHHhCCCcEEEeC
Q psy17695 137 RLESLYKVMWDRYSQNLPVTHITQAKIAREAEHFLDLSQVPCNGKAADRIHKDGIHILVNM 197 (222)
Q Consensus 137 rLRIGYvS~~d~~f~~H~v~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~i~~d~iDILvDL 197 (222)
+||||... . +...-+..++.+-.+..-+-=+.+...+. .++.+.+.+.++||.|-.
T Consensus 1 ~l~Ig~~~--~--~~~~~l~~~l~~~~~~~p~v~i~i~~~~~-~~~~~~l~~g~~D~~i~~ 56 (197)
T cd08440 1 RVRVAALP--S--LAATLLPPVLAAFRRRHPGIRVRLRDVSA-EQVIEAVRSGEVDFGIGS 56 (197)
T ss_pred CeEEEecc--c--hhhhHHHHHHHHHHHhCCCcEEEEEeCCh-HHHHHHHHcCCccEEEEe
Confidence 57888755 3 33333334443322222233344444454 689999999999998853
No 356
>COG0790 FOG: TPR repeat, SEL1 subfamily [General function prediction only]
Probab=52.75 E-value=94 Score=25.60 Aligned_cols=52 Identities=15% Similarity=0.084 Sum_probs=37.6
Q ss_pred CCCCHHHHHHHHHhccc----CCCcHHHHHHHHHHHHhCCCC-HHHHHHHHHHHHHc
Q psy17695 4 TPFFKVGILFSNSSFPS----PGNIPEAIQSYRTALKLKPDF-PDAYCNLAHCLQIV 55 (222)
Q Consensus 4 ~P~~~~~~~~lg~~l~~----~g~~~eA~~~~~~Al~l~P~~-~~a~~~l~~~~~~~ 55 (222)
+..++.+.++||.+|.. ..+..+|...|++|.+..-.. ..+..+++.++..-
T Consensus 105 ~~g~~~a~~~lg~~~~~G~gv~~d~~~A~~~~~~Aa~~g~~~a~~~~~~l~~~~~~g 161 (292)
T COG0790 105 ADGLAEALFNLGLMYANGRGVPLDLVKALKYYEKAAKLGNVEAALAMYRLGLAYLSG 161 (292)
T ss_pred hcccHHHHHhHHHHHhcCCCcccCHHHHHHHHHHHHHcCChhHHHHHHHHHHHHHcC
Confidence 45667888888888887 448889999999988875443 35567777766553
No 357
>PF04053 Coatomer_WDAD: Coatomer WD associated region ; InterPro: IPR006692 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the WD-associated region found in coatomer subunits alpha, beta and beta' subunits. The alpha-subunit (RET1P) of the coatomer complex in Saccharomyces cerevisiae (Baker's yeast), participates in membrane transport between the endoplasmic reticulum and Golgi apparatus. The protein contains six WD-40 repeat motifs in its N-terminal region []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 3MKQ_B.
Probab=52.31 E-value=37 Score=30.55 Aligned_cols=30 Identities=20% Similarity=-0.031 Sum_probs=17.2
Q ss_pred CCCHHHHHHHHHhcccCCCcHHHHHHHHHH
Q psy17695 5 PFFKVGILFSNSSFPSPGNIPEAIQSYRTA 34 (222)
Q Consensus 5 P~~~~~~~~lg~~l~~~g~~~eA~~~~~~A 34 (222)
.+++.-|-.||..-..+|+++-|+.||.++
T Consensus 344 ~~~~~~W~~Lg~~AL~~g~~~lAe~c~~k~ 373 (443)
T PF04053_consen 344 LDDPEKWKQLGDEALRQGNIELAEECYQKA 373 (443)
T ss_dssp CSTHHHHHHHHHHHHHTTBHHHHHHHHHHC
T ss_pred cCcHHHHHHHHHHHHHcCCHHHHHHHHHhh
Confidence 345555666666666666666666655554
No 358
>PF01239 PPTA: Protein prenyltransferase alpha subunit repeat; InterPro: IPR002088 Protein prenylation is the posttranslational attachment of either a farnesyl group or a geranylgeranyl group via a thioether linkage (-C-S-C-) to a cysteine at or near the carboxyl terminus of the protein. Farnesyl and geranylgeranyl groups are polyisoprenes, unsaturated hydrocarbons with a multiple of five carbons; the chain is 15 carbons long in the farnesyl moiety and 20 carbons long in the geranylgeranyl moiety. There are three different protein prenyltransferases in humans: farnesyltransferase (FT) and geranylgeranyltransferase 1 (GGT1) share the same motif (the CaaX box) around the cysteine in their substrates, and are thus called CaaX prenyltransferases, whereas geranylgeranyltransferase 2 (GGT2, also called Rab geranylgeranyltransferase) recognises a different motif and is thus called a non-CaaX prenyltransferase. Protein prenyltransferases are currently known only in eukaryotes, but they are widespread, being found in vertebrates, insects, nematodes, plants, fungi and protozoa, including several parasites. Each protein consists of two subunits, alpha and beta; the alpha subunit of FT and GGT1 is encoded by the same gene, FNTA. The alpha subunit is thought to participate in a stable complex with the isoprenyl substrate; the beta subunit binds the peptide substrate. In the alpha subunits of both types of protein prenyltransferases, seven tetratricopeptide repeats are formed by pairs of helices that are stabilised by conserved intercalating residues. The alpha subunits of GGT2 in mammals and plants also have an immunoglobulin-like domain between the fifth and sixth tetratricopeptide repeat, as well as leucine-rich repeats at the carboxyl terminus. The functions of these additional domains in GGT2 are as yet undefined, but they are apparently not directly involved in the interaction with substrates and Rab escort proteins. The tetratricopeptide repeats of the alpha subunit form a right-handed superhelix, which embraces the (alpha-alpha)6 barrel of the beta subunit []. ; GO: 0008318 protein prenyltransferase activity, 0018342 protein prenylation; PDB: 1S63_A 1LD7_A 1LD8_A 2H6G_A 1SA4_A 1MZC_A 1TN6_A 2F0Y_A 2H6H_A 2H6F_A ....
Probab=52.28 E-value=34 Score=17.94 Aligned_cols=28 Identities=18% Similarity=0.154 Sum_probs=23.1
Q ss_pred HHHHHHHHHHhCCCCHHHHHHHHHHHHH
Q psy17695 27 AIQSYRTALKLKPDFPDAYCNLAHCLQI 54 (222)
Q Consensus 27 A~~~~~~Al~l~P~~~~a~~~l~~~~~~ 54 (222)
.++.-.+++..+|.+..+|..+-.++..
T Consensus 2 El~~~~~~l~~~pknys~W~yR~~ll~~ 29 (31)
T PF01239_consen 2 ELEFTKKALEKDPKNYSAWNYRRWLLKQ 29 (31)
T ss_dssp HHHHHHHHHHHSTTCHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHCcccccHHHHHHHHHHH
Confidence 4667789999999999999988776654
No 359
>KOG3325|consensus
Probab=52.01 E-value=27 Score=26.58 Aligned_cols=25 Identities=20% Similarity=0.065 Sum_probs=18.3
Q ss_pred CCChHHHHHHHHhCCCcEEEeCCCCCC
Q psy17695 176 VPCNGKAADRIHKDGIHILVNMNGYTK 202 (222)
Q Consensus 176 ~~~~~~~a~~i~~d~iDILvDL~G~t~ 202 (222)
..|-+.++-+-|+..+|||| .|||-
T Consensus 93 ~gd~~sL~~LaRqldvDILl--~G~Th 117 (183)
T KOG3325|consen 93 WGDPESLALLARQLDVDILL--TGHTH 117 (183)
T ss_pred CCCHHHHHHHHHhcCCcEEE--eCCce
Confidence 34426777788899999997 67774
No 360
>cd02680 MIT_calpain7_2 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in the nuclear thiol protease PalBH. The molecular function of the MIT domain is unclear.
Probab=51.52 E-value=24 Score=23.47 Aligned_cols=25 Identities=32% Similarity=0.348 Sum_probs=13.7
Q ss_pred HHHHhcccCCCcHHHHHHHHHHHHh
Q psy17695 13 FSNSSFPSPGNIPEAIQSYRTALKL 37 (222)
Q Consensus 13 ~lg~~l~~~g~~~eA~~~~~~Al~l 37 (222)
.-|.---..|++++|+..|..|++.
T Consensus 11 ~~A~~eD~~gny~eA~~lY~~ale~ 35 (75)
T cd02680 11 TQAFDEDEKGNAEEAIELYTEAVEL 35 (75)
T ss_pred HHHHHhhHhhhHHHHHHHHHHHHHH
Confidence 3344444556666666666666553
No 361
>KOG1586|consensus
Probab=51.30 E-value=69 Score=26.53 Aligned_cols=66 Identities=18% Similarity=0.206 Sum_probs=48.5
Q ss_pred HHHHHHHHHhcccCCCcHHHHHHHHHHHHhC----C--CCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQSYRTALKLK----P--DFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~----P--~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
++.+..-|+.|....+|..|=.+|.+|-.+. . +-+..++..++|+.+. +.++|..++++++.+..+
T Consensus 34 adl~~~Aan~yklaK~w~~AG~aflkaA~~h~k~~skhDaat~YveA~~cykk~-~~~eAv~cL~~aieIyt~ 105 (288)
T KOG1586|consen 34 AELYERAANMYKLAKNWSAAGDAFLKAADLHLKAGSKHDAATTYVEAANCYKKV-DPEEAVNCLEKAIEIYTD 105 (288)
T ss_pred HHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHhcCCchhHHHHHHHHHHHhhcc-ChHHHHHHHHHHHHHHHh
Confidence 4556667777888888888888888776653 1 1245677778887776 889999999988887664
No 362
>KOG3617|consensus
Probab=50.96 E-value=71 Score=31.41 Aligned_cols=65 Identities=14% Similarity=0.008 Sum_probs=52.2
Q ss_pred CCHHHHHHHHHhcccCCCcHHHHHHHHHHH---------------------HhCCCCHHHHHHHHHHHHHcCCHHHHHHH
Q psy17695 6 FFKVGILFSNSSFPSPGNIPEAIQSYRTAL---------------------KLKPDFPDAYCNLAHCLQIVCDWTDYEAR 64 (222)
Q Consensus 6 ~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al---------------------~l~P~~~~a~~~l~~~~~~~~~~~~a~~~ 64 (222)
.++..|...|.-+...|+.+.|+.+|..|- +-+..+-.+-+.|+.-|-..|+..+|...
T Consensus 910 ~d~~L~~WWgqYlES~GemdaAl~~Y~~A~D~fs~VrI~C~qGk~~kAa~iA~esgd~AAcYhlaR~YEn~g~v~~Av~F 989 (1416)
T KOG3617|consen 910 RDESLYSWWGQYLESVGEMDAALSFYSSAKDYFSMVRIKCIQGKTDKAARIAEESGDKAACYHLARMYENDGDVVKAVKF 989 (1416)
T ss_pred cchHHHHHHHHHHhcccchHHHHHHHHHhhhhhhheeeEeeccCchHHHHHHHhcccHHHHHHHHHHhhhhHHHHHHHHH
Confidence 355778889999999999999999998774 22355667778888888899999999887
Q ss_pred HHHHHH
Q psy17695 65 MKKLVS 70 (222)
Q Consensus 65 ~~~~~~ 70 (222)
|.++..
T Consensus 990 fTrAqa 995 (1416)
T KOG3617|consen 990 FTRAQA 995 (1416)
T ss_pred HHHHHH
Confidence 776654
No 363
>PHA02537 M terminase endonuclease subunit; Provisional
Probab=50.22 E-value=26 Score=28.57 Aligned_cols=33 Identities=9% Similarity=0.016 Sum_probs=24.8
Q ss_pred HHHHHHHHHhc---------ccCCCcHHHHHHHHHHHHhCCC
Q psy17695 8 KVGILFSNSSF---------PSPGNIPEAIQSYRTALKLKPD 40 (222)
Q Consensus 8 ~~~~~~lg~~l---------~~~g~~~eA~~~~~~Al~l~P~ 40 (222)
+..|-..|..+ .+.++..+|+.++++|++++|+
T Consensus 169 AKl~K~~G~~llr~~~g~~~~d~~~l~~Al~~L~rA~~l~~k 210 (230)
T PHA02537 169 AKLYKAAGYLLLRNEKGEPIGDAETLQLALALLQRAFQLNDK 210 (230)
T ss_pred HHHHHHHHHHHhhcccCCCccCcccHHHHHHHHHHHHHhCCC
Confidence 34455667767 4567888899999999999886
No 364
>COG2099 CobK Precorrin-6x reductase [Coenzyme metabolism]
Probab=49.79 E-value=20 Score=29.68 Aligned_cols=24 Identities=17% Similarity=0.342 Sum_probs=21.0
Q ss_pred CCCCChHHHHHHHHhCCCcEEEeCC
Q psy17695 174 SQVPCNGKAADRIHKDGIHILVNMN 198 (222)
Q Consensus 174 ~~~~~~~~~a~~i~~d~iDILvDL~ 198 (222)
..+.. +++++.|++.+||+|||-+
T Consensus 51 G~l~~-e~l~~~l~e~~i~llIDAT 74 (257)
T COG2099 51 GFLGA-EGLAAFLREEGIDLLIDAT 74 (257)
T ss_pred CcCCH-HHHHHHHHHcCCCEEEECC
Confidence 44676 8999999999999999976
No 365
>cd02684 MIT_2 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in proteins with an n-terminal serine/threonine kinase domain. The molecular function of the MIT domain is unclear.
Probab=48.99 E-value=30 Score=22.85 Aligned_cols=21 Identities=24% Similarity=0.306 Sum_probs=9.2
Q ss_pred HHhcccCCCcHHHHHHHHHHH
Q psy17695 15 NSSFPSPGNIPEAIQSYRTAL 35 (222)
Q Consensus 15 g~~l~~~g~~~eA~~~~~~Al 35 (222)
|+-.-..|++++|+.+|..++
T Consensus 13 Av~~D~~g~y~eA~~lY~~al 33 (75)
T cd02684 13 AVKKDQRGDAAAALSLYCSAL 33 (75)
T ss_pred HHHHHHhccHHHHHHHHHHHH
Confidence 333444444444444444444
No 366
>cd08441 PBP2_MetR The C-terminal substrate binding domain of LysR-type transcriptional regulator metR, which regulates the expression of methionine biosynthetic genes, contains type 2 periplasmic binding fold. MetR, a member of the LysR family, is a positive regulator for the metA, metE, metF, and metH genes. The sulfur-containing amino acid methionine is the universal initiator of protein synthesis in all known organisms and its derivative S-adenosylmethionine (SAM) and autoinducer-2 (AI-2) are involved in various cellular processes. SAM plays a central role as methyl donor in methylation reactions, which are essential for the biosynthesis of phospholipids, proteins, DNA and RNA. The interspecies signaling molecule AI-2 is involved in cell-cell communication process (quorum sensing) and gene regulation in bacteria. Although methionine biosynthetic enzymes and metabolic pathways are well conserved in bacteria, the regulation of methionine biosynthesis involves various regulatory mecha
Probab=47.67 E-value=35 Score=25.42 Aligned_cols=55 Identities=13% Similarity=0.030 Sum_probs=32.6
Q ss_pred ccceeeEeeccccccCCCchhhHHHHHHhhh-cceeecCCCCChHHHHHHHHhCCCcEEEeC
Q psy17695 137 RLESLYKVMWDRYSQNLPVTHITQAKIAREA-EHFLDLSQVPCNGKAADRIHKDGIHILVNM 197 (222)
Q Consensus 137 rLRIGYvS~~d~~f~~H~v~~l~~~~~~~~~-~~~~~~~~~~~~~~~a~~i~~d~iDILvDL 197 (222)
+||||+.. . +...-+..++ .++.+.. +.=+++...+. .++.+.+.+.++|+.|-.
T Consensus 1 ~l~Ig~~~--~--~~~~~~~~~l-~~~~~~~P~i~i~i~~~~~-~~~~~~l~~g~~Dl~i~~ 56 (198)
T cd08441 1 RLRIAVEC--H--SCFDWLMPVL-DQFRERWPDVELDLSSGFH-FDPLPALLRGELDLVITS 56 (198)
T ss_pred CEEEEeec--c--chhhhhHHHH-HHHHHhCCCeEEEEEeCCc-hhHHHHHHcCCceEEEec
Confidence 47899866 2 3322233333 2333332 33344444454 689999999999999853
No 367
>KOG4814|consensus
Probab=47.53 E-value=48 Score=31.35 Aligned_cols=62 Identities=10% Similarity=-0.005 Sum_probs=51.6
Q ss_pred HHHHhcccCCCcHHHHHHHHHHHHhCCCCH------HHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 13 FSNSSFPSPGNIPEAIQSYRTALKLKPDFP------DAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 13 ~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~------~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
|-+.-+.+..+|..+++.|+..++-=|.+. ...-+|..||..+.+.+.|.+.++++.+..|+
T Consensus 359 n~A~~~F~~~~Y~~s~~~y~~Sl~~i~~D~~~~~FaK~qR~l~~CYL~L~QLD~A~E~~~EAE~~d~~ 426 (872)
T KOG4814|consen 359 NTAKKLFKMEKYVVSIRFYKLSLKDIISDNYSDRFAKIQRALQVCYLKLEQLDNAVEVYQEAEEVDRQ 426 (872)
T ss_pred HhhHHHHHHHHHHHHHHHHHHHHHhccchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHhhccc
Confidence 445667788899999999999998876653 45678899999999999999999999887654
No 368
>TIGR00160 MGSA methylglyoxal synthase. Methylglyoxal synthase (MGS) generates methylglyoxal (MG), a toxic metabolite (that may also be a regulatory metabolite and) that is detoxified, prinicipally, through a pathway involving glutathione and glyoxylase I. Totemeyer, et al. (MUID:98149311) propose that, during a loss of control over carbon flux, with accumulation of phosphorylated sugars and depletion of phosphate, as might happen during a rapid shift to a richer medium, MGS aids the cell by converting some dihydroxyacetone phosphate (DHAP) to MG and phosphate. This is therefore an alternative to triosephosphate isomerase and the remainder of the glycolytic pathway for the disposal of DHAP during the stress of a sudden increase in available sugars.
Probab=47.38 E-value=23 Score=26.66 Aligned_cols=30 Identities=10% Similarity=-0.024 Sum_probs=21.8
Q ss_pred cceeecCCCCChHHHHHHHHhCCCcEEEeCC
Q psy17695 168 EHFLDLSQVPCNGKAADRIHKDGIHILVNMN 198 (222)
Q Consensus 168 ~~~~~~~~~~~~~~~a~~i~~d~iDILvDL~ 198 (222)
..+..-.--.| +|++.+|.+.+||.+|.+-
T Consensus 52 ~~~~SGplGGD-qQIga~Ia~g~id~vIFf~ 81 (143)
T TIGR00160 52 NAMLSGPMGGD-QQIGALIAEGKIDAVIFFW 81 (143)
T ss_pred EEeccCCccHH-HHHHHHHHhCCCCEEEEec
Confidence 33333333445 8999999999999999653
No 369
>KOG1070|consensus
Probab=46.45 E-value=69 Score=33.13 Aligned_cols=70 Identities=13% Similarity=0.023 Sum_probs=61.8
Q ss_pred CCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCC--CHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhh
Q psy17695 6 FFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPD--FPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQ 75 (222)
Q Consensus 6 ~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~--~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~ 75 (222)
.....|..+|..+..+++-++|...+.+|++--|. +.+...--+..-.+.|+-+.+...|+..+...|+.
T Consensus 1562 q~~~vW~~y~~fLl~~ne~~aa~~lL~rAL~~lPk~eHv~~IskfAqLEFk~GDaeRGRtlfEgll~ayPKR 1633 (1710)
T KOG1070|consen 1562 QTRKVWIMYADFLLRQNEAEAARELLKRALKSLPKQEHVEFISKFAQLEFKYGDAERGRTLFEGLLSAYPKR 1633 (1710)
T ss_pred chhhHHHHHHHHHhcccHHHHHHHHHHHHHhhcchhhhHHHHHHHHHHHhhcCCchhhHHHHHHHHhhCccc
Confidence 35678999999999999999999999999999999 78888888888889999999888998888877763
No 370
>PF04053 Coatomer_WDAD: Coatomer WD associated region ; InterPro: IPR006692 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the WD-associated region found in coatomer subunits alpha, beta and beta' subunits. The alpha-subunit (RET1P) of the coatomer complex in Saccharomyces cerevisiae (Baker's yeast), participates in membrane transport between the endoplasmic reticulum and Golgi apparatus. The protein contains six WD-40 repeat motifs in its N-terminal region []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 3MKQ_B.
Probab=46.35 E-value=96 Score=27.97 Aligned_cols=63 Identities=17% Similarity=0.054 Sum_probs=44.1
Q ss_pred HHHHHHhcccCCCcHHHHHH-------HHHHHHh-----------CCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHH
Q psy17695 11 ILFSNSSFPSPGNIPEAIQS-------YRTALKL-----------KPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIV 72 (222)
Q Consensus 11 ~~~lg~~l~~~g~~~eA~~~-------~~~Al~l-----------~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~ 72 (222)
...+...|.++|-.+.|+.. |+-|+++ ..+++..|-.||...+..|+.+-|+.+|+++....
T Consensus 298 ~~~i~~fL~~~G~~e~AL~~~~D~~~rFeLAl~lg~L~~A~~~a~~~~~~~~W~~Lg~~AL~~g~~~lAe~c~~k~~d~~ 377 (443)
T PF04053_consen 298 GQSIARFLEKKGYPELALQFVTDPDHRFELALQLGNLDIALEIAKELDDPEKWKQLGDEALRQGNIELAEECYQKAKDFS 377 (443)
T ss_dssp HHHHHHHHHHTT-HHHHHHHSS-HHHHHHHHHHCT-HHHHHHHCCCCSTHHHHHHHHHHHHHTTBHHHHHHHHHHCT-HH
T ss_pred HHHHHHHHHHCCCHHHHHhhcCChHHHhHHHHhcCCHHHHHHHHHhcCcHHHHHHHHHHHHHcCCHHHHHHHHHhhcCcc
Confidence 45556666677776666643 3333333 25678899999999999999999999998876544
Q ss_pred H
Q psy17695 73 A 73 (222)
Q Consensus 73 ~ 73 (222)
.
T Consensus 378 ~ 378 (443)
T PF04053_consen 378 G 378 (443)
T ss_dssp H
T ss_pred c
Confidence 3
No 371
>KOG1258|consensus
Probab=45.15 E-value=75 Score=29.53 Aligned_cols=73 Identities=14% Similarity=0.017 Sum_probs=56.9
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHH-cCCHHHHHHHHHHHHHHHHh
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQI-VCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~-~~~~~~a~~~~~~~~~~~~~ 74 (222)
.+.|-.-..|-..+..-.++|..+.+.+.|+++++-=|-+.+.|..+...+.. .++.+.....|+.++..+..
T Consensus 73 ~kyPl~~gyW~kfA~~E~klg~~~~s~~Vfergv~aip~SvdlW~~Y~~f~~n~~~d~~~lr~~fe~A~~~vG~ 146 (577)
T KOG1258|consen 73 SKYPLCYGYWKKFADYEYKLGNAENSVKVFERGVQAIPLSVDLWLSYLAFLKNNNGDPETLRDLFERAKSYVGL 146 (577)
T ss_pred hhCccHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhccCCCHHHHHHHHHHHHHhccc
Confidence 45677778888999999999999999999999999999999999887766544 35555555666666665543
No 372
>cd08431 PBP2_HupR The C-terminal substrate binding domain of LysR-type transcriptional regulator, HupR, which regulates expression of the heme uptake receptor HupA; contains the type 2 periplasmic binding fold. HupR, a member of the LysR family, activates hupA transcription under low-iron conditions in the presence of hemin. The expression of many iron-uptake genes, such as hupA, is regulated at the transcriptional level by iron and an iron-binding repressor protein called Fur (ferric uptake regulation). Under iron-abundant conditions with heme, the active Fur repressor protein represses transcription of the iron-uptake gene hupA, and prevents transcriptional activation via HupR. Under low-iron conditions with heme, the Fur repressor is inactive and transcription of the hupA is allowed. This substrate-binding domain shows significant homology to the type 2 periplasmic binding proteins (PBP2), which are responsible for the uptake of a variety of substrates such as phosphate, sulfate, p
Probab=44.43 E-value=32 Score=25.59 Aligned_cols=57 Identities=11% Similarity=0.068 Sum_probs=32.3
Q ss_pred ccceeeEeeccccccCCCchhhHHHHHHhhhcceeecCCCCChHHHHHHHHhCCCcEEEeCC
Q psy17695 137 RLESLYKVMWDRYSQNLPVTHITQAKIAREAEHFLDLSQVPCNGKAADRIHKDGIHILVNMN 198 (222)
Q Consensus 137 rLRIGYvS~~d~~f~~H~v~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~i~~d~iDILvDL~ 198 (222)
.||||..+ - +..+-+..++.+-.+..=+-=+++...+. .++.+.+.+.++|+.|-..
T Consensus 1 ~l~Ig~~~--~--~~~~~l~~~l~~~~~~~P~v~i~i~~~~~-~~~~~~l~~g~~D~~i~~~ 57 (195)
T cd08431 1 ELRIAIDT--V--LPLQPLYPLIAEFYQLNKATRIRLSEEVL-GGTWDALASGRADLVIGAT 57 (195)
T ss_pred CEEEEEcc--c--cChHHHHHHHHHHHHHCCCCceEEEEecc-chHHHHHhCCCCCEEEEec
Confidence 36778544 3 34433444442222222233344444454 5888999999999998643
No 373
>cd02677 MIT_SNX15 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This MIT domain sub-family is found in sorting nexin 15 and related proteins. The molecular function of the MIT domain is unclear.
Probab=44.33 E-value=32 Score=22.73 Aligned_cols=19 Identities=11% Similarity=0.060 Sum_probs=13.2
Q ss_pred CCHHHHHHHHHHHHHHHHh
Q psy17695 56 CDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 56 ~~~~~a~~~~~~~~~~~~~ 74 (222)
|++++|..+|..++...-.
T Consensus 20 ~~y~eA~~~Y~~~i~~~~~ 38 (75)
T cd02677 20 GDYEAAFEFYRAGVDLLLK 38 (75)
T ss_pred hhHHHHHHHHHHHHHHHHH
Confidence 6777777777777766554
No 374
>PF04910 Tcf25: Transcriptional repressor TCF25; InterPro: IPR006994 This entry appears to represent a novel family of basic helix-loop-helix (bHLH) proteins that control differentiation and development of a variety of organs [, ]. Human Nulp1 (Q2MK75 from SWISSPROT) is a basic helix-loop-helix protein expressed broadly during early embryonic organogenesis. Over expression of human Nulp1 in COS-7 cells inhibits the transcriptional activity of serum response factor (SRF), suggesting that Nulp1 may act as a novel bHLH transcriptional repressor in the SRF signalling pathway to mediate cellular functions [].
Probab=44.27 E-value=72 Score=27.83 Aligned_cols=41 Identities=12% Similarity=0.141 Sum_probs=34.9
Q ss_pred HHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHH
Q psy17695 33 TALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVA 73 (222)
Q Consensus 33 ~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~ 73 (222)
..++.+|-+.+++..++.++..+|+.+.|.+..++++-...
T Consensus 31 ~ll~~~PyHidtLlqls~v~~~~gd~~~A~~lleRALf~~e 71 (360)
T PF04910_consen 31 NLLQKNPYHIDTLLQLSEVYRQQGDHAQANDLLERALFAFE 71 (360)
T ss_pred HHHHHCCCcHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHH
Confidence 33566899999999999999999999999999998876543
No 375
>PF08631 SPO22: Meiosis protein SPO22/ZIP4 like; InterPro: IPR013940 SPO22 is a meiosis-specific protein with similarity to phospholipase A2, involved in completion of nuclear divisions during meiosis; induced early in meiosis []. It is also involved in sporulation [].
Probab=43.31 E-value=1.1e+02 Score=25.38 Aligned_cols=56 Identities=18% Similarity=0.258 Sum_probs=45.0
Q ss_pred ccCCCcHHHHHHHHHHHHhC----CCCH----HHHHHHHHHHHHcC-CHHHHHHHHHHHHHHHHh
Q psy17695 19 PSPGNIPEAIQSYRTALKLK----PDFP----DAYCNLAHCLQIVC-DWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 19 ~~~g~~~eA~~~~~~Al~l~----P~~~----~a~~~l~~~~~~~~-~~~~a~~~~~~~~~~~~~ 74 (222)
.++|+++.|..+|.|+-.+. |+.. ...+|.|..+...+ ++++|...++++..++..
T Consensus 4 ~~~~~~~~A~~~~~K~~~~~~~~~~~~~~~La~~~yn~G~~l~~~~~~~~~a~~wL~~a~~~l~~ 68 (278)
T PF08631_consen 4 WKQGDLDLAEHMYSKAKDLLNSLDPDMAEELARVCYNIGKSLLSKKDKYEEAVKWLQRAYDILEK 68 (278)
T ss_pred hhhCCHHHHHHHHHHhhhHHhcCCcHHHHHHHHHHHHHHHHHHHcCCChHHHHHHHHHHHHHHHh
Confidence 46899999999999998765 4433 34677777778888 999999999999988754
No 376
>cd08415 PBP2_LysR_opines_like The C-terminal substrate-domain of LysR-type transcriptional regulators involved in the catabolism of opines and that of related regulators, contains the type 2 periplasmic binding fold. This CD includes the C-terminal substrate-domain of LysR-type transcriptional regulators, OccR and NocR, involved in the catabolism of opines and that of LysR for lysine biosynthesis which clustered together in phylogenetic trees. Opines, such as octopine and nopaline, are low molecular weight compounds found in plant crown gall tumors that are produced by the parasitic bacterium Agrobacterium. There are at least 30 different opines identified so far. Opines are utilized by tumor-colonizing bacteria as a source of carbon, nitrogen, and energy. NocR and OccR belong to the family of LysR-type transcriptional regulators that positively regulates the catabolism of nopaline and octopine, respectively. Both nopaline and octopalin are arginine derivatives. In Agrobacterium tumefa
Probab=42.90 E-value=54 Score=24.11 Aligned_cols=55 Identities=7% Similarity=-0.015 Sum_probs=32.4
Q ss_pred ccceeeEeeccccccCCCchhhHHHHHHhhhcceeecCCCCChHHHHHHHHhCCCcEEEe
Q psy17695 137 RLESLYKVMWDRYSQNLPVTHITQAKIAREAEHFLDLSQVPCNGKAADRIHKDGIHILVN 196 (222)
Q Consensus 137 rLRIGYvS~~d~~f~~H~v~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~i~~d~iDILvD 196 (222)
+||||... . +..+-+..++.+-.+..-+.=+++...+. .++.+.+.+.++||.|=
T Consensus 1 ~l~Ig~~~--~--~~~~~l~~~l~~~~~~~P~i~l~i~~~~~-~~~~~~l~~~~~Dl~i~ 55 (196)
T cd08415 1 TLRIAALP--A--LALSLLPRAIARFRARHPDVRISLHTLSS-STVVEAVLSGQADLGLA 55 (196)
T ss_pred CeEEEecc--c--ccccccHHHHHHHHHHCCCcEEEEEecch-HHHHHHHHcCCccEEEE
Confidence 47788766 2 34444444443333332233344444454 58889999999999884
No 377
>cd08436 PBP2_LTTR_like_3 The C-terminal substrate binding domain of an uncharacterized LysR-type transcriptional regulator, contains the type 2 periplasmic binding fold. LysR-transcriptional regulators comprise the largest family of prokaryotic transcription factor. Homologs of some of LTTRs with similar domain organizations are also found in the archaea and eukaryotic organisms. The LTTRs are composed of two functional domains joined by a linker helix involved in oligomerization: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal substrate-binding domain, which is structurally homologous to the type 2 periplasmic binding proteins. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcriptional repressor undergoes a conformational change upon substrate binding which in turn changes the DNA binding affinity of the repressor. The genes controlled by the LTTRs have diverse functi
Probab=42.63 E-value=36 Score=25.02 Aligned_cols=54 Identities=7% Similarity=-0.001 Sum_probs=32.2
Q ss_pred cceeeEeeccccccCCCchhhHHHHHHhhhcceeecCCCCChHHHHHHHHhCCCcEEEe
Q psy17695 138 LESLYKVMWDRYSQNLPVTHITQAKIAREAEHFLDLSQVPCNGKAADRIHKDGIHILVN 196 (222)
Q Consensus 138 LRIGYvS~~d~~f~~H~v~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~i~~d~iDILvD 196 (222)
||||..+ . +..+-+..++.+-.+..=+-=+.+...+. .++.+.+.+.++|+.|-
T Consensus 2 l~Ig~~~--~--~~~~~l~~~l~~~~~~~P~v~i~i~~~~~-~~~~~~l~~~~~Dl~i~ 55 (194)
T cd08436 2 LAIGTIT--S--LAAVDLPELLARFHRRHPGVDIRLRQAGS-DDLLAAVREGRLDLAFV 55 (194)
T ss_pred EEEEeeh--H--HHHHHHHHHHHHHHHHCCCcEEEEecCCH-HHHHHHHHcCCccEEEE
Confidence 6777666 3 34444444443333333233444544454 68899999999999884
No 378
>PF02184 HAT: HAT (Half-A-TPR) repeat; InterPro: IPR003107 The HAT (Half A TPR) repeat has a repetitive pattern characterised by three aromatic residues with a conserved spacing. They are structurally and sequentially similar to TPRs (tetratricopeptide repeats), though they lack the highly conserved alanine and glycine residues found in TPRs. The number of HAT repeats found in different proteins varies between 9 and 12. HAT-repeat-containing proteins appear to be components of macromolecular complexes that are required for RNA processing []. The repeats may be involved in protein-protein interactions. The HAT motif has striking structural similarities to HEAT repeats (IPR000357 from INTERPRO), being of a similar length and consisting of two short helices connected by a loop domain, as in HEAT repeats.; GO: 0006396 RNA processing, 0005622 intracellular
Probab=42.32 E-value=53 Score=17.98 Aligned_cols=26 Identities=15% Similarity=0.293 Sum_probs=20.9
Q ss_pred CcHHHHHHHHHHHHhCCCCHHHHHHHH
Q psy17695 23 NIPEAIQSYRTALKLKPDFPDAYCNLA 49 (222)
Q Consensus 23 ~~~eA~~~~~~Al~l~P~~~~a~~~l~ 49 (222)
.++.|...|++.+...|+ +..|...+
T Consensus 2 E~dRAR~IyeR~v~~hp~-~k~WikyA 27 (32)
T PF02184_consen 2 EFDRARSIYERFVLVHPE-VKNWIKYA 27 (32)
T ss_pred hHHHHHHHHHHHHHhCCC-chHHHHHH
Confidence 578899999999999987 66666655
No 379
>cd07405 MPP_UshA_N Escherichia coli UshA and related proteins, N-terminal metallophosphatase domain. UshA is a bacterial periplasmic enzyme with UDP-sugar hydrolase and dinucleoside-polyphosphate hydrolase activities associated with its N-terminal metallophosphatase domain, and 5'-nucleotidase activity associated with its C-terminal domain. UshA has been studied in Escherichia coli where it is expressed from the ushA gene as an immature precursor and proteolytically cleaved to form a mature product upon export to the periplasm. UshA hydrolyzes many different nucleotides and nucleotide derivitives and has been shown to degrade external UDP-glucose to uridine, glucose 1-phosphate and phosphate for utilization by the cell. The N-terminal metallophosphatase domain belongs to a large superfamily of distantly related metallophosphatases (MPPs) that includes: Mre11/SbcD-like exonucleases, Dbr1-like RNA lariat debranching enzymes, YfcE-like phosphodiesterases, purple acid phosphatases (PAPs
Probab=42.07 E-value=43 Score=28.04 Aligned_cols=41 Identities=10% Similarity=0.320 Sum_probs=26.8
Q ss_pred HHHHh-hhcceeecCCCCC------------hHHHHHHHHhCCCcEEEeCCCCCCC
Q psy17695 161 AKIAR-EAEHFLDLSQVPC------------NGKAADRIHKDGIHILVNMNGYTKG 203 (222)
Q Consensus 161 ~~~~~-~~~~~~~~~~~~~------------~~~~a~~i~~d~iDILvDL~G~t~~ 203 (222)
.+++. .+|-.+-++.+.. +.++|+.+...+||++|. |||--
T Consensus 167 ~~lk~~~~D~VI~lsH~G~~~~~~~~~~~~~~~~lA~~~~~~giD~Iig--GHsH~ 220 (285)
T cd07405 167 PELKQEKPDIVIAATHMGHYDNGEHGSNAPGDVEMARALPAGGLDLIVG--GHSQD 220 (285)
T ss_pred HHHHHcCCCEEEEEecccccCCccccccCchHHHHHHhcCCCCCCEEEe--CCCCc
Confidence 34564 6777777766542 025666666679999984 88743
No 380
>cd07409 MPP_CD73_N CD73 ecto-5'-nucleotidase and related proteins, N-terminal metallophosphatase domain. CD73 is a mammalian ecto-5'-nucleotidase expressed in endothelial cells and lymphocytes that catalyzes the conversion of 5'-AMP to adenosine in the final step of a pathway that generates adenosine from ATP. This pathway also includes a CD39 nucleoside triphosphate dephosphorylase that mediates the dephosphorylation of ATP to ADP and then to 5'-AMP. These enzymes all have an N-terminal metallophosphatase domain and a C-terminal 5'nucleotidase domain. The N-terminal metallophosphatase domain belongs to a large superfamily of distantly related metallophosphatases (MPPs) that includes: Mre11/SbcD-like exonucleases, Dbr1-like RNA lariat debranching enzymes, YfcE-like phosphodiesterases, purple acid phosphatases (PAPs), YbbF-like UDP-2,3-diacylglucosamine hydrolases, and acid sphingomyelinases (ASMases). MPPs are functionally diverse, but all share a conserved domain with an active si
Probab=41.59 E-value=43 Score=27.91 Aligned_cols=38 Identities=18% Similarity=0.413 Sum_probs=24.8
Q ss_pred HHHHh-hhcceeecCCCCCh--HHHHHHHHhCCCcEEEeCCCCCC
Q psy17695 161 AKIAR-EAEHFLDLSQVPCN--GKAADRIHKDGIHILVNMNGYTK 202 (222)
Q Consensus 161 ~~~~~-~~~~~~~~~~~~~~--~~~a~~i~~d~iDILvDL~G~t~ 202 (222)
+++|. .+|-.+-++.+..+ .++|+++ .+||++| .|||-
T Consensus 176 ~~lr~~~~D~II~l~H~G~~~d~~la~~~--~giD~Ii--ggH~H 216 (281)
T cd07409 176 DKLKAQGVNKIIALSHSGYEVDKEIARKV--PGVDVIV--GGHSH 216 (281)
T ss_pred HHHHhcCCCEEEEEeccCchhHHHHHHcC--CCCcEEE--eCCcC
Confidence 34554 38888888877651 2455555 6899998 66643
No 381
>cd08458 PBP2_NocR The C-terminal substrate-domain of LysR-type transcriptional regulator, NocR, involved in the catabolism of nopaline, contains the type 2 periplasmic binding fold. This CD includes the C-terminal substrate-domain of LysR-type transcriptional regulator NocR, which is involved in the catabolism of nopaline. Opines are low molecular weight compounds found in plant crown gall tumors produced by the parasitic bacterium Agrobacterium. There are at least 30 different opines identified so far. Opines are utilized by tumor-colonizing bacteria as a source of carbon, nitrogen, and energy. In Agrobacterium tumefaciens, NocR regulates expression of the divergently transcribed nocB and nocR genes of the nopaline catabolism (noc) region. This substrate-binding domain shows significant homology to the type 2 periplasmic binding proteins (PBP2), which are responsible for the uptake of a variety of substrates such as phosphate, sulfate, polysaccharides, lysine/arginine/ornithine, an
Probab=41.13 E-value=55 Score=24.41 Aligned_cols=56 Identities=13% Similarity=0.086 Sum_probs=32.3
Q ss_pred ccceeeEeeccccccCCCchhhHHHHHHhhhcceeecCCCCChHHHHHHHHhCCCcEEEeC
Q psy17695 137 RLESLYKVMWDRYSQNLPVTHITQAKIAREAEHFLDLSQVPCNGKAADRIHKDGIHILVNM 197 (222)
Q Consensus 137 rLRIGYvS~~d~~f~~H~v~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~i~~d~iDILvDL 197 (222)
+||||+.+ + +..+-+..++.+-.+..=+.=+++...+. .++.+.+.+.+|||.|..
T Consensus 1 ~l~ig~~~-~---~~~~~l~~~l~~f~~~~P~v~i~~~~~~~-~~~~~~l~~g~~Dl~i~~ 56 (196)
T cd08458 1 SLRVACYT-A---PALSFMSGVIQTFIADRPDVSVYLDTVPS-QTVLELVSLQHYDLGISI 56 (196)
T ss_pred CeEEEEcc-h---hhhhhhHHHHHHHHHHCCCcEEEEeccCh-HHHHHHHHcCCCCEEEEe
Confidence 36788776 2 22222333332222222233445555554 689999999999998863
No 382
>PRK15490 Vi polysaccharide biosynthesis protein TviE; Provisional
Probab=40.76 E-value=98 Score=28.94 Aligned_cols=54 Identities=17% Similarity=0.087 Sum_probs=41.0
Q ss_pred CHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHH
Q psy17695 7 FKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYE 62 (222)
Q Consensus 7 ~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~ 62 (222)
.+...+.-+..+.+.|..++|-++|++.+..+|+ ++++..+.-+...|-...|.
T Consensus 41 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~ 94 (578)
T PRK15490 41 TSLAMLKKAEFLHDVNETERAYALYETLIAQNND--EARYEYARRLYNTGLAKDAQ 94 (578)
T ss_pred hHHHHHHHhhhhhhhhhhHhHHHHHHHHHHhCCc--chHHHHHHHHHhhhhhhHHH
Confidence 3455667778888999999999999999999999 66666666666665544443
No 383
>KOG1310|consensus
Probab=40.63 E-value=20 Score=32.99 Aligned_cols=42 Identities=10% Similarity=-0.049 Sum_probs=36.6
Q ss_pred CCCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCH
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFP 42 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~ 42 (222)
+.++|....+|+.|+.++.+++++.+|+.+...+....|.+.
T Consensus 438 lrln~s~~kah~~la~aL~el~r~~eal~~~~alq~~~Ptd~ 479 (758)
T KOG1310|consen 438 LRLNPSIQKAHFRLARALNELTRYLEALSCHWALQMSFPTDV 479 (758)
T ss_pred ccCChHHHHHHHHHHHHHHHHhhHHHhhhhHHHHhhcCchhh
Confidence 467888899999999999999999999999888777778543
No 384
>PRK15326 type III secretion system needle complex protein PrgI; Provisional
Probab=39.81 E-value=1.1e+02 Score=20.53 Aligned_cols=40 Identities=15% Similarity=0.235 Sum_probs=26.5
Q ss_pred CCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHH
Q psy17695 21 PGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEA 63 (222)
Q Consensus 21 ~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~ 63 (222)
.++..+++..-..+++.+|++|.++..+ ....++|.-+..
T Consensus 20 a~~~~~~l~~Al~~l~~~pdnP~~LA~~---Qa~l~eyn~~RN 59 (80)
T PRK15326 20 VDNLQTQVTEALDKLAAKPSDPALLAAY---QSKLSEYNLYRN 59 (80)
T ss_pred HHHHHHHHHHHHHHhhcCCCCHHHHHHH---HHHHHHHHHHHH
Confidence 3455666666677788999999986555 445556654433
No 385
>KOG2047|consensus
Probab=39.58 E-value=51 Score=31.22 Aligned_cols=58 Identities=14% Similarity=0.015 Sum_probs=31.7
Q ss_pred HHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHH
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARM 65 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~ 65 (222)
...|.-++......|-++.-...|++.+.|.--.|+.-.|.+..+....-++++-..|
T Consensus 477 lkiWs~y~DleEs~gtfestk~vYdriidLriaTPqii~NyAmfLEeh~yfeesFk~Y 534 (835)
T KOG2047|consen 477 LKIWSMYADLEESLGTFESTKAVYDRIIDLRIATPQIIINYAMFLEEHKYFEESFKAY 534 (835)
T ss_pred HHHHHHHHHHHHHhccHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhHHHHHHHHHH
Confidence 3445555555555666666666666666666555666666655554444444433333
No 386
>cd08419 PBP2_CbbR_RubisCO_like The C-terminal substrate binding of LysR-type transcriptional regulator (CbbR) of RubisCO operon, which is involved in the carbon dioxide fixation, contains the type 2 periplasmic binding fold. CbbR, a LysR-type transcriptional regulator, is required to activate expression of RubisCO, one of two unique enzymes in the Calvin-Benson-Bassham (CBB) cycle pathway. All plants, cyanobacteria, and many autotrophic bacteria use the CBB cycle to fix carbon dioxide. Thus, this cycle plays an essential role in assimilating CO2 into organic carbon on earth. The key CBB cycle enzyme is ribulose 1,5-bisphosphate carboxylase/oxygenase (RubisCO), which catalyzes the actual CO2 fixation reaction. The CO2 concentration affects the expression of RubisCO genes. It has also shown that NADPH enhances the DNA-binding ability of the CbbR. RubisCO is composed of eight large (CbbL) and eight small subunits (CbbS). The topology of this substrate-binding domain is most similar to t
Probab=39.52 E-value=43 Score=24.64 Aligned_cols=25 Identities=8% Similarity=0.112 Sum_probs=19.0
Q ss_pred eecCCCCChHHHHHHHHhCCCcEEEe
Q psy17695 171 LDLSQVPCNGKAADRIHKDGIHILVN 196 (222)
Q Consensus 171 ~~~~~~~~~~~~a~~i~~d~iDILvD 196 (222)
+++...+. .++.+.+.+.++|+.|-
T Consensus 30 l~i~~~~~-~~~~~~l~~g~~Dl~i~ 54 (197)
T cd08419 30 VSLRVGNR-EQVLERLADNEDDLAIM 54 (197)
T ss_pred EEEEECCH-HHHHHHHhcCCccEEEe
Confidence 44444454 68899999999999994
No 387
>smart00854 PGA_cap Bacterial capsule synthesis protein PGA_cap. This protein is a putative poly-gamma-glutamate capsule biosynthesis protein found in bacteria. Poly-gamma-glutamate is a natural polymer that may be involved in virulence and may help bacteria survive in high salt concentrations. It is a surface-associated protein.
Probab=39.29 E-value=66 Score=26.01 Aligned_cols=35 Identities=11% Similarity=0.196 Sum_probs=22.3
Q ss_pred HHHHhhhcceeecCCCCC---------hHHHHHHHHhCCCcEEE
Q psy17695 161 AKIAREAEHFLDLSQVPC---------NGKAADRIHKDGIHILV 195 (222)
Q Consensus 161 ~~~~~~~~~~~~~~~~~~---------~~~~a~~i~~d~iDILv 195 (222)
+++|+.+|..+-+..+.. ..++|+.+.+.++|+.|
T Consensus 167 ~~lr~~~D~vIv~~H~G~e~~~~p~~~~~~~A~~l~~~G~DvIi 210 (239)
T smart00854 167 ARARKKADVVIVSLHWGVEYQYEPTDEQRELAHALIDAGADVVI 210 (239)
T ss_pred HHHhccCCEEEEEecCccccCCCCCHHHHHHHHHHHHcCCCEEE
Confidence 355555665555444432 13678888888999998
No 388
>COG4455 ImpE Protein of avirulence locus involved in temperature-dependent protein secretion [General function prediction only]
Probab=39.07 E-value=49 Score=27.11 Aligned_cols=50 Identities=22% Similarity=0.237 Sum_probs=41.2
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHH---HHHHHHHH
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPD---AYCNLAHC 51 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~---a~~~l~~~ 51 (222)
+.+|.++..-.-|-.+|.-.|+|+.|..-++-+-++.|++.. .+-++..|
T Consensus 29 kakPtda~~RhflfqLlcvaGdw~kAl~Ql~l~a~l~p~~t~~a~lyr~lir~ 81 (273)
T COG4455 29 KAKPTDAGGRHFLFQLLCVAGDWEKALAQLNLAATLSPQDTVGASLYRHLIRC 81 (273)
T ss_pred hcCCccccchhHHHHHHhhcchHHHHHHHHHHHhhcCcccchHHHHHHHHHHH
Confidence 567888888888889999999999999999999999999753 34444444
No 389
>KOG0529|consensus
Probab=38.95 E-value=1e+02 Score=27.50 Aligned_cols=73 Identities=18% Similarity=0.160 Sum_probs=48.2
Q ss_pred CCCCCCHHHHHHHHHhccc------------CCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCC--HHHHHHHHHH
Q psy17695 2 MKTPFFKVGILFSNSSFPS------------PGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCD--WTDYEARMKK 67 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~------------~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~--~~~a~~~~~~ 67 (222)
.++|+...+|+.-=.++.. +.-+++-+.....+++.+|+...+|+.+..++.+... |..=.+..++
T Consensus 57 ~~npe~~t~wN~Rr~~~~~r~~~~~~~~~ek~~~ld~eL~~~~~~L~~npksY~aW~hR~w~L~~~p~~~~~~EL~lcek 136 (421)
T KOG0529|consen 57 EKNPEFYTVWNYRRLIIEERLTRAQLEPLEKQALLDEELKYVESALKVNPKSYGAWHHRKWVLQKNPHSDWNTELQLCEK 136 (421)
T ss_pred hhCchhhhhhhhHHHHHHHhhhhhcCCHHHHHHhhHHHHHHHHHHHHhCchhHHHHHHHHHHHHhCCCchHHHHHHHHHH
Confidence 3566666666543333221 2246677778888888889988888888888876653 6666666666
Q ss_pred HHHHHHh
Q psy17695 68 LVSIVAE 74 (222)
Q Consensus 68 ~~~~~~~ 74 (222)
+++..+.
T Consensus 137 ~L~~D~R 143 (421)
T KOG0529|consen 137 ALKQDPR 143 (421)
T ss_pred HHhcCcc
Confidence 6666554
No 390
>KOG2422|consensus
Probab=38.85 E-value=2.9e+02 Score=25.98 Aligned_cols=73 Identities=21% Similarity=0.244 Sum_probs=53.9
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHH-----HhCCCCH-------------------HHHHHHHHHHHHcCC
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTAL-----KLKPDFP-------------------DAYCNLAHCLQIVCD 57 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al-----~l~P~~~-------------------~a~~~l~~~~~~~~~ 57 (222)
...|-+.+.+..++.+...+|+.+-|.+..+|+| .+.|... .+++....-+...|.
T Consensus 278 ~ssPYHvdsLLqva~~~r~qgD~e~aadLieR~Ly~~d~a~hp~F~~~sg~cRL~y~~~eNR~FyL~l~r~m~~l~~RGC 357 (665)
T KOG2422|consen 278 ISSPYHVDSLLQVADIFRFQGDREMAADLIERGLYVFDRALHPNFIPFSGNCRLPYIYPENRQFYLALFRYMQSLAQRGC 357 (665)
T ss_pred ccCCcchhHHHHHHHHHHHhcchhhHHHHHHHHHHHHHHHhccccccccccccCcccchhhHHHHHHHHHHHHHHHhcCC
Confidence 3468888999999999999999999999888887 2233221 123333344557799
Q ss_pred HHHHHHHHHHHHHHHHh
Q psy17695 58 WTDYEARMKKLVSIVAE 74 (222)
Q Consensus 58 ~~~a~~~~~~~~~~~~~ 74 (222)
|.-|.+.++-++.+.|.
T Consensus 358 ~rTA~E~cKlllsLdp~ 374 (665)
T KOG2422|consen 358 WRTALEWCKLLLSLDPS 374 (665)
T ss_pred hHHHHHHHHHHhhcCCc
Confidence 99999988888877764
No 391
>cd08438 PBP2_CidR The C-terminal substrate binding domain of LysR-like transcriptional regulator CidR, contains the type 2 periplasmic binding fold. This CD includes the substrate binding domain of CidR which positively up-regulates the expression of cidABC operon in the presence of acetic acid produced by the metabolism of excess glucose. The CidR affects the control of murein hydrolase activity by enhancing cidABC expression in the presence of acetic acid. Thus, up-regulation of cidABC expression results in increased murein hydrolase activity. This substrate binding domain has significant homology to the type 2 periplasmic binding proteins (PBP2), which are responsible for the uptake of a variety of substrates such as phosphate, sulfate, polysaccharides, lysine/arginine/ornithine, and histidine. The PBP2 bind their ligand in the cleft between these domains in a manner resembling a Venus flytrap. After binding their specific ligand with high affinity, they can interact with a cognate
Probab=38.15 E-value=61 Score=23.78 Aligned_cols=55 Identities=13% Similarity=0.030 Sum_probs=31.9
Q ss_pred ccceeeEeeccccccCCCchhhHHHHHHhhhcceeecCCCCChHHHHHHHHhCCCcEEEe
Q psy17695 137 RLESLYKVMWDRYSQNLPVTHITQAKIAREAEHFLDLSQVPCNGKAADRIHKDGIHILVN 196 (222)
Q Consensus 137 rLRIGYvS~~d~~f~~H~v~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~i~~d~iDILvD 196 (222)
+||||... . +..+-+..++.+-.++.-+-=+.+...+. .++.+.+.+.++|+.|-
T Consensus 1 ~l~Ig~~~--~--~~~~~l~~~l~~~~~~~p~v~i~i~~~~~-~~~~~~L~~~~~Dl~i~ 55 (197)
T cd08438 1 HLRLGLPP--L--GGSLLFAPLLAAFRQRYPNIELELVEYGG-KKVEQAVLNGELDVGIT 55 (197)
T ss_pred CeEEEecc--h--hhhhhcHHHHHHHHHHCcCeEEEEEEcCc-HHHHHHHHcCCCCEEEE
Confidence 36777765 2 33333334443222222233344444455 68899999999999885
No 392
>TIGR00985 3a0801s04tom mitochondrial import receptor subunit translocase of outer membrane 20 kDa subunit.
Probab=38.13 E-value=58 Score=24.66 Aligned_cols=31 Identities=23% Similarity=0.319 Sum_probs=21.8
Q ss_pred HHHHhcccCC-CcHHHHHHHHHHHHhCCCCHH
Q psy17695 13 FSNSSFPSPG-NIPEAIQSYRTALKLKPDFPD 43 (222)
Q Consensus 13 ~lg~~l~~~g-~~~eA~~~~~~Al~l~P~~~~ 43 (222)
.+|..|...| +.++|..+|-+||..-|+-.+
T Consensus 95 ~~GE~L~~~g~~~~ega~hf~nAl~Vc~qP~~ 126 (148)
T TIGR00985 95 QLGEELMAQGTNVDEGAVHFYNALKVYPQPQQ 126 (148)
T ss_pred HHHHHHHhCCCchHHHHHHHHHHHHhCCCHHH
Confidence 4677777777 777777777777777666443
No 393
>PF04343 DUF488: Protein of unknown function, DUF488; InterPro: IPR007438 This family includes several proteins of uncharacterised function.
Probab=37.97 E-value=30 Score=24.86 Aligned_cols=17 Identities=35% Similarity=0.640 Sum_probs=15.4
Q ss_pred HHHHHHHHhCCCcEEEe
Q psy17695 180 GKAADRIHKDGIHILVN 196 (222)
Q Consensus 180 ~~~a~~i~~d~iDILvD 196 (222)
+++.+.+..++|++|||
T Consensus 3 e~f~~~l~~~~i~~lVD 19 (122)
T PF04343_consen 3 ERFYDLLKKNGIRVLVD 19 (122)
T ss_pred HHHHHHHHHCCCeEEEE
Confidence 47778999999999999
No 394
>KOG2709|consensus
Probab=37.97 E-value=33 Score=30.64 Aligned_cols=30 Identities=13% Similarity=0.212 Sum_probs=26.1
Q ss_pred HHHHHHHHHhcccCCCcHHHHHHHHHHHHh
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQSYRTALKL 37 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l 37 (222)
+.+.-++|.++.+++++++|+.+|+++|.+
T Consensus 22 A~~~V~~gl~~dE~~~~e~a~~~Ye~gl~~ 51 (560)
T KOG2709|consen 22 AYASVEQGLCYDEVNDWENALAMYEKGLNL 51 (560)
T ss_pred HHHHHHhhcchhhhcCHHHHHHHHHHHHHH
Confidence 345678999999999999999999999865
No 395
>cd02679 MIT_spastin MIT: domain contained within Microtubule Interacting and Trafficking molecules. This MIT domain sub-family is found in the AAA protein spastin, a probable ATPase involved in the assembly or function of nuclear protein complexes; spastins might also be involved in microtubule dynamics. The molecular function of the MIT domain is unclear.
Probab=37.80 E-value=96 Score=20.77 Aligned_cols=29 Identities=17% Similarity=0.186 Sum_probs=20.5
Q ss_pred HHHHHHHHhcccCCCcHHHHHHHHHHHHh
Q psy17695 9 VGILFSNSSFPSPGNIPEAIQSYRTALKL 37 (222)
Q Consensus 9 ~~~~~lg~~l~~~g~~~eA~~~~~~Al~l 37 (222)
....+.|..+...|+.++|+.+|++++..
T Consensus 9 ~~~I~kaL~~dE~g~~e~Al~~Y~~gi~~ 37 (79)
T cd02679 9 FEEISKALRADEWGDKEQALAHYRKGLRE 37 (79)
T ss_pred HHHHHHHhhhhhcCCHHHHHHHHHHHHHH
Confidence 34556677777778888888888877763
No 396
>KOG1258|consensus
Probab=36.32 E-value=1.7e+02 Score=27.31 Aligned_cols=68 Identities=13% Similarity=0.035 Sum_probs=50.2
Q ss_pred CHHHHHHHHHhcccCCCcHHHHHHHHHHHHh-CCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 7 FKVGILFSNSSFPSPGNIPEAIQSYRTALKL-KPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 7 ~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l-~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
..+.|.....-+...|+.+-|-..+.++.++ .|+.+..+...+.....+|++..|...++.+..-.|+
T Consensus 330 Y~efWiky~~~m~~~~~~~~~~~~~~~~~~i~~k~~~~i~L~~a~f~e~~~n~~~A~~~lq~i~~e~pg 398 (577)
T KOG1258|consen 330 YDEFWIKYARWMESSGDVSLANNVLARACKIHVKKTPIIHLLEARFEESNGNFDDAKVILQRIESEYPG 398 (577)
T ss_pred hHHHHHHHHHHHHHcCchhHHHHHHHhhhhhcCCCCcHHHHHHHHHHHhhccHHHHHHHHHHHHhhCCc
Confidence 3567777777777778888887777777777 4677777777777777778888888887777766554
No 397
>PF05387 Chorion_3: Chorion family 3; InterPro: IPR008449 This family consists Drosophila chorion proteins S36 and S38, which are involved in chorion membrane formation. The proteins consist of a central domain and two flanking 'arms'. The central domain contains tandemly repetitive peptides, which apparently generate a secondary structure of beta-sheet strands alternating with beta-turns, most probably, forming a twisted beta-pleated sheet or beta-barrel [].
Probab=35.75 E-value=14 Score=30.15 Aligned_cols=42 Identities=21% Similarity=0.374 Sum_probs=30.7
Q ss_pred CCCCChHHHHHHHHhCCCcEE-------------EeCCCCCCCcchhhhhhCC-chhhhc
Q psy17695 174 SQVPCNGKAADRIHKDGIHIL-------------VNMNGYTKGARNEIFALRP-APIQSS 219 (222)
Q Consensus 174 ~~~~~~~~~a~~i~~d~iDIL-------------vDL~G~t~~~r~~i~a~rp-APvQv~ 219 (222)
+.++- .++|.-||.-+|.|. .=.+||..+ +|++.|| +||-|.
T Consensus 104 sSIty-pqlas~irqg~i~~v~p~~~dq~~~rsllvpsg~N~~---qii~~~Pl~PIIV~ 159 (277)
T PF05387_consen 104 SSITY-PQLASNIRQGKINVVSPQFVDQHLFRSLLVPSGQNNH---QIIIHQPLPPIIVH 159 (277)
T ss_pred hhhcc-cchhhhhhcccccccccchhhhhhhhhhccccCCCCc---cEEEecCCCCeEec
Confidence 34565 688888998888873 335666555 7899999 998764
No 398
>COG1803 MgsA Methylglyoxal synthase [Carbohydrate transport and metabolism]
Probab=35.39 E-value=65 Score=23.84 Aligned_cols=57 Identities=12% Similarity=0.028 Sum_probs=35.4
Q ss_pred ccceeeEeeccccccCCCchhhH-----------------H-HHHHhhh--cceeecCC--CCChHHHHHHHHhCCCcEE
Q psy17695 137 RLESLYKVMWDRYSQNLPVTHIT-----------------Q-AKIAREA--EHFLDLSQ--VPCNGKAADRIHKDGIHIL 194 (222)
Q Consensus 137 rLRIGYvS~~d~~f~~H~v~~l~-----------------~-~~~~~~~--~~~~~~~~--~~~~~~~a~~i~~d~iDIL 194 (222)
+-||..+. .| =+--....|. | .++++.+ +..+-.|+ -.| +|+..+|.+-+||++
T Consensus 4 ~k~IALIA-HD--~kK~~lv~f~~~~~~~L~~h~L~ATGTTG~~i~~atgL~v~~~~SGPmGGD-QQiGa~Iaeg~id~l 79 (142)
T COG1803 4 RKHIALIA-HD--HKKDDLVNFVQAHKELLSRHDLYATGTTGGLIQEATGLNVHRLKSGPMGGD-QQIGALIAEGKIDVL 79 (142)
T ss_pred cceEEEEe-cc--hhHHHHHHHHHHHHHHhhhceEEEecCchHHHHHHhCCceEEeecCCCCcc-HHHHHHHhcCcceEE
Confidence 45777777 66 4444433333 1 3455443 23333333 455 899999999999999
Q ss_pred EeC
Q psy17695 195 VNM 197 (222)
Q Consensus 195 vDL 197 (222)
|.+
T Consensus 80 IFf 82 (142)
T COG1803 80 IFF 82 (142)
T ss_pred EEE
Confidence 965
No 399
>KOG4056|consensus
Probab=34.87 E-value=73 Score=23.85 Aligned_cols=34 Identities=12% Similarity=0.085 Sum_probs=27.7
Q ss_pred HHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHH
Q psy17695 12 LFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAY 45 (222)
Q Consensus 12 ~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~ 45 (222)
..+|..|..+|+.+++..++-.|+.+-|...+.+
T Consensus 85 v~lGE~L~~qg~~e~ga~h~~nAi~vcgqpaqLL 118 (143)
T KOG4056|consen 85 VQLGEELLAQGNEEEGAEHLANAIVVCGQPAQLL 118 (143)
T ss_pred HHhHHHHHHccCHHHHHHHHHHHHhhcCCHHHHH
Confidence 4688888888999999999999998888766543
No 400
>KOG0546|consensus
Probab=34.24 E-value=17 Score=31.53 Aligned_cols=65 Identities=9% Similarity=-0.088 Sum_probs=56.7
Q ss_pred HHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 10 GILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 10 ~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
...+++.+-.+.+.+..|+..-.-+++.+++...+++..+..++.+.+++++.+.+..+....|+
T Consensus 277 ~~~n~~~~~lk~~~~~~a~~~~~~~~~~~~s~tka~~Rr~~~~~~~~~~~~a~~~~~~a~~~~p~ 341 (372)
T KOG0546|consen 277 IRRNLAAVGLKVKGRGGARFRTNEALRDERSKTKAHYRRGQAYKLLKNYDEALEDLKKAKQKAPN 341 (372)
T ss_pred cccchHHhcccccCCCcceeccccccccChhhCcHHHHHHhHHHhhhchhhhHHHHHHhhccCcc
Confidence 34567888889999999999989999999999999999999999999999999888877766554
No 401
>PF10345 Cohesin_load: Cohesin loading factor; InterPro: IPR019440 Cohesin loading factor is a conserved protein that has been characterised in fungi. It is associated with the cohesin complex and is required in G1 for cohesin binding to chromosomes, but is dispensable in G2 when cohesion has been established. It is often referred to as Ssl3 in Schizosaccharomyces pombe (Fission yeast), and Scc4 in Saccharomyces cerevisiae (Baker's yeast). It complexes with Mis4 [].
Probab=34.01 E-value=1.6e+02 Score=27.54 Aligned_cols=61 Identities=16% Similarity=0.044 Sum_probs=49.3
Q ss_pred HHHHHHHhcccCCCcHHHHHHHHHHHHhC---CC------CHHHHHHHHHHHHHcCCHHHHHHHHHHHHH
Q psy17695 10 GILFSNSSFPSPGNIPEAIQSYRTALKLK---PD------FPDAYCNLAHCLQIVCDWTDYEARMKKLVS 70 (222)
Q Consensus 10 ~~~~lg~~l~~~g~~~eA~~~~~~Al~l~---P~------~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~ 70 (222)
....++....-.|++..|....+.+.+.. |. .+..++..|..++..|+.+.|+..|.+..-
T Consensus 363 ~~~y~~~~~~~~~~~~~a~~~l~~~~~~~~~~~~~~~~~~~~~~~yL~gl~~q~~g~l~~A~~~y~~~~~ 432 (608)
T PF10345_consen 363 LLFYQIWCNFIRGDWSKATQELEFMRQLCQRSPSKLYESLYPLLHYLLGLYYQSTGDLEAALYQYQKPRF 432 (608)
T ss_pred HHHHHHHHHHHCcCHHHHHHHHHHHHHHHhcCccchhhhhhHHHHHHHHHHHHHcCCHHHHHHHHhhhHH
Confidence 45667777888999999999999888763 22 477889999999999999999999985443
No 402
>PF04781 DUF627: Protein of unknown function (DUF627); InterPro: IPR006866 This domain represents the N-terminal region of several plant proteins of unknown function.
Probab=33.98 E-value=1e+02 Score=22.15 Aligned_cols=40 Identities=8% Similarity=0.017 Sum_probs=30.4
Q ss_pred HHHhcccCCCcHHHHHHHHHHHHhCCCCHHH---HHHHHHHHH
Q psy17695 14 SNSSFPSPGNIPEAIQSYRTALKLKPDFPDA---YCNLAHCLQ 53 (222)
Q Consensus 14 lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a---~~~l~~~~~ 53 (222)
++.-+...|++-+|++..+..+..++++... +..-|.++.
T Consensus 2 ~A~~~~~rGnhiKAL~iied~i~~h~~~~~~~~lh~~QG~if~ 44 (111)
T PF04781_consen 2 KAKDYFARGNHIKALEIIEDLISRHGEDESSWLLHRLQGTIFY 44 (111)
T ss_pred hHHHHHHccCHHHHHHHHHHHHHHccCCCchHHHHHHHhHHHH
Confidence 3456778999999999999999999888743 444455543
No 403
>cd07381 MPP_CapA CapA and related proteins, metallophosphatase domain. CapA is one of three membrane-associated enzymes in Bacillus anthracis that is required for synthesis of gamma-polyglutamic acid (PGA), a major component of the bacterial capsule. The YwtB and PgsA proteins of Bacillus subtilis are closely related to CapA and are also included in this alignment model. CapA belongs to the metallophosphatase (MPP) superfamily. MPPs are functionally diverse, but all share a conserved domain with an active site consisting of two metal ions (usually manganese, iron, or zinc) coordinated with octahedral geometry by a cage of histidine, aspartate, and asparagine residues. The MPP superfamily includes: Mre11/SbcD-like exonucleases, Dbr1-like RNA lariat debranching enzymes, YfcE-like phosphodiesterases, purple acid phosphatases (PAPs), YbbF-like UDP-2,3-diacylglucosamine hydrolases, and acid sphingomyelinases (ASMases). The conserved domain is a double beta-sheet sandwich with a di-metal
Probab=33.61 E-value=96 Score=24.94 Aligned_cols=35 Identities=11% Similarity=0.173 Sum_probs=21.6
Q ss_pred HHHHhhhcceeecCCCCC---------hHHHHHHHHhCCCcEEE
Q psy17695 161 AKIAREAEHFLDLSQVPC---------NGKAADRIHKDGIHILV 195 (222)
Q Consensus 161 ~~~~~~~~~~~~~~~~~~---------~~~~a~~i~~d~iDILv 195 (222)
+++|+.+|-.+-...+.. ..++|+.+.+.++|++|
T Consensus 169 ~~lr~~~D~vIv~~H~G~e~~~~p~~~~~~la~~l~~~G~D~Ii 212 (239)
T cd07381 169 AEAKKKADIVIVSLHWGVEYSYYPTPEQRELARALIDAGADLVI 212 (239)
T ss_pred HHHhhcCCEEEEEecCcccCCCCCCHHHHHHHHHHHHCCCCEEE
Confidence 345555665444444422 13677777778999999
No 404
>PF03466 LysR_substrate: LysR substrate binding domain; InterPro: IPR005119 The structure of this domain is known and is similar to the periplasmic binding proteins []. This domain is found in members of the LysR family of prokaryotic transcriptional regulatory proteins IPR000847 from INTERPRO which share sequence similarities over approximately 280 residues including a putative helix-turn-helix DNA-binding motif at their N terminus.; PDB: 3ONM_B 3FZJ_J 3FXR_B 3N6T_A 3FXQ_A 3FXU_A 3N6U_A 2QSX_B 3HO7_B 1IZ1_B ....
Probab=33.50 E-value=26 Score=26.34 Aligned_cols=61 Identities=3% Similarity=-0.014 Sum_probs=36.7
Q ss_pred hccccceeeEeeccccccCCCchhhHHHHHHhh-hcceeecCCCCChHHHHHHHHhCCCcEEEeCCCC
Q psy17695 134 YAQRLESLYKVMWDRYSQNLPVTHITQAKIARE-AEHFLDLSQVPCNGKAADRIHKDGIHILVNMNGY 200 (222)
Q Consensus 134 ~~~rLRIGYvS~~d~~f~~H~v~~l~~~~~~~~-~~~~~~~~~~~~~~~~a~~i~~d~iDILvDL~G~ 200 (222)
..++||||... .- ..--...++ .++++. -+-=+++...+. .++.+.+.+.++||.|-..-.
T Consensus 4 ~~~~l~Ig~~~-~~---~~~~l~~~l-~~~~~~~P~i~i~~~~~~~-~~~~~~l~~g~~Dl~i~~~~~ 65 (209)
T PF03466_consen 4 ERGTLRIGASP-SF---ASSLLPPLL-AEFRERHPNIRIEIREGDS-DELIEALRSGELDLAITFGPP 65 (209)
T ss_dssp SEEEEEEEEEH-HH---HHHTHHHHH-HHHHHHSTTEEEEEEEESH-HHHHHHHHTTSSSEEEESSSS
T ss_pred cceEEEEEEEh-HH---HHHHHHHHH-HHHHHHCCCcEEEEEeccc-hhhhHHHhcccccEEEEEeec
Confidence 36789999987 32 211112222 233332 233344554454 699999999999999876654
No 405
>cd08427 PBP2_LTTR_like_2 The C-terminal substrate binding domain of an uncharacterized LysR-type transcriptional regulator, contains the type 2 periplasmic binding fold. LysR-transcriptional regulators comprise the largest family of prokaryotic transcription factor. Homologs of some of LTTRs with similar domain organizations are also found in the archaea and eukaryotic organisms. The LTTRs are composed of two functional domains joined by a linker helix involved in oligomerization: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal substrate-binding domain, which is structurally homologous to the type 2 periplasmic binding proteins. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcriptional repressor undergoes a conformational change upon substrate binding which in turn changes the DNA binding affinity of the repressor. The genes controlled by the LTTRs have diverse functi
Probab=32.98 E-value=65 Score=23.63 Aligned_cols=26 Identities=4% Similarity=0.106 Sum_probs=19.6
Q ss_pred eecCCCCChHHHHHHHHhCCCcEEEeC
Q psy17695 171 LDLSQVPCNGKAADRIHKDGIHILVNM 197 (222)
Q Consensus 171 ~~~~~~~~~~~~a~~i~~d~iDILvDL 197 (222)
+++...+. .++.+.+++.++||.|-.
T Consensus 31 l~~~~~~~-~~~~~~l~~g~~Dl~i~~ 56 (195)
T cd08427 31 VHIVPGLS-AELLARVDAGELDAAIVV 56 (195)
T ss_pred EEEEeCCc-HHHHHHHHCCCCCEEEEc
Confidence 44444454 689999999999998864
No 406
>PF13281 DUF4071: Domain of unknown function (DUF4071)
Probab=32.89 E-value=2.1e+02 Score=25.22 Aligned_cols=62 Identities=11% Similarity=-0.035 Sum_probs=51.5
Q ss_pred HHHHHHHHHhcccCCCcHHHHHHHHHHHHh----CCCCHHHHHHHHHHHHH---cCCHHHHHHHHHHHH
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQSYRTALKL----KPDFPDAYCNLAHCLQI---VCDWTDYEARMKKLV 69 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l----~P~~~~a~~~l~~~~~~---~~~~~~a~~~~~~~~ 69 (222)
+....++=.+|++..+|+.-+...+..-.+ -++.+...+.++.++.. .|+-++|...+..++
T Consensus 141 ~div~~lllSyRdiqdydamI~Lve~l~~~p~~~~~~~~~i~~~yafALnRrn~~gdre~Al~il~~~l 209 (374)
T PF13281_consen 141 PDIVINLLLSYRDIQDYDAMIKLVETLEALPTCDVANQHNIKFQYAFALNRRNKPGDREKALQILLPVL 209 (374)
T ss_pred hhHHHHHHHHhhhhhhHHHHHHHHHHhhccCccchhcchHHHHHHHHHHhhcccCCCHHHHHHHHHHHH
Confidence 455667778899999999999999988888 57778888899999988 899998888776653
No 407
>cd05466 PBP2_LTTR_substrate The substrate binding domain of LysR-type transcriptional regulators (LTTRs), a member of the type 2 periplasmic binding fold protein superfamily. This model and hierarchy represent the the substrate-binding domain of the LysR-type transcriptional regulators that form the largest family of prokaryotic transcription factor. Homologs of some of LTTRs with similar domain organizations are also found in the archaea and eukaryotic organisms. The LTTRs are composed of two functional domains joined by a linker helix involved in oligomerization: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal substrate-binding domain, which is structurally homologous to the type 2 periplasmic binding proteins. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcriptional repressor undergoes a conformational change upon substrate binding which in turn changes the DNA bin
Probab=32.88 E-value=57 Score=23.55 Aligned_cols=55 Identities=9% Similarity=-0.019 Sum_probs=30.4
Q ss_pred cceeeEeeccccccCCCchhhHHHHHHhhhcceeecCCCCChHHHHHHHHhCCCcEEEeC
Q psy17695 138 LESLYKVMWDRYSQNLPVTHITQAKIAREAEHFLDLSQVPCNGKAADRIHKDGIHILVNM 197 (222)
Q Consensus 138 LRIGYvS~~d~~f~~H~v~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~i~~d~iDILvDL 197 (222)
||||.-. . +..+-+..++.+-.++.-+.=+++...+. .++.+.+.+.++||.|-.
T Consensus 2 l~I~~~~--~--~~~~~l~~~i~~~~~~~p~i~i~~~~~~~-~~~~~~l~~g~~D~~i~~ 56 (197)
T cd05466 2 LRIGASP--S--IAAYLLPPLLAAFRQRYPGVELSLVEGGS-SELLEALLEGELDLAIVA 56 (197)
T ss_pred eEEEech--h--hHHHHhHHHHHHHHHHCCCCEEEEEECCh-HHHHHHHHcCCceEEEEc
Confidence 6676643 3 33333444443322222222233443444 588899999999999954
No 408
>cd08446 PBP2_Chlorocatechol The C-terminal substrate binding domain of LysR-type transcriptional regulators involved in the chlorocatechol catabolism, contains the type 2 periplasmic binding fold. This CD includes the substrate binding domain of LysR-type regulators CbnR, ClcR and TfdR, which are involved in the regulation of chlorocatechol breakdown. The chlorocatechol-degradative pathway is often found in bacteria that can use chlorinated aromatic compounds as carbon and energy sources. CbnR is found in the 3-chlorobenzoate degradative bacterium Ralstonia eutropha NH9 and forms a tetramer. CbnR activates the expression of the cbnABCD genes, which are responsible for the degradation of chlorocatechol converted from 3-chlorobenzoate and are transcribed divergently from cbnR. In soil bacterium Pseudomonas putida, the 3-chlorocatechol-degradative pathway is encoded by clcABD operon, which requires the divergently transcribed clcR for activation. TfdR is involved in the activation of tf
Probab=32.83 E-value=60 Score=24.04 Aligned_cols=55 Identities=13% Similarity=0.106 Sum_probs=31.8
Q ss_pred cccceeeEeeccccccCCCchhhHHHHHHhhh-cceeecCCCCChHHHHHHHHhCCCcEEEe
Q psy17695 136 QRLESLYKVMWDRYSQNLPVTHITQAKIAREA-EHFLDLSQVPCNGKAADRIHKDGIHILVN 196 (222)
Q Consensus 136 ~rLRIGYvS~~d~~f~~H~v~~l~~~~~~~~~-~~~~~~~~~~~~~~~a~~i~~d~iDILvD 196 (222)
+.||||+.+ . +...-+..++ .++++.. +.=+++...+. .++.+.+.+.++|+.|-
T Consensus 1 g~l~ig~~~--~--~~~~~l~~~i-~~~~~~~P~v~l~i~~~~~-~~~~~~l~~~~~Dl~i~ 56 (198)
T cd08446 1 GELDVGYFG--S--AILDTVPRLL-RAFLTARPDVTVSLHNMTK-DEQIEALRAGRIHIGFG 56 (198)
T ss_pred CeEEEEech--H--HHHHHHHHHH-HHHHHHCCCeEEEEeeCCH-HHHHHHHHCCCccEEEE
Confidence 358888866 3 3333333333 2333332 22244444454 57889999999999984
No 409
>cd08414 PBP2_LTTR_aromatics_like The C-terminal substrate binding domain of LysR-type transcriptional regulators involved in the catabolism of aromatic compounds and that of other related regulators, contains type 2 periplasmic binding fold. This CD includes the C-terminal substrate binding domain of LTTRs involved in degradation of aromatic compounds, such as CbnR, BenM, CatM, ClcR and TfdR, as well as that of other transcriptional regulators clustered together in phylogenetic trees, including XapR, HcaR, MprR, IlvR, BudR, AlsR, LysR, and OccR. The structural topology of this substrate-binding domain is most similar to that of the type 2 periplasmic binding proteins (PBP2), which are responsible for the uptake of a variety of substrates such as phosphate, sulfate, polysaccharides, lysine/arginine/ornithine, and histidine. The PBP2 bind their ligand in the cleft between these domains in a manner resembling a Venus flytrap. After binding their specific ligand with high affinity, they ca
Probab=32.54 E-value=65 Score=23.65 Aligned_cols=27 Identities=7% Similarity=0.148 Sum_probs=20.2
Q ss_pred ceeecCCCCChHHHHHHHHhCCCcEEEe
Q psy17695 169 HFLDLSQVPCNGKAADRIHKDGIHILVN 196 (222)
Q Consensus 169 ~~~~~~~~~~~~~~a~~i~~d~iDILvD 196 (222)
-=+++...+. .++.+.+.+.++||.|-
T Consensus 29 i~i~i~~~~~-~~~~~~l~~~~~Dl~i~ 55 (197)
T cd08414 29 VELELREMTT-AEQLEALRAGRLDVGFV 55 (197)
T ss_pred cEEEEecCCh-HHHHHHHHcCCccEEEE
Confidence 3445554454 68999999999999884
No 410
>PRK10076 pyruvate formate lyase II activase; Provisional
Probab=32.16 E-value=52 Score=26.41 Aligned_cols=26 Identities=12% Similarity=0.235 Sum_probs=21.6
Q ss_pred HHHHHHHHhCCCcEEEeCCCCCCCcc
Q psy17695 180 GKAADRIHKDGIHILVNMNGYTKGAR 205 (222)
Q Consensus 180 ~~~a~~i~~d~iDILvDL~G~t~~~r 205 (222)
.++.+.+++.+|++.||++|+.....
T Consensus 57 ~~l~~~~k~~gi~~~leTnG~~~~~~ 82 (213)
T PRK10076 57 TRFLQRLRLWGVSCAIETAGDAPASK 82 (213)
T ss_pred HHHHHHHHHcCCCEEEECCCCCCHHH
Confidence 37788889999999999999986543
No 411
>cd08442 PBP2_YofA_SoxR_like The C-terminal substrate binding domain of LysR-type transcriptional regulators, YofA and SoxR, contains the type 2 periplasmic binding fold. YofA is a LysR-like transcriptional regulator of cell growth in Bacillus subtillis. YofA controls cell viability and the formation of constrictions during cell division. YofaA positively regulates expression of the cell division gene ftsW, and thus is essential for cell viability during stationary-phase growth of Bacillus substilis. YofA shows significant homology to SoxR from Arthrobacter sp. TE1826. SoxR is a negative regulator for the sarcosine oxidase gene soxA. Sarcosine oxidase catalyzes the oxidative demethylation of sarcosine, which is involved in the metabolism of creatine and choline. The topology of this substrate-binding domain is most similar to that of the type 2 periplasmic binding proteins (PBP2), which are responsible for the uptake of a variety of substrates such as phosphate, sulfate, polysaccharides
Probab=32.03 E-value=80 Score=23.09 Aligned_cols=25 Identities=16% Similarity=0.114 Sum_probs=19.0
Q ss_pred eecCCCCChHHHHHHHHhCCCcEEEe
Q psy17695 171 LDLSQVPCNGKAADRIHKDGIHILVN 196 (222)
Q Consensus 171 ~~~~~~~~~~~~a~~i~~d~iDILvD 196 (222)
+++...+. .++.+.+.+.++|+.|-
T Consensus 31 l~i~~~~~-~~~~~~l~~g~~Dl~i~ 55 (193)
T cd08442 31 LSLSTGTT-GALIQAVLEGRLDGAFV 55 (193)
T ss_pred EEEEeCCc-HHHHHHHHCCCccEEEE
Confidence 44444454 68999999999999884
No 412
>KOG2709|consensus
Probab=32.02 E-value=63 Score=28.92 Aligned_cols=33 Identities=24% Similarity=0.365 Sum_probs=29.2
Q ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 42 PDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 42 ~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
+.+..++|.++-++++|++|..+|++.+..+-+
T Consensus 22 A~~~V~~gl~~dE~~~~e~a~~~Ye~gl~~i~~ 54 (560)
T KOG2709|consen 22 AYASVEQGLCYDEVNDWENALAMYEKGLNLIVE 54 (560)
T ss_pred HHHHHHhhcchhhhcCHHHHHHHHHHHHHHHHh
Confidence 456789999999999999999999999987765
No 413
>PRK11716 DNA-binding transcriptional regulator IlvY; Provisional
Probab=31.96 E-value=61 Score=25.97 Aligned_cols=59 Identities=10% Similarity=0.104 Sum_probs=36.0
Q ss_pred hccccceeeEeeccccccCCCchhhHHHHHHhhhcceeecCCCCChHHHHHHHHhCCCcEEEeC
Q psy17695 134 YAQRLESLYKVMWDRYSQNLPVTHITQAKIAREAEHFLDLSQVPCNGKAADRIHKDGIHILVNM 197 (222)
Q Consensus 134 ~~~rLRIGYvS~~d~~f~~H~v~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~i~~d~iDILvDL 197 (222)
..+.||||..+ - +..+-+..++..-.+..-+-=+++...+. .++.+.+.+.++||.|-.
T Consensus 65 ~~~~l~I~~~~--~--~~~~~~~~~l~~~~~~~p~i~l~i~~~~~-~~~~~~l~~~~~D~~i~~ 123 (269)
T PRK11716 65 LSGELSLFCSV--T--AAYSHLPPILDRFRAEHPLVEIKLTTGDA-ADAVEKVQSGEADLAIAA 123 (269)
T ss_pred cCceEEEEecc--h--HHHHHHHHHHHHHHHHCCCeEEEEEECCH-HHHHHHHHCCCccEEEEe
Confidence 46789999655 3 33333333333333333333355554454 689999999999999954
No 414
>PF00448 SRP54: SRP54-type protein, GTPase domain; InterPro: IPR000897 The signal recognition particle (SRP) is a multimeric protein, which along with its conjugate receptor (SR), is involved in targeting secretory proteins to the rough endoplasmic reticulum (RER) membrane in eukaryotes, or to the plasma membrane in prokaryotes [, ]. SRP recognises the signal sequence of the nascent polypeptide on the ribosome, retards its elongation, and docks the SRP-ribosome-polypeptide complex to the RER membrane via the SR receptor. Eukaryotic SRP consists of six polypeptides (SRP9, SRP14, SRP19, SRP54, SRP68 and SRP72) and a single 300 nucleotide 7S RNA molecule. The RNA component catalyses the interaction of SRP with its SR receptor []. In higher eukaryotes, the SRP complex consists of the Alu domain and the S domain linked by the SRP RNA. The Alu domain consists of a heterodimer of SRP9 and SRP14 bound to the 5' and 3' terminal sequences of SRP RNA. This domain is necessary for retarding the elongation of the nascent polypeptide chain, which gives SRP time to dock the ribosome-polypeptide complex to the RER membrane. In archaea, the SRP complex contains 7S RNA like its eukaryotic counterpart, yet only includes two of the six protein subunits found in the eukarytic complex: SRP19 and SRP54 []. This entry represents the GTPase domain of the 54 kDa SRP54 component, a GTP-binding protein that interacts with the signal sequence when it emerges from the ribosome. SRP54 of the signal recognition particle has a three-domain structure: an N-terminal helical bundle domain, a GTPase domain, and the M-domain that binds the 7s RNA and also binds the signal sequence. The extreme C-terminal region is glycine-rich and lower in complexity and poorly conserved between species. The GTPase domain is evolutionary related to P-loop NTPase domains found in a variety of other proteins []. These proteins include Escherichia coli and Bacillus subtilis ffh protein (P48), which seems to be the prokaryotic counterpart of SRP54; signal recognition particle receptor alpha subunit (docking protein), an integral membrane GTP-binding protein which ensures, in conjunction with SRP, the correct targeting of nascent secretory proteins to the endoplasmic reticulum membrane; bacterial FtsY protein, which is believed to play a similar role to that of the docking protein in eukaryotes; the pilA protein from Neisseria gonorrhoeae, the homologue of ftsY; and bacterial flagellar biosynthesis protein flhF.; GO: 0005525 GTP binding, 0006614 SRP-dependent cotranslational protein targeting to membrane; PDB: 2OG2_A 3B9Q_A 3DM9_B 3DMD_B 3E70_C 3DM5_B 2XXA_C 2J28_9 1ZU5_B 1ZU4_A ....
Probab=31.92 E-value=2.2e+02 Score=22.39 Aligned_cols=62 Identities=15% Similarity=0.111 Sum_probs=30.2
Q ss_pred ccceeeEeeccccccCCCchhhHHHHHHhhh---c-ceeecCCCCChHHHHH----HHHhCCCc-EEEeCCCCCCCcc
Q psy17695 137 RLESLYKVMWDRYSQNLPVTHITQAKIAREA---E-HFLDLSQVPCNGKAAD----RIHKDGIH-ILVNMNGYTKGAR 205 (222)
Q Consensus 137 rLRIGYvS~~d~~f~~H~v~~l~~~~~~~~~---~-~~~~~~~~~~~~~~a~----~i~~d~iD-ILvDL~G~t~~~r 205 (222)
.-+|+.+| .| .||-=++ ++++..+ + .++.....++..++++ ..+.++.| ||||..|....+.
T Consensus 29 ~~~v~lis-~D-~~R~ga~-----eQL~~~a~~l~vp~~~~~~~~~~~~~~~~~l~~~~~~~~D~vlIDT~Gr~~~d~ 99 (196)
T PF00448_consen 29 GKKVALIS-AD-TYRIGAV-----EQLKTYAEILGVPFYVARTESDPAEIAREALEKFRKKGYDLVLIDTAGRSPRDE 99 (196)
T ss_dssp T--EEEEE-ES-TSSTHHH-----HHHHHHHHHHTEEEEESSTTSCHHHHHHHHHHHHHHTTSSEEEEEE-SSSSTHH
T ss_pred cccceeec-CC-CCCccHH-----HHHHHHHHHhccccchhhcchhhHHHHHHHHHHHhhcCCCEEEEecCCcchhhH
Confidence 57899999 88 2232221 2233222 2 3333443332134443 23446666 7999999776653
No 415
>cd08456 PBP2_LysR The C-terminal substrate binding domain of LysR, transcriptional regulator for lysine biosynthesis, contains the type 2 periplasmic binding fold. LysR, the transcriptional activator of lysA encoding diaminopimelate decarboxylase, catalyses the decarboxylation of diaminopimelate to produce lysine. The LysR-transcriptional regulators comprise the largest family of prokaryotic transcription factor. Homologs of some of LTTRs with similar domain organizations are also found in the archaea and eukaryotic organisms. The LTTRs are composed of two functional domains joined by a linker helix involved in oligomerization: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal substrate-binding domain, which is structurally homologous to the type 2 periplasmic binding proteins. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcriptional repressor undergoes a conformational
Probab=31.84 E-value=58 Score=23.99 Aligned_cols=55 Identities=5% Similarity=-0.028 Sum_probs=30.3
Q ss_pred ccceeeEeeccccccCCCchhhHHHHHHhhhcceeecCCCCChHHHHHHHHhCCCcEEEe
Q psy17695 137 RLESLYKVMWDRYSQNLPVTHITQAKIAREAEHFLDLSQVPCNGKAADRIHKDGIHILVN 196 (222)
Q Consensus 137 rLRIGYvS~~d~~f~~H~v~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~i~~d~iDILvD 196 (222)
+||||..+ + +...-+..++.+-.+..-+-=+++...+. .++.+.+.+.++||.|-
T Consensus 1 ~l~Ig~~~-~---~~~~~l~~~l~~~~~~~P~i~~~i~~~~~-~~~~~~l~~g~~Dl~i~ 55 (196)
T cd08456 1 ELRIAVLP-A---LSQSFLPRAIKAFLQRHPDVTISIHTRDS-PTVEQWLSAQQCDLGLV 55 (196)
T ss_pred CeEEEecH-H---HHHhhHHHHHHHHHHHCCCcEEEEEeCCH-HHHHHHHHcCCccEEEE
Confidence 36777765 2 22222233332222222233344444454 57889999999999985
No 416
>PRK13184 pknD serine/threonine-protein kinase; Reviewed
Probab=31.53 E-value=93 Score=30.93 Aligned_cols=65 Identities=9% Similarity=0.080 Sum_probs=38.0
Q ss_pred HHHHHHHHhcccC----C---CcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 9 VGILFSNSSFPSP----G---NIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 9 ~~~~~lg~~l~~~----g---~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
++.+.+|.++..+ | .+++|+..|++.-. .|.-|--+...+.+|+.+++++|-..++.-+++..++
T Consensus 513 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 584 (932)
T PRK13184 513 EAQFRLGITLLEKASEQGDPRDFTQALSEFSYLHG-GVGAPLEYLGKALVYQRLGEYNEEIKSLLLALKRYSQ 584 (932)
T ss_pred HHHHHhhHHHHHHHHhcCChHHHHHHHHHHHHhcC-CCCCchHHHhHHHHHHHhhhHHHHHHHHHHHHHhcCC
Confidence 5556666555432 1 35555555555332 4555555666666677777777766666666665554
No 417
>KOG2581|consensus
Probab=31.29 E-value=46 Score=29.68 Aligned_cols=36 Identities=11% Similarity=0.123 Sum_probs=30.5
Q ss_pred CCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCC
Q psy17695 6 FFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDF 41 (222)
Q Consensus 6 ~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~ 41 (222)
+.+...+.+|.+..-+++|..|.++|-+|+...|++
T Consensus 245 e~ARY~yY~GrIkaiqldYssA~~~~~qa~rkapq~ 280 (493)
T KOG2581|consen 245 EWARYLYYLGRIKAIQLDYSSALEYFLQALRKAPQH 280 (493)
T ss_pred HHHHHHHHHhhHHHhhcchhHHHHHHHHHHHhCcch
Confidence 456778888889888999999999999999998874
No 418
>cd07410 MPP_CpdB_N Escherichia coli CpdB and related proteins, N-terminal metallophosphatase domain. CpdB is a bacterial periplasmic protein with an N-terminal metallophosphatase domain and a C-terminal 3'-nucleotidase domain. This alignment model represents the N-terminal metallophosphatase domain, which has 2',3'-cyclic phosphodiesterase activity, hydrolyzing the 2',3'-cyclic phosphates of adenosine, guanosine, cytosine and uridine to yield nucleoside and phosphate. CpdB also hydrolyzes the chromogenic substrates p-nitrophenyl phosphate (PNPP), bis(PNPP) and p-nitrophenyl phosphorylcholine (NPPC). CpdB is thought to play a scavenging role during RNA hydrolysis by converting the non-transportable nucleotides produced by RNaseI to nucleosides which can easily enter a cell for use as a carbon source. This family also includes YfkN, a Bacillus subtilis nucleotide phosphoesterase with two copies of each of the metallophosphatase and 3'-nucleotidase domains. The N-terminal metallophos
Probab=31.04 E-value=75 Score=26.25 Aligned_cols=40 Identities=20% Similarity=0.280 Sum_probs=23.4
Q ss_pred HHHHh-hhcceeecCCCCChH---------HHH-HHHHh-CCCcEEEeCCCCCC
Q psy17695 161 AKIAR-EAEHFLDLSQVPCNG---------KAA-DRIHK-DGIHILVNMNGYTK 202 (222)
Q Consensus 161 ~~~~~-~~~~~~~~~~~~~~~---------~~a-~~i~~-d~iDILvDL~G~t~ 202 (222)
.++|. .+|-.+-+..+..+. ..+ +..++ .+|||.| .||+-
T Consensus 177 ~~lr~~~~D~IIvl~H~g~~~~~~~~~~~~~~~~~la~~~~~vD~Il--gGHsH 228 (277)
T cd07410 177 PKLRAEGADVVVVLAHGGFERDLEESLTGENAAYELAEEVPGIDAIL--TGHQH 228 (277)
T ss_pred HHHHHcCCCEEEEEecCCcCCCcccccCCccHHHHHHhcCCCCcEEE--eCCCc
Confidence 45655 678787777654311 112 23333 6999986 77774
No 419
>cd08449 PBP2_XapR The C-terminal substrate binding domain of LysR-type transcriptional regulator XapR involved in xanthosine catabolism, contains the type 2 periplasmic binding fold. In Escherichia coli, XapR is a positive regulator for the expression of xapA gene, encoding xanthosine phosphorylase, and xapB gene, encoding a polypeptide similar to the nucleotide transport protein NupG. As an operon, the expression of both xapA and xapB is fully dependent on the presence of both XapR and the inducer xanthosine. Expression of the xapR is constitutive but not auto-regulated, unlike many other LysR family proteins. This substrate-binding domain shows significant homology to the type 2 periplasmic binding proteins (PBP2), which are responsible for the uptake of a variety of substrates such as phosphate, sulfate, polysaccharides, lysine/arginine/ornithine, and histidine. The PBP2 bind their ligand in the cleft between these domains in a manner resembling a Venus flytrap. After binding their
Probab=30.95 E-value=76 Score=23.31 Aligned_cols=54 Identities=4% Similarity=-0.189 Sum_probs=31.8
Q ss_pred cceeeEeeccccccCCCchhhHHHHHHhhhcceeecCCCCChHHHHHHHHhCCCcEEEe
Q psy17695 138 LESLYKVMWDRYSQNLPVTHITQAKIAREAEHFLDLSQVPCNGKAADRIHKDGIHILVN 196 (222)
Q Consensus 138 LRIGYvS~~d~~f~~H~v~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~i~~d~iDILvD 196 (222)
||||..+ + +...-+..++.+-.+..-+.-+.+...+. .+..+.+.+.++|+.|-
T Consensus 2 l~ig~~~-~---~~~~~l~~~l~~~~~~~P~i~i~~~~~~~-~~~~~~l~~~~~Dl~i~ 55 (197)
T cd08449 2 LNIGMVG-S---VLWGGLGPALRRFKRQYPNVTVRFHELSP-EAQKAALLSKRIDLGFV 55 (197)
T ss_pred eEEEEec-h---HhhhhHHHHHHHHHHHCCCeEEEEEECCH-HHHHHHHhCCCccEEEe
Confidence 6777765 2 33333444443322233333444444454 68889999999999984
No 420
>smart00777 Mad3_BUB1_I Mad3/BUB1 hoMad3/BUB1 homology region 1. Proteins containing this domain are checkpoint proteins involved in cell division. This region has been shown to be essential for the binding of the binding of BUB1 and MAD3 to CDC20p.
Probab=30.93 E-value=1.7e+02 Score=21.37 Aligned_cols=59 Identities=10% Similarity=0.037 Sum_probs=41.6
Q ss_pred CCCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCC--CHHHHHHHHHHHHHcCCHHHHHHHHH
Q psy17695 2 MKTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPD--FPDAYCNLAHCLQIVCDWTDYEARMK 66 (222)
Q Consensus 2 ~~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~--~~~a~~~l~~~~~~~~~~~~a~~~~~ 66 (222)
++||.....|...+... ++..+.|....+..-. .+..+...+..+-..|+|.+|.+.|+
T Consensus 63 knD~RyLkiWi~ya~~~------~dp~~if~~L~~~~IG~~~AlfYe~~A~~lE~~g~~~~A~~iy~ 123 (125)
T smart00777 63 KNDPRYLKIWLKYADNC------DEPRELFQFLYSKGIGTKLALFYEEWAQLLEAAGRYKKADEVYQ 123 (125)
T ss_pred cCCHHHHHHHHHHHHhc------CCHHHHHHHHHHCCcchhhHHHHHHHHHHHHHcCCHHHHHHHHH
Confidence 45677777777777553 4466677777666544 45556667788888999999998775
No 421
>KOG2300|consensus
Probab=30.46 E-value=2.7e+02 Score=25.73 Aligned_cols=67 Identities=16% Similarity=0.208 Sum_probs=53.1
Q ss_pred CHHHHHHHHHhcccCC--CcHHHHHHHHHHHHhCCCC-HHH--HHHHHHHH-HHcCCHHHHHHHHHHHHHHHH
Q psy17695 7 FKVGILFSNSSFPSPG--NIPEAIQSYRTALKLKPDF-PDA--YCNLAHCL-QIVCDWTDYEARMKKLVSIVA 73 (222)
Q Consensus 7 ~~~~~~~lg~~l~~~g--~~~eA~~~~~~Al~l~P~~-~~a--~~~l~~~~-~~~~~~~~a~~~~~~~~~~~~ 73 (222)
-++++..++..++..| ++..+++|++..+...|.+ .++ +..||.++ ....+.+.|...++++..+..
T Consensus 6 va~aLlGlAe~~rt~~PPkIkk~IkClqA~~~~~is~~veart~LqLg~lL~~yT~N~elAksHLekA~~i~~ 78 (629)
T KOG2300|consen 6 VAEALLGLAEHFRTSGPPKIKKCIKCLQAIFQFQISFLVEARTHLQLGALLLRYTKNVELAKSHLEKAWLISK 78 (629)
T ss_pred HHHHHHHHHHHHhhcCChhHHHHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHc
Confidence 3577888999999999 9999999999999998875 344 45556554 467889999999988876544
No 422
>cd08485 PBP2_ClcR The C-terminal substrate binding domain of LysR-type transcriptional regulator ClcR involved in the chlorocatechol catabolism, contains type 2 periplasmic binding fold. In soil bacterium Pseudomonas putida, the ortho-pathways of catechol and 3-chlorocatechol are central catabolic pathways that convert aromatic and chloroaromaric compounds to tricarboxylic acid (TCA) cycle intermediates. The 3-chlorocatechol-degradative pathway is encoded by clcABD operon, which requires the divergently transcribed clcR and an intermediate of the pathway, 2-chloromuconate, as an inducer for activation. The topology of this substrate-binding domain is most similar to that of the type 2 periplasmic binding proteins (PBP2), which are responsible for the uptake of a variety of substrates such as phosphate, sulfate, polysaccharides, lysine/arginine/ornithine, and histidine. The PBP2 bind their ligand in the cleft between these domains in a manner resembling a Venus flytrap. After binding th
Probab=30.40 E-value=86 Score=23.45 Aligned_cols=54 Identities=13% Similarity=0.093 Sum_probs=30.0
Q ss_pred ccceeeEeeccccccCCCchhhHHHHHHhhh-cceeecCCCCChHHHHHHHHhCCCcEEEe
Q psy17695 137 RLESLYKVMWDRYSQNLPVTHITQAKIAREA-EHFLDLSQVPCNGKAADRIHKDGIHILVN 196 (222)
Q Consensus 137 rLRIGYvS~~d~~f~~H~v~~l~~~~~~~~~-~~~~~~~~~~~~~~~a~~i~~d~iDILvD 196 (222)
.||||... + +...-+..++ .++++.. +-=+++...+. .++.+.+++.++||.|-
T Consensus 2 ~l~ig~~~-~---~~~~~l~~~l-~~~~~~~P~i~l~~~~~~~-~~~~~~l~~~~~D~~i~ 56 (198)
T cd08485 2 ELRVAYFG-T---VVLHTLPLLL-RQLLSVAPSATVSLTQMSK-NRQIEALDAGTIDIGFG 56 (198)
T ss_pred eEEEEEec-c---chhHHHHHHH-HHHHHhCCCcEEEEEECCH-HHHHHHHHcCCccEEEe
Confidence 57888776 2 2221122222 2333332 22234444454 68999999999999774
No 423
>cd08461 PBP2_DntR_like_3 The C-terminal substrate binding domain of an uncharacterized LysR-type transcriptional regulator similar to DntR, which is involved in the catabolism of dinitrotoluene; contains the type 2 periplasmic binding fold. This CD includes an uncharacterized LysR-type transcriptional regulator similar to DntR, NahR, and LinR, which are involved in the degradation of aromatic compounds. The transcription of the genes encoding enzymes involved in such degradation is regulated and expression of these enzymes is enhanced by inducers, which are either an intermediate in the metabolic pathway or compounds to be degraded. This substrate-binding domain shows significant homology to the type 2 periplasmic binding proteins (PBP2), which are responsible for the uptake of a variety of substrates such as phosphate, sulfate, polysaccharides, lysine/arginine/ornithine, and histidine. The PBP2 bind their ligand in the cleft between these domains in a manner resembling a Venus flytra
Probab=30.38 E-value=57 Score=24.17 Aligned_cols=55 Identities=5% Similarity=0.041 Sum_probs=32.0
Q ss_pred ccceeeEeeccccccCCCchhhHHHHHHhhhcceeecCCCCChHHHHHHHHhCCCcEEEe
Q psy17695 137 RLESLYKVMWDRYSQNLPVTHITQAKIAREAEHFLDLSQVPCNGKAADRIHKDGIHILVN 196 (222)
Q Consensus 137 rLRIGYvS~~d~~f~~H~v~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~i~~d~iDILvD 196 (222)
+||||... . +..+-+..++.+-.++.=+.-+++...+. .++.+.+.+.++||.|-
T Consensus 1 ~l~I~~~~--~--~~~~~l~~~l~~f~~~~P~v~i~i~~~~~-~~~~~~l~~~~~Di~i~ 55 (198)
T cd08461 1 TLVIAATD--Y--AQKAILPPLLAALRQEAPGVRVAIRDLES-DNLEAQLERGEVDLALT 55 (198)
T ss_pred CEEEEech--H--HHHHHhHHHHHHHHHHCCCcEEEEeeCCc-ccHHHHHhcCCCcEEEe
Confidence 36777543 3 33333333443322333344566654454 58889999999999884
No 424
>PF03129 HGTP_anticodon: Anticodon binding domain; InterPro: IPR004154 tRNA synthetases, or tRNA ligases are involved in protein synthesis. This domain is found in histidyl, glycyl, threonyl and prolyl tRNA synthetases [] it is probably the anticodon binding domain [].; GO: 0004812 aminoacyl-tRNA ligase activity, 0005524 ATP binding; PDB: 1KOG_B 1EVL_D 1EVK_B 1QF6_A 1FYF_B 2I4O_A 2I4M_A 2I4N_A 2I4L_A 1HC7_D ....
Probab=30.22 E-value=42 Score=22.51 Aligned_cols=38 Identities=16% Similarity=0.232 Sum_probs=30.7
Q ss_pred HHHHHHHHhCCCcEEEeCCCCCCCcchhhhhhCCchhh
Q psy17695 180 GKAADRIHKDGIHILVNMNGYTKGARNEIFALRPAPIQ 217 (222)
Q Consensus 180 ~~~a~~i~~d~iDILvDL~G~t~~~r~~i~a~rpAPvQ 217 (222)
.++++.+++.+|.+.+|..+-+-+.++.-....=+|..
T Consensus 19 ~~l~~~L~~~gi~v~~d~~~~~~~k~~~~a~~~g~p~~ 56 (94)
T PF03129_consen 19 QELANKLRKAGIRVELDDSDKSLGKQIKYADKLGIPFI 56 (94)
T ss_dssp HHHHHHHHHTTSEEEEESSSSTHHHHHHHHHHTTESEE
T ss_pred HHHHHHHHHCCCEEEEECCCCchhHHHHHHhhcCCeEE
Confidence 47889999999999999988888877776666666654
No 425
>KOG3617|consensus
Probab=30.04 E-value=1.5e+02 Score=29.29 Aligned_cols=64 Identities=16% Similarity=0.104 Sum_probs=48.8
Q ss_pred HHHHHHHHHhcccCCCcHHHHHHHHHH----------HHhCCCCH----------HHHHHHHHHHHHcCCHHHHHHHHHH
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQSYRTA----------LKLKPDFP----------DAYCNLAHCLQIVCDWTDYEARMKK 67 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~~~~A----------l~l~P~~~----------~a~~~l~~~~~~~~~~~~a~~~~~~ 67 (222)
-..|++.+.-+...++.+.|+++|+++ |+-+|... ..|.-+|..+-..|+.+.|...|..
T Consensus 858 r~Tyy~yA~~Lear~Di~~AleyyEK~~~hafev~rmL~e~p~~~e~Yv~~~~d~~L~~WWgqYlES~GemdaAl~~Y~~ 937 (1416)
T KOG3617|consen 858 RNTYYNYAKYLEARRDIEAALEYYEKAGVHAFEVFRMLKEYPKQIEQYVRRKRDESLYSWWGQYLESVGEMDAALSFYSS 937 (1416)
T ss_pred hhhHHHHHHHHHhhccHHHHHHHHHhcCChHHHHHHHHHhChHHHHHHHHhccchHHHHHHHHHHhcccchHHHHHHHHH
Confidence 357889999999999999999999975 33345443 3445567777788999999988887
Q ss_pred HHHH
Q psy17695 68 LVSI 71 (222)
Q Consensus 68 ~~~~ 71 (222)
+...
T Consensus 938 A~D~ 941 (1416)
T KOG3617|consen 938 AKDY 941 (1416)
T ss_pred hhhh
Confidence 7654
No 426
>cd07412 MPP_YhcR_N Bacillus subtilis YhcR endonuclease and related proteins, N-terminal metallophosphatase domain. YhcR is a Bacillus subtilis sugar-nonspecific endonuclease. It cleaves endonucleolytically to yield nucleotide 3'-monophosphate products, similar to Staphylococcus aureus micrococcal nuclease. YhcR appears to be located in the cell wall, and is thought to be a substrate for a Bacillus subtilis sortase. YhcR is the major calcium-activated nuclease of B. subtilis. The N-terminal metallophosphatase domain belongs to a large superfamily of distantly related metallophosphatases (MPPs) that includes: Mre11/SbcD-like exonucleases, Dbr1-like RNA lariat debranching enzymes, YfcE-like phosphodiesterases, purple acid phosphatases (PAPs), YbbF-like UDP-2,3-diacylglucosamine hydrolases, and acid sphingomyelinases (ASMases). MPPs are functionally diverse, but all share a conserved domain with an active site consisting of two metal ions (usually manganese, iron, or zinc) coordinated wi
Probab=29.81 E-value=95 Score=25.98 Aligned_cols=39 Identities=13% Similarity=0.263 Sum_probs=24.9
Q ss_pred HHHH-hhhcceeecCCCCChH-------------HHHHHHHh--CCCcEEEeCCCCCC
Q psy17695 161 AKIA-REAEHFLDLSQVPCNG-------------KAADRIHK--DGIHILVNMNGYTK 202 (222)
Q Consensus 161 ~~~~-~~~~~~~~~~~~~~~~-------------~~a~~i~~--d~iDILvDL~G~t~ 202 (222)
++++ +.+|..+-++.+.. + +..+.+.+ .+||++| .|||-
T Consensus 185 ~~lr~~~~D~IIvL~H~G~-~~~~~~~~~~~~~~~~~~l~~~~~~~iD~Il--gGHsH 239 (288)
T cd07412 185 PELKAGGVDAIVVLAHEGG-STKGGDDTCSAASGPIADIVNRLDPDVDVVF--AGHTH 239 (288)
T ss_pred HHHHHCCCCEEEEEeCCCC-CCCCCCccccccChhHHHHHhhcCCCCCEEE--eCccC
Confidence 3455 35888887777665 3 24555554 4899988 66663
No 427
>cd00861 ProRS_anticodon_short ProRS Prolyl-anticodon binding domain, short version found predominantly in bacteria. ProRS belongs to class II aminoacyl-tRNA synthetases (aaRS). This alignment contains the anticodon binding domain, which is responsible for specificity in tRNA-binding, so that the activated amino acid is transferred to a ribose 3' OH group of the appropriate tRNA only.
Probab=29.76 E-value=77 Score=21.03 Aligned_cols=39 Identities=15% Similarity=0.203 Sum_probs=30.9
Q ss_pred HHHHHHHHhCCCcEEEeCCCCCCCcchhhhhhCCchhhh
Q psy17695 180 GKAADRIHKDGIHILVNMNGYTKGARNEIFALRPAPIQS 218 (222)
Q Consensus 180 ~~~a~~i~~d~iDILvDL~G~t~~~r~~i~a~rpAPvQv 218 (222)
.++++..|+-|+.+.+|..+-+-+.++.-...+=+|..+
T Consensus 21 ~~la~~Lr~~g~~v~~d~~~~~l~k~i~~a~~~g~~~~i 59 (94)
T cd00861 21 EKLYAELQAAGVDVLLDDRNERPGVKFADADLIGIPYRI 59 (94)
T ss_pred HHHHHHHHHCCCEEEEECCCCCcccchhHHHhcCCCEEE
Confidence 467788889999999999888888888777777677543
No 428
>PF04190 DUF410: Protein of unknown function (DUF410) ; InterPro: IPR007317 This is a family of conserved eukaryotic proteins with undetermined function.; PDB: 3LKU_E 2WPV_G.
Probab=29.70 E-value=1.3e+02 Score=24.95 Aligned_cols=66 Identities=15% Similarity=-0.014 Sum_probs=40.6
Q ss_pred CCHHHHHHHHHhcccCCCcHHHHHHHHHH----------------HHhCCCCHHHHHHHHHH-HHHcCCHHHHHHHHHHH
Q psy17695 6 FFKVGILFSNSSFPSPGNIPEAIQSYRTA----------------LKLKPDFPDAYCNLAHC-LQIVCDWTDYEARMKKL 68 (222)
Q Consensus 6 ~~~~~~~~lg~~l~~~g~~~eA~~~~~~A----------------l~l~P~~~~a~~~l~~~-~~~~~~~~~a~~~~~~~ 68 (222)
.+++.|..+|..|.+.|++.+|..+|-.. .+-.|...+.+...+.. +..+++...|...+...
T Consensus 88 Gdp~LH~~~a~~~~~e~~~~~A~~Hfl~~~~~~~~~~~~ll~~~~~~~~~~e~dlfi~RaVL~yL~l~n~~~A~~~~~~f 167 (260)
T PF04190_consen 88 GDPELHHLLAEKLWKEGNYYEAERHFLLGTDPSAFAYVMLLEEWSTKGYPSEADLFIARAVLQYLCLGNLRDANELFDTF 167 (260)
T ss_dssp --HHHHHHHHHHHHHTT-HHHHHHHHHTS-HHHHHHHHHHHHHHHHHTSS--HHHHHHHHHHHHHHTTBHHHHHHHHHHH
T ss_pred CCHHHHHHHHHHHHhhccHHHHHHHHHhcCChhHHHHHHHHHHHHHhcCCcchhHHHHHHHHHHHHhcCHHHHHHHHHHH
Confidence 57899999999999999999998877322 12234445444444433 34556777777666666
Q ss_pred HHH
Q psy17695 69 VSI 71 (222)
Q Consensus 69 ~~~ 71 (222)
.+.
T Consensus 168 ~~~ 170 (260)
T PF04190_consen 168 TSK 170 (260)
T ss_dssp HHH
T ss_pred HHH
Confidence 554
No 429
>COG4259 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=29.63 E-value=1.5e+02 Score=21.15 Aligned_cols=40 Identities=13% Similarity=0.044 Sum_probs=35.1
Q ss_pred CCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHH
Q psy17695 5 PFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDA 44 (222)
Q Consensus 5 P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a 44 (222)
+--|.+|..||.+|...|+-+.|...|+.=-++-|.+...
T Consensus 69 ~vpPG~HAhLGlLys~~G~~e~a~~eFetEKalFPES~~f 108 (121)
T COG4259 69 AVPPGYHAHLGLLYSNSGKDEQAVREFETEKALFPESGVF 108 (121)
T ss_pred CCCCcHHHHHHHHHhhcCChHHHHHHHHHhhhhCccchhH
Confidence 4457789999999999999999999999999999987643
No 430
>KOG1550|consensus
Probab=29.50 E-value=3e+02 Score=25.50 Aligned_cols=63 Identities=17% Similarity=0.046 Sum_probs=32.2
Q ss_pred CCHHHHHHHHHhcccCC---CcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHc----CCHHHHHHHHHHHHH
Q psy17695 6 FFKVGILFSNSSFPSPG---NIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIV----CDWTDYEARMKKLVS 70 (222)
Q Consensus 6 ~~~~~~~~lg~~l~~~g---~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~----~~~~~a~~~~~~~~~ 70 (222)
+++.+.+.+|.++..-. +...|..+|..|.+. .+..+.++++.|+..= -+.+.|..++.++.+
T Consensus 323 g~~~a~~~lg~~~~~g~~~~d~~~A~~yy~~Aa~~--G~~~A~~~la~~y~~G~gv~r~~~~A~~~~k~aA~ 392 (552)
T KOG1550|consen 323 GNPDAQYLLGVLYETGTKERDYRRAFEYYSLAAKA--GHILAIYRLALCYELGLGVERNLELAFAYYKKAAE 392 (552)
T ss_pred CCchHHHHHHHHHHcCCccccHHHHHHHHHHHHHc--CChHHHHHHHHHHHhCCCcCCCHHHHHHHHHHHHH
Confidence 45556666666665444 345566666555543 3455555555554431 234444445544443
No 431
>cd08420 PBP2_CysL_like C-terminal substrate binding domain of LysR-type transcriptional regulator CysL, which activates the transcription of the cysJI operon encoding sulfite reductase, contains the type 2 periplasmic binding fold. CysL, also known as YwfK, is a regular of sulfur metabolism in Bacillus subtilis. Sulfur is required for the synthesis of proteins and essential cofactors in all living organism. Sulfur can be assimilated either from inorganic sources (sulfate and thiosulfate), or from organic sources (sulfate esters, sulfamates, and sulfonates). CysL activates the transcription of the cysJI operon encoding sulfite reductase, which reduces sulfite to sulfide. Both cysL mutant and cysJI mutant are unable to grow using sulfate or sulfite as the sulfur source. Like other LysR-type regulators, CysL also negatively regulates its own transcription. In Escherichia coli, three LysR-type activators are involved in the regulation of sulfur metabolism: CysB, Cbl and MetR. The topology
Probab=29.27 E-value=1e+02 Score=22.51 Aligned_cols=26 Identities=15% Similarity=0.156 Sum_probs=18.9
Q ss_pred eecCCCCChHHHHHHHHhCCCcEEEeC
Q psy17695 171 LDLSQVPCNGKAADRIHKDGIHILVNM 197 (222)
Q Consensus 171 ~~~~~~~~~~~~a~~i~~d~iDILvDL 197 (222)
+.+...+. .++.+.+.+.++||.|-.
T Consensus 31 l~~~~~~~-~~~~~~l~~g~~D~~i~~ 56 (201)
T cd08420 31 VSLTIGNT-EEIAERVLDGEIDLGLVE 56 (201)
T ss_pred EEEEeCCc-HHHHHHHHCCCccEEEec
Confidence 33333454 689999999999998853
No 432
>cd08412 PBP2_PAO1_like The C-terminal substrate-binding domain of putative LysR-type transcriptional regulator PAO1-like, a member of the type 2 periplasmic binding fold protein superfamily. This family includes the C-terminal substrate domain of a putative LysR-type transcriptional regulator from the plant pathogen Pseudomonas aeruginosa PAO1and its closely related homologs. The LysR-type transcriptional regulators (LTTRs) are composed of two functional domains joined by a linker helix involved in oligomerization: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal substrate-binding domain, which is structurally homologous to the type 2 periplasmic binding proteins. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcriptional repressor undergoes a conformational change upon substrate binding which in turn changes the DNA binding affinity of the repressor. The genes controll
Probab=28.94 E-value=88 Score=22.96 Aligned_cols=55 Identities=2% Similarity=-0.124 Sum_probs=31.1
Q ss_pred cceeeEeeccccccCCCchhhHHHHHHhhhcceeecCCCCChHHHHHHHHhCCCcEEEeC
Q psy17695 138 LESLYKVMWDRYSQNLPVTHITQAKIAREAEHFLDLSQVPCNGKAADRIHKDGIHILVNM 197 (222)
Q Consensus 138 LRIGYvS~~d~~f~~H~v~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~i~~d~iDILvDL 197 (222)
||||..+ + +..+-+..++.+-.+..=+-=+++...+. .++.+.+.+.++|+.|-.
T Consensus 2 l~I~~~~-~---~~~~~l~~~l~~~~~~~P~i~l~i~~~~~-~~~~~~l~~~~~D~~i~~ 56 (198)
T cd08412 2 LRIGCFS-T---LAPYYLPGLLRRFREAYPGVEVRVVEGNQ-EELEEGLRSGELDLALTY 56 (198)
T ss_pred EEEeccc-c---cchhhhHHHHHHHHHHCCCcEEEEEECCH-HHHHHHHHcCCCcEEEEc
Confidence 6677655 2 22222333332222222233355555554 689999999999998863
No 433
>PRK13184 pknD serine/threonine-protein kinase; Reviewed
Probab=28.87 E-value=55 Score=32.45 Aligned_cols=42 Identities=21% Similarity=0.127 Sum_probs=37.2
Q ss_pred CCCCCHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHH
Q psy17695 3 KTPFFKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDA 44 (222)
Q Consensus 3 ~~P~~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a 44 (222)
..|.-|.-+...+.+|.++|+++|-+++|.-|++--|++|..
T Consensus 547 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 588 (932)
T PRK13184 547 GGVGAPLEYLGKALVYQRLGEYNEEIKSLLLALKRYSQHPEI 588 (932)
T ss_pred CCCCCchHHHhHHHHHHHhhhHHHHHHHHHHHHHhcCCCCcc
Confidence 356677888899999999999999999999999999998864
No 434
>COG2909 MalT ATP-dependent transcriptional regulator [Transcription]
Probab=28.70 E-value=2.9e+02 Score=27.26 Aligned_cols=67 Identities=19% Similarity=0.078 Sum_probs=54.7
Q ss_pred HHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCH-----HHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFP-----DAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~-----~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
++.-...|.+....|+.++|++..+.++..=|.+. .+...+|.+..-.|++++|..+.+.+.+....
T Consensus 458 ae~~aL~a~val~~~~~e~a~~lar~al~~L~~~~~~~r~~~~sv~~~a~~~~G~~~~Al~~~~~a~~~a~~ 529 (894)
T COG2909 458 AEFQALRAQVALNRGDPEEAEDLARLALVQLPEAAYRSRIVALSVLGEAAHIRGELTQALALMQQAEQMARQ 529 (894)
T ss_pred HHHHHHHHHHHHhcCCHHHHHHHHHHHHHhcccccchhhhhhhhhhhHHHHHhchHHHHHHHHHHHHHHHHH
Confidence 34455667888899999999999999999887754 46788888888999999999888888776554
No 435
>PRK11074 putative DNA-binding transcriptional regulator; Provisional
Probab=28.68 E-value=84 Score=26.00 Aligned_cols=58 Identities=9% Similarity=-0.023 Sum_probs=34.7
Q ss_pred hccccceeeEeeccccccCCCchhhHHHHHHhhhcceeecCCCCChHHHHHHHHhCCCcEEEe
Q psy17695 134 YAQRLESLYKVMWDRYSQNLPVTHITQAKIAREAEHFLDLSQVPCNGKAADRIHKDGIHILVN 196 (222)
Q Consensus 134 ~~~rLRIGYvS~~d~~f~~H~v~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~i~~d~iDILvD 196 (222)
..+.||||... . +...-...++..-.+..=+-=+++...+. .++.+.+.+.++||.|-
T Consensus 90 ~~g~l~Ig~~~--~--~~~~~l~~~l~~~~~~~p~i~i~i~~~~~-~~~~~~l~~g~~Dl~i~ 147 (300)
T PRK11074 90 WRGQLSIAVDN--I--VRPDRTRQLIVDFYRHFDDVELIIRQEVF-NGVWDALADGRVDIAIG 147 (300)
T ss_pred CCceEEEEEcC--c--cchhHHHHHHHHHHHhCCCceEEEEehhh-hHHHHHHHCCCCCEEEe
Confidence 46789999644 3 33222222222222222233355655565 68899999999999995
No 436
>cd08425 PBP2_CynR The C-terminal substrate-binding domain of the LysR-type transcriptional regulator CynR, contains the type 2 periplasmic binding fold. CynR is a LysR-like transcriptional regulator of the cyn operon, which encodes genes that allow cyanate to be used as a sole source of nitrogen. The operon includes three genes in the following order: cynT (cyanate permease), cynS (cyanase), and cynX (a protein of unknown function). CynR negatively regulates its own expression independently of cyanate. CynR binds to DNA and induces bending of DNA in the presence or absence of cyanate, but the amount of bending is decreased by cyanate. The CynR of LysR-type transcriptional regulator family is composed of two functional domains joined by a linker helix involved in oligomerization: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal substrate-binding domain, which is structurally homologous to the type 2 periplasmic binding
Probab=28.66 E-value=81 Score=23.27 Aligned_cols=57 Identities=12% Similarity=0.114 Sum_probs=31.2
Q ss_pred cccceeeEeeccccccCCCchhhHHHHHHhhhcceeecCCCCChHHHHHHHHhCCCcEEEeC
Q psy17695 136 QRLESLYKVMWDRYSQNLPVTHITQAKIAREAEHFLDLSQVPCNGKAADRIHKDGIHILVNM 197 (222)
Q Consensus 136 ~rLRIGYvS~~d~~f~~H~v~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~i~~d~iDILvDL 197 (222)
+.||||... . +..+-+..++.+-.+..=+-=+.+...+. .++.+.+.+.++||.|..
T Consensus 1 g~l~Ig~~~--~--~~~~~l~~~l~~~~~~~P~v~i~i~~~~~-~~~~~~l~~g~~Dl~i~~ 57 (197)
T cd08425 1 GSLRLAMTP--T--FTAYLIGPLIDRFHARYPGIALSLREMPQ-ERIEAALADDRLDLGIAF 57 (197)
T ss_pred CeEEEEech--h--hhhhhhHHHHHHHHHHCCCcEEEEEECcH-HHHHHHHHcCCccEEEEe
Confidence 357888755 2 23222333332222222222244444454 688899999999999953
No 437
>cd00280 TRFH Telomeric Repeat binding Factor or TTAGGG Repeat binding Factor, central (dimerization) domain Homology; TRFH. Telomeres are protein/DNA complexes that make up the physical ends of eukaryotic linear chromosomes and are essential for chromosome stability, protecting the chromosome ends from degradation and end-to-end fusion. Proteins TRF1, TRF2 and Taz1 bind telomeric DNA and are also involved in recruiting interacting proteins, TIN2, and Rap1, to the telomeres. It has also been demonstrated that PARP1 associates with TRF2 and is capable of poly(ADP-ribosyl)ation of TRF2, which affects binding of TRF2 to telomeric DNA. TRF1, TRF2 and Taz1 proteins contain three functional domains: an N-terminal acidic domain, a central TRF-specific/dimerization domain, and a C-terminal DNA binding domain with a single Myb-like repeat. Homodimerization, a prerequisite to DNA binding, results in the juxtaposition of two Myb DNA binding domains.
Probab=28.18 E-value=1.4e+02 Score=23.67 Aligned_cols=35 Identities=11% Similarity=0.142 Sum_probs=24.7
Q ss_pred HhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHH
Q psy17695 16 SSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHC 51 (222)
Q Consensus 16 ~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~ 51 (222)
.+..+.|.+++|++.+++... +|++..-...|..+
T Consensus 119 ~VCm~~g~Fk~A~eiLkr~~~-d~~~~~~r~kL~~I 153 (200)
T cd00280 119 AVCMENGEFKKAEEVLKRLFS-DPESQKLRMKLLMI 153 (200)
T ss_pred HHHHhcCchHHHHHHHHHHhc-CCCchhHHHHHHHH
Confidence 455677888888888888887 77776665555544
No 438
>KOG0530|consensus
Probab=28.15 E-value=2.7e+02 Score=23.61 Aligned_cols=66 Identities=11% Similarity=0.097 Sum_probs=43.6
Q ss_pred HHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCC-HHHHHHHHHHHHHHHH
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCD-WTDYEARMKKLVSIVA 73 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~-~~~a~~~~~~~~~~~~ 73 (222)
-++.-..-.++.+..+-+.|++.-+.++.++|.+...|...-.++..++. ..+-.+.+.++..-.|
T Consensus 43 r~~m~YfRAI~~~~E~S~RAl~LT~d~i~lNpAnYTVW~yRr~iL~~l~~dL~~El~~l~eI~e~np 109 (318)
T KOG0530|consen 43 RDVMDYFRAIIAKNEKSPRALQLTEDAIRLNPANYTVWQYRRVILRHLMSDLNKELEYLDEIIEDNP 109 (318)
T ss_pred HHHHHHHHHHHhccccCHHHHHHHHHHHHhCcccchHHHHHHHHHHHhHHHHHHHHHHHHHHHHhCc
Confidence 34444444556667778888888899999999998888888888776643 3333344444444433
No 439
>cd08429 PBP2_NhaR The C-terminal substrate binding domain of LysR-type transcriptional activator of the nhaA gene, encoding Na+/H+ antiporter, contains the type 2 periplasmic binding fold. NhaR is a positive regulator of the LysR family and is known to be an activator of the nhaA gene encoding a Na(+)/H(+) antiporter. In Escherichia coli, NhaA is the vital antiporter that protects against high sodium stress, and it is essential for growth in high sodium levels, while NhaB becomes essential only if NhaA is not available. The nhaA gene of nhaAR operon is induced by monovalent cations. The nhaR of the operon activates nhaAR, as well as the osmC transcription which is induced at elevated osmolarity. OsmC is transcribed from the two overlapping promoters (osmCp1 and osmP2) and that NhaR is shown to activate only the expression of osmCp1. NhaR also activates the transcription of the pgaABCD operon which is required for production of the biofilm adhesion, poly-beta-1,6-N-acetyl-d-glucosamine
Probab=27.89 E-value=76 Score=24.19 Aligned_cols=55 Identities=2% Similarity=-0.124 Sum_probs=32.4
Q ss_pred cceeeEeeccccccCCCchhhHHHHHHhhhcceeecCCCCChHHHHHHHHhCCCcEEEeC
Q psy17695 138 LESLYKVMWDRYSQNLPVTHITQAKIAREAEHFLDLSQVPCNGKAADRIHKDGIHILVNM 197 (222)
Q Consensus 138 LRIGYvS~~d~~f~~H~v~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~i~~d~iDILvDL 197 (222)
||||..+ + +..+-...++.+-.+..-+.=+++...+. .++.+.+.+.+||+.|-.
T Consensus 2 l~ig~~~-~---~~~~~l~~~l~~f~~~~P~v~l~i~~~~~-~~~~~~L~~~~~D~~i~~ 56 (204)
T cd08429 2 FRVGVAD-A---VPKSIAYRLLEPAMDLHEPIRLVCREGKL-EQLLADLALHRLDMVLAD 56 (204)
T ss_pred eeEEcch-h---hhHHHHHHHHHHHHHhCCCcEEEEEeCCH-HHHHHHHHcCCccEEEec
Confidence 5666655 2 23333333333333333344455665565 699999999999999943
No 440
>PF10602 RPN7: 26S proteasome subunit RPN7; InterPro: IPR019585 This entry represents the regulatory subunit RPN7 (known as the non-ATPase regulatory subunit 6 in higher eukaryotes) of the 26S proteasome. This entry also matches the evolutionarily related subunit 1 of the COP9 signalosome complex (CSN) from Arabidopsis []. The 26S proteasome plays a major role in ATP-dependent degradation of ubiquitinated proteins. Substrate specificity is conferred by the regulatory particle (RP), which can dissociate into stable lid and base subcomplexes. The regulatory subunit RPN7 is one of the lid subunits of the 26S proteasome and has been shown in Saccharomyces cerevisiae (Baker's yeast) to be required for structural integrity []. The COP9 signalosome is a conserved protein complex composed of eight subunits, where Individual subunits of the complex have been linked to various signal transduction pathways leading to gene expression and cell cycle control []. The overall organisation and the amino acid sequences of the COP9 signalosome subunits resemble the lid subcomplex of the 19 S regulatory particle for the 26 S proteasome []. COP9 subunit 1 (CSN1 or GPS1) of the COP9 complex is an essential subunit of the complex with regard to both structural integrity and functionality. The N-terminal region of subunit 1 (CSN1-N) can inhibit c-fos expression from either a transfected template or a chromosomal transgene (fos-lacZ), and may contain the activity domain that confers most of the repression functions of CSN1. The C-terminal region of subunit 1 (CSN1-C) allows integration of the protein into the COP9 signalosome.
Probab=27.03 E-value=94 Score=24.04 Aligned_cols=29 Identities=14% Similarity=0.008 Sum_probs=25.6
Q ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHHHH
Q psy17695 43 DAYCNLAHCLQIVCDWTDYEARMKKLVSI 71 (222)
Q Consensus 43 ~a~~~l~~~~~~~~~~~~a~~~~~~~~~~ 71 (222)
.++..+|..+...|+.++|.++|.++...
T Consensus 37 ~~~~~l~~~~~~~Gd~~~A~k~y~~~~~~ 65 (177)
T PF10602_consen 37 MALEDLADHYCKIGDLEEALKAYSRARDY 65 (177)
T ss_pred HHHHHHHHHHHHhhhHHHHHHHHHHHhhh
Confidence 57889999999999999999999887653
No 441
>PF10345 Cohesin_load: Cohesin loading factor; InterPro: IPR019440 Cohesin loading factor is a conserved protein that has been characterised in fungi. It is associated with the cohesin complex and is required in G1 for cohesin binding to chromosomes, but is dispensable in G2 when cohesion has been established. It is often referred to as Ssl3 in Schizosaccharomyces pombe (Fission yeast), and Scc4 in Saccharomyces cerevisiae (Baker's yeast). It complexes with Mis4 [].
Probab=26.83 E-value=3.7e+02 Score=25.13 Aligned_cols=66 Identities=15% Similarity=-0.054 Sum_probs=47.0
Q ss_pred HHHHHHHHHhcc-cCCCcHHHHHHHHHHHHhC--CCCHHH----HHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 8 KVGILFSNSSFP-SPGNIPEAIQSYRTALKLK--PDFPDA----YCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 8 ~~~~~~lg~~l~-~~g~~~eA~~~~~~Al~l~--P~~~~a----~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
+.+.+.+|.+|. ...++++|+.++++++.+. ++..+. .+.++.++.+.+.-. |....++.++....
T Consensus 59 a~~~l~la~iL~~eT~n~~~Ae~~L~k~~~l~~~~~~~d~k~~~~~ll~~i~~~~~~~~-a~~~l~~~I~~~~~ 131 (608)
T PF10345_consen 59 ARVRLRLASILLEETENLDLAETYLEKAILLCERHRLTDLKFRCQFLLARIYFKTNPKA-ALKNLDKAIEDSET 131 (608)
T ss_pred HHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhccccchHHHHHHHHHHHHHHHHhcCHHH-HHHHHHHHHHHHhc
Confidence 567888998888 8999999999999999886 554443 344555555555544 77666666665543
No 442
>KOG1550|consensus
Probab=26.62 E-value=1.7e+02 Score=27.13 Aligned_cols=29 Identities=28% Similarity=0.301 Sum_probs=12.7
Q ss_pred cHHHHHHHHHHHHhCCCCHHHHHHHHHHHHH
Q psy17695 24 IPEAIQSYRTALKLKPDFPDAYCNLAHCLQI 54 (222)
Q Consensus 24 ~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~ 54 (222)
.+.|...|.+|-++. ++++.+++|.++..
T Consensus 309 ~~~A~~~~~~aA~~g--~~~a~~~lg~~~~~ 337 (552)
T KOG1550|consen 309 YEKALKLYTKAAELG--NPDAQYLLGVLYET 337 (552)
T ss_pred HHHHHHHHHHHHhcC--CchHHHHHHHHHHc
Confidence 334444444444432 23344444444443
No 443
>PF04552 Sigma54_DBD: Sigma-54, DNA binding domain; InterPro: IPR007634 This DNA-binding domain is based on peptide fragmentation data. This domain is proximal to DNA in the promoter/holoenzyme complex. Furthermore, this region contains a putative helix-turn-helix motif. At the C terminus, there is a highly conserved region known as the RpoN box and is the signature of the sigma-54 proteins [].; PDB: 2AHQ_A 2O9L_A 2O8K_A.
Probab=26.35 E-value=40 Score=25.87 Aligned_cols=33 Identities=21% Similarity=0.256 Sum_probs=20.5
Q ss_pred HHHHHhhhcceeecCCCCChHHHHHHHHhCCCcE
Q psy17695 160 QAKIAREAEHFLDLSQVPCNGKAADRIHKDGIHI 193 (222)
Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~a~~i~~d~iDI 193 (222)
..+++...+.--.-..+|| +++++.+.+.||.|
T Consensus 106 k~~i~~lI~~Ed~~~PlSD-~~i~~~L~~~gi~i 138 (160)
T PF04552_consen 106 KARIKELIEEEDKKKPLSD-QEIAELLKEEGIKI 138 (160)
T ss_dssp -HHHHHHHTTS-TTS---H-HHHHHHHTTTTS--
T ss_pred HHHHHHHHHhcCCCCCCCH-HHHHHHHHHcCCCc
Confidence 5566666665545567998 89999999999987
No 444
>KOG0890|consensus
Probab=26.08 E-value=75 Score=34.51 Aligned_cols=50 Identities=12% Similarity=0.209 Sum_probs=0.0
Q ss_pred hcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHH
Q psy17695 17 SFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMK 66 (222)
Q Consensus 17 ~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~ 66 (222)
.+...|++..|..||+++++.+|+....+..........|.++..+....
T Consensus 1458 ~~e~~g~~~da~~Cye~~~q~~p~~~~~~~g~l~sml~~~~l~t~i~~~d 1507 (2382)
T KOG0890|consen 1458 EHEASGNWADAAACYERLIQKDPDKEKHHSGVLKSMLAIQHLSTEILHLD 1507 (2382)
T ss_pred HHHhhccHHHHHHHHHHhhcCCCccccchhhHHHhhhcccchhHHHhhhc
No 445
>cd05568 PTS_IIB_bgl_like PTS_IIB_bgl_like: the PTS (phosphotransferase system) IIB domain of a family of sensory systems composed of a membrane-bound sugar-sensor (similar to BglF) and a transcription antiterminator (similar to BglG) which regulate expression of genes involved in sugar utilization. The domain architecture of the IIB-containing protein includes a region N-terminal to the IIB domain which is homologous to the BglG transcription antiterminator with an RNA-binding domain followed by two homologous domains, PRD1 and PRD2 (PTS Regulation Domains). C-terminal to the IIB domain is a domain similar to the PTS IIA domain. In this system, the BglG-like region and the IIB and IIA-like domains are all expressed together as a single multidomain protein. The IIB domain fold includes a central four-stranded parallel open twisted beta-sheet flanked by alpha-helices on both sides. The seven major PTS systems with this IIB fold include this sensory system with similarity to the bacterial
Probab=26.06 E-value=63 Score=21.03 Aligned_cols=54 Identities=6% Similarity=0.162 Sum_probs=36.9
Q ss_pred ceeeEeeccccccCCCchhhHHHHHHhhhcceeecCCCCChHHHHHHHHhCCCcEEEeCCC
Q psy17695 139 ESLYKVMWDRYSQNLPVTHITQAKIAREAEHFLDLSQVPCNGKAADRIHKDGIHILVNMNG 199 (222)
Q Consensus 139 RIGYvS~~d~~f~~H~v~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~i~~d~iDILvDL~G 199 (222)
||..|+ =..+.++.++..++++.+.....+...+. .+..+ ...++.|++|=..-
T Consensus 2 kilivC-----~~G~~~s~~l~~~l~~~~~~~~~v~~~~~-~~~~~-~~~~~~DlIitT~~ 55 (85)
T cd05568 2 KALVVC-----PSGIGTSRLLKSKLKKLFPEIEIIDVISL-RELEE-VDLDDYDLIISTVP 55 (85)
T ss_pred eEEEEC-----CCCHHHHHHHHHHHHHHCCCceEEEEEeH-HHHhh-CcccCCCEEEEccc
Confidence 456666 34677888889999999876655554553 44443 35678999987664
No 446
>cd08445 PBP2_BenM_CatM_CatR The C-terminal substrate binding domain of LysR-type transcriptional regulators involved in benzoate catabolism; contains the type 2 periplasmic binding fold. This CD includes the C-terminal of LysR-type transcription regulators, BenM, CatM, and CatR, which are involved in the benzoate catabolism. The BenM and CatM are paralogs with overlapping functions. BenM responds synergistically to two effectors, benzoate and cis,cis-muconate, to activate expression of the benABCDE operon which is involved in benzoate catabolism, while CatM responses only to muconate. BenM and CatM share high protein sequence identity and bind to the operator-promoter regions that have similar DNA sequences. In Pseudomonas species, phenolic compounds are converted by different enzymes to central intermediates, such as protocatechuate and catechols. Generally, unsubstituted compounds, such as benzoate, are metabolized by an ortho-cleavage pathway. The catBCA operon encodes three enzymes
Probab=25.73 E-value=1.1e+02 Score=22.91 Aligned_cols=54 Identities=6% Similarity=0.060 Sum_probs=30.5
Q ss_pred ccceeeEeeccccccCCCchhhHHHHHHhhh-cceeecCCCCChHHHHHHHHhCCCcEEEe
Q psy17695 137 RLESLYKVMWDRYSQNLPVTHITQAKIAREA-EHFLDLSQVPCNGKAADRIHKDGIHILVN 196 (222)
Q Consensus 137 rLRIGYvS~~d~~f~~H~v~~l~~~~~~~~~-~~~~~~~~~~~~~~~a~~i~~d~iDILvD 196 (222)
.||||..+ +- ...-+..++ .++.+.. +-=+++...+. .++.+.+.+.++|+.|-
T Consensus 2 ~l~Ig~~~-~~---~~~~l~~~l-~~~~~~~P~i~l~i~~~~~-~~~~~~l~~~~~Dl~i~ 56 (203)
T cd08445 2 TFSIGFVP-ST---LYGLLPELI-RRFRQAAPDVEIELIEMTT-VQQIEALKEGRIDVGFG 56 (203)
T ss_pred eEEEEEeh-HH---HHhHHHHHH-HHHHHHCCCeEEEEEeCCh-HHHHHHHHcCCCcEEEe
Confidence 57888776 32 222222222 2233222 22244444444 68999999999999984
No 447
>TIGR03298 argP transcriptional regulator, ArgP family. ArgP used to be known as IciA. ArgP is a positive regulator of argK. It is a negative autoregulator in presence of arginine. It competes with DnaA for oriC iteron (13-mer) binding. It activates dnaA and nrd transcription. It has been demonstrated to be part of the pho regulon (PubMed:10589831). ArgP mutants convey canavanine (an L-arginine structural homolog) sensitivity (PubMed: 15150242).
Probab=25.58 E-value=86 Score=25.72 Aligned_cols=57 Identities=18% Similarity=0.056 Sum_probs=35.5
Q ss_pred ccccceeeEeeccccccCCCchhhHHHHHHhhhcceeecCCCCChHHHHHHHHhCCCcEEEeC
Q psy17695 135 AQRLESLYKVMWDRYSQNLPVTHITQAKIAREAEHFLDLSQVPCNGKAADRIHKDGIHILVNM 197 (222)
Q Consensus 135 ~~rLRIGYvS~~d~~f~~H~v~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~i~~d~iDILvDL 197 (222)
.+.|+||.. .+ +...-+..++.+-.++. +.-+++...+. .++.+.+.+.++||.|..
T Consensus 89 ~g~l~i~~~--~~--~~~~~l~~~l~~~~~~~-~i~i~l~~~~~-~~~~~~l~~g~~d~~i~~ 145 (292)
T TIGR03298 89 PVTLSIAVN--AD--SLATWFLPALAPVLAQE-GVLLDLVVEDQ-DHTAELLRSGEVLGAVTT 145 (292)
T ss_pred CceEEEecc--cc--hhhhhhHHHHHHHHhCC-CceEEEEeCcc-hhHHHHHhCCCceEEEec
Confidence 568999954 34 33333444443333333 43455544555 689999999999999864
No 448
>cd01422 MGS Methylglyoxal synthase catalyzes the enolization of dihydroxyacetone phosphate (DHAP) to produce methylglyoxal. The first part of the catalytic mechanism is believed to be similar to TIM (triosephosphate isomerase) in that both enzymes utilize DHAP to form an ene-diolate phosphate intermediate. In MGS, the second catalytic step is characterized by the elimination of phosphate and collapse of the enediolate to form methylglyoxal instead of reprotonation to form the isomer glyceraldehyde 3-phosphate, as in TIM. This is the first reaction in the methylglyoxal bypass of the Embden-Myerhoff glycolytic pathway and is believed to provide physiological benefits under non-ideal growth conditions in bacteria.
Probab=25.31 E-value=90 Score=22.28 Aligned_cols=33 Identities=15% Similarity=0.136 Sum_probs=24.3
Q ss_pred HHHHHHHHhCCCcEEEeCCCC-CCCc-chhhhhhC
Q psy17695 180 GKAADRIHKDGIHILVNMNGY-TKGA-RNEIFALR 212 (222)
Q Consensus 180 ~~~a~~i~~d~iDILvDL~G~-t~~~-r~~i~a~r 212 (222)
.++.++|++.+||..|++... .... +.+.|..|
T Consensus 60 ~~i~~~i~~g~i~~VInt~~~~~~~~~~~dg~~iR 94 (115)
T cd01422 60 QQIGALIAEGEIDAVIFFRDPLTAQPHEPDVKALL 94 (115)
T ss_pred hHHHHHHHcCceeEEEEcCCCCCCCcccccHHHHH
Confidence 589999999999999999753 3333 44555544
No 449
>cd08162 MPP_PhoA_N Synechococcus sp. strain PCC 7942 PhoA and related proteins, N-terminal metallophosphatase domain. Synechococcus sp. strain PCC 7942 PhoA is a large atypical alkaline phosphatase. It is known to be transported across the inner cytoplasmic membrane and into the periplasmic space. In vivo inactivation of the gene encoding PhoA leads to a loss of extracellular, phosphate-regulated phosphatase activity, but does not appear to affect the cells capacity for phosphate uptake. PhoA may play a role in scavenging phosphate during growth of Synechococcus sp. strain PCC 7942 in its natural environment. PhoA belongs to a domain family which includes the bacterial enzyme UshA and several other related enzymes including SoxB, CpdB, YhcR, and CD73. All members have a similar domain architecture which includes an N-terminal metallophosphatase domain and a C-terminal nucleotidase domain. The N-terminal metallophosphatase domain belongs to a large superfamily of distantly relat
Probab=25.17 E-value=72 Score=27.21 Aligned_cols=37 Identities=14% Similarity=0.319 Sum_probs=22.9
Q ss_pred HHHh-hhcceeecCCCC-C--hHHHHHHHHhCCCcEEEeCCCCCC
Q psy17695 162 KIAR-EAEHFLDLSQVP-C--NGKAADRIHKDGIHILVNMNGYTK 202 (222)
Q Consensus 162 ~~~~-~~~~~~~~~~~~-~--~~~~a~~i~~d~iDILvDL~G~t~ 202 (222)
++++ .+|..+-++.+. . +.++|+.+ .+|||+| .|||-
T Consensus 202 ~Lr~~gvD~II~LsH~g~~~~d~~lA~~v--~gIDvIi--gGHsH 242 (313)
T cd08162 202 ALTAQGINKIILLSHLQQISIEQALAALL--SGVDVII--AGGSN 242 (313)
T ss_pred HHHHCCCCEEEEEecccccchHHHHHhcC--CCCCEEE--eCCCC
Confidence 3443 388888888873 1 13444443 4899998 56554
No 450
>cd08434 PBP2_GltC_like The substrate binding domain of LysR-type transcriptional regulator GltC, which activates gltA expression of glutamate synthase operon, contains type 2 periplasmic binding fold. GltC, a member of the LysR family of bacterial transcriptional factors, activates the expression of gltA gene of glutamate synthase operon and is essential for cell growth in the absence of glutamate. Glutamate synthase is a heterodimeric protein that encoded by gltA and gltB, whose expression is subject to nutritional regulation. GltC also negatively auto-regulates its own expression. This substrate-binding domain has strong homology to the type 2 periplasmic binding proteins (PBP2), which are responsible for the uptake of a variety of substrates such as phosphate, sulfate, polysaccharides, lysine/arginine/ornithine, and histidine. The PBP2 bind their ligand in the cleft between these domains in a manner resembling a Venus flytrap. After binding their specific ligand with high affinity,
Probab=24.99 E-value=1.2e+02 Score=21.94 Aligned_cols=25 Identities=12% Similarity=0.260 Sum_probs=18.9
Q ss_pred eecCCCCChHHHHHHHHhCCCcEEEe
Q psy17695 171 LDLSQVPCNGKAADRIHKDGIHILVN 196 (222)
Q Consensus 171 ~~~~~~~~~~~~a~~i~~d~iDILvD 196 (222)
+++...+. .++.+.+.+.++||.|-
T Consensus 31 i~i~~~~~-~~~~~~l~~~~~Dl~i~ 55 (195)
T cd08434 31 FELHQGST-DELLDDLKNGELDLALC 55 (195)
T ss_pred EEEecCcH-HHHHHHHHcCCccEEEE
Confidence 44444454 68899999999999985
No 451
>KOG0889|consensus
Probab=24.46 E-value=1e+02 Score=34.96 Aligned_cols=46 Identities=13% Similarity=0.226 Sum_probs=42.2
Q ss_pred HHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHH
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQ 53 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~ 53 (222)
++.....|..+.++|+.++|-++|..|++++-..+.+|.+.|....
T Consensus 2812 aeff~lkG~f~~kL~~~eeAn~~fs~AvQi~~~l~KaW~~Wg~y~~ 2857 (3550)
T KOG0889|consen 2812 AEFFTLKGMFLEKLGKFEEANKAFSAAVQIDDGLGKAWAEWGKYLD 2857 (3550)
T ss_pred HHHHHhhhHHHHHhcCcchhHHHHHHHHHHHhhhHHHHHHHHHHHH
Confidence 5677888999999999999999999999999999999999998754
No 452
>COG0029 NadB Aspartate oxidase [Coenzyme metabolism]
Probab=24.39 E-value=1.2e+02 Score=27.78 Aligned_cols=41 Identities=39% Similarity=0.565 Sum_probs=30.8
Q ss_pred HHHHh-hhcceeecCCCCChH-------HHHHHHHhCCCc------------------EEEeCCCCCC
Q psy17695 161 AKIAR-EAEHFLDLSQVPCNG-------KAADRIHKDGIH------------------ILVNMNGYTK 202 (222)
Q Consensus 161 ~~~~~-~~~~~~~~~~~~~~~-------~~a~~i~~d~iD------------------ILvDL~G~t~ 202 (222)
.++++ ..|.|.|++...+ . .+.+..++.||| |.||..|-|.
T Consensus 287 ~e~~~~g~~V~LD~s~~~~-~~~~~rFP~I~~~c~~~GiD~~r~~IPV~PaaHY~mGGI~vD~~GrTs 353 (518)
T COG0029 287 AEMKRGGADVFLDISHIPG-DFFERRFPTIYAACLKAGIDPTREPIPVVPAAHYTMGGIAVDANGRTS 353 (518)
T ss_pred HHHHhcCCeEEEeccCCCc-hhhhhhCcHHHHHHHHcCCCcccCccCccchhheecccEEECCCCccc
Confidence 35555 3459999999886 4 467788999999 6778877775
No 453
>PF02571 CbiJ: Precorrin-6x reductase CbiJ/CobK; InterPro: IPR003723 Cobalamin (vitamin B12) is a structurally complex cofactor, consisting of a modified tetrapyrrole with a centrally chelated cobalt. Cobalamin is usually found in one of two biologically active forms: methylcobalamin and adocobalamin. Most prokaryotes, as well as animals, have cobalamin-dependent enzymes, whereas plants and fungi do not appear to use it. In bacteria and archaea, these include methionine synthase, ribonucleotide reductase, glutamate and methylmalonyl-CoA mutases, ethanolamine ammonia lyase, and diol dehydratase []. In mammals, cobalamin is obtained through the diet, and is required for methionine synthase and methylmalonyl-CoA mutase []. There are at least two distinct cobalamin biosynthetic pathways in bacteria []: Aerobic pathway that requires oxygen and in which cobalt is inserted late in the pathway []; found in Pseudomonas denitrificans and Rhodobacter capsulatus. Anaerobic pathway in which cobalt insertion is the first committed step towards cobalamin synthesis []; found in Salmonella typhimurium, Bacillus megaterium, and Propionibacterium freudenreichii subsp. shermanii. Either pathway can be divided into two parts: (1) corrin ring synthesis (differs in aerobic and anaerobic pathways) and (2) adenosylation of corrin ring, attachment of aminopropanol arm, and assembly of the nucleotide loop (common to both pathways) []. There are about 30 enzymes involved in either pathway, where those involved in the aerobic pathway are prefixed Cob and those of the anaerobic pathway Cbi. Several of these enzymes are pathway-specific: CbiD, CbiG, and CbiK are specific to the anaerobic route of S. typhimurium, whereas CobE, CobF, CobG, CobN, CobS, CobT, and CobW are unique to the aerobic pathway of P. denitrificans. This entry represents CobK and CbiJ precorrin-6x reductase (1.3.1.54 from EC). In the aerobic pathway, CobK catalyses the reduction of the macrocycle of precorrin-6X to produce precorrin-6Y; while in the anaerobic pathway CbiJ catalyses the reduction of the macrocycle of cobalt-precorrin-6X into cobalt-precorrin-6Y [, ].; GO: 0016994 precorrin-6A reductase activity, 0009236 cobalamin biosynthetic process, 0055114 oxidation-reduction process
Probab=24.32 E-value=90 Score=25.73 Aligned_cols=27 Identities=11% Similarity=0.322 Sum_probs=22.0
Q ss_pred eecCCC-CChHHHHHHHHhCCCcEEEeCC
Q psy17695 171 LDLSQV-PCNGKAADRIHKDGIHILVNMN 198 (222)
Q Consensus 171 ~~~~~~-~~~~~~a~~i~~d~iDILvDL~ 198 (222)
+.+.++ +. +++++.|++.+||.+||-+
T Consensus 47 v~~G~lg~~-~~l~~~l~~~~i~~vIDAT 74 (249)
T PF02571_consen 47 VRVGRLGDE-EGLAEFLRENGIDAVIDAT 74 (249)
T ss_pred EEECCCCCH-HHHHHHHHhCCCcEEEECC
Confidence 344556 54 7999999999999999986
No 454
>cd08430 PBP2_IlvY The C-terminal substrate binding of LysR-type transcriptional regulator IlvY, which activates the expression of ilvC gene that encoding acetohydroxy acid isomeroreductase for the biosynthesis of branched amino acids; contains the type 2 periplasmic binding fold. In Escherichia coli, IlvY is required for the regulation of ilvC gene expression that encodes acetohydroxy acid isomeroreductase (AHIR), a key enzyme in the biosynthesis of branched-chain amino acids (isoleucine, valine, and leucine). The ilvGMEDA operon genes encode remaining enzyme activities required for the biosynthesis of these amino acids. Activation of ilvC transcription by IlvY requires the additional binding of a co-inducer molecule (either alpha-acetolactate or alpha-acetohydoxybutyrate, the substrates for AHIR) to a preformed complex of IlvY protein-DNA. Like many other LysR-family members, IlvY negatively auto-regulates the transcription of its own divergently transcribed ilvY gene in an inducer-i
Probab=24.03 E-value=1.4e+02 Score=21.87 Aligned_cols=55 Identities=15% Similarity=0.067 Sum_probs=30.8
Q ss_pred cceeeEeeccccccCCCchhhHHHHHHhhhcceeecCCCCChHHHHHHHHhCCCcEEEeC
Q psy17695 138 LESLYKVMWDRYSQNLPVTHITQAKIAREAEHFLDLSQVPCNGKAADRIHKDGIHILVNM 197 (222)
Q Consensus 138 LRIGYvS~~d~~f~~H~v~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~i~~d~iDILvDL 197 (222)
||||..+ . +..+-...++.+-.+..=+.=+++...+. .++.+.+.+.++||.|--
T Consensus 2 l~ig~~~-~---~~~~~l~~~l~~~~~~~P~v~l~~~~~~~-~~~~~~l~~g~~Dl~i~~ 56 (199)
T cd08430 2 LSLYCSV-T---ASYSFLPPILERFRAQHPQVEIKLHTGDP-ADAIDKVLNGEADIAIAA 56 (199)
T ss_pred eEEEecc-c---cceeeccHHHHHHHHHCCCceEEEEeCCH-HHHHHHHHCCCCCEEEEe
Confidence 5677665 2 23333334442222222233344444454 688899999999998854
No 455
>cd08466 PBP2_LeuO The C-terminal substrate binding domain of LysR-type transcriptional regulator LeuO, an activator of leucine synthesis operon, contains the type 2 periplasmic binding fold. LeuO, a LysR-type transcriptional regulator, was originally identified as an activator of the leucine synthesis operon (leuABCD). Subsequently, LeuO was found to be not a specific regulator of the leu gene but a global regulator of unrelated various genes. LeuO activates bglGFB (utilization of beta-D-glucoside) and represses cadCBA (lysine decarboxylation) and dsrA (encoding a regulatory small RNA for translational control of rpoS and hns). LeuO also regulates the yjjQ-bglJ operon which coding for a LuxR-type transcription factor. In Salmonella enterica serovar Typhi, LeuO is a positive regulator of ompS1 (encoding an outer membrane), ompS2 (encoding a pathogenicity determinant), and assT, while LeuO represses the expression of OmpX and Tpx. Both osmS1 and osmS2 influence virulence in the mouse mo
Probab=23.97 E-value=89 Score=23.09 Aligned_cols=25 Identities=8% Similarity=0.105 Sum_probs=19.1
Q ss_pred eeecCCCCChHHHHHHHHhCCCcEEE
Q psy17695 170 FLDLSQVPCNGKAADRIHKDGIHILV 195 (222)
Q Consensus 170 ~~~~~~~~~~~~~a~~i~~d~iDILv 195 (222)
=+++...+. .++.+.+.+.++|+.|
T Consensus 30 ~l~~~~~~~-~~~~~~l~~g~~Dl~i 54 (200)
T cd08466 30 SLRESPSSE-EDLFEDLRLQEVDLVI 54 (200)
T ss_pred EEEEecCch-HhHHHHHHcCCccEEE
Confidence 344544554 6889999999999988
No 456
>cd08450 PBP2_HcaR The C-terminal substrate binding domain of LysR-type transcriptional regulator HcaR in involved in 3-phenylpropionic acid catabolism, contains the type2 periplasmic binding fold. HcaR, a member of the LysR family of transcriptional regulators, controls the expression of the hcA1, A2, B, C, and D operon, encoding for the 3-phenylpropionate dioxygenase complex and 3-phenylpropionate-2',3'-dihydrodiol dehydrogenase, that oxidizes 3-phenylpropionate to 3-(2,3-dihydroxyphenyl) propionate. Dioxygenases play an important role in protecting the cell against the toxic effects of dioxygen. The expression of hcaR is negatively auto-regulated, as for other members of the LysR family, and is strongly repressed in the presence of glucose. This substrate-binding domain shows significant homology to the type 2 periplasmic binding proteins (PBP2), which are responsible for the uptake of a variety of substrates such as phosphate, sulfate, polysaccharides, lysine/arginine/ornithine, an
Probab=23.72 E-value=1.3e+02 Score=22.06 Aligned_cols=28 Identities=11% Similarity=0.177 Sum_probs=20.6
Q ss_pred cceeecCCCCChHHHHHHHHhCCCcEEEe
Q psy17695 168 EHFLDLSQVPCNGKAADRIHKDGIHILVN 196 (222)
Q Consensus 168 ~~~~~~~~~~~~~~~a~~i~~d~iDILvD 196 (222)
+.-+++...+. .++.+.+.+.++|+.|-
T Consensus 28 ~i~l~i~~~~~-~~~~~~l~~~~~Dl~i~ 55 (196)
T cd08450 28 DLDVELSSLFS-PQLAEALMRGKLDVAFM 55 (196)
T ss_pred CcEEEEEecCh-HHHHHHHhcCCccEEEE
Confidence 33455554454 68999999999999884
No 457
>PF10255 Paf67: RNA polymerase I-associated factor PAF67; InterPro: IPR019382 RNA polymerase I is a multi-subunit enzyme and its transcription competence is dependent on the presence of PAF67 [].
Probab=23.72 E-value=49 Score=29.42 Aligned_cols=30 Identities=10% Similarity=0.156 Sum_probs=26.7
Q ss_pred CHHHHHHHHHhcccCCCcHHHHHHHHHHHH
Q psy17695 7 FKVGILFSNSSFPSPGNIPEAIQSYRTALK 36 (222)
Q Consensus 7 ~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~ 36 (222)
+...++.+|.+|..++||.+|+..|...|-
T Consensus 163 ~is~~YyvGFaylMlrRY~DAir~f~~iL~ 192 (404)
T PF10255_consen 163 HISTYYYVGFAYLMLRRYADAIRTFSQILL 192 (404)
T ss_pred heehHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 456788999999999999999999998874
No 458
>COG1654 BirA Biotin operon repressor [Transcription]
Probab=23.58 E-value=92 Score=20.87 Aligned_cols=29 Identities=17% Similarity=0.366 Sum_probs=23.3
Q ss_pred HHHHHHHhCCCcEEEeCC-CCCCCcchhhh
Q psy17695 181 KAADRIHKDGIHILVNMN-GYTKGARNEIF 209 (222)
Q Consensus 181 ~~a~~i~~d~iDILvDL~-G~t~~~r~~i~ 209 (222)
...+.+|++|+||...-. ||+-.....+.
T Consensus 38 K~Iq~Lr~~G~~I~s~~~kGY~L~~~~~ll 67 (79)
T COG1654 38 KHIQQLREEGVDIESVRGKGYLLPQLPDLL 67 (79)
T ss_pred HHHHHHHHhCCceEecCCCceeccCccccC
Confidence 456788999999999999 99877666543
No 459
>PF11817 Foie-gras_1: Foie gras liver health family 1; InterPro: IPR021773 Mutating the gene foie gras in zebrafish has been shown to affect development; the mutants develop large, lipid-filled hepatocytes in the liver, resembling those in individuals with fatty liver disease []. Foie-gras protein is long and has several well-defined domains though none of them has a known function. We have annotated this one as the first []. THe C terminus of this region contains TPR repeats.
Probab=23.45 E-value=3.6e+02 Score=21.88 Aligned_cols=55 Identities=9% Similarity=-0.021 Sum_probs=38.9
Q ss_pred cCCCcHHHHHHHHHHHHhCCCC------HHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 20 SPGNIPEAIQSYRTALKLKPDF------PDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 20 ~~g~~~eA~~~~~~Al~l~P~~------~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
........++.+++|++.--.. ...-..+|.-+...|+|++|...++.+.....+
T Consensus 150 ~~~hs~~iI~lL~~A~~~f~~~~~~R~~~~l~~~~A~ey~~~g~~~~A~~~l~~~~~~yr~ 210 (247)
T PF11817_consen 150 GVDHSKLIIELLEKAYEQFKKYGQNRMASYLSLEMAEEYFRLGDYDKALKLLEPAASSYRR 210 (247)
T ss_pred ccchHHHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHh
Confidence 3445556677888887663221 123467788899999999999999998666554
No 460
>COG0300 DltE Short-chain dehydrogenases of various substrate specificities [General function prediction only]
Probab=23.19 E-value=82 Score=26.30 Aligned_cols=20 Identities=25% Similarity=0.423 Sum_probs=15.2
Q ss_pred HHHhC--CCcEEEeCCCCCCCc
Q psy17695 185 RIHKD--GIHILVNMNGYTKGA 204 (222)
Q Consensus 185 ~i~~d--~iDILvDL~G~t~~~ 204 (222)
.++++ .|||||+=.|+....
T Consensus 77 ~l~~~~~~IdvLVNNAG~g~~g 98 (265)
T COG0300 77 ELKERGGPIDVLVNNAGFGTFG 98 (265)
T ss_pred HHHhcCCcccEEEECCCcCCcc
Confidence 34445 899999999986654
No 461
>smart00540 LEM in nuclear membrane-associated proteins. LEM, domain in nuclear membrane-associated proteins, including lamino-associated polypeptide 2 and emerin.
Probab=23.17 E-value=53 Score=19.38 Aligned_cols=21 Identities=10% Similarity=0.245 Sum_probs=17.7
Q ss_pred ecCCCCChHHHHHHHHhCCCcE
Q psy17695 172 DLSQVPCNGKAADRIHKDGIHI 193 (222)
Q Consensus 172 ~~~~~~~~~~~a~~i~~d~iDI 193 (222)
|+..+|| +|+.+...+.|+..
T Consensus 2 d~~~LSd-~eL~~~L~~~G~~~ 22 (44)
T smart00540 2 DVDRLSD-AELRAELKQYGLPP 22 (44)
T ss_pred chhHcCH-HHHHHHHHHcCCCC
Confidence 5677998 89999999998864
No 462
>PRK11233 nitrogen assimilation transcriptional regulator; Provisional
Probab=23.13 E-value=1.1e+02 Score=25.37 Aligned_cols=60 Identities=10% Similarity=-0.001 Sum_probs=35.3
Q ss_pred hccccceeeEeeccccccCCCchhhHHHHHHhhhcceeecCCCCChHHHHHHHHhCCCcEEEeC
Q psy17695 134 YAQRLESLYKVMWDRYSQNLPVTHITQAKIAREAEHFLDLSQVPCNGKAADRIHKDGIHILVNM 197 (222)
Q Consensus 134 ~~~rLRIGYvS~~d~~f~~H~v~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~i~~d~iDILvDL 197 (222)
..++|+||... .- +...-...++..-.+..-+.-+++...+. .++.+.+...++||.|..
T Consensus 89 ~~g~l~I~~~~-~~--~~~~~~~~~l~~~~~~~p~i~l~~~~~~~-~~~~~~l~~g~~Di~i~~ 148 (305)
T PRK11233 89 LSGQVSIGLAP-GT--AASSLTMPLLQAVRAEFPGIVLYLHENSG-ATLNEKLMNGQLDMAVIY 148 (305)
T ss_pred cCceEEEEccc-ch--hhHHHHHHHHHHHHHHCCCcEEEEEECCc-HHHHHHHHCCCCCEEEEc
Confidence 35789999865 33 22222222332222222243345554555 688899999999999964
No 463
>KOG2300|consensus
Probab=22.92 E-value=3.1e+02 Score=25.39 Aligned_cols=67 Identities=10% Similarity=0.069 Sum_probs=53.1
Q ss_pred HHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCC------CHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHh
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQSYRTALKLKPD------FPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAE 74 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~------~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~ 74 (222)
+.+++..|.....++++.||...+++.++..-. ..-.+..|+++....|+-.++.+..+-++.+..+
T Consensus 445 a~~~~v~glfaf~qn~lnEaK~~l~e~Lkmanaed~~rL~a~~LvLLs~v~lslgn~~es~nmvrpamqlAkK 517 (629)
T KOG2300|consen 445 ASILYVYGLFAFKQNDLNEAKRFLRETLKMANAEDLNRLTACSLVLLSHVFLSLGNTVESRNMVRPAMQLAKK 517 (629)
T ss_pred HHHHHHHHHHHHHhccHHHHHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHhcchHHHHhccchHHHHHhc
Confidence 467788899999999999999999999998611 1224667788888899999988887777776654
No 464
>cd08437 PBP2_MleR The substrate binding domain of LysR-type transcriptional regulator MleR which required for malolactic fermentation, contains type 2 periplasmic binidning fold. MleR, a transcription activator of malolactic fermentation system, is found in gram-positive bacteria and belongs to the lysR family of bacterial transcriptional regulators. The mleR gene is required for the expression and induction of malolactic fermentation. This substrate binding domain has significant homology to the type 2 periplasmic binding proteins (PBP2), which are responsible for the uptake of a variety of substrates such as phosphate, sulfate, polysaccharides, lysine/arginine/ornithine, and histidine. The PBP2 bind their ligand in the cleft between these domains in a manner resembling a Venus flytrap. After binding their specific ligand with high affinity, they can interact with a cognate membrane transport complex comprised of two integral membrane domains and two cytoplasmically located ATPase dom
Probab=22.87 E-value=98 Score=22.87 Aligned_cols=25 Identities=8% Similarity=0.163 Sum_probs=19.1
Q ss_pred eecCCCCChHHHHHHHHhCCCcEEEe
Q psy17695 171 LDLSQVPCNGKAADRIHKDGIHILVN 196 (222)
Q Consensus 171 ~~~~~~~~~~~~a~~i~~d~iDILvD 196 (222)
+++...+. .++.+.+.+.++|+.|-
T Consensus 31 i~~~~~~~-~~~~~~l~~g~~Dl~i~ 55 (198)
T cd08437 31 IDTYEGGS-AELLEQLLQGDLDIALL 55 (198)
T ss_pred EEEEEcCH-HHHHHHHHcCCCCEEEe
Confidence 44444454 68999999999999884
No 465
>cd08418 PBP2_TdcA The C-terminal substrate binding domain of LysR-type transcriptional regulator TdcA, which is involved in the degradation of L-serine and L-threonine, contains the type 2 periplasmic binding fold. TdcA, a member of the LysR family, activates the expression of the anaerobically-regulated tdcABCDEFG operon which is involved in the degradation of L-serine and L-threonine to acetate and propionate, respectively. The tdc operon is comprised of one regulatory gene tdcA and six structural genes, tdcB to tdcG. The expression of the tdc operon is affected by several transcription factors including the cAMP receptor protein (CRP), integration host factor (IHF), histone-like protein (HU), and the operon specific regulators TdcA and TcdR. TcdR is divergently transcribed from the operon and encodes a small protein that is required for efficient expression of the Escherichia coli tdc operon. This substrate-binding domain shows significant homology to the type 2 periplasmic binding
Probab=22.86 E-value=1.1e+02 Score=22.47 Aligned_cols=25 Identities=0% Similarity=0.112 Sum_probs=19.3
Q ss_pred eecCCCCChHHHHHHHHhCCCcEEEe
Q psy17695 171 LDLSQVPCNGKAADRIHKDGIHILVN 196 (222)
Q Consensus 171 ~~~~~~~~~~~~a~~i~~d~iDILvD 196 (222)
+++...+. .++.+.+.+.++|+.|-
T Consensus 31 l~i~~~~~-~~~~~~l~~g~~Dl~i~ 55 (201)
T cd08418 31 ISIYEGQL-SSLLPELRDGRLDFAIG 55 (201)
T ss_pred EEEEeCcH-HHHHHHHHcCCCcEEEE
Confidence 44444454 68999999999999995
No 466
>PRK08057 cobalt-precorrin-6x reductase; Reviewed
Probab=22.83 E-value=1e+02 Score=25.39 Aligned_cols=25 Identities=12% Similarity=0.340 Sum_probs=21.2
Q ss_pred cCCC-CChHHHHHHHHhCCCcEEEeCC
Q psy17695 173 LSQV-PCNGKAADRIHKDGIHILVNMN 198 (222)
Q Consensus 173 ~~~~-~~~~~~a~~i~~d~iDILvDL~ 198 (222)
+..+ +. +++.+.|++.+|+++||-+
T Consensus 48 ~G~l~~~-~~l~~~l~~~~i~~VIDAT 73 (248)
T PRK08057 48 VGGFGGA-EGLAAYLREEGIDLVIDAT 73 (248)
T ss_pred ECCCCCH-HHHHHHHHHCCCCEEEECC
Confidence 4557 54 7999999999999999986
No 467
>cd08451 PBP2_BudR The C-terminal substrate binding domain of LysR-type transcrptional regulator BudR, which is responsible for activation of the expression of the butanediol operon genes; contains the type 2 periplasmic binding fold. This CD represents the substrate binding domain of BudR regulator, which is responsible for induction of the butanediol formation pathway under fermentative growth conditions. Three enzymes are involved in the production of 1 mol of 2,3 butanediol from the condensation of 2 mol of pyruvate with acetolactate and acetoin as intermediates: acetolactate synthetase, acetolactate decarboxylase, and acetoin reductase. In Klebsiella terrigena, BudR regulates the expression of the budABC operon genes, encoding these three enzymes of the butanediol pathway. In many bacterial species, the use of this pathway can prevent intracellular acidification by diverting metabolism from acid production to the formation of neutral compounds (acetoin and butanediol). This substra
Probab=22.53 E-value=2.2e+02 Score=20.68 Aligned_cols=55 Identities=7% Similarity=0.036 Sum_probs=31.8
Q ss_pred cceeeEeeccccccCC-CchhhHHHHHHhhh-cceeecCCCCChHHHHHHHHhCCCcEEEeCC
Q psy17695 138 LESLYKVMWDRYSQNL-PVTHITQAKIAREA-EHFLDLSQVPCNGKAADRIHKDGIHILVNMN 198 (222)
Q Consensus 138 LRIGYvS~~d~~f~~H-~v~~l~~~~~~~~~-~~~~~~~~~~~~~~~a~~i~~d~iDILvDL~ 198 (222)
||||..+ + +..+ -...++ +++++.. +-=+++...+. .++.+.+.+.++||.|-..
T Consensus 2 l~Ig~~~-~---~~~~~~l~~~l-~~~~~~~P~i~l~i~~~~~-~~~~~~l~~g~~Dl~i~~~ 58 (199)
T cd08451 2 LRVGFTS-S---AAFHPLVPGLI-RRFREAYPDVELTLEEANT-AELLEALREGRLDAAFVRP 58 (199)
T ss_pred eeEEeee-e---eeeecccHHHH-HHHHHHCCCcEEEEecCCh-HHHHHHHHCCCccEEEEec
Confidence 6788765 3 2222 122222 3333333 33455555554 6888999999999998543
No 468
>cd08411 PBP2_OxyR The C-terminal substrate-binding domain of the LysR-type transcriptional regulator OxyR, a member of the type 2 periplasmic binding fold protein superfamily. OxyR senses hydrogen peroxide and is activated through the formation of an intramolecular disulfide bond. The OxyR activation induces the transcription of genes necessary for the bacterial defense against oxidative stress. The OxyR of LysR-type transcriptional regulator family is composed of two functional domains joined by a linker helix involved in oligomerization: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal substrate-binding domain, which is structurally homologous to the type 2 periplasmic binding proteins. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcriptional repressor undergoes a conformational change upon substrate binding which in turn changes the DNA binding affinity of the repre
Probab=22.33 E-value=1.3e+02 Score=22.22 Aligned_cols=55 Identities=5% Similarity=-0.010 Sum_probs=30.7
Q ss_pred ccceeeEeeccccccCCCchhhHHHHHHhhh-cceeecCCCCChHHHHHHHHhCCCcEEEeC
Q psy17695 137 RLESLYKVMWDRYSQNLPVTHITQAKIAREA-EHFLDLSQVPCNGKAADRIHKDGIHILVNM 197 (222)
Q Consensus 137 rLRIGYvS~~d~~f~~H~v~~l~~~~~~~~~-~~~~~~~~~~~~~~~a~~i~~d~iDILvDL 197 (222)
+||||... . +...-+..++. ++++.. +-=+.+...+. .++.+.+.+.++|+.|-.
T Consensus 2 ~l~I~~~~--~--~~~~~l~~~l~-~~~~~~P~i~i~i~~~~~-~~~~~~l~~~~~Dl~i~~ 57 (200)
T cd08411 2 PLRLGVIP--T--IAPYLLPRLLP-ALRQAYPKLRLYLREDQT-ERLLEKLRSGELDAALLA 57 (200)
T ss_pred eEEEEecH--H--HHhhhhHHHHH-HHHHHCCCcEEEEEeCcH-HHHHHHHHcCCccEEEEe
Confidence 57777755 2 33322223332 333332 22334443454 688999999999999853
No 469
>cd06822 PLPDE_III_YBL036c_euk Pyridoxal 5-phosphate (PLP)-binding TIM barrel domain of Type III PLP-Dependent Enzymes, Eukaryotic YBL036c-like proteins. This subfamily contains mostly uncharacterized eukaryotic proteins with similarity to the yeast hypothetical protein YBL036c, which is homologous to a Pseudomonas aeruginosa gene that is co-transcribed with a known proline biosynthetic gene. YBL036c is a single domain monomeric protein with a typical TIM barrel fold. It binds the PLP cofactor and has been shown to exhibit amino acid racemase activity. The YBL036c structure is similar to the N-terminal domain of the fold type III PLP-dependent enzymes, bacterial alanine racemase and eukaryotic ornithine decarboxylase, which are two-domain dimeric proteins. The lack of a second domain in YBL036c may explain limited D- to L-alanine racemase or non-specific racemase activity. Some members of this subfamily are also referred to as PROSC (Proline synthetase co-transcribed bacterial homolog)
Probab=22.20 E-value=64 Score=26.21 Aligned_cols=60 Identities=17% Similarity=0.312 Sum_probs=44.0
Q ss_pred ccccccCCCchhhHHHHHHhhhc--ceeecCCCCChHHHHHHHHh--------CCCcEEEeCCCCCCCcchhh
Q psy17695 146 WDRYSQNLPVTHITQAKIAREAE--HFLDLSQVPCNGKAADRIHK--------DGIHILVNMNGYTKGARNEI 208 (222)
Q Consensus 146 ~d~~f~~H~v~~l~~~~~~~~~~--~~~~~~~~~~~~~~a~~i~~--------d~iDILvDL~G~t~~~r~~i 208 (222)
.| .+.|=+|+|=+.+.+..+. +|.-|+.++. .++++.|-+ -.++|||+.+--...+|.++
T Consensus 66 ~~--i~wHfIG~LQ~NK~k~i~~~~~~~~ihsvDs-~~la~~L~~~a~~~~~~~~~~VlIqVn~g~e~~K~Gv 135 (227)
T cd06822 66 ID--IKWHFIGHLQSNKVKKLLKVPNLYMVETVDS-EKLADKLNKAWEKLGEREPLKVMVQVNTSGEESKSGL 135 (227)
T ss_pred CC--ceEEEECCCchhhHHHHhccccccEEEecCC-HHHHHHHHHHHHHhcCCCCCcEEEEEeCCCCCCCCCC
Confidence 46 6667777777889998854 6666777887 777776654 25999999986665677776
No 470
>PRK09508 leuO leucine transcriptional activator; Reviewed
Probab=22.00 E-value=1.1e+02 Score=25.57 Aligned_cols=58 Identities=2% Similarity=-0.086 Sum_probs=34.6
Q ss_pred hccccceeeEeeccccccCCCchhhHHHHHHhhhcceeecCCCCChHHHHHHHHhCCCcEEEe
Q psy17695 134 YAQRLESLYKVMWDRYSQNLPVTHITQAKIAREAEHFLDLSQVPCNGKAADRIHKDGIHILVN 196 (222)
Q Consensus 134 ~~~rLRIGYvS~~d~~f~~H~v~~l~~~~~~~~~~~~~~~~~~~~~~~~a~~i~~d~iDILvD 196 (222)
..+.++||..+ . +..+-+..++..-.+..-+--+++...+. .++.+.+.+.++||.|-
T Consensus 110 ~~~~i~i~~~~--~--~~~~~l~~~l~~f~~~~P~i~l~i~~~~~-~~~~~~l~~g~~Di~i~ 167 (314)
T PRK09508 110 SERVFNLCICS--P--LDIRLTSQIYNRIEQIAPNIHVVFKSSLN-QNIEHQLRYQETEFVIS 167 (314)
T ss_pred cccEEEEEech--h--HHHHHHHHHHHHHHHhCCCcEEEEEeCcc-hhHHHHHhcCCccEEEe
Confidence 35689999544 3 22222222223222333344466655555 68999999999999994
No 471
>KOG4563|consensus
Probab=22.00 E-value=1.5e+02 Score=26.13 Aligned_cols=53 Identities=11% Similarity=0.072 Sum_probs=41.4
Q ss_pred HHHHHHHHhcccCCCcHHHHHHHHHHHHhCC--------CCHHHHHHHHHHHHHcCCHHHH
Q psy17695 9 VGILFSNSSFPSPGNIPEAIQSYRTALKLKP--------DFPDAYCNLAHCLQIVCDWTDY 61 (222)
Q Consensus 9 ~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P--------~~~~a~~~l~~~~~~~~~~~~a 61 (222)
.-+...|.-...++++++|.+.|..|..+-- ++..+++.+|..++.+++.+..
T Consensus 42 e~lv~~G~~~~~~~d~~~Avda~s~A~~l~~ei~Ge~~~e~~eal~~YGkslLela~~e~~ 102 (400)
T KOG4563|consen 42 EELVQAGRRALCNNDIDKAVDALSEATELSDEIYGEKHLETFEALFLYGKSLLELAKEESQ 102 (400)
T ss_pred HHHHHhhhHHHhcccHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3456789999999999999999999998742 3456788888888877665543
No 472
>KOG0529|consensus
Probab=21.79 E-value=1.5e+02 Score=26.38 Aligned_cols=70 Identities=13% Similarity=-0.017 Sum_probs=51.4
Q ss_pred CCCCCCCHHHHHHHHHhcccCCC--cHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcC----CHHHHHHHHHHHHH
Q psy17695 1 MMKTPFFKVGILFSNSSFPSPGN--IPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVC----DWTDYEARMKKLVS 70 (222)
Q Consensus 1 l~~~P~~~~~~~~lg~~l~~~g~--~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~----~~~~a~~~~~~~~~ 70 (222)
|+.+|+.-.+|+....++.+.+. +..-++..++++++||.+-.+|...=.+..... .+.+-++...+++.
T Consensus 102 L~~npksY~aW~hR~w~L~~~p~~~~~~EL~lcek~L~~D~RNfh~W~YRRfV~~~~~~~~~~~~~El~ftt~~I~ 177 (421)
T KOG0529|consen 102 LKVNPKSYGAWHHRKWVLQKNPHSDWNTELQLCEKALKQDPRNFHAWHYRRFVVEQAERSRNLEKEELEFTTKLIN 177 (421)
T ss_pred HHhCchhHHHHHHHHHHHHhCCCchHHHHHHHHHHHHhcCcccccchHHHHHHHHHHhcccccchhHHHHHHHHHh
Confidence 45789999999999999997764 688899999999999999999877655543322 23443444444443
No 473
>KOG4079|consensus
Probab=21.66 E-value=61 Score=24.21 Aligned_cols=40 Identities=13% Similarity=0.257 Sum_probs=30.2
Q ss_pred HHHhhhcceeecCCCCChHHHHHHHHhCCCcEEEeCCCCCC
Q psy17695 162 KIAREAEHFLDLSQVPCNGKAADRIHKDGIHILVNMNGYTK 202 (222)
Q Consensus 162 ~~~~~~~~~~~~~~~~~~~~~a~~i~~d~iDILvDL~G~t~ 202 (222)
++++.--+.+-...|+. .-.++.--+||=|+||||+|-..
T Consensus 57 QykNP~VQ~~~~knmtp-sPF~R~YlddGr~vL~Dld~~~r 96 (169)
T KOG4079|consen 57 QYKNPKVQLVKHKNMTP-SPFARAYLDDGREVLFDLDGMKR 96 (169)
T ss_pred cccCCceEEEeeccCCC-ChHHHheecCcceEEEEcccccH
Confidence 44555455566677887 68888999999999999998543
No 474
>COG2909 MalT ATP-dependent transcriptional regulator [Transcription]
Probab=21.43 E-value=4.7e+02 Score=25.90 Aligned_cols=69 Identities=12% Similarity=-0.053 Sum_probs=51.9
Q ss_pred CHHHHHHHHHhcccCCCcHHHHHHHHHHHHhCCC--C-------HHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHhh
Q psy17695 7 FKVGILFSNSSFPSPGNIPEAIQSYRTALKLKPD--F-------PDAYCNLAHCLQIVCDWTDYEARMKKLVSIVAEQ 75 (222)
Q Consensus 7 ~~~~~~~lg~~l~~~g~~~eA~~~~~~Al~l~P~--~-------~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~~~ 75 (222)
+|......+..+....+++||..+..++..--|. . ++.....+.+....++.+++++..+.++..+|+.
T Consensus 414 ~P~Lvll~aW~~~s~~r~~ea~~li~~l~~~l~~~~~~~~~~l~ae~~aL~a~val~~~~~e~a~~lar~al~~L~~~ 491 (894)
T COG2909 414 TPRLVLLQAWLLASQHRLAEAETLIARLEHFLKAPMHSRQGDLLAEFQALRAQVALNRGDPEEAEDLARLALVQLPEA 491 (894)
T ss_pred CchHHHHHHHHHHHccChHHHHHHHHHHHHHhCcCcccchhhHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHhcccc
Confidence 4556666778888899999999999888766544 1 2334445666778899999999999998887763
No 475
>PRK12469 RNA polymerase factor sigma-54; Provisional
Probab=21.41 E-value=1.2e+02 Score=27.82 Aligned_cols=34 Identities=15% Similarity=0.012 Sum_probs=27.5
Q ss_pred HHHHHhhhcceeecCCCCChHHHHHHHHhCCCcEE
Q psy17695 160 QAKIAREAEHFLDLSQVPCNGKAADRIHKDGIHIL 194 (222)
Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~a~~i~~d~iDIL 194 (222)
+.+|+...+.-=.-..+|| +++++++.+.||+|.
T Consensus 426 k~~Ik~lI~~Ed~~kPLSD-~~I~~~L~~~GI~IA 459 (481)
T PRK12469 426 RALIKEMIAAEQAGDPLSD-VALAEMLAGRGVLIA 459 (481)
T ss_pred HHHHHHHHHhcCCCCCCCH-HHHHHHHHhcCCCee
Confidence 5667776666555677998 899999999999985
No 476
>PRK05234 mgsA methylglyoxal synthase; Validated
Probab=21.31 E-value=1e+02 Score=23.00 Aligned_cols=33 Identities=18% Similarity=0.213 Sum_probs=23.1
Q ss_pred HHHHHHHHhCCCcEEEeCCC-CCCCc-chhhhhhC
Q psy17695 180 GKAADRIHKDGIHILVNMNG-YTKGA-RNEIFALR 212 (222)
Q Consensus 180 ~~~a~~i~~d~iDILvDL~G-~t~~~-r~~i~a~r 212 (222)
.+++++|++.+||.+|.+.- .+... +.+.|..|
T Consensus 65 ~~i~~~I~~g~i~lVInt~dp~~~~~~~~D~~~IR 99 (142)
T PRK05234 65 QQIGALIAEGKIDMLIFFRDPLTAQPHDPDVKALL 99 (142)
T ss_pred hhHHHHHHcCceeEEEEecCCCCCCcccchHHHHH
Confidence 58999999999999999861 23333 44444443
No 477
>TIGR00715 precor6x_red precorrin-6x reductase. This enzyme was found to be a monomer by gel filtration.
Probab=21.10 E-value=1.1e+02 Score=25.24 Aligned_cols=26 Identities=15% Similarity=0.234 Sum_probs=21.6
Q ss_pred cCCCCChHHHHHHHHhCCCcEEEeCCC
Q psy17695 173 LSQVPCNGKAADRIHKDGIHILVNMNG 199 (222)
Q Consensus 173 ~~~~~~~~~~a~~i~~d~iDILvDL~G 199 (222)
...++. +++.+.|++.+||++||-+.
T Consensus 49 ~g~l~~-~~l~~~l~~~~i~~VIDAtH 74 (256)
T TIGR00715 49 TGALDP-QELREFLKRHSIDILVDATH 74 (256)
T ss_pred ECCCCH-HHHHHHHHhcCCCEEEEcCC
Confidence 445675 78999999999999999874
No 478
>cd08417 PBP2_Nitroaromatics_like The C-terminal substrate binding domain of LysR-type transcriptional regulators that involved in the catabolism of nitroaromatic/naphthalene compounds and that of related regulators; contains the type 2 periplasmic binding fold. This CD includes the C-terminal substrate binding domain of LysR-type transcriptional regulators involved in the catabolism of dinitrotoluene and similar compounds, such as DntR, NahR, and LinR. The transcription of the genes encoding enzymes involved in such degradation is regulated and expression of these enzymes is enhanced by inducers, which are either an intermediate in the metabolic pathway or compounds to be degraded. Also included are related LysR-type regulators clustered together in phylogenetic trees, including NodD, ToxR, LeuO, SyrM, TdcA, and PnbR. This substrate-binding domain shows significant homology to the type 2 periplasmic binding proteins (PBP2), which are responsible for the uptake of a variety of substrate
Probab=20.74 E-value=1.1e+02 Score=22.62 Aligned_cols=28 Identities=7% Similarity=0.133 Sum_probs=21.6
Q ss_pred cceeecCCCCChHHHHHHHHhCCCcEEEe
Q psy17695 168 EHFLDLSQVPCNGKAADRIHKDGIHILVN 196 (222)
Q Consensus 168 ~~~~~~~~~~~~~~~a~~i~~d~iDILvD 196 (222)
+--+++...+. .++.+.+.+.++|+.|-
T Consensus 28 ~i~l~~~~~~~-~~~~~~l~~g~~D~~i~ 55 (200)
T cd08417 28 GVRLRFVPLDR-DDLEEALESGEIDLAIG 55 (200)
T ss_pred CeEEEeccCCH-HHHHHHHHcCCCCEEEe
Confidence 34466665565 69999999999999984
No 479
>PRK09558 ushA bifunctional UDP-sugar hydrolase/5'-nucleotidase periplasmic precursor; Reviewed
Probab=20.57 E-value=1.1e+02 Score=28.15 Aligned_cols=40 Identities=13% Similarity=0.371 Sum_probs=26.7
Q ss_pred HHHH--hhhcceeecCCCCC------------hHHHHHHHHhCCCcEEEeCCCCCC
Q psy17695 161 AKIA--REAEHFLDLSQVPC------------NGKAADRIHKDGIHILVNMNGYTK 202 (222)
Q Consensus 161 ~~~~--~~~~~~~~~~~~~~------------~~~~a~~i~~d~iDILvDL~G~t~ 202 (222)
.++| ..+|..+-++.+.- +.++|+.+-..+||++| .||+-
T Consensus 202 ~~Lk~~~~~D~IV~LsH~G~~~~~~~~~~~~~d~~la~~~~~~~IDvIl--gGHsH 255 (551)
T PRK09558 202 PELKQTEKPDVIIALTHMGHYDDGEHGSNAPGDVEMARSLPAGGLDMIV--GGHSQ 255 (551)
T ss_pred HHHHhccCCCEEEEEeccccccCCccCCCCccHHHHHHhCCccCceEEE--eCCCC
Confidence 4566 45787777766442 13566666556999998 58775
No 480
>cd08433 PBP2_Nac The C-teminal substrate binding domain of LysR-like nitrogen assimilation control (NAC) protein, contains the type 2 periplasmic binding fold. The NAC is a LysR-type transcription regulator that activates expression of operons such as hut (histidine utilization) and ure (urea utilization), allowing use of non-preferred (poor) nitrogen sources, and represses expression of operons, such as glutamate dehydrogenase (gdh), allowing assimilation of the preferred nitrogen source. The expression of the nac gene is fully dependent on the nitrogen regulatory system (NTR) and the sigma54-containing RNA polymerase (sigma54-RNAP). In response to nitrogen starvation, NTR system activates the expression of nac, and NAC activates the expression of hut, ure, and put (proline utilization). NAC is not involved in the transcription of Sigma70-RNAP operons such as glnA, which directly respond by the NTR system, but activates the transcription of sigma70-RNAP dependent operons such as hut.
Probab=20.51 E-value=1.9e+02 Score=21.12 Aligned_cols=25 Identities=4% Similarity=0.106 Sum_probs=18.6
Q ss_pred ecCCCCChHHHHHHHHhCCCcEEEeC
Q psy17695 172 DLSQVPCNGKAADRIHKDGIHILVNM 197 (222)
Q Consensus 172 ~~~~~~~~~~~a~~i~~d~iDILvDL 197 (222)
.+...+. .++.+.+.+.++||.|-.
T Consensus 32 ~~~~~~~-~~~~~~l~~~~~D~~i~~ 56 (198)
T cd08433 32 RIVEGLS-GHLLEWLLNGRLDLALLY 56 (198)
T ss_pred EEEecCc-HHHHHHHhCCCCcEEEEe
Confidence 3333354 689999999999999863
No 481
>COG4649 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=20.30 E-value=4.5e+02 Score=20.92 Aligned_cols=63 Identities=10% Similarity=0.117 Sum_probs=0.0
Q ss_pred HHHHHHHHHhcccCCCcHHHHHHHHHHH---------------------------------------HhCCCCHHHHHHH
Q psy17695 8 KVGILFSNSSFPSPGNIPEAIQSYRTAL---------------------------------------KLKPDFPDAYCNL 48 (222)
Q Consensus 8 ~~~~~~lg~~l~~~g~~~eA~~~~~~Al---------------------------------------~l~P~~~~a~~~l 48 (222)
..+.+..|.++.+.|+-.+|+.+|..+- .-+|=-..+.-.|
T Consensus 94 vLA~mr~at~~a~kgdta~AV~aFdeia~dt~~P~~~rd~ARlraa~lLvD~gsy~dV~srvepLa~d~n~mR~sArEAL 173 (221)
T COG4649 94 VLARMRAATLLAQKGDTAAAVAAFDEIAADTSIPQIGRDLARLRAAYLLVDNGSYDDVSSRVEPLAGDGNPMRHSAREAL 173 (221)
T ss_pred HHHHHHHHHHHhhcccHHHHHHHHHHHhccCCCcchhhHHHHHHHHHHHhccccHHHHHHHhhhccCCCChhHHHHHHHH
Q ss_pred HHHHHHcCCHHHHHHHHHHHHH
Q psy17695 49 AHCLQIVCDWTDYEARMKKLVS 70 (222)
Q Consensus 49 ~~~~~~~~~~~~a~~~~~~~~~ 70 (222)
|....+.|++.+|...|.++..
T Consensus 174 glAa~kagd~a~A~~~F~qia~ 195 (221)
T COG4649 174 GLAAYKAGDFAKAKSWFVQIAN 195 (221)
T ss_pred hHHHHhccchHHHHHHHHHHHc
No 482
>cd00860 ThrRS_anticodon ThrRS Threonyl-anticodon binding domain. ThrRS belongs to class II aminoacyl-tRNA synthetases (aaRS). This alignment contains the anticodon binding domain, which is responsible for specificity in tRNA-binding, so that the activated amino acid is transferred to a ribose 3' OH group of the appropriate tRNA only.
Probab=20.28 E-value=1e+02 Score=20.08 Aligned_cols=38 Identities=18% Similarity=0.308 Sum_probs=26.8
Q ss_pred HHHHHHHHhCCCcEEEeCCCCCCCcchhhhhhCCchhh
Q psy17695 180 GKAADRIHKDGIHILVNMNGYTKGARNEIFALRPAPIQ 217 (222)
Q Consensus 180 ~~~a~~i~~d~iDILvDL~G~t~~~r~~i~a~rpAPvQ 217 (222)
.++++.+|+.|+.+.+|+.+-+-+.++.-.-.+=+|+.
T Consensus 18 ~~~~~~Lr~~g~~v~~d~~~~~~~~~~~~a~~~g~~~~ 55 (91)
T cd00860 18 KEVAKKLSDAGIRVEVDLRNEKLGKKIREAQLQKIPYI 55 (91)
T ss_pred HHHHHHHHHCCCEEEEECCCCCHHHHHHHHHHcCCCEE
Confidence 45678888889999999887666666655555555543
No 483
>KOG3807|consensus
Probab=20.24 E-value=3.9e+02 Score=23.51 Aligned_cols=53 Identities=23% Similarity=0.184 Sum_probs=38.4
Q ss_pred ccCCCcHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHH
Q psy17695 19 PSPGNIPEAIQSYRTALKLKPDFPDAYCNLAHCLQIVCDWTDYEARMKKLVSIVA 73 (222)
Q Consensus 19 ~~~g~~~eA~~~~~~Al~l~P~~~~a~~~l~~~~~~~~~~~~a~~~~~~~~~~~~ 73 (222)
.+-.+..+-++.-..|+++||..+.++..|+.- +.--..+|+..++++++...
T Consensus 195 WRERnp~~RI~~A~~ALeIN~eCA~AyvLLAEE--Ea~Ti~~AE~l~k~ALka~e 247 (556)
T KOG3807|consen 195 WRERNPPARIKAAYQALEINNECATAYVLLAEE--EATTIVDAERLFKQALKAGE 247 (556)
T ss_pred HHhcCcHHHHHHHHHHHhcCchhhhHHHhhhhh--hhhhHHHHHHHHHHHHHHHH
Confidence 344567788899999999999999999888753 22234567777777766443
No 484
>cd08416 PBP2_MdcR The C-terminal substrate-binding domian of LysR-type transcriptional regulator MdcR, which involved in the malonate catabolism contains the type 2 periplasmic binding fold. This family includes the C-terminal substrate binding domain of LysR-type transcriptional regulator (LTTR) MdcR that controls the expression of the malonate decarboxylase (mdc) genes. Like other members of the LTTRs, MdcR is a positive regulatory protein for its target promoter and composed of two functional domains joined by a linker helix involved in oligomerization: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal substrate-binding domain, which is structurally homologous to the type 2 periplasmic binding proteins (PBP2). The PBP2 are responsible for the uptake of a variety of substrates such as phosphate, sulfate, polysaccharides, lysine/arginine/ornithine, and histidine. The PBP2 bind their ligand in the cleft between these dom
Probab=20.21 E-value=1.4e+02 Score=21.85 Aligned_cols=26 Identities=4% Similarity=0.159 Sum_probs=19.3
Q ss_pred eecCCCCChHHHHHHHHhCCCcEEEeC
Q psy17695 171 LDLSQVPCNGKAADRIHKDGIHILVNM 197 (222)
Q Consensus 171 ~~~~~~~~~~~~a~~i~~d~iDILvDL 197 (222)
+++...+. .++.+.+.+.++||.|-.
T Consensus 31 l~i~~~~~-~~~~~~l~~~~~Dl~i~~ 56 (199)
T cd08416 31 IELTLGSN-KDLLKKLKDGELDAILVA 56 (199)
T ss_pred EEEEEcCc-HHHHHHHhCCCCCEEEEe
Confidence 44443354 588899999999999964
No 485
>cd08421 PBP2_LTTR_like_1 The C-terminal substrate binding domain of an uncharacterized LysR-type transcriptional regulator, contains the type 2 periplasmic binding fold. LysR-transcriptional regulators comprise the largest family of prokaryotic transcription factor. Homologs of some of LTTRs with similar domain organizations are also found in the archaea and eukaryotic organisms. The LTTRs are composed of two functional domains joined by a linker helix involved in oligomerization: an N-terminal HTH (helix-turn-helix) domain, which is responsible for the DNA-binding specificity, and a C-terminal substrate-binding domain, which is structurally homologous to the type 2 periplasmic binding proteins. As also observed in the periplasmic binding proteins, the C-terminal domain of the bacterial transcriptional repressor undergoes a conformational change upon substrate binding which in turn changes the DNA binding affinity of the repressor. The genes controlled by the LTTRs have diverse functi
Probab=20.06 E-value=1.8e+02 Score=21.27 Aligned_cols=25 Identities=8% Similarity=0.137 Sum_probs=19.0
Q ss_pred eecCCCCChHHHHHHHHhCCCcEEEe
Q psy17695 171 LDLSQVPCNGKAADRIHKDGIHILVN 196 (222)
Q Consensus 171 ~~~~~~~~~~~~a~~i~~d~iDILvD 196 (222)
+.+...+. .++.+.+.+.++|+.|-
T Consensus 31 i~~~~~~~-~~~~~~l~~~~~D~~i~ 55 (198)
T cd08421 31 IDLEERLS-ADIVRAVAEGRADLGIV 55 (198)
T ss_pred EEEEecCc-HHHHHHHhcCCceEEEE
Confidence 44444454 68999999999999885
Done!