Query psy3020
Match_columns 150
No_of_seqs 103 out of 1639
Neff 10.0
Searched_HMMs 46136
Date Fri Aug 16 23:29:52 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy3020.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/3020hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PRK15359 type III secretion sy 99.7 8.6E-17 1.9E-21 105.3 12.5 109 2-148 29-138 (144)
2 KOG0553|consensus 99.7 1.5E-16 3.2E-21 112.5 10.2 108 4-149 88-196 (304)
3 COG3063 PilF Tfp pilus assembl 99.6 3.1E-15 6.8E-20 102.4 10.2 134 2-148 40-185 (250)
4 KOG4626|consensus 99.6 5.3E-15 1.1E-19 113.5 12.4 132 2-149 223-367 (966)
5 KOG4626|consensus 99.6 2.2E-15 4.8E-20 115.5 9.6 129 2-146 359-500 (966)
6 KOG0548|consensus 99.6 3.7E-14 8E-19 106.8 12.7 88 3-98 76-164 (539)
7 TIGR00990 3a0801s09 mitochondr 99.6 5.7E-14 1.2E-18 111.1 12.5 132 2-149 336-480 (615)
8 PLN03088 SGT1, suppressor of 99.6 8.6E-14 1.9E-18 103.5 12.2 108 3-148 8-116 (356)
9 KOG1126|consensus 99.6 6E-15 1.3E-19 113.2 6.1 132 2-149 426-570 (638)
10 TIGR00990 3a0801s09 mitochondr 99.5 3.1E-13 6.7E-18 107.0 12.2 125 2-142 370-507 (615)
11 PRK11189 lipoprotein NlpI; Pro 99.5 5.2E-13 1.1E-17 97.1 12.2 97 2-136 69-166 (296)
12 PRK10370 formate-dependent nit 99.5 1E-12 2.2E-17 90.3 11.0 36 102-137 144-179 (198)
13 TIGR02552 LcrH_SycD type III s 99.5 2.7E-12 5.9E-17 82.7 12.0 100 2-139 22-122 (135)
14 TIGR02521 type_IV_pilW type IV 99.5 1.7E-12 3.7E-17 89.5 11.2 138 2-149 36-182 (234)
15 PRK15359 type III secretion sy 99.4 8.9E-13 1.9E-17 86.3 9.1 94 16-150 12-106 (144)
16 PRK11189 lipoprotein NlpI; Pro 99.4 3.2E-12 6.8E-17 93.0 11.9 129 2-140 103-275 (296)
17 PRK12370 invasion protein regu 99.4 2.8E-12 6.1E-17 100.5 12.1 123 11-149 318-454 (553)
18 PRK10370 formate-dependent nit 99.4 3.9E-12 8.4E-17 87.4 10.5 102 10-149 52-157 (198)
19 PRK12370 invasion protein regu 99.4 4.5E-12 9.7E-17 99.3 12.2 132 2-149 343-488 (553)
20 PRK15179 Vi polysaccharide bio 99.4 4.5E-12 9.7E-17 101.0 12.2 110 2-149 91-201 (694)
21 PRK09782 bacteriophage N4 rece 99.4 5.8E-12 1.3E-16 103.7 12.6 130 3-149 548-690 (987)
22 PRK11447 cellulose synthase su 99.4 1.4E-11 3.1E-16 103.6 12.7 131 3-149 467-650 (1157)
23 PLN02789 farnesyltranstransfer 99.4 2.4E-11 5.3E-16 89.1 12.4 129 5-148 45-188 (320)
24 PF13429 TPR_15: Tetratricopep 99.4 3.5E-12 7.6E-17 91.9 7.8 48 102-149 214-261 (280)
25 KOG0548|consensus 99.3 1.1E-11 2.4E-16 93.6 10.5 107 3-147 364-471 (539)
26 PRK09782 bacteriophage N4 rece 99.3 1.5E-11 3.3E-16 101.3 12.0 127 5-148 584-723 (987)
27 PRK11447 cellulose synthase su 99.3 2.1E-11 4.6E-16 102.6 13.1 129 3-147 275-430 (1157)
28 PRK15174 Vi polysaccharide exp 99.3 2.5E-11 5.3E-16 96.9 12.6 135 3-149 218-365 (656)
29 KOG0624|consensus 99.3 3.5E-12 7.6E-17 92.4 6.7 132 4-148 196-353 (504)
30 PRK15363 pathogenicity island 99.3 3.4E-11 7.3E-16 78.9 10.5 95 2-134 40-135 (157)
31 KOG1155|consensus 99.3 2E-11 4.3E-16 91.2 10.2 131 3-149 336-479 (559)
32 KOG1125|consensus 99.3 1.4E-11 3E-16 93.8 9.5 48 102-149 464-511 (579)
33 TIGR02552 LcrH_SycD type III s 99.3 3.4E-11 7.4E-16 77.5 9.7 94 18-149 4-98 (135)
34 TIGR03302 OM_YfiO outer membra 99.3 4.4E-11 9.5E-16 84.0 11.0 134 3-149 76-216 (235)
35 TIGR02917 PEP_TPR_lipo putativ 99.3 7.9E-11 1.7E-15 95.3 13.6 132 2-149 741-884 (899)
36 TIGR02795 tol_pal_ybgF tol-pal 99.3 8.8E-11 1.9E-15 73.6 10.8 103 2-139 7-113 (119)
37 PRK15174 Vi polysaccharide exp 99.3 5.8E-11 1.3E-15 94.8 12.1 124 2-141 251-391 (656)
38 KOG1126|consensus 99.3 1.6E-11 3.4E-16 94.7 8.2 132 2-149 460-604 (638)
39 KOG1155|consensus 99.3 7.5E-11 1.6E-15 88.2 11.2 121 2-132 369-496 (559)
40 KOG0550|consensus 99.3 1.9E-11 4.2E-16 90.1 6.9 121 3-149 175-300 (486)
41 TIGR02917 PEP_TPR_lipo putativ 99.2 1.4E-10 3E-15 93.8 12.3 132 2-149 130-274 (899)
42 PRK10049 pgaA outer membrane p 99.2 1.9E-10 4.1E-15 93.4 12.3 107 3-148 55-162 (765)
43 KOG2002|consensus 99.2 3.2E-10 6.9E-15 90.8 13.2 138 2-149 204-355 (1018)
44 KOG3060|consensus 99.2 2.3E-10 4.9E-15 79.8 10.9 109 3-149 92-201 (289)
45 cd00189 TPR Tetratricopeptide 99.2 4.7E-10 1E-14 66.2 10.4 95 2-134 5-100 (100)
46 PRK15363 pathogenicity island 99.2 1.9E-10 4.2E-15 75.4 9.2 91 22-150 25-117 (157)
47 TIGR03302 OM_YfiO outer membra 99.2 3.8E-10 8.3E-15 79.2 11.4 103 2-139 38-152 (235)
48 KOG0547|consensus 99.2 1.9E-10 4E-15 86.6 9.9 129 3-147 366-513 (606)
49 PRK11788 tetratricopeptide rep 99.2 4.8E-10 1E-14 84.0 11.8 132 2-149 112-262 (389)
50 KOG2076|consensus 99.2 4.1E-10 8.9E-15 89.5 11.7 110 2-149 144-254 (895)
51 TIGR02521 type_IV_pilW type IV 99.2 2.8E-09 6.2E-14 73.3 14.3 131 2-148 70-215 (234)
52 KOG1125|consensus 99.2 8.8E-11 1.9E-15 89.5 7.0 95 2-134 435-530 (579)
53 KOG0547|consensus 99.2 3.1E-10 6.7E-15 85.5 9.6 63 4-76 401-464 (606)
54 COG4235 Cytochrome c biogenesi 99.2 7.1E-10 1.5E-14 79.1 10.9 101 2-137 161-262 (287)
55 KOG0624|consensus 99.1 7.5E-10 1.6E-14 80.5 10.1 138 2-149 43-202 (504)
56 PRK11788 tetratricopeptide rep 99.1 1.4E-09 3.1E-14 81.4 11.8 130 3-149 41-193 (389)
57 PF13414 TPR_11: TPR repeat; P 99.1 1.5E-10 3.2E-15 66.1 5.0 49 102-150 3-51 (69)
58 PRK02603 photosystem I assembl 99.1 1.4E-09 3E-14 73.1 10.0 96 2-135 40-153 (172)
59 KOG4162|consensus 99.1 1E-09 2.2E-14 86.2 10.4 101 3-139 690-791 (799)
60 PF13414 TPR_11: TPR repeat; P 99.1 2.9E-10 6.2E-15 64.9 5.5 61 2-71 8-69 (69)
61 PF13432 TPR_16: Tetratricopep 99.1 2.4E-10 5.3E-15 64.5 5.1 63 1-73 1-64 (65)
62 COG5010 TadD Flp pilus assembl 99.1 4.6E-09 1E-13 73.5 11.6 108 3-148 106-214 (257)
63 PRK14720 transcript cleavage f 99.1 2.8E-09 6.1E-14 86.6 11.6 136 2-150 121-271 (906)
64 CHL00033 ycf3 photosystem I as 99.1 3.9E-09 8.5E-14 70.6 10.7 104 2-136 40-154 (168)
65 PF13429 TPR_15: Tetratricopep 99.0 2.9E-10 6.3E-15 81.9 4.3 135 2-149 13-193 (280)
66 COG4783 Putative Zn-dependent 99.0 6.8E-09 1.5E-13 78.2 11.5 110 3-150 312-422 (484)
67 PRK15179 Vi polysaccharide bio 99.0 5.9E-09 1.3E-13 83.4 11.8 104 2-143 125-229 (694)
68 KOG1173|consensus 99.0 5.3E-09 1.1E-13 80.0 10.5 146 2-149 317-502 (611)
69 PLN02789 farnesyltranstransfer 99.0 7.6E-09 1.6E-13 76.0 11.0 132 3-148 77-229 (320)
70 PF12895 Apc3: Anaphase-promot 99.0 2.5E-09 5.5E-14 63.5 6.5 80 10-128 2-84 (84)
71 KOG3785|consensus 99.0 7.1E-09 1.5E-13 75.9 9.8 126 1-149 61-198 (557)
72 PRK10803 tol-pal system protei 99.0 1.3E-08 2.7E-13 72.9 11.0 100 4-138 149-253 (263)
73 PRK15331 chaperone protein Sic 99.0 9.6E-09 2.1E-13 67.8 9.4 107 2-148 42-149 (165)
74 KOG0550|consensus 99.0 1.9E-09 4E-14 79.9 6.5 117 4-148 210-333 (486)
75 KOG0543|consensus 98.9 1.1E-08 2.4E-13 75.6 10.0 106 3-146 214-335 (397)
76 PRK10049 pgaA outer membrane p 98.9 1.1E-08 2.4E-13 83.2 11.0 101 2-140 364-465 (765)
77 KOG4648|consensus 98.9 5E-09 1.1E-13 76.4 7.3 100 3-140 103-203 (536)
78 COG5010 TadD Flp pilus assembl 98.9 2.5E-08 5.4E-13 69.9 10.5 108 3-148 72-180 (257)
79 KOG1129|consensus 98.9 9.8E-09 2.1E-13 74.4 8.6 137 2-148 261-441 (478)
80 PRK14574 hmsH outer membrane p 98.9 2.8E-08 6.1E-13 80.9 11.9 108 3-148 40-148 (822)
81 KOG2002|consensus 98.9 5.1E-09 1.1E-13 84.1 7.4 134 9-149 624-763 (1018)
82 PRK14574 hmsH outer membrane p 98.9 3.4E-08 7.3E-13 80.5 12.0 106 4-148 109-215 (822)
83 cd05804 StaR_like StaR_like; a 98.9 5.5E-08 1.2E-12 72.1 11.6 99 2-134 119-218 (355)
84 PF14559 TPR_19: Tetratricopep 98.9 7.1E-09 1.5E-13 58.9 5.4 56 7-72 1-57 (68)
85 PF13432 TPR_16: Tetratricopep 98.8 2.2E-08 4.8E-13 56.4 7.0 63 36-136 2-65 (65)
86 PRK10153 DNA-binding transcrip 98.8 8.2E-08 1.8E-12 74.8 12.0 127 4-139 346-490 (517)
87 PF09976 TPR_21: Tetratricopep 98.8 4.6E-08 9.9E-13 64.0 8.8 89 2-129 53-145 (145)
88 PF13428 TPR_14: Tetratricopep 98.8 9.7E-09 2.1E-13 53.5 4.2 44 102-145 1-44 (44)
89 cd05804 StaR_like StaR_like; a 98.8 1.2E-07 2.7E-12 70.2 11.7 133 4-149 50-199 (355)
90 PF13428 TPR_14: Tetratricopep 98.8 7.4E-09 1.6E-13 53.9 3.6 39 2-40 6-44 (44)
91 KOG1156|consensus 98.8 1.1E-07 2.3E-12 74.0 10.9 134 3-146 81-263 (700)
92 PF00515 TPR_1: Tetratricopept 98.8 1.9E-08 4.1E-13 49.3 4.5 34 102-135 1-34 (34)
93 KOG1128|consensus 98.7 9.9E-08 2.1E-12 75.0 9.7 129 3-149 430-566 (777)
94 PF07719 TPR_2: Tetratricopept 98.7 4.5E-08 9.7E-13 47.8 4.9 34 102-135 1-34 (34)
95 KOG1156|consensus 98.7 5.8E-08 1.3E-12 75.4 7.9 105 7-149 17-122 (700)
96 PF06552 TOM20_plant: Plant sp 98.7 1.1E-07 2.3E-12 63.6 7.4 101 12-142 6-120 (186)
97 PF12569 NARP1: NMDA receptor- 98.7 1.5E-06 3.3E-11 67.6 14.9 143 2-149 43-241 (517)
98 KOG4162|consensus 98.7 2.2E-07 4.7E-12 73.5 10.2 111 2-150 655-768 (799)
99 PF13371 TPR_9: Tetratricopept 98.7 7.4E-08 1.6E-12 55.4 5.5 63 3-75 1-64 (73)
100 KOG4340|consensus 98.7 6.1E-08 1.3E-12 69.7 6.0 130 2-149 49-191 (459)
101 KOG4642|consensus 98.6 1.1E-07 2.3E-12 66.2 6.6 94 4-135 17-111 (284)
102 KOG2076|consensus 98.6 7.7E-07 1.7E-11 71.4 12.2 49 102-150 449-497 (895)
103 PRK10747 putative protoheme IX 98.6 2E-07 4.4E-12 70.6 8.8 45 3-47 159-204 (398)
104 CHL00033 ycf3 photosystem I as 98.6 3.7E-07 8E-12 61.1 9.1 102 11-147 13-117 (168)
105 KOG0553|consensus 98.6 7.5E-08 1.6E-12 68.8 5.9 49 102-150 81-129 (304)
106 PRK10866 outer membrane biogen 98.6 6E-07 1.3E-11 63.7 10.4 136 1-149 73-225 (243)
107 TIGR00540 hemY_coli hemY prote 98.6 4.8E-07 1E-11 68.8 10.5 46 103-149 336-383 (409)
108 KOG4555|consensus 98.6 5.1E-07 1.1E-11 57.5 8.6 97 3-133 49-146 (175)
109 PF13424 TPR_12: Tetratricopep 98.6 4.5E-07 9.7E-12 52.8 7.7 31 102-132 46-76 (78)
110 PRK02603 photosystem I assembl 98.6 3.8E-07 8.2E-12 61.3 8.3 97 18-149 20-119 (172)
111 PRK14720 transcript cleavage f 98.6 9E-07 1.9E-11 72.4 11.3 126 2-148 36-162 (906)
112 PRK11906 transcriptional regul 98.6 1.1E-06 2.4E-11 66.6 10.8 112 13-133 274-403 (458)
113 KOG1173|consensus 98.6 2.1E-07 4.6E-12 71.4 7.0 131 3-143 386-530 (611)
114 KOG4234|consensus 98.6 8.9E-07 1.9E-11 60.4 9.1 97 4-138 102-204 (271)
115 PLN03098 LPA1 LOW PSII ACCUMUL 98.6 1.6E-07 3.4E-12 71.0 6.1 49 102-150 75-126 (453)
116 PF00515 TPR_1: Tetratricopept 98.5 7.6E-08 1.7E-12 47.1 2.8 29 2-30 6-34 (34)
117 PLN03098 LPA1 LOW PSII ACCUMUL 98.5 2E-07 4.4E-12 70.4 5.8 88 2-90 80-172 (453)
118 PF09295 ChAPs: ChAPs (Chs5p-A 98.5 2.9E-06 6.3E-11 64.0 11.7 105 4-149 176-281 (395)
119 PF13525 YfiO: Outer membrane 98.5 2E-06 4.4E-11 59.4 9.9 135 2-149 47-191 (203)
120 KOG0543|consensus 98.5 1.8E-06 4E-11 64.1 9.9 67 1-77 261-328 (397)
121 PLN03088 SGT1, suppressor of 98.5 1E-06 2.2E-11 65.8 8.7 67 53-149 17-83 (356)
122 PF04733 Coatomer_E: Coatomer 98.5 2.8E-06 6.1E-11 61.8 10.6 48 102-149 201-248 (290)
123 KOG1127|consensus 98.5 1.4E-06 3E-11 71.0 9.3 129 3-147 498-641 (1238)
124 PF07719 TPR_2: Tetratricopept 98.5 2E-07 4.4E-12 45.4 3.0 29 2-30 6-34 (34)
125 KOG1174|consensus 98.4 4.1E-06 8.9E-11 62.7 10.5 128 2-139 339-508 (564)
126 COG4235 Cytochrome c biogenesi 98.4 4.4E-06 9.6E-11 60.0 10.1 100 12-149 137-240 (287)
127 PRK10153 DNA-binding transcrip 98.4 4.2E-06 9.1E-11 65.4 10.6 62 3-75 426-488 (517)
128 PF13371 TPR_9: Tetratricopept 98.4 1.3E-06 2.8E-11 50.1 6.0 43 102-144 29-71 (73)
129 TIGR02795 tol_pal_ybgF tol-pal 98.4 4.5E-06 9.8E-11 51.9 9.0 83 32-149 3-89 (119)
130 PF13181 TPR_8: Tetratricopept 98.4 9.1E-07 2E-11 43.1 4.5 33 103-135 2-34 (34)
131 PRK11906 transcriptional regul 98.4 4E-06 8.7E-11 63.6 9.8 106 12-133 319-438 (458)
132 KOG4648|consensus 98.4 3.1E-07 6.7E-12 67.3 3.7 48 102-149 97-144 (536)
133 PF12688 TPR_5: Tetratrico pep 98.4 1.5E-05 3.2E-10 50.5 10.9 107 2-147 6-116 (120)
134 KOG1129|consensus 98.4 1.7E-06 3.6E-11 63.1 7.0 135 2-147 228-369 (478)
135 PF13512 TPR_18: Tetratricopep 98.4 1.4E-05 3E-10 51.8 10.5 101 2-137 15-134 (142)
136 COG4783 Putative Zn-dependent 98.4 2E-05 4.4E-10 59.9 12.6 110 2-132 345-455 (484)
137 PRK15331 chaperone protein Sic 98.3 6.7E-06 1.5E-10 54.4 8.8 49 102-150 71-119 (165)
138 KOG1840|consensus 98.3 6.7E-06 1.5E-10 63.8 10.1 143 2-149 246-422 (508)
139 cd00189 TPR Tetratricopeptide 98.3 7.6E-06 1.7E-10 47.7 8.3 48 102-149 34-81 (100)
140 KOG1174|consensus 98.3 8.9E-06 1.9E-10 61.0 10.0 132 2-149 237-381 (564)
141 COG1729 Uncharacterized protei 98.3 1.2E-05 2.5E-10 57.2 10.3 104 2-140 146-253 (262)
142 PRK10866 outer membrane biogen 98.3 1.4E-05 3E-10 56.8 10.4 99 3-136 38-158 (243)
143 TIGR00540 hemY_coli hemY prote 98.3 1.7E-05 3.7E-10 60.3 11.6 115 3-133 90-218 (409)
144 COG3063 PilF Tfp pilus assembl 98.3 8.4E-06 1.8E-10 56.6 8.9 89 53-149 50-152 (250)
145 KOG0376|consensus 98.3 9.7E-07 2.1E-11 66.8 4.2 104 4-145 11-115 (476)
146 COG4785 NlpI Lipoprotein NlpI, 98.3 3.2E-06 6.9E-11 58.4 6.0 95 4-136 72-167 (297)
147 KOG1127|consensus 98.2 6.1E-06 1.3E-10 67.4 8.3 125 11-149 472-609 (1238)
148 KOG1128|consensus 98.2 7.1E-06 1.5E-10 64.9 8.4 100 5-142 493-593 (777)
149 PF14559 TPR_19: Tetratricopep 98.2 2E-06 4.3E-11 48.6 3.9 50 89-144 18-67 (68)
150 PF13525 YfiO: Outer membrane 98.2 1.6E-05 3.5E-10 54.9 9.0 102 2-138 10-126 (203)
151 COG4700 Uncharacterized protei 98.2 5.9E-05 1.3E-09 51.1 10.9 97 2-134 94-192 (251)
152 PF13431 TPR_17: Tetratricopep 98.2 1E-06 2.2E-11 43.1 1.8 26 124-149 1-26 (34)
153 KOG0495|consensus 98.2 5.6E-05 1.2E-09 59.7 11.6 43 102-144 817-859 (913)
154 KOG2003|consensus 98.2 1.5E-05 3.3E-10 60.6 8.2 47 102-148 558-604 (840)
155 KOG1840|consensus 98.1 1.3E-05 2.8E-10 62.2 7.7 102 2-134 204-315 (508)
156 PF09976 TPR_21: Tetratricopep 98.1 6E-05 1.3E-09 49.2 9.8 107 8-150 22-132 (145)
157 PF13181 TPR_8: Tetratricopept 98.1 2.7E-06 5.8E-11 41.4 2.3 29 2-30 6-34 (34)
158 PF13431 TPR_17: Tetratricopep 98.1 3E-06 6.5E-11 41.4 2.3 28 19-46 1-29 (34)
159 PRK10747 putative protoheme IX 98.1 6.3E-05 1.4E-09 57.1 10.1 111 3-132 269-391 (398)
160 PF12895 Apc3: Anaphase-promot 98.0 6.4E-06 1.4E-10 48.7 3.6 42 2-44 30-72 (84)
161 KOG2003|consensus 98.0 0.00013 2.8E-09 55.7 10.8 43 2-44 529-572 (840)
162 PF09295 ChAPs: ChAPs (Chs5p-A 98.0 7.9E-05 1.7E-09 56.4 9.6 90 2-129 205-295 (395)
163 COG2956 Predicted N-acetylgluc 98.0 0.00022 4.8E-09 52.1 11.3 125 6-148 44-185 (389)
164 COG4785 NlpI Lipoprotein NlpI, 98.0 0.00013 2.8E-09 50.7 9.6 44 3-46 105-149 (297)
165 PF13176 TPR_7: Tetratricopept 98.0 2.3E-05 5E-10 38.7 4.2 30 104-133 1-30 (36)
166 KOG0551|consensus 97.9 6.2E-05 1.4E-09 55.0 7.7 94 4-135 88-186 (390)
167 PF13174 TPR_6: Tetratricopept 97.9 3.5E-05 7.7E-10 36.9 4.3 33 103-135 1-33 (33)
168 PF14853 Fis1_TPR_C: Fis1 C-te 97.9 4.9E-05 1.1E-09 40.9 4.9 39 2-40 6-44 (53)
169 PF13424 TPR_12: Tetratricopep 97.9 3.1E-05 6.7E-10 44.9 4.5 49 102-150 5-60 (78)
170 PF12569 NARP1: NMDA receptor- 97.9 0.00018 4E-09 56.3 9.9 122 3-134 200-337 (517)
171 KOG3060|consensus 97.8 0.0007 1.5E-08 48.0 11.0 109 3-149 58-167 (289)
172 smart00028 TPR Tetratricopepti 97.8 6E-05 1.3E-09 35.0 4.1 33 103-135 2-34 (34)
173 PLN03081 pentatricopeptide (PP 97.8 0.00058 1.3E-08 55.5 11.6 48 102-149 494-541 (697)
174 KOG0495|consensus 97.7 0.00029 6.3E-09 55.9 9.0 127 3-145 590-728 (913)
175 COG3071 HemY Uncharacterized e 97.7 0.00028 6E-09 52.6 8.5 47 102-149 328-374 (400)
176 COG2956 Predicted N-acetylgluc 97.7 0.00036 7.8E-09 51.0 8.9 36 9-44 192-228 (389)
177 KOG2796|consensus 97.7 0.00056 1.2E-08 48.9 9.3 132 2-146 182-333 (366)
178 KOG4234|consensus 97.7 0.00013 2.7E-09 50.1 5.6 63 2-74 139-202 (271)
179 PF13512 TPR_18: Tetratricopep 97.7 0.0012 2.6E-08 42.8 9.8 49 102-150 47-98 (142)
180 KOG4642|consensus 97.7 3.4E-05 7.3E-10 54.0 2.7 48 102-149 10-57 (284)
181 KOG0545|consensus 97.7 0.00049 1.1E-08 48.6 8.3 96 3-136 184-298 (329)
182 PRK10803 tol-pal system protei 97.7 0.00015 3.2E-09 52.1 6.0 64 1-74 184-251 (263)
183 PF12688 TPR_5: Tetratrico pep 97.6 0.00054 1.2E-08 43.4 7.7 83 32-149 2-88 (120)
184 PF13176 TPR_7: Tetratricopept 97.6 5E-05 1.1E-09 37.5 2.4 26 2-27 4-29 (36)
185 COG3118 Thioredoxin domain-con 97.6 0.0015 3.4E-08 47.2 10.4 114 4-134 141-268 (304)
186 smart00028 TPR Tetratricopepti 97.6 9.6E-05 2.1E-09 34.3 3.1 29 2-30 6-34 (34)
187 KOG1308|consensus 97.5 5.6E-05 1.2E-09 55.4 2.5 92 5-134 122-214 (377)
188 KOG1130|consensus 97.5 0.00024 5.2E-09 53.6 5.5 114 4-149 24-148 (639)
189 PF13174 TPR_6: Tetratricopept 97.5 7E-05 1.5E-09 35.8 1.8 29 2-30 5-33 (33)
190 PF14853 Fis1_TPR_C: Fis1 C-te 97.5 0.00054 1.2E-08 36.9 5.2 41 104-144 3-43 (53)
191 PLN03077 Protein ECB2; Provisi 97.4 0.0041 8.8E-08 51.8 12.1 134 2-149 559-704 (857)
192 KOG2376|consensus 97.4 0.0033 7.1E-08 49.4 10.5 109 4-130 86-203 (652)
193 COG0457 NrfG FOG: TPR repeat [ 97.3 0.0088 1.9E-07 39.8 11.1 46 102-147 202-247 (291)
194 KOG4555|consensus 97.3 0.00076 1.6E-08 43.3 5.3 47 102-148 43-89 (175)
195 COG4105 ComL DNA uptake lipopr 97.3 0.0076 1.6E-07 42.8 10.7 120 2-135 76-200 (254)
196 KOG2376|consensus 97.3 0.0024 5.1E-08 50.1 8.6 110 7-134 22-142 (652)
197 PF14938 SNAP: Soluble NSF att 97.3 0.005 1.1E-07 44.6 10.0 47 102-148 155-208 (282)
198 PF10300 DUF3808: Protein of u 97.2 0.0037 8.1E-08 48.6 9.1 104 10-147 246-350 (468)
199 PF06552 TOM20_plant: Plant sp 97.2 0.0015 3.3E-08 44.0 6.0 48 102-149 25-82 (186)
200 COG2976 Uncharacterized protei 97.2 0.0057 1.2E-07 41.8 8.7 96 2-136 94-193 (207)
201 PF04733 Coatomer_E: Coatomer 97.2 0.0034 7.4E-08 45.8 8.1 122 4-139 138-273 (290)
202 COG0457 NrfG FOG: TPR repeat [ 97.2 0.019 4.2E-07 38.1 11.4 134 3-149 65-215 (291)
203 PRK10941 hypothetical protein; 97.1 0.0037 8E-08 45.1 7.2 63 3-75 187-250 (269)
204 COG4105 ComL DNA uptake lipopr 97.0 0.015 3.3E-07 41.3 9.9 99 3-136 40-150 (254)
205 KOG2471|consensus 97.0 0.0018 4E-08 49.9 5.6 47 102-148 335-381 (696)
206 PF13374 TPR_10: Tetratricopep 96.9 0.0029 6.3E-08 31.6 4.4 31 102-132 2-32 (42)
207 KOG1130|consensus 96.9 0.0017 3.7E-08 49.2 4.8 44 2-45 200-250 (639)
208 KOG0545|consensus 96.9 0.0035 7.5E-08 44.5 5.6 46 102-147 230-275 (329)
209 PF04184 ST7: ST7 protein; In 96.9 0.012 2.6E-07 45.6 8.9 124 10-149 181-308 (539)
210 KOG3081|consensus 96.8 0.027 5.9E-07 40.4 9.4 45 102-146 207-251 (299)
211 PF03704 BTAD: Bacterial trans 96.8 0.0021 4.6E-08 41.7 3.8 44 3-46 68-112 (146)
212 KOG3824|consensus 96.7 0.0058 1.3E-07 44.8 6.0 43 102-144 150-192 (472)
213 PF07720 TPR_3: Tetratricopept 96.7 0.0085 1.8E-07 29.5 4.8 33 103-135 2-36 (36)
214 PLN03218 maturation of RBCL 1; 96.7 0.06 1.3E-06 46.1 12.7 29 103-131 615-643 (1060)
215 PLN03218 maturation of RBCL 1; 96.6 0.069 1.5E-06 45.7 12.5 46 102-148 684-731 (1060)
216 PF14938 SNAP: Soluble NSF att 96.6 0.026 5.6E-07 41.0 8.8 104 2-136 119-230 (282)
217 KOG2053|consensus 96.6 0.04 8.7E-07 45.4 10.2 101 8-147 20-121 (932)
218 KOG2796|consensus 96.6 0.033 7.1E-07 40.1 8.6 41 4-44 259-300 (366)
219 PF02259 FAT: FAT domain; Int 96.5 0.053 1.1E-06 40.1 10.4 138 2-149 151-305 (352)
220 PF13374 TPR_10: Tetratricopep 96.5 0.0031 6.7E-08 31.5 2.5 26 2-27 7-32 (42)
221 PF14561 TPR_20: Tetratricopep 96.5 0.029 6.3E-07 33.6 6.9 67 15-91 6-73 (90)
222 KOG0376|consensus 96.4 0.0031 6.8E-08 48.3 3.1 46 102-147 38-83 (476)
223 COG1729 Uncharacterized protei 96.4 0.01 2.2E-07 42.5 5.3 64 1-74 182-249 (262)
224 PF10516 SHNi-TPR: SHNi-TPR; 96.3 0.011 2.4E-07 29.4 3.9 30 103-132 2-31 (38)
225 PLN03081 pentatricopeptide (PP 96.3 0.12 2.6E-06 42.2 11.9 127 3-149 265-404 (697)
226 KOG4507|consensus 96.3 0.019 4.2E-07 45.5 6.8 100 8-145 618-719 (886)
227 KOG3824|consensus 96.3 0.0087 1.9E-07 43.9 4.5 63 4-76 123-186 (472)
228 PRK04841 transcriptional regul 96.2 0.074 1.6E-06 44.5 10.6 98 3-132 458-561 (903)
229 KOG3785|consensus 96.1 0.062 1.3E-06 40.4 8.4 136 1-149 155-329 (557)
230 KOG4340|consensus 96.1 0.016 3.4E-07 42.5 5.3 112 2-126 149-265 (459)
231 PF03704 BTAD: Bacterial trans 96.1 0.024 5.2E-07 36.7 5.9 48 102-149 62-109 (146)
232 PF04184 ST7: ST7 protein; In 96.1 0.14 2.9E-06 40.0 10.3 45 2-46 264-311 (539)
233 PLN03077 Protein ECB2; Provisi 96.0 0.099 2.1E-06 43.7 10.2 113 3-134 530-656 (857)
234 KOG0551|consensus 96.0 0.025 5.4E-07 41.9 5.8 48 102-149 119-166 (390)
235 KOG1941|consensus 96.0 0.047 1E-06 41.1 7.2 43 2-44 127-176 (518)
236 KOG2610|consensus 95.8 0.055 1.2E-06 40.3 7.0 45 2-46 180-225 (491)
237 COG3071 HemY Uncharacterized e 95.7 0.17 3.6E-06 38.3 9.2 114 3-131 269-390 (400)
238 PF14561 TPR_20: Tetratricopep 95.7 0.074 1.6E-06 31.8 6.1 46 84-135 10-55 (90)
239 PF13281 DUF4071: Domain of un 95.6 0.62 1.4E-05 35.3 12.0 132 3-136 185-339 (374)
240 PF10516 SHNi-TPR: SHNi-TPR; 95.5 0.017 3.8E-07 28.7 2.5 26 2-27 6-31 (38)
241 PF05843 Suf: Suppressor of fo 95.5 0.23 5E-06 36.1 9.4 121 5-137 9-142 (280)
242 PF09986 DUF2225: Uncharacteri 95.5 0.3 6.5E-06 34.1 9.4 48 102-149 165-213 (214)
243 KOG3081|consensus 95.4 0.087 1.9E-06 37.9 6.7 43 2-44 212-255 (299)
244 COG4649 Uncharacterized protei 95.4 0.46 9.9E-06 32.3 9.7 49 101-150 166-214 (221)
245 PF07720 TPR_3: Tetratricopept 95.3 0.021 4.5E-07 28.1 2.4 29 2-30 6-36 (36)
246 PRK10941 hypothetical protein; 95.3 0.054 1.2E-06 39.2 5.5 48 102-149 181-228 (269)
247 KOG1915|consensus 95.2 0.29 6.4E-06 38.2 9.1 100 8-144 448-548 (677)
248 KOG2610|consensus 95.1 0.25 5.5E-06 37.0 8.4 106 6-146 112-219 (491)
249 COG4976 Predicted methyltransf 94.9 0.047 1E-06 38.5 4.0 61 5-75 3-64 (287)
250 KOG1310|consensus 94.8 0.14 3E-06 40.4 6.7 94 7-135 384-478 (758)
251 PRK04841 transcriptional regul 94.6 0.38 8.2E-06 40.4 9.3 100 2-134 536-644 (903)
252 PF07721 TPR_4: Tetratricopept 94.5 0.078 1.7E-06 23.8 3.0 24 103-126 2-25 (26)
253 KOG2396|consensus 94.5 0.45 9.7E-06 37.3 8.6 87 16-140 90-178 (568)
254 KOG3364|consensus 94.4 0.12 2.5E-06 33.5 4.6 38 3-40 77-114 (149)
255 PF05843 Suf: Suppressor of fo 94.4 0.4 8.6E-06 34.9 8.0 34 102-135 70-103 (280)
256 smart00386 HAT HAT (Half-A-TPR 94.2 0.13 2.8E-06 23.7 3.7 30 11-40 1-30 (33)
257 KOG1915|consensus 94.2 0.6 1.3E-05 36.6 8.7 56 7-72 83-139 (677)
258 PRK13184 pknD serine/threonine 94.1 0.61 1.3E-05 39.6 9.4 107 2-141 480-591 (932)
259 KOG1586|consensus 93.8 0.8 1.7E-05 32.6 8.1 35 104-138 156-190 (288)
260 TIGR03504 FimV_Cterm FimV C-te 93.8 0.11 2.5E-06 26.7 3.1 28 1-29 3-30 (44)
261 PF08424 NRDE-2: NRDE-2, neces 93.2 1.5 3.2E-05 32.6 9.3 24 17-40 5-28 (321)
262 PF04781 DUF627: Protein of un 93.0 1.4 3E-05 27.5 7.9 103 3-132 2-108 (111)
263 smart00386 HAT HAT (Half-A-TPR 92.7 0.26 5.6E-06 22.6 3.3 29 116-144 1-29 (33)
264 KOG1308|consensus 92.4 0.071 1.5E-06 39.6 1.4 44 4-47 155-199 (377)
265 KOG0530|consensus 92.4 3.2 7E-05 30.2 10.2 120 7-139 53-184 (318)
266 PF07079 DUF1347: Protein of u 92.1 4.7 0.0001 31.6 12.1 27 8-34 390-417 (549)
267 PF14863 Alkyl_sulf_dimr: Alky 92.0 0.36 7.8E-06 31.5 4.2 43 3-45 76-119 (141)
268 KOG3364|consensus 91.9 0.29 6.2E-06 31.7 3.5 41 102-142 71-111 (149)
269 COG4700 Uncharacterized protei 91.8 3.1 6.8E-05 28.8 9.3 45 1-45 128-175 (251)
270 COG2912 Uncharacterized conser 91.5 0.5 1.1E-05 34.1 4.8 63 3-75 187-250 (269)
271 PHA02537 M terminase endonucle 91.3 1.3 2.8E-05 31.4 6.6 36 100-135 167-211 (230)
272 PF12968 DUF3856: Domain of Un 91.2 2.6 5.7E-05 26.8 10.0 100 5-131 17-129 (144)
273 COG3914 Spy Predicted O-linked 91.1 5.8 0.00012 32.0 10.4 31 108-138 148-178 (620)
274 KOG0529|consensus 90.7 3.6 7.8E-05 31.6 8.8 105 11-147 89-194 (421)
275 PF01535 PPR: PPR repeat; Int 90.6 0.59 1.3E-05 21.1 3.2 28 104-131 2-29 (31)
276 KOG2471|consensus 90.5 0.31 6.7E-06 38.2 3.2 40 1-40 339-378 (696)
277 PF14863 Alkyl_sulf_dimr: Alky 90.4 1.2 2.6E-05 29.0 5.5 48 102-149 70-117 (141)
278 PF10579 Rapsyn_N: Rapsyn N-te 90.3 1.6 3.5E-05 25.5 5.3 41 4-44 13-57 (80)
279 PF10602 RPN7: 26S proteasome 90.2 4.2 9.1E-05 27.5 10.9 100 2-133 41-144 (177)
280 PF09613 HrpB1_HrpK: Bacterial 90.2 1.5 3.2E-05 29.3 5.7 48 102-149 44-91 (160)
281 PF04781 DUF627: Protein of un 90.1 0.97 2.1E-05 28.2 4.6 41 108-148 2-45 (111)
282 PF04910 Tcf25: Transcriptiona 90.1 6.2 0.00013 29.9 9.8 38 102-139 103-141 (360)
283 KOG2053|consensus 90.1 10 0.00022 32.2 11.3 59 4-72 50-109 (932)
284 COG4976 Predicted methyltransf 89.9 0.7 1.5E-05 32.8 4.3 36 102-137 29-64 (287)
285 COG3947 Response regulator con 89.9 1.2 2.5E-05 32.9 5.5 43 3-45 285-328 (361)
286 cd02682 MIT_AAA_Arch MIT: doma 89.8 1.3 2.8E-05 25.6 4.6 38 102-139 6-50 (75)
287 PF12854 PPR_1: PPR repeat 89.6 0.84 1.8E-05 21.8 3.3 27 102-128 7-33 (34)
288 PF04212 MIT: MIT (microtubule 89.0 1.9 4E-05 24.2 5.0 30 102-131 5-34 (69)
289 PF08424 NRDE-2: NRDE-2, neces 88.8 8 0.00017 28.7 10.8 116 13-138 47-190 (321)
290 PF12862 Apc5: Anaphase-promot 88.7 1.3 2.8E-05 26.5 4.4 34 102-135 41-74 (94)
291 PF10300 DUF3808: Protein of u 88.5 7.8 0.00017 30.5 9.6 94 3-133 273-378 (468)
292 PF01239 PPTA: Protein prenylt 88.5 1.1 2.4E-05 20.8 3.2 29 121-149 2-30 (31)
293 PF13041 PPR_2: PPR repeat fam 88.4 1.2 2.7E-05 23.0 3.7 30 102-131 3-32 (50)
294 KOG1550|consensus 88.2 6.2 0.00013 31.7 9.0 38 3-40 250-297 (552)
295 PF12862 Apc5: Anaphase-promot 88.1 0.48 1E-05 28.4 2.2 27 2-28 46-72 (94)
296 TIGR00756 PPR pentatricopeptid 87.6 1.7 3.8E-05 19.8 3.8 28 104-131 2-29 (35)
297 COG5191 Uncharacterized conser 87.5 1.1 2.4E-05 33.3 4.1 84 19-140 95-180 (435)
298 COG2912 Uncharacterized conser 87.5 1.8 4E-05 31.3 5.1 44 106-149 185-228 (269)
299 PF10579 Rapsyn_N: Rapsyn N-te 87.3 2.7 5.8E-05 24.6 4.8 42 102-143 6-47 (80)
300 cd02681 MIT_calpain7_1 MIT: do 87.1 2.4 5.3E-05 24.5 4.6 31 102-132 6-36 (76)
301 KOG4814|consensus 86.9 4.9 0.00011 32.9 7.5 39 102-140 394-432 (872)
302 PF02064 MAS20: MAS20 protein 86.6 1.4 3E-05 28.0 3.7 36 2-37 68-103 (121)
303 cd02683 MIT_1 MIT: domain cont 85.8 3 6.5E-05 24.1 4.6 29 102-130 6-34 (77)
304 KOG1070|consensus 85.6 14 0.0003 33.2 9.9 31 14-44 1441-1472(1710)
305 KOG1941|consensus 85.6 3.5 7.5E-05 31.5 5.9 33 102-134 162-194 (518)
306 PF15015 NYD-SP12_N: Spermatog 85.5 1.7 3.8E-05 33.6 4.4 46 2-47 233-279 (569)
307 KOG1070|consensus 85.5 12 0.00027 33.5 9.6 124 3-148 1536-1682(1710)
308 KOG4507|consensus 84.5 4.2 9.1E-05 33.0 6.1 104 18-137 200-318 (886)
309 PF10602 RPN7: 26S proteasome 84.3 1.9 4.1E-05 29.1 3.8 32 102-133 36-67 (177)
310 KOG1585|consensus 84.3 6.8 0.00015 28.3 6.6 29 4-32 38-66 (308)
311 PF09613 HrpB1_HrpK: Bacterial 84.1 3.8 8.3E-05 27.3 5.0 43 3-45 50-93 (160)
312 KOG3616|consensus 83.4 3.6 7.8E-05 34.4 5.5 20 3-22 667-686 (1636)
313 PF04053 Coatomer_WDAD: Coatom 83.4 1.1 2.5E-05 34.8 2.7 29 102-130 347-375 (443)
314 KOG0985|consensus 83.0 19 0.00041 31.6 9.5 41 2-47 1109-1150(1666)
315 PF04212 MIT: MIT (microtubule 82.8 1.9 4.1E-05 24.1 2.9 23 3-25 11-33 (69)
316 PF02064 MAS20: MAS20 protein 82.7 4.8 0.0001 25.5 4.9 33 102-134 63-95 (121)
317 cd02680 MIT_calpain7_2 MIT: do 82.6 4.2 9.1E-05 23.5 4.2 30 102-131 6-35 (75)
318 KOG0546|consensus 82.4 2.7 5.9E-05 31.6 4.2 37 102-138 309-345 (372)
319 TIGR02561 HrpB1_HrpK type III 82.0 5 0.00011 26.5 4.9 43 3-45 50-93 (153)
320 cd02683 MIT_1 MIT: domain cont 81.9 1.8 3.9E-05 25.0 2.6 21 4-24 13-33 (77)
321 KOG2300|consensus 81.4 26 0.00055 28.0 10.0 104 2-136 12-123 (629)
322 TIGR02561 HrpB1_HrpK type III 81.3 3.8 8.2E-05 27.0 4.2 47 102-148 44-90 (153)
323 COG3629 DnrI DNA-binding trans 81.2 9.7 0.00021 27.9 6.6 48 102-149 153-200 (280)
324 COG3118 Thioredoxin domain-con 81.1 13 0.00028 27.5 7.2 21 2-22 173-193 (304)
325 smart00745 MIT Microtubule Int 80.9 6.3 0.00014 22.4 4.7 28 103-130 9-36 (77)
326 cd02681 MIT_calpain7_1 MIT: do 80.7 1.9 4.2E-05 24.9 2.4 22 4-25 13-34 (76)
327 cd02678 MIT_VPS4 MIT: domain c 80.5 6.6 0.00014 22.4 4.7 29 102-130 6-34 (75)
328 PF13812 PPR_3: Pentatricopept 80.1 4.5 9.7E-05 18.5 4.2 28 104-131 3-30 (34)
329 KOG1464|consensus 79.6 7.2 0.00016 28.7 5.5 53 53-131 42-94 (440)
330 COG3629 DnrI DNA-binding trans 79.1 21 0.00046 26.1 7.8 44 3-46 159-203 (280)
331 KOG2396|consensus 79.0 8.8 0.00019 30.5 6.1 43 102-144 105-147 (568)
332 cd02682 MIT_AAA_Arch MIT: doma 78.7 4.9 0.00011 23.2 3.6 37 4-40 13-56 (75)
333 PF11207 DUF2989: Protein of u 78.6 7.7 0.00017 26.9 5.2 43 3-46 147-194 (203)
334 cd02684 MIT_2 MIT: domain cont 77.6 9.2 0.0002 22.0 4.6 30 102-131 6-35 (75)
335 PF10255 Paf67: RNA polymerase 77.3 8 0.00017 29.9 5.4 29 102-130 164-192 (404)
336 cd02656 MIT MIT: domain contai 77.3 9.6 0.00021 21.6 4.7 26 105-130 9-34 (75)
337 PF10373 EST1_DNA_bind: Est1 D 76.8 4.4 9.5E-05 28.9 3.8 29 16-44 1-30 (278)
338 KOG2300|consensus 76.7 18 0.0004 28.8 7.2 110 2-145 372-486 (629)
339 cd02677 MIT_SNX15 MIT: domain 76.3 9.6 0.00021 21.9 4.4 29 103-131 7-35 (75)
340 PF13281 DUF4071: Domain of un 76.1 33 0.00072 26.3 9.1 29 117-146 241-269 (374)
341 smart00745 MIT Microtubule Int 76.0 3.6 7.7E-05 23.5 2.6 21 4-24 15-35 (77)
342 TIGR00985 3a0801s04tom mitocho 75.6 5.5 0.00012 26.2 3.6 36 2-37 95-131 (148)
343 COG3947 Response regulator con 75.1 9 0.00019 28.4 4.9 42 107-148 284-325 (361)
344 cd02679 MIT_spastin MIT: domai 74.8 2.9 6.4E-05 24.4 2.0 19 11-29 3-21 (79)
345 KOG1914|consensus 73.9 35 0.00076 27.7 8.1 72 21-130 10-81 (656)
346 PF07079 DUF1347: Protein of u 73.8 11 0.00023 29.7 5.3 40 4-44 469-509 (549)
347 COG1747 Uncharacterized N-term 73.6 47 0.001 26.9 8.8 131 2-144 104-247 (711)
348 COG0790 FOG: TPR repeat, SEL1 73.2 32 0.00069 24.8 10.6 40 102-143 187-230 (292)
349 PF04910 Tcf25: Transcriptiona 72.1 41 0.00089 25.6 8.1 36 5-40 111-147 (360)
350 KOG4056|consensus 71.5 8.9 0.00019 24.9 3.7 36 2-37 86-121 (143)
351 PF12583 TPPII_N: Tripeptidyl 70.6 20 0.00044 23.2 5.2 44 102-145 76-119 (139)
352 cd00280 TRFH Telomeric Repeat 70.4 13 0.00027 25.7 4.5 40 4-44 118-158 (200)
353 KOG2047|consensus 70.2 63 0.0014 26.9 9.3 31 102-132 248-278 (835)
354 KOG3617|consensus 69.8 6.6 0.00014 33.3 3.6 29 102-130 967-995 (1416)
355 smart00671 SEL1 Sel1-like repe 69.3 10 0.00022 17.5 4.0 30 103-132 2-35 (36)
356 PF12583 TPPII_N: Tripeptidyl 68.9 10 0.00022 24.5 3.6 33 8-40 87-119 (139)
357 KOG2997|consensus 68.4 8.1 0.00018 28.9 3.5 41 100-140 17-57 (366)
358 KOG2581|consensus 68.2 13 0.00029 28.8 4.7 35 102-136 247-281 (493)
359 KOG3783|consensus 68.0 21 0.00047 28.5 5.9 46 102-147 303-348 (546)
360 KOG0276|consensus 67.8 8.9 0.00019 31.3 3.8 30 102-131 666-695 (794)
361 KOG4151|consensus 67.8 23 0.0005 29.6 6.2 61 4-72 60-125 (748)
362 PF02184 HAT: HAT (Half-A-TPR) 67.7 6.9 0.00015 18.6 2.0 26 12-38 2-27 (32)
363 COG3898 Uncharacterized membra 67.3 59 0.0013 25.5 9.4 38 9-46 166-204 (531)
364 KOG2047|consensus 67.0 74 0.0016 26.6 11.4 122 2-134 392-543 (835)
365 TIGR00985 3a0801s04tom mitocho 66.3 18 0.0004 23.8 4.5 33 102-134 90-123 (148)
366 KOG1258|consensus 65.8 73 0.0016 26.0 10.3 105 9-145 378-484 (577)
367 KOG1585|consensus 65.4 25 0.00054 25.6 5.3 24 3-26 77-100 (308)
368 KOG3617|consensus 64.5 89 0.0019 27.1 8.9 32 97-128 1140-1171(1416)
369 PRK15180 Vi polysaccharide bio 64.2 13 0.00028 29.7 4.1 118 9-140 301-429 (831)
370 PF02259 FAT: FAT domain; Int 64.1 13 0.00028 27.3 4.1 30 11-40 272-301 (352)
371 KOG1310|consensus 63.5 11 0.00023 30.4 3.5 47 103-149 409-458 (758)
372 KOG4056|consensus 63.1 15 0.00033 23.8 3.6 37 102-138 81-117 (143)
373 PF11817 Foie-gras_1: Foie gra 62.7 22 0.00047 25.3 4.8 43 2-44 183-232 (247)
374 PF08238 Sel1: Sel1 repeat; I 62.6 16 0.00034 17.2 4.1 30 103-132 2-38 (39)
375 PF06957 COPI_C: Coatomer (COP 62.4 53 0.0012 25.7 7.0 36 102-137 300-335 (422)
376 cd02679 MIT_spastin MIT: domai 62.2 26 0.00057 20.4 4.2 30 102-131 8-37 (79)
377 KOG3807|consensus 62.0 28 0.00061 26.5 5.3 28 106-133 279-306 (556)
378 KOG1464|consensus 61.2 21 0.00046 26.4 4.5 37 10-46 40-81 (440)
379 PRK15180 Vi polysaccharide bio 61.0 88 0.0019 25.3 9.8 94 9-133 710-807 (831)
380 COG3914 Spy Predicted O-linked 60.9 92 0.002 25.5 9.0 41 102-142 101-142 (620)
381 KOG0546|consensus 60.7 19 0.00041 27.3 4.2 42 3-44 315-357 (372)
382 COG5191 Uncharacterized conser 59.5 16 0.00035 27.5 3.7 42 102-143 107-148 (435)
383 KOG2422|consensus 59.3 97 0.0021 25.4 8.0 33 8-40 353-386 (665)
384 cd00280 TRFH Telomeric Repeat 59.3 57 0.0012 22.6 7.2 34 106-140 115-148 (200)
385 PF07219 HemY_N: HemY protein 57.3 43 0.00092 20.5 5.7 39 102-140 59-97 (108)
386 KOG2041|consensus 57.1 73 0.0016 27.0 7.1 26 102-127 852-877 (1189)
387 KOG2422|consensus 55.6 1.2E+02 0.0025 25.0 10.1 120 8-143 249-384 (665)
388 PF07219 HemY_N: HemY protein 55.1 47 0.001 20.3 4.9 42 3-44 65-107 (108)
389 KOG4814|consensus 54.9 38 0.00082 28.1 5.2 37 2-38 399-435 (872)
390 KOG1586|consensus 54.9 80 0.0017 22.9 9.2 103 2-136 118-229 (288)
391 KOG0985|consensus 54.2 33 0.00071 30.2 5.0 29 100-128 1102-1130(1666)
392 PF12968 DUF3856: Domain of Un 53.0 59 0.0013 20.9 6.1 43 2-44 60-114 (144)
393 COG4259 Uncharacterized protei 52.7 32 0.00069 21.3 3.6 37 102-138 72-108 (121)
394 PF10373 EST1_DNA_bind: Est1 D 52.0 70 0.0015 22.7 6.0 47 102-148 16-62 (278)
395 PF09670 Cas_Cas02710: CRISPR- 51.9 1.1E+02 0.0023 23.5 8.7 30 3-32 137-166 (379)
396 PF08631 SPO22: Meiosis protei 51.9 77 0.0017 23.0 6.2 37 8-44 4-42 (278)
397 COG4455 ImpE Protein of avirul 51.3 59 0.0013 23.3 5.2 61 5-75 9-70 (273)
398 PF05053 Menin: Menin; InterP 51.2 1.4E+02 0.003 24.5 7.7 30 102-131 318-347 (618)
399 COG5187 RPN7 26S proteasome re 50.5 1.1E+02 0.0023 23.1 8.8 33 100-132 113-145 (412)
400 KOG4459|consensus 50.4 31 0.00066 27.1 4.0 41 4-44 140-181 (471)
401 COG2976 Uncharacterized protei 50.1 61 0.0013 22.6 5.0 58 2-71 131-190 (207)
402 PF14689 SPOB_a: Sensor_kinase 49.2 44 0.00095 18.3 5.0 30 102-131 23-52 (62)
403 KOG0890|consensus 49.1 2.6E+02 0.0057 27.2 11.8 120 4-147 1677-1800(2382)
404 PF05131 Pep3_Vps18: Pep3/Vps1 48.3 19 0.00041 23.7 2.3 22 106-127 107-128 (147)
405 COG5159 RPN6 26S proteasome re 45.8 1.3E+02 0.0028 22.7 9.9 45 3-47 9-62 (421)
406 PF15469 Sec5: Exocyst complex 44.0 97 0.0021 20.8 6.7 21 7-27 96-116 (182)
407 PF08626 TRAPPC9-Trs120: Trans 43.2 33 0.00072 30.5 3.7 33 102-134 242-274 (1185)
408 PF04053 Coatomer_WDAD: Coatom 42.5 1.7E+02 0.0037 23.1 9.1 33 9-46 330-363 (443)
409 PF11846 DUF3366: Domain of un 42.5 1.1E+02 0.0023 20.7 6.1 33 102-134 144-176 (193)
410 PF10255 Paf67: RNA polymerase 41.2 24 0.00053 27.3 2.3 22 3-24 170-191 (404)
411 PF13226 DUF4034: Domain of un 40.7 51 0.0011 24.2 3.8 33 13-45 115-148 (277)
412 COG3898 Uncharacterized membra 40.2 1.8E+02 0.004 22.9 11.0 110 12-137 244-364 (531)
413 PF11817 Foie-gras_1: Foie gra 40.2 50 0.0011 23.5 3.7 32 102-133 178-209 (247)
414 KOG1550|consensus 40.1 2E+02 0.0044 23.3 11.0 105 2-146 293-406 (552)
415 KOG2709|consensus 40.1 18 0.00039 28.2 1.4 24 2-25 27-50 (560)
416 PF10366 Vps39_1: Vacuolar sor 39.8 51 0.0011 20.3 3.2 29 102-130 39-67 (108)
417 KOG1538|consensus 39.1 2E+02 0.0042 24.4 7.0 26 103-128 805-830 (1081)
418 KOG2114|consensus 39.1 48 0.001 28.3 3.7 29 102-130 368-396 (933)
419 KOG2114|consensus 37.6 2.7E+02 0.0059 24.1 7.9 29 3-31 374-403 (933)
420 COG0790 FOG: TPR repeat, SEL1 36.8 1.6E+02 0.0034 21.1 8.9 46 102-147 109-159 (292)
421 PF10952 DUF2753: Protein of u 36.6 82 0.0018 20.3 3.7 30 104-133 3-32 (140)
422 PF04097 Nic96: Nup93/Nic96; 36.2 2.2E+02 0.0049 23.4 7.2 34 102-136 503-538 (613)
423 KOG0530|consensus 35.3 1.9E+02 0.004 21.5 6.8 42 102-143 112-154 (318)
424 PF04762 IKI3: IKI3 family; I 34.9 1.8E+02 0.0038 25.5 6.6 99 3-128 818-927 (928)
425 PF12753 Nro1: Nuclear pore co 34.9 46 0.001 25.7 2.9 31 117-149 333-363 (404)
426 KOG0529|consensus 34.7 2.2E+02 0.0049 22.3 8.5 45 102-146 109-155 (421)
427 KOG1463|consensus 33.0 1.7E+02 0.0037 22.5 5.5 46 102-147 285-332 (411)
428 PF08631 SPO22: Meiosis protei 31.7 2E+02 0.0043 20.8 9.4 25 2-26 40-65 (278)
429 KOG2041|consensus 31.0 2.4E+02 0.0052 24.2 6.3 30 100-129 794-823 (1189)
430 KOG2997|consensus 29.7 93 0.002 23.5 3.6 32 4-35 26-57 (366)
431 smart00299 CLH Clathrin heavy 28.8 1.5E+02 0.0033 18.5 11.2 36 5-40 15-50 (140)
432 COG4455 ImpE Protein of avirul 28.3 1.9E+02 0.0042 20.9 4.8 43 103-145 36-81 (273)
433 PF08969 USP8_dimer: USP8 dime 28.2 1.5E+02 0.0033 18.2 4.4 31 102-132 38-68 (115)
434 COG4649 Uncharacterized protei 28.2 1.7E+02 0.0036 20.4 4.3 38 2-40 172-209 (221)
435 PF09797 NatB_MDM20: N-acetylt 27.6 1.5E+02 0.0032 22.4 4.6 34 11-44 197-231 (365)
436 COG2015 Alkyl sulfatase and re 27.3 1.6E+02 0.0035 23.8 4.7 48 102-149 452-499 (655)
437 PF04505 Dispanin: Interferon- 27.3 97 0.0021 18.0 2.9 27 106-132 41-67 (82)
438 COG2909 MalT ATP-dependent tra 26.6 4.4E+02 0.0094 23.1 9.4 37 102-138 497-533 (894)
439 PF12739 TRAPPC-Trs85: ER-Golg 26.2 1.3E+02 0.0027 23.4 4.1 44 102-146 208-251 (414)
440 KOG4563|consensus 25.9 1.3E+02 0.0028 23.2 3.9 33 102-134 41-73 (400)
441 KOG3616|consensus 25.8 55 0.0012 27.9 2.1 34 2-35 1000-1033(1636)
442 PF11207 DUF2989: Protein of u 25.3 2.5E+02 0.0053 19.7 11.2 64 81-149 124-191 (203)
443 PF15015 NYD-SP12_N: Spermatog 25.1 97 0.0021 24.5 3.1 43 106-148 232-274 (569)
444 KOG0687|consensus 24.6 3.3E+02 0.0071 20.9 6.0 30 102-131 104-133 (393)
445 PF00244 14-3-3: 14-3-3 protei 24.6 78 0.0017 22.5 2.5 29 3-31 7-35 (236)
446 KOG0890|consensus 24.3 4E+02 0.0087 26.1 7.1 32 102-133 1702-1733(2382)
447 PF09670 Cas_Cas02710: CRISPR- 23.1 2.2E+02 0.0047 21.9 4.8 42 105-146 134-177 (379)
448 PF06957 COPI_C: Coatomer (COP 23.0 2.1E+02 0.0046 22.5 4.7 29 105-133 207-235 (422)
449 TIGR02757 conserved hypothetic 23.0 53 0.0012 23.4 1.4 36 103-138 182-217 (229)
450 KOG2518|consensus 22.6 4.3E+02 0.0093 21.6 6.8 33 102-134 96-128 (556)
451 PF04010 DUF357: Protein of un 22.3 1.7E+02 0.0037 16.8 3.8 27 102-128 35-61 (75)
452 PHA01810 hypothetical protein 22.0 1E+02 0.0022 17.7 2.2 18 117-134 14-31 (100)
453 PF09674 DUF2400: Protein of u 21.9 55 0.0012 23.3 1.3 37 102-138 184-220 (232)
454 KOG4151|consensus 21.9 1.1E+02 0.0024 25.9 3.1 61 6-76 102-163 (748)
455 PF09311 Rab5-bind: Rabaptin-l 21.9 2.7E+02 0.0058 18.9 4.8 32 100-131 138-169 (181)
456 KOG4279|consensus 21.8 4E+02 0.0087 23.2 6.1 22 113-134 377-398 (1226)
457 TIGR02710 CRISPR-associated pr 21.8 1.9E+02 0.0041 22.4 4.1 41 107-147 135-180 (380)
458 cd00215 PTS_IIA_lac PTS_IIA, P 21.8 2E+02 0.0043 17.4 4.1 28 103-130 16-43 (97)
459 PF02255 PTS_IIA: PTS system, 21.7 2E+02 0.0043 17.3 4.2 28 102-129 14-41 (96)
460 COG4941 Predicted RNA polymera 21.7 3.8E+02 0.0083 20.7 7.7 39 102-140 365-403 (415)
461 PF08626 TRAPPC9-Trs120: Trans 21.6 77 0.0017 28.3 2.3 27 2-28 247-273 (1185)
462 KOG2518|consensus 21.6 79 0.0017 25.5 2.2 27 3-29 102-128 (556)
463 PRK09591 celC cellobiose phosp 21.5 2.1E+02 0.0045 17.6 4.1 28 103-130 21-48 (104)
464 KOG0686|consensus 21.4 2.2E+02 0.0047 22.5 4.3 30 102-131 150-179 (466)
465 COG5600 Transcription-associat 21.4 85 0.0018 24.3 2.2 29 2-30 225-253 (413)
466 PRK15326 type III secretion sy 21.3 1.9E+02 0.0041 17.0 4.4 25 12-36 22-46 (80)
467 TIGR00823 EIIA-LAC phosphotran 21.2 2.1E+02 0.0045 17.4 4.1 28 103-130 18-45 (99)
468 KOG2460|consensus 20.7 76 0.0017 25.6 1.9 23 3-25 428-450 (593)
469 KOG3807|consensus 20.5 4.1E+02 0.0089 20.6 9.1 26 3-28 281-306 (556)
470 PF10345 Cohesin_load: Cohesin 20.4 4.9E+02 0.011 21.4 7.8 113 4-146 308-459 (608)
471 PF08311 Mad3_BUB1_I: Mad3/BUB 20.3 2.4E+02 0.0052 17.8 10.7 28 102-129 99-126 (126)
472 COG1747 Uncharacterized N-term 20.2 5E+02 0.011 21.4 6.9 34 7-40 215-248 (711)
No 1
>PRK15359 type III secretion system chaperone protein SscB; Provisional
Probab=99.74 E-value=8.6e-17 Score=105.31 Aligned_cols=109 Identities=15% Similarity=0.037 Sum_probs=100.2
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCCh
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDP 80 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~ 80 (150)
..|.++...|++++|+..|++++.++|++..++..+|.+ ..+| .+.+|+ ..+.+++.++|++
T Consensus 29 ~~g~~~~~~g~~~~A~~~~~~al~~~P~~~~a~~~lg~~~~~~g--------~~~~A~--~~y~~Al~l~p~~------- 91 (144)
T PRK15359 29 ASGYASWQEGDYSRAVIDFSWLVMAQPWSWRAHIALAGTWMMLK--------EYTTAI--NFYGHALMLDASH------- 91 (144)
T ss_pred HHHHHHHHcCCHHHHHHHHHHHHHcCCCcHHHHHHHHHHHHHHh--------hHHHHH--HHHHHHHhcCCCC-------
Confidence 368899999999999999999999999999999999999 9999 667788 7888888888865
Q ss_pred HHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHh
Q psy3020 81 SYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAK 148 (150)
Q Consensus 81 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~ 148 (150)
+.++.++|.++...|++++|+..|.++++++|+++..+.+++.+...
T Consensus 92 ---------------------~~a~~~lg~~l~~~g~~~eAi~~~~~Al~~~p~~~~~~~~~~~~~~~ 138 (144)
T PRK15359 92 ---------------------PEPVYQTGVCLKMMGEPGLAREAFQTAIKMSYADASWSEIRQNAQIM 138 (144)
T ss_pred ---------------------cHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCCCChHHHHHHHHHHHH
Confidence 77899999999999999999999999999999999999999988754
No 2
>KOG0553|consensus
Probab=99.70 E-value=1.5e-16 Score=112.52 Aligned_cols=108 Identities=27% Similarity=0.310 Sum_probs=99.0
Q ss_pred hHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChHH
Q psy3020 4 YSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPSY 82 (150)
Q Consensus 4 g~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~~ 82 (150)
|.-+.+.++|++|+..|.+||+++|+|+-.|-+.+.+ ..+| .+..|+ .++..++.+||.+
T Consensus 88 GN~~m~~~~Y~eAv~kY~~AI~l~P~nAVyycNRAAAy~~Lg--------~~~~AV--kDce~Al~iDp~y--------- 148 (304)
T KOG0553|consen 88 GNKLMKNKDYQEAVDKYTEAIELDPTNAVYYCNRAAAYSKLG--------EYEDAV--KDCESALSIDPHY--------- 148 (304)
T ss_pred HHHHHHhhhHHHHHHHHHHHHhcCCCcchHHHHHHHHHHHhc--------chHHHH--HHHHHHHhcChHH---------
Confidence 6778899999999999999999999999999999999 9999 677888 8999999999853
Q ss_pred HHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 83 VQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 83 ~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
+.+|..+|.+++.+|++++|++.|.++|+++|++...+.|+.++-.++
T Consensus 149 -------------------skay~RLG~A~~~~gk~~~A~~aykKaLeldP~Ne~~K~nL~~Ae~~l 196 (304)
T KOG0553|consen 149 -------------------SKAYGRLGLAYLALGKYEEAIEAYKKALELDPDNESYKSNLKIAEQKL 196 (304)
T ss_pred -------------------HHHHHHHHHHHHccCcHHHHHHHHHhhhccCCCcHHHHHHHHHHHHHh
Confidence 788999999999999999999999999999999999999998876543
No 3
>COG3063 PilF Tfp pilus assembly protein PilF [Cell motility and secretion / Intracellular trafficking and secretion]
Probab=99.64 E-value=3.1e-15 Score=102.44 Aligned_cols=134 Identities=19% Similarity=0.140 Sum_probs=111.9
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCCh
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDP 80 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~ 80 (150)
.||.-|++.|++..|.+-+++||+.||++..+|..++.+ ...| +.+.|. ..+++++.++|+.-.++++.
T Consensus 40 qLal~YL~~gd~~~A~~nlekAL~~DPs~~~a~~~~A~~Yq~~G--------e~~~A~--e~YrkAlsl~p~~GdVLNNY 109 (250)
T COG3063 40 QLALGYLQQGDYAQAKKNLEKALEHDPSYYLAHLVRAHYYQKLG--------ENDLAD--ESYRKALSLAPNNGDVLNNY 109 (250)
T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHhCcccHHHHHHHHHHHHHcC--------ChhhHH--HHHHHHHhcCCCccchhhhh
Confidence 588999999999999999999999999999999999999 9999 556666 89999999999999877661
Q ss_pred --------H---HHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHh
Q psy3020 81 --------S---YVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAK 148 (150)
Q Consensus 81 --------~---~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~ 148 (150)
. .+..+.....+|... ..+.+|.++|.+.++.|+++.|.++|+++|+++|+++.....++..+++
T Consensus 110 G~FLC~qg~~~eA~q~F~~Al~~P~Y~---~~s~t~eN~G~Cal~~gq~~~A~~~l~raL~~dp~~~~~~l~~a~~~~~ 185 (250)
T COG3063 110 GAFLCAQGRPEEAMQQFERALADPAYG---EPSDTLENLGLCALKAGQFDQAEEYLKRALELDPQFPPALLELARLHYK 185 (250)
T ss_pred hHHHHhCCChHHHHHHHHHHHhCCCCC---CcchhhhhhHHHHhhcCCchhHHHHHHHHHHhCcCCChHHHHHHHHHHh
Confidence 1 222222222344332 3389999999999999999999999999999999999998888877765
No 4
>KOG4626|consensus
Probab=99.64 E-value=5.3e-15 Score=113.49 Aligned_cols=132 Identities=19% Similarity=0.260 Sum_probs=106.6
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCC-
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSD- 79 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~- 79 (150)
+||..+...|+...|+.+|++|+++||++.+++..||.+ ..++ .+.+|+ .-+.+++.+.|++...+.+
T Consensus 223 nLg~~f~~~Gei~~aiq~y~eAvkldP~f~dAYiNLGnV~ke~~--------~~d~Av--s~Y~rAl~lrpn~A~a~gNl 292 (966)
T KOG4626|consen 223 NLGCVFNAQGEIWLAIQHYEEAVKLDPNFLDAYINLGNVYKEAR--------IFDRAV--SCYLRALNLRPNHAVAHGNL 292 (966)
T ss_pred hcchHHhhcchHHHHHHHHHHhhcCCCcchHHHhhHHHHHHHHh--------cchHHH--HHHHHHHhcCCcchhhccce
Confidence 578888889999999999999999999999999999999 8888 566777 7788888888888765332
Q ss_pred -------hHHHHHH----HHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHh
Q psy3020 80 -------PSYVQMI----KEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAK 148 (150)
Q Consensus 80 -------~~~~~~~----~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~ 148 (150)
+...-+| +.++.+|.. +.++.++++++-.+|+..+|+.+|++||.+.|+++.+.+|+|.+|.+
T Consensus 293 a~iYyeqG~ldlAI~~Ykral~~~P~F------~~Ay~NlanALkd~G~V~ea~~cYnkaL~l~p~hadam~NLgni~~E 366 (966)
T KOG4626|consen 293 ACIYYEQGLLDLAIDTYKRALELQPNF------PDAYNNLANALKDKGSVTEAVDCYNKALRLCPNHADAMNNLGNIYRE 366 (966)
T ss_pred EEEEeccccHHHHHHHHHHHHhcCCCc------hHHHhHHHHHHHhccchHHHHHHHHHHHHhCCccHHHHHHHHHHHHH
Confidence 2222222 336667766 88899999999888999999999999999999999999999988877
Q ss_pred c
Q psy3020 149 A 149 (150)
Q Consensus 149 ~ 149 (150)
+
T Consensus 367 ~ 367 (966)
T KOG4626|consen 367 Q 367 (966)
T ss_pred h
Confidence 5
No 5
>KOG4626|consensus
Probab=99.63 E-value=2.2e-15 Score=115.52 Aligned_cols=129 Identities=18% Similarity=0.232 Sum_probs=109.3
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCC-
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSD- 79 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~- 79 (150)
+||.+|.+.|++++|...|.+||++.|..+.++..|+.+ ...| .+.+|+ ..+.+++++.|.....+++
T Consensus 359 NLgni~~E~~~~e~A~~ly~~al~v~p~~aaa~nNLa~i~kqqg--------nl~~Ai--~~YkealrI~P~fAda~~Nm 428 (966)
T KOG4626|consen 359 NLGNIYREQGKIEEATRLYLKALEVFPEFAAAHNNLASIYKQQG--------NLDDAI--MCYKEALRIKPTFADALSNM 428 (966)
T ss_pred HHHHHHHHhccchHHHHHHHHHHhhChhhhhhhhhHHHHHHhcc--------cHHHHH--HHHHHHHhcCchHHHHHHhc
Confidence 689999999999999999999999999999999999999 8888 777888 8888999999987765433
Q ss_pred h-------HHHHHH----HHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHH
Q psy3020 80 P-------SYVQMI----KEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAF 146 (150)
Q Consensus 80 ~-------~~~~~~----~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~ 146 (150)
+ ++..++ +.+...|.. ++++.++|.++...|+.++||..|+.||.++|+.+.++.|++.|+
T Consensus 429 Gnt~ke~g~v~~A~q~y~rAI~~nPt~------AeAhsNLasi~kDsGni~~AI~sY~~aLklkPDfpdA~cNllh~l 500 (966)
T KOG4626|consen 429 GNTYKEMGDVSAAIQCYTRAIQINPTF------AEAHSNLASIYKDSGNIPEAIQSYRTALKLKPDFPDAYCNLLHCL 500 (966)
T ss_pred chHHHHhhhHHHHHHHHHHHHhcCcHH------HHHHhhHHHHhhccCCcHHHHHHHHHHHccCCCCchhhhHHHHHH
Confidence 1 122222 235556655 999999999999999999999999999999999999999999886
No 6
>KOG0548|consensus
Probab=99.59 E-value=3.7e-14 Score=106.80 Aligned_cols=88 Identities=36% Similarity=0.569 Sum_probs=66.1
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChH
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPS 81 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~ 81 (150)
+|.+++.+|+|++|+..|.+.|+.||+|...+.+++.+ .... .....+.++.+...+..+|.....+.++.
T Consensus 76 ~Gaa~~~lg~~~eA~~ay~~GL~~d~~n~~L~~gl~~a~~~~~--------~~~~~~~~p~~~~~l~~~p~t~~~~~~~~ 147 (539)
T KOG0548|consen 76 KGAALFGLGDYEEAILAYSEGLEKDPSNKQLKTGLAQAYLEDY--------AADQLFTKPYFHEKLANLPLTNYSLSDPA 147 (539)
T ss_pred hHHHHHhcccHHHHHHHHHHHhhcCCchHHHHHhHHHhhhHHH--------HhhhhccCcHHHHHhhcChhhhhhhccHH
Confidence 68999999999999999999999999999999999887 3322 22344555777777777777666666666
Q ss_pred HHHHHHHHhcCcchhhh
Q psy3020 82 YVQMIKEIQKDPSLMTL 98 (150)
Q Consensus 82 ~~~~~~~l~~~~~~~~~ 98 (150)
|...+..+.+.|+.+..
T Consensus 148 ~~~~l~~~~~~p~~l~~ 164 (539)
T KOG0548|consen 148 YVKILEIIQKNPTSLKL 164 (539)
T ss_pred HHHHHHHhhcCcHhhhc
Confidence 66666666666655444
No 7
>TIGR00990 3a0801s09 mitochondrial precursor proteins import receptor (72 kDa mitochondrial outermembrane protein) (mitochondrial import receptor for the ADP/ATP carrier) (translocase of outermembrane tom70).
Probab=99.56 E-value=5.7e-14 Score=111.13 Aligned_cols=132 Identities=12% Similarity=0.105 Sum_probs=106.8
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCC-
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSD- 79 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~- 79 (150)
.+|.++...|++++|+.+|+++++++|++..++..++.+ ..+| ++.+++ ..+.+++..+|++...+..
T Consensus 336 ~lg~~~~~~g~~~eA~~~~~kal~l~P~~~~~~~~la~~~~~~g--------~~~eA~--~~~~~al~~~p~~~~~~~~l 405 (615)
T TIGR00990 336 LRGTFKCLKGKHLEALADLSKSIELDPRVTQSYIKRASMNLELG--------DPDKAE--EDFDKALKLNSEDPDIYYHR 405 (615)
T ss_pred HHHHHHHHcCCHHHHHHHHHHHHHcCCCcHHHHHHHHHHHHHCC--------CHHHHH--HHHHHHHHhCCCCHHHHHHH
Confidence 468899999999999999999999999999999999999 8888 667777 7888888888887653211
Q ss_pred -------hHHHHHH----HHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHh
Q psy3020 80 -------PSYVQMI----KEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAK 148 (150)
Q Consensus 80 -------~~~~~~~----~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~ 148 (150)
..+...+ ..+..+|.. ...+..+|.++++.|++++|+..|.+++..+|+++.+++++|.++..
T Consensus 406 g~~~~~~g~~~~A~~~~~kal~l~P~~------~~~~~~la~~~~~~g~~~eA~~~~~~al~~~P~~~~~~~~lg~~~~~ 479 (615)
T TIGR00990 406 AQLHFIKGEFAQAGKDYQKSIDLDPDF------IFSHIQLGVTQYKEGSIASSMATFRRCKKNFPEAPDVYNYYGELLLD 479 (615)
T ss_pred HHHHHHcCCHHHHHHHHHHHHHcCccC------HHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCChHHHHHHHHHHHH
Confidence 1111221 224445544 77889999999999999999999999999999999999999999876
Q ss_pred c
Q psy3020 149 A 149 (150)
Q Consensus 149 ~ 149 (150)
+
T Consensus 480 ~ 480 (615)
T TIGR00990 480 Q 480 (615)
T ss_pred c
Confidence 5
No 8
>PLN03088 SGT1, suppressor of G2 allele of SKP1; Provisional
Probab=99.56 E-value=8.6e-14 Score=103.52 Aligned_cols=108 Identities=16% Similarity=0.204 Sum_probs=98.3
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChH
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPS 81 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~ 81 (150)
.|...+..|+|++|++.|++||+++|+++.++..++.+ ..+| .+.+|+ ..+.+++.++|++
T Consensus 8 ~a~~a~~~~~~~~Ai~~~~~Al~~~P~~~~a~~~~a~~~~~~g--------~~~eAl--~~~~~Al~l~P~~-------- 69 (356)
T PLN03088 8 KAKEAFVDDDFALAVDLYTQAIDLDPNNAELYADRAQANIKLG--------NFTEAV--ADANKAIELDPSL-------- 69 (356)
T ss_pred HHHHHHHcCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcC--------CHHHHH--HHHHHHHHhCcCC--------
Confidence 57889999999999999999999999999999999999 8888 778888 8888888888854
Q ss_pred HHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHh
Q psy3020 82 YVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAK 148 (150)
Q Consensus 82 ~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~ 148 (150)
+.++..+|.+++..|+|++|+..|+++++++|+++.+...++.|.-+
T Consensus 70 --------------------~~a~~~lg~~~~~lg~~~eA~~~~~~al~l~P~~~~~~~~l~~~~~k 116 (356)
T PLN03088 70 --------------------AKAYLRKGTACMKLEEYQTAKAALEKGASLAPGDSRFTKLIKECDEK 116 (356)
T ss_pred --------------------HHHHHHHHHHHHHhCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHH
Confidence 67899999999999999999999999999999999999999888543
No 9
>KOG1126|consensus
Probab=99.56 E-value=6e-15 Score=113.18 Aligned_cols=132 Identities=20% Similarity=0.261 Sum_probs=106.6
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCC----
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPF---- 76 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~---- 76 (150)
.+|.||.-+++++.|+++|++|+++||+++++|.-+|-= .... +++.|+ ..++.++..+|+++.+
T Consensus 426 a~GNcfSLQkdh~~Aik~f~RAiQldp~faYayTLlGhE~~~~e--------e~d~a~--~~fr~Al~~~~rhYnAwYGl 495 (638)
T KOG1126|consen 426 ALGNCFSLQKDHDTAIKCFKRAIQLDPRFAYAYTLLGHESIATE--------EFDKAM--KSFRKALGVDPRHYNAWYGL 495 (638)
T ss_pred HhcchhhhhhHHHHHHHHHHHhhccCCccchhhhhcCChhhhhH--------HHHhHH--HHHHhhhcCCchhhHHHHhh
Confidence 479999999999999999999999999999999988765 5555 677888 8999999999998852
Q ss_pred ----CCChHHHHH----HHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHh
Q psy3020 77 ----LSDPSYVQM----IKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAK 148 (150)
Q Consensus 77 ----~~~~~~~~~----~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~ 148 (150)
+.+..+..+ -..++.+|.. ...+-..|.++.+.|+.++|+..|++|+.+||.|+...+++|.+++.
T Consensus 496 G~vy~Kqek~e~Ae~~fqkA~~INP~n------svi~~~~g~~~~~~k~~d~AL~~~~~A~~ld~kn~l~~~~~~~il~~ 569 (638)
T KOG1126|consen 496 GTVYLKQEKLEFAEFHFQKAVEINPSN------SVILCHIGRIQHQLKRKDKALQLYEKAIHLDPKNPLCKYHRASILFS 569 (638)
T ss_pred hhheeccchhhHHHHHHHhhhcCCccc------hhHHhhhhHHHHHhhhhhHHHHHHHHHHhcCCCCchhHHHHHHHHHh
Confidence 222222211 1225556655 77788889999999999999999999999999999999999988875
Q ss_pred c
Q psy3020 149 A 149 (150)
Q Consensus 149 ~ 149 (150)
+
T Consensus 570 ~ 570 (638)
T KOG1126|consen 570 L 570 (638)
T ss_pred h
Confidence 4
No 10
>TIGR00990 3a0801s09 mitochondrial precursor proteins import receptor (72 kDa mitochondrial outermembrane protein) (mitochondrial import receptor for the ADP/ATP carrier) (translocase of outermembrane tom70).
Probab=99.51 E-value=3.1e-13 Score=107.00 Aligned_cols=125 Identities=20% Similarity=0.213 Sum_probs=100.5
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCC-
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSD- 79 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~- 79 (150)
.+|.++...|++++|+..|+++++++|+++.++..+|.+ ..+| ++.+|+ ..+.+++.++|++...+..
T Consensus 370 ~la~~~~~~g~~~eA~~~~~~al~~~p~~~~~~~~lg~~~~~~g--------~~~~A~--~~~~kal~l~P~~~~~~~~l 439 (615)
T TIGR00990 370 KRASMNLELGDPDKAEEDFDKALKLNSEDPDIYYHRAQLHFIKG--------EFAQAG--KDYQKSIDLDPDFIFSHIQL 439 (615)
T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcC--------CHHHHH--HHHHHHHHcCccCHHHHHHH
Confidence 578999999999999999999999999999999999999 9999 777888 8899999999986543211
Q ss_pred -------hHHHHHHH----HHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcH
Q psy3020 80 -------PSYVQMIK----EIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNR 142 (150)
Q Consensus 80 -------~~~~~~~~----~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~ 142 (150)
+.+...+. .+...|.. +..+..+|.++...|++++|++.|.++++++|++...+.+.
T Consensus 440 a~~~~~~g~~~eA~~~~~~al~~~P~~------~~~~~~lg~~~~~~g~~~~A~~~~~~Al~l~p~~~~~~~~~ 507 (615)
T TIGR00990 440 GVTQYKEGSIASSMATFRRCKKNFPEA------PDVYNYYGELLLDQNKFDEAIEKFDTAIELEKETKPMYMNV 507 (615)
T ss_pred HHHHHHCCCHHHHHHHHHHHHHhCCCC------hHHHHHHHHHHHHccCHHHHHHHHHHHHhcCCccccccccH
Confidence 11111211 13344544 88999999999999999999999999999999876655544
No 11
>PRK11189 lipoprotein NlpI; Provisional
Probab=99.50 E-value=5.2e-13 Score=97.10 Aligned_cols=97 Identities=14% Similarity=0.132 Sum_probs=88.2
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCCh
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDP 80 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~ 80 (150)
.+|.+|...|++++|+..|+++++++|+++.++..+|.+ ..+| ++.+|+ ..+.+++.++|++
T Consensus 69 ~~g~~~~~~g~~~~A~~~~~~Al~l~P~~~~a~~~lg~~~~~~g--------~~~~A~--~~~~~Al~l~P~~------- 131 (296)
T PRK11189 69 ERGVLYDSLGLRALARNDFSQALALRPDMADAYNYLGIYLTQAG--------NFDAAY--EAFDSVLELDPTY------- 131 (296)
T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHCC--------CHHHHH--HHHHHHHHhCCCC-------
Confidence 468899999999999999999999999999999999999 9999 677888 7888888888864
Q ss_pred HHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCc
Q psy3020 81 SYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNH 136 (150)
Q Consensus 81 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~ 136 (150)
..+|.++|.+++..|++++|++.|+++++++|+++
T Consensus 132 ---------------------~~a~~~lg~~l~~~g~~~eA~~~~~~al~~~P~~~ 166 (296)
T PRK11189 132 ---------------------NYAYLNRGIALYYGGRYELAQDDLLAFYQDDPNDP 166 (296)
T ss_pred ---------------------HHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCCH
Confidence 66789999999999999999999999999999987
No 12
>PRK10370 formate-dependent nitrite reductase complex subunit NrfG; Provisional
Probab=99.47 E-value=1e-12 Score=90.32 Aligned_cols=36 Identities=17% Similarity=0.301 Sum_probs=32.6
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcc
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHI 137 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~ 137 (150)
+.++..+|..+++.|+|++|+.+|+++++++|.+..
T Consensus 144 ~~al~~LA~~~~~~g~~~~Ai~~~~~aL~l~~~~~~ 179 (198)
T PRK10370 144 VTALMLLASDAFMQADYAQAIELWQKVLDLNSPRVN 179 (198)
T ss_pred hhHHHHHHHHHHHcCCHHHHHHHHHHHHhhCCCCcc
Confidence 788999999999999999999999999999997654
No 13
>TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD. ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array.
Probab=99.46 E-value=2.7e-12 Score=82.68 Aligned_cols=100 Identities=18% Similarity=0.189 Sum_probs=87.4
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCCh
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDP 80 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~ 80 (150)
.+|.++...|++++|++.|++++.++|+++.++..++.+ ..+| ++.++. ..+..++..+|++
T Consensus 22 ~~a~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~la~~~~~~~--------~~~~A~--~~~~~~~~~~p~~------- 84 (135)
T TIGR02552 22 ALAYNLYQQGRYDEALKLFQLLAAYDPYNSRYWLGLAACCQMLK--------EYEEAI--DAYALAAALDPDD------- 84 (135)
T ss_pred HHHHHHHHcccHHHHHHHHHHHHHhCCCcHHHHHHHHHHHHHHH--------HHHHHH--HHHHHHHhcCCCC-------
Confidence 578999999999999999999999999999999999999 8888 556666 5666666666643
Q ss_pred HHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchh
Q psy3020 81 SYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILF 139 (150)
Q Consensus 81 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~ 139 (150)
+..+..+|.++...|++++|+..|+++++++|++....
T Consensus 85 ---------------------~~~~~~la~~~~~~g~~~~A~~~~~~al~~~p~~~~~~ 122 (135)
T TIGR02552 85 ---------------------PRPYFHAAECLLALGEPESALKALDLAIEICGENPEYS 122 (135)
T ss_pred ---------------------hHHHHHHHHHHHHcCCHHHHHHHHHHHHHhccccchHH
Confidence 67789999999999999999999999999999988743
No 14
>TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW. Members of this family are designated PilF in ref (PubMed:8973346) and PilW in ref (PubMed:15612916). This outer membrane protein is required both for pilus stability and for pilus function such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.
Probab=99.45 E-value=1.7e-12 Score=89.53 Aligned_cols=138 Identities=14% Similarity=0.151 Sum_probs=100.4
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCC--
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLS-- 78 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~-- 78 (150)
.+|.++...|++++|++.|+++++.+|++..++..++.+ ..+| ++.++. ..+.+.+...|++.....
T Consensus 36 ~la~~~~~~~~~~~A~~~~~~~l~~~p~~~~~~~~la~~~~~~~--------~~~~A~--~~~~~al~~~~~~~~~~~~~ 105 (234)
T TIGR02521 36 QLALGYLEQGDLEVAKENLDKALEHDPDDYLAYLALALYYQQLG--------ELEKAE--DSFRRALTLNPNNGDVLNNY 105 (234)
T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHhCcccHHHHHHHHHHHHHcC--------CHHHHH--HHHHHHHhhCCCCHHHHHHH
Confidence 578999999999999999999999999999999999999 8888 666777 777777877777543210
Q ss_pred ------ChHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 79 ------DPSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 79 ------~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
.+.+...+..+.........+.....+..+|.+++..|++++|+..|.+++..+|+++..+..+|.++..+
T Consensus 106 ~~~~~~~g~~~~A~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~g~~~~A~~~~~~~~~~~~~~~~~~~~la~~~~~~ 182 (234)
T TIGR02521 106 GTFLCQQGKYEQAMQQFEQAIEDPLYPQPARSLENAGLCALKAGDFDKAEKYLTRALQIDPQRPESLLELAELYYLR 182 (234)
T ss_pred HHHHHHcccHHHHHHHHHHHHhccccccchHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCcCChHHHHHHHHHHHHc
Confidence 01111111111110000000012567788899999999999999999999999999988888898888764
No 15
>PRK15359 type III secretion system chaperone protein SscB; Provisional
Probab=99.45 E-value=8.9e-13 Score=86.30 Aligned_cols=94 Identities=12% Similarity=0.102 Sum_probs=81.1
Q ss_pred HHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChHHHHHHHHHhcCcc
Q psy3020 16 SISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPSYVQMIKEIQKDPS 94 (150)
Q Consensus 16 A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~ 94 (150)
-...|+++++++|++ +..+|.+ ...| .+.++. ..+..++..+|.+
T Consensus 12 ~~~~~~~al~~~p~~---~~~~g~~~~~~g--------~~~~A~--~~~~~al~~~P~~--------------------- 57 (144)
T PRK15359 12 PEDILKQLLSVDPET---VYASGYASWQEG--------DYSRAV--IDFSWLVMAQPWS--------------------- 57 (144)
T ss_pred HHHHHHHHHHcCHHH---HHHHHHHHHHcC--------CHHHHH--HHHHHHHHcCCCc---------------------
Confidence 347899999999997 4457777 8888 667777 7777777777754
Q ss_pred hhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhcC
Q psy3020 95 LMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKAI 150 (150)
Q Consensus 95 ~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~~ 150 (150)
+.+|..+|.++...|++++|+..|.++++++|+++..++++|.|+..++
T Consensus 58 -------~~a~~~lg~~~~~~g~~~~A~~~y~~Al~l~p~~~~a~~~lg~~l~~~g 106 (144)
T PRK15359 58 -------WRAHIALAGTWMMLKEYTTAINFYGHALMLDASHPEPVYQTGVCLKMMG 106 (144)
T ss_pred -------HHHHHHHHHHHHHHhhHHHHHHHHHHHHhcCCCCcHHHHHHHHHHHHcC
Confidence 7889999999999999999999999999999999999999999998753
No 16
>PRK11189 lipoprotein NlpI; Provisional
Probab=99.43 E-value=3.2e-12 Score=93.01 Aligned_cols=129 Identities=13% Similarity=0.029 Sum_probs=92.6
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCC----
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPF---- 76 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~---- 76 (150)
.+|.++...|++++|+..|+++++++|++..++..+|.+ ...| .+.+|+ ..+.+.+..+|++...
T Consensus 103 ~lg~~~~~~g~~~~A~~~~~~Al~l~P~~~~a~~~lg~~l~~~g--------~~~eA~--~~~~~al~~~P~~~~~~~~~ 172 (296)
T PRK11189 103 YLGIYLTQAGNFDAAYEAFDSVLELDPTYNYAYLNRGIALYYGG--------RYELAQ--DDLLAFYQDDPNDPYRALWL 172 (296)
T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHCC--------CHHHHH--HHHHHHHHhCCCCHHHHHHH
Confidence 579999999999999999999999999999999999999 8888 566777 7777777778876410
Q ss_pred -----CCC-hHHHHHH-HHHhc-Ccchhh------------------h------------HHHhHHHHHHHHHHHHhcCH
Q psy3020 77 -----LSD-PSYVQMI-KEIQK-DPSLMT------------------L------------EAKISLLKDKGNAALQANNF 118 (150)
Q Consensus 77 -----~~~-~~~~~~~-~~l~~-~~~~~~------------------~------------~~~~~~~~~~g~~~~~~~~~ 118 (150)
..+ ......+ ..+.. ++.... . +....+|..+|.++.+.|++
T Consensus 173 ~l~~~~~~~~~A~~~l~~~~~~~~~~~~~~~~~~~~lg~~~~~~~~~~~~~~~~~~~~l~~~~~ea~~~Lg~~~~~~g~~ 252 (296)
T PRK11189 173 YLAESKLDPKQAKENLKQRYEKLDKEQWGWNIVEFYLGKISEETLMERLKAGATDNTELAERLCETYFYLAKYYLSLGDL 252 (296)
T ss_pred HHHHccCCHHHHHHHHHHHHhhCCccccHHHHHHHHccCCCHHHHHHHHHhcCCCcHHHHHHHHHHHHHHHHHHHHCCCH
Confidence 011 0111111 00111 111000 0 11257899999999999999
Q ss_pred HHHHHHHHHHHhhcC-CCcchhh
Q psy3020 119 KEAIEAYSEAIKLDG-TNHILFS 140 (150)
Q Consensus 119 ~~A~~~~~~al~~~p-~~~~~~~ 140 (150)
++|+.+|+++++++| +.....+
T Consensus 253 ~~A~~~~~~Al~~~~~~~~e~~~ 275 (296)
T PRK11189 253 DEAAALFKLALANNVYNFVEHRY 275 (296)
T ss_pred HHHHHHHHHHHHhCCchHHHHHH
Confidence 999999999999997 5444444
No 17
>PRK12370 invasion protein regulator; Provisional
Probab=99.43 E-value=2.8e-12 Score=100.47 Aligned_cols=123 Identities=14% Similarity=0.014 Sum_probs=91.3
Q ss_pred CCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCC--------hH
Q psy3020 11 GRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSD--------PS 81 (150)
Q Consensus 11 ~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~--------~~ 81 (150)
+++++|+..+++|++++|+++.++..+|.+ ...| ++.+|. ..+.+++.++|++...+.. +.
T Consensus 318 ~~~~~A~~~~~~Al~ldP~~~~a~~~lg~~~~~~g--------~~~~A~--~~~~~Al~l~P~~~~a~~~lg~~l~~~G~ 387 (553)
T PRK12370 318 NAMIKAKEHAIKATELDHNNPQALGLLGLINTIHS--------EYIVGS--LLFKQANLLSPISADIKYYYGWNLFMAGQ 387 (553)
T ss_pred hHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHcc--------CHHHHH--HHHHHHHHhCCCCHHHHHHHHHHHHHCCC
Confidence 348899999999999999999999999998 8888 667777 8888889999987653210 11
Q ss_pred HHHHH----HHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhc-CCCcchhhcHHHHHHhc
Q psy3020 82 YVQMI----KEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLD-GTNHILFSNRSAAFAKA 149 (150)
Q Consensus 82 ~~~~~----~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~-p~~~~~~~n~a~~~~~~ 149 (150)
+.+.+ ..+..+|.. +..+..++.+++..|++++|+..+.+++..+ |+++.++.++|.+|..+
T Consensus 388 ~~eAi~~~~~Al~l~P~~------~~~~~~~~~~~~~~g~~eeA~~~~~~~l~~~~p~~~~~~~~la~~l~~~ 454 (553)
T PRK12370 388 LEEALQTINECLKLDPTR------AAAGITKLWITYYHTGIDDAIRLGDELRSQHLQDNPILLSMQVMFLSLK 454 (553)
T ss_pred HHHHHHHHHHHHhcCCCC------hhhHHHHHHHHHhccCHHHHHHHHHHHHHhccccCHHHHHHHHHHHHhC
Confidence 11111 123344544 4455566777788899999999999998875 78888889999888764
No 18
>PRK10370 formate-dependent nitrite reductase complex subunit NrfG; Provisional
Probab=99.41 E-value=3.9e-12 Score=87.45 Aligned_cols=102 Identities=13% Similarity=0.125 Sum_probs=90.4
Q ss_pred hCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChHHHHHHHH
Q psy3020 10 LGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPSYVQMIKE 88 (150)
Q Consensus 10 ~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 88 (150)
.++.++++..++++|+.+|+|+++|..+|.+ ..+| ++..|+ ..+.+++.++|++
T Consensus 52 ~~~~~~~i~~l~~~L~~~P~~~~~w~~Lg~~~~~~g--------~~~~A~--~a~~~Al~l~P~~--------------- 106 (198)
T PRK10370 52 QQTPEAQLQALQDKIRANPQNSEQWALLGEYYLWRN--------DYDNAL--LAYRQALQLRGEN--------------- 106 (198)
T ss_pred chhHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCC--------CHHHHH--HHHHHHHHhCCCC---------------
Confidence 5667899999999999999999999999999 8999 677888 7888888888864
Q ss_pred HhcCcchhhhHHHhHHHHHHHHHH-HHhcC--HHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 89 IQKDPSLMTLEAKISLLKDKGNAA-LQANN--FKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 89 l~~~~~~~~~~~~~~~~~~~g~~~-~~~~~--~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
+..+..+|.++ +..|+ +++|.+.++++++++|+++.+++++|.+++++
T Consensus 107 -------------~~~~~~lA~aL~~~~g~~~~~~A~~~l~~al~~dP~~~~al~~LA~~~~~~ 157 (198)
T PRK10370 107 -------------AELYAALATVLYYQAGQHMTPQTREMIDKALALDANEVTALMLLASDAFMQ 157 (198)
T ss_pred -------------HHHHHHHHHHHHHhcCCCCcHHHHHHHHHHHHhCCCChhHHHHHHHHHHHc
Confidence 67788889875 57777 59999999999999999999999999999875
No 19
>PRK12370 invasion protein regulator; Provisional
Probab=99.41 E-value=4.5e-12 Score=99.34 Aligned_cols=132 Identities=14% Similarity=-0.027 Sum_probs=105.9
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCC--
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLS-- 78 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~-- 78 (150)
.+|.++...|++++|+..|++|++++|+++.++..+|.+ ...| ++.+|+ ..+.+++.++|.+..+..
T Consensus 343 ~lg~~~~~~g~~~~A~~~~~~Al~l~P~~~~a~~~lg~~l~~~G--------~~~eAi--~~~~~Al~l~P~~~~~~~~~ 412 (553)
T PRK12370 343 LLGLINTIHSEYIVGSLLFKQANLLSPISADIKYYYGWNLFMAG--------QLEEAL--QTINECLKLDPTRAAAGITK 412 (553)
T ss_pred HHHHHHHHccCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHCC--------CHHHHH--HHHHHHHhcCCCChhhHHHH
Confidence 478899999999999999999999999999999999999 8889 677888 889999999998764310
Q ss_pred ------ChHHHHHH---HH-Hhc-CcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHH
Q psy3020 79 ------DPSYVQMI---KE-IQK-DPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFA 147 (150)
Q Consensus 79 ------~~~~~~~~---~~-l~~-~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~ 147 (150)
...+..++ +. +.. .|.. +..+..+|.++...|++++|...+.+.+...|++..++..++..|+
T Consensus 413 ~~~~~~~g~~eeA~~~~~~~l~~~~p~~------~~~~~~la~~l~~~G~~~eA~~~~~~~~~~~~~~~~~~~~l~~~~~ 486 (553)
T PRK12370 413 LWITYYHTGIDDAIRLGDELRSQHLQDN------PILLSMQVMFLSLKGKHELARKLTKEISTQEITGLIAVNLLYAEYC 486 (553)
T ss_pred HHHHHhccCHHHHHHHHHHHHHhccccC------HHHHHHHHHHHHhCCCHHHHHHHHHHhhhccchhHHHHHHHHHHHh
Confidence 01111111 11 111 1323 6778999999999999999999999999999999999999988887
Q ss_pred hc
Q psy3020 148 KA 149 (150)
Q Consensus 148 ~~ 149 (150)
.+
T Consensus 487 ~~ 488 (553)
T PRK12370 487 QN 488 (553)
T ss_pred cc
Confidence 54
No 20
>PRK15179 Vi polysaccharide biosynthesis protein TviE; Provisional
Probab=99.41 E-value=4.5e-12 Score=100.96 Aligned_cols=110 Identities=6% Similarity=-0.018 Sum_probs=95.8
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCCh
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDP 80 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~ 80 (150)
+||.+..+.|++++|...++.++++.|++..++..++.+ ...+ .+++|+ ..+.+.+..+|++
T Consensus 91 ~La~i~~~~g~~~ea~~~l~~~~~~~Pd~~~a~~~~a~~L~~~~--------~~eeA~--~~~~~~l~~~p~~------- 153 (694)
T PRK15179 91 LVARALEAAHRSDEGLAVWRGIHQRFPDSSEAFILMLRGVKRQQ--------GIEAGR--AEIELYFSGGSSS------- 153 (694)
T ss_pred HHHHHHHHcCCcHHHHHHHHHHHhhCCCcHHHHHHHHHHHHHhc--------cHHHHH--HHHHHHhhcCCCC-------
Confidence 589999999999999999999999999999999999999 9988 778888 8888888888865
Q ss_pred HHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 81 SYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 81 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
+..+..+|.++.+.|+|++|+++|++++.-+|+++.++.++|.++.+.
T Consensus 154 ---------------------~~~~~~~a~~l~~~g~~~~A~~~y~~~~~~~p~~~~~~~~~a~~l~~~ 201 (694)
T PRK15179 154 ---------------------AREILLEAKSWDEIGQSEQADACFERLSRQHPEFENGYVGWAQSLTRR 201 (694)
T ss_pred ---------------------HHHHHHHHHHHHHhcchHHHHHHHHHHHhcCCCcHHHHHHHHHHHHHc
Confidence 667778888888888888888888888888888888888888887754
No 21
>PRK09782 bacteriophage N4 receptor, outer membrane subunit; Provisional
Probab=99.40 E-value=5.8e-12 Score=103.67 Aligned_cols=130 Identities=8% Similarity=0.005 Sum_probs=69.5
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCC----
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFL---- 77 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~---- 77 (150)
+|.++...|++++|+..|+++++++|++...+..++.. ..+| .+.+|. ..+.+++..+|+ ...+
T Consensus 548 la~all~~Gd~~eA~~~l~qAL~l~P~~~~l~~~La~~l~~~G--------r~~eAl--~~~~~AL~l~P~-~~a~~~LA 616 (987)
T PRK09782 548 AANTAQAAGNGAARDRWLQQAEQRGLGDNALYWWLHAQRYIPG--------QPELAL--NDLTRSLNIAPS-ANAYVARA 616 (987)
T ss_pred HHHHHHHCCCHHHHHHHHHHHHhcCCccHHHHHHHHHHHHhCC--------CHHHHH--HHHHHHHHhCCC-HHHHHHHH
Confidence 45555666666666666666666666655554444444 4445 444555 555555555553 2110
Q ss_pred ------CCh-HHHHHH-HHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 78 ------SDP-SYVQMI-KEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 78 ------~~~-~~~~~~-~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
.+. .....+ ..+..+|+. +..+.++|.++...|++++|+..|.++++++|+++.+++++|.++..+
T Consensus 617 ~~l~~lG~~deA~~~l~~AL~l~Pd~------~~a~~nLG~aL~~~G~~eeAi~~l~~AL~l~P~~~~a~~nLA~al~~l 690 (987)
T PRK09782 617 TIYRQRHNVPAAVSDLRAALELEPNN------SNYQAALGYALWDSGDIAQSREMLERAHKGLPDDPALIRQLAYVNQRL 690 (987)
T ss_pred HHHHHCCCHHHHHHHHHHHHHhCCCC------HHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHC
Confidence 000 000111 113334433 566666666666666666666666666666666666666666666543
No 22
>PRK11447 cellulose synthase subunit BcsC; Provisional
Probab=99.35 E-value=1.4e-11 Score=103.60 Aligned_cols=131 Identities=12% Similarity=0.078 Sum_probs=100.4
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCC----
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFL---- 77 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~---- 77 (150)
+|.++...|++++|++.|+++++++|+++.++..++.+ ...| ++.+|. ..+.+.+..+|++....
T Consensus 467 ~a~~~~~~g~~~eA~~~~~~Al~~~P~~~~~~~~LA~~~~~~G--------~~~~A~--~~l~~al~~~P~~~~~~~a~a 536 (1157)
T PRK11447 467 QAEALENQGKWAQAAELQRQRLALDPGSVWLTYRLAQDLRQAG--------QRSQAD--ALMRRLAQQKPNDPEQVYAYG 536 (1157)
T ss_pred HHHHHHHCCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcC--------CHHHHH--HHHHHHHHcCCCCHHHHHHHH
Confidence 57788899999999999999999999999999999999 8999 566776 67777777777665421
Q ss_pred ------CC-hHHHH-----------------------------------------HHHHHhcCcchhhhHHHhHHHHHHH
Q psy3020 78 ------SD-PSYVQ-----------------------------------------MIKEIQKDPSLMTLEAKISLLKDKG 109 (150)
Q Consensus 78 ------~~-~~~~~-----------------------------------------~~~~l~~~~~~~~~~~~~~~~~~~g 109 (150)
.+ ..... ++..+...|.. +..+..+|
T Consensus 537 l~l~~~~~~~~Al~~l~~l~~~~~~~~~~~l~~~l~~~~~l~~a~~l~~~G~~~eA~~~l~~~p~~------~~~~~~La 610 (1157)
T PRK11447 537 LYLSGSDRDRAALAHLNTLPRAQWNSNIQELAQRLQSDQVLETANRLRDSGKEAEAEALLRQQPPS------TRIDLTLA 610 (1157)
T ss_pred HHHHhCCCHHHHHHHHHhCCchhcChhHHHHHHHHhhhHHHHHHHHHHHCCCHHHHHHHHHhCCCC------chHHHHHH
Confidence 00 00000 11111222222 56778899
Q ss_pred HHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 110 NAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 110 ~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
.++.+.|++++|++.|+++++++|+++.+++++|.+|..+
T Consensus 611 ~~~~~~g~~~~A~~~y~~al~~~P~~~~a~~~la~~~~~~ 650 (1157)
T PRK11447 611 DWAQQRGDYAAARAAYQRVLTREPGNADARLGLIEVDIAQ 650 (1157)
T ss_pred HHHHHcCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHC
Confidence 9999999999999999999999999999999999998764
No 23
>PLN02789 farnesyltranstransferase
Probab=99.35 E-value=2.4e-11 Score=89.06 Aligned_cols=129 Identities=9% Similarity=0.081 Sum_probs=101.9
Q ss_pred HHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCC-Ch--
Q psy3020 5 SCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLS-DP-- 80 (150)
Q Consensus 5 ~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~-~~-- 80 (150)
.++...+++++|+..+.++|+++|++..+|..++.+ ..+|. .+.+++ ..+.+.+..+|++..... +.
T Consensus 45 a~l~~~e~serAL~lt~~aI~lnP~~ytaW~~R~~iL~~L~~-------~l~eeL--~~~~~~i~~npknyqaW~~R~~~ 115 (320)
T PLN02789 45 AVYASDERSPRALDLTADVIRLNPGNYTVWHFRRLCLEALDA-------DLEEEL--DFAEDVAEDNPKNYQIWHHRRWL 115 (320)
T ss_pred HHHHcCCCCHHHHHHHHHHHHHCchhHHHHHHHHHHHHHcch-------hHHHHH--HHHHHHHHHCCcchHHhHHHHHH
Confidence 456778899999999999999999999999999998 77762 356667 677778888888765311 00
Q ss_pred ----------HHHHHH-HHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHh
Q psy3020 81 ----------SYVQMI-KEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAK 148 (150)
Q Consensus 81 ----------~~~~~~-~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~ 148 (150)
.-.... ..++.+|.. ..+|..+|.++...|+|++++++++++|++||.|..+|++|+.++..
T Consensus 116 l~~l~~~~~~~el~~~~kal~~dpkN------y~AW~~R~w~l~~l~~~~eeL~~~~~~I~~d~~N~sAW~~R~~vl~~ 188 (320)
T PLN02789 116 AEKLGPDAANKELEFTRKILSLDAKN------YHAWSHRQWVLRTLGGWEDELEYCHQLLEEDVRNNSAWNQRYFVITR 188 (320)
T ss_pred HHHcCchhhHHHHHHHHHHHHhCccc------HHHHHHHHHHHHHhhhHHHHHHHHHHHHHHCCCchhHHHHHHHHHHh
Confidence 011111 224445544 89999999999999999999999999999999999999999988754
No 24
>PF13429 TPR_15: Tetratricopeptide repeat; PDB: 2VQ2_A 2PL2_B.
Probab=99.35 E-value=3.5e-12 Score=91.91 Aligned_cols=48 Identities=25% Similarity=0.327 Sum_probs=41.2
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
+..|..+|.++...|++++|+.+|++++..+|+|+..+.+.|.++.+.
T Consensus 214 ~~~~~~la~~~~~lg~~~~Al~~~~~~~~~~p~d~~~~~~~a~~l~~~ 261 (280)
T PF13429_consen 214 PDLWDALAAAYLQLGRYEEALEYLEKALKLNPDDPLWLLAYADALEQA 261 (280)
T ss_dssp CCHCHHHHHHHHHHT-HHHHHHHHHHHHHHSTT-HHHHHHHHHHHT--
T ss_pred HHHHHHHHHHhccccccccccccccccccccccccccccccccccccc
Confidence 566788999999999999999999999999999999999999998764
No 25
>KOG0548|consensus
Probab=99.35 E-value=1.1e-11 Score=93.64 Aligned_cols=107 Identities=22% Similarity=0.330 Sum_probs=95.6
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChH
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPS 81 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~ 81 (150)
-|.+++..|+|..|+.+|.+||+.+|+++..+-+++.| -.++ .+..++ .++..++.++|+.
T Consensus 364 kGne~Fk~gdy~~Av~~YteAIkr~P~Da~lYsNRAac~~kL~--------~~~~aL--~Da~~~ieL~p~~-------- 425 (539)
T KOG0548|consen 364 KGNEAFKKGDYPEAVKHYTEAIKRDPEDARLYSNRAACYLKLG--------EYPEAL--KDAKKCIELDPNF-------- 425 (539)
T ss_pred HHHHHHhccCHHHHHHHHHHHHhcCCchhHHHHHHHHHHHHHh--------hHHHHH--HHHHHHHhcCchH--------
Confidence 48899999999999999999999999999999999999 8999 677888 8888888888843
Q ss_pred HHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHH
Q psy3020 82 YVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFA 147 (150)
Q Consensus 82 ~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~ 147 (150)
..+|...|.+++...+|+.|+++|.+++++||++..+.-....|+.
T Consensus 426 --------------------~kgy~RKg~al~~mk~ydkAleay~eale~dp~~~e~~~~~~rc~~ 471 (539)
T KOG0548|consen 426 --------------------IKAYLRKGAALRAMKEYDKALEAYQEALELDPSNAEAIDGYRRCVE 471 (539)
T ss_pred --------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCchhHHHHHHHHHHHH
Confidence 7788999999999999999999999999999999877666666654
No 26
>PRK09782 bacteriophage N4 receptor, outer membrane subunit; Provisional
Probab=99.35 E-value=1.5e-11 Score=101.28 Aligned_cols=127 Identities=9% Similarity=0.129 Sum_probs=99.4
Q ss_pred HHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCC----
Q psy3020 5 SCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSD---- 79 (150)
Q Consensus 5 ~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~---- 79 (150)
..+...|++++|+..|+++++++|+ +.++..++.+ ..+| .+.+|+ ..+.+++.++|++..++..
T Consensus 584 ~~l~~~Gr~~eAl~~~~~AL~l~P~-~~a~~~LA~~l~~lG--------~~deA~--~~l~~AL~l~Pd~~~a~~nLG~a 652 (987)
T PRK09782 584 AQRYIPGQPELALNDLTRSLNIAPS-ANAYVARATIYRQRH--------NVPAAV--SDLRAALELEPNNSNYQAALGYA 652 (987)
T ss_pred HHHHhCCCHHHHHHHHHHHHHhCCC-HHHHHHHHHHHHHCC--------CHHHHH--HHHHHHHHhCCCCHHHHHHHHHH
Confidence 3444559999999999999999996 8889999988 8888 667777 8888889999988754211
Q ss_pred ----hHHHHH---H-HHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHh
Q psy3020 80 ----PSYVQM---I-KEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAK 148 (150)
Q Consensus 80 ----~~~~~~---~-~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~ 148 (150)
..+... + ..+...|.. +.++.++|.++...|++++|+.+|+++++++|+++....-.|.+...
T Consensus 653 L~~~G~~eeAi~~l~~AL~l~P~~------~~a~~nLA~al~~lGd~~eA~~~l~~Al~l~P~~a~i~~~~g~~~~~ 723 (987)
T PRK09782 653 LWDSGDIAQSREMLERAHKGLPDD------PALIRQLAYVNQRLDDMAATQHYARLVIDDIDNQALITPLTPEQNQQ 723 (987)
T ss_pred HHHCCCHHHHHHHHHHHHHhCCCC------HHHHHHHHHHHHHCCCHHHHHHHHHHHHhcCCCCchhhhhhhHHHHH
Confidence 111121 1 224456655 89999999999999999999999999999999998888777766543
No 27
>PRK11447 cellulose synthase subunit BcsC; Provisional
Probab=99.34 E-value=2.1e-11 Score=102.55 Aligned_cols=129 Identities=16% Similarity=0.176 Sum_probs=103.6
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCC----
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFL---- 77 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~---- 77 (150)
.|.++...|++++|+..|+++++++|++++++..++.+ ...| ++.+|+ ..+.+++..+|+.....
T Consensus 275 ~G~~~~~~g~~~~A~~~l~~aL~~~P~~~~a~~~Lg~~~~~~g--------~~~eA~--~~l~~Al~~~p~~~~~~~~~~ 344 (1157)
T PRK11447 275 QGLAAVDSGQGGKAIPELQQAVRANPKDSEALGALGQAYSQQG--------DRARAV--AQFEKALALDPHSSNRDKWES 344 (1157)
T ss_pred HHHHHHHCCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcC--------CHHHHH--HHHHHHHHhCCCccchhHHHH
Confidence 48889999999999999999999999999999999999 8889 677888 88888888888765310
Q ss_pred ------------------CChHHHHHH----HHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCC
Q psy3020 78 ------------------SDPSYVQMI----KEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTN 135 (150)
Q Consensus 78 ------------------~~~~~~~~~----~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~ 135 (150)
....+..++ +.+..+|.. +.++..+|.++...|++++|+++|+++++++|++
T Consensus 345 ll~~~~~~~~~~~g~~~~~~g~~~eA~~~~~~Al~~~P~~------~~a~~~Lg~~~~~~g~~~eA~~~y~~aL~~~p~~ 418 (1157)
T PRK11447 345 LLKVNRYWLLIQQGDAALKANNLAQAERLYQQARQVDNTD------SYAVLGLGDVAMARKDYAAAERYYQQALRMDPGN 418 (1157)
T ss_pred HHHhhhHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCC------HHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCC
Confidence 000011111 113334444 7888999999999999999999999999999999
Q ss_pred cchhhcHHHHHH
Q psy3020 136 HILFSNRSAAFA 147 (150)
Q Consensus 136 ~~~~~n~a~~~~ 147 (150)
+.++.+++.+|.
T Consensus 419 ~~a~~~L~~l~~ 430 (1157)
T PRK11447 419 TNAVRGLANLYR 430 (1157)
T ss_pred HHHHHHHHHHHH
Confidence 999988888774
No 28
>PRK15174 Vi polysaccharide export protein VexE; Provisional
Probab=99.34 E-value=2.5e-11 Score=96.88 Aligned_cols=135 Identities=15% Similarity=0.179 Sum_probs=99.9
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCC---
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLS--- 78 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~--- 78 (150)
+|.++...|++++|+..|+++++++|+++.++..+|.+ ...|+.+ ....+|. ..+.+++..+|++.....
T Consensus 218 l~~~l~~~g~~~eA~~~~~~al~~~p~~~~~~~~Lg~~l~~~G~~~----eA~~~A~--~~~~~Al~l~P~~~~a~~~lg 291 (656)
T PRK15174 218 AVDTLCAVGKYQEAIQTGESALARGLDGAALRRSLGLAYYQSGRSR----EAKLQAA--EHWRHALQFNSDNVRIVTLYA 291 (656)
T ss_pred HHHHHHHCCCHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHcCCch----hhHHHHH--HHHHHHHhhCCCCHHHHHHHH
Confidence 46788899999999999999999999999999999999 8888432 0112356 778888888998664211
Q ss_pred -----ChHHHHHH----HHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 79 -----DPSYVQMI----KEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 79 -----~~~~~~~~----~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
...+...+ ..+..+|.. +..+..+|.++...|++++|+..|.++++.+|+++..+..+|.++..+
T Consensus 292 ~~l~~~g~~~eA~~~l~~al~l~P~~------~~a~~~La~~l~~~G~~~eA~~~l~~al~~~P~~~~~~~~~a~al~~~ 365 (656)
T PRK15174 292 DALIRTGQNEKAIPLLQQSLATHPDL------PYVRAMYARALRQVGQYTAASDEFVQLAREKGVTSKWNRYAAAALLQA 365 (656)
T ss_pred HHHHHCCCHHHHHHHHHHHHHhCCCC------HHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCccchHHHHHHHHHHHHC
Confidence 11111111 113345544 778888999999999999999999999999998887777778777654
No 29
>KOG0624|consensus
Probab=99.33 E-value=3.5e-12 Score=92.42 Aligned_cols=132 Identities=19% Similarity=0.253 Sum_probs=108.2
Q ss_pred hHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChHH
Q psy3020 4 YSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPSY 82 (150)
Q Consensus 4 g~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~~ 82 (150)
+.||...|.+..||.+...|-++..||.+.++.++.+ +..| +.+.++ ..++++++++|+++.++ .+
T Consensus 196 akc~i~~~e~k~AI~Dlk~askLs~DnTe~~ykis~L~Y~vg--------d~~~sL--~~iRECLKldpdHK~Cf---~~ 262 (504)
T KOG0624|consen 196 AKCYIAEGEPKKAIHDLKQASKLSQDNTEGHYKISQLLYTVG--------DAENSL--KEIRECLKLDPDHKLCF---PF 262 (504)
T ss_pred HHHHHhcCcHHHHHHHHHHHHhccccchHHHHHHHHHHHhhh--------hHHHHH--HHHHHHHccCcchhhHH---HH
Confidence 6799999999999999999999999999999999999 9999 778888 99999999999999987 67
Q ss_pred HHHHHHHhcCcchhhhHHH-------------------------hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcc
Q psy3020 83 VQMIKEIQKDPSLMTLEAK-------------------------ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHI 137 (150)
Q Consensus 83 ~~~~~~l~~~~~~~~~~~~-------------------------~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~ 137 (150)
++.++.+.+...++....+ ...+..+..++..-+++.+||+.++++|+++|++..
T Consensus 263 YKklkKv~K~les~e~~ie~~~~t~cle~ge~vlk~ep~~~~ir~~~~r~~c~C~~~d~~~~eAiqqC~evL~~d~~dv~ 342 (504)
T KOG0624|consen 263 YKKLKKVVKSLESAEQAIEEKHWTECLEAGEKVLKNEPEETMIRYNGFRVLCTCYREDEQFGEAIQQCKEVLDIDPDDVQ 342 (504)
T ss_pred HHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCCcccceeeeeeheeeecccccCCHHHHHHHHHHHHhcCchHHH
Confidence 7777776666555544111 112222333445678999999999999999999999
Q ss_pred hhhcHHHHHHh
Q psy3020 138 LFSNRSAAFAK 148 (150)
Q Consensus 138 ~~~n~a~~~~~ 148 (150)
.+..||.+|+.
T Consensus 343 ~l~dRAeA~l~ 353 (504)
T KOG0624|consen 343 VLCDRAEAYLG 353 (504)
T ss_pred HHHHHHHHHhh
Confidence 99999998863
No 30
>PRK15363 pathogenicity island 2 chaperone protein SscA; Provisional
Probab=99.33 E-value=3.4e-11 Score=78.88 Aligned_cols=95 Identities=11% Similarity=0.016 Sum_probs=86.4
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCCh
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDP 80 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~ 80 (150)
.+|..+...|++++|...|+.+..+||.++..|++||.+ ..+| .+.+|+ .-+..++.++|++
T Consensus 40 ~~A~~ly~~G~l~~A~~~f~~L~~~Dp~~~~y~~gLG~~~Q~~g--------~~~~AI--~aY~~A~~L~~dd------- 102 (157)
T PRK15363 40 RYAMQLMEVKEFAGAARLFQLLTIYDAWSFDYWFRLGECCQAQK--------HWGEAI--YAYGRAAQIKIDA------- 102 (157)
T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHhCcccHHHHHHHHHHHHHHh--------hHHHHH--HHHHHHHhcCCCC-------
Confidence 468889999999999999999999999999999999999 9999 778888 7888888888865
Q ss_pred HHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCC
Q psy3020 81 SYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGT 134 (150)
Q Consensus 81 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~ 134 (150)
+.++.+.|.+++..|+.+.|.++|+.++.+--.
T Consensus 103 ---------------------p~~~~~ag~c~L~lG~~~~A~~aF~~Ai~~~~~ 135 (157)
T PRK15363 103 ---------------------PQAPWAAAECYLACDNVCYAIKALKAVVRICGE 135 (157)
T ss_pred ---------------------chHHHHHHHHHHHcCCHHHHHHHHHHHHHHhcc
Confidence 678899999999999999999999999998733
No 31
>KOG1155|consensus
Probab=99.32 E-value=2e-11 Score=91.20 Aligned_cols=131 Identities=15% Similarity=0.170 Sum_probs=101.4
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCC-CCCh
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPF-LSDP 80 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~-~~~~ 80 (150)
+|+.|.-.++.++|+..|++||++||....+|.-+|-= ..|. -...|+ ..+++++.++|.+... +.=+
T Consensus 336 IaNYYSlr~eHEKAv~YFkRALkLNp~~~~aWTLmGHEyvEmK--------Nt~AAi--~sYRrAvdi~p~DyRAWYGLG 405 (559)
T KOG1155|consen 336 IANYYSLRSEHEKAVMYFKRALKLNPKYLSAWTLMGHEYVEMK--------NTHAAI--ESYRRAVDINPRDYRAWYGLG 405 (559)
T ss_pred ehhHHHHHHhHHHHHHHHHHHHhcCcchhHHHHHhhHHHHHhc--------ccHHHH--HHHHHHHhcCchhHHHHhhhh
Confidence 57788888999999999999999999999999888876 6666 445677 8899999999987652 2111
Q ss_pred HHHHH-----------HHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 81 SYVQM-----------IKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 81 ~~~~~-----------~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
+.+.. -+...-.|.+ ...|..+|.++.+.++.++|++||.+++.....+-.++..+|.+|-++
T Consensus 406 QaYeim~Mh~YaLyYfqkA~~~kPnD------sRlw~aLG~CY~kl~~~~eAiKCykrai~~~dte~~~l~~LakLye~l 479 (559)
T KOG1155|consen 406 QAYEIMKMHFYALYYFQKALELKPND------SRLWVALGECYEKLNRLEEAIKCYKRAILLGDTEGSALVRLAKLYEEL 479 (559)
T ss_pred HHHHHhcchHHHHHHHHHHHhcCCCc------hHHHHHHHHHHHHhccHHHHHHHHHHHHhccccchHHHHHHHHHHHHH
Confidence 11111 0112233433 889999999999999999999999999999999999999999988654
No 32
>KOG1125|consensus
Probab=99.32 E-value=1.4e-11 Score=93.82 Aligned_cols=48 Identities=23% Similarity=0.203 Sum_probs=35.9
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
...|..+|-.+.-..+.++||..|++|+++.|+...++||+|++++.+
T Consensus 464 ~~lWNRLGAtLAN~~~s~EAIsAY~rALqLqP~yVR~RyNlgIS~mNl 511 (579)
T KOG1125|consen 464 YLLWNRLGATLANGNRSEEAISAYNRALQLQPGYVRVRYNLGISCMNL 511 (579)
T ss_pred HHHHHHhhHHhcCCcccHHHHHHHHHHHhcCCCeeeeehhhhhhhhhh
Confidence 666777777777777777777777777777777777777777777654
No 33
>TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD. ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array.
Probab=99.31 E-value=3.4e-11 Score=77.52 Aligned_cols=94 Identities=17% Similarity=0.175 Sum_probs=80.8
Q ss_pred HHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChHHHHHHHHHhcCcchh
Q psy3020 18 STYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPSYVQMIKEIQKDPSLM 96 (150)
Q Consensus 18 ~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 96 (150)
+.|++++.++|++..+...++.+ ...| ++.++. ..+...+..+|++
T Consensus 4 ~~~~~~l~~~p~~~~~~~~~a~~~~~~~--------~~~~A~--~~~~~~~~~~p~~----------------------- 50 (135)
T TIGR02552 4 ATLKDLLGLDSEQLEQIYALAYNLYQQG--------RYDEAL--KLFQLLAAYDPYN----------------------- 50 (135)
T ss_pred hhHHHHHcCChhhHHHHHHHHHHHHHcc--------cHHHHH--HHHHHHHHhCCCc-----------------------
Confidence 57899999999999999999998 8888 556666 5555555555542
Q ss_pred hhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 97 TLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 97 ~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
+..|..+|.+++..|++++|+.+|.+++.++|+++..++++|.+|..+
T Consensus 51 -----~~~~~~la~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~la~~~~~~ 98 (135)
T TIGR02552 51 -----SRYWLGLAACCQMLKEYEEAIDAYALAAALDPDDPRPYFHAAECLLAL 98 (135)
T ss_pred -----HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChHHHHHHHHHHHHc
Confidence 778999999999999999999999999999999999999999999865
No 34
>TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795).
Probab=99.31 E-value=4.4e-11 Score=84.00 Aligned_cols=134 Identities=12% Similarity=0.038 Sum_probs=77.2
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHH---HHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCC
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQ---MKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLS 78 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~---a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~ 78 (150)
+|.+|...|++++|+..|+++++.+|+++. ++..++.+ .............+..+. ..+.+.+..+|++....
T Consensus 76 la~~~~~~~~~~~A~~~~~~~l~~~p~~~~~~~a~~~~g~~~~~~~~~~~~~~~~~~~A~--~~~~~~~~~~p~~~~~~- 152 (235)
T TIGR03302 76 LAYAYYKSGDYAEAIAAADRFIRLHPNHPDADYAYYLRGLSNYNQIDRVDRDQTAAREAF--EAFQELIRRYPNSEYAP- 152 (235)
T ss_pred HHHHHHhcCCHHHHHHHHHHHHHHCcCCCchHHHHHHHHHHHHHhcccccCCHHHHHHHH--HHHHHHHHHCCCChhHH-
Confidence 556666666666666666666666665554 34455554 332000000000344455 55555566666543321
Q ss_pred ChHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCc---chhhcHHHHHHhc
Q psy3020 79 DPSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNH---ILFSNRSAAFAKA 149 (150)
Q Consensus 79 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~---~~~~n~a~~~~~~ 149 (150)
.....+..+... .......+|..++..|++.+|+..|++++...|+++ .+++++|.++..+
T Consensus 153 --~a~~~~~~~~~~--------~~~~~~~~a~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~a~~~l~~~~~~l 216 (235)
T TIGR03302 153 --DAKKRMDYLRNR--------LAGKELYVARFYLKRGAYVAAINRFETVVENYPDTPATEEALARLVEAYLKL 216 (235)
T ss_pred --HHHHHHHHHHHH--------HHHHHHHHHHHHHHcCChHHHHHHHHHHHHHCCCCcchHHHHHHHHHHHHHc
Confidence 111111111000 023345788889999999999999999999988764 7899999999865
No 35
>TIGR02917 PEP_TPR_lipo putative PEP-CTERM system TPR-repeat lipoprotein. This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
Probab=99.30 E-value=7.9e-11 Score=95.27 Aligned_cols=132 Identities=16% Similarity=0.251 Sum_probs=100.6
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCC--
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLS-- 78 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~-- 78 (150)
.+|.++...|++++|++.++++++.+|+++.++..++.+ ..+| +..+|+ ..+.+.+...|++.....
T Consensus 741 ~l~~~~~~~g~~~~A~~~~~~~l~~~~~~~~~~~~la~~~~~~g--------~~~~A~--~~~~~~~~~~p~~~~~~~~l 810 (899)
T TIGR02917 741 KLHRALLASGNTAEAVKTLEAWLKTHPNDAVLRTALAELYLAQK--------DYDKAI--KHYRTVVKKAPDNAVVLNNL 810 (899)
T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHCc--------CHHHHH--HHHHHHHHhCCCCHHHHHHH
Confidence 467788888888888888888888888888888888888 7788 556677 777777777887654211
Q ss_pred --------ChHHHHHHH-HHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 79 --------DPSYVQMIK-EIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 79 --------~~~~~~~~~-~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
++.....+. .+...|+. +..+..+|.+++..|++++|+++|+++++.+|.++..+++++.++.+.
T Consensus 811 ~~~~~~~~~~~A~~~~~~~~~~~~~~------~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~~~~~~~~~~l~~~~~~~ 884 (899)
T TIGR02917 811 AWLYLELKDPRALEYAEKALKLAPNI------PAILDTLGWLLVEKGEADRALPLLRKAVNIAPEAAAIRYHLALALLAT 884 (899)
T ss_pred HHHHHhcCcHHHHHHHHHHHhhCCCC------cHHHHHHHHHHHHcCCHHHHHHHHHHHHhhCCCChHHHHHHHHHHHHc
Confidence 111111111 12333433 778889999999999999999999999999999999999999999865
No 36
>TIGR02795 tol_pal_ybgF tol-pal system protein YbgF. Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.
Probab=99.29 E-value=8.8e-11 Score=73.59 Aligned_cols=103 Identities=14% Similarity=0.123 Sum_probs=86.8
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCc---HHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCC
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNN---EQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFL 77 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~---~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~ 77 (150)
.+|..+...|++++|++.|+++++.+|++ ..++..++.+ ...| ++..|+ ..+...+...|+....
T Consensus 7 ~~~~~~~~~~~~~~A~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~--------~~~~A~--~~~~~~~~~~p~~~~~- 75 (119)
T TIGR02795 7 DAALLVLKAGDYADAIQAFQAFLKKYPKSTYAPNAHYWLGEAYYAQG--------KYADAA--KAFLAVVKKYPKSPKA- 75 (119)
T ss_pred HHHHHHHHcCCHHHHHHHHHHHHHHCCCccccHHHHHHHHHHHHhhc--------cHHHHH--HHHHHHHHHCCCCCcc-
Confidence 57899999999999999999999999987 5688889999 8888 667777 6666666667754321
Q ss_pred CChHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchh
Q psy3020 78 SDPSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILF 139 (150)
Q Consensus 78 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~ 139 (150)
+.++..+|.++...|++++|++.|.++++..|+++...
T Consensus 76 ------------------------~~~~~~~~~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~ 113 (119)
T TIGR02795 76 ------------------------PDALLKLGMSLQELGDKEKAKATLQQVIKRYPGSSAAK 113 (119)
T ss_pred ------------------------cHHHHHHHHHHHHhCChHHHHHHHHHHHHHCcCChhHH
Confidence 56789999999999999999999999999999987643
No 37
>PRK15174 Vi polysaccharide export protein VexE; Provisional
Probab=99.29 E-value=5.8e-11 Score=94.78 Aligned_cols=124 Identities=15% Similarity=0.115 Sum_probs=96.0
Q ss_pred hhhHHHHhhCCHHH----HHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCC
Q psy3020 2 TTYSCLSYLGRYKE----SISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPF 76 (150)
Q Consensus 2 ~lg~~~~~~~~~~~----A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~ 76 (150)
.+|.++...|++++ |+..|+++++++|+++.++..++.+ ...| .+.+|+ ..+.+++.++|++...
T Consensus 251 ~Lg~~l~~~G~~~eA~~~A~~~~~~Al~l~P~~~~a~~~lg~~l~~~g--------~~~eA~--~~l~~al~l~P~~~~a 320 (656)
T PRK15174 251 SLGLAYYQSGRSREAKLQAAEHWRHALQFNSDNVRIVTLYADALIRTG--------QNEKAI--PLLQQSLATHPDLPYV 320 (656)
T ss_pred HHHHHHHHcCCchhhHHHHHHHHHHHHhhCCCCHHHHHHHHHHHHHCC--------CHHHHH--HHHHHHHHhCCCCHHH
Confidence 57999999999996 8999999999999999999999999 8889 667777 7888888999987653
Q ss_pred CC--------ChHHHHHHHH----HhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhc
Q psy3020 77 LS--------DPSYVQMIKE----IQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSN 141 (150)
Q Consensus 77 ~~--------~~~~~~~~~~----l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n 141 (150)
.. ...+...+.. +...|.. +..+..+|.++...|++++|+..|.++++++|++....+.
T Consensus 321 ~~~La~~l~~~G~~~eA~~~l~~al~~~P~~------~~~~~~~a~al~~~G~~deA~~~l~~al~~~P~~~~~~~~ 391 (656)
T PRK15174 321 RAMYARALRQVGQYTAASDEFVQLAREKGVT------SKWNRYAAAALLQAGKTSEAESVFEHYIQARASHLPQSFE 391 (656)
T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHhCccc------hHHHHHHHHHHHHCCCHHHHHHHHHHHHHhChhhchhhHH
Confidence 21 0111122211 2234433 5556667899999999999999999999999998755444
No 38
>KOG1126|consensus
Probab=99.28 E-value=1.6e-11 Score=94.73 Aligned_cols=132 Identities=12% Similarity=0.150 Sum_probs=106.6
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCC-
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSD- 79 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~- 79 (150)
.+|.-+.....++.|..+|++||.+||.+..||+++|.+ ..++ +++.|. -.+.+++.++|.....+--
T Consensus 460 LlGhE~~~~ee~d~a~~~fr~Al~~~~rhYnAwYGlG~vy~Kqe--------k~e~Ae--~~fqkA~~INP~nsvi~~~~ 529 (638)
T KOG1126|consen 460 LLGHESIATEEFDKAMKSFRKALGVDPRHYNAWYGLGTVYLKQE--------KLEFAE--FHFQKAVEINPSNSVILCHI 529 (638)
T ss_pred hcCChhhhhHHHHhHHHHHHhhhcCCchhhHHHHhhhhheeccc--------hhhHHH--HHHHhhhcCCccchhHHhhh
Confidence 467778888999999999999999999999999999999 8888 667777 7899999999987753210
Q ss_pred -------hHHHHHHH----HHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHh
Q psy3020 80 -------PSYVQMIK----EIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAK 148 (150)
Q Consensus 80 -------~~~~~~~~----~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~ 148 (150)
+...+.+. .+--+|.. +-.-+.+|.+++-.+++++|+..+++.-++-|++...++.+|.+|.+
T Consensus 530 g~~~~~~k~~d~AL~~~~~A~~ld~kn------~l~~~~~~~il~~~~~~~eal~~LEeLk~~vP~es~v~~llgki~k~ 603 (638)
T KOG1126|consen 530 GRIQHQLKRKDKALQLYEKAIHLDPKN------PLCKYHRASILFSLGRYVEALQELEELKELVPQESSVFALLGKIYKR 603 (638)
T ss_pred hHHHHHhhhhhHHHHHHHHHHhcCCCC------chhHHHHHHHHHhhcchHHHHHHHHHHHHhCcchHHHHHHHHHHHHH
Confidence 01111111 12223333 77889999999999999999999999999999999999999999987
Q ss_pred c
Q psy3020 149 A 149 (150)
Q Consensus 149 ~ 149 (150)
+
T Consensus 604 ~ 604 (638)
T KOG1126|consen 604 L 604 (638)
T ss_pred H
Confidence 5
No 39
>KOG1155|consensus
Probab=99.28 E-value=7.5e-11 Score=88.18 Aligned_cols=121 Identities=10% Similarity=0.145 Sum_probs=83.7
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCC-
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSD- 79 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~- 79 (150)
.+|.-|.++++-..|++.|++|++++|.+..+|.+||+. .-|+.. .+ ++ =-+.++....|.+......
T Consensus 369 LmGHEyvEmKNt~AAi~sYRrAvdi~p~DyRAWYGLGQaYeim~Mh------~Y--aL--yYfqkA~~~kPnDsRlw~aL 438 (559)
T KOG1155|consen 369 LMGHEYVEMKNTHAAIESYRRAVDINPRDYRAWYGLGQAYEIMKMH------FY--AL--YYFQKALELKPNDSRLWVAL 438 (559)
T ss_pred HhhHHHHHhcccHHHHHHHHHHHhcCchhHHHHhhhhHHHHHhcch------HH--HH--HHHHHHHhcCCCchHHHHHH
Confidence 579999999999999999999999999999999999999 777742 11 22 3566777778877642100
Q ss_pred hHHHHHHHHHhcCcchhhh-----HHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhc
Q psy3020 80 PSYVQMIKEIQKDPSLMTL-----EAKISLLKDKGNAALQANNFKEAIEAYSEAIKLD 132 (150)
Q Consensus 80 ~~~~~~~~~l~~~~~~~~~-----~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~ 132 (150)
+.-+..+..++........ +.+..++..+|+.+-+.+++++|..+|.+.++.-
T Consensus 439 G~CY~kl~~~~eAiKCykrai~~~dte~~~l~~LakLye~l~d~~eAa~~yek~v~~~ 496 (559)
T KOG1155|consen 439 GECYEKLNRLEEAIKCYKRAILLGDTEGSALVRLAKLYEELKDLNEAAQYYEKYVEVS 496 (559)
T ss_pred HHHHHHhccHHHHHHHHHHHHhccccchHHHHHHHHHHHHHHhHHHHHHHHHHHHHHH
Confidence 0001111111000000000 1225789999999999999999999999999854
No 40
>KOG0550|consensus
Probab=99.25 E-value=1.9e-11 Score=90.12 Aligned_cols=121 Identities=28% Similarity=0.425 Sum_probs=100.0
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChH
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPS 81 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~ 81 (150)
-+.|+.-.+++++|.+.--..+++|+.|.++..-.+.+ +... ....+. ..+.+.++++|++...-.
T Consensus 175 ka~cl~~~~~~~~a~~ea~~ilkld~~n~~al~vrg~~~yy~~--------~~~ka~--~hf~qal~ldpdh~~sk~--- 241 (486)
T KOG0550|consen 175 KAECLAFLGDYDEAQSEAIDILKLDATNAEALYVRGLCLYYND--------NADKAI--NHFQQALRLDPDHQKSKS--- 241 (486)
T ss_pred hhhhhhhcccchhHHHHHHHHHhcccchhHHHHhccccccccc--------chHHHH--HHHhhhhccChhhhhHHh---
Confidence 36789999999999999999999999999999999998 7766 556666 889999999998765321
Q ss_pred HHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCC----cchhhcHHHHHHhc
Q psy3020 82 YVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTN----HILFSNRSAAFAKA 149 (150)
Q Consensus 82 ~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~----~~~~~n~a~~~~~~ 149 (150)
....+ .+...|...|+..++.|+|.+|.++|+++|.++|++ +.+|.|||.+.+.+
T Consensus 242 -------~~~~~------k~le~~k~~gN~~fk~G~y~~A~E~Yteal~idP~n~~~naklY~nra~v~~rL 300 (486)
T KOG0550|consen 242 -------ASMMP------KKLEVKKERGNDAFKNGNYRKAYECYTEALNIDPSNKKTNAKLYGNRALVNIRL 300 (486)
T ss_pred -------HhhhH------HHHHHHHhhhhhHhhccchhHHHHHHHHhhcCCccccchhHHHHHHhHhhhccc
Confidence 11111 337889999999999999999999999999999986 57789999888765
No 41
>TIGR02917 PEP_TPR_lipo putative PEP-CTERM system TPR-repeat lipoprotein. This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
Probab=99.24 E-value=1.4e-10 Score=93.82 Aligned_cols=132 Identities=18% Similarity=0.152 Sum_probs=93.4
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCC---
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFL--- 77 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~--- 77 (150)
.+|.++...|++++|+..|+++++.+|+++.++..++.+ ...| ++.++. ..+...+..+|.+....
T Consensus 130 ~~~~~~~~~~~~~~A~~~~~~a~~~~~~~~~~~~~la~~~~~~~--------~~~~A~--~~~~~~~~~~~~~~~~~~~~ 199 (899)
T TIGR02917 130 LRGLAYLGLGQLELAQKSYEQALAIDPRSLYAKLGLAQLALAEN--------RFDEAR--ALIDEVLTADPGNVDALLLK 199 (899)
T ss_pred HHHHHHHHcCCHHHHHHHHHHHHhcCCCChhhHHHHHHHHHHCC--------CHHHHH--HHHHHHHHhCCCChHHHHHH
Confidence 468889999999999999999999999999999999988 8888 566666 66666667777654321
Q ss_pred -----CChHHHHHH----HHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHh
Q psy3020 78 -----SDPSYVQMI----KEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAK 148 (150)
Q Consensus 78 -----~~~~~~~~~----~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~ 148 (150)
....+...+ ..+..+|.. +..+..++.++...|++++|...+.++++..|+++..++.+|.+++.
T Consensus 200 ~~~~~~~g~~~~A~~~~~~a~~~~p~~------~~~~~~~~~~~~~~g~~~~A~~~~~~~~~~~~~~~~~~~~~~~~~~~ 273 (899)
T TIGR02917 200 GDLLLSLGNIELALAAYRKAIALRPNN------PAVLLALATILIEAGEFEEAEKHADALLKKAPNSPLAHYLKALVDFQ 273 (899)
T ss_pred HHHHHhcCCHHHHHHHHHHHHhhCCCC------HHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCCCCchHHHHHHHHHHH
Confidence 000111111 113334444 66777778888888888888888888888888887777777777664
Q ss_pred c
Q psy3020 149 A 149 (150)
Q Consensus 149 ~ 149 (150)
+
T Consensus 274 ~ 274 (899)
T TIGR02917 274 K 274 (899)
T ss_pred h
Confidence 3
No 42
>PRK10049 pgaA outer membrane protein PgaA; Provisional
Probab=99.23 E-value=1.9e-10 Score=93.36 Aligned_cols=107 Identities=11% Similarity=0.168 Sum_probs=78.0
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChH
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPS 81 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~ 81 (150)
+|.++...|++++|+..|+++|+++|+++.++..++.+ ...| .+.+|+ ..+.+.+...|+
T Consensus 55 lA~~~~~~g~~~~A~~~~~~al~~~P~~~~a~~~la~~l~~~g--------~~~eA~--~~l~~~l~~~P~--------- 115 (765)
T PRK10049 55 VAVAYRNLKQWQNSLTLWQKALSLEPQNDDYQRGLILTLADAG--------QYDEAL--VKAKQLVSGAPD--------- 115 (765)
T ss_pred HHHHHHHcCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHCC--------CHHHHH--HHHHHHHHhCCC---------
Confidence 45566666666666666666666666666666666666 5555 444444 444444444443
Q ss_pred HHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHh
Q psy3020 82 YVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAK 148 (150)
Q Consensus 82 ~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~ 148 (150)
. +. +..+|.++...|++++|+..|+++++++|+++.++..+|.++..
T Consensus 116 -------------~------~~-~~~la~~l~~~g~~~~Al~~l~~al~~~P~~~~~~~~la~~l~~ 162 (765)
T PRK10049 116 -------------K------AN-LLALAYVYKRAGRHWDELRAMTQALPRAPQTQQYPTEYVQALRN 162 (765)
T ss_pred -------------C------HH-HHHHHHHHHHCCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHH
Confidence 2 66 88999999999999999999999999999999999999988754
No 43
>KOG2002|consensus
Probab=99.23 E-value=3.2e-10 Score=90.82 Aligned_cols=138 Identities=18% Similarity=0.262 Sum_probs=106.0
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCC-
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSD- 79 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~- 79 (150)
.+|.|+.++++.+.|+..|++||++||+++.+..+|+.+ ..-.+.+ .+..++ ..+..+...++.++.+++-
T Consensus 204 gig~Cf~kl~~~~~a~~a~~ralqLdp~~v~alv~L~~~~l~~~d~~-----s~~~~~--~ll~~ay~~n~~nP~~l~~L 276 (1018)
T KOG2002|consen 204 GIGHCFWKLGMSEKALLAFERALQLDPTCVSALVALGEVDLNFNDSD-----SYKKGV--QLLQRAYKENNENPVALNHL 276 (1018)
T ss_pred hhhhHHHhccchhhHHHHHHHHHhcChhhHHHHHHHHHHHHHccchH-----HHHHHH--HHHHHHHhhcCCCcHHHHHH
Confidence 579999999999999999999999999999999999998 5555333 677777 7788888888888864211
Q ss_pred -------hHH---HHHHHH-HhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCC-cchhhcHHHHHH
Q psy3020 80 -------PSY---VQMIKE-IQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTN-HILFSNRSAAFA 147 (150)
Q Consensus 80 -------~~~---~~~~~~-l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~-~~~~~n~a~~~~ 147 (150)
+.+ ...... +.... .....+..++.+|..+..+|+|++|..+|-+++..+|++ ...++-+|..|+
T Consensus 277 An~fyfK~dy~~v~~la~~ai~~t~---~~~~~aes~Y~~gRs~Ha~Gd~ekA~~yY~~s~k~~~d~~~l~~~GlgQm~i 353 (1018)
T KOG2002|consen 277 ANHFYFKKDYERVWHLAEHAIKNTE---NKSIKAESFYQLGRSYHAQGDFEKAFKYYMESLKADNDNFVLPLVGLGQMYI 353 (1018)
T ss_pred HHHHhhcccHHHHHHHHHHHHHhhh---hhHHHHHHHHHHHHHHHhhccHHHHHHHHHHHHccCCCCccccccchhHHHH
Confidence 111 111111 11110 001337889999999999999999999999999999999 788888999888
Q ss_pred hc
Q psy3020 148 KA 149 (150)
Q Consensus 148 ~~ 149 (150)
..
T Consensus 354 ~~ 355 (1018)
T KOG2002|consen 354 KR 355 (1018)
T ss_pred Hh
Confidence 65
No 44
>KOG3060|consensus
Probab=99.23 E-value=2.3e-10 Score=79.82 Aligned_cols=109 Identities=17% Similarity=0.200 Sum_probs=87.4
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChH
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPS 81 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~ 81 (150)
-|+.+...|+|++|++.|+..|+-||+|.-.+.....+ ..+| +-..++ ..+...+...+.+
T Consensus 92 kam~lEa~~~~~~A~e~y~~lL~ddpt~~v~~KRKlAilka~G--------K~l~aI--k~ln~YL~~F~~D-------- 153 (289)
T KOG3060|consen 92 KAMLLEATGNYKEAIEYYESLLEDDPTDTVIRKRKLAILKAQG--------KNLEAI--KELNEYLDKFMND-------- 153 (289)
T ss_pred HHHHHHHhhchhhHHHHHHHHhccCcchhHHHHHHHHHHHHcC--------CcHHHH--HHHHHHHHHhcCc--------
Confidence 47778888999999999999999999988888766666 7777 334455 4554444444322
Q ss_pred HHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 82 YVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 82 ~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
.++|..++.+|+..|+|++|.-||++.+-+.|.++..+..+|.+++-+
T Consensus 154 --------------------~EAW~eLaeiY~~~~~f~kA~fClEE~ll~~P~n~l~f~rlae~~Yt~ 201 (289)
T KOG3060|consen 154 --------------------QEAWHELAEIYLSEGDFEKAAFCLEELLLIQPFNPLYFQRLAEVLYTQ 201 (289)
T ss_pred --------------------HHHHHHHHHHHHhHhHHHHHHHHHHHHHHcCCCcHHHHHHHHHHHHHH
Confidence 789999999999999999999999999999999999998888887643
No 45
>cd00189 TPR Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and C
Probab=99.21 E-value=4.7e-10 Score=66.22 Aligned_cols=95 Identities=26% Similarity=0.418 Sum_probs=80.2
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCCh
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDP 80 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~ 80 (150)
.+|.++...|++++|+..++++++..|++..++..++.+ ...+ .+..++ ..+...+...|..
T Consensus 5 ~~a~~~~~~~~~~~A~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~a~--~~~~~~~~~~~~~------- 67 (100)
T cd00189 5 NLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLG--------KYEEAL--EDYEKALELDPDN------- 67 (100)
T ss_pred HHHHHHHHHhcHHHHHHHHHHHHhcCCccHHHHHHHHHHHHHHH--------HHHHHH--HHHHHHHhCCCcc-------
Confidence 578999999999999999999999999999999999999 7777 555666 5555555555532
Q ss_pred HHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCC
Q psy3020 81 SYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGT 134 (150)
Q Consensus 81 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~ 134 (150)
...+..+|.++...|++++|...+.+++..+|+
T Consensus 68 ---------------------~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~ 100 (100)
T cd00189 68 ---------------------AKAYYNLGLAYYKLGKYEEALEAYEKALELDPN 100 (100)
T ss_pred ---------------------hhHHHHHHHHHHHHHhHHHHHHHHHHHHccCCC
Confidence 456888999999999999999999999999884
No 46
>PRK15363 pathogenicity island 2 chaperone protein SscA; Provisional
Probab=99.21 E-value=1.9e-10 Score=75.35 Aligned_cols=91 Identities=13% Similarity=0.009 Sum_probs=73.8
Q ss_pred HhhccC-CCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChHHHHHHHHHhcCcchhhhH
Q psy3020 22 EGLKLD-PNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPSYVQMIKEIQKDPSLMTLE 99 (150)
Q Consensus 22 ~aL~~~-p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~ 99 (150)
....++ |+.-+....++.. ...| ++.+|. .-+......+|.+
T Consensus 25 ~l~~~~~~~~l~~lY~~A~~ly~~G--------~l~~A~--~~f~~L~~~Dp~~-------------------------- 68 (157)
T PRK15363 25 MLLDDDVTQPLNTLYRYAMQLMEVK--------EFAGAA--RLFQLLTIYDAWS-------------------------- 68 (157)
T ss_pred HHHCCChHHHHHHHHHHHHHHHHCC--------CHHHHH--HHHHHHHHhCccc--------------------------
Confidence 345677 7777888888887 8888 556665 4444444555532
Q ss_pred HHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhcC
Q psy3020 100 AKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKAI 150 (150)
Q Consensus 100 ~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~~ 150 (150)
+..|.++|.++..+|+|++|+.+|.+|+.++|++|..++|.|.|++.++
T Consensus 69 --~~y~~gLG~~~Q~~g~~~~AI~aY~~A~~L~~ddp~~~~~ag~c~L~lG 117 (157)
T PRK15363 69 --FDYWFRLGECCQAQKHWGEAIYAYGRAAQIKIDAPQAPWAAAECYLACD 117 (157)
T ss_pred --HHHHHHHHHHHHHHhhHHHHHHHHHHHHhcCCCCchHHHHHHHHHHHcC
Confidence 8899999999999999999999999999999999999999999998753
No 47
>TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795).
Probab=99.20 E-value=3.8e-10 Score=79.21 Aligned_cols=103 Identities=17% Similarity=0.099 Sum_probs=86.7
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcH---HHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCC
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNE---QMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFL 77 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~---~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~ 77 (150)
.+|.++...|++++|+..|+++++.+|+++ .++..+|.+ ...| ++.+|+ ..+.+.++..|++...
T Consensus 38 ~~g~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~a~~~la~~~~~~~--------~~~~A~--~~~~~~l~~~p~~~~~- 106 (235)
T TIGR03302 38 EEAKEALDSGDYTEAIKYFEALESRYPFSPYAEQAQLDLAYAYYKSG--------DYAEAI--AAADRFIRLHPNHPDA- 106 (235)
T ss_pred HHHHHHHHcCCHHHHHHHHHHHHHhCCCchhHHHHHHHHHHHHHhcC--------CHHHHH--HHHHHHHHHCcCCCch-
Confidence 578999999999999999999999999876 577889999 8888 667788 7888888888865441
Q ss_pred CChHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHh--------cCHHHHHHHHHHHHhhcCCCcchh
Q psy3020 78 SDPSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQA--------NNFKEAIEAYSEAIKLDGTNHILF 139 (150)
Q Consensus 78 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~--------~~~~~A~~~~~~al~~~p~~~~~~ 139 (150)
..++..+|.+++.. |++++|++.|++++..+|++...+
T Consensus 107 ------------------------~~a~~~~g~~~~~~~~~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~ 152 (235)
T TIGR03302 107 ------------------------DYAYYLRGLSNYNQIDRVDRDQTAAREAFEAFQELIRRYPNSEYAP 152 (235)
T ss_pred ------------------------HHHHHHHHHHHHHhcccccCCHHHHHHHHHHHHHHHHHCCCChhHH
Confidence 34677788888765 899999999999999999987553
No 48
>KOG0547|consensus
Probab=99.20 E-value=1.9e-10 Score=86.63 Aligned_cols=129 Identities=12% Similarity=0.172 Sum_probs=96.0
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCCh-
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDP- 80 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~- 80 (150)
+|..|....+-++-..+|++|.++||+|++++..+|++ ..++ ++++|. .++.++++++|++.-.+-+.
T Consensus 366 ~a~~y~d~~~~~~~~~~F~~A~~ldp~n~dvYyHRgQm~flL~--------q~e~A~--aDF~Kai~L~pe~~~~~iQl~ 435 (606)
T KOG0547|consen 366 RAAAYADENQSEKMWKDFNKAEDLDPENPDVYYHRGQMRFLLQ--------QYEEAI--ADFQKAISLDPENAYAYIQLC 435 (606)
T ss_pred HHHHHhhhhccHHHHHHHHHHHhcCCCCCchhHhHHHHHHHHH--------HHHHHH--HHHHHHhhcChhhhHHHHHHH
Confidence 57788888888899999999999999999999999998 8888 777888 88999999999876422110
Q ss_pred -------HHH---HHH-HHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCC------CcchhhcHH
Q psy3020 81 -------SYV---QMI-KEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGT------NHILFSNRS 143 (150)
Q Consensus 81 -------~~~---~~~-~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~------~~~~~~n~a 143 (150)
.+. ... +...+=|.. ++.+..-+.++..+++|+.|++.|.+|+++.|. ++..+.+.|
T Consensus 436 ~a~Yr~~k~~~~m~~Fee~kkkFP~~------~Evy~~fAeiLtDqqqFd~A~k~YD~ai~LE~~~~~~~v~~~plV~Ka 509 (606)
T KOG0547|consen 436 CALYRQHKIAESMKTFEEAKKKFPNC------PEVYNLFAEILTDQQQFDKAVKQYDKAIELEPREHLIIVNAAPLVHKA 509 (606)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhCCCC------chHHHHHHHHHhhHHhHHHHHHHHHHHHhhccccccccccchhhhhhh
Confidence 000 000 112222333 788888889999999999999999999999999 667776666
Q ss_pred HHHH
Q psy3020 144 AAFA 147 (150)
Q Consensus 144 ~~~~ 147 (150)
.+.+
T Consensus 510 ~l~~ 513 (606)
T KOG0547|consen 510 LLVL 513 (606)
T ss_pred Hhhh
Confidence 6554
No 49
>PRK11788 tetratricopeptide repeat protein; Provisional
Probab=99.18 E-value=4.8e-10 Score=83.95 Aligned_cols=132 Identities=11% Similarity=0.090 Sum_probs=85.8
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCC----
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPF---- 76 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~---- 76 (150)
.+|.+|...|++++|+..|+++++.+|.+..++..++.+ ...| ++.+|+ ..+...+...|.....
T Consensus 112 ~La~~~~~~g~~~~A~~~~~~~l~~~~~~~~~~~~la~~~~~~g--------~~~~A~--~~~~~~~~~~~~~~~~~~~~ 181 (389)
T PRK11788 112 ELGQDYLKAGLLDRAEELFLQLVDEGDFAEGALQQLLEIYQQEK--------DWQKAI--DVAERLEKLGGDSLRVEIAH 181 (389)
T ss_pred HHHHHHHHCCCHHHHHHHHHHHHcCCcchHHHHHHHHHHHHHhc--------hHHHHH--HHHHHHHHhcCCcchHHHHH
Confidence 467888888888888888888888888888888888888 7777 445555 4444445444443210
Q ss_pred ---------CCChHHHHH---HHH-HhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCC-cchhhcH
Q psy3020 77 ---------LSDPSYVQM---IKE-IQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTN-HILFSNR 142 (150)
Q Consensus 77 ---------~~~~~~~~~---~~~-l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~-~~~~~n~ 142 (150)
.....+... +.. +...|.. ..++..+|.++.+.|++++|++.|+++++.+|.+ ...+..+
T Consensus 182 ~~~~la~~~~~~~~~~~A~~~~~~al~~~p~~------~~~~~~la~~~~~~g~~~~A~~~~~~~~~~~p~~~~~~~~~l 255 (389)
T PRK11788 182 FYCELAQQALARGDLDAARALLKKALAADPQC------VRASILLGDLALAQGDYAAAIEALERVEEQDPEYLSEVLPKL 255 (389)
T ss_pred HHHHHHHHHHhCCCHHHHHHHHHHHHhHCcCC------HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHChhhHHHHHHHH
Confidence 000011111 111 2223332 5677788888888888888888888888888876 3456667
Q ss_pred HHHHHhc
Q psy3020 143 SAAFAKA 149 (150)
Q Consensus 143 a~~~~~~ 149 (150)
+.+|..+
T Consensus 256 ~~~~~~~ 262 (389)
T PRK11788 256 MECYQAL 262 (389)
T ss_pred HHHHHHc
Confidence 7777654
No 50
>KOG2076|consensus
Probab=99.18 E-value=4.1e-10 Score=89.54 Aligned_cols=110 Identities=22% Similarity=0.269 Sum_probs=93.8
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCCh
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDP 80 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~ 80 (150)
..|..++..|++++|...+.++++++|.++.++..||.+ ...| +..++. ....-+-+++|.+
T Consensus 144 ~eAN~lfarg~~eeA~~i~~EvIkqdp~~~~ay~tL~~IyEqrG--------d~eK~l--~~~llAAHL~p~d------- 206 (895)
T KOG2076|consen 144 GEANNLFARGDLEEAEEILMEVIKQDPRNPIAYYTLGEIYEQRG--------DIEKAL--NFWLLAAHLNPKD------- 206 (895)
T ss_pred HHHHHHHHhCCHHHHHHHHHHHHHhCccchhhHHHHHHHHHHcc--------cHHHHH--HHHHHHHhcCCCC-------
Confidence 456778888999999999999999999999999999999 9999 666666 5555556777754
Q ss_pred HHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 81 SYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 81 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
..-|..++....++|.+.+|.-||++||+.+|.+....++++..|-++
T Consensus 207 ---------------------~e~W~~ladls~~~~~i~qA~~cy~rAI~~~p~n~~~~~ers~L~~~~ 254 (895)
T KOG2076|consen 207 ---------------------YELWKRLADLSEQLGNINQARYCYSRAIQANPSNWELIYERSSLYQKT 254 (895)
T ss_pred ---------------------hHHHHHHHHHHHhcccHHHHHHHHHHHHhcCCcchHHHHHHHHHHHHh
Confidence 567888888888999999999999999999999999999998888764
No 51
>TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW. Members of this family are designated PilF in ref (PubMed:8973346) and PilW in ref (PubMed:15612916). This outer membrane protein is required both for pilus stability and for pilus function such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.
Probab=99.17 E-value=2.8e-09 Score=73.35 Aligned_cols=131 Identities=11% Similarity=0.083 Sum_probs=95.7
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCC--CCCCC--
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDP--RTKPF-- 76 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~--~~~~~-- 76 (150)
.+|.++...|++++|++.|+++++++|++..++..++.+ ...| .+.++. ..+.+.+...+ .....
T Consensus 70 ~la~~~~~~~~~~~A~~~~~~al~~~~~~~~~~~~~~~~~~~~g--------~~~~A~--~~~~~~~~~~~~~~~~~~~~ 139 (234)
T TIGR02521 70 ALALYYQQLGELEKAEDSFRRALTLNPNNGDVLNNYGTFLCQQG--------KYEQAM--QQFEQAIEDPLYPQPARSLE 139 (234)
T ss_pred HHHHHHHHcCCHHHHHHHHHHHHhhCCCCHHHHHHHHHHHHHcc--------cHHHHH--HHHHHHHhccccccchHHHH
Confidence 478999999999999999999999999999999999999 8888 555665 55555554321 11110
Q ss_pred --------CCCh-HHHHHH-HHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHH
Q psy3020 77 --------LSDP-SYVQMI-KEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAF 146 (150)
Q Consensus 77 --------~~~~-~~~~~~-~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~ 146 (150)
..+. .....+ ..+...|.. +..+..+|.+++..|++++|+..+++++++.|+++..+..++.++
T Consensus 140 ~l~~~~~~~g~~~~A~~~~~~~~~~~~~~------~~~~~~la~~~~~~~~~~~A~~~~~~~~~~~~~~~~~~~~~~~~~ 213 (234)
T TIGR02521 140 NAGLCALKAGDFDKAEKYLTRALQIDPQR------PESLLELAELYYLRGQYKDARAYLERYQQTYNQTAESLWLGIRIA 213 (234)
T ss_pred HHHHHHHHcCCHHHHHHHHHHHHHhCcCC------hHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHH
Confidence 0000 011111 112233433 678899999999999999999999999999999888888777776
Q ss_pred Hh
Q psy3020 147 AK 148 (150)
Q Consensus 147 ~~ 148 (150)
..
T Consensus 214 ~~ 215 (234)
T TIGR02521 214 RA 215 (234)
T ss_pred HH
Confidence 54
No 52
>KOG1125|consensus
Probab=99.17 E-value=8.8e-11 Score=89.54 Aligned_cols=95 Identities=17% Similarity=0.284 Sum_probs=85.5
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCCh
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDP 80 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~ 80 (150)
.||..|.-.|+|++|++||+.||+.+|+|.-.|..||.. .... ..++|+ .-|.+++.+.|.+
T Consensus 435 ~LGVLy~ls~efdraiDcf~~AL~v~Pnd~~lWNRLGAtLAN~~--------~s~EAI--sAY~rALqLqP~y------- 497 (579)
T KOG1125|consen 435 GLGVLYNLSGEFDRAVDCFEAALQVKPNDYLLWNRLGATLANGN--------RSEEAI--SAYNRALQLQPGY------- 497 (579)
T ss_pred hhHHHHhcchHHHHHHHHHHHHHhcCCchHHHHHHhhHHhcCCc--------ccHHHH--HHHHHHHhcCCCe-------
Confidence 589999999999999999999999999999999999887 4433 557788 8889999988865
Q ss_pred HHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCC
Q psy3020 81 SYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGT 134 (150)
Q Consensus 81 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~ 134 (150)
..+++++|..++.+|.|.+|+++|-.||.+.+.
T Consensus 498 ---------------------VR~RyNlgIS~mNlG~ykEA~~hlL~AL~mq~k 530 (579)
T KOG1125|consen 498 ---------------------VRVRYNLGISCMNLGAYKEAVKHLLEALSMQRK 530 (579)
T ss_pred ---------------------eeeehhhhhhhhhhhhHHHHHHHHHHHHHhhhc
Confidence 667899999999999999999999999999887
No 53
>KOG0547|consensus
Probab=99.16 E-value=3.1e-10 Score=85.46 Aligned_cols=63 Identities=17% Similarity=0.184 Sum_probs=55.4
Q ss_pred hHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCC
Q psy3020 4 YSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPF 76 (150)
Q Consensus 4 g~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~ 76 (150)
|..++-+++|++|+.+|++++.++|+|+.++..++-. +.++ .+.+++ ..+..+.+..|+.++.
T Consensus 401 gQm~flL~q~e~A~aDF~Kai~L~pe~~~~~iQl~~a~Yr~~--------k~~~~m--~~Fee~kkkFP~~~Ev 464 (606)
T KOG0547|consen 401 GQMRFLLQQYEEAIADFQKAISLDPENAYAYIQLCCALYRQH--------KIAESM--KTFEEAKKKFPNCPEV 464 (606)
T ss_pred HHHHHHHHHHHHHHHHHHHHhhcChhhhHHHHHHHHHHHHHH--------HHHHHH--HHHHHHHHhCCCCchH
Confidence 5667778999999999999999999999999999988 8888 677777 8888888888888774
No 54
>COG4235 Cytochrome c biogenesis factor [Posttranslational modification, protein turnover, chaperones]
Probab=99.16 E-value=7.1e-10 Score=79.10 Aligned_cols=101 Identities=21% Similarity=0.229 Sum_probs=82.1
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCCh
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDP 80 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~ 80 (150)
.||.+|+.+|++..|...|.+|+++.|+|++.+.+++.+ ..+.... ...++. .-+.+++..||.+
T Consensus 161 ~Lg~~ym~~~~~~~A~~AY~~A~rL~g~n~~~~~g~aeaL~~~a~~~-----~ta~a~--~ll~~al~~D~~~------- 226 (287)
T COG4235 161 LLGRAYMALGRASDALLAYRNALRLAGDNPEILLGLAEALYYQAGQQ-----MTAKAR--ALLRQALALDPAN------- 226 (287)
T ss_pred HHHHHHHHhcchhHHHHHHHHHHHhCCCCHHHHHHHHHHHHHhcCCc-----ccHHHH--HHHHHHHhcCCcc-------
Confidence 589999999999999999999999999999999999998 5444221 222233 4445555656543
Q ss_pred HHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcc
Q psy3020 81 SYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHI 137 (150)
Q Consensus 81 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~ 137 (150)
..+...+|..+++.|+|.+|+..+...++..|.+..
T Consensus 227 ---------------------iral~lLA~~afe~g~~~~A~~~Wq~lL~~lp~~~~ 262 (287)
T COG4235 227 ---------------------IRALSLLAFAAFEQGDYAEAAAAWQMLLDLLPADDP 262 (287)
T ss_pred ---------------------HHHHHHHHHHHHHcccHHHHHHHHHHHHhcCCCCCc
Confidence 788899999999999999999999999999997754
No 55
>KOG0624|consensus
Probab=99.13 E-value=7.5e-10 Score=80.55 Aligned_cols=138 Identities=18% Similarity=0.184 Sum_probs=103.8
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCC----
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPF---- 76 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~---- 76 (150)
-+|..++..|++.+|+.+|-.|++.||+|..+.+..+.+ ..+| .-+.++ +++.+.+.+.|+...+
T Consensus 43 ElGk~lla~~Q~sDALt~yHaAve~dp~~Y~aifrRaT~yLAmG--------ksk~al--~Dl~rVlelKpDF~~ARiQR 112 (504)
T KOG0624|consen 43 ELGKELLARGQLSDALTHYHAAVEGDPNNYQAIFRRATVYLAMG--------KSKAAL--QDLSRVLELKPDFMAARIQR 112 (504)
T ss_pred HHHHHHHHhhhHHHHHHHHHHHHcCCchhHHHHHHHHHHHhhhc--------CCccch--hhHHHHHhcCccHHHHHHHh
Confidence 478999999999999999999999999999999999999 9999 445667 8899999999987753
Q ss_pred ----CCChHHHHHHHH----HhcCcchhhh---HHH------hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchh
Q psy3020 77 ----LSDPSYVQMIKE----IQKDPSLMTL---EAK------ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILF 139 (150)
Q Consensus 77 ----~~~~~~~~~~~~----l~~~~~~~~~---~~~------~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~ 139 (150)
+.++.+.++... +..+|+.-.. ..+ -..+.......+..|++..|++..++.|++-|+++.++
T Consensus 113 g~vllK~Gele~A~~DF~~vl~~~~s~~~~~eaqskl~~~~e~~~l~~ql~s~~~~GD~~~ai~~i~~llEi~~Wda~l~ 192 (504)
T KOG0624|consen 113 GVVLLKQGELEQAEADFDQVLQHEPSNGLVLEAQSKLALIQEHWVLVQQLKSASGSGDCQNAIEMITHLLEIQPWDASLR 192 (504)
T ss_pred chhhhhcccHHHHHHHHHHHHhcCCCcchhHHHHHHHHhHHHHHHHHHHHHHHhcCCchhhHHHHHHHHHhcCcchhHHH
Confidence 222223333222 3333322111 111 22333344455678999999999999999999999999
Q ss_pred hcHHHHHHhc
Q psy3020 140 SNRSAAFAKA 149 (150)
Q Consensus 140 ~n~a~~~~~~ 149 (150)
..|+.||+..
T Consensus 193 ~~Rakc~i~~ 202 (504)
T KOG0624|consen 193 QARAKCYIAE 202 (504)
T ss_pred HHHHHHHHhc
Confidence 9999999864
No 56
>PRK11788 tetratricopeptide repeat protein; Provisional
Probab=99.12 E-value=1.4e-09 Score=81.41 Aligned_cols=130 Identities=14% Similarity=0.117 Sum_probs=89.2
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCC-----
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPF----- 76 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~----- 76 (150)
.|..+...|++++|+..|+++++.+|+++.++..++.+ ...| ++.+++ ..+...+. .|.....
T Consensus 41 ~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~~la~~~~~~g--------~~~~A~--~~~~~~l~-~~~~~~~~~~~~ 109 (389)
T PRK11788 41 KGLNFLLNEQPDKAIDLFIEMLKVDPETVELHLALGNLFRRRG--------EVDRAI--RIHQNLLS-RPDLTREQRLLA 109 (389)
T ss_pred HHHHHHhcCChHHHHHHHHHHHhcCcccHHHHHHHHHHHHHcC--------cHHHHH--HHHHHHhc-CCCCCHHHHHHH
Confidence 57888899999999999999999999999999999999 8888 555665 44444443 2221100
Q ss_pred --------CCChHHHHH---HHH-HhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcc-----hh
Q psy3020 77 --------LSDPSYVQM---IKE-IQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHI-----LF 139 (150)
Q Consensus 77 --------~~~~~~~~~---~~~-l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~-----~~ 139 (150)
.....+... +.. +...|.. ..++..++.++.+.|++++|++.+.+++..+|.+.. .+
T Consensus 110 ~~~La~~~~~~g~~~~A~~~~~~~l~~~~~~------~~~~~~la~~~~~~g~~~~A~~~~~~~~~~~~~~~~~~~~~~~ 183 (389)
T PRK11788 110 LQELGQDYLKAGLLDRAEELFLQLVDEGDFA------EGALQQLLEIYQQEKDWQKAIDVAERLEKLGGDSLRVEIAHFY 183 (389)
T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHcCCcch------HHHHHHHHHHHHHhchHHHHHHHHHHHHHhcCCcchHHHHHHH
Confidence 000011111 111 2222322 677888898999999999999999999998887643 45
Q ss_pred hcHHHHHHhc
Q psy3020 140 SNRSAAFAKA 149 (150)
Q Consensus 140 ~n~a~~~~~~ 149 (150)
.++|.++.++
T Consensus 184 ~~la~~~~~~ 193 (389)
T PRK11788 184 CELAQQALAR 193 (389)
T ss_pred HHHHHHHHhC
Confidence 6677777654
No 57
>PF13414 TPR_11: TPR repeat; PDB: 2HO1_B 2FI7_B 2DBA_A 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2PL2_B 3IEG_B 2FBN_A ....
Probab=99.12 E-value=1.5e-10 Score=66.15 Aligned_cols=49 Identities=29% Similarity=0.367 Sum_probs=46.8
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhcC
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKAI 150 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~~ 150 (150)
+..|..+|..++..|+|++|+.+|+++++++|+++.+|+++|.||.+++
T Consensus 3 a~~~~~~g~~~~~~~~~~~A~~~~~~ai~~~p~~~~~~~~~g~~~~~~~ 51 (69)
T PF13414_consen 3 AEAWYNLGQIYFQQGDYEEAIEYFEKAIELDPNNAEAYYNLGLAYMKLG 51 (69)
T ss_dssp HHHHHHHHHHHHHTTHHHHHHHHHHHHHHHSTTHHHHHHHHHHHHHHTT
T ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHhC
Confidence 6789999999999999999999999999999999999999999998763
No 58
>PRK02603 photosystem I assembly protein Ycf3; Provisional
Probab=99.10 E-value=1.4e-09 Score=73.12 Aligned_cols=96 Identities=22% Similarity=0.305 Sum_probs=76.5
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCc---HHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCC
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNN---EQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFL 77 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~---~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~ 77 (150)
.+|.++...|++++|+.+|++++++.|+. ..++..+|.+ ..+| .+.+++ ..+.+++...|+.
T Consensus 40 ~lg~~~~~~g~~~~A~~~~~~al~~~~~~~~~~~~~~~la~~~~~~g--------~~~~A~--~~~~~al~~~p~~---- 105 (172)
T PRK02603 40 RDGMSAQADGEYAEALENYEEALKLEEDPNDRSYILYNMGIIYASNG--------EHDKAL--EYYHQALELNPKQ---- 105 (172)
T ss_pred HHHHHHHHcCCHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHcC--------CHHHHH--HHHHHHHHhCccc----
Confidence 57889999999999999999999988764 4688999999 8889 667777 6777777777754
Q ss_pred CChHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcC--------------HHHHHHHHHHHHhhcCCC
Q psy3020 78 SDPSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANN--------------FKEAIEAYSEAIKLDGTN 135 (150)
Q Consensus 78 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~--------------~~~A~~~~~~al~~~p~~ 135 (150)
...+..+|.++...|+ +.+|++++++++.++|++
T Consensus 106 ------------------------~~~~~~lg~~~~~~g~~~~a~~~~~~A~~~~~~A~~~~~~a~~~~p~~ 153 (172)
T PRK02603 106 ------------------------PSALNNIAVIYHKRGEKAEEAGDQDEAEALFDKAAEYWKQAIRLAPNN 153 (172)
T ss_pred ------------------------HHHHHHHHHHHHHcCChHhHhhCHHHHHHHHHHHHHHHHHHHhhCchh
Confidence 3445555666655555 788999999999999987
No 59
>KOG4162|consensus
Probab=99.10 E-value=1e-09 Score=86.20 Aligned_cols=101 Identities=21% Similarity=0.222 Sum_probs=75.2
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChH
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPS 81 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~ 81 (150)
.|.++...|++++|.+.|.-|+.+||+++.....+|.+ ...|+. .+...- ..+..+++.+|.
T Consensus 690 ~G~~~~~~~~~~EA~~af~~Al~ldP~hv~s~~Ala~~lle~G~~------~la~~~--~~L~dalr~dp~--------- 752 (799)
T KOG4162|consen 690 RGLLLEVKGQLEEAKEAFLVALALDPDHVPSMTALAELLLELGSP------RLAEKR--SLLSDALRLDPL--------- 752 (799)
T ss_pred hhHHHHHHHhhHHHHHHHHHHHhcCCCCcHHHHHHHHHHHHhCCc------chHHHH--HHHHHHHhhCCC---------
Confidence 46677777777788888888888888877777777777 666633 122211 133444555553
Q ss_pred HHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchh
Q psy3020 82 YVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILF 139 (150)
Q Consensus 82 ~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~ 139 (150)
. +++|+.+|.++.+.|+.++|.+||.-|+++++.+|..-
T Consensus 753 -------------n------~eaW~~LG~v~k~~Gd~~~Aaecf~aa~qLe~S~PV~p 791 (799)
T KOG4162|consen 753 -------------N------HEAWYYLGEVFKKLGDSKQAAECFQAALQLEESNPVLP 791 (799)
T ss_pred -------------C------HHHHHHHHHHHHHccchHHHHHHHHHHHhhccCCCccc
Confidence 3 88999999999999999999999999999999998653
No 60
>PF13414 TPR_11: TPR repeat; PDB: 2HO1_B 2FI7_B 2DBA_A 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2PL2_B 3IEG_B 2FBN_A ....
Probab=99.09 E-value=2.9e-10 Score=64.95 Aligned_cols=61 Identities=23% Similarity=0.338 Sum_probs=54.2
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCC
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDP 71 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~ 71 (150)
.+|.+++..|+|++|+.+|+++++++|+++.++..+|.+ ..+|+ .+.+++ ..+.+++.++|
T Consensus 8 ~~g~~~~~~~~~~~A~~~~~~ai~~~p~~~~~~~~~g~~~~~~~~-------~~~~A~--~~~~~al~l~P 69 (69)
T PF13414_consen 8 NLGQIYFQQGDYEEAIEYFEKAIELDPNNAEAYYNLGLAYMKLGK-------DYEEAI--EDFEKALKLDP 69 (69)
T ss_dssp HHHHHHHHTTHHHHHHHHHHHHHHHSTTHHHHHHHHHHHHHHTTT-------HHHHHH--HHHHHHHHHST
T ss_pred HHHHHHHHcCCHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHhCc-------cHHHHH--HHHHHHHHcCc
Confidence 479999999999999999999999999999999999999 88882 256777 78888888877
No 61
>PF13432 TPR_16: Tetratricopeptide repeat; PDB: 3CVP_A 3CVL_A 3CVQ_A 3CV0_A 2GW1_B 3CVN_A 3QKY_A 2PL2_B.
Probab=99.09 E-value=2.4e-10 Score=64.52 Aligned_cols=63 Identities=25% Similarity=0.393 Sum_probs=56.0
Q ss_pred ChhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCC
Q psy3020 1 MTTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRT 73 (150)
Q Consensus 1 ~~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~ 73 (150)
+.+|..++..|++++|+.+|+++++.+|++++++..+|.+ ..+| ++.++. ..+.+.+..+|++
T Consensus 1 ~~~a~~~~~~g~~~~A~~~~~~~l~~~P~~~~a~~~lg~~~~~~g--------~~~~A~--~~~~~a~~~~P~~ 64 (65)
T PF13432_consen 1 YALARALYQQGDYDEAIAAFEQALKQDPDNPEAWYLLGRILYQQG--------RYDEAL--AYYERALELDPDN 64 (65)
T ss_dssp HHHHHHHHHCTHHHHHHHHHHHHHCCSTTHHHHHHHHHHHHHHTT---------HHHHH--HHHHHHHHHSTT-
T ss_pred ChHHHHHHHcCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHcC--------CHHHHH--HHHHHHHHHCcCC
Confidence 3689999999999999999999999999999999999999 9999 667787 7788888888875
No 62
>COG5010 TadD Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking and secretion]
Probab=99.07 E-value=4.6e-09 Score=73.52 Aligned_cols=108 Identities=13% Similarity=0.088 Sum_probs=91.8
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChH
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPS 81 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~ 81 (150)
.|..++..|+|..|+..+.++..++|+|.++|+.+|.+ ...|++ ..|- ..+.+++.+.|..
T Consensus 106 ~gk~~~~~g~~~~A~~~~rkA~~l~p~d~~~~~~lgaaldq~Gr~--------~~Ar--~ay~qAl~L~~~~-------- 167 (257)
T COG5010 106 QGKNQIRNGNFGEAVSVLRKAARLAPTDWEAWNLLGAALDQLGRF--------DEAR--RAYRQALELAPNE-------- 167 (257)
T ss_pred HHHHHHHhcchHHHHHHHHHHhccCCCChhhhhHHHHHHHHccCh--------hHHH--HHHHHHHHhccCC--------
Confidence 68889999999999999999999999999999999999 888954 4444 5566666666643
Q ss_pred HHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHh
Q psy3020 82 YVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAK 148 (150)
Q Consensus 82 ~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~ 148 (150)
+....++|..++-.|+++.|...+.++...-|.++..-.|++.+...
T Consensus 168 --------------------p~~~nNlgms~~L~gd~~~A~~lll~a~l~~~ad~~v~~NLAl~~~~ 214 (257)
T COG5010 168 --------------------PSIANNLGMSLLLRGDLEDAETLLLPAYLSPAADSRVRQNLALVVGL 214 (257)
T ss_pred --------------------chhhhhHHHHHHHcCCHHHHHHHHHHHHhCCCCchHHHHHHHHHHhh
Confidence 66788999999999999999999999999999899999999887654
No 63
>PRK14720 transcript cleavage factor/unknown domain fusion protein; Provisional
Probab=99.06 E-value=2.8e-09 Score=86.57 Aligned_cols=136 Identities=10% Similarity=0.141 Sum_probs=99.0
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCC-------------CchhhhhCChHHHHhh
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRG-------------DPFANLFSDPNIFVQL 67 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~-------------~~~~~a~~~~~~~~~l 67 (150)
++|.||.++|++++|...|+++|++||+|+.+...+|.. ... +.+ +|. .++.+++ .-+.+.+
T Consensus 121 ~LA~~Ydk~g~~~ka~~~yer~L~~D~~n~~aLNn~AY~~ae~-dL~-KA~~m~~KAV~~~i~~kq~~~~~--e~W~k~~ 196 (906)
T PRK14720 121 TLAEAYAKLNENKKLKGVWERLVKADRDNPEIVKKLATSYEEE-DKE-KAITYLKKAIYRFIKKKQYVGIE--EIWSKLV 196 (906)
T ss_pred HHHHHHHHcCChHHHHHHHHHHHhcCcccHHHHHHHHHHHHHh-hHH-HHHHHHHHHHHHHHhhhcchHHH--HHHHHHH
Confidence 689999999999999999999999999999999999987 544 333 111 1223333 2233334
Q ss_pred ccCCCCCCCCCChHHHHHHHHHhcCcchhhhHHH-hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHH
Q psy3020 68 QLDPRTKPFLSDPSYVQMIKEIQKDPSLMTLEAK-ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAF 146 (150)
Q Consensus 68 ~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~-~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~ 146 (150)
..+|++.. .|...++.+.+.... .+ ...+..+-..+...++|++++..+..+|+++|+|..+.+.++.||
T Consensus 197 ~~~~~d~d-----~f~~i~~ki~~~~~~----~~~~~~~~~l~~~y~~~~~~~~~i~iLK~iL~~~~~n~~a~~~l~~~y 267 (906)
T PRK14720 197 HYNSDDFD-----FFLRIERKVLGHREF----TRLVGLLEDLYEPYKALEDWDEVIYILKKILEHDNKNNKAREELIRFY 267 (906)
T ss_pred hcCcccch-----HHHHHHHHHHhhhcc----chhHHHHHHHHHHHhhhhhhhHHHHHHHHHHhcCCcchhhHHHHHHHH
Confidence 55555444 234444444333221 22 666777778888999999999999999999999999999999999
Q ss_pred HhcC
Q psy3020 147 AKAI 150 (150)
Q Consensus 147 ~~~~ 150 (150)
.+.+
T Consensus 268 ~~kY 271 (906)
T PRK14720 268 KEKY 271 (906)
T ss_pred HHHc
Confidence 8653
No 64
>CHL00033 ycf3 photosystem I assembly protein Ycf3
Probab=99.06 E-value=3.9e-09 Score=70.64 Aligned_cols=104 Identities=19% Similarity=0.261 Sum_probs=77.5
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCc---HHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCC
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNN---EQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFL 77 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~---~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~ 77 (150)
.+|.++...|++++|+..|++++.+.|+. +.++..+|.+ ...| .+.+|+ ..+.+++.++|.....+
T Consensus 40 ~~g~~~~~~g~~~~A~~~~~~al~l~~~~~~~~~~~~~lg~~~~~~g--------~~~eA~--~~~~~Al~~~~~~~~~~ 109 (168)
T CHL00033 40 RDGMSAQSEGEYAEALQNYYEAMRLEIDPYDRSYILYNIGLIHTSNG--------EHTKAL--EYYFQALERNPFLPQAL 109 (168)
T ss_pred HHHHHHHHcCCHHHHHHHHHHHHhccccchhhHHHHHHHHHHHHHcC--------CHHHHH--HHHHHHHHhCcCcHHHH
Confidence 57899999999999999999999998763 4588999999 8888 667788 77888888777543211
Q ss_pred CChHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHH-------HHHHHHHHHHhhcCCCc
Q psy3020 78 SDPSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFK-------EAIEAYSEAIKLDGTNH 136 (150)
Q Consensus 78 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~-------~A~~~~~~al~~~p~~~ 136 (150)
... ...+..+|..+...|+++ +|+.+|.+++..+|++.
T Consensus 110 -----~~l----------------a~i~~~~~~~~~~~g~~~~A~~~~~~a~~~~~~a~~~~p~~~ 154 (168)
T CHL00033 110 -----NNM----------------AVICHYRGEQAIEQGDSEIAEAWFDQAAEYWKQAIALAPGNY 154 (168)
T ss_pred -----HHH----------------HHHHHHhhHHHHHcccHHHHHHHHHHHHHHHHHHHHhCcccH
Confidence 000 344555555555777766 77777788899999653
No 65
>PF13429 TPR_15: Tetratricopeptide repeat; PDB: 2VQ2_A 2PL2_B.
Probab=99.02 E-value=2.9e-10 Score=81.95 Aligned_cols=135 Identities=13% Similarity=0.201 Sum_probs=44.7
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhh-cc-CCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCC---
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGL-KL-DPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKP--- 75 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL-~~-~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~--- 75 (150)
.+|.++...|++++|++.+++.+ .. +|++++.|..++.+ ..+++.+ .+...+.+ .+..++....
T Consensus 13 ~~A~~~~~~~~~~~Al~~L~~~~~~~~~~~~~~~~~~~a~La~~~~~~~-~A~~ay~~---------l~~~~~~~~~~~~ 82 (280)
T PF13429_consen 13 RLARLLYQRGDYEKALEVLKKAAQKIAPPDDPEYWRLLADLAWSLGDYD-EAIEAYEK---------LLASDKANPQDYE 82 (280)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred ccccccccccccccccccccccccccccccccccccccccccccccccc-cccccccc---------ccccccccccccc
Confidence 46788899999999999996655 44 48899999989988 8888554 22211211 1111111110
Q ss_pred -----------------------------CC----------CC-hHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHh
Q psy3020 76 -----------------------------FL----------SD-PSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQA 115 (150)
Q Consensus 76 -----------------------------~~----------~~-~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~ 115 (150)
++ .+ ......+..+...+.. +..+..|..+|.++.+.
T Consensus 83 ~l~~l~~~~~~~~A~~~~~~~~~~~~~~~~l~~~l~~~~~~~~~~~~~~~l~~~~~~~~~---~~~~~~~~~~a~~~~~~ 159 (280)
T PF13429_consen 83 RLIQLLQDGDPEEALKLAEKAYERDGDPRYLLSALQLYYRLGDYDEAEELLEKLEELPAA---PDSARFWLALAEIYEQL 159 (280)
T ss_dssp -----------------------------------H-HHHTT-HHHHHHHHHHHHH-T------T-HHHHHHHHHHHHHC
T ss_pred ccccccccccccccccccccccccccccchhhHHHHHHHHHhHHHHHHHHHHHHHhccCC---CCCHHHHHHHHHHHHHc
Confidence 00 00 0111111211111100 12277889999999999
Q ss_pred cCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 116 NNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 116 ~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
|++++|+++|+++++++|+++.+...++.+++..
T Consensus 160 G~~~~A~~~~~~al~~~P~~~~~~~~l~~~li~~ 193 (280)
T PF13429_consen 160 GDPDKALRDYRKALELDPDDPDARNALAWLLIDM 193 (280)
T ss_dssp CHHHHHHHHHHHHHHH-TT-HHHHHHHHHHHCTT
T ss_pred CCHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHC
Confidence 9999999999999999999999999998887654
No 66
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=99.02 E-value=6.8e-09 Score=78.16 Aligned_cols=110 Identities=18% Similarity=0.166 Sum_probs=97.8
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChH
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPS 81 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~ 81 (150)
.+..++..|.+++|+..++..++..|+|+..+...+++ ...| +..+|. .-+.+++.++|+.
T Consensus 312 ~A~~~~~~~~~d~A~~~l~~L~~~~P~N~~~~~~~~~i~~~~n--------k~~~A~--e~~~kal~l~P~~-------- 373 (484)
T COG4783 312 RALQTYLAGQYDEALKLLQPLIAAQPDNPYYLELAGDILLEAN--------KAKEAI--ERLKKALALDPNS-------- 373 (484)
T ss_pred HHHHHHHhcccchHHHHHHHHHHhCCCCHHHHHHHHHHHHHcC--------ChHHHH--HHHHHHHhcCCCc--------
Confidence 46678889999999999999999999999999999999 8888 666777 7777778888853
Q ss_pred HHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhcC
Q psy3020 82 YVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKAI 150 (150)
Q Consensus 82 ~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~~ 150 (150)
+..+.++|+.+++.|++.+|+..++..+.-+|+++..|..+|.+|-+++
T Consensus 374 --------------------~~l~~~~a~all~~g~~~eai~~L~~~~~~~p~dp~~w~~LAqay~~~g 422 (484)
T COG4783 374 --------------------PLLQLNLAQALLKGGKPQEAIRILNRYLFNDPEDPNGWDLLAQAYAELG 422 (484)
T ss_pred --------------------cHHHHHHHHHHHhcCChHHHHHHHHHHhhcCCCCchHHHHHHHHHHHhC
Confidence 5678899999999999999999999999999999999999999997763
No 67
>PRK15179 Vi polysaccharide biosynthesis protein TviE; Provisional
Probab=99.01 E-value=5.9e-09 Score=83.42 Aligned_cols=104 Identities=5% Similarity=-0.119 Sum_probs=90.4
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCCh
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDP 80 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~ 80 (150)
.++.++.+.+++++|+..++++|+.+|+++.++..+|.+ ..+| .+.+|+ .-+.+.+..+|+.
T Consensus 125 ~~a~~L~~~~~~eeA~~~~~~~l~~~p~~~~~~~~~a~~l~~~g--------~~~~A~--~~y~~~~~~~p~~------- 187 (694)
T PRK15179 125 LMLRGVKRQQGIEAGRAEIELYFSGGSSSAREILLEAKSWDEIG--------QSEQAD--ACFERLSRQHPEF------- 187 (694)
T ss_pred HHHHHHHHhccHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHhc--------chHHHH--HHHHHHHhcCCCc-------
Confidence 578999999999999999999999999999999999999 9999 667777 6666666655532
Q ss_pred HHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHH
Q psy3020 81 SYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRS 143 (150)
Q Consensus 81 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a 143 (150)
+.+|..+|..+.+.|+.++|+..|.++++...+-...|.++.
T Consensus 188 ---------------------~~~~~~~a~~l~~~G~~~~A~~~~~~a~~~~~~~~~~~~~~~ 229 (694)
T PRK15179 188 ---------------------ENGYVGWAQSLTRRGALWRARDVLQAGLDAIGDGARKLTRRL 229 (694)
T ss_pred ---------------------HHHHHHHHHHHHHcCCHHHHHHHHHHHHHhhCcchHHHHHHH
Confidence 789999999999999999999999999999888766655543
No 68
>KOG1173|consensus
Probab=99.00 E-value=5.3e-09 Score=79.96 Aligned_cols=146 Identities=17% Similarity=0.198 Sum_probs=99.3
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccC----------------CCC----------Cch
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDM----------------NRG----------DPF 54 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~----------------~~~----------~~~ 54 (150)
.+|..|.-.|++++|..+|-||..+||.+.++|.+.|-. ..+++.++ .|- +-+
T Consensus 317 aVg~YYl~i~k~seARry~SKat~lD~~fgpaWl~fghsfa~e~EhdQAmaaY~tAarl~~G~hlP~LYlgmey~~t~n~ 396 (611)
T KOG1173|consen 317 AVGCYYLMIGKYSEARRYFSKATTLDPTFGPAWLAFGHSFAGEGEHDQAMAAYFTAARLMPGCHLPSLYLGMEYMRTNNL 396 (611)
T ss_pred hHHHHHHHhcCcHHHHHHHHHHhhcCccccHHHHHHhHHhhhcchHHHHHHHHHHHHHhccCCcchHHHHHHHHHHhccH
Confidence 478889999999999999999999999999999999988 77774430 000 011
Q ss_pred hhhhCChHHHHhhccCCCCCCCCCC--------hHHHHHHHHHhcCcchhhh----HHH-hHHHHHHHHHHHHhcCHHHH
Q psy3020 55 ANLFSDPNIFVQLQLDPRTKPFLSD--------PSYVQMIKEIQKDPSLMTL----EAK-ISLLKDKGNAALQANNFKEA 121 (150)
Q Consensus 55 ~~a~~~~~~~~~l~~~~~~~~~~~~--------~~~~~~~~~l~~~~~~~~~----~~~-~~~~~~~g~~~~~~~~~~~A 121 (150)
+-|- .-+..++.+.|.++....+ ..+..+..-+.+....+.. -.+ ...+.++|-++.+.+.+++|
T Consensus 397 kLAe--~Ff~~A~ai~P~Dplv~~Elgvvay~~~~y~~A~~~f~~~l~~ik~~~~e~~~w~p~~~NLGH~~Rkl~~~~eA 474 (611)
T KOG1173|consen 397 KLAE--KFFKQALAIAPSDPLVLHELGVVAYTYEEYPEALKYFQKALEVIKSVLNEKIFWEPTLNNLGHAYRKLNKYEEA 474 (611)
T ss_pred HHHH--HHHHHHHhcCCCcchhhhhhhheeehHhhhHHHHHHHHHHHHHhhhccccccchhHHHHhHHHHHHHHhhHHHH
Confidence 1111 3334456677776654322 1122222111111100000 011 34578899999999999999
Q ss_pred HHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 122 IEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 122 ~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
|.+|+++|.+.|.++..+...|.+|.-+
T Consensus 475 I~~~q~aL~l~~k~~~~~asig~iy~ll 502 (611)
T KOG1173|consen 475 IDYYQKALLLSPKDASTHASIGYIYHLL 502 (611)
T ss_pred HHHHHHHHHcCCCchhHHHHHHHHHHHh
Confidence 9999999999999999999999999765
No 69
>PLN02789 farnesyltranstransferase
Probab=99.00 E-value=7.6e-09 Score=76.01 Aligned_cols=132 Identities=14% Similarity=0.104 Sum_probs=98.2
Q ss_pred hhHHHHhhC-CHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCC-
Q psy3020 3 TYSCLSYLG-RYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSD- 79 (150)
Q Consensus 3 lg~~~~~~~-~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~- 79 (150)
-|.++..+| ++++++..++++++.+|++..+|..++.+ ..+|.. ...+.+ .-+.+++..+|++..+...
T Consensus 77 R~~iL~~L~~~l~eeL~~~~~~i~~npknyqaW~~R~~~l~~l~~~------~~~~el--~~~~kal~~dpkNy~AW~~R 148 (320)
T PLN02789 77 RRLCLEALDADLEEELDFAEDVAEDNPKNYQIWHHRRWLAEKLGPD------AANKEL--EFTRKILSLDAKNYHAWSHR 148 (320)
T ss_pred HHHHHHHcchhHHHHHHHHHHHHHHCCcchHHhHHHHHHHHHcCch------hhHHHH--HHHHHHHHhCcccHHHHHHH
Confidence 367888888 68999999999999999999999999988 777731 123445 5666888999987653110
Q ss_pred -------hHHHHHH----HHHhcCcchhhhHHHhHHHHHHHHHHHHh---cCH----HHHHHHHHHHHhhcCCCcchhhc
Q psy3020 80 -------PSYVQMI----KEIQKDPSLMTLEAKISLLKDKGNAALQA---NNF----KEAIEAYSEAIKLDGTNHILFSN 141 (150)
Q Consensus 80 -------~~~~~~~----~~l~~~~~~~~~~~~~~~~~~~g~~~~~~---~~~----~~A~~~~~~al~~~p~~~~~~~n 141 (150)
..+...+ ..++.++.. ..+|..+|.++... |.+ ++++++..++|.++|+|..+|++
T Consensus 149 ~w~l~~l~~~~eeL~~~~~~I~~d~~N------~sAW~~R~~vl~~~~~l~~~~~~~e~el~y~~~aI~~~P~N~SaW~Y 222 (320)
T PLN02789 149 QWVLRTLGGWEDELEYCHQLLEEDVRN------NSAWNQRYFVITRSPLLGGLEAMRDSELKYTIDAILANPRNESPWRY 222 (320)
T ss_pred HHHHHHhhhHHHHHHHHHHHHHHCCCc------hhHHHHHHHHHHhccccccccccHHHHHHHHHHHHHhCCCCcCHHHH
Confidence 0111111 224555655 89999999988765 333 57888889999999999999999
Q ss_pred HHHHHHh
Q psy3020 142 RSAAFAK 148 (150)
Q Consensus 142 ~a~~~~~ 148 (150)
++.++..
T Consensus 223 l~~ll~~ 229 (320)
T PLN02789 223 LRGLFKD 229 (320)
T ss_pred HHHHHhc
Confidence 9998865
No 70
>PF12895 Apc3: Anaphase-promoting complex, cyclosome, subunit 3; PDB: 3KAE_D 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2XPI_A 3ULQ_A.
Probab=98.98 E-value=2.5e-09 Score=63.47 Aligned_cols=80 Identities=26% Similarity=0.370 Sum_probs=60.3
Q ss_pred hCCHHHHHHHHHHhhccCCC--cHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChHHHHHH
Q psy3020 10 LGRYKESISTYEEGLKLDPN--NEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPSYVQMI 86 (150)
Q Consensus 10 ~~~~~~A~~~~~~aL~~~p~--~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 86 (150)
.|+|+.|+..|+++++.+|+ +..++..+|.+ ..+| ++.+|+ .-+.+ ...+|.
T Consensus 2 ~~~y~~Ai~~~~k~~~~~~~~~~~~~~~~la~~~~~~~--------~y~~A~--~~~~~-~~~~~~-------------- 56 (84)
T PF12895_consen 2 QGNYENAIKYYEKLLELDPTNPNSAYLYNLAQCYFQQG--------KYEEAI--ELLQK-LKLDPS-------------- 56 (84)
T ss_dssp TT-HHHHHHHHHHHHHHHCGTHHHHHHHHHHHHHHHTT--------HHHHHH--HHHHC-HTHHHC--------------
T ss_pred CccHHHHHHHHHHHHHHCCCChhHHHHHHHHHHHHHCC--------CHHHHH--HHHHH-hCCCCC--------------
Confidence 68999999999999999995 56678889999 9999 666666 33322 222221
Q ss_pred HHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHH
Q psy3020 87 KEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEA 128 (150)
Q Consensus 87 ~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~a 128 (150)
. +.....+|.++++.|+|++|+++|.++
T Consensus 57 --------~------~~~~~l~a~~~~~l~~y~eAi~~l~~~ 84 (84)
T PF12895_consen 57 --------N------PDIHYLLARCLLKLGKYEEAIKALEKA 84 (84)
T ss_dssp --------H------HHHHHHHHHHHHHTT-HHHHHHHHHHH
T ss_pred --------C------HHHHHHHHHHHHHhCCHHHHHHHHhcC
Confidence 1 566677799999999999999999875
No 71
>KOG3785|consensus
Probab=98.97 E-value=7.1e-09 Score=75.92 Aligned_cols=126 Identities=14% Similarity=0.163 Sum_probs=91.7
Q ss_pred ChhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCC---
Q psy3020 1 MTTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPF--- 76 (150)
Q Consensus 1 ~~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~--- 76 (150)
+|+|.|++++|+|++|+..|.-+...+..+++.+..++-+ ..+|.+ .+|. . .....|+++..
T Consensus 61 lWia~C~fhLgdY~~Al~~Y~~~~~~~~~~~el~vnLAcc~FyLg~Y--------~eA~---~---~~~ka~k~pL~~RL 126 (557)
T KOG3785|consen 61 LWIAHCYFHLGDYEEALNVYTFLMNKDDAPAELGVNLACCKFYLGQY--------IEAK---S---IAEKAPKTPLCIRL 126 (557)
T ss_pred HHHHHHHHhhccHHHHHHHHHHHhccCCCCcccchhHHHHHHHHHHH--------HHHH---H---HHhhCCCChHHHHH
Confidence 4899999999999999999999999888889999999988 888944 3332 1 11122332221
Q ss_pred -------CCC-hHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHh
Q psy3020 77 -------LSD-PSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAK 148 (150)
Q Consensus 77 -------~~~-~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~ 148 (150)
+++ ......-..++.. .+.-..++.+.+.+-.|.+||+.|++.+..+|+-..+..++|.||.+
T Consensus 127 lfhlahklndEk~~~~fh~~LqD~---------~EdqLSLAsvhYmR~HYQeAIdvYkrvL~dn~ey~alNVy~ALCyyK 197 (557)
T KOG3785|consen 127 LFHLAHKLNDEKRILTFHSSLQDT---------LEDQLSLASVHYMRMHYQEAIDVYKRVLQDNPEYIALNVYMALCYYK 197 (557)
T ss_pred HHHHHHHhCcHHHHHHHHHHHhhh---------HHHHHhHHHHHHHHHHHHHHHHHHHHHHhcChhhhhhHHHHHHHHHh
Confidence 111 1222222222211 34456677888889999999999999999999999999999999988
Q ss_pred c
Q psy3020 149 A 149 (150)
Q Consensus 149 ~ 149 (150)
+
T Consensus 198 l 198 (557)
T KOG3785|consen 198 L 198 (557)
T ss_pred c
Confidence 6
No 72
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=98.97 E-value=1.3e-08 Score=72.89 Aligned_cols=100 Identities=12% Similarity=0.067 Sum_probs=84.3
Q ss_pred hHHH-HhhCCHHHHHHHHHHhhccCCCc---HHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCC
Q psy3020 4 YSCL-SYLGRYKESISTYEEGLKLDPNN---EQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLS 78 (150)
Q Consensus 4 g~~~-~~~~~~~~A~~~~~~aL~~~p~~---~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~ 78 (150)
|..+ +..|+|++|+..|++.++..|++ +.++..+|.+ ...| ++.+|. ..+...+...|++..
T Consensus 149 A~~l~~~~~~y~~Ai~af~~fl~~yP~s~~a~~A~y~LG~~y~~~g--------~~~~A~--~~f~~vv~~yP~s~~--- 215 (263)
T PRK10803 149 AIALVQDKSRQDDAIVAFQNFVKKYPDSTYQPNANYWLGQLNYNKG--------KKDDAA--YYFASVVKNYPKSPK--- 215 (263)
T ss_pred HHHHHHhcCCHHHHHHHHHHHHHHCcCCcchHHHHHHHHHHHHHcC--------CHHHHH--HHHHHHHHHCCCCcc---
Confidence 3444 56799999999999999999998 4799999999 8888 667777 777777777776433
Q ss_pred ChHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcch
Q psy3020 79 DPSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHIL 138 (150)
Q Consensus 79 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~ 138 (150)
. +.++..+|.++...|++++|++.|++.++..|++..+
T Consensus 216 ----------------~------~dAl~klg~~~~~~g~~~~A~~~~~~vi~~yP~s~~a 253 (263)
T PRK10803 216 ----------------A------ADAMFKVGVIMQDKGDTAKAKAVYQQVIKKYPGTDGA 253 (263)
T ss_pred ----------------h------hHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCcCCHHH
Confidence 1 7889999999999999999999999999999987743
No 73
>PRK15331 chaperone protein SicA; Provisional
Probab=98.97 E-value=9.6e-09 Score=67.77 Aligned_cols=107 Identities=12% Similarity=0.074 Sum_probs=88.1
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCCh
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDP 80 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~ 80 (150)
..|.-++..|++++|...|.-..-.||.|++.+++||.+ ..++ .+.+|+ .-+..+..+++++
T Consensus 42 ~~Ay~~y~~Gk~~eA~~~F~~L~~~d~~n~~Y~~GLaa~~Q~~k--------~y~~Ai--~~Y~~A~~l~~~d------- 104 (165)
T PRK15331 42 AHAYEFYNQGRLDEAETFFRFLCIYDFYNPDYTMGLAAVCQLKK--------QFQKAC--DLYAVAFTLLKND------- 104 (165)
T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHhCcCcHHHHHHHHHHHHHHH--------HHHHHH--HHHHHHHHcccCC-------
Confidence 457778999999999999999999999999999999999 9999 667777 6665555555544
Q ss_pred HHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHh
Q psy3020 81 SYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAK 148 (150)
Q Consensus 81 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~ 148 (150)
+...+..|.+++..|+.++|..+|..+++ .|.++.+ ..+|..++.
T Consensus 105 ---------------------p~p~f~agqC~l~l~~~~~A~~~f~~a~~-~~~~~~l-~~~A~~~L~ 149 (165)
T PRK15331 105 ---------------------YRPVFFTGQCQLLMRKAAKARQCFELVNE-RTEDESL-RAKALVYLE 149 (165)
T ss_pred ---------------------CCccchHHHHHHHhCCHHHHHHHHHHHHh-CcchHHH-HHHHHHHHH
Confidence 45578889999999999999999999999 6776555 456776654
No 74
>KOG0550|consensus
Probab=98.96 E-value=1.9e-09 Score=79.86 Aligned_cols=117 Identities=20% Similarity=0.335 Sum_probs=77.3
Q ss_pred hHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHH---H---HhhhhccCCCC-CchhhhhCChHHHHhhccCCCCCCC
Q psy3020 4 YSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKD---V---RNQEMNDMNRG-DPFANLFSDPNIFVQLQLDPRTKPF 76 (150)
Q Consensus 4 g~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~---~---~~~g~~~~~~~-~~~~~a~~~~~~~~~l~~~~~~~~~ 76 (150)
|.|+...++.+.|+.+|+++|.++|+...+....-. + ...|.. ... ..+..+. ..+..++.++|++...
T Consensus 210 g~~~yy~~~~~ka~~hf~qal~ldpdh~~sk~~~~~~k~le~~k~~gN~--~fk~G~y~~A~--E~Yteal~idP~n~~~ 285 (486)
T KOG0550|consen 210 GLCLYYNDNADKAINHFQQALRLDPDHQKSKSASMMPKKLEVKKERGND--AFKNGNYRKAY--ECYTEALNIDPSNKKT 285 (486)
T ss_pred ccccccccchHHHHHHHhhhhccChhhhhHHhHhhhHHHHHHHHhhhhh--HhhccchhHHH--HHHHHhhcCCccccch
Confidence 778999999999999999999999999886543322 2 111110 111 1456666 7788889999986552
Q ss_pred CCChHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHh
Q psy3020 77 LSDPSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAK 148 (150)
Q Consensus 77 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~ 148 (150)
. +..|.+++.+..+.|+..+|+..+++|+.+||....++..||.||+-
T Consensus 286 n------------------------aklY~nra~v~~rLgrl~eaisdc~~Al~iD~syikall~ra~c~l~ 333 (486)
T KOG0550|consen 286 N------------------------AKLYGNRALVNIRLGRLREAISDCNEALKIDSSYIKALLRRANCHLA 333 (486)
T ss_pred h------------------------HHHHHHhHhhhcccCCchhhhhhhhhhhhcCHHHHHHHHHHHHHHHH
Confidence 1 44555566666666666666666666666666666666666666653
No 75
>KOG0543|consensus
Probab=98.94 E-value=1.1e-08 Score=75.62 Aligned_cols=106 Identities=18% Similarity=0.241 Sum_probs=88.4
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCC----CcH-----------HHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHh
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDP----NNE-----------QMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQ 66 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p----~~~-----------~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~ 66 (150)
-|+.|++.|+|..|+..|++|++.=+ .+. .+|++++.| ..++ .+..|+ ..+.++
T Consensus 214 ~Gn~~fK~gk~~~A~~~Yerav~~l~~~~~~~~ee~~~~~~~k~~~~lNlA~c~lKl~--------~~~~Ai--~~c~kv 283 (397)
T KOG0543|consen 214 RGNVLFKEGKFKLAKKRYERAVSFLEYRRSFDEEEQKKAEALKLACHLNLAACYLKLK--------EYKEAI--ESCNKV 283 (397)
T ss_pred hhhHHHhhchHHHHHHHHHHHHHHhhccccCCHHHHHHHHHHHHHHhhHHHHHHHhhh--------hHHHHH--HHHHHH
Confidence 48899999999999999999887433 111 146788888 7888 667888 888888
Q ss_pred hccCCCCCCCCCChHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHH
Q psy3020 67 LQLDPRTKPFLSDPSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAF 146 (150)
Q Consensus 67 l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~ 146 (150)
|.++|++ +.+++.+|.++...|+|+.|...|.++++++|+|..+..-+..|.
T Consensus 284 Le~~~~N----------------------------~KALyRrG~A~l~~~e~~~A~~df~ka~k~~P~Nka~~~el~~l~ 335 (397)
T KOG0543|consen 284 LELDPNN----------------------------VKALYRRGQALLALGEYDLARDDFQKALKLEPSNKAARAELIKLK 335 (397)
T ss_pred HhcCCCc----------------------------hhHHHHHHHHHHhhccHHHHHHHHHHHHHhCCCcHHHHHHHHHHH
Confidence 8888865 788999999999999999999999999999999987776666554
No 76
>PRK10049 pgaA outer membrane protein PgaA; Provisional
Probab=98.94 E-value=1.1e-08 Score=83.20 Aligned_cols=101 Identities=12% Similarity=0.067 Sum_probs=90.4
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCCh
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDP 80 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~ 80 (150)
.+|.++...|++++|++.+++++...|+++.++..++.+ ...| ...+|+ ..+.+++.++|++
T Consensus 364 ~~a~~l~~~g~~~eA~~~l~~al~~~P~n~~l~~~lA~l~~~~g--------~~~~A~--~~l~~al~l~Pd~------- 426 (765)
T PRK10049 364 LLSQVAKYSNDLPQAEMRARELAYNAPGNQGLRIDYASVLQARG--------WPRAAE--NELKKAEVLEPRN------- 426 (765)
T ss_pred HHHHHHHHcCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHhcC--------CHHHHH--HHHHHHHhhCCCC-------
Confidence 467899999999999999999999999999999999999 8889 677888 8888888888865
Q ss_pred HHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhh
Q psy3020 81 SYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFS 140 (150)
Q Consensus 81 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~ 140 (150)
...+..+|..+...|++++|...++++++.+|+++.+..
T Consensus 427 ---------------------~~l~~~~a~~al~~~~~~~A~~~~~~ll~~~Pd~~~~~~ 465 (765)
T PRK10049 427 ---------------------INLEVEQAWTALDLQEWRQMDVLTDDVVAREPQDPGVQR 465 (765)
T ss_pred ---------------------hHHHHHHHHHHHHhCCHHHHHHHHHHHHHhCCCCHHHHH
Confidence 556788888999999999999999999999999987654
No 77
>KOG4648|consensus
Probab=98.92 E-value=5e-09 Score=76.41 Aligned_cols=100 Identities=19% Similarity=0.169 Sum_probs=81.5
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChH
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPS 81 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~ 81 (150)
-|.-|++.|+|++|++||.+++.++|-|+-.+...+.. ..+. .+..|. .++..++.++..+
T Consensus 103 ~GN~yFKQgKy~EAIDCYs~~ia~~P~NpV~~~NRA~AYlk~K--------~FA~AE--~DC~~AiaLd~~Y-------- 164 (536)
T KOG4648|consen 103 RGNTYFKQGKYEEAIDCYSTAIAVYPHNPVYHINRALAYLKQK--------SFAQAE--EDCEAAIALDKLY-------- 164 (536)
T ss_pred hhhhhhhccchhHHHHHhhhhhccCCCCccchhhHHHHHHHHH--------HHHHHH--HhHHHHHHhhHHH--------
Confidence 37889999999999999999999999999888777766 5666 444455 6776666655421
Q ss_pred HHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhh
Q psy3020 82 YVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFS 140 (150)
Q Consensus 82 ~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~ 140 (150)
..+|..+|.+....|...+|.+.|+.+|++.|++..+.-
T Consensus 165 --------------------~KAYSRR~~AR~~Lg~~~EAKkD~E~vL~LEP~~~ELkK 203 (536)
T KOG4648|consen 165 --------------------VKAYSRRMQARESLGNNMEAKKDCETVLALEPKNIELKK 203 (536)
T ss_pred --------------------HHHHHHHHHHHHHHhhHHHHHHhHHHHHhhCcccHHHHH
Confidence 677888899999999999999999999999999765543
No 78
>COG5010 TadD Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking and secretion]
Probab=98.92 E-value=2.5e-08 Score=69.88 Aligned_cols=108 Identities=18% Similarity=0.179 Sum_probs=81.9
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChH
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPS 81 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~ 81 (150)
++..+.-.|+-+.++....+++.-+|.+.+++...+.. ...| .+..|. ..+.++..++|++
T Consensus 72 ~a~a~~~~G~a~~~l~~~~~~~~~~~~d~~ll~~~gk~~~~~g--------~~~~A~--~~~rkA~~l~p~d-------- 133 (257)
T COG5010 72 LATALYLRGDADSSLAVLQKSAIAYPKDRELLAAQGKNQIRNG--------NFGEAV--SVLRKAARLAPTD-------- 133 (257)
T ss_pred HHHHHHhcccccchHHHHhhhhccCcccHHHHHHHHHHHHHhc--------chHHHH--HHHHHHhccCCCC--------
Confidence 44555566666666666666666666666666555555 5555 445555 5666666666643
Q ss_pred HHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHh
Q psy3020 82 YVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAK 148 (150)
Q Consensus 82 ~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~ 148 (150)
+..|..+|.++.+.|++++|-..|.+++++.|+++....|+|..|+-
T Consensus 134 --------------------~~~~~~lgaaldq~Gr~~~Ar~ay~qAl~L~~~~p~~~nNlgms~~L 180 (257)
T COG5010 134 --------------------WEAWNLLGAALDQLGRFDEARRAYRQALELAPNEPSIANNLGMSLLL 180 (257)
T ss_pred --------------------hhhhhHHHHHHHHccChhHHHHHHHHHHHhccCCchhhhhHHHHHHH
Confidence 88899999999999999999999999999999999999999998864
No 79
>KOG1129|consensus
Probab=98.91 E-value=9.8e-09 Score=74.44 Aligned_cols=137 Identities=15% Similarity=0.178 Sum_probs=97.4
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCC----
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPF---- 76 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~---- 76 (150)
.|..+|...+++..|+..|.+.|+.-|.++..+.+++.+ ..|+ .+..+. .-+..+++.+|.+.+.
T Consensus 261 lLskvY~ridQP~~AL~~~~~gld~fP~~VT~l~g~ARi~eam~--------~~~~a~--~lYk~vlk~~~~nvEaiAci 330 (478)
T KOG1129|consen 261 LLSKVYQRIDQPERALLVIGEGLDSFPFDVTYLLGQARIHEAME--------QQEDAL--QLYKLVLKLHPINVEAIACI 330 (478)
T ss_pred HHHHHHHHhccHHHHHHHHhhhhhcCCchhhhhhhhHHHHHHHH--------hHHHHH--HHHHHHHhcCCccceeeeee
Confidence 356677777777888888888888888888888888888 8888 556666 6677777888876653
Q ss_pred -----C-CChH----HHHHHHHHhc-Ccchhhh----------------------------HHHhHHHHHHHHHHHHhcC
Q psy3020 77 -----L-SDPS----YVQMIKEIQK-DPSLMTL----------------------------EAKISLLKDKGNAALQANN 117 (150)
Q Consensus 77 -----~-~~~~----~~~~~~~l~~-~~~~~~~----------------------------~~~~~~~~~~g~~~~~~~~ 117 (150)
+ ++|. |++.|-.+.. .|+-..- ...++.|+++|.+....|+
T Consensus 331 a~~yfY~~~PE~AlryYRRiLqmG~~speLf~NigLCC~yaqQ~D~~L~sf~RAlstat~~~~aaDvWYNlg~vaV~iGD 410 (478)
T KOG1129|consen 331 AVGYFYDNNPEMALRYYRRILQMGAQSPELFCNIGLCCLYAQQIDLVLPSFQRALSTATQPGQAADVWYNLGFVAVTIGD 410 (478)
T ss_pred eeccccCCChHHHHHHHHHHHHhcCCChHHHhhHHHHHHhhcchhhhHHHHHHHHhhccCcchhhhhhhccceeEEeccc
Confidence 1 1121 2222211111 1110000 1338999999999999999
Q ss_pred HHHHHHHHHHHHhhcCCCcchhhcHHHHHHh
Q psy3020 118 FKEAIEAYSEAIKLDGTNHILFSNRSAAFAK 148 (150)
Q Consensus 118 ~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~ 148 (150)
+.-|.++|.-|+.-||++..++.|+|..-.+
T Consensus 411 ~nlA~rcfrlaL~~d~~h~ealnNLavL~~r 441 (478)
T KOG1129|consen 411 FNLAKRCFRLALTSDAQHGEALNNLAVLAAR 441 (478)
T ss_pred hHHHHHHHHHHhccCcchHHHHHhHHHHHhh
Confidence 9999999999999999999999999987554
No 80
>PRK14574 hmsH outer membrane protein; Provisional
Probab=98.90 E-value=2.8e-08 Score=80.93 Aligned_cols=108 Identities=13% Similarity=0.036 Sum_probs=72.7
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChH
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPS 81 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~ 81 (150)
-+...+..|+++.|++.|+++++.+|+++.+...++.+ ...| ...+|+ ..+.+++ +|+...
T Consensus 40 ~aii~~r~Gd~~~Al~~L~qaL~~~P~~~~av~dll~l~~~~G--------~~~~A~--~~~eka~--~p~n~~------ 101 (822)
T PRK14574 40 SLIIRARAGDTAPVLDYLQEESKAGPLQSGQVDDWLQIAGWAG--------RDQEVI--DVYERYQ--SSMNIS------ 101 (822)
T ss_pred HHHHHHhCCCHHHHHHHHHHHHhhCccchhhHHHHHHHHHHcC--------CcHHHH--HHHHHhc--cCCCCC------
Confidence 46778899999999999999999999996544477776 6666 556666 6666666 443322
Q ss_pred HHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHh
Q psy3020 82 YVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAK 148 (150)
Q Consensus 82 ~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~ 148 (150)
...+..+|.++..+|+|++|++.|+++++.+|+++.++.-++.+|.+
T Consensus 102 --------------------~~~llalA~ly~~~gdyd~Aiely~kaL~~dP~n~~~l~gLa~~y~~ 148 (822)
T PRK14574 102 --------------------SRGLASAARAYRNEKRWDQALALWQSSLKKDPTNPDLISGMIMTQAD 148 (822)
T ss_pred --------------------HHHHHHHHHHHHHcCCHHHHHHHHHHHHhhCCCCHHHHHHHHHHHhh
Confidence 33344445566666666666666666666666666666555554443
No 81
>KOG2002|consensus
Probab=98.90 E-value=5.1e-09 Score=84.15 Aligned_cols=134 Identities=16% Similarity=0.091 Sum_probs=98.2
Q ss_pred hhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCC-----CCChHH
Q psy3020 9 YLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPF-----LSDPSY 82 (150)
Q Consensus 9 ~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~-----~~~~~~ 82 (150)
..+.+++|++.|.++|+.||.|..+-+++|.| ...|++. .....| ..+.++..-.++...- ..+..|
T Consensus 624 ~kk~~~KAlq~y~kvL~~dpkN~yAANGIgiVLA~kg~~~-----~A~dIF--sqVrEa~~~~~dv~lNlah~~~e~~qy 696 (1018)
T KOG2002|consen 624 EKKHQEKALQLYGKVLRNDPKNMYAANGIGIVLAEKGRFS-----EARDIF--SQVREATSDFEDVWLNLAHCYVEQGQY 696 (1018)
T ss_pred HHHHHHHHHHHHHHHHhcCcchhhhccchhhhhhhccCch-----HHHHHH--HHHHHHHhhCCceeeeHHHHHHHHHHH
Confidence 44678899999999999999999999999999 8888654 444455 5666665544443321 112345
Q ss_pred HHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 83 VQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 83 ~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
..+++..+.-....-+.+.+..+..+|.++++.|.+.+|.+....|+.+.|.++.+.+|+|.++.++
T Consensus 697 ~~AIqmYe~~lkkf~~~~~~~vl~~Lara~y~~~~~~eak~~ll~a~~~~p~~~~v~FN~a~v~kkl 763 (1018)
T KOG2002|consen 697 RLAIQMYENCLKKFYKKNRSEVLHYLARAWYEAGKLQEAKEALLKARHLAPSNTSVKFNLALVLKKL 763 (1018)
T ss_pred HHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHhhhHHHHHHHHHHHHHhCCccchHHhHHHHHHHHH
Confidence 5555443322111111123788899999999999999999999999999999999999999998764
No 82
>PRK14574 hmsH outer membrane protein; Provisional
Probab=98.89 E-value=3.4e-08 Score=80.48 Aligned_cols=106 Identities=15% Similarity=0.188 Sum_probs=69.7
Q ss_pred hHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChHH
Q psy3020 4 YSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPSY 82 (150)
Q Consensus 4 g~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~~ 82 (150)
|.++...|+|++|++.|+++++.+|+|+.++.+++.+ ...+ +..+|+ ..+.+....+|+.
T Consensus 109 A~ly~~~gdyd~Aiely~kaL~~dP~n~~~l~gLa~~y~~~~--------q~~eAl--~~l~~l~~~dp~~--------- 169 (822)
T PRK14574 109 ARAYRNEKRWDQALALWQSSLKKDPTNPDLISGMIMTQADAG--------RGGVVL--KQATELAERDPTV--------- 169 (822)
T ss_pred HHHHHHcCCHHHHHHHHHHHHhhCCCCHHHHHHHHHHHhhcC--------CHHHHH--HHHHHhcccCcch---------
Confidence 5566666777777777777777777777666666555 4444 445555 4444444545431
Q ss_pred HHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHh
Q psy3020 83 VQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAK 148 (150)
Q Consensus 83 ~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~ 148 (150)
..+..++.++...+++.+|++.|+++++.+|++...+..+..++..
T Consensus 170 --------------------~~~l~layL~~~~~~~~~AL~~~ekll~~~P~n~e~~~~~~~~l~~ 215 (822)
T PRK14574 170 --------------------QNYMTLSYLNRATDRNYDALQASSEAVRLAPTSEEVLKNHLEILQR 215 (822)
T ss_pred --------------------HHHHHHHHHHHhcchHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHH
Confidence 1123334444446777779999999999999999998888776654
No 83
>cd05804 StaR_like StaR_like; a well-conserved protein found in bacteria, plants, and animals. A family member from Streptomyces toyocaensis, StaR is part of a gene cluster involved in the biosynthesis of glycopeptide antibiotics (GPAs), specifically A47934. It has been speculated that StaR could be a flavoprotein hydroxylating a tyrosine sidechain. Some family members have been annotated as proteins containing tetratricopeptide (TPR) repeats, which may at least indicate mostly alpha-helical secondary structure.
Probab=98.87 E-value=5.5e-08 Score=72.08 Aligned_cols=99 Identities=14% Similarity=0.104 Sum_probs=81.3
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCCh
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDP 80 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~ 80 (150)
.+|.++...|++++|+..++++++++|+++.++..++.+ ...| .+.++. ..+.+.+...|.....
T Consensus 119 ~~a~~~~~~G~~~~A~~~~~~al~~~p~~~~~~~~la~i~~~~g--------~~~eA~--~~l~~~l~~~~~~~~~---- 184 (355)
T cd05804 119 MLAFGLEEAGQYDRAEEAARRALELNPDDAWAVHAVAHVLEMQG--------RFKEGI--AFMESWRDTWDCSSML---- 184 (355)
T ss_pred HHHHHHHHcCCHHHHHHHHHHHHhhCCCCcHHHHHHHHHHHHcC--------CHHHHH--HHHHhhhhccCCCcch----
Confidence 367899999999999999999999999999999999999 8889 556666 6666666666542111
Q ss_pred HHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCC
Q psy3020 81 SYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGT 134 (150)
Q Consensus 81 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~ 134 (150)
....|..++.++...|++++|+..|.+++...|.
T Consensus 185 --------------------~~~~~~~la~~~~~~G~~~~A~~~~~~~~~~~~~ 218 (355)
T cd05804 185 --------------------RGHNWWHLALFYLERGDYEAALAIYDTHIAPSAE 218 (355)
T ss_pred --------------------hHHHHHHHHHHHHHCCCHHHHHHHHHHHhccccC
Confidence 0345778899999999999999999999888773
No 84
>PF14559 TPR_19: Tetratricopeptide repeat; PDB: 2R5S_A 3QDN_B 3QOU_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 3FP3_A 3LCA_A ....
Probab=98.87 E-value=7.1e-09 Score=58.87 Aligned_cols=56 Identities=23% Similarity=0.334 Sum_probs=40.7
Q ss_pred HHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCC
Q psy3020 7 LSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPR 72 (150)
Q Consensus 7 ~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~ 72 (150)
++..|+|++|++.|+++++.+|+|.+++..++.+ ...| ++.+|. ..+.+.+..+|+
T Consensus 1 ll~~~~~~~A~~~~~~~l~~~p~~~~~~~~la~~~~~~g--------~~~~A~--~~l~~~~~~~~~ 57 (68)
T PF14559_consen 1 LLKQGDYDEAIELLEKALQRNPDNPEARLLLAQCYLKQG--------QYDEAE--ELLERLLKQDPD 57 (68)
T ss_dssp HHHTTHHHHHHHHHHHHHHHTTTSHHHHHHHHHHHHHTT---------HHHHH--HHHHCCHGGGTT
T ss_pred ChhccCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHcC--------CHHHHH--HHHHHHHHHCcC
Confidence 3578899999999999999999998888888887 6666 444555 444444555554
No 85
>PF13432 TPR_16: Tetratricopeptide repeat; PDB: 3CVP_A 3CVL_A 3CVQ_A 3CV0_A 2GW1_B 3CVN_A 3QKY_A 2PL2_B.
Probab=98.85 E-value=2.2e-08 Score=56.38 Aligned_cols=63 Identities=22% Similarity=0.289 Sum_probs=49.3
Q ss_pred HHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHH
Q psy3020 36 AIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQ 114 (150)
Q Consensus 36 ~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~ 114 (150)
.+|.. ...| ++.+|. ..+...+..+|++ +.+|..+|.+++.
T Consensus 2 ~~a~~~~~~g--------~~~~A~--~~~~~~l~~~P~~----------------------------~~a~~~lg~~~~~ 43 (65)
T PF13432_consen 2 ALARALYQQG--------DYDEAI--AAFEQALKQDPDN----------------------------PEAWYLLGRILYQ 43 (65)
T ss_dssp HHHHHHHHCT--------HHHHHH--HHHHHHHCCSTTH----------------------------HHHHHHHHHHHHH
T ss_pred hHHHHHHHcC--------CHHHHH--HHHHHHHHHCCCC----------------------------HHHHHHHHHHHHH
Confidence 34555 5666 666777 6777677766642 8899999999999
Q ss_pred hcCHHHHHHHHHHHHhhcCCCc
Q psy3020 115 ANNFKEAIEAYSEAIKLDGTNH 136 (150)
Q Consensus 115 ~~~~~~A~~~~~~al~~~p~~~ 136 (150)
+|++++|+..|+++++++|++|
T Consensus 44 ~g~~~~A~~~~~~a~~~~P~~p 65 (65)
T PF13432_consen 44 QGRYDEALAYYERALELDPDNP 65 (65)
T ss_dssp TT-HHHHHHHHHHHHHHSTT-H
T ss_pred cCCHHHHHHHHHHHHHHCcCCC
Confidence 9999999999999999999985
No 86
>PRK10153 DNA-binding transcriptional activator CadC; Provisional
Probab=98.84 E-value=8.2e-08 Score=74.79 Aligned_cols=127 Identities=14% Similarity=0.014 Sum_probs=77.9
Q ss_pred hHHHHhhCC---HHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhc--cCCCCCCCC
Q psy3020 4 YSCLSYLGR---YKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQ--LDPRTKPFL 77 (150)
Q Consensus 4 g~~~~~~~~---~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~--~~~~~~~~~ 77 (150)
|..|...+. ...|+.+|++|+++||+++.++-.++.+ .....+..........+. ....+++. .+|.....+
T Consensus 346 g~~~~~~~~~~~~~~A~~lle~Ai~ldP~~a~a~A~la~~~~~~~~~~~~~~~~l~~a~--~~~~~a~al~~~~~~~~~~ 423 (517)
T PRK10153 346 AHHYLNSGDAKSLNKASDLLEEILKSEPDFTYAQAEKALADIVRHSQQPLDEKQLAALS--TELDNIVALPELNVLPRIY 423 (517)
T ss_pred HHHHHhcCCHHHHHHHHHHHHHHHHhCCCcHHHHHHHHHHHHHHHhcCCccHHHHHHHH--HHHHHhhhcccCcCChHHH
Confidence 444444433 7799999999999999999999888776 443322100011223333 33333333 244333211
Q ss_pred --------CChHHHHHH----HHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchh
Q psy3020 78 --------SDPSYVQMI----KEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILF 139 (150)
Q Consensus 78 --------~~~~~~~~~----~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~ 139 (150)
.+..+..+. +.+..+| . +.+|..+|.++...|++++|++.|.+|+.++|.++..+
T Consensus 424 ~ala~~~~~~g~~~~A~~~l~rAl~L~p-s------~~a~~~lG~~~~~~G~~~eA~~~~~~A~~L~P~~pt~~ 490 (517)
T PRK10153 424 EILAVQALVKGKTDEAYQAINKAIDLEM-S------WLNYVLLGKVYELKGDNRLAADAYSTAFNLRPGENTLY 490 (517)
T ss_pred HHHHHHHHhcCCHHHHHHHHHHHHHcCC-C------HHHHHHHHHHHHHcCCHHHHHHHHHHHHhcCCCCchHH
Confidence 001111111 1122222 1 67899999999999999999999999999999988644
No 87
>PF09976 TPR_21: Tetratricopeptide repeat; InterPro: IPR018704 This domain, found in various hypothetical prokaryotic proteins, has no known function.
Probab=98.82 E-value=4.6e-08 Score=63.97 Aligned_cols=89 Identities=19% Similarity=0.290 Sum_probs=65.1
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcH---HHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCC
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNE---QMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFL 77 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~---~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~ 77 (150)
.+|.+++..|++++|+..|++++...|+.. .+...++.+ ...| ++.+|+ ..+.. + ++
T Consensus 53 ~lA~~~~~~g~~~~A~~~l~~~~~~~~d~~l~~~a~l~LA~~~~~~~--------~~d~Al--~~L~~-~---~~----- 113 (145)
T PF09976_consen 53 QLAKAAYEQGDYDEAKAALEKALANAPDPELKPLARLRLARILLQQG--------QYDEAL--ATLQQ-I---PD----- 113 (145)
T ss_pred HHHHHHHHCCCHHHHHHHHHHHHhhCCCHHHHHHHHHHHHHHHHHcC--------CHHHHH--HHHHh-c---cC-----
Confidence 477888888999999999999988877653 366777888 7777 555555 33311 1 10
Q ss_pred CChHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHH
Q psy3020 78 SDPSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAI 129 (150)
Q Consensus 78 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al 129 (150)
.+ -.+..+..+|.++...|++++|+..|++||
T Consensus 114 --------------~~------~~~~~~~~~Gdi~~~~g~~~~A~~~y~~Al 145 (145)
T PF09976_consen 114 --------------EA------FKALAAELLGDIYLAQGDYDEARAAYQKAL 145 (145)
T ss_pred --------------cc------hHHHHHHHHHHHHHHCCCHHHHHHHHHHhC
Confidence 00 115677889999999999999999999985
No 88
>PF13428 TPR_14: Tetratricopeptide repeat
Probab=98.80 E-value=9.7e-09 Score=53.48 Aligned_cols=44 Identities=14% Similarity=0.013 Sum_probs=40.6
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHH
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAA 145 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~ 145 (150)
+.+|..+|..+...|++++|++.|+++++.+|+++.+|..+|.+
T Consensus 1 p~~~~~la~~~~~~G~~~~A~~~~~~~l~~~P~~~~a~~~La~l 44 (44)
T PF13428_consen 1 PAAWLALARAYRRLGQPDEAERLLRRALALDPDDPEAWRALAQL 44 (44)
T ss_pred CHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCcCCHHHHHHhhhC
Confidence 35789999999999999999999999999999999999988753
No 89
>cd05804 StaR_like StaR_like; a well-conserved protein found in bacteria, plants, and animals. A family member from Streptomyces toyocaensis, StaR is part of a gene cluster involved in the biosynthesis of glycopeptide antibiotics (GPAs), specifically A47934. It has been speculated that StaR could be a flavoprotein hydroxylating a tyrosine sidechain. Some family members have been annotated as proteins containing tetratricopeptide (TPR) repeats, which may at least indicate mostly alpha-helical secondary structure.
Probab=98.80 E-value=1.2e-07 Score=70.20 Aligned_cols=133 Identities=11% Similarity=0.004 Sum_probs=68.7
Q ss_pred hHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCC-----
Q psy3020 4 YSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFL----- 77 (150)
Q Consensus 4 g~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~----- 77 (150)
|.++...|++++|++.++++++.+|++..++.. +.. ..+|... .......+++ ......+|......
T Consensus 50 a~~~~~~g~~~~A~~~~~~~l~~~P~~~~a~~~-~~~~~~~~~~~-~~~~~~~~~l-----~~~~~~~~~~~~~~~~~a~ 122 (355)
T cd05804 50 ALSAWIAGDLPKALALLEQLLDDYPRDLLALKL-HLGAFGLGDFS-GMRDHVARVL-----PLWAPENPDYWYLLGMLAF 122 (355)
T ss_pred HHHHHHcCCHHHHHHHHHHHHHHCCCcHHHHHH-hHHHHHhcccc-cCchhHHHHH-----hccCcCCCCcHHHHHHHHH
Confidence 566777888888888888888888888876653 333 3333211 0000111111 11112222221110
Q ss_pred ---CChHHHHHH----HHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcc----hhhcHHHHH
Q psy3020 78 ---SDPSYVQMI----KEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHI----LFSNRSAAF 146 (150)
Q Consensus 78 ---~~~~~~~~~----~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~----~~~n~a~~~ 146 (150)
..+.+...+ ..+...|.. +..+..+|.+++..|++++|+..+.+++...|.++. .|..+|.++
T Consensus 123 ~~~~~G~~~~A~~~~~~al~~~p~~------~~~~~~la~i~~~~g~~~eA~~~l~~~l~~~~~~~~~~~~~~~~la~~~ 196 (355)
T cd05804 123 GLEEAGQYDRAEEAARRALELNPDD------AWAVHAVAHVLEMQGRFKEGIAFMESWRDTWDCSSMLRGHNWWHLALFY 196 (355)
T ss_pred HHHHcCCHHHHHHHHHHHHhhCCCC------cHHHHHHHHHHHHcCCHHHHHHHHHhhhhccCCCcchhHHHHHHHHHHH
Confidence 000111111 113333333 566677777777777777777777777777765433 244566666
Q ss_pred Hhc
Q psy3020 147 AKA 149 (150)
Q Consensus 147 ~~~ 149 (150)
..+
T Consensus 197 ~~~ 199 (355)
T cd05804 197 LER 199 (355)
T ss_pred HHC
Confidence 554
No 90
>PF13428 TPR_14: Tetratricopeptide repeat
Probab=98.80 E-value=7.4e-09 Score=53.92 Aligned_cols=39 Identities=21% Similarity=0.308 Sum_probs=36.6
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV 40 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~ 40 (150)
.+|.+|..+|++++|++.|+++++.+|+|++++..++.+
T Consensus 6 ~la~~~~~~G~~~~A~~~~~~~l~~~P~~~~a~~~La~l 44 (44)
T PF13428_consen 6 ALARAYRRLGQPDEAERLLRRALALDPDDPEAWRALAQL 44 (44)
T ss_pred HHHHHHHHcCCHHHHHHHHHHHHHHCcCCHHHHHHhhhC
Confidence 589999999999999999999999999999999998853
No 91
>KOG1156|consensus
Probab=98.78 E-value=1.1e-07 Score=73.98 Aligned_cols=134 Identities=17% Similarity=0.245 Sum_probs=90.1
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCC-----
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPF----- 76 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~----- 76 (150)
+|..+...++|++|+++|..||+++|||...+..++.+ ..|+ ++...+ ..-...+.+.|.....
T Consensus 81 ~gl~~R~dK~Y~eaiKcy~nAl~~~~dN~qilrDlslLQ~QmR--------d~~~~~--~tr~~LLql~~~~ra~w~~~A 150 (700)
T KOG1156|consen 81 LGLLQRSDKKYDEAIKCYRNALKIEKDNLQILRDLSLLQIQMR--------DYEGYL--ETRNQLLQLRPSQRASWIGFA 150 (700)
T ss_pred HHHHHhhhhhHHHHHHHHHHHHhcCCCcHHHHHHHHHHHHHHH--------hhhhHH--HHHHHHHHhhhhhHHHHHHHH
Confidence 68899999999999999999999999999999999999 9999 444444 3444445555543321
Q ss_pred -----CCC----hH-HHHHHHHHhcCcchhhh---------------------------------HHHhHHHHHHHHHHH
Q psy3020 77 -----LSD----PS-YVQMIKEIQKDPSLMTL---------------------------------EAKISLLKDKGNAAL 113 (150)
Q Consensus 77 -----~~~----~~-~~~~~~~l~~~~~~~~~---------------------------------~~~~~~~~~~g~~~~ 113 (150)
..+ .. +..........|+.... -++.......+.+++
T Consensus 151 vs~~L~g~y~~A~~il~ef~~t~~~~~s~~~~e~se~~Ly~n~i~~E~g~~q~ale~L~~~e~~i~Dkla~~e~ka~l~~ 230 (700)
T KOG1156|consen 151 VAQHLLGEYKMALEILEEFEKTQNTSPSKEDYEHSELLLYQNQILIEAGSLQKALEHLLDNEKQIVDKLAFEETKADLLM 230 (700)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhhccCCCHHHHHHHHHHHHHHHHHHHcccHHHHHHHHHhhhhHHHHHHHHhhhHHHHHH
Confidence 000 00 00000111111111111 022344455788999
Q ss_pred HhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHH
Q psy3020 114 QANNFKEAIEAYSEAIKLDGTNHILFSNRSAAF 146 (150)
Q Consensus 114 ~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~ 146 (150)
+.+++++|+..|...+..+|++...|..+=.|+
T Consensus 231 kl~~lEeA~~~y~~Ll~rnPdn~~Yy~~l~~~l 263 (700)
T KOG1156|consen 231 KLGQLEEAVKVYRRLLERNPDNLDYYEGLEKAL 263 (700)
T ss_pred HHhhHHhHHHHHHHHHhhCchhHHHHHHHHHHH
Confidence 999999999999999999999988777665555
No 92
>PF00515 TPR_1: Tetratricopeptide repeat; InterPro: IPR001440 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees; within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B. ; GO: 0005515 protein binding; PDB: 3SF4_C 2LNI_A 1ELW_A 2C0M_A 1FCH_B 3R9A_B 2J9Q_A 2C0L_A 1KT1_A 3FWV_A ....
Probab=98.77 E-value=1.9e-08 Score=49.28 Aligned_cols=34 Identities=35% Similarity=0.541 Sum_probs=30.8
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCC
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTN 135 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~ 135 (150)
+.+|..+|.+++.+|++++|+.+|+++++++|++
T Consensus 1 a~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~ 34 (34)
T PF00515_consen 1 AEAYYNLGNAYFQLGDYEEALEYYQRALELDPDN 34 (34)
T ss_dssp HHHHHHHHHHHHHTT-HHHHHHHHHHHHHHSTTH
T ss_pred CHHHHHHHHHHHHhCCchHHHHHHHHHHHHCcCC
Confidence 4689999999999999999999999999999974
No 93
>KOG1128|consensus
Probab=98.74 E-value=9.9e-08 Score=75.03 Aligned_cols=129 Identities=19% Similarity=0.274 Sum_probs=90.0
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhh-CChHHHHh--hccCCCCCCCCC
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLF-SDPNIFVQ--LQLDPRTKPFLS 78 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~-~~~~~~~~--l~~~~~~~~~~~ 78 (150)
+..||...|+-.+|-....+-++ .|+++..|-.+|++ .... .++++. -+.....+ ....+ ..++
T Consensus 430 vi~CY~~lg~~~kaeei~~q~le-k~~d~~lyc~LGDv~~d~s--------~yEkawElsn~~sarA~r~~~~---~~~~ 497 (777)
T KOG1128|consen 430 VILCYLLLGQHGKAEEINRQELE-KDPDPRLYCLLGDVLHDPS--------LYEKAWELSNYISARAQRSLAL---LILS 497 (777)
T ss_pred HHHHHHHhcccchHHHHHHHHhc-CCCcchhHHHhhhhccChH--------HHHHHHHHhhhhhHHHHHhhcc---cccc
Confidence 34688888888888888888887 67778888888888 5555 344443 11111111 11111 1122
Q ss_pred ChHHHHHHHH----HhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 79 DPSYVQMIKE----IQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 79 ~~~~~~~~~~----l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
+++|.+.... +...|-. ...|+.+|.+..+.++++.|+++|..++.++|++..+|.|++.+|+++
T Consensus 498 ~~~fs~~~~hle~sl~~nplq------~~~wf~~G~~ALqlek~q~av~aF~rcvtL~Pd~~eaWnNls~ayi~~ 566 (777)
T KOG1128|consen 498 NKDFSEADKHLERSLEINPLQ------LGTWFGLGCAALQLEKEQAAVKAFHRCVTLEPDNAEAWNNLSTAYIRL 566 (777)
T ss_pred chhHHHHHHHHHHHhhcCccc------hhHHHhccHHHHHHhhhHHHHHHHHHHhhcCCCchhhhhhhhHHHHHH
Confidence 2344443333 4445544 899999999999999999999999999999999999999999999875
No 94
>PF07719 TPR_2: Tetratricopeptide repeat; InterPro: IPR013105 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. This repeat includes outlying Tetratricopeptide-like repeats (TPR) that are not matched by IPR001440 from INTERPRO.; PDB: 1XNF_B 3Q15_A 4ABN_A 1OUV_A 3U4T_A 3MA5_C 2KCV_A 2KCL_A 2XEV_A 3NF1_A ....
Probab=98.73 E-value=4.5e-08 Score=47.79 Aligned_cols=34 Identities=35% Similarity=0.449 Sum_probs=30.9
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCC
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTN 135 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~ 135 (150)
+..|..+|.+++..|++++|+++|+++++++|++
T Consensus 1 a~~~~~lg~~~~~~~~~~~A~~~~~~al~l~p~~ 34 (34)
T PF07719_consen 1 AEAWYYLGQAYYQLGNYEEAIEYFEKALELDPNN 34 (34)
T ss_dssp HHHHHHHHHHHHHTT-HHHHHHHHHHHHHHSTTS
T ss_pred CHHHHHHHHHHHHhCCHHHHHHHHHHHHHHCcCC
Confidence 4678999999999999999999999999999986
No 95
>KOG1156|consensus
Probab=98.72 E-value=5.8e-08 Score=75.43 Aligned_cols=105 Identities=15% Similarity=0.129 Sum_probs=90.7
Q ss_pred HHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChHHHHH
Q psy3020 7 LSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPSYVQM 85 (150)
Q Consensus 7 ~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 85 (150)
.++.++|+.+++..+..|+-.|++++..-+.|.. ..+| .-.+|. .-+...++.|+..
T Consensus 17 ~yE~kQYkkgLK~~~~iL~k~~eHgeslAmkGL~L~~lg--------~~~ea~--~~vr~glr~d~~S------------ 74 (700)
T KOG1156|consen 17 CYETKQYKKGLKLIKQILKKFPEHGESLAMKGLTLNCLG--------KKEEAY--ELVRLGLRNDLKS------------ 74 (700)
T ss_pred HHHHHHHHhHHHHHHHHHHhCCccchhHHhccchhhccc--------chHHHH--HHHHHHhccCccc------------
Confidence 4578999999999999999999999999888888 8999 556666 6666677766643
Q ss_pred HHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 86 IKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 86 ~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
...|.-+|.+...-.+|++|++||+.|+.++|+|...|..++....++
T Consensus 75 ----------------~vCwHv~gl~~R~dK~Y~eaiKcy~nAl~~~~dN~qilrDlslLQ~Qm 122 (700)
T KOG1156|consen 75 ----------------HVCWHVLGLLQRSDKKYDEAIKCYRNALKIEKDNLQILRDLSLLQIQM 122 (700)
T ss_pred ----------------chhHHHHHHHHhhhhhHHHHHHHHHHHHhcCCCcHHHHHHHHHHHHHH
Confidence 567999999999999999999999999999999999999999887764
No 96
>PF06552 TOM20_plant: Plant specific mitochondrial import receptor subunit TOM20; InterPro: IPR010547 This family consists of several plant specific mitochondrial import receptor subunit TOM20 (translocase of outer membrane 20 kDa subunit) proteins. Most mitochondrial proteins are encoded by the nuclear genome, and are synthesised in the cytosol. TOM20 is a general import receptor that binds to mitochondrial pre-sequences in the early step of protein import into the mitochondria [].; GO: 0045040 protein import into mitochondrial outer membrane, 0005742 mitochondrial outer membrane translocase complex; PDB: 1ZU2_A.
Probab=98.70 E-value=1.1e-07 Score=63.59 Aligned_cols=101 Identities=14% Similarity=0.137 Sum_probs=67.2
Q ss_pred CHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccC--CCCCchhhhhCChHHHHhhccCCCCCCCCCChHHHHHHHH
Q psy3020 12 RYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDM--NRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPSYVQMIKE 88 (150)
Q Consensus 12 ~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~--~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 88 (150)
-|+.|.+.++.....||.++++++.=|.+ ..+.+++. ++...+.+|+ ..+..++.++|+.
T Consensus 6 ~FE~ark~aea~y~~nP~DadnL~~WG~ALLELAqfk~g~es~~miedAi--sK~eeAL~I~P~~--------------- 68 (186)
T PF06552_consen 6 FFEHARKKAEAAYAKNPLDADNLTNWGGALLELAQFKQGPESKKMIEDAI--SKFEEALKINPNK--------------- 68 (186)
T ss_dssp HHHHHHHHHHHHHHH-TT-HHHHHHHHHHHHHHHHHS-HHHHHHHHHHHH--HHHHHHHHH-TT----------------
T ss_pred HHHHHHHHHHHHHHhCcHhHHHHHHHHHHHHHHHhccCcchHHHHHHHHH--HHHHHHHhcCCch---------------
Confidence 37889999999999999999988766655 44443321 1112456677 7777788888854
Q ss_pred HhcCcchhhhHHHhHHHHHHHHHHHHh-----------cCHHHHHHHHHHHHhhcCCCcchhhcH
Q psy3020 89 IQKDPSLMTLEAKISLLKDKGNAALQA-----------NNFKEAIEAYSEAIKLDGTNHILFSNR 142 (150)
Q Consensus 89 l~~~~~~~~~~~~~~~~~~~g~~~~~~-----------~~~~~A~~~~~~al~~~p~~~~~~~n~ 142 (150)
..++..+|+++... .-|++|.++|.+|+..+|+|......+
T Consensus 69 -------------hdAlw~lGnA~ts~A~l~~d~~~A~~~F~kA~~~FqkAv~~~P~ne~Y~ksL 120 (186)
T PF06552_consen 69 -------------HDALWCLGNAYTSLAFLTPDTAEAEEYFEKATEYFQKAVDEDPNNELYRKSL 120 (186)
T ss_dssp -------------HHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHH-TT-HHHHHHH
T ss_pred -------------HHHHHHHHHHHHHHHhhcCChHHHHHHHHHHHHHHHHHHhcCCCcHHHHHHH
Confidence 55666666666543 448899999999999999887654443
No 97
>PF12569 NARP1: NMDA receptor-regulated protein 1 ; InterPro: IPR021183 This group represents N-terminal acetyltransferase A (NatA) auxiliary subunit and represents a non-catalytic component of the NatA N-terminal acetyltransferase, which catalyzes acetylation of proteins beginning with Met-Ser, Met-Gly and Met-Ala. N-terminal acetylation plays a role in normal eukaryotic translation and processing, protect against proteolytic degradation and protein turnover. NAT1 anchors ARD1 and NAT5 to the ribosome and may present the N- terminal of nascent polypeptides for acetylation [], [].
Probab=98.69 E-value=1.5e-06 Score=67.62 Aligned_cols=143 Identities=20% Similarity=0.197 Sum_probs=88.5
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCC----
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPF---- 76 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~---- 76 (150)
..|.++.++|++++|...|...|+.+|+|...+.++..+ -...... ........ .-+.......|.....
T Consensus 43 ~rA~ll~kLg~~~eA~~~y~~Li~rNPdn~~Yy~~L~~~~g~~~~~~---~~~~~~~~--~~y~~l~~~yp~s~~~~rl~ 117 (517)
T PF12569_consen 43 KRAELLLKLGRKEEAEKIYRELIDRNPDNYDYYRGLEEALGLQLQLS---DEDVEKLL--ELYDELAEKYPRSDAPRRLP 117 (517)
T ss_pred HHHHHHHHcCCHHHHHHHHHHHHHHCCCcHHHHHHHHHHHhhhcccc---cccHHHHH--HHHHHHHHhCccccchhHhh
Confidence 368999999999999999999999999999999998777 2222110 00111111 1111111111211100
Q ss_pred --------------------CC--ChHHHHHHHHHhcCcchhhh---------------------------HHH--hHHH
Q psy3020 77 --------------------LS--DPSYVQMIKEIQKDPSLMTL---------------------------EAK--ISLL 105 (150)
Q Consensus 77 --------------------~~--~~~~~~~~~~l~~~~~~~~~---------------------------~~~--~~~~ 105 (150)
+. -|.+..-+..+..++..... ++. ..++
T Consensus 118 L~~~~g~~F~~~~~~yl~~~l~KgvPslF~~lk~Ly~d~~K~~~i~~l~~~~~~~l~~~~~~~~~~~~~~~~p~~~lw~~ 197 (517)
T PF12569_consen 118 LDFLEGDEFKERLDEYLRPQLRKGVPSLFSNLKPLYKDPEKAAIIESLVEEYVNSLESNGSFSNGDDEEKEPPSTLLWTL 197 (517)
T ss_pred cccCCHHHHHHHHHHHHHHHHhcCCchHHHHHHHHHcChhHHHHHHHHHHHHHHhhcccCCCCCccccccCCchHHHHHH
Confidence 00 01111111222111111000 111 3566
Q ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 106 KDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 106 ~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
..+++.+-..|++++|+++.++||+..|+.+.+|+..|.+|-..
T Consensus 198 ~~lAqhyd~~g~~~~Al~~Id~aI~htPt~~ely~~KarilKh~ 241 (517)
T PF12569_consen 198 YFLAQHYDYLGDYEKALEYIDKAIEHTPTLVELYMTKARILKHA 241 (517)
T ss_pred HHHHHHHHHhCCHHHHHHHHHHHHhcCCCcHHHHHHHHHHHHHC
Confidence 88899999999999999999999999999999999999998654
No 98
>KOG4162|consensus
Probab=98.69 E-value=2.2e-07 Score=73.50 Aligned_cols=111 Identities=20% Similarity=0.181 Sum_probs=97.8
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCCh
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDP 80 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~ 80 (150)
..|..+...++.++|..|..+|-+++|-.+..++..|.+ ...| ...+|. ..+..++.++|++
T Consensus 655 laa~~~~~~~~~~~a~~CL~Ea~~~~~l~~~~~~~~G~~~~~~~--------~~~EA~--~af~~Al~ldP~h------- 717 (799)
T KOG4162|consen 655 LAADLFLLSGNDDEARSCLLEASKIDPLSASVYYLRGLLLEVKG--------QLEEAK--EAFLVALALDPDH------- 717 (799)
T ss_pred HHHHHHHhcCCchHHHHHHHHHHhcchhhHHHHHHhhHHHHHHH--------hhHHHH--HHHHHHHhcCCCC-------
Confidence 456778888999999999999999999999999999999 8888 556677 7777788888876
Q ss_pred HHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHH--HHHHHHhhcCCCcchhhcHHHHHHhcC
Q psy3020 81 SYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIE--AYSEAIKLDGTNHILFSNRSAAFAKAI 150 (150)
Q Consensus 81 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~--~~~~al~~~p~~~~~~~n~a~~~~~~~ 150 (150)
+.....+|.++.+.|+-.-|.. ....|+++||.++.+|+++|.++.+++
T Consensus 718 ---------------------v~s~~Ala~~lle~G~~~la~~~~~L~dalr~dp~n~eaW~~LG~v~k~~G 768 (799)
T KOG4162|consen 718 ---------------------VPSMTALAELLLELGSPRLAEKRSLLSDALRLDPLNHEAWYYLGEVFKKLG 768 (799)
T ss_pred ---------------------cHHHHHHHHHHHHhCCcchHHHHHHHHHHHhhCCCCHHHHHHHHHHHHHcc
Confidence 5677888999999999888888 999999999999999999999988764
No 99
>PF13371 TPR_9: Tetratricopeptide repeat
Probab=98.67 E-value=7.4e-08 Score=55.39 Aligned_cols=63 Identities=16% Similarity=0.247 Sum_probs=56.8
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCC
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKP 75 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~ 75 (150)
|..+|...++|++|+++++++++++|+++.++..+|.+ ..+| ++.+++ ..+.+.+...|+...
T Consensus 1 l~~~~~~~~~~~~A~~~~~~~l~~~p~~~~~~~~~a~~~~~~g--------~~~~A~--~~l~~~l~~~p~~~~ 64 (73)
T PF13371_consen 1 LKQIYLQQEDYEEALEVLERALELDPDDPELWLQRARCLFQLG--------RYEEAL--EDLERALELSPDDPD 64 (73)
T ss_pred CHHHHHhCCCHHHHHHHHHHHHHhCcccchhhHHHHHHHHHhc--------cHHHHH--HHHHHHHHHCCCcHH
Confidence 46789999999999999999999999999999999999 9999 778888 888889988886543
No 100
>KOG4340|consensus
Probab=98.67 E-value=6.1e-08 Score=69.70 Aligned_cols=130 Identities=17% Similarity=0.174 Sum_probs=88.5
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCCh
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDP 80 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~ 80 (150)
.+|.||...++|..|.+||++.-.+.|.......--++. +..+. +..|+ +.+.+--+....+++.
T Consensus 49 lLgyCYY~~Q~f~~AA~CYeQL~ql~P~~~qYrlY~AQSLY~A~i--------~ADAL------rV~~~~~D~~~L~~~~ 114 (459)
T KOG4340|consen 49 LLGYCYYRLQEFALAAECYEQLGQLHPELEQYRLYQAQSLYKACI--------YADAL------RVAFLLLDNPALHSRV 114 (459)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhChHHHHHHHHHHHHHHHhcc--------cHHHH------HHHHHhcCCHHHHHHH
Confidence 589999999999999999999999999988877666666 55552 22222 1111111111111000
Q ss_pred -HHHHHHH-----------HHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHh
Q psy3020 81 -SYVQMIK-----------EIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAK 148 (150)
Q Consensus 81 -~~~~~~~-----------~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~ 148 (150)
....+|. -++.-|+. ..+....+.|++.++.|+|++|++-|..|++...-+|.+-+|.|.|+++
T Consensus 115 lqLqaAIkYse~Dl~g~rsLveQlp~e----n~Ad~~in~gCllykegqyEaAvqkFqaAlqvsGyqpllAYniALaHy~ 190 (459)
T KOG4340|consen 115 LQLQAAIKYSEGDLPGSRSLVEQLPSE----NEADGQINLGCLLYKEGQYEAAVQKFQAALQVSGYQPLLAYNLALAHYS 190 (459)
T ss_pred HHHHHHHhcccccCcchHHHHHhccCC----CccchhccchheeeccccHHHHHHHHHHHHhhcCCCchhHHHHHHHHHh
Confidence 0111110 01111211 2278889999999999999999999999999999999999999999986
Q ss_pred c
Q psy3020 149 A 149 (150)
Q Consensus 149 ~ 149 (150)
.
T Consensus 191 ~ 191 (459)
T KOG4340|consen 191 S 191 (459)
T ss_pred h
Confidence 3
No 101
>KOG4642|consensus
Probab=98.65 E-value=1.1e-07 Score=66.21 Aligned_cols=94 Identities=19% Similarity=0.166 Sum_probs=80.1
Q ss_pred hHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChHH
Q psy3020 4 YSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPSY 82 (150)
Q Consensus 4 g~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~~ 82 (150)
|..++--.+|..|+.+|-+||.++|..+..++..+.+ ..+.++ +... .+..+++.++|+.
T Consensus 17 gnk~f~~k~y~~ai~~y~raI~~nP~~~~Y~tnralchlk~~~~--------~~v~--~dcrralql~~N~--------- 77 (284)
T KOG4642|consen 17 GNKCFIPKRYDDAIDCYSRAICINPTVASYYTNRALCHLKLKHW--------EPVE--EDCRRALQLDPNL--------- 77 (284)
T ss_pred cccccchhhhchHHHHHHHHHhcCCCcchhhhhHHHHHHHhhhh--------hhhh--hhHHHHHhcChHH---------
Confidence 6678888999999999999999999999999999999 777744 4444 6777888877742
Q ss_pred HHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCC
Q psy3020 83 VQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTN 135 (150)
Q Consensus 83 ~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~ 135 (150)
+.+++.+|..+.+...|++||.++.+|.++-..+
T Consensus 78 -------------------vk~h~flg~~~l~s~~~~eaI~~Lqra~sl~r~~ 111 (284)
T KOG4642|consen 78 -------------------VKAHYFLGQWLLQSKGYDEAIKVLQRAYSLLREQ 111 (284)
T ss_pred -------------------HHHHHHHHHHHHhhccccHHHHHHHHHHHHHhcC
Confidence 7889999999999999999999999997765543
No 102
>KOG2076|consensus
Probab=98.65 E-value=7.7e-07 Score=71.44 Aligned_cols=49 Identities=14% Similarity=0.132 Sum_probs=43.7
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhcC
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKAI 150 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~~ 150 (150)
...|..+|.++..+|.+++|+++|.++|.++|++..+...++.+|.+++
T Consensus 449 ~~vw~~~a~c~~~l~e~e~A~e~y~kvl~~~p~~~D~Ri~Lasl~~~~g 497 (895)
T KOG2076|consen 449 AFVWYKLARCYMELGEYEEAIEFYEKVLILAPDNLDARITLASLYQQLG 497 (895)
T ss_pred hhhhHHHHHHHHHHhhHHHHHHHHHHHHhcCCCchhhhhhHHHHHHhcC
Confidence 5678889999999999999999999999999999999999998887763
No 103
>PRK10747 putative protoheme IX biogenesis protein; Provisional
Probab=98.65 E-value=2e-07 Score=70.58 Aligned_cols=45 Identities=4% Similarity=-0.031 Sum_probs=40.7
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-Hhhhhcc
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMND 47 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~ 47 (150)
.+..+...|+++.|+..++++++.+|+++.++..++.+ ...|+++
T Consensus 159 ~a~l~l~~g~~~~Al~~l~~~~~~~P~~~~al~ll~~~~~~~gdw~ 204 (398)
T PRK10747 159 RVRIQLARNENHAARHGVDKLLEVAPRHPEVLRLAEQAYIRTGAWS 204 (398)
T ss_pred HHHHHHHCCCHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHhHH
Confidence 36788899999999999999999999999999999988 8888765
No 104
>CHL00033 ycf3 photosystem I assembly protein Ycf3
Probab=98.64 E-value=3.7e-07 Score=61.06 Aligned_cols=102 Identities=11% Similarity=0.072 Sum_probs=81.5
Q ss_pred CCHHHHHHHHHHhhccCCCc--HHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChHHHHHHH
Q psy3020 11 GRYKESISTYEEGLKLDPNN--EQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPSYVQMIK 87 (150)
Q Consensus 11 ~~~~~A~~~~~~aL~~~p~~--~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 87 (150)
..|..+...+...++.++.+ ...+..++.+ ..+| .+.+|+ ..+.+++.+.|+...
T Consensus 13 ~~~~~~~~~l~~~~~~~~~~~~a~~~~~~g~~~~~~g--------~~~~A~--~~~~~al~l~~~~~~------------ 70 (168)
T CHL00033 13 KTFTIVADILLRILPTTSGEKEAFTYYRDGMSAQSEG--------EYAEAL--QNYYEAMRLEIDPYD------------ 70 (168)
T ss_pred cccccchhhhhHhccCCchhHHHHHHHHHHHHHHHcC--------CHHHHH--HHHHHHHhccccchh------------
Confidence 45667777787777777766 4566788888 8888 667777 677777766553211
Q ss_pred HHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHH
Q psy3020 88 EIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFA 147 (150)
Q Consensus 88 ~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~ 147 (150)
.+.+|.++|.++...|++++|+++|.+++.++|.....+.++|.++.
T Consensus 71 -------------~~~~~~~lg~~~~~~g~~~eA~~~~~~Al~~~~~~~~~~~~la~i~~ 117 (168)
T CHL00033 71 -------------RSYILYNIGLIHTSNGEHTKALEYYFQALERNPFLPQALNNMAVICH 117 (168)
T ss_pred -------------hHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCcCcHHHHHHHHHHHH
Confidence 15689999999999999999999999999999999999999999987
No 105
>KOG0553|consensus
Probab=98.64 E-value=7.5e-08 Score=68.81 Aligned_cols=49 Identities=39% Similarity=0.689 Sum_probs=46.4
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhcC
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKAI 150 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~~ 150 (150)
++.++.-|+-+++.++|.+|+..|++||+++|+|+..|.|||.+|.++.
T Consensus 81 AE~LK~eGN~~m~~~~Y~eAv~kY~~AI~l~P~nAVyycNRAAAy~~Lg 129 (304)
T KOG0553|consen 81 AESLKNEGNKLMKNKDYQEAVDKYTEAIELDPTNAVYYCNRAAAYSKLG 129 (304)
T ss_pred HHHHHHHHHHHHHhhhHHHHHHHHHHHHhcCCCcchHHHHHHHHHHHhc
Confidence 8889999999999999999999999999999999999999999999863
No 106
>PRK10866 outer membrane biogenesis protein BamD; Provisional
Probab=98.64 E-value=6e-07 Score=63.68 Aligned_cols=136 Identities=4% Similarity=-0.062 Sum_probs=89.8
Q ss_pred ChhhHHHHhhCCHHHHHHHHHHhhccCCCcHH---HHHHHHHH-Hhhhh-----cc--CCCCC---chhhhhCChHHHHh
Q psy3020 1 MTTYSCLSYLGRYKESISTYEEGLKLDPNNEQ---MKEAIKDV-RNQEM-----ND--MNRGD---PFANLFSDPNIFVQ 66 (150)
Q Consensus 1 ~~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~---a~~~l~~~-~~~g~-----~~--~~~~~---~~~~a~~~~~~~~~ 66 (150)
+++|.+|++.++|++|+..|++.++.+|+++. ++..+|.+ ..++. +. ...++ ....|+ ..+.+.
T Consensus 73 l~la~ayy~~~~y~~A~~~~e~fi~~~P~~~~~~~a~Y~~g~~~~~~~~~~~~~~~~~~~~~rD~~~~~~A~--~~~~~l 150 (243)
T PRK10866 73 LDLIYAYYKNADLPLAQAAIDRFIRLNPTHPNIDYVLYMRGLTNMALDDSALQGFFGVDRSDRDPQHARAAF--RDFSKL 150 (243)
T ss_pred HHHHHHHHhcCCHHHHHHHHHHHHHhCcCCCchHHHHHHHHHhhhhcchhhhhhccCCCccccCHHHHHHHH--HHHHHH
Confidence 36899999999999999999999999998764 55666655 33331 10 01111 234566 677777
Q ss_pred hccCCCCCCCCCChHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCc---chhhcHH
Q psy3020 67 LQLDPRTKPFLSDPSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNH---ILFSNRS 143 (150)
Q Consensus 67 l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~---~~~~n~a 143 (150)
++.-|++.-.- .....+..+.... +.--...|..+++.|.|..|+.-++..++.-|+.+ .+.+.++
T Consensus 151 i~~yP~S~ya~---~A~~rl~~l~~~l--------a~~e~~ia~~Y~~~~~y~AA~~r~~~v~~~Yp~t~~~~eal~~l~ 219 (243)
T PRK10866 151 VRGYPNSQYTT---DATKRLVFLKDRL--------AKYELSVAEYYTKRGAYVAVVNRVEQMLRDYPDTQATRDALPLME 219 (243)
T ss_pred HHHCcCChhHH---HHHHHHHHHHHHH--------HHHHHHHHHHHHHcCchHHHHHHHHHHHHHCCCCchHHHHHHHHH
Confidence 78888754321 1122222221111 44445788899999999999999999999999864 5556666
Q ss_pred HHHHhc
Q psy3020 144 AAFAKA 149 (150)
Q Consensus 144 ~~~~~~ 149 (150)
.+|..+
T Consensus 220 ~ay~~l 225 (243)
T PRK10866 220 NAYRQL 225 (243)
T ss_pred HHHHHc
Confidence 666543
No 107
>TIGR00540 hemY_coli hemY protein. This is an uncharacterized protein encoded next to a heme-biosynthetic enzyme in two gamma division proteobacteria (E. coli and H. influenzae). It is known in no other species. The gene symbol hemY is unfortunate in that an unrelated protein, protoporphyrinogen oxidase, is designated as HemG in E. coli but as HemY in Bacillus subtilis.
Probab=98.63 E-value=4.8e-07 Score=68.76 Aligned_cols=46 Identities=24% Similarity=0.304 Sum_probs=39.1
Q ss_pred HHHHHHHHHHHHhcCHHHHHHHHH--HHHhhcCCCcchhhcHHHHHHhc
Q psy3020 103 SLLKDKGNAALQANNFKEAIEAYS--EAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 103 ~~~~~~g~~~~~~~~~~~A~~~~~--~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
..+..+|+++++.|+|++|.++|+ .+++.+|++... ..+|.++.++
T Consensus 336 ~ll~sLg~l~~~~~~~~~A~~~le~a~a~~~~p~~~~~-~~La~ll~~~ 383 (409)
T TIGR00540 336 CINRALGQLLMKHGEFIEAADAFKNVAACKEQLDANDL-AMAADAFDQA 383 (409)
T ss_pred HHHHHHHHHHHHcccHHHHHHHHHHhHHhhcCCCHHHH-HHHHHHHHHc
Confidence 677889999999999999999999 688899987764 4788888754
No 108
>KOG4555|consensus
Probab=98.62 E-value=5.1e-07 Score=57.53 Aligned_cols=97 Identities=14% Similarity=0.136 Sum_probs=81.4
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChH
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPS 81 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~ 81 (150)
-|.++.+.|+.+.|++-|.++|.+-|.++.++.+.++. +..| +.++++ .++.+++.+..+-..
T Consensus 49 ~~valaE~g~Ld~AlE~F~qal~l~P~raSayNNRAQa~RLq~--------~~e~AL--dDLn~AleLag~~tr------ 112 (175)
T KOG4555|consen 49 KAIALAEAGDLDGALELFGQALCLAPERASAYNNRAQALRLQG--------DDEEAL--DDLNKALELAGDQTR------ 112 (175)
T ss_pred HHHHHHhccchHHHHHHHHHHHHhcccchHhhccHHHHHHHcC--------ChHHHH--HHHHHHHHhcCccch------
Confidence 47788999999999999999999999999999999999 8888 667888 888888876653211
Q ss_pred HHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcC
Q psy3020 82 YVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDG 133 (150)
Q Consensus 82 ~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p 133 (150)
....++..+|.++..+|+-+.|-..|..|-++-.
T Consensus 113 ------------------tacqa~vQRg~lyRl~g~dd~AR~DFe~AA~LGS 146 (175)
T KOG4555|consen 113 ------------------TACQAFVQRGLLYRLLGNDDAARADFEAAAQLGS 146 (175)
T ss_pred ------------------HHHHHHHHHHHHHHHhCchHHHHHhHHHHHHhCC
Confidence 1267889999999999999999999988766544
No 109
>PF13424 TPR_12: Tetratricopeptide repeat; PDB: 3RO2_A 3Q15_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 4A1S_B 3CEQ_B 3EDT_H ....
Probab=98.61 E-value=4.5e-07 Score=52.82 Aligned_cols=31 Identities=23% Similarity=0.454 Sum_probs=28.5
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhc
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLD 132 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~ 132 (150)
+..+.++|.++...|++++|+++|++++++.
T Consensus 46 a~~~~~lg~~~~~~g~~~~A~~~~~~al~i~ 76 (78)
T PF13424_consen 46 ANTLNNLGECYYRLGDYEEALEYYQKALDIF 76 (78)
T ss_dssp HHHHHHHHHHHHHTTHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHHhhh
Confidence 6788999999999999999999999999874
No 110
>PRK02603 photosystem I assembly protein Ycf3; Provisional
Probab=98.60 E-value=3.8e-07 Score=61.29 Aligned_cols=97 Identities=14% Similarity=0.163 Sum_probs=73.7
Q ss_pred HHHHHhhccCCC--cHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChHHHHHHHHHhcCcc
Q psy3020 18 STYEEGLKLDPN--NEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPSYVQMIKEIQKDPS 94 (150)
Q Consensus 18 ~~~~~aL~~~p~--~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~ 94 (150)
+.+.+.+.++++ ...++..+|.+ ...| ++.+|+ ..+.+++...|+...
T Consensus 20 ~~~~~~~~~~~~~~~a~~~~~lg~~~~~~g--------~~~~A~--~~~~~al~~~~~~~~------------------- 70 (172)
T PRK02603 20 DLILKILPINKKAKEAFVYYRDGMSAQADG--------EYAEAL--ENYEEALKLEEDPND------------------- 70 (172)
T ss_pred HHHHHHcccccHhhhHHHHHHHHHHHHHcC--------CHHHHH--HHHHHHHHHhhccch-------------------
Confidence 344555555554 44566788888 7888 667777 666666655553211
Q ss_pred hhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 95 LMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 95 ~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
....+..+|.++...|++++|+.+|.+++.++|+++..+.++|.+|..+
T Consensus 71 ------~~~~~~~la~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~lg~~~~~~ 119 (172)
T PRK02603 71 ------RSYILYNMGIIYASNGEHDKALEYYHQALELNPKQPSALNNIAVIYHKR 119 (172)
T ss_pred ------HHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCcccHHHHHHHHHHHHHc
Confidence 1568899999999999999999999999999999999999999998754
No 111
>PRK14720 transcript cleavage factor/unknown domain fusion protein; Provisional
Probab=98.59 E-value=9e-07 Score=72.38 Aligned_cols=126 Identities=10% Similarity=-0.009 Sum_probs=94.4
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCCh
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDP 80 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~ 80 (150)
.|...+...+++++|++.++.+++..|+...++..+|.+ ...++. ..+. ....+...+....+.--.
T Consensus 36 ~Li~~~~~~~~~deai~i~~~~l~~~P~~i~~yy~~G~l~~q~~~~--------~~~~----lv~~l~~~~~~~~~~~ve 103 (906)
T PRK14720 36 DLIDAYKSENLTDEAKDICEEHLKEHKKSISALYISGILSLSRRPL--------NDSN----LLNLIDSFSQNLKWAIVE 103 (906)
T ss_pred HHHHHHHhcCCHHHHHHHHHHHHHhCCcceehHHHHHHHHHhhcch--------hhhh----hhhhhhhcccccchhHHH
Confidence 367788899999999999999999999999999999888 777732 2222 114444444443331101
Q ss_pred HHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHh
Q psy3020 81 SYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAK 148 (150)
Q Consensus 81 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~ 148 (150)
.+...+... ++. -.+++.+|.+|-+.|++++|+..|++++++||+|+.+..|.|-.|.+
T Consensus 104 ~~~~~i~~~---~~~------k~Al~~LA~~Ydk~g~~~ka~~~yer~L~~D~~n~~aLNn~AY~~ae 162 (906)
T PRK14720 104 HICDKILLY---GEN------KLALRTLAEAYAKLNENKKLKGVWERLVKADRDNPEIVKKLATSYEE 162 (906)
T ss_pred HHHHHHHhh---hhh------hHHHHHHHHHHHHcCChHHHHHHHHHHHhcCcccHHHHHHHHHHHHH
Confidence 233333332 222 56899999999999999999999999999999999999999988764
No 112
>PRK11906 transcriptional regulator; Provisional
Probab=98.58 E-value=1.1e-06 Score=66.58 Aligned_cols=112 Identities=6% Similarity=0.026 Sum_probs=63.0
Q ss_pred HHHHHHHHHHhh---ccCCCcHHHHHHHHHH-Hhh--hhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCC-------
Q psy3020 13 YKESISTYEEGL---KLDPNNEQMKEAIKDV-RNQ--EMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSD------- 79 (150)
Q Consensus 13 ~~~A~~~~~~aL---~~~p~~~~a~~~l~~~-~~~--g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~------- 79 (150)
.+.|+..|.+|+ .+||+++.++-.++.+ ... ..+- .......++. ....+++.++|.+..++.-
T Consensus 274 ~~~Al~lf~ra~~~~~ldp~~a~a~~~lA~~h~~~~~~g~~-~~~~~~~~a~--~~A~rAveld~~Da~a~~~~g~~~~~ 350 (458)
T PRK11906 274 IYRAMTIFDRLQNKSDIQTLKTECYCLLAECHMSLALHGKS-ELELAAQKAL--ELLDYVSDITTVDGKILAIMGLITGL 350 (458)
T ss_pred HHHHHHHHHHHhhcccCCcccHHHHHHHHHHHHHHHHhcCC-CchHHHHHHH--HHHHHHHhcCCCCHHHHHHHHHHHHh
Confidence 357899999999 9999999999999998 332 1111 1222444455 5566677777766432100
Q ss_pred -----hHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcC
Q psy3020 80 -----PSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDG 133 (150)
Q Consensus 80 -----~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p 133 (150)
.....+-+.+.-+|.. +.+|...|.+++..|+.++|++..++|++++|
T Consensus 351 ~~~~~~a~~~f~rA~~L~Pn~------A~~~~~~~~~~~~~G~~~~a~~~i~~alrLsP 403 (458)
T PRK11906 351 SGQAKVSHILFEQAKIHSTDI------ASLYYYRALVHFHNEKIEEARICIDKSLQLEP 403 (458)
T ss_pred hcchhhHHHHHHHHhhcCCcc------HHHHHHHHHHHHHcCCHHHHHHHHHHHhccCc
Confidence 0000111112233333 55555555555555555555555555555555
No 113
>KOG1173|consensus
Probab=98.57 E-value=2.1e-07 Score=71.43 Aligned_cols=131 Identities=11% Similarity=0.071 Sum_probs=87.9
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhh----ccCCCCCCC-
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQL----QLDPRTKPF- 76 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l----~~~~~~~~~- 76 (150)
+|.-|..+++++-|-+.|.+|+.+.|+++-+...+|-+ +..+.+ .+|. .-+..++ ...++...|
T Consensus 386 lgmey~~t~n~kLAe~Ff~~A~ai~P~Dplv~~Elgvvay~~~~y--------~~A~--~~f~~~l~~ik~~~~e~~~w~ 455 (611)
T KOG1173|consen 386 LGMEYMRTNNLKLAEKFFKQALAIAPSDPLVLHELGVVAYTYEEY--------PEAL--KYFQKALEVIKSVLNEKIFWE 455 (611)
T ss_pred HHHHHHHhccHHHHHHHHHHHHhcCCCcchhhhhhhheeehHhhh--------HHHH--HHHHHHHHHhhhccccccchh
Confidence 78889999999999999999999999999999999998 877744 3443 3333333 112221111
Q ss_pred --CCC-hHHHHHHHHHhcCcchhhh-----HHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHH
Q psy3020 77 --LSD-PSYVQMIKEIQKDPSLMTL-----EAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRS 143 (150)
Q Consensus 77 --~~~-~~~~~~~~~l~~~~~~~~~-----~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a 143 (150)
... +...+.++.........+. ++.+.++...|-++..+|+++.|+++|.++|.++|++...---++
T Consensus 456 p~~~NLGH~~Rkl~~~~eAI~~~q~aL~l~~k~~~~~asig~iy~llgnld~Aid~fhKaL~l~p~n~~~~~lL~ 530 (611)
T KOG1173|consen 456 PTLNNLGHAYRKLNKYEEAIDYYQKALLLSPKDASTHASIGYIYHLLGNLDKAIDHFHKALALKPDNIFISELLK 530 (611)
T ss_pred HHHHhHHHHHHHHhhHHHHHHHHHHHHHcCCCchhHHHHHHHHHHHhcChHHHHHHHHHHHhcCCccHHHHHHHH
Confidence 000 1222222221111111111 122899999999999999999999999999999999965544333
No 114
>KOG4234|consensus
Probab=98.57 E-value=8.9e-07 Score=60.37 Aligned_cols=97 Identities=19% Similarity=0.270 Sum_probs=75.2
Q ss_pred hHHHHhhCCHHHHHHHHHHhhccCCCcHH-----HHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCC
Q psy3020 4 YSCLSYLGRYKESISTYEEGLKLDPNNEQ-----MKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFL 77 (150)
Q Consensus 4 g~~~~~~~~~~~A~~~~~~aL~~~p~~~~-----a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~ 77 (150)
|.-++..|.|++|..-|..||.+-|.-+. .+...+.+ ..++ .-+.++ .++-+++.++|.+
T Consensus 102 GN~~F~ngdyeeA~skY~~Ale~cp~~~~e~rsIly~Nraaa~iKl~--------k~e~aI--~dcsKaiel~pty---- 167 (271)
T KOG4234|consen 102 GNELFKNGDYEEANSKYQEALESCPSTSTEERSILYSNRAAALIKLR--------KWESAI--EDCSKAIELNPTY---- 167 (271)
T ss_pred HHHhhhcccHHHHHHHHHHHHHhCccccHHHHHHHHhhhHHHHHHhh--------hHHHHH--HHHHhhHhcCchh----
Confidence 67789999999999999999999997654 23344545 5555 445666 7787888888853
Q ss_pred CChHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcch
Q psy3020 78 SDPSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHIL 138 (150)
Q Consensus 78 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~ 138 (150)
..++..++.+|-+..+|++|++.|.+.++++|....+
T Consensus 168 ------------------------~kAl~RRAeayek~ek~eealeDyKki~E~dPs~~ea 204 (271)
T KOG4234|consen 168 ------------------------EKALERRAEAYEKMEKYEEALEDYKKILESDPSRREA 204 (271)
T ss_pred ------------------------HHHHHHHHHHHHhhhhHHHHHHHHHHHHHhCcchHHH
Confidence 4455566777888899999999999999999987644
No 115
>PLN03098 LPA1 LOW PSII ACCUMULATION1; Provisional
Probab=98.57 E-value=1.6e-07 Score=70.95 Aligned_cols=49 Identities=14% Similarity=0.210 Sum_probs=46.1
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcc---hhhcHHHHHHhcC
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHI---LFSNRSAAFAKAI 150 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~---~~~n~a~~~~~~~ 150 (150)
+.+|.++|.+++..|+|++|+.+|+++|+++|+++. +|+|+|.||..+.
T Consensus 75 a~a~~NLG~AL~~lGryeEAIa~f~rALeL~Pd~aeA~~A~yNLAcaya~LG 126 (453)
T PLN03098 75 AEDAVNLGLSLFSKGRVKDALAQFETALELNPNPDEAQAAYYNKACCHAYRE 126 (453)
T ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHHhhCCCchHHHHHHHHHHHHHHHcC
Confidence 999999999999999999999999999999999985 4999999998763
No 116
>PF00515 TPR_1: Tetratricopeptide repeat; InterPro: IPR001440 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees; within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B. ; GO: 0005515 protein binding; PDB: 3SF4_C 2LNI_A 1ELW_A 2C0M_A 1FCH_B 3R9A_B 2J9Q_A 2C0L_A 1KT1_A 3FWV_A ....
Probab=98.55 E-value=7.6e-08 Score=47.06 Aligned_cols=29 Identities=34% Similarity=0.634 Sum_probs=26.7
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCc
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNN 30 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~ 30 (150)
.+|.+|..+|++++|+.+|++||+++|+|
T Consensus 6 ~~g~~~~~~~~~~~A~~~~~~al~~~p~~ 34 (34)
T PF00515_consen 6 NLGNAYFQLGDYEEALEYYQRALELDPDN 34 (34)
T ss_dssp HHHHHHHHTT-HHHHHHHHHHHHHHSTTH
T ss_pred HHHHHHHHhCCchHHHHHHHHHHHHCcCC
Confidence 58999999999999999999999999986
No 117
>PLN03098 LPA1 LOW PSII ACCUMULATION1; Provisional
Probab=98.53 E-value=2e-07 Score=70.40 Aligned_cols=88 Identities=20% Similarity=0.309 Sum_probs=58.6
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHH---HHHHHHH-HhhhhccCCCCCchhhhhC-ChHHHHhhccCCCCCCC
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQM---KEAIKDV-RNQEMNDMNRGDPFANLFS-DPNIFVQLQLDPRTKPF 76 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a---~~~l~~~-~~~g~~~~~~~~~~~~a~~-~~~~~~~l~~~~~~~~~ 76 (150)
++|.+|+..|+|++|+.+|++||+++|++.++ |..+|-+ ..+|+.+ +|..++.+|+. +......+..+|+...+
T Consensus 80 NLG~AL~~lGryeEAIa~f~rALeL~Pd~aeA~~A~yNLAcaya~LGr~d-EAla~LrrALelsn~~f~~i~~DpdL~pl 158 (453)
T PLN03098 80 NLGLSLFSKGRVKDALAQFETALELNPNPDEAQAAYYNKACCHAYREEGK-KAADCLRTALRDYNLKFSTILNDPDLAPF 158 (453)
T ss_pred HHHHHHHHcCCHHHHHHHHHHHHhhCCCchHHHHHHHHHHHHHHHcCCHH-HHHHHHHHHHHhcchhHHHHHhCcchhhh
Confidence 68999999999999999999999999999965 9999999 8889544 33333444441 11122233444444334
Q ss_pred CCChHHHHHHHHHh
Q psy3020 77 LSDPSYVQMIKEIQ 90 (150)
Q Consensus 77 ~~~~~~~~~~~~l~ 90 (150)
.+.+.+...+....
T Consensus 159 R~~pef~eLlee~r 172 (453)
T PLN03098 159 RASPEFKELQEEAR 172 (453)
T ss_pred cccHHHHHHHHHHH
Confidence 44445555555443
No 118
>PF09295 ChAPs: ChAPs (Chs5p-Arf1p-binding proteins); InterPro: IPR015374 ChAPs (Chs5p-Arf1p-binding proteins) are required for the export of specialised cargo from the Golgi. They physically interact with Chs3, Chs5 and the small GTPase Arf1, and they also form interactions with each other [].
Probab=98.52 E-value=2.9e-06 Score=63.99 Aligned_cols=105 Identities=18% Similarity=0.219 Sum_probs=80.5
Q ss_pred hHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChHH
Q psy3020 4 YSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPSY 82 (150)
Q Consensus 4 g~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~~ 82 (150)
-..+...++++.|++.|++..+.+|+ +...++.+ ..+++ -.+|+ .-+.+.+...|.+
T Consensus 176 l~~l~~t~~~~~ai~lle~L~~~~pe---v~~~LA~v~l~~~~--------E~~AI--~ll~~aL~~~p~d--------- 233 (395)
T PF09295_consen 176 LKYLSLTQRYDEAIELLEKLRERDPE---VAVLLARVYLLMNE--------EVEAI--RLLNEALKENPQD--------- 233 (395)
T ss_pred HHHHhhcccHHHHHHHHHHHHhcCCc---HHHHHHHHHHhcCc--------HHHHH--HHHHHHHHhCCCC---------
Confidence 34556678999999999999999987 34446776 55552 23444 4555555555533
Q ss_pred HHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 83 VQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 83 ~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
...+...+..+..+++++.|+++.++++.+.|++-..|+.+|.||+++
T Consensus 234 -------------------~~LL~~Qa~fLl~k~~~~lAL~iAk~av~lsP~~f~~W~~La~~Yi~~ 281 (395)
T PF09295_consen 234 -------------------SELLNLQAEFLLSKKKYELALEIAKKAVELSPSEFETWYQLAECYIQL 281 (395)
T ss_pred -------------------HHHHHHHHHHHHhcCCHHHHHHHHHHHHHhCchhHHHHHHHHHHHHhc
Confidence 566777888899999999999999999999999999999999999875
No 119
>PF13525 YfiO: Outer membrane lipoprotein; PDB: 3TGO_A 3Q5M_A 2YHC_A.
Probab=98.50 E-value=2e-06 Score=59.37 Aligned_cols=135 Identities=14% Similarity=0.051 Sum_probs=85.8
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcH---HHHHHHHHH-Hhhhhcc---CCCCCchhhhhCChHHHHhhccCCCCC
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNE---QMKEAIKDV-RNQEMND---MNRGDPFANLFSDPNIFVQLQLDPRTK 74 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~---~a~~~l~~~-~~~g~~~---~~~~~~~~~a~~~~~~~~~l~~~~~~~ 74 (150)
.+|.+++..|+|++|+..|++.++..|+++ +++..+|.+ ..+.... ..-.....+|+ ..+...+..-|++.
T Consensus 47 ~la~a~y~~~~y~~A~~~~~~fi~~yP~~~~~~~A~Y~~g~~~~~~~~~~~~~~~D~~~~~~A~--~~~~~li~~yP~S~ 124 (203)
T PF13525_consen 47 MLAYAYYKQGDYEEAIAAYERFIKLYPNSPKADYALYMLGLSYYKQIPGILRSDRDQTSTRKAI--EEFEELIKRYPNSE 124 (203)
T ss_dssp HHHHHHHHTT-HHHHHHHHHHHHHH-TT-TTHHHHHHHHHHHHHHHHHHHH-TT---HHHHHHH--HHHHHHHHH-TTST
T ss_pred HHHHHHHHcCCHHHHHHHHHHHHHHCCCCcchhhHHHHHHHHHHHhCccchhcccChHHHHHHH--HHHHHHHHHCcCch
Confidence 588999999999999999999999999865 466666666 3332110 00001345566 67777777788765
Q ss_pred CCCCChHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCc---chhhcHHHHHHhc
Q psy3020 75 PFLSDPSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNH---ILFSNRSAAFAKA 149 (150)
Q Consensus 75 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~---~~~~n~a~~~~~~ 149 (150)
-.- .....+..+.... ..--...|..+++.|.|..|+.-|+..++..|+.+ .++..++.+|.++
T Consensus 125 y~~---~A~~~l~~l~~~l--------a~~e~~ia~~Y~~~~~y~aA~~r~~~v~~~yp~t~~~~~al~~l~~~y~~l 191 (203)
T PF13525_consen 125 YAE---EAKKRLAELRNRL--------AEHELYIARFYYKRGKYKAAIIRFQYVIENYPDTPAAEEALARLAEAYYKL 191 (203)
T ss_dssp THH---HHHHHHHHHHHHH--------HHHHHHHHHHHHCTT-HHHHHHHHHHHHHHSTTSHHHHHHHHHHHHHHHHT
T ss_pred HHH---HHHHHHHHHHHHH--------HHHHHHHHHHHHHcccHHHHHHHHHHHHHHCCCCchHHHHHHHHHHHHHHh
Confidence 422 1222222221111 45556689999999999999999999999999975 4455566666554
No 120
>KOG0543|consensus
Probab=98.50 E-value=1.8e-06 Score=64.11 Aligned_cols=67 Identities=24% Similarity=0.302 Sum_probs=61.7
Q ss_pred ChhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCC
Q psy3020 1 MTTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFL 77 (150)
Q Consensus 1 ~~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~ 77 (150)
++++.||.++++|.+|+..++++|.++|+|.-+++..|.+ ..+| ++..|. .++.+++.+.|+++.+.
T Consensus 261 lNlA~c~lKl~~~~~Ai~~c~kvLe~~~~N~KALyRrG~A~l~~~--------e~~~A~--~df~ka~k~~P~Nka~~ 328 (397)
T KOG0543|consen 261 LNLAACYLKLKEYKEAIESCNKVLELDPNNVKALYRRGQALLALG--------EYDLAR--DDFQKALKLEPSNKAAR 328 (397)
T ss_pred hHHHHHHHhhhhHHHHHHHHHHHHhcCCCchhHHHHHHHHHHhhc--------cHHHHH--HHHHHHHHhCCCcHHHH
Confidence 4799999999999999999999999999999999999999 9999 777888 99999999999986643
No 121
>PLN03088 SGT1, suppressor of G2 allele of SKP1; Provisional
Probab=98.49 E-value=1e-06 Score=65.84 Aligned_cols=67 Identities=27% Similarity=0.364 Sum_probs=57.9
Q ss_pred chhhhhCChHHHHhhccCCCCCCCCCChHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhc
Q psy3020 53 PFANLFSDPNIFVQLQLDPRTKPFLSDPSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLD 132 (150)
Q Consensus 53 ~~~~a~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~ 132 (150)
++..|+ ..+.+++.++|++ +.+|..+|.++...|++++|+..+.++++++
T Consensus 17 ~~~~Ai--~~~~~Al~~~P~~----------------------------~~a~~~~a~~~~~~g~~~eAl~~~~~Al~l~ 66 (356)
T PLN03088 17 DFALAV--DLYTQAIDLDPNN----------------------------AELYADRAQANIKLGNFTEAVADANKAIELD 66 (356)
T ss_pred CHHHHH--HHHHHHHHhCCCC----------------------------HHHHHHHHHHHHHcCCHHHHHHHHHHHHHhC
Confidence 556666 6666666666653 6788999999999999999999999999999
Q ss_pred CCCcchhhcHHHHHHhc
Q psy3020 133 GTNHILFSNRSAAFAKA 149 (150)
Q Consensus 133 p~~~~~~~n~a~~~~~~ 149 (150)
|+++.+|+++|.+|+.+
T Consensus 67 P~~~~a~~~lg~~~~~l 83 (356)
T PLN03088 67 PSLAKAYLRKGTACMKL 83 (356)
T ss_pred cCCHHHHHHHHHHHHHh
Confidence 99999999999999875
No 122
>PF04733 Coatomer_E: Coatomer epsilon subunit; InterPro: IPR006822 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the epsilon subunit of the coatomer complex, which is involved in the regulation of intracellular protein trafficking between the endoplasmic reticulum and the Golgi complex []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006890 retrograde vesicle-mediated transport, Golgi to ER, 0030126 COPI vesicle coat; PDB: 3MV2_B 3MV3_F 3MKR_A.
Probab=98.48 E-value=2.8e-06 Score=61.77 Aligned_cols=48 Identities=23% Similarity=0.196 Sum_probs=41.8
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
+..+..++.+.+.+|+|++|.+.+.++++.+|+++....|++.|..-+
T Consensus 201 ~~~lng~A~~~l~~~~~~eAe~~L~~al~~~~~~~d~LaNliv~~~~~ 248 (290)
T PF04733_consen 201 PKLLNGLAVCHLQLGHYEEAEELLEEALEKDPNDPDTLANLIVCSLHL 248 (290)
T ss_dssp HHHHHHHHHHHHHCT-HHHHHHHHHHHCCC-CCHHHHHHHHHHHHHHT
T ss_pred HHHHHHHHHHHHHhCCHHHHHHHHHHHHHhccCCHHHHHHHHHHHHHh
Confidence 677888999999999999999999999999999999999999886543
No 123
>KOG1127|consensus
Probab=98.47 E-value=1.4e-06 Score=70.99 Aligned_cols=129 Identities=13% Similarity=0.105 Sum_probs=79.8
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCC----------
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDP---------- 71 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~---------- 71 (150)
||..|....+...|..+|++|.++||+++++.-+.++. .....+ ..++ ..+.++-..+|
T Consensus 498 LG~iYrd~~Dm~RA~kCf~KAFeLDatdaeaaaa~adtyae~~~w--------e~a~--~I~l~~~qka~a~~~k~nW~~ 567 (1238)
T KOG1127|consen 498 LGQIYRDSDDMKRAKKCFDKAFELDATDAEAAAASADTYAEESTW--------EEAF--EICLRAAQKAPAFACKENWVQ 567 (1238)
T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCchhhhhHHHHHHHhhccccH--------HHHH--HHHHHHhhhchHHHHHhhhhh
Confidence 56666666677778888888888888888777777776 444422 2222 11111111111
Q ss_pred ---CCCCCCCChHHHHHH-HHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHH
Q psy3020 72 ---RTKPFLSDPSYVQMI-KEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFA 147 (150)
Q Consensus 72 ---~~~~~~~~~~~~~~~-~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~ 147 (150)
-+.+..+-.....-+ -.+..+|.+ ...|..+|.+|...|+|+-|++.|++|..++|.+.-..|-.|....
T Consensus 568 rG~yyLea~n~h~aV~~fQsALR~dPkD------~n~W~gLGeAY~~sGry~~AlKvF~kAs~LrP~s~y~~fk~A~~ec 641 (1238)
T KOG1127|consen 568 RGPYYLEAHNLHGAVCEFQSALRTDPKD------YNLWLGLGEAYPESGRYSHALKVFTKASLLRPLSKYGRFKEAVMEC 641 (1238)
T ss_pred ccccccCccchhhHHHHHHHHhcCCchh------HHHHHHHHHHHHhcCceehHHHhhhhhHhcCcHhHHHHHHHHHHHH
Confidence 111111111111111 225566655 8999999999999999999999999999999998877776666543
No 124
>PF07719 TPR_2: Tetratricopeptide repeat; InterPro: IPR013105 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. This repeat includes outlying Tetratricopeptide-like repeats (TPR) that are not matched by IPR001440 from INTERPRO.; PDB: 1XNF_B 3Q15_A 4ABN_A 1OUV_A 3U4T_A 3MA5_C 2KCV_A 2KCL_A 2XEV_A 3NF1_A ....
Probab=98.46 E-value=2e-07 Score=45.40 Aligned_cols=29 Identities=41% Similarity=0.647 Sum_probs=26.7
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCc
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNN 30 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~ 30 (150)
.+|.+++.+|++++|+++|+++++++|+|
T Consensus 6 ~lg~~~~~~~~~~~A~~~~~~al~l~p~~ 34 (34)
T PF07719_consen 6 YLGQAYYQLGNYEEAIEYFEKALELDPNN 34 (34)
T ss_dssp HHHHHHHHTT-HHHHHHHHHHHHHHSTTS
T ss_pred HHHHHHHHhCCHHHHHHHHHHHHHHCcCC
Confidence 58999999999999999999999999986
No 125
>KOG1174|consensus
Probab=98.44 E-value=4.1e-06 Score=62.74 Aligned_cols=128 Identities=16% Similarity=0.193 Sum_probs=77.5
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCC-----
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKP----- 75 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~----- 75 (150)
.-|.++.+.++.++|+=+|..|..+-|..-+.+.+|..+ ...|. +++|+ -.....++.-|....
T Consensus 339 lKG~lL~~~~R~~~A~IaFR~Aq~Lap~rL~~Y~GL~hsYLA~~~--------~kEA~--~~An~~~~~~~~sA~~LtL~ 408 (564)
T KOG1174|consen 339 LKGRLLIALERHTQAVIAFRTAQMLAPYRLEIYRGLFHSYLAQKR--------FKEAN--ALANWTIRLFQNSARSLTLF 408 (564)
T ss_pred hccHHHHhccchHHHHHHHHHHHhcchhhHHHHHHHHHHHHhhch--------HHHHH--HHHHHHHHHhhcchhhhhhh
Confidence 347777888888888888888888888888888777777 66663 33333 222222222222111
Q ss_pred ----CCCChHHHHHHHH-----HhcCcchhhh---------------------------HHHhHHHHHHHHHHHHhcCHH
Q psy3020 76 ----FLSDPSYVQMIKE-----IQKDPSLMTL---------------------------EAKISLLKDKGNAALQANNFK 119 (150)
Q Consensus 76 ----~~~~~~~~~~~~~-----l~~~~~~~~~---------------------------~~~~~~~~~~g~~~~~~~~~~ 119 (150)
.+.++....+..+ +...|..... ......+..+|.++...+.+.
T Consensus 409 g~~V~~~dp~~rEKAKkf~ek~L~~~P~Y~~AV~~~AEL~~~Eg~~~D~i~LLe~~L~~~~D~~LH~~Lgd~~~A~Ne~Q 488 (564)
T KOG1174|consen 409 GTLVLFPDPRMREKAKKFAEKSLKINPIYTPAVNLIAELCQVEGPTKDIIKLLEKHLIIFPDVNLHNHLGDIMRAQNEPQ 488 (564)
T ss_pred cceeeccCchhHHHHHHHHHhhhccCCccHHHHHHHHHHHHhhCccchHHHHHHHHHhhccccHHHHHHHHHHHHhhhHH
Confidence 0111111111111 2222222222 022566778999999999999
Q ss_pred HHHHHHHHHHhhcCCCcchh
Q psy3020 120 EAIEAYSEAIKLDGTNHILF 139 (150)
Q Consensus 120 ~A~~~~~~al~~~p~~~~~~ 139 (150)
+|.++|..|+.+||++....
T Consensus 489 ~am~~y~~ALr~dP~~~~sl 508 (564)
T KOG1174|consen 489 KAMEYYYKALRQDPKSKRTL 508 (564)
T ss_pred HHHHHHHHHHhcCccchHHH
Confidence 99999999999999987654
No 126
>COG4235 Cytochrome c biogenesis factor [Posttranslational modification, protein turnover, chaperones]
Probab=98.42 E-value=4.4e-06 Score=59.97 Aligned_cols=100 Identities=15% Similarity=0.073 Sum_probs=83.6
Q ss_pred CHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChHHHHHHHHHh
Q psy3020 12 RYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPSYVQMIKEIQ 90 (150)
Q Consensus 12 ~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~l~ 90 (150)
..+.-+.-.+.-|+.||+|++-|..||.+ ...| .+..|. .-|.+++++.|++
T Consensus 137 ~~~~l~a~Le~~L~~nP~d~egW~~Lg~~ym~~~--------~~~~A~--~AY~~A~rL~g~n----------------- 189 (287)
T COG4235 137 EMEALIARLETHLQQNPGDAEGWDLLGRAYMALG--------RASDAL--LAYRNALRLAGDN----------------- 189 (287)
T ss_pred cHHHHHHHHHHHHHhCCCCchhHHHHHHHHHHhc--------chhHHH--HHHHHHHHhCCCC-----------------
Confidence 45667788899999999999999999999 8989 666777 7888888888865
Q ss_pred cCcchhhhHHHhHHHHHHHHHHHHh---cCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 91 KDPSLMTLEAKISLLKDKGNAALQA---NNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 91 ~~~~~~~~~~~~~~~~~~g~~~~~~---~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
++.+..+|.+++.+ ....++...+++++.+||+|+.+.+.+|..++++
T Consensus 190 -----------~~~~~g~aeaL~~~a~~~~ta~a~~ll~~al~~D~~~iral~lLA~~afe~ 240 (287)
T COG4235 190 -----------PEILLGLAEALYYQAGQQMTAKARALLRQALALDPANIRALSLLAFAAFEQ 240 (287)
T ss_pred -----------HHHHHHHHHHHHHhcCCcccHHHHHHHHHHHhcCCccHHHHHHHHHHHHHc
Confidence 55666677777643 4456999999999999999999999999999875
No 127
>PRK10153 DNA-binding transcriptional activator CadC; Provisional
Probab=98.41 E-value=4.2e-06 Score=65.39 Aligned_cols=62 Identities=10% Similarity=-0.007 Sum_probs=53.2
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCC
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKP 75 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~ 75 (150)
+|..+...|++++|...|++|+.++| +..+|..+|.+ ...| ...+|. ..+.+++.++|.++.
T Consensus 426 la~~~~~~g~~~~A~~~l~rAl~L~p-s~~a~~~lG~~~~~~G--------~~~eA~--~~~~~A~~L~P~~pt 488 (517)
T PRK10153 426 LAVQALVKGKTDEAYQAINKAIDLEM-SWLNYVLLGKVYELKG--------DNRLAA--DAYSTAFNLRPGENT 488 (517)
T ss_pred HHHHHHhcCCHHHHHHHHHHHHHcCC-CHHHHHHHHHHHHHcC--------CHHHHH--HHHHHHHhcCCCCch
Confidence 56667778999999999999999999 47899999999 8888 667888 888899999997654
No 128
>PF13371 TPR_9: Tetratricopeptide repeat
Probab=98.41 E-value=1.3e-06 Score=50.11 Aligned_cols=43 Identities=19% Similarity=0.243 Sum_probs=38.5
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHH
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSA 144 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~ 144 (150)
+..|..+|.+++..|++++|++.|+++++..|+++....-++.
T Consensus 29 ~~~~~~~a~~~~~~g~~~~A~~~l~~~l~~~p~~~~~~~~~a~ 71 (73)
T PF13371_consen 29 PELWLQRARCLFQLGRYEEALEDLERALELSPDDPDARALRAM 71 (73)
T ss_pred chhhHHHHHHHHHhccHHHHHHHHHHHHHHCCCcHHHHHHHHh
Confidence 7789999999999999999999999999999999887665543
No 129
>TIGR02795 tol_pal_ybgF tol-pal system protein YbgF. Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.
Probab=98.41 E-value=4.5e-06 Score=51.93 Aligned_cols=83 Identities=10% Similarity=0.036 Sum_probs=63.5
Q ss_pred HHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChHHHHHHHHHhcCcchhhhHHHhHHHHHHHH
Q psy3020 32 QMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPSYVQMIKEIQKDPSLMTLEAKISLLKDKGN 110 (150)
Q Consensus 32 ~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~ 110 (150)
..+..++.. ...| ++.++. ..+...+...|++.. . +.++..+|.
T Consensus 3 ~~~~~~~~~~~~~~--------~~~~A~--~~~~~~~~~~~~~~~-------------------~------~~~~~~l~~ 47 (119)
T TIGR02795 3 EAYYDAALLVLKAG--------DYADAI--QAFQAFLKKYPKSTY-------------------A------PNAHYWLGE 47 (119)
T ss_pred HHHHHHHHHHHHcC--------CHHHHH--HHHHHHHHHCCCccc-------------------c------HHHHHHHHH
Confidence 455666666 6666 566666 666666666664321 0 567888999
Q ss_pred HHHHhcCHHHHHHHHHHHHhhcCCC---cchhhcHHHHHHhc
Q psy3020 111 AALQANNFKEAIEAYSEAIKLDGTN---HILFSNRSAAFAKA 149 (150)
Q Consensus 111 ~~~~~~~~~~A~~~~~~al~~~p~~---~~~~~n~a~~~~~~ 149 (150)
++++.|++++|+..|.+++..+|++ +.+++++|.++.++
T Consensus 48 ~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~ 89 (119)
T TIGR02795 48 AYYAQGKYADAAKAFLAVVKKYPKSPKAPDALLKLGMSLQEL 89 (119)
T ss_pred HHHhhccHHHHHHHHHHHHHHCCCCCcccHHHHHHHHHHHHh
Confidence 9999999999999999999999986 56799999998765
No 130
>PF13181 TPR_8: Tetratricopeptide repeat; PDB: 3GW4_B 3MA5_C 2KCV_A 2KCL_A 3FP3_A 3LCA_A 3FP4_A 3FP2_A 1W3B_B 1ELW_A ....
Probab=98.40 E-value=9.1e-07 Score=43.07 Aligned_cols=33 Identities=24% Similarity=0.410 Sum_probs=30.5
Q ss_pred HHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCC
Q psy3020 103 SLLKDKGNAALQANNFKEAIEAYSEAIKLDGTN 135 (150)
Q Consensus 103 ~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~ 135 (150)
..|..+|.++..+|++++|+++|+++++++|++
T Consensus 2 ~~~~~lg~~y~~~~~~~~A~~~~~~a~~~~~~n 34 (34)
T PF13181_consen 2 EAYYNLGKIYEQLGDYEEALEYFEKALELNPDN 34 (34)
T ss_dssp HHHHHHHHHHHHTTSHHHHHHHHHHHHHHHTT-
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHHhhCCCC
Confidence 578999999999999999999999999999964
No 131
>PRK11906 transcriptional regulator; Provisional
Probab=98.39 E-value=4e-06 Score=63.61 Aligned_cols=106 Identities=8% Similarity=-0.002 Sum_probs=70.4
Q ss_pred CHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCC----------CC-
Q psy3020 12 RYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFL----------SD- 79 (150)
Q Consensus 12 ~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~----------~~- 79 (150)
...+|.+.-++|+++||+|+.++..+|.+ ...+ ++..++ ..+.+++.++|+....+ .+
T Consensus 319 ~~~~a~~~A~rAveld~~Da~a~~~~g~~~~~~~--------~~~~a~--~~f~rA~~L~Pn~A~~~~~~~~~~~~~G~~ 388 (458)
T PRK11906 319 AAQKALELLDYVSDITTVDGKILAIMGLITGLSG--------QAKVSH--ILFEQAKIHSTDIASLYYYRALVHFHNEKI 388 (458)
T ss_pred HHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHhhc--------chhhHH--HHHHHHhhcCCccHHHHHHHHHHHHHcCCH
Confidence 45579999999999999999999999998 7777 556677 78889999999987632 11
Q ss_pred hHHHHHHHH-HhcCcchhhhHHHhHHHHHHHHH-HHHhcCHHHHHHHHHHHHhhcC
Q psy3020 80 PSYVQMIKE-IQKDPSLMTLEAKISLLKDKGNA-ALQANNFKEAIEAYSEAIKLDG 133 (150)
Q Consensus 80 ~~~~~~~~~-l~~~~~~~~~~~~~~~~~~~g~~-~~~~~~~~~A~~~~~~al~~~p 133 (150)
......+.. +..+|.. ..+-..+-++ .|-....++|++.|-+-.+-..
T Consensus 389 ~~a~~~i~~alrLsP~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 438 (458)
T PRK11906 389 EEARICIDKSLQLEPRR------RKAVVIKECVDMYVPNPLKNNIKLYYKETESES 438 (458)
T ss_pred HHHHHHHHHHhccCchh------hHHHHHHHHHHHHcCCchhhhHHHHhhcccccc
Confidence 011112222 3344422 2333333333 5566778899998877555433
No 132
>KOG4648|consensus
Probab=98.39 E-value=3.1e-07 Score=67.27 Aligned_cols=48 Identities=33% Similarity=0.538 Sum_probs=44.8
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
..-+...|+.++++|+|++||.||++++..+|.|+..+.|||.+|+++
T Consensus 97 ~SEiKE~GN~yFKQgKy~EAIDCYs~~ia~~P~NpV~~~NRA~AYlk~ 144 (536)
T KOG4648|consen 97 ASEIKERGNTYFKQGKYEEAIDCYSTAIAVYPHNPVYHINRALAYLKQ 144 (536)
T ss_pred hHHHHHhhhhhhhccchhHHHHHhhhhhccCCCCccchhhHHHHHHHH
Confidence 444788999999999999999999999999999999999999999875
No 133
>PF12688 TPR_5: Tetratrico peptide repeat
Probab=98.39 E-value=1.5e-05 Score=50.48 Aligned_cols=107 Identities=20% Similarity=0.106 Sum_probs=78.6
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCc---HHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCC
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNN---EQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFL 77 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~---~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~ 77 (150)
.+|.++-..|+.++|+..|++++...++. ..++..++.. ..+| .+.+++ ..+.+.+...|+...
T Consensus 6 ~~A~a~d~~G~~~~Ai~~Y~~Al~~gL~~~~~~~a~i~lastlr~LG--------~~deA~--~~L~~~~~~~p~~~~-- 73 (120)
T PF12688_consen 6 ELAWAHDSLGREEEAIPLYRRALAAGLSGADRRRALIQLASTLRNLG--------RYDEAL--ALLEEALEEFPDDEL-- 73 (120)
T ss_pred HHHHHHHhcCCHHHHHHHHHHHHHcCCCchHHHHHHHHHHHHHHHcC--------CHHHHH--HHHHHHHHHCCCccc--
Confidence 46889999999999999999999986654 4578889988 9999 666776 556556655554321
Q ss_pred CChHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHH
Q psy3020 78 SDPSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFA 147 (150)
Q Consensus 78 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~ 147 (150)
. ......++.+++..|++++|++.+-.++.-... .|.+++.+.
T Consensus 74 -----------------~------~~l~~f~Al~L~~~gr~~eAl~~~l~~la~~~~----~y~ra~~~y 116 (120)
T PF12688_consen 74 -----------------N------AALRVFLALALYNLGRPKEALEWLLEALAETLP----RYRRAIRFY 116 (120)
T ss_pred -----------------c------HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHH----HHHHHHHHH
Confidence 0 233445667889999999999999988863332 566666543
No 134
>KOG1129|consensus
Probab=98.37 E-value=1.7e-06 Score=63.13 Aligned_cols=135 Identities=14% Similarity=0.138 Sum_probs=84.7
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCC-CC
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFL-SD 79 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~-~~ 79 (150)
.+|.||..+|-+.+|-..++.+|+..|- ++.+.-|+.+ .... +-..|+ ..+...+...|.+...+ .+
T Consensus 228 Q~gkCylrLgm~r~AekqlqssL~q~~~-~dTfllLskvY~rid--------QP~~AL--~~~~~gld~fP~~VT~l~g~ 296 (478)
T KOG1129|consen 228 QMGKCYLRLGMPRRAEKQLQSSLTQFPH-PDTFLLLSKVYQRID--------QPERAL--LVIGEGLDSFPFDVTYLLGQ 296 (478)
T ss_pred HHHHHHHHhcChhhhHHHHHHHhhcCCc-hhHHHHHHHHHHHhc--------cHHHHH--HHHhhhhhcCCchhhhhhhh
Confidence 4899999999999999999999998875 4455556666 5555 445566 66777777778665432 11
Q ss_pred hHHHHHHHHHhcCcchhhh-----HHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHH
Q psy3020 80 PSYVQMIKEIQKDPSLMTL-----EAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFA 147 (150)
Q Consensus 80 ~~~~~~~~~l~~~~~~~~~-----~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~ 147 (150)
....+.++..+........ +.+.++..-.|..++.-++.+-|+++|++.+++--.++.++.|+|.|.+
T Consensus 297 ARi~eam~~~~~a~~lYk~vlk~~~~nvEaiAcia~~yfY~~~PE~AlryYRRiLqmG~~speLf~NigLCC~ 369 (478)
T KOG1129|consen 297 ARIHEAMEQQEDALQLYKLVLKLHPINVEAIACIAVGYFYDNNPEMALRYYRRILQMGAQSPELFCNIGLCCL 369 (478)
T ss_pred HHHHHHHHhHHHHHHHHHHHHhcCCccceeeeeeeeccccCCChHHHHHHHHHHHHhcCCChHHHhhHHHHHH
Confidence 1111111111100000000 0113444445556677788888888888888888888888888887765
No 135
>PF13512 TPR_18: Tetratricopeptide repeat
Probab=98.36 E-value=1.4e-05 Score=51.77 Aligned_cols=101 Identities=12% Similarity=0.092 Sum_probs=81.3
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHH---HHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCC
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQ---MKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFL 77 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~---a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~ 77 (150)
.-|...+..|+|++|++.|+....--|.... +.+.++.+ ...+ ++..|+ ..+.+-++++|+++..
T Consensus 15 ~~a~~~l~~~~Y~~A~~~le~L~~ryP~g~ya~qAqL~l~yayy~~~--------~y~~A~--a~~~rFirLhP~hp~v- 83 (142)
T PF13512_consen 15 QEAQEALQKGNYEEAIKQLEALDTRYPFGEYAEQAQLDLAYAYYKQG--------DYEEAI--AAYDRFIRLHPTHPNV- 83 (142)
T ss_pred HHHHHHHHhCCHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHcc--------CHHHHH--HHHHHHHHhCCCCCCc-
Confidence 3467788999999999999999999997664 56788888 8888 677888 8888889999988763
Q ss_pred CChHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcC---------------HHHHHHHHHHHHhhcCCCcc
Q psy3020 78 SDPSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANN---------------FKEAIEAYSEAIKLDGTNHI 137 (150)
Q Consensus 78 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~---------------~~~A~~~~~~al~~~p~~~~ 137 (150)
..++..+|.+.+.+.. ...|...|++.|...|++.-
T Consensus 84 ------------------------dYa~Y~~gL~~~~~~~~~~~~~~~~drD~~~~~~A~~~f~~lv~~yP~S~y 134 (142)
T PF13512_consen 84 ------------------------DYAYYMRGLSYYEQDEGSLQSFFRSDRDPTPARQAFRDFEQLVRRYPNSEY 134 (142)
T ss_pred ------------------------cHHHHHHHHHHHHHhhhHHhhhcccccCcHHHHHHHHHHHHHHHHCcCChh
Confidence 4455666666665554 78999999999999998763
No 136
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=98.35 E-value=2e-05 Score=59.85 Aligned_cols=110 Identities=15% Similarity=0.083 Sum_probs=88.7
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCCh
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDP 80 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~ 80 (150)
..|..+.+.++.++|++.+++++.++|+.......++.. ...| ...+++ ..+.+.+..+|+++.+.
T Consensus 345 ~~~~i~~~~nk~~~A~e~~~kal~l~P~~~~l~~~~a~all~~g--------~~~eai--~~L~~~~~~~p~dp~~w--- 411 (484)
T COG4783 345 LAGDILLEANKAKEAIERLKKALALDPNSPLLQLNLAQALLKGG--------KPQEAI--RILNRYLFNDPEDPNGW--- 411 (484)
T ss_pred HHHHHHHHcCChHHHHHHHHHHHhcCCCccHHHHHHHHHHHhcC--------ChHHHH--HHHHHHhhcCCCCchHH---
Confidence 357889999999999999999999999999999999999 8888 556777 78888888899887743
Q ss_pred HHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhc
Q psy3020 81 SYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLD 132 (150)
Q Consensus 81 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~ 132 (150)
.+.. +........ ..+....+..++..|++++|+..+..+.+..
T Consensus 412 ~~LA--qay~~~g~~------~~a~~A~AE~~~~~G~~~~A~~~l~~A~~~~ 455 (484)
T COG4783 412 DLLA--QAYAELGNR------AEALLARAEGYALAGRLEQAIIFLMRASQQV 455 (484)
T ss_pred HHHH--HHHHHhCch------HHHHHHHHHHHHhCCCHHHHHHHHHHHHHhc
Confidence 2222 222222211 7788888999999999999999999998877
No 137
>PRK15331 chaperone protein SicA; Provisional
Probab=98.34 E-value=6.7e-06 Score=54.43 Aligned_cols=49 Identities=10% Similarity=-0.029 Sum_probs=46.3
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhcC
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKAI 150 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~~ 150 (150)
+.-|..+|-++..+++|++|+..|..|..+++++|...+..|.||+.+.
T Consensus 71 ~~Y~~GLaa~~Q~~k~y~~Ai~~Y~~A~~l~~~dp~p~f~agqC~l~l~ 119 (165)
T PRK15331 71 PDYTMGLAAVCQLKKQFQKACDLYAVAFTLLKNDYRPVFFTGQCQLLMR 119 (165)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccCCCCccchHHHHHHHhC
Confidence 6779999999999999999999999999999999999999999999763
No 138
>KOG1840|consensus
Probab=98.34 E-value=6.7e-06 Score=63.79 Aligned_cols=143 Identities=18% Similarity=0.248 Sum_probs=84.3
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhcc--------CCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhh-c-cC
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKL--------DPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQL-Q-LD 70 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~--------~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l-~-~~ 70 (150)
++|..|..+++|++|+..|++||.+ +|.-+.++..|+.+ ...|.+. ++...++.|+ .+.+.+ . ..
T Consensus 246 ~~a~~y~~~~k~~eAv~ly~~AL~i~e~~~G~~h~~va~~l~nLa~ly~~~GKf~-EA~~~~e~Al---~I~~~~~~~~~ 321 (508)
T KOG1840|consen 246 ILALVYRSLGKYDEAVNLYEEALTIREEVFGEDHPAVAATLNNLAVLYYKQGKFA-EAEEYCERAL---EIYEKLLGASH 321 (508)
T ss_pred HHHHHHHHhccHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhccCChH-HHHHHHHHHH---HHHHHhhccCh
Confidence 4899999999999999999999975 44555677888888 8888443 3332333333 221111 1 11
Q ss_pred CCCCCCC----------CC----hHHHHH-HHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCC-
Q psy3020 71 PRTKPFL----------SD----PSYVQM-IKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGT- 134 (150)
Q Consensus 71 ~~~~~~~----------~~----~~~~~~-~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~- 134 (150)
|+....+ ++ ..+++. +.-+.+.+.... ..-+....++|..+++.|+|++|.+.|++||++...
T Consensus 322 ~~v~~~l~~~~~~~~~~~~~Eea~~l~q~al~i~~~~~g~~~-~~~a~~~~nl~~l~~~~gk~~ea~~~~k~ai~~~~~~ 400 (508)
T KOG1840|consen 322 PEVAAQLSELAAILQSMNEYEEAKKLLQKALKIYLDAPGEDN-VNLAKIYANLAELYLKMGKYKEAEELYKKAIQILREL 400 (508)
T ss_pred HHHHHHHHHHHHHHHHhcchhHHHHHHHHHHHHHHhhccccc-hHHHHHHHHHHHHHHHhcchhHHHHHHHHHHHHHHhc
Confidence 1111000 00 011111 111111111000 011678889999999999999999999999998732
Q ss_pred -------CcchhhcHHHHHHhc
Q psy3020 135 -------NHILFSNRSAAFAKA 149 (150)
Q Consensus 135 -------~~~~~~n~a~~~~~~ 149 (150)
......++|..|.++
T Consensus 401 ~~~~~~~~~~~l~~la~~~~~~ 422 (508)
T KOG1840|consen 401 LGKKDYGVGKPLNQLAEAYEEL 422 (508)
T ss_pred ccCcChhhhHHHHHHHHHHHHh
Confidence 244566677776553
No 139
>cd00189 TPR Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and C
Probab=98.33 E-value=7.6e-06 Score=47.67 Aligned_cols=48 Identities=25% Similarity=0.352 Sum_probs=43.7
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
...+..+|.++...+++++|++.|++++...|.+...++.+|.++..+
T Consensus 34 ~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 81 (100)
T cd00189 34 ADAYYNLAAAYYKLGKYEEALEDYEKALELDPDNAKAYYNLGLAYYKL 81 (100)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhHHHHHHHHHHHH
Confidence 467888999999999999999999999999999999999999988754
No 140
>KOG1174|consensus
Probab=98.33 E-value=8.9e-06 Score=61.00 Aligned_cols=132 Identities=15% Similarity=0.191 Sum_probs=91.0
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCC-C---
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKP-F--- 76 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~-~--- 76 (150)
.+|.|++..|++++|+.-|+++..+||.+.++.--.|-+ ...|+.+ +..... ...+..+..+.. |
T Consensus 237 ~lak~~~~~Gdn~~a~~~Fe~~~~~dpy~i~~MD~Ya~LL~~eg~~e-----~~~~L~-----~~Lf~~~~~ta~~wfV~ 306 (564)
T KOG1174|consen 237 ALGKCLYYNGDYFQAEDIFSSTLCANPDNVEAMDLYAVLLGQEGGCE-----QDSALM-----DYLFAKVKYTASHWFVH 306 (564)
T ss_pred HHhhhhhhhcCchHHHHHHHHHhhCChhhhhhHHHHHHHHHhccCHh-----hHHHHH-----HHHHhhhhcchhhhhhh
Confidence 589999999999999999999999999999876655555 4555332 222222 112222221111 1
Q ss_pred ----CCChHHHHHH----HHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHh
Q psy3020 77 ----LSDPSYVQMI----KEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAK 148 (150)
Q Consensus 77 ----~~~~~~~~~~----~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~ 148 (150)
.....+..++ ..++.++.. ..++..+|+.+...++.++|+-.|+.|..+-|-.-..|--+-.||+.
T Consensus 307 ~~~l~~~K~~~rAL~~~eK~I~~~~r~------~~alilKG~lL~~~~R~~~A~IaFR~Aq~Lap~rL~~Y~GL~hsYLA 380 (564)
T KOG1174|consen 307 AQLLYDEKKFERALNFVEKCIDSEPRN------HEALILKGRLLIALERHTQAVIAFRTAQMLAPYRLEIYRGLFHSYLA 380 (564)
T ss_pred hhhhhhhhhHHHHHHHHHHHhccCccc------chHHHhccHHHHhccchHHHHHHHHHHHhcchhhHHHHHHHHHHHHh
Confidence 1112333333 224445544 88999999999999999999999999999999888888777777764
Q ss_pred c
Q psy3020 149 A 149 (150)
Q Consensus 149 ~ 149 (150)
.
T Consensus 381 ~ 381 (564)
T KOG1174|consen 381 Q 381 (564)
T ss_pred h
Confidence 3
No 141
>COG1729 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=98.33 E-value=1.2e-05 Score=57.16 Aligned_cols=104 Identities=16% Similarity=0.083 Sum_probs=85.3
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcH---HHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCC
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNE---QMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFL 77 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~---~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~ 77 (150)
..|.-++..|+|.+|...|..-++.-|+.. .+++=||.+ +.+| ++..|. ..+....+..|++
T Consensus 146 ~~A~~~~ksgdy~~A~~~F~~fi~~YP~s~~~~nA~yWLGe~~y~qg--------~y~~Aa--~~f~~~~k~~P~s---- 211 (262)
T COG1729 146 NAALDLYKSGDYAEAEQAFQAFIKKYPNSTYTPNAYYWLGESLYAQG--------DYEDAA--YIFARVVKDYPKS---- 211 (262)
T ss_pred HHHHHHHHcCCHHHHHHHHHHHHHcCCCCcccchhHHHHHHHHHhcc--------cchHHH--HHHHHHHHhCCCC----
Confidence 356778899999999999999999999866 578889999 9999 555555 5555555555543
Q ss_pred CChHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhh
Q psy3020 78 SDPSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFS 140 (150)
Q Consensus 78 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~ 140 (150)
|.. ++++..+|.+...+|+.++|...|.+.++.-|+.+.+-.
T Consensus 212 ---------------~KA------pdallKlg~~~~~l~~~d~A~atl~qv~k~YP~t~aA~~ 253 (262)
T COG1729 212 ---------------PKA------PDALLKLGVSLGRLGNTDEACATLQQVIKRYPGTDAAKL 253 (262)
T ss_pred ---------------CCC------hHHHHHHHHHHHHhcCHHHHHHHHHHHHHHCCCCHHHHH
Confidence 322 899999999999999999999999999999999876644
No 142
>PRK10866 outer membrane biogenesis protein BamD; Provisional
Probab=98.30 E-value=1.4e-05 Score=56.80 Aligned_cols=99 Identities=13% Similarity=0.080 Sum_probs=77.6
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHH---HHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCC
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMK---EAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLS 78 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~---~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~ 78 (150)
.|..++..|+|++|++.|++++...|+.+.+. ..+|.+ ..++ ++..|. ..+.+.++..|+++..
T Consensus 38 ~A~~~~~~g~y~~Ai~~f~~l~~~yP~s~~a~~a~l~la~ayy~~~--------~y~~A~--~~~e~fi~~~P~~~~~-- 105 (243)
T PRK10866 38 TAQQKLQDGNWKQAITQLEALDNRYPFGPYSQQVQLDLIYAYYKNA--------DLPLAQ--AAIDRFIRLNPTHPNI-- 105 (243)
T ss_pred HHHHHHHCCCHHHHHHHHHHHHHhCCCChHHHHHHHHHHHHHHhcC--------CHHHHH--HHHHHHHHhCcCCCch--
Confidence 46778889999999999999999999988765 677888 8888 777888 7888888889987652
Q ss_pred ChHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhc------------------CHHHHHHHHHHHHhhcCCCc
Q psy3020 79 DPSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQAN------------------NFKEAIEAYSEAIKLDGTNH 136 (150)
Q Consensus 79 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~------------------~~~~A~~~~~~al~~~p~~~ 136 (150)
+.++..+|.+.+..+ ...+|+..|++.++..|++.
T Consensus 106 -----------------------~~a~Y~~g~~~~~~~~~~~~~~~~~~~~~rD~~~~~~A~~~~~~li~~yP~S~ 158 (243)
T PRK10866 106 -----------------------DYVLYMRGLTNMALDDSALQGFFGVDRSDRDPQHARAAFRDFSKLVRGYPNSQ 158 (243)
T ss_pred -----------------------HHHHHHHHHhhhhcchhhhhhccCCCccccCHHHHHHHHHHHHHHHHHCcCCh
Confidence 455666665543322 23579999999999999874
No 143
>TIGR00540 hemY_coli hemY protein. This is an uncharacterized protein encoded next to a heme-biosynthetic enzyme in two gamma division proteobacteria (E. coli and H. influenzae). It is known in no other species. The gene symbol hemY is unfortunate in that an unrelated protein, protoporphyrinogen oxidase, is designated as HemG in E. coli but as HemY in Bacillus subtilis.
Probab=98.30 E-value=1.7e-05 Score=60.35 Aligned_cols=115 Identities=8% Similarity=-0.002 Sum_probs=70.5
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCC-----
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPF----- 76 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~----- 76 (150)
-|......|+++.|.+...++.+..|+....+...+.+ ..+| ++..+. ..+.+.....|+....
T Consensus 90 ~glla~~~g~~~~A~~~l~~~~~~~~~~~~~~llaA~aa~~~g--------~~~~A~--~~l~~a~~~~p~~~l~~~~~~ 159 (409)
T TIGR00540 90 EALLKLAEGDYAKAEKLIAKNADHAAEPVLNLIKAAEAAQQRG--------DEARAN--QHLEEAAELAGNDNILVEIAR 159 (409)
T ss_pred HHHHHHhCCCHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHCC--------CHHHHH--HHHHHHHHhCCcCchHHHHHH
Confidence 46777889999999999999999999988888888888 8888 455555 4555555555554310
Q ss_pred ------CCCh-HHHHHHHH-HhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcC
Q psy3020 77 ------LSDP-SYVQMIKE-IQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDG 133 (150)
Q Consensus 77 ------~~~~-~~~~~~~~-l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p 133 (150)
..+. .....+.. ++..|.. +.++..++.++...|++++|++.+.+.++..+
T Consensus 160 a~l~l~~~~~~~Al~~l~~l~~~~P~~------~~~l~ll~~~~~~~~d~~~a~~~l~~l~k~~~ 218 (409)
T TIGR00540 160 TRILLAQNELHAARHGVDKLLEMAPRH------KEVLKLAEEAYIRSGAWQALDDIIDNMAKAGL 218 (409)
T ss_pred HHHHHHCCCHHHHHHHHHHHHHhCCCC------HHHHHHHHHHHHHHhhHHHHHHHHHHHHHcCC
Confidence 0000 01111111 2233433 55566666666666666666666666665533
No 144
>COG3063 PilF Tfp pilus assembly protein PilF [Cell motility and secretion / Intracellular trafficking and secretion]
Probab=98.30 E-value=8.4e-06 Score=56.63 Aligned_cols=89 Identities=20% Similarity=0.191 Sum_probs=60.7
Q ss_pred chhhhhCChHHHHhhccCCCCCCCCC-ChHHHHHH-----------HHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHH
Q psy3020 53 PFANLFSDPNIFVQLQLDPRTKPFLS-DPSYVQMI-----------KEIQKDPSLMTLEAKISLLKDKGNAALQANNFKE 120 (150)
Q Consensus 53 ~~~~a~~~~~~~~~l~~~~~~~~~~~-~~~~~~~~-----------~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~ 120 (150)
++..|. ..+.+++..||+....+- ...++..+ ..+..+|.. .+.+++.|..++.+|+|++
T Consensus 50 d~~~A~--~nlekAL~~DPs~~~a~~~~A~~Yq~~Ge~~~A~e~YrkAlsl~p~~------GdVLNNYG~FLC~qg~~~e 121 (250)
T COG3063 50 DYAQAK--KNLEKALEHDPSYYLAHLVRAHYYQKLGENDLADESYRKALSLAPNN------GDVLNNYGAFLCAQGRPEE 121 (250)
T ss_pred CHHHHH--HHHHHHHHhCcccHHHHHHHHHHHHHcCChhhHHHHHHHHHhcCCCc------cchhhhhhHHHHhCCChHH
Confidence 677777 899999999998775320 00111111 113334433 7888888888888888888
Q ss_pred HHHHHHHHHh--hcCCCcchhhcHHHHHHhc
Q psy3020 121 AIEAYSEAIK--LDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 121 A~~~~~~al~--~~p~~~~~~~n~a~~~~~~ 149 (150)
|...|++|+. .-|..+..|-|+|.|-+++
T Consensus 122 A~q~F~~Al~~P~Y~~~s~t~eN~G~Cal~~ 152 (250)
T COG3063 122 AMQQFERALADPAYGEPSDTLENLGLCALKA 152 (250)
T ss_pred HHHHHHHHHhCCCCCCcchhhhhhHHHHhhc
Confidence 8888888886 3444577888888887765
No 145
>KOG0376|consensus
Probab=98.28 E-value=9.7e-07 Score=66.78 Aligned_cols=104 Identities=17% Similarity=0.215 Sum_probs=85.0
Q ss_pred hHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChHH
Q psy3020 4 YSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPSY 82 (150)
Q Consensus 4 g~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~~ 82 (150)
+.-.+.-+.|+.|+..|.+||+++|+++..+...+.. ...+ .+..|+ .+..+++.++|..
T Consensus 11 an~~l~~~~fd~avdlysKaI~ldpnca~~~anRa~a~lK~e--------~~~~Al--~Da~kaie~dP~~--------- 71 (476)
T KOG0376|consen 11 ANEALKDKVFDVAVDLYSKAIELDPNCAIYFANRALAHLKVE--------SFGGAL--HDALKAIELDPTY--------- 71 (476)
T ss_pred HhhhcccchHHHHHHHHHHHHhcCCcceeeechhhhhheeec--------hhhhHH--HHHHhhhhcCchh---------
Confidence 4556778999999999999999999999877766655 5556 666777 7777777777743
Q ss_pred HHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHH
Q psy3020 83 VQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAA 145 (150)
Q Consensus 83 ~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~ 145 (150)
..+|..+|...+..+.+.+|+..|+....+.|+++.+..-...|
T Consensus 72 -------------------~K~Y~rrg~a~m~l~~~~~A~~~l~~~~~l~Pnd~~~~r~~~Ec 115 (476)
T KOG0376|consen 72 -------------------IKAYVRRGTAVMALGEFKKALLDLEKVKKLAPNDPDATRKIDEC 115 (476)
T ss_pred -------------------hheeeeccHHHHhHHHHHHHHHHHHHhhhcCcCcHHHHHHHHHH
Confidence 66788889999999999999999999999999998876655554
No 146
>COG4785 NlpI Lipoprotein NlpI, contains TPR repeats [General function prediction only]
Probab=98.26 E-value=3.2e-06 Score=58.43 Aligned_cols=95 Identities=15% Similarity=0.136 Sum_probs=79.1
Q ss_pred hHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChHH
Q psy3020 4 YSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPSY 82 (150)
Q Consensus 4 g~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~~ 82 (150)
|..|-.+|-+.-|.-+|.++|.+.|+-++++.-+|.. -..| ++..+. ..+...+.+||..
T Consensus 72 GvlYDSlGL~~LAR~DftQaLai~P~m~~vfNyLG~Yl~~a~--------~fdaa~--eaFds~~ELDp~y--------- 132 (297)
T COG4785 72 GVLYDSLGLRALARNDFSQALAIRPDMPEVFNYLGIYLTQAG--------NFDAAY--EAFDSVLELDPTY--------- 132 (297)
T ss_pred cchhhhhhHHHHHhhhhhhhhhcCCCcHHHHHHHHHHHHhcc--------cchHHH--HHhhhHhccCCcc---------
Confidence 5667778888889999999999999999999999988 7778 566666 6667777888854
Q ss_pred HHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCc
Q psy3020 83 VQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNH 136 (150)
Q Consensus 83 ~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~ 136 (150)
--+..++|..++.-|+|.-|.+.+.+-.+-||++|
T Consensus 133 -------------------~Ya~lNRgi~~YY~gR~~LAq~d~~~fYQ~D~~DP 167 (297)
T COG4785 133 -------------------NYAHLNRGIALYYGGRYKLAQDDLLAFYQDDPNDP 167 (297)
T ss_pred -------------------hHHHhccceeeeecCchHhhHHHHHHHHhcCCCCh
Confidence 45667788888888999999999988888888887
No 147
>KOG1127|consensus
Probab=98.25 E-value=6.1e-06 Score=67.37 Aligned_cols=125 Identities=15% Similarity=0.218 Sum_probs=95.6
Q ss_pred CCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCC--------CChH
Q psy3020 11 GRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFL--------SDPS 81 (150)
Q Consensus 11 ~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~--------~~~~ 81 (150)
++...|+.+|-+++++||+.+.++..||.+ +..- ++..|. .=+.++..+||.+.+.- ..+.
T Consensus 472 K~~~~al~ali~alrld~~~apaf~~LG~iYrd~~--------Dm~RA~--kCf~KAFeLDatdaeaaaa~adtyae~~~ 541 (1238)
T KOG1127|consen 472 KNSALALHALIRALRLDVSLAPAFAFLGQIYRDSD--------DMKRAK--KCFDKAFELDATDAEAAAASADTYAEEST 541 (1238)
T ss_pred hhHHHHHHHHHHHHhcccchhHHHHHHHHHHHHHH--------HHHHHH--HHHHHHhcCCchhhhhHHHHHHHhhcccc
Confidence 457889999999999999999999999999 7666 556666 67777788888766521 0011
Q ss_pred ----HHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 82 ----YVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 82 ----~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
+.-.++.-+++|.. .....|..+|..+.+.+++.+|+..|..|+..+|.+..+|.-+|.+|...
T Consensus 542 we~a~~I~l~~~qka~a~----~~k~nW~~rG~yyLea~n~h~aV~~fQsALR~dPkD~n~W~gLGeAY~~s 609 (1238)
T KOG1127|consen 542 WEEAFEICLRAAQKAPAF----ACKENWVQRGPYYLEAHNLHGAVCEFQSALRTDPKDYNLWLGLGEAYPES 609 (1238)
T ss_pred HHHHHHHHHHHhhhchHH----HHHhhhhhccccccCccchhhHHHHHHHHhcCCchhHHHHHHHHHHHHhc
Confidence 11122223333322 33667788999999999999999999999999999999999999999764
No 148
>KOG1128|consensus
Probab=98.24 E-value=7.1e-06 Score=64.94 Aligned_cols=100 Identities=14% Similarity=0.115 Sum_probs=68.5
Q ss_pred HHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChHHH
Q psy3020 5 SCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPSYV 83 (150)
Q Consensus 5 ~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~~~ 83 (150)
......++|+++.++++.+++++|-..+.|+++|.+ ...+ ++..+. ..+..++.++|++
T Consensus 493 ~~~~~~~~fs~~~~hle~sl~~nplq~~~wf~~G~~ALqle--------k~q~av--~aF~rcvtL~Pd~---------- 552 (777)
T KOG1128|consen 493 LLILSNKDFSEADKHLERSLEINPLQLGTWFGLGCAALQLE--------KEQAAV--KAFHRCVTLEPDN---------- 552 (777)
T ss_pred cccccchhHHHHHHHHHHHhhcCccchhHHHhccHHHHHHh--------hhHHHH--HHHHHHhhcCCCc----------
Confidence 334456889999999999999999999999999888 7777 444444 5677788888876
Q ss_pred HHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcH
Q psy3020 84 QMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNR 142 (150)
Q Consensus 84 ~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~ 142 (150)
+.+|.++...+.+.++-.+|...+++|++-+-+++..|-|-
T Consensus 553 ------------------~eaWnNls~ayi~~~~k~ra~~~l~EAlKcn~~~w~iWENy 593 (777)
T KOG1128|consen 553 ------------------AEAWNNLSTAYIRLKKKKRAFRKLKEALKCNYQHWQIWENY 593 (777)
T ss_pred ------------------hhhhhhhhHHHHHHhhhHHHHHHHHHHhhcCCCCCeeeech
Confidence 44555555555555555555555555555555555554443
No 149
>PF14559 TPR_19: Tetratricopeptide repeat; PDB: 2R5S_A 3QDN_B 3QOU_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 3FP3_A 3LCA_A ....
Probab=98.22 E-value=2e-06 Score=48.59 Aligned_cols=50 Identities=18% Similarity=0.128 Sum_probs=40.1
Q ss_pred HhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHH
Q psy3020 89 IQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSA 144 (150)
Q Consensus 89 l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~ 144 (150)
+...|.. ...+..++.++++.|++++|.+.+.+++..+|+++..+.-++.
T Consensus 18 l~~~p~~------~~~~~~la~~~~~~g~~~~A~~~l~~~~~~~~~~~~~~~l~a~ 67 (68)
T PF14559_consen 18 LQRNPDN------PEARLLLAQCYLKQGQYDEAEELLERLLKQDPDNPEYQQLLAQ 67 (68)
T ss_dssp HHHTTTS------HHHHHHHHHHHHHTT-HHHHHHHHHCCHGGGTTHHHHHHHHHH
T ss_pred HHHCCCC------HHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCcCHHHHHHHHhc
Confidence 5566766 8888889999999999999999999999999988777665554
No 150
>PF13525 YfiO: Outer membrane lipoprotein; PDB: 3TGO_A 3Q5M_A 2YHC_A.
Probab=98.22 E-value=1.6e-05 Score=54.90 Aligned_cols=102 Identities=19% Similarity=0.157 Sum_probs=74.0
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcH---HHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCC
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNE---QMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFL 77 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~---~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~ 77 (150)
..|..++..|+|.+|+..|++++...|+.. .+...++.+ ...| ++..|. ..+..-+...|++...
T Consensus 10 ~~a~~~~~~g~y~~Ai~~f~~l~~~~P~s~~a~~A~l~la~a~y~~~--------~y~~A~--~~~~~fi~~yP~~~~~- 78 (203)
T PF13525_consen 10 QKALEALQQGDYEEAIKLFEKLIDRYPNSPYAPQAQLMLAYAYYKQG--------DYEEAI--AAYERFIKLYPNSPKA- 78 (203)
T ss_dssp HHHHHHHHCT-HHHHHHHHHHHHHH-TTSTTHHHHHHHHHHHHHHTT---------HHHHH--HHHHHHHHH-TT-TTH-
T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHHCCCChHHHHHHHHHHHHHHHcC--------CHHHHH--HHHHHHHHHCCCCcch-
Confidence 468889999999999999999999988754 567888888 8888 667777 6777777777876431
Q ss_pred CChHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHh-----------cCHHHHHHHHHHHHhhcCCCcch
Q psy3020 78 SDPSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQA-----------NNFKEAIEAYSEAIKLDGTNHIL 138 (150)
Q Consensus 78 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~-----------~~~~~A~~~~~~al~~~p~~~~~ 138 (150)
+.++..+|.+.+.. +...+|+..|+..+...|+++-+
T Consensus 79 ------------------------~~A~Y~~g~~~~~~~~~~~~~~~D~~~~~~A~~~~~~li~~yP~S~y~ 126 (203)
T PF13525_consen 79 ------------------------DYALYMLGLSYYKQIPGILRSDRDQTSTRKAIEEFEELIKRYPNSEYA 126 (203)
T ss_dssp ------------------------HHHHHHHHHHHHHHHHHHH-TT---HHHHHHHHHHHHHHHH-TTSTTH
T ss_pred ------------------------hhHHHHHHHHHHHhCccchhcccChHHHHHHHHHHHHHHHHCcCchHH
Confidence 45566666665443 44569999999999999998643
No 151
>COG4700 Uncharacterized protein conserved in bacteria containing a divergent form of TPR repeats [Function unknown]
Probab=98.20 E-value=5.9e-05 Score=51.14 Aligned_cols=97 Identities=16% Similarity=0.205 Sum_probs=75.5
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhc-cCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCC
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLK-LDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSD 79 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~-~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~ 79 (150)
-||.++.++|++.+|..+|++++. +--+++....+++.. ...+ ++..+. +.+.+..+-+|....
T Consensus 94 rLa~al~elGr~~EA~~hy~qalsG~fA~d~a~lLglA~Aqfa~~--------~~A~a~--~tLe~l~e~~pa~r~---- 159 (251)
T COG4700 94 RLANALAELGRYHEAVPHYQQALSGIFAHDAAMLLGLAQAQFAIQ--------EFAAAQ--QTLEDLMEYNPAFRS---- 159 (251)
T ss_pred HHHHHHHHhhhhhhhHHHHHHHhccccCCCHHHHHHHHHHHHhhc--------cHHHHH--HHHHHHhhcCCccCC----
Confidence 478999999999999999999997 455788888999998 8888 556666 666666666654322
Q ss_pred hHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCC
Q psy3020 80 PSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGT 134 (150)
Q Consensus 80 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~ 134 (150)
+......|..+...|++.+|...|+.++..-|+
T Consensus 160 ----------------------pd~~Ll~aR~laa~g~~a~Aesafe~a~~~ypg 192 (251)
T COG4700 160 ----------------------PDGHLLFARTLAAQGKYADAESAFEVAISYYPG 192 (251)
T ss_pred ----------------------CCchHHHHHHHHhcCCchhHHHHHHHHHHhCCC
Confidence 555667777888888888888888888888875
No 152
>PF13431 TPR_17: Tetratricopeptide repeat
Probab=98.19 E-value=1e-06 Score=43.12 Aligned_cols=26 Identities=23% Similarity=0.425 Sum_probs=24.0
Q ss_pred HHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 124 AYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 124 ~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
+|++||+++|+++.+|+|+|.+|...
T Consensus 1 ~y~kAie~~P~n~~a~~nla~~~~~~ 26 (34)
T PF13431_consen 1 CYKKAIELNPNNAEAYNNLANLYLNQ 26 (34)
T ss_pred ChHHHHHHCCCCHHHHHHHHHHHHHC
Confidence 48999999999999999999999865
No 153
>KOG0495|consensus
Probab=98.16 E-value=5.6e-05 Score=59.73 Aligned_cols=43 Identities=16% Similarity=0.074 Sum_probs=36.3
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHH
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSA 144 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~ 144 (150)
+-.+...|..++...+++.|.+.|.+|+.++|++-.+|.+.=.
T Consensus 817 phVllaia~lfw~e~k~~kar~Wf~Ravk~d~d~GD~wa~fyk 859 (913)
T KOG0495|consen 817 PHVLLAIAKLFWSEKKIEKAREWFERAVKKDPDNGDAWAWFYK 859 (913)
T ss_pred chhHHHHHHHHHHHHHHHHHHHHHHHHHccCCccchHHHHHHH
Confidence 5556778999999999999999999999999998777665433
No 154
>KOG2003|consensus
Probab=98.15 E-value=1.5e-05 Score=60.60 Aligned_cols=47 Identities=17% Similarity=0.132 Sum_probs=32.5
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHh
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAK 148 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~ 148 (150)
+..+..+++++-...+..+||+.+.++..+-|++|..+..+|..|-+
T Consensus 558 ~evl~qianiye~led~aqaie~~~q~~slip~dp~ilskl~dlydq 604 (840)
T KOG2003|consen 558 AEVLVQIANIYELLEDPAQAIELLMQANSLIPNDPAILSKLADLYDQ 604 (840)
T ss_pred HHHHHHHHHHHHHhhCHHHHHHHHHHhcccCCCCHHHHHHHHHHhhc
Confidence 55666666676666777777777777777777777777777666644
No 155
>KOG1840|consensus
Probab=98.13 E-value=1.3e-05 Score=62.25 Aligned_cols=102 Identities=14% Similarity=0.162 Sum_probs=77.0
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhcc--------CCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCC
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKL--------DPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPR 72 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~--------~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~ 72 (150)
.+|..|...|+|+.|...|++|+++ +|.-......+|.+ ..++ .+.+|. ..+.+++.+--.
T Consensus 204 ~La~~y~~~g~~e~A~~l~k~Al~~l~k~~G~~hl~va~~l~~~a~~y~~~~--------k~~eAv--~ly~~AL~i~e~ 273 (508)
T KOG1840|consen 204 NLAEMYAVQGRLEKAEPLCKQALRILEKTSGLKHLVVASMLNILALVYRSLG--------KYDEAV--NLYEEALTIREE 273 (508)
T ss_pred HHHHHHHHhccHHHHHHHHHHHHHHHHHccCccCHHHHHHHHHHHHHHHHhc--------cHHHHH--HHHHHHHHHHHH
Confidence 4789999999999999999999999 77777777778888 8888 556665 555444422111
Q ss_pred CCCCCCChHHHHHHHHHh-cCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCC
Q psy3020 73 TKPFLSDPSYVQMIKEIQ-KDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGT 134 (150)
Q Consensus 73 ~~~~~~~~~~~~~~~~l~-~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~ 134 (150)
. .. ..|.. +.++.+++..++..|+|.+|..++++|+++--.
T Consensus 274 ~---------------~G~~h~~v------a~~l~nLa~ly~~~GKf~EA~~~~e~Al~I~~~ 315 (508)
T KOG1840|consen 274 V---------------FGEDHPAV------AATLNNLAVLYYKQGKFAEAEEYCERALEIYEK 315 (508)
T ss_pred h---------------cCCCCHHH------HHHHHHHHHHHhccCChHHHHHHHHHHHHHHHH
Confidence 0 11 12222 788999999999999999999999999987543
No 156
>PF09976 TPR_21: Tetratricopeptide repeat; InterPro: IPR018704 This domain, found in various hypothetical prokaryotic proteins, has no known function.
Probab=98.12 E-value=6e-05 Score=49.18 Aligned_cols=107 Identities=12% Similarity=0.035 Sum_probs=81.9
Q ss_pred HhhCCHHHHHHHHHHhhccCCCcH---HHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChHHH
Q psy3020 8 SYLGRYKESISTYEEGLKLDPNNE---QMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPSYV 83 (150)
Q Consensus 8 ~~~~~~~~A~~~~~~aL~~~p~~~---~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~~~ 83 (150)
...+++..+...+++.++-.|+.+ .+.+.++.+ ...| ++.+|. ..+...+...|+. .
T Consensus 22 ~~~~~~~~~~~~~~~l~~~~~~s~ya~~A~l~lA~~~~~~g--------~~~~A~--~~l~~~~~~~~d~-~-------- 82 (145)
T PF09976_consen 22 LQAGDPAKAEAAAEQLAKDYPSSPYAALAALQLAKAAYEQG--------DYDEAK--AALEKALANAPDP-E-------- 82 (145)
T ss_pred HHCCCHHHHHHHHHHHHHHCCCChHHHHHHHHHHHHHHHCC--------CHHHHH--HHHHHHHhhCCCH-H--------
Confidence 357889999899999999999984 456678888 8888 677777 6666665544331 0
Q ss_pred HHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhcC
Q psy3020 84 QMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKAI 150 (150)
Q Consensus 84 ~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~~ 150 (150)
-+..+...++.+++..|+|++|+..+.. +.-.|..+..+..+|.+|+.++
T Consensus 83 ----------------l~~~a~l~LA~~~~~~~~~d~Al~~L~~-~~~~~~~~~~~~~~Gdi~~~~g 132 (145)
T PF09976_consen 83 ----------------LKPLARLRLARILLQQGQYDEALATLQQ-IPDEAFKALAAELLGDIYLAQG 132 (145)
T ss_pred ----------------HHHHHHHHHHHHHHHcCCHHHHHHHHHh-ccCcchHHHHHHHHHHHHHHCC
Confidence 1155677899999999999999999966 5566667888889999998753
No 157
>PF13181 TPR_8: Tetratricopeptide repeat; PDB: 3GW4_B 3MA5_C 2KCV_A 2KCL_A 3FP3_A 3LCA_A 3FP4_A 3FP2_A 1W3B_B 1ELW_A ....
Probab=98.10 E-value=2.7e-06 Score=41.35 Aligned_cols=29 Identities=31% Similarity=0.612 Sum_probs=26.8
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCc
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNN 30 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~ 30 (150)
.+|.+|..+|++++|++.|+++++++|+|
T Consensus 6 ~lg~~y~~~~~~~~A~~~~~~a~~~~~~n 34 (34)
T PF13181_consen 6 NLGKIYEQLGDYEEALEYFEKALELNPDN 34 (34)
T ss_dssp HHHHHHHHTTSHHHHHHHHHHHHHHHTT-
T ss_pred HHHHHHHHcCCHHHHHHHHHHHHhhCCCC
Confidence 58999999999999999999999999965
No 158
>PF13431 TPR_17: Tetratricopeptide repeat
Probab=98.09 E-value=3e-06 Score=41.43 Aligned_cols=28 Identities=25% Similarity=0.484 Sum_probs=25.4
Q ss_pred HHHHhhccCCCcHHHHHHHHHH-Hhhhhc
Q psy3020 19 TYEEGLKLDPNNEQMKEAIKDV-RNQEMN 46 (150)
Q Consensus 19 ~~~~aL~~~p~~~~a~~~l~~~-~~~g~~ 46 (150)
+|++||+++|+|+++|..+|.+ ...|+.
T Consensus 1 ~y~kAie~~P~n~~a~~nla~~~~~~g~~ 29 (34)
T PF13431_consen 1 CYKKAIELNPNNAEAYNNLANLYLNQGDY 29 (34)
T ss_pred ChHHHHHHCCCCHHHHHHHHHHHHHCcCH
Confidence 4899999999999999999999 888844
No 159
>PRK10747 putative protoheme IX biogenesis protein; Provisional
Probab=98.05 E-value=6.3e-05 Score=57.13 Aligned_cols=111 Identities=13% Similarity=0.050 Sum_probs=66.1
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHHHhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCC-----
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDVRNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFL----- 77 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~----- 77 (150)
+|..+...|+.++|.+..+++++. |.++......+.+ ..+ +..+++ ..+...++..|++...+
T Consensus 269 ~A~~l~~~g~~~~A~~~L~~~l~~-~~~~~l~~l~~~l-~~~--------~~~~al--~~~e~~lk~~P~~~~l~l~lgr 336 (398)
T PRK10747 269 MAEHLIECDDHDTAQQIILDGLKR-QYDERLVLLIPRL-KTN--------NPEQLE--KVLRQQIKQHGDTPLLWSTLGQ 336 (398)
T ss_pred HHHHHHHCCCHHHHHHHHHHHHhc-CCCHHHHHHHhhc-cCC--------ChHHHH--HHHHHHHhhCCCCHHHHHHHHH
Confidence 566777777777777777777774 3344444444444 112 334445 55555666666665421
Q ss_pred ---CChHHHHHHHH----HhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhc
Q psy3020 78 ---SDPSYVQMIKE----IQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLD 132 (150)
Q Consensus 78 ---~~~~~~~~~~~----l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~ 132 (150)
....+.++... +...|+ ...+..++.++.+.|+.++|.++|++++.+-
T Consensus 337 l~~~~~~~~~A~~~le~al~~~P~-------~~~~~~La~~~~~~g~~~~A~~~~~~~l~~~ 391 (398)
T PRK10747 337 LLMKHGEWQEASLAFRAALKQRPD-------AYDYAWLADALDRLHKPEEAAAMRRDGLMLT 391 (398)
T ss_pred HHHHCCCHHHHHHHHHHHHhcCCC-------HHHHHHHHHHHHHcCCHHHHHHHHHHHHhhh
Confidence 11111111111 222232 3556789999999999999999999998864
No 160
>PF12895 Apc3: Anaphase-promoting complex, cyclosome, subunit 3; PDB: 3KAE_D 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2XPI_A 3ULQ_A.
Probab=98.04 E-value=6.4e-06 Score=48.68 Aligned_cols=42 Identities=26% Similarity=0.457 Sum_probs=40.0
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-Hhhh
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQE 44 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g 44 (150)
.+|.||+..|+|++|+..+++ ++.+|.++..+..+|.+ ..+|
T Consensus 30 ~la~~~~~~~~y~~A~~~~~~-~~~~~~~~~~~~l~a~~~~~l~ 72 (84)
T PF12895_consen 30 NLAQCYFQQGKYEEAIELLQK-LKLDPSNPDIHYLLARCLLKLG 72 (84)
T ss_dssp HHHHHHHHTTHHHHHHHHHHC-HTHHHCHHHHHHHHHHHHHHTT
T ss_pred HHHHHHHHCCCHHHHHHHHHH-hCCCCCCHHHHHHHHHHHHHhC
Confidence 479999999999999999999 99999999999999999 9999
No 161
>KOG2003|consensus
Probab=98.01 E-value=0.00013 Score=55.68 Aligned_cols=43 Identities=16% Similarity=0.136 Sum_probs=23.4
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-Hhhh
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQE 44 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g 44 (150)
++|..+..+|+.++|+++|-+.-.+=-++++++..++.+ ..+.
T Consensus 529 niglt~e~~~~ldeald~f~klh~il~nn~evl~qianiye~le 572 (840)
T KOG2003|consen 529 NIGLTAEALGNLDEALDCFLKLHAILLNNAEVLVQIANIYELLE 572 (840)
T ss_pred HhcccHHHhcCHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHhh
Confidence 345555555555555555555555555555555555555 4444
No 162
>PF09295 ChAPs: ChAPs (Chs5p-Arf1p-binding proteins); InterPro: IPR015374 ChAPs (Chs5p-Arf1p-binding proteins) are required for the export of specialised cargo from the Golgi. They physically interact with Chs3, Chs5 and the small GTPase Arf1, and they also form interactions with each other [].
Probab=98.00 E-value=7.9e-05 Score=56.38 Aligned_cols=90 Identities=13% Similarity=0.185 Sum_probs=74.1
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCCh
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDP 80 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~ 80 (150)
.++.++...++..+|++...++|+..|.+.+.+...+.. ...+ ++..|+ .-..++....|+.
T Consensus 205 ~LA~v~l~~~~E~~AI~ll~~aL~~~p~d~~LL~~Qa~fLl~k~--------~~~lAL--~iAk~av~lsP~~------- 267 (395)
T PF09295_consen 205 LLARVYLLMNEEVEAIRLLNEALKENPQDSELLNLQAEFLLSKK--------KYELAL--EIAKKAVELSPSE------- 267 (395)
T ss_pred HHHHHHHhcCcHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHhcC--------CHHHHH--HHHHHHHHhCchh-------
Confidence 478888999999999999999999999998888877877 6666 555666 5565566666632
Q ss_pred HHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHH
Q psy3020 81 SYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAI 129 (150)
Q Consensus 81 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al 129 (150)
...|..++.+|...|+|++|+-..+-+=
T Consensus 268 ---------------------f~~W~~La~~Yi~~~d~e~ALlaLNs~P 295 (395)
T PF09295_consen 268 ---------------------FETWYQLAECYIQLGDFENALLALNSCP 295 (395)
T ss_pred ---------------------HHHHHHHHHHHHhcCCHHHHHHHHhcCc
Confidence 8899999999999999999998877553
No 163
>COG2956 Predicted N-acetylglucosaminyl transferase [Carbohydrate transport and metabolism]
Probab=98.00 E-value=0.00022 Score=52.10 Aligned_cols=125 Identities=12% Similarity=0.163 Sum_probs=86.1
Q ss_pred HHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCC-----
Q psy3020 6 CLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSD----- 79 (150)
Q Consensus 6 ~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~----- 79 (150)
=+.=..+.++|++.|...++.||..++++..||.+ +.-| ....|+ .+...+-..|+...-...
T Consensus 44 NfLLs~Q~dKAvdlF~e~l~~d~~t~e~~ltLGnLfRsRG--------EvDRAI---RiHQ~L~~spdlT~~qr~lAl~q 112 (389)
T COG2956 44 NFLLSNQPDKAVDLFLEMLQEDPETFEAHLTLGNLFRSRG--------EVDRAI---RIHQTLLESPDLTFEQRLLALQQ 112 (389)
T ss_pred HHHhhcCcchHHHHHHHHHhcCchhhHHHHHHHHHHHhcc--------hHHHHH---HHHHHHhcCCCCchHHHHHHHHH
Confidence 34456788999999999999999999999999999 9999 455666 555566666665431100
Q ss_pred -------hHHH----HHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHh
Q psy3020 80 -------PSYV----QMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAK 148 (150)
Q Consensus 80 -------~~~~----~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~ 148 (150)
..+. .....+...++. ...++..+..+|....+|++||++-++.+.+.|+... ..+|.+|++
T Consensus 113 L~~Dym~aGl~DRAE~~f~~L~de~ef-----a~~AlqqLl~IYQ~treW~KAId~A~~L~k~~~q~~~--~eIAqfyCE 185 (389)
T COG2956 113 LGRDYMAAGLLDRAEDIFNQLVDEGEF-----AEGALQQLLNIYQATREWEKAIDVAERLVKLGGQTYR--VEIAQFYCE 185 (389)
T ss_pred HHHHHHHhhhhhHHHHHHHHHhcchhh-----hHHHHHHHHHHHHHhhHHHHHHHHHHHHHHcCCccch--hHHHHHHHH
Confidence 0011 111112111211 1678888999999999999999999999999997643 345555554
No 164
>COG4785 NlpI Lipoprotein NlpI, contains TPR repeats [General function prediction only]
Probab=97.99 E-value=0.00013 Score=50.71 Aligned_cols=44 Identities=14% Similarity=0.114 Sum_probs=39.6
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-Hhhhhc
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMN 46 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~ 46 (150)
+|..+...|+|+.|++.|+..+++||.+-.++.+.|.. +.-|+.
T Consensus 105 LG~Yl~~a~~fdaa~eaFds~~ELDp~y~Ya~lNRgi~~YY~gR~ 149 (297)
T COG4785 105 LGIYLTQAGNFDAAYEAFDSVLELDPTYNYAHLNRGIALYYGGRY 149 (297)
T ss_pred HHHHHHhcccchHHHHHhhhHhccCCcchHHHhccceeeeecCch
Confidence 78889999999999999999999999999999988877 666654
No 165
>PF13176 TPR_7: Tetratricopeptide repeat; PDB: 3SF4_C 3RO3_A 3RO2_A.
Probab=97.95 E-value=2.3e-05 Score=38.72 Aligned_cols=30 Identities=30% Similarity=0.479 Sum_probs=24.8
Q ss_pred HHHHHHHHHHHhcCHHHHHHHHHHHHhhcC
Q psy3020 104 LLKDKGNAALQANNFKEAIEAYSEAIKLDG 133 (150)
Q Consensus 104 ~~~~~g~~~~~~~~~~~A~~~~~~al~~~p 133 (150)
+|.++|.++.+.|+|++|+++|++++.+.+
T Consensus 1 al~~Lg~~~~~~g~~~~Ai~~y~~aL~l~~ 30 (36)
T PF13176_consen 1 ALNNLGRIYRQQGDYEKAIEYYEQALALAR 30 (36)
T ss_dssp HHHHHHHHHHHCT-HHHHHHHHHHHHHHHH
T ss_pred CHHHHHHHHHHcCCHHHHHHHHHHHHHhcc
Confidence 478999999999999999999999665443
No 166
>KOG0551|consensus
Probab=97.94 E-value=6.2e-05 Score=55.00 Aligned_cols=94 Identities=18% Similarity=0.271 Sum_probs=74.7
Q ss_pred hHHHHhhCCHHHHHHHHHHhhccCCCcHH----HHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCC
Q psy3020 4 YSCLSYLGRYKESISTYEEGLKLDPNNEQ----MKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLS 78 (150)
Q Consensus 4 g~~~~~~~~~~~A~~~~~~aL~~~p~~~~----a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~ 78 (150)
|+-|++.++|..|..+|.++|+....|+. .|.+.+.+ ..+| -+..++ .++..++..+|.+
T Consensus 88 GN~~fK~Kryk~A~~~Yt~Glk~kc~D~dlnavLY~NRAAa~~~l~--------NyRs~l--~Dcs~al~~~P~h----- 152 (390)
T KOG0551|consen 88 GNEYFKEKRYKDAVESYTEGLKKKCADPDLNAVLYTNRAAAQLYLG--------NYRSAL--NDCSAALKLKPTH----- 152 (390)
T ss_pred hHHHHHhhhHHHHHHHHHHHHhhcCCCccHHHHHHhhHHHHHHHHH--------HHHHHH--HHHHHHHhcCcch-----
Confidence 78899999999999999999998776655 45677888 8888 678888 8999999999976
Q ss_pred ChHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCC
Q psy3020 79 DPSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTN 135 (150)
Q Consensus 79 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~ 135 (150)
..+++.-+.+++...++.+|+..+++.++++-+.
T Consensus 153 -----------------------~Ka~~R~Akc~~eLe~~~~a~nw~ee~~~~d~e~ 186 (390)
T KOG0551|consen 153 -----------------------LKAYIRGAKCLLELERFAEAVNWCEEGLQIDDEA 186 (390)
T ss_pred -----------------------hhhhhhhhHHHHHHHHHHHHHHHHhhhhhhhHHH
Confidence 4455555666777888888888887777766554
No 167
>PF13174 TPR_6: Tetratricopeptide repeat; PDB: 3QKY_A 2XEV_A 3URZ_B 2Q7F_A.
Probab=97.91 E-value=3.5e-05 Score=36.86 Aligned_cols=33 Identities=21% Similarity=0.242 Sum_probs=30.5
Q ss_pred HHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCC
Q psy3020 103 SLLKDKGNAALQANNFKEAIEAYSEAIKLDGTN 135 (150)
Q Consensus 103 ~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~ 135 (150)
+++..+|.++.+.|++++|++.|++.++..|++
T Consensus 1 ~a~~~~a~~~~~~g~~~~A~~~~~~~~~~~P~s 33 (33)
T PF13174_consen 1 DALYRLARCYYKLGDYDEAIEYFQRLIKRYPDS 33 (33)
T ss_dssp HHHHHHHHHHHHHCHHHHHHHHHHHHHHHSTTS
T ss_pred CHHHHHHHHHHHccCHHHHHHHHHHHHHHCcCC
Confidence 367889999999999999999999999999975
No 168
>PF14853 Fis1_TPR_C: Fis1 C-terminal tetratricopeptide repeat; PDB: 1IYG_A 1PC2_A 1NZN_A 3UUX_C 1Y8M_A 2PQR_A 2PQN_A 3O48_A.
Probab=97.88 E-value=4.9e-05 Score=40.92 Aligned_cols=39 Identities=18% Similarity=0.223 Sum_probs=32.6
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV 40 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~ 40 (150)
.++.++.++|+|++|.+..+.+|+++|+|..+..-...+
T Consensus 6 ~lAig~ykl~~Y~~A~~~~~~lL~~eP~N~Qa~~L~~~i 44 (53)
T PF14853_consen 6 YLAIGHYKLGEYEKARRYCDALLEIEPDNRQAQSLKELI 44 (53)
T ss_dssp HHHHHHHHTT-HHHHHHHHHHHHHHTTS-HHHHHHHHHH
T ss_pred HHHHHHHHhhhHHHHHHHHHHHHhhCCCcHHHHHHHHHH
Confidence 478899999999999999999999999999987655555
No 169
>PF13424 TPR_12: Tetratricopeptide repeat; PDB: 3RO2_A 3Q15_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 4A1S_B 3CEQ_B 3EDT_H ....
Probab=97.88 E-value=3.1e-05 Score=44.90 Aligned_cols=49 Identities=10% Similarity=0.220 Sum_probs=40.5
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhc-------CCCcchhhcHHHHHHhcC
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLD-------GTNHILFSNRSAAFAKAI 150 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~-------p~~~~~~~n~a~~~~~~~ 150 (150)
+..+.++|.+++..|+|++|+++|++++++. |.-+..++++|.||..++
T Consensus 5 a~~~~~la~~~~~~~~~~~A~~~~~~al~~~~~~~~~~~~~a~~~~~lg~~~~~~g 60 (78)
T PF13424_consen 5 ANAYNNLARVYRELGRYDEALDYYEKALDIEEQLGDDHPDTANTLNNLGECYYRLG 60 (78)
T ss_dssp HHHHHHHHHHHHHTT-HHHHHHHHHHHHHHHHHTTTHHHHHHHHHHHHHHHHHHTT
T ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHcC
Confidence 7789999999999999999999999999762 223678899999987653
No 170
>PF12569 NARP1: NMDA receptor-regulated protein 1 ; InterPro: IPR021183 This group represents N-terminal acetyltransferase A (NatA) auxiliary subunit and represents a non-catalytic component of the NatA N-terminal acetyltransferase, which catalyzes acetylation of proteins beginning with Met-Ser, Met-Gly and Met-Ala. N-terminal acetylation plays a role in normal eukaryotic translation and processing, protect against proteolytic degradation and protein turnover. NAT1 anchors ARD1 and NAT5 to the ribosome and may present the N- terminal of nascent polypeptides for acetylation [], [].
Probab=97.88 E-value=0.00018 Score=56.27 Aligned_cols=122 Identities=9% Similarity=0.130 Sum_probs=83.9
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCC-----
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPF----- 76 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~----- 76 (150)
++..|...|++++|++..++||+.+|+.++.+...|.+ ...| ++.+|. ..+..+..+|+.+.-.
T Consensus 200 lAqhyd~~g~~~~Al~~Id~aI~htPt~~ely~~KarilKh~G--------~~~~Aa--~~~~~Ar~LD~~DRyiNsK~a 269 (517)
T PF12569_consen 200 LAQHYDYLGDYEKALEYIDKAIEHTPTLVELYMTKARILKHAG--------DLKEAA--EAMDEARELDLADRYINSKCA 269 (517)
T ss_pred HHHHHHHhCCHHHHHHHHHHHHhcCCCcHHHHHHHHHHHHHCC--------CHHHHH--HHHHHHHhCChhhHHHHHHHH
Confidence 57788899999999999999999999999999999999 9999 666776 6666666667654321
Q ss_pred -----CCCh-HHHHHHHHHhcCcchhhh---HHH-hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCC
Q psy3020 77 -----LSDP-SYVQMIKEIQKDPSLMTL---EAK-ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGT 134 (150)
Q Consensus 77 -----~~~~-~~~~~~~~l~~~~~~~~~---~~~-~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~ 134 (150)
.++. .....+...-+....... ..+ ..-...-|.++.++|++..|++.|....++.-+
T Consensus 270 Ky~LRa~~~e~A~~~~~~Ftr~~~~~~~~L~~mQc~Wf~~e~a~a~~r~~~~~~ALk~~~~v~k~f~~ 337 (517)
T PF12569_consen 270 KYLLRAGRIEEAEKTASLFTREDVDPLSNLNDMQCMWFETECAEAYLRQGDYGLALKRFHAVLKHFDD 337 (517)
T ss_pred HHHHHCCCHHHHHHHHHhhcCCCCCcccCHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHH
Confidence 0110 111111212121111000 111 444466799999999999999999988876544
No 171
>KOG3060|consensus
Probab=97.82 E-value=0.0007 Score=47.98 Aligned_cols=109 Identities=11% Similarity=0.040 Sum_probs=81.5
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChH
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPS 81 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~ 81 (150)
+..+....|+.+.|..|+++.-..=|....+-..-|-. ...| .+.+|+ .-+...+.-||.+
T Consensus 58 V~IAAld~~~~~lAq~C~~~L~~~fp~S~RV~~lkam~lEa~~--------~~~~A~--e~y~~lL~ddpt~-------- 119 (289)
T KOG3060|consen 58 VFIAALDTGRDDLAQKCINQLRDRFPGSKRVGKLKAMLLEATG--------NYKEAI--EYYESLLEDDPTD-------- 119 (289)
T ss_pred HHHHHHHhcchHHHHHHHHHHHHhCCCChhHHHHHHHHHHHhh--------chhhHH--HHHHHHhccCcch--------
Confidence 34556677888889999988776668888887777766 7777 445565 5555555656654
Q ss_pred HHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 82 YVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 82 ~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
...+..+-.+...+|+-.+||+...+-++..|.++.+|..++.+|+.+
T Consensus 120 --------------------~v~~KRKlAilka~GK~l~aIk~ln~YL~~F~~D~EAW~eLaeiY~~~ 167 (289)
T KOG3060|consen 120 --------------------TVIRKRKLAILKAQGKNLEAIKELNEYLDKFMNDQEAWHELAEIYLSE 167 (289)
T ss_pred --------------------hHHHHHHHHHHHHcCCcHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhH
Confidence 334444455556778888999999999999999999999999999864
No 172
>smart00028 TPR Tetratricopeptide repeats. Repeats present in 4 or more copies in proteins. Contain a minimum of 34 amino acids each and self-associate via a "knobs and holes" mechanism.
Probab=97.80 E-value=6e-05 Score=35.01 Aligned_cols=33 Identities=39% Similarity=0.620 Sum_probs=30.1
Q ss_pred HHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCC
Q psy3020 103 SLLKDKGNAALQANNFKEAIEAYSEAIKLDGTN 135 (150)
Q Consensus 103 ~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~ 135 (150)
..|..+|.+++..|++++|+.+|+++++++|++
T Consensus 2 ~~~~~~a~~~~~~~~~~~a~~~~~~~~~~~~~~ 34 (34)
T smart00028 2 EALYNLGNAYLKLGDYDEALEYYEKALELDPNN 34 (34)
T ss_pred hHHHHHHHHHHHHhhHHHHHHHHHHHHccCCCC
Confidence 467889999999999999999999999999863
No 173
>PLN03081 pentatricopeptide (PPR) repeat-containing protein; Provisional
Probab=97.77 E-value=0.00058 Score=55.46 Aligned_cols=48 Identities=6% Similarity=-0.048 Sum_probs=43.5
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
...|..+...+...|+++.|.+.+++.++++|++...|..++.+|.+.
T Consensus 494 ~~~~~~Ll~a~~~~g~~~~a~~~~~~l~~~~p~~~~~y~~L~~~y~~~ 541 (697)
T PLN03081 494 VNMWAALLTACRIHKNLELGRLAAEKLYGMGPEKLNNYVVLLNLYNSS 541 (697)
T ss_pred HHHHHHHHHHHHHcCCcHHHHHHHHHHhCCCCCCCcchHHHHHHHHhC
Confidence 567999999999999999999999999999999999999888888754
No 174
>KOG0495|consensus
Probab=97.75 E-value=0.00029 Score=55.88 Aligned_cols=127 Identities=13% Similarity=0.111 Sum_probs=79.1
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChH
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPS 81 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~ 81 (150)
.+..+...|+...|...+.+|++.+|++.+.|+.-..+ .... +++.+- .-+.++....|....+.....
T Consensus 590 ~ake~w~agdv~~ar~il~~af~~~pnseeiwlaavKle~en~--------e~eraR--~llakar~~sgTeRv~mKs~~ 659 (913)
T KOG0495|consen 590 YAKEKWKAGDVPAARVILDQAFEANPNSEEIWLAAVKLEFEND--------ELERAR--DLLAKARSISGTERVWMKSAN 659 (913)
T ss_pred HHHHHHhcCCcHHHHHHHHHHHHhCCCcHHHHHHHHHHhhccc--------cHHHHH--HHHHHHhccCCcchhhHHHhH
Confidence 34455666888888888888888888888887766665 3333 333333 333333333443333221111
Q ss_pred ----------HHHHH-HHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHH
Q psy3020 82 ----------YVQMI-KEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAA 145 (150)
Q Consensus 82 ----------~~~~~-~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~ 145 (150)
..+.+ +.+..-|.. ...|..+|+++-++++.+.|-+.|...+..-|+.+.+|..++.+
T Consensus 660 ~er~ld~~eeA~rllEe~lk~fp~f------~Kl~lmlGQi~e~~~~ie~aR~aY~~G~k~cP~~ipLWllLakl 728 (913)
T KOG0495|consen 660 LERYLDNVEEALRLLEEALKSFPDF------HKLWLMLGQIEEQMENIEMAREAYLQGTKKCPNSIPLWLLLAKL 728 (913)
T ss_pred HHHHhhhHHHHHHHHHHHHHhCCch------HHHHHHHhHHHHHHHHHHHHHHHHHhccccCCCCchHHHHHHHH
Confidence 11111 113344444 77888888888888888888888888888888888888777654
No 175
>COG3071 HemY Uncharacterized enzyme of heme biosynthesis [Coenzyme metabolism]
Probab=97.74 E-value=0.00028 Score=52.61 Aligned_cols=47 Identities=26% Similarity=0.270 Sum_probs=40.3
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
+..+..+|..+++.+.|.+|-++|+.|+..-| +...|..+|.++-++
T Consensus 328 p~L~~tLG~L~~k~~~w~kA~~~leaAl~~~~-s~~~~~~la~~~~~~ 374 (400)
T COG3071 328 PLLLSTLGRLALKNKLWGKASEALEAALKLRP-SASDYAELADALDQL 374 (400)
T ss_pred hhHHHHHHHHHHHhhHHHHHHHHHHHHHhcCC-ChhhHHHHHHHHHHc
Confidence 78888899999999999999999999999999 555667788877665
No 176
>COG2956 Predicted N-acetylglucosaminyl transferase [Carbohydrate transport and metabolism]
Probab=97.74 E-value=0.00036 Score=51.02 Aligned_cols=36 Identities=8% Similarity=0.021 Sum_probs=17.5
Q ss_pred hhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-Hhhh
Q psy3020 9 YLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQE 44 (150)
Q Consensus 9 ~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g 44 (150)
...+++.|+..+.+|++.||+.+.+-..+|++ ...|
T Consensus 192 ~~~~~d~A~~~l~kAlqa~~~cvRAsi~lG~v~~~~g 228 (389)
T COG2956 192 ASSDVDRARELLKKALQADKKCVRASIILGRVELAKG 228 (389)
T ss_pred hhhhHHHHHHHHHHHHhhCccceehhhhhhHHHHhcc
Confidence 33444445555555555555555554444544 4444
No 177
>KOG2796|consensus
Probab=97.71 E-value=0.00056 Score=48.89 Aligned_cols=132 Identities=14% Similarity=0.185 Sum_probs=78.2
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCC-CcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCC------
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDP-NNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRT------ 73 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p-~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~------ 73 (150)
.+..|+...|.|.-.+..+.++++-+| ..+....+++.+ ...|+.+ ..+.-+ .++.+.-..-.+-
T Consensus 182 ~~~~~llG~kEy~iS~d~~~~vi~~~~e~~p~L~s~Lgr~~MQ~GD~k-----~a~~yf--~~vek~~~kL~~~q~~~~V 254 (366)
T KOG2796|consen 182 SMANCLLGMKEYVLSVDAYHSVIKYYPEQEPQLLSGLGRISMQIGDIK-----TAEKYF--QDVEKVTQKLDGLQGKIMV 254 (366)
T ss_pred HHHHHHhcchhhhhhHHHHHHHHHhCCcccHHHHHHHHHHHHhcccHH-----HHHHHH--HHHHHHHhhhhccchhHHH
Confidence 357789999999999999999999995 577777899999 8888553 111112 1111110000000
Q ss_pred -----CCCCCChHHHHHH----HHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCc---chhhc
Q psy3020 74 -----KPFLSDPSYVQMI----KEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNH---ILFSN 141 (150)
Q Consensus 74 -----~~~~~~~~~~~~~----~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~---~~~~n 141 (150)
..++....+.... +-+..+++. +.+-.+++-+++..|+..+|++..+.+++++|... .+.+|
T Consensus 255 ~~n~a~i~lg~nn~a~a~r~~~~i~~~D~~~------~~a~NnKALcllYlg~l~DAiK~~e~~~~~~P~~~l~es~~~n 328 (366)
T KOG2796|consen 255 LMNSAFLHLGQNNFAEAHRFFTEILRMDPRN------AVANNNKALCLLYLGKLKDALKQLEAMVQQDPRHYLHESVLFN 328 (366)
T ss_pred HhhhhhheecccchHHHHHHHhhccccCCCc------hhhhchHHHHHHHHHHHHHHHHHHHHHhccCCccchhhhHHHH
Confidence 0011111111111 123445544 66677777777778888888888888888888643 34455
Q ss_pred HHHHH
Q psy3020 142 RSAAF 146 (150)
Q Consensus 142 ~a~~~ 146 (150)
+..+|
T Consensus 329 L~tmy 333 (366)
T KOG2796|consen 329 LTTMY 333 (366)
T ss_pred HHHHH
Confidence 54444
No 178
>KOG4234|consensus
Probab=97.69 E-value=0.00013 Score=50.10 Aligned_cols=63 Identities=17% Similarity=0.331 Sum_probs=56.7
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCC
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTK 74 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~ 74 (150)
+-|.|+.++++++.|++.+.+||+++|.+..++...+.+ ..+. ++..++ .++.+.+..+|...
T Consensus 139 Nraaa~iKl~k~e~aI~dcsKaiel~pty~kAl~RRAeayek~e--------k~eeal--eDyKki~E~dPs~~ 202 (271)
T KOG4234|consen 139 NRAAALIKLRKWESAIEDCSKAIELNPTYEKALERRAEAYEKME--------KYEEAL--EDYKKILESDPSRR 202 (271)
T ss_pred hhHHHHHHhhhHHHHHHHHHhhHhcCchhHHHHHHHHHHHHhhh--------hHHHHH--HHHHHHHHhCcchH
Confidence 347899999999999999999999999999999999998 7776 788899 99999999999653
No 179
>PF13512 TPR_18: Tetratricopeptide repeat
Probab=97.68 E-value=0.0012 Score=42.84 Aligned_cols=49 Identities=22% Similarity=0.272 Sum_probs=43.8
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCc---chhhcHHHHHHhcC
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNH---ILFSNRSAAFAKAI 150 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~---~~~~n~a~~~~~~~ 150 (150)
..+...+|.++++.++|++|+..+.+-|+++|+++ -+++.+|.++++++
T Consensus 47 ~qAqL~l~yayy~~~~y~~A~a~~~rFirLhP~hp~vdYa~Y~~gL~~~~~~ 98 (142)
T PF13512_consen 47 EQAQLDLAYAYYKQGDYEEAIAAYDRFIRLHPTHPNVDYAYYMRGLSYYEQD 98 (142)
T ss_pred HHHHHHHHHHHHHccCHHHHHHHHHHHHHhCCCCCCccHHHHHHHHHHHHHh
Confidence 56778899999999999999999999999999986 57888999998763
No 180
>KOG4642|consensus
Probab=97.67 E-value=3.4e-05 Score=54.02 Aligned_cols=48 Identities=29% Similarity=0.490 Sum_probs=41.8
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
+.-+...|+.++....|..|+.+|.+||-++|+.+..|.||+.||+++
T Consensus 10 a~qlkE~gnk~f~~k~y~~ai~~y~raI~~nP~~~~Y~tnralchlk~ 57 (284)
T KOG4642|consen 10 AEQLKEQGNKCFIPKRYDDAIDCYSRAICINPTVASYYTNRALCHLKL 57 (284)
T ss_pred HHHHHhccccccchhhhchHHHHHHHHHhcCCCcchhhhhHHHHHHHh
Confidence 455677788888899999999999999999999999999999999875
No 181
>KOG0545|consensus
Probab=97.66 E-value=0.00049 Score=48.63 Aligned_cols=96 Identities=21% Similarity=0.214 Sum_probs=66.6
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhc--------cCCCcHHH----------HHHHHHH-HhhhhccCCCCCchhhhhCChHH
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLK--------LDPNNEQM----------KEAIKDV-RNQEMNDMNRGDPFANLFSDPNI 63 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~--------~~p~~~~a----------~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~ 63 (150)
-|+-++..|+|++|...|..|+- -.|..++. +.+..+| ...| ++-+++ ..+
T Consensus 184 ~GN~lfk~~~ykEA~~~YreAi~~l~~L~lkEkP~e~eW~eLdk~~tpLllNy~QC~L~~~--------e~yevl--eh~ 253 (329)
T KOG0545|consen 184 EGNRLFKLGRYKEASSKYREAIICLRNLQLKEKPGEPEWLELDKMITPLLLNYCQCLLKKE--------EYYEVL--EHC 253 (329)
T ss_pred hhhhhhhhccHHHHHHHHHHHHHHHHHHHhccCCCChHHHHHHHhhhHHHHhHHHHHhhHH--------HHHHHH--HHH
Confidence 37889999999999999998863 34554442 2334444 3334 333344 444
Q ss_pred HHhhccCCCCCCCCCChHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCc
Q psy3020 64 FVQLQLDPRTKPFLSDPSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNH 136 (150)
Q Consensus 64 ~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~ 136 (150)
...++..|+ . ..+++.+|.+....=+..+|...|.++|+++|.-.
T Consensus 254 seiL~~~~~----------------------n------vKA~frRakAhaa~Wn~~eA~~D~~~vL~ldpsla 298 (329)
T KOG0545|consen 254 SEILRHHPG----------------------N------VKAYFRRAKAHAAVWNEAEAKADLQKVLELDPSLA 298 (329)
T ss_pred HHHHhcCCc----------------------h------HHHHHHHHHHHHhhcCHHHHHHHHHHHHhcChhhH
Confidence 444444443 2 78888888888888888999999999999999643
No 182
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=97.66 E-value=0.00015 Score=52.15 Aligned_cols=64 Identities=9% Similarity=-0.016 Sum_probs=53.4
Q ss_pred ChhhHHHHhhCCHHHHHHHHHHhhccCCCc---HHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCC
Q psy3020 1 MTTYSCLSYLGRYKESISTYEEGLKLDPNN---EQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTK 74 (150)
Q Consensus 1 ~~lg~~~~~~~~~~~A~~~~~~aL~~~p~~---~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~ 74 (150)
+++|.+|+..|++++|+..|+++++..|++ ++++..+|.+ ..+| +..+|. ..+...+...|+..
T Consensus 184 y~LG~~y~~~g~~~~A~~~f~~vv~~yP~s~~~~dAl~klg~~~~~~g--------~~~~A~--~~~~~vi~~yP~s~ 251 (263)
T PRK10803 184 YWLGQLNYNKGKKDDAAYYFASVVKNYPKSPKAADAMFKVGVIMQDKG--------DTAKAK--AVYQQVIKKYPGTD 251 (263)
T ss_pred HHHHHHHHHcCCHHHHHHHHHHHHHHCCCCcchhHHHHHHHHHHHHcC--------CHHHHH--HHHHHHHHHCcCCH
Confidence 479999999999999999999999998884 6678888888 8888 666777 77777777778653
No 183
>PF12688 TPR_5: Tetratrico peptide repeat
Probab=97.65 E-value=0.00054 Score=43.40 Aligned_cols=83 Identities=12% Similarity=-0.012 Sum_probs=62.5
Q ss_pred HHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChHHHHHHHHHhcCcchhhhHHHhHHHHHHHH
Q psy3020 32 QMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPSYVQMIKEIQKDPSLMTLEAKISLLKDKGN 110 (150)
Q Consensus 32 ~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~ 110 (150)
.+++.++.+ ..+| ...+|+ +.+.+++...... + ....++..+|.
T Consensus 2 ~~~~~~A~a~d~~G--------~~~~Ai--~~Y~~Al~~gL~~-------------------~------~~~~a~i~las 46 (120)
T PF12688_consen 2 RALYELAWAHDSLG--------REEEAI--PLYRRALAAGLSG-------------------A------DRRRALIQLAS 46 (120)
T ss_pred chHHHHHHHHHhcC--------CHHHHH--HHHHHHHHcCCCc-------------------h------HHHHHHHHHHH
Confidence 456677777 7778 556677 7776666543321 1 12678999999
Q ss_pred HHHHhcCHHHHHHHHHHHHhhcCC---CcchhhcHHHHHHhc
Q psy3020 111 AALQANNFKEAIEAYSEAIKLDGT---NHILFSNRSAAFAKA 149 (150)
Q Consensus 111 ~~~~~~~~~~A~~~~~~al~~~p~---~~~~~~n~a~~~~~~ 149 (150)
.+...|++++|+..+++++...|+ +..+...++.++..+
T Consensus 47 tlr~LG~~deA~~~L~~~~~~~p~~~~~~~l~~f~Al~L~~~ 88 (120)
T PF12688_consen 47 TLRNLGRYDEALALLEEALEEFPDDELNAALRVFLALALYNL 88 (120)
T ss_pred HHHHcCCHHHHHHHHHHHHHHCCCccccHHHHHHHHHHHHHC
Confidence 999999999999999999999998 777777777776643
No 184
>PF13176 TPR_7: Tetratricopeptide repeat; PDB: 3SF4_C 3RO3_A 3RO2_A.
Probab=97.63 E-value=5e-05 Score=37.48 Aligned_cols=26 Identities=27% Similarity=0.401 Sum_probs=22.0
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccC
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLD 27 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~ 27 (150)
.||.+|...|+|++|+++|+++|.+.
T Consensus 4 ~Lg~~~~~~g~~~~Ai~~y~~aL~l~ 29 (36)
T PF13176_consen 4 NLGRIYRQQGDYEKAIEYYEQALALA 29 (36)
T ss_dssp HHHHHHHHCT-HHHHHHHHHHHHHHH
T ss_pred HHHHHHHHcCCHHHHHHHHHHHHHhc
Confidence 58999999999999999999966543
No 185
>COG3118 Thioredoxin domain-containing protein [Posttranslational modification, protein turnover, chaperones]
Probab=97.61 E-value=0.0015 Score=47.16 Aligned_cols=114 Identities=14% Similarity=0.158 Sum_probs=80.0
Q ss_pred hHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCC---C
Q psy3020 4 YSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLS---D 79 (150)
Q Consensus 4 g~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~---~ 79 (150)
|......|++.+|...|..+++.+|++.++...++.+ ...|+. +.+. .+...+..+.....++. +
T Consensus 141 ~~~~~~~e~~~~a~~~~~~al~~~~~~~~~~~~la~~~l~~g~~--------e~A~---~iL~~lP~~~~~~~~~~l~a~ 209 (304)
T COG3118 141 AKELIEAEDFGEAAPLLKQALQAAPENSEAKLLLAECLLAAGDV--------EAAQ---AILAALPLQAQDKAAHGLQAQ 209 (304)
T ss_pred hhhhhhccchhhHHHHHHHHHHhCcccchHHHHHHHHHHHcCCh--------HHHH---HHHHhCcccchhhHHHHHHHH
Confidence 4567788999999999999999999999999999999 888844 2222 22222222222222211 0
Q ss_pred ----------hHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCC
Q psy3020 80 ----------PSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGT 134 (150)
Q Consensus 80 ----------~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~ 134 (150)
+........+..+|++ ..+-..++..+...|+.++|.+.+-..+..|.+
T Consensus 210 i~ll~qaa~~~~~~~l~~~~aadPdd------~~aa~~lA~~~~~~g~~e~Ale~Ll~~l~~d~~ 268 (304)
T COG3118 210 IELLEQAAATPEIQDLQRRLAADPDD------VEAALALADQLHLVGRNEAALEHLLALLRRDRG 268 (304)
T ss_pred HHHHHHHhcCCCHHHHHHHHHhCCCC------HHHHHHHHHHHHHcCCHHHHHHHHHHHHHhccc
Confidence 1122333445556655 888899999999999999999999888887544
No 186
>smart00028 TPR Tetratricopeptide repeats. Repeats present in 4 or more copies in proteins. Contain a minimum of 34 amino acids each and self-associate via a "knobs and holes" mechanism.
Probab=97.60 E-value=9.6e-05 Score=34.27 Aligned_cols=29 Identities=41% Similarity=0.678 Sum_probs=27.0
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCc
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNN 30 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~ 30 (150)
.+|.++...+++++|+.+|+++++++|++
T Consensus 6 ~~a~~~~~~~~~~~a~~~~~~~~~~~~~~ 34 (34)
T smart00028 6 NLGNAYLKLGDYDEALEYYEKALELDPNN 34 (34)
T ss_pred HHHHHHHHHhhHHHHHHHHHHHHccCCCC
Confidence 57999999999999999999999999974
No 187
>KOG1308|consensus
Probab=97.54 E-value=5.6e-05 Score=55.40 Aligned_cols=92 Identities=18% Similarity=0.196 Sum_probs=72.6
Q ss_pred HHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChHHH
Q psy3020 5 SCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPSYV 83 (150)
Q Consensus 5 ~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~~~ 83 (150)
.-.+..|.+++|+++|..++.++|.....|-..+.+ ..++ +-..++ .++..++.++|+.
T Consensus 122 ~eAln~G~~~~ai~~~t~ai~lnp~~a~l~~kr~sv~lkl~--------kp~~ai--rD~d~A~ein~Ds---------- 181 (377)
T KOG1308|consen 122 SEALNDGEFDTAIELFTSAIELNPPLAILYAKRASVFLKLK--------KPNAAI--RDCDFAIEINPDS---------- 181 (377)
T ss_pred HHHhcCcchhhhhcccccccccCCchhhhcccccceeeecc--------CCchhh--hhhhhhhccCccc----------
Confidence 345677999999999999999999999999999988 7777 334555 7777788888875
Q ss_pred HHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCC
Q psy3020 84 QMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGT 134 (150)
Q Consensus 84 ~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~ 134 (150)
...+..+|.....+|+|++|-..+..+.+++-+
T Consensus 182 ------------------a~~ykfrg~A~rllg~~e~aa~dl~~a~kld~d 214 (377)
T KOG1308|consen 182 ------------------AKGYKFRGYAERLLGNWEEAAHDLALACKLDYD 214 (377)
T ss_pred ------------------ccccchhhHHHHHhhchHHHHHHHHHHHhcccc
Confidence 344555666677778888888888888777655
No 188
>KOG1130|consensus
Probab=97.51 E-value=0.00024 Score=53.64 Aligned_cols=114 Identities=8% Similarity=0.148 Sum_probs=78.7
Q ss_pred hHHHHhhCCHHHHHHHHHHhhccCCCcHHHH----HHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCC
Q psy3020 4 YSCLSYLGRYKESISTYEEGLKLDPNNEQMK----EAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLS 78 (150)
Q Consensus 4 g~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~----~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~ 78 (150)
|.-+++.|++..++..|+.||++--++...+ ..|+.. ..++ ++.+|+ .....+
T Consensus 24 GERLck~gdcraGv~ff~aA~qvGTeDl~tLSAIYsQLGNAyfyL~--------DY~kAl---------~yH~hD----- 81 (639)
T KOG1130|consen 24 GERLCKMGDCRAGVDFFKAALQVGTEDLSTLSAIYSQLGNAYFYLK--------DYEKAL---------KYHTHD----- 81 (639)
T ss_pred HHHHHhccchhhhHHHHHHHHHhcchHHHHHHHHHHHhcchhhhHh--------hHHHHH---------hhhhhh-----
Confidence 6678899999999999999999988776543 334444 4444 444444 222211
Q ss_pred ChHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCC------CcchhhcHHHHHHhc
Q psy3020 79 DPSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGT------NHILFSNRSAAFAKA 149 (150)
Q Consensus 79 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~------~~~~~~n~a~~~~~~ 149 (150)
+-..+.+...... +.+--++|+++-.+|.|++|+.|+.+-+.+-.. ...++||+|.+|+..
T Consensus 82 ----ltlar~lgdklGE------AKssgNLGNtlKv~G~fdeA~~cc~rhLd~areLgDrv~e~RAlYNlgnvYhak 148 (639)
T KOG1130|consen 82 ----LTLARLLGDKLGE------AKSSGNLGNTLKVKGAFDEALTCCFRHLDFARELGDRVLESRALYNLGNVYHAK 148 (639)
T ss_pred ----HHHHHHhcchhcc------ccccccccchhhhhcccchHHHHHHHHhHHHHHHhHHHhhhHHHhhhhhhhhhc
Confidence 1112223222222 667789999999999999999999988776544 468899999999754
No 189
>PF13174 TPR_6: Tetratricopeptide repeat; PDB: 3QKY_A 2XEV_A 3URZ_B 2Q7F_A.
Probab=97.50 E-value=7e-05 Score=35.79 Aligned_cols=29 Identities=28% Similarity=0.503 Sum_probs=27.2
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCc
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNN 30 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~ 30 (150)
.+|.|+...|++++|++.|+++++..|++
T Consensus 5 ~~a~~~~~~g~~~~A~~~~~~~~~~~P~s 33 (33)
T PF13174_consen 5 RLARCYYKLGDYDEAIEYFQRLIKRYPDS 33 (33)
T ss_dssp HHHHHHHHHCHHHHHHHHHHHHHHHSTTS
T ss_pred HHHHHHHHccCHHHHHHHHHHHHHHCcCC
Confidence 68999999999999999999999999974
No 190
>PF14853 Fis1_TPR_C: Fis1 C-terminal tetratricopeptide repeat; PDB: 1IYG_A 1PC2_A 1NZN_A 3UUX_C 1Y8M_A 2PQR_A 2PQN_A 3O48_A.
Probab=97.47 E-value=0.00054 Score=36.86 Aligned_cols=41 Identities=10% Similarity=0.055 Sum_probs=32.9
Q ss_pred HHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHH
Q psy3020 104 LLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSA 144 (150)
Q Consensus 104 ~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~ 144 (150)
.++.++..+++.|+|++|.++...+|+++|+|..+......
T Consensus 3 ~lY~lAig~ykl~~Y~~A~~~~~~lL~~eP~N~Qa~~L~~~ 43 (53)
T PF14853_consen 3 CLYYLAIGHYKLGEYEKARRYCDALLEIEPDNRQAQSLKEL 43 (53)
T ss_dssp HHHHHHHHHHHTT-HHHHHHHHHHHHHHTTS-HHHHHHHHH
T ss_pred hHHHHHHHHHHhhhHHHHHHHHHHHHhhCCCcHHHHHHHHH
Confidence 46778888999999999999999999999999877654433
No 191
>PLN03077 Protein ECB2; Provisional
Probab=97.42 E-value=0.0041 Score=51.78 Aligned_cols=134 Identities=11% Similarity=0.107 Sum_probs=86.6
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhc--cCCCcHHHHHHHHHHHhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCC---
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLK--LDPNNEQMKEAIKDVRNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPF--- 76 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~--~~p~~~~a~~~l~~~~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~--- 76 (150)
.+..+|...|+.++|++.|++..+ +.||.......+..+...|..+ +..+.+ ....+.....|+...+
T Consensus 559 ~lI~~~~~~G~~~~A~~lf~~M~~~g~~Pd~~T~~~ll~a~~~~g~v~-----ea~~~f--~~M~~~~gi~P~~~~y~~l 631 (857)
T PLN03077 559 ILLTGYVAHGKGSMAVELFNRMVESGVNPDEVTFISLLCACSRSGMVT-----QGLEYF--HSMEEKYSITPNLKHYACV 631 (857)
T ss_pred HHHHHHHHcCCHHHHHHHHHHHHHcCCCCCcccHHHHHHHHhhcChHH-----HHHHHH--HHHHHHhCCCCchHHHHHH
Confidence 356788999999999999998776 6777666544443334455332 222233 3333334455653211
Q ss_pred ---CC-Ch---HHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 77 ---LS-DP---SYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 77 ---~~-~~---~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
+. .+ .....++..... +.+..|..+-..+...|+.+.+....+++++++|+++..|..++.+|...
T Consensus 632 v~~l~r~G~~~eA~~~~~~m~~~-------pd~~~~~aLl~ac~~~~~~e~~e~~a~~l~~l~p~~~~~y~ll~n~ya~~ 704 (857)
T PLN03077 632 VDLLGRAGKLTEAYNFINKMPIT-------PDPAVWGALLNACRIHRHVELGELAAQHIFELDPNSVGYYILLCNLYADA 704 (857)
T ss_pred HHHHHhCCCHHHHHHHHHHCCCC-------CCHHHHHHHHHHHHHcCChHHHHHHHHHHHhhCCCCcchHHHHHHHHHHC
Confidence 00 01 122223332222 22577888888888899999999999999999999999999999888653
No 192
>KOG2376|consensus
Probab=97.40 E-value=0.0033 Score=49.37 Aligned_cols=109 Identities=15% Similarity=0.240 Sum_probs=75.4
Q ss_pred hHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChHH
Q psy3020 4 YSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPSY 82 (150)
Q Consensus 4 g~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~~ 82 (150)
+.|++++++.++|+.+++ -+|+.+.....-.+++ +.+| .+.+++ ..|....+.+.++.+...+..+
T Consensus 86 AYc~Yrlnk~Dealk~~~---~~~~~~~~ll~L~AQvlYrl~--------~ydeal--diY~~L~kn~~dd~d~~~r~nl 152 (652)
T KOG2376|consen 86 AYCEYRLNKLDEALKTLK---GLDRLDDKLLELRAQVLYRLE--------RYDEAL--DIYQHLAKNNSDDQDEERRANL 152 (652)
T ss_pred HHHHHHcccHHHHHHHHh---cccccchHHHHHHHHHHHHHh--------hHHHHH--HHHHHHHhcCCchHHHHHHHHH
Confidence 679999999999999999 5677767677777899 9999 667887 6666555666655442111011
Q ss_pred HH--------HHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHh
Q psy3020 83 VQ--------MIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIK 130 (150)
Q Consensus 83 ~~--------~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~ 130 (150)
.. ..+.+..-|+ ...+.+++.++++...|+|.+|++.+++|+.
T Consensus 153 ~a~~a~l~~~~~q~v~~v~e-----~syel~yN~Ac~~i~~gky~qA~elL~kA~~ 203 (652)
T KOG2376|consen 153 LAVAAALQVQLLQSVPEVPE-----DSYELLYNTACILIENGKYNQAIELLEKALR 203 (652)
T ss_pred HHHHHhhhHHHHHhccCCCc-----chHHHHHHHHHHHHhcccHHHHHHHHHHHHH
Confidence 11 1111111111 1278889999999999999999999999943
No 193
>COG0457 NrfG FOG: TPR repeat [General function prediction only]
Probab=97.33 E-value=0.0088 Score=39.83 Aligned_cols=46 Identities=26% Similarity=0.330 Sum_probs=31.6
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHH
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFA 147 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~ 147 (150)
...+..++..+...+++.+|+..+.+++...|.....+..++..+.
T Consensus 202 ~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~ 247 (291)
T COG0457 202 AEALLNLGLLYLKLGKYEEALEYYEKALELDPDNAEALYNLALLLL 247 (291)
T ss_pred hHHHHHhhHHHHHcccHHHHHHHHHHHHhhCcccHHHHhhHHHHHH
Confidence 4556677777777777788888888888877775555555555543
No 194
>KOG4555|consensus
Probab=97.33 E-value=0.00076 Score=43.30 Aligned_cols=47 Identities=23% Similarity=0.275 Sum_probs=41.2
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHh
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAK 148 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~ 148 (150)
...+...|..+...|+.++|++-|.++|.+-|..+.+|.||+.++--
T Consensus 43 S~~LEl~~valaE~g~Ld~AlE~F~qal~l~P~raSayNNRAQa~RL 89 (175)
T KOG4555|consen 43 SRELELKAIALAEAGDLDGALELFGQALCLAPERASAYNNRAQALRL 89 (175)
T ss_pred HHHHHHHHHHHHhccchHHHHHHHHHHHHhcccchHhhccHHHHHHH
Confidence 45566678888899999999999999999999999999999998753
No 195
>COG4105 ComL DNA uptake lipoprotein [General function prediction only]
Probab=97.31 E-value=0.0076 Score=42.83 Aligned_cols=120 Identities=10% Similarity=0.022 Sum_probs=68.2
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHH---HHHHHHHH--HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCC
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQ---MKEAIKDV--RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPF 76 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~---a~~~l~~~--~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~ 76 (150)
.++.++++.++|++|+...++-+++.|+++. +....+.+ ....+..... .....++ ..+.+.+..-|++.-.
T Consensus 76 ~l~yA~Yk~~~y~~A~~~~drFi~lyP~~~n~dY~~YlkgLs~~~~i~~~~rDq-~~~~~A~--~~f~~~i~ryPnS~Ya 152 (254)
T COG4105 76 DLAYAYYKNGEYDLALAYIDRFIRLYPTHPNADYAYYLKGLSYFFQIDDVTRDQ-SAARAAF--AAFKELVQRYPNSRYA 152 (254)
T ss_pred HHHHHHHhcccHHHHHHHHHHHHHhCCCCCChhHHHHHHHHHHhccCCccccCH-HHHHHHH--HHHHHHHHHCCCCcch
Confidence 4677888888888888888888888887554 33333333 2222110000 1345566 6666677767765432
Q ss_pred CCChHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCC
Q psy3020 77 LSDPSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTN 135 (150)
Q Consensus 77 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~ 135 (150)
- +....+..+.... +.--...|+.+.+.|.+..|+.-+++.++--|+-
T Consensus 153 ~---dA~~~i~~~~d~L--------A~~Em~IaryY~kr~~~~AA~nR~~~v~e~y~~t 200 (254)
T COG4105 153 P---DAKARIVKLNDAL--------AGHEMAIARYYLKRGAYVAAINRFEEVLENYPDT 200 (254)
T ss_pred h---hHHHHHHHHHHHH--------HHHHHHHHHHHHHhcChHHHHHHHHHHHhccccc
Confidence 1 1222222221110 2233446777778888888888888888877764
No 196
>KOG2376|consensus
Probab=97.29 E-value=0.0024 Score=50.13 Aligned_cols=110 Identities=16% Similarity=0.205 Sum_probs=69.3
Q ss_pred HHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCC---------
Q psy3020 7 LSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPF--------- 76 (150)
Q Consensus 7 ~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~--------- 76 (150)
+-+.++|++|.....+.|.+.|++..+...-.-+ ...+ .|.+|+ ..+...........-+
T Consensus 22 ~~~~~e~e~a~k~~~Kil~~~pdd~~a~~cKvValIq~~--------ky~~AL--k~ikk~~~~~~~~~~~fEKAYc~Yr 91 (652)
T KOG2376|consen 22 HGKNGEYEEAVKTANKILSIVPDDEDAIRCKVVALIQLD--------KYEDAL--KLIKKNGALLVINSFFFEKAYCEYR 91 (652)
T ss_pred hccchHHHHHHHHHHHHHhcCCCcHhhHhhhHhhhhhhh--------HHHHHH--HHHHhcchhhhcchhhHHHHHHHHH
Confidence 4567899999999999999999999998876666 6666 455554 2222211111111110
Q ss_pred CCCh-HHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCC
Q psy3020 77 LSDP-SYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGT 134 (150)
Q Consensus 77 ~~~~-~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~ 134 (150)
++.. .+...+..++..+ ...+.-.++++|+.|+|++|+..|...++-+.+
T Consensus 92 lnk~Dealk~~~~~~~~~--------~~ll~L~AQvlYrl~~ydealdiY~~L~kn~~d 142 (652)
T KOG2376|consen 92 LNKLDEALKTLKGLDRLD--------DKLLELRAQVLYRLERYDEALDIYQHLAKNNSD 142 (652)
T ss_pred cccHHHHHHHHhcccccc--------hHHHHHHHHHHHHHhhHHHHHHHHHHHHhcCCc
Confidence 1111 1222222222222 557788899999999999999999998665544
No 197
>PF14938 SNAP: Soluble NSF attachment protein, SNAP; PDB: 1QQE_A 2IFU_A.
Probab=97.28 E-value=0.005 Score=44.64 Aligned_cols=47 Identities=21% Similarity=0.222 Sum_probs=34.2
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcc-------hhhcHHHHHHh
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHI-------LFSNRSAAFAK 148 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~-------~~~n~a~~~~~ 148 (150)
...+..++.++.+.|+|++|++.|++.....-+++. .++..++|++-
T Consensus 155 ~~~~~~~A~l~~~l~~y~~A~~~~e~~~~~~l~~~l~~~~~~~~~l~a~l~~L~ 208 (282)
T PF14938_consen 155 AECLLKAADLYARLGRYEEAIEIYEEVAKKCLENNLLKYSAKEYFLKAILCHLA 208 (282)
T ss_dssp HHHHHHHHHHHHHTT-HHHHHHHHHHHHHTCCCHCTTGHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHhCCHHHHHHHHHHHHHHhhcccccchhHHHHHHHHHHHHHH
Confidence 677788999999999999999999999886544432 33445555543
No 198
>PF10300 DUF3808: Protein of unknown function (DUF3808); InterPro: IPR019412 This entry represents a family of proteins conserved from fungi to humans. In humans this protein is expressed in primary breast carcinomas but not in normal breast tissue, and has a putative eukaryotic RNP-1 RNA binding region and a candidate anchoring transmembrane domain. The human protein is coordinately regulated with oestrogen receptor, but is not necessarily oestradiol-responsive []. Members of this family carry a tetratricopeptide repeat (IPR013105 from INTERPRO) at their C terminus.
Probab=97.20 E-value=0.0037 Score=48.61 Aligned_cols=104 Identities=13% Similarity=0.074 Sum_probs=80.3
Q ss_pred hCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChHHHHHHHH
Q psy3020 10 LGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPSYVQMIKE 88 (150)
Q Consensus 10 ~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 88 (150)
....+.|.+.++..++.-|+.+-.....|.+ ...| ++.+|+ ..+.+++....+ + ..
T Consensus 246 ~~~~~~a~~lL~~~~~~yP~s~lfl~~~gR~~~~~g--------~~~~Ai--~~~~~a~~~q~~---------~-~Q--- 302 (468)
T PF10300_consen 246 DVPLEEAEELLEEMLKRYPNSALFLFFEGRLERLKG--------NLEEAI--ESFERAIESQSE---------W-KQ--- 302 (468)
T ss_pred CCCHHHHHHHHHHHHHhCCCcHHHHHHHHHHHHHhc--------CHHHHH--HHHHHhccchhh---------H-Hh---
Confidence 4567889999999999999999999999999 8888 666776 555544421111 1 11
Q ss_pred HhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHH
Q psy3020 89 IQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFA 147 (150)
Q Consensus 89 l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~ 147 (150)
-+.-.+..+|+.+..+++|++|.++|.+.++.+.++...|.+.+.|.+
T Consensus 303 -----------l~~l~~~El~w~~~~~~~w~~A~~~f~~L~~~s~WSka~Y~Y~~a~c~ 350 (468)
T PF10300_consen 303 -----------LHHLCYFELAWCHMFQHDWEEAAEYFLRLLKESKWSKAFYAYLAAACL 350 (468)
T ss_pred -----------HHHHHHHHHHHHHHHHchHHHHHHHHHHHHhccccHHHHHHHHHHHHH
Confidence 115678999999999999999999999999999998877776655543
No 199
>PF06552 TOM20_plant: Plant specific mitochondrial import receptor subunit TOM20; InterPro: IPR010547 This family consists of several plant specific mitochondrial import receptor subunit TOM20 (translocase of outer membrane 20 kDa subunit) proteins. Most mitochondrial proteins are encoded by the nuclear genome, and are synthesised in the cytosol. TOM20 is a general import receptor that binds to mitochondrial pre-sequences in the early step of protein import into the mitochondria [].; GO: 0045040 protein import into mitochondrial outer membrane, 0005742 mitochondrial outer membrane translocase complex; PDB: 1ZU2_A.
Probab=97.20 E-value=0.0015 Score=43.97 Aligned_cols=48 Identities=23% Similarity=0.326 Sum_probs=35.3
Q ss_pred hHHHHHHHHHHHHh----------cCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 102 ISLLKDKGNAALQA----------NNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 102 ~~~~~~~g~~~~~~----------~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
++.+++=|.++..+ .-+++|+.-|++||.++|+...+++++|.+|..+
T Consensus 25 adnL~~WG~ALLELAqfk~g~es~~miedAisK~eeAL~I~P~~hdAlw~lGnA~ts~ 82 (186)
T PF06552_consen 25 ADNLTNWGGALLELAQFKQGPESKKMIEDAISKFEEALKINPNKHDALWCLGNAYTSL 82 (186)
T ss_dssp HHHHHHHHHHHHHHHHHS-HHHHHHHHHHHHHHHHHHHHH-TT-HHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHhccCcchHHHHHHHHHHHHHHHHhcCCchHHHHHHHHHHHHHH
Confidence 45555555555432 4567899999999999999999999999999754
No 200
>COG2976 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=97.19 E-value=0.0057 Score=41.78 Aligned_cols=96 Identities=14% Similarity=0.143 Sum_probs=65.4
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCC-cHH--HHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCC
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPN-NEQ--MKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFL 77 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~-~~~--a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~ 77 (150)
.++..+...+++++|...++.+|....| +.. +...|+.+ ...| .+.+++ ..+.. +...
T Consensus 94 ~lAk~~ve~~~~d~A~aqL~~~l~~t~De~lk~l~~lRLArvq~q~~--------k~D~AL--~~L~t-----~~~~--- 155 (207)
T COG2976 94 ELAKAEVEANNLDKAEAQLKQALAQTKDENLKALAALRLARVQLQQK--------KADAAL--KTLDT-----IKEE--- 155 (207)
T ss_pred HHHHHHHhhccHHHHHHHHHHHHccchhHHHHHHHHHHHHHHHHHhh--------hHHHHH--HHHhc-----cccc---
Confidence 4677888999999999999999865544 222 34667777 7777 455555 22210 0000
Q ss_pred CChHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCc
Q psy3020 78 SDPSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNH 136 (150)
Q Consensus 78 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~ 136 (150)
.+. +.....+|.++..+|+-.+|...|+++++.+++.+
T Consensus 156 ---~w~------------------~~~~elrGDill~kg~k~~Ar~ay~kAl~~~~s~~ 193 (207)
T COG2976 156 ---SWA------------------AIVAELRGDILLAKGDKQEARAAYEKALESDASPA 193 (207)
T ss_pred ---cHH------------------HHHHHHhhhHHHHcCchHHHHHHHHHHHHccCChH
Confidence 010 23345589999999999999999999999986543
No 201
>PF04733 Coatomer_E: Coatomer epsilon subunit; InterPro: IPR006822 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the epsilon subunit of the coatomer complex, which is involved in the regulation of intracellular protein trafficking between the endoplasmic reticulum and the Golgi complex []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006890 retrograde vesicle-mediated transport, Golgi to ER, 0030126 COPI vesicle coat; PDB: 3MV2_B 3MV3_F 3MKR_A.
Probab=97.17 E-value=0.0034 Score=45.80 Aligned_cols=122 Identities=13% Similarity=0.041 Sum_probs=77.1
Q ss_pred hHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCC-----
Q psy3020 4 YSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFL----- 77 (150)
Q Consensus 4 g~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~----- 77 (150)
-.+|...+|++.|.+.++..-+++.|..-....-+-+ ...|.. .+.+|+ -.+.+.....+.+...+
T Consensus 138 Vqi~L~~~R~dlA~k~l~~~~~~~eD~~l~qLa~awv~l~~g~e------~~~~A~--y~f~El~~~~~~t~~~lng~A~ 209 (290)
T PF04733_consen 138 VQILLKMNRPDLAEKELKNMQQIDEDSILTQLAEAWVNLATGGE------KYQDAF--YIFEELSDKFGSTPKLLNGLAV 209 (290)
T ss_dssp HHHHHHTT-HHHHHHHHHHHHCCSCCHHHHHHHHHHHHHHHTTT------CCCHHH--HHHHHHHCCS--SHHHHHHHHH
T ss_pred HHHHHHcCCHHHHHHHHHHHHhcCCcHHHHHHHHHHHHHHhCch------hHHHHH--HHHHHHHhccCCCHHHHHHHHH
Confidence 4678889999999999999999988877777666667 666632 455555 44444333223322210
Q ss_pred ---CChHHH---HHH-HHHhcCcchhhhHHHhHHHHHHHHHHHHhcCH-HHHHHHHHHHHhhcCCCcchh
Q psy3020 78 ---SDPSYV---QMI-KEIQKDPSLMTLEAKISLLKDKGNAALQANNF-KEAIEAYSEAIKLDGTNHILF 139 (150)
Q Consensus 78 ---~~~~~~---~~~-~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~-~~A~~~~~~al~~~p~~~~~~ 139 (150)
....+. ..+ +.+..+|.. +.++.++..+....|+. +.+-++..+.-..+|+++..-
T Consensus 210 ~~l~~~~~~eAe~~L~~al~~~~~~------~d~LaNliv~~~~~gk~~~~~~~~l~qL~~~~p~h~~~~ 273 (290)
T PF04733_consen 210 CHLQLGHYEEAEELLEEALEKDPND------PDTLANLIVCSLHLGKPTEAAERYLSQLKQSNPNHPLVK 273 (290)
T ss_dssp HHHHCT-HHHHHHHHHHHCCC-CCH------HHHHHHHHHHHHHTT-TCHHHHHHHHHCHHHTTTSHHHH
T ss_pred HHHHhCCHHHHHHHHHHHHHhccCC------HHHHHHHHHHHHHhCCChhHHHHHHHHHHHhCCCChHHH
Confidence 111122 222 225555655 88999999998888988 667778888888999987653
No 202
>COG0457 NrfG FOG: TPR repeat [General function prediction only]
Probab=97.15 E-value=0.019 Score=38.15 Aligned_cols=134 Identities=21% Similarity=0.247 Sum_probs=78.1
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhc--cCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCC---
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLK--LDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPF--- 76 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~--~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~--- 76 (150)
.+..+...+.+..+...+..++. ..+.....+...+.+ ...+ .+..++ ..+...+...+.....
T Consensus 65 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~--~~~~~~~~~~~~~~~~~~~ 134 (291)
T COG0457 65 LALALLKLGRLEEALELLEKALELELLPNLAEALLNLGLLLEALG--------KYEEAL--ELLEKALALDPDPDLAEAL 134 (291)
T ss_pred HHHHHHHcccHHHHHHHHHHHHhhhhccchHHHHHHHHHHHHHHh--------hHHHHH--HHHHHHHcCCCCcchHHHH
Confidence 45566777777777777777776 667777777777666 6655 333344 3444444433332110
Q ss_pred ------CCChHHHHHHHH----HhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCC-CcchhhcHHHH
Q psy3020 77 ------LSDPSYVQMIKE----IQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGT-NHILFSNRSAA 145 (150)
Q Consensus 77 ------~~~~~~~~~~~~----l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~-~~~~~~n~a~~ 145 (150)
.....+...+.. +...+.. ......+...+..+...+++++|+..+.+++...|. ....+.+++.+
T Consensus 135 ~~~~~~~~~~~~~~a~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 211 (291)
T COG0457 135 LALGALYELGDYEEALELYEKALELDPEL---NELAEALLALGALLEALGRYEEALELLEKALKLNPDDDAEALLNLGLL 211 (291)
T ss_pred HHHHHHHHcCCHHHHHHHHHHHHhcCCCc---cchHHHHHHhhhHHHHhcCHHHHHHHHHHHHhhCcccchHHHHHhhHH
Confidence 000011111111 1111100 011445555666677889999999999999999999 68888888888
Q ss_pred HHhc
Q psy3020 146 FAKA 149 (150)
Q Consensus 146 ~~~~ 149 (150)
+...
T Consensus 212 ~~~~ 215 (291)
T COG0457 212 YLKL 215 (291)
T ss_pred HHHc
Confidence 7653
No 203
>PRK10941 hypothetical protein; Provisional
Probab=97.05 E-value=0.0037 Score=45.09 Aligned_cols=63 Identities=10% Similarity=0.014 Sum_probs=56.4
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCC
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKP 75 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~ 75 (150)
+=.+|.+.++++.|+.+.+..|.++|+++.-+...|.+ ..+| +...|. .++..-+..-|+++.
T Consensus 187 LK~~~~~~~~~~~AL~~~e~ll~l~P~dp~e~RDRGll~~qL~--------c~~~A~--~DL~~fl~~~P~dp~ 250 (269)
T PRK10941 187 LKAALMEEKQMELALRASEALLQFDPEDPYEIRDRGLIYAQLD--------CEHVAL--SDLSYFVEQCPEDPI 250 (269)
T ss_pred HHHHHHHcCcHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcC--------CcHHHH--HHHHHHHHhCCCchh
Confidence 45689999999999999999999999999999999999 9999 777788 888888888887765
No 204
>COG4105 ComL DNA uptake lipoprotein [General function prediction only]
Probab=97.02 E-value=0.015 Score=41.32 Aligned_cols=99 Identities=14% Similarity=0.055 Sum_probs=71.8
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHH---HHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCC
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMK---EAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLS 78 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~---~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~ 78 (150)
-|......|+|++|++.|+.+....|..+.+- .+++.. ...+ ++..|. ..+.+-+++.|+++..
T Consensus 40 ~g~~~L~~gn~~~A~~~fe~l~~~~p~s~~~~qa~l~l~yA~Yk~~--------~y~~A~--~~~drFi~lyP~~~n~-- 107 (254)
T COG4105 40 EGLTELQKGNYEEAIKYFEALDSRHPFSPYSEQAQLDLAYAYYKNG--------EYDLAL--AYIDRFIRLYPTHPNA-- 107 (254)
T ss_pred HHHHHHhcCCHHHHHHHHHHHHHcCCCCcccHHHHHHHHHHHHhcc--------cHHHHH--HHHHHHHHhCCCCCCh--
Confidence 46777889999999999999999999887654 455666 6666 677777 6777777888877652
Q ss_pred ChHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHH--------hcCHHHHHHHHHHHHhhcCCCc
Q psy3020 79 DPSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQ--------ANNFKEAIEAYSEAIKLDGTNH 136 (150)
Q Consensus 79 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~--------~~~~~~A~~~~~~al~~~p~~~ 136 (150)
..+++.+|.+.+. ..--.+|+..|.+.|+.-|++.
T Consensus 108 -----------------------dY~~YlkgLs~~~~i~~~~rDq~~~~~A~~~f~~~i~ryPnS~ 150 (254)
T COG4105 108 -----------------------DYAYYLKGLSYFFQIDDVTRDQSAARAAFAAFKELVQRYPNSR 150 (254)
T ss_pred -----------------------hHHHHHHHHHHhccCCccccCHHHHHHHHHHHHHHHHHCCCCc
Confidence 3334444444432 2233588999999999999863
No 205
>KOG2471|consensus
Probab=97.02 E-value=0.0018 Score=49.94 Aligned_cols=47 Identities=15% Similarity=0.156 Sum_probs=44.4
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHh
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAK 148 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~ 148 (150)
.+.+++.|..+...|+.-.|.+||.+++.....+|.+|..+|.|.+.
T Consensus 335 ~eilYNcG~~~Lh~grPl~AfqCf~~av~vfh~nPrlWLRlAEcCim 381 (696)
T KOG2471|consen 335 MEILYNCGLLYLHSGRPLLAFQCFQKAVHVFHRNPRLWLRLAECCIM 381 (696)
T ss_pred hhhHHhhhHHHHhcCCcHHHHHHHHHHHHHHhcCcHHHHHHHHHHHH
Confidence 67889999999999999999999999999999999999999999874
No 206
>PF13374 TPR_10: Tetratricopeptide repeat; PDB: 3CEQ_B 3EDT_H 3NF1_A.
Probab=96.94 E-value=0.0029 Score=31.60 Aligned_cols=31 Identities=29% Similarity=0.409 Sum_probs=26.8
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhc
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLD 132 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~ 132 (150)
..++.++|.++...|++++|...+.+++++-
T Consensus 2 a~~~~~la~~~~~~g~~~~A~~~~~~al~~~ 32 (42)
T PF13374_consen 2 ASALNNLANAYRAQGRYEEALELLEEALEIR 32 (42)
T ss_dssp HHHHHHHHHHHHHCT-HHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHhhhhcchhhHHHHHHHHHH
Confidence 3578899999999999999999999999864
No 207
>KOG1130|consensus
Probab=96.93 E-value=0.0017 Score=49.20 Aligned_cols=44 Identities=9% Similarity=0.065 Sum_probs=33.7
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHH------HHHHHHHH-Hhhhh
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQ------MKEAIKDV-RNQEM 45 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~------a~~~l~~~-~~~g~ 45 (150)
+||..|+-+|+|+.|+.+-+.-|.|-..+.+ ++..++.+ .-+|+
T Consensus 200 nLGNTyYlLGdf~~ai~~H~~RL~ia~efGDrAaeRRA~sNlgN~hiflg~ 250 (639)
T KOG1130|consen 200 NLGNTYYLLGDFDQAIHFHKLRLEIAQEFGDRAAERRAHSNLGNCHIFLGN 250 (639)
T ss_pred ccCceeeeeccHHHHHHHHHHHHHHHHHhhhHHHHHHhhcccchhhhhhcc
Confidence 5788888899999999988888887765543 56677777 67773
No 208
>KOG0545|consensus
Probab=96.88 E-value=0.0035 Score=44.48 Aligned_cols=46 Identities=24% Similarity=0.291 Sum_probs=41.7
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHH
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFA 147 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~ 147 (150)
...+.+..+++...|+|-++++.+++.|..+|+|..+|+.||.++.
T Consensus 230 tpLllNy~QC~L~~~e~yevleh~seiL~~~~~nvKA~frRakAha 275 (329)
T KOG0545|consen 230 TPLLLNYCQCLLKKEEYYEVLEHCSEILRHHPGNVKAYFRRAKAHA 275 (329)
T ss_pred hHHHHhHHHHHhhHHHHHHHHHHHHHHHhcCCchHHHHHHHHHHHH
Confidence 4456778889999999999999999999999999999999998875
No 209
>PF04184 ST7: ST7 protein; InterPro: IPR007311 The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain.
Probab=96.87 E-value=0.012 Score=45.60 Aligned_cols=124 Identities=10% Similarity=0.058 Sum_probs=72.1
Q ss_pred hCCHHHHHHHHHHhhccCCCcHHHHHHHHHH--HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChHHHHHHH
Q psy3020 10 LGRYKESISTYEEGLKLDPNNEQMKEAIKDV--RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPSYVQMIK 87 (150)
Q Consensus 10 ~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~--~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 87 (150)
..+++.-++.-.+||+++|+-++++.-|+.= .-.. ++..-+.++. .--...++.+.... .+ ..+.....
T Consensus 181 ERnp~aRIkaA~eALei~pdCAdAYILLAEEeA~Ti~----Eae~l~rqAv--kAgE~~lg~s~~~~-~~--g~~~e~~~ 251 (539)
T PF04184_consen 181 ERNPQARIKAAKEALEINPDCADAYILLAEEEASTIV----EAEELLRQAV--KAGEASLGKSQFLQ-HH--GHFWEAWH 251 (539)
T ss_pred cCCHHHHHHHHHHHHHhhhhhhHHHhhcccccccCHH----HHHHHHHHHH--HHHHHhhchhhhhh-cc--cchhhhhh
Confidence 4677888899999999999999998877652 1111 0000222232 11111111111000 00 01111100
Q ss_pred HHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCC--CcchhhcHHHHHHhc
Q psy3020 88 EIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGT--NHILFSNRSAAFAKA 149 (150)
Q Consensus 88 ~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~--~~~~~~n~a~~~~~~ 149 (150)
..+.. ...-+-..++.++.+.|+..|||+.|.+.++.+|. +-..+.|+..|++++
T Consensus 252 ---~Rdt~----~~~y~KrRLAmCarklGr~~EAIk~~rdLlke~p~~~~l~IrenLie~LLel 308 (539)
T PF04184_consen 252 ---RRDTN----VLVYAKRRLAMCARKLGRLREAIKMFRDLLKEFPNLDNLNIRENLIEALLEL 308 (539)
T ss_pred ---ccccc----hhhhhHHHHHHHHHHhCChHHHHHHHHHHHhhCCccchhhHHHHHHHHHHhc
Confidence 00000 11334466899999999999999999999998886 456889999998875
No 210
>KOG3081|consensus
Probab=96.77 E-value=0.027 Score=40.42 Aligned_cols=45 Identities=22% Similarity=0.208 Sum_probs=39.8
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHH
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAF 146 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~ 146 (150)
+..+..++.+.+.+|+|++|....+.+|..+|++|....|+-.+-
T Consensus 207 ~~llnG~Av~~l~~~~~eeAe~lL~eaL~kd~~dpetL~Nliv~a 251 (299)
T KOG3081|consen 207 PLLLNGQAVCHLQLGRYEEAESLLEEALDKDAKDPETLANLIVLA 251 (299)
T ss_pred hHHHccHHHHHHHhcCHHHHHHHHHHHHhccCCCHHHHHHHHHHH
Confidence 566777788889999999999999999999999999999887653
No 211
>PF03704 BTAD: Bacterial transcriptional activator domain; InterPro: IPR005158 Found in the DNRI/REDD/AFSR family of regulators, this region of AFSR (P25941 from SWISSPROT) along with the C-terminal region is capable of independently directing actinorhodin production. It is important for the formation of secondary metabolites.; PDB: 2FF4_B 2FEZ_A.
Probab=96.76 E-value=0.0021 Score=41.72 Aligned_cols=44 Identities=23% Similarity=0.322 Sum_probs=37.9
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-Hhhhhc
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMN 46 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~ 46 (150)
++..+...|++++|+..+++++.+||.+..++..+..+ ..+|+.
T Consensus 68 l~~~~~~~~~~~~a~~~~~~~l~~dP~~E~~~~~lm~~~~~~g~~ 112 (146)
T PF03704_consen 68 LAEALLEAGDYEEALRLLQRALALDPYDEEAYRLLMRALAAQGRR 112 (146)
T ss_dssp HHHHHHHTT-HHHHHHHHHHHHHHSTT-HHHHHHHHHHHHHTT-H
T ss_pred HHHHHHhccCHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHCcCH
Confidence 56778899999999999999999999999999999999 999944
No 212
>KOG3824|consensus
Probab=96.73 E-value=0.0058 Score=44.81 Aligned_cols=43 Identities=23% Similarity=0.276 Sum_probs=39.6
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHH
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSA 144 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~ 144 (150)
++.+...|...-..++.-+|-.+|.+|+.++|.|..+..||+.
T Consensus 150 p~~L~e~G~f~E~~~~iv~ADq~Y~~ALtisP~nseALvnR~R 192 (472)
T KOG3824|consen 150 PQILIEMGQFREMHNEIVEADQCYVKALTISPGNSEALVNRAR 192 (472)
T ss_pred HHHHHHHhHHHHhhhhhHhhhhhhheeeeeCCCchHHHhhhhc
Confidence 7788889998888899999999999999999999999999875
No 213
>PF07720 TPR_3: Tetratricopeptide repeat; InterPro: IPR011716 This entry includes tetratricopeptide-like repeats found in the LcrH/SycD-like chaperones [].; PDB: 3KS2_O 3GZ2_A 3GZ1_A 3GYZ_A 4AM9_A 2VGX_A 2VGY_A.
Probab=96.71 E-value=0.0085 Score=29.47 Aligned_cols=33 Identities=27% Similarity=0.260 Sum_probs=27.1
Q ss_pred HHHHHHHHHHHHhcCHHHHHHH--HHHHHhhcCCC
Q psy3020 103 SLLKDKGNAALQANNFKEAIEA--YSEAIKLDGTN 135 (150)
Q Consensus 103 ~~~~~~g~~~~~~~~~~~A~~~--~~~al~~~p~~ 135 (150)
+.|..+|-.++.+|++++|+.. |.-+..++|.|
T Consensus 2 e~~y~~a~~~y~~~ky~~A~~~~~y~~l~~ld~~n 36 (36)
T PF07720_consen 2 EYLYGLAYNFYQKGKYDEAIHFFQYAFLCALDKYN 36 (36)
T ss_dssp HHHHHHHHHHHHTT-HHHHHHHHHHHHHHHHTTT-
T ss_pred cHHHHHHHHHHHHhhHHHHHHHHHHHHHHHhcccC
Confidence 4678889999999999999999 55888888865
No 214
>PLN03218 maturation of RBCL 1; Provisional
Probab=96.71 E-value=0.06 Score=46.08 Aligned_cols=29 Identities=17% Similarity=0.196 Sum_probs=17.1
Q ss_pred HHHHHHHHHHHHhcCHHHHHHHHHHHHhh
Q psy3020 103 SLLKDKGNAALQANNFKEAIEAYSEAIKL 131 (150)
Q Consensus 103 ~~~~~~g~~~~~~~~~~~A~~~~~~al~~ 131 (150)
..|..+...+.+.|++++|++.|.+..+.
T Consensus 615 ~tynsLI~ay~k~G~~deAl~lf~eM~~~ 643 (1060)
T PLN03218 615 EVYTIAVNSCSQKGDWDFALSIYDDMKKK 643 (1060)
T ss_pred HHHHHHHHHHHhcCCHHHHHHHHHHHHHc
Confidence 45555555566666666666666665554
No 215
>PLN03218 maturation of RBCL 1; Provisional
Probab=96.62 E-value=0.069 Score=45.73 Aligned_cols=46 Identities=20% Similarity=0.191 Sum_probs=33.5
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHh--hcCCCcchhhcHHHHHHh
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIK--LDGTNHILFSNRSAAFAK 148 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~--~~p~~~~~~~n~a~~~~~ 148 (150)
...|..+...+.+.|++++|++.|.+..+ +.| +...|..+..+|.+
T Consensus 684 ~~tynsLI~ay~k~G~~eeA~~lf~eM~~~g~~P-dvvtyN~LI~gy~k 731 (1060)
T PLN03218 684 TVSYSSLMGACSNAKNWKKALELYEDIKSIKLRP-TVSTMNALITALCE 731 (1060)
T ss_pred HHHHHHHHHHHHhCCCHHHHHHHHHHHHHcCCCC-CHHHHHHHHHHHHH
Confidence 56777888888888888888888888765 455 34556666666654
No 216
>PF14938 SNAP: Soluble NSF attachment protein, SNAP; PDB: 1QQE_A 2IFU_A.
Probab=96.61 E-value=0.026 Score=40.96 Aligned_cols=104 Identities=11% Similarity=0.061 Sum_probs=65.4
Q ss_pred hhhHHHHhh-CCHHHHHHHHHHhhccCC--Cc----HHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCC
Q psy3020 2 TTYSCLSYL-GRYKESISTYEEGLKLDP--NN----EQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRT 73 (150)
Q Consensus 2 ~lg~~~~~~-~~~~~A~~~~~~aL~~~p--~~----~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~ 73 (150)
.+|..|... |++++|++.|++|+++-. +. ...+..++.+ ..+| .+.+|+ ..+.+.....-+.
T Consensus 119 ~lA~~ye~~~~d~e~Ai~~Y~~A~~~y~~e~~~~~a~~~~~~~A~l~~~l~--------~y~~A~--~~~e~~~~~~l~~ 188 (282)
T PF14938_consen 119 ELAEIYEEQLGDYEKAIEYYQKAAELYEQEGSPHSAAECLLKAADLYARLG--------RYEEAI--EIYEEVAKKCLEN 188 (282)
T ss_dssp HHHHHHCCTT--HHHHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHHHHHTT---------HHHHH--HHHHHHHHTCCCH
T ss_pred HHHHHHHHHcCCHHHHHHHHHHHHHHHHHCCChhhHHHHHHHHHHHHHHhC--------CHHHHH--HHHHHHHHHhhcc
Confidence 478888888 999999999999998622 21 1234566777 7777 666776 4444433322111
Q ss_pred CCCCCChHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCc
Q psy3020 74 KPFLSDPSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNH 136 (150)
Q Consensus 74 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~ 136 (150)
... .+ .-...+...+.++...|++..|-+.+++....+|...
T Consensus 189 ~l~----~~-----------------~~~~~~l~a~l~~L~~~D~v~A~~~~~~~~~~~~~F~ 230 (282)
T PF14938_consen 189 NLL----KY-----------------SAKEYFLKAILCHLAMGDYVAARKALERYCSQDPSFA 230 (282)
T ss_dssp CTT----GH-----------------HHHHHHHHHHHHHHHTT-HHHHHHHHHHHGTTSTTST
T ss_pred ccc----ch-----------------hHHHHHHHHHHHHHHcCCHHHHHHHHHHHHhhCCCCC
Confidence 110 00 0023455667777889999999999999999999654
No 217
>KOG2053|consensus
Probab=96.57 E-value=0.04 Score=45.43 Aligned_cols=101 Identities=16% Similarity=0.036 Sum_probs=69.6
Q ss_pred HhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChHHHHHH
Q psy3020 8 SYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPSYVQMI 86 (150)
Q Consensus 8 ~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 86 (150)
...++++.|+....+.++..|+-..+.---|.+ ..+|.. .++. .+.+.+..-+.+
T Consensus 20 ld~~qfkkal~~~~kllkk~Pn~~~a~vLkaLsl~r~gk~--------~ea~---~~Le~~~~~~~~------------- 75 (932)
T KOG2053|consen 20 LDSSQFKKALAKLGKLLKKHPNALYAKVLKALSLFRLGKG--------DEAL---KLLEALYGLKGT------------- 75 (932)
T ss_pred hhhHHHHHHHHHHHHHHHHCCCcHHHHHHHHHHHHHhcCc--------hhHH---HHHhhhccCCCC-------------
Confidence 356899999999999999999988776555666 677733 3333 222222211211
Q ss_pred HHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHH
Q psy3020 87 KEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFA 147 (150)
Q Consensus 87 ~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~ 147 (150)
+ -..+..+-.++...+++++|+.+|++++..+|. ....+.+=.+|+
T Consensus 76 --------D------~~tLq~l~~~y~d~~~~d~~~~~Ye~~~~~~P~-eell~~lFmayv 121 (932)
T KOG2053|consen 76 --------D------DLTLQFLQNVYRDLGKLDEAVHLYERANQKYPS-EELLYHLFMAYV 121 (932)
T ss_pred --------c------hHHHHHHHHHHHHHhhhhHHHHHHHHHHhhCCc-HHHHHHHHHHHH
Confidence 1 455666778888999999999999999999998 444444444444
No 218
>KOG2796|consensus
Probab=96.55 E-value=0.033 Score=40.14 Aligned_cols=41 Identities=15% Similarity=0.052 Sum_probs=23.4
Q ss_pred hHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-Hhhh
Q psy3020 4 YSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQE 44 (150)
Q Consensus 4 g~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g 44 (150)
+..|.-.++|..|...|++.+..||.++-+....+.| ...|
T Consensus 259 a~i~lg~nn~a~a~r~~~~i~~~D~~~~~a~NnKALcllYlg 300 (366)
T KOG2796|consen 259 AFLHLGQNNFAEAHRFFTEILRMDPRNAVANNNKALCLLYLG 300 (366)
T ss_pred hhheecccchHHHHHHHhhccccCCCchhhhchHHHHHHHHH
Confidence 3444455555566666666666666666555555555 5555
No 219
>PF02259 FAT: FAT domain; InterPro: IPR003151 The FAT domain is a domain present in the PIK-related kinases. Members of the family of PIK-related kinases may act as intracellular sensors that govern radial and horizontal pathways [].; GO: 0005515 protein binding
Probab=96.54 E-value=0.053 Score=40.09 Aligned_cols=138 Identities=13% Similarity=0.067 Sum_probs=86.2
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCC----cHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCC-C
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPN----NEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTK-P 75 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~----~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~-~ 75 (150)
.++.+..+.|+++-|...+.++...++. .+.+....+.+ ...| +-..|+ ..+...+....... .
T Consensus 151 ~~a~~aRk~g~~~~A~~~l~~~~~~~~~~~~~~~~v~~e~akllw~~g--------~~~~Ai--~~L~~~~~~~~~~~~~ 220 (352)
T PF02259_consen 151 KFAKLARKAGNFQLALSALNRLFQLNPSSESLLPRVFLEYAKLLWAQG--------EQEEAI--QKLRELLKCRLSKNID 220 (352)
T ss_pred HHHHHHHHCCCcHHHHHHHHHHhccCCcccCCCcchHHHHHHHHHHcC--------CHHHHH--HHHHHHHHHHhhhccc
Confidence 3567788999999999999999987632 45677777888 8888 334455 33333333111100 0
Q ss_pred CCCChHHHHHH----HHHh-cCcchhhhHHHhHHHHHHHHHHHHh------cCHHHHHHHHHHHHhhcCCCcchhhcHHH
Q psy3020 76 FLSDPSYVQMI----KEIQ-KDPSLMTLEAKISLLKDKGNAALQA------NNFKEAIEAYSEAIKLDGTNHILFSNRSA 144 (150)
Q Consensus 76 ~~~~~~~~~~~----~~l~-~~~~~~~~~~~~~~~~~~g~~~~~~------~~~~~A~~~~~~al~~~p~~~~~~~n~a~ 144 (150)
........... .... ...........+..+..+|...... +.+++++..|.++++++|+....|+..|.
T Consensus 221 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~l~~a~w~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~k~~~~~a~ 300 (352)
T PF02259_consen 221 SISNAELKSGLLESLEVISSTNLDKESKELKAKAFLLLAKWLDELYSKLSSESSDEILKYYKEATKLDPSWEKAWHSWAL 300 (352)
T ss_pred cccHHHHhhccccccccccccchhhhhHHHHHHHHHHHHHHHHhhccccccccHHHHHHHHHHHHHhChhHHHHHHHHHH
Confidence 00000000000 0000 0000111113378888888888777 99999999999999999999999999998
Q ss_pred HHHhc
Q psy3020 145 AFAKA 149 (150)
Q Consensus 145 ~~~~~ 149 (150)
.+.+.
T Consensus 301 ~~~~~ 305 (352)
T PF02259_consen 301 FNDKL 305 (352)
T ss_pred HHHHH
Confidence 87654
No 220
>PF13374 TPR_10: Tetratricopeptide repeat; PDB: 3CEQ_B 3EDT_H 3NF1_A.
Probab=96.48 E-value=0.0031 Score=31.49 Aligned_cols=26 Identities=27% Similarity=0.469 Sum_probs=22.7
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccC
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLD 27 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~ 27 (150)
++|.+|...|++++|+..+++++.+.
T Consensus 7 ~la~~~~~~g~~~~A~~~~~~al~~~ 32 (42)
T PF13374_consen 7 NLANAYRAQGRYEEALELLEEALEIR 32 (42)
T ss_dssp HHHHHHHHCT-HHHHHHHHHHHHHHH
T ss_pred HHHHHHHhhhhcchhhHHHHHHHHHH
Confidence 68999999999999999999999753
No 221
>PF14561 TPR_20: Tetratricopeptide repeat; PDB: 3QOU_A 2R5S_A 3QDN_B.
Probab=96.45 E-value=0.029 Score=33.62 Aligned_cols=67 Identities=10% Similarity=0.204 Sum_probs=46.7
Q ss_pred HHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChHHHHHHHHHhc
Q psy3020 15 ESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPSYVQMIKEIQK 91 (150)
Q Consensus 15 ~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~ 91 (150)
..+..++++++.+|+|.++.+.++.. ...| +++.++ ..+...+..+++...-..+..+...+..+..
T Consensus 6 ~~~~al~~~~a~~P~D~~ar~~lA~~~~~~g--------~~e~Al--~~Ll~~v~~dr~~~~~~ar~~ll~~f~~lg~ 73 (90)
T PF14561_consen 6 PDIAALEAALAANPDDLDARYALADALLAAG--------DYEEAL--DQLLELVRRDRDYEDDAARKRLLDIFELLGP 73 (90)
T ss_dssp HHHHHHHHHHHHSTT-HHHHHHHHHHHHHTT---------HHHHH--HHHHHHHCC-TTCCCCHHHHHHHHHHHHH-T
T ss_pred ccHHHHHHHHHcCCCCHHHHHHHHHHHHHCC--------CHHHHH--HHHHHHHHhCccccccHHHHHHHHHHHHcCC
Confidence 35788999999999999999999999 8889 778888 8888888888776432111234444444443
No 222
>KOG0376|consensus
Probab=96.40 E-value=0.0031 Score=48.29 Aligned_cols=46 Identities=24% Similarity=0.291 Sum_probs=23.1
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHH
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFA 147 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~ 147 (150)
+..+.+++....+.++|.+|+..+.+|++++|....+|+.+|.+.+
T Consensus 38 a~~~anRa~a~lK~e~~~~Al~Da~kaie~dP~~~K~Y~rrg~a~m 83 (476)
T KOG0376|consen 38 AIYFANRALAHLKVESFGGALHDALKAIELDPTYIKAYVRRGTAVM 83 (476)
T ss_pred eeeechhhhhheeechhhhHHHHHHhhhhcCchhhheeeeccHHHH
Confidence 4444444444445555555555555555555555555555544443
No 223
>COG1729 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=96.36 E-value=0.01 Score=42.48 Aligned_cols=64 Identities=11% Similarity=0.030 Sum_probs=50.5
Q ss_pred ChhhHHHHhhCCHHHHHHHHHHhhccCCCcH---HHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCC
Q psy3020 1 MTTYSCLSYLGRYKESISTYEEGLKLDPNNE---QMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTK 74 (150)
Q Consensus 1 ~~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~---~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~ 74 (150)
+|||.+++..|+|++|...|..+.+-.|+.+ ++++.+|.+ ..+|. ..+|- ..+...+.--|+..
T Consensus 182 yWLGe~~y~qg~y~~Aa~~f~~~~k~~P~s~KApdallKlg~~~~~l~~--------~d~A~--atl~qv~k~YP~t~ 249 (262)
T COG1729 182 YWLGESLYAQGDYEDAAYIFARVVKDYPKSPKAPDALLKLGVSLGRLGN--------TDEAC--ATLQQVIKRYPGTD 249 (262)
T ss_pred HHHHHHHHhcccchHHHHHHHHHHHhCCCCCCChHHHHHHHHHHHHhcC--------HHHHH--HHHHHHHHHCCCCH
Confidence 4899999999999999999999999888754 678899999 88884 44444 55666666666543
No 224
>PF10516 SHNi-TPR: SHNi-TPR; InterPro: IPR019544 The tetratrico peptide repeat region (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B. This entry represents SHNi-TPR (Sim3-Hif1-NASP interrupted TPR), a sequence that is an interrupted form of TPR repeat [].
Probab=96.33 E-value=0.011 Score=29.42 Aligned_cols=30 Identities=30% Similarity=0.595 Sum_probs=27.0
Q ss_pred HHHHHHHHHHHHhcCHHHHHHHHHHHHhhc
Q psy3020 103 SLLKDKGNAALQANNFKEAIEAYSEAIKLD 132 (150)
Q Consensus 103 ~~~~~~g~~~~~~~~~~~A~~~~~~al~~~ 132 (150)
..+..+|.+-...++|++|++.|.+|+++-
T Consensus 2 dv~~~Lgeisle~e~f~qA~~D~~~aL~i~ 31 (38)
T PF10516_consen 2 DVYDLLGEISLENENFEQAIEDYEKALEIQ 31 (38)
T ss_pred cHHHHHHHHHHHhccHHHHHHHHHHHHHHH
Confidence 457789999999999999999999999874
No 225
>PLN03081 pentatricopeptide (PPR) repeat-containing protein; Provisional
Probab=96.32 E-value=0.12 Score=42.23 Aligned_cols=127 Identities=7% Similarity=-0.033 Sum_probs=76.5
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhh--ccCCCCCCCC--
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQL--QLDPRTKPFL-- 77 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l--~~~~~~~~~~-- 77 (150)
+...|.+.|++++|.+.|++.. +.|...|..+... ...| ...+|+ .-+.+.. ...|+...+.
T Consensus 265 Li~~y~k~g~~~~A~~vf~~m~---~~~~vt~n~li~~y~~~g--------~~~eA~--~lf~~M~~~g~~pd~~t~~~l 331 (697)
T PLN03081 265 LIDMYSKCGDIEDARCVFDGMP---EKTTVAWNSMLAGYALHG--------YSEEAL--CLYYEMRDSGVSIDQFTFSIM 331 (697)
T ss_pred HHHHHHHCCCHHHHHHHHHhCC---CCChhHHHHHHHHHHhCC--------CHHHHH--HHHHHHHHcCCCCCHHHHHHH
Confidence 5578899999999999999753 4577888888777 6777 444444 2222221 2334322210
Q ss_pred -------CC-hHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 78 -------SD-PSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 78 -------~~-~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
.. ....+....+.+..- ......+..+...+.+.|++++|.+.|.+..+ | +...|..+...|.+.
T Consensus 332 l~a~~~~g~~~~a~~i~~~m~~~g~----~~d~~~~~~Li~~y~k~G~~~~A~~vf~~m~~--~-d~~t~n~lI~~y~~~ 404 (697)
T PLN03081 332 IRIFSRLALLEHAKQAHAGLIRTGF----PLDIVANTALVDLYSKWGRMEDARNVFDRMPR--K-NLISWNALIAGYGNH 404 (697)
T ss_pred HHHHHhccchHHHHHHHHHHHHhCC----CCCeeehHHHHHHHHHCCCHHHHHHHHHhCCC--C-CeeeHHHHHHHHHHc
Confidence 00 000011111100000 01145677888999999999999999998864 4 567788888777653
No 226
>KOG4507|consensus
Probab=96.29 E-value=0.019 Score=45.51 Aligned_cols=100 Identities=17% Similarity=0.198 Sum_probs=70.5
Q ss_pred HhhCCHHHHHHHHHHhhccCCCcHHH-HHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChHHHHH
Q psy3020 8 SYLGRYKESISTYEEGLKLDPNNEQM-KEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPSYVQM 85 (150)
Q Consensus 8 ~~~~~~~~A~~~~~~aL~~~p~~~~a-~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 85 (150)
...|+...|+.|+..|+-..|.-..+ ...|+.+ ...|.. ..|- .-+..++.++..
T Consensus 618 r~~gn~~~a~~cl~~a~~~~p~~~~v~~v~la~~~~~~~~~--------~da~--~~l~q~l~~~~s------------- 674 (886)
T KOG4507|consen 618 RAVGNSTFAIACLQRALNLAPLQQDVPLVNLANLLIHYGLH--------LDAT--KLLLQALAINSS------------- 674 (886)
T ss_pred eecCCcHHHHHHHHHHhccChhhhcccHHHHHHHHHHhhhh--------ccHH--HHHHHHHhhccc-------------
Confidence 45688999999999999999965543 4567776 555522 1122 233333333321
Q ss_pred HHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHH
Q psy3020 86 IKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAA 145 (150)
Q Consensus 86 ~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~ 145 (150)
. +..+..+|+.+....+.++|++.|+.|+.++|+++..-..+-.+
T Consensus 675 ---------e------pl~~~~~g~~~l~l~~i~~a~~~~~~a~~~~~~~~~~~~~l~~i 719 (886)
T KOG4507|consen 675 ---------E------PLTFLSLGNAYLALKNISGALEAFRQALKLTTKCPECENSLKLI 719 (886)
T ss_pred ---------C------chHHHhcchhHHHHhhhHHHHHHHHHHHhcCCCChhhHHHHHHH
Confidence 1 66788899999999999999999999999999998766554433
No 227
>KOG3824|consensus
Probab=96.27 E-value=0.0087 Score=43.93 Aligned_cols=63 Identities=14% Similarity=0.200 Sum_probs=48.6
Q ss_pred hHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCC
Q psy3020 4 YSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPF 76 (150)
Q Consensus 4 g~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~ 76 (150)
+.-....|+.++|...|+.||.++|+|++++..+|.. ..-+ ++.+|- .-+.+++.++|.+.++
T Consensus 123 A~~~~~~Gk~ekA~~lfeHAlalaP~~p~~L~e~G~f~E~~~--------~iv~AD--q~Y~~ALtisP~nseA 186 (472)
T KOG3824|consen 123 AGRSRKDGKLEKAMTLFEHALALAPTNPQILIEMGQFREMHN--------EIVEAD--QCYVKALTISPGNSEA 186 (472)
T ss_pred HHHHHhccchHHHHHHHHHHHhcCCCCHHHHHHHhHHHHhhh--------hhHhhh--hhhheeeeeCCCchHH
Confidence 4455678999999999999999999999999999988 3323 344443 5666777788877764
No 228
>PRK04841 transcriptional regulator MalT; Provisional
Probab=96.25 E-value=0.074 Score=44.51 Aligned_cols=98 Identities=13% Similarity=0.026 Sum_probs=66.3
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHH-----HHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCC
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQ-----MKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPF 76 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~-----a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~ 76 (150)
+|.++...|++++|...++++++..|.... +...++.+ ...| ++.++. ..+..++........
T Consensus 458 ~a~~~~~~g~~~~A~~~~~~al~~~~~~~~~~~~~a~~~lg~~~~~~G--------~~~~A~--~~~~~al~~~~~~g~- 526 (903)
T PRK04841 458 RAQVAINDGDPEEAERLAELALAELPLTWYYSRIVATSVLGEVHHCKG--------ELARAL--AMMQQTEQMARQHDV- 526 (903)
T ss_pred HHHHHHhCCCHHHHHHHHHHHHhcCCCccHHHHHHHHHHHHHHHHHcC--------CHHHHH--HHHHHHHHHHhhhcc-
Confidence 467788999999999999999986554322 33455666 6677 555555 444433332221100
Q ss_pred CCChHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhc
Q psy3020 77 LSDPSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLD 132 (150)
Q Consensus 77 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~ 132 (150)
.. .....+..+|.+++..|++++|...+.+++++-
T Consensus 527 ---------------~~------~~~~~~~~la~~~~~~G~~~~A~~~~~~al~~~ 561 (903)
T PRK04841 527 ---------------YH------YALWSLLQQSEILFAQGFLQAAYETQEKAFQLI 561 (903)
T ss_pred ---------------hH------HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHH
Confidence 00 114567788999999999999999999999873
No 229
>KOG3785|consensus
Probab=96.14 E-value=0.062 Score=40.39 Aligned_cols=136 Identities=17% Similarity=0.107 Sum_probs=83.5
Q ss_pred ChhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCC-
Q psy3020 1 MTTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLS- 78 (150)
Q Consensus 1 ~~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~- 78 (150)
+.|+..+.-.-.|++|++.|.++|.-+|+.......++.| ..+. -+.-+. .-+...++..|+...+.+
T Consensus 155 LSLAsvhYmR~HYQeAIdvYkrvL~dn~ey~alNVy~ALCyyKlD--------Yydvsq--evl~vYL~q~pdStiA~NL 224 (557)
T KOG3785|consen 155 LSLASVHYMRMHYQEAIDVYKRVLQDNPEYIALNVYMALCYYKLD--------YYDVSQ--EVLKVYLRQFPDSTIAKNL 224 (557)
T ss_pred HhHHHHHHHHHHHHHHHHHHHHHHhcChhhhhhHHHHHHHHHhcc--------hhhhHH--HHHHHHHHhCCCcHHHHHH
Confidence 3577778888899999999999999999988887788877 6666 222222 222233445555443210
Q ss_pred ----------ChHHHHHHHHHhcCcc-hhhh----------------------H--HH--hHHHHHHHHHHHHhcCHHHH
Q psy3020 79 ----------DPSYVQMIKEIQKDPS-LMTL----------------------E--AK--ISLLKDKGNAALQANNFKEA 121 (150)
Q Consensus 79 ----------~~~~~~~~~~l~~~~~-~~~~----------------------~--~~--~~~~~~~g~~~~~~~~~~~A 121 (150)
......-+..+..+.+ .... | .+ +++-.++..-+..+++..+|
T Consensus 225 kacn~fRl~ngr~ae~E~k~ladN~~~~~~f~~~l~rHNLVvFrngEgALqVLP~L~~~IPEARlNL~iYyL~q~dVqeA 304 (557)
T KOG3785|consen 225 KACNLFRLINGRTAEDEKKELADNIDQEYPFIEYLCRHNLVVFRNGEGALQVLPSLMKHIPEARLNLIIYYLNQNDVQEA 304 (557)
T ss_pred HHHHHhhhhccchhHHHHHHHHhcccccchhHHHHHHcCeEEEeCCccHHHhchHHHhhChHhhhhheeeecccccHHHH
Confidence 0000111111111111 0001 1 11 66677777777899999998
Q ss_pred HHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 122 IEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 122 ~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
+..+. +++|..|--|.-.|.++..+
T Consensus 305 ~~L~K---dl~PttP~EyilKgvv~aal 329 (557)
T KOG3785|consen 305 ISLCK---DLDPTTPYEYILKGVVFAAL 329 (557)
T ss_pred HHHHh---hcCCCChHHHHHHHHHHHHh
Confidence 87654 68999998888888877653
No 230
>KOG4340|consensus
Probab=96.13 E-value=0.016 Score=42.47 Aligned_cols=112 Identities=13% Similarity=0.161 Sum_probs=68.6
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhh--CChHHHHhhccCCCCCCCCC
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLF--SDPNIFVQLQLDPRTKPFLS 78 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~--~~~~~~~~l~~~~~~~~~~~ 78 (150)
++|-..++.|+|++|++-|+.|++...-++..-.+++.+ +..+ ++..|+ -+.-+.+.++..|+.-.-..
T Consensus 149 n~gCllykegqyEaAvqkFqaAlqvsGyqpllAYniALaHy~~~--------qyasALk~iSEIieRG~r~HPElgIGm~ 220 (459)
T KOG4340|consen 149 NLGCLLYKEGQYEAAVQKFQAALQVSGYQPLLAYNLALAHYSSR--------QYASALKHISEIIERGIRQHPELGIGMT 220 (459)
T ss_pred cchheeeccccHHHHHHHHHHHHhhcCCCchhHHHHHHHHHhhh--------hHHHHHHHHHHHHHhhhhcCCccCccce
Confidence 356678899999999999999999999999988899988 8888 445554 22344556677775443110
Q ss_pred C--hHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHH
Q psy3020 79 D--PSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYS 126 (150)
Q Consensus 79 ~--~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~ 126 (150)
. +++ +. ++ ++..+-.-.-.+++..+.-+.++.|+++.|.+.++
T Consensus 221 tegiDv-rs---vg-Nt~~lh~Sal~eAfNLKaAIeyq~~n~eAA~eaLt 265 (459)
T KOG4340|consen 221 TEGIDV-RS---VG-NTLVLHQSALVEAFNLKAAIEYQLRNYEAAQEALT 265 (459)
T ss_pred eccCch-hc---cc-chHHHHHHHHHHHhhhhhhhhhhcccHHHHHHHhh
Confidence 0 010 00 00 00000000004566666777777777777766544
No 231
>PF03704 BTAD: Bacterial transcriptional activator domain; InterPro: IPR005158 Found in the DNRI/REDD/AFSR family of regulators, this region of AFSR (P25941 from SWISSPROT) along with the C-terminal region is capable of independently directing actinorhodin production. It is important for the formation of secondary metabolites.; PDB: 2FF4_B 2FEZ_A.
Probab=96.13 E-value=0.024 Score=36.71 Aligned_cols=48 Identities=23% Similarity=0.319 Sum_probs=41.2
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
..++..++..+...|++++|++.+.+++.++|.+..+|..+-.+|..+
T Consensus 62 ~~~~~~l~~~~~~~~~~~~a~~~~~~~l~~dP~~E~~~~~lm~~~~~~ 109 (146)
T PF03704_consen 62 LDALERLAEALLEAGDYEEALRLLQRALALDPYDEEAYRLLMRALAAQ 109 (146)
T ss_dssp HHHHHHHHHHHHHTT-HHHHHHHHHHHHHHSTT-HHHHHHHHHHHHHT
T ss_pred HHHHHHHHHHHHhccCHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHC
Confidence 567777888889999999999999999999999999999988888765
No 232
>PF04184 ST7: ST7 protein; InterPro: IPR007311 The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain.
Probab=96.09 E-value=0.14 Score=40.03 Aligned_cols=45 Identities=27% Similarity=0.406 Sum_probs=37.9
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCC--cHHHHHHHHHH-Hhhhhc
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPN--NEQMKEAIKDV-RNQEMN 46 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~--~~~a~~~l~~~-~~~g~~ 46 (150)
.+|+|..++|+.++|++.|...++..|. +..++..|..+ ..++.+
T Consensus 264 RLAmCarklGr~~EAIk~~rdLlke~p~~~~l~IrenLie~LLelq~Y 311 (539)
T PF04184_consen 264 RLAMCARKLGRLREAIKMFRDLLKEFPNLDNLNIRENLIEALLELQAY 311 (539)
T ss_pred HHHHHHHHhCChHHHHHHHHHHHhhCCccchhhHHHHHHHHHHhcCCH
Confidence 3789999999999999999999998875 56788888888 777733
No 233
>PLN03077 Protein ECB2; Provisional
Probab=96.01 E-value=0.099 Score=43.73 Aligned_cols=113 Identities=7% Similarity=0.003 Sum_probs=69.9
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHh--hccCCCCCCCC--
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQ--LQLDPRTKPFL-- 77 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~--l~~~~~~~~~~-- 77 (150)
+-..|.+.|++++|.+.|+.. +.|...|..+... -..| ...+|+ .-+.+- ....|+...+.
T Consensus 530 Li~~y~k~G~~~~A~~~f~~~----~~d~~s~n~lI~~~~~~G--------~~~~A~--~lf~~M~~~g~~Pd~~T~~~l 595 (857)
T PLN03077 530 LLDLYVRCGRMNYAWNQFNSH----EKDVVSWNILLTGYVAHG--------KGSMAV--ELFNRMVESGVNPDEVTFISL 595 (857)
T ss_pred HHHHHHHcCCHHHHHHHHHhc----CCChhhHHHHHHHHHHcC--------CHHHHH--HHHHHHHHcCCCCCcccHHHH
Confidence 456789999999999999986 5567788877766 6666 334444 333322 24566654421
Q ss_pred -----CCh---HHHHHHHHHh-cCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCC
Q psy3020 78 -----SDP---SYVQMIKEIQ-KDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGT 134 (150)
Q Consensus 78 -----~~~---~~~~~~~~l~-~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~ 134 (150)
..+ .....+.... ..+-. +....|..+...+.+.|++++|.+.+++. .+.|+
T Consensus 596 l~a~~~~g~v~ea~~~f~~M~~~~gi~----P~~~~y~~lv~~l~r~G~~~eA~~~~~~m-~~~pd 656 (857)
T PLN03077 596 LCACSRSGMVTQGLEYFHSMEEKYSIT----PNLKHYACVVDLLGRAGKLTEAYNFINKM-PITPD 656 (857)
T ss_pred HHHHhhcChHHHHHHHHHHHHHHhCCC----CchHHHHHHHHHHHhCCCHHHHHHHHHHC-CCCCC
Confidence 000 1111111111 11000 22567888999999999999999999886 46675
No 234
>KOG0551|consensus
Probab=96.01 E-value=0.025 Score=41.85 Aligned_cols=48 Identities=21% Similarity=0.313 Sum_probs=44.3
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
+..|.+++-+.+..|+|..|+..+++++.++|++..+++.-|.|++++
T Consensus 119 avLY~NRAAa~~~l~NyRs~l~Dcs~al~~~P~h~Ka~~R~Akc~~eL 166 (390)
T KOG0551|consen 119 AVLYTNRAAAQLYLGNYRSALNDCSAALKLKPTHLKAYIRGAKCLLEL 166 (390)
T ss_pred HHHHhhHHHHHHHHHHHHHHHHHHHHHHhcCcchhhhhhhhhHHHHHH
Confidence 677888999999999999999999999999999999999999998875
No 235
>KOG1941|consensus
Probab=95.97 E-value=0.047 Score=41.05 Aligned_cols=43 Identities=12% Similarity=0.253 Sum_probs=32.2
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHH------HHHHHHH-Hhhh
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQM------KEAIKDV-RNQE 44 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a------~~~l~~~-~~~g 44 (150)
.+|.++..++.++++++.|++|+++--++.+. .-+|+.+ -.+.
T Consensus 127 ~~~~Ahlgls~fq~~Lesfe~A~~~A~~~~D~~LElqvcv~Lgslf~~l~ 176 (518)
T KOG1941|consen 127 SMGNAHLGLSVFQKALESFEKALRYAHNNDDAMLELQVCVSLGSLFAQLK 176 (518)
T ss_pred hHHHHhhhHHHHHHHHHHHHHHHHHhhccCCceeeeehhhhHHHHHHHHH
Confidence 57899999999999999999999987666554 3455555 4444
No 236
>KOG2610|consensus
Probab=95.84 E-value=0.055 Score=40.35 Aligned_cols=45 Identities=13% Similarity=0.074 Sum_probs=38.0
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-Hhhhhc
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMN 46 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~ 46 (150)
+.+.++.+.|-|++|.+.-+++|+++|.+..+...++-+ .-.++.
T Consensus 180 myaFgL~E~g~y~dAEk~A~ralqiN~~D~Wa~Ha~aHVlem~~r~ 225 (491)
T KOG2610|consen 180 MYAFGLEECGIYDDAEKQADRALQINRFDCWASHAKAHVLEMNGRH 225 (491)
T ss_pred HHHhhHHHhccchhHHHHHHhhccCCCcchHHHHHHHHHHHhcchh
Confidence 356788899999999999999999999999999999888 555533
No 237
>COG3071 HemY Uncharacterized enzyme of heme biosynthesis [Coenzyme metabolism]
Probab=95.73 E-value=0.17 Score=38.28 Aligned_cols=114 Identities=18% Similarity=0.122 Sum_probs=62.3
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHHHhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCC-----
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDVRNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFL----- 77 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~----- 77 (150)
++.-+...|..++|.+....+|+-.-|.. ...-+..+ ..+ +....+ ..+.+.++..|+.+..+
T Consensus 269 ~a~~li~l~~~~~A~~~i~~~Lk~~~D~~-L~~~~~~l-~~~--------d~~~l~--k~~e~~l~~h~~~p~L~~tLG~ 336 (400)
T COG3071 269 YAERLIRLGDHDEAQEIIEDALKRQWDPR-LCRLIPRL-RPG--------DPEPLI--KAAEKWLKQHPEDPLLLSTLGR 336 (400)
T ss_pred HHHHHHHcCChHHHHHHHHHHHHhccChh-HHHHHhhc-CCC--------CchHHH--HHHHHHHHhCCCChhHHHHHHH
Confidence 44556677888888888888877665543 22222222 111 222222 34444555556555321
Q ss_pred ---CChHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhh
Q psy3020 78 ---SDPSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKL 131 (150)
Q Consensus 78 ---~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~ 131 (150)
.+..+.++-..++... ...+....+..+|.++-+.|+..+|.+++++++.+
T Consensus 337 L~~k~~~w~kA~~~leaAl---~~~~s~~~~~~la~~~~~~g~~~~A~~~r~e~L~~ 390 (400)
T COG3071 337 LALKNKLWGKASEALEAAL---KLRPSASDYAELADALDQLGEPEEAEQVRREALLL 390 (400)
T ss_pred HHHHhhHHHHHHHHHHHHH---hcCCChhhHHHHHHHHHHcCChHHHHHHHHHHHHH
Confidence 1111111111111100 00012678899999999999999999999999854
No 238
>PF14561 TPR_20: Tetratricopeptide repeat; PDB: 3QOU_A 2R5S_A 3QDN_B.
Probab=95.68 E-value=0.074 Score=31.84 Aligned_cols=46 Identities=17% Similarity=0.291 Sum_probs=36.5
Q ss_pred HHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCC
Q psy3020 84 QMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTN 135 (150)
Q Consensus 84 ~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~ 135 (150)
.....+..+|.+ ..+...++..+...|++++|++.+-+.+..+|+.
T Consensus 10 al~~~~a~~P~D------~~ar~~lA~~~~~~g~~e~Al~~Ll~~v~~dr~~ 55 (90)
T PF14561_consen 10 ALEAALAANPDD------LDARYALADALLAAGDYEEALDQLLELVRRDRDY 55 (90)
T ss_dssp HHHHHHHHSTT-------HHHHHHHHHHHHHTT-HHHHHHHHHHHHCC-TTC
T ss_pred HHHHHHHcCCCC------HHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCccc
Confidence 334445666666 8899999999999999999999999999999876
No 239
>PF13281 DUF4071: Domain of unknown function (DUF4071)
Probab=95.62 E-value=0.62 Score=35.35 Aligned_cols=132 Identities=8% Similarity=0.052 Sum_probs=79.9
Q ss_pred hhHHHHh---hCCHHHHHHHHHH-hhccCCCcHHHHHHHHHH-HhhhhccC-CCCCchhhhhCChHHHHhhccCCCCCCC
Q psy3020 3 TYSCLSY---LGRYKESISTYEE-GLKLDPNNEQMKEAIKDV-RNQEMNDM-NRGDPFANLFSDPNIFVQLQLDPRTKPF 76 (150)
Q Consensus 3 lg~~~~~---~~~~~~A~~~~~~-aL~~~p~~~~a~~~l~~~-~~~g~~~~-~~~~~~~~a~~~~~~~~~l~~~~~~~~~ 76 (150)
.|.++-. .|+.++|+..+.. ...-++.+++.+--+|.+ ..+-.... .-.....+|+ .-|.++...+|+.+.-
T Consensus 185 yafALnRrn~~gdre~Al~il~~~l~~~~~~~~d~~gL~GRIyKD~~~~s~~~d~~~ldkAi--~~Y~kgFe~~~~~Y~G 262 (374)
T PF13281_consen 185 YAFALNRRNKPGDREKALQILLPVLESDENPDPDTLGLLGRIYKDLFLESNFTDRESLDKAI--EWYRKGFEIEPDYYSG 262 (374)
T ss_pred HHHHHhhcccCCCHHHHHHHHHHHHhccCCCChHHHHHHHHHHHHHHHHcCccchHHHHHHH--HHHHHHHcCCccccch
Confidence 4667777 8999999999999 555666788888777887 44332110 1112467777 8888888888876652
Q ss_pred CCC--------------hHHHH---HHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCc
Q psy3020 77 LSD--------------PSYVQ---MIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNH 136 (150)
Q Consensus 77 ~~~--------------~~~~~---~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~ 136 (150)
.+- ....+ .+..+-.................+..+..-.|++++|++.+++++.++|...
T Consensus 263 IN~AtLL~~~g~~~~~~~el~~i~~~l~~llg~kg~~~~~~dYWd~ATl~Ea~vL~~d~~ka~~a~e~~~~l~~~~W 339 (374)
T PF13281_consen 263 INAATLLMLAGHDFETSEELRKIGVKLSSLLGRKGSLEKMQDYWDVATLLEASVLAGDYEKAIQAAEKAFKLKPPAW 339 (374)
T ss_pred HHHHHHHHHcCCcccchHHHHHHHHHHHHHHHhhccccccccHHHHHHHHHHHHHcCCHHHHHHHHHHHhhcCCcch
Confidence 110 00000 0001000000000012244445566677789999999999999999987554
No 240
>PF10516 SHNi-TPR: SHNi-TPR; InterPro: IPR019544 The tetratrico peptide repeat region (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B. This entry represents SHNi-TPR (Sim3-Hif1-NASP interrupted TPR), a sequence that is an interrupted form of TPR repeat [].
Probab=95.55 E-value=0.017 Score=28.73 Aligned_cols=26 Identities=15% Similarity=0.225 Sum_probs=23.5
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccC
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLD 27 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~ 27 (150)
.||..-...++|+.|+++|+++|++.
T Consensus 6 ~Lgeisle~e~f~qA~~D~~~aL~i~ 31 (38)
T PF10516_consen 6 LLGEISLENENFEQAIEDYEKALEIQ 31 (38)
T ss_pred HHHHHHHHhccHHHHHHHHHHHHHHH
Confidence 47889999999999999999999864
No 241
>PF05843 Suf: Suppressor of forked protein (Suf); InterPro: IPR008847 This domain consists of several eukaryotic suppressor of forked (Suf) like proteins. The Drosophila melanogaster suppressor of forked [Su(f)] protein shares homology with the Saccharomyces cerevisiae RNA14 protein and the 77 kDa subunit of Homo sapiens cleavage stimulation factor, which are proteins involved in mRNA 3' end formation. This suggests a role for Su(f) in mRNA 3' end formation in Drosophila. The su(f) gene produces three transcripts; two of them are polyadenylated at the end of the transcription unit, and one is a truncated transcript, polyadenylated in intron 4. It is thought that su(f) plays a role in the regulation of poly(A) site utilisation and the GU-rich sequence is important for this regulation to occur [].; GO: 0006397 mRNA processing, 0005634 nucleus; PDB: 2L9B_B 2OND_B 2OOE_A 4E85_B 4EBA_C 4E6H_A 2UY1_B.
Probab=95.54 E-value=0.23 Score=36.08 Aligned_cols=121 Identities=9% Similarity=-0.019 Sum_probs=64.4
Q ss_pred HHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChHHH
Q psy3020 5 SCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPSYV 83 (150)
Q Consensus 5 ~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~~~ 83 (150)
......+..+.|...|.+|++-.+....+|...|.+ +..+. +...|. .-+...++..|....+.. .|.
T Consensus 9 ~~~~r~~g~~~aR~vF~~a~~~~~~~~~vy~~~A~~E~~~~~-------d~~~A~--~Ife~glk~f~~~~~~~~--~Y~ 77 (280)
T PF05843_consen 9 RFMRRTEGIEAARKVFKRARKDKRCTYHVYVAYALMEYYCNK-------DPKRAR--KIFERGLKKFPSDPDFWL--EYL 77 (280)
T ss_dssp HHHHHHHHHHHHHHHHHHHHCCCCS-THHHHHHHHHHHHTCS--------HHHHH--HHHHHHHHHHTT-HHHHH--HHH
T ss_pred HHHHHhCChHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHhCC-------CHHHHH--HHHHHHHHHCCCCHHHHH--HHH
Confidence 344455567777888888876666667777777777 55432 223244 444455555555443210 111
Q ss_pred HHHHHHhcCcchhhh-----------HH-HhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcc
Q psy3020 84 QMIKEIQKDPSLMTL-----------EA-KISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHI 137 (150)
Q Consensus 84 ~~~~~l~~~~~~~~~-----------~~-~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~ 137 (150)
..+..+. +...... .. -...|......-.+-|+.+...+.+.++.+..|++..
T Consensus 78 ~~l~~~~-d~~~aR~lfer~i~~l~~~~~~~~iw~~~i~fE~~~Gdl~~v~~v~~R~~~~~~~~~~ 142 (280)
T PF05843_consen 78 DFLIKLN-DINNARALFERAISSLPKEKQSKKIWKKFIEFESKYGDLESVRKVEKRAEELFPEDNS 142 (280)
T ss_dssp HHHHHTT--HHHHHHHHHHHCCTSSCHHHCHHHHHHHHHHHHHHS-HHHHHHHHHHHHHHTTTS-H
T ss_pred HHHHHhC-cHHHHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHhhhhhH
Confidence 1111110 0000000 01 1345666666667889999999999999999998543
No 242
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=95.46 E-value=0.3 Score=34.14 Aligned_cols=48 Identities=17% Similarity=0.123 Sum_probs=37.5
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCc-chhhcHHHHHHhc
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNH-ILFSNRSAAFAKA 149 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~-~~~~n~a~~~~~~ 149 (150)
...++.+|.+.++.|++++|++.|.+.+..-..+. ....++|.=.-++
T Consensus 165 ~~l~YLigeL~rrlg~~~eA~~~fs~vi~~~~~s~~~~l~~~AR~~w~~ 213 (214)
T PF09986_consen 165 ATLLYLIGELNRRLGNYDEAKRWFSRVIGSKKASKEPKLKDMARDQWQL 213 (214)
T ss_pred HHHHHHHHHHHHHhCCHHHHHHHHHHHHcCCCCCCcHHHHHHHHHHHHh
Confidence 67788899999999999999999999998655443 4666666655444
No 243
>KOG3081|consensus
Probab=95.45 E-value=0.087 Score=37.91 Aligned_cols=43 Identities=26% Similarity=0.321 Sum_probs=39.4
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-Hhhh
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQE 44 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g 44 (150)
..+.|+..+|||++|....+.||.-+|++++.+-++..+ ...|
T Consensus 212 G~Av~~l~~~~~eeAe~lL~eaL~kd~~dpetL~Nliv~a~~~G 255 (299)
T KOG3081|consen 212 GQAVCHLQLGRYEEAESLLEEALDKDAKDPETLANLIVLALHLG 255 (299)
T ss_pred cHHHHHHHhcCHHHHHHHHHHHHhccCCCHHHHHHHHHHHHHhC
Confidence 568899999999999999999999999999999988877 6667
No 244
>COG4649 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=95.44 E-value=0.46 Score=32.31 Aligned_cols=49 Identities=20% Similarity=0.213 Sum_probs=42.4
Q ss_pred HhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhcC
Q psy3020 101 KISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKAI 150 (150)
Q Consensus 101 ~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~~ 150 (150)
+..+-..+|-..++.|+|.+|...|..... |.+.|..-.+|+.+.+.+|
T Consensus 166 R~sArEALglAa~kagd~a~A~~~F~qia~-Da~aprnirqRAq~mldlI 214 (221)
T COG4649 166 RHSAREALGLAAYKAGDFAKAKSWFVQIAN-DAQAPRNIRQRAQIMLDLI 214 (221)
T ss_pred HHHHHHHHhHHHHhccchHHHHHHHHHHHc-cccCcHHHHHHHHHHHHHH
Confidence 366667799999999999999999998887 7888888899999988765
No 245
>PF07720 TPR_3: Tetratricopeptide repeat; InterPro: IPR011716 This entry includes tetratricopeptide-like repeats found in the LcrH/SycD-like chaperones [].; PDB: 3KS2_O 3GZ2_A 3GZ1_A 3GYZ_A 4AM9_A 2VGX_A 2VGY_A.
Probab=95.33 E-value=0.021 Score=28.09 Aligned_cols=29 Identities=28% Similarity=0.232 Sum_probs=23.4
Q ss_pred hhhHHHHhhCCHHHHHHH--HHHhhccCCCc
Q psy3020 2 TTYSCLSYLGRYKESIST--YEEGLKLDPNN 30 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~--~~~aL~~~p~~ 30 (150)
.+|..+...|++++|++. |.-+..++|.|
T Consensus 6 ~~a~~~y~~~ky~~A~~~~~y~~l~~ld~~n 36 (36)
T PF07720_consen 6 GLAYNFYQKGKYDEAIHFFQYAFLCALDKYN 36 (36)
T ss_dssp HHHHHHHHTT-HHHHHHHHHHHHHHHHTTT-
T ss_pred HHHHHHHHHhhHHHHHHHHHHHHHHHhcccC
Confidence 578899999999999999 55888888865
No 246
>PRK10941 hypothetical protein; Provisional
Probab=95.32 E-value=0.054 Score=39.16 Aligned_cols=48 Identities=17% Similarity=0.250 Sum_probs=42.2
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
..-+.++-.++.+.++++.|+++.+..+.++|+++.-+..||.+|.++
T Consensus 181 ~Rml~nLK~~~~~~~~~~~AL~~~e~ll~l~P~dp~e~RDRGll~~qL 228 (269)
T PRK10941 181 RKLLDTLKAALMEEKQMELALRASEALLQFDPEDPYEIRDRGLIYAQL 228 (269)
T ss_pred HHHHHHHHHHHHHcCcHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHc
Confidence 344556667778999999999999999999999999999999999876
No 247
>KOG1915|consensus
Probab=95.17 E-value=0.29 Score=38.23 Aligned_cols=100 Identities=15% Similarity=0.195 Sum_probs=69.6
Q ss_pred HhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChHHHHHH
Q psy3020 8 SYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPSYVQMI 86 (150)
Q Consensus 8 ~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 86 (150)
.++++++.+...|++-|+-.|.|-.+|...|.+ ..+|+++ .....+ .-++. .|. ++
T Consensus 448 lqL~efDRcRkLYEkfle~~Pe~c~~W~kyaElE~~Lgdtd-----RaRaif-----elAi~-qp~----ld-------- 504 (677)
T KOG1915|consen 448 LQLREFDRCRKLYEKFLEFSPENCYAWSKYAELETSLGDTD-----RARAIF-----ELAIS-QPA----LD-------- 504 (677)
T ss_pred HHHhhHHHHHHHHHHHHhcChHhhHHHHHHHHHHHHhhhHH-----HHHHHH-----HHHhc-Ccc----cc--------
Confidence 467899999999999999999999999999999 9999653 222111 11111 111 10
Q ss_pred HHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHH
Q psy3020 87 KEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSA 144 (150)
Q Consensus 87 ~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~ 144 (150)
.....|......-...|.++.|-..|++.|+..+... +|...|.
T Consensus 505 -------------mpellwkaYIdFEi~~~E~ekaR~LYerlL~rt~h~k-vWisFA~ 548 (677)
T KOG1915|consen 505 -------------MPELLWKAYIDFEIEEGEFEKARALYERLLDRTQHVK-VWISFAK 548 (677)
T ss_pred -------------cHHHHHHHhhhhhhhcchHHHHHHHHHHHHHhcccch-HHHhHHH
Confidence 0033455555566788999999999999999998554 6655543
No 248
>KOG2610|consensus
Probab=95.14 E-value=0.25 Score=36.98 Aligned_cols=106 Identities=8% Similarity=-0.020 Sum_probs=65.9
Q ss_pred HHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhcc-CCCCCCCCCChHHH
Q psy3020 6 CLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQL-DPRTKPFLSDPSYV 83 (150)
Q Consensus 6 ~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~-~~~~~~~~~~~~~~ 83 (150)
.+...|++.+|...+++.|+--|.+.-++...-+. ..+|... ... +.+.+.+.. +|+.+-+
T Consensus 112 i~~~~g~~h~a~~~wdklL~d~PtDlla~kfsh~a~fy~G~~~-----~~k-----~ai~kIip~wn~dlp~~------- 174 (491)
T KOG2610|consen 112 ILWGRGKHHEAAIEWDKLLDDYPTDLLAVKFSHDAHFYNGNQI-----GKK-----NAIEKIIPKWNADLPCY------- 174 (491)
T ss_pred HhhccccccHHHHHHHHHHHhCchhhhhhhhhhhHHHhccchh-----hhh-----hHHHHhccccCCCCcHH-------
Confidence 34556788888888899999888887776655555 5555321 111 222222222 3332210
Q ss_pred HHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHH
Q psy3020 84 QMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAF 146 (150)
Q Consensus 84 ~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~ 146 (150)
....-..+-.+..-|-|++|.+.-.+++++||.+..+.+-.+.++
T Consensus 175 ------------------sYv~GmyaFgL~E~g~y~dAEk~A~ralqiN~~D~Wa~Ha~aHVl 219 (491)
T KOG2610|consen 175 ------------------SYVHGMYAFGLEECGIYDDAEKQADRALQINRFDCWASHAKAHVL 219 (491)
T ss_pred ------------------HHHHHHHHhhHHHhccchhHHHHHHhhccCCCcchHHHHHHHHHH
Confidence 111122333456789999999999999999999887776665544
No 249
>COG4976 Predicted methyltransferase (contains TPR repeat) [General function prediction only]
Probab=94.94 E-value=0.047 Score=38.51 Aligned_cols=61 Identities=18% Similarity=0.189 Sum_probs=51.2
Q ss_pred HHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCC
Q psy3020 5 SCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKP 75 (150)
Q Consensus 5 ~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~ 75 (150)
....+.++.+.|.+.|.+||.+-|+....|..++.. ...| ++..|. ..+.+.++++|.+..
T Consensus 3 ~~~~~~~D~~aaaely~qal~lap~w~~gwfR~g~~~ekag--------~~daAa--~a~~~~L~ldp~D~~ 64 (287)
T COG4976 3 YMLAESGDAEAAAELYNQALELAPEWAAGWFRLGEYTEKAG--------EFDAAA--AAYEEVLELDPEDHG 64 (287)
T ss_pred chhcccCChHHHHHHHHHHhhcCchhhhhhhhcchhhhhcc--------cHHHHH--HHHHHHHcCCccccc
Confidence 445678899999999999999999999999999999 8888 555555 677888888987653
No 250
>KOG1310|consensus
Probab=94.84 E-value=0.14 Score=40.44 Aligned_cols=94 Identities=15% Similarity=0.065 Sum_probs=69.6
Q ss_pred HHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChHHHHH
Q psy3020 7 LSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPSYVQM 85 (150)
Q Consensus 7 ~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 85 (150)
-+..+....|+.+|.++++--|+..+.+...+.+ ..-+ +. . +.--|+ -++..+++++|..
T Consensus 384 ~ly~~~~~~~i~~~s~a~q~~~~~~~~l~nraa~lmkRk-W~--~--d~~~Al--rDch~Alrln~s~------------ 444 (758)
T KOG1310|consen 384 GLYESIVSGAISHYSRAIQYVPDAIYLLENRAAALMKRK-WR--G--DSYLAL--RDCHVALRLNPSI------------ 444 (758)
T ss_pred hhhhHHHHHHHHHHHHHhhhccchhHHHHhHHHHHHhhh-cc--c--cHHHHH--HhHHhhccCChHH------------
Confidence 3445667789999999999999999998888776 3222 11 0 222233 4666666666632
Q ss_pred HHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCC
Q psy3020 86 IKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTN 135 (150)
Q Consensus 86 ~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~ 135 (150)
..+|+.++.++...+++.+|+.+...+...+|.+
T Consensus 445 ----------------~kah~~la~aL~el~r~~eal~~~~alq~~~Ptd 478 (758)
T KOG1310|consen 445 ----------------QKAHFRLARALNELTRYLEALSCHWALQMSFPTD 478 (758)
T ss_pred ----------------HHHHHHHHHHHHHHhhHHHhhhhHHHHhhcCchh
Confidence 7889999999999999999999999998899954
No 251
>PRK04841 transcriptional regulator MalT; Provisional
Probab=94.59 E-value=0.38 Score=40.36 Aligned_cols=100 Identities=13% Similarity=0.082 Sum_probs=64.3
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCC--------cHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCC
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPN--------NEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPR 72 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~--------~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~ 72 (150)
.+|.++...|++++|...+++++.+-.. ....+..++.+ ...| ++.++. ..+...+.....
T Consensus 536 ~la~~~~~~G~~~~A~~~~~~al~~~~~~~~~~~~~~~~~~~~la~~~~~~G--------~~~~A~--~~~~~al~~~~~ 605 (903)
T PRK04841 536 QQSEILFAQGFLQAAYETQEKAFQLIEEQHLEQLPMHEFLLRIRAQLLWEWA--------RLDEAE--QCARKGLEVLSN 605 (903)
T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHHHHHhccccccHHHHHHHHHHHHHHHhc--------CHHHHH--HHHHHhHHhhhc
Confidence 3577888889999999999988885221 22334456777 6777 444444 333333322211
Q ss_pred CCCCCCChHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCC
Q psy3020 73 TKPFLSDPSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGT 134 (150)
Q Consensus 73 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~ 134 (150)
... . .....+..+|.+....|++++|.+.+.+++.+.+.
T Consensus 606 ~~~-----------------~------~~~~~~~~la~~~~~~G~~~~A~~~l~~a~~~~~~ 644 (903)
T PRK04841 606 YQP-----------------Q------QQLQCLAMLAKISLARGDLDNARRYLNRLENLLGN 644 (903)
T ss_pred cCc-----------------h------HHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHhc
Confidence 000 0 01456677888999999999999999999887543
No 252
>PF07721 TPR_4: Tetratricopeptide repeat; InterPro: IPR011717 This entry includes tetratricopeptide-like repeats not detected by the IPR001440 from INTERPRO, IPR013105 from INTERPRO and IPR011716 from INTERPRO models. The tetratricopeptide repeat (TPR) motif is a protein-protein interaction module found in multiple copies in a number of functionally different proteins that facilitates specific interactions with a partner protein(s) [].; GO: 0042802 identical protein binding
Probab=94.48 E-value=0.078 Score=23.77 Aligned_cols=24 Identities=17% Similarity=-0.010 Sum_probs=20.9
Q ss_pred HHHHHHHHHHHHhcCHHHHHHHHH
Q psy3020 103 SLLKDKGNAALQANNFKEAIEAYS 126 (150)
Q Consensus 103 ~~~~~~g~~~~~~~~~~~A~~~~~ 126 (150)
.+...+|.++...|++++|...++
T Consensus 2 ~a~~~la~~~~~~G~~~eA~~~l~ 25 (26)
T PF07721_consen 2 RARLALARALLAQGDPDEAERLLR 25 (26)
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHh
Confidence 456789999999999999998875
No 253
>KOG2396|consensus
Probab=94.46 E-value=0.45 Score=37.30 Aligned_cols=87 Identities=8% Similarity=0.103 Sum_probs=60.0
Q ss_pred HHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChHHHHHHHHHhcCcc
Q psy3020 16 SISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPSYVQMIKEIQKDPS 94 (150)
Q Consensus 16 A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~ 94 (150)
=...|++|+.-=|.++..|...... ...+. +.+.- .-+...+..+|+.
T Consensus 90 Iv~lyr~at~rf~~D~~lW~~yi~f~kk~~~--------~~~v~--ki~~~~l~~Hp~~--------------------- 138 (568)
T KOG2396|consen 90 IVFLYRRATNRFNGDVKLWLSYIAFCKKKKT--------YGEVK--KIFAAMLAKHPNN--------------------- 138 (568)
T ss_pred HHHHHHHHHHhcCCCHHHHHHHHHHHHHhcc--------hhHHH--HHHHHHHHhCCCC---------------------
Confidence 3467788888777778777766555 44441 11111 3334445555543
Q ss_pred hhhhHHHhHHHHHHHHHHHHhcC-HHHHHHHHHHHHhhcCCCcchhh
Q psy3020 95 LMTLEAKISLLKDKGNAALQANN-FKEAIEAYSEAIKLDGTNHILFS 140 (150)
Q Consensus 95 ~~~~~~~~~~~~~~g~~~~~~~~-~~~A~~~~~~al~~~p~~~~~~~ 140 (150)
+..|..-+...+..+. .+.|-..|.++|..+|++|.+|.
T Consensus 139 -------~dLWI~aA~wefe~n~ni~saRalflrgLR~npdsp~Lw~ 178 (568)
T KOG2396|consen 139 -------PDLWIYAAKWEFEINLNIESARALFLRGLRFNPDSPKLWK 178 (568)
T ss_pred -------chhHHhhhhhHHhhccchHHHHHHHHHHhhcCCCChHHHH
Confidence 6778888877887666 88999999999999999998875
No 254
>KOG3364|consensus
Probab=94.44 E-value=0.12 Score=33.50 Aligned_cols=38 Identities=24% Similarity=0.239 Sum_probs=32.7
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV 40 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~ 40 (150)
|+..+.++++|+.|+...+..|+.+|+|..+..-.-.+
T Consensus 77 LAvg~yRlkeY~~s~~yvd~ll~~e~~n~Qa~~Lk~~i 114 (149)
T KOG3364|consen 77 LAVGHYRLKEYSKSLRYVDALLETEPNNRQALELKETI 114 (149)
T ss_pred hHHHHHHHhhHHHHHHHHHHHHhhCCCcHHHHHHHHHH
Confidence 67788999999999999999999999999986544444
No 255
>PF05843 Suf: Suppressor of forked protein (Suf); InterPro: IPR008847 This domain consists of several eukaryotic suppressor of forked (Suf) like proteins. The Drosophila melanogaster suppressor of forked [Su(f)] protein shares homology with the Saccharomyces cerevisiae RNA14 protein and the 77 kDa subunit of Homo sapiens cleavage stimulation factor, which are proteins involved in mRNA 3' end formation. This suggests a role for Su(f) in mRNA 3' end formation in Drosophila. The su(f) gene produces three transcripts; two of them are polyadenylated at the end of the transcription unit, and one is a truncated transcript, polyadenylated in intron 4. It is thought that su(f) plays a role in the regulation of poly(A) site utilisation and the GU-rich sequence is important for this regulation to occur [].; GO: 0006397 mRNA processing, 0005634 nucleus; PDB: 2L9B_B 2OND_B 2OOE_A 4E85_B 4EBA_C 4E6H_A 2UY1_B.
Probab=94.38 E-value=0.4 Score=34.85 Aligned_cols=34 Identities=12% Similarity=0.082 Sum_probs=17.9
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCC
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTN 135 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~ 135 (150)
...|......+...|+.+.|-..|++++..-|..
T Consensus 70 ~~~~~~Y~~~l~~~~d~~~aR~lfer~i~~l~~~ 103 (280)
T PF05843_consen 70 PDFWLEYLDFLIKLNDINNARALFERAISSLPKE 103 (280)
T ss_dssp HHHHHHHHHHHHHTT-HHHHHHHHHHHCCTSSCH
T ss_pred HHHHHHHHHHHHHhCcHHHHHHHHHHHHHhcCch
Confidence 4445555555555566666666666665554443
No 256
>smart00386 HAT HAT (Half-A-TPR) repeats. Present in several RNA-binding proteins. Structurally and sequentially thought to be similar to TPRs.
Probab=94.24 E-value=0.13 Score=23.70 Aligned_cols=30 Identities=17% Similarity=0.259 Sum_probs=26.4
Q ss_pred CCHHHHHHHHHHhhccCCCcHHHHHHHHHH
Q psy3020 11 GRYKESISTYEEGLKLDPNNEQMKEAIKDV 40 (150)
Q Consensus 11 ~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~ 40 (150)
|+++.|...|++++...|.++..|...+..
T Consensus 1 ~~~~~~r~i~e~~l~~~~~~~~~W~~y~~~ 30 (33)
T smart00386 1 GDIERARKIYERALEKFPKSVELWLKYAEF 30 (33)
T ss_pred CcHHHHHHHHHHHHHHCCCChHHHHHHHHH
Confidence 568899999999999999999999887654
No 257
>KOG1915|consensus
Probab=94.21 E-value=0.6 Score=36.60 Aligned_cols=56 Identities=13% Similarity=0.066 Sum_probs=40.3
Q ss_pred HHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCC
Q psy3020 7 LSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPR 72 (150)
Q Consensus 7 ~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~ 72 (150)
=..++++..|.+.|++||..|..|...|...+.+ ..-. ++..|- .-+.+++.+-|.
T Consensus 83 Eesq~e~~RARSv~ERALdvd~r~itLWlkYae~Emknk--------~vNhAR--Nv~dRAvt~lPR 139 (677)
T KOG1915|consen 83 EESQKEIQRARSVFERALDVDYRNITLWLKYAEFEMKNK--------QVNHAR--NVWDRAVTILPR 139 (677)
T ss_pred HHhHHHHHHHHHHHHHHHhcccccchHHHHHHHHHHhhh--------hHhHHH--HHHHHHHHhcch
Confidence 3456788899999999999999999999999888 4444 333333 334445555564
No 258
>PRK13184 pknD serine/threonine-protein kinase; Reviewed
Probab=94.12 E-value=0.61 Score=39.56 Aligned_cols=107 Identities=9% Similarity=0.035 Sum_probs=68.4
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHH---HHHHH--HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCC
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKE---AIKDV--RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPF 76 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~---~l~~~--~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~ 76 (150)
.+..++...+.|+.|+..|.+.-..=|+-.+.+. ..|-. ....... .+ ..+.+|+ ..+
T Consensus 480 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~-~~~~~~~--~~~------------- 542 (932)
T PRK13184 480 AVPDAFLAEKLYDQALIFYRRIRESFPGRKEGYEAQFRLGITLLEKASEQG-DP-RDFTQAL--SEF------------- 542 (932)
T ss_pred cCcHHHHhhHHHHHHHHHHHHHhhcCCCcccchHHHHHhhHHHHHHHHhcC-Ch-HHHHHHH--HHH-------------
Confidence 3567888999999999999999998888766543 33322 1111000 00 1222232 111
Q ss_pred CCChHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhc
Q psy3020 77 LSDPSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSN 141 (150)
Q Consensus 77 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n 141 (150)
..+-..+.. +-.|..++-+|...|+|+|=+++|.-|+..-|++|..-.-
T Consensus 543 ----------~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 591 (932)
T PRK13184 543 ----------SYLHGGVGA------PLEYLGKALVYQRLGEYNEEIKSLLLALKRYSQHPEISRL 591 (932)
T ss_pred ----------HHhcCCCCC------chHHHhHHHHHHHhhhHHHHHHHHHHHHHhcCCCCccHHH
Confidence 122233433 5556666667789999999999999999999998865443
No 259
>KOG1586|consensus
Probab=93.80 E-value=0.8 Score=32.64 Aligned_cols=35 Identities=23% Similarity=0.332 Sum_probs=25.8
Q ss_pred HHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcch
Q psy3020 104 LLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHIL 138 (150)
Q Consensus 104 ~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~ 138 (150)
.+...+......++|++|++.|++...-.-+|+.+
T Consensus 156 C~lKvA~yaa~leqY~~Ai~iyeqva~~s~~n~LL 190 (288)
T KOG1586|consen 156 CLLKVAQYAAQLEQYSKAIDIYEQVARSSLDNNLL 190 (288)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhccchHH
Confidence 34444555568899999999999998877666544
No 260
>TIGR03504 FimV_Cterm FimV C-terminal domain. This protein is found at the extreme C-terminus of FimV from Pseudomonas aeruginosa, and of TspA of Neisseria meningitidis. Disruption of the former blocks twitching motility from type IV pili; Semmler, et al. suggest a role in peptidoglycan layer remodelling required by type IV fimbrial systems.
Probab=93.79 E-value=0.11 Score=26.65 Aligned_cols=28 Identities=11% Similarity=0.162 Sum_probs=24.4
Q ss_pred ChhhHHHHhhCCHHHHHHHHHHhhccCCC
Q psy3020 1 MTTYSCLSYLGRYKESISTYEEGLKLDPN 29 (150)
Q Consensus 1 ~~lg~~~~~~~~~~~A~~~~~~aL~~~p~ 29 (150)
+.++.+|..+|+++.|.+.+++++. +++
T Consensus 3 LdLA~ayie~Gd~e~Ar~lL~evl~-~~~ 30 (44)
T TIGR03504 3 LDLARAYIEMGDLEGARELLEEVIE-EGD 30 (44)
T ss_pred hHHHHHHHHcCChHHHHHHHHHHHH-cCC
Confidence 4689999999999999999999994 444
No 261
>PF08424 NRDE-2: NRDE-2, necessary for RNA interference; InterPro: IPR013633 This is domain is found in eukaryotic proteins of unknown function.
Probab=93.21 E-value=1.5 Score=32.57 Aligned_cols=24 Identities=4% Similarity=0.156 Sum_probs=21.5
Q ss_pred HHHHHHhhccCCCcHHHHHHHHHH
Q psy3020 17 ISTYEEGLKLDPNNEQMKEAIKDV 40 (150)
Q Consensus 17 ~~~~~~aL~~~p~~~~a~~~l~~~ 40 (150)
...|++.++-+|.|.++|..+...
T Consensus 5 ~~el~~~v~~~P~di~~Wl~li~~ 28 (321)
T PF08424_consen 5 TAELNRRVRENPHDIEAWLELIEF 28 (321)
T ss_pred HHHHHHHHHhCcccHHHHHHHHHH
Confidence 356899999999999999999887
No 262
>PF04781 DUF627: Protein of unknown function (DUF627); InterPro: IPR006866 This domain represents the N-terminal region of several plant proteins of unknown function.
Probab=92.96 E-value=1.4 Score=27.46 Aligned_cols=103 Identities=10% Similarity=0.049 Sum_probs=58.4
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHH---HHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCC
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQM---KEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLS 78 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a---~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~ 78 (150)
.+..++..|++-+|++..+..+..++++... +..-|.+ ..+...- -||+.+..+-
T Consensus 2 ~A~~~~~rGnhiKAL~iied~i~~h~~~~~~~~lh~~QG~if~~lA~~t---------------------en~d~k~~yL 60 (111)
T PF04781_consen 2 KAKDYFARGNHIKALEIIEDLISRHGEDESSWLLHRLQGTIFYKLAKKT---------------------ENPDVKFRYL 60 (111)
T ss_pred hHHHHHHccCHHHHHHHHHHHHHHccCCCchHHHHHHHhHHHHHHHHhc---------------------cCchHHHHHH
Confidence 3567899999999999999999999987743 3333555 4433210 0222111000
Q ss_pred ChHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhc
Q psy3020 79 DPSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLD 132 (150)
Q Consensus 79 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~ 132 (150)
...+...-+...-.|.. +..++.+|.-+--..-|++++...+++|.+.
T Consensus 61 l~sve~~s~a~~Lsp~~------A~~L~~la~~l~s~~~Ykk~v~kak~~Lsv~ 108 (111)
T PF04781_consen 61 LGSVECFSRAVELSPDS------AHSLFELASQLGSVKYYKKAVKKAKRGLSVT 108 (111)
T ss_pred HHhHHHHHHHhccChhH------HHHHHHHHHHhhhHHHHHHHHHHHHHHhccc
Confidence 00111111112223322 6667777766655666778888888877653
No 263
>smart00386 HAT HAT (Half-A-TPR) repeats. Present in several RNA-binding proteins. Structurally and sequentially thought to be similar to TPRs.
Probab=92.75 E-value=0.26 Score=22.58 Aligned_cols=29 Identities=14% Similarity=0.164 Sum_probs=24.6
Q ss_pred cCHHHHHHHHHHHHhhcCCCcchhhcHHH
Q psy3020 116 NNFKEAIEAYSEAIKLDGTNHILFSNRSA 144 (150)
Q Consensus 116 ~~~~~A~~~~~~al~~~p~~~~~~~n~a~ 144 (150)
|+++.|...|++++...|.++..|...+.
T Consensus 1 ~~~~~~r~i~e~~l~~~~~~~~~W~~y~~ 29 (33)
T smart00386 1 GDIERARKIYERALEKFPKSVELWLKYAE 29 (33)
T ss_pred CcHHHHHHHHHHHHHHCCCChHHHHHHHH
Confidence 56788999999999999999988876544
No 264
>KOG1308|consensus
Probab=92.42 E-value=0.071 Score=39.61 Aligned_cols=44 Identities=11% Similarity=0.073 Sum_probs=38.6
Q ss_pred hHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-Hhhhhcc
Q psy3020 4 YSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMND 47 (150)
Q Consensus 4 g~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~ 47 (150)
|.++.+++++..|+.+|..|+.++||...-+.-.+.. +.+|.+.
T Consensus 155 ~sv~lkl~kp~~airD~d~A~ein~Dsa~~ykfrg~A~rllg~~e 199 (377)
T KOG1308|consen 155 ASVFLKLKKPNAAIRDCDFAIEINPDSAKGYKFRGYAERLLGNWE 199 (377)
T ss_pred cceeeeccCCchhhhhhhhhhccCcccccccchhhHHHHHhhchH
Confidence 5678899999999999999999999999888888888 8888553
No 265
>KOG0530|consensus
Probab=92.37 E-value=3.2 Score=30.18 Aligned_cols=120 Identities=12% Similarity=0.165 Sum_probs=63.5
Q ss_pred HHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChHHHHH
Q psy3020 7 LSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPSYVQM 85 (150)
Q Consensus 7 ~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 85 (150)
+....+-..|+..-..+|.++|.|..+|.-.-.+ ..++. ++.+-+ .-+.+...-+|.+...- +..+.
T Consensus 53 ~~~~E~S~RAl~LT~d~i~lNpAnYTVW~yRr~iL~~l~~-------dL~~El--~~l~eI~e~npKNYQvW---HHRr~ 120 (318)
T KOG0530|consen 53 IAKNEKSPRALQLTEDAIRLNPANYTVWQYRRVILRHLMS-------DLNKEL--EYLDEIIEDNPKNYQVW---HHRRV 120 (318)
T ss_pred HhccccCHHHHHHHHHHHHhCcccchHHHHHHHHHHHhHH-------HHHHHH--HHHHHHHHhCccchhHH---HHHHH
Confidence 3444566789999999999999999999877766 65553 222222 23333345556555421 12222
Q ss_pred HHHHhcCcchhhh----------H-HHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchh
Q psy3020 86 IKEIQKDPSLMTL----------E-AKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILF 139 (150)
Q Consensus 86 ~~~l~~~~~~~~~----------~-~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~ 139 (150)
+-.+-.+|+. .+ + +..-+|..+-+++..-+.|+.=+.+..+.|+.|--|-.+|
T Consensus 121 ive~l~d~s~-rELef~~~~l~~DaKNYHaWshRqW~~r~F~~~~~EL~y~~~Lle~Di~NNSAW 184 (318)
T KOG0530|consen 121 IVELLGDPSF-RELEFTKLMLDDDAKNYHAWSHRQWVLRFFKDYEDELAYADELLEEDIRNNSAW 184 (318)
T ss_pred HHHHhcCccc-chHHHHHHHHhccccchhhhHHHHHHHHHHhhHHHHHHHHHHHHHHhhhccchh
Confidence 2222223331 11 0 1245555555555555555555555555555544444443
No 266
>PF07079 DUF1347: Protein of unknown function (DUF1347); InterPro: IPR010764 This family consists of several hypothetical bacterial proteins of around 610 residues in length. Members of this family are highly conserved and seem to be specific to Chlamydia species. The function of this family is unknown.
Probab=92.15 E-value=4.7 Score=31.60 Aligned_cols=27 Identities=7% Similarity=0.108 Sum_probs=20.6
Q ss_pred HhhCC-HHHHHHHHHHhhccCCCcHHHH
Q psy3020 8 SYLGR-YKESISTYEEGLKLDPNNEQMK 34 (150)
Q Consensus 8 ~~~~~-~~~A~~~~~~aL~~~p~~~~a~ 34 (150)
-+.|. -++|+..++.+++..|.+.+..
T Consensus 390 W~~g~~dekalnLLk~il~ft~yD~ec~ 417 (549)
T PF07079_consen 390 WEIGQCDEKALNLLKLILQFTNYDIECE 417 (549)
T ss_pred HhcCCccHHHHHHHHHHHHhccccHHHH
Confidence 34444 7789999999999998888654
No 267
>PF14863 Alkyl_sulf_dimr: Alkyl sulfatase dimerisation; PDB: 2YHE_C 2CG2_A 2CG3_A 2CFU_A 2CFZ_A.
Probab=92.04 E-value=0.36 Score=31.47 Aligned_cols=43 Identities=16% Similarity=0.196 Sum_probs=35.5
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-Hhhhh
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEM 45 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~ 45 (150)
.+...+..|+|.-|++..+.++..||+|.++..-.+++ ..+|.
T Consensus 76 ~A~~~~~~gd~~wA~~L~d~l~~adp~n~~ar~l~A~al~~lg~ 119 (141)
T PF14863_consen 76 RAQAALAAGDYQWAAELLDHLVFADPDNEEARQLKADALEQLGY 119 (141)
T ss_dssp HHHHHHHCT-HHHHHHHHHHHHHH-TT-HHHHHHHHHHHHHHHH
T ss_pred HHHHHHHCCCHHHHHHHHHHHHHcCCCcHHHHHHHHHHHHHHHH
Confidence 45667789999999999999999999999999999998 88884
No 268
>KOG3364|consensus
Probab=91.87 E-value=0.29 Score=31.73 Aligned_cols=41 Identities=5% Similarity=0.063 Sum_probs=35.4
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcH
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNR 142 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~ 142 (150)
-+..+.++-.+++.++|+.++++.+..++..|+|..+.-..
T Consensus 71 Re~lyYLAvg~yRlkeY~~s~~yvd~ll~~e~~n~Qa~~Lk 111 (149)
T KOG3364|consen 71 RECLYYLAVGHYRLKEYSKSLRYVDALLETEPNNRQALELK 111 (149)
T ss_pred hhhhhhhHHHHHHHhhHHHHHHHHHHHHhhCCCcHHHHHHH
Confidence 56677788889999999999999999999999998775443
No 269
>COG4700 Uncharacterized protein conserved in bacteria containing a divergent form of TPR repeats [Function unknown]
Probab=91.81 E-value=3.1 Score=28.82 Aligned_cols=45 Identities=9% Similarity=0.053 Sum_probs=37.1
Q ss_pred ChhhHHHHhhCCHHHHHHHHHHhhccCCCc--HHHHHHHHHH-Hhhhh
Q psy3020 1 MTTYSCLSYLGRYKESISTYEEGLKLDPNN--EQMKEAIKDV-RNQEM 45 (150)
Q Consensus 1 ~~lg~~~~~~~~~~~A~~~~~~aL~~~p~~--~~a~~~l~~~-~~~g~ 45 (150)
+.++.+.+..+++.+|...++...+-+|.. +..+..++.+ ...|.
T Consensus 128 LglA~Aqfa~~~~A~a~~tLe~l~e~~pa~r~pd~~Ll~aR~laa~g~ 175 (251)
T COG4700 128 LGLAQAQFAIQEFAAAQQTLEDLMEYNPAFRSPDGHLLFARTLAAQGK 175 (251)
T ss_pred HHHHHHHHhhccHHHHHHHHHHHhhcCCccCCCCchHHHHHHHHhcCC
Confidence 357889999999999999999999999974 4556667777 77773
No 270
>COG2912 Uncharacterized conserved protein [Function unknown]
Probab=91.47 E-value=0.5 Score=34.13 Aligned_cols=63 Identities=10% Similarity=0.029 Sum_probs=51.9
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCC
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKP 75 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~ 75 (150)
+=.++...++++.|+.+-++.|.++|++++-+...|-+ ..+| +...|+ .++...+..-|+...
T Consensus 187 lk~~~~~e~~~~~al~~~~r~l~l~P~dp~eirDrGliY~ql~--------c~~vAl--~dl~~~~~~~P~~~~ 250 (269)
T COG2912 187 LKAALLRELQWELALRVAERLLDLNPEDPYEIRDRGLIYAQLG--------CYHVAL--EDLSYFVEHCPDDPI 250 (269)
T ss_pred HHHHHHHhhchHHHHHHHHHHHhhCCCChhhccCcHHHHHhcC--------CchhhH--HHHHHHHHhCCCchH
Confidence 34578889999999999999999999999999999999 8999 666677 677666666776543
No 271
>PHA02537 M terminase endonuclease subunit; Provisional
Probab=91.33 E-value=1.3 Score=31.40 Aligned_cols=36 Identities=22% Similarity=0.325 Sum_probs=29.7
Q ss_pred HHhHHHHHHHHHHH---------HhcCHHHHHHHHHHHHhhcCCC
Q psy3020 100 AKISLLKDKGNAAL---------QANNFKEAIEAYSEAIKLDGTN 135 (150)
Q Consensus 100 ~~~~~~~~~g~~~~---------~~~~~~~A~~~~~~al~~~p~~ 135 (150)
.++.-++..|..+. ..++...|+..+.+|+++||+.
T Consensus 167 vrAKl~K~~G~~llr~~~g~~~~d~~~l~~Al~~L~rA~~l~~k~ 211 (230)
T PHA02537 167 VRAKLYKAAGYLLLRNEKGEPIGDAETLQLALALLQRAFQLNDKC 211 (230)
T ss_pred HHHHHHHHHHHHHhhcccCCCccCcccHHHHHHHHHHHHHhCCCC
Confidence 44778888888884 4568889999999999999964
No 272
>PF12968 DUF3856: Domain of Unknown Function (DUF3856); InterPro: IPR024552 This domain of unknown function is found in a small group of tetratricopeptide-like proteins, which includes the uncharacterised protein Q8KAL8 from SWISSPROT. The structure of Q8KAL8 is known and belongs to the SCOP all alpha class, TPR-like superfamily, CT2138-like family.; PDB: 2HR2_D.
Probab=91.21 E-value=2.6 Score=26.84 Aligned_cols=100 Identities=12% Similarity=0.136 Sum_probs=58.3
Q ss_pred HHHHhhCCHHHHHHHHHHhhccCCCcH------------HHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCC
Q psy3020 5 SCLSYLGRYKESISTYEEGLKLDPNNE------------QMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDP 71 (150)
Q Consensus 5 ~~~~~~~~~~~A~~~~~~aL~~~p~~~------------~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~ 71 (150)
.-....|-|++|...|.+|.++..+-+ -.|-.|+.. ..+| .+.+++ .+...+|.
T Consensus 17 e~ql~~g~~~eAa~s~r~AM~~srtiP~eEaFDh~GFDA~chA~Ls~A~~~Lg--------ry~e~L--~sA~~aL~--- 83 (144)
T PF12968_consen 17 ERQLQDGAYEEAAASCRKAMEVSRTIPAEEAFDHDGFDAFCHAGLSGALAGLG--------RYDECL--QSADRALR--- 83 (144)
T ss_dssp HHHHHHT-HHHHHHHHHHHHHHHTTS-TTS---HHHHHHHHHHHHHHHHHHTT---------HHHHH--HHHHHHHH---
T ss_pred HHHHHhhhHHHHHHHHHHHHHHhccCChHhhcccccHHHHHHHHHHHHHHhhc--------cHHHHH--HHHHHHHH---
Confidence 334566889999999999998654322 245566666 7778 555555 34333332
Q ss_pred CCCCCCCChHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhh
Q psy3020 72 RTKPFLSDPSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKL 131 (150)
Q Consensus 72 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~ 131 (150)
+...-..+.++... .=..+-.+++..+-..|+.++|+..|+.+-+.
T Consensus 84 ----------YFNRRGEL~qdeGk----lWIaaVfsra~Al~~~Gr~~eA~~~fr~agEM 129 (144)
T PF12968_consen 84 ----------YFNRRGELHQDEGK----LWIAAVFSRAVALEGLGRKEEALKEFRMAGEM 129 (144)
T ss_dssp ----------HHHHH--TTSTHHH----HHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHH
T ss_pred ----------HHhhccccccccch----hHHHHHHHHHHHHHhcCChHHHHHHHHHHHHH
Confidence 22222223333211 00233456788888999999999999998653
No 273
>COG3914 Spy Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones]
Probab=91.06 E-value=5.8 Score=31.97 Aligned_cols=31 Identities=19% Similarity=-0.003 Sum_probs=26.9
Q ss_pred HHHHHHHhcCHHHHHHHHHHHHhhcCCCcch
Q psy3020 108 KGNAALQANNFKEAIEAYSEAIKLDGTNHIL 138 (150)
Q Consensus 108 ~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~ 138 (150)
+|+.+...|+..++.....+++++.|.++..
T Consensus 148 ~~~~~~~l~~~~~~~~~l~~~~d~~p~~~~~ 178 (620)
T COG3914 148 LGRYLKLLGRTAEAELALERAVDLLPKYPRV 178 (620)
T ss_pred HHHHHHHhccHHHHHHHHHHHHHhhhhhhhh
Confidence 6888888899999999999999999988544
No 274
>KOG0529|consensus
Probab=90.65 E-value=3.6 Score=31.59 Aligned_cols=105 Identities=20% Similarity=0.164 Sum_probs=69.3
Q ss_pred CCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChHHHHHHHHH
Q psy3020 11 GRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPSYVQMIKEI 89 (150)
Q Consensus 11 ~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~l 89 (150)
.-+++-+..-..+|+++|+...+|..+.-+ ...+.. ....-+ .-+.+.++.||.+.-.- .+.+.+-
T Consensus 89 ~~ld~eL~~~~~~L~~npksY~aW~hR~w~L~~~p~~------~~~~EL--~lcek~L~~D~RNfh~W---~YRRfV~-- 155 (421)
T KOG0529|consen 89 ALLDEELKYVESALKVNPKSYGAWHHRKWVLQKNPHS------DWNTEL--QLCEKALKQDPRNFHAW---HYRRFVV-- 155 (421)
T ss_pred HhhHHHHHHHHHHHHhCchhHHHHHHHHHHHHhCCCc------hHHHHH--HHHHHHHhcCcccccch---HHHHHHH--
Confidence 345677888899999999999999988877 544421 233333 45666777777654321 1222211
Q ss_pred hcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHH
Q psy3020 90 QKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFA 147 (150)
Q Consensus 90 ~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~ 147 (150)
+.+-.......+=+++.+++|..|+.|-.+|++|..++-
T Consensus 156 -------------------~~~~~~~~~~~~El~ftt~~I~~nfSNYsaWhyRs~lL~ 194 (421)
T KOG0529|consen 156 -------------------EQAERSRNLEKEELEFTTKLINDNFSNYSAWHYRSLLLS 194 (421)
T ss_pred -------------------HHHhcccccchhHHHHHHHHHhccchhhhHHHHHHHHHH
Confidence 111112234667788999999999999999999988764
No 275
>PF01535 PPR: PPR repeat; InterPro: IPR002885 This entry represents the PPR repeat. Pentatricopeptide repeat (PPR) proteins are characterised by tandem repeats of a degenerate 35 amino acid motif []. Most of PPR proteins have roles in mitochondria or plastid []. PPR repeats were discovered while screening Arabidopsis proteins for those predicted to be targeted to mitochondria or chloroplast [, ]. Some of these proteins have been shown to play a role in post-transcriptional processes within organelles and they are thought to be sequence-specific RNA-binding proteins [, , ]. Plant genomes have between one hundred to five hundred PPR genes per genome whereas non-plant genomes encode two to six PPR proteins. Although no PPR structures are yet known, the motif is predicted to fold into a helix-turn-helix structure similar to those found in the tetratricopeptide repeat (TPR) family (see PDOC50005 from PROSITEDOC) []. The plant PPR protein family has been divided in two subfamilies on the basis of their motif content and organisation [, ]. Examples of PPR repeat-containing proteins include PET309 P32522 from SWISSPROT, which may be involved in RNA stabilisation [], and crp1, which is involved in RNA processing []. The repeat is associated with a predicted plant protein O49549 from SWISSPROT that has a domain organisation similar to the human BRCA1 protein.
Probab=90.57 E-value=0.59 Score=21.08 Aligned_cols=28 Identities=18% Similarity=0.226 Sum_probs=23.9
Q ss_pred HHHHHHHHHHHhcCHHHHHHHHHHHHhh
Q psy3020 104 LLKDKGNAALQANNFKEAIEAYSEAIKL 131 (150)
Q Consensus 104 ~~~~~g~~~~~~~~~~~A~~~~~~al~~ 131 (150)
.|..+-..+.+.|++++|.+.|.+..+.
T Consensus 2 ~y~~li~~~~~~~~~~~a~~~~~~M~~~ 29 (31)
T PF01535_consen 2 TYNSLISGYCKMGQFEEALEVFDEMRER 29 (31)
T ss_pred cHHHHHHHHHccchHHHHHHHHHHHhHC
Confidence 4677788899999999999999988764
No 276
>KOG2471|consensus
Probab=90.54 E-value=0.31 Score=38.24 Aligned_cols=40 Identities=8% Similarity=0.037 Sum_probs=38.2
Q ss_pred ChhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH
Q psy3020 1 MTTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV 40 (150)
Q Consensus 1 ~~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~ 40 (150)
+++|..|.+.|++..|.++|.++.+.--.|+..|..++.+
T Consensus 339 YNcG~~~Lh~grPl~AfqCf~~av~vfh~nPrlWLRlAEc 378 (696)
T KOG2471|consen 339 YNCGLLYLHSGRPLLAFQCFQKAVHVFHRNPRLWLRLAEC 378 (696)
T ss_pred HhhhHHHHhcCCcHHHHHHHHHHHHHHhcCcHHHHHHHHH
Confidence 4789999999999999999999999999999999999997
No 277
>PF14863 Alkyl_sulf_dimr: Alkyl sulfatase dimerisation; PDB: 2YHE_C 2CG2_A 2CG3_A 2CFU_A 2CFZ_A.
Probab=90.40 E-value=1.2 Score=29.02 Aligned_cols=48 Identities=19% Similarity=0.194 Sum_probs=39.7
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
.+.....+...+..|+|.-|.+..+.++..+|+|..+..-++.+|-++
T Consensus 70 ~d~vl~~A~~~~~~gd~~wA~~L~d~l~~adp~n~~ar~l~A~al~~l 117 (141)
T PF14863_consen 70 ADKVLERAQAALAAGDYQWAAELLDHLVFADPDNEEARQLKADALEQL 117 (141)
T ss_dssp HHHHHHHHHHHHHCT-HHHHHHHHHHHHHH-TT-HHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHcCCCcHHHHHHHHHHHHHH
Confidence 566777888889999999999999999999999999999888888654
No 278
>PF10579 Rapsyn_N: Rapsyn N-terminal myristoylation and linker region; InterPro: IPR019568 Neuromuscular junction formation relies upon the clustering of acetylcholine receptors and other proteins in the muscle membrane. Rapsyn is a peripheral membrane protein that is selectively concentrated at the neuromuscular junction and is essential for the formation of synaptic acetylcholine receptor aggregates. Acetylcholine receptors fail to aggregate beneath nerve terminals in mice where rapsyn has been knocked out. The N-terminal six amino acids of rapsyn are its myristoylation site, and myristoylation is necessary for the targeting of the protein to the membrane []. ; GO: 0008270 zinc ion binding, 0033130 acetylcholine receptor binding, 0007268 synaptic transmission, 0005856 cytoskeleton, 0030054 cell junction, 0045211 postsynaptic membrane
Probab=90.32 E-value=1.6 Score=25.48 Aligned_cols=41 Identities=7% Similarity=0.004 Sum_probs=33.1
Q ss_pred hHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH----Hhhh
Q psy3020 4 YSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV----RNQE 44 (150)
Q Consensus 4 g~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~----~~~g 44 (150)
|.-+++..+.++|+..+.++|+..++..+-+..+|.+ ..+|
T Consensus 13 GlkLY~~~~~~~Al~~W~~aL~k~~~~~~rf~~lG~l~qA~~e~G 57 (80)
T PF10579_consen 13 GLKLYHQNETQQALQKWRKALEKITDREDRFRVLGYLIQAHMEWG 57 (80)
T ss_pred HHHHhccchHHHHHHHHHHHHhhcCChHHHHHHHHHHHHHHHHHH
Confidence 4446788899999999999999999988877766654 4555
No 279
>PF10602 RPN7: 26S proteasome subunit RPN7; InterPro: IPR019585 This entry represents the regulatory subunit RPN7 (known as the non-ATPase regulatory subunit 6 in higher eukaryotes) of the 26S proteasome. This entry also matches the evolutionarily related subunit 1 of the COP9 signalosome complex (CSN) from Arabidopsis []. The 26S proteasome plays a major role in ATP-dependent degradation of ubiquitinated proteins. Substrate specificity is conferred by the regulatory particle (RP), which can dissociate into stable lid and base subcomplexes. The regulatory subunit RPN7 is one of the lid subunits of the 26S proteasome and has been shown in Saccharomyces cerevisiae (Baker's yeast) to be required for structural integrity []. The COP9 signalosome is a conserved protein complex composed of eight subunits, where Individual subunits of the complex have been linked to various signal transduction pathways leading to gene expression and cell cycle control []. The overall organisation and the amino acid sequences of the COP9 signalosome subunits resemble the lid subcomplex of the 19 S regulatory particle for the 26 S proteasome []. COP9 subunit 1 (CSN1 or GPS1) of the COP9 complex is an essential subunit of the complex with regard to both structural integrity and functionality. The N-terminal region of subunit 1 (CSN1-N) can inhibit c-fos expression from either a transfected template or a chromosomal transgene (fos-lacZ), and may contain the activity domain that confers most of the repression functions of CSN1. The C-terminal region of subunit 1 (CSN1-C) allows integration of the protein into the COP9 signalosome.
Probab=90.24 E-value=4.2 Score=27.50 Aligned_cols=100 Identities=15% Similarity=0.126 Sum_probs=56.7
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcH---HHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCC
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNE---QMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFL 77 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~---~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~ 77 (150)
.+|.-|.+.|++++|+++|.++.+-..+.. +..+.+..+ ...+++ ..+. .
T Consensus 41 ~l~~~~~~~Gd~~~A~k~y~~~~~~~~~~~~~id~~l~~irv~i~~~d~--------~~v~--~---------------- 94 (177)
T PF10602_consen 41 DLADHYCKIGDLEEALKAYSRARDYCTSPGHKIDMCLNVIRVAIFFGDW--------SHVE--K---------------- 94 (177)
T ss_pred HHHHHHHHhhhHHHHHHHHHHHhhhcCCHHHHHHHHHHHHHHHHHhCCH--------HHHH--H----------------
Confidence 478899999999999999999877543322 344455555 555533 2221 1
Q ss_pred CChHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcC
Q psy3020 78 SDPSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDG 133 (150)
Q Consensus 78 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p 133 (150)
+...+..+...+.+.... ...-...|......++|.+|-+.|..++.-..
T Consensus 95 ----~i~ka~~~~~~~~d~~~~--nrlk~~~gL~~l~~r~f~~AA~~fl~~~~t~~ 144 (177)
T PF10602_consen 95 ----YIEKAESLIEKGGDWERR--NRLKVYEGLANLAQRDFKEAAELFLDSLSTFT 144 (177)
T ss_pred ----HHHHHHHHHhccchHHHH--HHHHHHHHHHHHHhchHHHHHHHHHccCcCCC
Confidence 111111111111111111 22223345556678999999999988866554
No 280
>PF09613 HrpB1_HrpK: Bacterial type III secretion protein (HrpB1_HrpK); InterPro: IPR013394 This family of proteins is encoded by genes found within type III secretion operons in a limited range of species including Xanthomonas, Ralstonia and Burkholderia.
Probab=90.16 E-value=1.5 Score=29.27 Aligned_cols=48 Identities=8% Similarity=-0.083 Sum_probs=41.5
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
+..-..-|+++...|+|.+|+..++...+-.|..+.+--.++.|+..+
T Consensus 44 ~e~~~~~~~l~i~r~~w~dA~rlLr~l~~~~~~~p~~kALlA~CL~~~ 91 (160)
T PF09613_consen 44 PELDLFDGWLHIVRGDWDDALRLLRELEERAPGFPYAKALLALCLYAL 91 (160)
T ss_pred hHHHHHHHHHHHHhCCHHHHHHHHHHHhccCCCChHHHHHHHHHHHHc
Confidence 556677788999999999999999999999999998888888888654
No 281
>PF04781 DUF627: Protein of unknown function (DUF627); InterPro: IPR006866 This domain represents the N-terminal region of several plant proteins of unknown function.
Probab=90.13 E-value=0.97 Score=28.16 Aligned_cols=41 Identities=22% Similarity=0.327 Sum_probs=32.7
Q ss_pred HHHHHHHhcCHHHHHHHHHHHHhhcCCCc---chhhcHHHHHHh
Q psy3020 108 KGNAALQANNFKEAIEAYSEAIKLDGTNH---ILFSNRSAAFAK 148 (150)
Q Consensus 108 ~g~~~~~~~~~~~A~~~~~~al~~~p~~~---~~~~n~a~~~~~ 148 (150)
++..++.+|++-+|++..+..+..++++. .++.--|.++.+
T Consensus 2 ~A~~~~~rGnhiKAL~iied~i~~h~~~~~~~~lh~~QG~if~~ 45 (111)
T PF04781_consen 2 KAKDYFARGNHIKALEIIEDLISRHGEDESSWLLHRLQGTIFYK 45 (111)
T ss_pred hHHHHHHccCHHHHHHHHHHHHHHccCCCchHHHHHHHhHHHHH
Confidence 35667899999999999999999999988 455555666554
No 282
>PF04910 Tcf25: Transcriptional repressor TCF25; InterPro: IPR006994 This entry appears to represent a novel family of basic helix-loop-helix (bHLH) proteins that control differentiation and development of a variety of organs [, ]. Human Nulp1 (Q2MK75 from SWISSPROT) is a basic helix-loop-helix protein expressed broadly during early embryonic organogenesis. Over expression of human Nulp1 in COS-7 cells inhibits the transcriptional activity of serum response factor (SRF), suggesting that Nulp1 may act as a novel bHLH transcriptional repressor in the SRF signalling pathway to mediate cellular functions [].
Probab=90.09 E-value=6.2 Score=29.94 Aligned_cols=38 Identities=13% Similarity=0.036 Sum_probs=32.5
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCC-Ccchh
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGT-NHILF 139 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~-~~~~~ 139 (150)
..++......+.++|-+.-|.+.++-.+.+||. ||..-
T Consensus 103 flal~r~i~~L~~RG~~rTAlE~~KlLlsLdp~~DP~g~ 141 (360)
T PF04910_consen 103 FLALFRYIQSLGRRGCWRTALEWCKLLLSLDPDEDPLGV 141 (360)
T ss_pred HHHHHHHHHHHHhcCcHHHHHHHHHHHHhcCCCCCcchh
Confidence 667777778888999999999999999999999 66543
No 283
>KOG2053|consensus
Probab=90.06 E-value=10 Score=32.17 Aligned_cols=59 Identities=17% Similarity=0.112 Sum_probs=43.9
Q ss_pred hHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCC
Q psy3020 4 YSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPR 72 (150)
Q Consensus 4 g~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~ 72 (150)
|-.++++|+.++|..+++..=..-|++...+..+..+ ..++ +..+++ .-+.+....+|.
T Consensus 50 aLsl~r~gk~~ea~~~Le~~~~~~~~D~~tLq~l~~~y~d~~--------~~d~~~--~~Ye~~~~~~P~ 109 (932)
T KOG2053|consen 50 ALSLFRLGKGDEALKLLEALYGLKGTDDLTLQFLQNVYRDLG--------KLDEAV--HLYERANQKYPS 109 (932)
T ss_pred HHHHHHhcCchhHHHHHhhhccCCCCchHHHHHHHHHHHHHh--------hhhHHH--HHHHHHHhhCCc
Confidence 5677888999999977776666666777777788888 8888 666777 666666666775
No 284
>COG4976 Predicted methyltransferase (contains TPR repeat) [General function prediction only]
Probab=89.94 E-value=0.7 Score=32.82 Aligned_cols=36 Identities=28% Similarity=0.332 Sum_probs=33.2
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcc
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHI 137 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~ 137 (150)
...|+.+|...-+.|+++.|.+.|++.+++||.+..
T Consensus 29 ~~gwfR~g~~~ekag~~daAa~a~~~~L~ldp~D~~ 64 (287)
T COG4976 29 AAGWFRLGEYTEKAGEFDAAAAAYEEVLELDPEDHG 64 (287)
T ss_pred hhhhhhcchhhhhcccHHHHHHHHHHHHcCCccccc
Confidence 778999999999999999999999999999998753
No 285
>COG3947 Response regulator containing CheY-like receiver and SARP domains [Signal transduction mechanisms]
Probab=89.92 E-value=1.2 Score=32.86 Aligned_cols=43 Identities=21% Similarity=0.202 Sum_probs=39.1
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-Hhhhh
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEM 45 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~ 45 (150)
.+..|...|.+.+|++..++++.+||=+...+..+-.+ ..+|+
T Consensus 285 va~~yle~g~~neAi~l~qr~ltldpL~e~~nk~lm~~la~~gD 328 (361)
T COG3947 285 VARAYLEAGKPNEAIQLHQRALTLDPLSEQDNKGLMASLATLGD 328 (361)
T ss_pred HHHHHHHcCChHHHHHHHHHHhhcChhhhHHHHHHHHHHHHhcc
Confidence 46789999999999999999999999999999999888 88884
No 286
>cd02682 MIT_AAA_Arch MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in mostly archaebacterial AAA-ATPases. The molecular function of the MIT domain is unclear.
Probab=89.77 E-value=1.3 Score=25.58 Aligned_cols=38 Identities=18% Similarity=0.113 Sum_probs=24.2
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHh-------hcCCCcchh
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIK-------LDGTNHILF 139 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~-------~~p~~~~~~ 139 (150)
+..+..++.-+=+.|++++|+.+|+++++ ..|+++...
T Consensus 6 A~~~a~~AVe~D~~gr~~eAi~~Y~~aIe~L~q~~~~~pD~~~k~ 50 (75)
T cd02682 6 ARKYAINAVKAEKEGNAEDAITNYKKAIEVLSQIVKNYPDSPTRL 50 (75)
T ss_pred HHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHhCCChHHHH
Confidence 33444455555577888888877777765 567765443
No 287
>PF12854 PPR_1: PPR repeat
Probab=89.60 E-value=0.84 Score=21.76 Aligned_cols=27 Identities=19% Similarity=0.213 Sum_probs=23.7
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHH
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEA 128 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~a 128 (150)
...|..+-..+.+.|+.++|++.|.+.
T Consensus 7 ~~ty~~lI~~~Ck~G~~~~A~~l~~~M 33 (34)
T PF12854_consen 7 VVTYNTLIDGYCKAGRVDEAFELFDEM 33 (34)
T ss_pred HhHHHHHHHHHHHCCCHHHHHHHHHhC
Confidence 567888899999999999999998763
No 288
>PF04212 MIT: MIT (microtubule interacting and transport) domain; InterPro: IPR007330 The MIT domain is found in vacuolar sorting proteins, spastin (probable ATPase involved in the assembly or function of nuclear protein complexes), and a sorting nexin, which may play a role in intracellular trafficking.; PDB: 2DL1_A 2JQK_A 1WR0_A 2CPT_A 2JQH_A 2V6Y_A 2JQ9_A 2K3W_A 1YXR_A 3EAB_E ....
Probab=88.99 E-value=1.9 Score=24.17 Aligned_cols=30 Identities=40% Similarity=0.465 Sum_probs=21.3
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhh
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKL 131 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~ 131 (150)
+..+...|..+=..|++++|+++|+++++.
T Consensus 5 A~~~~~~Av~~D~~g~~~~A~~~Y~~ai~~ 34 (69)
T PF04212_consen 5 AIELIKKAVEADEAGNYEEALELYKEAIEY 34 (69)
T ss_dssp HHHHHHHHHHHHHTTSHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHH
Confidence 334455555556789999999998888763
No 289
>PF08424 NRDE-2: NRDE-2, necessary for RNA interference; InterPro: IPR013633 This is domain is found in eukaryotic proteins of unknown function.
Probab=88.82 E-value=8 Score=28.75 Aligned_cols=116 Identities=14% Similarity=0.110 Sum_probs=72.2
Q ss_pred HHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCC-----------CCC-
Q psy3020 13 YKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPF-----------LSD- 79 (150)
Q Consensus 13 ~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~-----------~~~- 79 (150)
.+.-+..|++||+.+|++...+..+-.+ .... .-.... ..+.+.+..+|+.... ...
T Consensus 47 ~E~klsilerAL~~np~~~~L~l~~l~~~~~~~--------~~~~l~--~~we~~l~~~~~~~~LW~~yL~~~q~~~~~f 116 (321)
T PF08424_consen 47 AERKLSILERALKHNPDSERLLLGYLEEGEKVW--------DSEKLA--KKWEELLFKNPGSPELWREYLDFRQSNFASF 116 (321)
T ss_pred HHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHhC--------CHHHHH--HHHHHHHHHCCCChHHHHHHHHHHHHHhccC
Confidence 4567899999999999999988877666 4444 223333 5666677777765531 000
Q ss_pred ------hHHHHHHHHHhcCcchh-hh-------H-HHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcch
Q psy3020 80 ------PSYVQMIKEIQKDPSLM-TL-------E-AKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHIL 138 (150)
Q Consensus 80 ------~~~~~~~~~l~~~~~~~-~~-------~-~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~ 138 (150)
..|.+.+..+....... .. . .-...+..+...+.+.|-.+.|+..+.-.++++=..|..
T Consensus 117 ~v~~~~~~y~~~l~~L~~~~~~~~~~~~~~~~~e~~~l~v~~r~~~fl~~aG~~E~Ava~~Qa~lE~n~~~P~~ 190 (321)
T PF08424_consen 117 TVSDVRDVYEKCLRALSRRRSGRMTSHPDLPELEEFMLYVFLRLCRFLRQAGYTERAVALWQALLEFNFFRPES 190 (321)
T ss_pred cHHHHHHHHHHHHHHHHHhhccccccccchhhHHHHHHHHHHHHHHHHHHCCchHHHHHHHHHHHHHHcCCccc
Confidence 12334444443322221 00 0 114555556666678999999999999999988654443
No 290
>PF12862 Apc5: Anaphase-promoting complex subunit 5
Probab=88.66 E-value=1.3 Score=26.47 Aligned_cols=34 Identities=26% Similarity=0.341 Sum_probs=29.5
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCC
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTN 135 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~ 135 (150)
..++.+++.+....|++++|+..+++|+.+-...
T Consensus 41 ~~all~lA~~~~~~G~~~~A~~~l~eAi~~Are~ 74 (94)
T PF12862_consen 41 AYALLNLAELHRRFGHYEEALQALEEAIRLAREN 74 (94)
T ss_pred HHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHHHH
Confidence 5567889999999999999999999999876654
No 291
>PF10300 DUF3808: Protein of unknown function (DUF3808); InterPro: IPR019412 This entry represents a family of proteins conserved from fungi to humans. In humans this protein is expressed in primary breast carcinomas but not in normal breast tissue, and has a putative eukaryotic RNP-1 RNA binding region and a candidate anchoring transmembrane domain. The human protein is coordinately regulated with oestrogen receptor, but is not necessarily oestradiol-responsive []. Members of this family carry a tetratricopeptide repeat (IPR013105 from INTERPRO) at their C terminus.
Probab=88.53 E-value=7.8 Score=30.49 Aligned_cols=94 Identities=16% Similarity=0.093 Sum_probs=55.6
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHH----HHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCC
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMK----EAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFL 77 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~----~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~ 77 (150)
-|+.+...|+.++|++.|++++...+.-...+ ..++-+ ..+. ++.+|. ..+......+.-
T Consensus 273 ~gR~~~~~g~~~~Ai~~~~~a~~~q~~~~Ql~~l~~~El~w~~~~~~--------~w~~A~--~~f~~L~~~s~W----- 337 (468)
T PF10300_consen 273 EGRLERLKGNLEEAIESFERAIESQSEWKQLHHLCYFELAWCHMFQH--------DWEEAA--EYFLRLLKESKW----- 337 (468)
T ss_pred HHHHHHHhcCHHHHHHHHHHhccchhhHHhHHHHHHHHHHHHHHHHc--------hHHHHH--HHHHHHHhcccc-----
Confidence 47888999999999999999996555544432 344444 4444 334443 222222221110
Q ss_pred CChHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCH-------HHHHHHHHHHHhhcC
Q psy3020 78 SDPSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNF-------KEAIEAYSEAIKLDG 133 (150)
Q Consensus 78 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~-------~~A~~~~~~al~~~p 133 (150)
.+.-..+..|.++...|+. ++|.+.|.++-.+-.
T Consensus 338 ----------------------Ska~Y~Y~~a~c~~~l~~~~~~~~~~~~a~~l~~~vp~l~~ 378 (468)
T PF10300_consen 338 ----------------------SKAFYAYLAAACLLMLGREEEAKEHKKEAEELFRKVPKLKQ 378 (468)
T ss_pred ----------------------HHHHHHHHHHHHHHhhccchhhhhhHHHHHHHHHHHHHHHh
Confidence 1144455667777788887 777777776655433
No 292
>PF01239 PPTA: Protein prenyltransferase alpha subunit repeat; InterPro: IPR002088 Protein prenylation is the posttranslational attachment of either a farnesyl group or a geranylgeranyl group via a thioether linkage (-C-S-C-) to a cysteine at or near the carboxyl terminus of the protein. Farnesyl and geranylgeranyl groups are polyisoprenes, unsaturated hydrocarbons with a multiple of five carbons; the chain is 15 carbons long in the farnesyl moiety and 20 carbons long in the geranylgeranyl moiety. There are three different protein prenyltransferases in humans: farnesyltransferase (FT) and geranylgeranyltransferase 1 (GGT1) share the same motif (the CaaX box) around the cysteine in their substrates, and are thus called CaaX prenyltransferases, whereas geranylgeranyltransferase 2 (GGT2, also called Rab geranylgeranyltransferase) recognises a different motif and is thus called a non-CaaX prenyltransferase. Protein prenyltransferases are currently known only in eukaryotes, but they are widespread, being found in vertebrates, insects, nematodes, plants, fungi and protozoa, including several parasites. Each protein consists of two subunits, alpha and beta; the alpha subunit of FT and GGT1 is encoded by the same gene, FNTA. The alpha subunit is thought to participate in a stable complex with the isoprenyl substrate; the beta subunit binds the peptide substrate. In the alpha subunits of both types of protein prenyltransferases, seven tetratricopeptide repeats are formed by pairs of helices that are stabilised by conserved intercalating residues. The alpha subunits of GGT2 in mammals and plants also have an immunoglobulin-like domain between the fifth and sixth tetratricopeptide repeat, as well as leucine-rich repeats at the carboxyl terminus. The functions of these additional domains in GGT2 are as yet undefined, but they are apparently not directly involved in the interaction with substrates and Rab escort proteins. The tetratricopeptide repeats of the alpha subunit form a right-handed superhelix, which embraces the (alpha-alpha)6 barrel of the beta subunit []. ; GO: 0008318 protein prenyltransferase activity, 0018342 protein prenylation; PDB: 1S63_A 1LD7_A 1LD8_A 2H6G_A 1SA4_A 1MZC_A 1TN6_A 2F0Y_A 2H6H_A 2H6F_A ....
Probab=88.45 E-value=1.1 Score=20.77 Aligned_cols=29 Identities=17% Similarity=0.166 Sum_probs=24.0
Q ss_pred HHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 121 AIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 121 A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
.++...+++..+|.|..+|..|-.+.-.+
T Consensus 2 El~~~~~~l~~~pknys~W~yR~~ll~~l 30 (31)
T PF01239_consen 2 ELEFTKKALEKDPKNYSAWNYRRWLLKQL 30 (31)
T ss_dssp HHHHHHHHHHHSTTCHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHCcccccHHHHHHHHHHHc
Confidence 45678899999999999999998776543
No 293
>PF13041 PPR_2: PPR repeat family
Probab=88.42 E-value=1.2 Score=22.97 Aligned_cols=30 Identities=20% Similarity=0.244 Sum_probs=26.4
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhh
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKL 131 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~ 131 (150)
...|..+-..+.+.|++++|.+.|++..+.
T Consensus 3 ~~~yn~li~~~~~~~~~~~a~~l~~~M~~~ 32 (50)
T PF13041_consen 3 VVTYNTLISGYCKAGKFEEALKLFKEMKKR 32 (50)
T ss_pred hHHHHHHHHHHHHCcCHHHHHHHHHHHHHc
Confidence 456788888999999999999999999874
No 294
>KOG1550|consensus
Probab=88.19 E-value=6.2 Score=31.70 Aligned_cols=38 Identities=5% Similarity=-0.051 Sum_probs=25.1
Q ss_pred hhHHHHhh-----CCHHHHHHHHHHhhcc-----CCCcHHHHHHHHHH
Q psy3020 3 TYSCLSYL-----GRYKESISTYEEGLKL-----DPNNEQMKEAIKDV 40 (150)
Q Consensus 3 lg~~~~~~-----~~~~~A~~~~~~aL~~-----~p~~~~a~~~l~~~ 40 (150)
+|.||..- ++.+.|+.+|..+.+. .-.++.+...+|.+
T Consensus 250 ~g~~y~~G~~g~~~d~e~a~~~l~~aa~~~~~~a~~~~~~a~~~lg~~ 297 (552)
T KOG1550|consen 250 LGICYLAGTYGVTQDLESAIEYLKLAAESFKKAATKGLPPAQYGLGRL 297 (552)
T ss_pred HHHHHhhccccccccHHHHHHHHHHHHHHHHHHHhhcCCccccHHHHH
Confidence 45555544 7889999999999771 11144566667766
No 295
>PF12862 Apc5: Anaphase-promoting complex subunit 5
Probab=88.07 E-value=0.48 Score=28.40 Aligned_cols=27 Identities=22% Similarity=0.425 Sum_probs=24.0
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCC
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDP 28 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p 28 (150)
++|..+...|++++|+..+++|+++-.
T Consensus 46 ~lA~~~~~~G~~~~A~~~l~eAi~~Ar 72 (94)
T PF12862_consen 46 NLAELHRRFGHYEEALQALEEAIRLAR 72 (94)
T ss_pred HHHHHHHHhCCHHHHHHHHHHHHHHHH
Confidence 578899999999999999999997654
No 296
>TIGR00756 PPR pentatricopeptide repeat domain (PPR motif). This family has a similar consensus to the TPR domain (tetratricopeptide), pfam pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.
Probab=87.61 E-value=1.7 Score=19.85 Aligned_cols=28 Identities=18% Similarity=0.239 Sum_probs=23.3
Q ss_pred HHHHHHHHHHHhcCHHHHHHHHHHHHhh
Q psy3020 104 LLKDKGNAALQANNFKEAIEAYSEAIKL 131 (150)
Q Consensus 104 ~~~~~g~~~~~~~~~~~A~~~~~~al~~ 131 (150)
.|..+-..+.+.|++++|.+.|.+..+.
T Consensus 2 ~~n~li~~~~~~~~~~~a~~~~~~M~~~ 29 (35)
T TIGR00756 2 TYNTLIDGLCKAGRVEEALELFKEMLER 29 (35)
T ss_pred cHHHHHHHHHHCCCHHHHHHHHHHHHHc
Confidence 3566777889999999999999998653
No 297
>COG5191 Uncharacterized conserved protein, contains HAT (Half-A-TPR) repeat [General function prediction only]
Probab=87.52 E-value=1.1 Score=33.33 Aligned_cols=84 Identities=7% Similarity=0.050 Sum_probs=54.6
Q ss_pred HHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChHHHHHHHHHhcCcchhh
Q psy3020 19 TYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPSYVQMIKEIQKDPSLMT 97 (150)
Q Consensus 19 ~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~ 97 (150)
.|.++..--|+++..|.+.+.. ...+. +.+.- .-+..++..+|.+.+
T Consensus 95 ~~~R~tnkff~D~k~w~~y~~Y~~k~k~--------y~~~~--nI~~~~l~khP~nvd---------------------- 142 (435)
T COG5191 95 ELYRSTNKFFNDPKIWSQYAAYVIKKKM--------YGEMK--NIFAECLTKHPLNVD---------------------- 142 (435)
T ss_pred eeehhhhcCCCCcHHHHHHHHHHHHHHH--------HHHHH--HHHHHHHhcCCCCce----------------------
Confidence 3445555667778877777666 55552 22222 344556666776543
Q ss_pred hHHHhHHHHHHH-HHHHHhcCHHHHHHHHHHHHhhcCCCcchhh
Q psy3020 98 LEAKISLLKDKG-NAALQANNFKEAIEAYSEAIKLDGTNHILFS 140 (150)
Q Consensus 98 ~~~~~~~~~~~g-~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~ 140 (150)
.|..-. --+...++++.+-..|.+++..||++|..|.
T Consensus 143 ------lWI~~c~~e~~~~ani~s~Ra~f~~glR~N~~~p~iw~ 180 (435)
T COG5191 143 ------LWIYCCAFELFEIANIESSRAMFLKGLRMNSRSPRIWI 180 (435)
T ss_pred ------eeeeeccchhhhhccHHHHHHHHHhhhccCCCCchHHH
Confidence 333311 1234678999999999999999999998875
No 298
>COG2912 Uncharacterized conserved protein [Function unknown]
Probab=87.46 E-value=1.8 Score=31.30 Aligned_cols=44 Identities=14% Similarity=0.229 Sum_probs=38.1
Q ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 106 KDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 106 ~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
.++=..+...++++.|..+-.+.+.++|+++.-+.-||.+|.++
T Consensus 185 ~~lk~~~~~e~~~~~al~~~~r~l~l~P~dp~eirDrGliY~ql 228 (269)
T COG2912 185 RNLKAALLRELQWELALRVAERLLDLNPEDPYEIRDRGLIYAQL 228 (269)
T ss_pred HHHHHHHHHhhchHHHHHHHHHHHhhCCCChhhccCcHHHHHhc
Confidence 33444567889999999999999999999999999999999876
No 299
>PF10579 Rapsyn_N: Rapsyn N-terminal myristoylation and linker region; InterPro: IPR019568 Neuromuscular junction formation relies upon the clustering of acetylcholine receptors and other proteins in the muscle membrane. Rapsyn is a peripheral membrane protein that is selectively concentrated at the neuromuscular junction and is essential for the formation of synaptic acetylcholine receptor aggregates. Acetylcholine receptors fail to aggregate beneath nerve terminals in mice where rapsyn has been knocked out. The N-terminal six amino acids of rapsyn are its myristoylation site, and myristoylation is necessary for the targeting of the protein to the membrane []. ; GO: 0008270 zinc ion binding, 0033130 acetylcholine receptor binding, 0007268 synaptic transmission, 0005856 cytoskeleton, 0030054 cell junction, 0045211 postsynaptic membrane
Probab=87.32 E-value=2.7 Score=24.55 Aligned_cols=42 Identities=14% Similarity=0.138 Sum_probs=30.4
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHH
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRS 143 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a 143 (150)
.......|--++..++..+|+..++++++..++.+.-+.-+|
T Consensus 6 ak~~ie~GlkLY~~~~~~~Al~~W~~aL~k~~~~~~rf~~lG 47 (80)
T PF10579_consen 6 AKQQIEKGLKLYHQNETQQALQKWRKALEKITDREDRFRVLG 47 (80)
T ss_pred HHHHHHHHHHHhccchHHHHHHHHHHHHhhcCChHHHHHHHH
Confidence 344556677778888888888888888888888766554444
No 300
>cd02681 MIT_calpain7_1 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in the nuclear thiol protease PalBH. The molecular function of the MIT domain is unclear.
Probab=87.07 E-value=2.4 Score=24.50 Aligned_cols=31 Identities=23% Similarity=0.169 Sum_probs=22.2
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhc
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLD 132 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~ 132 (150)
+..+...+.-+=+.|+|++|+.+|.++++..
T Consensus 6 Ai~~a~~Ave~D~~g~y~eA~~~Y~~aie~l 36 (76)
T cd02681 6 AVQFARLAVQRDQEGRYSEAVFYYKEAAQLL 36 (76)
T ss_pred HHHHHHHHHHHHHccCHHHHHHHHHHHHHHH
Confidence 3444455555557899999999999988743
No 301
>KOG4814|consensus
Probab=86.93 E-value=4.9 Score=32.92 Aligned_cols=39 Identities=18% Similarity=0.248 Sum_probs=21.7
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhh
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFS 140 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~ 140 (150)
+...+.+..+|....+.+.|++++.+|-+.+|.++...+
T Consensus 394 aK~qR~l~~CYL~L~QLD~A~E~~~EAE~~d~~~~l~q~ 432 (872)
T KOG4814|consen 394 AKIQRALQVCYLKLEQLDNAVEVYQEAEEVDRQSPLCQL 432 (872)
T ss_pred HHHHHHHHHHHhhHHHHHHHHHHHHHHHhhccccHHHHH
Confidence 334445555555556666666666666666665554433
No 302
>PF02064 MAS20: MAS20 protein import receptor; InterPro: IPR002056 Virtually all mitochondrial precursors are imported via the same mechanism []: precursors first bind to receptors on the mitochondrial surface, then insert into the translocation channel in the outer membrane. Many outer-membrane proteins participate in the early stages of import, four of which (MAS20, MAS22, MAS37 and MAS70) are components of the receptor. MAS20, which forms a subcomplex with MAS22, seems to interact with most or all mitochondrial precursors, suggesting that the protein binds directly to mitochondrial targeting sequences. The MAS37 and MAS70 components also form a subcomplex, the two subcomplexes possibly binding via their trans- membrane (TM) regions - the TM region of MAS70 promotes oligomerisation of attatched protein domains and shares sequence similarity with the TM region of MAS20 []. MAS20 is also known as TOM20.; GO: 0006605 protein targeting, 0006886 intracellular protein transport, 0005742 mitochondrial outer membrane translocase complex; PDB: 3AX3_A 3AWR_B 2V1S_A 3AX5_C 3AX2_C 1OM2_A 2V1T_B.
Probab=86.59 E-value=1.4 Score=27.98 Aligned_cols=36 Identities=19% Similarity=0.273 Sum_probs=30.5
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHH
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAI 37 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l 37 (150)
.+|..+...|++++|..+|-+||.+-|+-.+....+
T Consensus 68 ~lGE~L~~~G~~~~aa~hf~nAl~V~~qP~~LL~i~ 103 (121)
T PF02064_consen 68 QLGEQLLAQGDYEEAAEHFYNALKVCPQPAELLQIY 103 (121)
T ss_dssp HHHHHHHHTT-HHHHHHHHHHHHHTSSSHHHHHHHH
T ss_pred HHHHHHHhCCCHHHHHHHHHHHHHhCCCHHHHHHHH
Confidence 578899999999999999999999999977765544
No 303
>cd02683 MIT_1 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in proteins with unknown function, co-occuring with an as yet undescribed domain. The molecular function of the MIT domain is unclear.
Probab=85.81 E-value=3 Score=24.14 Aligned_cols=29 Identities=24% Similarity=0.328 Sum_probs=18.2
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHh
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIK 130 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~ 130 (150)
+..+...+.-+=+.|+|++|+.+|.++|+
T Consensus 6 a~~l~~~Ave~D~~g~y~eAl~~Y~~aie 34 (77)
T cd02683 6 AKEVLKRAVELDQEGRFQEALVCYQEGID 34 (77)
T ss_pred HHHHHHHHHHHHHhccHHHHHHHHHHHHH
Confidence 33444455555567788777777777665
No 304
>KOG1070|consensus
Probab=85.64 E-value=14 Score=33.21 Aligned_cols=31 Identities=16% Similarity=0.150 Sum_probs=23.6
Q ss_pred HHHHHHHHHhhccCCCcHHHHHHHHHH-Hhhh
Q psy3020 14 KESISTYEEGLKLDPNNEQMKEAIKDV-RNQE 44 (150)
Q Consensus 14 ~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g 44 (150)
.+..++|++.+..+|+.+-.|...-.. ..++
T Consensus 1441 pesaeDferlvrssPNSSi~WI~YMaf~Lels 1472 (1710)
T KOG1070|consen 1441 PESAEDFERLVRSSPNSSILWIRYMAFHLELS 1472 (1710)
T ss_pred CcCHHHHHHHHhcCCCcchHHHHHHHHHhhhh
Confidence 346789999999999999888665544 4555
No 305
>KOG1941|consensus
Probab=85.63 E-value=3.5 Score=31.51 Aligned_cols=33 Identities=15% Similarity=0.182 Sum_probs=27.9
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCC
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGT 134 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~ 134 (150)
...+..+|..+.+..++++|.-+-.+|.++-..
T Consensus 162 lqvcv~Lgslf~~l~D~~Kal~f~~kA~~lv~s 194 (518)
T KOG1941|consen 162 LQVCVSLGSLFAQLKDYEKALFFPCKAAELVNS 194 (518)
T ss_pred eehhhhHHHHHHHHHhhhHHhhhhHhHHHHHHh
Confidence 466778999999999999999999999886543
No 306
>PF15015 NYD-SP12_N: Spermatogenesis-associated, N-terminal
Probab=85.48 E-value=1.7 Score=33.59 Aligned_cols=46 Identities=13% Similarity=0.179 Sum_probs=42.0
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-Hhhhhcc
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMND 47 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~ 47 (150)
.|..||..+++.+-|+.+--+.+-++|.++.-|+.-+.| +.+.++.
T Consensus 233 klv~CYL~~rkpdlALnh~hrsI~lnP~~frnHLrqAavfR~LeRy~ 279 (569)
T PF15015_consen 233 KLVTCYLRMRKPDLALNHSHRSINLNPSYFRNHLRQAAVFRRLERYS 279 (569)
T ss_pred HHHHhhhhcCCCchHHHHHhhhhhcCcchhhHHHHHHHHHHHHHHHH
Confidence 467899999999999999999999999999999999988 8888664
No 307
>KOG1070|consensus
Probab=85.48 E-value=12 Score=33.49 Aligned_cols=124 Identities=11% Similarity=0.077 Sum_probs=80.9
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCC--CCCCCCC
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPR--TKPFLSD 79 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~--~~~~~~~ 79 (150)
|...|...+++++|.+.|+.-++-=.+-...|.+.++. ...+ +.++|- .-+.+++...|. |..
T Consensus 1536 L~~iy~k~ek~~~A~ell~~m~KKF~q~~~vW~~y~~fLl~~n--------e~~aa~--~lL~rAL~~lPk~eHv~---- 1601 (1710)
T KOG1070|consen 1536 LLGIYEKSEKNDEADELLRLMLKKFGQTRKVWIMYADFLLRQN--------EAEAAR--ELLKRALKSLPKQEHVE---- 1601 (1710)
T ss_pred HHHHHHHhhcchhHHHHHHHHHHHhcchhhHHHHHHHHHhccc--------HHHHHH--HHHHHHHhhcchhhhHH----
Confidence 45667777888888888888887777677788888776 4444 223333 445556666665 222
Q ss_pred hHHHHHHHH------------------HhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHh--hcCCCcchh
Q psy3020 80 PSYVQMIKE------------------IQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIK--LDGTNHILF 139 (150)
Q Consensus 80 ~~~~~~~~~------------------l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~--~~p~~~~~~ 139 (150)
+...... +...| .+-+.|.-....-.+.|+-.-+-..|++++. +-|.....+
T Consensus 1602 --~IskfAqLEFk~GDaeRGRtlfEgll~ayP------KRtDlW~VYid~eik~~~~~~vR~lfeRvi~l~l~~kkmKff 1673 (1710)
T KOG1070|consen 1602 --FISKFAQLEFKYGDAERGRTLFEGLLSAYP------KRTDLWSVYIDMEIKHGDIKYVRDLFERVIELKLSIKKMKFF 1673 (1710)
T ss_pred --HHHHHHHHHhhcCCchhhHHHHHHHHhhCc------cchhHHHHHHHHHHccCCHHHHHHHHHHHHhcCCChhHhHHH
Confidence 2222111 22222 3367777777777888898999999999876 456677777
Q ss_pred hcHHHHHHh
Q psy3020 140 SNRSAAFAK 148 (150)
Q Consensus 140 ~n~a~~~~~ 148 (150)
|.+..-|-+
T Consensus 1674 fKkwLeyEk 1682 (1710)
T KOG1070|consen 1674 FKKWLEYEK 1682 (1710)
T ss_pred HHHHHHHHH
Confidence 777776654
No 308
>KOG4507|consensus
Probab=84.53 E-value=4.2 Score=33.01 Aligned_cols=104 Identities=12% Similarity=0.093 Sum_probs=63.3
Q ss_pred HHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChHHHHHH----------
Q psy3020 18 STYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPSYVQMI---------- 86 (150)
Q Consensus 18 ~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~~~~~~---------- 86 (150)
.....+++-.|.+...|++-+.+ +..| ...+|+ +=+..++...|.+..-.-.-.....+
T Consensus 200 ~~~~~glq~~~~sw~lH~~as~YWR~~G--------~~~~A~--~Ca~~a~hf~~~h~kdi~lLSlaTiL~RaG~sadA~ 269 (886)
T KOG4507|consen 200 HLIHEGLQKNTSSWVLHNMASFYWRIKG--------EPYQAV--ECAMRALHFSSRHNKDIALLSLATVLHRAGFSADAA 269 (886)
T ss_pred HHHHHhhhcCchhHHHHHHHHHHHHHcC--------Chhhhh--HHHHHHhhhCCcccccchhhhHHHHHHHcccccchh
Confidence 45567888888888888887777 8888 555566 55555666666543310000000000
Q ss_pred ----HHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcc
Q psy3020 87 ----KEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHI 137 (150)
Q Consensus 87 ----~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~ 137 (150)
..++..+.. ...++-+|+++.++++|.-.+.+|..+.+.+|.-..
T Consensus 270 iILhAA~~dA~~~------t~n~y~l~~i~aml~~~N~S~~~ydha~k~~p~f~q 318 (886)
T KOG4507|consen 270 VILHAALDDADFF------TSNYYTLGNIYAMLGEYNHSVLCYDHALQARPGFEQ 318 (886)
T ss_pred heeehhccCCccc------cccceeHHHHHHHHhhhhhhhhhhhhhhccCcchhH
Confidence 001111111 334667888888899999999999999888886543
No 309
>PF10602 RPN7: 26S proteasome subunit RPN7; InterPro: IPR019585 This entry represents the regulatory subunit RPN7 (known as the non-ATPase regulatory subunit 6 in higher eukaryotes) of the 26S proteasome. This entry also matches the evolutionarily related subunit 1 of the COP9 signalosome complex (CSN) from Arabidopsis []. The 26S proteasome plays a major role in ATP-dependent degradation of ubiquitinated proteins. Substrate specificity is conferred by the regulatory particle (RP), which can dissociate into stable lid and base subcomplexes. The regulatory subunit RPN7 is one of the lid subunits of the 26S proteasome and has been shown in Saccharomyces cerevisiae (Baker's yeast) to be required for structural integrity []. The COP9 signalosome is a conserved protein complex composed of eight subunits, where Individual subunits of the complex have been linked to various signal transduction pathways leading to gene expression and cell cycle control []. The overall organisation and the amino acid sequences of the COP9 signalosome subunits resemble the lid subcomplex of the 19 S regulatory particle for the 26 S proteasome []. COP9 subunit 1 (CSN1 or GPS1) of the COP9 complex is an essential subunit of the complex with regard to both structural integrity and functionality. The N-terminal region of subunit 1 (CSN1-N) can inhibit c-fos expression from either a transfected template or a chromosomal transgene (fos-lacZ), and may contain the activity domain that confers most of the repression functions of CSN1. The C-terminal region of subunit 1 (CSN1-C) allows integration of the protein into the COP9 signalosome.
Probab=84.29 E-value=1.9 Score=29.14 Aligned_cols=32 Identities=25% Similarity=0.278 Sum_probs=28.6
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcC
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDG 133 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p 133 (150)
..++..+|..+.+.|++++|+++|.++.+...
T Consensus 36 r~~~~~l~~~~~~~Gd~~~A~k~y~~~~~~~~ 67 (177)
T PF10602_consen 36 RMALEDLADHYCKIGDLEEALKAYSRARDYCT 67 (177)
T ss_pred HHHHHHHHHHHHHhhhHHHHHHHHHHHhhhcC
Confidence 67889999999999999999999999877543
No 310
>KOG1585|consensus
Probab=84.26 E-value=6.8 Score=28.33 Aligned_cols=29 Identities=10% Similarity=0.061 Sum_probs=21.5
Q ss_pred hHHHHhhCCHHHHHHHHHHhhccCCCcHH
Q psy3020 4 YSCLSYLGRYKESISTYEEGLKLDPNNEQ 32 (150)
Q Consensus 4 g~~~~~~~~~~~A~~~~~~aL~~~p~~~~ 32 (150)
+.+|...++|++|..++.+|.+-.-+|..
T Consensus 38 AvafRnAk~feKakdcLlkA~~~yEnnrs 66 (308)
T KOG1585|consen 38 AVAFRNAKKFEKAKDCLLKASKGYENNRS 66 (308)
T ss_pred HHHHHhhccHHHHHHHHHHHHHHHHhccc
Confidence 56788888888888888888865554443
No 311
>PF09613 HrpB1_HrpK: Bacterial type III secretion protein (HrpB1_HrpK); InterPro: IPR013394 This family of proteins is encoded by genes found within type III secretion operons in a limited range of species including Xanthomonas, Ralstonia and Burkholderia.
Probab=84.10 E-value=3.8 Score=27.33 Aligned_cols=43 Identities=9% Similarity=-0.070 Sum_probs=37.6
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-Hhhhh
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEM 45 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~ 45 (150)
-|..+...|+|.+|+..++.+..-.|..+.+.-.++.| ..+++
T Consensus 50 ~~~l~i~r~~w~dA~rlLr~l~~~~~~~p~~kALlA~CL~~~~D 93 (160)
T PF09613_consen 50 DGWLHIVRGDWDDALRLLRELEERAPGFPYAKALLALCLYALGD 93 (160)
T ss_pred HHHHHHHhCCHHHHHHHHHHHhccCCCChHHHHHHHHHHHHcCC
Confidence 46788999999999999999999999999888888888 88774
No 312
>KOG3616|consensus
Probab=83.41 E-value=3.6 Score=34.42 Aligned_cols=20 Identities=0% Similarity=0.247 Sum_probs=16.5
Q ss_pred hhHHHHhhCCHHHHHHHHHH
Q psy3020 3 TYSCLSYLGRYKESISTYEE 22 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~ 22 (150)
-|..+.+..++++|+++|++
T Consensus 667 agdlfeki~d~dkale~fkk 686 (1636)
T KOG3616|consen 667 AGDLFEKIHDFDKALECFKK 686 (1636)
T ss_pred hhhHHHHhhCHHHHHHHHHc
Confidence 36778888899999998876
No 313
>PF04053 Coatomer_WDAD: Coatomer WD associated region ; InterPro: IPR006692 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the WD-associated region found in coatomer subunits alpha, beta and beta' subunits. The alpha-subunit (RET1P) of the coatomer complex in Saccharomyces cerevisiae (Baker's yeast), participates in membrane transport between the endoplasmic reticulum and Golgi apparatus. The protein contains six WD-40 repeat motifs in its N-terminal region []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 3MKQ_B.
Probab=83.41 E-value=1.1 Score=34.82 Aligned_cols=29 Identities=31% Similarity=0.354 Sum_probs=26.1
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHh
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIK 130 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~ 130 (150)
+..|..+|.....+|+++-|.++|.++=.
T Consensus 347 ~~~W~~Lg~~AL~~g~~~lAe~c~~k~~d 375 (443)
T PF04053_consen 347 PEKWKQLGDEALRQGNIELAEECYQKAKD 375 (443)
T ss_dssp HHHHHHHHHHHHHTTBHHHHHHHHHHCT-
T ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHhhcC
Confidence 77999999999999999999999998643
No 314
>KOG0985|consensus
Probab=83.05 E-value=19 Score=31.58 Aligned_cols=41 Identities=15% Similarity=0.127 Sum_probs=31.0
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-Hhhhhcc
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMND 47 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~ 47 (150)
.+|.+....|..++|++.|-+| +++..+....++ ...|.++
T Consensus 1109 qlakAQL~~~~v~dAieSyika-----dDps~y~eVi~~a~~~~~~e 1150 (1666)
T KOG0985|consen 1109 QLAKAQLQGGLVKDAIESYIKA-----DDPSNYLEVIDVASRTGKYE 1150 (1666)
T ss_pred HHHHHHHhcCchHHHHHHHHhc-----CCcHHHHHHHHHHHhcCcHH
Confidence 4788889999999999999775 566667777766 6666443
No 315
>PF04212 MIT: MIT (microtubule interacting and transport) domain; InterPro: IPR007330 The MIT domain is found in vacuolar sorting proteins, spastin (probable ATPase involved in the assembly or function of nuclear protein complexes), and a sorting nexin, which may play a role in intracellular trafficking.; PDB: 2DL1_A 2JQK_A 1WR0_A 2CPT_A 2JQH_A 2V6Y_A 2JQ9_A 2K3W_A 1YXR_A 3EAB_E ....
Probab=82.84 E-value=1.9 Score=24.11 Aligned_cols=23 Identities=22% Similarity=0.417 Sum_probs=17.2
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhc
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLK 25 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~ 25 (150)
.|.-.-+.|++++|++.|.++++
T Consensus 11 ~Av~~D~~g~~~~A~~~Y~~ai~ 33 (69)
T PF04212_consen 11 KAVEADEAGNYEEALELYKEAIE 33 (69)
T ss_dssp HHHHHHHTTSHHHHHHHHHHHHH
T ss_pred HHHHHHHCCCHHHHHHHHHHHHH
Confidence 35566778888888888887763
No 316
>PF02064 MAS20: MAS20 protein import receptor; InterPro: IPR002056 Virtually all mitochondrial precursors are imported via the same mechanism []: precursors first bind to receptors on the mitochondrial surface, then insert into the translocation channel in the outer membrane. Many outer-membrane proteins participate in the early stages of import, four of which (MAS20, MAS22, MAS37 and MAS70) are components of the receptor. MAS20, which forms a subcomplex with MAS22, seems to interact with most or all mitochondrial precursors, suggesting that the protein binds directly to mitochondrial targeting sequences. The MAS37 and MAS70 components also form a subcomplex, the two subcomplexes possibly binding via their trans- membrane (TM) regions - the TM region of MAS70 promotes oligomerisation of attatched protein domains and shares sequence similarity with the TM region of MAS20 []. MAS20 is also known as TOM20.; GO: 0006605 protein targeting, 0006886 intracellular protein transport, 0005742 mitochondrial outer membrane translocase complex; PDB: 3AX3_A 3AWR_B 2V1S_A 3AX5_C 3AX2_C 1OM2_A 2V1T_B.
Probab=82.70 E-value=4.8 Score=25.53 Aligned_cols=33 Identities=21% Similarity=0.293 Sum_probs=28.8
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCC
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGT 134 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~ 134 (150)
+..-..+|..+...|++++|+.+|-+||.+-|+
T Consensus 63 Fl~qV~lGE~L~~~G~~~~aa~hf~nAl~V~~q 95 (121)
T PF02064_consen 63 FLQQVQLGEQLLAQGDYEEAAEHFYNALKVCPQ 95 (121)
T ss_dssp HHHHHHHHHHHHHTT-HHHHHHHHHHHHHTSSS
T ss_pred HHHHHHHHHHHHhCCCHHHHHHHHHHHHHhCCC
Confidence 556677999999999999999999999999995
No 317
>cd02680 MIT_calpain7_2 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in the nuclear thiol protease PalBH. The molecular function of the MIT domain is unclear.
Probab=82.63 E-value=4.2 Score=23.47 Aligned_cols=30 Identities=33% Similarity=0.427 Sum_probs=21.8
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhh
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKL 131 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~ 131 (150)
+..+...|...=..|+|++|++.|..+|+.
T Consensus 6 ai~Lv~~A~~eD~~gny~eA~~lY~~ale~ 35 (75)
T cd02680 6 AHFLVTQAFDEDEKGNAEEAIELYTEAVEL 35 (75)
T ss_pred HHHHHHHHHHhhHhhhHHHHHHHHHHHHHH
Confidence 334444454455789999999999999875
No 318
>KOG0546|consensus
Probab=82.44 E-value=2.7 Score=31.62 Aligned_cols=37 Identities=19% Similarity=0.236 Sum_probs=34.2
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcch
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHIL 138 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~ 138 (150)
..+++.+|..+....++++|++.+..+....|++...
T Consensus 309 tka~~Rr~~~~~~~~~~~~a~~~~~~a~~~~p~d~~i 345 (372)
T KOG0546|consen 309 TKAHYRRGQAYKLLKNYDEALEDLKKAKQKAPNDKAI 345 (372)
T ss_pred CcHHHHHHhHHHhhhchhhhHHHHHHhhccCcchHHH
Confidence 7889999999999999999999999999999988654
No 319
>TIGR02561 HrpB1_HrpK type III secretion protein HrpB1/HrpK. This gene is found within type III secretion operons in a limited range of species including Xanthomonas, Ralstonia and Burkholderia.
Probab=81.95 E-value=5 Score=26.51 Aligned_cols=43 Identities=16% Similarity=-0.011 Sum_probs=37.5
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-Hhhhh
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEM 45 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~ 45 (150)
-|..+...|+|.+|+..+.....-.|..+.+.-.++.| +.+|+
T Consensus 50 dg~l~i~rg~w~eA~rvlr~l~~~~~~~p~~kAL~A~CL~al~D 93 (153)
T TIGR02561 50 DGWLLIARGNYDEAARILRELLSSAGAPPYGKALLALCLNAKGD 93 (153)
T ss_pred HHHHHHHcCCHHHHHHHHHhhhccCCCchHHHHHHHHHHHhcCC
Confidence 47789999999999999999988888888888888888 88884
No 320
>cd02683 MIT_1 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in proteins with unknown function, co-occuring with an as yet undescribed domain. The molecular function of the MIT domain is unclear.
Probab=81.91 E-value=1.8 Score=25.04 Aligned_cols=21 Identities=33% Similarity=0.638 Sum_probs=14.3
Q ss_pred hHHHHhhCCHHHHHHHHHHhh
Q psy3020 4 YSCLSYLGRYKESISTYEEGL 24 (150)
Q Consensus 4 g~~~~~~~~~~~A~~~~~~aL 24 (150)
|.-.-+.|+|++|+.+|.++|
T Consensus 13 Ave~D~~g~y~eAl~~Y~~ai 33 (77)
T cd02683 13 AVELDQEGRFQEALVCYQEGI 33 (77)
T ss_pred HHHHHHhccHHHHHHHHHHHH
Confidence 445566788887777777665
No 321
>KOG2300|consensus
Probab=81.43 E-value=26 Score=28.04 Aligned_cols=104 Identities=12% Similarity=0.061 Sum_probs=60.1
Q ss_pred hhhHHHHhhC--CHHHHHHHHHHhhccCCCcH-HH--HHHHHHH--HhhhhccCCCCCchhhhhCChHHHHhhccCCCCC
Q psy3020 2 TTYSCLSYLG--RYKESISTYEEGLKLDPNNE-QM--KEAIKDV--RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTK 74 (150)
Q Consensus 2 ~lg~~~~~~~--~~~~A~~~~~~aL~~~p~~~-~a--~~~l~~~--~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~ 74 (150)
++|..+...| +...+++|.+..+...|.+- ++ +..+|.+ ...... +.|- ..+.++..+....+
T Consensus 12 GlAe~~rt~~PPkIkk~IkClqA~~~~~is~~veart~LqLg~lL~~yT~N~--------elAk--sHLekA~~i~~~ip 81 (629)
T KOG2300|consen 12 GLAEHFRTSGPPKIKKCIKCLQAIFQFQISFLVEARTHLQLGALLLRYTKNV--------ELAK--SHLEKAWLISKSIP 81 (629)
T ss_pred HHHHHHhhcCChhHHHHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHhccH--------HHHH--HHHHHHHHHHcccc
Confidence 5677788888 88999999999999999754 44 4566766 333322 2222 23333322221111
Q ss_pred CCCCChHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhc-CHHHHHHHHHHHHhhcCCCc
Q psy3020 75 PFLSDPSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQAN-NFKEAIEAYSEAIKLDGTNH 136 (150)
Q Consensus 75 ~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~-~~~~A~~~~~~al~~~p~~~ 136 (150)
.+. +.|.++...++..+.... .+..|.....+|+++..+.|
T Consensus 82 ~fy---------------------dvKf~a~SlLa~lh~~~~~s~~~~KalLrkaielsq~~p 123 (629)
T KOG2300|consen 82 SFY---------------------DVKFQAASLLAHLHHQLAQSFPPAKALLRKAIELSQSVP 123 (629)
T ss_pred cHH---------------------hhhhHHHHHHHHHHHHhcCCCchHHHHHHHHHHHhcCCc
Confidence 111 022555555666665444 66667777777777766665
No 322
>TIGR02561 HrpB1_HrpK type III secretion protein HrpB1/HrpK. This gene is found within type III secretion operons in a limited range of species including Xanthomonas, Ralstonia and Burkholderia.
Probab=81.34 E-value=3.8 Score=27.04 Aligned_cols=47 Identities=15% Similarity=0.052 Sum_probs=37.9
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHh
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAK 148 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~ 148 (150)
+..-..-|.++...|+|.+|+..+++..+-.|..+..--.++.|+.-
T Consensus 44 ~e~d~~dg~l~i~rg~w~eA~rvlr~l~~~~~~~p~~kAL~A~CL~a 90 (153)
T TIGR02561 44 KELDMFDGWLLIARGNYDEAARILRELLSSAGAPPYGKALLALCLNA 90 (153)
T ss_pred cccchhHHHHHHHcCCHHHHHHHHHhhhccCCCchHHHHHHHHHHHh
Confidence 44455668888999999999999999999988878776667777654
No 323
>COG3629 DnrI DNA-binding transcriptional activator of the SARP family [Signal transduction mechanisms]
Probab=81.19 E-value=9.7 Score=27.88 Aligned_cols=48 Identities=19% Similarity=0.207 Sum_probs=42.0
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
..++..+...+...|+++.+++.+++.+.++|-+...|..+=.+|.+.
T Consensus 153 ~~~l~~lae~~~~~~~~~~~~~~l~~Li~~dp~~E~~~~~lm~~y~~~ 200 (280)
T COG3629 153 IKALTKLAEALIACGRADAVIEHLERLIELDPYDEPAYLRLMEAYLVN 200 (280)
T ss_pred HHHHHHHHHHHHhcccHHHHHHHHHHHHhcCccchHHHHHHHHHHHHc
Confidence 677788888888999999999999999999999999988877777653
No 324
>COG3118 Thioredoxin domain-containing protein [Posttranslational modification, protein turnover, chaperones]
Probab=81.09 E-value=13 Score=27.46 Aligned_cols=21 Identities=14% Similarity=0.184 Sum_probs=15.8
Q ss_pred hhhHHHHhhCCHHHHHHHHHH
Q psy3020 2 TTYSCLSYLGRYKESISTYEE 22 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~ 22 (150)
.++.||...|+.+.|...+..
T Consensus 173 ~la~~~l~~g~~e~A~~iL~~ 193 (304)
T COG3118 173 LLAECLLAAGDVEAAQAILAA 193 (304)
T ss_pred HHHHHHHHcCChHHHHHHHHh
Confidence 578899999998876665544
No 325
>smart00745 MIT Microtubule Interacting and Trafficking molecule domain.
Probab=80.92 E-value=6.3 Score=22.43 Aligned_cols=28 Identities=36% Similarity=0.469 Sum_probs=18.0
Q ss_pred HHHHHHHHHHHHhcCHHHHHHHHHHHHh
Q psy3020 103 SLLKDKGNAALQANNFKEAIEAYSEAIK 130 (150)
Q Consensus 103 ~~~~~~g~~~~~~~~~~~A~~~~~~al~ 130 (150)
..+...|...=+.|++++|+.+|.++++
T Consensus 9 ~~li~~Av~~d~~g~~~eAl~~Y~~a~e 36 (77)
T smart00745 9 KELISKALKADEAGDYEEALELYKKAIE 36 (77)
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHHH
Confidence 3344455555567888887777777765
No 326
>cd02681 MIT_calpain7_1 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in the nuclear thiol protease PalBH. The molecular function of the MIT domain is unclear.
Probab=80.66 E-value=1.9 Score=24.91 Aligned_cols=22 Identities=27% Similarity=0.363 Sum_probs=16.9
Q ss_pred hHHHHhhCCHHHHHHHHHHhhc
Q psy3020 4 YSCLSYLGRYKESISTYEEGLK 25 (150)
Q Consensus 4 g~~~~~~~~~~~A~~~~~~aL~ 25 (150)
|.-.-+.|+|++|+..|..+++
T Consensus 13 Ave~D~~g~y~eA~~~Y~~aie 34 (76)
T cd02681 13 AVQRDQEGRYSEAVFYYKEAAQ 34 (76)
T ss_pred HHHHHHccCHHHHHHHHHHHHH
Confidence 4455677899998888888874
No 327
>cd02678 MIT_VPS4 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in intracellular protein transport proteins of the AAA-ATPase family. The molecular function of the MIT domain is unclear.
Probab=80.51 E-value=6.6 Score=22.40 Aligned_cols=29 Identities=28% Similarity=0.354 Sum_probs=19.6
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHh
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIK 130 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~ 130 (150)
+..+...|.-.=..|+|++|+..|.++++
T Consensus 6 A~~l~~~Av~~D~~g~y~eA~~~Y~~aie 34 (75)
T cd02678 6 AIELVKKAIEEDNAGNYEEALRLYQHALE 34 (75)
T ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHHH
Confidence 34445555555577888888888887766
No 328
>PF13812 PPR_3: Pentatricopeptide repeat domain
Probab=80.07 E-value=4.5 Score=18.46 Aligned_cols=28 Identities=14% Similarity=0.116 Sum_probs=23.8
Q ss_pred HHHHHHHHHHHhcCHHHHHHHHHHHHhh
Q psy3020 104 LLKDKGNAALQANNFKEAIEAYSEAIKL 131 (150)
Q Consensus 104 ~~~~~g~~~~~~~~~~~A~~~~~~al~~ 131 (150)
.|..+-..+.+.|+++.|.+.|....+.
T Consensus 3 ty~~ll~a~~~~g~~~~a~~~~~~M~~~ 30 (34)
T PF13812_consen 3 TYNALLRACAKAGDPDAALQLFDEMKEQ 30 (34)
T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHh
Confidence 5677788889999999999999988763
No 329
>KOG1464|consensus
Probab=79.60 E-value=7.2 Score=28.70 Aligned_cols=53 Identities=17% Similarity=0.245 Sum_probs=38.7
Q ss_pred chhhhhCChHHHHhhccCCCCCCCCCChHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhh
Q psy3020 53 PFANLFSDPNIFVQLQLDPRTKPFLSDPSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKL 131 (150)
Q Consensus 53 ~~~~a~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~ 131 (150)
+.++|+ ..+.+.+.+.++-.+|= +.++..+..+.++.++|++..+.|++.+..
T Consensus 42 ~p~~Al--~sF~kVlelEgEKgeWG------------------------FKALKQmiKI~f~l~~~~eMm~~Y~qlLTY 94 (440)
T KOG1464|consen 42 EPKEAL--SSFQKVLELEGEKGEWG------------------------FKALKQMIKINFRLGNYKEMMERYKQLLTY 94 (440)
T ss_pred CHHHHH--HHHHHHHhcccccchhH------------------------HHHHHHHHHHHhccccHHHHHHHHHHHHHH
Confidence 445566 66666666666543332 678888899999999999999999888753
No 330
>COG3629 DnrI DNA-binding transcriptional activator of the SARP family [Signal transduction mechanisms]
Probab=79.14 E-value=21 Score=26.14 Aligned_cols=44 Identities=20% Similarity=0.190 Sum_probs=38.5
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-Hhhhhc
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMN 46 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~ 46 (150)
++..+...|+++.+..++++.+..||-+..+|..+-.. ...|+.
T Consensus 159 lae~~~~~~~~~~~~~~l~~Li~~dp~~E~~~~~lm~~y~~~g~~ 203 (280)
T COG3629 159 LAEALIACGRADAVIEHLERLIELDPYDEPAYLRLMEAYLVNGRQ 203 (280)
T ss_pred HHHHHHhcccHHHHHHHHHHHHhcCccchHHHHHHHHHHHHcCCc
Confidence 56778888999999999999999999999999988777 777743
No 331
>KOG2396|consensus
Probab=79.04 E-value=8.8 Score=30.49 Aligned_cols=43 Identities=9% Similarity=0.088 Sum_probs=36.6
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHH
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSA 144 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~ 144 (150)
+..|........+.+.+.+.-..|.+++..+|+++.+|...|.
T Consensus 105 ~~lW~~yi~f~kk~~~~~~v~ki~~~~l~~Hp~~~dLWI~aA~ 147 (568)
T KOG2396|consen 105 VKLWLSYIAFCKKKKTYGEVKKIFAAMLAKHPNNPDLWIYAAK 147 (568)
T ss_pred HHHHHHHHHHHHHhcchhHHHHHHHHHHHhCCCCchhHHhhhh
Confidence 6777777766667777999999999999999999999987664
No 332
>cd02682 MIT_AAA_Arch MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in mostly archaebacterial AAA-ATPases. The molecular function of the MIT domain is unclear.
Probab=78.65 E-value=4.9 Score=23.21 Aligned_cols=37 Identities=14% Similarity=0.129 Sum_probs=22.3
Q ss_pred hHHHHhhCCHHHHHHHHHHh-------hccCCCcHHHHHHHHHH
Q psy3020 4 YSCLSYLGRYKESISTYEEG-------LKLDPNNEQMKEAIKDV 40 (150)
Q Consensus 4 g~~~~~~~~~~~A~~~~~~a-------L~~~p~~~~a~~~l~~~ 40 (150)
|.-+-+.|++++|+.+|+++ ++..||++.-..-...+
T Consensus 13 AVe~D~~gr~~eAi~~Y~~aIe~L~q~~~~~pD~~~k~~yr~ki 56 (75)
T cd02682 13 AVKAEKEGNAEDAITNYKKAIEVLSQIVKNYPDSPTRLIYEQMI 56 (75)
T ss_pred HHHHHhcCCHHHHHHHHHHHHHHHHHHHHhCCChHHHHHHHHHH
Confidence 34455667777766666655 55777777654444444
No 333
>PF11207 DUF2989: Protein of unknown function (DUF2989); InterPro: IPR021372 Some members in this bacterial family of proteins are annotated as lipoproteins however this cannot be confirmed.
Probab=78.63 E-value=7.7 Score=26.94 Aligned_cols=43 Identities=9% Similarity=0.110 Sum_probs=34.4
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCC----cHHHHHHHHHH-Hhhhhc
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPN----NEQMKEAIKDV-RNQEMN 46 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~----~~~a~~~l~~~-~~~g~~ 46 (150)
||+.| ...+.++|+..|-++|++.+. |++++.+|+.+ ...++.
T Consensus 147 LAtyY-~krD~~Kt~~ll~~~L~l~~~~~~~n~eil~sLas~~~~~~~~ 194 (203)
T PF11207_consen 147 LATYY-TKRDPEKTIQLLLRALELSNPDDNFNPEILKSLASIYQKLKNY 194 (203)
T ss_pred HHHHH-HccCHHHHHHHHHHHHHhcCCCCCCCHHHHHHHHHHHHHhcch
Confidence 34444 478899999999999998764 58999999999 888843
No 334
>cd02684 MIT_2 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in proteins with an n-terminal serine/threonine kinase domain. The molecular function of the MIT domain is unclear.
Probab=77.55 E-value=9.2 Score=21.95 Aligned_cols=30 Identities=17% Similarity=0.131 Sum_probs=20.2
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhh
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKL 131 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~ 131 (150)
+..+...|...=..|+|++|+..|..+|+.
T Consensus 6 Ai~lv~~Av~~D~~g~y~eA~~lY~~ale~ 35 (75)
T cd02684 6 AIALVVQAVKKDQRGDAAAALSLYCSALQY 35 (75)
T ss_pred HHHHHHHHHHHHHhccHHHHHHHHHHHHHH
Confidence 344445555555778888888888887763
No 335
>PF10255 Paf67: RNA polymerase I-associated factor PAF67; InterPro: IPR019382 RNA polymerase I is a multi-subunit enzyme and its transcription competence is dependent on the presence of PAF67 [].
Probab=77.32 E-value=8 Score=29.87 Aligned_cols=29 Identities=28% Similarity=0.324 Sum_probs=26.7
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHh
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIK 130 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~ 130 (150)
...++..|-+++..++|.+|++.|+..+.
T Consensus 164 is~~YyvGFaylMlrRY~DAir~f~~iL~ 192 (404)
T PF10255_consen 164 ISTYYYVGFAYLMLRRYADAIRTFSQILL 192 (404)
T ss_pred eehHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 67889999999999999999999999875
No 336
>cd02656 MIT MIT: domain contained within Microtubule Interacting and Trafficking molecules. The MIT domain is found in sorting nexins, the nuclear thiol protease PalBH, the AAA protein spastin and archaebacterial proteins with similar domain architecture, vacuolar sorting proteins and others. The molecular function of the MIT domain is unclear.
Probab=77.28 E-value=9.6 Score=21.60 Aligned_cols=26 Identities=31% Similarity=0.489 Sum_probs=16.9
Q ss_pred HHHHHHHHHHhcCHHHHHHHHHHHHh
Q psy3020 105 LKDKGNAALQANNFKEAIEAYSEAIK 130 (150)
Q Consensus 105 ~~~~g~~~~~~~~~~~A~~~~~~al~ 130 (150)
+...|...=..|++++|+.+|..+++
T Consensus 9 l~~~Av~~D~~g~~~~Al~~Y~~a~e 34 (75)
T cd02656 9 LIKQAVKEDEDGNYEEALELYKEALD 34 (75)
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHH
Confidence 34444444466888888887777765
No 337
>PF10373 EST1_DNA_bind: Est1 DNA/RNA binding domain; InterPro: IPR018834 Est1 is a protein which recruits or activates telomerase at the site of polymerisation [, ]. This is the DNA/RNA binding domain of EST1 []. ; PDB: 1YA0_B.
Probab=76.79 E-value=4.4 Score=28.90 Aligned_cols=29 Identities=17% Similarity=0.251 Sum_probs=25.4
Q ss_pred HHHHHHHhhccCCCcHHHHHHHHHH-Hhhh
Q psy3020 16 SISTYEEGLKLDPNNEQMKEAIKDV-RNQE 44 (150)
Q Consensus 16 A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g 44 (150)
|..+|.+|+.+.|++...+..+|.+ ...|
T Consensus 1 A~~~Y~~A~~l~P~~G~p~nQLAvl~~~~~ 30 (278)
T PF10373_consen 1 AERYYRKAIRLLPSNGNPYNQLAVLASYQG 30 (278)
T ss_dssp HHHHHHHHHHH-TTBSHHHHHHHHHHHHTT
T ss_pred CHHHHHHHHHhCCCCCCcccchhhhhcccc
Confidence 7889999999999999999999998 6655
No 338
>KOG2300|consensus
Probab=76.73 E-value=18 Score=28.80 Aligned_cols=110 Identities=19% Similarity=0.134 Sum_probs=66.2
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCC-cHHHHH--HHHHH-HhhhhccCCCCCchhhhhCChHHHHhh-ccCCCCCCC
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPN-NEQMKE--AIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQL-QLDPRTKPF 76 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~-~~~a~~--~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l-~~~~~~~~~ 76 (150)
.+|.....-+-++.|..+|..|+++... +..+.. .+|.+ ...++. .++.+.+ .+.|.+...
T Consensus 372 LlGlys~sv~~~enAe~hf~~a~k~t~~~dl~a~~nlnlAi~YL~~~~~--------------ed~y~~ld~i~p~nt~s 437 (629)
T KOG2300|consen 372 LLGLYSHSVNCYENAEFHFIEATKLTESIDLQAFCNLNLAISYLRIGDA--------------EDLYKALDLIGPLNTNS 437 (629)
T ss_pred HHhhHhhhcchHHHHHHHHHHHHHhhhHHHHHHHHHHhHHHHHHHhccH--------------HHHHHHHHhcCCCCCCc
Confidence 3566667778899999999999987653 444443 34444 334422 1222222 223332211
Q ss_pred CCChHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHH
Q psy3020 77 LSDPSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAA 145 (150)
Q Consensus 77 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~ 145 (150)
+. ... .+...++-.|-..+.++++.||.....+.++.- |+.-.+.+-.|
T Consensus 438 ~s----sq~--------------l~a~~~~v~glfaf~qn~lnEaK~~l~e~Lkma--naed~~rL~a~ 486 (629)
T KOG2300|consen 438 LS----SQR--------------LEASILYVYGLFAFKQNDLNEAKRFLRETLKMA--NAEDLNRLTAC 486 (629)
T ss_pred ch----HHH--------------HHHHHHHHHHHHHHHhccHHHHHHHHHHHHhhc--chhhHHHHHHH
Confidence 10 011 226677888989999999999999999999887 34333333333
No 339
>cd02677 MIT_SNX15 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This MIT domain sub-family is found in sorting nexin 15 and related proteins. The molecular function of the MIT domain is unclear.
Probab=76.27 E-value=9.6 Score=21.91 Aligned_cols=29 Identities=17% Similarity=0.267 Sum_probs=19.1
Q ss_pred HHHHHHHHHHHHhcCHHHHHHHHHHHHhh
Q psy3020 103 SLLKDKGNAALQANNFKEAIEAYSEAIKL 131 (150)
Q Consensus 103 ~~~~~~g~~~~~~~~~~~A~~~~~~al~~ 131 (150)
..+...+.-.=..|+|++|+.+|..+|+.
T Consensus 7 ~~l~~~Ave~d~~~~y~eA~~~Y~~~i~~ 35 (75)
T cd02677 7 AELIRLALEKEEEGDYEAAFEFYRAGVDL 35 (75)
T ss_pred HHHHHHHHHHHHHhhHHHHHHHHHHHHHH
Confidence 33444444445668888888888888763
No 340
>PF13281 DUF4071: Domain of unknown function (DUF4071)
Probab=76.12 E-value=33 Score=26.32 Aligned_cols=29 Identities=17% Similarity=0.237 Sum_probs=21.8
Q ss_pred CHHHHHHHHHHHHhhcCCCcchhhcHHHHH
Q psy3020 117 NFKEAIEAYSEAIKLDGTNHILFSNRSAAF 146 (150)
Q Consensus 117 ~~~~A~~~~~~al~~~p~~~~~~~n~a~~~ 146 (150)
..++|+..|.++.+++|+.... .|.+.++
T Consensus 241 ~ldkAi~~Y~kgFe~~~~~Y~G-IN~AtLL 269 (374)
T PF13281_consen 241 SLDKAIEWYRKGFEIEPDYYSG-INAATLL 269 (374)
T ss_pred HHHHHHHHHHHHHcCCccccch-HHHHHHH
Confidence 4889999999999999865444 4455444
No 341
>smart00745 MIT Microtubule Interacting and Trafficking molecule domain.
Probab=76.01 E-value=3.6 Score=23.48 Aligned_cols=21 Identities=19% Similarity=0.374 Sum_probs=11.9
Q ss_pred hHHHHhhCCHHHHHHHHHHhh
Q psy3020 4 YSCLSYLGRYKESISTYEEGL 24 (150)
Q Consensus 4 g~~~~~~~~~~~A~~~~~~aL 24 (150)
|..+-+.|++++|+.+|..++
T Consensus 15 Av~~d~~g~~~eAl~~Y~~a~ 35 (77)
T smart00745 15 ALKADEAGDYEEALELYKKAI 35 (77)
T ss_pred HHHHHHcCCHHHHHHHHHHHH
Confidence 344555666666666655544
No 342
>TIGR00985 3a0801s04tom mitochondrial import receptor subunit translocase of outer membrane 20 kDa subunit.
Probab=75.55 E-value=5.5 Score=26.22 Aligned_cols=36 Identities=19% Similarity=0.188 Sum_probs=31.2
Q ss_pred hhhHHHHhhC-CHHHHHHHHHHhhccCCCcHHHHHHH
Q psy3020 2 TTYSCLSYLG-RYKESISTYEEGLKLDPNNEQMKEAI 37 (150)
Q Consensus 2 ~lg~~~~~~~-~~~~A~~~~~~aL~~~p~~~~a~~~l 37 (150)
.+|..+...| ++++|..+|-.||.+-|+-.+....+
T Consensus 95 ~~GE~L~~~g~~~~ega~hf~nAl~Vc~qP~~LL~iy 131 (148)
T TIGR00985 95 QLGEELMAQGTNVDEGAVHFYNALKVYPQPQQLLSIY 131 (148)
T ss_pred HHHHHHHhCCCchHHHHHHHHHHHHhCCCHHHHHHHH
Confidence 5788999999 99999999999999999977765544
No 343
>COG3947 Response regulator containing CheY-like receiver and SARP domains [Signal transduction mechanisms]
Probab=75.10 E-value=9 Score=28.42 Aligned_cols=42 Identities=26% Similarity=0.187 Sum_probs=34.1
Q ss_pred HHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHh
Q psy3020 107 DKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAK 148 (150)
Q Consensus 107 ~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~ 148 (150)
..+..+...|.+.+|++...+++.+||=+...+.-+-.++..
T Consensus 284 kva~~yle~g~~neAi~l~qr~ltldpL~e~~nk~lm~~la~ 325 (361)
T COG3947 284 KVARAYLEAGKPNEAIQLHQRALTLDPLSEQDNKGLMASLAT 325 (361)
T ss_pred HHHHHHHHcCChHHHHHHHHHHhhcChhhhHHHHHHHHHHHH
Confidence 345556789999999999999999999988887766665544
No 344
>cd02679 MIT_spastin MIT: domain contained within Microtubule Interacting and Trafficking molecules. This MIT domain sub-family is found in the AAA protein spastin, a probable ATPase involved in the assembly or function of nuclear protein complexes; spastins might also be involved in microtubule dynamics. The molecular function of the MIT domain is unclear.
Probab=74.78 E-value=2.9 Score=24.37 Aligned_cols=19 Identities=26% Similarity=0.475 Sum_probs=12.4
Q ss_pred CCHHHHHHHHHHhhccCCC
Q psy3020 11 GRYKESISTYEEGLKLDPN 29 (150)
Q Consensus 11 ~~~~~A~~~~~~aL~~~p~ 29 (150)
+.|+.|.+..++||+.|-.
T Consensus 3 ~~~~~A~~~I~kaL~~dE~ 21 (79)
T cd02679 3 GYYKQAFEEISKALRADEW 21 (79)
T ss_pred hHHHHHHHHHHHHhhhhhc
Confidence 3466777777777776654
No 345
>KOG1914|consensus
Probab=73.92 E-value=35 Score=27.66 Aligned_cols=72 Identities=11% Similarity=0.133 Sum_probs=46.3
Q ss_pred HHhhccCCCcHHHHHHHHHHHhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChHHHHHHHHHhcCcchhhhHH
Q psy3020 21 EEGLKLDPNNEQMKEAIKDVRNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPSYVQMIKEIQKDPSLMTLEA 100 (150)
Q Consensus 21 ~~aL~~~p~~~~a~~~l~~~~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~ 100 (150)
++-++.+|.|..+|..|..-.... .+.+.- ..+...++-.|..
T Consensus 10 ~~rie~nP~di~sw~~lire~qt~--------~~~~~R--~~YEq~~~~FP~s--------------------------- 52 (656)
T KOG1914|consen 10 RERIEENPYDIDSWSQLIREAQTQ--------PIDKVR--ETYEQLVNVFPSS--------------------------- 52 (656)
T ss_pred HHHHhcCCccHHHHHHHHHHHccC--------CHHHHH--HHHHHHhccCCCC---------------------------
Confidence 677889999999999887651111 223333 3444444545543
Q ss_pred HhHHHHHHHHHHHHhcCHHHHHHHHHHHHh
Q psy3020 101 KISLLKDKGNAALQANNFKEAIEAYSEAIK 130 (150)
Q Consensus 101 ~~~~~~~~g~~~~~~~~~~~A~~~~~~al~ 130 (150)
+.+|..-....+..++|+...+.|.+||.
T Consensus 53 -~r~W~~yi~~El~skdfe~VEkLF~RCLv 81 (656)
T KOG1914|consen 53 -PRAWKLYIERELASKDFESVEKLFSRCLV 81 (656)
T ss_pred -cHHHHHHHHHHHHhhhHHHHHHHHHHHHH
Confidence 55666666666777788888888877775
No 346
>PF07079 DUF1347: Protein of unknown function (DUF1347); InterPro: IPR010764 This family consists of several hypothetical bacterial proteins of around 610 residues in length. Members of this family are highly conserved and seem to be specific to Chlamydia species. The function of this family is unknown.
Probab=73.78 E-value=11 Score=29.73 Aligned_cols=40 Identities=15% Similarity=0.069 Sum_probs=35.2
Q ss_pred hHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-Hhhh
Q psy3020 4 YSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQE 44 (150)
Q Consensus 4 g~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g 44 (150)
|.-++..|+|.++.-.-....+++| .+.++.-+|-+ ....
T Consensus 469 AEyLysqgey~kc~~ys~WL~~iaP-S~~~~RLlGl~l~e~k 509 (549)
T PF07079_consen 469 AEYLYSQGEYHKCYLYSSWLTKIAP-SPQAYRLLGLCLMENK 509 (549)
T ss_pred HHHHHhcccHHHHHHHHHHHHHhCC-cHHHHHHHHHHHHHHh
Confidence 4457889999999999999999999 89999999988 6666
No 347
>COG1747 Uncharacterized N-terminal domain of the transcription elongation factor GreA [Function unknown]
Probab=73.64 E-value=47 Score=26.90 Aligned_cols=131 Identities=15% Similarity=0.156 Sum_probs=72.0
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhh--CC--------hHHHHh-hcc
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLF--SD--------PNIFVQ-LQL 69 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~--~~--------~~~~~~-l~~ 69 (150)
.++.||.+. .-+.=...+++..+.+=+++..-..|+.. .... .. .+...+.+++ .. ..++.. ...
T Consensus 104 el~q~y~en-~n~~l~~lWer~ve~dfnDvv~~ReLa~~yEkik-~s-k~a~~f~Ka~yrfI~~~q~~~i~evWeKL~~~ 180 (711)
T COG1747 104 ELLQCYKEN-GNEQLYSLWERLVEYDFNDVVIGRELADKYEKIK-KS-KAAEFFGKALYRFIPRRQNAAIKEVWEKLPEL 180 (711)
T ss_pred HHHHHHHhc-CchhhHHHHHHHHHhcchhHHHHHHHHHHHHHhc-hh-hHHHHHHHHHHHhcchhhhhhHHHHHHHHHHh
Confidence 356677776 44566677788777777777776677666 3322 11 0000111111 00 112222 122
Q ss_pred CCCCCCCCCChHHHHHHHHHhcCcchhhhHHHhHHHHHHH-HHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHH
Q psy3020 70 DPRTKPFLSDPSYVQMIKEIQKDPSLMTLEAKISLLKDKG-NAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSA 144 (150)
Q Consensus 70 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g-~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~ 144 (150)
-|++.+ .++..+..+++..+. .+...+.+-- .-|-...+|.+|++.....++.|..+..+.-++-.
T Consensus 181 i~dD~D-----~fl~l~~kiqt~lg~----~~~~Vl~qdv~~~Ys~~eN~~eai~Ilk~il~~d~k~~~ar~~~i~ 247 (711)
T COG1747 181 IGDDKD-----FFLRLQKKIQTKLGE----GRGSVLMQDVYKKYSENENWTEAIRILKHILEHDEKDVWARKEIIE 247 (711)
T ss_pred ccccHH-----HHHHHHHHHHHhhcc----chHHHHHHHHHHHhccccCHHHHHHHHHHHhhhcchhhhHHHHHHH
Confidence 333333 456666666655443 2223333322 33346789999999999999999988777655543
No 348
>COG0790 FOG: TPR repeat, SEL1 subfamily [General function prediction only]
Probab=73.18 E-value=32 Score=24.77 Aligned_cols=40 Identities=15% Similarity=0.047 Sum_probs=32.7
Q ss_pred hHHHHHHHHHHHH----hcCHHHHHHHHHHHHhhcCCCcchhhcHH
Q psy3020 102 ISLLKDKGNAALQ----ANNFKEAIEAYSEAIKLDGTNHILFSNRS 143 (150)
Q Consensus 102 ~~~~~~~g~~~~~----~~~~~~A~~~~~~al~~~p~~~~~~~n~a 143 (150)
+.+...+|.++.. ..++.+|+..|.++.+... ....++++
T Consensus 187 ~~a~~~lg~~y~~G~Gv~~d~~~A~~wy~~Aa~~g~--~~a~~~~~ 230 (292)
T COG0790 187 PDAQLLLGRMYEKGLGVPRDLKKAFRWYKKAAEQGD--GAACYNLG 230 (292)
T ss_pred HHHHHHHHHHHHcCCCCCcCHHHHHHHHHHHHHCCC--HHHHHHHH
Confidence 6677888877753 4689999999999999988 77777777
No 349
>PF04910 Tcf25: Transcriptional repressor TCF25; InterPro: IPR006994 This entry appears to represent a novel family of basic helix-loop-helix (bHLH) proteins that control differentiation and development of a variety of organs [, ]. Human Nulp1 (Q2MK75 from SWISSPROT) is a basic helix-loop-helix protein expressed broadly during early embryonic organogenesis. Over expression of human Nulp1 in COS-7 cells inhibits the transcriptional activity of serum response factor (SRF), suggesting that Nulp1 may act as a novel bHLH transcriptional repressor in the SRF signalling pathway to mediate cellular functions [].
Probab=72.08 E-value=41 Score=25.58 Aligned_cols=36 Identities=19% Similarity=0.205 Sum_probs=27.3
Q ss_pred HHHHhhCCHHHHHHHHHHhhccCCC-cHHHHHHHHHH
Q psy3020 5 SCLSYLGRYKESISTYEEGLKLDPN-NEQMKEAIKDV 40 (150)
Q Consensus 5 ~~~~~~~~~~~A~~~~~~aL~~~p~-~~~a~~~l~~~ 40 (150)
..+.+.|-+..|++..+-.|.+||+ |+-.-.-..+.
T Consensus 111 ~~L~~RG~~rTAlE~~KlLlsLdp~~DP~g~ll~ID~ 147 (360)
T PF04910_consen 111 QSLGRRGCWRTALEWCKLLLSLDPDEDPLGVLLFIDY 147 (360)
T ss_pred HHHHhcCcHHHHHHHHHHHHhcCCCCCcchhHHHHHH
Confidence 4566778899999999999999998 77654444443
No 350
>KOG4056|consensus
Probab=71.45 E-value=8.9 Score=24.89 Aligned_cols=36 Identities=11% Similarity=0.056 Sum_probs=31.9
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHH
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAI 37 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l 37 (150)
.+|..|+..|+++++..++-.||.+.|.-...+.-+
T Consensus 86 ~lGE~L~~qg~~e~ga~h~~nAi~vcgqpaqLL~vl 121 (143)
T KOG4056|consen 86 QLGEELLAQGNEEEGAEHLANAIVVCGQPAQLLQVL 121 (143)
T ss_pred HhHHHHHHccCHHHHHHHHHHHHhhcCCHHHHHHHH
Confidence 579999999999999999999999999988776544
No 351
>PF12583 TPPII_N: Tripeptidyl peptidase II N terminal; InterPro: IPR022232 This entry represents a region of approximately 190 amino acids in length and is found in association with PF00082 from PFAM. The members are serine peptidases belonging to MEROPS peptidase family S8A, tripeptidyl peptidase II (TPPII), clan SB. They are a crucial component of the proteolytic cascade acting downstream of the 26S proteasome in the ubiquitin-proteasome pathway. It is an amino peptidase belonging to the subtilase family removing tripeptides from the free N terminus of oligopeptides. ; PDB: 3LXU_X.
Probab=70.62 E-value=20 Score=23.16 Aligned_cols=44 Identities=11% Similarity=0.009 Sum_probs=31.1
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHH
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAA 145 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~ 145 (150)
.++++.+-+..+.+-+.+.|...|.+.++.+|++-.++.-+-..
T Consensus 76 ~EaLRDfq~~~iaKle~e~Ae~vY~el~~~~P~HLpaHla~i~~ 119 (139)
T PF12583_consen 76 SEALRDFQCSWIAKLEPENAEQVYEELLEAHPDHLPAHLAMIQN 119 (139)
T ss_dssp HHHHHHHHHHHHTTS-HHHHHHHHHHHHHH-TT-THHHHHHHHH
T ss_pred HHHHHHHHHHHHHhhCHHHHHHHHHHHHHHCcchHHHHHHHHHc
Confidence 45555556666677888999999999999999998877655443
No 352
>cd00280 TRFH Telomeric Repeat binding Factor or TTAGGG Repeat binding Factor, central (dimerization) domain Homology; TRFH. Telomeres are protein/DNA complexes that make up the physical ends of eukaryotic linear chromosomes and are essential for chromosome stability, protecting the chromosome ends from degradation and end-to-end fusion. Proteins TRF1, TRF2 and Taz1 bind telomeric DNA and are also involved in recruiting interacting proteins, TIN2, and Rap1, to the telomeres. It has also been demonstrated that PARP1 associates with TRF2 and is capable of poly(ADP-ribosyl)ation of TRF2, which affects binding of TRF2 to telomeric DNA. TRF1, TRF2 and Taz1 proteins contain three functional domains: an N-terminal acidic domain, a central TRF-specific/dimerization domain, and a C-terminal DNA binding domain with a single Myb-like repeat. Homodimerization, a prerequisite to DNA binding, results in the juxtaposition of two Myb DNA binding domains.
Probab=70.41 E-value=13 Score=25.65 Aligned_cols=40 Identities=13% Similarity=0.308 Sum_probs=33.3
Q ss_pred hHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-Hhhh
Q psy3020 4 YSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQE 44 (150)
Q Consensus 4 g~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g 44 (150)
...+.+.|.+++|.+.+++... ||++..-...|..+ +..+
T Consensus 118 V~VCm~~g~Fk~A~eiLkr~~~-d~~~~~~r~kL~~II~~Kd 158 (200)
T cd00280 118 VAVCMENGEFKKAEEVLKRLFS-DPESQKLRMKLLMIIREKD 158 (200)
T ss_pred HHHHHhcCchHHHHHHHHHHhc-CCCchhHHHHHHHHHHccc
Confidence 3567889999999999999998 99998887777777 5544
No 353
>KOG2047|consensus
Probab=70.19 E-value=63 Score=26.94 Aligned_cols=31 Identities=23% Similarity=0.219 Sum_probs=27.7
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhc
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLD 132 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~ 132 (150)
...|..++.-|.+.|.|+.|-..|++++..-
T Consensus 248 g~Lw~SLAdYYIr~g~~ekarDvyeeai~~v 278 (835)
T KOG2047|consen 248 GFLWCSLADYYIRSGLFEKARDVYEEAIQTV 278 (835)
T ss_pred HHHHHHHHHHHHHhhhhHHHHHHHHHHHHhh
Confidence 5678899999999999999999999998743
No 354
>KOG3617|consensus
Probab=69.78 E-value=6.6 Score=33.34 Aligned_cols=29 Identities=7% Similarity=0.022 Sum_probs=24.3
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHh
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIK 130 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~ 130 (150)
-.+.+.+|..|-..|++.+|+..|++|-.
T Consensus 967 ~AAcYhlaR~YEn~g~v~~Av~FfTrAqa 995 (1416)
T KOG3617|consen 967 KAACYHLARMYENDGDVVKAVKFFTRAQA 995 (1416)
T ss_pred HHHHHHHHHHhhhhHHHHHHHHHHHHHHH
Confidence 56677889889899999999999988744
No 355
>smart00671 SEL1 Sel1-like repeats. These represent a subfamily of TPR (tetratricopeptide repeat) sequences.
Probab=69.30 E-value=10 Score=17.47 Aligned_cols=30 Identities=20% Similarity=0.278 Sum_probs=21.8
Q ss_pred HHHHHHHHHHHH----hcCHHHHHHHHHHHHhhc
Q psy3020 103 SLLKDKGNAALQ----ANNFKEAIEAYSEAIKLD 132 (150)
Q Consensus 103 ~~~~~~g~~~~~----~~~~~~A~~~~~~al~~~ 132 (150)
.+...+|..+.. ..+..+|+..|+++.+..
T Consensus 2 ~a~~~lg~~~~~G~g~~~d~~~A~~~~~~Aa~~g 35 (36)
T smart00671 2 EAQYNLGQMYEYGLGVKKDLEKALEYYKKAAELG 35 (36)
T ss_pred HHHHHHHHHHHcCCCCCcCHHHHHHHHHHHHHcc
Confidence 356667777753 358899999999987653
No 356
>PF12583 TPPII_N: Tripeptidyl peptidase II N terminal; InterPro: IPR022232 This entry represents a region of approximately 190 amino acids in length and is found in association with PF00082 from PFAM. The members are serine peptidases belonging to MEROPS peptidase family S8A, tripeptidyl peptidase II (TPPII), clan SB. They are a crucial component of the proteolytic cascade acting downstream of the 26S proteasome in the ubiquitin-proteasome pathway. It is an amino peptidase belonging to the subtilase family removing tripeptides from the free N terminus of oligopeptides. ; PDB: 3LXU_X.
Probab=68.92 E-value=10 Score=24.48 Aligned_cols=33 Identities=18% Similarity=0.094 Sum_probs=25.2
Q ss_pred HhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH
Q psy3020 8 SYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV 40 (150)
Q Consensus 8 ~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~ 40 (150)
...-+.+.|-..|+..++..|++..+|..+...
T Consensus 87 iaKle~e~Ae~vY~el~~~~P~HLpaHla~i~~ 119 (139)
T PF12583_consen 87 IAKLEPENAEQVYEELLEAHPDHLPAHLAMIQN 119 (139)
T ss_dssp HTTS-HHHHHHHHHHHHHH-TT-THHHHHHHHH
T ss_pred HHhhCHHHHHHHHHHHHHHCcchHHHHHHHHHc
Confidence 334456789999999999999999999988776
No 357
>KOG2997|consensus
Probab=68.43 E-value=8.1 Score=28.86 Aligned_cols=41 Identities=20% Similarity=0.212 Sum_probs=35.5
Q ss_pred HHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhh
Q psy3020 100 AKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFS 140 (150)
Q Consensus 100 ~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~ 140 (150)
.++.+++..|...-+.|..-+|+..|+.|+++.|+-...+.
T Consensus 17 kkA~~l~~~av~~Eq~G~l~dai~fYR~AlqI~~diEs~~r 57 (366)
T KOG2997|consen 17 KKAIALYEKAVLKEQDGSLYDAINFYRDALQIVPDIESKYR 57 (366)
T ss_pred HHHHHHHHHHHHHhhcCcHHHHHHHHHhhhcCCchHHHHHH
Confidence 45778888888888999999999999999999998766655
No 358
>KOG2581|consensus
Probab=68.18 E-value=13 Score=28.77 Aligned_cols=35 Identities=14% Similarity=0.179 Sum_probs=32.3
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCc
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNH 136 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~ 136 (150)
+..++.+|.+..-+++|+.|.+++..|+..-|++.
T Consensus 247 ARY~yY~GrIkaiqldYssA~~~~~qa~rkapq~~ 281 (493)
T KOG2581|consen 247 ARYLYYLGRIKAIQLDYSSALEYFLQALRKAPQHA 281 (493)
T ss_pred HHHHHHHhhHHHhhcchhHHHHHHHHHHHhCcchh
Confidence 77788899999999999999999999999999854
No 359
>KOG3783|consensus
Probab=68.01 E-value=21 Score=28.55 Aligned_cols=46 Identities=4% Similarity=0.128 Sum_probs=40.6
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHH
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFA 147 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~ 147 (150)
...++.+|+...-+.+|..|-.++.....+..++...|.+++.|.+
T Consensus 303 ~l~~fE~aw~~v~~~~~~~aad~~~~L~desdWS~a~Y~Yfa~cc~ 348 (546)
T KOG3783|consen 303 SLMVFERAWLSVGQHQYSRAADSFDLLRDESDWSHAFYTYFAGCCL 348 (546)
T ss_pred HHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhhhhHHHHHHHHHHHH
Confidence 5677889999999999999999999999999999999999984443
No 360
>KOG0276|consensus
Probab=67.82 E-value=8.9 Score=31.28 Aligned_cols=30 Identities=30% Similarity=0.309 Sum_probs=27.6
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhh
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKL 131 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~ 131 (150)
...|..+|......+++..|.+||.++-.+
T Consensus 666 ~~Kw~~Lg~~al~~~~l~lA~EC~~~a~d~ 695 (794)
T KOG0276|consen 666 EVKWRQLGDAALSAGELPLASECFLRARDL 695 (794)
T ss_pred hHHHHHHHHHHhhcccchhHHHHHHhhcch
Confidence 788999999999999999999999998654
No 361
>KOG4151|consensus
Probab=67.79 E-value=23 Score=29.57 Aligned_cols=61 Identities=16% Similarity=0.275 Sum_probs=38.3
Q ss_pred hHHHHhhCCHHHHHHHHHHhhccCCC----cHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCC
Q psy3020 4 YSCLSYLGRYKESISTYEEGLKLDPN----NEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPR 72 (150)
Q Consensus 4 g~~~~~~~~~~~A~~~~~~aL~~~p~----~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~ 72 (150)
|+.++..+++.+|..-|..++.+-|. .+..+...+.+ ..+|-. ++..++ ++..-++...|.
T Consensus 60 ~n~~~~K~d~~~~~~~~~~~~~llp~~~~~~a~~~~~~~s~~m~~~l~------~~~~~~--~E~~la~~~~p~ 125 (748)
T KOG4151|consen 60 GNKLFQKRDYEGAMFRYDCAIKLLPKDHHVVATLRSNQASCYMQLGLG------EYPKAI--PECELALESQPR 125 (748)
T ss_pred hhHHhhhhhhhccchhhhhhheeccccchhhhhHHHHHHHHHhhcCcc------chhhhc--Cchhhhhhccch
Confidence 66788889999999999999999982 22333444555 444422 445555 444445555554
No 362
>PF02184 HAT: HAT (Half-A-TPR) repeat; InterPro: IPR003107 The HAT (Half A TPR) repeat has a repetitive pattern characterised by three aromatic residues with a conserved spacing. They are structurally and sequentially similar to TPRs (tetratricopeptide repeats), though they lack the highly conserved alanine and glycine residues found in TPRs. The number of HAT repeats found in different proteins varies between 9 and 12. HAT-repeat-containing proteins appear to be components of macromolecular complexes that are required for RNA processing []. The repeats may be involved in protein-protein interactions. The HAT motif has striking structural similarities to HEAT repeats (IPR000357 from INTERPRO), being of a similar length and consisting of two short helices connected by a loop domain, as in HEAT repeats.; GO: 0006396 RNA processing, 0005622 intracellular
Probab=67.69 E-value=6.9 Score=18.60 Aligned_cols=26 Identities=15% Similarity=0.174 Sum_probs=20.3
Q ss_pred CHHHHHHHHHHhhccCCCcHHHHHHHH
Q psy3020 12 RYKESISTYEEGLKLDPNNEQMKEAIK 38 (150)
Q Consensus 12 ~~~~A~~~~~~aL~~~p~~~~a~~~l~ 38 (150)
.++.|...|++.+...|+ +..|...|
T Consensus 2 E~dRAR~IyeR~v~~hp~-~k~WikyA 27 (32)
T PF02184_consen 2 EFDRARSIYERFVLVHPE-VKNWIKYA 27 (32)
T ss_pred hHHHHHHHHHHHHHhCCC-chHHHHHH
Confidence 578999999999999987 55555444
No 363
>COG3898 Uncharacterized membrane-bound protein [Function unknown]
Probab=67.28 E-value=59 Score=25.45 Aligned_cols=38 Identities=13% Similarity=-0.016 Sum_probs=20.8
Q ss_pred hhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-Hhhhhc
Q psy3020 9 YLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMN 46 (150)
Q Consensus 9 ~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~ 46 (150)
..|..+.|+.+-++|-..-|.-+.+....-.. -.-|++
T Consensus 166 r~GareaAr~yAe~Aa~~Ap~l~WA~~AtLe~r~~~gdW 204 (531)
T COG3898 166 RLGAREAARHYAERAAEKAPQLPWAARATLEARCAAGDW 204 (531)
T ss_pred hcccHHHHHHHHHHHHhhccCCchHHHHHHHHHHhcCCh
Confidence 34566666666666666666666655544333 333433
No 364
>KOG2047|consensus
Probab=67.04 E-value=74 Score=26.56 Aligned_cols=122 Identities=11% Similarity=0.134 Sum_probs=66.3
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcH----HHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCC-
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNE----QMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKP- 75 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~----~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~- 75 (150)
.+|..|...|+.+.|...|++|+++.=.-. .+|..-|.. .... .++.|+ .-+.++.. .|..+.
T Consensus 392 ~faklYe~~~~l~~aRvifeka~~V~y~~v~dLa~vw~~waemElrh~--------~~~~Al--~lm~~A~~-vP~~~~~ 460 (835)
T KOG2047|consen 392 EFAKLYENNGDLDDARVIFEKATKVPYKTVEDLAEVWCAWAEMELRHE--------NFEAAL--KLMRRATH-VPTNPEL 460 (835)
T ss_pred HHHHHHHhcCcHHHHHHHHHHhhcCCccchHHHHHHHHHHHHHHHhhh--------hHHHHH--HHHHhhhc-CCCchhh
Confidence 367889999999999999999999875433 344444544 2222 344555 33333322 233221
Q ss_pred -CCCC-----------hHHHHHHHHHhcCcchhhh---------HHH---hHHHHHHHHHHHHhcCHHHHHHHHHHHHhh
Q psy3020 76 -FLSD-----------PSYVQMIKEIQKDPSLMTL---------EAK---ISLLKDKGNAALQANNFKEAIEAYSEAIKL 131 (150)
Q Consensus 76 -~~~~-----------~~~~~~~~~l~~~~~~~~~---------~~~---~~~~~~~g~~~~~~~~~~~A~~~~~~al~~ 131 (150)
+++. ...-.+..+++..-.-... +-+ |....+.+..+-...-|+++.+.|++.+.+
T Consensus 461 ~~yd~~~pvQ~rlhrSlkiWs~y~DleEs~gtfestk~vYdriidLriaTPqii~NyAmfLEeh~yfeesFk~YErgI~L 540 (835)
T KOG2047|consen 461 EYYDNSEPVQARLHRSLKIWSMYADLEESLGTFESTKAVYDRIIDLRIATPQIIINYAMFLEEHKYFEESFKAYERGISL 540 (835)
T ss_pred hhhcCCCcHHHHHHHhHHHHHHHHHHHHHhccHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhHHHHHHHHHHHcCCcc
Confidence 1111 0000011111111000000 000 555666777777778889999999999998
Q ss_pred cCC
Q psy3020 132 DGT 134 (150)
Q Consensus 132 ~p~ 134 (150)
.|.
T Consensus 541 Fk~ 543 (835)
T KOG2047|consen 541 FKW 543 (835)
T ss_pred CCC
Confidence 874
No 365
>TIGR00985 3a0801s04tom mitochondrial import receptor subunit translocase of outer membrane 20 kDa subunit.
Probab=66.27 E-value=18 Score=23.80 Aligned_cols=33 Identities=15% Similarity=0.160 Sum_probs=29.5
Q ss_pred hHHHHHHHHHHHHhc-CHHHHHHHHHHHHhhcCC
Q psy3020 102 ISLLKDKGNAALQAN-NFKEAIEAYSEAIKLDGT 134 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~-~~~~A~~~~~~al~~~p~ 134 (150)
+..-..+|..+...| +.+++..+|-+||.+-|+
T Consensus 90 Fl~eV~~GE~L~~~g~~~~ega~hf~nAl~Vc~q 123 (148)
T TIGR00985 90 FLQEVQLGEELMAQGTNVDEGAVHFYNALKVYPQ 123 (148)
T ss_pred HHHHHHHHHHHHhCCCchHHHHHHHHHHHHhCCC
Confidence 566677999999999 999999999999999985
No 366
>KOG1258|consensus
Probab=65.77 E-value=73 Score=25.98 Aligned_cols=105 Identities=9% Similarity=-0.039 Sum_probs=58.1
Q ss_pred hhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCCCCChHHHHHHH
Q psy3020 9 YLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPSYVQMIK 87 (150)
Q Consensus 9 ~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 87 (150)
..|++..|...|++...--|+..++-..-+.. +..|..+ +... . ..+.........+...+
T Consensus 378 ~~~n~~~A~~~lq~i~~e~pg~v~~~l~~~~~e~r~~~~~-----~~~~-~--~~l~s~~~~~~~~~~i~---------- 439 (577)
T KOG1258|consen 378 SNGNFDDAKVILQRIESEYPGLVEVVLRKINWERRKGNLE-----DANY-K--NELYSSIYEGKENNGIL---------- 439 (577)
T ss_pred hhccHHHHHHHHHHHHhhCCchhhhHHHHHhHHHHhcchh-----hhhH-H--HHHHHHhcccccCcchh----------
Confidence 34678888888888877777777776666666 5555332 1110 0 01111111111111000
Q ss_pred HHhcCcchhhhHHHhHHHHHHHHHH-HHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHH
Q psy3020 88 EIQKDPSLMTLEAKISLLKDKGNAA-LQANNFKEAIEAYSEAIKLDGTNHILFSNRSAA 145 (150)
Q Consensus 88 ~l~~~~~~~~~~~~~~~~~~~g~~~-~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~ 145 (150)
...+....+.. .-.++-++|...+.+++++.|++-.+|..+-..
T Consensus 440 --------------~~l~~~~~r~~~~i~~d~~~a~~~l~~~~~~~~~~k~~~~~~~~~ 484 (577)
T KOG1258|consen 440 --------------EKLYVKFARLRYKIREDADLARIILLEANDILPDCKVLYLELIRF 484 (577)
T ss_pred --------------HHHHHHHHHHHHHHhcCHHHHHHHHHHhhhcCCccHHHHHHHHHH
Confidence 22233333333 246888999999999999999987777654443
No 367
>KOG1585|consensus
Probab=65.41 E-value=25 Score=25.56 Aligned_cols=24 Identities=21% Similarity=0.279 Sum_probs=16.1
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhcc
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKL 26 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~ 26 (150)
.|....++..+.++.+.|++|..+
T Consensus 77 aamLake~~klsEvvdl~eKAs~l 100 (308)
T KOG1585|consen 77 AAMLAKELSKLSEVVDLYEKASEL 100 (308)
T ss_pred HHHHHHHHHHhHHHHHHHHHHHHH
Confidence 345566667777777777777653
No 368
>KOG3617|consensus
Probab=64.55 E-value=89 Score=27.13 Aligned_cols=32 Identities=16% Similarity=0.342 Sum_probs=27.9
Q ss_pred hhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHH
Q psy3020 97 TLEAKISLLKDKGNAALQANNFKEAIEAYSEA 128 (150)
Q Consensus 97 ~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~a 128 (150)
.+..+...+..+|..++++|.|..|.+-|++|
T Consensus 1140 ~e~~R~~vLeqvae~c~qQG~Yh~AtKKfTQA 1171 (1416)
T KOG3617|consen 1140 NEQERKQVLEQVAELCLQQGAYHAATKKFTQA 1171 (1416)
T ss_pred cHHHHHHHHHHHHHHHHhccchHHHHHHHhhh
Confidence 33466888999999999999999999999987
No 369
>PRK15180 Vi polysaccharide biosynthesis protein TviD; Provisional
Probab=64.22 E-value=13 Score=29.66 Aligned_cols=118 Identities=5% Similarity=-0.026 Sum_probs=63.7
Q ss_pred hhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhh-CChHHHHhhccC---------CCCCCCC
Q psy3020 9 YLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLF-SDPNIFVQLQLD---------PRTKPFL 77 (150)
Q Consensus 9 ~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~-~~~~~~~~l~~~---------~~~~~~~ 77 (150)
..|+.-.|-+-..-+|+-.|.++..-.-.+.+ ..+| +++.++ ...+....+..- ..+...-
T Consensus 301 ~~gd~~aas~~~~~~lr~~~~~p~~i~l~~~i~~~lg--------~ye~~~~~~s~~~~~~~s~~~~~~~~~r~~~~l~r 372 (831)
T PRK15180 301 ADGDIIAASQQLFAALRNQQQDPVLIQLRSVIFSHLG--------YYEQAYQDISDVEKIIGTTDSTLRCRLRSLHGLAR 372 (831)
T ss_pred hccCHHHHHHHHHHHHHhCCCCchhhHHHHHHHHHhh--------hHHHHHHHhhchhhhhcCCchHHHHHHHhhhchhh
Confidence 34677777888888899999888877777888 8888 444444 111111111111 1000000
Q ss_pred CChHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhh
Q psy3020 78 SDPSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFS 140 (150)
Q Consensus 78 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~ 140 (150)
-+..+.-+...+....+. ++...-.+...-..|-++++.-++++.+.++|.....|.
T Consensus 373 ~~~a~s~a~~~l~~eie~------~ei~~iaa~sa~~l~~~d~~~~~wk~~~~~~~~~~~g~v 429 (831)
T PRK15180 373 WREALSTAEMMLSNEIED------EEVLTVAAGSADALQLFDKSYHYWKRVLLLNPETQSGWV 429 (831)
T ss_pred HHHHHHHHHHHhccccCC------hhheeeecccHHHHhHHHHHHHHHHHHhccCChhcccce
Confidence 000111111112222211 333333344445678899999999999999998554443
No 370
>PF02259 FAT: FAT domain; InterPro: IPR003151 The FAT domain is a domain present in the PIK-related kinases. Members of the family of PIK-related kinases may act as intracellular sensors that govern radial and horizontal pathways [].; GO: 0005515 protein binding
Probab=64.09 E-value=13 Score=27.35 Aligned_cols=30 Identities=27% Similarity=0.283 Sum_probs=28.1
Q ss_pred CCHHHHHHHHHHhhccCCCcHHHHHHHHHH
Q psy3020 11 GRYKESISTYEEGLKLDPNNEQMKEAIKDV 40 (150)
Q Consensus 11 ~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~ 40 (150)
+..++++..|.+|++++|+...++...|..
T Consensus 272 ~~~~~~~~~~~~a~~~~~~~~k~~~~~a~~ 301 (352)
T PF02259_consen 272 ESSDEILKYYKEATKLDPSWEKAWHSWALF 301 (352)
T ss_pred ccHHHHHHHHHHHHHhChhHHHHHHHHHHH
Confidence 888999999999999999999999999876
No 371
>KOG1310|consensus
Probab=63.48 E-value=11 Score=30.43 Aligned_cols=47 Identities=15% Similarity=0.171 Sum_probs=30.0
Q ss_pred HHHHHHHHHHHH---hcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 103 SLLKDKGNAALQ---ANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 103 ~~~~~~g~~~~~---~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
..+-+++.++++ .|+.-.|++.+..|+++||..-.+|+.++.++.++
T Consensus 409 ~~l~nraa~lmkRkW~~d~~~AlrDch~Alrln~s~~kah~~la~aL~el 458 (758)
T KOG1310|consen 409 YLLENRAAALMKRKWRGDSYLALRDCHVALRLNPSIQKAHFRLARALNEL 458 (758)
T ss_pred HHHHhHHHHHHhhhccccHHHHHHhHHhhccCChHHHHHHHHHHHHHHHH
Confidence 334444444443 35666777777777777777777777777776654
No 372
>KOG4056|consensus
Probab=63.11 E-value=15 Score=23.83 Aligned_cols=37 Identities=24% Similarity=0.212 Sum_probs=31.6
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcch
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHIL 138 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~ 138 (150)
+..-..+|..++..|+++++..++-.||.+-|.-..+
T Consensus 81 Fmqqv~lGE~L~~qg~~e~ga~h~~nAi~vcgqpaqL 117 (143)
T KOG4056|consen 81 FMQQVQLGEELLAQGNEEEGAEHLANAIVVCGQPAQL 117 (143)
T ss_pred HHHHHHhHHHHHHccCHHHHHHHHHHHHhhcCCHHHH
Confidence 5555779999999999999999999999999975443
No 373
>PF11817 Foie-gras_1: Foie gras liver health family 1; InterPro: IPR021773 Mutating the gene foie gras in zebrafish has been shown to affect development; the mutants develop large, lipid-filled hepatocytes in the liver, resembling those in individuals with fatty liver disease []. Foie-gras protein is long and has several well-defined domains though none of them has a known function. We have annotated this one as the first []. THe C terminus of this region contains TPR repeats.
Probab=62.70 E-value=22 Score=25.33 Aligned_cols=43 Identities=9% Similarity=0.082 Sum_probs=30.6
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCc------HHHHHHHHHH-Hhhh
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNN------EQMKEAIKDV-RNQE 44 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~------~~a~~~l~~~-~~~g 44 (150)
.+|.-|+..|+|++|++.|+.++..-... ..+...+..| ..+|
T Consensus 183 ~~A~ey~~~g~~~~A~~~l~~~~~~yr~egW~~l~~~~l~~l~~Ca~~~~ 232 (247)
T PF11817_consen 183 EMAEEYFRLGDYDKALKLLEPAASSYRREGWWSLLTEVLWRLLECAKRLG 232 (247)
T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHHHHhCCcHHHHHHHHHHHHHHHHHhC
Confidence 47899999999999999999996543321 1233455556 6666
No 374
>PF08238 Sel1: Sel1 repeat; InterPro: IPR006597 Sel1-like repeats are tetratricopeptide repeat sequences originally identified in a Caenorhabditis elegans receptor molecule which is a key negative regulator of the Notch pathway []. Mammalian homologues have since been identified although these mainly pancreatic proteins have yet to have a function assigned.; PDB: 2XM6_A 3RJV_A 1OUV_A 1KLX_A.
Probab=62.63 E-value=16 Score=17.21 Aligned_cols=30 Identities=13% Similarity=0.234 Sum_probs=19.4
Q ss_pred HHHHHHH--HHHHHh-----cCHHHHHHHHHHHHhhc
Q psy3020 103 SLLKDKG--NAALQA-----NNFKEAIEAYSEAIKLD 132 (150)
Q Consensus 103 ~~~~~~g--~~~~~~-----~~~~~A~~~~~~al~~~ 132 (150)
.+...+| ..+... .++++|++.|+++.+..
T Consensus 2 ~A~~~lg~~~~~~~g~~g~~~d~~~A~~~~~~Aa~~g 38 (39)
T PF08238_consen 2 EAQYNLGMYYMYYNGKGGVPKDYEKAFKWYEKAAEQG 38 (39)
T ss_dssp HHHHHHHHHHHHHHTSTSSCHHHHHHHHHHHHHHHTT
T ss_pred hHHHHHHHHHhhhhccCCccccccchHHHHHHHHHcc
Confidence 4566667 333322 25789999999987653
No 375
>PF06957 COPI_C: Coatomer (COPI) alpha subunit C-terminus; InterPro: IPR010714 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the C terminus (approximately 500 residues) of the eukaryotic coatomer alpha subunit [, ]. This domain is found along with the IPR006692 from INTERPRO domain. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0005515 protein binding, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030126 COPI vesicle coat; PDB: 3MKR_B 3MV2_E 3MKQ_B 3MV3_A.
Probab=62.41 E-value=53 Score=25.67 Aligned_cols=36 Identities=19% Similarity=0.259 Sum_probs=27.3
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcc
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHI 137 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~ 137 (150)
..++..--...++.++|.-|-..-++.|++.|....
T Consensus 300 ~LaLr~AM~~~~K~KNf~tAa~FArRLLel~p~~~~ 335 (422)
T PF06957_consen 300 ILALRSAMSQAFKLKNFITAASFARRLLELNPSPEV 335 (422)
T ss_dssp HHHHHHHHHHCCCTTBHHHHHHHHHHHHCT--SCHH
T ss_pred HHHHHHHHHHHHHhccHHHHHHHHHHHHHcCCCHHH
Confidence 555666666678999999999999999999997543
No 376
>cd02679 MIT_spastin MIT: domain contained within Microtubule Interacting and Trafficking molecules. This MIT domain sub-family is found in the AAA protein spastin, a probable ATPase involved in the assembly or function of nuclear protein complexes; spastins might also be involved in microtubule dynamics. The molecular function of the MIT domain is unclear.
Probab=62.22 E-value=26 Score=20.39 Aligned_cols=30 Identities=13% Similarity=0.051 Sum_probs=20.5
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhh
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKL 131 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~ 131 (150)
+-.....|..+=..|+.++|+.+|++++..
T Consensus 8 A~~~I~kaL~~dE~g~~e~Al~~Y~~gi~~ 37 (79)
T cd02679 8 AFEEISKALRADEWGDKEQALAHYRKGLRE 37 (79)
T ss_pred HHHHHHHHhhhhhcCCHHHHHHHHHHHHHH
Confidence 334455555555668888888888888763
No 377
>KOG3807|consensus
Probab=61.97 E-value=28 Score=26.51 Aligned_cols=28 Identities=14% Similarity=0.150 Sum_probs=23.9
Q ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcC
Q psy3020 106 KDKGNAALQANNFKEAIEAYSEAIKLDG 133 (150)
Q Consensus 106 ~~~g~~~~~~~~~~~A~~~~~~al~~~p 133 (150)
..++.+..++|+..+|++.++...+-.|
T Consensus 279 RRLAMCARklGrlrEA~K~~RDL~ke~p 306 (556)
T KOG3807|consen 279 RRLAMCARKLGRLREAVKIMRDLMKEFP 306 (556)
T ss_pred HHHHHHHHHhhhHHHHHHHHHHHhhhcc
Confidence 4567777899999999999999888777
No 378
>KOG1464|consensus
Probab=61.22 E-value=21 Score=26.37 Aligned_cols=37 Identities=14% Similarity=0.181 Sum_probs=27.9
Q ss_pred hCCHHHHHHHHHHhhccCCCcHH----HHHHHHHH-Hhhhhc
Q psy3020 10 LGRYKESISTYEEGLKLDPNNEQ----MKEAIKDV-RNQEMN 46 (150)
Q Consensus 10 ~~~~~~A~~~~~~aL~~~p~~~~----a~~~l~~~-~~~g~~ 46 (150)
...+++|+..|+++|+++|.-.+ ++..+..+ ..++.+
T Consensus 40 e~~p~~Al~sF~kVlelEgEKgeWGFKALKQmiKI~f~l~~~ 81 (440)
T KOG1464|consen 40 EDEPKEALSSFQKVLELEGEKGEWGFKALKQMIKINFRLGNY 81 (440)
T ss_pred ccCHHHHHHHHHHHHhcccccchhHHHHHHHHHHHHhccccH
Confidence 35789999999999999998654 44555566 677744
No 379
>PRK15180 Vi polysaccharide biosynthesis protein TviD; Provisional
Probab=61.00 E-value=88 Score=25.30 Aligned_cols=94 Identities=16% Similarity=0.225 Sum_probs=53.8
Q ss_pred hhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-Hhhhhcc--C-CCCCchhhhhCChHHHHhhccCCCCCCCCCChHHHH
Q psy3020 9 YLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMND--M-NRGDPFANLFSDPNIFVQLQLDPRTKPFLSDPSYVQ 84 (150)
Q Consensus 9 ~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~--~-~~~~~~~~a~~~~~~~~~l~~~~~~~~~~~~~~~~~ 84 (150)
..|+.++|+..+-..-+++|+-........++ +..++-. + .....++. .++.+.+-
T Consensus 710 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~---------------- 769 (831)
T PRK15180 710 TEGRLDEALSVLISLKRIEPDVSRLMREYKQIIRLFNESRKDGGSTITSYEH----LDYAKKLL---------------- 769 (831)
T ss_pred ccccHHHHHHHHHhhhccCccHHHHHHHHHHHHHHhhhhcccCCcccchhhh----Hhhhhhhe----------------
Confidence 35889999999999999999987766555554 3333110 0 00001110 11111110
Q ss_pred HHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcC
Q psy3020 85 MIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDG 133 (150)
Q Consensus 85 ~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p 133 (150)
--+. ..+.++...+.-.+...+|.+|++++.+.-+.+.
T Consensus 770 -----~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 807 (831)
T PRK15180 770 -----VFDS------ENAYALKYAALNAMHLRDYTQALQYWQRLEKVNG 807 (831)
T ss_pred -----eecc------chHHHHHHHHhhHhHHHHHHHHHHHHHHHHhccC
Confidence 0011 1144455555556788999999999999888764
No 380
>COG3914 Spy Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones]
Probab=60.87 E-value=92 Score=25.52 Aligned_cols=41 Identities=17% Similarity=0.000 Sum_probs=25.7
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHH-HHhhcCCCcchhhcH
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSE-AIKLDGTNHILFSNR 142 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~-al~~~p~~~~~~~n~ 142 (150)
.....+++......|....++..+.+ +....|++......+
T Consensus 101 ~~~~~~L~~ale~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~ 142 (620)
T COG3914 101 CPAVQNLAAALELDGLQFLALADISEIAEWLSPDNAEFLGHL 142 (620)
T ss_pred chHHHHHHHHHHHhhhHHHHHHHHHHHHHhcCcchHHHHhhH
Confidence 55566666666555555555555555 777888777665555
No 381
>KOG0546|consensus
Probab=60.68 E-value=19 Score=27.32 Aligned_cols=42 Identities=17% Similarity=0.415 Sum_probs=34.6
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-Hhhh
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQE 44 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g 44 (150)
.|.++....++++|++++..+....|++......+..+ ....
T Consensus 315 r~~~~~~~~~~~~a~~~~~~a~~~~p~d~~i~~~~~~~~~~~~ 357 (372)
T KOG0546|consen 315 RGQAYKLLKNYDEALEDLKKAKQKAPNDKAIEEELENVRQKKK 357 (372)
T ss_pred HHhHHHhhhchhhhHHHHHHhhccCcchHHHHHHHHHhhhHHH
Confidence 46778888999999999999999999999888777766 4444
No 382
>COG5191 Uncharacterized conserved protein, contains HAT (Half-A-TPR) repeat [General function prediction only]
Probab=59.52 E-value=16 Score=27.47 Aligned_cols=42 Identities=10% Similarity=0.105 Sum_probs=35.6
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHH
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRS 143 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a 143 (150)
+..|.....--.+.|-|.+.-..|.+++..+|.|..+|...+
T Consensus 107 ~k~w~~y~~Y~~k~k~y~~~~nI~~~~l~khP~nvdlWI~~c 148 (435)
T COG5191 107 PKIWSQYAAYVIKKKMYGEMKNIFAECLTKHPLNVDLWIYCC 148 (435)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeeeeec
Confidence 677777776667888999999999999999999999998643
No 383
>KOG2422|consensus
Probab=59.34 E-value=97 Score=25.44 Aligned_cols=33 Identities=18% Similarity=0.221 Sum_probs=27.7
Q ss_pred HhhCCHHHHHHHHHHhhccCCC-cHHHHHHHHHH
Q psy3020 8 SYLGRYKESISTYEEGLKLDPN-NEQMKEAIKDV 40 (150)
Q Consensus 8 ~~~~~~~~A~~~~~~aL~~~p~-~~~a~~~l~~~ 40 (150)
.+.|=+..|.+.++-.|++||. |+-+-..+.++
T Consensus 353 ~~RGC~rTA~E~cKlllsLdp~eDPl~~l~~ID~ 386 (665)
T KOG2422|consen 353 AQRGCWRTALEWCKLLLSLDPSEDPLGILYLIDI 386 (665)
T ss_pred HhcCChHHHHHHHHHHhhcCCcCCchhHHHHHHH
Confidence 4557788999999999999998 88877776666
No 384
>cd00280 TRFH Telomeric Repeat binding Factor or TTAGGG Repeat binding Factor, central (dimerization) domain Homology; TRFH. Telomeres are protein/DNA complexes that make up the physical ends of eukaryotic linear chromosomes and are essential for chromosome stability, protecting the chromosome ends from degradation and end-to-end fusion. Proteins TRF1, TRF2 and Taz1 bind telomeric DNA and are also involved in recruiting interacting proteins, TIN2, and Rap1, to the telomeres. It has also been demonstrated that PARP1 associates with TRF2 and is capable of poly(ADP-ribosyl)ation of TRF2, which affects binding of TRF2 to telomeric DNA. TRF1, TRF2 and Taz1 proteins contain three functional domains: an N-terminal acidic domain, a central TRF-specific/dimerization domain, and a C-terminal DNA binding domain with a single Myb-like repeat. Homodimerization, a prerequisite to DNA binding, results in the juxtaposition of two Myb DNA binding domains.
Probab=59.25 E-value=57 Score=22.57 Aligned_cols=34 Identities=18% Similarity=0.216 Sum_probs=27.8
Q ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhh
Q psy3020 106 KDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFS 140 (150)
Q Consensus 106 ~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~ 140 (150)
.....++++.|.|++|.+.+.+..+ +|++.....
T Consensus 115 ~~aV~VCm~~g~Fk~A~eiLkr~~~-d~~~~~~r~ 148 (200)
T cd00280 115 EQAVAVCMENGEFKKAEEVLKRLFS-DPESQKLRM 148 (200)
T ss_pred HHHHHHHHhcCchHHHHHHHHHHhc-CCCchhHHH
Confidence 3455678899999999999999999 888876633
No 385
>PF07219 HemY_N: HemY protein N-terminus; InterPro: IPR010817 This entry represents the N terminus (approximately 150 residues) of bacterial HemY porphyrin biosynthesis proteins. These are membrane protein involved in a late step of protoheme IX synthesis [].
Probab=57.28 E-value=43 Score=20.51 Aligned_cols=39 Identities=18% Similarity=0.225 Sum_probs=30.7
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhh
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFS 140 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~ 140 (150)
.......|.+.+-.|++..|.+...++-+..++.+..|.
T Consensus 59 a~~al~~Gl~al~~G~~~~A~k~~~~a~~~~~~~~l~~L 97 (108)
T PF07219_consen 59 AQRALSRGLIALAEGDWQRAEKLLAKAAKLSDNPLLNYL 97 (108)
T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHhcCCCHHHHHH
Confidence 666677888889999999999999999777554444444
No 386
>KOG2041|consensus
Probab=57.05 E-value=73 Score=27.01 Aligned_cols=26 Identities=23% Similarity=0.225 Sum_probs=15.7
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHH
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSE 127 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~ 127 (150)
...+-.+|..+...|-.++|+++|-+
T Consensus 852 s~llp~~a~mf~svGMC~qAV~a~Lr 877 (1189)
T KOG2041|consen 852 SELLPVMADMFTSVGMCDQAVEAYLR 877 (1189)
T ss_pred cchHHHHHHHHHhhchHHHHHHHHHh
Confidence 34445556666666666666666654
No 387
>KOG2422|consensus
Probab=55.57 E-value=1.2e+02 Score=25.03 Aligned_cols=120 Identities=15% Similarity=0.123 Sum_probs=65.7
Q ss_pred HhhCCHHHHHHHHHHhhcc------------CCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCC
Q psy3020 8 SYLGRYKESISTYEEGLKL------------DPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTK 74 (150)
Q Consensus 8 ~~~~~~~~A~~~~~~aL~~------------~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~ 74 (150)
.+..-|++|...|.-|..+ .|-+.+....++.+ ...|+.+..++ -+..++ =.+.. ...|...
T Consensus 249 ~hs~sYeqaq~~F~~av~~~d~n~v~~lL~ssPYHvdsLLqva~~~r~qgD~e~aad-LieR~L--y~~d~--a~hp~F~ 323 (665)
T KOG2422|consen 249 EHSNSYEQAQRDFYLAVIVHDPNNVLILLISSPYHVDSLLQVADIFRFQGDREMAAD-LIERGL--YVFDR--ALHPNFI 323 (665)
T ss_pred ecchHHHHHHHHHHHHHhhcCCcceeeeeccCCcchhHHHHHHHHHHHhcchhhHHH-HHHHHH--HHHHH--Hhccccc
Confidence 3456677888888777654 34556677899999 88886542222 222222 11111 2233332
Q ss_pred CCCCCh--HHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCC-CcchhhcHH
Q psy3020 75 PFLSDP--SYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGT-NHILFSNRS 143 (150)
Q Consensus 75 ~~~~~~--~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~-~~~~~~n~a 143 (150)
.+.... .|. .|.. .....+++.--..+.++|-+.-|.+.+.-.++++|. +|.+-..+-
T Consensus 324 ~~sg~cRL~y~--------~~eN---R~FyL~l~r~m~~l~~RGC~rTA~E~cKlllsLdp~eDPl~~l~~I 384 (665)
T KOG2422|consen 324 PFSGNCRLPYI--------YPEN---RQFYLALFRYMQSLAQRGCWRTALEWCKLLLSLDPSEDPLGILYLI 384 (665)
T ss_pred cccccccCccc--------chhh---HHHHHHHHHHHHHHHhcCChHHHHHHHHHHhhcCCcCCchhHHHHH
Confidence 221110 000 0000 000334444444556899999999999999999999 776544433
No 388
>PF07219 HemY_N: HemY protein N-terminus; InterPro: IPR010817 This entry represents the N terminus (approximately 150 residues) of bacterial HemY porphyrin biosynthesis proteins. These are membrane protein involved in a late step of protoheme IX synthesis [].
Probab=55.08 E-value=47 Score=20.33 Aligned_cols=42 Identities=12% Similarity=-0.051 Sum_probs=28.9
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-Hhhh
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQE 44 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g 44 (150)
-|..-...|+++.|.+...++-+..++..-.+..-+.. ..+|
T Consensus 65 ~Gl~al~~G~~~~A~k~~~~a~~~~~~~~l~~L~AA~AA~~~g 107 (108)
T PF07219_consen 65 RGLIALAEGDWQRAEKLLAKAAKLSDNPLLNYLLAARAAQAQG 107 (108)
T ss_pred HHHHHHHCCCHHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHcC
Confidence 46667788999999999999977654444444444555 5544
No 389
>KOG4814|consensus
Probab=54.94 E-value=38 Score=28.13 Aligned_cols=37 Identities=14% Similarity=0.258 Sum_probs=30.7
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHH
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIK 38 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~ 38 (150)
.+.-||.++.+.+.|++.+++|=+.||.++--...+.
T Consensus 399 ~l~~CYL~L~QLD~A~E~~~EAE~~d~~~~l~q~~~~ 435 (872)
T KOG4814|consen 399 ALQVCYLKLEQLDNAVEVYQEAEEVDRQSPLCQLLML 435 (872)
T ss_pred HHHHHHhhHHHHHHHHHHHHHHHhhccccHHHHHHHH
Confidence 3678999999999999999999999999886554443
No 390
>KOG1586|consensus
Probab=54.91 E-value=80 Score=22.93 Aligned_cols=103 Identities=10% Similarity=0.048 Sum_probs=52.5
Q ss_pred hhhHHHHhh-CCHHHHHHHHHHhhc---cCCCcHHHHHHH---HHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCC
Q psy3020 2 TTYSCLSYL-GRYKESISTYEEGLK---LDPNNEQMKEAI---KDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRT 73 (150)
Q Consensus 2 ~lg~~~~~~-~~~~~A~~~~~~aL~---~~p~~~~a~~~l---~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~ 73 (150)
.||..|... .+++.|+.+|++|-+ .+..++-+...+ +.+ -..+ ++.+++ ..+....+..-++
T Consensus 118 ~iaEiyEsdl~d~ekaI~~YE~Aae~yk~ees~ssANKC~lKvA~yaa~le--------qY~~Ai--~iyeqva~~s~~n 187 (288)
T KOG1586|consen 118 EIAEIYESDLQDFEKAIAHYEQAAEYYKGEESVSSANKCLLKVAQYAAQLE--------QYSKAI--DIYEQVARSSLDN 187 (288)
T ss_pred hHHHHHhhhHHHHHHHHHHHHHHHHHHcchhhhhhHHHHHHHHHHHHHHHH--------HHHHHH--HHHHHHHHHhccc
Confidence 356666655 788888888888865 333344444433 333 4444 555555 3332222211111
Q ss_pred CCCCCChHHHHHHHHHhcCcchhhhHHHhHHH-HHHHHHHHHhcCHHHHHHHHHHHHhhcCCCc
Q psy3020 74 KPFLSDPSYVQMIKEIQKDPSLMTLEAKISLL-KDKGNAALQANNFKEAIEAYSEAIKLDGTNH 136 (150)
Q Consensus 74 ~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~-~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~ 136 (150)
.. .+. .+..+ ...|.+++-..+.-.+-..+++-.+++|...
T Consensus 188 ~L-------LKy---------------s~KdyflkAgLChl~~~D~v~a~~ALeky~~~dP~F~ 229 (288)
T KOG1586|consen 188 NL-------LKY---------------SAKDYFLKAGLCHLCKADEVNAQRALEKYQELDPAFT 229 (288)
T ss_pred hH-------HHh---------------HHHHHHHHHHHHhHhcccHHHHHHHHHHHHhcCCccc
Confidence 10 000 02222 3334444555676777777777777777643
No 391
>KOG0985|consensus
Probab=54.23 E-value=33 Score=30.21 Aligned_cols=29 Identities=31% Similarity=0.394 Sum_probs=25.5
Q ss_pred HHhHHHHHHHHHHHHhcCHHHHHHHHHHH
Q psy3020 100 AKISLLKDKGNAALQANNFKEAIEAYSEA 128 (150)
Q Consensus 100 ~~~~~~~~~g~~~~~~~~~~~A~~~~~~a 128 (150)
+.+..|..++.+..+.|...+|++.|-+|
T Consensus 1102 n~p~vWsqlakAQL~~~~v~dAieSyika 1130 (1666)
T KOG0985|consen 1102 NEPAVWSQLAKAQLQGGLVKDAIESYIKA 1130 (1666)
T ss_pred CChHHHHHHHHHHHhcCchHHHHHHHHhc
Confidence 33889999999999999999999999775
No 392
>PF12968 DUF3856: Domain of Unknown Function (DUF3856); InterPro: IPR024552 This domain of unknown function is found in a small group of tetratricopeptide-like proteins, which includes the uncharacterised protein Q8KAL8 from SWISSPROT. The structure of Q8KAL8 is known and belongs to the SCOP all alpha class, TPR-like superfamily, CT2138-like family.; PDB: 2HR2_D.
Probab=53.02 E-value=59 Score=20.91 Aligned_cols=43 Identities=19% Similarity=0.270 Sum_probs=27.5
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhh-------ccCCCcHHHHH----HHHHH-Hhhh
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGL-------KLDPNNEQMKE----AIKDV-RNQE 44 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL-------~~~p~~~~a~~----~l~~~-~~~g 44 (150)
+|..++..+|+|++++..-+++| +++.+....|. ..+.. ..+|
T Consensus 60 ~Ls~A~~~Lgry~e~L~sA~~aL~YFNRRGEL~qdeGklWIaaVfsra~Al~~~G 114 (144)
T PF12968_consen 60 GLSGALAGLGRYDECLQSADRALRYFNRRGELHQDEGKLWIAAVFSRAVALEGLG 114 (144)
T ss_dssp HHHHHHHHTT-HHHHHHHHHHHHHHHHHH--TTSTHHHHHHHHHHHHHHHHHHTT
T ss_pred HHHHHHHhhccHHHHHHHHHHHHHHHhhccccccccchhHHHHHHHHHHHHHhcC
Confidence 46788899999998766655555 57777666553 33444 5666
No 393
>COG4259 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=52.66 E-value=32 Score=21.26 Aligned_cols=37 Identities=8% Similarity=0.041 Sum_probs=33.5
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcch
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHIL 138 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~ 138 (150)
+..+..+|..+...|+-+.|++.|+.--.+.|++...
T Consensus 72 PG~HAhLGlLys~~G~~e~a~~eFetEKalFPES~~f 108 (121)
T COG4259 72 PGYHAHLGLLYSNSGKDEQAVREFETEKALFPESGVF 108 (121)
T ss_pred CcHHHHHHHHHhhcCChHHHHHHHHHhhhhCccchhH
Confidence 7788999999999999999999999999999987643
No 394
>PF10373 EST1_DNA_bind: Est1 DNA/RNA binding domain; InterPro: IPR018834 Est1 is a protein which recruits or activates telomerase at the site of polymerisation [, ]. This is the DNA/RNA binding domain of EST1 []. ; PDB: 1YA0_B.
Probab=52.01 E-value=70 Score=22.67 Aligned_cols=47 Identities=13% Similarity=-0.028 Sum_probs=35.8
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHh
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAK 148 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~ 148 (150)
...+.++|.+....|+.-+|+-+|-+++-..--.+.+.-|+...+.+
T Consensus 16 G~p~nQLAvl~~~~~~~l~avy~y~Rsl~~~~Pf~~A~~NL~~lf~~ 62 (278)
T PF10373_consen 16 GNPYNQLAVLASYQGDDLDAVYYYIRSLAVRIPFPSARENLQKLFEK 62 (278)
T ss_dssp SHHHHHHHHHHHHTT-HHHHHHHHHHHHSSSB--HHHHHHHHHHHHH
T ss_pred CCcccchhhhhccccchHHHHHHHHHHHhcCCCcHHHHHHHHHHHHH
Confidence 67899999999999999999999999987554457777777766554
No 395
>PF09670 Cas_Cas02710: CRISPR-associated protein (Cas_Cas02710)
Probab=51.91 E-value=1.1e+02 Score=23.53 Aligned_cols=30 Identities=23% Similarity=0.125 Sum_probs=22.9
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcHH
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNEQ 32 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~ 32 (150)
.+..+++.++|..|.+.+...++-=|.+..
T Consensus 137 ~a~~l~n~~~y~aA~~~l~~l~~rl~~~~~ 166 (379)
T PF09670_consen 137 RAKELFNRYDYGAAARILEELLRRLPGREE 166 (379)
T ss_pred HHHHHHhcCCHHHHHHHHHHHHHhCCchhh
Confidence 356678999999999999999985344333
No 396
>PF08631 SPO22: Meiosis protein SPO22/ZIP4 like; InterPro: IPR013940 SPO22 is a meiosis-specific protein with similarity to phospholipase A2, involved in completion of nuclear divisions during meiosis; induced early in meiosis []. It is also involved in sporulation [].
Probab=51.87 E-value=77 Score=22.95 Aligned_cols=37 Identities=19% Similarity=0.137 Sum_probs=28.3
Q ss_pred HhhCCHHHHHHHHHHhhccC-CCcHHHHHHHHHH-Hhhh
Q psy3020 8 SYLGRYKESISTYEEGLKLD-PNNEQMKEAIKDV-RNQE 44 (150)
Q Consensus 8 ~~~~~~~~A~~~~~~aL~~~-p~~~~a~~~l~~~-~~~g 44 (150)
...|+++.|..+|.|+=... .-++.....++.+ +..|
T Consensus 4 ~~~~~~~~A~~~~~K~~~~~~~~~~~~~~~La~~~yn~G 42 (278)
T PF08631_consen 4 WKQGDLDLAEHMYSKAKDLLNSLDPDMAEELARVCYNIG 42 (278)
T ss_pred hhhCCHHHHHHHHHHhhhHHhcCCcHHHHHHHHHHHHHH
Confidence 46799999999999998776 5566666677776 6666
No 397
>COG4455 ImpE Protein of avirulence locus involved in temperature-dependent protein secretion [General function prediction only]
Probab=51.32 E-value=59 Score=23.31 Aligned_cols=61 Identities=11% Similarity=0.155 Sum_probs=48.4
Q ss_pred HHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCC
Q psy3020 5 SCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKP 75 (150)
Q Consensus 5 ~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~ 75 (150)
..+...+...+|+...+.-++.+|.+......+-++ --.| +..+|. ..+.-+-.+.|++..
T Consensus 9 seLL~~~sL~dai~~a~~qVkakPtda~~RhflfqLlcvaG--------dw~kAl--~Ql~l~a~l~p~~t~ 70 (273)
T COG4455 9 SELLDDNSLQDAIGLARDQVKAKPTDAGGRHFLFQLLCVAG--------DWEKAL--AQLNLAATLSPQDTV 70 (273)
T ss_pred HHHHHhccHHHHHHHHHHHHhcCCccccchhHHHHHHhhcc--------hHHHHH--HHHHHHhhcCcccch
Confidence 457788999999999999999999999988888777 7788 556666 666666677776553
No 398
>PF05053 Menin: Menin; InterPro: IPR007747 MEN1, the gene responsible for multiple endocrine neoplasia type 1, is a tumour suppressor gene that encodes a protein called Menin which may be an atypical GTPase stimulated by nm23 [].; GO: 0005634 nucleus; PDB: 3RE2_A 3U84_B 3U86_A 3U88_B 3U85_A.
Probab=51.18 E-value=1.4e+02 Score=24.53 Aligned_cols=30 Identities=17% Similarity=0.327 Sum_probs=25.8
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhh
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKL 131 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~ 131 (150)
...|..+|..+++.++|.+|+.++-+|-..
T Consensus 318 vYPYty~gg~~yR~~~~~eA~~~Wa~aa~V 347 (618)
T PF05053_consen 318 VYPYTYLGGYYYRHKRYREALRSWAEAADV 347 (618)
T ss_dssp SHHHHHHHHHHHHTT-HHHHHHHHHHHHHH
T ss_pred cccceehhhHHHHHHHHHHHHHHHHHHHHH
Confidence 788899999999999999999999888653
No 399
>COG5187 RPN7 26S proteasome regulatory complex component, contains PCI domain [Posttranslational modification, protein turnover, chaperones]
Probab=50.52 E-value=1.1e+02 Score=23.09 Aligned_cols=33 Identities=6% Similarity=-0.029 Sum_probs=28.0
Q ss_pred HHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhc
Q psy3020 100 AKISLLKDKGNAALQANNFKEAIEAYSEAIKLD 132 (150)
Q Consensus 100 ~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~ 132 (150)
...++|.++|..+.+.++-+.+.+.+.+.++..
T Consensus 113 e~~ea~~n~aeyY~qi~D~~ng~~~~~~~~~~a 145 (412)
T COG5187 113 EGSEADRNIAEYYCQIMDIQNGFEWMRRLMRDA 145 (412)
T ss_pred HHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHH
Confidence 338999999999999999999999888877643
No 400
>KOG4459|consensus
Probab=50.37 E-value=31 Score=27.13 Aligned_cols=41 Identities=12% Similarity=0.186 Sum_probs=35.6
Q ss_pred hHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-Hhhh
Q psy3020 4 YSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQE 44 (150)
Q Consensus 4 g~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g 44 (150)
=.+|++.|++.+|++.-...|--+|++-.+...+... ..+|
T Consensus 140 ~~ay~k~~~l~kAv~aa~tflv~~Pdde~ik~~ldyYq~~l~ 181 (471)
T KOG4459|consen 140 QFAYFKVGELEKAVAAAHTFLVANPDDEDIKQNLDYYQTMLG 181 (471)
T ss_pred HHHHHHhhhHHHHHHhcceeeecCCcHHHHHHHHHHHHhccC
Confidence 3688999999999999999999999999998888877 5555
No 401
>COG2976 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=50.06 E-value=61 Score=22.61 Aligned_cols=58 Identities=9% Similarity=-0.047 Sum_probs=37.9
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHH-HHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCC
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQ-MKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDP 71 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~-a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~ 71 (150)
.||+++...|.+++|+..++..- +++... .....|++ ...| +-.+|. ..+..++..++
T Consensus 131 RLArvq~q~~k~D~AL~~L~t~~--~~~w~~~~~elrGDill~kg--------~k~~Ar--~ay~kAl~~~~ 190 (207)
T COG2976 131 RLARVQLQQKKADAALKTLDTIK--EESWAAIVAELRGDILLAKG--------DKQEAR--AAYEKALESDA 190 (207)
T ss_pred HHHHHHHHhhhHHHHHHHHhccc--cccHHHHHHHHhhhHHHHcC--------chHHHH--HHHHHHHHccC
Confidence 47888899999999998877632 222222 23445888 7888 445566 66666666654
No 402
>PF14689 SPOB_a: Sensor_kinase_SpoOB-type, alpha-helical domain; PDB: 1F51_C 2FTK_B 1IXM_B.
Probab=49.22 E-value=44 Score=18.26 Aligned_cols=30 Identities=23% Similarity=0.025 Sum_probs=22.1
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhh
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKL 131 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~ 131 (150)
+.-....-..+.+.|++++|.++..+...-
T Consensus 23 ~~NhLqvI~gllqlg~~~~a~eYi~~~~~~ 52 (62)
T PF14689_consen 23 FLNHLQVIYGLLQLGKYEEAKEYIKELSKD 52 (62)
T ss_dssp HHHHHHHHHHHHHTT-HHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHH
Confidence 555556666778999999999998887653
No 403
>KOG0890|consensus
Probab=49.14 E-value=2.6e+02 Score=27.23 Aligned_cols=120 Identities=10% Similarity=0.003 Sum_probs=66.6
Q ss_pred hHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhc-cCCCCCCCCCChH
Q psy3020 4 YSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQ-LDPRTKPFLSDPS 81 (150)
Q Consensus 4 g~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~-~~~~~~~~~~~~~ 81 (150)
|..-...|+++.|....-.|.+.. -++++...|.. +..| +-..|+ ..+...+. ..|+...-..+
T Consensus 1677 AriaR~aG~~q~A~nall~A~e~r--~~~i~~E~AK~lW~~g--------d~~~Al--~~Lq~~l~~~~~~~~~~~~~-- 1742 (2382)
T KOG0890|consen 1677 ARIARLAGHLQRAQNALLNAKESR--LPEIVLERAKLLWQTG--------DELNAL--SVLQEILSKNFPDLHTPYTD-- 1742 (2382)
T ss_pred HHHHHhcccHHHHHHHHHhhhhcc--cchHHHHHHHHHHhhc--------cHHHHH--HHHHHHHHhhcccccCCccc--
Confidence 334445688888888888888777 45677777888 7777 334455 44444442 22221111000
Q ss_pred HHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCH--HHHHHHHHHHHhhcCCCcchhhcHHHHHH
Q psy3020 82 YVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNF--KEAIEAYSEAIKLDGTNHILFSNRSAAFA 147 (150)
Q Consensus 82 ~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~--~~A~~~~~~al~~~p~~~~~~~n~a~~~~ 147 (150)
.|..-+......+....+...-..+++ .+-++.|..+.+++|.....+|.+|.=|.
T Consensus 1743 ----------~p~~~n~~i~~~~~L~~~~~~~es~n~~s~~ilk~Y~~~~ail~ewe~~hy~l~~yy~ 1800 (2382)
T KOG0890|consen 1743 ----------TPQSVNLLIFKKAKLKITKYLEESGNFESKDILKYYHDAKAILPEWEDKHYHLGKYYD 1800 (2382)
T ss_pred ----------cchhhhhhhhhhHHHHHHHHHHHhcchhHHHHHHHHHHHHHHcccccCceeeHHHHHH
Confidence 000000000012222233333333443 47789999999999988888899985443
No 404
>PF05131 Pep3_Vps18: Pep3/Vps18/deep orange family; InterPro: IPR007810 This region is found in a number of proteins identified as being involved in Golgi function and vacuolar sorting. The molecular function of this region is unknown. Proteins containing this domain also contain a C-terminal ring finger domain.
Probab=48.29 E-value=19 Score=23.66 Aligned_cols=22 Identities=23% Similarity=0.333 Sum_probs=17.5
Q ss_pred HHHHHHHHHhcCHHHHHHHHHH
Q psy3020 106 KDKGNAALQANNFKEAIEAYSE 127 (150)
Q Consensus 106 ~~~g~~~~~~~~~~~A~~~~~~ 127 (150)
.....+++++|+|++|++++..
T Consensus 107 r~vWk~yl~~~~fd~Al~~~~~ 128 (147)
T PF05131_consen 107 RDVWKIYLDKGDFDEALQYCKT 128 (147)
T ss_pred HHHHHHHHhcCcHHHHHHHccC
Confidence 3455668899999999998876
No 405
>COG5159 RPN6 26S proteasome regulatory complex component [Posttranslational modification, protein turnover, chaperones]
Probab=45.82 E-value=1.3e+02 Score=22.66 Aligned_cols=45 Identities=11% Similarity=0.002 Sum_probs=31.3
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhcc----CCC----cHHHHHHHHHH-Hhhhhcc
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKL----DPN----NEQMKEAIKDV-RNQEMND 47 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~----~p~----~~~a~~~l~~~-~~~g~~~ 47 (150)
+++-..+.+++++|+..|.+.|.. |.. -......++.+ ..+|+..
T Consensus 9 ~a~~~v~~~~~~~ai~~yk~iL~kg~s~dek~~nEqE~tvlel~~lyv~~g~~~ 62 (421)
T COG5159 9 LANNAVKSNDIEKAIGEYKRILGKGVSKDEKTLNEQEATVLELFKLYVSKGDYC 62 (421)
T ss_pred HHHHhhhhhhHHHHHHHHHHHhcCCCChhhhhhhHHHHHHHHHHHHHHhcCCcc
Confidence 466677889999999999999976 211 11234667777 6777554
No 406
>PF15469 Sec5: Exocyst complex component Sec5
Probab=43.96 E-value=97 Score=20.77 Aligned_cols=21 Identities=24% Similarity=0.366 Sum_probs=15.5
Q ss_pred HHhhCCHHHHHHHHHHhhccC
Q psy3020 7 LSYLGRYKESISTYEEGLKLD 27 (150)
Q Consensus 7 ~~~~~~~~~A~~~~~~aL~~~ 27 (150)
+.+.|+|+.++..|.+|..+-
T Consensus 96 ~i~~~dy~~~i~dY~kak~l~ 116 (182)
T PF15469_consen 96 CIKKGDYDQAINDYKKAKSLF 116 (182)
T ss_pred HHHcCcHHHHHHHHHHHHHHH
Confidence 356688888888888876543
No 407
>PF08626 TRAPPC9-Trs120: Transport protein Trs120 or TRAPPC9, TRAPP II complex subunit; InterPro: IPR013935 The trafficking protein particle complex TRAPP is a multi-protein complex needed in the early stages of the secretory pathway. To date, two kinds of TRAPP complexes have been studied, TRAPPI and TRAPP II. These complexes differ in subunit composition []. TRAPP I binds vesicles derived from the endoplasmic reticulum bringing them closer to the acceptor membrane. Trs120 is a subunit specific to the TRAPP II complex [] along with Trs65p and Trs130p(TRAPPC10). It is suggested that Trs120p is required for the stability of the Trs130p subunit, suggesting that these two proteins might interact in some way []. It is likely that there is a complex function for TRAPP II in multiple pathways [].
Probab=43.17 E-value=33 Score=30.49 Aligned_cols=33 Identities=30% Similarity=0.391 Sum_probs=28.9
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCC
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGT 134 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~ 134 (150)
....+.+|..+...|++.+|++.|.+|++.-..
T Consensus 242 gR~~k~~gd~~LlaG~~~dAl~~y~~a~~~~k~ 274 (1185)
T PF08626_consen 242 GRLQKVLGDLYLLAGRWPDALKEYTEAIEILKS 274 (1185)
T ss_pred hhhhhhhhhHHHHcCCHHHHHHHHHHHHHHHhh
Confidence 566788999999999999999999999986553
No 408
>PF04053 Coatomer_WDAD: Coatomer WD associated region ; InterPro: IPR006692 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the WD-associated region found in coatomer subunits alpha, beta and beta' subunits. The alpha-subunit (RET1P) of the coatomer complex in Saccharomyces cerevisiae (Baker's yeast), participates in membrane transport between the endoplasmic reticulum and Golgi apparatus. The protein contains six WD-40 repeat motifs in its N-terminal region []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 3MKQ_B.
Probab=42.49 E-value=1.7e+02 Score=23.10 Aligned_cols=33 Identities=12% Similarity=0.099 Sum_probs=20.0
Q ss_pred hhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-Hhhhhc
Q psy3020 9 YLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMN 46 (150)
Q Consensus 9 ~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~ 46 (150)
+.|+++.|.+. ....++...|..||.. ...|+.
T Consensus 330 ~lg~L~~A~~~-----a~~~~~~~~W~~Lg~~AL~~g~~ 363 (443)
T PF04053_consen 330 QLGNLDIALEI-----AKELDDPEKWKQLGDEALRQGNI 363 (443)
T ss_dssp HCT-HHHHHHH-----CCCCSTHHHHHHHHHHHHHTTBH
T ss_pred hcCCHHHHHHH-----HHhcCcHHHHHHHHHHHHHcCCH
Confidence 44455544442 2334577788999988 777854
No 409
>PF11846 DUF3366: Domain of unknown function (DUF3366); InterPro: IPR021797 This domain is functionally uncharacterised. This domain is found in bacteria. This presumed domain is about 200 amino acids in length.
Probab=42.49 E-value=1.1e+02 Score=20.73 Aligned_cols=33 Identities=15% Similarity=0.003 Sum_probs=30.6
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCC
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGT 134 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~ 134 (150)
+..+..++.++...|+.++|.+...++..+.|.
T Consensus 144 ~~~~~~~a~~l~~~G~~~eA~~~~~~~~~lyP~ 176 (193)
T PF11846_consen 144 PNVYQRYALALALLGDPEEARQWLARARRLYPA 176 (193)
T ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCCc
Confidence 677888899999999999999999999999994
No 410
>PF10255 Paf67: RNA polymerase I-associated factor PAF67; InterPro: IPR019382 RNA polymerase I is a multi-subunit enzyme and its transcription competence is dependent on the presence of PAF67 [].
Probab=41.22 E-value=24 Score=27.32 Aligned_cols=22 Identities=27% Similarity=0.351 Sum_probs=20.7
Q ss_pred hhHHHHhhCCHHHHHHHHHHhh
Q psy3020 3 TYSCLSYLGRYKESISTYEEGL 24 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL 24 (150)
+|-||+=++||.+|++.|...|
T Consensus 170 vGFaylMlrRY~DAir~f~~iL 191 (404)
T PF10255_consen 170 VGFAYLMLRRYADAIRTFSQIL 191 (404)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 6999999999999999999887
No 411
>PF13226 DUF4034: Domain of unknown function (DUF4034)
Probab=40.71 E-value=51 Score=24.18 Aligned_cols=33 Identities=6% Similarity=0.153 Sum_probs=29.0
Q ss_pred HHHHHHHHHHhhccCCCcHHHHHHHHHH-Hhhhh
Q psy3020 13 YKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEM 45 (150)
Q Consensus 13 ~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~ 45 (150)
.+.|..++.+|+.++|..+.+...+..+ ...|+
T Consensus 115 ~d~A~~~ll~A~~l~pr~~~A~~~m~~~s~~fge 148 (277)
T PF13226_consen 115 CDQAVAALLKAIELSPRPVAAAIGMINISAYFGE 148 (277)
T ss_pred HHHHHHHHHHHHhcCCCchHHHHHHHHHHhhcCC
Confidence 4679999999999999999999999888 77773
No 412
>COG3898 Uncharacterized membrane-bound protein [Function unknown]
Probab=40.19 E-value=1.8e+02 Score=22.88 Aligned_cols=110 Identities=14% Similarity=0.049 Sum_probs=64.8
Q ss_pred CHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCC--C-----CCh--H
Q psy3020 12 RYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPF--L-----SDP--S 81 (150)
Q Consensus 12 ~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~--~-----~~~--~ 81 (150)
+...|.++-.+++++.|+...+-..-+.. ...|+. .+.- .-+...-+..|.-..+ + .+. +
T Consensus 244 dp~~Ar~~A~~a~KL~pdlvPaav~AAralf~d~~~--------rKg~--~ilE~aWK~ePHP~ia~lY~~ar~gdta~d 313 (531)
T COG3898 244 DPASARDDALEANKLAPDLVPAAVVAARALFRDGNL--------RKGS--KILETAWKAEPHPDIALLYVRARSGDTALD 313 (531)
T ss_pred ChHHHHHHHHHHhhcCCccchHHHHHHHHHHhccch--------hhhh--hHHHHHHhcCCChHHHHHHHHhcCCCcHHH
Confidence 46678899999999999999887776666 555532 2221 2223333333321110 0 000 0
Q ss_pred HHHHHHHHh-cCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcc
Q psy3020 82 YVQMIKEIQ-KDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHI 137 (150)
Q Consensus 82 ~~~~~~~l~-~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~ 137 (150)
=.+....+. -.| +..+....+....+..|+|..|-..-+.+....|....
T Consensus 314 RlkRa~~L~slk~------nnaes~~~va~aAlda~e~~~ARa~Aeaa~r~~pres~ 364 (531)
T COG3898 314 RLKRAKKLESLKP------NNAESSLAVAEAALDAGEFSAARAKAEAAAREAPRESA 364 (531)
T ss_pred HHHHHHHHHhcCc------cchHHHHHHHHHHHhccchHHHHHHHHHHhhhCchhhH
Confidence 011111111 112 22667777888888899999999999999999986543
No 413
>PF11817 Foie-gras_1: Foie gras liver health family 1; InterPro: IPR021773 Mutating the gene foie gras in zebrafish has been shown to affect development; the mutants develop large, lipid-filled hepatocytes in the liver, resembling those in individuals with fatty liver disease []. Foie-gras protein is long and has several well-defined domains though none of them has a known function. We have annotated this one as the first []. THe C terminus of this region contains TPR repeats.
Probab=40.15 E-value=50 Score=23.50 Aligned_cols=32 Identities=9% Similarity=0.096 Sum_probs=26.0
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcC
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDG 133 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p 133 (150)
......+|..++..|+|++|++.|+.+...-.
T Consensus 178 ~~l~~~~A~ey~~~g~~~~A~~~l~~~~~~yr 209 (247)
T PF11817_consen 178 SYLSLEMAEEYFRLGDYDKALKLLEPAASSYR 209 (247)
T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHH
Confidence 34445689999999999999999999966544
No 414
>KOG1550|consensus
Probab=40.11 E-value=2e+02 Score=23.30 Aligned_cols=105 Identities=10% Similarity=0.018 Sum_probs=60.2
Q ss_pred hhhHHHHhhC-----CHHHHHHHHHHhhccCCCcHHHHHHHHHHHhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCC
Q psy3020 2 TTYSCLSYLG-----RYKESISTYEEGLKLDPNNEQMKEAIKDVRNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPF 76 (150)
Q Consensus 2 ~lg~~~~~~~-----~~~~A~~~~~~aL~~~p~~~~a~~~l~~~~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~ 76 (150)
.+|.+|.+-. +++.|+..|.++-++... .+...++.+...|. ...++..|+ .-+..+.
T Consensus 293 ~lg~~Y~~g~~~~~~d~~~A~~~~~~aA~~g~~--~a~~~lg~~~~~g~----~~~d~~~A~--~yy~~Aa--------- 355 (552)
T KOG1550|consen 293 GLGRLYLQGLGVEKIDYEKALKLYTKAAELGNP--DAQYLLGVLYETGT----KERDYRRAF--EYYSLAA--------- 355 (552)
T ss_pred HHHHHHhcCCCCccccHHHHHHHHHHHHhcCCc--hHHHHHHHHHHcCC----ccccHHHHH--HHHHHHH---------
Confidence 4788888843 678899999999877654 44455555522231 011344444 2111111
Q ss_pred CCChHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHH----HhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHH
Q psy3020 77 LSDPSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAAL----QANNFKEAIEAYSEAIKLDGTNHILFSNRSAAF 146 (150)
Q Consensus 77 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~----~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~ 146 (150)
.... ..+...++.++. ...+...|+.+|.++.+.++ +.+.+-++..+
T Consensus 356 -------------~~G~--------~~A~~~la~~y~~G~gv~r~~~~A~~~~k~aA~~g~--~~A~~~~~~~~ 406 (552)
T KOG1550|consen 356 -------------KAGH--------ILAIYRLALCYELGLGVERNLELAFAYYKKAAEKGN--PSAAYLLGAFY 406 (552)
T ss_pred -------------HcCC--------hHHHHHHHHHHHhCCCcCCCHHHHHHHHHHHHHccC--hhhHHHHHHHH
Confidence 0111 445555565554 34688899999999999883 44444444443
No 415
>KOG2709|consensus
Probab=40.10 E-value=18 Score=28.20 Aligned_cols=24 Identities=21% Similarity=0.551 Sum_probs=21.9
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhc
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLK 25 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~ 25 (150)
..|.||-+.+++++|+..|+++|.
T Consensus 27 ~~gl~~dE~~~~e~a~~~Ye~gl~ 50 (560)
T KOG2709|consen 27 EQGLCYDEVNDWENALAMYEKGLN 50 (560)
T ss_pred HhhcchhhhcCHHHHHHHHHHHHH
Confidence 468899999999999999999986
No 416
>PF10366 Vps39_1: Vacuolar sorting protein 39 domain 1; InterPro: IPR019452 This entry represents a domain found in the vacuolar sorting protein Vps39 and transforming growth factor beta receptor-associated protein Trap1. Vps39, a component of the C-Vps complex, is thought to be required for the fusion of endosomes and other types of transport intermediates with the vacuole [, ]. In Saccharomyces cerevisiae (Baker's yeast), Vps39 has been shown to stimulate nucleotide exchange []. Trap1 plays a role in the TGF-beta/activin signaling pathway. It associates with inactive heteromeric TGF-beta and activin receptor complexes, mainly through the type II receptor, and is released upon activation of signaling [, ]. The precise function of this domain has not been characterised.
Probab=39.83 E-value=51 Score=20.31 Aligned_cols=29 Identities=7% Similarity=-0.005 Sum_probs=25.7
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHh
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIK 130 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~ 130 (150)
..-+..++..|+.+|.+++|++...+...
T Consensus 39 ~~~~~eL~~lY~~kg~h~~AL~ll~~l~~ 67 (108)
T PF10366_consen 39 HGKYQELVDLYQGKGLHRKALELLKKLAD 67 (108)
T ss_pred cCCHHHHHHHHHccCccHHHHHHHHHHhc
Confidence 45678899999999999999999998877
No 417
>KOG1538|consensus
Probab=39.12 E-value=2e+02 Score=24.42 Aligned_cols=26 Identities=19% Similarity=0.279 Sum_probs=13.3
Q ss_pred HHHHHHHHHHHHhcCHHHHHHHHHHH
Q psy3020 103 SLLKDKGNAALQANNFKEAIEAYSEA 128 (150)
Q Consensus 103 ~~~~~~g~~~~~~~~~~~A~~~~~~a 128 (150)
..+...|+.+.+..+|+||.+.|-+|
T Consensus 805 dVy~pyaqwLAE~DrFeEAqkAfhkA 830 (1081)
T KOG1538|consen 805 DVYMPYAQWLAENDRFEEAQKAFHKA 830 (1081)
T ss_pred cccchHHHHhhhhhhHHHHHHHHHHh
Confidence 34444555555555555555555443
No 418
>KOG2114|consensus
Probab=39.07 E-value=48 Score=28.29 Aligned_cols=29 Identities=24% Similarity=0.421 Sum_probs=26.3
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHh
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIK 130 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~ 130 (150)
...+...|.-++.+|+|++|+..|-++|.
T Consensus 368 ~~i~~kYgd~Ly~Kgdf~~A~~qYI~tI~ 396 (933)
T KOG2114|consen 368 AEIHRKYGDYLYGKGDFDEATDQYIETIG 396 (933)
T ss_pred HHHHHHHHHHHHhcCCHHHHHHHHHHHcc
Confidence 56778889999999999999999999986
No 419
>KOG2114|consensus
Probab=37.62 E-value=2.7e+02 Score=24.12 Aligned_cols=29 Identities=24% Similarity=0.347 Sum_probs=23.7
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhc-cCCCcH
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLK-LDPNNE 31 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~-~~p~~~ 31 (150)
-|.-++..|++++|...|-+.+. ++|...
T Consensus 374 Ygd~Ly~Kgdf~~A~~qYI~tI~~le~s~V 403 (933)
T KOG2114|consen 374 YGDYLYGKGDFDEATDQYIETIGFLEPSEV 403 (933)
T ss_pred HHHHHHhcCCHHHHHHHHHHHcccCChHHH
Confidence 47778999999999999999987 565433
No 420
>COG0790 FOG: TPR repeat, SEL1 subfamily [General function prediction only]
Probab=36.79 E-value=1.6e+02 Score=21.13 Aligned_cols=46 Identities=17% Similarity=0.204 Sum_probs=34.4
Q ss_pred hHHHHHHHHHHHH----hcCHHHHHHHHHHHHhhcCCC-cchhhcHHHHHH
Q psy3020 102 ISLLKDKGNAALQ----ANNFKEAIEAYSEAIKLDGTN-HILFSNRSAAFA 147 (150)
Q Consensus 102 ~~~~~~~g~~~~~----~~~~~~A~~~~~~al~~~p~~-~~~~~n~a~~~~ 147 (150)
+.+...+|..+.. ..++.+|++.|++|.+..-.. ....++++.+|.
T Consensus 109 ~~a~~~lg~~~~~G~gv~~d~~~A~~~~~~Aa~~g~~~a~~~~~~l~~~~~ 159 (292)
T COG0790 109 AEALFNLGLMYANGRGVPLDLVKALKYYEKAAKLGNVEAALAMYRLGLAYL 159 (292)
T ss_pred HHHHHhHHHHHhcCCCcccCHHHHHHHHHHHHHcCChhHHHHHHHHHHHHH
Confidence 5677788888875 559999999999999886544 344666777664
No 421
>PF10952 DUF2753: Protein of unknown function (DUF2753); InterPro: IPR020206 This entry represents a group of uncharacterised proteins.
Probab=36.64 E-value=82 Score=20.33 Aligned_cols=30 Identities=17% Similarity=0.169 Sum_probs=24.9
Q ss_pred HHHHHHHHHHHhcCHHHHHHHHHHHHhhcC
Q psy3020 104 LLKDKGNAALQANNFKEAIEAYSEAIKLDG 133 (150)
Q Consensus 104 ~~~~~g~~~~~~~~~~~A~~~~~~al~~~p 133 (150)
.+..+|...++.+++-.++-+|.+|+.+--
T Consensus 3 ~htllAd~a~~~~~~l~si~hYQqAls~se 32 (140)
T PF10952_consen 3 KHTLLADQAFKEADPLRSILHYQQALSLSE 32 (140)
T ss_pred hHHHHHHHHhhcccHHHHHHHHHHHHHHHH
Confidence 456788888999999999999999987543
No 422
>PF04097 Nic96: Nup93/Nic96; InterPro: IPR007231 Nup93/Nic96 is a component of the nuclear pore complex. It is required for the correct assembly of the nuclear pore complex []. In Saccharomyces cerevisiae, Nic96 has been shown to be involved in the distribution and cellular concentration of the GTPase Gsp1 []. The structure of Nic96 has revealed a mostly alpha helical structure [].; GO: 0006810 transport, 0005643 nuclear pore; PDB: 2QX5_B 2RFO_A.
Probab=36.23 E-value=2.2e+02 Score=23.44 Aligned_cols=34 Identities=18% Similarity=0.283 Sum_probs=19.9
Q ss_pred hHHHHHHHHHH--HHhcCHHHHHHHHHHHHhhcCCCc
Q psy3020 102 ISLLKDKGNAA--LQANNFKEAIEAYSEAIKLDGTNH 136 (150)
Q Consensus 102 ~~~~~~~g~~~--~~~~~~~~A~~~~~~al~~~p~~~ 136 (150)
...+..+..++ +..|+|++|++..++ +.+-|.++
T Consensus 503 ~~~Ll~L~~ff~~~~~g~~~~AL~~i~~-L~liP~~~ 538 (613)
T PF04097_consen 503 FQLLLDLAEFFDLYHAGQYEQALDIIEK-LDLIPLDP 538 (613)
T ss_dssp HHHHHHHHHHHHHHHTT-HHHHHHHHHH-TT-S-S-H
T ss_pred HHHHHHHHHHHHHHHcCCHHHHHHHHHh-CCCCCCCH
Confidence 44445555544 689999999987765 45667543
No 423
>KOG0530|consensus
Probab=35.28 E-value=1.9e+02 Score=21.49 Aligned_cols=42 Identities=12% Similarity=0.063 Sum_probs=19.8
Q ss_pred hHHHHHHHHHHHHhcCHH-HHHHHHHHHHhhcCCCcchhhcHH
Q psy3020 102 ISLLKDKGNAALQANNFK-EAIEAYSEAIKLDGTNHILFSNRS 143 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~-~A~~~~~~al~~~p~~~~~~~n~a 143 (150)
...|..+-.+.-..|+.. .=++....++..|..|.-+|..|-
T Consensus 112 YQvWHHRr~ive~l~d~s~rELef~~~~l~~DaKNYHaWshRq 154 (318)
T KOG0530|consen 112 YQVWHHRRVIVELLGDPSFRELEFTKLMLDDDAKNYHAWSHRQ 154 (318)
T ss_pred hhHHHHHHHHHHHhcCcccchHHHHHHHHhccccchhhhHHHH
Confidence 444444444444444444 444444455555555444444443
No 424
>PF04762 IKI3: IKI3 family; InterPro: IPR006849 Members of this family are components of the elongator multi-subunit component of a novel RNA polymerase II holoenzyme for transcriptional elongation [].
Probab=34.93 E-value=1.8e+02 Score=25.47 Aligned_cols=99 Identities=17% Similarity=0.234 Sum_probs=49.8
Q ss_pred hhHHHHhhC--CHHHHHHHHHHhhccCCCcHH-HHHHH---HH---H--HhhhhccCCCCCchhhhhCChHHHHhhccCC
Q psy3020 3 TYSCLSYLG--RYKESISTYEEGLKLDPNNEQ-MKEAI---KD---V--RNQEMNDMNRGDPFANLFSDPNIFVQLQLDP 71 (150)
Q Consensus 3 lg~~~~~~~--~~~~A~~~~~~aL~~~p~~~~-a~~~l---~~---~--~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~ 71 (150)
+=+||.+.+ ++++|+....+.-+.++..++ +...+ .+ + .++|-+ +++=++ .+...-..||
T Consensus 818 IlTa~vkk~Pp~le~aL~~I~~l~~~~~~~ae~alkyl~fLvDvn~Ly~~ALG~Y------Dl~Lal---~VAq~SQkDP 888 (928)
T PF04762_consen 818 ILTAYVKKSPPDLEEALQLIKELREEDPESAEEALKYLCFLVDVNKLYDVALGTY------DLELAL---MVAQQSQKDP 888 (928)
T ss_pred HHHHHHhcCchhHHHHHHHHHHHHhcChHHHHHHHhHheeeccHHHHHHHHhhhc------CHHHHH---HHHHHhccCh
Confidence 345677777 777777777776655444333 21111 11 1 233332 222222 2222334455
Q ss_pred CCCCCCCChHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHH
Q psy3020 72 RTKPFLSDPSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEA 128 (150)
Q Consensus 72 ~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~a 128 (150)
. +|.-.++.+..-+. ...+-.+-..+++|+.|++.+.++
T Consensus 889 K--------EYLPfL~~L~~l~~----------~~rry~ID~hLkRy~kAL~~L~~~ 927 (928)
T PF04762_consen 889 K--------EYLPFLQELQKLPP----------LYRRYKIDDHLKRYEKALRHLSAC 927 (928)
T ss_pred H--------HHHHHHHHHHhCCh----------hheeeeHhhhhCCHHHHHHHHHhh
Confidence 4 35555555543331 123444456789999999987764
No 425
>PF12753 Nro1: Nuclear pore complex subunit Nro1; InterPro: IPR024318 In fission yeast, Nro1 is a positive regulator of the stability of Sre1N, the sterol regulatory element-binding protein, which is an ER membrane-bound transcription factor that controls adaptation to low oxygen-growth []. In addition, the fission yeast Nro1 is a direct inhibitor of a protein that inhibits SreN1 degradation, Ofd1 (an oxoglutamate deoxygenase). The outcome of this reactivity is that Ofd1 acts as an oxygen sensor that regulates the binding of Nro1 to Ofd1 to control the stability of Sre1N []. This entry also represents ETT1, an Nro1 ortholog []. ETT1 is required for correct translation termination and probably involved in regulation of hypoxic gene expression in association TPA1 []. It inhibits replication of Brome mosaic virus [].; GO: 0005515 protein binding, 0005634 nucleus; PDB: 3QTM_B 3MSV_B 3QTN_B.
Probab=34.88 E-value=46 Score=25.73 Aligned_cols=31 Identities=16% Similarity=0.153 Sum_probs=23.0
Q ss_pred CHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 117 NFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 117 ~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
-...|++++++|.. .++|..|.+.|.+++.+
T Consensus 333 l~~~Al~yL~kA~d--~ddPetWv~vAEa~I~L 363 (404)
T PF12753_consen 333 LIKKALEYLKKAQD--EDDPETWVDVAEAMIDL 363 (404)
T ss_dssp HHHHHHHHHHHHHH--S--TTHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHhhc--cCChhHHHHHHHHHhhh
Confidence 34577788887766 66799999999999865
No 426
>KOG0529|consensus
Probab=34.74 E-value=2.2e+02 Score=22.28 Aligned_cols=45 Identities=11% Similarity=0.147 Sum_probs=36.5
Q ss_pred hHHHHHHHHHHHHhc--CHHHHHHHHHHHHhhcCCCcchhhcHHHHH
Q psy3020 102 ISLLKDKGNAALQAN--NFKEAIEAYSEAIKLDGTNHILFSNRSAAF 146 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~--~~~~A~~~~~~al~~~p~~~~~~~n~a~~~ 146 (150)
..+|..+-+++.+.+ ++..=++.++++++.||.|--+|.+|=.+.
T Consensus 109 Y~aW~hR~w~L~~~p~~~~~~EL~lcek~L~~D~RNfh~W~YRRfV~ 155 (421)
T KOG0529|consen 109 YGAWHHRKWVLQKNPHSDWNTELQLCEKALKQDPRNFHAWHYRRFVV 155 (421)
T ss_pred HHHHHHHHHHHHhCCCchHHHHHHHHHHHHhcCcccccchHHHHHHH
Confidence 788999888887554 467888999999999999988887775543
No 427
>KOG1463|consensus
Probab=33.03 E-value=1.7e+02 Score=22.47 Aligned_cols=46 Identities=15% Similarity=0.189 Sum_probs=30.2
Q ss_pred hHHHHHHHHHHH--HhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHH
Q psy3020 102 ISLLKDKGNAAL--QANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFA 147 (150)
Q Consensus 102 ~~~~~~~g~~~~--~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~ 147 (150)
..++...+.++. ...+|+.|+..|..-+.-||--...+.++-..++
T Consensus 285 i~AmkavAeA~~nRSLkdF~~AL~~yk~eL~~D~ivr~Hl~~Lyd~lL 332 (411)
T KOG1463|consen 285 IDAMKAVAEAFGNRSLKDFEKALADYKKELAEDPIVRSHLQSLYDNLL 332 (411)
T ss_pred hHHHHHHHHHhcCCcHHHHHHHHHHhHHHHhcChHHHHHHHHHHHHHH
Confidence 445555555553 4578999999999999988865544444444333
No 428
>PF08631 SPO22: Meiosis protein SPO22/ZIP4 like; InterPro: IPR013940 SPO22 is a meiosis-specific protein with similarity to phospholipase A2, involved in completion of nuclear divisions during meiosis; induced early in meiosis []. It is also involved in sporulation [].
Probab=31.66 E-value=2e+02 Score=20.78 Aligned_cols=25 Identities=12% Similarity=0.201 Sum_probs=22.6
Q ss_pred hhhHHHHhhC-CHHHHHHHHHHhhcc
Q psy3020 2 TTYSCLSYLG-RYKESISTYEEGLKL 26 (150)
Q Consensus 2 ~lg~~~~~~~-~~~~A~~~~~~aL~~ 26 (150)
++|......+ ++++|...+++|+++
T Consensus 40 n~G~~l~~~~~~~~~a~~wL~~a~~~ 65 (278)
T PF08631_consen 40 NIGKSLLSKKDKYEEAVKWLQRAYDI 65 (278)
T ss_pred HHHHHHHHcCCChHHHHHHHHHHHHH
Confidence 5788889999 999999999999887
No 429
>KOG2041|consensus
Probab=31.03 E-value=2.4e+02 Score=24.20 Aligned_cols=30 Identities=20% Similarity=0.243 Sum_probs=25.9
Q ss_pred HHhHHHHHHHHHHHHhcCHHHHHHHHHHHH
Q psy3020 100 AKISLLKDKGNAALQANNFKEAIEAYSEAI 129 (150)
Q Consensus 100 ~~~~~~~~~g~~~~~~~~~~~A~~~~~~al 129 (150)
.+..+|.+.|..+..+..|++|.++|..+-
T Consensus 794 ~~e~A~r~ig~~fa~~~~We~A~~yY~~~~ 823 (1189)
T KOG2041|consen 794 GKEDAFRNIGETFAEMMEWEEAAKYYSYCG 823 (1189)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcc
Confidence 348899999999999999999999988753
No 430
>KOG2997|consensus
Probab=29.72 E-value=93 Score=23.55 Aligned_cols=32 Identities=19% Similarity=0.241 Sum_probs=26.2
Q ss_pred hHHHHhhCCHHHHHHHHHHhhccCCCcHHHHH
Q psy3020 4 YSCLSYLGRYKESISTYEEGLKLDPNNEQMKE 35 (150)
Q Consensus 4 g~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~ 35 (150)
|....+.|..-+|+..|..|+++.|+-...+.
T Consensus 26 av~~Eq~G~l~dai~fYR~AlqI~~diEs~~r 57 (366)
T KOG2997|consen 26 AVLKEQDGSLYDAINFYRDALQIVPDIESKYR 57 (366)
T ss_pred HHHHhhcCcHHHHHHHHHhhhcCCchHHHHHH
Confidence 44456779999999999999999998666665
No 431
>smart00299 CLH Clathrin heavy chain repeat homology.
Probab=28.80 E-value=1.5e+02 Score=18.52 Aligned_cols=36 Identities=19% Similarity=0.290 Sum_probs=29.9
Q ss_pred HHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH
Q psy3020 5 SCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV 40 (150)
Q Consensus 5 ~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~ 40 (150)
..+...+.+...+..++..+..+|.++..+..+..+
T Consensus 15 ~~~~~~~~~~~l~~yLe~~~~~~~~~~~~~~~li~l 50 (140)
T smart00299 15 ELFEKRNLLEELIPYLESALKLNSENPALQTKLIEL 50 (140)
T ss_pred HHHHhCCcHHHHHHHHHHHHccCccchhHHHHHHHH
Confidence 345556889999999999999998888888888776
No 432
>COG4455 ImpE Protein of avirulence locus involved in temperature-dependent protein secretion [General function prediction only]
Probab=28.29 E-value=1.9e+02 Score=20.85 Aligned_cols=43 Identities=12% Similarity=0.023 Sum_probs=31.0
Q ss_pred HHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCc---chhhcHHHH
Q psy3020 103 SLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNH---ILFSNRSAA 145 (150)
Q Consensus 103 ~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~---~~~~n~a~~ 145 (150)
..-..+-..+.-.|+|+.|...++-+-++.|+.. .+|.++-.|
T Consensus 36 ~~RhflfqLlcvaGdw~kAl~Ql~l~a~l~p~~t~~a~lyr~lir~ 81 (273)
T COG4455 36 GGRHFLFQLLCVAGDWEKALAQLNLAATLSPQDTVGASLYRHLIRC 81 (273)
T ss_pred cchhHHHHHHhhcchHHHHHHHHHHHhhcCcccchHHHHHHHHHHH
Confidence 3334455566778999999999999999999863 455554444
No 433
>PF08969 USP8_dimer: USP8 dimerisation domain; InterPro: IPR015063 This domain is predominantly found in the amino terminal region of Ubiquitin carboxyl-terminal hydrolase 8 (USP8). It has no known function. ; PDB: 2XZE_B 2A9U_A.
Probab=28.23 E-value=1.5e+02 Score=18.24 Aligned_cols=31 Identities=16% Similarity=0.102 Sum_probs=25.4
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhc
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLD 132 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~ 132 (150)
+..+...+..+...|+.+.|.-.|-+.+.+-
T Consensus 38 a~~l~~~A~~~~~egd~E~AYvl~~R~~~L~ 68 (115)
T PF08969_consen 38 ANKLLREAEEYRQEGDEEQAYVLYMRYLTLV 68 (115)
T ss_dssp HHHHHHHHHHHHHCT-HHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHH
Confidence 6677788999999999999999999887654
No 434
>COG4649 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=28.18 E-value=1.7e+02 Score=20.37 Aligned_cols=38 Identities=8% Similarity=0.009 Sum_probs=28.0
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV 40 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~ 40 (150)
.||.+-.+.|++.+|.+.|++.-. |..-++.-...+++
T Consensus 172 ALglAa~kagd~a~A~~~F~qia~-Da~aprnirqRAq~ 209 (221)
T COG4649 172 ALGLAAYKAGDFAKAKSWFVQIAN-DAQAPRNIRQRAQI 209 (221)
T ss_pred HHhHHHHhccchHHHHHHHHHHHc-cccCcHHHHHHHHH
Confidence 467788888999999999988765 65556655555555
No 435
>PF09797 NatB_MDM20: N-acetyltransferase B complex (NatB) non catalytic subunit; InterPro: IPR019183 This is the non-catalytic subunit of the N-terminal acetyltransferase B complex (NatB). The NatB complex catalyses the acetylation of the amino-terminal methionine residue of all proteins beginning with Met-Asp or Met-Glu and of some proteins beginning with Met-Asn or Met-Met. In Saccharomyces cerevisiae (Baker's yeast) this subunit is called MDM20 and in Schizosaccharomyces pombe (Fission yeast) it is called Arm1. NatB acetylates the Tpm1 protein and regulates and tropomyocin-actin interactions. This subunit is required by the NatB complex for the N-terminal acetylation of Tpm1 [].
Probab=27.65 E-value=1.5e+02 Score=22.43 Aligned_cols=34 Identities=24% Similarity=0.233 Sum_probs=30.1
Q ss_pred CCHHHHHHHHHHhhccCCCcHHHHHHHHHH-Hhhh
Q psy3020 11 GRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQE 44 (150)
Q Consensus 11 ~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g 44 (150)
...-+|+-.++.++..+|.|......+..+ ..+|
T Consensus 197 ~~l~~Ai~lLE~~l~~s~~n~~~~LlLvrlY~~LG 231 (365)
T PF09797_consen 197 EYLLQAIALLEHALKKSPHNYQLKLLLVRLYSLLG 231 (365)
T ss_pred HHHHHHHHHHHHHHHcCCCcHHHHHHHHHHHHHcC
Confidence 445579999999999999999999999998 8888
No 436
>COG2015 Alkyl sulfatase and related hydrolases [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=27.35 E-value=1.6e+02 Score=23.84 Aligned_cols=48 Identities=15% Similarity=0.159 Sum_probs=41.8
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHhc
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAKA 149 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~~ 149 (150)
...-..++.-.+.+|+|.=+.+..++++--+|+|..+....|.++-++
T Consensus 452 adrVl~la~ea~~kGdyrW~a~lln~~VfAdp~n~~Ar~L~Ad~lEQL 499 (655)
T COG2015 452 ADRVLELAREAFDKGDYRWAAELLNQAVFADPGNKAARELQADALEQL 499 (655)
T ss_pred HHHHHHHHHHHHhcccchHHHHHHhhHHhcCCccHHHHHHHHhHHHHh
Confidence 344566788889999999999999999999999999999888888765
No 437
>PF04505 Dispanin: Interferon-induced transmembrane protein; InterPro: IPR007593 This family includes the human leukocyte antigen CD225, which is an interferon inducible transmembrane protein, and is associated with interferon induced cell growth suppression [].; GO: 0009607 response to biotic stimulus, 0016021 integral to membrane
Probab=27.27 E-value=97 Score=18.03 Aligned_cols=27 Identities=22% Similarity=0.209 Sum_probs=20.4
Q ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhc
Q psy3020 106 KDKGNAALQANNFKEAIEAYSEAIKLD 132 (150)
Q Consensus 106 ~~~g~~~~~~~~~~~A~~~~~~al~~~ 132 (150)
....+..+..||+++|.+.-++|....
T Consensus 41 s~kv~~~~~~Gd~~~A~~aS~~Ak~~~ 67 (82)
T PF04505_consen 41 SSKVRSRYAAGDYEGARRASRKAKKWS 67 (82)
T ss_pred chhhHHHHHCCCHHHHHHHHHHhHHHH
Confidence 335566788999999999888876543
No 438
>COG2909 MalT ATP-dependent transcriptional regulator [Transcription]
Probab=26.62 E-value=4.4e+02 Score=23.05 Aligned_cols=37 Identities=19% Similarity=0.245 Sum_probs=28.6
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcch
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHIL 138 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~ 138 (150)
.......|.+.+-+|++++|..+...+.++.......
T Consensus 497 ~~~~sv~~~a~~~~G~~~~Al~~~~~a~~~a~~~~~~ 533 (894)
T COG2909 497 IVALSVLGEAAHIRGELTQALALMQQAEQMARQHDVY 533 (894)
T ss_pred hhhhhhhhHHHHHhchHHHHHHHHHHHHHHHHHcccH
Confidence 5667778888888999999999888888875544333
No 439
>PF12739 TRAPPC-Trs85: ER-Golgi trafficking TRAPP I complex 85 kDa subunit; InterPro: IPR024420 This entry represents Trs85, a subunit of the TRAPP III complex []. Trs85 is a multimeric guanine nucleotide-exchange factor for Ypt1, required for membrane expansion during autophagy and the CVT pathway. It directs Ypt1 to the phagophore assembly site [, , , ].
Probab=26.25 E-value=1.3e+02 Score=23.38 Aligned_cols=44 Identities=11% Similarity=0.050 Sum_probs=32.1
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHH
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAF 146 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~ 146 (150)
......+|...+..|+|+-|...|.-+..-.-+ ..+|...|-|+
T Consensus 208 E~q~R~LAD~aFml~Dy~~A~s~Y~~~k~Df~~-Dkaw~~~A~~~ 251 (414)
T PF12739_consen 208 EAQMRRLADLAFMLRDYELAYSTYRLLKKDFKN-DKAWKYLAGAQ 251 (414)
T ss_pred HHHHHHHHHHHHHHccHHHHHHHHHHHHHHHhh-chhHHHHHhHH
Confidence 445677999999999999999999998875543 33444445443
No 440
>KOG4563|consensus
Probab=25.87 E-value=1.3e+02 Score=23.19 Aligned_cols=33 Identities=33% Similarity=0.462 Sum_probs=28.8
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCC
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGT 134 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~ 134 (150)
...+...|+-.+..++++.|+..|..|..+-..
T Consensus 41 ~e~lv~~G~~~~~~~d~~~Avda~s~A~~l~~e 73 (400)
T KOG4563|consen 41 LEELVQAGRRALCNNDIDKAVDALSEATELSDE 73 (400)
T ss_pred HHHHHHhhhHHHhcccHHHHHHHHHHHHHHHHH
Confidence 667788899999999999999999999987554
No 441
>KOG3616|consensus
Probab=25.84 E-value=55 Score=27.93 Aligned_cols=34 Identities=21% Similarity=0.299 Sum_probs=28.8
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCcHHHHH
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNNEQMKE 35 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~ 35 (150)
.+++-+...|++++|-++|..|++++.-|....+
T Consensus 1000 k~a~~ledegk~edaskhyveaiklntynitwcq 1033 (1636)
T KOG3616|consen 1000 KLAMFLEDEGKFEDASKHYVEAIKLNTYNITWCQ 1033 (1636)
T ss_pred HHhhhhhhccchhhhhHhhHHHhhcccccchhhh
Confidence 4567788899999999999999999988876554
No 442
>PF11207 DUF2989: Protein of unknown function (DUF2989); InterPro: IPR021372 Some members in this bacterial family of proteins are annotated as lipoproteins however this cannot be confirmed.
Probab=25.28 E-value=2.5e+02 Score=19.71 Aligned_cols=64 Identities=11% Similarity=0.185 Sum_probs=43.6
Q ss_pred HHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCC----cchhhcHHHHHHhc
Q psy3020 81 SYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTN----HILFSNRSAAFAKA 149 (150)
Q Consensus 81 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~----~~~~~n~a~~~~~~ 149 (150)
.....+-.++..|.- +.+.....++..| .+.+.++|+..+.+++++.+.+ +..+..++-+|.++
T Consensus 124 ~A~~~fL~~E~~~~l----~t~elq~aLAtyY-~krD~~Kt~~ll~~~L~l~~~~~~~n~eil~sLas~~~~~ 191 (203)
T PF11207_consen 124 EALRRFLQLEGTPEL----ETAELQYALATYY-TKRDPEKTIQLLLRALELSNPDDNFNPEILKSLASIYQKL 191 (203)
T ss_pred HHHHHHHHHcCCCCC----CCHHHHHHHHHHH-HccCHHHHHHHHHHHHHhcCCCCCCCHHHHHHHHHHHHHh
Confidence 344444445555443 2256666666554 5888999999999999987664 67777788777664
No 443
>PF15015 NYD-SP12_N: Spermatogenesis-associated, N-terminal
Probab=25.08 E-value=97 Score=24.54 Aligned_cols=43 Identities=14% Similarity=0.117 Sum_probs=37.1
Q ss_pred HHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhhcHHHHHHh
Q psy3020 106 KDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFSNRSAAFAK 148 (150)
Q Consensus 106 ~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~n~a~~~~~ 148 (150)
..+..+|.+.++.+-|+..--+.|-++|-...-+...|.|+-.
T Consensus 232 tklv~CYL~~rkpdlALnh~hrsI~lnP~~frnHLrqAavfR~ 274 (569)
T PF15015_consen 232 TKLVTCYLRMRKPDLALNHSHRSINLNPSYFRNHLRQAAVFRR 274 (569)
T ss_pred HHHHHhhhhcCCCchHHHHHhhhhhcCcchhhHHHHHHHHHHH
Confidence 3467778899999999999999999999998888888888643
No 444
>KOG0687|consensus
Probab=24.62 E-value=3.3e+02 Score=20.93 Aligned_cols=30 Identities=17% Similarity=0.115 Sum_probs=27.0
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhh
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKL 131 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~ 131 (150)
.+++..++.-+.+.|+-+.|++.+++..+.
T Consensus 104 ~ea~~~kaeYycqigDkena~~~~~~t~~k 133 (393)
T KOG0687|consen 104 REAMLRKAEYYCQIGDKENALEALRKTYEK 133 (393)
T ss_pred HHHHHHHHHHHHHhccHHHHHHHHHHHHHH
Confidence 789999999999999999999999887664
No 445
>PF00244 14-3-3: 14-3-3 protein; InterPro: IPR023410 The 14-3-3 proteins are a large family of approximately 30kDa acidic proteins which exist primarily as homo- and heterodimeric within all eukaryotic cells [, ]. There is a high degree of sequence identity and conservation between all the 14-3-3 isotypes, particularly in the regions which form the dimer interface or line the central ligand binding channel of the dimeric molecule. Each 14-3-3 protein sequence can be roughly divided into three sections: a divergent amino terminus, the conserved core region and a divergent carboxyl terminus. The conserved middle core region of the 14-3-3s encodes an amphipathic groove that forms the main functional domain, a cradle for interacting with client proteins. The monomer consists of nine helices organised in an antiparallel manner, forming an L-shaped structure. The interior of the L-structure is composed of four helices: H3 and H5, which contain many charged and polar amino acids, and H7 and H9, which contain hydrophobic amino acids. These four helices form the concave amphipathic groove that interacts with target peptides. 14-3-3 proteins mainly bind proteins containing phosphothreonine or phosphoserine motifs however exceptions to this rule do exist. Extensive investigation of the 14-3-3 binding site of the mammalian serine/threonine kinase Raf-1 has produced a consensus sequence for 14-3-3-binding, RSxpSxP (in the single-letter amino-acid code, where x denotes any amino acid and p indicates that the next residue is phosphorylated). 14-3-3 proteins appear to effect intracellular signalling in one of three ways - by direct regulation of the catalytic activity of the bound protein, by regulating interactions between the bound protein and other molecules in the cell by sequestration or modification or by controlling the subcellular localisation of the bound ligand. Proteins appear to initially bind to a single dominant site and then subsequently to many, much weaker secondary interaction sites. The 14-3-3 dimer is capable of changing the conformation of its bound ligand whilst itself undergoing minimal structural alteration. This entry represents the structural domain found in 14-3-3 proteins.; PDB: 2O8P_A 3AXY_D 2C74_A 2C63_A 4DX0_A 1YWT_A 3P1O_A 3P1N_A 4DAU_A 3U9X_A ....
Probab=24.57 E-value=78 Score=22.50 Aligned_cols=29 Identities=10% Similarity=0.218 Sum_probs=24.2
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCCcH
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPNNE 31 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~~~ 31 (150)
++......++|++.+++.++++..+|+..
T Consensus 7 ~Aklaeq~eRy~dmv~~mk~~~~~~~eLt 35 (236)
T PF00244_consen 7 LAKLAEQAERYDDMVEYMKQLIEMNPELT 35 (236)
T ss_dssp HHHHHHHTTHHHHHHHHHHHHHHTSS---
T ss_pred HHHHHHHhcCHHHHHHHHHHHHccCCCCC
Confidence 67788899999999999999999998744
No 446
>KOG0890|consensus
Probab=24.32 E-value=4e+02 Score=26.14 Aligned_cols=32 Identities=9% Similarity=0.115 Sum_probs=28.8
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcC
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDG 133 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p 133 (150)
+....+.+..++..|+-..|+...++-++.+-
T Consensus 1702 ~~i~~E~AK~lW~~gd~~~Al~~Lq~~l~~~~ 1733 (2382)
T KOG0890|consen 1702 PEIVLERAKLLWQTGDELNALSVLQEILSKNF 1733 (2382)
T ss_pred chHHHHHHHHHHhhccHHHHHHHHHHHHHhhc
Confidence 78889999999999999999999999996553
No 447
>PF09670 Cas_Cas02710: CRISPR-associated protein (Cas_Cas02710)
Probab=23.15 E-value=2.2e+02 Score=21.90 Aligned_cols=42 Identities=5% Similarity=0.083 Sum_probs=29.6
Q ss_pred HHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcc--hhhcHHHHH
Q psy3020 105 LKDKGNAALQANNFKEAIEAYSEAIKLDGTNHI--LFSNRSAAF 146 (150)
Q Consensus 105 ~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~--~~~n~a~~~ 146 (150)
.......++..++|..|.+.+..++..-|.+.. .+..++.+|
T Consensus 134 ~~~~a~~l~n~~~y~aA~~~l~~l~~rl~~~~~~~~~~~l~~~y 177 (379)
T PF09670_consen 134 EWRRAKELFNRYDYGAAARILEELLRRLPGREEYQRYKDLCEGY 177 (379)
T ss_pred HHHHHHHHHhcCCHHHHHHHHHHHHHhCCchhhHHHHHHHHHHH
Confidence 345566778999999999999999986444433 555555555
No 448
>PF06957 COPI_C: Coatomer (COPI) alpha subunit C-terminus; InterPro: IPR010714 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the C terminus (approximately 500 residues) of the eukaryotic coatomer alpha subunit [, ]. This domain is found along with the IPR006692 from INTERPRO domain. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0005515 protein binding, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030126 COPI vesicle coat; PDB: 3MKR_B 3MV2_E 3MKQ_B 3MV3_A.
Probab=23.00 E-value=2.1e+02 Score=22.48 Aligned_cols=29 Identities=24% Similarity=0.321 Sum_probs=22.3
Q ss_pred HHHHHHHHHHhcCHHHHHHHHHHHHhhcC
Q psy3020 105 LKDKGNAALQANNFKEAIEAYSEAIKLDG 133 (150)
Q Consensus 105 ~~~~g~~~~~~~~~~~A~~~~~~al~~~p 133 (150)
....|-.+++.|+|++|+..|+..|..=|
T Consensus 207 ~Lk~gyk~~t~gKF~eA~~~Fr~iL~~i~ 235 (422)
T PF06957_consen 207 RLKEGYKLFTAGKFEEAIEIFRSILHSIP 235 (422)
T ss_dssp HHHHHHHHHHTT-HHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHhcCCHHHHHHHHHHHHHHhh
Confidence 34457777899999999999999987544
No 449
>TIGR02757 conserved hypothetical protein TIGR02757. Members of this uncharacterized protein family are found sporadically, so far only among spirochetes, epsilon and delta proteobacteria, and Bacteroides. The function is unknown and its gene neighborhoods show little conservation.
Probab=22.98 E-value=53 Score=23.37 Aligned_cols=36 Identities=19% Similarity=0.153 Sum_probs=28.9
Q ss_pred HHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcch
Q psy3020 103 SLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHIL 138 (150)
Q Consensus 103 ~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~ 138 (150)
..-..+|.+-.+..++..|++.-...-+++|++|.-
T Consensus 182 rvar~LgL~~Rk~~d~kaa~ElT~~Lr~~dp~DPvK 217 (229)
T TIGR02757 182 RIAKKLKLLKRKSYDLKAAIEITEALRELNPEDPIK 217 (229)
T ss_pred HHHHHhCCcccCchhHHHHHHHHHHHHhcCCCCCcc
Confidence 333556767778889999999999999999999864
No 450
>KOG2518|consensus
Probab=22.60 E-value=4.3e+02 Score=21.58 Aligned_cols=33 Identities=12% Similarity=0.122 Sum_probs=27.2
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCC
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGT 134 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~ 134 (150)
-..-...|..++..|+-.+|.++|++++.+.|.
T Consensus 96 R~~n~~~a~~ll~~G~~~~A~~~fqr~VdIT~~ 128 (556)
T KOG2518|consen 96 RKKNLDAAEQLLAEGKESNARECFQRCVDITPE 128 (556)
T ss_pred HHHhHHHHHHHHHcCCHHHHHHHHHHhccCcHH
Confidence 334455677788899999999999999999985
No 451
>PF04010 DUF357: Protein of unknown function (DUF357); InterPro: IPR023140 This domain is found in a family of proteins, which have no known function.; PDB: 2OO2_A 2PMR_A.
Probab=22.28 E-value=1.7e+02 Score=16.77 Aligned_cols=27 Identities=19% Similarity=0.325 Sum_probs=22.0
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHH
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEA 128 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~a 128 (150)
+..|..=|..++.+|++..|+.++.=+
T Consensus 35 A~~Y~~D~~~fl~~gD~v~Ala~~sYa 61 (75)
T PF04010_consen 35 AESYLEDGKYFLEKGDYVNALACFSYA 61 (75)
T ss_dssp HHHHHHHHHHHHHTT-HHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHH
Confidence 778888899999999999999887643
No 452
>PHA01810 hypothetical protein
Probab=21.97 E-value=1e+02 Score=17.68 Aligned_cols=18 Identities=11% Similarity=0.429 Sum_probs=14.7
Q ss_pred CHHHHHHHHHHHHhhcCC
Q psy3020 117 NFKEAIEAYSEAIKLDGT 134 (150)
Q Consensus 117 ~~~~A~~~~~~al~~~p~ 134 (150)
-|.+.++.|++.++.+|-
T Consensus 14 aytemlqlfnkliqwnpa 31 (100)
T PHA01810 14 AYTEMLQLFNKLIQWNPA 31 (100)
T ss_pred HHHHHHHHHHHHHhcCcc
Confidence 356788889999999985
No 453
>PF09674 DUF2400: Protein of unknown function (DUF2400); InterPro: IPR014127 Members of this uncharacterised protein family are found sporadically, so far only among spirochetes, epsilon and delta proteobacteria, and Bacteroides. The function is unknown and its gene neighbourhoods show little conservation.
Probab=21.95 E-value=55 Score=23.35 Aligned_cols=37 Identities=14% Similarity=0.174 Sum_probs=30.0
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcch
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHIL 138 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~ 138 (150)
...-..+|.+-.+..++..|++.-...-+++|++|.-
T Consensus 184 ~~var~LGL~~rk~~d~k~A~elT~~lr~~~p~DPvK 220 (232)
T PF09674_consen 184 FRVARKLGLLKRKSADWKAARELTEALREFDPDDPVK 220 (232)
T ss_pred HHHHHHcCCccCCCccHHHHHHHHHHHHhcCCCCCcc
Confidence 3444556777778899999999999999999999864
No 454
>KOG4151|consensus
Probab=21.88 E-value=1.1e+02 Score=25.88 Aligned_cols=61 Identities=15% Similarity=0.146 Sum_probs=45.3
Q ss_pred HHHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH-HhhhhccCCCCCchhhhhCChHHHHhhccCCCCCCC
Q psy3020 6 CLSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV-RNQEMNDMNRGDPFANLFSDPNIFVQLQLDPRTKPF 76 (150)
Q Consensus 6 ~~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~-~~~g~~~~~~~~~~~~a~~~~~~~~~l~~~~~~~~~ 76 (150)
++...++|.+++...+-|+...|....+++..+.+ ...+ .+.-++ .++.-....+|...++
T Consensus 102 m~~~l~~~~~~~~E~~la~~~~p~i~~~Ll~r~~~y~al~--------k~d~a~--rdl~i~~~~~p~~~~~ 163 (748)
T KOG4151|consen 102 MQLGLGEYPKAIPECELALESQPRISKALLKRARKYEALN--------KLDLAV--RDLRIVEKMDPSNVSA 163 (748)
T ss_pred hhcCccchhhhcCchhhhhhccchHHHHHhhhhhHHHHHH--------HHHHHH--HHHHHHhcCCCCcchH
Confidence 34567899999999999999999999998887777 6666 334445 4555566777776554
No 455
>PF09311 Rab5-bind: Rabaptin-like protein; InterPro: IPR015390 This domain is predominantly found in Rabaptin and allows for binding to the GTPase Rab5. This interaction is necessary and sufficient for Rab5-dependent recruitment of Rabaptin5 to early endosomal membranes []. ; PDB: 3NF1_A 3CEQ_B 3EDT_H 1X79_C 1TU3_F.
Probab=21.86 E-value=2.7e+02 Score=18.88 Aligned_cols=32 Identities=13% Similarity=0.131 Sum_probs=26.7
Q ss_pred HHhHHHHHHHHHHHHhcCHHHHHHHHHHHHhh
Q psy3020 100 AKISLLKDKGNAALQANNFKEAIEAYSEAIKL 131 (150)
Q Consensus 100 ~~~~~~~~~g~~~~~~~~~~~A~~~~~~al~~ 131 (150)
.+...+.++-.-+...|+|+.|+..++.+++-
T Consensus 138 ~rl~tL~nlv~q~~~q~r~evav~~~KqalEd 169 (181)
T PF09311_consen 138 ARLRTLHNLVIQYESQGRYEVAVPLCKQALED 169 (181)
T ss_dssp HHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHH
Confidence 44788888888999999999999999998863
No 456
>KOG4279|consensus
Probab=21.83 E-value=4e+02 Score=23.15 Aligned_cols=22 Identities=32% Similarity=0.447 Sum_probs=19.0
Q ss_pred HHhcCHHHHHHHHHHHHhhcCC
Q psy3020 113 LQANNFKEAIEAYSEAIKLDGT 134 (150)
Q Consensus 113 ~~~~~~~~A~~~~~~al~~~p~ 134 (150)
.-.++|..|++.-+...+++|-
T Consensus 377 VLAnd~~kaiqAae~mfKLk~P 398 (1226)
T KOG4279|consen 377 VLANDYQKAIQAAEMMFKLKPP 398 (1226)
T ss_pred hhccCHHHHHHHHHHHhccCCc
Confidence 3569999999999999999983
No 457
>TIGR02710 CRISPR-associated protein, TIGR02710 family. Members of this family are found, exclusively in the vicinity of CRISPR repeats and other CRISPR-associated (cas) genes, in Methanothermobacter thermautotrophicus (Archaea), Thermus thermophilus (Deinococcus-Thermus), Chloroflexus aurantiacus (Chloroflexi), and Thermomicrobium roseum (Thermomicrobia).
Probab=21.80 E-value=1.9e+02 Score=22.42 Aligned_cols=41 Identities=10% Similarity=-0.030 Sum_probs=30.4
Q ss_pred HHHHHHHHhcCHHHHHHHHHHHHhhcCC-----CcchhhcHHHHHH
Q psy3020 107 DKGNAALQANNFKEAIEAYSEAIKLDGT-----NHILFSNRSAAFA 147 (150)
Q Consensus 107 ~~g~~~~~~~~~~~A~~~~~~al~~~p~-----~~~~~~n~a~~~~ 147 (150)
.....++..++|..|.+.|.+++...+. ....+.+++.+|.
T Consensus 135 ~~~r~l~n~~dy~aA~~~~~~L~~r~l~~~~~~~~~~~~~l~~~y~ 180 (380)
T TIGR02710 135 GYARRAINAFDYLFAHARLETLLRRLLSAVNHTFYEAMIKLTRAYL 180 (380)
T ss_pred HHHHHHHHhcChHHHHHHHHHHHhcccChhhhhHHHHHHHHHHHHH
Confidence 3555678999999999999999998753 2345566666654
No 458
>cd00215 PTS_IIA_lac PTS_IIA, PTS system, lactose/cellobiose specific IIA subunit. The bacterial phosphoenolpyruvate: sugar phosphotransferase system (PTS) is a multi-protein system involved in the regulation of a variety of metabolic and transcriptional processes. This family is one of four structurally and functionally distinct group IIA PTS system cytoplasmic enzymes, necessary for the uptake of carbohydrates across the cytoplasmic membrane and their phosphorylation. This family of proteins normally function as a homotrimer, stabilized by a centrally located metal ion. Separation into subunits is thought to occur after phosphorylation.
Probab=21.79 E-value=2e+02 Score=17.41 Aligned_cols=28 Identities=25% Similarity=0.133 Sum_probs=17.3
Q ss_pred HHHHHHHHHHHHhcCHHHHHHHHHHHHh
Q psy3020 103 SLLKDKGNAALQANNFKEAIEAYSEAIK 130 (150)
Q Consensus 103 ~~~~~~g~~~~~~~~~~~A~~~~~~al~ 130 (150)
.....-+.-..+.|+|++|-+...++=+
T Consensus 16 rs~~~eAl~~a~~g~fe~A~~~l~ea~~ 43 (97)
T cd00215 16 RSKALEALKAAKEGDFAEAEELLEEAND 43 (97)
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHHH
Confidence 3344444455677888877777766643
No 459
>PF02255 PTS_IIA: PTS system, Lactose/Cellobiose specific IIA subunit; InterPro: IPR003188 The phosphoenolpyruvate-dependent sugar phosphotransferase system (PTS) [, ] is a major carbohydrate transport system in bacteria. The PTS catalyses the phosphorylation of incoming sugar substrates and coupled with translocation across the cell membrane, makes the PTS a link between the uptake and metabolism of sugars. The general mechanism of the PTS is the following: a phosphoryl group from phosphoenolpyruvate (PEP) is transferred via a signal transduction pathway, to enzyme I (EI) which in turn transfers it to a phosphoryl carrier, the histidine protein (HPr). Phospho-HPr then transfers the phosphoryl group to a sugar-specific permease, a membrane-bound complex known as enzyme 2 (EII), which transports the sugar to the cell. EII consists of at least three structurally distinct domains IIA, IIB and IIC []. These can either be fused together in a single polypeptide chain or exist as two or three interactive chains, formerly called enzymes II (EII) and III (EIII). The first domain (IIA or EIIA) carries the first permease-specific phosphorylation site, a histidine which is phosphorylated by phospho-HPr. The second domain (IIB or EIIB) is phosphorylated by phospho-IIA on a cysteinyl or histidyl residue, depending on the sugar transported. Finally, the phosphoryl group is transferred from the IIB domain to the sugar substrate concomitantly with the sugar uptake processed by the IIC domain. This third domain (IIC or EIIC) forms the translocation channel and the specific substrate-binding site. An additional transmembrane domain IID, homologous to IIC, can be found in some PTSs, e.g. for mannose [, , , ]. The lactose/cellobiose-specific family are one of four structurally and functionally distinct group IIA PTS system enzymes. This family of proteins normally function as a homotrimer, stabilised by a centrally located metal ion []. Separation into subunits is thought to occur after phosphorylation.; GO: 0005351 sugar:hydrogen symporter activity, 0006810 transport, 0009401 phosphoenolpyruvate-dependent sugar phosphotransferase system, 0016020 membrane; PDB: 3L8R_D 2E2A_B 1E2A_C 3K1S_C 2LRK_C 2LRL_A 2WY2_A 1WCR_A 2WWV_C.
Probab=21.74 E-value=2e+02 Score=17.34 Aligned_cols=28 Identities=29% Similarity=0.208 Sum_probs=16.1
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHH
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAI 129 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al 129 (150)
+......+.-..+.|+|++|.+.+.+|=
T Consensus 14 Ars~~~eAl~~a~~~~fe~A~~~l~~a~ 41 (96)
T PF02255_consen 14 ARSLAMEALKAAREGDFEEAEELLKEAD 41 (96)
T ss_dssp HHHHHHHHHHHHHTT-HHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHhCCHHHHHHHHHHHH
Confidence 3444444555567777777777666653
No 460
>COG4941 Predicted RNA polymerase sigma factor containing a TPR repeat domain [Transcription]
Probab=21.72 E-value=3.8e+02 Score=20.69 Aligned_cols=39 Identities=18% Similarity=0.088 Sum_probs=33.0
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhhcCCCcchhh
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKLDGTNHILFS 140 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~ 140 (150)
.-.+..+|..+.+.|+.++|-..|.+|+.+-++.+..-+
T Consensus 365 ~~~h~~RadlL~rLgr~~eAr~aydrAi~La~~~aer~~ 403 (415)
T COG4941 365 HLYHAARADLLARLGRVEEARAAYDRAIALARNAAERAF 403 (415)
T ss_pred cccHHHHHHHHHHhCChHHHHHHHHHHHHhcCChHHHHH
Confidence 456677899999999999999999999999998765443
No 461
>PF08626 TRAPPC9-Trs120: Transport protein Trs120 or TRAPPC9, TRAPP II complex subunit; InterPro: IPR013935 The trafficking protein particle complex TRAPP is a multi-protein complex needed in the early stages of the secretory pathway. To date, two kinds of TRAPP complexes have been studied, TRAPPI and TRAPP II. These complexes differ in subunit composition []. TRAPP I binds vesicles derived from the endoplasmic reticulum bringing them closer to the acceptor membrane. Trs120 is a subunit specific to the TRAPP II complex [] along with Trs65p and Trs130p(TRAPPC10). It is suggested that Trs120p is required for the stability of the Trs130p subunit, suggesting that these two proteins might interact in some way []. It is likely that there is a complex function for TRAPP II in multiple pathways [].
Probab=21.61 E-value=77 Score=28.34 Aligned_cols=27 Identities=15% Similarity=0.320 Sum_probs=23.6
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCC
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDP 28 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p 28 (150)
.+|..+.-.|++.+|++.|..|+.+-.
T Consensus 247 ~~gd~~LlaG~~~dAl~~y~~a~~~~k 273 (1185)
T PF08626_consen 247 VLGDLYLLAGRWPDALKEYTEAIEILK 273 (1185)
T ss_pred hhhhHHHHcCCHHHHHHHHHHHHHHHh
Confidence 478999999999999999999987543
No 462
>KOG2518|consensus
Probab=21.59 E-value=79 Score=25.48 Aligned_cols=27 Identities=11% Similarity=0.223 Sum_probs=24.0
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCCC
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDPN 29 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p~ 29 (150)
.|.-+...|+..+|.++|++++.+.|+
T Consensus 102 ~a~~ll~~G~~~~A~~~fqr~VdIT~~ 128 (556)
T KOG2518|consen 102 AAEQLLAEGKESNARECFQRCVDITPE 128 (556)
T ss_pred HHHHHHHcCCHHHHHHHHHHhccCcHH
Confidence 466788899999999999999999875
No 463
>PRK09591 celC cellobiose phosphotransferase system IIA component; Reviewed
Probab=21.54 E-value=2.1e+02 Score=17.58 Aligned_cols=28 Identities=18% Similarity=0.170 Sum_probs=17.5
Q ss_pred HHHHHHHHHHHHhcCHHHHHHHHHHHHh
Q psy3020 103 SLLKDKGNAALQANNFKEAIEAYSEAIK 130 (150)
Q Consensus 103 ~~~~~~g~~~~~~~~~~~A~~~~~~al~ 130 (150)
.....-+.-..+.|+|++|-+...+|=+
T Consensus 21 rs~~~eAl~~ak~gdf~~A~~~l~eA~~ 48 (104)
T PRK09591 21 RTEVHEAFAAMREGNFDLAEQKLNQSNE 48 (104)
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHHH
Confidence 3344444455677888887777776644
No 464
>KOG0686|consensus
Probab=21.39 E-value=2.2e+02 Score=22.48 Aligned_cols=30 Identities=27% Similarity=0.409 Sum_probs=25.5
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHHhh
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAIKL 131 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al~~ 131 (150)
-.++..+|..+..-|+++.|+++|.++-..
T Consensus 150 Rra~~Dl~dhy~~cG~l~~Alr~YsR~RdY 179 (466)
T KOG0686|consen 150 RRALEDLGDHYLDCGQLDNALRCYSRARDY 179 (466)
T ss_pred HHHHHHHHHHHHHhccHHHHHhhhhhhhhh
Confidence 567788999999999999999999995443
No 465
>COG5600 Transcription-associated recombination protein [DNA replication, recombination, and repair]
Probab=21.39 E-value=85 Score=24.26 Aligned_cols=29 Identities=10% Similarity=-0.055 Sum_probs=26.8
Q ss_pred hhhHHHHhhCCHHHHHHHHHHhhccCCCc
Q psy3020 2 TTYSCLSYLGRYKESISTYEEGLKLDPNN 30 (150)
Q Consensus 2 ~lg~~~~~~~~~~~A~~~~~~aL~~~p~~ 30 (150)
-+|.+|....++.+|..+++.|+..-|.-
T Consensus 225 YLG~~~l~~en~heA~~~L~~aFl~c~~l 253 (413)
T COG5600 225 YLGIYYLLNENFHEAFLHLNEAFLQCPWL 253 (413)
T ss_pred HHHHHHHHHHhHHHHHHHHHHHHHhChhh
Confidence 47999999999999999999999999983
No 466
>PRK15326 type III secretion system needle complex protein PrgI; Provisional
Probab=21.32 E-value=1.9e+02 Score=16.96 Aligned_cols=25 Identities=12% Similarity=0.255 Sum_probs=17.6
Q ss_pred CHHHHHHHHHHhhccCCCcHHHHHH
Q psy3020 12 RYKESISTYEEGLKLDPNNEQMKEA 36 (150)
Q Consensus 12 ~~~~A~~~~~~aL~~~p~~~~a~~~ 36 (150)
...+++..-.++|+.+|+|+-++-.
T Consensus 22 ~~~~~l~~Al~~l~~~pdnP~~LA~ 46 (80)
T PRK15326 22 NLQTQVTEALDKLAAKPSDPALLAA 46 (80)
T ss_pred HHHHHHHHHHHHhhcCCCCHHHHHH
Confidence 3445666666778899999987533
No 467
>TIGR00823 EIIA-LAC phosphotransferase system enzyme II, lactose-specific, factor III. operon. While the Lac permeases consist of two polypeptide chains (IIA and IICB), the Chb permease of E. coli consists of three (IIA, IIB and IIC). In B. subtilis, a PTS permease similar to the Chb permease of E. coli is believed to transport lichenan (a b-1,3;1,4 glucan) degradation products, oligosaccharides of 2-4 glucose units. This model is specific for the IIA subunit of the Lac PTS family.
Probab=21.18 E-value=2.1e+02 Score=17.40 Aligned_cols=28 Identities=18% Similarity=0.016 Sum_probs=16.9
Q ss_pred HHHHHHHHHHHHhcCHHHHHHHHHHHHh
Q psy3020 103 SLLKDKGNAALQANNFKEAIEAYSEAIK 130 (150)
Q Consensus 103 ~~~~~~g~~~~~~~~~~~A~~~~~~al~ 130 (150)
.....-+.-..+.|+|++|-+...+|=+
T Consensus 18 rs~~~eAl~~a~~gdfe~A~~~l~eA~~ 45 (99)
T TIGR00823 18 RSKALEALKAAKAGDFAKARALVEQAGM 45 (99)
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHHH
Confidence 3344444445677888877777666543
No 468
>KOG2460|consensus
Probab=20.67 E-value=76 Score=25.63 Aligned_cols=23 Identities=13% Similarity=0.232 Sum_probs=20.5
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhc
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLK 25 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~ 25 (150)
++..|...++|.+|+..|.+|..
T Consensus 428 iA~sY~a~~K~~EAlALy~Ra~s 450 (593)
T KOG2460|consen 428 IAVSYQAKKKYSEALALYVRAYS 450 (593)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH
Confidence 67889999999999999999874
No 469
>KOG3807|consensus
Probab=20.51 E-value=4.1e+02 Score=20.57 Aligned_cols=26 Identities=27% Similarity=0.427 Sum_probs=21.7
Q ss_pred hhHHHHhhCCHHHHHHHHHHhhccCC
Q psy3020 3 TYSCLSYLGRYKESISTYEEGLKLDP 28 (150)
Q Consensus 3 lg~~~~~~~~~~~A~~~~~~aL~~~p 28 (150)
+++|-.++|+..+|++.+.-..+--|
T Consensus 281 LAMCARklGrlrEA~K~~RDL~ke~p 306 (556)
T KOG3807|consen 281 LAMCARKLGRLREAVKIMRDLMKEFP 306 (556)
T ss_pred HHHHHHHhhhHHHHHHHHHHHhhhcc
Confidence 67888999999999998888776666
No 470
>PF10345 Cohesin_load: Cohesin loading factor; InterPro: IPR019440 Cohesin loading factor is a conserved protein that has been characterised in fungi. It is associated with the cohesin complex and is required in G1 for cohesin binding to chromosomes, but is dispensable in G2 when cohesion has been established. It is often referred to as Ssl3 in Schizosaccharomyces pombe (Fission yeast), and Scc4 in Saccharomyces cerevisiae (Baker's yeast). It complexes with Mis4 [].
Probab=20.41 E-value=4.9e+02 Score=21.37 Aligned_cols=113 Identities=13% Similarity=-0.090 Sum_probs=0.0
Q ss_pred hHHHHhhCCHHHHHHHHHHhhccCCCcH--------------------------HHHHHHHHH-HhhhhccCCCCCchhh
Q psy3020 4 YSCLSYLGRYKESISTYEEGLKLDPNNE--------------------------QMKEAIKDV-RNQEMNDMNRGDPFAN 56 (150)
Q Consensus 4 g~~~~~~~~~~~A~~~~~~aL~~~p~~~--------------------------~a~~~l~~~-~~~g~~~~~~~~~~~~ 56 (150)
|.+....+..+.|.+.++++++.=-+.. ..+...+-+ ...+++. ....
T Consensus 308 ~l~~~~~~~~~ks~k~~~k~l~~i~~~~~~~~~~~~~sl~~~~~~~~~~~~l~~~~~~y~~~~~~~~~~~~-----~a~~ 382 (608)
T PF10345_consen 308 GLHNLYKGSMDKSEKFLEKALKQIEKLKIKSPSAPSESLSEASERIQWLRYLQCYLLFYQIWCNFIRGDWS-----KATQ 382 (608)
T ss_pred HHHHhhccCchHHHHHHHHHHHHHHHhhccCCCCCCcCHHHHHHhHHHHHHHHHHHHHHHHHHHHHCcCHH-----HHHH
Q ss_pred hhCChHHHHhhccCCC-CCCCCCChHHHHHHHHHhcCcchhhhHHHhHHHHHHHHHHHHhcCHHHHHHHHH--------H
Q psy3020 57 LFSDPNIFVQLQLDPR-TKPFLSDPSYVQMIKEIQKDPSLMTLEAKISLLKDKGNAALQANNFKEAIEAYS--------E 127 (150)
Q Consensus 57 a~~~~~~~~~l~~~~~-~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~A~~~~~--------~ 127 (150)
.+ ..+.......|. ....+. +..+.-.|..+...|+.+.|...|. .
T Consensus 383 ~l--~~~~~~~~~~~~~~~~~~~-----------------------~~~~yL~gl~~q~~g~l~~A~~~y~~~~~~~~~~ 437 (608)
T PF10345_consen 383 EL--EFMRQLCQRSPSKLYESLY-----------------------PLLHYLLGLYYQSTGDLEAALYQYQKPRFLLCEA 437 (608)
T ss_pred HH--HHHHHHHhcCccchhhhhh-----------------------HHHHHHHHHHHHHcCCHHHHHHHHhhhHHhhhhh
Q ss_pred HHhhcCCC---cchhhcHHHHH
Q psy3020 128 AIKLDGTN---HILFSNRSAAF 146 (150)
Q Consensus 128 al~~~p~~---~~~~~n~a~~~ 146 (150)
+....+.+ ..+-.|+..++
T Consensus 438 ~~~~~~~~El~ila~LNl~~I~ 459 (608)
T PF10345_consen 438 ANRKSKFRELYILAALNLAIIL 459 (608)
T ss_pred hccCCcchHHHHHHHHHHHHHh
No 471
>PF08311 Mad3_BUB1_I: Mad3/BUB1 homology region 1; InterPro: IPR013212 Proteins containing this domain are checkpoint proteins involved in cell division. This region has been shown to be essential for the binding of BUB1 and MAD3 to CDC20p [].; PDB: 3ESL_B 4AEZ_I 4A1G_B 2LAH_A 2WVI_A 3SI5_B.
Probab=20.34 E-value=2.4e+02 Score=17.77 Aligned_cols=28 Identities=25% Similarity=0.266 Sum_probs=22.7
Q ss_pred hHHHHHHHHHHHHhcCHHHHHHHHHHHH
Q psy3020 102 ISLLKDKGNAALQANNFKEAIEAYSEAI 129 (150)
Q Consensus 102 ~~~~~~~g~~~~~~~~~~~A~~~~~~al 129 (150)
+.-|..-+..+-..|++.+|.+.|..+|
T Consensus 99 A~fY~~wA~~le~~~~~~~A~~I~~~Gi 126 (126)
T PF08311_consen 99 ALFYEEWAEFLEKRGNFKKADEIYQLGI 126 (126)
T ss_dssp HHHHHHHHHHHHHTT-HHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHhhC
Confidence 5666777788889999999999999875
No 472
>COG1747 Uncharacterized N-terminal domain of the transcription elongation factor GreA [Function unknown]
Probab=20.16 E-value=5e+02 Score=21.44 Aligned_cols=34 Identities=18% Similarity=0.166 Sum_probs=27.8
Q ss_pred HHhhCCHHHHHHHHHHhhccCCCcHHHHHHHHHH
Q psy3020 7 LSYLGRYKESISTYEEGLKLDPNNEQMKEAIKDV 40 (150)
Q Consensus 7 ~~~~~~~~~A~~~~~~aL~~~p~~~~a~~~l~~~ 40 (150)
|....++.+|++.....|+.|..|..|...+...
T Consensus 215 Ys~~eN~~eai~Ilk~il~~d~k~~~ar~~~i~~ 248 (711)
T COG1747 215 YSENENWTEAIRILKHILEHDEKDVWARKEIIEN 248 (711)
T ss_pred hccccCHHHHHHHHHHHhhhcchhhhHHHHHHHH
Confidence 5567888999999999999998888888777654
Done!