Query psy398
Match_columns 162
No_of_seqs 107 out of 260
Neff 5.8
Searched_HMMs 46136
Date Fri Aug 16 22:11:36 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy398.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/398hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG0687|consensus 100.0 5.5E-56 1.2E-60 380.0 16.3 155 1-161 53-210 (393)
2 PF10602 RPN7: 26S proteasome 100.0 1.5E-48 3.3E-53 310.0 17.8 139 17-161 1-142 (177)
3 COG5187 RPN7 26S proteasome re 100.0 4.9E-43 1.1E-47 298.1 14.6 154 1-161 57-221 (412)
4 KOG0686|consensus 100.0 3.5E-28 7.7E-33 213.4 13.8 145 13-161 111-258 (466)
5 PF09976 TPR_21: Tetratricopep 97.6 0.0057 1.2E-07 46.0 14.8 122 25-159 23-145 (145)
6 cd00189 TPR Tetratricopeptide 97.1 0.017 3.7E-07 36.3 10.5 94 53-159 2-95 (100)
7 PF13424 TPR_12: Tetratricopep 96.5 0.011 2.3E-07 39.5 6.1 72 49-121 3-77 (78)
8 TIGR02795 tol_pal_ybgF tol-pal 96.5 0.074 1.6E-06 36.9 10.5 100 53-159 4-103 (119)
9 TIGR02521 type_IV_pilW type IV 96.2 0.11 2.5E-06 39.0 11.0 62 52-118 66-127 (234)
10 PF12688 TPR_5: Tetratrico pep 96.2 0.066 1.4E-06 40.2 9.2 101 53-160 3-103 (120)
11 TIGR02552 LcrH_SycD type III s 95.4 0.36 7.7E-06 34.8 10.4 95 52-159 18-112 (135)
12 PF13414 TPR_11: TPR repeat; P 95.4 0.095 2E-06 33.9 6.6 63 52-119 4-67 (69)
13 KOG1840|consensus 95.3 1.8 4E-05 40.1 16.7 127 32-160 261-395 (508)
14 TIGR02521 type_IV_pilW type IV 95.3 0.4 8.6E-06 36.0 10.6 97 52-159 100-196 (234)
15 PRK11788 tetratricopeptide rep 95.3 0.9 2E-05 38.4 13.8 101 51-159 107-207 (389)
16 COG4235 Cytochrome c biogenesi 94.9 0.41 9E-06 41.3 10.6 117 27-161 137-259 (287)
17 PRK10370 formate-dependent nit 94.7 1.7 3.8E-05 34.7 13.3 116 27-159 54-171 (198)
18 PF13176 TPR_7: Tetratricopept 94.5 0.057 1.2E-06 31.7 3.3 26 53-78 1-26 (36)
19 PRK11788 tetratricopeptide rep 94.5 0.77 1.7E-05 38.8 11.5 94 53-159 216-309 (389)
20 PRK11189 lipoprotein NlpI; Pro 94.4 2.6 5.6E-05 35.4 14.3 86 27-119 41-127 (296)
21 PRK02603 photosystem I assembl 94.3 0.93 2E-05 34.8 10.6 67 51-119 35-101 (172)
22 TIGR03302 OM_YfiO outer membra 94.1 2.3 5E-05 33.6 12.9 124 32-159 53-193 (235)
23 PF14938 SNAP: Soluble NSF att 94.0 0.57 1.2E-05 39.2 9.6 108 48-159 32-142 (282)
24 TIGR02917 PEP_TPR_lipo putativ 94.0 0.81 1.8E-05 41.5 11.1 95 52-159 194-288 (899)
25 TIGR02917 PEP_TPR_lipo putativ 93.7 0.62 1.3E-05 42.3 9.8 97 51-159 56-152 (899)
26 PF13432 TPR_16: Tetratricopep 93.6 0.7 1.5E-05 29.4 7.4 59 56-119 2-60 (65)
27 TIGR00990 3a0801s09 mitochondr 93.5 2.1 4.5E-05 39.4 13.1 60 54-118 368-427 (615)
28 PF12569 NARP1: NMDA receptor- 93.5 4.5 9.9E-05 37.5 15.1 96 47-155 190-285 (517)
29 CHL00033 ycf3 photosystem I as 93.4 2.6 5.6E-05 32.0 11.5 103 48-155 32-136 (168)
30 TIGR02795 tol_pal_ybgF tol-pal 93.2 1.1 2.3E-05 30.9 8.4 70 48-119 36-105 (119)
31 TIGR00990 3a0801s09 mitochondr 93.1 3 6.5E-05 38.4 13.4 99 48-159 328-426 (615)
32 PLN03088 SGT1, suppressor of 92.8 1.8 3.9E-05 37.7 11.0 92 55-159 6-97 (356)
33 PF14938 SNAP: Soluble NSF att 92.7 2 4.3E-05 35.9 10.8 109 47-159 71-182 (282)
34 PF12895 Apc3: Anaphase-promot 92.5 0.96 2.1E-05 30.5 7.3 82 64-157 2-83 (84)
35 PF13181 TPR_8: Tetratricopept 92.4 0.28 6E-06 27.5 3.7 27 52-78 2-28 (34)
36 PRK10803 tol-pal system protei 92.2 1.3 2.8E-05 37.3 9.0 88 28-119 159-246 (263)
37 PF07719 TPR_2: Tetratricopept 91.9 0.32 6.9E-06 27.0 3.6 27 52-78 2-28 (34)
38 cd00189 TPR Tetratricopeptide 91.9 1.5 3.2E-05 27.1 7.1 62 52-118 35-96 (100)
39 KOG0548|consensus 91.5 6.5 0.00014 36.7 13.3 91 55-159 362-453 (539)
40 PF00515 TPR_1: Tetratricopept 91.2 0.42 9.1E-06 26.8 3.6 27 52-78 2-28 (34)
41 PRK10049 pgaA outer membrane p 90.6 3.5 7.6E-05 39.4 11.2 96 51-159 359-454 (765)
42 KOG1497|consensus 90.3 7 0.00015 34.9 11.8 108 47-158 99-210 (399)
43 PF13174 TPR_6: Tetratricopept 90.0 0.43 9.3E-06 26.2 2.8 27 52-78 1-27 (33)
44 cd05804 StaR_like StaR_like; a 89.8 5.4 0.00012 33.2 10.6 99 51-158 114-212 (355)
45 PF13374 TPR_10: Tetratricopep 89.8 0.64 1.4E-05 26.6 3.6 28 51-78 2-29 (42)
46 TIGR03302 OM_YfiO outer membra 89.7 8.6 0.00019 30.3 11.7 68 51-120 33-100 (235)
47 PRK15359 type III secretion sy 89.6 7.1 0.00015 29.3 10.4 93 54-159 27-119 (144)
48 KOG4234|consensus 89.5 8.4 0.00018 32.6 11.2 94 30-128 70-167 (271)
49 PF13428 TPR_14: Tetratricopep 89.5 0.64 1.4E-05 28.1 3.5 27 52-78 2-28 (44)
50 PRK11447 cellulose synthase su 88.8 4.3 9.4E-05 40.6 10.7 94 53-159 605-698 (1157)
51 KOG1840|consensus 88.8 7.9 0.00017 36.0 11.6 118 41-158 189-309 (508)
52 PF13429 TPR_15: Tetratricopep 87.3 2.2 4.8E-05 34.9 6.5 95 52-159 147-241 (280)
53 PRK12370 invasion protein regu 87.0 9.4 0.0002 35.0 11.0 63 52-119 339-401 (553)
54 PRK11189 lipoprotein NlpI; Pro 86.6 14 0.0003 31.0 11.1 26 136-161 240-265 (296)
55 KOG1173|consensus 86.3 2.8 6.1E-05 39.5 7.2 89 53-160 314-408 (611)
56 PRK11447 cellulose synthase su 86.0 9.3 0.0002 38.3 11.1 98 52-159 304-412 (1157)
57 PRK12370 invasion protein regu 85.9 11 0.00025 34.5 11.0 96 52-159 373-468 (553)
58 KOG1070|consensus 85.9 7.1 0.00015 40.6 10.1 66 53-121 1460-1528(1710)
59 KOG1155|consensus 85.3 14 0.00031 34.4 11.1 93 53-153 434-528 (559)
60 smart00028 TPR Tetratricopepti 85.2 2.1 4.5E-05 21.4 3.7 27 52-78 2-28 (34)
61 KOG0547|consensus 84.7 6.2 0.00013 37.0 8.5 78 53-141 117-197 (606)
62 PF12862 Apc5: Anaphase-promot 84.0 3.6 7.7E-05 28.9 5.4 62 62-127 9-76 (94)
63 PF12895 Apc3: Anaphase-promot 83.9 4 8.7E-05 27.4 5.5 59 52-116 26-84 (84)
64 PRK04841 transcriptional regul 83.7 14 0.00029 35.3 10.7 104 52-159 492-600 (903)
65 KOG2908|consensus 83.3 13 0.00028 33.3 9.6 93 60-154 84-179 (380)
66 COG1849 Uncharacterized protei 83.0 7 0.00015 28.3 6.5 55 22-78 4-68 (90)
67 KOG1125|consensus 83.0 2.9 6.4E-05 39.3 5.8 91 53-160 432-526 (579)
68 PF10300 DUF3808: Protein of u 82.3 22 0.00048 32.3 11.1 97 54-159 270-374 (468)
69 KOG1155|consensus 82.2 11 0.00024 35.1 9.0 106 4-117 420-534 (559)
70 PF09976 TPR_21: Tetratricopep 81.3 19 0.00042 26.6 12.0 86 63-155 23-108 (145)
71 CHL00033 ycf3 photosystem I as 80.8 22 0.00047 26.8 11.6 75 50-129 71-152 (168)
72 PF13431 TPR_17: Tetratricopep 80.0 1.4 3E-05 25.5 1.7 22 51-72 13-34 (34)
73 cd05804 StaR_like StaR_like; a 79.4 13 0.00027 31.0 8.0 63 52-116 149-212 (355)
74 PRK15174 Vi polysaccharide exp 79.4 25 0.00054 33.2 10.7 62 53-119 248-313 (656)
75 PRK15174 Vi polysaccharide exp 79.2 31 0.00067 32.5 11.3 96 52-160 285-380 (656)
76 PRK14574 hmsH outer membrane p 79.1 19 0.00041 35.4 10.0 94 53-159 36-129 (822)
77 KOG0553|consensus 78.9 29 0.00063 30.3 10.1 93 51-156 81-173 (304)
78 TIGR00540 hemY_coli hemY prote 78.7 30 0.00066 30.3 10.5 90 54-155 121-210 (409)
79 KOG3677|consensus 78.6 2.1 4.5E-05 39.3 3.2 124 24-160 167-300 (525)
80 PF13424 TPR_12: Tetratricopep 78.2 16 0.00034 23.7 9.2 69 89-160 5-74 (78)
81 PRK10049 pgaA outer membrane p 77.8 31 0.00067 33.0 11.0 94 52-159 50-143 (765)
82 PF13429 TPR_15: Tetratricopep 77.1 8.2 0.00018 31.6 6.1 92 54-158 81-172 (280)
83 KOG3785|consensus 76.6 21 0.00045 32.7 8.8 50 41-94 49-98 (557)
84 PF14559 TPR_19: Tetratricopep 75.8 4.5 9.7E-05 25.6 3.4 53 62-119 2-54 (68)
85 COG4259 Uncharacterized protei 75.0 9.8 0.00021 28.6 5.4 65 27-96 48-116 (121)
86 PF13525 YfiO: Outer membrane 75.0 39 0.00086 26.7 13.3 110 47-159 38-168 (203)
87 KOG0624|consensus 74.7 23 0.00051 32.2 8.6 64 51-129 223-286 (504)
88 PF13371 TPR_9: Tetratricopept 74.4 5.8 0.00013 25.4 3.7 30 53-83 31-60 (73)
89 PF05843 Suf: Suppressor of fo 74.4 27 0.00058 29.3 8.7 66 54-121 73-138 (280)
90 COG5107 RNA14 Pre-mRNA 3'-end 73.4 26 0.00057 32.9 8.8 96 53-158 304-423 (660)
91 TIGR01716 RGG_Cterm transcript 73.3 39 0.00084 26.6 9.0 60 85-147 124-183 (220)
92 KOG0553|consensus 73.3 34 0.00074 29.9 9.1 78 21-122 117-198 (304)
93 TIGR02552 LcrH_SycD type III s 73.3 30 0.00065 24.5 7.7 65 51-120 51-115 (135)
94 PF13041 PPR_2: PPR repeat fam 71.6 17 0.00037 22.0 5.3 30 53-83 5-34 (50)
95 PRK10866 outer membrane biogen 71.2 40 0.00087 27.8 8.9 58 55-114 179-236 (243)
96 PLN03081 pentatricopeptide (PP 71.2 42 0.00091 31.5 9.9 58 53-114 292-349 (697)
97 PF08671 SinI: Anti-repressor 71.0 6.5 0.00014 22.8 2.9 24 16-52 1-24 (30)
98 PF10345 Cohesin_load: Cohesin 71.0 35 0.00076 31.8 9.3 66 88-156 360-428 (608)
99 PRK04841 transcriptional regul 70.6 81 0.0017 30.1 11.9 98 59-160 661-759 (903)
100 PF13525 YfiO: Outer membrane 70.5 19 0.00042 28.5 6.6 65 53-119 7-71 (203)
101 PF13432 TPR_16: Tetratricopep 70.2 7.9 0.00017 24.3 3.6 29 50-78 30-58 (65)
102 PF09986 DUF2225: Uncharacteri 69.7 33 0.00072 28.0 8.0 43 52-94 166-209 (214)
103 PRK15179 Vi polysaccharide bio 69.6 51 0.0011 31.7 10.3 96 51-159 86-181 (694)
104 KOG2114|consensus 69.4 68 0.0015 32.0 11.0 84 41-128 360-462 (933)
105 PLN03088 SGT1, suppressor of 69.4 61 0.0013 28.2 10.0 63 52-119 37-99 (356)
106 PF04100 Vps53_N: Vps53-like, 69.3 61 0.0013 28.8 10.1 47 21-69 23-78 (383)
107 KOG1174|consensus 69.2 26 0.00057 32.5 7.8 25 54-78 235-259 (564)
108 PRK10866 outer membrane biogen 69.0 64 0.0014 26.6 13.6 112 41-157 61-200 (243)
109 PRK09782 bacteriophage N4 rece 69.0 68 0.0015 32.2 11.3 25 135-159 680-704 (987)
110 TIGR00756 PPR pentatricopeptid 68.4 9.1 0.0002 20.4 3.2 27 54-81 3-29 (35)
111 PF07721 TPR_4: Tetratricopept 68.1 8.1 0.00017 20.8 2.8 23 53-75 3-25 (26)
112 PF01535 PPR: PPR repeat; Int 68.1 8.2 0.00018 20.4 2.9 25 54-78 3-27 (31)
113 KOG2758|consensus 67.6 94 0.002 28.1 10.6 73 48-122 126-199 (432)
114 KOG2076|consensus 67.2 19 0.00042 35.6 6.9 97 52-160 450-550 (895)
115 PF12854 PPR_1: PPR repeat 67.0 9.1 0.0002 22.0 3.0 24 54-77 10-33 (34)
116 smart00745 MIT Microtubule Int 66.6 31 0.00067 23.0 6.1 51 56-121 13-70 (77)
117 PRK09782 bacteriophage N4 rece 66.5 56 0.0012 32.8 10.1 92 50-158 43-134 (987)
118 COG4649 Uncharacterized protei 65.4 78 0.0017 26.2 11.8 129 18-158 60-193 (221)
119 PF07079 DUF1347: Protein of u 65.3 64 0.0014 30.2 9.5 88 60-156 15-103 (549)
120 PLN03218 maturation of RBCL 1; 65.3 85 0.0018 31.8 11.2 62 53-118 616-677 (1060)
121 PRK15359 type III secretion sy 65.0 55 0.0012 24.4 8.7 65 51-120 58-122 (144)
122 PF09613 HrpB1_HrpK: Bacterial 64.7 56 0.0012 25.9 8.0 69 81-158 2-70 (160)
123 PF13371 TPR_9: Tetratricopept 64.5 34 0.00073 21.7 6.2 58 58-120 2-59 (73)
124 KOG2300|consensus 63.5 1.4E+02 0.0029 28.5 11.3 109 47-159 3-116 (629)
125 PLN03098 LPA1 LOW PSII ACCUMUL 63.3 46 0.001 30.7 8.2 49 29-80 92-140 (453)
126 PF10345 Cohesin_load: Cohesin 63.3 1.1E+02 0.0023 28.6 10.9 105 47-156 97-203 (608)
127 PRK10747 putative protoheme IX 63.2 63 0.0014 28.2 9.0 57 58-119 160-216 (398)
128 KOG2376|consensus 62.4 1.5E+02 0.0033 28.5 11.8 126 29-160 89-252 (652)
129 KOG1127|consensus 62.0 70 0.0015 32.7 9.6 100 50-161 491-591 (1238)
130 KOG0543|consensus 61.8 99 0.0021 28.1 9.9 95 55-158 212-317 (397)
131 KOG1156|consensus 61.7 50 0.0011 31.9 8.4 67 48-119 368-434 (700)
132 cd02680 MIT_calpain7_2 MIT: do 61.1 14 0.00031 25.7 3.6 51 62-120 17-67 (75)
133 PRK15331 chaperone protein Sic 60.3 86 0.0019 25.0 10.6 100 47-159 33-132 (165)
134 PRK12798 chemotaxis protein; R 60.2 43 0.00092 30.6 7.4 68 83-155 251-318 (421)
135 PF14561 TPR_20: Tetratricopep 60.1 57 0.0012 22.9 10.2 79 28-115 4-84 (90)
136 PRK10370 formate-dependent nit 59.9 25 0.00055 27.9 5.4 62 54-120 110-174 (198)
137 TIGR03504 FimV_Cterm FimV C-te 59.0 13 0.00029 23.1 2.9 25 54-78 2-26 (44)
138 PLN03081 pentatricopeptide (PP 59.0 1.1E+02 0.0023 28.8 10.1 57 53-117 362-418 (697)
139 PRK14574 hmsH outer membrane p 58.7 96 0.0021 30.5 10.0 61 53-121 104-164 (822)
140 PF13812 PPR_3: Pentatricopept 58.4 17 0.00036 19.6 3.1 27 54-81 4-30 (34)
141 PLN03218 maturation of RBCL 1; 58.2 1.2E+02 0.0027 30.7 10.9 25 54-78 510-534 (1060)
142 KOG1129|consensus 58.1 27 0.00058 31.6 5.6 87 63-161 336-424 (478)
143 PF14559 TPR_19: Tetratricopep 58.1 43 0.00094 20.8 6.4 47 32-84 11-57 (68)
144 PRK02603 photosystem I assembl 57.2 82 0.0018 23.8 9.9 32 50-82 71-102 (172)
145 PF10255 Paf67: RNA polymerase 57.1 21 0.00046 32.3 4.9 76 80-160 110-192 (404)
146 KOG2002|consensus 56.8 18 0.00039 36.2 4.7 66 52-120 681-746 (1018)
147 PRK15363 pathogenicity island 56.2 99 0.0021 24.4 11.0 100 48-160 32-131 (157)
148 COG1516 FliS Flagellin-specifi 56.2 92 0.002 24.0 8.0 64 69-137 5-84 (132)
149 PF04010 DUF357: Protein of un 56.0 22 0.00048 24.6 3.9 26 53-78 37-62 (75)
150 COG3063 PilF Tfp pilus assembl 55.0 31 0.00067 29.3 5.3 64 53-119 105-168 (250)
151 PRK06771 hypothetical protein; 53.1 70 0.0015 23.3 6.2 56 21-78 21-79 (93)
152 COG5071 RPN5 26S proteasome re 53.0 1.7E+02 0.0038 26.3 10.5 100 54-157 134-237 (439)
153 PF14853 Fis1_TPR_C: Fis1 C-te 52.7 35 0.00075 22.0 4.2 38 51-89 1-38 (53)
154 PLN03098 LPA1 LOW PSII ACCUMUL 52.7 1.9E+02 0.0042 26.7 11.9 33 51-84 75-107 (453)
155 KOG2300|consensus 52.3 1.4E+02 0.0031 28.3 9.5 101 53-159 369-472 (629)
156 KOG2047|consensus 52.0 16 0.00035 35.4 3.4 33 53-86 250-282 (835)
157 KOG0548|consensus 51.9 97 0.0021 29.2 8.4 90 56-158 7-96 (539)
158 PF11207 DUF2989: Protein of u 51.7 95 0.0021 25.6 7.5 51 54-107 144-195 (203)
159 COG5010 TadD Flp pilus assembl 51.5 1.6E+02 0.0034 25.3 10.8 20 139-158 175-194 (257)
160 cd02681 MIT_calpain7_1 MIT: do 50.5 64 0.0014 22.3 5.5 23 56-78 11-33 (76)
161 KOG0551|consensus 49.4 99 0.0022 27.9 7.7 70 47-119 77-148 (390)
162 PRK10153 DNA-binding transcrip 48.9 47 0.001 30.7 6.0 62 52-119 421-482 (517)
163 COG1729 Uncharacterized protei 48.7 58 0.0012 27.9 6.0 66 51-119 178-244 (262)
164 TIGR03362 VI_chp_7 type VI sec 48.7 1.8E+02 0.0038 25.2 9.1 74 56-131 218-291 (301)
165 KOG4162|consensus 48.5 1.2E+02 0.0026 29.9 8.7 98 54-159 653-781 (799)
166 PF01895 PhoU: PhoU domain; I 47.2 53 0.0011 21.3 4.6 52 17-70 23-83 (88)
167 PF10300 DUF3808: Protein of u 46.2 95 0.0021 28.2 7.4 96 57-159 194-294 (468)
168 KOG2076|consensus 45.7 2.6E+02 0.0057 28.0 10.6 91 53-157 175-266 (895)
169 KOG4555|consensus 45.3 1.6E+02 0.0034 23.5 10.7 96 49-154 41-144 (175)
170 PRK14957 DNA polymerase III su 45.2 2.7E+02 0.0058 26.2 10.4 94 4-101 186-295 (546)
171 TIGR00540 hemY_coli hemY prote 44.9 2.1E+02 0.0046 24.9 9.8 96 57-159 159-290 (409)
172 PF13512 TPR_18: Tetratricopep 43.9 1.1E+02 0.0024 23.8 6.5 50 51-102 47-96 (142)
173 KOG3060|consensus 43.7 2.2E+02 0.0048 24.8 12.1 106 4-120 108-221 (289)
174 KOG3616|consensus 43.7 1.1E+02 0.0024 30.8 7.7 75 41-122 756-837 (1636)
175 KOG0624|consensus 43.3 1.8E+02 0.004 26.6 8.5 103 48-157 103-221 (504)
176 PF02064 MAS20: MAS20 protein 42.9 56 0.0012 24.7 4.6 75 17-94 24-105 (121)
177 COG4105 ComL DNA uptake lipopr 42.3 2.2E+02 0.0047 24.3 12.4 111 41-156 64-191 (254)
178 PRK10747 putative protoheme IX 42.3 2.3E+02 0.0051 24.7 10.5 62 54-119 121-182 (398)
179 COG5159 RPN6 26S proteasome re 41.8 1.5E+02 0.0033 26.4 7.7 131 21-159 87-233 (421)
180 KOG2003|consensus 41.5 2.6E+02 0.0056 26.6 9.4 70 53-129 594-663 (840)
181 PF10130 PIN_2: PIN domain; I 40.7 74 0.0016 24.2 5.0 61 67-129 36-96 (133)
182 KOG2003|consensus 40.1 79 0.0017 29.9 5.9 62 54-119 663-725 (840)
183 PF12569 NARP1: NMDA receptor- 39.8 2.6E+02 0.0055 26.1 9.3 96 52-157 39-135 (517)
184 PF03704 BTAD: Bacterial trans 39.2 1.5E+02 0.0032 21.5 8.4 67 50-121 61-127 (146)
185 COG1283 NptA Na+/phosphate sym 38.9 2.9E+02 0.0062 26.1 9.4 100 17-119 361-480 (533)
186 cd02682 MIT_AAA_Arch MIT: doma 38.8 1.3E+02 0.0029 20.8 7.8 52 55-121 10-68 (75)
187 PF12862 Apc5: Anaphase-promot 38.2 82 0.0018 21.8 4.6 31 48-78 38-68 (94)
188 PRK10803 tol-pal system protei 37.3 2.5E+02 0.0054 23.5 11.2 96 55-157 146-242 (263)
189 PLN03077 Protein ECB2; Provisi 37.3 2.5E+02 0.0054 27.0 9.1 61 53-117 556-616 (857)
190 KOG3617|consensus 37.2 52 0.0011 33.2 4.5 75 58-154 919-1002(1416)
191 PF12925 APP_E2: E2 domain of 36.8 2.4E+02 0.0051 23.1 8.5 89 21-119 72-162 (193)
192 PF04212 MIT: MIT (microtubule 36.6 1.2E+02 0.0026 19.7 6.8 23 56-78 10-32 (69)
193 KOG2908|consensus 36.0 1.5E+02 0.0033 26.7 6.8 77 1-78 57-142 (380)
194 KOG1173|consensus 35.8 1.2E+02 0.0027 28.9 6.5 97 48-159 343-441 (611)
195 PLN03077 Protein ECB2; Provisi 35.7 3.1E+02 0.0066 26.4 9.4 54 56-118 529-582 (857)
196 PF14643 DUF4455: Domain of un 35.6 3.4E+02 0.0075 24.7 11.6 117 19-150 312-443 (473)
197 PF10938 YfdX: YfdX protein; 35.0 1.6E+02 0.0034 22.9 6.1 102 54-161 5-146 (155)
198 KOG1129|consensus 34.7 95 0.0021 28.2 5.3 93 56-161 295-387 (478)
199 smart00101 14_3_3 14-3-3 homol 33.7 83 0.0018 26.5 4.7 32 50-82 168-201 (244)
200 KOG4626|consensus 33.6 66 0.0014 31.5 4.4 48 51-99 116-180 (966)
201 cd02677 MIT_SNX15 MIT: domain 33.2 1.6E+02 0.0035 20.1 5.7 50 62-120 17-67 (75)
202 KOG2002|consensus 32.5 2.2E+02 0.0047 29.0 7.8 94 52-156 271-366 (1018)
203 smart00386 HAT HAT (Half-A-TPR 32.4 60 0.0013 16.7 2.6 18 65-83 1-18 (33)
204 PRK14965 DNA polymerase III su 31.9 4.2E+02 0.009 24.8 9.4 95 4-102 186-295 (576)
205 KOG1127|consensus 31.7 2.4E+02 0.0051 29.1 7.9 97 53-159 598-705 (1238)
206 KOG2047|consensus 31.7 2.3E+02 0.0049 27.9 7.6 93 28-121 363-456 (835)
207 TIGR02397 dnaX_nterm DNA polym 31.6 3.1E+02 0.0067 22.9 10.3 96 4-102 184-293 (355)
208 TIGR02561 HrpB1_HrpK type III 31.5 2.6E+02 0.0057 22.1 7.2 68 82-158 3-70 (153)
209 KOG4626|consensus 31.2 1.4E+02 0.003 29.5 6.1 28 51-78 456-483 (966)
210 cd08053 Yqbg Putative Head-Tai 30.9 2.1E+02 0.0045 20.7 6.0 53 13-68 15-72 (121)
211 KOG1125|consensus 30.9 4.7E+02 0.01 25.0 9.4 125 20-160 324-458 (579)
212 PRK14720 transcript cleavage f 30.5 1.4E+02 0.003 30.0 6.2 56 52-120 117-172 (906)
213 PF04053 Coatomer_WDAD: Coatom 30.1 4.2E+02 0.0092 24.1 9.7 80 53-158 349-428 (443)
214 cd08638 DNA_pol_A_theta DNA po 30.0 2.2E+02 0.0047 25.1 6.9 40 13-55 1-40 (373)
215 PF03357 Snf7: Snf7; InterPro 29.2 1.2E+02 0.0027 22.6 4.7 70 22-100 7-78 (171)
216 PRK10316 hypothetical protein; 28.7 1.1E+02 0.0023 25.5 4.4 52 80-133 45-96 (209)
217 KOG3081|consensus 28.2 4.1E+02 0.0089 23.3 11.0 128 25-160 85-235 (299)
218 KOG0276|consensus 27.8 87 0.0019 30.4 4.2 80 53-158 668-747 (794)
219 PRK14970 DNA polymerase III su 27.4 3.9E+02 0.0085 22.8 9.5 94 5-102 176-284 (367)
220 PRK15363 pathogenicity island 27.0 3.1E+02 0.0068 21.6 9.6 70 47-121 65-134 (157)
221 PF13414 TPR_11: TPR repeat; P 26.7 1.6E+02 0.0035 18.2 7.2 61 91-160 5-66 (69)
222 KOG0796|consensus 26.5 2.5E+02 0.0054 24.8 6.5 20 20-40 113-132 (319)
223 PRK06078 pyrimidine-nucleoside 26.3 1.7E+02 0.0036 26.9 5.6 81 52-140 238-318 (434)
224 COG5010 TadD Flp pilus assembl 26.3 4.2E+02 0.009 22.7 9.2 87 53-153 136-223 (257)
225 PF11564 BpuJI_N: Restriction 26.1 1.2E+02 0.0026 26.3 4.3 110 2-118 26-150 (285)
226 PF07219 HemY_N: HemY protein 26.0 2.5E+02 0.0054 20.0 5.8 49 108-158 32-85 (108)
227 cd02678 MIT_VPS4 MIT: domain c 25.7 2.1E+02 0.0046 19.1 7.2 22 57-78 12-33 (75)
228 KOG2796|consensus 25.7 4.7E+02 0.01 23.2 12.3 142 5-159 128-279 (366)
229 KOG1156|consensus 25.7 4E+02 0.0087 26.0 8.1 92 53-157 77-168 (700)
230 PF02071 NSF: Aromatic-di-Alan 25.3 39 0.00084 15.6 0.7 10 146-155 3-12 (12)
231 KOG1941|consensus 25.2 71 0.0015 29.4 3.0 80 53-133 124-205 (518)
232 PHA02774 E1; Provisional 25.1 3.5E+02 0.0077 26.0 7.7 126 17-160 299-432 (613)
233 PF07889 DUF1664: Protein of u 24.3 3.2E+02 0.007 20.8 6.1 62 9-72 45-113 (126)
234 PF03037 KMP11: Kinetoplastid 24.1 79 0.0017 22.3 2.4 31 1-32 1-31 (90)
235 PF10516 SHNi-TPR: SHNi-TPR; 24.0 1.4E+02 0.0031 17.8 3.3 26 53-78 3-28 (38)
236 smart00671 SEL1 Sel1-like repe 23.9 1.3E+02 0.0028 16.1 3.4 27 52-78 2-32 (36)
237 KOG1128|consensus 23.7 3.2E+02 0.007 27.0 7.1 28 52-80 425-452 (777)
238 TIGR00985 3a0801s04tom mitocho 23.5 3.6E+02 0.0079 21.1 8.3 42 50-92 89-131 (148)
239 PF11272 DUF3072: Protein of u 23.3 1.2E+02 0.0027 20.1 3.1 34 2-38 21-54 (57)
240 PRK06585 holA DNA polymerase I 23.3 4.6E+02 0.0099 22.1 11.8 91 5-99 151-258 (343)
241 PRK14951 DNA polymerase III su 23.3 6.6E+02 0.014 24.0 9.6 96 3-102 190-300 (618)
242 PF12688 TPR_5: Tetratrico pep 23.1 2.8E+02 0.0061 20.4 5.5 35 44-78 31-65 (120)
243 PF08631 SPO22: Meiosis protei 23.0 4.4E+02 0.0095 21.8 9.9 72 48-121 32-115 (278)
244 PF09295 ChAPs: ChAPs (Chs5p-A 22.8 3.5E+02 0.0076 24.3 7.0 57 53-114 236-292 (395)
245 cd06571 Bac_DnaA_C C-terminal 22.4 2.3E+02 0.0049 19.5 4.6 36 95-134 47-83 (90)
246 KOG1760|consensus 22.4 3.2E+02 0.007 21.0 5.6 41 19-66 84-124 (131)
247 PF08559 Cut8_C: Cut8 six-heli 22.2 3.7E+02 0.008 20.7 7.0 40 104-145 60-103 (143)
248 PRK10153 DNA-binding transcrip 22.2 5.1E+02 0.011 24.0 8.1 58 51-115 453-510 (517)
249 KOG1258|consensus 22.1 7E+02 0.015 23.9 10.4 105 54-162 82-199 (577)
250 KOG4648|consensus 22.0 6.3E+02 0.014 23.3 9.0 45 55-99 101-148 (536)
251 PRK08691 DNA polymerase III su 21.9 7.6E+02 0.017 24.2 9.9 92 3-98 185-291 (709)
252 cd02683 MIT_1 MIT: domain cont 21.9 2.7E+02 0.0059 19.0 7.4 24 55-78 10-33 (77)
253 PF10440 WIYLD: Ubiquitin-bind 21.7 2.5E+02 0.0055 19.0 4.5 39 68-110 9-47 (65)
254 PF04733 Coatomer_E: Coatomer 21.4 5E+02 0.011 21.9 8.4 97 54-160 105-229 (290)
255 PF02255 PTS_IIA: PTS system, 21.3 1.5E+02 0.0033 21.2 3.6 26 132-157 14-39 (96)
256 PF10187 Nefa_Nip30_N: N-termi 21.2 2E+02 0.0044 20.9 4.3 48 17-72 37-87 (102)
257 PF01109 GM_CSF: Granulocyte-m 21.2 1.7E+02 0.0037 22.3 3.9 18 104-122 11-28 (122)
258 PRK14953 DNA polymerase III su 21.2 6.5E+02 0.014 23.1 10.5 97 4-102 186-295 (486)
259 PF02840 Prp18: Prp18 domain; 21.1 4E+02 0.0088 20.7 7.2 52 24-78 35-86 (144)
260 PRK09591 celC cellobiose phosp 21.0 1.4E+02 0.0031 21.8 3.4 26 132-157 20-45 (104)
261 PHA03025 hypothetical protein; 21.0 68 0.0015 21.7 1.5 22 23-44 21-42 (68)
262 PF03704 BTAD: Bacterial trans 20.8 1.5E+02 0.0032 21.5 3.6 44 53-96 98-145 (146)
263 COG4785 NlpI Lipoprotein NlpI, 20.6 5.2E+02 0.011 22.3 7.1 112 28-149 43-157 (297)
264 KOG2518|consensus 20.0 1.3E+02 0.0029 28.4 3.7 47 47-100 92-138 (556)
No 1
>KOG0687|consensus
Probab=100.00 E-value=5.5e-56 Score=380.00 Aligned_cols=155 Identities=41% Similarity=0.760 Sum_probs=154.3
Q ss_pred CchhHHHHHHhcCCCcCHHHHHHHHHHHHHHHHHHHHHHH---hcCCCchHHHHHHHHhhHhHHHhhcCHHHHHHHHHHh
Q psy398 1 MAPYYKEVCEEFGWKMDEKFYTEMKDANEKRLKELDLEHE---KNMMDEEDQVSGIWQAKLDYLCSIGDRAEAFKLADSK 77 (162)
Q Consensus 1 m~p~Y~~l~~~~~~~~D~~~~~~~~~~n~~~l~~Le~~lk---~Nlg~~~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~ 77 (162)
|||||+.+|+.+.|++|+.++++|.++|++++++|+++++ +|+| +.|||+|+..+|+|||+|||+++|+++++++
T Consensus 53 Map~Ye~lce~~~i~~D~~~l~~m~~~neeki~eld~~iedaeenlG--E~ev~ea~~~kaeYycqigDkena~~~~~~t 130 (393)
T KOG0687|consen 53 MAPLYEYLCESLVIKLDQDLLNSMKKANEEKIKELDEKIEDAEENLG--ESEVREAMLRKAEYYCQIGDKENALEALRKT 130 (393)
T ss_pred cchHHHHHHhhcceeccHHHHHHHHHhhHHHHHHHHHHHHHHHHhcc--hHHHHHHHHHHHHHHHHhccHHHHHHHHHHH
Confidence 9999999999999999999999999999999999999999 9999 9999999999999999999999999999999
Q ss_pred cCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhhhHHHHHHHHHHhhhhcHHHHHHHhHh
Q psy398 78 FADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSARNKLKAYEGVYCLAIRQYCRAAELFVD 157 (162)
Q Consensus 78 ~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rrnrLKvy~gl~~l~~R~fk~AA~lfld 157 (162)
+ ++++++|||||++|++||+|+|+ +|...|+++|+||+.++|+ |||||||||||||+|||+|++||||+||.||+|
T Consensus 131 ~-~ktvs~g~kiDVvf~~iRlglfy-~D~~lV~~~iekak~liE~--GgDWeRrNRlKvY~Gly~msvR~Fk~Aa~Lfld 206 (393)
T KOG0687|consen 131 Y-EKTVSLGHKIDVVFYKIRLGLFY-LDHDLVTESIEKAKSLIEE--GGDWERRNRLKVYQGLYCMSVRNFKEAADLFLD 206 (393)
T ss_pred H-HHHhhcccchhhHHHHHHHHHhh-ccHHHHHHHHHHHHHHHHh--CCChhhhhhHHHHHHHHHHHHHhHHHHHHHHHH
Confidence 9 99999999999999999999999 9999999999999999999 999999999999999999999999999999999
Q ss_pred hccC
Q psy398 158 VIRS 161 (162)
Q Consensus 158 ~~~T 161 (162)
|+||
T Consensus 207 ~vsT 210 (393)
T KOG0687|consen 207 SVST 210 (393)
T ss_pred Hccc
Confidence 9998
No 2
>PF10602 RPN7: 26S proteasome subunit RPN7; InterPro: IPR019585 This entry represents the regulatory subunit RPN7 (known as the non-ATPase regulatory subunit 6 in higher eukaryotes) of the 26S proteasome. This entry also matches the evolutionarily related subunit 1 of the COP9 signalosome complex (CSN) from Arabidopsis []. The 26S proteasome plays a major role in ATP-dependent degradation of ubiquitinated proteins. Substrate specificity is conferred by the regulatory particle (RP), which can dissociate into stable lid and base subcomplexes. The regulatory subunit RPN7 is one of the lid subunits of the 26S proteasome and has been shown in Saccharomyces cerevisiae (Baker's yeast) to be required for structural integrity []. The COP9 signalosome is a conserved protein complex composed of eight subunits, where Individual subunits of the complex have been linked to various signal transduction pathways leading to gene expression and cell cycle control []. The overall organisation and the amino acid sequences of the COP9 signalosome subunits resemble the lid subcomplex of the 19 S regulatory particle for the 26 S proteasome []. COP9 subunit 1 (CSN1 or GPS1) of the COP9 complex is an essential subunit of the complex with regard to both structural integrity and functionality. The N-terminal region of subunit 1 (CSN1-N) can inhibit c-fos expression from either a transfected template or a chromosomal transgene (fos-lacZ), and may contain the activity domain that confers most of the repression functions of CSN1. The C-terminal region of subunit 1 (CSN1-C) allows integration of the protein into the COP9 signalosome.
Probab=100.00 E-value=1.5e-48 Score=310.05 Aligned_cols=139 Identities=37% Similarity=0.588 Sum_probs=138.0
Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHH---hcCCCchHHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHH
Q psy398 17 DEKFYTEMKDANEKRLKELDLEHE---KNMMDEEDQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVF 93 (162)
Q Consensus 17 D~~~~~~~~~~n~~~l~~Le~~lk---~Nlg~~~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l 93 (162)
|++|+++|+++|++++++||.+|+ +|+| ++|||+|+.++|+||++|||+++|+++|.+++ +||+++||+|||+|
T Consensus 1 D~~w~~~~~~~~~~~~~~Le~elk~~~~n~~--kesir~~~~~l~~~~~~~Gd~~~A~k~y~~~~-~~~~~~~~~id~~l 77 (177)
T PF10602_consen 1 DEEWIEETKAKNAEELEKLEAELKDAKSNLG--KESIRMALEDLADHYCKIGDLEEALKAYSRAR-DYCTSPGHKIDMCL 77 (177)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHhccc--hHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHh-hhcCCHHHHHHHHH
Confidence 899999999999999999999999 9999 99999999999999999999999999999999 99999999999999
Q ss_pred HHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhhhHHHHHHHHHHhhhhcHHHHHHHhHhhccC
Q psy398 94 TLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSARNKLKAYEGVYCLAIRQYCRAAELFVDVIRS 161 (162)
Q Consensus 94 ~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rrnrLKvy~gl~~l~~R~fk~AA~lfld~~~T 161 (162)
++|||+||+ +||.+|.++|+||+.++++ ||||+|+||||||+||++|.+|+|+.||++||+++||
T Consensus 78 ~~irv~i~~-~d~~~v~~~i~ka~~~~~~--~~d~~~~nrlk~~~gL~~l~~r~f~~AA~~fl~~~~t 142 (177)
T PF10602_consen 78 NVIRVAIFF-GDWSHVEKYIEKAESLIEK--GGDWERRNRLKVYEGLANLAQRDFKEAAELFLDSLST 142 (177)
T ss_pred HHHHHHHHh-CCHHHHHHHHHHHHHHHhc--cchHHHHHHHHHHHHHHHHHhchHHHHHHHHHccCcC
Confidence 999999999 9999999999999999999 9999999999999999999999999999999999998
No 3
>COG5187 RPN7 26S proteasome regulatory complex component, contains PCI domain [Posttranslational modification, protein turnover, chaperones]
Probab=100.00 E-value=4.9e-43 Score=298.08 Aligned_cols=154 Identities=28% Similarity=0.457 Sum_probs=150.3
Q ss_pred CchhHHHHHHhc-------CCCcCHHHHHHHHHHHHHHHHHHHHHHH----hcCCCchHHHHHHHHhhHhHHHhhcCHHH
Q psy398 1 MAPYYKEVCEEF-------GWKMDEKFYTEMKDANEKRLKELDLEHE----KNMMDEEDQVSGIWQAKLDYLCSIGDRAE 69 (162)
Q Consensus 1 m~p~Y~~l~~~~-------~~~~D~~~~~~~~~~n~~~l~~Le~~lk----~Nlg~~~~eIr~~~~~~a~~~~~iGD~~~ 69 (162)
|||||+++|++. .+++|..++++|-++|++++++|+++|+ .| | +.|+.+++..+|+|||+|||+++
T Consensus 57 maplYkyL~E~~n~kt~a~~ikfD~~~~n~l~kkneeki~Elde~i~~~eedn-g--E~e~~ea~~n~aeyY~qi~D~~n 133 (412)
T COG5187 57 MAPLYKYLAEKGNPKTSASVIKFDRGRMNTLLKKNEEKIEELDERIREKEEDN-G--ETEGSEADRNIAEYYCQIMDIQN 133 (412)
T ss_pred hhHHHHHHHhccCCcccchheehhhHHHHHHHHhhHHHHHHHHHHHHHHhhcc-c--chHHHHHHHHHHHHHHHHhhhhh
Confidence 899999999987 4789999999999999999999999998 55 9 99999999999999999999999
Q ss_pred HHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhhhHHHHHHHHHHhhhhcHH
Q psy398 70 AFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSARNKLKAYEGVYCLAIRQYC 149 (162)
Q Consensus 70 A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rrnrLKvy~gl~~l~~R~fk 149 (162)
+++...++. .+.+++|.|||+.|+.||+|+++ +|...|...|++++.++|+ ||||+||||+|+|.|+|+|+.|+|+
T Consensus 134 g~~~~~~~~-~~a~stg~KiDv~l~kiRlg~~y-~d~~vV~e~lE~~~~~iEk--GgDWeRrNRyK~Y~Gi~~m~~RnFk 209 (412)
T COG5187 134 GFEWMRRLM-RDAMSTGLKIDVFLCKIRLGLIY-GDRKVVEESLEVADDIIEK--GGDWERRNRYKVYKGIFKMMRRNFK 209 (412)
T ss_pred HHHHHHHHH-HHHHhcccchhhHHHHHHHHHhh-ccHHHHHHHHHHHHHHHHh--CCCHHhhhhHHHHHHHHHHHHHhhH
Confidence 999999999 99999999999999999999999 9999999999999999999 9999999999999999999999999
Q ss_pred HHHHHhHhhccC
Q psy398 150 RAAELFVDVIRS 161 (162)
Q Consensus 150 ~AA~lfld~~~T 161 (162)
+||.||+|++||
T Consensus 210 eAa~Ll~d~l~t 221 (412)
T COG5187 210 EAAILLSDILPT 221 (412)
T ss_pred HHHHHHHHHhcc
Confidence 999999999998
No 4
>KOG0686|consensus
Probab=99.95 E-value=3.5e-28 Score=213.35 Aligned_cols=145 Identities=19% Similarity=0.244 Sum_probs=133.3
Q ss_pred CCCcCHHHHHHHHHHHHHHHHHHHHHHH---hcCCCchHHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHH
Q psy398 13 GWKMDEKFYTEMKDANEKRLKELDLEHE---KNMMDEEDQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRI 89 (162)
Q Consensus 13 ~~~~D~~~~~~~~~~n~~~l~~Le~~lk---~Nlg~~~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~ki 89 (162)
+.++|++|+++..++...++++|+.+|| +|+| +||||.|+.|+|+||...|++++|+++|.|+| ||||+.+|.|
T Consensus 111 ~~~~D~~WvE~~~~~a~~~le~L~~eLk~yK~n~i--KEsiRra~~Dl~dhy~~cG~l~~Alr~YsR~R-dYCTs~khvI 187 (466)
T KOG0686|consen 111 GYLLDEKWVETNNKKAVLKLEKLDNELKSYKDNLI--KESIRRALEDLGDHYLDCGQLDNALRCYSRAR-DYCTSAKHVI 187 (466)
T ss_pred ccccchHHHHHhhHHHHHHHHHHHHHHHHhhcchh--hHHHHHHHHHHHHHHHHhccHHHHHhhhhhhh-hhhcchHHHH
Confidence 3788999999999999999999999999 9999 99999999999999999999999999999999 9999999999
Q ss_pred HHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhhhHHHHHHHHHHhhhhcHHHHHHHhHhhccC
Q psy398 90 DAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSARNKLKAYEGVYCLAIRQYCRAAELFVDVIRS 161 (162)
Q Consensus 90 D~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rrnrLKvy~gl~~l~~R~fk~AA~lfld~~~T 161 (162)
+||+++|||+|.. +||.||.+++.+|++.++..+..--+--.|++++.||..|..|+|+.||+.||.+..+
T Consensus 188 nm~ln~i~VSI~~-~nw~hv~sy~~~A~st~~~~~~~~q~v~~kl~C~agLa~L~lkkyk~aa~~fL~~~~~ 258 (466)
T KOG0686|consen 188 NMCLNLILVSIYM-GNWGHVLSYISKAESTPDANENLAQEVPAKLKCAAGLANLLLKKYKSAAKYFLLAEFD 258 (466)
T ss_pred HHHHHHHHHHHhh-cchhhhhhHHHHHHhCchhhhhHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHhCCCC
Confidence 9999999999999 9999999999999999742001122345899999999999999999999999998765
No 5
>PF09976 TPR_21: Tetratricopeptide repeat; InterPro: IPR018704 This domain, found in various hypothetical prokaryotic proteins, has no known function.
Probab=97.63 E-value=0.0057 Score=46.01 Aligned_cols=122 Identities=12% Similarity=0.052 Sum_probs=91.1
Q ss_pred HHHHHHHHHHHHHHHH-hcCCCchHHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhc
Q psy398 25 KDANEKRLKELDLEHE-KNMMDEEDQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHG 103 (162)
Q Consensus 25 ~~~n~~~l~~Le~~lk-~Nlg~~~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~ 103 (162)
...+..++.+.=.++. ++-+ ..--..+.+.+|..+..-||+++|.+.|.... +......-+--+.+.+.+|-+..
T Consensus 23 ~~~~~~~~~~~~~~l~~~~~~--s~ya~~A~l~lA~~~~~~g~~~~A~~~l~~~~-~~~~d~~l~~~a~l~LA~~~~~~- 98 (145)
T PF09976_consen 23 QAGDPAKAEAAAEQLAKDYPS--SPYAALAALQLAKAAYEQGDYDEAKAALEKAL-ANAPDPELKPLARLRLARILLQQ- 98 (145)
T ss_pred HCCCHHHHHHHHHHHHHHCCC--ChHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH-hhCCCHHHHHHHHHHHHHHHHHc-
Confidence 3444455554333444 5555 45678899999999999999999999999999 77766666666778888888888
Q ss_pred CCHHHHHHHHHHHHhhhhhcCCCchhhhhHHHHHHHHHHhhhhcHHHHHHHhHhhc
Q psy398 104 CDIQAMTAAIEKATELIEGISGGDWSARNKLKAYEGVYCLAIRQYCRAAELFVDVI 159 (162)
Q Consensus 104 ~d~~~v~~~i~ka~~l~e~~~ggDW~rrnrLKvy~gl~~l~~R~fk~AA~lfld~~ 159 (162)
+++....+.+.. +. +..|. ....-..|--++..|++.+|-..|-.++
T Consensus 99 ~~~d~Al~~L~~----~~---~~~~~--~~~~~~~Gdi~~~~g~~~~A~~~y~~Al 145 (145)
T PF09976_consen 99 GQYDEALATLQQ----IP---DEAFK--ALAAELLGDIYLAQGDYDEARAAYQKAL 145 (145)
T ss_pred CCHHHHHHHHHh----cc---CcchH--HHHHHHHHHHHHHCCCHHHHHHHHHHhC
Confidence 999888887744 22 34553 3455579999999999999999886653
No 6
>cd00189 TPR Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and C
Probab=97.13 E-value=0.017 Score=36.31 Aligned_cols=94 Identities=19% Similarity=0.188 Sum_probs=67.1
Q ss_pred HHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhhh
Q psy398 53 IWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSARN 132 (162)
Q Consensus 53 ~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rrn 132 (162)
++..+|..+.+.|+.+.|++.+.++. ....... .+.+.+..+.+.. +++....+.+.++-.+... +.
T Consensus 2 ~~~~~a~~~~~~~~~~~A~~~~~~~~-~~~~~~~---~~~~~~~~~~~~~-~~~~~a~~~~~~~~~~~~~----~~---- 68 (100)
T cd00189 2 ALLNLGNLYYKLGDYDEALEYYEKAL-ELDPDNA---DAYYNLAAAYYKL-GKYEEALEDYEKALELDPD----NA---- 68 (100)
T ss_pred HHHHHHHHHHHHhcHHHHHHHHHHHH-hcCCccH---HHHHHHHHHHHHH-HHHHHHHHHHHHHHhCCCc----ch----
Confidence 46788999999999999999999988 5544332 4555566666666 7777777777766654321 22
Q ss_pred HHHHHHHHHHhhhhcHHHHHHHhHhhc
Q psy398 133 KLKAYEGVYCLAIRQYCRAAELFVDVI 159 (162)
Q Consensus 133 rLKvy~gl~~l~~R~fk~AA~lfld~~ 159 (162)
......|..++..+++..|...|..++
T Consensus 69 ~~~~~~~~~~~~~~~~~~a~~~~~~~~ 95 (100)
T cd00189 69 KAYYNLGLAYYKLGKYEEALEAYEKAL 95 (100)
T ss_pred hHHHHHHHHHHHHHhHHHHHHHHHHHH
Confidence 344566777788899999998887664
No 7
>PF13424 TPR_12: Tetratricopeptide repeat; PDB: 3RO2_A 3Q15_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 4A1S_B 3CEQ_B 3EDT_H ....
Probab=96.53 E-value=0.011 Score=39.46 Aligned_cols=72 Identities=15% Similarity=0.148 Sum_probs=51.3
Q ss_pred HHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChH-HHHHHHHHHHHHHHHhc--CCHHHHHHHHHHHHhhhh
Q psy398 49 QVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKS-HRIDAVFTLFRIAYFHG--CDIQAMTAAIEKATELIE 121 (162)
Q Consensus 49 eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g-~kiD~~l~~iRv~~f~~--~d~~~v~~~i~ka~~l~e 121 (162)
++-.++..+|..|.+.|+.+.|++.|.+++ +.....| ...++..+...+|..+. +|+....+++.+|-++.+
T Consensus 3 ~~a~~~~~la~~~~~~~~~~~A~~~~~~al-~~~~~~~~~~~~~a~~~~~lg~~~~~~g~~~~A~~~~~~al~i~~ 77 (78)
T PF13424_consen 3 DTANAYNNLARVYRELGRYDEALDYYEKAL-DIEEQLGDDHPDTANTLNNLGECYYRLGDYEEALEYYQKALDIFE 77 (78)
T ss_dssp HHHHHHHHHHHHHHHTT-HHHHHHHHHHHH-HHHHHTTTHHHHHHHHHHHHHHHHHHTTHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHcCCHHHHHHHHHHHH-HHHHHHCCCCHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHhhhc
Confidence 456788999999999999999999999988 6542222 22344555555554442 888888888888877654
No 8
>TIGR02795 tol_pal_ybgF tol-pal system protein YbgF. Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.
Probab=96.49 E-value=0.074 Score=36.91 Aligned_cols=100 Identities=13% Similarity=0.040 Sum_probs=72.4
Q ss_pred HHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhhh
Q psy398 53 IWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSARN 132 (162)
Q Consensus 53 ~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rrn 132 (162)
.+..+|..+.+.|+++.|.+.|.+.. .........-+..+.+.++.+-. +++....+++.++-..-. ++....
T Consensus 4 ~~~~~~~~~~~~~~~~~A~~~~~~~~-~~~~~~~~~~~~~~~l~~~~~~~-~~~~~A~~~~~~~~~~~p---~~~~~~-- 76 (119)
T TIGR02795 4 AYYDAALLVLKAGDYADAIQAFQAFL-KKYPKSTYAPNAHYWLGEAYYAQ-GKYADAAKAFLAVVKKYP---KSPKAP-- 76 (119)
T ss_pred HHHHHHHHHHHcCCHHHHHHHHHHHH-HHCCCccccHHHHHHHHHHHHhh-ccHHHHHHHHHHHHHHCC---CCCccc--
Confidence 45678999999999999999999988 65544445567888888888888 999999999998877632 122111
Q ss_pred HHHHHHHHHHhhhhcHHHHHHHhHhhc
Q psy398 133 KLKAYEGVYCLAIRQYCRAAELFVDVI 159 (162)
Q Consensus 133 rLKvy~gl~~l~~R~fk~AA~lfld~~ 159 (162)
......|..+...+++.+|.+.|-..+
T Consensus 77 ~~~~~~~~~~~~~~~~~~A~~~~~~~~ 103 (119)
T TIGR02795 77 DALLKLGMSLQELGDKEKAKATLQQVI 103 (119)
T ss_pred HHHHHHHHHHHHhCChHHHHHHHHHHH
Confidence 123344555567888888888876543
No 9
>TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW. Members of this family are designated PilF in ref (PubMed:8973346) and PilW in ref (PubMed:15612916). This outer membrane protein is required both for pilus stability and for pilus function such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.
Probab=96.22 E-value=0.11 Score=39.01 Aligned_cols=62 Identities=13% Similarity=0.046 Sum_probs=31.6
Q ss_pred HHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHh
Q psy398 52 GIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATE 118 (162)
Q Consensus 52 ~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~ 118 (162)
.++..+|..|...|+.++|.+.|.++. ....... +..+.+..+.+.. +++....+.+.++-.
T Consensus 66 ~~~~~la~~~~~~~~~~~A~~~~~~al-~~~~~~~---~~~~~~~~~~~~~-g~~~~A~~~~~~~~~ 127 (234)
T TIGR02521 66 LAYLALALYYQQLGELEKAEDSFRRAL-TLNPNNG---DVLNNYGTFLCQQ-GKYEQAMQQFEQAIE 127 (234)
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHH-hhCCCCH---HHHHHHHHHHHHc-ccHHHHHHHHHHHHh
Confidence 345556666666666666666666665 4332211 2333334444444 555555555555543
No 10
>PF12688 TPR_5: Tetratrico peptide repeat
Probab=96.16 E-value=0.066 Score=40.22 Aligned_cols=101 Identities=16% Similarity=0.056 Sum_probs=74.7
Q ss_pred HHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhhh
Q psy398 53 IWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSARN 132 (162)
Q Consensus 53 ~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rrn 132 (162)
++..+|.-+-.-|+-++|+..|+++. ....+.-++....+.+-..---. +........+.++- ++ -+|-+-..
T Consensus 3 ~~~~~A~a~d~~G~~~~Ai~~Y~~Al-~~gL~~~~~~~a~i~lastlr~L-G~~deA~~~L~~~~---~~--~p~~~~~~ 75 (120)
T PF12688_consen 3 ALYELAWAHDSLGREEEAIPLYRRAL-AAGLSGADRRRALIQLASTLRNL-GRYDEALALLEEAL---EE--FPDDELNA 75 (120)
T ss_pred hHHHHHHHHHhcCCHHHHHHHHHHHH-HcCCCchHHHHHHHHHHHHHHHc-CCHHHHHHHHHHHH---HH--CCCccccH
Confidence 56788889999999999999999999 76655556666555544444444 66666666665553 33 12222367
Q ss_pred HHHHHHHHHHhhhhcHHHHHHHhHhhcc
Q psy398 133 KLKAYEGVYCLAIRQYCRAAELFVDVIR 160 (162)
Q Consensus 133 rLKvy~gl~~l~~R~fk~AA~lfld~~~ 160 (162)
.+++..++.+...+..++|.+.||.+++
T Consensus 76 ~l~~f~Al~L~~~gr~~eAl~~~l~~la 103 (120)
T PF12688_consen 76 ALRVFLALALYNLGRPKEALEWLLEALA 103 (120)
T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHH
Confidence 7889999999999999999999998764
No 11
>TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD. ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array.
Probab=95.43 E-value=0.36 Score=34.78 Aligned_cols=95 Identities=17% Similarity=-0.061 Sum_probs=67.0
Q ss_pred HHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhh
Q psy398 52 GIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSAR 131 (162)
Q Consensus 52 ~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rr 131 (162)
.+...+|..|.+.|+.++|...|.++. ...... .++...+-.+.+-. +++......+.++-.+-. .+++
T Consensus 18 ~~~~~~a~~~~~~~~~~~A~~~~~~~~-~~~p~~---~~~~~~la~~~~~~-~~~~~A~~~~~~~~~~~p----~~~~-- 86 (135)
T TIGR02552 18 EQIYALAYNLYQQGRYDEALKLFQLLA-AYDPYN---SRYWLGLAACCQML-KEYEEAIDAYALAAALDP----DDPR-- 86 (135)
T ss_pred HHHHHHHHHHHHcccHHHHHHHHHHHH-HhCCCc---HHHHHHHHHHHHHH-HHHHHHHHHHHHHHhcCC----CChH--
Confidence 457789999999999999999999988 543322 24444444444445 777777777777655421 2333
Q ss_pred hHHHHHHHHHHhhhhcHHHHHHHhHhhc
Q psy398 132 NKLKAYEGVYCLAIRQYCRAAELFVDVI 159 (162)
Q Consensus 132 nrLKvy~gl~~l~~R~fk~AA~lfld~~ 159 (162)
...+.|..++..+++.+|.+.|-.++
T Consensus 87 --~~~~la~~~~~~g~~~~A~~~~~~al 112 (135)
T TIGR02552 87 --PYFHAAECLLALGEPESALKALDLAI 112 (135)
T ss_pred --HHHHHHHHHHHcCCHHHHHHHHHHHH
Confidence 44667888889999999999887654
No 12
>PF13414 TPR_11: TPR repeat; PDB: 2HO1_B 2FI7_B 2DBA_A 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2PL2_B 3IEG_B 2FBN_A ....
Probab=95.39 E-value=0.095 Score=33.86 Aligned_cols=63 Identities=19% Similarity=0.179 Sum_probs=49.6
Q ss_pred HHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcC-CHHHHHHHHHHHHhh
Q psy398 52 GIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGC-DIQAMTAAIEKATEL 119 (162)
Q Consensus 52 ~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~-d~~~v~~~i~ka~~l 119 (162)
..+..+|..|.+.||++.|++.|+++. +...+. .++.+.+-.+-... + ++....+++.+|-++
T Consensus 4 ~~~~~~g~~~~~~~~~~~A~~~~~~ai-~~~p~~---~~~~~~~g~~~~~~-~~~~~~A~~~~~~al~l 67 (69)
T PF13414_consen 4 EAWYNLGQIYFQQGDYEEAIEYFEKAI-ELDPNN---AEAYYNLGLAYMKL-GKDYEEAIEDFEKALKL 67 (69)
T ss_dssp HHHHHHHHHHHHTTHHHHHHHHHHHHH-HHSTTH---HHHHHHHHHHHHHT-TTHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHH-HcCCCC---HHHHHHHHHHHHHh-CccHHHHHHHHHHHHHc
Confidence 467889999999999999999999999 766443 34666666666666 7 688888888887654
No 13
>KOG1840|consensus
Probab=95.27 E-value=1.8 Score=40.08 Aligned_cols=127 Identities=15% Similarity=0.127 Sum_probs=94.4
Q ss_pred HHHHHHHHH---hcCCCchHHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCC-----CCCChHHHHHHHHHHHHHHHHhc
Q psy398 32 LKELDLEHE---KNMMDEEDQVSGIWQAKLDYLCSIGDRAEAFKLADSKFAD-----KTNPKSHRIDAVFTLFRIAYFHG 103 (162)
Q Consensus 32 l~~Le~~lk---~Nlg~~~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~-----~~~s~g~kiD~~l~~iRv~~f~~ 103 (162)
+.-++..+. +.+|-.-..|-..+.++|.-|++.|++++|...+.++. + +.++.+..--....++-+.--.
T Consensus 261 v~ly~~AL~i~e~~~G~~h~~va~~l~nLa~ly~~~GKf~EA~~~~e~Al-~I~~~~~~~~~~~v~~~l~~~~~~~~~~- 338 (508)
T KOG1840|consen 261 VNLYEEALTIREEVFGEDHPAVAATLNNLAVLYYKQGKFAEAEEYCERAL-EIYEKLLGASHPEVAAQLSELAAILQSM- 338 (508)
T ss_pred HHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhccCChHHHHHHHHHHH-HHHHHhhccChHHHHHHHHHHHHHHHHh-
Confidence 333444455 78883345666679999999999999999999998865 3 2344555555555566666566
Q ss_pred CCHHHHHHHHHHHHhhhhhcCCCchhhhhHHHHHHHHHHhhhhcHHHHHHHhHhhcc
Q psy398 104 CDIQAMTAAIEKATELIEGISGGDWSARNKLKAYEGVYCLAIRQYCRAAELFVDVIR 160 (162)
Q Consensus 104 ~d~~~v~~~i~ka~~l~e~~~ggDW~rrnrLKvy~gl~~l~~R~fk~AA~lfld~~~ 160 (162)
+.+..+..+..++-+++...-|.|----++.+.=-|.-+..+|+|++|-++|-.++.
T Consensus 339 ~~~Eea~~l~q~al~i~~~~~g~~~~~~a~~~~nl~~l~~~~gk~~ea~~~~k~ai~ 395 (508)
T KOG1840|consen 339 NEYEEAKKLLQKALKIYLDAPGEDNVNLAKIYANLAELYLKMGKYKEAEELYKKAIQ 395 (508)
T ss_pred cchhHHHHHHHHHHHHHHhhccccchHHHHHHHHHHHHHHHhcchhHHHHHHHHHHH
Confidence 889999999998888876323556556788888888889999999999999988763
No 14
>TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW. Members of this family are designated PilF in ref (PubMed:8973346) and PilW in ref (PubMed:15612916). This outer membrane protein is required both for pilus stability and for pilus function such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.
Probab=95.26 E-value=0.4 Score=36.04 Aligned_cols=97 Identities=10% Similarity=0.031 Sum_probs=66.8
Q ss_pred HHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhh
Q psy398 52 GIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSAR 131 (162)
Q Consensus 52 ~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rr 131 (162)
.++..+|..|...|+.++|.+.|.++. .... .....+....+..+.+.. +++......+.++-.... .+
T Consensus 100 ~~~~~~~~~~~~~g~~~~A~~~~~~~~-~~~~-~~~~~~~~~~l~~~~~~~-g~~~~A~~~~~~~~~~~~----~~---- 168 (234)
T TIGR02521 100 DVLNNYGTFLCQQGKYEQAMQQFEQAI-EDPL-YPQPARSLENAGLCALKA-GDFDKAEKYLTRALQIDP----QR---- 168 (234)
T ss_pred HHHHHHHHHHHHcccHHHHHHHHHHHH-hccc-cccchHHHHHHHHHHHHc-CCHHHHHHHHHHHHHhCc----CC----
Confidence 466778899999999999999999988 5322 223345566666677777 888888888888776532 11
Q ss_pred hHHHHHHHHHHhhhhcHHHHHHHhHhhc
Q psy398 132 NKLKAYEGVYCLAIRQYCRAAELFVDVI 159 (162)
Q Consensus 132 nrLKvy~gl~~l~~R~fk~AA~lfld~~ 159 (162)
...-...|..+...++|.+|...|-.++
T Consensus 169 ~~~~~~la~~~~~~~~~~~A~~~~~~~~ 196 (234)
T TIGR02521 169 PESLLELAELYYLRGQYKDARAYLERYQ 196 (234)
T ss_pred hHHHHHHHHHHHHcCCHHHHHHHHHHHH
Confidence 1223345666777888888887776543
No 15
>PRK11788 tetratricopeptide repeat protein; Provisional
Probab=95.26 E-value=0.9 Score=38.37 Aligned_cols=101 Identities=16% Similarity=-0.034 Sum_probs=58.9
Q ss_pred HHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhh
Q psy398 51 SGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSA 130 (162)
Q Consensus 51 r~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~r 130 (162)
..++..+|.-|.+.|+.+.|.+.|.++. +... ........++++.... +++....+.+.++-..-. ...+..
T Consensus 107 ~~~~~~La~~~~~~g~~~~A~~~~~~~l-~~~~---~~~~~~~~la~~~~~~-g~~~~A~~~~~~~~~~~~---~~~~~~ 178 (389)
T PRK11788 107 LLALQELGQDYLKAGLLDRAEELFLQLV-DEGD---FAEGALQQLLEIYQQE-KDWQKAIDVAERLEKLGG---DSLRVE 178 (389)
T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHH-cCCc---chHHHHHHHHHHHHHh-chHHHHHHHHHHHHHhcC---CcchHH
Confidence 3567778888888888888888888887 5422 2345566666666666 777766666555443211 011111
Q ss_pred hhHHHHHHHHHHhhhhcHHHHHHHhHhhc
Q psy398 131 RNKLKAYEGVYCLAIRQYCRAAELFVDVI 159 (162)
Q Consensus 131 rnrLKvy~gl~~l~~R~fk~AA~lfld~~ 159 (162)
...+-...|..++..+++.+|.+.|-.++
T Consensus 179 ~~~~~~~la~~~~~~~~~~~A~~~~~~al 207 (389)
T PRK11788 179 IAHFYCELAQQALARGDLDAARALLKKAL 207 (389)
T ss_pred HHHHHHHHHHHHHhCCCHHHHHHHHHHHH
Confidence 12222234445556677777766665543
No 16
>COG4235 Cytochrome c biogenesis factor [Posttranslational modification, protein turnover, chaperones]
Probab=94.86 E-value=0.41 Score=41.31 Aligned_cols=117 Identities=14% Similarity=0.192 Sum_probs=84.7
Q ss_pred HHHHHHHHHHHHHHhcCCCchHHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCC-
Q psy398 27 ANEKRLKELDLEHEKNMMDEEDQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCD- 105 (162)
Q Consensus 27 ~n~~~l~~Le~~lk~Nlg~~~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d- 105 (162)
..+..+.+|+..|..|-+ +. .+|.-+|..|...|+...|+.+|.++. .- .|...|+.+..-++-++. .|
T Consensus 137 ~~~~l~a~Le~~L~~nP~--d~---egW~~Lg~~ym~~~~~~~A~~AY~~A~-rL---~g~n~~~~~g~aeaL~~~-a~~ 206 (287)
T COG4235 137 EMEALIARLETHLQQNPG--DA---EGWDLLGRAYMALGRASDALLAYRNAL-RL---AGDNPEILLGLAEALYYQ-AGQ 206 (287)
T ss_pred cHHHHHHHHHHHHHhCCC--Cc---hhHHHHHHHHHHhcchhHHHHHHHHHH-Hh---CCCCHHHHHHHHHHHHHh-cCC
Confidence 356667778888888888 63 488899999999999999999999988 32 355678888888887777 44
Q ss_pred --HHHHHHHHHHHHhhhhhcCCCchhhhhHHHHHHHHHHhhhhcHHHHHHH---hHhhccC
Q psy398 106 --IQAMTAAIEKATELIEGISGGDWSARNKLKAYEGVYCLAIRQYCRAAEL---FVDVIRS 161 (162)
Q Consensus 106 --~~~v~~~i~ka~~l~e~~~ggDW~rrnrLKvy~gl~~l~~R~fk~AA~l---fld~~~T 161 (162)
...+..-+.+|-.+ |-. --|-.-|-|.....+++|++|+.. +|+++|.
T Consensus 207 ~~ta~a~~ll~~al~~-------D~~-~iral~lLA~~afe~g~~~~A~~~Wq~lL~~lp~ 259 (287)
T COG4235 207 QMTAKARALLRQALAL-------DPA-NIRALSLLAFAAFEQGDYAEAAAAWQMLLDLLPA 259 (287)
T ss_pred cccHHHHHHHHHHHhc-------CCc-cHHHHHHHHHHHHHcccHHHHHHHHHHHHhcCCC
Confidence 44444444444433 211 234456889999999999999865 5666654
No 17
>PRK10370 formate-dependent nitrite reductase complex subunit NrfG; Provisional
Probab=94.68 E-value=1.7 Score=34.65 Aligned_cols=116 Identities=16% Similarity=0.186 Sum_probs=78.4
Q ss_pred HHHHHHHHHHHHHHhcCCCchHHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCC-
Q psy398 27 ANEKRLKELDLEHEKNMMDEEDQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCD- 105 (162)
Q Consensus 27 ~n~~~l~~Le~~lk~Nlg~~~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d- 105 (162)
..++.+..|+..++.|-. +. ++|..+|..|...|+.+.|+.+|.+++ ..... ..++.+.+-.+-++..++
T Consensus 54 ~~~~~i~~l~~~L~~~P~--~~---~~w~~Lg~~~~~~g~~~~A~~a~~~Al-~l~P~---~~~~~~~lA~aL~~~~g~~ 124 (198)
T PRK10370 54 TPEAQLQALQDKIRANPQ--NS---EQWALLGEYYLWRNDYDNALLAYRQAL-QLRGE---NAELYAALATVLYYQAGQH 124 (198)
T ss_pred hHHHHHHHHHHHHHHCCC--CH---HHHHHHHHHHHHCCCHHHHHHHHHHHH-HhCCC---CHHHHHHHHHHHHHhcCCC
Confidence 345666777777776555 43 489999999999999999999999998 43322 234444444433223133
Q ss_pred -HHHHHHHHHHHHhhhhhcCCCchhhhhHHHHHHHHHHhhhhcHHHHHHHhHhhc
Q psy398 106 -IQAMTAAIEKATELIEGISGGDWSARNKLKAYEGVYCLAIRQYCRAAELFVDVI 159 (162)
Q Consensus 106 -~~~v~~~i~ka~~l~e~~~ggDW~rrnrLKvy~gl~~l~~R~fk~AA~lfld~~ 159 (162)
.......+.+|-.+- ..+. ......|+.++..++|.+|...|-..+
T Consensus 125 ~~~~A~~~l~~al~~d----P~~~----~al~~LA~~~~~~g~~~~Ai~~~~~aL 171 (198)
T PRK10370 125 MTPQTREMIDKALALD----ANEV----TALMLLASDAFMQADYAQAIELWQKVL 171 (198)
T ss_pred CcHHHHHHHHHHHHhC----CCCh----hHHHHHHHHHHHcCCHHHHHHHHHHHH
Confidence 367777777776652 1232 344567888999999999998886543
No 18
>PF13176 TPR_7: Tetratricopeptide repeat; PDB: 3SF4_C 3RO3_A 3RO2_A.
Probab=94.54 E-value=0.057 Score=31.65 Aligned_cols=26 Identities=12% Similarity=0.068 Sum_probs=22.6
Q ss_pred HHHhhHhHHHhhcCHHHHHHHHHHhc
Q psy398 53 IWQAKLDYLCSIGDRAEAFKLADSKF 78 (162)
Q Consensus 53 ~~~~~a~~~~~iGD~~~A~~ay~~~~ 78 (162)
++..+|+.|.+.||.++|+++|.+++
T Consensus 1 al~~Lg~~~~~~g~~~~Ai~~y~~aL 26 (36)
T PF13176_consen 1 ALNNLGRIYRQQGDYEKAIEYYEQAL 26 (36)
T ss_dssp HHHHHHHHHHHCT-HHHHHHHHHHHH
T ss_pred CHHHHHHHHHHcCCHHHHHHHHHHHH
Confidence 46789999999999999999999955
No 19
>PRK11788 tetratricopeptide repeat protein; Provisional
Probab=94.50 E-value=0.77 Score=38.80 Aligned_cols=94 Identities=13% Similarity=-0.055 Sum_probs=48.5
Q ss_pred HHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhhh
Q psy398 53 IWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSARN 132 (162)
Q Consensus 53 ~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rrn 132 (162)
++..+|..|++.|+.++|.+.|.++. +.. +....+....++.+..-. +++....+.+.++...- .+ +
T Consensus 216 ~~~~la~~~~~~g~~~~A~~~~~~~~-~~~--p~~~~~~~~~l~~~~~~~-g~~~~A~~~l~~~~~~~-----p~----~ 282 (389)
T PRK11788 216 ASILLGDLALAQGDYAAAIEALERVE-EQD--PEYLSEVLPKLMECYQAL-GDEAEGLEFLRRALEEY-----PG----A 282 (389)
T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHH-HHC--hhhHHHHHHHHHHHHHHc-CCHHHHHHHHHHHHHhC-----CC----c
Confidence 44455566666666666666666655 321 122233333444444444 55555555555554431 11 1
Q ss_pred HHHHHHHHHHhhhhcHHHHHHHhHhhc
Q psy398 133 KLKAYEGVYCLAIRQYCRAAELFVDVI 159 (162)
Q Consensus 133 rLKvy~gl~~l~~R~fk~AA~lfld~~ 159 (162)
..-...|-.++..+++.+|..+|-+.+
T Consensus 283 ~~~~~la~~~~~~g~~~~A~~~l~~~l 309 (389)
T PRK11788 283 DLLLALAQLLEEQEGPEAAQALLREQL 309 (389)
T ss_pred hHHHHHHHHHHHhCCHHHHHHHHHHHH
Confidence 111445566667778888877776554
No 20
>PRK11189 lipoprotein NlpI; Provisional
Probab=94.39 E-value=2.6 Score=35.45 Aligned_cols=86 Identities=17% Similarity=0.060 Sum_probs=44.1
Q ss_pred HHHHHHHHHHHHHH-hcCCCchHHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCC
Q psy398 27 ANEKRLKELDLEHE-KNMMDEEDQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCD 105 (162)
Q Consensus 27 ~n~~~l~~Le~~lk-~Nlg~~~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d 105 (162)
..+..+.++..-|. ..+. ++.--..+..+|..|.+.|+.+.|...|.++. ...... .+..+.+-.+.... ++
T Consensus 41 ~~e~~i~~~~~~l~~~~~~--~~~~a~~~~~~g~~~~~~g~~~~A~~~~~~Al-~l~P~~---~~a~~~lg~~~~~~-g~ 113 (296)
T PRK11189 41 QQEVILARLNQILASRDLT--DEERAQLHYERGVLYDSLGLRALARNDFSQAL-ALRPDM---ADAYNYLGIYLTQA-GN 113 (296)
T ss_pred HHHHHHHHHHHHHccccCC--cHhhHHHHHHHHHHHHHCCCHHHHHHHHHHHH-HcCCCC---HHHHHHHHHHHHHC-CC
Confidence 34445555555554 2333 33334456666666666666666666666665 332221 23344444444444 56
Q ss_pred HHHHHHHHHHHHhh
Q psy398 106 IQAMTAAIEKATEL 119 (162)
Q Consensus 106 ~~~v~~~i~ka~~l 119 (162)
+........+|-.+
T Consensus 114 ~~~A~~~~~~Al~l 127 (296)
T PRK11189 114 FDAAYEAFDSVLEL 127 (296)
T ss_pred HHHHHHHHHHHHHh
Confidence 66555555555443
No 21
>PRK02603 photosystem I assembly protein Ycf3; Provisional
Probab=94.27 E-value=0.93 Score=34.77 Aligned_cols=67 Identities=13% Similarity=0.042 Sum_probs=52.7
Q ss_pred HHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhh
Q psy398 51 SGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATEL 119 (162)
Q Consensus 51 r~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l 119 (162)
...+..+|..|.+.|+.++|+..|.++. .............+.+..+.... +++......+.+|-..
T Consensus 35 a~~~~~lg~~~~~~g~~~~A~~~~~~al-~~~~~~~~~~~~~~~la~~~~~~-g~~~~A~~~~~~al~~ 101 (172)
T PRK02603 35 AFVYYRDGMSAQADGEYAEALENYEEAL-KLEEDPNDRSYILYNMGIIYASN-GEHDKALEYYHQALEL 101 (172)
T ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHH-HHhhccchHHHHHHHHHHHHHHc-CCHHHHHHHHHHHHHh
Confidence 4467899999999999999999999988 54433333456677777777777 9999999988888776
No 22
>TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795).
Probab=94.12 E-value=2.3 Score=33.61 Aligned_cols=124 Identities=19% Similarity=0.166 Sum_probs=67.7
Q ss_pred HHHHHHHHHhcCCCchHHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhc-----CCH
Q psy398 32 LKELDLEHEKNMMDEEDQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHG-----CDI 106 (162)
Q Consensus 32 l~~Le~~lk~Nlg~~~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~-----~d~ 106 (162)
+..++.-++.+.. ......++..+|.-|.+.|+.+.|+..|.+.. ..........+. +..+-.+.+.- .+.
T Consensus 53 ~~~~~~~~~~~p~--~~~~~~a~~~la~~~~~~~~~~~A~~~~~~~l-~~~p~~~~~~~a-~~~~g~~~~~~~~~~~~~~ 128 (235)
T TIGR03302 53 IKYFEALESRYPF--SPYAEQAQLDLAYAYYKSGDYAEAIAAADRFI-RLHPNHPDADYA-YYLRGLSNYNQIDRVDRDQ 128 (235)
T ss_pred HHHHHHHHHhCCC--chhHHHHHHHHHHHHHhcCCHHHHHHHHHHHH-HHCcCCCchHHH-HHHHHHHHHHhcccccCCH
Confidence 4444444444444 45567889999999999999999999999998 555443333222 22222222210 111
Q ss_pred HHHHHHHHHHHhhhhh-cCCC-chhhhhHH----------HHHHHHHHhhhhcHHHHHHHhHhhc
Q psy398 107 QAMTAAIEKATELIEG-ISGG-DWSARNKL----------KAYEGVYCLAIRQYCRAAELFVDVI 159 (162)
Q Consensus 107 ~~v~~~i~ka~~l~e~-~~gg-DW~rrnrL----------Kvy~gl~~l~~R~fk~AA~lfld~~ 159 (162)
......+.-.+..++. |... -|.-..++ ....|-.++..|+|.+|...|-..+
T Consensus 129 ~~~~~A~~~~~~~~~~~p~~~~~~~a~~~~~~~~~~~~~~~~~~a~~~~~~g~~~~A~~~~~~al 193 (235)
T TIGR03302 129 TAAREAFEAFQELIRRYPNSEYAPDAKKRMDYLRNRLAGKELYVARFYLKRGAYVAAINRFETVV 193 (235)
T ss_pred HHHHHHHHHHHHHHHHCCCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCChHHHHHHHHHHH
Confidence 2233333333333333 1111 12211111 1245677788899999888776543
No 23
>PF14938 SNAP: Soluble NSF attachment protein, SNAP; PDB: 1QQE_A 2IFU_A.
Probab=94.05 E-value=0.57 Score=39.16 Aligned_cols=108 Identities=16% Similarity=0.148 Sum_probs=76.0
Q ss_pred HHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHH--HHhcCCHHHHHHHHHHHHhhhhhcCC
Q psy398 48 DQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIA--YFHGCDIQAMTAAIEKATELIEGISG 125 (162)
Q Consensus 48 ~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~--~f~~~d~~~v~~~i~ka~~l~e~~~g 125 (162)
++..+.+..-|.-|-.-|+.++|.++|.++- +-....+..-+..-+....+ +.. .|+..+.+++.+|-.+.-+ .
T Consensus 32 e~Aa~~y~~Aa~~fk~~~~~~~A~~ay~kAa-~~~~~~~~~~~Aa~~~~~Aa~~~k~-~~~~~Ai~~~~~A~~~y~~--~ 107 (282)
T PF14938_consen 32 EEAADLYEKAANCFKLAKDWEKAAEAYEKAA-DCYEKLGDKFEAAKAYEEAANCYKK-GDPDEAIECYEKAIEIYRE--A 107 (282)
T ss_dssp HHHHHHHHHHHHHHHHTT-CHHHHHHHHHHH-HHHHHTT-HHHHHHHHHHHHHHHHH-TTHHHHHHHHHHHHHHHHH--C
T ss_pred HHHHHHHHHHHHHHHHHhccchhHHHHHHHH-HHHHHcCCHHHHHHHHHHHHHHHHh-hCHHHHHHHHHHHHHHHHh--c
Confidence 6778888889999999999999999999975 43333344444443333333 344 6888889999999998777 8
Q ss_pred CchhhhhHHHHHHHHHHhhh-hcHHHHHHHhHhhc
Q psy398 126 GDWSARNKLKAYEGVYCLAI-RQYCRAAELFVDVI 159 (162)
Q Consensus 126 gDW~rrnrLKvy~gl~~l~~-R~fk~AA~lfld~~ 159 (162)
|++.+-.+...--|-.+-.. +++..|.++|-.++
T Consensus 108 G~~~~aA~~~~~lA~~ye~~~~d~e~Ai~~Y~~A~ 142 (282)
T PF14938_consen 108 GRFSQAAKCLKELAEIYEEQLGDYEKAIEYYQKAA 142 (282)
T ss_dssp T-HHHHHHHHHHHHHHHCCTT--HHHHHHHHHHHH
T ss_pred CcHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHH
Confidence 89998888887777777777 88888888776553
No 24
>TIGR02917 PEP_TPR_lipo putative PEP-CTERM system TPR-repeat lipoprotein. This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
Probab=93.98 E-value=0.81 Score=41.54 Aligned_cols=95 Identities=13% Similarity=0.060 Sum_probs=61.9
Q ss_pred HHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhh
Q psy398 52 GIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSAR 131 (162)
Q Consensus 52 ~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rr 131 (162)
.++.-+|..+...|+.+.|.+.|.++. +..... .+..+.++.+.+.. +++......+.++...... +.
T Consensus 194 ~~~~~~~~~~~~~g~~~~A~~~~~~a~-~~~p~~---~~~~~~~~~~~~~~-g~~~~A~~~~~~~~~~~~~----~~--- 261 (899)
T TIGR02917 194 DALLLKGDLLLSLGNIELALAAYRKAI-ALRPNN---PAVLLALATILIEA-GEFEEAEKHADALLKKAPN----SP--- 261 (899)
T ss_pred HHHHHHHHHHHhcCCHHHHHHHHHHHH-hhCCCC---HHHHHHHHHHHHHc-CCHHHHHHHHHHHHHhCCC----Cc---
Confidence 456677888888888888888888887 544332 34566777777777 8888888777776654221 11
Q ss_pred hHHHHHHHHHHhhhhcHHHHHHHhHhhc
Q psy398 132 NKLKAYEGVYCLAIRQYCRAAELFVDVI 159 (162)
Q Consensus 132 nrLKvy~gl~~l~~R~fk~AA~lfld~~ 159 (162)
....+.|+..+..++|.+|...|-.++
T Consensus 262 -~~~~~~~~~~~~~~~~~~A~~~~~~~l 288 (899)
T TIGR02917 262 -LAHYLKALVDFQKKNYEDARETLQDAL 288 (899)
T ss_pred -hHHHHHHHHHHHhcCHHHHHHHHHHHH
Confidence 122345555666666666666665443
No 25
>TIGR02917 PEP_TPR_lipo putative PEP-CTERM system TPR-repeat lipoprotein. This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
Probab=93.69 E-value=0.62 Score=42.28 Aligned_cols=97 Identities=19% Similarity=0.100 Sum_probs=67.0
Q ss_pred HHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhh
Q psy398 51 SGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSA 130 (162)
Q Consensus 51 r~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~r 130 (162)
..++..+|.-|++.|+.++|...|.++. ...... .+....+.++.... +++..+...+.++. +.++..
T Consensus 56 ~~~~~~l~~~~~~~g~~~~A~~~~~~~~-~~~~~~---~~~~~~~a~~~~~~-g~~~~a~~~~~~~~-------~~~~~~ 123 (899)
T TIGR02917 56 AEARFLLGKIYLALGDYAAAEKELRKAL-SLGYPK---NQVLPLLARAYLLQ-GKFQQVLDELPGKT-------LLDDEG 123 (899)
T ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHH-HcCCCh---hhhHHHHHHHHHHC-CCHHHHHHhhcccc-------cCCchh
Confidence 4678889999999999999999999998 544433 34556666777777 77766655544321 123444
Q ss_pred hhHHHHHHHHHHhhhhcHHHHHHHhHhhc
Q psy398 131 RNKLKAYEGVYCLAIRQYCRAAELFVDVI 159 (162)
Q Consensus 131 rnrLKvy~gl~~l~~R~fk~AA~lfld~~ 159 (162)
........|..+...++|.+|.+.|-.++
T Consensus 124 ~~~~~~~~~~~~~~~~~~~~A~~~~~~a~ 152 (899)
T TIGR02917 124 AAELLALRGLAYLGLGQLELAQKSYEQAL 152 (899)
T ss_pred hHHHHHHHHHHHHHcCCHHHHHHHHHHHH
Confidence 55666677777777777777777775543
No 26
>PF13432 TPR_16: Tetratricopeptide repeat; PDB: 3CVP_A 3CVL_A 3CVQ_A 3CV0_A 2GW1_B 3CVN_A 3QKY_A 2PL2_B.
Probab=93.55 E-value=0.7 Score=29.43 Aligned_cols=59 Identities=27% Similarity=0.256 Sum_probs=49.2
Q ss_pred hhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhh
Q psy398 56 AKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATEL 119 (162)
Q Consensus 56 ~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l 119 (162)
.+|..|.+.|+++.|.++|.+++ ..- ..-.++.+.+-++.+.. +++......+.++-..
T Consensus 2 ~~a~~~~~~g~~~~A~~~~~~~l-~~~---P~~~~a~~~lg~~~~~~-g~~~~A~~~~~~a~~~ 60 (65)
T PF13432_consen 2 ALARALYQQGDYDEAIAAFEQAL-KQD---PDNPEAWYLLGRILYQQ-GRYDEALAYYERALEL 60 (65)
T ss_dssp HHHHHHHHCTHHHHHHHHHHHHH-CCS---TTHHHHHHHHHHHHHHT-T-HHHHHHHHHHHHHH
T ss_pred hHHHHHHHcCCHHHHHHHHHHHH-HHC---CCCHHHHHHHHHHHHHc-CCHHHHHHHHHHHHHH
Confidence 57899999999999999999999 544 33778888999999888 9999988888887554
No 27
>TIGR00990 3a0801s09 mitochondrial precursor proteins import receptor (72 kDa mitochondrial outermembrane protein) (mitochondrial import receptor for the ADP/ATP carrier) (translocase of outermembrane tom70).
Probab=93.52 E-value=2.1 Score=39.45 Aligned_cols=60 Identities=10% Similarity=0.164 Sum_probs=24.7
Q ss_pred HHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHh
Q psy398 54 WQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATE 118 (162)
Q Consensus 54 ~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~ 118 (162)
+..+|..|...|++++|+..|.+++ +.... -.++.+++-.+.+.. +++........+|-.
T Consensus 368 ~~~la~~~~~~g~~~eA~~~~~~al-~~~p~---~~~~~~~lg~~~~~~-g~~~~A~~~~~kal~ 427 (615)
T TIGR00990 368 YIKRASMNLELGDPDKAEEDFDKAL-KLNSE---DPDIYYHRAQLHFIK-GEFAQAGKDYQKSID 427 (615)
T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHH-HhCCC---CHHHHHHHHHHHHHc-CCHHHHHHHHHHHHH
Confidence 3344444444444444444444444 22111 123344444444444 444444444444433
No 28
>PF12569 NARP1: NMDA receptor-regulated protein 1 ; InterPro: IPR021183 This group represents N-terminal acetyltransferase A (NatA) auxiliary subunit and represents a non-catalytic component of the NatA N-terminal acetyltransferase, which catalyzes acetylation of proteins beginning with Met-Ser, Met-Gly and Met-Ala. N-terminal acetylation plays a role in normal eukaryotic translation and processing, protect against proteolytic degradation and protein turnover. NAT1 anchors ARD1 and NAT5 to the ribosome and may present the N- terminal of nascent polypeptides for acetylation [], [].
Probab=93.46 E-value=4.5 Score=37.48 Aligned_cols=96 Identities=18% Similarity=0.162 Sum_probs=77.1
Q ss_pred hHHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCC
Q psy398 47 EDQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGG 126 (162)
Q Consensus 47 ~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~gg 126 (162)
.+.+.+++.-+|.||-.+|+++.|++...+++ +.|.|. +|+.+...||---. ||+......++.|+.| | ..
T Consensus 190 p~~~lw~~~~lAqhyd~~g~~~~Al~~Id~aI-~htPt~---~ely~~KarilKh~-G~~~~Aa~~~~~Ar~L-D---~~ 260 (517)
T PF12569_consen 190 PSTLLWTLYFLAQHYDYLGDYEKALEYIDKAI-EHTPTL---VELYMTKARILKHA-GDLKEAAEAMDEAREL-D---LA 260 (517)
T ss_pred chHHHHHHHHHHHHHHHhCCHHHHHHHHHHHH-hcCCCc---HHHHHHHHHHHHHC-CCHHHHHHHHHHHHhC-C---hh
Confidence 46678899999999999999999999999999 888664 89999999999888 9999999999999998 3 23
Q ss_pred chhhhhHHHHHHHHHHhhhhcHHHHHHHh
Q psy398 127 DWSARNKLKAYEGVYCLAIRQYCRAAELF 155 (162)
Q Consensus 127 DW~rrnrLKvy~gl~~l~~R~fk~AA~lf 155 (162)
| |- +-.=.+=|+|-.+++++|-+.+
T Consensus 261 D--Ry--iNsK~aKy~LRa~~~e~A~~~~ 285 (517)
T PF12569_consen 261 D--RY--INSKCAKYLLRAGRIEEAEKTA 285 (517)
T ss_pred h--HH--HHHHHHHHHHHCCCHHHHHHHH
Confidence 4 21 1222345677777887776543
No 29
>CHL00033 ycf3 photosystem I assembly protein Ycf3
Probab=93.39 E-value=2.6 Score=32.02 Aligned_cols=103 Identities=13% Similarity=-0.083 Sum_probs=66.6
Q ss_pred HHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCc
Q psy398 48 DQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGD 127 (162)
Q Consensus 48 ~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggD 127 (162)
..-...+..+|..+...|+.+.|+..|.+++ .....+.....+...+-.+.... +++........+|-.+... -+
T Consensus 32 ~~~a~~~~~~g~~~~~~g~~~~A~~~~~~al-~l~~~~~~~~~~~~~lg~~~~~~-g~~~eA~~~~~~Al~~~~~--~~- 106 (168)
T CHL00033 32 EKEAFTYYRDGMSAQSEGEYAEALQNYYEAM-RLEIDPYDRSYILYNIGLIHTSN-GEHTKALEYYFQALERNPF--LP- 106 (168)
T ss_pred hHHHHHHHHHHHHHHHcCCHHHHHHHHHHHH-hccccchhhHHHHHHHHHHHHHc-CCHHHHHHHHHHHHHhCcC--cH-
Confidence 3446677889999999999999999999998 55433222333555555566666 8999999999888876332 22
Q ss_pred hhhhhHHHHHH--HHHHhhhhcHHHHHHHh
Q psy398 128 WSARNKLKAYE--GVYCLAIRQYCRAAELF 155 (162)
Q Consensus 128 W~rrnrLKvy~--gl~~l~~R~fk~AA~lf 155 (162)
....|.-.+|. |-..+..++|..|-..|
T Consensus 107 ~~~~~la~i~~~~~~~~~~~g~~~~A~~~~ 136 (168)
T CHL00033 107 QALNNMAVICHYRGEQAIEQGDSEIAEAWF 136 (168)
T ss_pred HHHHHHHHHHHHhhHHHHHcccHHHHHHHH
Confidence 22345445544 43444566665443333
No 30
>TIGR02795 tol_pal_ybgF tol-pal system protein YbgF. Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.
Probab=93.16 E-value=1.1 Score=30.91 Aligned_cols=70 Identities=14% Similarity=-0.020 Sum_probs=54.9
Q ss_pred HHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhh
Q psy398 48 DQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATEL 119 (162)
Q Consensus 48 ~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l 119 (162)
.....++..+|.-|.+.|+.+.|.+.|.+.. ..........++.+.+.++.... +++......+.++-..
T Consensus 36 ~~~~~~~~~l~~~~~~~~~~~~A~~~~~~~~-~~~p~~~~~~~~~~~~~~~~~~~-~~~~~A~~~~~~~~~~ 105 (119)
T TIGR02795 36 TYAPNAHYWLGEAYYAQGKYADAAKAFLAVV-KKYPKSPKAPDALLKLGMSLQEL-GDKEKAKATLQQVIKR 105 (119)
T ss_pred cccHHHHHHHHHHHHhhccHHHHHHHHHHHH-HHCCCCCcccHHHHHHHHHHHHh-CChHHHHHHHHHHHHH
Confidence 3345678889999999999999999999988 54433334456777777777778 9999888888887766
No 31
>TIGR00990 3a0801s09 mitochondrial precursor proteins import receptor (72 kDa mitochondrial outermembrane protein) (mitochondrial import receptor for the ADP/ATP carrier) (translocase of outermembrane tom70).
Probab=93.05 E-value=3 Score=38.43 Aligned_cols=99 Identities=11% Similarity=0.084 Sum_probs=69.5
Q ss_pred HHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCc
Q psy398 48 DQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGD 127 (162)
Q Consensus 48 ~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggD 127 (162)
...-.++..+|.+|...|+.+.|++.|.+++ +.... ..+..+.+..+.+.. +++......+.+|-.+- ..+
T Consensus 328 ~~~a~a~~~lg~~~~~~g~~~eA~~~~~kal-~l~P~---~~~~~~~la~~~~~~-g~~~eA~~~~~~al~~~----p~~ 398 (615)
T TIGR00990 328 EKEAIALNLRGTFKCLKGKHLEALADLSKSI-ELDPR---VTQSYIKRASMNLEL-GDPDKAEEDFDKALKLN----SED 398 (615)
T ss_pred hhhHHHHHHHHHHHHHcCCHHHHHHHHHHHH-HcCCC---cHHHHHHHHHHHHHC-CCHHHHHHHHHHHHHhC----CCC
Confidence 4455788999999999999999999999998 54332 123445555566667 89999999998885542 112
Q ss_pred hhhhhHHHHHHHHHHhhhhcHHHHHHHhHhhc
Q psy398 128 WSARNKLKAYEGVYCLAIRQYCRAAELFVDVI 159 (162)
Q Consensus 128 W~rrnrLKvy~gl~~l~~R~fk~AA~lfld~~ 159 (162)
. ..-...|..++..++|.+|...|-.++
T Consensus 399 ~----~~~~~lg~~~~~~g~~~~A~~~~~kal 426 (615)
T TIGR00990 399 P----DIYYHRAQLHFIKGEFAQAGKDYQKSI 426 (615)
T ss_pred H----HHHHHHHHHHHHcCCHHHHHHHHHHHH
Confidence 2 233455666777788888777775554
No 32
>PLN03088 SGT1, suppressor of G2 allele of SKP1; Provisional
Probab=92.77 E-value=1.8 Score=37.65 Aligned_cols=92 Identities=17% Similarity=0.087 Sum_probs=57.4
Q ss_pred HhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhhhHH
Q psy398 55 QAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSARNKL 134 (162)
Q Consensus 55 ~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rrnrL 134 (162)
...|.-...-|+++.|++.|.+++ ...... .+..+.+-.+.+.. +++......+.+|-.+- .+. ...
T Consensus 6 ~~~a~~a~~~~~~~~Ai~~~~~Al-~~~P~~---~~a~~~~a~~~~~~-g~~~eAl~~~~~Al~l~-----P~~---~~a 72 (356)
T PLN03088 6 EDKAKEAFVDDDFALAVDLYTQAI-DLDPNN---AELYADRAQANIKL-GNFTEAVADANKAIELD-----PSL---AKA 72 (356)
T ss_pred HHHHHHHHHcCCHHHHHHHHHHHH-HhCCCC---HHHHHHHHHHHHHc-CCHHHHHHHHHHHHHhC-----cCC---HHH
Confidence 456777777788888888888877 433322 23445555555556 77777777777776652 121 112
Q ss_pred HHHHHHHHhhhhcHHHHHHHhHhhc
Q psy398 135 KAYEGVYCLAIRQYCRAAELFVDVI 159 (162)
Q Consensus 135 Kvy~gl~~l~~R~fk~AA~lfld~~ 159 (162)
....|+.++..++|.+|...|-.++
T Consensus 73 ~~~lg~~~~~lg~~~eA~~~~~~al 97 (356)
T PLN03088 73 YLRKGTACMKLEEYQTAKAALEKGA 97 (356)
T ss_pred HHHHHHHHHHhCCHHHHHHHHHHHH
Confidence 2334777777888888887776554
No 33
>PF14938 SNAP: Soluble NSF attachment protein, SNAP; PDB: 1QQE_A 2IFU_A.
Probab=92.70 E-value=2 Score=35.88 Aligned_cols=109 Identities=16% Similarity=0.111 Sum_probs=75.4
Q ss_pred hHHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhc---CCHHHHHHHHHHHHhhhhhc
Q psy398 47 EDQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHG---CDIQAMTAAIEKATELIEGI 123 (162)
Q Consensus 47 ~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~---~d~~~v~~~i~ka~~l~e~~ 123 (162)
..+.-.++..-+..|-+. +.+.|+++|.++. +--...|.---..-...++|-.+- +|+.....+..+|-.+.+.
T Consensus 71 ~~~Aa~~~~~Aa~~~k~~-~~~~Ai~~~~~A~-~~y~~~G~~~~aA~~~~~lA~~ye~~~~d~e~Ai~~Y~~A~~~y~~- 147 (282)
T PF14938_consen 71 KFEAAKAYEEAANCYKKG-DPDEAIECYEKAI-EIYREAGRFSQAAKCLKELAEIYEEQLGDYEKAIEYYQKAAELYEQ- 147 (282)
T ss_dssp HHHHHHHHHHHHHHHHHT-THHHHHHHHHHHH-HHHHHCT-HHHHHHHHHHHHHHHCCTT--HHHHHHHHHHHHHHHHH-
T ss_pred HHHHHHHHHHHHHHHHhh-CHHHHHHHHHHHH-HHHHhcCcHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHH-
Confidence 455566677777777666 9999999999988 655555555555666666776662 3678888888899999876
Q ss_pred CCCchhhhhHHHHHHHHHHhhhhcHHHHHHHhHhhc
Q psy398 124 SGGDWSARNKLKAYEGVYCLAIRQYCRAAELFVDVI 159 (162)
Q Consensus 124 ~ggDW~rrnrLKvy~gl~~l~~R~fk~AA~lfld~~ 159 (162)
.+.....+....--|-.++..++|.+|.++|-...
T Consensus 148 -e~~~~~a~~~~~~~A~l~~~l~~y~~A~~~~e~~~ 182 (282)
T PF14938_consen 148 -EGSPHSAAECLLKAADLYARLGRYEEAIEIYEEVA 182 (282)
T ss_dssp -TT-HHHHHHHHHHHHHHHHHTT-HHHHHHHHHHHH
T ss_pred -CCChhhHHHHHHHHHHHHHHhCCHHHHHHHHHHHH
Confidence 55544455555555666777889999999986543
No 34
>PF12895 Apc3: Anaphase-promoting complex, cyclosome, subunit 3; PDB: 3KAE_D 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2XPI_A 3ULQ_A.
Probab=92.55 E-value=0.96 Score=30.51 Aligned_cols=82 Identities=11% Similarity=0.166 Sum_probs=57.1
Q ss_pred hcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhhhHHHHHHHHHHh
Q psy398 64 IGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSARNKLKAYEGVYCL 143 (162)
Q Consensus 64 iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rrnrLKvy~gl~~l 143 (162)
-|++++|+..|.+.. +...+.. .-+..+.+.++.+-. +++..+...+.+ ... + +. ........|-.++
T Consensus 2 ~~~y~~Ai~~~~k~~-~~~~~~~-~~~~~~~la~~~~~~-~~y~~A~~~~~~-~~~-~---~~----~~~~~~l~a~~~~ 69 (84)
T PF12895_consen 2 QGNYENAIKYYEKLL-ELDPTNP-NSAYLYNLAQCYFQQ-GKYEEAIELLQK-LKL-D---PS----NPDIHYLLARCLL 69 (84)
T ss_dssp TT-HHHHHHHHHHHH-HHHCGTH-HHHHHHHHHHHHHHT-THHHHHHHHHHC-HTH-H---HC----HHHHHHHHHHHHH
T ss_pred CccHHHHHHHHHHHH-HHCCCCh-hHHHHHHHHHHHHHC-CCHHHHHHHHHH-hCC-C---CC----CHHHHHHHHHHHH
Confidence 478999999999998 5444322 666778788888888 999888888877 221 1 11 1244445588999
Q ss_pred hhhcHHHHHHHhHh
Q psy398 144 AIRQYCRAAELFVD 157 (162)
Q Consensus 144 ~~R~fk~AA~lfld 157 (162)
..++|.+|-+.|-.
T Consensus 70 ~l~~y~eAi~~l~~ 83 (84)
T PF12895_consen 70 KLGKYEEAIKALEK 83 (84)
T ss_dssp HTT-HHHHHHHHHH
T ss_pred HhCCHHHHHHHHhc
Confidence 99999999988753
No 35
>PF13181 TPR_8: Tetratricopeptide repeat; PDB: 3GW4_B 3MA5_C 2KCV_A 2KCL_A 3FP3_A 3LCA_A 3FP4_A 3FP2_A 1W3B_B 1ELW_A ....
Probab=92.40 E-value=0.28 Score=27.45 Aligned_cols=27 Identities=15% Similarity=0.182 Sum_probs=24.5
Q ss_pred HHHHhhHhHHHhhcCHHHHHHHHHHhc
Q psy398 52 GIWQAKLDYLCSIGDRAEAFKLADSKF 78 (162)
Q Consensus 52 ~~~~~~a~~~~~iGD~~~A~~ay~~~~ 78 (162)
.++..+|..|.+.||.++|.+.|.+++
T Consensus 2 ~~~~~lg~~y~~~~~~~~A~~~~~~a~ 28 (34)
T PF13181_consen 2 EAYYNLGKIYEQLGDYEEALEYFEKAL 28 (34)
T ss_dssp HHHHHHHHHHHHTTSHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHH
Confidence 357889999999999999999999987
No 36
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=92.16 E-value=1.3 Score=37.34 Aligned_cols=88 Identities=11% Similarity=-0.027 Sum_probs=68.3
Q ss_pred HHHHHHHHHHHHHhcCCCchHHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHH
Q psy398 28 NEKRLKELDLEHEKNMMDEEDQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQ 107 (162)
Q Consensus 28 n~~~l~~Le~~lk~Nlg~~~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~ 107 (162)
..+-+..++.=++..-. ......++..+|..|...||++.|+..|.+.. ..-..+...-|..+.+..+.... +|+.
T Consensus 159 y~~Ai~af~~fl~~yP~--s~~a~~A~y~LG~~y~~~g~~~~A~~~f~~vv-~~yP~s~~~~dAl~klg~~~~~~-g~~~ 234 (263)
T PRK10803 159 QDDAIVAFQNFVKKYPD--STYQPNANYWLGQLNYNKGKKDDAAYYFASVV-KNYPKSPKAADAMFKVGVIMQDK-GDTA 234 (263)
T ss_pred HHHHHHHHHHHHHHCcC--CcchHHHHHHHHHHHHHcCCHHHHHHHHHHHH-HHCCCCcchhHHHHHHHHHHHHc-CCHH
Confidence 44445556555554444 44557889999999999999999999999999 65566678889999988888888 8988
Q ss_pred HHHHHHHHHHhh
Q psy398 108 AMTAAIEKATEL 119 (162)
Q Consensus 108 ~v~~~i~ka~~l 119 (162)
...+.+.++-..
T Consensus 235 ~A~~~~~~vi~~ 246 (263)
T PRK10803 235 KAKAVYQQVIKK 246 (263)
T ss_pred HHHHHHHHHHHH
Confidence 888888766443
No 37
>PF07719 TPR_2: Tetratricopeptide repeat; InterPro: IPR013105 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. This repeat includes outlying Tetratricopeptide-like repeats (TPR) that are not matched by IPR001440 from INTERPRO.; PDB: 1XNF_B 3Q15_A 4ABN_A 1OUV_A 3U4T_A 3MA5_C 2KCV_A 2KCL_A 2XEV_A 3NF1_A ....
Probab=91.94 E-value=0.32 Score=26.99 Aligned_cols=27 Identities=15% Similarity=0.257 Sum_probs=23.6
Q ss_pred HHHHhhHhHHHhhcCHHHHHHHHHHhc
Q psy398 52 GIWQAKLDYLCSIGDRAEAFKLADSKF 78 (162)
Q Consensus 52 ~~~~~~a~~~~~iGD~~~A~~ay~~~~ 78 (162)
.++..+|..|.+.|++++|.++|.+++
T Consensus 2 ~~~~~lg~~~~~~~~~~~A~~~~~~al 28 (34)
T PF07719_consen 2 EAWYYLGQAYYQLGNYEEAIEYFEKAL 28 (34)
T ss_dssp HHHHHHHHHHHHTT-HHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHhCCHHHHHHHHHHHH
Confidence 357889999999999999999999987
No 38
>cd00189 TPR Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and C
Probab=91.87 E-value=1.5 Score=27.08 Aligned_cols=62 Identities=18% Similarity=0.141 Sum_probs=46.3
Q ss_pred HHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHh
Q psy398 52 GIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATE 118 (162)
Q Consensus 52 ~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~ 118 (162)
.++..+|..|...|+.+.|.+.|.++. ....... ++.+.+..+.+.. +++......+.++..
T Consensus 35 ~~~~~~~~~~~~~~~~~~a~~~~~~~~-~~~~~~~---~~~~~~~~~~~~~-~~~~~a~~~~~~~~~ 96 (100)
T cd00189 35 DAYYNLAAAYYKLGKYEEALEDYEKAL-ELDPDNA---KAYYNLGLAYYKL-GKYEEALEAYEKALE 96 (100)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH-hCCCcch---hHHHHHHHHHHHH-HhHHHHHHHHHHHHc
Confidence 678899999999999999999999988 5443332 5566666666666 777777777666544
No 39
>KOG0548|consensus
Probab=91.49 E-value=6.5 Score=36.75 Aligned_cols=91 Identities=18% Similarity=0.217 Sum_probs=62.9
Q ss_pred HhhHhHHHhhcCHHHHHHHHHHhcCCCC-CChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhhhH
Q psy398 55 QAKLDYLCSIGDRAEAFKLADSKFADKT-NPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSARNK 133 (162)
Q Consensus 55 ~~~a~~~~~iGD~~~A~~ay~~~~~~~~-~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rrnr 133 (162)
...|.=+.+-||+.+|++.|+++. ... ....--=.-.++.++++ .+...|.-|+..++. ...|. |
T Consensus 362 r~kGne~Fk~gdy~~Av~~YteAI-kr~P~Da~lYsNRAac~~kL~--------~~~~aL~Da~~~ieL--~p~~~---k 427 (539)
T KOG0548|consen 362 REKGNEAFKKGDYPEAVKHYTEAI-KRDPEDARLYSNRAACYLKLG--------EYPEALKDAKKCIEL--DPNFI---K 427 (539)
T ss_pred HHHHHHHHhccCHHHHHHHHHHHH-hcCCchhHHHHHHHHHHHHHh--------hHHHHHHHHHHHHhc--CchHH---H
Confidence 456999999999999999999998 444 22222223445555554 233444447777775 44544 4
Q ss_pred HHHHHHHHHhhhhcHHHHHHHhHhhc
Q psy398 134 LKAYEGVYCLAIRQYCRAAELFVDVI 159 (162)
Q Consensus 134 LKvy~gl~~l~~R~fk~AA~lfld~~ 159 (162)
.-.+.|..+-+.++|.+|.+.|-+.+
T Consensus 428 gy~RKg~al~~mk~ydkAleay~eal 453 (539)
T KOG0548|consen 428 AYLRKGAALRAMKEYDKALEAYQEAL 453 (539)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45567999999999999999987654
No 40
>PF00515 TPR_1: Tetratricopeptide repeat; InterPro: IPR001440 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees; within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B. ; GO: 0005515 protein binding; PDB: 3SF4_C 2LNI_A 1ELW_A 2C0M_A 1FCH_B 3R9A_B 2J9Q_A 2C0L_A 1KT1_A 3FWV_A ....
Probab=91.19 E-value=0.42 Score=26.81 Aligned_cols=27 Identities=15% Similarity=0.187 Sum_probs=24.0
Q ss_pred HHHHhhHhHHHhhcCHHHHHHHHHHhc
Q psy398 52 GIWQAKLDYLCSIGDRAEAFKLADSKF 78 (162)
Q Consensus 52 ~~~~~~a~~~~~iGD~~~A~~ay~~~~ 78 (162)
.++..+|.-|...|+.+.|+++|++++
T Consensus 2 ~~~~~~g~~~~~~~~~~~A~~~~~~al 28 (34)
T PF00515_consen 2 EAYYNLGNAYFQLGDYEEALEYYQRAL 28 (34)
T ss_dssp HHHHHHHHHHHHTT-HHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHhCCchHHHHHHHHHH
Confidence 468899999999999999999999987
No 41
>PRK10049 pgaA outer membrane protein PgaA; Provisional
Probab=90.59 E-value=3.5 Score=39.37 Aligned_cols=96 Identities=13% Similarity=-0.058 Sum_probs=77.7
Q ss_pred HHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhh
Q psy398 51 SGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSA 130 (162)
Q Consensus 51 r~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~r 130 (162)
..++.-+|..+...|++++|.+.+.++. ...... .++.+.+..+-.-. ++.....+.+.+|..+- .|
T Consensus 359 ~~a~~~~a~~l~~~g~~~eA~~~l~~al-~~~P~n---~~l~~~lA~l~~~~-g~~~~A~~~l~~al~l~-----Pd--- 425 (765)
T PRK10049 359 LQGQSLLSQVAKYSNDLPQAEMRARELA-YNAPGN---QGLRIDYASVLQAR-GWPRAAENELKKAEVLE-----PR--- 425 (765)
T ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHH-HhCCCC---HHHHHHHHHHHHhc-CCHHHHHHHHHHHHhhC-----CC---
Confidence 4567789999999999999999999988 544332 57888888888888 99999999998888762 22
Q ss_pred hhHHHHHHHHHHhhhhcHHHHHHHhHhhc
Q psy398 131 RNKLKAYEGVYCLAIRQYCRAAELFVDVI 159 (162)
Q Consensus 131 rnrLKvy~gl~~l~~R~fk~AA~lfld~~ 159 (162)
...+.+..|+..+..++|.+|-.++=+.+
T Consensus 426 ~~~l~~~~a~~al~~~~~~~A~~~~~~ll 454 (765)
T PRK10049 426 NINLEVEQAWTALDLQEWRQMDVLTDDVV 454 (765)
T ss_pred ChHHHHHHHHHHHHhCCHHHHHHHHHHHH
Confidence 23488999999999999999988875543
No 42
>KOG1497|consensus
Probab=90.27 E-value=7 Score=34.88 Aligned_cols=108 Identities=15% Similarity=0.052 Sum_probs=89.8
Q ss_pred hHHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCC----CChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhh
Q psy398 47 EDQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKT----NPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEG 122 (162)
Q Consensus 47 ~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~----~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~ 122 (162)
++.+--....+|.-|.+-++...|..+..-.- -.+ ...+.|+-.|+.+-|+.+.- +|......+|.+|.-++-.
T Consensus 99 eEqv~~irl~LAsiYE~Eq~~~~aaq~L~~I~-~~tg~~~~d~~~kl~l~iriarlyLe~-~d~veae~~inRaSil~a~ 176 (399)
T KOG1497|consen 99 EEQVASIRLHLASIYEKEQNWRDAAQVLVGIP-LDTGQKAYDVEQKLLLCIRIARLYLED-DDKVEAEAYINRASILQAE 176 (399)
T ss_pred HHHHHHHHHHHHHHHHHhhhHHHHHHHHhccC-cccchhhhhhHHHHHHHHHHHHHHHhc-CcHHHHHHHHHHHHHhhhc
Confidence 56666677899999999999999998887654 332 56788999999999999999 9999999999999998765
Q ss_pred cCCCchhhhhHHHHHHHHHHhhhhcHHHHHHHhHhh
Q psy398 123 ISGGDWSARNKLKAYEGVYCLAIRQYCRAAELFVDV 158 (162)
Q Consensus 123 ~~ggDW~rrnrLKvy~gl~~l~~R~fk~AA~lfld~ 158 (162)
....+-.--+||..+=.+=..|+|-+||+.+.+.
T Consensus 177 --~~Ne~Lqie~kvc~ARvlD~krkFlEAAqrYyel 210 (399)
T KOG1497|consen 177 --SSNEQLQIEYKVCYARVLDYKRKFLEAAQRYYEL 210 (399)
T ss_pred --ccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4556667788998888888899999999987653
No 43
>PF13174 TPR_6: Tetratricopeptide repeat; PDB: 3QKY_A 2XEV_A 3URZ_B 2Q7F_A.
Probab=89.96 E-value=0.43 Score=26.16 Aligned_cols=27 Identities=15% Similarity=0.117 Sum_probs=24.6
Q ss_pred HHHHhhHhHHHhhcCHHHHHHHHHHhc
Q psy398 52 GIWQAKLDYLCSIGDRAEAFKLADSKF 78 (162)
Q Consensus 52 ~~~~~~a~~~~~iGD~~~A~~ay~~~~ 78 (162)
+|+..+|.-|.+.|+.++|.+.|.+..
T Consensus 1 ~a~~~~a~~~~~~g~~~~A~~~~~~~~ 27 (33)
T PF13174_consen 1 DALYRLARCYYKLGDYDEAIEYFQRLI 27 (33)
T ss_dssp HHHHHHHHHHHHHCHHHHHHHHHHHHH
T ss_pred CHHHHHHHHHHHccCHHHHHHHHHHHH
Confidence 477889999999999999999999987
No 44
>cd05804 StaR_like StaR_like; a well-conserved protein found in bacteria, plants, and animals. A family member from Streptomyces toyocaensis, StaR is part of a gene cluster involved in the biosynthesis of glycopeptide antibiotics (GPAs), specifically A47934. It has been speculated that StaR could be a flavoprotein hydroxylating a tyrosine sidechain. Some family members have been annotated as proteins containing tetratricopeptide (TPR) repeats, which may at least indicate mostly alpha-helical secondary structure.
Probab=89.79 E-value=5.4 Score=33.24 Aligned_cols=99 Identities=12% Similarity=-0.022 Sum_probs=68.7
Q ss_pred HHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhh
Q psy398 51 SGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSA 130 (162)
Q Consensus 51 r~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~r 130 (162)
..++..+|..+...|+++.|.+.+++++ ...... ......+-.+-... ++++.....+.++-.+... +.+.
T Consensus 114 ~~~~~~~a~~~~~~G~~~~A~~~~~~al-~~~p~~---~~~~~~la~i~~~~-g~~~eA~~~l~~~l~~~~~----~~~~ 184 (355)
T cd05804 114 WYLLGMLAFGLEEAGQYDRAEEAARRAL-ELNPDD---AWAVHAVAHVLEMQ-GRFKEGIAFMESWRDTWDC----SSML 184 (355)
T ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHH-hhCCCC---cHHHHHHHHHHHHc-CCHHHHHHHHHhhhhccCC----Ccch
Confidence 3455567888899999999999999988 554333 23445556666677 8888888888887765431 1122
Q ss_pred hhHHHHHHHHHHhhhhcHHHHHHHhHhh
Q psy398 131 RNKLKAYEGVYCLAIRQYCRAAELFVDV 158 (162)
Q Consensus 131 rnrLKvy~gl~~l~~R~fk~AA~lfld~ 158 (162)
.-..-...|..++..|++.+|...|=..
T Consensus 185 ~~~~~~~la~~~~~~G~~~~A~~~~~~~ 212 (355)
T cd05804 185 RGHNWWHLALFYLERGDYEAALAIYDTH 212 (355)
T ss_pred hHHHHHHHHHHHHHCCCHHHHHHHHHHH
Confidence 2223345788889999999998887554
No 45
>PF13374 TPR_10: Tetratricopeptide repeat; PDB: 3CEQ_B 3EDT_H 3NF1_A.
Probab=89.78 E-value=0.64 Score=26.64 Aligned_cols=28 Identities=14% Similarity=0.182 Sum_probs=24.2
Q ss_pred HHHHHhhHhHHHhhcCHHHHHHHHHHhc
Q psy398 51 SGIWQAKLDYLCSIGDRAEAFKLADSKF 78 (162)
Q Consensus 51 r~~~~~~a~~~~~iGD~~~A~~ay~~~~ 78 (162)
-.++..+|..|...|++++|.+.+.+++
T Consensus 2 a~~~~~la~~~~~~g~~~~A~~~~~~al 29 (42)
T PF13374_consen 2 ASALNNLANAYRAQGRYEEALELLEEAL 29 (42)
T ss_dssp HHHHHHHHHHHHHCT-HHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHhhhhcchhhHHHHHHH
Confidence 3567899999999999999999999977
No 46
>TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795).
Probab=89.67 E-value=8.6 Score=30.28 Aligned_cols=68 Identities=18% Similarity=0.054 Sum_probs=53.4
Q ss_pred HHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhh
Q psy398 51 SGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELI 120 (162)
Q Consensus 51 r~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~ 120 (162)
-+.+..+|.-|.+.|+.+.|...|.+.. ..-......-+..+.+-.+-+.. +++....+...++-..-
T Consensus 33 ~~~~~~~g~~~~~~~~~~~A~~~~~~~~-~~~p~~~~~~~a~~~la~~~~~~-~~~~~A~~~~~~~l~~~ 100 (235)
T TIGR03302 33 AEELYEEAKEALDSGDYTEAIKYFEALE-SRYPFSPYAEQAQLDLAYAYYKS-GDYAEAIAAADRFIRLH 100 (235)
T ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHH-HhCCCchhHHHHHHHHHHHHHhc-CCHHHHHHHHHHHHHHC
Confidence 4467899999999999999999999988 54444445556667777777777 89999888888887653
No 47
>PRK15359 type III secretion system chaperone protein SscB; Provisional
Probab=89.64 E-value=7.1 Score=29.29 Aligned_cols=93 Identities=15% Similarity=-0.095 Sum_probs=68.0
Q ss_pred HHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhhhH
Q psy398 54 WQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSARNK 133 (162)
Q Consensus 54 ~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rrnr 133 (162)
+..+|.-+.+.|+++.|.+.|.++. ..... ..+..+.+-.+..-. +++........+|-.+- .++ ..
T Consensus 27 ~~~~g~~~~~~g~~~~A~~~~~~al-~~~P~---~~~a~~~lg~~~~~~-g~~~~A~~~y~~Al~l~----p~~----~~ 93 (144)
T PRK15359 27 VYASGYASWQEGDYSRAVIDFSWLV-MAQPW---SWRAHIALAGTWMML-KEYTTAINFYGHALMLD----ASH----PE 93 (144)
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHH-HcCCC---cHHHHHHHHHHHHHH-hhHHHHHHHHHHHHhcC----CCC----cH
Confidence 4467888899999999999999988 54433 235556666666666 88888888888887762 122 22
Q ss_pred HHHHHHHHHhhhhcHHHHHHHhHhhc
Q psy398 134 LKAYEGVYCLAIRQYCRAAELFVDVI 159 (162)
Q Consensus 134 LKvy~gl~~l~~R~fk~AA~lfld~~ 159 (162)
....-|+.+...|++.+|.+.|-.++
T Consensus 94 a~~~lg~~l~~~g~~~eAi~~~~~Al 119 (144)
T PRK15359 94 PVYQTGVCLKMMGEPGLAREAFQTAI 119 (144)
T ss_pred HHHHHHHHHHHcCCHHHHHHHHHHHH
Confidence 34566888888999999999887664
No 48
>KOG4234|consensus
Probab=89.51 E-value=8.4 Score=32.56 Aligned_cols=94 Identities=22% Similarity=0.195 Sum_probs=61.6
Q ss_pred HHHHHHHHHHHhcCCCchHHH----HHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCC
Q psy398 30 KRLKELDLEHEKNMMDEEDQV----SGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCD 105 (162)
Q Consensus 30 ~~l~~Le~~lk~Nlg~~~~eI----r~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d 105 (162)
.+.++|-++++.++.+|+.+- .+-+..-|+-++.-||+++|...|..++ +-|.++..+.--++..=|-+-+. -
T Consensus 70 l~eEeLmae~E~i~~deek~k~~~kad~lK~EGN~~F~ngdyeeA~skY~~Al-e~cp~~~~e~rsIly~Nraaa~i--K 146 (271)
T KOG4234|consen 70 LKEEELMAEIEKIFSDEEKDKAIEKADSLKKEGNELFKNGDYEEANSKYQEAL-ESCPSTSTEERSILYSNRAAALI--K 146 (271)
T ss_pred hhHHHHHHHHHHhcCcHHHHHHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHH-HhCccccHHHHHHHHhhhHHHHH--H
Confidence 344567777776666322221 2234456788999999999999999999 99999988887666666655554 2
Q ss_pred HHHHHHHHHHHHhhhhhcCCCch
Q psy398 106 IQAMTAAIEKATELIEGISGGDW 128 (162)
Q Consensus 106 ~~~v~~~i~ka~~l~e~~~ggDW 128 (162)
...-...|+-+...|+- +.-.
T Consensus 147 l~k~e~aI~dcsKaiel--~pty 167 (271)
T KOG4234|consen 147 LRKWESAIEDCSKAIEL--NPTY 167 (271)
T ss_pred hhhHHHHHHHHHhhHhc--Cchh
Confidence 33334455555555665 4444
No 49
>PF13428 TPR_14: Tetratricopeptide repeat
Probab=89.46 E-value=0.64 Score=28.06 Aligned_cols=27 Identities=22% Similarity=0.302 Sum_probs=25.1
Q ss_pred HHHHhhHhHHHhhcCHHHHHHHHHHhc
Q psy398 52 GIWQAKLDYLCSIGDRAEAFKLADSKF 78 (162)
Q Consensus 52 ~~~~~~a~~~~~iGD~~~A~~ay~~~~ 78 (162)
.++..+|..|.+.|+.+.|.++|++++
T Consensus 2 ~~~~~la~~~~~~G~~~~A~~~~~~~l 28 (44)
T PF13428_consen 2 AAWLALARAYRRLGQPDEAERLLRRAL 28 (44)
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHH
Confidence 367889999999999999999999988
No 50
>PRK11447 cellulose synthase subunit BcsC; Provisional
Probab=88.80 E-value=4.3 Score=40.55 Aligned_cols=94 Identities=18% Similarity=0.148 Sum_probs=67.2
Q ss_pred HHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhhh
Q psy398 53 IWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSARN 132 (162)
Q Consensus 53 ~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rrn 132 (162)
.+..+|+.|.+.|++++|+++|.++. ..... -.|..+.+.++-+-. +++....+.+.++-..- ..+-..
T Consensus 605 ~~~~La~~~~~~g~~~~A~~~y~~al-~~~P~---~~~a~~~la~~~~~~-g~~~eA~~~l~~ll~~~----p~~~~~-- 673 (1157)
T PRK11447 605 IDLTLADWAQQRGDYAAARAAYQRVL-TREPG---NADARLGLIEVDIAQ-GDLAAARAQLAKLPATA----NDSLNT-- 673 (1157)
T ss_pred HHHHHHHHHHHcCCHHHHHHHHHHHH-HhCCC---CHHHHHHHHHHHHHC-CCHHHHHHHHHHHhccC----CCChHH--
Confidence 45679999999999999999999998 54332 357888889988888 99998888888665431 111111
Q ss_pred HHHHHHHHHHhhhhcHHHHHHHhHhhc
Q psy398 133 KLKAYEGVYCLAIRQYCRAAELFVDVI 159 (162)
Q Consensus 133 rLKvy~gl~~l~~R~fk~AA~lfld~~ 159 (162)
....|..+...+++.+|.++|-..+
T Consensus 674 --~~~la~~~~~~g~~~eA~~~~~~al 698 (1157)
T PRK11447 674 --QRRVALAWAALGDTAAAQRTFNRLI 698 (1157)
T ss_pred --HHHHHHHHHhCCCHHHHHHHHHHHh
Confidence 1223555567788888887776554
No 51
>KOG1840|consensus
Probab=88.76 E-value=7.9 Score=35.95 Aligned_cols=118 Identities=14% Similarity=0.089 Sum_probs=75.8
Q ss_pred hcCCCchHHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHH-HHHHHhc--CCHHHHHHHHHHHH
Q psy398 41 KNMMDEEDQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLF-RIAYFHG--CDIQAMTAAIEKAT 117 (162)
Q Consensus 41 ~Nlg~~~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~i-Rv~~f~~--~d~~~v~~~i~ka~ 117 (162)
.+.+++.-.+......+|.-|...|++++|...+..+...-+-+.|++.=.+=..+ -+|+++. +.+.......++|=
T Consensus 189 ~~~~~~~P~~~~~~~~La~~y~~~g~~e~A~~l~k~Al~~l~k~~G~~hl~va~~l~~~a~~y~~~~k~~eAv~ly~~AL 268 (508)
T KOG1840|consen 189 KGLGDEDPERLRTLRNLAEMYAVQGRLEKAEPLCKQALRILEKTSGLKHLVVASMLNILALVYRSLGKYDEAVNLYEEAL 268 (508)
T ss_pred HhcccCCchHHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHccCccCHHHHHHHHHHHHHHHHhccHHHHHHHHHHHH
Confidence 44454466777777779999999999999999999987343334443322222222 2455442 56666666667776
Q ss_pred hhhhhcCCCchhhhhHHHHHHHHHHhhhhcHHHHHHHhHhh
Q psy398 118 ELIEGISGGDWSARNKLKAYEGVYCLAIRQYCRAAELFVDV 158 (162)
Q Consensus 118 ~l~e~~~ggDW~rrnrLKvy~gl~~l~~R~fk~AA~lfld~ 158 (162)
+..+..=|.+-.+..=.-.=-|+.+..+|+|++|..++-.+
T Consensus 269 ~i~e~~~G~~h~~va~~l~nLa~ly~~~GKf~EA~~~~e~A 309 (508)
T KOG1840|consen 269 TIREEVFGEDHPAVAATLNNLAVLYYKQGKFAEAEEYCERA 309 (508)
T ss_pred HHHHHhcCCCCHHHHHHHHHHHHHHhccCChHHHHHHHHHH
Confidence 66665446666554333333355568899999999887654
No 52
>PF13429 TPR_15: Tetratricopeptide repeat; PDB: 2VQ2_A 2PL2_B.
Probab=87.33 E-value=2.2 Score=34.94 Aligned_cols=95 Identities=17% Similarity=0.175 Sum_probs=65.6
Q ss_pred HHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhh
Q psy398 52 GIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSAR 131 (162)
Q Consensus 52 ~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rr 131 (162)
..+...|.++.+.|+.++|+++|.+++ +...... ++...++.+-+.. ++...+...+......... ..+
T Consensus 147 ~~~~~~a~~~~~~G~~~~A~~~~~~al-~~~P~~~---~~~~~l~~~li~~-~~~~~~~~~l~~~~~~~~~--~~~---- 215 (280)
T PF13429_consen 147 RFWLALAEIYEQLGDPDKALRDYRKAL-ELDPDDP---DARNALAWLLIDM-GDYDEAREALKRLLKAAPD--DPD---- 215 (280)
T ss_dssp HHHHHHHHHHHHCCHHHHHHHHHHHHH-HH-TT-H---HHHHHHHHHHCTT-CHHHHHHHHHHHHHHH-HT--SCC----
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHH-HcCCCCH---HHHHHHHHHHHHC-CChHHHHHHHHHHHHHCcC--HHH----
Confidence 457889999999999999999999999 5443322 3444555566666 7888877778777776532 222
Q ss_pred hHHHHHHHHHHhhhhcHHHHHHHhHhhc
Q psy398 132 NKLKAYEGVYCLAIRQYCRAAELFVDVI 159 (162)
Q Consensus 132 nrLKvy~gl~~l~~R~fk~AA~lfld~~ 159 (162)
+-..-|..++..+++++|-..|-.++
T Consensus 216 --~~~~la~~~~~lg~~~~Al~~~~~~~ 241 (280)
T PF13429_consen 216 --LWDALAAAYLQLGRYEEALEYLEKAL 241 (280)
T ss_dssp --HCHHHHHHHHHHT-HHHHHHHHHHHH
T ss_pred --HHHHHHHHhccccccccccccccccc
Confidence 22344778888899999998886654
No 53
>PRK12370 invasion protein regulator; Provisional
Probab=87.01 E-value=9.4 Score=35.01 Aligned_cols=63 Identities=6% Similarity=-0.126 Sum_probs=40.0
Q ss_pred HHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhh
Q psy398 52 GIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATEL 119 (162)
Q Consensus 52 ~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l 119 (162)
.++..+|.-+...|+.++|.++|.+++ +.....- +..+.+-.+.+.. +++......+++|-.+
T Consensus 339 ~a~~~lg~~~~~~g~~~~A~~~~~~Al-~l~P~~~---~a~~~lg~~l~~~-G~~~eAi~~~~~Al~l 401 (553)
T PRK12370 339 QALGLLGLINTIHSEYIVGSLLFKQAN-LLSPISA---DIKYYYGWNLFMA-GQLEEALQTINECLKL 401 (553)
T ss_pred HHHHHHHHHHHHccCHHHHHHHHHHHH-HhCCCCH---HHHHHHHHHHHHC-CCHHHHHHHHHHHHhc
Confidence 355667777777778888888887777 4433221 2344455555566 7777777777776665
No 54
>PRK11189 lipoprotein NlpI; Provisional
Probab=86.58 E-value=14 Score=31.04 Aligned_cols=26 Identities=27% Similarity=0.349 Sum_probs=21.3
Q ss_pred HHHHHHHhhhhcHHHHHHHhHhhccC
Q psy398 136 AYEGVYCLAIRQYCRAAELFVDVIRS 161 (162)
Q Consensus 136 vy~gl~~l~~R~fk~AA~lfld~~~T 161 (162)
.|.|..+...+++.+|...|-.++++
T Consensus 240 ~~Lg~~~~~~g~~~~A~~~~~~Al~~ 265 (296)
T PRK11189 240 FYLAKYYLSLGDLDEAAALFKLALAN 265 (296)
T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHh
Confidence 36688899999999999999776654
No 55
>KOG1173|consensus
Probab=86.26 E-value=2.8 Score=39.48 Aligned_cols=89 Identities=22% Similarity=0.246 Sum_probs=69.6
Q ss_pred HHHhhHhHHHhhcCHHHHHHHHHH--hcCCCCCChHHHHHHHHHHHHHHHHhc----CCHHHHHHHHHHHHhhhhhcCCC
Q psy398 53 IWQAKLDYLCSIGDRAEAFKLADS--KFADKTNPKSHRIDAVFTLFRIAYFHG----CDIQAMTAAIEKATELIEGISGG 126 (162)
Q Consensus 53 ~~~~~a~~~~~iGD~~~A~~ay~~--~~~~~~~s~g~kiD~~l~~iRv~~f~~----~d~~~v~~~i~ka~~l~e~~~gg 126 (162)
.|...|-||..||..++|.+.|++ +. |.+-+++ .|||=+. +..+...+....|-.++. |.
T Consensus 314 sW~aVg~YYl~i~k~seARry~SKat~l-D~~fgpa----------Wl~fghsfa~e~EhdQAmaaY~tAarl~~---G~ 379 (611)
T KOG1173|consen 314 SWFAVGCYYLMIGKYSEARRYFSKATTL-DPTFGPA----------WLAFGHSFAGEGEHDQAMAAYFTAARLMP---GC 379 (611)
T ss_pred chhhHHHHHHHhcCcHHHHHHHHHHhhc-CccccHH----------HHHHhHHhhhcchHHHHHHHHHHHHHhcc---CC
Confidence 589999999999999999999998 44 4554443 4554332 566777777888888876 43
Q ss_pred chhhhhHHHHHHHHHHhhhhcHHHHHHHhHhhcc
Q psy398 127 DWSARNKLKAYEGVYCLAIRQYCRAAELFVDVIR 160 (162)
Q Consensus 127 DW~rrnrLKvy~gl~~l~~R~fk~AA~lfld~~~ 160 (162)
. ....|-|+-++.+.+++-|.+.|..+++
T Consensus 380 h-----lP~LYlgmey~~t~n~kLAe~Ff~~A~a 408 (611)
T KOG1173|consen 380 H-----LPSLYLGMEYMRTNNLKLAEKFFKQALA 408 (611)
T ss_pred c-----chHHHHHHHHHHhccHHHHHHHHHHHHh
Confidence 3 5678999999999999999999988764
No 56
>PRK11447 cellulose synthase subunit BcsC; Provisional
Probab=86.00 E-value=9.3 Score=38.26 Aligned_cols=98 Identities=14% Similarity=0.064 Sum_probs=64.6
Q ss_pred HHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChH--HH---------HHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhh
Q psy398 52 GIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKS--HR---------IDAVFTLFRIAYFHGCDIQAMTAAIEKATELI 120 (162)
Q Consensus 52 ~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g--~k---------iD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~ 120 (162)
+++..+|..|.+.|+.++|...|+++. ....... .+ ..+.+..-.+.+.. +++......+.+|-.+-
T Consensus 304 ~a~~~Lg~~~~~~g~~~eA~~~l~~Al-~~~p~~~~~~~~~~ll~~~~~~~~~~~g~~~~~~-g~~~eA~~~~~~Al~~~ 381 (1157)
T PRK11447 304 EALGALGQAYSQQGDRARAVAQFEKAL-ALDPHSSNRDKWESLLKVNRYWLLIQQGDAALKA-NNLAQAERLYQQARQVD 381 (1157)
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHH-HhCCCccchhHHHHHHHhhhHHHHHHHHHHHHHC-CCHHHHHHHHHHHHHhC
Confidence 567889999999999999999999988 5443221 11 12223333444556 78888877777777652
Q ss_pred hhcCCCchhhhhHHHHHHHHHHhhhhcHHHHHHHhHhhc
Q psy398 121 EGISGGDWSARNKLKAYEGVYCLAIRQYCRAAELFVDVI 159 (162)
Q Consensus 121 e~~~ggDW~rrnrLKvy~gl~~l~~R~fk~AA~lfld~~ 159 (162)
..+-. .....|-.++..++|.+|.+.|-.++
T Consensus 382 ----P~~~~----a~~~Lg~~~~~~g~~~eA~~~y~~aL 412 (1157)
T PRK11447 382 ----NTDSY----AVLGLGDVAMARKDYAAAERYYQQAL 412 (1157)
T ss_pred ----CCCHH----HHHHHHHHHHHCCCHHHHHHHHHHHH
Confidence 11211 22234667777888888888887655
No 57
>PRK12370 invasion protein regulator; Provisional
Probab=85.90 E-value=11 Score=34.46 Aligned_cols=96 Identities=15% Similarity=0.047 Sum_probs=61.8
Q ss_pred HHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhh
Q psy398 52 GIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSAR 131 (162)
Q Consensus 52 ~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rr 131 (162)
.++..+|..|...|+.++|+..|.++. .-..... ...+....+.+.. +++......+.++-.... .++..
T Consensus 373 ~a~~~lg~~l~~~G~~~eAi~~~~~Al-~l~P~~~---~~~~~~~~~~~~~-g~~eeA~~~~~~~l~~~~---p~~~~-- 442 (553)
T PRK12370 373 DIKYYYGWNLFMAGQLEEALQTINECL-KLDPTRA---AAGITKLWITYYH-TGIDDAIRLGDELRSQHL---QDNPI-- 442 (553)
T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHH-hcCCCCh---hhHHHHHHHHHhc-cCHHHHHHHHHHHHHhcc---ccCHH--
Confidence 367889999999999999999999988 4433311 1122233345556 676655555554433211 11222
Q ss_pred hHHHHHHHHHHhhhhcHHHHHHHhHhhc
Q psy398 132 NKLKAYEGVYCLAIRQYCRAAELFVDVI 159 (162)
Q Consensus 132 nrLKvy~gl~~l~~R~fk~AA~lfld~~ 159 (162)
...+.|..+...|++.+|...|....
T Consensus 443 --~~~~la~~l~~~G~~~eA~~~~~~~~ 468 (553)
T PRK12370 443 --LLSMQVMFLSLKGKHELARKLTKEIS 468 (553)
T ss_pred --HHHHHHHHHHhCCCHHHHHHHHHHhh
Confidence 34567888888999999998876543
No 58
>KOG1070|consensus
Probab=85.90 E-value=7.1 Score=40.60 Aligned_cols=66 Identities=12% Similarity=0.283 Sum_probs=59.3
Q ss_pred HHHhhHhHHHhhcCHHHHHHHHHHhcCCCCC---ChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhh
Q psy398 53 IWQAKLDYLCSIGDRAEAFKLADSKFADKTN---PKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIE 121 (162)
Q Consensus 53 ~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~---s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e 121 (162)
-|...=-|+.+++|.++|.+...+++ +|. -.--++.+|.+.+-+-.+| ++-..+++-+++|-+.++
T Consensus 1460 ~WI~YMaf~LelsEiekAR~iaerAL--~tIN~REeeEKLNiWiA~lNlEn~y-G~eesl~kVFeRAcqycd 1528 (1710)
T KOG1070|consen 1460 LWIRYMAFHLELSEIEKARKIAERAL--KTINFREEEEKLNIWIAYLNLENAY-GTEESLKKVFERACQYCD 1528 (1710)
T ss_pred HHHHHHHHHhhhhhhHHHHHHHHHHh--hhCCcchhHHHHHHHHHHHhHHHhh-CcHHHHHHHHHHHHHhcc
Confidence 46677789999999999999999999 666 5567999999999999999 999999999999999876
No 59
>KOG1155|consensus
Probab=85.27 E-value=14 Score=34.39 Aligned_cols=93 Identities=19% Similarity=0.184 Sum_probs=54.5
Q ss_pred HHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhc--CCHHHHHHHHHHHHhhhhhcCCCchhh
Q psy398 53 IWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHG--CDIQAMTAAIEKATELIEGISGGDWSA 130 (162)
Q Consensus 53 ~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~--~d~~~v~~~i~ka~~l~e~~~ggDW~r 130 (162)
-|..+|+=|.+++.+++|.+||.+++ .-.-+.|. + +.|+|=.+- +|......+.+|-=...+. .|.--+.
T Consensus 434 lw~aLG~CY~kl~~~~eAiKCykrai-~~~dte~~---~---l~~LakLye~l~d~~eAa~~yek~v~~~~~-eg~~~~~ 505 (559)
T KOG1155|consen 434 LWVALGECYEKLNRLEEAIKCYKRAI-LLGDTEGS---A---LVRLAKLYEELKDLNEAAQYYEKYVEVSEL-EGEIDDE 505 (559)
T ss_pred HHHHHHHHHHHhccHHHHHHHHHHHH-hccccchH---H---HHHHHHHHHHHHhHHHHHHHHHHHHHHHHh-hcccchH
Confidence 46788888888888888888888888 44433332 2 344443331 4444444444444333221 0222223
Q ss_pred hhHHHHHHHHHHhhhhcHHHHHH
Q psy398 131 RNKLKAYEGVYCLAIRQYCRAAE 153 (162)
Q Consensus 131 rnrLKvy~gl~~l~~R~fk~AA~ 153 (162)
--+-.++-+.|..-..||.+|..
T Consensus 506 t~ka~~fLA~~f~k~~~~~~As~ 528 (559)
T KOG1155|consen 506 TIKARLFLAEYFKKMKDFDEASY 528 (559)
T ss_pred HHHHHHHHHHHHHhhcchHHHHH
Confidence 44455567888888888888865
No 60
>smart00028 TPR Tetratricopeptide repeats. Repeats present in 4 or more copies in proteins. Contain a minimum of 34 amino acids each and self-associate via a "knobs and holes" mechanism.
Probab=85.21 E-value=2.1 Score=21.36 Aligned_cols=27 Identities=15% Similarity=0.140 Sum_probs=24.2
Q ss_pred HHHHhhHhHHHhhcCHHHHHHHHHHhc
Q psy398 52 GIWQAKLDYLCSIGDRAEAFKLADSKF 78 (162)
Q Consensus 52 ~~~~~~a~~~~~iGD~~~A~~ay~~~~ 78 (162)
.++..+|..|...|+.+.|..+|.++.
T Consensus 2 ~~~~~~a~~~~~~~~~~~a~~~~~~~~ 28 (34)
T smart00028 2 EALYNLGNAYLKLGDYDEALEYYEKAL 28 (34)
T ss_pred hHHHHHHHHHHHHhhHHHHHHHHHHHH
Confidence 356789999999999999999999987
No 61
>KOG0547|consensus
Probab=84.65 E-value=6.2 Score=37.00 Aligned_cols=78 Identities=19% Similarity=0.129 Sum_probs=53.4
Q ss_pred HHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHH-HHHHhcCCHHHHHHHHHHHHhhhhhcCCCchh--
Q psy398 53 IWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFR-IAYFHGCDIQAMTAAIEKATELIEGISGGDWS-- 129 (162)
Q Consensus 53 ~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iR-v~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~-- 129 (162)
++..+|+.|++-|.+++|+++|+.++ +.|..- -+|..=| -++...+||+.|.+--++|=.+ ..|..
T Consensus 117 ~lK~~GN~~f~~kkY~eAIkyY~~AI-~l~p~e-----piFYsNraAcY~~lgd~~~Vied~TkALEl-----~P~Y~KA 185 (606)
T KOG0547|consen 117 ALKTKGNKFFRNKKYDEAIKYYTQAI-ELCPDE-----PIFYSNRAACYESLGDWEKVIEDCTKALEL-----NPDYVKA 185 (606)
T ss_pred HHHhhhhhhhhcccHHHHHHHHHHHH-hcCCCC-----chhhhhHHHHHHHHhhHHHHHHHHHHHhhc-----CcHHHHH
Confidence 46789999999999999999999999 888763 2233333 2333227888888877777665 34443
Q ss_pred hhhHHHHHHHHH
Q psy398 130 ARNKLKAYEGVY 141 (162)
Q Consensus 130 rrnrLKvy~gl~ 141 (162)
.--|-+++++|-
T Consensus 186 l~RRA~A~E~lg 197 (606)
T KOG0547|consen 186 LLRRASAHEQLG 197 (606)
T ss_pred HHHHHHHHHhhc
Confidence 123455677663
No 62
>PF12862 Apc5: Anaphase-promoting complex subunit 5
Probab=84.00 E-value=3.6 Score=28.88 Aligned_cols=62 Identities=23% Similarity=0.266 Sum_probs=48.3
Q ss_pred HhhcCHHHHHHHHHHhcCCCCCChHHHH------HHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCc
Q psy398 62 CSIGDRAEAFKLADSKFADKTNPKSHRI------DAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGD 127 (162)
Q Consensus 62 ~~iGD~~~A~~ay~~~~~~~~~s~g~ki------D~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggD 127 (162)
.+.||+..|++...+.+ |.+...+..- =..+++.++-..+ +++......+..|=.+..+ -+|
T Consensus 9 ~~~~dy~~A~d~L~~~f-D~~~~~~~~~~~~~~~~all~lA~~~~~~-G~~~~A~~~l~eAi~~Are--~~D 76 (94)
T PF12862_consen 9 LRSGDYSEALDALHRYF-DYAKQSNNSSSNSGLAYALLNLAELHRRF-GHYEEALQALEEAIRLARE--NGD 76 (94)
T ss_pred HHcCCHHHHHHHHHHHH-HHHhhcccchhhHHHHHHHHHHHHHHHHh-CCHHHHHHHHHHHHHHHHH--HCC
Confidence 46899999999999999 8876555433 2456677777778 9999999999888888776 566
No 63
>PF12895 Apc3: Anaphase-promoting complex, cyclosome, subunit 3; PDB: 3KAE_D 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2XPI_A 3ULQ_A.
Probab=83.91 E-value=4 Score=27.37 Aligned_cols=59 Identities=20% Similarity=0.144 Sum_probs=44.9
Q ss_pred HHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHH
Q psy398 52 GIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKA 116 (162)
Q Consensus 52 ~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka 116 (162)
..+..+|.=|++.|++++|++.+.+ . .... + -.++.+.+-|..+.. +++....+.+++|
T Consensus 26 ~~~~~la~~~~~~~~y~~A~~~~~~-~-~~~~--~-~~~~~~l~a~~~~~l-~~y~eAi~~l~~~ 84 (84)
T PF12895_consen 26 AYLYNLAQCYFQQGKYEEAIELLQK-L-KLDP--S-NPDIHYLLARCLLKL-GKYEEAIKALEKA 84 (84)
T ss_dssp HHHHHHHHHHHHTTHHHHHHHHHHC-H-THHH--C-HHHHHHHHHHHHHHT-T-HHHHHHHHHHH
T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHH-h-CCCC--C-CHHHHHHHHHHHHHh-CCHHHHHHHHhcC
Confidence 3566789999999999999999987 4 2211 1 256666778888888 9999998888875
No 64
>PRK04841 transcriptional regulator MalT; Provisional
Probab=83.69 E-value=14 Score=35.27 Aligned_cols=104 Identities=13% Similarity=-0.009 Sum_probs=71.7
Q ss_pred HHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHH---HHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCch
Q psy398 52 GIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSH---RIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDW 128 (162)
Q Consensus 52 ~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~---kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW 128 (162)
.+...+|..+...||.+.|...+.++. ......|. .+-....+-.+.+.. +++......+.++-.+.+. .|..
T Consensus 492 ~a~~~lg~~~~~~G~~~~A~~~~~~al-~~~~~~g~~~~~~~~~~~la~~~~~~-G~~~~A~~~~~~al~~~~~--~~~~ 567 (903)
T PRK04841 492 VATSVLGEVHHCKGELARALAMMQQTE-QMARQHDVYHYALWSLLQQSEILFAQ-GFLQAAYETQEKAFQLIEE--QHLE 567 (903)
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHH-HHHhhhcchHHHHHHHHHHHHHHHHC-CCHHHHHHHHHHHHHHHHH--hccc
Confidence 456678888899999999999999877 54333332 333444555666777 9999999999999988775 3321
Q ss_pred h--hhhHHHHHHHHHHhhhhcHHHHHHHhHhhc
Q psy398 129 S--ARNKLKAYEGVYCLAIRQYCRAAELFVDVI 159 (162)
Q Consensus 129 ~--rrnrLKvy~gl~~l~~R~fk~AA~lfld~~ 159 (162)
. ...-+....|......+++.+|..++-.++
T Consensus 568 ~~~~~~~~~~~la~~~~~~G~~~~A~~~~~~al 600 (903)
T PRK04841 568 QLPMHEFLLRIRAQLLWEWARLDEAEQCARKGL 600 (903)
T ss_pred cccHHHHHHHHHHHHHHHhcCHHHHHHHHHHhH
Confidence 1 123334456766777889988887776654
No 65
>KOG2908|consensus
Probab=83.25 E-value=13 Score=33.25 Aligned_cols=93 Identities=12% Similarity=0.199 Sum_probs=71.3
Q ss_pred HHHhhcCHHHHHHHHHHhcCCCCCChH---HHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhhhHHHH
Q psy398 60 YLCSIGDRAEAFKLADSKFADKTNPKS---HRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSARNKLKA 136 (162)
Q Consensus 60 ~~~~iGD~~~A~~ay~~~~~~~~~s~g---~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rrnrLKv 136 (162)
+.-+++|+++|++..+++. ++....+ .++=+...+.|+-+.. +|.+.+++.|...++.+|...|-+---++++--
T Consensus 84 ~~~~~~D~~~al~~Le~i~-~~~~~~~e~~av~~~~t~~~r~~L~i-~DLk~~kk~ldd~~~~ld~~~~v~~~Vh~~fY~ 161 (380)
T KOG2908|consen 84 VSEQISDKDEALEFLEKII-EKLKEYKEPDAVIYILTEIARLKLEI-NDLKEIKKLLDDLKSMLDSLDGVTSNVHSSFYS 161 (380)
T ss_pred HHHHhccHHHHHHHHHHHH-HHHHhhccchhHHHHHHHHHHHHHhc-ccHHHHHHHHHHHHHHHhcccCCChhhhhhHHH
Confidence 3457889999999999988 6654443 4666677788888888 999999999999999998755555545677766
Q ss_pred HHHHHHhhhhcHHHHHHH
Q psy398 137 YEGVYCLAIRQYCRAAEL 154 (162)
Q Consensus 137 y~gl~~l~~R~fk~AA~l 154 (162)
-..-|+=.+|||..+=+.
T Consensus 162 lssqYyk~~~d~a~yYr~ 179 (380)
T KOG2908|consen 162 LSSQYYKKIGDFASYYRH 179 (380)
T ss_pred HHHHHHHHHHhHHHHHHH
Confidence 677777788888766443
No 66
>COG1849 Uncharacterized protein conserved in archaea [Function unknown]
Probab=83.00 E-value=7 Score=28.31 Aligned_cols=55 Identities=22% Similarity=0.247 Sum_probs=39.9
Q ss_pred HHHHHHHHHHHHHHHHHHH-----hc---CCCchHHHHHH--HHhhHhHHHhhcCHHHHHHHHHHhc
Q psy398 22 TEMKDANEKRLKELDLEHE-----KN---MMDEEDQVSGI--WQAKLDYLCSIGDRAEAFKLADSKF 78 (162)
Q Consensus 22 ~~~~~~n~~~l~~Le~~lk-----~N---lg~~~~eIr~~--~~~~a~~~~~iGD~~~A~~ay~~~~ 78 (162)
+.+.++-++=++.|++.|+ .+ .+ +.-+.|+ +..=|.||..-||+.+|+.|.+=+.
T Consensus 4 ~~l~ekiekYi~~leeaL~~~k~~~~~~s~a--e~~~~ma~~Y~~Dakyf~ekGD~vtAfa~~sYa~ 68 (90)
T COG1849 4 EELAEKIEKYIELLEEALKEIKSRPGDRSAA--EDFVDMAESYFEDAKYFLEKGDYVTAFAALSYAH 68 (90)
T ss_pred HHHHHHHHHHHHHHHHHHHHhccCCCcchHH--HHHHHHHHHHHHHHHHHHHcCcHHHHHHHHHHHH
Confidence 3456667777778888777 22 22 3345554 6777899999999999999987765
No 67
>KOG1125|consensus
Probab=82.97 E-value=2.9 Score=39.28 Aligned_cols=91 Identities=13% Similarity=0.180 Sum_probs=59.7
Q ss_pred HHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCC----HHHHHHHHHHHHhhhhhcCCCch
Q psy398 53 IWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCD----IQAMTAAIEKATELIEGISGGDW 128 (162)
Q Consensus 53 ~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d----~~~v~~~i~ka~~l~e~~~ggDW 128 (162)
.+.-+|-.|.-.|++++|+.||+.++ .....-.. .-=|+|--. -| -..|..| .+|-+|- .-.
T Consensus 432 vQ~~LGVLy~ls~efdraiDcf~~AL-~v~Pnd~~------lWNRLGAtL-AN~~~s~EAIsAY-~rALqLq-----P~y 497 (579)
T KOG1125|consen 432 VQSGLGVLYNLSGEFDRAVDCFEAAL-QVKPNDYL------LWNRLGATL-ANGNRSEEAISAY-NRALQLQ-----PGY 497 (579)
T ss_pred HHhhhHHHHhcchHHHHHHHHHHHHH-hcCCchHH------HHHHhhHHh-cCCcccHHHHHHH-HHHHhcC-----CCe
Confidence 34568999999999999999999998 44433221 123555544 33 2333333 3444442 111
Q ss_pred hhhhHHHHHHHHHHhhhhcHHHHHHHhHhhcc
Q psy398 129 SARNKLKAYEGVYCLAIRQYCRAAELFVDVIR 160 (162)
Q Consensus 129 ~rrnrLKvy~gl~~l~~R~fk~AA~lfld~~~ 160 (162)
. | .+-=-||.||-.+.|++|++.||.+|+
T Consensus 498 V-R--~RyNlgIS~mNlG~ykEA~~hlL~AL~ 526 (579)
T KOG1125|consen 498 V-R--VRYNLGISCMNLGAYKEAVKHLLEALS 526 (579)
T ss_pred e-e--eehhhhhhhhhhhhHHHHHHHHHHHHH
Confidence 1 2 223469999999999999999999875
No 68
>PF10300 DUF3808: Protein of unknown function (DUF3808); InterPro: IPR019412 This entry represents a family of proteins conserved from fungi to humans. In humans this protein is expressed in primary breast carcinomas but not in normal breast tissue, and has a putative eukaryotic RNP-1 RNA binding region and a candidate anchoring transmembrane domain. The human protein is coordinately regulated with oestrogen receptor, but is not necessarily oestradiol-responsive []. Members of this family carry a tetratricopeptide repeat (IPR013105 from INTERPRO) at their C terminus.
Probab=82.33 E-value=22 Score=32.30 Aligned_cols=97 Identities=13% Similarity=0.082 Sum_probs=68.8
Q ss_pred HHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHH-HHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhhh
Q psy398 54 WQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTL-FRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSARN 132 (162)
Q Consensus 54 ~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~-iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rrn 132 (162)
+.-.|.++...|+.++|++.|.++. +-...-++.=.+|+.= .=..++. +||.....+..+..+ + ..|. +.
T Consensus 270 l~~~gR~~~~~g~~~~Ai~~~~~a~-~~q~~~~Ql~~l~~~El~w~~~~~-~~w~~A~~~f~~L~~---~---s~WS-ka 340 (468)
T PF10300_consen 270 LFFEGRLERLKGNLEEAIESFERAI-ESQSEWKQLHHLCYFELAWCHMFQ-HDWEEAAEYFLRLLK---E---SKWS-KA 340 (468)
T ss_pred HHHHHHHHHHhcCHHHHHHHHHHhc-cchhhHHhHHHHHHHHHHHHHHHH-chHHHHHHHHHHHHh---c---cccH-HH
Confidence 4568899999999999999999988 6555555555544332 2223445 899988877766555 2 6896 55
Q ss_pred HHHHHHHHHHhhhhcH-------HHHHHHhHhhc
Q psy398 133 KLKAYEGVYCLAIRQY-------CRAAELFVDVI 159 (162)
Q Consensus 133 rLKvy~gl~~l~~R~f-------k~AA~lfld~~ 159 (162)
-+.-..|..++..++. ++|.++|-.+.
T Consensus 341 ~Y~Y~~a~c~~~l~~~~~~~~~~~~a~~l~~~vp 374 (468)
T PF10300_consen 341 FYAYLAAACLLMLGREEEAKEHKKEAEELFRKVP 374 (468)
T ss_pred HHHHHHHHHHHhhccchhhhhhHHHHHHHHHHHH
Confidence 5555677777777777 88888887643
No 69
>KOG1155|consensus
Probab=82.21 E-value=11 Score=35.12 Aligned_cols=106 Identities=11% Similarity=0.097 Sum_probs=73.2
Q ss_pred hHHHHHHhcCCCcCHHHHHHHHHHHHHHHHHHHHHHH-----hcCCCchHHHHHHHHhhHhHHHhhcCHHHHHHHHHHhc
Q psy398 4 YYKEVCEEFGWKMDEKFYTEMKDANEKRLKELDLEHE-----KNMMDEEDQVSGIWQAKLDYLCSIGDRAEAFKLADSKF 78 (162)
Q Consensus 4 ~Y~~l~~~~~~~~D~~~~~~~~~~n~~~l~~Le~~lk-----~Nlg~~~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~ 78 (162)
||+..|+ --|-|+-....+-+.- .++.++++.+| -+.| +++ -+|+..+|+.|.+-+|.++|..+|.+-.
T Consensus 420 YfqkA~~--~kPnDsRlw~aLG~CY-~kl~~~~eAiKCykrai~~~--dte-~~~l~~LakLye~l~d~~eAa~~yek~v 493 (559)
T KOG1155|consen 420 YFQKALE--LKPNDSRLWVALGECY-EKLNRLEEAIKCYKRAILLG--DTE-GSALVRLAKLYEELKDLNEAAQYYEKYV 493 (559)
T ss_pred HHHHHHh--cCCCchHHHHHHHHHH-HHhccHHHHHHHHHHHHhcc--ccc-hHHHHHHHHHHHHHHhHHHHHHHHHHHH
Confidence 3444443 3467888888887776 88999999999 5666 553 5788999999999999999999999977
Q ss_pred CCCCCChHH----HHHHHHHHHHHHHHhcCCHHHHHHHHHHHH
Q psy398 79 ADKTNPKSH----RIDAVFTLFRIAYFHGCDIQAMTAAIEKAT 117 (162)
Q Consensus 79 ~~~~~s~g~----kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~ 117 (162)
+--..-|. -+-.++-+.+--+-. .||.....|-.++-
T Consensus 494 -~~~~~eg~~~~~t~ka~~fLA~~f~k~-~~~~~As~Ya~~~~ 534 (559)
T KOG1155|consen 494 -EVSELEGEIDDETIKARLFLAEYFKKM-KDFDEASYYATLVL 534 (559)
T ss_pred -HHHHhhcccchHHHHHHHHHHHHHHhh-cchHHHHHHHHHHh
Confidence 42222222 333444444333334 78888777765543
No 70
>PF09976 TPR_21: Tetratricopeptide repeat; InterPro: IPR018704 This domain, found in various hypothetical prokaryotic proteins, has no known function.
Probab=81.29 E-value=19 Score=26.58 Aligned_cols=86 Identities=23% Similarity=0.130 Sum_probs=69.6
Q ss_pred hhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhhhHHHHHHHHHH
Q psy398 63 SIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSARNKLKAYEGVYC 142 (162)
Q Consensus 63 ~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rrnrLKvy~gl~~ 142 (162)
..||...+.+.+.... ..-.++.......|.+-.+.+.. +|+......+.++-.- ..|-..++..+..-+-.+
T Consensus 23 ~~~~~~~~~~~~~~l~-~~~~~s~ya~~A~l~lA~~~~~~-g~~~~A~~~l~~~~~~-----~~d~~l~~~a~l~LA~~~ 95 (145)
T PF09976_consen 23 QAGDPAKAEAAAEQLA-KDYPSSPYAALAALQLAKAAYEQ-GDYDEAKAALEKALAN-----APDPELKPLARLRLARIL 95 (145)
T ss_pred HCCCHHHHHHHHHHHH-HHCCCChHHHHHHHHHHHHHHHC-CCHHHHHHHHHHHHhh-----CCCHHHHHHHHHHHHHHH
Confidence 5889999988899988 66666667777888888899888 9999998888887664 345556666677778888
Q ss_pred hhhhcHHHHHHHh
Q psy398 143 LAIRQYCRAAELF 155 (162)
Q Consensus 143 l~~R~fk~AA~lf 155 (162)
+..++|.+|...+
T Consensus 96 ~~~~~~d~Al~~L 108 (145)
T PF09976_consen 96 LQQGQYDEALATL 108 (145)
T ss_pred HHcCCHHHHHHHH
Confidence 9999999998886
No 71
>CHL00033 ycf3 photosystem I assembly protein Ycf3
Probab=80.79 E-value=22 Score=26.83 Aligned_cols=75 Identities=15% Similarity=-0.023 Sum_probs=50.1
Q ss_pred HHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHH-------HHHHHhcCCHHHHHHHHHHHHhhhhh
Q psy398 50 VSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLF-------RIAYFHGCDIQAMTAAIEKATELIEG 122 (162)
Q Consensus 50 Ir~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~i-------Rv~~f~~~d~~~v~~~i~ka~~l~e~ 122 (162)
.-.++..+|.-|...|+.+.|++.|.+++ ......+ +....+. |+.+.. +++.....+..+|....++
T Consensus 71 ~~~~~~~lg~~~~~~g~~~eA~~~~~~Al-~~~~~~~---~~~~~la~i~~~~~~~~~~~-g~~~~A~~~~~~a~~~~~~ 145 (168)
T CHL00033 71 RSYILYNIGLIHTSNGEHTKALEYYFQAL-ERNPFLP---QALNNMAVICHYRGEQAIEQ-GDSEIAEAWFDQAAEYWKQ 145 (168)
T ss_pred hHHHHHHHHHHHHHcCCHHHHHHHHHHHH-HhCcCcH---HHHHHHHHHHHHhhHHHHHc-ccHHHHHHHHHHHHHHHHH
Confidence 34588999999999999999999999988 5432222 2222222 233356 7888888888888776554
Q ss_pred cCCCchh
Q psy398 123 ISGGDWS 129 (162)
Q Consensus 123 ~~ggDW~ 129 (162)
..|.|.+
T Consensus 146 a~~~~p~ 152 (168)
T CHL00033 146 AIALAPG 152 (168)
T ss_pred HHHhCcc
Confidence 2234433
No 72
>PF13431 TPR_17: Tetratricopeptide repeat
Probab=79.99 E-value=1.4 Score=25.53 Aligned_cols=22 Identities=18% Similarity=0.252 Sum_probs=19.3
Q ss_pred HHHHHhhHhHHHhhcCHHHHHH
Q psy398 51 SGIWQAKLDYLCSIGDRAEAFK 72 (162)
Q Consensus 51 r~~~~~~a~~~~~iGD~~~A~~ 72 (162)
-+++..+|..|+..||.+.|.+
T Consensus 13 ~~a~~nla~~~~~~g~~~~A~~ 34 (34)
T PF13431_consen 13 AEAYNNLANLYLNQGDYEEAIA 34 (34)
T ss_pred HHHHHHHHHHHHHCcCHHhhcC
Confidence 3588999999999999999863
No 73
>cd05804 StaR_like StaR_like; a well-conserved protein found in bacteria, plants, and animals. A family member from Streptomyces toyocaensis, StaR is part of a gene cluster involved in the biosynthesis of glycopeptide antibiotics (GPAs), specifically A47934. It has been speculated that StaR could be a flavoprotein hydroxylating a tyrosine sidechain. Some family members have been annotated as proteins containing tetratricopeptide (TPR) repeats, which may at least indicate mostly alpha-helical secondary structure.
Probab=79.43 E-value=13 Score=31.04 Aligned_cols=63 Identities=17% Similarity=-0.014 Sum_probs=42.8
Q ss_pred HHHHhhHhHHHhhcCHHHHHHHHHHhcCCCC-CChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHH
Q psy398 52 GIWQAKLDYLCSIGDRAEAFKLADSKFADKT-NPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKA 116 (162)
Q Consensus 52 ~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~-~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka 116 (162)
.++..+|.-|.+.|+.++|...+.+.. +-. ..+...+...+++.++.+.. +++......++++
T Consensus 149 ~~~~~la~i~~~~g~~~eA~~~l~~~l-~~~~~~~~~~~~~~~~la~~~~~~-G~~~~A~~~~~~~ 212 (355)
T cd05804 149 WAVHAVAHVLEMQGRFKEGIAFMESWR-DTWDCSSMLRGHNWWHLALFYLER-GDYEAALAIYDTH 212 (355)
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHhhh-hccCCCcchhHHHHHHHHHHHHHC-CCHHHHHHHHHHH
Confidence 456667777777888888888877776 432 23444555666777777777 7777777777665
No 74
>PRK15174 Vi polysaccharide export protein VexE; Provisional
Probab=79.41 E-value=25 Score=33.15 Aligned_cols=62 Identities=6% Similarity=-0.070 Sum_probs=37.0
Q ss_pred HHHhhHhHHHhhcCHHH----HHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhh
Q psy398 53 IWQAKLDYLCSIGDRAE----AFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATEL 119 (162)
Q Consensus 53 ~~~~~a~~~~~iGD~~~----A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l 119 (162)
++..+|..|.+.|+.++ |...|++++ .-... -..+...+..+-+.. +++......+.++-.+
T Consensus 248 ~~~~Lg~~l~~~G~~~eA~~~A~~~~~~Al-~l~P~---~~~a~~~lg~~l~~~-g~~~eA~~~l~~al~l 313 (656)
T PRK15174 248 LRRSLGLAYYQSGRSREAKLQAAEHWRHAL-QFNSD---NVRIVTLYADALIRT-GQNEKAIPLLQQSLAT 313 (656)
T ss_pred HHHHHHHHHHHcCCchhhHHHHHHHHHHHH-hhCCC---CHHHHHHHHHHHHHC-CCHHHHHHHHHHHHHh
Confidence 44557777777777664 677777766 43322 124445555555555 6666666666666554
No 75
>PRK15174 Vi polysaccharide export protein VexE; Provisional
Probab=79.22 E-value=31 Score=32.50 Aligned_cols=96 Identities=17% Similarity=0.078 Sum_probs=67.5
Q ss_pred HHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhh
Q psy398 52 GIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSAR 131 (162)
Q Consensus 52 ~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rr 131 (162)
.++..+|..+.+.|+.++|+..|.+++ .-.... -++...+.++..-. +++......+.++-.. .++..
T Consensus 285 ~a~~~lg~~l~~~g~~~eA~~~l~~al-~l~P~~---~~a~~~La~~l~~~-G~~~eA~~~l~~al~~-----~P~~~-- 352 (656)
T PRK15174 285 RIVTLYADALIRTGQNEKAIPLLQQSL-ATHPDL---PYVRAMYARALRQV-GQYTAASDEFVQLARE-----KGVTS-- 352 (656)
T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHH-HhCCCC---HHHHHHHHHHHHHC-CCHHHHHHHHHHHHHh-----Cccch--
Confidence 467788999999999999999999998 543332 24455566666667 8888777777655543 22211
Q ss_pred hHHHHHHHHHHhhhhcHHHHHHHhHhhcc
Q psy398 132 NKLKAYEGVYCLAIRQYCRAAELFVDVIR 160 (162)
Q Consensus 132 nrLKvy~gl~~l~~R~fk~AA~lfld~~~ 160 (162)
....+.|..+...+++.+|...|-.++.
T Consensus 353 -~~~~~~a~al~~~G~~deA~~~l~~al~ 380 (656)
T PRK15174 353 -KWNRYAAAALLQAGKTSEAESVFEHYIQ 380 (656)
T ss_pred -HHHHHHHHHHHHCCCHHHHHHHHHHHHH
Confidence 1233457777889999999998876653
No 76
>PRK14574 hmsH outer membrane protein; Provisional
Probab=79.09 E-value=19 Score=35.37 Aligned_cols=94 Identities=13% Similarity=0.020 Sum_probs=53.8
Q ss_pred HHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhhh
Q psy398 53 IWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSARN 132 (162)
Q Consensus 53 ~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rrn 132 (162)
...+.+-..++-||.+.|+..|.++......+.....| ++.+.... ++...+..+++++- +. ..-| .
T Consensus 36 ~~y~~aii~~r~Gd~~~Al~~L~qaL~~~P~~~~av~d----ll~l~~~~-G~~~~A~~~~eka~---~p--~n~~---~ 102 (822)
T PRK14574 36 TQYDSLIIRARAGDTAPVLDYLQEESKAGPLQSGQVDD----WLQIAGWA-GRDQEVIDVYERYQ---SS--MNIS---S 102 (822)
T ss_pred HHHHHHHHHHhCCCHHHHHHHHHHHHhhCccchhhHHH----HHHHHHHc-CCcHHHHHHHHHhc---cC--CCCC---H
Confidence 45567777778888888888888877333343334443 33344444 66667777766665 11 1111 2
Q ss_pred HHHHHHHHHHhhhhcHHHHHHHhHhhc
Q psy398 133 KLKAYEGVYCLAIRQYCRAAELFVDVI 159 (162)
Q Consensus 133 rLKvy~gl~~l~~R~fk~AA~lfld~~ 159 (162)
......|..+...++|.+|.++|-.++
T Consensus 103 ~~llalA~ly~~~gdyd~Aiely~kaL 129 (822)
T PRK14574 103 RGLASAARAYRNEKRWDQALALWQSSL 129 (822)
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHH
Confidence 233334556666777777777665443
No 77
>KOG0553|consensus
Probab=78.85 E-value=29 Score=30.33 Aligned_cols=93 Identities=18% Similarity=0.183 Sum_probs=60.7
Q ss_pred HHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhh
Q psy398 51 SGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSA 130 (162)
Q Consensus 51 r~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~r 130 (162)
-+-+.+-|+=+.+-+|+++|+..|++++ ..+.+ |.+++.=|-+... -.......|.-++.-+.- | .+
T Consensus 81 AE~LK~eGN~~m~~~~Y~eAv~kY~~AI-~l~P~-----nAVyycNRAAAy~--~Lg~~~~AVkDce~Al~i----D-p~ 147 (304)
T KOG0553|consen 81 AESLKNEGNKLMKNKDYQEAVDKYTEAI-ELDPT-----NAVYYCNRAAAYS--KLGEYEDAVKDCESALSI----D-PH 147 (304)
T ss_pred HHHHHHHHHHHHHhhhHHHHHHHHHHHH-hcCCC-----cchHHHHHHHHHH--HhcchHHHHHHHHHHHhc----C-hH
Confidence 3345678888999999999999999999 76654 4455555554433 222333445555555432 2 12
Q ss_pred hhHHHHHHHHHHhhhhcHHHHHHHhH
Q psy398 131 RNKLKAYEGVYCLAIRQYCRAAELFV 156 (162)
Q Consensus 131 rnrLKvy~gl~~l~~R~fk~AA~lfl 156 (162)
-.|.-.--|+.++..++|.+|.+.|=
T Consensus 148 yskay~RLG~A~~~~gk~~~A~~ayk 173 (304)
T KOG0553|consen 148 YSKAYGRLGLAYLALGKYEEAIEAYK 173 (304)
T ss_pred HHHHHHHHHHHHHccCcHHHHHHHHH
Confidence 33444456999999999999988763
No 78
>TIGR00540 hemY_coli hemY protein. This is an uncharacterized protein encoded next to a heme-biosynthetic enzyme in two gamma division proteobacteria (E. coli and H. influenzae). It is known in no other species. The gene symbol hemY is unfortunate in that an unrelated protein, protoporphyrinogen oxidase, is designated as HemG in E. coli but as HemY in Bacillus subtilis.
Probab=78.69 E-value=30 Score=30.26 Aligned_cols=90 Identities=11% Similarity=0.072 Sum_probs=58.4
Q ss_pred HHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhhhH
Q psy398 54 WQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSARNK 133 (162)
Q Consensus 54 ~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rrnr 133 (162)
..-.|..+.+.||.+.|.+.+.++. +.. ++..+.+-++..|+.+.. +++......+++....- ..+. .
T Consensus 121 ~llaA~aa~~~g~~~~A~~~l~~a~-~~~--p~~~l~~~~~~a~l~l~~-~~~~~Al~~l~~l~~~~----P~~~----~ 188 (409)
T TIGR00540 121 LIKAAEAAQQRGDEARANQHLEEAA-ELA--GNDNILVEIARTRILLAQ-NELHAARHGVDKLLEMA----PRHK----E 188 (409)
T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHH-HhC--CcCchHHHHHHHHHHHHC-CCHHHHHHHHHHHHHhC----CCCH----H
Confidence 4456777888899999999888877 443 344455566668888888 88887777766655541 1111 2
Q ss_pred HHHHHHHHHhhhhcHHHHHHHh
Q psy398 134 LKAYEGVYCLAIRQYCRAAELF 155 (162)
Q Consensus 134 LKvy~gl~~l~~R~fk~AA~lf 155 (162)
.--..|..++..+|+.+|.++|
T Consensus 189 ~l~ll~~~~~~~~d~~~a~~~l 210 (409)
T TIGR00540 189 VLKLAEEAYIRSGAWQALDDII 210 (409)
T ss_pred HHHHHHHHHHHHhhHHHHHHHH
Confidence 2335556667777777666654
No 79
>KOG3677|consensus
Probab=78.60 E-value=2.1 Score=39.28 Aligned_cols=124 Identities=15% Similarity=0.166 Sum_probs=76.4
Q ss_pred HHHHHHHHHHHHHHHHHhcCCCchHHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChH--HH--------HHHHH
Q psy398 24 MKDANEKRLKELDLEHEKNMMDEEDQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKS--HR--------IDAVF 93 (162)
Q Consensus 24 ~~~~n~~~l~~Le~~lk~Nlg~~~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g--~k--------iD~~l 93 (162)
..++|+.+++.|..--++|.. .-.|...+..+ -.......-++++..-. |..+-.+ .+ -=-++
T Consensus 167 ~~krn~deie~l~si~sn~wn--iy~iLnil~sl----v~ksqi~~ql~~~~s~~-dp~~va~~~g~s~~y~~LgyfsL~ 239 (525)
T KOG3677|consen 167 AEKRNKDEIENLQSIFSNNWN--IYWILNILHSL----VDKSQISIQLTASVSNK-DPALVALIFGASQPYANLGYFSLL 239 (525)
T ss_pred hhhcCHHHHHHHHhcccccch--HHHHHHHHHHH----HHHHHHHHHHHHHhcCC-ChhhhhhhhccccHHHhhhHHHHH
Confidence 456677777766654445555 66666544433 23334455555555544 3221111 01 11356
Q ss_pred HHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhhhHHHHHHHHHHhhhhcHHHHHHHhHhhcc
Q psy398 94 TLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSARNKLKAYEGVYCLAIRQYCRAAELFVDVIR 160 (162)
Q Consensus 94 ~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rrnrLKvy~gl~~l~~R~fk~AA~lfld~~~ 160 (162)
.++|+-++. +|+..+..+|+.-..-+- |--... +.--|-|.++|+.|+|.+|-+-|++++.
T Consensus 240 GLlR~H~lL-gDhQat~q~idi~pk~iy---~t~p~c--~VTY~VGFayLmmrryadai~~F~niLl 300 (525)
T KOG3677|consen 240 GLLRMHILL-GDHQATSQILDIMPKEIY---GTEPMC--RVTYQVGFAYLMMRRYADAIRVFLNILL 300 (525)
T ss_pred HHHHHHHHh-hhhHhhhhhhhcCchhhc---Ccccce--eEeeehhHHHHHHHHHHHHHHHHHHHHH
Confidence 789999999 999887777776555442 222222 2226889999999999999999999874
No 80
>PF13424 TPR_12: Tetratricopeptide repeat; PDB: 3RO2_A 3Q15_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 4A1S_B 3CEQ_B 3EDT_H ....
Probab=78.24 E-value=16 Score=23.75 Aligned_cols=69 Identities=19% Similarity=0.108 Sum_probs=49.9
Q ss_pred HHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchh-hhhHHHHHHHHHHhhhhcHHHHHHHhHhhcc
Q psy398 89 IDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWS-ARNKLKAYEGVYCLAIRQYCRAAELFVDVIR 160 (162)
Q Consensus 89 iD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~-rrnrLKvy~gl~~l~~R~fk~AA~lfld~~~ 160 (162)
..+..++-.+-... +++....++.++|-.+.+. -|+.. .....-.--|..+...++|.+|.+.|-.++.
T Consensus 5 a~~~~~la~~~~~~-~~~~~A~~~~~~al~~~~~--~~~~~~~~a~~~~~lg~~~~~~g~~~~A~~~~~~al~ 74 (78)
T PF13424_consen 5 ANAYNNLARVYREL-GRYDEALDYYEKALDIEEQ--LGDDHPDTANTLNNLGECYYRLGDYEEALEYYQKALD 74 (78)
T ss_dssp HHHHHHHHHHHHHT-T-HHHHHHHHHHHHHHHHH--TTTHHHHHHHHHHHHHHHHHHTTHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHc-CCHHHHHHHHHHHHHHHHH--HCCCCHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHh
Confidence 45666777777778 9999999999999999766 55532 1233333456666779999999999987763
No 81
>PRK10049 pgaA outer membrane protein PgaA; Provisional
Probab=77.77 E-value=31 Score=33.00 Aligned_cols=94 Identities=5% Similarity=-0.121 Sum_probs=63.4
Q ss_pred HHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhh
Q psy398 52 GIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSAR 131 (162)
Q Consensus 52 ~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rr 131 (162)
.++..+|..+...|+.++|++.|.+++ ...... .+..+.+..+-+.. +++......+.++-..- +.+ .
T Consensus 50 ~~~~~lA~~~~~~g~~~~A~~~~~~al-~~~P~~---~~a~~~la~~l~~~-g~~~eA~~~l~~~l~~~----P~~---~ 117 (765)
T PRK10049 50 RGYAAVAVAYRNLKQWQNSLTLWQKAL-SLEPQN---DDYQRGLILTLADA-GQYDEALVKAKQLVSGA----PDK---A 117 (765)
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHH-HhCCCC---HHHHHHHHHHHHHC-CCHHHHHHHHHHHHHhC----CCC---H
Confidence 347789999999999999999999987 442221 33445566666666 88888888888776652 112 1
Q ss_pred hHHHHHHHHHHhhhhcHHHHHHHhHhhc
Q psy398 132 NKLKAYEGVYCLAIRQYCRAAELFVDVI 159 (162)
Q Consensus 132 nrLKvy~gl~~l~~R~fk~AA~lfld~~ 159 (162)
+ ....|..+...+++.+|...|-.++
T Consensus 118 ~--~~~la~~l~~~g~~~~Al~~l~~al 143 (765)
T PRK10049 118 N--LLALAYVYKRAGRHWDELRAMTQAL 143 (765)
T ss_pred H--HHHHHHHHHHCCCHHHHHHHHHHHH
Confidence 1 2223667778888888887775543
No 82
>PF13429 TPR_15: Tetratricopeptide repeat; PDB: 2VQ2_A 2PL2_B.
Probab=77.06 E-value=8.2 Score=31.55 Aligned_cols=92 Identities=15% Similarity=0.185 Sum_probs=35.6
Q ss_pred HHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhhhH
Q psy398 54 WQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSARNK 133 (162)
Q Consensus 54 ~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rrnr 133 (162)
...++.+ ...|+.++|.+.+.+.+ +... .-+.....+.+..-. ++|..+...|+++.....- . ....
T Consensus 81 ~~~l~~l-~~~~~~~~A~~~~~~~~-~~~~----~~~~l~~~l~~~~~~-~~~~~~~~~l~~~~~~~~~--~----~~~~ 147 (280)
T PF13429_consen 81 YERLIQL-LQDGDPEEALKLAEKAY-ERDG----DPRYLLSALQLYYRL-GDYDEAEELLEKLEELPAA--P----DSAR 147 (280)
T ss_dssp -------------------------------------------H-HHHT-T-HHHHHHHHHHHHH-T---------T-HH
T ss_pred ccccccc-ccccccccccccccccc-cccc----ccchhhHHHHHHHHH-hHHHHHHHHHHHHHhccCC--C----CCHH
Confidence 4456666 68899999999888887 6542 235556677777777 9999999999998865432 1 2234
Q ss_pred HHHHHHHHHhhhhcHHHHHHHhHhh
Q psy398 134 LKAYEGVYCLAIRQYCRAAELFVDV 158 (162)
Q Consensus 134 LKvy~gl~~l~~R~fk~AA~lfld~ 158 (162)
+-+..|-.+...|+..+|.+.|-.+
T Consensus 148 ~~~~~a~~~~~~G~~~~A~~~~~~a 172 (280)
T PF13429_consen 148 FWLALAEIYEQLGDPDKALRDYRKA 172 (280)
T ss_dssp HHHHHHHHHHHCCHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 5556677777788888887777443
No 83
>KOG3785|consensus
Probab=76.65 E-value=21 Score=32.66 Aligned_cols=50 Identities=12% Similarity=0.165 Sum_probs=34.8
Q ss_pred hcCCCchHHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHH
Q psy398 41 KNMMDEEDQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFT 94 (162)
Q Consensus 41 ~Nlg~~~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~ 94 (162)
.|++ +||=-....=+|.-|+..||+++|+..|+-.. .+.+. ...+++-|.
T Consensus 49 ~~~~--~EEE~~~~lWia~C~fhLgdY~~Al~~Y~~~~-~~~~~-~~el~vnLA 98 (557)
T KOG3785|consen 49 LNLD--REEEDSLQLWIAHCYFHLGDYEEALNVYTFLM-NKDDA-PAELGVNLA 98 (557)
T ss_pred hccc--hhhhHHHHHHHHHHHHhhccHHHHHHHHHHHh-ccCCC-CcccchhHH
Confidence 6888 54445555667888889999999999999988 54433 334444443
No 84
>PF14559 TPR_19: Tetratricopeptide repeat; PDB: 2R5S_A 3QDN_B 3QOU_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 3FP3_A 3LCA_A ....
Probab=75.83 E-value=4.5 Score=25.59 Aligned_cols=53 Identities=15% Similarity=0.168 Sum_probs=36.6
Q ss_pred HhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhh
Q psy398 62 CSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATEL 119 (162)
Q Consensus 62 ~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l 119 (162)
.+-|+++.|++.|.++. ..... -.++.+.+.++.+.. +++....+.++++-..
T Consensus 2 l~~~~~~~A~~~~~~~l-~~~p~---~~~~~~~la~~~~~~-g~~~~A~~~l~~~~~~ 54 (68)
T PF14559_consen 2 LKQGDYDEAIELLEKAL-QRNPD---NPEARLLLAQCYLKQ-GQYDEAEELLERLLKQ 54 (68)
T ss_dssp HHTTHHHHHHHHHHHHH-HHTTT---SHHHHHHHHHHHHHT-T-HHHHHHHHHCCHGG
T ss_pred hhccCHHHHHHHHHHHH-HHCCC---CHHHHHHHHHHHHHc-CCHHHHHHHHHHHHHH
Confidence 45688999999999988 44322 456666777777777 8887777777665554
No 85
>COG4259 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=75.02 E-value=9.8 Score=28.61 Aligned_cols=65 Identities=14% Similarity=0.003 Sum_probs=44.2
Q ss_pred HHHHHHHHHHHHHH----hcCCCchHHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHH
Q psy398 27 ANEKRLKELDLEHE----KNMMDEEDQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLF 96 (162)
Q Consensus 27 ~n~~~l~~Le~~lk----~Nlg~~~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~i 96 (162)
.-++++++||.-++ .|+-. --..|-.+|-.|.++|+.+.|+.-|..-- .--.-+|.-||.++.-.
T Consensus 48 s~e~Q~~~le~~~ek~~ak~~~v----pPG~HAhLGlLys~~G~~e~a~~eFetEK-alFPES~~fmDFLmk~~ 116 (121)
T COG4259 48 SKEAQTAALEKYLEKIGAKNGAV----PPGYHAHLGLLYSNSGKDEQAVREFETEK-ALFPESGVFMDFLMKNG 116 (121)
T ss_pred CHHHHHHHHHHHHHHHhhcCCCC----CCcHHHHHHHHHhhcCChHHHHHHHHHhh-hhCccchhHHHHHHHcc
Confidence 34556666666655 44330 02347789999999999999999998644 33456678888877643
No 86
>PF13525 YfiO: Outer membrane lipoprotein; PDB: 3TGO_A 3Q5M_A 2YHC_A.
Probab=74.96 E-value=39 Score=26.66 Aligned_cols=110 Identities=20% Similarity=0.221 Sum_probs=68.2
Q ss_pred hHHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHH---------hcCCHHHHHHHHHHHH
Q psy398 47 EDQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYF---------HGCDIQAMTAAIEKAT 117 (162)
Q Consensus 47 ~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f---------~~~d~~~v~~~i~ka~ 117 (162)
-.-...+...+|.=|.+.||++.|...|.+.+..+..+. .+|-++.++=++.+ . .|...+.+.+...+
T Consensus 38 s~~a~~A~l~la~a~y~~~~y~~A~~~~~~fi~~yP~~~--~~~~A~Y~~g~~~~~~~~~~~~~~-~D~~~~~~A~~~~~ 114 (203)
T PF13525_consen 38 SPYAPQAQLMLAYAYYKQGDYEEAIAAYERFIKLYPNSP--KADYALYMLGLSYYKQIPGILRSD-RDQTSTRKAIEEFE 114 (203)
T ss_dssp STTHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHH-TT-T--THHHHHHHHHHHHHHHHHHHH-TT----HHHHHHHHHHH
T ss_pred ChHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCCCCc--chhhHHHHHHHHHHHhCccchhcc-cChHHHHHHHHHHH
Confidence 345678899999999999999999999999772344433 34444333333322 3 56667778888888
Q ss_pred hhhhhcCCCchhhh---------hHHH---HHHHHHHhhhhcHHHHHHHhHhhc
Q psy398 118 ELIEGISGGDWSAR---------NKLK---AYEGVYCLAIRQYCRAAELFVDVI 159 (162)
Q Consensus 118 ~l~e~~~ggDW~rr---------nrLK---vy~gl~~l~~R~fk~AA~lfld~~ 159 (162)
.++.......+... ++|- .+-|-+++-.+.|..|..-|-..+
T Consensus 115 ~li~~yP~S~y~~~A~~~l~~l~~~la~~e~~ia~~Y~~~~~y~aA~~r~~~v~ 168 (203)
T PF13525_consen 115 ELIKRYPNSEYAEEAKKRLAELRNRLAEHELYIARFYYKRGKYKAAIIRFQYVI 168 (203)
T ss_dssp HHHHH-TTSTTHHHHHHHHHHHHHHHHHHHHHHHHHHHCTT-HHHHHHHHHHHH
T ss_pred HHHHHCcCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccHHHHHHHHHHHH
Confidence 88877444444322 2221 356778888888888887665543
No 87
>KOG0624|consensus
Probab=74.66 E-value=23 Score=32.16 Aligned_cols=64 Identities=17% Similarity=0.278 Sum_probs=53.7
Q ss_pred HHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchh
Q psy398 51 SGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWS 129 (162)
Q Consensus 51 r~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~ 129 (162)
-+++...+..++.+||.+++++..++.+ . ..+.||. .--|+ .....|.+.|+-|++.+++ ++|.
T Consensus 223 Te~~ykis~L~Y~vgd~~~sL~~iRECL-K--ldpdHK~--------Cf~~Y-KklkKv~K~les~e~~ie~---~~~t 286 (504)
T KOG0624|consen 223 TEGHYKISQLLYTVGDAENSLKEIRECL-K--LDPDHKL--------CFPFY-KKLKKVVKSLESAEQAIEE---KHWT 286 (504)
T ss_pred hHHHHHHHHHHHhhhhHHHHHHHHHHHH-c--cCcchhh--------HHHHH-HHHHHHHHHHHHHHHHHhh---hhHH
Confidence 5689999999999999999999999988 3 3456775 44567 8888999999999999996 8995
No 88
>PF13371 TPR_9: Tetratricopeptide repeat
Probab=74.40 E-value=5.8 Score=25.42 Aligned_cols=30 Identities=17% Similarity=0.400 Sum_probs=18.8
Q ss_pred HHHhhHhHHHhhcCHHHHHHHHHHhcCCCCC
Q psy398 53 IWQAKLDYLCSIGDRAEAFKLADSKFADKTN 83 (162)
Q Consensus 53 ~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~ 83 (162)
.+..+|..|.+.|+++.|.+.+.+.. +.+.
T Consensus 31 ~~~~~a~~~~~~g~~~~A~~~l~~~l-~~~p 60 (73)
T PF13371_consen 31 LWLQRARCLFQLGRYEEALEDLERAL-ELSP 60 (73)
T ss_pred hhHHHHHHHHHhccHHHHHHHHHHHH-HHCC
Confidence 45566666666666666666666666 4443
No 89
>PF05843 Suf: Suppressor of forked protein (Suf); InterPro: IPR008847 This domain consists of several eukaryotic suppressor of forked (Suf) like proteins. The Drosophila melanogaster suppressor of forked [Su(f)] protein shares homology with the Saccharomyces cerevisiae RNA14 protein and the 77 kDa subunit of Homo sapiens cleavage stimulation factor, which are proteins involved in mRNA 3' end formation. This suggests a role for Su(f) in mRNA 3' end formation in Drosophila. The su(f) gene produces three transcripts; two of them are polyadenylated at the end of the transcription unit, and one is a truncated transcript, polyadenylated in intron 4. It is thought that su(f) plays a role in the regulation of poly(A) site utilisation and the GU-rich sequence is important for this regulation to occur [].; GO: 0006397 mRNA processing, 0005634 nucleus; PDB: 2L9B_B 2OND_B 2OOE_A 4E85_B 4EBA_C 4E6H_A 2UY1_B.
Probab=74.35 E-value=27 Score=29.25 Aligned_cols=66 Identities=17% Similarity=0.239 Sum_probs=52.8
Q ss_pred HHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhh
Q psy398 54 WQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIE 121 (162)
Q Consensus 54 ~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e 121 (162)
|..-.+|+..+||.++|-..|+++. .......+.-.++-..|+.-.-+ ||...|.+--.++..+..
T Consensus 73 ~~~Y~~~l~~~~d~~~aR~lfer~i-~~l~~~~~~~~iw~~~i~fE~~~-Gdl~~v~~v~~R~~~~~~ 138 (280)
T PF05843_consen 73 WLEYLDFLIKLNDINNARALFERAI-SSLPKEKQSKKIWKKFIEFESKY-GDLESVRKVEKRAEELFP 138 (280)
T ss_dssp HHHHHHHHHHTT-HHHHHHHHHHHC-CTSSCHHHCHHHHHHHHHHHHHH-S-HHHHHHHHHHHHHHTT
T ss_pred HHHHHHHHHHhCcHHHHHHHHHHHH-HhcCchhHHHHHHHHHHHHHHHc-CCHHHHHHHHHHHHHHhh
Confidence 5566689999999999999999999 77555554667888888998889 899999998888888764
No 90
>COG5107 RNA14 Pre-mRNA 3'-end processing (cleavage and polyadenylation) factor [RNA processing and modification]
Probab=73.36 E-value=26 Score=32.88 Aligned_cols=96 Identities=23% Similarity=0.285 Sum_probs=61.8
Q ss_pred HHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhc--CCHHHHHHHHHHHHhhh-------hh-
Q psy398 53 IWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHG--CDIQAMTAAIEKATELI-------EG- 122 (162)
Q Consensus 53 ~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~--~d~~~v~~~i~ka~~l~-------e~- 122 (162)
.|.+-..|+..|+|+++|++...+.. ++|.|.-.+ ++-.++ +|-..|....++..+.+ +.
T Consensus 304 vw~dys~Y~~~isd~q~al~tv~rg~-~~spsL~~~---------lse~yel~nd~e~v~~~fdk~~q~L~r~ys~~~s~ 373 (660)
T COG5107 304 VWFDYSEYLIGISDKQKALKTVERGI-EMSPSLTMF---------LSEYYELVNDEEAVYGCFDKCTQDLKRKYSMGESE 373 (660)
T ss_pred HHHHHHHHHhhccHHHHHHHHHHhcc-cCCCchhee---------HHHHHhhcccHHHHhhhHHHHHHHHHHHHhhhhhh
Confidence 57888999999999999999999999 999884333 333332 56666666555444332 11
Q ss_pred -cCCCc--h--------hhhhHHHHHHHHHHhh---hhcHHHHHHHhHhh
Q psy398 123 -ISGGD--W--------SARNKLKAYEGVYCLA---IRQYCRAAELFVDV 158 (162)
Q Consensus 123 -~~ggD--W--------~rrnrLKvy~gl~~l~---~R~fk~AA~lfld~ 158 (162)
.++.| . -|+|+|-...-+|.=. +|-.+.|-.+|..+
T Consensus 374 ~~s~~D~N~e~~~Ell~kr~~k~t~v~C~~~N~v~r~~Gl~aaR~~F~k~ 423 (660)
T COG5107 374 SASKVDNNFEYSKELLLKRINKLTFVFCVHLNYVLRKRGLEAARKLFIKL 423 (660)
T ss_pred hhccccCCccccHHHHHHHHhhhhhHHHHHHHHHHHHhhHHHHHHHHHHH
Confidence 11222 2 2778777665555433 44557777888764
No 91
>TIGR01716 RGG_Cterm transcriptional activator, Rgg/GadR/MutR family, C-terminal domain. This model describes the whole, except for a 60 residue N-terminal helix-turn-helix DNA-binding domain (PFAM pfam01381) of the family of proteins related to the transcriptional regulator Rgg, also called RopB. Rgg is required for secretion of several proteins, including a cysteine proteinase associated with virulence. GadR is a positive regulator of a glutamate-dependent acid resistance mechanism. MutR is a transcriptional activator for mutacin biosynthesis genes in Streptococcus mutans. This family appears restricted to the low-GC Gram-positive bacteria, including at least eight members in Lactococcus lactis.
Probab=73.35 E-value=39 Score=26.57 Aligned_cols=60 Identities=12% Similarity=0.141 Sum_probs=48.3
Q ss_pred hHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhhhHHHHHHHHHHhhhhc
Q psy398 85 KSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSARNKLKAYEGVYCLAIRQ 147 (162)
Q Consensus 85 ~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rrnrLKvy~gl~~l~~R~ 147 (162)
..-.+.++.+.+-+.+.. ++...+...+.+.+.++.. ..+=--|..++-|+|++....++
T Consensus 124 ~~~i~~il~N~~~~~i~~-~~~~~a~~~l~~l~~l~~~--~~~~~~ki~~~f~~~l~~y~~g~ 183 (220)
T TIGR01716 124 RRRVIQLLLNIAVLLIEK-NEFSYAQYFLEKLEKILDP--EDDLYERILFNFLKGIILYKEGQ 183 (220)
T ss_pred HHHHHHHHHHHHHHHHHh-hHHHHHHHHHHHHHHHhch--hhhHHHHHHHHHHHHHHHHHcCC
Confidence 345677888999999999 9999999999999999752 12433588888899999888776
No 92
>KOG0553|consensus
Probab=73.34 E-value=34 Score=29.90 Aligned_cols=78 Identities=13% Similarity=0.106 Sum_probs=47.9
Q ss_pred HHHHHHHHHHHHHHHHHHHH----hcCCCchHHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHH
Q psy398 21 YTEMKDANEKRLKELDLEHE----KNMMDEEDQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLF 96 (162)
Q Consensus 21 ~~~~~~~n~~~l~~Le~~lk----~Nlg~~~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~i 96 (162)
+=..++..--+|.+-+..++ -=.+ +..-..+|.-+|.=|..-|+.++|.++|.+++ +-
T Consensus 117 yycNRAAAy~~Lg~~~~AVkDce~Al~i--Dp~yskay~RLG~A~~~~gk~~~A~~aykKaL-el--------------- 178 (304)
T KOG0553|consen 117 YYCNRAAAYSKLGEYEDAVKDCESALSI--DPHYSKAYGRLGLAYLALGKYEEAIEAYKKAL-EL--------------- 178 (304)
T ss_pred HHHHHHHHHHHhcchHHHHHHHHHHHhc--ChHHHHHHHHHHHHHHccCcHHHHHHHHHhhh-cc---------------
Confidence 33444555555555555555 2223 44455566666666666666666666666666 21
Q ss_pred HHHHHhcCCHHHHHHHHHHHHhhhhh
Q psy398 97 RIAYFHGCDIQAMTAAIEKATELIEG 122 (162)
Q Consensus 97 Rv~~f~~~d~~~v~~~i~ka~~l~e~ 122 (162)
- -|+..++++|..|+..+.+
T Consensus 179 -----d-P~Ne~~K~nL~~Ae~~l~e 198 (304)
T KOG0553|consen 179 -----D-PDNESYKSNLKIAEQKLNE 198 (304)
T ss_pred -----C-CCcHHHHHHHHHHHHHhcC
Confidence 1 6788888888888888776
No 93
>TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD. ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array.
Probab=73.28 E-value=30 Score=24.50 Aligned_cols=65 Identities=12% Similarity=0.035 Sum_probs=48.8
Q ss_pred HHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhh
Q psy398 51 SGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELI 120 (162)
Q Consensus 51 r~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~ 120 (162)
...+..+|..|...|+.+.|.+.|.++. .-... ..+..+.+-.+-... +++......+.++-.+.
T Consensus 51 ~~~~~~la~~~~~~~~~~~A~~~~~~~~-~~~p~---~~~~~~~la~~~~~~-g~~~~A~~~~~~al~~~ 115 (135)
T TIGR02552 51 SRYWLGLAACCQMLKEYEEAIDAYALAA-ALDPD---DPRPYFHAAECLLAL-GEPESALKALDLAIEIC 115 (135)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH-hcCCC---ChHHHHHHHHHHHHc-CCHHHHHHHHHHHHHhc
Confidence 4567789999999999999999999987 43322 234445555566666 89999988888887774
No 94
>PF13041 PPR_2: PPR repeat family
Probab=71.59 E-value=17 Score=21.96 Aligned_cols=30 Identities=20% Similarity=0.278 Sum_probs=24.8
Q ss_pred HHHhhHhHHHhhcCHHHHHHHHHHhcCCCCC
Q psy398 53 IWQAKLDYLCSIGDRAEAFKLADSKFADKTN 83 (162)
Q Consensus 53 ~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~ 83 (162)
.+.-+-+.|++.|+.++|++.|.+|. +...
T Consensus 5 ~yn~li~~~~~~~~~~~a~~l~~~M~-~~g~ 34 (50)
T PF13041_consen 5 TYNTLISGYCKAGKFEEALKLFKEMK-KRGI 34 (50)
T ss_pred HHHHHHHHHHHCcCHHHHHHHHHHHH-HcCC
Confidence 35567788999999999999999999 5443
No 95
>PRK10866 outer membrane biogenesis protein BamD; Provisional
Probab=71.25 E-value=40 Score=27.78 Aligned_cols=58 Identities=7% Similarity=-0.064 Sum_probs=48.1
Q ss_pred HhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHH
Q psy398 55 QAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIE 114 (162)
Q Consensus 55 ~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ 114 (162)
...|.||.+.|.+..|..-+..+. +.=..+.+.-+.++.+++..... +.....++.+.
T Consensus 179 ~~ia~~Y~~~~~y~AA~~r~~~v~-~~Yp~t~~~~eal~~l~~ay~~l-g~~~~a~~~~~ 236 (243)
T PRK10866 179 LSVAEYYTKRGAYVAVVNRVEQML-RDYPDTQATRDALPLMENAYRQL-QLNAQADKVAK 236 (243)
T ss_pred HHHHHHHHHcCchHHHHHHHHHHH-HHCCCCchHHHHHHHHHHHHHHc-CChHHHHHHHH
Confidence 368999999999999999999999 66666788999999999988888 66666655443
No 96
>PLN03081 pentatricopeptide (PPR) repeat-containing protein; Provisional
Probab=71.15 E-value=42 Score=31.45 Aligned_cols=58 Identities=17% Similarity=0.068 Sum_probs=32.2
Q ss_pred HHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHH
Q psy398 53 IWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIE 114 (162)
Q Consensus 53 ~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ 114 (162)
.|..+..-|++.|+.++|++.|.+|. .....+... ..-.+|+..... +++........
T Consensus 292 t~n~li~~y~~~g~~~eA~~lf~~M~-~~g~~pd~~--t~~~ll~a~~~~-g~~~~a~~i~~ 349 (697)
T PLN03081 292 AWNSMLAGYALHGYSEEALCLYYEMR-DSGVSIDQF--TFSIMIRIFSRL-ALLEHAKQAHA 349 (697)
T ss_pred HHHHHHHHHHhCCCHHHHHHHHHHHH-HcCCCCCHH--HHHHHHHHHHhc-cchHHHHHHHH
Confidence 46667777777777777777777776 433322221 334445544444 55554444333
No 97
>PF08671 SinI: Anti-repressor SinI; InterPro: IPR010981 The SinR repressor is part of a group of Sin (sporulation inhibition) proteins in Bacillus subtilis that regulate the commitment to sporulation in response to extreme adversity []. SinR is a tetrameric repressor protein that binds to the promoters of genes essential for entry into sporulation and prevents their transcription. This repression is overcome through the activity of SinI, which disrupts the SinR tetramer through the formation of a SinI-SinR heterodimer, thereby allowing sporulation to proceed. The SinR structure consists of two domains: a dimerisation domain stabilised by a hydrophobic core, and a DNA-binding domain that is identical to domains of the bacteriophage 434 CI and Cro proteins that regulate prophage induction. The dimerisation domain is a four-helical bundle formed from two helices from the C-terminal residues of SinR and two helices from the central residues of SinI. These regions in SinR and SinI are similar in both structure and sequence. The interaction of SinR monomers to form tetramers is weaker than between SinR and SinI, since SinI can effectively disrupt SinR tetramers. This entry represents the dimerisation domain in both SinI and SinR proteins.; GO: 0005488 binding, 0006355 regulation of transcription, DNA-dependent; PDB: 1B0N_A 2YAL_A.
Probab=71.01 E-value=6.5 Score=22.82 Aligned_cols=24 Identities=17% Similarity=0.327 Sum_probs=14.7
Q ss_pred cCHHHHHHHHHHHHHHHHHHHHHHHhcCCCchHHHHH
Q psy398 16 MDEKFYTEMKDANEKRLKELDLEHEKNMMDEEDQVSG 52 (162)
Q Consensus 16 ~D~~~~~~~~~~n~~~l~~Le~~lk~Nlg~~~~eIr~ 52 (162)
+|++|++-|.++. ++|+++++||.
T Consensus 1 LD~EW~~Li~eA~-------------~~Gls~eeir~ 24 (30)
T PF08671_consen 1 LDEEWVELIKEAK-------------ESGLSKEEIRE 24 (30)
T ss_dssp --HHHHHHHHHHH-------------HTT--HHHHHH
T ss_pred CCHHHHHHHHHHH-------------HcCCCHHHHHH
Confidence 5888888887664 44556888875
No 98
>PF10345 Cohesin_load: Cohesin loading factor; InterPro: IPR019440 Cohesin loading factor is a conserved protein that has been characterised in fungi. It is associated with the cohesin complex and is required in G1 for cohesin binding to chromosomes, but is dispensable in G2 when cohesion has been established. It is often referred to as Ssl3 in Schizosaccharomyces pombe (Fission yeast), and Scc4 in Saccharomyces cerevisiae (Baker's yeast). It complexes with Mis4 [].
Probab=70.95 E-value=35 Score=31.80 Aligned_cols=66 Identities=15% Similarity=0.095 Sum_probs=52.2
Q ss_pred HHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCc---hhhhhHHHHHHHHHHhhhhcHHHHHHHhH
Q psy398 88 RIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGD---WSARNKLKAYEGVYCLAIRQYCRAAELFV 156 (162)
Q Consensus 88 kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggD---W~rrnrLKvy~gl~~l~~R~fk~AA~lfl 156 (162)
++-+.++++=..+-. +||..+...++.+.+.... -++ -.-...+-.-.|+|+...|+...|-..|.
T Consensus 360 ~~~~~~y~~~~~~~~-~~~~~a~~~l~~~~~~~~~--~~~~~~~~~~~~~~yL~gl~~q~~g~l~~A~~~y~ 428 (608)
T PF10345_consen 360 QCYLLFYQIWCNFIR-GDWSKATQELEFMRQLCQR--SPSKLYESLYPLLHYLLGLYYQSTGDLEAALYQYQ 428 (608)
T ss_pred HHHHHHHHHHHHHHC-cCHHHHHHHHHHHHHHHhc--CccchhhhhhHHHHHHHHHHHHHcCCHHHHHHHHh
Confidence 344666677777777 9999999999999998765 333 22346666789999999999999999998
No 99
>PRK04841 transcriptional regulator MalT; Provisional
Probab=70.65 E-value=81 Score=30.08 Aligned_cols=98 Identities=11% Similarity=0.034 Sum_probs=73.2
Q ss_pred hHHHhhcCHHHHHHHHHHhcCCCCCChHHH-HHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhhhHHHHH
Q psy398 59 DYLCSIGDRAEAFKLADSKFADKTNPKSHR-IDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSARNKLKAY 137 (162)
Q Consensus 59 ~~~~~iGD~~~A~~ay~~~~~~~~~s~g~k-iD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rrnrLKvy 137 (162)
.++...|+.+.|.+...... .......+. ......+.|+..-. +++......+.+|-...+. .|.--...+..+.
T Consensus 661 ~~~~~~g~~~~A~~~l~~~~-~~~~~~~~~~~~~~~~~a~~~~~~-g~~~~A~~~l~~al~~~~~--~g~~~~~a~~~~~ 736 (903)
T PRK04841 661 IYWQMTGDKEAAANWLRQAP-KPEFANNHFLQGQWRNIARAQILL-GQFDEAEIILEELNENARS--LRLMSDLNRNLIL 736 (903)
T ss_pred HHHHHCCCHHHHHHHHHhcC-CCCCccchhHHHHHHHHHHHHHHc-CCHHHHHHHHHHHHHHHHH--hCchHHHHHHHHH
Confidence 55566789999999988877 544333322 22345667777777 8999999999999888665 5554456788889
Q ss_pred HHHHHhhhhcHHHHHHHhHhhcc
Q psy398 138 EGVYCLAIRQYCRAAELFVDVIR 160 (162)
Q Consensus 138 ~gl~~l~~R~fk~AA~lfld~~~ 160 (162)
.|+.+...|+..+|...|..++.
T Consensus 737 la~a~~~~G~~~~A~~~L~~Al~ 759 (903)
T PRK04841 737 LNQLYWQQGRKSEAQRVLLEALK 759 (903)
T ss_pred HHHHHHHcCCHHHHHHHHHHHHH
Confidence 99999999999999999887653
No 100
>PF13525 YfiO: Outer membrane lipoprotein; PDB: 3TGO_A 3Q5M_A 2YHC_A.
Probab=70.54 E-value=19 Score=28.45 Aligned_cols=65 Identities=25% Similarity=0.157 Sum_probs=51.8
Q ss_pred HHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhh
Q psy398 53 IWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATEL 119 (162)
Q Consensus 53 ~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l 119 (162)
.+...|.-+.+.||...|.+.|++.. +.-.++...-+..|.++...+-. +|+......++.--..
T Consensus 7 ~lY~~a~~~~~~g~y~~Ai~~f~~l~-~~~P~s~~a~~A~l~la~a~y~~-~~y~~A~~~~~~fi~~ 71 (203)
T PF13525_consen 7 ALYQKALEALQQGDYEEAIKLFEKLI-DRYPNSPYAPQAQLMLAYAYYKQ-GDYEEAIAAYERFIKL 71 (203)
T ss_dssp HHHHHHHHHHHCT-HHHHHHHHHHHH-HH-TTSTTHHHHHHHHHHHHHHT-T-HHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHH-HHCCCChHHHHHHHHHHHHHHHc-CCHHHHHHHHHHHHHH
Confidence 56788999999999999999999999 66666678889999999988888 9998887777764443
No 101
>PF13432 TPR_16: Tetratricopeptide repeat; PDB: 3CVP_A 3CVL_A 3CVQ_A 3CV0_A 2GW1_B 3CVN_A 3QKY_A 2PL2_B.
Probab=70.16 E-value=7.9 Score=24.35 Aligned_cols=29 Identities=17% Similarity=0.185 Sum_probs=25.5
Q ss_pred HHHHHHhhHhHHHhhcCHHHHHHHHHHhc
Q psy398 50 VSGIWQAKLDYLCSIGDRAEAFKLADSKF 78 (162)
Q Consensus 50 Ir~~~~~~a~~~~~iGD~~~A~~ay~~~~ 78 (162)
-.+++..+|.-+.+.|+.+.|...|.++.
T Consensus 30 ~~~a~~~lg~~~~~~g~~~~A~~~~~~a~ 58 (65)
T PF13432_consen 30 NPEAWYLLGRILYQQGRYDEALAYYERAL 58 (65)
T ss_dssp HHHHHHHHHHHHHHTT-HHHHHHHHHHHH
T ss_pred CHHHHHHHHHHHHHcCCHHHHHHHHHHHH
Confidence 36788899999999999999999999987
No 102
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=69.72 E-value=33 Score=27.97 Aligned_cols=43 Identities=14% Similarity=0.046 Sum_probs=32.2
Q ss_pred HHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCCh-HHHHHHHHH
Q psy398 52 GIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPK-SHRIDAVFT 94 (162)
Q Consensus 52 ~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~-g~kiD~~l~ 94 (162)
..+.-+|+...+.|+.++|.+.|++....+..+. +..+|+.-.
T Consensus 166 ~l~YLigeL~rrlg~~~eA~~~fs~vi~~~~~s~~~~l~~~AR~ 209 (214)
T PF09986_consen 166 TLLYLIGELNRRLGNYDEAKRWFSRVIGSKKASKEPKLKDMARD 209 (214)
T ss_pred HHHHHHHHHHHHhCCHHHHHHHHHHHHcCCCCCCcHHHHHHHHH
Confidence 3566799999999999999999999993444443 366665543
No 103
>PRK15179 Vi polysaccharide biosynthesis protein TviE; Provisional
Probab=69.65 E-value=51 Score=31.74 Aligned_cols=96 Identities=13% Similarity=-0.038 Sum_probs=68.9
Q ss_pred HHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhh
Q psy398 51 SGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSA 130 (162)
Q Consensus 51 r~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~r 130 (162)
-+++.-+|+-..+.|.+++|...+..+..-..-..+-++++...+.|.+ .+.+.+..++.+++. +.+.
T Consensus 86 ~~~~~~La~i~~~~g~~~ea~~~l~~~~~~~Pd~~~a~~~~a~~L~~~~--------~~eeA~~~~~~~l~~----~p~~ 153 (694)
T PRK15179 86 ELFQVLVARALEAAHRSDEGLAVWRGIHQRFPDSSEAFILMLRGVKRQQ--------GIEAGRAEIELYFSG----GSSS 153 (694)
T ss_pred HHHHHHHHHHHHHcCCcHHHHHHHHHHHhhCCCcHHHHHHHHHHHHHhc--------cHHHHHHHHHHHhhc----CCCC
Confidence 4578889999999999999999999998345567777787777776654 455666677777653 3332
Q ss_pred hhHHHHHHHHHHhhhhcHHHHHHHhHhhc
Q psy398 131 RNKLKAYEGVYCLAIRQYCRAAELFVDVI 159 (162)
Q Consensus 131 rnrLKvy~gl~~l~~R~fk~AA~lfld~~ 159 (162)
.--..-.|..+..+|.|.+|-.+|=+.+
T Consensus 154 -~~~~~~~a~~l~~~g~~~~A~~~y~~~~ 181 (694)
T PRK15179 154 -AREILLEAKSWDEIGQSEQADACFERLS 181 (694)
T ss_pred -HHHHHHHHHHHHHhcchHHHHHHHHHHH
Confidence 2334456777778888888877775544
No 104
>KOG2114|consensus
Probab=69.43 E-value=68 Score=31.98 Aligned_cols=84 Identities=24% Similarity=0.318 Sum_probs=56.7
Q ss_pred hcCCCchHHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCCh------------HHHHHHHHHHHHHHHHhcCCHHH
Q psy398 41 KNMMDEEDQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPK------------SHRIDAVFTLFRIAYFHGCDIQA 108 (162)
Q Consensus 41 ~Nlg~~~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~------------g~kiD~~l~~iRv~~f~~~d~~~ 108 (162)
.+++ .+-++..|.+-|+|+++-||.++|..-|.++. ++--++ ..-.+.+=++++-|+-. .|+..
T Consensus 360 ~~~d--~d~~~~i~~kYgd~Ly~Kgdf~~A~~qYI~tI-~~le~s~Vi~kfLdaq~IknLt~YLe~L~~~gla~-~dhtt 435 (933)
T KOG2114|consen 360 QHLD--EDTLAEIHRKYGDYLYGKGDFDEATDQYIETI-GFLEPSEVIKKFLDAQRIKNLTSYLEALHKKGLAN-SDHTT 435 (933)
T ss_pred cCCC--HHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHc-ccCChHHHHHHhcCHHHHHHHHHHHHHHHHccccc-chhHH
Confidence 5556 99999999999999999999999999999999 654332 22334555666666666 55543
Q ss_pred HH-------HHHHHHHhhhhhcCCCch
Q psy398 109 MT-------AAIEKATELIEGISGGDW 128 (162)
Q Consensus 109 v~-------~~i~ka~~l~e~~~ggDW 128 (162)
+. +-++|....|.+.+.|.|
T Consensus 436 lLLncYiKlkd~~kL~efI~~~~~g~~ 462 (933)
T KOG2114|consen 436 LLLNCYIKLKDVEKLTEFISKCDKGEW 462 (933)
T ss_pred HHHHHHHHhcchHHHHHHHhcCCCcce
Confidence 32 123444555554333556
No 105
>PLN03088 SGT1, suppressor of G2 allele of SKP1; Provisional
Probab=69.40 E-value=61 Score=28.15 Aligned_cols=63 Identities=17% Similarity=0.144 Sum_probs=49.1
Q ss_pred HHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhh
Q psy398 52 GIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATEL 119 (162)
Q Consensus 52 ~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l 119 (162)
.++..+|.-|.+.|+++.|+..+.+++ ...... ....+.+-.+.+.. +++......+.+|-.+
T Consensus 37 ~a~~~~a~~~~~~g~~~eAl~~~~~Al-~l~P~~---~~a~~~lg~~~~~l-g~~~eA~~~~~~al~l 99 (356)
T PLN03088 37 ELYADRAQANIKLGNFTEAVADANKAI-ELDPSL---AKAYLRKGTACMKL-EEYQTAKAALEKGASL 99 (356)
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHH-HhCcCC---HHHHHHHHHHHHHh-CCHHHHHHHHHHHHHh
Confidence 467899999999999999999999998 544332 23345555555666 8999999999998886
No 106
>PF04100 Vps53_N: Vps53-like, N-terminal ; InterPro: IPR007234 Vps53 complexes with Vps52 and Vps54 to form a multi-subunit complex involved in regulating membrane trafficking events [].
Probab=69.31 E-value=61 Score=28.83 Aligned_cols=47 Identities=19% Similarity=0.213 Sum_probs=35.6
Q ss_pred HHHHHHHHHHHHHHHHHHHH---------hcCCCchHHHHHHHHhhHhHHHhhcCHHH
Q psy398 21 YTEMKDANEKRLKELDLEHE---------KNMMDEEDQVSGIWQAKLDYLCSIGDRAE 69 (162)
Q Consensus 21 ~~~~~~~n~~~l~~Le~~lk---------~Nlg~~~~eIr~~~~~~a~~~~~iGD~~~ 69 (162)
++.+...-+.++.+|+.+|. .+.+ ...+..+...+.+.+.+|.+...
T Consensus 23 ld~~i~~l~~~i~~ld~eI~~~v~~q~~~~~~~--~~~l~~a~~~i~~L~~~i~~ik~ 78 (383)
T PF04100_consen 23 LDELIAKLRKEIRELDEEIKELVREQSSSGQDA--EEDLEEAQEAIQELFEKISEIKS 78 (383)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhccccc--cccHHHHHHHHHHHHHHHHHHHH
Confidence 56666777778888888888 2334 67788888899999999988544
No 107
>KOG1174|consensus
Probab=69.21 E-value=26 Score=32.47 Aligned_cols=25 Identities=24% Similarity=0.081 Sum_probs=21.9
Q ss_pred HHhhHhHHHhhcCHHHHHHHHHHhc
Q psy398 54 WQAKLDYLCSIGDRAEAFKLADSKF 78 (162)
Q Consensus 54 ~~~~a~~~~~iGD~~~A~~ay~~~~ 78 (162)
+..+|+-|+.+||..+|.--|++++
T Consensus 235 l~~lak~~~~~Gdn~~a~~~Fe~~~ 259 (564)
T KOG1174|consen 235 MMALGKCLYYNGDYFQAEDIFSSTL 259 (564)
T ss_pred HHHHhhhhhhhcCchHHHHHHHHHh
Confidence 4478999999999999999999865
No 108
>PRK10866 outer membrane biogenesis protein BamD; Provisional
Probab=69.04 E-value=64 Score=26.56 Aligned_cols=112 Identities=15% Similarity=0.135 Sum_probs=73.5
Q ss_pred hcCCCchHHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHH----------------HhcC
Q psy398 41 KNMMDEEDQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAY----------------FHGC 104 (162)
Q Consensus 41 ~Nlg~~~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~----------------f~~~ 104 (162)
..-+ -.....+...+|.-|.+.||++.|...|++.. .. -+....+|-++.++=++. .. .
T Consensus 61 ~yP~--s~~a~~a~l~la~ayy~~~~y~~A~~~~e~fi-~~-~P~~~~~~~a~Y~~g~~~~~~~~~~~~~~~~~~~~~-r 135 (243)
T PRK10866 61 RYPF--GPYSQQVQLDLIYAYYKNADLPLAQAAIDRFI-RL-NPTHPNIDYVLYMRGLTNMALDDSALQGFFGVDRSD-R 135 (243)
T ss_pred hCCC--ChHHHHHHHHHHHHHHhcCCHHHHHHHHHHHH-Hh-CcCCCchHHHHHHHHHhhhhcchhhhhhccCCCccc-c
Confidence 4445 56777889999999999999999999999988 33 233345564444433321 11 3
Q ss_pred CHHHHHHHHHHHHhhhhhcCCCchhh---------hhHHH---HHHHHHHhhhhcHHHHHHHhHh
Q psy398 105 DIQAMTAAIEKATELIEGISGGDWSA---------RNKLK---AYEGVYCLAIRQYCRAAELFVD 157 (162)
Q Consensus 105 d~~~v~~~i~ka~~l~e~~~ggDW~r---------rnrLK---vy~gl~~l~~R~fk~AA~lfld 157 (162)
|.....+.+...+.+++...+..... ||+|- .+-|=+++-.+.|..|+.-|=.
T Consensus 136 D~~~~~~A~~~~~~li~~yP~S~ya~~A~~rl~~l~~~la~~e~~ia~~Y~~~~~y~AA~~r~~~ 200 (243)
T PRK10866 136 DPQHARAAFRDFSKLVRGYPNSQYTTDATKRLVFLKDRLAKYELSVAEYYTKRGAYVAVVNRVEQ 200 (243)
T ss_pred CHHHHHHHHHHHHHHHHHCcCChhHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCchHHHHHHHHH
Confidence 77778888888999988844444332 23332 2556667777777777665543
No 109
>PRK09782 bacteriophage N4 receptor, outer membrane subunit; Provisional
Probab=69.03 E-value=68 Score=32.21 Aligned_cols=25 Identities=8% Similarity=-0.129 Sum_probs=14.7
Q ss_pred HHHHHHHHhhhhcHHHHHHHhHhhc
Q psy398 135 KAYEGVYCLAIRQYCRAAELFVDVI 159 (162)
Q Consensus 135 Kvy~gl~~l~~R~fk~AA~lfld~~ 159 (162)
...-|..++..+++.+|...|-.++
T Consensus 680 ~~nLA~al~~lGd~~eA~~~l~~Al 704 (987)
T PRK09782 680 IRQLAYVNQRLDDMAATQHYARLVI 704 (987)
T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHH
Confidence 3345566666666666666665544
No 110
>TIGR00756 PPR pentatricopeptide repeat domain (PPR motif). This family has a similar consensus to the TPR domain (tetratricopeptide), pfam pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.
Probab=68.42 E-value=9.1 Score=20.44 Aligned_cols=27 Identities=26% Similarity=0.538 Sum_probs=22.5
Q ss_pred HHhhHhHHHhhcCHHHHHHHHHHhcCCC
Q psy398 54 WQAKLDYLCSIGDRAEAFKLADSKFADK 81 (162)
Q Consensus 54 ~~~~a~~~~~iGD~~~A~~ay~~~~~~~ 81 (162)
+..+-.-|++.|+.++|.+.|.++. ..
T Consensus 3 ~n~li~~~~~~~~~~~a~~~~~~M~-~~ 29 (35)
T TIGR00756 3 YNTLIDGLCKAGRVEEALELFKEML-ER 29 (35)
T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHH-Hc
Confidence 3455677999999999999999998 54
No 111
>PF07721 TPR_4: Tetratricopeptide repeat; InterPro: IPR011717 This entry includes tetratricopeptide-like repeats not detected by the IPR001440 from INTERPRO, IPR013105 from INTERPRO and IPR011716 from INTERPRO models. The tetratricopeptide repeat (TPR) motif is a protein-protein interaction module found in multiple copies in a number of functionally different proteins that facilitates specific interactions with a partner protein(s) [].; GO: 0042802 identical protein binding
Probab=68.11 E-value=8.1 Score=20.78 Aligned_cols=23 Identities=30% Similarity=0.172 Sum_probs=20.1
Q ss_pred HHHhhHhHHHhhcCHHHHHHHHH
Q psy398 53 IWQAKLDYLCSIGDRAEAFKLAD 75 (162)
Q Consensus 53 ~~~~~a~~~~~iGD~~~A~~ay~ 75 (162)
++..+|.-+...||.++|...++
T Consensus 3 a~~~la~~~~~~G~~~eA~~~l~ 25 (26)
T PF07721_consen 3 ARLALARALLAQGDPDEAERLLR 25 (26)
T ss_pred HHHHHHHHHHHcCCHHHHHHHHh
Confidence 56789999999999999998775
No 112
>PF01535 PPR: PPR repeat; InterPro: IPR002885 This entry represents the PPR repeat. Pentatricopeptide repeat (PPR) proteins are characterised by tandem repeats of a degenerate 35 amino acid motif []. Most of PPR proteins have roles in mitochondria or plastid []. PPR repeats were discovered while screening Arabidopsis proteins for those predicted to be targeted to mitochondria or chloroplast [, ]. Some of these proteins have been shown to play a role in post-transcriptional processes within organelles and they are thought to be sequence-specific RNA-binding proteins [, , ]. Plant genomes have between one hundred to five hundred PPR genes per genome whereas non-plant genomes encode two to six PPR proteins. Although no PPR structures are yet known, the motif is predicted to fold into a helix-turn-helix structure similar to those found in the tetratricopeptide repeat (TPR) family (see PDOC50005 from PROSITEDOC) []. The plant PPR protein family has been divided in two subfamilies on the basis of their motif content and organisation [, ]. Examples of PPR repeat-containing proteins include PET309 P32522 from SWISSPROT, which may be involved in RNA stabilisation [], and crp1, which is involved in RNA processing []. The repeat is associated with a predicted plant protein O49549 from SWISSPROT that has a domain organisation similar to the human BRCA1 protein.
Probab=68.07 E-value=8.2 Score=20.39 Aligned_cols=25 Identities=20% Similarity=0.408 Sum_probs=21.5
Q ss_pred HHhhHhHHHhhcCHHHHHHHHHHhc
Q psy398 54 WQAKLDYLCSIGDRAEAFKLADSKF 78 (162)
Q Consensus 54 ~~~~a~~~~~iGD~~~A~~ay~~~~ 78 (162)
+..+-+-|++.|+.++|.+.|.++.
T Consensus 3 y~~li~~~~~~~~~~~a~~~~~~M~ 27 (31)
T PF01535_consen 3 YNSLISGYCKMGQFEEALEVFDEMR 27 (31)
T ss_pred HHHHHHHHHccchHHHHHHHHHHHh
Confidence 3456678999999999999999988
No 113
>KOG2758|consensus
Probab=67.58 E-value=94 Score=28.06 Aligned_cols=73 Identities=7% Similarity=0.023 Sum_probs=55.2
Q ss_pred HHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHH-HHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhh
Q psy398 48 DQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSH-RIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEG 122 (162)
Q Consensus 48 ~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~-kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~ 122 (162)
.|-.+.....|.|-+.+|.+..|-...---| --+..+.. -+...-..+---|.. .||+....-+.|.+..||.
T Consensus 126 ~e~i~~lykyakfqyeCGNY~gAs~yLY~~r-~l~~~~d~n~lsalwGKlASEIL~-qnWd~A~edL~rLre~IDs 199 (432)
T KOG2758|consen 126 PERIETLYKYAKFQYECGNYSGASDYLYFYR-ALVSDPDRNYLSALWGKLASEILT-QNWDGALEDLTRLREYIDS 199 (432)
T ss_pred HHHHHHHHHHHHHHHhccCcccHHHHHHHHH-HhcCCcchhhHHHHHHHHHHHHHH-hhHHHHHHHHHHHHHHHcc
Confidence 4556788899999999999999987654444 33333333 566666666667777 9999999999999999875
No 114
>KOG2076|consensus
Probab=67.16 E-value=19 Score=35.63 Aligned_cols=97 Identities=12% Similarity=0.180 Sum_probs=55.0
Q ss_pred HHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhh----hcCCCc
Q psy398 52 GIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIE----GISGGD 127 (162)
Q Consensus 52 ~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e----~~~ggD 127 (162)
..|..+|.-|-..|.++.|.++|.+.+ .-.. .-+|+-+++--|-... ++...+-..|+..- .+| + +.-
T Consensus 450 ~vw~~~a~c~~~l~e~e~A~e~y~kvl-~~~p---~~~D~Ri~Lasl~~~~-g~~EkalEtL~~~~-~~D~~~~e--~~a 521 (895)
T KOG2076|consen 450 FVWYKLARCYMELGEYEEAIEFYEKVL-ILAP---DNLDARITLASLYQQL-GNHEKALETLEQII-NPDGRNAE--ACA 521 (895)
T ss_pred hhhHHHHHHHHHHhhHHHHHHHHHHHH-hcCC---CchhhhhhHHHHHHhc-CCHHHHHHHHhccc-CCCccchh--hcc
Confidence 357778888888888888888888877 2221 2256666666666666 77774443333322 111 1 345
Q ss_pred hhhhhHHHHHHHHHHhhhhcHHHHHHHhHhhcc
Q psy398 128 WSARNKLKAYEGVYCLAIRQYCRAAELFVDVIR 160 (162)
Q Consensus 128 W~rrnrLKvy~gl~~l~~R~fk~AA~lfld~~~ 160 (162)
|.+.-|+-.-.---+-.++++.+ |+++.+
T Consensus 522 ~~~e~ri~~~r~d~l~~~gk~E~----fi~t~~ 550 (895)
T KOG2076|consen 522 WEPERRILAHRCDILFQVGKREE----FINTAS 550 (895)
T ss_pred ccHHHHHHHHHHHHHHHhhhHHH----HHHHHH
Confidence 77766666433333333455444 666554
No 115
>PF12854 PPR_1: PPR repeat
Probab=67.00 E-value=9.1 Score=21.96 Aligned_cols=24 Identities=33% Similarity=0.570 Sum_probs=20.7
Q ss_pred HHhhHhHHHhhcCHHHHHHHHHHh
Q psy398 54 WQAKLDYLCSIGDRAEAFKLADSK 77 (162)
Q Consensus 54 ~~~~a~~~~~iGD~~~A~~ay~~~ 77 (162)
+.-+-+=||+.|+.++|++.+.+|
T Consensus 10 y~~lI~~~Ck~G~~~~A~~l~~~M 33 (34)
T PF12854_consen 10 YNTLIDGYCKAGRVDEAFELFDEM 33 (34)
T ss_pred HHHHHHHHHHCCCHHHHHHHHHhC
Confidence 455667899999999999999887
No 116
>smart00745 MIT Microtubule Interacting and Trafficking molecule domain.
Probab=66.57 E-value=31 Score=22.98 Aligned_cols=51 Identities=18% Similarity=0.238 Sum_probs=33.5
Q ss_pred hhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCC-------HHHHHHHHHHHHhhhh
Q psy398 56 AKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCD-------IQAMTAAIEKATELIE 121 (162)
Q Consensus 56 ~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d-------~~~v~~~i~ka~~l~e 121 (162)
..|.-+-+.|++++|+.+|..+. +. ++..+... .| +..+..|+.+|+.+-.
T Consensus 13 ~~Av~~d~~g~~~eAl~~Y~~a~-e~----------l~~~~~~~----~~~~~~~~~~~k~~eyl~raE~lk~ 70 (77)
T smart00745 13 SKALKADEAGDYEEALELYKKAI-EY----------LLEGIKVE----SDSKRREAVKAKAAEYLDRAEEIKK 70 (77)
T ss_pred HHHHHHHHcCCHHHHHHHHHHHH-HH----------HHHHhccC----CCHHHHHHHHHHHHHHHHHHHHHHH
Confidence 45556667899999999999987 32 22222211 12 5678888888888743
No 117
>PRK09782 bacteriophage N4 receptor, outer membrane subunit; Provisional
Probab=66.48 E-value=56 Score=32.80 Aligned_cols=92 Identities=12% Similarity=-0.019 Sum_probs=65.5
Q ss_pred HHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchh
Q psy398 50 VSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWS 129 (162)
Q Consensus 50 Ir~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~ 129 (162)
..+.+...|..+..-||.++|+..|.++. +..... ..+.+.+.++.+.. ++......++++|-.+- ..|-.
T Consensus 43 ~~~~~f~~a~~~~~~Gd~~~A~~~l~~Al-~~dP~n---~~~~~~LA~~yl~~-g~~~~A~~~~~kAv~ld----P~n~~ 113 (987)
T PRK09782 43 VIYPRLDKALKAQKNNDEATAIREFEYIH-QQVPDN---IPLTLYLAEAYRHF-GHDDRARLLLEDQLKRH----PGDAR 113 (987)
T ss_pred HHHHHHHHHHHHHhCCCHHHHHHHHHHHH-HhCCCC---HHHHHHHHHHHHHC-CCHHHHHHHHHHHHhcC----cccHH
Confidence 34567888888888899999999999998 544444 55669999999999 99888888888877762 12322
Q ss_pred hhhHHHHHHHHHHhhhhcHHHHHHHhHhh
Q psy398 130 ARNKLKAYEGVYCLAIRQYCRAAELFVDV 158 (162)
Q Consensus 130 rrnrLKvy~gl~~l~~R~fk~AA~lfld~ 158 (162)
-.+.| ..+ ++|.+|+..+-..
T Consensus 114 ~~~~L------a~i--~~~~kA~~~ye~l 134 (987)
T PRK09782 114 LERSL------AAI--PVEVKSVTTVEEL 134 (987)
T ss_pred HHHHH------HHh--ccChhHHHHHHHH
Confidence 22222 122 8888888776543
No 118
>COG4649 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=65.40 E-value=78 Score=26.22 Aligned_cols=129 Identities=19% Similarity=0.192 Sum_probs=83.7
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH----hcCCCchHHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHH
Q psy398 18 EKFYTEMKDANEKRLKELDLEHE----KNMMDEEDQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVF 93 (162)
Q Consensus 18 ~~~~~~~~~~n~~~l~~Le~~lk----~Nlg~~~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l 93 (162)
.+++..++-.++.+..+-=.-++ .|.| .-=.-+.+..|-...+-||-..|+++|..+- ..+..+-.-=|+
T Consensus 60 d~flaAL~lA~~~k~d~Alaaf~~lektg~g---~YpvLA~mr~at~~a~kgdta~AV~aFdeia-~dt~~P~~~rd~-- 133 (221)
T COG4649 60 DAFLAALKLAQENKTDDALAAFTDLEKTGYG---SYPVLARMRAATLLAQKGDTAAAVAAFDEIA-ADTSIPQIGRDL-- 133 (221)
T ss_pred HHHHHHHHHHHcCCchHHHHHHHHHHhcCCC---cchHHHHHHHHHHHhhcccHHHHHHHHHHHh-ccCCCcchhhHH--
Confidence 34555555555544443333222 6666 2334456677888999999999999999988 444443322254
Q ss_pred HHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCc-hhhhhHHHHHHHHHHhhhhcHHHHHHHhHhh
Q psy398 94 TLFRIAYFHGCDIQAMTAAIEKATELIEGISGGD-WSARNKLKAYEGVYCLAIRQYCRAAELFVDV 158 (162)
Q Consensus 94 ~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggD-W~rrnrLKvy~gl~~l~~R~fk~AA~lfld~ 158 (162)
.-+|-++.. .|.-....-..+++.|- +| ---|---|=.-|+...-.++|.+|.+.|.-.
T Consensus 134 ARlraa~lL-vD~gsy~dV~srvepLa-----~d~n~mR~sArEALglAa~kagd~a~A~~~F~qi 193 (221)
T COG4649 134 ARLRAAYLL-VDNGSYDDVSSRVEPLA-----GDGNPMRHSAREALGLAAYKAGDFAKAKSWFVQI 193 (221)
T ss_pred HHHHHHHHH-hccccHHHHHHHhhhcc-----CCCChhHHHHHHHHhHHHHhccchHHHHHHHHHH
Confidence 567778877 77666666666666763 33 1123334446799999999999999999753
No 119
>PF07079 DUF1347: Protein of unknown function (DUF1347); InterPro: IPR010764 This family consists of several hypothetical bacterial proteins of around 610 residues in length. Members of this family are highly conserved and seem to be specific to Chlamydia species. The function of this family is unknown.
Probab=65.33 E-value=64 Score=30.23 Aligned_cols=88 Identities=13% Similarity=0.163 Sum_probs=69.9
Q ss_pred HHHhhcCHHHHHHHHHHhcCCCCCC-hHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhhhHHHHHH
Q psy398 60 YLCSIGDRAEAFKLADSKFADKTNP-KSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSARNKLKAYE 138 (162)
Q Consensus 60 ~~~~iGD~~~A~~ay~~~~~~~~~s-~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rrnrLKvy~ 138 (162)
-+-+.|+...|.+.|++.+++.+.+ .--+=.+....|-=++|. +|.+.+...+..-+... | +-.-+-.++
T Consensus 15 ~Lqkq~~~~esEkifskI~~e~~~~~f~lkeEvl~grilnAffl-~nld~Me~~l~~l~~~~----~----~s~~l~LF~ 85 (549)
T PF07079_consen 15 ILQKQKKFQESEKIFSKIYDEKESSPFLLKEEVLGGRILNAFFL-NNLDLMEKQLMELRQQF----G----KSAYLPLFK 85 (549)
T ss_pred HHHHHhhhhHHHHHHHHHHHHhhcchHHHHHHHHhhHHHHHHHH-hhHHHHHHHHHHHHHhc----C----CchHHHHHH
Confidence 3456789999999999999344433 334447888999999999 99999999998887763 2 345677899
Q ss_pred HHHHhhhhcHHHHHHHhH
Q psy398 139 GVYCLAIRQYCRAAELFV 156 (162)
Q Consensus 139 gl~~l~~R~fk~AA~lfl 156 (162)
||++-.++.|.+|-+.|-
T Consensus 86 ~L~~Y~~k~~~kal~~ls 103 (549)
T PF07079_consen 86 ALVAYKQKEYRKALQALS 103 (549)
T ss_pred HHHHHHhhhHHHHHHHHH
Confidence 999999999999988764
No 120
>PLN03218 maturation of RBCL 1; Provisional
Probab=65.28 E-value=85 Score=31.82 Aligned_cols=62 Identities=15% Similarity=0.113 Sum_probs=35.3
Q ss_pred HHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHh
Q psy398 53 IWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATE 118 (162)
Q Consensus 53 ~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~ 118 (162)
.+..+..-|++.|+.++|++.|.+|. .....+. .-..-.+|...... +++..+...+..+..
T Consensus 616 tynsLI~ay~k~G~~deAl~lf~eM~-~~Gv~PD--~~TynsLI~a~~k~-G~~eeA~~l~~eM~k 677 (1060)
T PLN03218 616 VYTIAVNSCSQKGDWDFALSIYDDMK-KKGVKPD--EVFFSALVDVAGHA-GDLDKAFEILQDARK 677 (1060)
T ss_pred HHHHHHHHHHhcCCHHHHHHHHHHHH-HcCCCCC--HHHHHHHHHHHHhC-CCHHHHHHHHHHHHH
Confidence 45556666777777777777777777 5433332 11233444444444 666666666555543
No 121
>PRK15359 type III secretion system chaperone protein SscB; Provisional
Probab=65.01 E-value=55 Score=24.37 Aligned_cols=65 Identities=8% Similarity=-0.042 Sum_probs=49.1
Q ss_pred HHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhh
Q psy398 51 SGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELI 120 (162)
Q Consensus 51 r~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~ 120 (162)
..++..+|.-+.+.|+.+.|..+|.++. .-... -.+..+++-.+-... +++....+...+|-.+.
T Consensus 58 ~~a~~~lg~~~~~~g~~~~A~~~y~~Al-~l~p~---~~~a~~~lg~~l~~~-g~~~eAi~~~~~Al~~~ 122 (144)
T PRK15359 58 WRAHIALAGTWMMLKEYTTAINFYGHAL-MLDAS---HPEPVYQTGVCLKMM-GEPGLAREAFQTAIKMS 122 (144)
T ss_pred HHHHHHHHHHHHHHhhHHHHHHHHHHHH-hcCCC---CcHHHHHHHHHHHHc-CCHHHHHHHHHHHHHhC
Confidence 5678899999999999999999999999 53322 234445555555556 88888888888887763
No 122
>PF09613 HrpB1_HrpK: Bacterial type III secretion protein (HrpB1_HrpK); InterPro: IPR013394 This family of proteins is encoded by genes found within type III secretion operons in a limited range of species including Xanthomonas, Ralstonia and Burkholderia.
Probab=64.65 E-value=56 Score=25.93 Aligned_cols=69 Identities=12% Similarity=0.139 Sum_probs=50.9
Q ss_pred CCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhhhHHHHHHHHHHhhhhcHHHHHHHhHhh
Q psy398 81 KTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSARNKLKAYEGVYCLAIRQYCRAAELFVDV 158 (162)
Q Consensus 81 ~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rrnrLKvy~gl~~l~~R~fk~AA~lfld~ 158 (162)
|...++-.+.-++.++.+|+.. +|..-+..-+.-.+-|==+ ..=+.+++|.-+|..|++.+|..+|=+.
T Consensus 2 y~~C~~~iv~gLie~~~~al~~-~~~~D~e~lL~ALrvLRP~--------~~e~~~~~~~l~i~r~~w~dA~rlLr~l 70 (160)
T PF09613_consen 2 YLQCSDEIVGGLIEVLSVALRL-GDPDDAEALLDALRVLRPE--------FPELDLFDGWLHIVRGDWDDALRLLREL 70 (160)
T ss_pred cCcCcHHHHHHHHHHHHHHHcc-CChHHHHHHHHHHHHhCCC--------chHHHHHHHHHHHHhCCHHHHHHHHHHH
Confidence 4455677788888888888888 8877777777777766210 2336678888888888888888887653
No 123
>PF13371 TPR_9: Tetratricopeptide repeat
Probab=64.47 E-value=34 Score=21.69 Aligned_cols=58 Identities=14% Similarity=0.118 Sum_probs=40.3
Q ss_pred HhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhh
Q psy398 58 LDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELI 120 (162)
Q Consensus 58 a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~ 120 (162)
+.-|.+.+|++.|++++.+++.-...++ ...+..-++-+-. +++....+.++++-+.-
T Consensus 2 ~~~~~~~~~~~~A~~~~~~~l~~~p~~~----~~~~~~a~~~~~~-g~~~~A~~~l~~~l~~~ 59 (73)
T PF13371_consen 2 KQIYLQQEDYEEALEVLERALELDPDDP----ELWLQRARCLFQL-GRYEEALEDLERALELS 59 (73)
T ss_pred HHHHHhCCCHHHHHHHHHHHHHhCcccc----hhhHHHHHHHHHh-ccHHHHHHHHHHHHHHC
Confidence 4668899999999999999882223333 3344444555556 78888888888777653
No 124
>KOG2300|consensus
Probab=63.49 E-value=1.4e+02 Score=28.45 Aligned_cols=109 Identities=13% Similarity=0.057 Sum_probs=76.2
Q ss_pred hHHHHHHHHhhHhHHHhhc--CHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcC
Q psy398 47 EDQVSGIWQAKLDYLCSIG--DRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGIS 124 (162)
Q Consensus 47 ~~eIr~~~~~~a~~~~~iG--D~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ 124 (162)
...+-.|+.-+|+||-..| ....+++|...+. ..-.+..-..=.-|.+=.+-++|..|+++.+++++||=...+.
T Consensus 3 qdAva~aLlGlAe~~rt~~PPkIkk~IkClqA~~-~~~is~~veart~LqLg~lL~~yT~N~elAksHLekA~~i~~~-- 79 (629)
T KOG2300|consen 3 QDAVAEALLGLAEHFRTSGPPKIKKCIKCLQAIF-QFQISFLVEARTHLQLGALLLRYTKNVELAKSHLEKAWLISKS-- 79 (629)
T ss_pred hhHHHHHHHHHHHHHhhcCChhHHHHHHHHHHHh-ccCChHHHHHHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHcc--
Confidence 4677889999999999999 9999999999999 7666655444344444455566657899999999999887765
Q ss_pred CCchhhhhHHHHHH---HHHHhhhhcHHHHHHHhHhhc
Q psy398 125 GGDWSARNKLKAYE---GVYCLAIRQYCRAAELFVDVI 159 (162)
Q Consensus 125 ggDW~rrnrLKvy~---gl~~l~~R~fk~AA~lfld~~ 159 (162)
=+... +-|+..|. -+|+...+.|..|-.++..++
T Consensus 80 ip~fy-dvKf~a~SlLa~lh~~~~~s~~~~KalLrkai 116 (629)
T KOG2300|consen 80 IPSFY-DVKFQAASLLAHLHHQLAQSFPPAKALLRKAI 116 (629)
T ss_pred cccHH-hhhhHHHHHHHHHHHHhcCCCchHHHHHHHHH
Confidence 33333 34455543 455556667766666555443
No 125
>PLN03098 LPA1 LOW PSII ACCUMULATION1; Provisional
Probab=63.33 E-value=46 Score=30.65 Aligned_cols=49 Identities=12% Similarity=0.079 Sum_probs=39.8
Q ss_pred HHHHHHHHHHHHhcCCCchHHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCC
Q psy398 29 EKRLKELDLEHEKNMMDEEDQVSGIWQAKLDYLCSIGDRAEAFKLADSKFAD 80 (162)
Q Consensus 29 ~~~l~~Le~~lk~Nlg~~~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~ 80 (162)
++-+.-++.-|+-|-. ..+.+.+|..+|..|...|+.++|+++|.+++ +
T Consensus 92 eEAIa~f~rALeL~Pd--~aeA~~A~yNLAcaya~LGr~dEAla~LrrAL-e 140 (453)
T PLN03098 92 KDALAQFETALELNPN--PDEAQAAYYNKACCHAYREEGKKAADCLRTAL-R 140 (453)
T ss_pred HHHHHHHHHHHhhCCC--chHHHHHHHHHHHHHHHcCCHHHHHHHHHHHH-H
Confidence 4445555555557777 77778889999999999999999999999999 5
No 126
>PF10345 Cohesin_load: Cohesin loading factor; InterPro: IPR019440 Cohesin loading factor is a conserved protein that has been characterised in fungi. It is associated with the cohesin complex and is required in G1 for cohesin binding to chromosomes, but is dispensable in G2 when cohesion has been established. It is often referred to as Ssl3 in Schizosaccharomyces pombe (Fission yeast), and Scc4 in Saccharomyces cerevisiae (Baker's yeast). It complexes with Mis4 [].
Probab=63.28 E-value=1.1e+02 Score=28.60 Aligned_cols=105 Identities=16% Similarity=0.112 Sum_probs=80.4
Q ss_pred hHHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHH-HHHHHHHHHHHHHHhc-CCHHHHHHHHHHHHhhhhhcC
Q psy398 47 EDQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSH-RIDAVFTLFRIAYFHG-CDIQAMTAAIEKATELIEGIS 124 (162)
Q Consensus 47 ~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~-kiD~~l~~iRv~~f~~-~d~~~v~~~i~ka~~l~e~~~ 124 (162)
+.-.++.. -++..|.+.+-+. |++..++.. .++.+.++ -.=-+|.++|+.+-.. +|+..+-.++.++..+...
T Consensus 97 d~k~~~~~-ll~~i~~~~~~~~-a~~~l~~~I-~~~~~~~~~~w~~~frll~~~l~~~~~d~~~Al~~L~~~~~~a~~-- 171 (608)
T PF10345_consen 97 DLKFRCQF-LLARIYFKTNPKA-ALKNLDKAI-EDSETYGHSAWYYAFRLLKIQLALQHKDYNAALENLQSIAQLANQ-- 171 (608)
T ss_pred HHHHHHHH-HHHHHHHhcCHHH-HHHHHHHHH-HHHhccCchhHHHHHHHHHHHHHHhcccHHHHHHHHHHHHHHhhh--
Confidence 44444443 4599999999999 999999988 77766333 3455677777766642 5999999999999999876
Q ss_pred CCchhhhhHHHHHHHHHHhhhhcHHHHHHHhH
Q psy398 125 GGDWSARNKLKAYEGVYCLAIRQYCRAAELFV 156 (162)
Q Consensus 125 ggDW~rrnrLKvy~gl~~l~~R~fk~AA~lfl 156 (162)
-|||.-+--+-+++|+-.+..+....+.+..=
T Consensus 172 ~~d~~~~v~~~l~~~~l~l~~~~~~d~~~~l~ 203 (608)
T PF10345_consen 172 RGDPAVFVLASLSEALLHLRRGSPDDVLELLQ 203 (608)
T ss_pred cCCHHHHHHHHHHHHHHHhcCCCchhHHHHHH
Confidence 68999999999999999998887766655543
No 127
>PRK10747 putative protoheme IX biogenesis protein; Provisional
Probab=63.20 E-value=63 Score=28.23 Aligned_cols=57 Identities=4% Similarity=-0.060 Sum_probs=43.0
Q ss_pred HhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhh
Q psy398 58 LDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATEL 119 (162)
Q Consensus 58 a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l 119 (162)
+..+...||.+.|++.+++.. +...... ++...+.++.+-. +||..+.+.+.+..+.
T Consensus 160 a~l~l~~g~~~~Al~~l~~~~-~~~P~~~---~al~ll~~~~~~~-gdw~~a~~~l~~l~k~ 216 (398)
T PRK10747 160 VRIQLARNENHAARHGVDKLL-EVAPRHP---EVLRLAEQAYIRT-GAWSSLLDILPSMAKA 216 (398)
T ss_pred HHHHHHCCCHHHHHHHHHHHH-hcCCCCH---HHHHHHHHHHHHH-HhHHHHHHHHHHHHHc
Confidence 888999999999999999988 5443332 4455556677777 9999999777777654
No 128
>KOG2376|consensus
Probab=62.43 E-value=1.5e+02 Score=28.52 Aligned_cols=126 Identities=17% Similarity=0.177 Sum_probs=82.3
Q ss_pred HHHHHHHHHHHH-hcCCCchHHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHH--------------------
Q psy398 29 EKRLKELDLEHE-KNMMDEEDQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSH-------------------- 87 (162)
Q Consensus 29 ~~~l~~Le~~lk-~Nlg~~~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~-------------------- 87 (162)
.-++.++|+.|+ -+ |.++.. +....-.|.++++-|++++|++.|.... +..+.-+
T Consensus 89 ~Yrlnk~Dealk~~~-~~~~~~-~~ll~L~AQvlYrl~~ydealdiY~~L~--kn~~dd~d~~~r~nl~a~~a~l~~~~~ 164 (652)
T KOG2376|consen 89 EYRLNKLDEALKTLK-GLDRLD-DKLLELRAQVLYRLERYDEALDIYQHLA--KNNSDDQDEERRANLLAVAAALQVQLL 164 (652)
T ss_pred HHHcccHHHHHHHHh-cccccc-hHHHHHHHHHHHHHhhHHHHHHHHHHHH--hcCCchHHHHHHHHHHHHHHhhhHHHH
Confidence 346777777777 22 322333 4456678899999999999999998874 2222222
Q ss_pred ---------HHHHHHHHHHHHHHhcCCHHHHHHHHHHH-----HhhhhhcCCCchhh---hhHHHHHHHHHHhhhhcHHH
Q psy398 88 ---------RIDAVFTLFRIAYFHGCDIQAMTAAIEKA-----TELIEGISGGDWSA---RNKLKAYEGVYCLAIRQYCR 150 (162)
Q Consensus 88 ---------kiD~~l~~iRv~~f~~~d~~~v~~~i~ka-----~~l~e~~~ggDW~r---rnrLKvy~gl~~l~~R~fk~ 150 (162)
--|.+|+..=+-+.. ++|....+.+.+| +.+.++ +.++-+- -|-.+|=-+--+..+|+=.+
T Consensus 165 q~v~~v~e~syel~yN~Ac~~i~~-gky~qA~elL~kA~~~~~e~l~~~-d~~eEeie~el~~IrvQlayVlQ~~Gqt~e 242 (652)
T KOG2376|consen 165 QSVPEVPEDSYELLYNTACILIEN-GKYNQAIELLEKALRICREKLEDE-DTNEEEIEEELNPIRVQLAYVLQLQGQTAE 242 (652)
T ss_pred HhccCCCcchHHHHHHHHHHHHhc-ccHHHHHHHHHHHHHHHHHhhccc-ccchhhHHHHHHHHHHHHHHHHHHhcchHH
Confidence 235566665556666 8999999999999 444332 1122333 35666666666777888999
Q ss_pred HHHHhHhhcc
Q psy398 151 AAELFVDVIR 160 (162)
Q Consensus 151 AA~lfld~~~ 160 (162)
|...+.+++.
T Consensus 243 a~~iy~~~i~ 252 (652)
T KOG2376|consen 243 ASSIYVDIIK 252 (652)
T ss_pred HHHHHHHHHH
Confidence 9988777653
No 129
>KOG1127|consensus
Probab=62.02 E-value=70 Score=32.69 Aligned_cols=100 Identities=14% Similarity=0.083 Sum_probs=64.8
Q ss_pred HHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhh-hhhcCCCch
Q psy398 50 VSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATEL-IEGISGGDW 128 (162)
Q Consensus 50 Ir~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l-~e~~~ggDW 128 (162)
.--|+--+|+||+.+-|+..|.+||.+++ +-..|. ..-.-........- .+|..+...+-.+.+- +-..---.|
T Consensus 491 ~apaf~~LG~iYrd~~Dm~RA~kCf~KAF-eLDatd---aeaaaa~adtyae~-~~we~a~~I~l~~~qka~a~~~k~nW 565 (1238)
T KOG1127|consen 491 LAPAFAFLGQIYRDSDDMKRAKKCFDKAF-ELDATD---AEAAAASADTYAEE-STWEEAFEICLRAAQKAPAFACKENW 565 (1238)
T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHh-cCCchh---hhhHHHHHHHhhcc-ccHHHHHHHHHHHhhhchHHHHHhhh
Confidence 33588899999999999999999999999 644331 11222334444555 7777766554333221 110000134
Q ss_pred hhhhHHHHHHHHHHhhhhcHHHHHHHhHhhccC
Q psy398 129 SARNKLKAYEGVYCLAIRQYCRAAELFVDVIRS 161 (162)
Q Consensus 129 ~rrnrLKvy~gl~~l~~R~fk~AA~lfld~~~T 161 (162)
-+ .|+|+|...++.+|-.-|-.++.|
T Consensus 566 ~~-------rG~yyLea~n~h~aV~~fQsALR~ 591 (1238)
T KOG1127|consen 566 VQ-------RGPYYLEAHNLHGAVCEFQSALRT 591 (1238)
T ss_pred hh-------ccccccCccchhhHHHHHHHHhcC
Confidence 43 799999999999998888776654
No 130
>KOG0543|consensus
Probab=61.75 E-value=99 Score=28.08 Aligned_cols=95 Identities=17% Similarity=0.109 Sum_probs=49.7
Q ss_pred HhhHhHHHhhcCHHHHHHHHHHhcC--CCCCChHH---------HHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhc
Q psy398 55 QAKLDYLCSIGDRAEAFKLADSKFA--DKTNPKSH---------RIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGI 123 (162)
Q Consensus 55 ~~~a~~~~~iGD~~~A~~ay~~~~~--~~~~s~g~---------kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~ 123 (162)
...|++|++-|++..|...|+++.. +++.+... ++=+.+++.=+-+-. ++|. ..|..+...++-
T Consensus 212 ke~Gn~~fK~gk~~~A~~~Yerav~~l~~~~~~~~ee~~~~~~~k~~~~lNlA~c~lKl-~~~~---~Ai~~c~kvLe~- 286 (397)
T KOG0543|consen 212 KERGNVLFKEGKFKLAKKRYERAVSFLEYRRSFDEEEQKKAEALKLACHLNLAACYLKL-KEYK---EAIESCNKVLEL- 286 (397)
T ss_pred HHhhhHHHhhchHHHHHHHHHHHHHHhhccccCCHHHHHHHHHHHHHHhhHHHHHHHhh-hhHH---HHHHHHHHHHhc-
Confidence 3589999999999999999999540 23322211 111222222222222 3444 445555555543
Q ss_pred CCCchhhhhHHHHHHHHHHhhhhcHHHHHHHhHhh
Q psy398 124 SGGDWSARNKLKAYEGVYCLAIRQYCRAAELFVDV 158 (162)
Q Consensus 124 ~ggDW~rrnrLKvy~gl~~l~~R~fk~AA~lfld~ 158 (162)
+..-. |-=--.|=.++..++|..|-..|.-+
T Consensus 287 -~~~N~---KALyRrG~A~l~~~e~~~A~~df~ka 317 (397)
T KOG0543|consen 287 -DPNNV---KALYRRGQALLALGEYDLARDDFQKA 317 (397)
T ss_pred -CCCch---hHHHHHHHHHHhhccHHHHHHHHHHH
Confidence 22211 11113466666777777776666543
No 131
>KOG1156|consensus
Probab=61.71 E-value=50 Score=31.89 Aligned_cols=67 Identities=19% Similarity=0.123 Sum_probs=60.0
Q ss_pred HHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhh
Q psy398 48 DQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATEL 119 (162)
Q Consensus 48 ~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l 119 (162)
+-+.....-++.||-..||++.|++-...++ +.|.| .++..+...||.--. |+.+....-+..|+++
T Consensus 368 ttllWt~y~laqh~D~~g~~~~A~~yId~AI-dHTPT---liEly~~KaRI~kH~-G~l~eAa~~l~ea~el 434 (700)
T KOG1156|consen 368 TTLLWTLYFLAQHYDKLGDYEVALEYIDLAI-DHTPT---LIELYLVKARIFKHA-GLLDEAAAWLDEAQEL 434 (700)
T ss_pred hHHHHHHHHHHHHHHHcccHHHHHHHHHHHh-ccCch---HHHHHHHHHHHHHhc-CChHHHHHHHHHHHhc
Confidence 5677888899999999999999999999999 88877 689999999999988 8999888888888876
No 132
>cd02680 MIT_calpain7_2 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in the nuclear thiol protease PalBH. The molecular function of the MIT domain is unclear.
Probab=61.12 E-value=14 Score=25.66 Aligned_cols=51 Identities=14% Similarity=0.124 Sum_probs=34.3
Q ss_pred HhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhh
Q psy398 62 CSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELI 120 (162)
Q Consensus 62 ~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~ 120 (162)
=..|.+++|++.|..+. ++|... +=+..-..||=-+-. ....++++|+.+-
T Consensus 17 D~~gny~eA~~lY~~al-e~~~~e--kn~~~k~~i~~K~~~-----~a~~yl~RAE~Lk 67 (75)
T cd02680 17 DEKGNAEEAIELYTEAV-ELCINT--SNETMDQALQTKLKQ-----LARQALDRAEALK 67 (75)
T ss_pred hHhhhHHHHHHHHHHHH-HHHHHh--cChhhHHHHHHHHHH-----HHHHHHHHHHHHH
Confidence 36789999999999999 888762 334444444433322 4457777777763
No 133
>PRK15331 chaperone protein SicA; Provisional
Probab=60.30 E-value=86 Score=25.00 Aligned_cols=100 Identities=15% Similarity=0.011 Sum_probs=63.7
Q ss_pred hHHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCC
Q psy398 47 EDQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGG 126 (162)
Q Consensus 47 ~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~gg 126 (162)
.++..++....|--++..|++++|.+.|.-.. -+.... ++ -++.+-=+.=.. +++......-.-|-.+ +. .
T Consensus 33 s~~~le~iY~~Ay~~y~~Gk~~eA~~~F~~L~-~~d~~n-~~--Y~~GLaa~~Q~~-k~y~~Ai~~Y~~A~~l-~~---~ 103 (165)
T PRK15331 33 PQDMMDGLYAHAYEFYNQGRLDEAETFFRFLC-IYDFYN-PD--YTMGLAAVCQLK-KQFQKACDLYAVAFTL-LK---N 103 (165)
T ss_pred CHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH-HhCcCc-HH--HHHHHHHHHHHH-HHHHHHHHHHHHHHHc-cc---C
Confidence 36778899999999999999999999999854 332222 22 222222222222 3444444333333333 11 1
Q ss_pred chhhhhHHHHHHHHHHhhhhcHHHHHHHhHhhc
Q psy398 127 DWSARNKLKAYEGVYCLAIRQYCRAAELFVDVI 159 (162)
Q Consensus 127 DW~rrnrLKvy~gl~~l~~R~fk~AA~lfld~~ 159 (162)
| .|.-.|.|.-+|..|+-..|-.+|-.++
T Consensus 104 d----p~p~f~agqC~l~l~~~~~A~~~f~~a~ 132 (165)
T PRK15331 104 D----YRPVFFTGQCQLLMRKAAKARQCFELVN 132 (165)
T ss_pred C----CCccchHHHHHHHhCCHHHHHHHHHHHH
Confidence 2 3446789999999999999999987665
No 134
>PRK12798 chemotaxis protein; Reviewed
Probab=60.19 E-value=43 Score=30.61 Aligned_cols=68 Identities=18% Similarity=0.162 Sum_probs=58.3
Q ss_pred CChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhhhHHHHHHHHHHhhhhcHHHHHHHh
Q psy398 83 NPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSARNKLKAYEGVYCLAIRQYCRAAELF 155 (162)
Q Consensus 83 ~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rrnrLKvy~gl~~l~~R~fk~AA~lf 155 (162)
.....+.++.|.+-|-++.- +......-..++|..+.+. +-....+.+.|+|...+.+-++..|.+.+
T Consensus 251 ~d~~~q~~lYL~iAR~Ali~-Gk~~lA~~As~~A~~L~~~----~~~~~~ra~LY~aaa~v~s~~~~~al~~L 318 (421)
T PRK12798 251 MDPERQRELYLRIARAALID-GKTELARFASERALKLADP----DSADAARARLYRGAALVASDDAESALEEL 318 (421)
T ss_pred cCchhHHHHHHHHHHHHHHc-CcHHHHHHHHHHHHHhccC----CCcchHHHHHHHHHHccCcccHHHHHHHH
Confidence 34566788999999999999 9999999999999999863 33447889999999999999999888765
No 135
>PF14561 TPR_20: Tetratricopeptide repeat; PDB: 3QOU_A 2R5S_A 3QDN_B.
Probab=60.07 E-value=57 Score=22.89 Aligned_cols=79 Identities=15% Similarity=0.114 Sum_probs=48.6
Q ss_pred HHHHHHHHHHHHHhcCCCchHHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCCh--HHHHHHHHHHHHHHHHhcCC
Q psy398 28 NEKRLKELDLEHEKNMMDEEDQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPK--SHRIDAVFTLFRIAYFHGCD 105 (162)
Q Consensus 28 n~~~l~~Le~~lk~Nlg~~~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~--g~kiD~~l~~iRv~~f~~~d 105 (162)
....++.|+..+..|-. +. ++...+|..+..-|+.+.|++.+-... ....+- +.-=-.++.++++- |.+
T Consensus 4 ~~~~~~al~~~~a~~P~--D~---~ar~~lA~~~~~~g~~e~Al~~Ll~~v-~~dr~~~~~~ar~~ll~~f~~l---g~~ 74 (90)
T PF14561_consen 4 DAPDIAALEAALAANPD--DL---DARYALADALLAAGDYEEALDQLLELV-RRDRDYEDDAARKRLLDIFELL---GPG 74 (90)
T ss_dssp S-HHHHHHHHHHHHSTT---H---HHHHHHHHHHHHTT-HHHHHHHHHHHH-CC-TTCCCCHHHHHHHHHHHHH----TT
T ss_pred CcccHHHHHHHHHcCCC--CH---HHHHHHHHHHHHCCCHHHHHHHHHHHH-HhCccccccHHHHHHHHHHHHc---CCC
Confidence 34568889999998888 75 456699999999999999999998876 433221 22212344444433 344
Q ss_pred HHHHHHHHHH
Q psy398 106 IQAMTAAIEK 115 (162)
Q Consensus 106 ~~~v~~~i~k 115 (162)
.+.|.+|=.|
T Consensus 75 ~plv~~~RRk 84 (90)
T PF14561_consen 75 DPLVSEYRRK 84 (90)
T ss_dssp -HHHHHHHHH
T ss_pred ChHHHHHHHH
Confidence 5566555443
No 136
>PRK10370 formate-dependent nitrite reductase complex subunit NrfG; Provisional
Probab=59.86 E-value=25 Score=27.89 Aligned_cols=62 Identities=15% Similarity=0.210 Sum_probs=33.2
Q ss_pred HHhhHh-HHHhhcC--HHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhh
Q psy398 54 WQAKLD-YLCSIGD--RAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELI 120 (162)
Q Consensus 54 ~~~~a~-~~~~iGD--~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~ 120 (162)
+..+|. +|.+-|+ .+.|.+.+.+++ ...... .+..+.+--..+.. +|+.....+..++-++.
T Consensus 110 ~~~lA~aL~~~~g~~~~~~A~~~l~~al-~~dP~~---~~al~~LA~~~~~~-g~~~~Ai~~~~~aL~l~ 174 (198)
T PRK10370 110 YAALATVLYYQAGQHMTPQTREMIDKAL-ALDANE---VTALMLLASDAFMQ-ADYAQAIELWQKVLDLN 174 (198)
T ss_pred HHHHHHHHHHhcCCCCcHHHHHHHHHHH-HhCCCC---hhHHHHHHHHHHHc-CCHHHHHHHHHHHHhhC
Confidence 345555 3455555 366666666666 222211 23444444455555 66666666666666654
No 137
>TIGR03504 FimV_Cterm FimV C-terminal domain. This protein is found at the extreme C-terminus of FimV from Pseudomonas aeruginosa, and of TspA of Neisseria meningitidis. Disruption of the former blocks twitching motility from type IV pili; Semmler, et al. suggest a role in peptidoglycan layer remodelling required by type IV fimbrial systems.
Probab=59.03 E-value=13 Score=23.08 Aligned_cols=25 Identities=16% Similarity=0.057 Sum_probs=22.7
Q ss_pred HHhhHhHHHhhcCHHHHHHHHHHhc
Q psy398 54 WQAKLDYLCSIGDRAEAFKLADSKF 78 (162)
Q Consensus 54 ~~~~a~~~~~iGD~~~A~~ay~~~~ 78 (162)
.+++|.=|.++||++.|.+......
T Consensus 2 kLdLA~ayie~Gd~e~Ar~lL~evl 26 (44)
T TIGR03504 2 KLDLARAYIEMGDLEGARELLEEVI 26 (44)
T ss_pred chHHHHHHHHcCChHHHHHHHHHHH
Confidence 4688999999999999999999888
No 138
>PLN03081 pentatricopeptide (PPR) repeat-containing protein; Provisional
Probab=59.03 E-value=1.1e+02 Score=28.77 Aligned_cols=57 Identities=16% Similarity=-0.028 Sum_probs=35.7
Q ss_pred HHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHH
Q psy398 53 IWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKAT 117 (162)
Q Consensus 53 ~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~ 117 (162)
.+..+.+.|++.|+.+.|.+.|.++. ++.+. ..-.+|.-.... ++...+.+.+++..
T Consensus 362 ~~~~Li~~y~k~G~~~~A~~vf~~m~-~~d~~------t~n~lI~~y~~~-G~~~~A~~lf~~M~ 418 (697)
T PLN03081 362 ANTALVDLYSKWGRMEDARNVFDRMP-RKNLI------SWNALIAGYGNH-GRGTKAVEMFERMI 418 (697)
T ss_pred ehHHHHHHHHHCCCHHHHHHHHHhCC-CCCee------eHHHHHHHHHHc-CCHHHHHHHHHHHH
Confidence 45678888999999999999998888 54322 123344433344 56555555555543
No 139
>PRK14574 hmsH outer membrane protein; Provisional
Probab=58.73 E-value=96 Score=30.54 Aligned_cols=61 Identities=16% Similarity=0.037 Sum_probs=39.0
Q ss_pred HHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhh
Q psy398 53 IWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIE 121 (162)
Q Consensus 53 ~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e 121 (162)
+...+|..|...|+.++|++.|+++. +..... .++.+.++.+.... +... +.+..++.+..
T Consensus 104 ~llalA~ly~~~gdyd~Aiely~kaL-~~dP~n---~~~l~gLa~~y~~~-~q~~---eAl~~l~~l~~ 164 (822)
T PRK14574 104 GLASAARAYRNEKRWDQALALWQSSL-KKDPTN---PDLISGMIMTQADA-GRGG---VVLKQATELAE 164 (822)
T ss_pred HHHHHHHHHHHcCCHHHHHHHHHHHH-hhCCCC---HHHHHHHHHHHhhc-CCHH---HHHHHHHHhcc
Confidence 34456889999999999999999999 555444 34445554444444 4444 44555555543
No 140
>PF13812 PPR_3: Pentatricopeptide repeat domain
Probab=58.42 E-value=17 Score=19.62 Aligned_cols=27 Identities=26% Similarity=0.335 Sum_probs=22.4
Q ss_pred HHhhHhHHHhhcCHHHHHHHHHHhcCCC
Q psy398 54 WQAKLDYLCSIGDRAEAFKLADSKFADK 81 (162)
Q Consensus 54 ~~~~a~~~~~iGD~~~A~~ay~~~~~~~ 81 (162)
+..+-.-|++.|+.+.|.+.|..|. +.
T Consensus 4 y~~ll~a~~~~g~~~~a~~~~~~M~-~~ 30 (34)
T PF13812_consen 4 YNALLRACAKAGDPDAALQLFDEMK-EQ 30 (34)
T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHH-Hh
Confidence 4556677899999999999999998 54
No 141
>PLN03218 maturation of RBCL 1; Provisional
Probab=58.16 E-value=1.2e+02 Score=30.68 Aligned_cols=25 Identities=24% Similarity=0.163 Sum_probs=12.1
Q ss_pred HHhhHhHHHhhcCHHHHHHHHHHhc
Q psy398 54 WQAKLDYLCSIGDRAEAFKLADSKF 78 (162)
Q Consensus 54 ~~~~a~~~~~iGD~~~A~~ay~~~~ 78 (162)
+..+..-|++.|+.++|++.|.+|.
T Consensus 510 ynaLI~gy~k~G~~eeAl~lf~~M~ 534 (1060)
T PLN03218 510 FGALIDGCARAGQVAKAFGAYGIMR 534 (1060)
T ss_pred HHHHHHHHHHCcCHHHHHHHHHHHH
Confidence 3444444455555555555554444
No 142
>KOG1129|consensus
Probab=58.12 E-value=27 Score=31.61 Aligned_cols=87 Identities=11% Similarity=0.153 Sum_probs=61.3
Q ss_pred hhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhhhHHHHHH--HH
Q psy398 63 SIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSARNKLKAYE--GV 140 (162)
Q Consensus 63 ~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rrnrLKvy~--gl 140 (162)
=-|.-|-|+.-|++.+ .--+.++. +|+-|=++-++++.++.+...+.+|.++.+. .|. +--|.. |-
T Consensus 336 Y~~~PE~AlryYRRiL-qmG~~spe----Lf~NigLCC~yaqQ~D~~L~sf~RAlstat~--~~~-----aaDvWYNlg~ 403 (478)
T KOG1129|consen 336 YDNNPEMALRYYRRIL-QMGAQSPE----LFCNIGLCCLYAQQIDLVLPSFQRALSTATQ--PGQ-----AADVWYNLGF 403 (478)
T ss_pred cCCChHHHHHHHHHHH-HhcCCChH----HHhhHHHHHHhhcchhhhHHHHHHHHhhccC--cch-----hhhhhhccce
Confidence 3567888999999988 66555554 3455555555559999999999999999875 221 112322 34
Q ss_pred HHhhhhcHHHHHHHhHhhccC
Q psy398 141 YCLAIRQYCRAAELFVDVIRS 161 (162)
Q Consensus 141 ~~l~~R~fk~AA~lfld~~~T 161 (162)
-.+.+|||.-|.++|--+++.
T Consensus 404 vaV~iGD~nlA~rcfrlaL~~ 424 (478)
T KOG1129|consen 404 VAVTIGDFNLAKRCFRLALTS 424 (478)
T ss_pred eEEeccchHHHHHHHHHHhcc
Confidence 456789999999999877653
No 143
>PF14559 TPR_19: Tetratricopeptide repeat; PDB: 2R5S_A 3QDN_B 3QOU_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 3FP3_A 3LCA_A ....
Probab=58.06 E-value=43 Score=20.83 Aligned_cols=47 Identities=13% Similarity=0.100 Sum_probs=32.8
Q ss_pred HHHHHHHHHhcCCCchHHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCC
Q psy398 32 LKELDLEHEKNMMDEEDQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNP 84 (162)
Q Consensus 32 l~~Le~~lk~Nlg~~~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s 84 (162)
++.++.-+..+-. ..+++ ..+|..|.+.|+.++|.+.+.+.. .....
T Consensus 11 ~~~~~~~l~~~p~--~~~~~---~~la~~~~~~g~~~~A~~~l~~~~-~~~~~ 57 (68)
T PF14559_consen 11 IELLEKALQRNPD--NPEAR---LLLAQCYLKQGQYDEAEELLERLL-KQDPD 57 (68)
T ss_dssp HHHHHHHHHHTTT--SHHHH---HHHHHHHHHTT-HHHHHHHHHCCH-GGGTT
T ss_pred HHHHHHHHHHCCC--CHHHH---HHHHHHHHHcCCHHHHHHHHHHHH-HHCcC
Confidence 3444444555555 55544 479999999999999999999988 54443
No 144
>PRK02603 photosystem I assembly protein Ycf3; Provisional
Probab=57.21 E-value=82 Score=23.80 Aligned_cols=32 Identities=16% Similarity=0.050 Sum_probs=27.9
Q ss_pred HHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCC
Q psy398 50 VSGIWQAKLDYLCSIGDRAEAFKLADSKFADKT 82 (162)
Q Consensus 50 Ir~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~ 82 (162)
.-.++..+|.-|.+.|+.+.|.++|.+++ ...
T Consensus 71 ~~~~~~~la~~~~~~g~~~~A~~~~~~al-~~~ 102 (172)
T PRK02603 71 RSYILYNMGIIYASNGEHDKALEYYHQAL-ELN 102 (172)
T ss_pred HHHHHHHHHHHHHHcCCHHHHHHHHHHHH-HhC
Confidence 34678999999999999999999999998 543
No 145
>PF10255 Paf67: RNA polymerase I-associated factor PAF67; InterPro: IPR019382 RNA polymerase I is a multi-subunit enzyme and its transcription competence is dependent on the presence of PAF67 [].
Probab=57.11 E-value=21 Score=32.27 Aligned_cols=76 Identities=17% Similarity=0.225 Sum_probs=49.6
Q ss_pred CCCCChHHHHHHHH---HHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCC-ch---hhhhHHHHHHHHHHhhhhcHHHHH
Q psy398 80 DKTNPKSHRIDAVF---TLFRIAYFHGCDIQAMTAAIEKATELIEGISGG-DW---SARNKLKAYEGVYCLAIRQYCRAA 152 (162)
Q Consensus 80 ~~~~s~g~kiD~~l---~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~gg-DW---~rrnrLKvy~gl~~l~~R~fk~AA 152 (162)
++..++-.++==.| .++||-... ||+....+-++-.+- .+ .+ =| ...=.+-=|-|..+|+.|.|..|.
T Consensus 110 ~~g~~~l~~~LGYFSligLlRvh~LL-GDY~~Alk~l~~idl--~~--~~l~~~V~~~~is~~YyvGFaylMlrRY~DAi 184 (404)
T PF10255_consen 110 EYGSSPLYKMLGYFSLIGLLRVHCLL-GDYYQALKVLENIDL--NK--KGLYTKVPACHISTYYYVGFAYLMLRRYADAI 184 (404)
T ss_pred ccccccHHHHhhHHHHHHHHHHHHhc-cCHHHHHHHhhccCc--cc--chhhccCcchheehHHHHHHHHHHHHHHHHHH
Confidence 45555555655444 456999999 999887766654432 11 11 11 111222338999999999999999
Q ss_pred HHhHhhcc
Q psy398 153 ELFVDVIR 160 (162)
Q Consensus 153 ~lfld~~~ 160 (162)
+.|..++.
T Consensus 185 r~f~~iL~ 192 (404)
T PF10255_consen 185 RTFSQILL 192 (404)
T ss_pred HHHHHHHH
Confidence 99988763
No 146
>KOG2002|consensus
Probab=56.77 E-value=18 Score=36.20 Aligned_cols=66 Identities=21% Similarity=0.259 Sum_probs=57.8
Q ss_pred HHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhh
Q psy398 52 GIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELI 120 (162)
Q Consensus 52 ~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~ 120 (162)
++|..+|+=|...|.+-.|++.|+.++ .+.- ....-+|..++-|+-+.. +.|...+.++.+|..+.
T Consensus 681 dv~lNlah~~~e~~qy~~AIqmYe~~l-kkf~-~~~~~~vl~~Lara~y~~-~~~~eak~~ll~a~~~~ 746 (1018)
T KOG2002|consen 681 DVWLNLAHCYVEQGQYRLAIQMYENCL-KKFY-KKNRSEVLHYLARAWYEA-GKLQEAKEALLKARHLA 746 (1018)
T ss_pred ceeeeHHHHHHHHHHHHHHHHHHHHHH-HHhc-ccCCHHHHHHHHHHHHHh-hhHHHHHHHHHHHHHhC
Confidence 466799999999999999999999998 4443 666788999999999999 99999999999998874
No 147
>PRK15363 pathogenicity island 2 chaperone protein SscA; Provisional
Probab=56.22 E-value=99 Score=24.41 Aligned_cols=100 Identities=17% Similarity=0.011 Sum_probs=67.6
Q ss_pred HHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCc
Q psy398 48 DQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGD 127 (162)
Q Consensus 48 ~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggD 127 (162)
.+..++...+|--+++.|++++|.+.|.-+. .+... -.+-.+++==+.=-. ++|......-.+|-.+- .|
T Consensus 32 ~~~l~~lY~~A~~ly~~G~l~~A~~~f~~L~-~~Dp~---~~~y~~gLG~~~Q~~-g~~~~AI~aY~~A~~L~-----~d 101 (157)
T PRK15363 32 TQPLNTLYRYAMQLMEVKEFAGAARLFQLLT-IYDAW---SFDYWFRLGECCQAQ-KHWGEAIYAYGRAAQIK-----ID 101 (157)
T ss_pred HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH-HhCcc---cHHHHHHHHHHHHHH-hhHHHHHHHHHHHHhcC-----CC
Confidence 6778889999999999999999999998854 22211 112222222222223 56666666666666652 11
Q ss_pred hhhhhHHHHHHHHHHhhhhcHHHHHHHhHhhcc
Q psy398 128 WSARNKLKAYEGVYCLAIRQYCRAAELFVDVIR 160 (162)
Q Consensus 128 W~rrnrLKvy~gl~~l~~R~fk~AA~lfld~~~ 160 (162)
-.+.--+.|+-+|..|+...|-+.|--++.
T Consensus 102 ---dp~~~~~ag~c~L~lG~~~~A~~aF~~Ai~ 131 (157)
T PRK15363 102 ---APQAPWAAAECYLACDNVCYAIKALKAVVR 131 (157)
T ss_pred ---CchHHHHHHHHHHHcCCHHHHHHHHHHHHH
Confidence 245677899999999999999999976653
No 148
>COG1516 FliS Flagellin-specific chaperone FliS [Cell motility and secretion / Intracellular trafficking and secretion / Posttranslational modification, protein turnover, chaperones]
Probab=56.16 E-value=92 Score=24.03 Aligned_cols=64 Identities=23% Similarity=0.295 Sum_probs=47.8
Q ss_pred HHHHHHHHhcCCCCCChHHHHHHHHH-------HHHHHHHhcCCHHHHHHHHHHHHhhh---------hhcCCCchhhhh
Q psy398 69 EAFKLADSKFADKTNPKSHRIDAVFT-------LFRIAYFHGCDIQAMTAAIEKATELI---------EGISGGDWSARN 132 (162)
Q Consensus 69 ~A~~ay~~~~~~~~~s~g~kiD~~l~-------~iRv~~f~~~d~~~v~~~i~ka~~l~---------e~~~ggDW~rrn 132 (162)
+|.++|.... --+.|+..-+-|++. ..|.++.. +|+.....+|.||.+.+ |+ ||+|. .|
T Consensus 5 ~~~~aY~qn~-V~taSP~~Li~MLyeg~l~~l~~A~~aie~-~~i~~k~~~i~ka~~Ii~eL~~~Ld~E~--Ggeia-~n 79 (132)
T COG1516 5 NAYQAYQQNQ-VNTASPHKLILMLYEGALKFLKRAKEAIEQ-EDIEEKNESIDKAIDIITELRASLDYEK--GGEIA-QN 79 (132)
T ss_pred HHHHHHHHHH-HhcCCHHHHHHHHHHHHHHHHHHHHHHHHh-ccHHHHHHHHHHHHHHHHHHHHhcCHhh--cchHH-HH
Confidence 4788998877 678888888877654 45667777 99999999999999976 44 67765 34
Q ss_pred HHHHH
Q psy398 133 KLKAY 137 (162)
Q Consensus 133 rLKvy 137 (162)
=...|
T Consensus 80 L~~LY 84 (132)
T COG1516 80 LDALY 84 (132)
T ss_pred HHHHH
Confidence 44433
No 149
>PF04010 DUF357: Protein of unknown function (DUF357); InterPro: IPR023140 This domain is found in a family of proteins, which have no known function.; PDB: 2OO2_A 2PMR_A.
Probab=55.97 E-value=22 Score=24.56 Aligned_cols=26 Identities=15% Similarity=0.162 Sum_probs=21.2
Q ss_pred HHHhhHhHHHhhcCHHHHHHHHHHhc
Q psy398 53 IWQAKLDYLCSIGDRAEAFKLADSKF 78 (162)
Q Consensus 53 ~~~~~a~~~~~iGD~~~A~~ay~~~~ 78 (162)
.+..=|.||.+.||..+|+.|+.=..
T Consensus 37 ~Y~~D~~~fl~~gD~v~Ala~~sYa~ 62 (75)
T PF04010_consen 37 SYLEDGKYFLEKGDYVNALACFSYAH 62 (75)
T ss_dssp HHHHHHHHHHHTT-HHHHHHHHHHHH
T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHH
Confidence 47788999999999999999987543
No 150
>COG3063 PilF Tfp pilus assembly protein PilF [Cell motility and secretion / Intracellular trafficking and secretion]
Probab=55.04 E-value=31 Score=29.34 Aligned_cols=64 Identities=17% Similarity=0.198 Sum_probs=40.1
Q ss_pred HHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhh
Q psy398 53 IWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATEL 119 (162)
Q Consensus 53 ~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l 119 (162)
.++.-|-|+|+-|..++|..-|+++. ..+ .-++.-|..-++--+++-. ++......++.++-.+
T Consensus 105 VLNNYG~FLC~qg~~~eA~q~F~~Al-~~P-~Y~~~s~t~eN~G~Cal~~-gq~~~A~~~l~raL~~ 168 (250)
T COG3063 105 VLNNYGAFLCAQGRPEEAMQQFERAL-ADP-AYGEPSDTLENLGLCALKA-GQFDQAEEYLKRALEL 168 (250)
T ss_pred hhhhhhHHHHhCCChHHHHHHHHHHH-hCC-CCCCcchhhhhhHHHHhhc-CCchhHHHHHHHHHHh
Confidence 46678888888888888888888887 322 2234444444444444445 5555666666555443
No 151
>PRK06771 hypothetical protein; Provisional
Probab=53.06 E-value=70 Score=23.31 Aligned_cols=56 Identities=21% Similarity=0.265 Sum_probs=38.6
Q ss_pred HHHHHHHHHHHHHHHHHHHH---hcCCCchHHHHHHHHhhHhHHHhhcCHHHHHHHHHHhc
Q psy398 21 YTEMKDANEKRLKELDLEHE---KNMMDEEDQVSGIWQAKLDYLCSIGDRAEAFKLADSKF 78 (162)
Q Consensus 21 ~~~~~~~n~~~l~~Le~~lk---~Nlg~~~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~ 78 (162)
+..++++++.+++++|.+|. +.+|..+.+ . ...+-..-+.+-|.+-+|.+.|+..-
T Consensus 21 l~~~~~~~~~~~k~ie~~L~~I~~~~Gi~~~~-~-~~~~e~~~Li~~Gkki~AIK~~Re~t 79 (93)
T PRK06771 21 LTKIEKKTDARLKRMEDRLQLITKEMGIVDRE-P-PVNKELRQLMEEGQTVTAVKRVREAF 79 (93)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCc-c-cccHHHHHHHHcCCchHHHHHHHHHc
Confidence 45678899999999999999 555533331 0 11111233778899999999998865
No 152
>COG5071 RPN5 26S proteasome regulatory complex component [Posttranslational modification, protein turnover, chaperones]
Probab=52.98 E-value=1.7e+02 Score=26.25 Aligned_cols=100 Identities=17% Similarity=0.090 Sum_probs=75.6
Q ss_pred HHhhHhHHHhhcCHHHHHHHHHHhc-CC-CCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhh
Q psy398 54 WQAKLDYLCSIGDRAEAFKLADSKF-AD-KTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSAR 131 (162)
Q Consensus 54 ~~~~a~~~~~iGD~~~A~~ay~~~~-~~-~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rr 131 (162)
-.++..-+...||...|....-..- +- -..-.+-++...|..+|+++.. +||.....+..|+..-.-+ ..|-+ .
T Consensus 134 T~~L~~ikee~Gdi~sA~Dilcn~pVETygs~~~Sekv~fiLEQ~rL~vl~-~Dy~~A~~~~kKI~KK~Fe--~~d~~-s 209 (439)
T COG5071 134 TQLLSQIKEEQGDIKSAQDILCNEPVETYGSFDLSEKVAFILEQVRLFLLR-SDYYMASTYTKKINKKFFE--KEDVQ-S 209 (439)
T ss_pred HHHHHHHHHHhcchhHHHHHHhcCchhhccchhHHHHHHHHHHHHHHHHhc-cchHHHHHHHHHHHHHHhc--cccHH-H
Confidence 4578888999999999988665422 01 1234577899999999999999 9999999999998775433 34544 5
Q ss_pred hHHHHHHHHHH--hhhhcHHHHHHHhHh
Q psy398 132 NKLKAYEGVYC--LAIRQYCRAAELFVD 157 (162)
Q Consensus 132 nrLKvy~gl~~--l~~R~fk~AA~lfld 157 (162)
-|+|-|+=+-. |+.|.|-.|++-+-+
T Consensus 210 lKlkyYeL~V~i~Lh~R~Yl~v~~y~~~ 237 (439)
T COG5071 210 LKLKYYELKVRIGLHDRAYLDVCKYYRA 237 (439)
T ss_pred HHHHHHHHhheeecccHHHHHHHHHHHH
Confidence 78888987765 458999999976654
No 153
>PF14853 Fis1_TPR_C: Fis1 C-terminal tetratricopeptide repeat; PDB: 1IYG_A 1PC2_A 1NZN_A 3UUX_C 1Y8M_A 2PQR_A 2PQN_A 3O48_A.
Probab=52.75 E-value=35 Score=21.95 Aligned_cols=38 Identities=8% Similarity=-0.010 Sum_probs=26.9
Q ss_pred HHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHH
Q psy398 51 SGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRI 89 (162)
Q Consensus 51 r~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~ki 89 (162)
|+.+.-+|-=++++||+++|.+.....+ ..-....+-.
T Consensus 1 Rd~lY~lAig~ykl~~Y~~A~~~~~~lL-~~eP~N~Qa~ 38 (53)
T PF14853_consen 1 RDCLYYLAIGHYKLGEYEKARRYCDALL-EIEPDNRQAQ 38 (53)
T ss_dssp HHHHHHHHHHHHHTT-HHHHHHHHHHHH-HHTTS-HHHH
T ss_pred ChhHHHHHHHHHHhhhHHHHHHHHHHHH-hhCCCcHHHH
Confidence 5666777777899999999999999888 5444444433
No 154
>PLN03098 LPA1 LOW PSII ACCUMULATION1; Provisional
Probab=52.72 E-value=1.9e+02 Score=26.71 Aligned_cols=33 Identities=12% Similarity=-0.060 Sum_probs=28.5
Q ss_pred HHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCC
Q psy398 51 SGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNP 84 (162)
Q Consensus 51 r~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s 84 (162)
-++|.++|.-|.+.|++++|+.+|++++ +-...
T Consensus 75 a~a~~NLG~AL~~lGryeEAIa~f~rAL-eL~Pd 107 (453)
T PLN03098 75 AEDAVNLGLSLFSKGRVKDALAQFETAL-ELNPN 107 (453)
T ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHH-hhCCC
Confidence 5688999999999999999999999988 55443
No 155
>KOG2300|consensus
Probab=52.32 E-value=1.4e+02 Score=28.29 Aligned_cols=101 Identities=15% Similarity=0.017 Sum_probs=61.6
Q ss_pred HHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhh---hcCCCchh
Q psy398 53 IWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIE---GISGGDWS 129 (162)
Q Consensus 53 ~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e---~~~ggDW~ 129 (162)
-|.-+|-|-+..|-+++|...|..+. ..|.+... +.|+..-+++.| -.......+ -++-+.+- ..+-..--
T Consensus 369 ih~LlGlys~sv~~~enAe~hf~~a~-k~t~~~dl---~a~~nlnlAi~Y-L~~~~~ed~-y~~ld~i~p~nt~s~ssq~ 442 (629)
T KOG2300|consen 369 IHMLLGLYSHSVNCYENAEFHFIEAT-KLTESIDL---QAFCNLNLAISY-LRIGDAEDL-YKALDLIGPLNTNSLSSQR 442 (629)
T ss_pred HHHHHhhHhhhcchHHHHHHHHHHHH-HhhhHHHH---HHHHHHhHHHHH-HHhccHHHH-HHHHHhcCCCCCCcchHHH
Confidence 35668899999999999999999988 56544333 455555566555 222221111 12222221 00011112
Q ss_pred hhhHHHHHHHHHHhhhhcHHHHHHHhHhhc
Q psy398 130 ARNKLKAYEGVYCLAIRQYCRAAELFVDVI 159 (162)
Q Consensus 130 rrnrLKvy~gl~~l~~R~fk~AA~lfld~~ 159 (162)
.+|-.+-..|++.+.+|+|.+|-..+-+++
T Consensus 443 l~a~~~~v~glfaf~qn~lnEaK~~l~e~L 472 (629)
T KOG2300|consen 443 LEASILYVYGLFAFKQNDLNEAKRFLRETL 472 (629)
T ss_pred HHHHHHHHHHHHHHHhccHHHHHHHHHHHH
Confidence 356666688999999999999977765554
No 156
>KOG2047|consensus
Probab=51.96 E-value=16 Score=35.43 Aligned_cols=33 Identities=15% Similarity=0.226 Sum_probs=28.5
Q ss_pred HHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChH
Q psy398 53 IWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKS 86 (162)
Q Consensus 53 ~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g 86 (162)
-|..+|+||-++|++++|-..|.++. ..+++..
T Consensus 250 Lw~SLAdYYIr~g~~ekarDvyeeai-~~v~tvr 282 (835)
T KOG2047|consen 250 LWCSLADYYIRSGLFEKARDVYEEAI-QTVMTVR 282 (835)
T ss_pred HHHHHHHHHHHhhhhHHHHHHHHHHH-Hhheehh
Confidence 46789999999999999999999999 6666654
No 157
>KOG0548|consensus
Probab=51.87 E-value=97 Score=29.20 Aligned_cols=90 Identities=19% Similarity=0.208 Sum_probs=64.5
Q ss_pred hhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhhhHHH
Q psy398 56 AKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSARNKLK 135 (162)
Q Consensus 56 ~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rrnrLK 135 (162)
.+|+-.++.||++.|+.+|+.+. .-..+ .|++ .-=|.+-+. -...+.+.+.-|+..++. -.||-.- -
T Consensus 7 ~kgnaa~s~~d~~~ai~~~t~ai-~l~p~-nhvl----ySnrsaa~a--~~~~~~~al~da~k~~~l--~p~w~kg---y 73 (539)
T KOG0548|consen 7 EKGNAAFSSGDFETAIRLFTEAI-MLSPT-NHVL----YSNRSAAYA--SLGSYEKALKDATKTRRL--NPDWAKG---Y 73 (539)
T ss_pred HHHHhhcccccHHHHHHHHHHHH-ccCCC-ccch----hcchHHHHH--HHhhHHHHHHHHHHHHhc--CCchhhH---H
Confidence 57788899999999999999988 44444 6653 444555443 445667778888888887 7899732 1
Q ss_pred HHHHHHHhhhhcHHHHHHHhHhh
Q psy398 136 AYEGVYCLAIRQYCRAAELFVDV 158 (162)
Q Consensus 136 vy~gl~~l~~R~fk~AA~lfld~ 158 (162)
.-.|..++.-++|.+|-.-|-+.
T Consensus 74 ~r~Gaa~~~lg~~~eA~~ay~~G 96 (539)
T KOG0548|consen 74 SRKGAALFGLGDYEEAILAYSEG 96 (539)
T ss_pred HHhHHHHHhcccHHHHHHHHHHH
Confidence 12578888889999887766543
No 158
>PF11207 DUF2989: Protein of unknown function (DUF2989); InterPro: IPR021372 Some members in this bacterial family of proteins are annotated as lipoproteins however this cannot be confirmed.
Probab=51.71 E-value=95 Score=25.61 Aligned_cols=51 Identities=16% Similarity=0.128 Sum_probs=34.2
Q ss_pred HHhhHhHHHhhcCHHHHHHHHHHhcCCCCCCh-HHHHHHHHHHHHHHHHhcCCHH
Q psy398 54 WQAKLDYLCSIGDRAEAFKLADSKFADKTNPK-SHRIDAVFTLFRIAYFHGCDIQ 107 (162)
Q Consensus 54 ~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~-g~kiD~~l~~iRv~~f~~~d~~ 107 (162)
...+|-||.+. |.++|...|-+++ +-.... .--.|+..+++-+..-. +++.
T Consensus 144 q~aLAtyY~kr-D~~Kt~~ll~~~L-~l~~~~~~~n~eil~sLas~~~~~-~~~e 195 (203)
T PF11207_consen 144 QYALATYYTKR-DPEKTIQLLLRAL-ELSNPDDNFNPEILKSLASIYQKL-KNYE 195 (203)
T ss_pred HHHHHHHHHcc-CHHHHHHHHHHHH-HhcCCCCCCCHHHHHHHHHHHHHh-cchh
Confidence 45677888765 8888888888877 544433 55566777777666666 5544
No 159
>COG5010 TadD Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking and secretion]
Probab=51.53 E-value=1.6e+02 Score=25.28 Aligned_cols=20 Identities=20% Similarity=0.190 Sum_probs=12.9
Q ss_pred HHHHhhhhcHHHHHHHhHhh
Q psy398 139 GVYCLAIRQYCRAAELFVDV 158 (162)
Q Consensus 139 gl~~l~~R~fk~AA~lfld~ 158 (162)
|++++..+|++.|..+|+..
T Consensus 175 gms~~L~gd~~~A~~lll~a 194 (257)
T COG5010 175 GMSLLLRGDLEDAETLLLPA 194 (257)
T ss_pred HHHHHHcCCHHHHHHHHHHH
Confidence 56666666666666666654
No 160
>cd02681 MIT_calpain7_1 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in the nuclear thiol protease PalBH. The molecular function of the MIT domain is unclear.
Probab=50.53 E-value=64 Score=22.29 Aligned_cols=23 Identities=13% Similarity=-0.174 Sum_probs=17.7
Q ss_pred hhHhHHHhhcCHHHHHHHHHHhc
Q psy398 56 AKLDYLCSIGDRAEAFKLADSKF 78 (162)
Q Consensus 56 ~~a~~~~~iGD~~~A~~ay~~~~ 78 (162)
.+|-=.=+.|++++|+.+|..+.
T Consensus 11 ~~Ave~D~~g~y~eA~~~Y~~ai 33 (76)
T cd02681 11 RLAVQRDQEGRYSEAVFYYKEAA 33 (76)
T ss_pred HHHHHHHHccCHHHHHHHHHHHH
Confidence 34455567789999999999887
No 161
>KOG0551|consensus
Probab=49.43 E-value=99 Score=27.86 Aligned_cols=70 Identities=11% Similarity=0.081 Sum_probs=51.9
Q ss_pred hHHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHH--HHhcCCHHHHHHHHHHHHhh
Q psy398 47 EDQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIA--YFHGCDIQAMTAAIEKATEL 119 (162)
Q Consensus 47 ~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~--~f~~~d~~~v~~~i~ka~~l 119 (162)
..|+-..+.+=|++|.+.-++-.|..+|++-++.+|.- ..|..++.-=|-+ +++ +|+..+.+-..+|..+
T Consensus 77 p~E~Aen~KeeGN~~fK~Kryk~A~~~Yt~Glk~kc~D--~dlnavLY~NRAAa~~~l-~NyRs~l~Dcs~al~~ 148 (390)
T KOG0551|consen 77 PHEQAENYKEEGNEYFKEKRYKDAVESYTEGLKKKCAD--PDLNAVLYTNRAAAQLYL-GNYRSALNDCSAALKL 148 (390)
T ss_pred hHHHHHHHHHHhHHHHHhhhHHHHHHHHHHHHhhcCCC--ccHHHHHHhhHHHHHHHH-HHHHHHHHHHHHHHhc
Confidence 46789999999999999999999999999999455554 4555555555544 555 6776666666665554
No 162
>PRK10153 DNA-binding transcriptional activator CadC; Provisional
Probab=48.88 E-value=47 Score=30.73 Aligned_cols=62 Identities=16% Similarity=0.088 Sum_probs=45.4
Q ss_pred HHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhh
Q psy398 52 GIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATEL 119 (162)
Q Consensus 52 ~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l 119 (162)
.++..+|-.+..-|+.+.|..++.++. +-..+ .+....+-++..+. |+.........+|-.+
T Consensus 421 ~~~~ala~~~~~~g~~~~A~~~l~rAl-~L~ps----~~a~~~lG~~~~~~-G~~~eA~~~~~~A~~L 482 (517)
T PRK10153 421 RIYEILAVQALVKGKTDEAYQAINKAI-DLEMS----WLNYVLLGKVYELK-GDNRLAADAYSTAFNL 482 (517)
T ss_pred HHHHHHHHHHHhcCCHHHHHHHHHHHH-HcCCC----HHHHHHHHHHHHHc-CCHHHHHHHHHHHHhc
Confidence 467777777777788888888888888 54432 34555666666777 8888888888888776
No 163
>COG1729 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=48.72 E-value=58 Score=27.87 Aligned_cols=66 Identities=14% Similarity=0.059 Sum_probs=50.7
Q ss_pred HHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHH-HHHHHHHHHHHHhcCCHHHHHHHHHHHHhh
Q psy398 51 SGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRI-DAVFTLFRIAYFHGCDIQAMTAAIEKATEL 119 (162)
Q Consensus 51 r~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~ki-D~~l~~iRv~~f~~~d~~~v~~~i~ka~~l 119 (162)
-.|+.=+|+-|+.-||++.|-+.|.++..++..+ .|. |+.|-+-.+-.-. ++.......+.+.-+-
T Consensus 178 ~nA~yWLGe~~y~qg~y~~Aa~~f~~~~k~~P~s--~KApdallKlg~~~~~l-~~~d~A~atl~qv~k~ 244 (262)
T COG1729 178 PNAYYWLGESLYAQGDYEDAAYIFARVVKDYPKS--PKAPDALLKLGVSLGRL-GNTDEACATLQQVIKR 244 (262)
T ss_pred chhHHHHHHHHHhcccchHHHHHHHHHHHhCCCC--CCChHHHHHHHHHHHHh-cCHHHHHHHHHHHHHH
Confidence 4688889999999999999999999988344443 334 8888887777777 8887777777665443
No 164
>TIGR03362 VI_chp_7 type VI secretion-associated protein, VC_A0119 family. This protein family is one of two related families in type VI secretion systems that contain an ImpA-related N-terminal domain (pfam06812).
Probab=48.67 E-value=1.8e+02 Score=25.18 Aligned_cols=74 Identities=11% Similarity=0.147 Sum_probs=64.4
Q ss_pred hhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhh
Q psy398 56 AKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSAR 131 (162)
Q Consensus 56 ~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rr 131 (162)
+-+.-.++.|.++.|+......+ ..+.+..+++=.-|.+.|+.... +=..+....+......++..+=.+||.-
T Consensus 218 ~eA~~l~~~~gl~~Al~~L~~~~-~~~~s~R~rf~~rL~~A~l~~~~-g~~~lA~~ll~~L~~~~~~~~L~~WEP~ 291 (301)
T TIGR03362 218 EEARALAAEGGLEAALQRLQQRL-AQAREPRERFHWRLLLARLLEQA-GKAELAQQLYAALDQQIQQLGLAEWEPA 291 (301)
T ss_pred HHHHHHHHcCCHHHHHHHHHhhc-ccCCChHHHHHHHHHHHHHHHHc-CCHHHHHHHHHHHHHHHHhCCccccChH
Confidence 34667788899999999999988 88999999999999999999999 8889999999988888876555699854
No 165
>KOG4162|consensus
Probab=48.52 E-value=1.2e+02 Score=29.90 Aligned_cols=98 Identities=21% Similarity=0.129 Sum_probs=54.9
Q ss_pred HHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHH-------------HHHHHHHHHhcCCHHHHHHHHHHHHhhh
Q psy398 54 WQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAV-------------FTLFRIAYFHGCDIQAMTAAIEKATELI 120 (162)
Q Consensus 54 ~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~-------------l~~iRv~~f~~~d~~~v~~~i~ka~~l~ 120 (162)
|.-.|+.|..+|.-+.|.-|.-++. .-+..+-.+.-+. ...+-+++. -|-.+|..-...|+-+.
T Consensus 653 wllaa~~~~~~~~~~~a~~CL~Ea~-~~~~l~~~~~~~~G~~~~~~~~~~EA~~af~~Al~--ldP~hv~s~~Ala~~ll 729 (799)
T KOG4162|consen 653 WLLAADLFLLSGNDDEARSCLLEAS-KIDPLSASVYYLRGLLLEVKGQLEEAKEAFLVALA--LDPDHVPSMTALAELLL 729 (799)
T ss_pred HHHHHHHHHhcCCchHHHHHHHHHH-hcchhhHHHHHHhhHHHHHHHhhHHHHHHHHHHHh--cCCCCcHHHHHHHHHHH
Confidence 4456667777776666666666655 3222222222111 111112222 25555666666666666
Q ss_pred hhcCCCchhhhhHHH------------------HHHHHHHhhhhcHHHHHHHhHhhc
Q psy398 121 EGISGGDWSARNKLK------------------AYEGVYCLAIRQYCRAAELFVDVI 159 (162)
Q Consensus 121 e~~~ggDW~rrnrLK------------------vy~gl~~l~~R~fk~AA~lfld~~ 159 (162)
+. | .+|+.-+ -|-|--....||+++|+++|.-++
T Consensus 730 e~---G--~~~la~~~~~L~dalr~dp~n~eaW~~LG~v~k~~Gd~~~Aaecf~aa~ 781 (799)
T KOG4162|consen 730 EL---G--SPRLAEKRSLLSDALRLDPLNHEAWYYLGEVFKKLGDSKQAAECFQAAL 781 (799)
T ss_pred Hh---C--CcchHHHHHHHHHHHhhCCCCHHHHHHHHHHHHHccchHHHHHHHHHHH
Confidence 64 4 3444433 267777888999999999997654
No 166
>PF01895 PhoU: PhoU domain; InterPro: IPR008170 This family contains phosphate regulatory proteins including PhoU. PhoU proteins are known to play a role in the regulation of phosphate uptake. The PhoU domain is composed of a three helix bundle []. The PhoU protein contains two copies of this domain. The domain binds to an iron cluster via its conserved E/DXXXD motif. Deletion of PhoU activates constitutive expression of the phosphate ABC transporter and allows phosphate transport, but causes a growth defect; suggesting that the protein has some secondary function []. ; PDB: 2I0M_A 1T72_B 1T8B_A 1SUM_B 1VCT_A 2BKN_A 2BKP_A 2BKO_A.
Probab=47.23 E-value=53 Score=21.32 Aligned_cols=52 Identities=25% Similarity=0.163 Sum_probs=32.7
Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHH-------hcCCCchHHH--HHHHHhhHhHHHhhcCHHHH
Q psy398 17 DEKFYTEMKDANEKRLKELDLEHE-------KNMMDEEDQV--SGIWQAKLDYLCSIGDRAEA 70 (162)
Q Consensus 17 D~~~~~~~~~~n~~~l~~Le~~lk-------~Nlg~~~~eI--r~~~~~~a~~~~~iGD~~~A 70 (162)
|.++.+.+.+ .+.++.++..++. .+.. +..++ -.++.....++.||||.-..
T Consensus 23 d~~~a~~i~~-~e~~id~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~i~~~lERigD~~~n 83 (88)
T PF01895_consen 23 DSELAQEIIQ-LEEEIDELYREIRRQILKILKNQN-PLEELRELVGLLRIARDLERIGDHAVN 83 (88)
T ss_dssp -HHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHC-GHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred CHHHHHHHHH-hHHHHHHHHHHHHHHHHHHHhhCC-CHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 6677766644 4777777777775 2212 01223 34588899999999997544
No 167
>PF10300 DUF3808: Protein of unknown function (DUF3808); InterPro: IPR019412 This entry represents a family of proteins conserved from fungi to humans. In humans this protein is expressed in primary breast carcinomas but not in normal breast tissue, and has a putative eukaryotic RNP-1 RNA binding region and a candidate anchoring transmembrane domain. The human protein is coordinately regulated with oestrogen receptor, but is not necessarily oestradiol-responsive []. Members of this family carry a tetratricopeptide repeat (IPR013105 from INTERPRO) at their C terminus.
Probab=46.23 E-value=95 Score=28.20 Aligned_cols=96 Identities=21% Similarity=0.178 Sum_probs=55.0
Q ss_pred hHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhh-----cCCCchhhh
Q psy398 57 KLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEG-----ISGGDWSAR 131 (162)
Q Consensus 57 ~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~-----~~ggDW~rr 131 (162)
+..++-=.||++.+++...++. +...--+..-++++...-..+...-+++.....+..|+.+++. |.|.=
T Consensus 194 ll~~vGF~gdR~~GL~~L~~~~-~~~~i~~~la~L~LL~y~~~~~~~~~~~~~~~~~~~a~~lL~~~~~~yP~s~l---- 268 (468)
T PF10300_consen 194 LLSFVGFSGDRELGLRLLWEAS-KSENIRSPLAALVLLWYHLVVPSFLGIDGEDVPLEEAEELLEEMLKRYPNSAL---- 268 (468)
T ss_pred HHhhcCcCCcHHHHHHHHHHHh-ccCCcchHHHHHHHHHHHHHHHHHcCCcccCCCHHHHHHHHHHHHHhCCCcHH----
Confidence 3344445699999999999999 6555455555555554444432112222245566666666543 22332
Q ss_pred hHHHHHHHHHHhhhhcHHHHHHHhHhhc
Q psy398 132 NKLKAYEGVYCLAIRQYCRAAELFVDVI 159 (162)
Q Consensus 132 nrLKvy~gl~~l~~R~fk~AA~lfld~~ 159 (162)
+..++|-.+...|+..+|-++|-.++
T Consensus 269 --fl~~~gR~~~~~g~~~~Ai~~~~~a~ 294 (468)
T PF10300_consen 269 --FLFFEGRLERLKGNLEEAIESFERAI 294 (468)
T ss_pred --HHHHHHHHHHHhcCHHHHHHHHHHhc
Confidence 34566666666666666666665544
No 168
>KOG2076|consensus
Probab=45.67 E-value=2.6e+02 Score=28.03 Aligned_cols=91 Identities=15% Similarity=0.144 Sum_probs=62.2
Q ss_pred HHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhh-
Q psy398 53 IWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSAR- 131 (162)
Q Consensus 53 ~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rr- 131 (162)
++.-+|+=|.+.||.++|+.+.-.+- .-. ...+ |+++.+--.+..+ ++|....-+-.||=++- +.+|.-.
T Consensus 175 ay~tL~~IyEqrGd~eK~l~~~llAA-HL~-p~d~--e~W~~ladls~~~-~~i~qA~~cy~rAI~~~----p~n~~~~~ 245 (895)
T KOG2076|consen 175 AYYTLGEIYEQRGDIEKALNFWLLAA-HLN-PKDY--ELWKRLADLSEQL-GNINQARYCYSRAIQAN----PSNWELIY 245 (895)
T ss_pred hHHHHHHHHHHcccHHHHHHHHHHHH-hcC-CCCh--HHHHHHHHHHHhc-ccHHHHHHHHHHHHhcC----CcchHHHH
Confidence 67788888899999999988887765 322 2222 8888888888888 88888888888887763 4466532
Q ss_pred hHHHHHHHHHHhhhhcHHHHHHHhHh
Q psy398 132 NKLKAYEGVYCLAIRQYCRAAELFVD 157 (162)
Q Consensus 132 nrLKvy~gl~~l~~R~fk~AA~lfld 157 (162)
+|--.|+ -.|+.+.|+..|+.
T Consensus 246 ers~L~~-----~~G~~~~Am~~f~~ 266 (895)
T KOG2076|consen 246 ERSSLYQ-----KTGDLKRAMETFLQ 266 (895)
T ss_pred HHHHHHH-----HhChHHHHHHHHHH
Confidence 3332233 34666666666654
No 169
>KOG4555|consensus
Probab=45.35 E-value=1.6e+02 Score=23.51 Aligned_cols=96 Identities=23% Similarity=0.223 Sum_probs=62.2
Q ss_pred HHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCch
Q psy398 49 QVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDW 128 (162)
Q Consensus 49 eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW 128 (162)
+....+.-.|--.++.||++.|++-|.+++ .-|.---+...---.-.|+. ++-.....-+++|=++ .||-
T Consensus 41 e~S~~LEl~~valaE~g~Ld~AlE~F~qal-~l~P~raSayNNRAQa~RLq----~~~e~ALdDLn~AleL-----ag~~ 110 (175)
T KOG4555|consen 41 KASRELELKAIALAEAGDLDGALELFGQAL-CLAPERASAYNNRAQALRLQ----GDDEEALDDLNKALEL-----AGDQ 110 (175)
T ss_pred HHHHHHHHHHHHHHhccchHHHHHHHHHHH-HhcccchHhhccHHHHHHHc----CChHHHHHHHHHHHHh-----cCcc
Confidence 445567778889999999999999999988 66644444443334445543 7777788888888777 3443
Q ss_pred hhhhH-HHHHHHH-HHh------hhhcHHHHHHH
Q psy398 129 SARNK-LKAYEGV-YCL------AIRQYCRAAEL 154 (162)
Q Consensus 129 ~rrnr-LKvy~gl-~~l------~~R~fk~AA~l 154 (162)
.|..+ --|-.|+ |.+ +.+||..||+|
T Consensus 111 trtacqa~vQRg~lyRl~g~dd~AR~DFe~AA~L 144 (175)
T KOG4555|consen 111 TRTACQAFVQRGLLYRLLGNDDAARADFEAAAQL 144 (175)
T ss_pred chHHHHHHHHHHHHHHHhCchHHHHHhHHHHHHh
Confidence 33221 1223343 332 45688888875
No 170
>PRK14957 DNA polymerase III subunits gamma and tau; Provisional
Probab=45.23 E-value=2.7e+02 Score=26.19 Aligned_cols=94 Identities=9% Similarity=0.034 Sum_probs=61.7
Q ss_pred hHHHHHHhcCCCcCHHHHHHHHHHHHHHHHHHHHHHH-------hcCCCchHHHHH--------HHHhhHhHHHhhcCHH
Q psy398 4 YYKEVCEEFGWKMDEKFYTEMKDANEKRLKELDLEHE-------KNMMDEEDQVSG--------IWQAKLDYLCSIGDRA 68 (162)
Q Consensus 4 ~Y~~l~~~~~~~~D~~~~~~~~~~n~~~l~~Le~~lk-------~Nlg~~~~eIr~--------~~~~~a~~~~~iGD~~ 68 (162)
+.+.++.+.+.+++++.++.+-+...--+-.+...++ ++.. .+.|+. .+.++.+.+. -||..
T Consensus 186 ~L~~il~~egi~~e~~Al~~Ia~~s~GdlR~alnlLek~i~~~~~~It--~~~V~~~l~~~~~~~v~~ll~Al~-~~d~~ 262 (546)
T PRK14957 186 QLKIILAKENINSDEQSLEYIAYHAKGSLRDALSLLDQAISFCGGELK--QAQIKQMLGIIDSEEVYSIINAII-DNDPK 262 (546)
T ss_pred HHHHHHHHcCCCCCHHHHHHHHHHcCCCHHHHHHHHHHHHHhccCCCC--HHHHHHHHccCCHHHHHHHHHHHH-cCCHH
Confidence 4566677778888888887766655443333333332 2233 455555 2444444443 37999
Q ss_pred HHHHHHHHhcCCCCCChHHHHHHHH-HHHHHHHH
Q psy398 69 EAFKLADSKFADKTNPKSHRIDAVF-TLFRIAYF 101 (162)
Q Consensus 69 ~A~~ay~~~~~~~~~s~g~kiD~~l-~~iRv~~f 101 (162)
.++.....+. ++.......+|++. ..+|+.++
T Consensus 263 ~~l~~~~~l~-~~~~~~~~~l~~l~~~~~r~~~~ 295 (546)
T PRK14957 263 AILPAIKNLA-LTESSADAVLDRIAEIWFACCIY 295 (546)
T ss_pred HHHHHHHHHH-HhCCCHHHHHHHHHHHHHHHHHH
Confidence 9999999999 88888899999888 67775544
No 171
>TIGR00540 hemY_coli hemY protein. This is an uncharacterized protein encoded next to a heme-biosynthetic enzyme in two gamma division proteobacteria (E. coli and H. influenzae). It is known in no other species. The gene symbol hemY is unfortunate in that an unrelated protein, protoporphyrinogen oxidase, is designated as HemG in E. coli but as HemY in Bacillus subtilis.
Probab=44.93 E-value=2.1e+02 Score=24.94 Aligned_cols=96 Identities=14% Similarity=0.086 Sum_probs=61.8
Q ss_pred hHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhh-----------------
Q psy398 57 KLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATEL----------------- 119 (162)
Q Consensus 57 ~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l----------------- 119 (162)
.+..+.+.|+.+.|.+.+++.. +.. +.+. ++...+..+.+-. +||..+.+.+.+..+.
T Consensus 159 ~a~l~l~~~~~~~Al~~l~~l~-~~~--P~~~-~~l~ll~~~~~~~-~d~~~a~~~l~~l~k~~~~~~~~~~~l~~~a~~ 233 (409)
T TIGR00540 159 RTRILLAQNELHAARHGVDKLL-EMA--PRHK-EVLKLAEEAYIRS-GAWQALDDIIDNMAKAGLFDDEEFADLEQKAEI 233 (409)
T ss_pred HHHHHHHCCCHHHHHHHHHHHH-HhC--CCCH-HHHHHHHHHHHHH-hhHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHH
Confidence 4777777888888888888877 432 2333 5666667777777 8887666666665543
Q ss_pred --hhhcCCC----------chh-------hhhHHHHHHHHHHhhhhcHHHHHHHhHhhc
Q psy398 120 --IEGISGG----------DWS-------ARNKLKAYEGVYCLAIRQYCRAAELFVDVI 159 (162)
Q Consensus 120 --~e~~~gg----------DW~-------rrnrLKvy~gl~~l~~R~fk~AA~lfld~~ 159 (162)
+++ +. -|. ...++....|-.++..+++.+|-+.+-+.+
T Consensus 234 ~~l~~--~~~~~~~~~L~~~~~~~p~~~~~~~~l~~~~a~~l~~~g~~~~A~~~l~~~l 290 (409)
T TIGR00540 234 GLLDE--AMADEGIDGLLNWWKNQPRHRRHNIALKIALAEHLIDCDDHDSAQEIIFDGL 290 (409)
T ss_pred HHHHH--HHHhcCHHHHHHHHHHCCHHHhCCHHHHHHHHHHHHHCCChHHHHHHHHHHH
Confidence 100 00 132 134667777888888888888887776654
No 172
>PF13512 TPR_18: Tetratricopeptide repeat
Probab=43.90 E-value=1.1e+02 Score=23.76 Aligned_cols=50 Identities=16% Similarity=0.216 Sum_probs=39.7
Q ss_pred HHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHh
Q psy398 51 SGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFH 102 (162)
Q Consensus 51 r~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~ 102 (162)
..+.++++.=|+..|+++.|+.+|.+-+ +--+....+|-++.++=++-+.
T Consensus 47 ~qAqL~l~yayy~~~~y~~A~a~~~rFi--rLhP~hp~vdYa~Y~~gL~~~~ 96 (142)
T PF13512_consen 47 EQAQLDLAYAYYKQGDYEEAIAAYDRFI--RLHPTHPNVDYAYYMRGLSYYE 96 (142)
T ss_pred HHHHHHHHHHHHHccCHHHHHHHHHHHH--HhCCCCCCccHHHHHHHHHHHH
Confidence 5788999999999999999999999977 3333444688887776666555
No 173
>KOG3060|consensus
Probab=43.70 E-value=2.2e+02 Score=24.77 Aligned_cols=106 Identities=20% Similarity=0.198 Sum_probs=66.0
Q ss_pred hHHHHHHhcCCCcCHHHHHHHHH------HHHHHHHHHHHHHHhcCCCchHHHHHHHHhhHhHHHhhcCHHHHHHHHHHh
Q psy398 4 YYKEVCEEFGWKMDEKFYTEMKD------ANEKRLKELDLEHEKNMMDEEDQVSGIWQAKLDYLCSIGDRAEAFKLADSK 77 (162)
Q Consensus 4 ~Y~~l~~~~~~~~D~~~~~~~~~------~n~~~l~~Le~~lk~Nlg~~~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~ 77 (162)
||..++++- |-|.......-+ +|-.-+++|.+=++.=++ | -++|..+++-|.+.||++.|.=||+++
T Consensus 108 ~y~~lL~dd--pt~~v~~KRKlAilka~GK~l~aIk~ln~YL~~F~~--D---~EAW~eLaeiY~~~~~f~kA~fClEE~ 180 (289)
T KOG3060|consen 108 YYESLLEDD--PTDTVIRKRKLAILKAQGKNLEAIKELNEYLDKFMN--D---QEAWHELAEIYLSEGDFEKAAFCLEEL 180 (289)
T ss_pred HHHHHhccC--cchhHHHHHHHHHHHHcCCcHHHHHHHHHHHHHhcC--c---HHHHHHHHHHHHhHhHHHHHHHHHHHH
Confidence 566666654 556655543222 223334444444443334 4 458999999999999999999999998
Q ss_pred cCCCCCChHHHHHHHHHHHHHHHHhc--CCHHHHHHHHHHHHhhh
Q psy398 78 FADKTNPKSHRIDAVFTLFRIAYFHG--CDIQAMTAAIEKATELI 120 (162)
Q Consensus 78 ~~~~~~s~g~kiD~~l~~iRv~~f~~--~d~~~v~~~i~ka~~l~ 120 (162)
. -.++..|..=--+.=+| +..| .|.....+|.++|=++.
T Consensus 181 l--l~~P~n~l~f~rlae~~--Yt~gg~eN~~~arkyy~~alkl~ 221 (289)
T KOG3060|consen 181 L--LIQPFNPLYFQRLAEVL--YTQGGAENLELARKYYERALKLN 221 (289)
T ss_pred H--HcCCCcHHHHHHHHHHH--HHHhhHHHHHHHHHHHHHHHHhC
Confidence 8 55666665533333332 3332 56777777777776663
No 174
>KOG3616|consensus
Probab=43.70 E-value=1.1e+02 Score=30.78 Aligned_cols=75 Identities=20% Similarity=0.107 Sum_probs=51.9
Q ss_pred hcCCCchHHHHHH-HHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHH------HHHHHHHHHhcCCHHHHHHHH
Q psy398 41 KNMMDEEDQVSGI-WQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAV------FTLFRIAYFHGCDIQAMTAAI 113 (162)
Q Consensus 41 ~Nlg~~~~eIr~~-~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~------l~~iRv~~f~~~d~~~v~~~i 113 (162)
+|+- +--++-+ +-..|+||+.-||++-|.+.|.+.- -.+.-|||. -.-.|++..+-+--..+..||
T Consensus 756 dniq--dqk~~s~yy~~iadhyan~~dfe~ae~lf~e~~-----~~~dai~my~k~~kw~da~kla~e~~~~e~t~~~yi 828 (1636)
T KOG3616|consen 756 DNIQ--DQKTASGYYGEIADHYANKGDFEIAEELFTEAD-----LFKDAIDMYGKAGKWEDAFKLAEECHGPEATISLYI 828 (1636)
T ss_pred HHhh--hhccccccchHHHHHhccchhHHHHHHHHHhcc-----hhHHHHHHHhccccHHHHHHHHHHhcCchhHHHHHH
Confidence 5555 5555555 4468999999999999999998753 124455553 345666665425566778899
Q ss_pred HHHHhhhhh
Q psy398 114 EKATELIEG 122 (162)
Q Consensus 114 ~ka~~l~e~ 122 (162)
.||+.+-+.
T Consensus 829 akaedldeh 837 (1636)
T KOG3616|consen 829 AKAEDLDEH 837 (1636)
T ss_pred HhHHhHHhh
Confidence 999998654
No 175
>KOG0624|consensus
Probab=43.31 E-value=1.8e+02 Score=26.60 Aligned_cols=103 Identities=20% Similarity=0.147 Sum_probs=70.0
Q ss_pred HHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHH-----------HHHhcCCHHHHHHHHHHH
Q psy398 48 DQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRI-----------AYFHGCDIQAMTAAIEKA 116 (162)
Q Consensus 48 ~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv-----------~~f~~~d~~~v~~~i~ka 116 (162)
.+.--|....|.-+.+.|.++.|..-|...+ +...|.|...+.-=.++-+ ..+-++|...|..+|+
T Consensus 103 pDF~~ARiQRg~vllK~Gele~A~~DF~~vl-~~~~s~~~~~eaqskl~~~~e~~~l~~ql~s~~~~GD~~~ai~~i~-- 179 (504)
T KOG0624|consen 103 PDFMAARIQRGVVLLKQGELEQAEADFDQVL-QHEPSNGLVLEAQSKLALIQEHWVLVQQLKSASGSGDCQNAIEMIT-- 179 (504)
T ss_pred ccHHHHHHHhchhhhhcccHHHHHHHHHHHH-hcCCCcchhHHHHHHHHhHHHHHHHHHHHHHHhcCCchhhHHHHHH--
Confidence 3344567788999999999999999999999 8888877776643332222 1111256666655554
Q ss_pred HhhhhhcCCCchh---hhhHHHHHH--HHHHhhhhcHHHHHHHhHh
Q psy398 117 TELIEGISGGDWS---ARNKLKAYE--GVYCLAIRQYCRAAELFVD 157 (162)
Q Consensus 117 ~~l~e~~~ggDW~---rrnrLKvy~--gl~~l~~R~fk~AA~lfld 157 (162)
.++| -.-|+ |+-|-|+|. |=-..++-|.+.|++|=-|
T Consensus 180 -~llE---i~~Wda~l~~~Rakc~i~~~e~k~AI~Dlk~askLs~D 221 (504)
T KOG0624|consen 180 -HLLE---IQPWDASLRQARAKCYIAEGEPKKAIHDLKQASKLSQD 221 (504)
T ss_pred -HHHh---cCcchhHHHHHHHHHHHhcCcHHHHHHHHHHHHhcccc
Confidence 4555 36787 667777764 4556778888888887444
No 176
>PF02064 MAS20: MAS20 protein import receptor; InterPro: IPR002056 Virtually all mitochondrial precursors are imported via the same mechanism []: precursors first bind to receptors on the mitochondrial surface, then insert into the translocation channel in the outer membrane. Many outer-membrane proteins participate in the early stages of import, four of which (MAS20, MAS22, MAS37 and MAS70) are components of the receptor. MAS20, which forms a subcomplex with MAS22, seems to interact with most or all mitochondrial precursors, suggesting that the protein binds directly to mitochondrial targeting sequences. The MAS37 and MAS70 components also form a subcomplex, the two subcomplexes possibly binding via their trans- membrane (TM) regions - the TM region of MAS70 promotes oligomerisation of attatched protein domains and shares sequence similarity with the TM region of MAS20 []. MAS20 is also known as TOM20.; GO: 0006605 protein targeting, 0006886 intracellular protein transport, 0005742 mitochondrial outer membrane translocase complex; PDB: 3AX3_A 3AWR_B 2V1S_A 3AX5_C 3AX2_C 1OM2_A 2V1T_B.
Probab=42.94 E-value=56 Score=24.69 Aligned_cols=75 Identities=12% Similarity=0.060 Sum_probs=37.6
Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHH-------hcCCCchHHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHH
Q psy398 17 DEKFYTEMKDANEKRLKELDLEHE-------KNMMDEEDQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRI 89 (162)
Q Consensus 17 D~~~~~~~~~~n~~~l~~Le~~lk-------~Nlg~~~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~ki 89 (162)
|+++-.++.++.+++.+.-+.+.. +... .+..=+-...+|+-+..-|+.++|..+|.+++ -=|..++.-+
T Consensus 24 dP~frkkL~~rrk~~~~~~~~~~~~~~~~~~d~~~--~e~~Fl~qV~lGE~L~~~G~~~~aa~hf~nAl-~V~~qP~~LL 100 (121)
T PF02064_consen 24 DPDFRKKLRERRKKQAKAQEQAKSEKLPDPNDPEE--MERFFLQQVQLGEQLLAQGDYEEAAEHFYNAL-KVCPQPAELL 100 (121)
T ss_dssp -------------------------SSSHHHHHHH--HHHHHHHHHHHHHHHHHTT-HHHHHHHHHHHH-HTSSSHHHHH
T ss_pred ChHHHHHHHHHHHHHHHHHhhhhcccccCcCCHHH--HHHHHHHHHHHHHHHHhCCCHHHHHHHHHHHH-HhCCCHHHHH
Confidence 566666666555544443111111 1112 34445567789999999999999999999999 8888888777
Q ss_pred HHHHH
Q psy398 90 DAVFT 94 (162)
Q Consensus 90 D~~l~ 94 (162)
.|.=.
T Consensus 101 ~i~q~ 105 (121)
T PF02064_consen 101 QIYQK 105 (121)
T ss_dssp HHHHH
T ss_pred HHHHh
Confidence 66543
No 177
>COG4105 ComL DNA uptake lipoprotein [General function prediction only]
Probab=42.30 E-value=2.2e+02 Score=24.32 Aligned_cols=111 Identities=17% Similarity=0.155 Sum_probs=78.4
Q ss_pred hcCCCchHHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhc-----CCHHHHHHHHHH
Q psy398 41 KNMMDEEDQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHG-----CDIQAMTAAIEK 115 (162)
Q Consensus 41 ~Nlg~~~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~-----~d~~~v~~~i~k 115 (162)
...+ ..+ +.++.+++-=+++.||++.|+....+-. +.-++..-+|-++.+.=++.|++ +|...+...+..
T Consensus 64 ~p~s--~~~-~qa~l~l~yA~Yk~~~y~~A~~~~drFi--~lyP~~~n~dY~~YlkgLs~~~~i~~~~rDq~~~~~A~~~ 138 (254)
T COG4105 64 HPFS--PYS-EQAQLDLAYAYYKNGEYDLALAYIDRFI--RLYPTHPNADYAYYLKGLSYFFQIDDVTRDQSAARAAFAA 138 (254)
T ss_pred CCCC--ccc-HHHHHHHHHHHHhcccHHHHHHHHHHHH--HhCCCCCChhHHHHHHHHHHhccCCccccCHHHHHHHHHH
Confidence 5555 555 9999999999999999999999999966 34455667898988888887775 699999999999
Q ss_pred HHhhhhhcCCCchhhh------------hHHHHHHHHHHhhhhcHHHHHHHhH
Q psy398 116 ATELIEGISGGDWSAR------------NKLKAYEGVYCLAIRQYCRAAELFV 156 (162)
Q Consensus 116 a~~l~e~~~ggDW~rr------------nrLKvy~gl~~l~~R~fk~AA~lfl 156 (162)
.+.++...-......- +..-..=|-|++-.+.+-.|+.=|-
T Consensus 139 f~~~i~ryPnS~Ya~dA~~~i~~~~d~LA~~Em~IaryY~kr~~~~AA~nR~~ 191 (254)
T COG4105 139 FKELVQRYPNSRYAPDAKARIVKLNDALAGHEMAIARYYLKRGAYVAAINRFE 191 (254)
T ss_pred HHHHHHHCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHH
Confidence 9999887222222111 2222344555555555556655443
No 178
>PRK10747 putative protoheme IX biogenesis protein; Provisional
Probab=42.28 E-value=2.3e+02 Score=24.66 Aligned_cols=62 Identities=16% Similarity=0.166 Sum_probs=46.5
Q ss_pred HHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhh
Q psy398 54 WQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATEL 119 (162)
Q Consensus 54 ~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l 119 (162)
+.-.+....+.||.+.|...|.++. +.+.+.. +-..+...++.+.. +++......++++...
T Consensus 121 ~llaA~aA~~~g~~~~A~~~l~~A~-~~~~~~~--~~~~l~~a~l~l~~-g~~~~Al~~l~~~~~~ 182 (398)
T PRK10747 121 YLLAAEAAQQRGDEARANQHLERAA-ELADNDQ--LPVEITRVRIQLAR-NENHAARHGVDKLLEV 182 (398)
T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHH-hcCCcch--HHHHHHHHHHHHHC-CCHHHHHHHHHHHHhc
Confidence 4444666699999999999999998 5444433 33344557888888 9999998888887665
No 179
>COG5159 RPN6 26S proteasome regulatory complex component [Posttranslational modification, protein turnover, chaperones]
Probab=41.85 E-value=1.5e+02 Score=26.44 Aligned_cols=131 Identities=15% Similarity=0.145 Sum_probs=76.1
Q ss_pred HHHHHHHHHHHHHHHHHHHH--hcCCCchHHHHHHH--------HhhHhHHHhhcCHHHHHHHHHHh------cCCCCCC
Q psy398 21 YTEMKDANEKRLKELDLEHE--KNMMDEEDQVSGIW--------QAKLDYLCSIGDRAEAFKLADSK------FADKTNP 84 (162)
Q Consensus 21 ~~~~~~~n~~~l~~Le~~lk--~Nlg~~~~eIr~~~--------~~~a~~~~~iGD~~~A~~ay~~~------~~~~~~s 84 (162)
+.++-++-...-+-|+..|+ +.++ +.-+|+-. ..+.-.|++.|.+..|+....-. ++||..
T Consensus 87 irtLiekf~~~~dsl~dqi~v~~~~i--ewA~rEkr~fLr~~Le~Kli~l~y~~~~YsdalalIn~ll~ElKk~DDK~~- 163 (421)
T COG5159 87 IRTLIEKFPYSSDSLEDQIKVLTALI--EWADREKRKFLRLELECKLIYLLYKTGKYSDALALINPLLHELKKYDDKIN- 163 (421)
T ss_pred HHHHHHhcCCCCccHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHHHHHHHHHHHHHhhcCccc-
Confidence 33333333333344455555 4445 44444321 23455677788888777654432 213321
Q ss_pred hHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhhhHHHHHHHHHHhhhhcHHHHHHHhHhhc
Q psy398 85 KSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSARNKLKAYEGVYCLAIRQYCRAAELFVDVI 159 (162)
Q Consensus 85 ~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rrnrLKvy~gl~~l~~R~fk~AA~lfld~~ 159 (162)
-+++-+.-=.+.... .|.+.-+..++.|+.+... .=+-.....-|-.-.||+...-|||+.|..-|+++.
T Consensus 164 ---Li~vhllESKvyh~i-rnv~KskaSLTaArt~Ans-~YCPpqlqa~lDL~sGIlhcdd~dyktA~SYF~Ea~ 233 (421)
T COG5159 164 ---LITVHLLESKVYHEI-RNVSKSKASLTAARTLANS-AYCPPQLQAQLDLLSGILHCDDRDYKTASSYFIEAL 233 (421)
T ss_pred ---eeehhhhhHHHHHHH-HhhhhhhhHHHHHHHHhhc-cCCCHHHHHHHHHhccceeeccccchhHHHHHHHHH
Confidence 122222222333333 6777777788888777432 122344567888899999999999999999999875
No 180
>KOG2003|consensus
Probab=41.51 E-value=2.6e+02 Score=26.64 Aligned_cols=70 Identities=16% Similarity=0.105 Sum_probs=49.7
Q ss_pred HHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchh
Q psy398 53 IWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWS 129 (162)
Q Consensus 53 ~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~ 129 (162)
.+..+|+.|-+-||+..|+.||-..+ .|-.+.-..|.-+ ---.+.. +=|.....|++||.-+. |.-..|.
T Consensus 594 ilskl~dlydqegdksqafq~~ydsy-ryfp~nie~iewl---~ayyidt-qf~ekai~y~ekaaliq--p~~~kwq 663 (840)
T KOG2003|consen 594 ILSKLADLYDQEGDKSQAFQCHYDSY-RYFPCNIETIEWL---AAYYIDT-QFSEKAINYFEKAALIQ--PNQSKWQ 663 (840)
T ss_pred HHHHHHHHhhcccchhhhhhhhhhcc-cccCcchHHHHHH---HHHHHhh-HHHHHHHHHHHHHHhcC--ccHHHHH
Confidence 35689999999999999999999988 7777665555432 1222333 55777888888886543 3356786
No 181
>PF10130 PIN_2: PIN domain; InterPro: IPR019298 This entry represents a set of bacterial and archaeal proteins that are predicted to be RNases (from similarities to 5'-exonucleases).
Probab=40.68 E-value=74 Score=24.15 Aligned_cols=61 Identities=18% Similarity=0.206 Sum_probs=45.1
Q ss_pred HHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchh
Q psy398 67 RAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWS 129 (162)
Q Consensus 67 ~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~ 129 (162)
.+++.+...+.. .+..-+...+...+..++-.+.. -+-..+..++.+|..++..++..||-
T Consensus 36 ~~Ei~kh~~~I~-~k~~l~~~~~~~~l~~l~~~I~i-v~~~~~~~~~~~A~~~~~~~D~~D~p 96 (133)
T PF10130_consen 36 LEEIEKHLPKIA-KKSKLSEEELEEVLNILFSRIKI-VPEEIYSENIEEAREIIRDRDPDDWP 96 (133)
T ss_pred HHHHHHHHHHHH-HHhCCCHHHHHHHHHHHHhheEE-ecHHHhHHHHHHHHHHhcCCCcchHH
Confidence 445666666666 66666677777888888777777 78889999999999998544456764
No 182
>KOG2003|consensus
Probab=40.11 E-value=79 Score=29.93 Aligned_cols=62 Identities=24% Similarity=0.233 Sum_probs=50.0
Q ss_pred HHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhc-CCHHHHHHHHHHHHhh
Q psy398 54 WQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHG-CDIQAMTAAIEKATEL 119 (162)
Q Consensus 54 ~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~-~d~~~v~~~i~ka~~l 119 (162)
.+-.|.-|-|.|.+++|+..|..+- .|- -.-+|++--++||+-..| .|.......++|++.+
T Consensus 663 qlmiasc~rrsgnyqka~d~yk~~h-rkf---pedldclkflvri~~dlgl~d~key~~klek~eki 725 (840)
T KOG2003|consen 663 QLMIASCFRRSGNYQKAFDLYKDIH-RKF---PEDLDCLKFLVRIAGDLGLKDAKEYADKLEKAEKI 725 (840)
T ss_pred HHHHHHHHHhcccHHHHHHHHHHHH-HhC---ccchHHHHHHHHHhccccchhHHHHHHHHHHHHHH
Confidence 3457888999999999999999876 333 245899999999997775 6888888888888876
No 183
>PF12569 NARP1: NMDA receptor-regulated protein 1 ; InterPro: IPR021183 This group represents N-terminal acetyltransferase A (NatA) auxiliary subunit and represents a non-catalytic component of the NatA N-terminal acetyltransferase, which catalyzes acetylation of proteins beginning with Met-Ser, Met-Gly and Met-Ala. N-terminal acetylation plays a role in normal eukaryotic translation and processing, protect against proteolytic degradation and protein turnover. NAT1 anchors ARD1 and NAT5 to the ribosome and may present the N- terminal of nascent polypeptides for acetylation [], [].
Probab=39.79 E-value=2.6e+02 Score=26.08 Aligned_cols=96 Identities=16% Similarity=0.234 Sum_probs=56.1
Q ss_pred HHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhc-CCHHHHHHHHHHHHhhhhhcCCCchhh
Q psy398 52 GIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHG-CDIQAMTAAIEKATELIEGISGGDWSA 130 (162)
Q Consensus 52 ~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~-~d~~~v~~~i~ka~~l~e~~~ggDW~r 130 (162)
..+..+|++|.+.|++++|.++|...+ +.......-...+....-+-.-.. .+......-.. .+.+...-+++-|
T Consensus 39 ~~~E~rA~ll~kLg~~~eA~~~y~~Li-~rNPdn~~Yy~~L~~~~g~~~~~~~~~~~~~~~~y~---~l~~~yp~s~~~~ 114 (517)
T PF12569_consen 39 AVLEKRAELLLKLGRKEEAEKIYRELI-DRNPDNYDYYRGLEEALGLQLQLSDEDVEKLLELYD---ELAEKYPRSDAPR 114 (517)
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHH-HHCCCcHHHHHHHHHHHhhhcccccccHHHHHHHHH---HHHHhCccccchh
Confidence 345689999999999999999999999 888777666655555442221110 12233333222 2222222356666
Q ss_pred hhHHHHHHHHHHhhhhcHHHHHHHhHh
Q psy398 131 RNKLKAYEGVYCLAIRQYCRAAELFVD 157 (162)
Q Consensus 131 rnrLKvy~gl~~l~~R~fk~AA~lfld 157 (162)
|--|....| -.|+..+..+|.
T Consensus 115 rl~L~~~~g------~~F~~~~~~yl~ 135 (517)
T PF12569_consen 115 RLPLDFLEG------DEFKERLDEYLR 135 (517)
T ss_pred HhhcccCCH------HHHHHHHHHHHH
Confidence 655544432 356666655554
No 184
>PF03704 BTAD: Bacterial transcriptional activator domain; InterPro: IPR005158 Found in the DNRI/REDD/AFSR family of regulators, this region of AFSR (P25941 from SWISSPROT) along with the C-terminal region is capable of independently directing actinorhodin production. It is important for the formation of secondary metabolites.; PDB: 2FF4_B 2FEZ_A.
Probab=39.18 E-value=1.5e+02 Score=21.50 Aligned_cols=67 Identities=15% Similarity=0.068 Sum_probs=51.9
Q ss_pred HHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhh
Q psy398 50 VSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIE 121 (162)
Q Consensus 50 Ir~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e 121 (162)
...+...++.++..-|+.+.|+..+.+.. ..... -=.++..+||.-.-. ++...+.+...+.+..+.
T Consensus 61 ~~~~~~~l~~~~~~~~~~~~a~~~~~~~l-~~dP~---~E~~~~~lm~~~~~~-g~~~~A~~~Y~~~~~~l~ 127 (146)
T PF03704_consen 61 YLDALERLAEALLEAGDYEEALRLLQRAL-ALDPY---DEEAYRLLMRALAAQ-GRRAEALRVYERYRRRLR 127 (146)
T ss_dssp HHHHHHHHHHHHHHTT-HHHHHHHHHHHH-HHSTT----HHHHHHHHHHHHHT-T-HHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHhccCHHHHHHHHHHHH-hcCCC---CHHHHHHHHHHHHHC-cCHHHHHHHHHHHHHHHH
Confidence 34567788999999999999999999987 43332 224678889999888 999999999999988764
No 185
>COG1283 NptA Na+/phosphate symporter [Inorganic ion transport and metabolism]
Probab=38.92 E-value=2.9e+02 Score=26.12 Aligned_cols=100 Identities=17% Similarity=0.115 Sum_probs=60.2
Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHH--------hcCCCchHHHHHH--HHhhHhHHHhhcCHHHHHHHHHHhcCCCC----
Q psy398 17 DEKFYTEMKDANEKRLKELDLEHE--------KNMMDEEDQVSGI--WQAKLDYLCSIGDRAEAFKLADSKFADKT---- 82 (162)
Q Consensus 17 D~~~~~~~~~~n~~~l~~Le~~lk--------~Nlg~~~~eIr~~--~~~~a~~~~~iGD~~~A~~ay~~~~~~~~---- 82 (162)
|.+.++++. +-+..+++++++|+ +++. |+|.|.. +.+.+.=+.+|||.-+-+-.+.+.--+..
T Consensus 361 ~~~~~~~i~-~~e~~vd~~~~~Ik~YL~~ls~~~Ls--e~es~r~~~iid~a~~lE~IgDiie~l~~~~~kk~~~~~~fs 437 (533)
T COG1283 361 DAKKVKEIR-KLEDAVDRLYEEIKLYLARLSKEGLS--EEESRRWAEIIDAAINLEHIGDIIERLLELADKKIANGRAFS 437 (533)
T ss_pred chHHHHHHH-HHHHHHHHHHHHHHHHHHHhccccCC--HHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHhcCCCCC
Confidence 666665554 44667888888888 7777 6666654 78889999999997766655544220111
Q ss_pred ----CChHHHHHHHHHHHHHHHHhc--CCHHHHHHHHHHHHhh
Q psy398 83 ----NPKSHRIDAVFTLFRIAYFHG--CDIQAMTAAIEKATEL 119 (162)
Q Consensus 83 ----~s~g~kiD~~l~~iRv~~f~~--~d~~~v~~~i~ka~~l 119 (162)
.-..+-.+++...+|+++-.- +|.+...+.+++=+.+
T Consensus 438 e~~~~el~~l~~~~~~n~~~a~~~l~~~D~~~ar~lv~~k~~~ 480 (533)
T COG1283 438 EDGLEELDALFALTLENLRLAISVLVTGDLELARRLVERKKRV 480 (533)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHH
Confidence 112344556666666654431 5555555555444443
No 186
>cd02682 MIT_AAA_Arch MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in mostly archaebacterial AAA-ATPases. The molecular function of the MIT domain is unclear.
Probab=38.83 E-value=1.3e+02 Score=20.81 Aligned_cols=52 Identities=8% Similarity=0.059 Sum_probs=35.3
Q ss_pred HhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHH-------HHHhcCCHHHHHHHHHHHHhhhh
Q psy398 55 QAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRI-------AYFHGCDIQAMTAAIEKATELIE 121 (162)
Q Consensus 55 ~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv-------~~f~~~d~~~v~~~i~ka~~l~e 121 (162)
..+|--+-+.|++++|+.+|.... ++++.+++. .+|. ..+++|+++|+.|-+
T Consensus 10 a~~AVe~D~~gr~~eAi~~Y~~aI-----------e~L~q~~~~~pD~~~k~~yr----~ki~eY~~Rae~Lk~ 68 (75)
T cd02682 10 AINAVKAEKEGNAEDAITNYKKAI-----------EVLSQIVKNYPDSPTRLIYE----QMINEYKRRIEVLEK 68 (75)
T ss_pred HHHHHHHHhcCCHHHHHHHHHHHH-----------HHHHHHHHhCCChHHHHHHH----HHHHHHHHHHHHHHH
Confidence 446777788999999999999887 333333333 2222 367888888887744
No 187
>PF12862 Apc5: Anaphase-promoting complex subunit 5
Probab=38.18 E-value=82 Score=21.81 Aligned_cols=31 Identities=10% Similarity=0.077 Sum_probs=28.4
Q ss_pred HHHHHHHHhhHhHHHhhcCHHHHHHHHHHhc
Q psy398 48 DQVSGIWQAKLDYLCSIGDRAEAFKLADSKF 78 (162)
Q Consensus 48 ~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~ 78 (162)
...+.+...+|..+...|+.++|+++++++.
T Consensus 38 ~~~~~all~lA~~~~~~G~~~~A~~~l~eAi 68 (94)
T PF12862_consen 38 SGLAYALLNLAELHRRFGHYEEALQALEEAI 68 (94)
T ss_pred HHHHHHHHHHHHHHHHhCCHHHHHHHHHHHH
Confidence 4678899999999999999999999999987
No 188
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=37.30 E-value=2.5e+02 Score=23.55 Aligned_cols=96 Identities=10% Similarity=0.024 Sum_probs=61.0
Q ss_pred HhhHh-HHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhhhH
Q psy398 55 QAKLD-YLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSARNK 133 (162)
Q Consensus 55 ~~~a~-~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rrnr 133 (162)
.+.|. .+.+-|++++|+.+|.... ..-..+...-+..+.+-.+.+-. +|+.....+..++-..-- +..|..-..
T Consensus 146 Y~~A~~l~~~~~~y~~Ai~af~~fl-~~yP~s~~a~~A~y~LG~~y~~~-g~~~~A~~~f~~vv~~yP---~s~~~~dAl 220 (263)
T PRK10803 146 YNAAIALVQDKSRQDDAIVAFQNFV-KKYPDSTYQPNANYWLGQLNYNK-GKKDDAAYYFASVVKNYP---KSPKAADAM 220 (263)
T ss_pred HHHHHHHHHhcCCHHHHHHHHHHHH-HHCcCCcchHHHHHHHHHHHHHc-CCHHHHHHHHHHHHHHCC---CCcchhHHH
Confidence 33443 3456699999999999988 44334444456667666666667 888877777666544421 334433333
Q ss_pred HHHHHHHHHhhhhcHHHHHHHhHh
Q psy398 134 LKAYEGVYCLAIRQYCRAAELFVD 157 (162)
Q Consensus 134 LKvy~gl~~l~~R~fk~AA~lfld 157 (162)
++ .|..+...+++.+|-+.|-.
T Consensus 221 ~k--lg~~~~~~g~~~~A~~~~~~ 242 (263)
T PRK10803 221 FK--VGVIMQDKGDTAKAKAVYQQ 242 (263)
T ss_pred HH--HHHHHHHcCCHHHHHHHHHH
Confidence 33 35556677888888877644
No 189
>PLN03077 Protein ECB2; Provisional
Probab=37.28 E-value=2.5e+02 Score=27.00 Aligned_cols=61 Identities=10% Similarity=0.052 Sum_probs=33.4
Q ss_pred HHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHH
Q psy398 53 IWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKAT 117 (162)
Q Consensus 53 ~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~ 117 (162)
.|..+..-|++.|+.++|++.|.+|. +....+.... ...+|.-.... +++......+....
T Consensus 556 s~n~lI~~~~~~G~~~~A~~lf~~M~-~~g~~Pd~~T--~~~ll~a~~~~-g~v~ea~~~f~~M~ 616 (857)
T PLN03077 556 SWNILLTGYVAHGKGSMAVELFNRMV-ESGVNPDEVT--FISLLCACSRS-GMVTQGLEYFHSME 616 (857)
T ss_pred hHHHHHHHHHHcCCHHHHHHHHHHHH-HcCCCCCccc--HHHHHHHHhhc-ChHHHHHHHHHHHH
Confidence 35667777888888888888888887 5433322211 22223322222 55555555554443
No 190
>KOG3617|consensus
Probab=37.23 E-value=52 Score=33.23 Aligned_cols=75 Identities=27% Similarity=0.370 Sum_probs=0.0
Q ss_pred HhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhh------
Q psy398 58 LDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSAR------ 131 (162)
Q Consensus 58 a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rr------ 131 (162)
|.|..+.|+.+.|+..|..+. |+ |+++||--|. ++ +.||..+.++ .||-..-
T Consensus 919 gqYlES~GemdaAl~~Y~~A~-D~-----------fs~VrI~C~q-Gk-------~~kAa~iA~e--sgd~AAcYhlaR~ 976 (1416)
T KOG3617|consen 919 GQYLESVGEMDAALSFYSSAK-DY-----------FSMVRIKCIQ-GK-------TDKAARIAEE--SGDKAACYHLARM 976 (1416)
T ss_pred HHHHhcccchHHHHHHHHHhh-hh-----------hhheeeEeec-cC-------chHHHHHHHh--cccHHHHHHHHHH
Q ss_pred --hHHHHHHHHHHhh-hhcHHHHHHH
Q psy398 132 --NKLKAYEGVYCLA-IRQYCRAAEL 154 (162)
Q Consensus 132 --nrLKvy~gl~~l~-~R~fk~AA~l 154 (162)
|-=+|-++|.... .+-|+.|-.+
T Consensus 977 YEn~g~v~~Av~FfTrAqafsnAIRl 1002 (1416)
T KOG3617|consen 977 YENDGDVVKAVKFFTRAQAFSNAIRL 1002 (1416)
T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHH
No 191
>PF12925 APP_E2: E2 domain of amyloid precursor protein; InterPro: IPR024329 Amyloid-beta precursor protein (APP, or A4) is associated with Alzheimer's disease (AD), because one of its breakdown products, amyloid-beta (A-beta), aggregates to form amyloid or senile plaques [, ]. Mutations in APP or in proteins that process APP have been linked with early-onset, familial AD. Individuals with Down's syndrome carry an extra copy of chromosome 21, which contains the APP gene, and almost invariably develop amyloid plaques and Alzheimer's symptoms. APP is important for the neurogenesis and neuronal regeneration, either through the intact protein, or through its many breakdown products []. APP consists of a large N-terminal extracellular region containing heparin-binding and copper-binding sites, a short hydrophobic transmembrane domain, and a short C-terminal intracellular domain. The N-terminal region is similar in structure to cysteine-rich growth factors and appears to function as a cell surface receptor, contributing to neurite growth, neuronal adhesion, axonogenesis and cell mobility []. APP acts as a kinesin I membrane receptor to mediate the axonal transport of beta-secretase and presenilin 1. The N-terminal domain can regulate neurite outgrowth through its binding to heparin and collagen I and IV, which are components of the extracellular matrix. APP is also coupled to apoptosis-inducing pathways, and is involved in copper homeostasis/oxidative stress through copper ion reduction, where copper-metallated APP induces neuronal death []. The C-terminal intracellular domain appears to be involved in transcription regulation through protein-protein interactions. APP can promote transcription activation through binding to APBB1/Tip60, and may bind to the adaptor protein FE65 to transactivate a wide variety of different promoters. APP can be processed by different sets of enzymes: In the non-amyloidogenic (non-plaque-forming) pathway, APP is cleaved by alpha-secretase to yield a soluble N-terminal sAPP-alpha (neuroprotective) and a membrane-bound CTF-alpha. CTF-alpha is broken-down by presenilin-containing gamma-secretase to yield soluble p3 and membrane-bound AICD (nuclear signalling). In the amyloidogenic pathway (plaque-forming), APP is broken down by beta-secretase to yield soluble sAPP-beta and membrane-bound CTF-beta. CTF-beta is broken down by gamma-secretase to yield soluble amyloid-beta and membrane-bound AICD. Amyloid-beta is required for neuronal function, but can aggregate to form amyloid plaques that seem to disrupt brain cells by clogging points of cell-cell contact. The E2 domain is the largest of the conserved domains in the amyloidogenic glycoproteins. The structure of E2 consists of two coiled-coil sub-structures connected through a continuous helix, and bears an unexpected resemblance to the spectrin family of protein structures. E2 can reversibly dimerise in solution, and the dimerisation occurs along the longest dimension of the molecule in an antiparallel orientation, which enables the N-terminal substructure of one monomer to pack against the C-terminal substructure of a second monomer. The high degree of conservation of residues at the putative dimer interface suggests that the E2 dimer observed in the crystal could be physiologically relevant. Heparin sulphate proteoglycans, the putative ligands for the precursor present in extracellular matrix, bind to E2 at a conserved and positively charged site near the dimer interface [].; PDB: 3K6B_A 3K66_A 1TKN_A 3NYL_A 3NYJ_A 3UMH_A 3UMK_A 3UMI_A 3QMK_B 3PMR_B ....
Probab=36.79 E-value=2.4e+02 Score=23.14 Aligned_cols=89 Identities=17% Similarity=0.224 Sum_probs=51.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCchHHHHHHHHhhHhHHHhhcC-HHHHHHHHHHhcCCCCCChHHHH-HHHHHHHHH
Q psy398 21 YTEMKDANEKRLKELDLEHEKNMMDEEDQVSGIWQAKLDYLCSIGD-RAEAFKLADSKFADKTNPKSHRI-DAVFTLFRI 98 (162)
Q Consensus 21 ~~~~~~~n~~~l~~Le~~lk~Nlg~~~~eIr~~~~~~a~~~~~iGD-~~~A~~ay~~~~~~~~~s~g~ki-D~~l~~iRv 98 (162)
-+.|.+.-++.+..||.+-. .. +-.+...|.... .+.++| +..|+++|..++ .-..+..|+| -++-..||+
T Consensus 72 k~~m~~rFQ~~v~aLE~e~~--~e--r~qL~~~H~qRV--~a~Lnerkr~al~~y~~al-~~~ppn~~~vl~~Lk~yiRa 144 (193)
T PF12925_consen 72 KKEMTQRFQKTVQALEQEAA--AE--RQQLVETHQQRV--QAMLNERKRAALENYTAAL-QADPPNPHKVLKALKKYIRA 144 (193)
T ss_dssp HHHHHHHHHHHHHHHHHHHH--HH--HHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHH-TCSS--HHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHH--HH--HHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHH-hcCCCChHHHHHHHHHHHHH
Confidence 33455555555555655533 22 555555565554 566665 678999999999 5555555555 444477764
Q ss_pred HHHhcCCHHHHHHHHHHHHhh
Q psy398 99 AYFHGCDIQAMTAAIEKATEL 119 (162)
Q Consensus 99 ~~f~~~d~~~v~~~i~ka~~l 119 (162)
+. .|-.+.-+.-.-....
T Consensus 145 --~~-KDR~Htl~h~~H~~~~ 162 (193)
T PF12925_consen 145 --EE-KDRQHTLRHFEHLRMV 162 (193)
T ss_dssp --HH-HHHHHHHHHHHHHHHH
T ss_pred --HH-HhHHHHHHHHHHHHhc
Confidence 34 6666666655554443
No 192
>PF04212 MIT: MIT (microtubule interacting and transport) domain; InterPro: IPR007330 The MIT domain is found in vacuolar sorting proteins, spastin (probable ATPase involved in the assembly or function of nuclear protein complexes), and a sorting nexin, which may play a role in intracellular trafficking.; PDB: 2DL1_A 2JQK_A 1WR0_A 2CPT_A 2JQH_A 2V6Y_A 2JQ9_A 2K3W_A 1YXR_A 3EAB_E ....
Probab=36.55 E-value=1.2e+02 Score=19.69 Aligned_cols=23 Identities=22% Similarity=0.151 Sum_probs=17.8
Q ss_pred hhHhHHHhhcCHHHHHHHHHHhc
Q psy398 56 AKLDYLCSIGDRAEAFKLADSKF 78 (162)
Q Consensus 56 ~~a~~~~~iGD~~~A~~ay~~~~ 78 (162)
..|-=+-+-|+++.|+..|....
T Consensus 10 ~~Av~~D~~g~~~~A~~~Y~~ai 32 (69)
T PF04212_consen 10 KKAVEADEAGNYEEALELYKEAI 32 (69)
T ss_dssp HHHHHHHHTTSHHHHHHHHHHHH
T ss_pred HHHHHHHHCCCHHHHHHHHHHHH
Confidence 34445566899999999998877
No 193
>KOG2908|consensus
Probab=36.00 E-value=1.5e+02 Score=26.68 Aligned_cols=77 Identities=22% Similarity=0.175 Sum_probs=47.4
Q ss_pred CchhHHHHHHhcCCCcCHHHH--------HHHHHHHHHHHHHHHHHHH-hcCCCchHHHHHHHHhhHhHHHhhcCHHHHH
Q psy398 1 MAPYYKEVCEEFGWKMDEKFY--------TEMKDANEKRLKELDLEHE-KNMMDEEDQVSGIWQAKLDYLCSIGDRAEAF 71 (162)
Q Consensus 1 m~p~Y~~l~~~~~~~~D~~~~--------~~~~~~n~~~l~~Le~~lk-~Nlg~~~~eIr~~~~~~a~~~~~iGD~~~A~ 71 (162)
|.|+|...+.++.-++.+=-+ +.+. .+++-++=||.=++ -+.-+|++.|---+...|..+..|||+.++.
T Consensus 57 ~l~lY~NFvsefe~kINplslvei~l~~~~~~~-D~~~al~~Le~i~~~~~~~~e~~av~~~~t~~~r~~L~i~DLk~~k 135 (380)
T KOG2908|consen 57 LLQLYLNFVSEFETKINPLSLVEILLVVSEQIS-DKDEALEFLEKIIEKLKEYKEPDAVIYILTEIARLKLEINDLKEIK 135 (380)
T ss_pred HHHHHHHHHHHHhhccChHHHHHHHHHHHHHhc-cHHHHHHHHHHHHHHHHhhccchhHHHHHHHHHHHHHhcccHHHHH
Confidence 357888888888766554322 2222 12222333332222 2223224488888999999999999999998
Q ss_pred HHHHHhc
Q psy398 72 KLADSKF 78 (162)
Q Consensus 72 ~ay~~~~ 78 (162)
+......
T Consensus 136 k~ldd~~ 142 (380)
T KOG2908|consen 136 KLLDDLK 142 (380)
T ss_pred HHHHHHH
Confidence 8877644
No 194
>KOG1173|consensus
Probab=35.83 E-value=1.2e+02 Score=28.87 Aligned_cols=97 Identities=13% Similarity=0.127 Sum_probs=70.3
Q ss_pred HHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhc--CCHHHHHHHHHHHHhhhhhcCC
Q psy398 48 DQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHG--CDIQAMTAAIEKATELIEGISG 125 (162)
Q Consensus 48 ~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~--~d~~~v~~~i~ka~~l~e~~~g 125 (162)
-.---||...|+=|+--|.-+.|..||..+. . ...-.| +-.+=+|+.++ ++..+..++...|-+.. .
T Consensus 343 ~~fgpaWl~fghsfa~e~EhdQAmaaY~tAa-r-l~~G~h-----lP~LYlgmey~~t~n~kLAe~Ff~~A~ai~----P 411 (611)
T KOG1173|consen 343 PTFGPAWLAFGHSFAGEGEHDQAMAAYFTAA-R-LMPGCH-----LPSLYLGMEYMRTNNLKLAEKFFKQALAIA----P 411 (611)
T ss_pred ccccHHHHHHhHHhhhcchHHHHHHHHHHHH-H-hccCCc-----chHHHHHHHHHHhccHHHHHHHHHHHHhcC----C
Confidence 3344699999999999999999999999976 2 222122 22344677764 68888889888888875 3
Q ss_pred CchhhhhHHHHHHHHHHhhhhcHHHHHHHhHhhc
Q psy398 126 GDWSARNKLKAYEGVYCLAIRQYCRAAELFVDVI 159 (162)
Q Consensus 126 gDW~rrnrLKvy~gl~~l~~R~fk~AA~lfld~~ 159 (162)
.|.--.| +.||-.-.-..|.+|...|..++
T Consensus 412 ~Dplv~~----Elgvvay~~~~y~~A~~~f~~~l 441 (611)
T KOG1173|consen 412 SDPLVLH----ELGVVAYTYEEYPEALKYFQKAL 441 (611)
T ss_pred Ccchhhh----hhhheeehHhhhHHHHHHHHHHH
Confidence 4655455 44555566788999999998775
No 195
>PLN03077 Protein ECB2; Provisional
Probab=35.71 E-value=3.1e+02 Score=26.41 Aligned_cols=54 Identities=17% Similarity=0.065 Sum_probs=35.5
Q ss_pred hhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHh
Q psy398 56 AKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATE 118 (162)
Q Consensus 56 ~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~ 118 (162)
.+-+.|++.|+.+.|.+.|..+. ....+-. .+|.....+ ++...+...+++...
T Consensus 529 aLi~~y~k~G~~~~A~~~f~~~~-~d~~s~n-------~lI~~~~~~-G~~~~A~~lf~~M~~ 582 (857)
T PLN03077 529 ALLDLYVRCGRMNYAWNQFNSHE-KDVVSWN-------ILLTGYVAH-GKGSMAVELFNRMVE 582 (857)
T ss_pred HHHHHHHHcCCHHHHHHHHHhcC-CChhhHH-------HHHHHHHHc-CCHHHHHHHHHHHHH
Confidence 45689999999999999998874 4444433 333333344 666666666665443
No 196
>PF14643 DUF4455: Domain of unknown function (DUF4455)
Probab=35.60 E-value=3.4e+02 Score=24.67 Aligned_cols=117 Identities=15% Similarity=0.246 Sum_probs=76.3
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCchHHHHHHHHhhHhHHHhhcC---------------HHHHHHHHHHhcCCCCC
Q psy398 19 KFYTEMKDANEKRLKELDLEHEKNMMDEEDQVSGIWQAKLDYLCSIGD---------------RAEAFKLADSKFADKTN 83 (162)
Q Consensus 19 ~~~~~~~~~n~~~l~~Le~~lk~Nlg~~~~eIr~~~~~~a~~~~~iGD---------------~~~A~~ay~~~~~~~~~ 83 (162)
.++.......+..++.++.-+++... ....--.++..|+.+++. ...-++..+...+..+.
T Consensus 312 plv~~~q~~~e~~le~l~~~~E~~a~----~~~~~~~~L~~f~~~~~~lwd~h~~~l~~~e~~l~~~l~~~r~~~~~~~q 387 (473)
T PF14643_consen 312 PLVGELQSEFEEELEKLDKSFEELAK----QTEAQSEDLFKFFQEAAQLWDEHRKKLSKQEEELEKRLEQCREKHDQENQ 387 (473)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHH
Confidence 35667778888888888877762111 112233457777776664 33333444444434456
Q ss_pred ChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhhhHHHHHHHHHHhhhhcHHH
Q psy398 84 PKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSARNKLKAYEGVYCLAIRQYCR 150 (162)
Q Consensus 84 s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rrnrLKvy~gl~~l~~R~fk~ 150 (162)
..-..+|+++..+|-+ .+-..+..++++|.++++. -++++..|.+-..-....|-.
T Consensus 388 ~~E~~Ld~~~d~lRq~----s~ee~L~~~l~~~~~~Ld~-------Ie~~Y~~fh~~~~~~~~~yP~ 443 (473)
T PF14643_consen 388 EKEAKLDIALDRLRQA----SSEEKLKEHLEKALDLLDQ-------IEEEYEDFHKKQTAIVMEYPE 443 (473)
T ss_pred HHHHHHHHHHHHHHhC----CCHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHH
Confidence 6778899999999987 8899999999999999876 455555555555544555433
No 197
>PF10938 YfdX: YfdX protein; InterPro: IPR021236 YfdX is a protein found in Proteobacteria of unknown function. The protein coding for this gene is regulated by EvgA in Escherichia coli []. ; PDB: 3DZA_C.
Probab=34.96 E-value=1.6e+02 Score=22.86 Aligned_cols=102 Identities=23% Similarity=0.178 Sum_probs=60.1
Q ss_pred HHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHH------------HHH------HHHHhcCCHHHHHHHHHH
Q psy398 54 WQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFT------------LFR------IAYFHGCDIQAMTAAIEK 115 (162)
Q Consensus 54 ~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~------------~iR------v~~f~~~d~~~v~~~i~k 115 (162)
+...|.....-||.+.|.+...++. .+-..... .+=.|. .|- +.=++ ......+..+.+
T Consensus 5 ~i~~Ar~aL~~g~~~~A~~~L~~A~-~~l~~~~~-~~p~~~~~~~~~~~~~~~~iPI~~~~~v~d~~-~~~~~~~~ai~~ 81 (155)
T PF10938_consen 5 DIQKARLALFQGDTDEAKKLLEDAQ-GKLDAARA-DDPKLAKAEKILPPAKDDLIPIDAEVIVIDDY-VPTPEKKAAIKT 81 (155)
T ss_dssp HHHHHHHHHCTT-HHHHHHHHHHHH-HHHTS-HH-HHHCCB-TT-S--SSSS-EEEEEEEEEEE-------HHHHHHHHH
T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHH-HHHHHHHh-cChHhHhhhhccccCCCceEEEeeEEEEeecc-CChHHHHHHHHH
Confidence 4456666677799999888887765 21111110 000011 111 11124 578899999999
Q ss_pred HHhhhhhcCCCchh-hh---------------------hHHHHHHHHHHhhhhcHHHHHHHhHhhccC
Q psy398 116 ATELIEGISGGDWS-AR---------------------NKLKAYEGVYCLAIRQYCRAAELFVDVIRS 161 (162)
Q Consensus 116 a~~l~e~~~ggDW~-rr---------------------nrLKvy~gl~~l~~R~fk~AA~lfld~~~T 161 (162)
|+..+.. ||+. -+ -.=.|+.+..+|..++|.+|...+-.+..|
T Consensus 82 a~~~l~~---g~~~~A~~~L~~~~~ei~~~~~~lPL~~~~~av~~A~~ll~~~k~~eA~~aL~~A~~~ 146 (155)
T PF10938_consen 82 ANELLKK---GDKQAAREILKLAGSEIDITTALLPLAQTPAAVKQAAALLDEGKYYEANAALKQALDG 146 (155)
T ss_dssp HHHHHHT---T-HHHHHHHHHHTT-EEEEEEEEEEHHHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHT
T ss_pred HHHHHhC---CCHHHHHHHHHHhcccceeeeeeCCHHhhHHHHHHHHHHHHCCCHHHHHHHHHHHhcC
Confidence 9999996 7775 23 233468999999999999999888766544
No 198
>KOG1129|consensus
Probab=34.70 E-value=95 Score=28.22 Aligned_cols=93 Identities=12% Similarity=0.165 Sum_probs=55.7
Q ss_pred hhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhhhHHH
Q psy398 56 AKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSARNKLK 135 (162)
Q Consensus 56 ~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rrnrLK 135 (162)
..|.-+...|..+.|++.|.+..+-..+... .+..|-++.|++++-.....|-.++=++ |.- ..-|-
T Consensus 295 g~ARi~eam~~~~~a~~lYk~vlk~~~~nvE-----aiAcia~~yfY~~~PE~AlryYRRiLqm-----G~~---speLf 361 (478)
T KOG1129|consen 295 GQARIHEAMEQQEDALQLYKLVLKLHPINVE-----AIACIAVGYFYDNNPEMALRYYRRILQM-----GAQ---SPELF 361 (478)
T ss_pred hhHHHHHHHHhHHHHHHHHHHHHhcCCccce-----eeeeeeeccccCCChHHHHHHHHHHHHh-----cCC---ChHHH
Confidence 3455555566666666666666522222211 1233445666644445555555555554 211 13355
Q ss_pred HHHHHHHhhhhcHHHHHHHhHhhccC
Q psy398 136 AYEGVYCLAIRQYCRAAELFVDVIRS 161 (162)
Q Consensus 136 vy~gl~~l~~R~fk~AA~lfld~~~T 161 (162)
+=-||-||...+|.-+-.+|..++||
T Consensus 362 ~NigLCC~yaqQ~D~~L~sf~RAlst 387 (478)
T KOG1129|consen 362 CNIGLCCLYAQQIDLVLPSFQRALST 387 (478)
T ss_pred hhHHHHHHhhcchhhhHHHHHHHHhh
Confidence 56799999999999999999999887
No 199
>smart00101 14_3_3 14-3-3 homologues. 14-3-3 homologues mediates signal transduction by binding to phosphoserine-containing proteins. They are involved in growth factor signalling and also interact with MEK kinases.
Probab=33.66 E-value=83 Score=26.45 Aligned_cols=32 Identities=19% Similarity=0.124 Sum_probs=24.2
Q ss_pred HHHH-HHhhHhHHHhh-cCHHHHHHHHHHhcCCCC
Q psy398 50 VSGI-WQAKLDYLCSI-GDRAEAFKLADSKFADKT 82 (162)
Q Consensus 50 Ir~~-~~~~a~~~~~i-GD~~~A~~ay~~~~~~~~ 82 (162)
||=| .+..+-||+.| ++.+.|.+.-..++ +..
T Consensus 168 irLgLaLN~SVF~yEI~~~~~~A~~lAk~af-d~A 201 (244)
T smart00101 168 IRLGLALNFSVFYYEILNSPDRACNLAKQAF-DEA 201 (244)
T ss_pred HHHHHHHHHHHHHHHHcCCHHHHHHHHHHHH-HHH
Confidence 4554 56788999999 79999997777777 544
No 200
>KOG4626|consensus
Probab=33.60 E-value=66 Score=31.54 Aligned_cols=48 Identities=10% Similarity=0.187 Sum_probs=37.8
Q ss_pred HHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCC-----------------ChHHHHHHHHHHHHHH
Q psy398 51 SGIWQAKLDYLCSIGDRAEAFKLADSKFADKTN-----------------PKSHRIDAVFTLFRIA 99 (162)
Q Consensus 51 r~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~-----------------s~g~kiD~~l~~iRv~ 99 (162)
-+++..+|+.|-+.|+++.|+..|+.+. +... ....-.+.++.-+|+-
T Consensus 116 ae~ysn~aN~~kerg~~~~al~~y~~ai-el~p~fida~inla~al~~~~~~~~a~~~~~~alqln 180 (966)
T KOG4626|consen 116 AEAYSNLANILKERGQLQDALALYRAAI-ELKPKFIDAYINLAAALVTQGDLELAVQCFFEALQLN 180 (966)
T ss_pred HHHHHHHHHHHHHhchHHHHHHHHHHHH-hcCchhhHHHhhHHHHHHhcCCCcccHHHHHHHHhcC
Confidence 4577889999999999999999999998 6554 4455667777777754
No 201
>cd02677 MIT_SNX15 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This MIT domain sub-family is found in sorting nexin 15 and related proteins. The molecular function of the MIT domain is unclear.
Probab=33.24 E-value=1.6e+02 Score=20.11 Aligned_cols=50 Identities=18% Similarity=0.147 Sum_probs=27.6
Q ss_pred HhhcCHHHHHHHHHHhcCCCCCC-hHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhh
Q psy398 62 CSIGDRAEAFKLADSKFADKTNP-KSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELI 120 (162)
Q Consensus 62 ~~iGD~~~A~~ay~~~~~~~~~s-~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~ 120 (162)
-..|++++|+.+|...+ ++=.. ...--| ...|-++- ..+..|+.+|+.+.
T Consensus 17 d~~~~y~eA~~~Y~~~i-~~~~~~~k~e~~---~~~k~~ir-----~K~~eYl~RAE~i~ 67 (75)
T cd02677 17 EEEGDYEAAFEFYRAGV-DLLLKGVQGDSS---PERREAVK-----RKIAEYLKRAEEIL 67 (75)
T ss_pred HHHhhHHHHHHHHHHHH-HHHHHHhccCCC---HHHHHHHH-----HHHHHHHHHHHHHH
Confidence 45589999999998877 32100 000000 01122222 36788888888874
No 202
>KOG2002|consensus
Probab=32.52 E-value=2.2e+02 Score=28.96 Aligned_cols=94 Identities=13% Similarity=0.036 Sum_probs=70.7
Q ss_pred HHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhh
Q psy398 52 GIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSAR 131 (162)
Q Consensus 52 ~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rr 131 (162)
++..-+|.||+.-||++.+...+.-+. ..+.....+-+-++.+-|.---. +|+.....|--++.+.-+ .
T Consensus 271 ~~l~~LAn~fyfK~dy~~v~~la~~ai-~~t~~~~~~aes~Y~~gRs~Ha~-Gd~ekA~~yY~~s~k~~~---------d 339 (1018)
T KOG2002|consen 271 VALNHLANHFYFKKDYERVWHLAEHAI-KNTENKSIKAESFYQLGRSYHAQ-GDFEKAFKYYMESLKADN---------D 339 (1018)
T ss_pred HHHHHHHHHHhhcccHHHHHHHHHHHH-HhhhhhHHHHHHHHHHHHHHHhh-ccHHHHHHHHHHHHccCC---------C
Confidence 466789999999999999999999999 77766666668899999988888 999988888887776532 2
Q ss_pred hHHHHHHHHHHhh--hhcHHHHHHHhH
Q psy398 132 NKLKAYEGVYCLA--IRQYCRAAELFV 156 (162)
Q Consensus 132 nrLKvy~gl~~l~--~R~fk~AA~lfl 156 (162)
|-+=.+.||-.|. .++++.|..+|=
T Consensus 340 ~~~l~~~GlgQm~i~~~dle~s~~~fE 366 (1018)
T KOG2002|consen 340 NFVLPLVGLGQMYIKRGDLEESKFCFE 366 (1018)
T ss_pred CccccccchhHHHHHhchHHHHHHHHH
Confidence 2333455665544 456677766663
No 203
>smart00386 HAT HAT (Half-A-TPR) repeats. Present in several RNA-binding proteins. Structurally and sequentially thought to be similar to TPRs.
Probab=32.42 E-value=60 Score=16.74 Aligned_cols=18 Identities=28% Similarity=0.357 Sum_probs=14.6
Q ss_pred cCHHHHHHHHHHhcCCCCC
Q psy398 65 GDRAEAFKLADSKFADKTN 83 (162)
Q Consensus 65 GD~~~A~~ay~~~~~~~~~ 83 (162)
|+.+.|.+.|.++. ..+.
T Consensus 1 ~~~~~~r~i~e~~l-~~~~ 18 (33)
T smart00386 1 GDIERARKIYERAL-EKFP 18 (33)
T ss_pred CcHHHHHHHHHHHH-HHCC
Confidence 67888999999988 6665
No 204
>PRK14965 DNA polymerase III subunits gamma and tau; Provisional
Probab=31.93 E-value=4.2e+02 Score=24.83 Aligned_cols=95 Identities=5% Similarity=0.018 Sum_probs=56.2
Q ss_pred hHHHHHHhcCCCcCHHHHHHHHHHHHHHHHHHHHHHH-------hcCCCchHHHHHH--------HHhhHhHHHhhcCHH
Q psy398 4 YYKEVCEEFGWKMDEKFYTEMKDANEKRLKELDLEHE-------KNMMDEEDQVSGI--------WQAKLDYLCSIGDRA 68 (162)
Q Consensus 4 ~Y~~l~~~~~~~~D~~~~~~~~~~n~~~l~~Le~~lk-------~Nlg~~~~eIr~~--------~~~~a~~~~~iGD~~ 68 (162)
+-+.++.+.+..+|.+-++.+-+...--+-..-..|. ++.. .++|... ..++.+. ..-||..
T Consensus 186 ~L~~i~~~egi~i~~~al~~la~~a~G~lr~al~~Ldqliay~g~~It--~edV~~llG~~~~~~l~~ll~a-l~~~d~~ 262 (576)
T PRK14965 186 RLRYIADQEGISISDAALALVARKGDGSMRDSLSTLDQVLAFCGDAVG--DDDVAELLGVVDRRLLLDISAA-VFGRDTR 262 (576)
T ss_pred HHHHHHHHhCCCCCHHHHHHHHHHcCCCHHHHHHHHHHHHHhccCCCC--HHHHHHHhCCCCHHHHHHHHHH-HHcCCHH
Confidence 3445556667777777766655444433333222222 2222 3444332 2222222 2347999
Q ss_pred HHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHh
Q psy398 69 EAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFH 102 (162)
Q Consensus 69 ~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~ 102 (162)
.|+..+.++. .........++.++..+|--+..
T Consensus 263 ~al~~l~~l~-~~G~~~~~~l~~Ll~~~RdLl~~ 295 (576)
T PRK14965 263 ALLEIVERVD-EFGYNMRQFCQELIDHLRNLVVL 295 (576)
T ss_pred HHHHHHHHHH-HhCCCHHHHHHHHHHHHHHHHHH
Confidence 9999999999 77778888888888888766554
No 205
>KOG1127|consensus
Probab=31.68 E-value=2.4e+02 Score=29.14 Aligned_cols=97 Identities=16% Similarity=0.138 Sum_probs=64.8
Q ss_pred HHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchh---
Q psy398 53 IWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWS--- 129 (162)
Q Consensus 53 ~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~--- 129 (162)
.|..+|+=|.+.|-+..|+|+|+++- .-.. .| .+....+++.. .|.-.-+..|..-...+-+ -.+|.
T Consensus 598 ~W~gLGeAY~~sGry~~AlKvF~kAs-~LrP--~s----~y~~fk~A~~e-cd~GkYkeald~l~~ii~~--~s~e~~~q 667 (1238)
T KOG1127|consen 598 LWLGLGEAYPESGRYSHALKVFTKAS-LLRP--LS----KYGRFKEAVME-CDNGKYKEALDALGLIIYA--FSLERTGQ 667 (1238)
T ss_pred HHHHHHHHHHhcCceehHHHhhhhhH-hcCc--Hh----HHHHHHHHHHH-HHhhhHHHHHHHHHHHHHH--HHHHHHhh
Confidence 67889999999999999999998866 2221 12 23444555555 5554444444444444433 23443
Q ss_pred --------hhhHHHHHHHHHHhhhhcHHHHHHHhHhhc
Q psy398 130 --------ARNKLKAYEGVYCLAIRQYCRAAELFVDVI 159 (162)
Q Consensus 130 --------rrnrLKvy~gl~~l~~R~fk~AA~lfld~~ 159 (162)
|-.+.-...|++.++.--|+++-+.|.-++
T Consensus 668 ~gLaE~~ir~akd~~~~gf~~kavd~~eksie~f~~~l 705 (1238)
T KOG1127|consen 668 NGLAESVIRDAKDSAITGFQKKAVDFFEKSIESFIVSL 705 (1238)
T ss_pred hhHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHH
Confidence 667777888999888888888888887554
No 206
>KOG2047|consensus
Probab=31.67 E-value=2.3e+02 Score=27.91 Aligned_cols=93 Identities=9% Similarity=0.060 Sum_probs=68.2
Q ss_pred HHHHHHHHHHHHH-hcCCCchHHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCH
Q psy398 28 NEKRLKELDLEHE-KNMMDEEDQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDI 106 (162)
Q Consensus 28 n~~~l~~Le~~lk-~Nlg~~~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~ 106 (162)
.++++.--.+.++ -|--.---+...-|...|+||...||++.|...|+++..-.-.+..+--.+...-.-+-+-+ .++
T Consensus 363 ~~~~i~tyteAv~~vdP~ka~Gs~~~Lw~~faklYe~~~~l~~aRvifeka~~V~y~~v~dLa~vw~~waemElrh-~~~ 441 (835)
T KOG2047|consen 363 AAEQINTYTEAVKTVDPKKAVGSPGTLWVEFAKLYENNGDLDDARVIFEKATKVPYKTVEDLAEVWCAWAEMELRH-ENF 441 (835)
T ss_pred hHHHHHHHHHHHHccCcccCCCChhhHHHHHHHHHHhcCcHHHHHHHHHHhhcCCccchHHHHHHHHHHHHHHHhh-hhH
Confidence 4455555555555 11110013566778999999999999999999999988233356666677788888888888 999
Q ss_pred HHHHHHHHHHHhhhh
Q psy398 107 QAMTAAIEKATELIE 121 (162)
Q Consensus 107 ~~v~~~i~ka~~l~e 121 (162)
...-+-+.+|..++-
T Consensus 442 ~~Al~lm~~A~~vP~ 456 (835)
T KOG2047|consen 442 EAALKLMRRATHVPT 456 (835)
T ss_pred HHHHHHHHhhhcCCC
Confidence 999999999988864
No 207
>TIGR02397 dnaX_nterm DNA polymerase III, subunit gamma and tau. This model represents the well-conserved first ~ 365 amino acids of the translation of the dnaX gene. The full-length product of the dnaX gene in the model bacterium E. coli is the DNA polymerase III tau subunit. A translational frameshift leads to early termination and a truncated protein subunit gamma, about 1/3 shorter than tau and present in roughly equal amounts. This frameshift mechanism is not necessarily universal for species with DNA polymerase III but appears conserved in the exterme thermophile Thermus thermophilis.
Probab=31.57 E-value=3.1e+02 Score=22.95 Aligned_cols=96 Identities=15% Similarity=0.210 Sum_probs=59.1
Q ss_pred hHHHHHHhcCCCcCHHHHHHHHHHHHHHHHHHHHHHH------hcCCCchHHHHHH--------HHhhHhHHHhhcCHHH
Q psy398 4 YYKEVCEEFGWKMDEKFYTEMKDANEKRLKELDLEHE------KNMMDEEDQVSGI--------WQAKLDYLCSIGDRAE 69 (162)
Q Consensus 4 ~Y~~l~~~~~~~~D~~~~~~~~~~n~~~l~~Le~~lk------~Nlg~~~~eIr~~--------~~~~a~~~~~iGD~~~ 69 (162)
+-+..+.+.+.++|++.++.+-+...-.+..+...++ ...+ +.+.|+.. ..++.+... .||...
T Consensus 184 ~l~~~~~~~g~~i~~~a~~~l~~~~~g~~~~a~~~lekl~~~~~~~i-t~~~v~~~~~~~~~~~i~~l~~ai~-~~~~~~ 261 (355)
T TIGR02397 184 RLKKILDKEGIKIEDEALELIARAADGSLRDALSLLDQLISFGNGNI-TYEDVNELLGLVDDEKLIELLEAIL-NKDTAE 261 (355)
T ss_pred HHHHHHHHcCCCCCHHHHHHHHHHcCCChHHHHHHHHHHHhhcCCCC-CHHHHHHHhCCCCHHHHHHHHHHHH-cCCHHH
Confidence 3445556678888888777666554444444444443 1111 13334332 233333333 389999
Q ss_pred HHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHh
Q psy398 70 AFKLADSKFADKTNPKSHRIDAVFTLFRIAYFH 102 (162)
Q Consensus 70 A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~ 102 (162)
|+..+..+. +....+-..+.++...+|--+..
T Consensus 262 a~~~~~~l~-~~~~~~~~il~~l~~~~r~l~~~ 293 (355)
T TIGR02397 262 ALKILDEIL-ESGVDPEKFLEDLIEILRDLLLI 293 (355)
T ss_pred HHHHHHHHH-HcCCCHHHHHHHHHHHHHHHHHH
Confidence 999999999 76667778888888888876555
No 208
>TIGR02561 HrpB1_HrpK type III secretion protein HrpB1/HrpK. This gene is found within type III secretion operons in a limited range of species including Xanthomonas, Ralstonia and Burkholderia.
Probab=31.45 E-value=2.6e+02 Score=22.08 Aligned_cols=68 Identities=15% Similarity=0.195 Sum_probs=50.4
Q ss_pred CCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhhhHHHHHHHHHHhhhhcHHHHHHHhHhh
Q psy398 82 TNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSARNKLKAYEGVYCLAIRQYCRAAELFVDV 158 (162)
Q Consensus 82 ~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rrnrLKvy~gl~~l~~R~fk~AA~lfld~ 158 (162)
..=++..+--+..++++|+.. .|..-+..-|...+-|-=+ - .=+-.+.|.-+|..|++.+|...|=+.
T Consensus 3 ~qCs~~iv~gLi~~~~~aL~~-~d~~D~e~lLdALrvLrP~--~------~e~d~~dg~l~i~rg~w~eA~rvlr~l 70 (153)
T TIGR02561 3 IQCSNRLLGGLIEVLMYALRS-ADPYDAQAMLDALRVLRPN--L------KELDMFDGWLLIARGNYDEAARILREL 70 (153)
T ss_pred CcCcHHHHHHHHHHHHHHHhc-CCHHHHHHHHHHHHHhCCC--c------cccchhHHHHHHHcCCHHHHHHHHHhh
Confidence 334456667778888899998 8888888888888777211 1 124578899999999999999888654
No 209
>KOG4626|consensus
Probab=31.20 E-value=1.4e+02 Score=29.46 Aligned_cols=28 Identities=11% Similarity=0.058 Sum_probs=16.8
Q ss_pred HHHHHhhHhHHHhhcCHHHHHHHHHHhc
Q psy398 51 SGIWQAKLDYLCSIGDRAEAFKLADSKF 78 (162)
Q Consensus 51 r~~~~~~a~~~~~iGD~~~A~~ay~~~~ 78 (162)
-+||..+|.-|-..|..++|+.+|+.++
T Consensus 456 AeAhsNLasi~kDsGni~~AI~sY~~aL 483 (966)
T KOG4626|consen 456 AEAHSNLASIYKDSGNIPEAIQSYRTAL 483 (966)
T ss_pred HHHHhhHHHHhhccCCcHHHHHHHHHHH
Confidence 3455666666666666666666666655
No 210
>cd08053 Yqbg Putative Head-Tail Connector Protein Yqbg from Bacillus subtilis and similar proteins. The uncharacterized Bacillus subtilis Yqbg protein, whose gene is part of the unusual genetic element called skin, shows a similar structure to the connector proteins gp6 and gp15 from bacteriophage HK97 and SPP1, respectively. gp6 and gp15 are critical in the assembly of the connector, a specialized structure that serves as an interface for head and tail attachment, as well as a point at which DNA exits the head during infection by the bacteriophage. They form dodecameric ring structures that comprise the middle ring of the connector, located between the portal protein (attached to the head) and the gp7/gp16 ring (attached to the tail). The Yqbg gene is surrounded with genes similar to genes in the Bacillus subtilis prophage-like element PBSX, which encode for proteins comprising contractile-tailed phage-like particles that are produced upon mitomycin C treatment. Yqbg likely acts as a
Probab=30.94 E-value=2.1e+02 Score=20.74 Aligned_cols=53 Identities=11% Similarity=0.149 Sum_probs=43.1
Q ss_pred CCCcCHHHHHHHHHHHHHHHHHHHHHHH-h----cCCCchHHHHHHHHhhHhHHHhhcCHH
Q psy398 13 GWKMDEKFYTEMKDANEKRLKELDLEHE-K----NMMDEEDQVSGIWQAKLDYLCSIGDRA 68 (162)
Q Consensus 13 ~~~~D~~~~~~~~~~n~~~l~~Le~~lk-~----Nlg~~~~eIr~~~~~~a~~~~~iGD~~ 68 (162)
+-+++++..+...++....+..+--- . . ... .+.|++|..+.++|++.+|...
T Consensus 15 g~~~~~~~f~~~~~rA~~~Id~it~~-r~~~~~~~~~--~e~vk~A~c~~ae~~~~~~~~~ 72 (121)
T cd08053 15 GKERPEELLKKDILRASAKIDKITFG-RFDDLGYPEI--PEEVKLAVCALAEYIALIDGDE 72 (121)
T ss_pred CCcCCHHHHHHHHHHHHHHHHhhhcC-cccccccccc--cHHHHHHHHHHHHHHHHhCCcc
Confidence 55667889999999999988877652 2 1 556 7899999999999999999864
No 211
>KOG1125|consensus
Probab=30.89 E-value=4.7e+02 Score=25.03 Aligned_cols=125 Identities=14% Similarity=0.080 Sum_probs=71.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHH-hcCCCchHHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCC------CChHHHHHHH
Q psy398 20 FYTEMKDANEKRLKELDLEHE-KNMMDEEDQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKT------NPKSHRIDAV 92 (162)
Q Consensus 20 ~~~~~~~~n~~~l~~Le~~lk-~Nlg~~~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~------~s~g~kiD~~ 92 (162)
|+--+...|+.+..-+.+-.+ -++- --=.+++.++|-.|.--|-=..|++++.+=+ ... .+.+..-+-.
T Consensus 324 ~LG~~qaENE~E~~ai~AL~rcl~Ld---P~NleaLmaLAVSytNeg~q~~Al~~L~~Wi-~~~p~y~~l~~a~~~~~~~ 399 (579)
T KOG1125|consen 324 KLGITQAENENEQNAISALRRCLELD---PTNLEALMALAVSYTNEGLQNQALKMLDKWI-RNKPKYVHLVSAGENEDFE 399 (579)
T ss_pred HhhhHhhhccchHHHHHHHHHHHhcC---CccHHHHHHHHHHHhhhhhHHHHHHHHHHHH-HhCccchhccccCcccccc
Confidence 345577888888777766655 1111 1113477899999999999999999988843 111 1100000000
Q ss_pred H--HHHHHHHHhcCCHHHHHHH-HHHHHhhhhhcCCCchhhhhHHHHHHHHHHhhhhcHHHHHHHhHhhcc
Q psy398 93 F--TLFRIAYFHGCDIQAMTAA-IEKATELIEGISGGDWSARNKLKAYEGVYCLAIRQYCRAAELFVDVIR 160 (162)
Q Consensus 93 l--~~iRv~~f~~~d~~~v~~~-i~ka~~l~e~~~ggDW~rrnrLKvy~gl~~l~~R~fk~AA~lfld~~~ 160 (162)
. ..+-. ..+.+|+.. ++-|.....+ -|. .+...-||-+=..|+|.+|..||=.+|.
T Consensus 400 ~~~s~~~~-----~~l~~i~~~fLeaa~~~~~~---~Dp----dvQ~~LGVLy~ls~efdraiDcf~~AL~ 458 (579)
T KOG1125|consen 400 NTKSFLDS-----SHLAHIQELFLEAARQLPTK---IDP----DVQSGLGVLYNLSGEFDRAVDCFEAALQ 458 (579)
T ss_pred CCcCCCCH-----HHHHHHHHHHHHHHHhCCCC---CCh----hHHhhhHHHHhcchHHHHHHHHHHHHHh
Confidence 0 00000 122233332 3334444321 243 3555667777789999999999987764
No 212
>PRK14720 transcript cleavage factor/unknown domain fusion protein; Provisional
Probab=30.49 E-value=1.4e+02 Score=30.02 Aligned_cols=56 Identities=7% Similarity=-0.041 Sum_probs=40.4
Q ss_pred HHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhh
Q psy398 52 GIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELI 120 (162)
Q Consensus 52 ~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~ 120 (162)
.|+.-+|+.|-+.|+.++|..+|.+++ ..... =...+=++|+++ ..- ++.||..+.
T Consensus 117 ~Al~~LA~~Ydk~g~~~ka~~~yer~L-~~D~~------n~~aLNn~AY~~-ae~-----dL~KA~~m~ 172 (906)
T PRK14720 117 LALRTLAEAYAKLNENKKLKGVWERLV-KADRD------NPEIVKKLATSY-EEE-----DKEKAITYL 172 (906)
T ss_pred HHHHHHHHHHHHcCChHHHHHHHHHHH-hcCcc------cHHHHHHHHHHH-HHh-----hHHHHHHHH
Confidence 588889999999999999999999999 76622 223344577777 322 566665554
No 213
>PF04053 Coatomer_WDAD: Coatomer WD associated region ; InterPro: IPR006692 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the WD-associated region found in coatomer subunits alpha, beta and beta' subunits. The alpha-subunit (RET1P) of the coatomer complex in Saccharomyces cerevisiae (Baker's yeast), participates in membrane transport between the endoplasmic reticulum and Golgi apparatus. The protein contains six WD-40 repeat motifs in its N-terminal region []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 3MKQ_B.
Probab=30.13 E-value=4.2e+02 Score=24.07 Aligned_cols=80 Identities=16% Similarity=0.125 Sum_probs=47.2
Q ss_pred HHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhhh
Q psy398 53 IWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSARN 132 (162)
Q Consensus 53 ~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rrn 132 (162)
-|..+|+...+.||.+-|.+||.++- +.. - +.-+.... +|...+.+-.+.|+. .|||..
T Consensus 349 ~W~~Lg~~AL~~g~~~lAe~c~~k~~-d~~--------~---L~lLy~~~-g~~~~L~kl~~~a~~------~~~~n~-- 407 (443)
T PF04053_consen 349 KWKQLGDEALRQGNIELAEECYQKAK-DFS--------G---LLLLYSST-GDREKLSKLAKIAEE------RGDINI-- 407 (443)
T ss_dssp HHHHHHHHHHHTTBHHHHHHHHHHCT--HH--------H---HHHHHHHC-T-HHHHHHHHHHHHH------TT-HHH--
T ss_pred HHHHHHHHHHHcCCHHHHHHHHHhhc-Ccc--------c---cHHHHHHh-CCHHHHHHHHHHHHH------ccCHHH--
Confidence 57788888899999999999998887 432 1 11133344 676555544444333 356542
Q ss_pred HHHHHHHHHHhhhhcHHHHHHHhHhh
Q psy398 133 KLKAYEGVYCLAIRQYCRAAELFVDV 158 (162)
Q Consensus 133 rLKvy~gl~~l~~R~fk~AA~lfld~ 158 (162)
.-...+..+|+++-.++|+++
T Consensus 408 -----af~~~~~lgd~~~cv~lL~~~ 428 (443)
T PF04053_consen 408 -----AFQAALLLGDVEECVDLLIET 428 (443)
T ss_dssp -----HHHHHHHHT-HHHHHHHHHHT
T ss_pred -----HHHHHHHcCCHHHHHHHHHHc
Confidence 224455567887777777765
No 214
>cd08638 DNA_pol_A_theta DNA polymerase theta is a low-fidelity family A enzyme implicated in translesion synthesis and in somatic hypermutation. DNA polymerase theta is a low-fidelity family A enzyme implicated in translesion synthesis (TLS) and in somatic hypermutation (SHM). DNA-dependent DNA polymerases can be classified in six main groups based upon phylogenetic relationships with E. coli polymerase I (classA), E. coli polymerase II (class B), E.coli polymerase III (class C), euryarchaaeota polymerase II (class D), human polymerase beta (class x), E. coli UmuC/DinB and eukaryotic RAP 30/Xeroderma pigmentosum variant (class Y). Family A polymerase functions primarily to fill DNA gaps that arise during DNA repair, recombination and replication. Pol theta is an exception among family A polymerases and generates processive single base substitutions. Family A polymerase are found primarily in organisms related to prokaryotes and include prokaryotic DNA polymerase I (pol I) ,mitochondri
Probab=29.96 E-value=2.2e+02 Score=25.13 Aligned_cols=40 Identities=20% Similarity=0.215 Sum_probs=32.2
Q ss_pred CCCcCHHHHHHHHHHHHHHHHHHHHHHHhcCCCchHHHHHHHH
Q psy398 13 GWKMDEKFYTEMKDANEKRLKELDLEHEKNMMDEEDQVSGIWQ 55 (162)
Q Consensus 13 ~~~~D~~~~~~~~~~n~~~l~~Le~~lk~Nlg~~~~eIr~~~~ 55 (162)
|+++|.++++.+.+.-..++++||.++.. .. ++++...+.
T Consensus 1 Gi~~d~~~l~~~~~~l~~~~~~le~~~~~-st--~~~~L~~l~ 40 (373)
T cd08638 1 GIGFDPEELERQRALLQAKLKELEEEAYR-ST--SKEVLEQLK 40 (373)
T ss_pred CeEeCHHHHHHHHHHHHHHHHHHHHHHHh-cc--hHHHHHHHH
Confidence 56899999999999999999999999985 23 455555443
No 215
>PF03357 Snf7: Snf7; InterPro: IPR005024 This is a family of eukaryotic proteins which are variously described as either hypothetical protein, developmental protein or related to yeast SNF7. The family contains human CHMP1. CHMP1 (CHromatin Modifying Protein; CHarged Multivesicular body Protein), is encoded by an alternative open reading frame in the PRSM1 gene [] and is conserved in both complex and simple eukaryotes. CHMP1 contains a predicted bipartite nuclear localisation signal and distributes as distinct forms to the cytoplasm and the nuclear matrix in all cell lines tested. Human CHMP1 is strongly implicated in multivesicular body formation. A multivesicular body is a vesicle-filled endosome that targets proteins to the interior of lysosomes. Immunocytochemistry and biochemical fractionation localise CHMP1 to early endosomes and CHMP1 physically interacts with SKD1/VPS4, a highly conserved protein directly linked to multivesicular body sorting in yeast. Similar to the action of a mutant SKD1 protein, over expression of a fusion derivative of human CHMP1 dilates endosomal compartments and disrupts the normal distribution of several endosomal markers. Genetic studies in Saccharomyces cerevisiae (Baker's yeast) further support a conserved role of CHMP1 in vesicle trafficking. Deletion of CHM1, the budding yeast homologue of CHMP1, results in defective sorting of carboxypeptidases S and Y and produces abnormal, multi-lamellar prevacuolar compartments. This phenotype classifies CHM1 as a member of the class E vacuolar protein sorting genes []. ; GO: 0015031 protein transport; PDB: 2V6X_B 2W2U_D 2GD5_D 3FRT_B 3FRV_A 4ABM_D 3EAB_H 3HTU_D.
Probab=29.15 E-value=1.2e+02 Score=22.62 Aligned_cols=70 Identities=11% Similarity=0.108 Sum_probs=39.5
Q ss_pred HHHHHHHHHHHHHHHHHHH--hcCCCchHHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHH
Q psy398 22 TEMKDANEKRLKELDLEHE--KNMMDEEDQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIA 99 (162)
Q Consensus 22 ~~~~~~n~~~l~~Le~~lk--~Nlg~~~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~ 99 (162)
......=+.+..+|+.+++ .... +..++.|....|..|++. .+.+++.. +.+.+.-..++-+...|.-+
T Consensus 7 k~~~~~L~~~~~~le~~i~~~~~~~--k~~~~~~~~~~A~~~lk~------~k~~~k~~-~~~~~~~~~l~~~~~~ie~a 77 (171)
T PF03357_consen 7 KKTIRRLEKQIKRLEKKIKKLEKKA--KKAIKKGNKERAKIYLKR------KKRLEKQL-EKLLNQLSNLESVLLQIETA 77 (171)
T ss_dssp HHHHHHHHHHHHHHHHHHHHCHHHH--HHHHCTT-HHHHHHHHHH------HHHHHHHH-HHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHcCChHHHHHHHHH------HHHHHHHH-HHHHHHHHHHHHHHHHHHHH
Confidence 3445555666777777777 5555 666666666777766654 23334444 44445555555555555544
Q ss_pred H
Q psy398 100 Y 100 (162)
Q Consensus 100 ~ 100 (162)
.
T Consensus 78 ~ 78 (171)
T PF03357_consen 78 Q 78 (171)
T ss_dssp H
T ss_pred H
Confidence 3
No 216
>PRK10316 hypothetical protein; Provisional
Probab=28.71 E-value=1.1e+02 Score=25.50 Aligned_cols=52 Identities=19% Similarity=0.323 Sum_probs=39.3
Q ss_pred CCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhhhH
Q psy398 80 DKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSARNK 133 (162)
Q Consensus 80 ~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rrnr 133 (162)
.+....|+.-=.=..+-|..||. ++....+.-+..|+.+++. ...||-+-.|
T Consensus 45 ~~lS~dG~~A~~DI~~AR~Alf~-G~~~~Ak~ll~~A~~~l~~-a~~D~~~f~k 96 (209)
T PRK10316 45 ERISEQGLYAMRDVQVARLALFH-GDPEKAKELTNQASALLSD-DSTDWAKFAK 96 (209)
T ss_pred HHhhHHHHHHHHHHHHHHHHHhc-CCHHHHHHHHHHHHHHHHh-hhccHHHHHh
Confidence 44555566554556778999999 9999999999999999874 3448876543
No 217
>KOG3081|consensus
Probab=28.18 E-value=4.1e+02 Score=23.26 Aligned_cols=128 Identities=17% Similarity=0.046 Sum_probs=72.6
Q ss_pred HHHHHHHHHHHHHHHH-hcCCCchHHHHHHHHhhHhHHHhhcCHHHHHHHHHHhc-------CCCCCChHHHHHHHHHHH
Q psy398 25 KDANEKRLKELDLEHE-KNMMDEEDQVSGIWQAKLDYLCSIGDRAEAFKLADSKF-------ADKTNPKSHRIDAVFTLF 96 (162)
Q Consensus 25 ~~~n~~~l~~Le~~lk-~Nlg~~~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~-------~~~~~s~g~kiD~~l~~i 96 (162)
...-+..+++|.+.+- .+-+ +..-.+..-|.+|+.-||.++|+++..+-. +-.+..--||+|..-..+
T Consensus 85 e~~~~~~~~~l~E~~a~~~~~----sn~i~~l~aa~i~~~~~~~deAl~~~~~~~~lE~~Al~VqI~lk~~r~d~A~~~l 160 (299)
T KOG3081|consen 85 ESNKKSILASLYELVADSTDG----SNLIDLLLAAIIYMHDGDFDEALKALHLGENLEAAALNVQILLKMHRFDLAEKEL 160 (299)
T ss_pred cchhHHHHHHHHHHHHhhccc----hhHHHHHHhhHHhhcCCChHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3344555666666665 2222 222345567899999999999998776511 112333456666666666
Q ss_pred HHHHHhcCCHHHHHHHHHHHHhhhhhcCCCch---------------hhhhHHHHHHHHHHhhhhcHHHHHHHhHhhcc
Q psy398 97 RIAYFHGCDIQAMTAAIEKATELIEGISGGDW---------------SARNKLKAYEGVYCLAIRQYCRAAELFVDVIR 160 (162)
Q Consensus 97 Rv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW---------------~rrnrLKvy~gl~~l~~R~fk~AA~lfld~~~ 160 (162)
+..... |-+++-+.++.|=--+- +||.- .+...+--=.++-+|..|+|.+|-.++.+++.
T Consensus 161 k~mq~i--ded~tLtQLA~awv~la--~ggek~qdAfyifeE~s~k~~~T~~llnG~Av~~l~~~~~eeAe~lL~eaL~ 235 (299)
T KOG3081|consen 161 KKMQQI--DEDATLTQLAQAWVKLA--TGGEKIQDAFYIFEELSEKTPPTPLLLNGQAVCHLQLGRYEEAESLLEEALD 235 (299)
T ss_pred HHHHcc--chHHHHHHHHHHHHHHh--ccchhhhhHHHHHHHHhcccCCChHHHccHHHHHHHhcCHHHHHHHHHHHHh
Confidence 665443 33333333332221111 13321 12223333456778999999999999888764
No 218
>KOG0276|consensus
Probab=27.75 E-value=87 Score=30.42 Aligned_cols=80 Identities=15% Similarity=0.102 Sum_probs=50.9
Q ss_pred HHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhhh
Q psy398 53 IWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSARN 132 (162)
Q Consensus 53 ~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rrn 132 (162)
-|..+|++..+-|++..|.+|+.+++ |... ++-+.--. +|...+.. +-++.++ .-
T Consensus 668 Kw~~Lg~~al~~~~l~lA~EC~~~a~-d~~~-----------LlLl~t~~-g~~~~l~~----la~~~~~--------~g 722 (794)
T KOG0276|consen 668 KWRQLGDAALSAGELPLASECFLRAR-DLGS-----------LLLLYTSS-GNAEGLAV----LASLAKK--------QG 722 (794)
T ss_pred HHHHHHHHHhhcccchhHHHHHHhhc-chhh-----------hhhhhhhc-CChhHHHH----HHHHHHh--------hc
Confidence 47789999999999999999999999 6531 11122223 55443332 2222221 12
Q ss_pred HHHHHHHHHHhhhhcHHHHHHHhHhh
Q psy398 133 KLKAYEGVYCLAIRQYCRAAELFVDV 158 (162)
Q Consensus 133 rLKvy~gl~~l~~R~fk~AA~lfld~ 158 (162)
|..+ .-+.++..+++.+..++++++
T Consensus 723 ~~N~-AF~~~~l~g~~~~C~~lLi~t 747 (794)
T KOG0276|consen 723 KNNL-AFLAYFLSGDYEECLELLIST 747 (794)
T ss_pred ccch-HHHHHHHcCCHHHHHHHHHhc
Confidence 2222 346677789999999999876
No 219
>PRK14970 DNA polymerase III subunits gamma and tau; Provisional
Probab=27.39 E-value=3.9e+02 Score=22.81 Aligned_cols=94 Identities=14% Similarity=0.127 Sum_probs=58.6
Q ss_pred HHHHHHhcCCCcCHHHHHHHHHHHHHHHHHHHHHHH-------hcCCCchHHHHHH--------HHhhHhHHHhhcCHHH
Q psy398 5 YKEVCEEFGWKMDEKFYTEMKDANEKRLKELDLEHE-------KNMMDEEDQVSGI--------WQAKLDYLCSIGDRAE 69 (162)
Q Consensus 5 Y~~l~~~~~~~~D~~~~~~~~~~n~~~l~~Le~~lk-------~Nlg~~~~eIr~~--------~~~~a~~~~~iGD~~~ 69 (162)
-...+.+.+.+++++-++.+-......+..+..+++ .+.. .+.|+.. ..++.+.. .-||..+
T Consensus 176 l~~~~~~~g~~i~~~al~~l~~~~~gdlr~~~~~lekl~~y~~~~it--~~~v~~~~~~~~~~~if~l~~ai-~~~~~~~ 252 (367)
T PRK14970 176 LAGIAVKEGIKFEDDALHIIAQKADGALRDALSIFDRVVTFCGKNIT--RQAVTENLNILDYDTYINVTDLI-LENKIPE 252 (367)
T ss_pred HHHHHHHcCCCCCHHHHHHHHHhCCCCHHHHHHHHHHHHHhcCCCCC--HHHHHHHhCCCCHHHHHHHHHHH-HcCCHHH
Confidence 344455667888888777766655544444444444 2222 3344432 22333333 3479999
Q ss_pred HHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHh
Q psy398 70 AFKLADSKFADKTNPKSHRIDAVFTLFRIAYFH 102 (162)
Q Consensus 70 A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~ 102 (162)
|+..+.... .....+-..+.++...+|.-+..
T Consensus 253 a~~~~~~l~-~~~~~~~~il~~l~~~fr~ll~~ 284 (367)
T PRK14970 253 LLLAFNEIL-RKGFDGHHFIAGLASHFRDLMVS 284 (367)
T ss_pred HHHHHHHHH-HcCCCHHHHHHHHHHHHHHHHHh
Confidence 999999998 66677777888887777766555
No 220
>PRK15363 pathogenicity island 2 chaperone protein SscA; Provisional
Probab=27.05 E-value=3.1e+02 Score=21.56 Aligned_cols=70 Identities=9% Similarity=-0.064 Sum_probs=49.5
Q ss_pred hHHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhh
Q psy398 47 EDQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIE 121 (162)
Q Consensus 47 ~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e 121 (162)
+-+..++|..+|-=+-+.|++++|+.+|.++..=....+..-..+.. .-+.. +|.....+....|-..+.
T Consensus 65 Dp~~~~y~~gLG~~~Q~~g~~~~AI~aY~~A~~L~~ddp~~~~~ag~----c~L~l-G~~~~A~~aF~~Ai~~~~ 134 (157)
T PRK15363 65 DAWSFDYWFRLGECCQAQKHWGEAIYAYGRAAQIKIDAPQAPWAAAE----CYLAC-DNVCYAIKALKAVVRICG 134 (157)
T ss_pred CcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcCCCCchHHHHHHH----HHHHc-CCHHHHHHHHHHHHHHhc
Confidence 55667899999999999999999999999976112223333333332 23334 788888888888888773
No 221
>PF13414 TPR_11: TPR repeat; PDB: 2HO1_B 2FI7_B 2DBA_A 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2PL2_B 3IEG_B 2FBN_A ....
Probab=26.74 E-value=1.6e+02 Score=18.16 Aligned_cols=61 Identities=23% Similarity=0.226 Sum_probs=43.5
Q ss_pred HHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhhhHHHHHHHHHHhhhh-cHHHHHHHhHhhcc
Q psy398 91 AVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSARNKLKAYEGVYCLAIR-QYCRAAELFVDVIR 160 (162)
Q Consensus 91 ~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rrnrLKvy~gl~~l~~R-~fk~AA~lfld~~~ 160 (162)
..+..=.+.+.. +++.....+..+|-.+- ..++. .-...|+.++..+ +|.+|-+.|-.++.
T Consensus 5 ~~~~~g~~~~~~-~~~~~A~~~~~~ai~~~----p~~~~----~~~~~g~~~~~~~~~~~~A~~~~~~al~ 66 (69)
T PF13414_consen 5 AWYNLGQIYFQQ-GDYEEAIEYFEKAIELD----PNNAE----AYYNLGLAYMKLGKDYEEAIEDFEKALK 66 (69)
T ss_dssp HHHHHHHHHHHT-THHHHHHHHHHHHHHHS----TTHHH----HHHHHHHHHHHTTTHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHc-CCHHHHHHHHHHHHHcC----CCCHH----HHHHHHHHHHHhCccHHHHHHHHHHHHH
Confidence 344445556666 89999999988888862 22333 5556788888888 89999999887763
No 222
>KOG0796|consensus
Probab=26.52 E-value=2.5e+02 Score=24.81 Aligned_cols=20 Identities=15% Similarity=0.145 Sum_probs=10.6
Q ss_pred HHHHHHHHHHHHHHHHHHHHH
Q psy398 20 FYTEMKDANEKRLKELDLEHE 40 (162)
Q Consensus 20 ~~~~~~~~n~~~l~~Le~~lk 40 (162)
..+.+..+ ++++..|+++|+
T Consensus 113 ~~~e~~~k-~~~v~~l~e~I~ 132 (319)
T KOG0796|consen 113 RSEEAARK-AEKVHELEEKIG 132 (319)
T ss_pred hhhHHHHH-HHHHHHHHHHHH
Confidence 33333333 556666666665
No 223
>PRK06078 pyrimidine-nucleoside phosphorylase; Reviewed
Probab=26.30 E-value=1.7e+02 Score=26.86 Aligned_cols=81 Identities=15% Similarity=0.135 Sum_probs=59.6
Q ss_pred HHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhh
Q psy398 52 GIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSAR 131 (162)
Q Consensus 52 ~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rr 131 (162)
++..++|. .+|+-.+..++ ...+ . .-+++...|+++..--..++..+.-....+.++.|+..++. |--|++-
T Consensus 238 ~~~~plG~---~iGna~Ev~Ea-~~vL-~-G~~~~~~~d~v~~~A~~~L~~~g~~~~~~eg~~~a~e~l~s--GkAl~kf 309 (434)
T PRK06078 238 DMSQPLGR---AIGNALEVLEA-IDTL-Q-GKGPKDLTELVLTLGSQMVVLAGKAKTLEEAREHLIEVMNN--GKALEKF 309 (434)
T ss_pred CCCccccc---cCCCHHHHHHH-HHHH-C-CCCcccHHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHHHhc--cHHHHHH
Confidence 34445665 89999999988 5555 2 23556789999999999999843344577888999999886 6577777
Q ss_pred hHHHHHHHH
Q psy398 132 NKLKAYEGV 140 (162)
Q Consensus 132 nrLKvy~gl 140 (162)
.++-.++|=
T Consensus 310 ~~~v~aqGg 318 (434)
T PRK06078 310 KEFLSAQGG 318 (434)
T ss_pred HHHHHHhCC
Confidence 777777763
No 224
>COG5010 TadD Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking and secretion]
Probab=26.28 E-value=4.2e+02 Score=22.71 Aligned_cols=87 Identities=14% Similarity=0.125 Sum_probs=57.4
Q ss_pred HHHhhHhHHHhhcCHHHHHHHHHHhcCC-CCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhh
Q psy398 53 IWQAKLDYLCSIGDRAEAFKLADSKFAD-KTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSAR 131 (162)
Q Consensus 53 ~~~~~a~~~~~iGD~~~A~~ay~~~~~~-~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rr 131 (162)
+|..+|-=|-+.|+.+.|-.+|.+++ + +...+.-.=.+.++. +.. +|.....+++..|..... .|
T Consensus 136 ~~~~lgaaldq~Gr~~~Ar~ay~qAl-~L~~~~p~~~nNlgms~----~L~-gd~~~A~~lll~a~l~~~----ad---- 201 (257)
T COG5010 136 AWNLLGAALDQLGRFDEARRAYRQAL-ELAPNEPSIANNLGMSL----LLR-GDLEDAETLLLPAYLSPA----AD---- 201 (257)
T ss_pred hhhHHHHHHHHccChhHHHHHHHHHH-HhccCCchhhhhHHHHH----HHc-CCHHHHHHHHHHHHhCCC----Cc----
Confidence 56789999999999999999999998 4 333333333333322 234 899999999999887643 12
Q ss_pred hHHHHHHHHHHhhhhcHHHHHH
Q psy398 132 NKLKAYEGVYCLAIRQYCRAAE 153 (162)
Q Consensus 132 nrLKvy~gl~~l~~R~fk~AA~ 153 (162)
++..-=-.+..-.+|+|..|-.
T Consensus 202 ~~v~~NLAl~~~~~g~~~~A~~ 223 (257)
T COG5010 202 SRVRQNLALVVGLQGDFREAED 223 (257)
T ss_pred hHHHHHHHHHHhhcCChHHHHh
Confidence 2222223444456778876643
No 225
>PF11564 BpuJI_N: Restriction endonuclease BpuJI - N terminal; InterPro: IPR021108 There are four classes of restriction endonucleases: types I, II,III and IV. All types of enzymes recognise specific short DNA sequences and carry out the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. They differ in their recognition sequence, subunit composition, cleavage position, and cofactor requirements [, ], as summarised below: Type I enzymes (3.1.21.3 from EC) cleave at sites remote from recognition site; require both ATP and S-adenosyl-L-methionine to function; multifunctional protein with both restriction and methylase (2.1.1.72 from EC) activities. Type II enzymes (3.1.21.4 from EC) cleave within or at short specific distances from recognition site; most require magnesium; single function (restriction) enzymes independent of methylase. Type III enzymes (3.1.21.5 from EC) cleave at sites a short distance from recognition site; require ATP (but doesn't hydrolyse it); S-adenosyl-L-methionine stimulates reaction but is not required; exists as part of a complex with a modification methylase methylase (2.1.1.72 from EC). Type IV enzymes target methylated DNA. Type II restriction endonucleases (3.1.21.4 from EC) are components of prokaryotic DNA restriction-modification mechanisms that protect the organism against invading foreign DNA. These site-specific deoxyribonucleases catalyse the endonucleolytic cleavage of DNA to give specific double-stranded fragments with terminal 5'-phosphates. Of the 3000 restriction endonucleases that have been characterised, most are homodimeric or tetrameric enzymes that cleave target DNA at sequence-specific sites close to the recognition site. For homodimeric enzymes, the recognition site is usually a palindromic sequence 4-8 bp in length. Most enzymes require magnesium ions as a cofactor for catalysis. Although they can vary in their mode of recognition, many restriction endonucleases share a similar structural core comprising four beta-strands and one alpha-helix, as well as a similar mechanism of cleavage, suggesting a common ancestral origin []. However, there is still considerable diversity amongst restriction endonucleases [, ]. The target site recognition process triggers large conformational changes of the enzyme and the target DNA, leading to the activation of the catalytic centres. Like other DNA binding proteins, restriction enzymes are capable of non-specific DNA binding as well, which is the prerequisite for efficient target site location by facilitated diffusion. Non-specific binding usually does not involve interactions with the bases but only with the DNA backbone []. BpuJI is a type II restriction endonuclease which recognises the asymmetric sequence 5'-CCCGT and cuts at multiple sites in the surrounding area of the target sequence. The BpuJI protein consists of two distinct modules; an N-terminal DNA recognition domain, and a C-terminal dimerisation and catalysis domain. The N-terminal domain is composed of two winged-helix subdomains and a disrupted linker subdomain. Target sequence recognition occurs through major groove contacts of amino acids in the winged-helix subdomains []. ; PDB: 2VLA_A.
Probab=26.05 E-value=1.2e+02 Score=26.30 Aligned_cols=110 Identities=16% Similarity=0.212 Sum_probs=68.2
Q ss_pred chhHHHHHHhcCCCcCHHHHHHHHHHHHHHHHHHHHHHH--hcCCCchHHHHHHH-------------HhhHhHHHhhcC
Q psy398 2 APYYKEVCEEFGWKMDEKFYTEMKDANEKRLKELDLEHE--KNMMDEEDQVSGIW-------------QAKLDYLCSIGD 66 (162)
Q Consensus 2 ~p~Y~~l~~~~~~~~D~~~~~~~~~~n~~~l~~Le~~lk--~Nlg~~~~eIr~~~-------------~~~a~~~~~iGD 66 (162)
.|||...+++++.-.-.++.+++.+.-.+=+.. +...| .|- -+||..++ ...+..+..-+|
T Consensus 26 L~~~At~Is~~~~l~~~~F~~~~N~~l~~~~~n-~~tqKTi~Nw---rTEIag~LFG~~~e~~g~~~~s~~a~~L~e~qd 101 (285)
T PF11564_consen 26 LPYYATIISDICPLDKKEFNQQLNNALSEFLGN-DSTQKTINNW---RTEIAGALFGFYYEKNGVIYPSRRAKELIEDQD 101 (285)
T ss_dssp HHHHHHHHHHH-SEEHHHHHHHHHHHHHHHH-T-T--HHHHHHH---HHHCCCCCS--EEEETTEEEE-HHHHHHHHH--
T ss_pred HHHHHHHHHhccCCCHHHHHHHHHHHHHHhhcC-chhHHHHHHH---HHHHHHHHhhheeccCceeccChhHHHHhhccc
Confidence 589999999988766667777766654444433 33334 332 25554222 246889999999
Q ss_pred HHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHh
Q psy398 67 RAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATE 118 (162)
Q Consensus 67 ~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~ 118 (162)
+.+.++.+-=.+ .+..+ -.|.+-+...|..|+-+ .-..-+.+-+.+|++
T Consensus 102 l~~FFK~fl~~F-QyP~g-h~K~~~I~~~iq~~I~f-kP~~yiL~ll~~a~~ 150 (285)
T PF11564_consen 102 LPEFFKNFLFNF-QYPGG-HIKSQEIIELIQNGIQF-KPFQYILQLLQEAEQ 150 (285)
T ss_dssp HHHHHHHHHHH---BS-S-SS-HHHHHHHHHTT--B--HHHHHHHHHHHHHH
T ss_pred HHHHHHHHHhhc-cCCCc-cchHHHHHHHHhcCCee-ccHHHHHHHHHHhhc
Confidence 999999998888 88766 35777788888888888 777777777777765
No 226
>PF07219 HemY_N: HemY protein N-terminus; InterPro: IPR010817 This entry represents the N terminus (approximately 150 residues) of bacterial HemY porphyrin biosynthesis proteins. These are membrane protein involved in a late step of protoheme IX synthesis [].
Probab=26.02 E-value=2.5e+02 Score=20.04 Aligned_cols=49 Identities=16% Similarity=0.264 Sum_probs=34.1
Q ss_pred HHHHHHHHHHhhhhhcCCCchh-----hhhHHHHHHHHHHhhhhcHHHHHHHhHhh
Q psy398 108 AMTAAIEKATELIEGISGGDWS-----ARNKLKAYEGVYCLAIRQYCRAAELFVDV 158 (162)
Q Consensus 108 ~v~~~i~ka~~l~e~~~ggDW~-----rrnrLKvy~gl~~l~~R~fk~AA~lfld~ 158 (162)
.+-.-+...-.++.. -..|- +|.+-...+|+-.+..+||+.|.+.+..+
T Consensus 32 ll~~ll~~~~~~p~~--~~~~~~~rr~~ka~~al~~Gl~al~~G~~~~A~k~~~~a 85 (108)
T PF07219_consen 32 LLLRLLRRLLSLPSR--VRRWRRRRRRRKAQRALSRGLIALAEGDWQRAEKLLAKA 85 (108)
T ss_pred HHHHHHHHHHhChHH--HHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH
Confidence 556666666666544 34563 23334458999999999999999988654
No 227
>cd02678 MIT_VPS4 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in intracellular protein transport proteins of the AAA-ATPase family. The molecular function of the MIT domain is unclear.
Probab=25.71 E-value=2.1e+02 Score=19.09 Aligned_cols=22 Identities=23% Similarity=0.175 Sum_probs=17.9
Q ss_pred hHhHHHhhcCHHHHHHHHHHhc
Q psy398 57 KLDYLCSIGDRAEAFKLADSKF 78 (162)
Q Consensus 57 ~a~~~~~iGD~~~A~~ay~~~~ 78 (162)
.|--.-..|+++.|+..|..+.
T Consensus 12 ~Av~~D~~g~y~eA~~~Y~~ai 33 (75)
T cd02678 12 KAIEEDNAGNYEEALRLYQHAL 33 (75)
T ss_pred HHHHHHHcCCHHHHHHHHHHHH
Confidence 4455667899999999999887
No 228
>KOG2796|consensus
Probab=25.70 E-value=4.7e+02 Score=23.17 Aligned_cols=142 Identities=13% Similarity=0.155 Sum_probs=85.3
Q ss_pred HHHHHHhcCCCcCH-HHHHHHHHHHHHHHHHHHHHHHhcCCCchHHHHH-------HHHhhHhHHHhhcCHHHHHHHHHH
Q psy398 5 YKEVCEEFGWKMDE-KFYTEMKDANEKRLKELDLEHEKNMMDEEDQVSG-------IWQAKLDYLCSIGDRAEAFKLADS 76 (162)
Q Consensus 5 Y~~l~~~~~~~~D~-~~~~~~~~~n~~~l~~Le~~lk~Nlg~~~~eIr~-------~~~~~a~~~~~iGD~~~A~~ay~~ 76 (162)
|..+-..+|.|... ..+..++---++.+..++.. |+- |++|+. .+.-.+.++.-.|.+.-++.+|.+
T Consensus 128 hAe~~~~lgnpqesLdRl~~L~~~V~~ii~~~e~~---~~~--ESsv~lW~KRl~~Vmy~~~~~llG~kEy~iS~d~~~~ 202 (366)
T KOG2796|consen 128 HAELQQYLGNPQESLDRLHKLKTVVSKILANLEQG---LAE--ESSIRLWRKRLGRVMYSMANCLLGMKEYVLSVDAYHS 202 (366)
T ss_pred HHHHHHhcCCcHHHHHHHHHHHHHHHHHHHHHHhc---cch--hhHHHHHHHHHHHHHHHHHHHHhcchhhhhhHHHHHH
Confidence 44444555554422 33444444444555544443 333 455543 355667788888899999999999
Q ss_pred hcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhhhHHHH--HHHHHHhhhhcHHHHHHH
Q psy398 77 KFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSARNKLKA--YEGVYCLAIRQYCRAAEL 154 (162)
Q Consensus 77 ~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rrnrLKv--y~gl~~l~~R~fk~AA~l 154 (162)
.+ ++...... -+.=.+-||++.+ +|.+.+++|.+..++.-.+-+|+. |+.-| ..++-++.+.||..|-.-
T Consensus 203 vi-~~~~e~~p--~L~s~Lgr~~MQ~-GD~k~a~~yf~~vek~~~kL~~~q----~~~~V~~n~a~i~lg~nn~a~a~r~ 274 (366)
T KOG2796|consen 203 VI-KYYPEQEP--QLLSGLGRISMQI-GDIKTAEKYFQDVEKVTQKLDGLQ----GKIMVLMNSAFLHLGQNNFAEAHRF 274 (366)
T ss_pred HH-HhCCcccH--HHHHHHHHHHHhc-ccHHHHHHHHHHHHHHHhhhhccc----hhHHHHhhhhhheecccchHHHHHH
Confidence 99 66544333 3456678999999 999999999885554322111222 22222 334445667778777776
Q ss_pred hHhhc
Q psy398 155 FVDVI 159 (162)
Q Consensus 155 fld~~ 159 (162)
|-+++
T Consensus 275 ~~~i~ 279 (366)
T KOG2796|consen 275 FTEIL 279 (366)
T ss_pred Hhhcc
Confidence 65544
No 229
>KOG1156|consensus
Probab=25.69 E-value=4e+02 Score=26.01 Aligned_cols=92 Identities=13% Similarity=0.051 Sum_probs=50.1
Q ss_pred HHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhhh
Q psy398 53 IWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSARN 132 (162)
Q Consensus 53 ~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rrn 132 (162)
.|.-+|-++-..-|+.+|+|||+.+++=......--=|+.+-.|.|+-+- + +..-=..+++- ---.|-|
T Consensus 77 CwHv~gl~~R~dK~Y~eaiKcy~nAl~~~~dN~qilrDlslLQ~QmRd~~-~-------~~~tr~~LLql---~~~~ra~ 145 (700)
T KOG1156|consen 77 CWHVLGLLQRSDKKYDEAIKCYRNALKIEKDNLQILRDLSLLQIQMRDYE-G-------YLETRNQLLQL---RPSQRAS 145 (700)
T ss_pred hHHHHHHHHhhhhhHHHHHHHHHHHHhcCCCcHHHHHHHHHHHHHHHhhh-h-------HHHHHHHHHHh---hhhhHHH
Confidence 46677778888889999999999988222233333336666666555433 1 00001111111 0001223
Q ss_pred HHHHHHHHHHhhhhcHHHHHHHhHh
Q psy398 133 KLKAYEGVYCLAIRQYCRAAELFVD 157 (162)
Q Consensus 133 rLKvy~gl~~l~~R~fk~AA~lfld 157 (162)
.+. .+++.-..++++.|.+..=.
T Consensus 146 w~~--~Avs~~L~g~y~~A~~il~e 168 (700)
T KOG1156|consen 146 WIG--FAVAQHLLGEYKMALEILEE 168 (700)
T ss_pred HHH--HHHHHHHHHHHHHHHHHHHH
Confidence 333 35566667788888776543
No 230
>PF02071 NSF: Aromatic-di-Alanine (AdAR) repeat ; InterPro: IPR000744 Regulated exocytosis of neurotransmitters and hormones, as well as intracellular traffic, requires fusion of two lipid bilayers. SNARE proteins are thought to form a protein bridge, the SNARE complex, between an incoming vesicle and the acceptor compartment. SNARE proteins contribute to the specificity of membrane fusion, implying that the mechanisms by which SNAREs are targeted to subcellular compartments are important for specific docking and fusion of vesicles. This mechanism involves a family of conserved proteins, members of which appear to function at all sites of constitutive and regulated secretion in eukaryotes []. Among them are 2 types of cytosolic protein, NSF (N-ethyl-maleimide-sensitive protein) and the SNAPs (alpha-, beta- and gamma-soluble NSF attachment proteins). The yeast vesicular fusion protein, sec17, a cytoplasmic peripheral membrane protein involved in vesicular transport between the endoplasmic reticulum and the golgi apparatus, shows a high degree of sequence similarity to the alpha-SNAP family. SNAP-25 and its non-neuronal homologue Syndet/SNAP-23 are synthesized as soluble proteins in the cytosol. Both SNAP-25 and Syndet/SNAP-23 are palmitoylated at cysteine residues clustered in a loop between two N- and C-terminal coils and palmitoylation is essential for membrane binding and plasma membrane targeting. The C-terminal and the N-terminal helices of SNAP-25, are each targeted to the plasma membrane by two distinct cysteine-rich domains and appear to regulate the availability of SNAP to form complexes with SNARE [].; GO: 0006886 intracellular protein transport
Probab=25.32 E-value=39 Score=15.56 Aligned_cols=10 Identities=40% Similarity=0.660 Sum_probs=7.1
Q ss_pred hcHHHHHHHh
Q psy398 146 RQYCRAAELF 155 (162)
Q Consensus 146 R~fk~AA~lf 155 (162)
..|.+||++|
T Consensus 3 ~~y~~Aa~~y 12 (12)
T PF02071_consen 3 KCYEKAAECY 12 (12)
T ss_pred HHHHHHHhhC
Confidence 4577888776
No 231
>KOG1941|consensus
Probab=25.24 E-value=71 Score=29.35 Aligned_cols=80 Identities=11% Similarity=0.087 Sum_probs=56.0
Q ss_pred HHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhc--CCHHHHHHHHHHHHhhhhhcCCCchhh
Q psy398 53 IWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHG--CDIQAMTAAIEKATELIEGISGGDWSA 130 (162)
Q Consensus 53 ~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~--~d~~~v~~~i~ka~~l~e~~~ggDW~r 130 (162)
++.-++.=+...|-++.+++.|++++ .+.+.++..+==+=-.+-+|-+|+ +|+.....+..||-.++....-+||..
T Consensus 124 ~~l~~~~Ahlgls~fq~~Lesfe~A~-~~A~~~~D~~LElqvcv~Lgslf~~l~D~~Kal~f~~kA~~lv~s~~l~d~~~ 202 (518)
T KOG1941|consen 124 VSLSMGNAHLGLSVFQKALESFEKAL-RYAHNNDDAMLELQVCVSLGSLFAQLKDYEKALFFPCKAAELVNSYGLKDWSL 202 (518)
T ss_pred hhhhHHHHhhhHHHHHHHHHHHHHHH-HHhhccCCceeeeehhhhHHHHHHHHHhhhHHhhhhHhHHHHHHhcCcCchhH
Confidence 34456777778888999999999999 887777665422222233444444 788888888999999876544479985
Q ss_pred hhH
Q psy398 131 RNK 133 (162)
Q Consensus 131 rnr 133 (162)
+-|
T Consensus 203 kyr 205 (518)
T KOG1941|consen 203 KYR 205 (518)
T ss_pred HHH
Confidence 533
No 232
>PHA02774 E1; Provisional
Probab=25.12 E-value=3.5e+02 Score=26.01 Aligned_cols=126 Identities=15% Similarity=0.162 Sum_probs=84.4
Q ss_pred CHHHHHHHHHHHHHHHH----HHHHHHH----hcCCCchHHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHH
Q psy398 17 DEKFYTEMKDANEKRLK----ELDLEHE----KNMMDEEDQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHR 88 (162)
Q Consensus 17 D~~~~~~~~~~n~~~l~----~Le~~lk----~Nlg~~~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~k 88 (162)
=++|+....--+-+-.+ +|..-+. +|+. +|+ ++|-||+...|-+.--++|-+.. ....=...-
T Consensus 299 ~PeWI~~Qt~i~h~~~~~~~FdlS~MVQWAyDn~~~--dEs------~iay~ya~~a~~d~NA~afl~sn-~Q~k~vkdc 369 (613)
T PHA02774 299 LPEWIARQTLLSHQLAEAEQFDLSKMVQWAYDNDYT--DES------EIAYEYALLADEDSNAAAFLKSN-NQAKYVKDC 369 (613)
T ss_pred CcHHHHHHHHhccccccCCceecceeeeeccCCCCc--chh------hHHHHHHHhhcccHHHHHHHhcc-cchHHHHHH
Confidence 35677554433333212 2333333 7777 777 78999999999888888888777 555556777
Q ss_pred HHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhhcCCCchhhhhHHHHHHHHHHhhhhcHHHHHHHhHhhcc
Q psy398 89 IDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEGISGGDWSARNKLKAYEGVYCLAIRQYCRAAELFVDVIR 160 (162)
Q Consensus 89 iD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~~~ggDW~rrnrLKvy~gl~~l~~R~fk~AA~lfld~~~ 160 (162)
.+||.+.-|.-+-. .. +...|.+-=.-+++ +|||-.--++=-|+|+-.+ .|-.+-+.|+..+|
T Consensus 370 ~~mvr~ykr~em~~-ms---~~~~i~~r~~~~~~--~g~w~~iv~fL~~q~v~~~---~fl~~lk~~l~~~P 432 (613)
T PHA02774 370 ATMVRHYKRAEMRE-MS---MSQWIKKRCDKVEG--EGDWKPIVKFLRYQGVEFI---SFLTALKDFLKGIP 432 (613)
T ss_pred HHHHHHHHHHHHHH-hC---HHHHHHHHHhhcCC--CCCHHHHHHHHHccCccHH---HHHHHHHHHHhcCC
Confidence 88888888876555 33 33444443333444 7899988888888888665 58888888887665
No 233
>PF07889 DUF1664: Protein of unknown function (DUF1664); InterPro: IPR012458 The members of this family are hypothetical plant proteins of unknown function. The region featured in this family is approximately 100 amino acids long.
Probab=24.31 E-value=3.2e+02 Score=20.76 Aligned_cols=62 Identities=8% Similarity=0.163 Sum_probs=38.1
Q ss_pred HHhcCCCcCH--HHHHHHHHHHHHHHHHHHHHHH-----hcCCCchHHHHHHHHhhHhHHHhhcCHHHHHH
Q psy398 9 CEEFGWKMDE--KFYTEMKDANEKRLKELDLEHE-----KNMMDEEDQVSGIWQAKLDYLCSIGDRAEAFK 72 (162)
Q Consensus 9 ~~~~~~~~D~--~~~~~~~~~n~~~l~~Le~~lk-----~Nlg~~~~eIr~~~~~~a~~~~~iGD~~~A~~ 72 (162)
|+..+-.+|+ +-+..+++.=.++++.|+.++. ...+ +++|-....++...-..++....++.
T Consensus 45 ~~~v~kql~~vs~~l~~tKkhLsqRId~vd~klDe~~ei~~~i--~~eV~~v~~dv~~i~~dv~~v~~~V~ 113 (126)
T PF07889_consen 45 VASVSKQLEQVSESLSSTKKHLSQRIDRVDDKLDEQKEISKQI--KDEVTEVREDVSQIGDDVDSVQQMVE 113 (126)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHH--HHHHHHHHhhHHHHHHHHHHHHHHHH
Confidence 4444444443 3366778888889999999988 5556 66666666666555555444444443
No 234
>PF03037 KMP11: Kinetoplastid membrane protein 11; InterPro: IPR004132 Kinetoplastid membrane protein 11 is a major cell surface glycoprotein of the parasite Leishmania donovani. It stimulates T-cell proliferation and may play a role in the immunlogy of the dieases Leishmaniasis.; GO: 0006952 defense response, 0008284 positive regulation of cell proliferation
Probab=24.10 E-value=79 Score=22.27 Aligned_cols=31 Identities=29% Similarity=0.592 Sum_probs=20.5
Q ss_pred CchhHHHHHHhcCCCcCHHHHHHHHHHHHHHH
Q psy398 1 MAPYYKEVCEEFGWKMDEKFYTEMKDANEKRL 32 (162)
Q Consensus 1 m~p~Y~~l~~~~~~~~D~~~~~~~~~~n~~~l 32 (162)
||.-|.....+++ .+|.++-..|++.|++=.
T Consensus 1 mattyeefaakld-rld~ef~kkm~eqn~kff 31 (90)
T PF03037_consen 1 MATTYEEFAAKLD-RLDAEFNKKMQEQNKKFF 31 (90)
T ss_pred CcccHHHHHHHHH-HHHHHHHHHHHHHHHhhh
Confidence 6677777666554 567777777777776543
No 235
>PF10516 SHNi-TPR: SHNi-TPR; InterPro: IPR019544 The tetratrico peptide repeat region (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B. This entry represents SHNi-TPR (Sim3-Hif1-NASP interrupted TPR), a sequence that is an interrupted form of TPR repeat [].
Probab=24.00 E-value=1.4e+02 Score=17.82 Aligned_cols=26 Identities=4% Similarity=-0.012 Sum_probs=18.7
Q ss_pred HHHhhHhHHHhhcCHHHHHHHHHHhc
Q psy398 53 IWQAKLDYLCSIGDRAEAFKLADSKF 78 (162)
Q Consensus 53 ~~~~~a~~~~~iGD~~~A~~ay~~~~ 78 (162)
.+..+|+.-...++++.|.+=|.+++
T Consensus 3 v~~~Lgeisle~e~f~qA~~D~~~aL 28 (38)
T PF10516_consen 3 VYDLLGEISLENENFEQAIEDYEKAL 28 (38)
T ss_pred HHHHHHHHHHHhccHHHHHHHHHHHH
Confidence 45667777777777777777777665
No 236
>smart00671 SEL1 Sel1-like repeats. These represent a subfamily of TPR (tetratricopeptide repeat) sequences.
Probab=23.89 E-value=1.3e+02 Score=16.10 Aligned_cols=27 Identities=11% Similarity=-0.119 Sum_probs=19.3
Q ss_pred HHHHhhHhHHHhh-c---CHHHHHHHHHHhc
Q psy398 52 GIWQAKLDYLCSI-G---DRAEAFKLADSKF 78 (162)
Q Consensus 52 ~~~~~~a~~~~~i-G---D~~~A~~ay~~~~ 78 (162)
++...+|.+|..- | |.+.|+..|.++-
T Consensus 2 ~a~~~lg~~~~~G~g~~~d~~~A~~~~~~Aa 32 (36)
T smart00671 2 EAQYNLGQMYEYGLGVKKDLEKALEYYKKAA 32 (36)
T ss_pred HHHHHHHHHHHcCCCCCcCHHHHHHHHHHHH
Confidence 3566777777543 3 8899999888864
No 237
>KOG1128|consensus
Probab=23.65 E-value=3.2e+02 Score=26.97 Aligned_cols=28 Identities=11% Similarity=0.367 Sum_probs=22.1
Q ss_pred HHHHhhHhHHHhhcCHHHHHHHHHHhcCC
Q psy398 52 GIWQAKLDYLCSIGDRAEAFKLADSKFAD 80 (162)
Q Consensus 52 ~~~~~~a~~~~~iGD~~~A~~ay~~~~~~ 80 (162)
..|-....+|+..|+..+|.+...+-. +
T Consensus 425 emw~~vi~CY~~lg~~~kaeei~~q~l-e 452 (777)
T KOG1128|consen 425 EMWDPVILCYLLLGQHGKAEEINRQEL-E 452 (777)
T ss_pred HHHHHHHHHHHHhcccchHHHHHHHHh-c
Confidence 346678888888888888888888777 5
No 238
>TIGR00985 3a0801s04tom mitochondrial import receptor subunit translocase of outer membrane 20 kDa subunit.
Probab=23.47 E-value=3.6e+02 Score=21.06 Aligned_cols=42 Identities=7% Similarity=-0.166 Sum_probs=34.5
Q ss_pred HHHHHHhhHhHHHhhc-CHHHHHHHHHHhcCCCCCChHHHHHHH
Q psy398 50 VSGIWQAKLDYLCSIG-DRAEAFKLADSKFADKTNPKSHRIDAV 92 (162)
Q Consensus 50 Ir~~~~~~a~~~~~iG-D~~~A~~ay~~~~~~~~~s~g~kiD~~ 92 (162)
.=+....+|+-++..| +.++|..+|-+++ -=|..++.-|-|.
T Consensus 89 ~Fl~eV~~GE~L~~~g~~~~ega~hf~nAl-~Vc~qP~~LL~iy 131 (148)
T TIGR00985 89 FFLQEVQLGEELMAQGTNVDEGAVHFYNAL-KVYPQPQQLLSIY 131 (148)
T ss_pred HHHHHHHHHHHHHhCCCchHHHHHHHHHHH-HhCCCHHHHHHHH
Confidence 3445778999999999 9999999999999 8888887766544
No 239
>PF11272 DUF3072: Protein of unknown function (DUF3072); InterPro: IPR021425 This bacterial family of proteins has no known function.
Probab=23.34 E-value=1.2e+02 Score=20.11 Aligned_cols=34 Identities=29% Similarity=0.478 Sum_probs=25.7
Q ss_pred chhHHHHHHhcCCCcCHHHHHHHHHHHHHHHHHHHHH
Q psy398 2 APYYKEVCEEFGWKMDEKFYTEMKDANEKRLKELDLE 38 (162)
Q Consensus 2 ~p~Y~~l~~~~~~~~D~~~~~~~~~~n~~~l~~Le~~ 38 (162)
+.|.+.||++.|-++|..+ ++....+.+++|+.+
T Consensus 21 ~syL~tL~e~Age~~~~~L---tkaeAs~rId~L~~~ 54 (57)
T PF11272_consen 21 ASYLKTLSEEAGEPFPDDL---TKAEASERIDELQAQ 54 (57)
T ss_pred HHHHHHHHHHhCCCCCCcc---cHHHHHHHHHHHHHH
Confidence 4588999999998888744 556677777777765
No 240
>PRK06585 holA DNA polymerase III subunit delta; Reviewed
Probab=23.33 E-value=4.6e+02 Score=22.13 Aligned_cols=91 Identities=14% Similarity=0.072 Sum_probs=64.1
Q ss_pred HHHHHHhcCCCcCHHHHHHHHHHHHHHHHHHHHHHH-------h--cCCCchHHHHHHH--------HhhHhHHHhhcCH
Q psy398 5 YKEVCEEFGWKMDEKFYTEMKDANEKRLKELDLEHE-------K--NMMDEEDQVSGIW--------QAKLDYLCSIGDR 67 (162)
Q Consensus 5 Y~~l~~~~~~~~D~~~~~~~~~~n~~~l~~Le~~lk-------~--Nlg~~~~eIr~~~--------~~~a~~~~~iGD~ 67 (162)
-+..+.+.|..+|.+-++.+-..-...+..+..+|+ . ... .+.|.... .++.+. .-.||.
T Consensus 151 i~~~~~~~g~~i~~~a~~~L~~~~g~dl~~l~~EleKL~ly~~~~~~It--~edV~~lv~~~~e~~if~l~da-i~~~~~ 227 (343)
T PRK06585 151 IDDELAEAGLRITPDARALLVALLGGDRLASRNEIEKLALYAHGKGEIT--LDDVRAVVGDASALSLDDAADA-ALAGDL 227 (343)
T ss_pred HHHHHHHCCCCCCHHHHHHHHHHhCCCHHHHHHHHHHHHHhcCCCCCCC--HHHHHHHhCCcccccHHHHHHH-HHCCCH
Confidence 345577789999999999988888777777777777 1 233 56665542 222222 446799
Q ss_pred HHHHHHHHHhcCCCCCChHHHHHHHHHHHHHH
Q psy398 68 AEAFKLADSKFADKTNPKSHRIDAVFTLFRIA 99 (162)
Q Consensus 68 ~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~ 99 (162)
..|++.+.+.+ .....+-..+-++...+|.=
T Consensus 228 ~~a~~~l~~ll-~~g~~p~~il~~L~~~~r~L 258 (343)
T PRK06585 228 AAFERALDRAL-AEGTAPVLILRAALRHFQRL 258 (343)
T ss_pred HHHHHHHHHHH-HcCCCHHHHHHHHHHHHHHH
Confidence 99999999999 76667766777766666653
No 241
>PRK14951 DNA polymerase III subunits gamma and tau; Provisional
Probab=23.31 E-value=6.6e+02 Score=24.02 Aligned_cols=96 Identities=10% Similarity=0.025 Sum_probs=59.1
Q ss_pred hhHHHHHHhcCCCcCHHHHHHHHHHHHHHHHHHHHH----HH---hcCCCchHHHHH--------HHHhhHhHHHhhcCH
Q psy398 3 PYYKEVCEEFGWKMDEKFYTEMKDANEKRLKELDLE----HE---KNMMDEEDQVSG--------IWQAKLDYLCSIGDR 67 (162)
Q Consensus 3 p~Y~~l~~~~~~~~D~~~~~~~~~~n~~~l~~Le~~----lk---~Nlg~~~~eIr~--------~~~~~a~~~~~iGD~ 67 (162)
.+.+.++.+.+..+|.+-++.+-+...-.+-.+... +. .... .+.|+. -..++.+...+ ||.
T Consensus 190 ~~L~~i~~~egi~ie~~AL~~La~~s~GslR~al~lLdq~ia~~~~~It--~~~V~~~Lg~~~~~~i~~LldaL~~-~d~ 266 (618)
T PRK14951 190 EHLTQVLAAENVPAEPQALRLLARAARGSMRDALSLTDQAIAFGSGQLQ--EAAVRQMLGSVDRSHVFRLIDALAQ-GDG 266 (618)
T ss_pred HHHHHHHHHcCCCCCHHHHHHHHHHcCCCHHHHHHHHHHHHHhcCCCcC--HHHHHHHHcCCCHHHHHHHHHHHHc-CCH
Confidence 456667777788888876666555443333222222 22 1122 344443 22334444444 799
Q ss_pred HHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHh
Q psy398 68 AEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFH 102 (162)
Q Consensus 68 ~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~ 102 (162)
..+++.+..+. +........++.++..+|=-+..
T Consensus 267 ~~al~~l~~l~-~~G~~~~~il~~l~~~~~~~~~~ 300 (618)
T PRK14951 267 RTVVETADELR-LNGLSAASTLEEMAAVLQRMAVL 300 (618)
T ss_pred HHHHHHHHHHH-HcCCCHHHHHHHHHHHHHHHHHH
Confidence 99999999999 88888888888887777755544
No 242
>PF12688 TPR_5: Tetratrico peptide repeat
Probab=23.07 E-value=2.8e+02 Score=20.45 Aligned_cols=35 Identities=14% Similarity=0.056 Sum_probs=30.9
Q ss_pred CCchHHHHHHHHhhHhHHHhhcCHHHHHHHHHHhc
Q psy398 44 MDEEDQVSGIWQAKLDYLCSIGDRAEAFKLADSKF 78 (162)
Q Consensus 44 g~~~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~ 78 (162)
|++.+.-+.++..+|.=|-..|+.++|+..++...
T Consensus 31 gL~~~~~~~a~i~lastlr~LG~~deA~~~L~~~~ 65 (120)
T PF12688_consen 31 GLSGADRRRALIQLASTLRNLGRYDEALALLEEAL 65 (120)
T ss_pred CCCchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHH
Confidence 44466777899999999999999999999999988
No 243
>PF08631 SPO22: Meiosis protein SPO22/ZIP4 like; InterPro: IPR013940 SPO22 is a meiosis-specific protein with similarity to phospholipase A2, involved in completion of nuclear divisions during meiosis; induced early in meiosis []. It is also involved in sporulation [].
Probab=23.03 E-value=4.4e+02 Score=21.81 Aligned_cols=72 Identities=14% Similarity=0.064 Sum_probs=51.6
Q ss_pred HHHHHHHHhhHhHHHhhc-CHHHHHHHHHHhcCCCCCC-----------hHHHHHHHHHHHHHHHHhcCCHHHHHHHHHH
Q psy398 48 DQVSGIWQAKLDYLCSIG-DRAEAFKLADSKFADKTNP-----------KSHRIDAVFTLFRIAYFHGCDIQAMTAAIEK 115 (162)
Q Consensus 48 ~eIr~~~~~~a~~~~~iG-D~~~A~~ay~~~~~~~~~s-----------~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~k 115 (162)
+++-+...+.|.-..+.+ +++.|.+..++++ +.+.. ...++=++-.+++.-+.. ++...+.+.+.-
T Consensus 32 ~~La~~~yn~G~~l~~~~~~~~~a~~wL~~a~-~~l~~~~~~~~~~~~~~elr~~iL~~La~~~l~~-~~~~~~~ka~~~ 109 (278)
T PF08631_consen 32 EELARVCYNIGKSLLSKKDKYEEAVKWLQRAY-DILEKPGKMDKLSPDGSELRLSILRLLANAYLEW-DTYESVEKALNA 109 (278)
T ss_pred HHHHHHHHHHHHHHHHcCCChHHHHHHHHHHH-HHHHhhhhccccCCcHHHHHHHHHHHHHHHHHcC-CChHHHHHHHHH
Confidence 677778888888888888 9999999988888 76422 245666777777878777 777666555544
Q ss_pred HHhhhh
Q psy398 116 ATELIE 121 (162)
Q Consensus 116 a~~l~e 121 (162)
.+.+..
T Consensus 110 l~~l~~ 115 (278)
T PF08631_consen 110 LRLLES 115 (278)
T ss_pred HHHHHH
Confidence 444433
No 244
>PF09295 ChAPs: ChAPs (Chs5p-Arf1p-binding proteins); InterPro: IPR015374 ChAPs (Chs5p-Arf1p-binding proteins) are required for the export of specialised cargo from the Golgi. They physically interact with Chs3, Chs5 and the small GTPase Arf1, and they also form interactions with each other [].
Probab=22.83 E-value=3.5e+02 Score=24.31 Aligned_cols=57 Identities=12% Similarity=0.170 Sum_probs=45.9
Q ss_pred HHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHH
Q psy398 53 IWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIE 114 (162)
Q Consensus 53 ~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ 114 (162)
.+.-.|+|+.+.|+.+.|++...++. .- . -.-.+..+.+.++.+.. +||....-.++
T Consensus 236 LL~~Qa~fLl~k~~~~lAL~iAk~av-~l-s--P~~f~~W~~La~~Yi~~-~d~e~ALlaLN 292 (395)
T PF09295_consen 236 LLNLQAEFLLSKKKYELALEIAKKAV-EL-S--PSEFETWYQLAECYIQL-GDFENALLALN 292 (395)
T ss_pred HHHHHHHHHHhcCCHHHHHHHHHHHH-Hh-C--chhHHHHHHHHHHHHhc-CCHHHHHHHHh
Confidence 45578999999999999999999988 32 2 34577889999999999 99988766665
No 245
>cd06571 Bac_DnaA_C C-terminal domain of bacterial DnaA proteins. The DNA-binding C-terminal domain of DnaA contains a helix-turn-helix motif that specifically interacts with the DnaA box, a 9-mer motif that occurs repetitively in the replication origin oriC. Multiple copies of DnaA, which is an ATPase, bind to 9-mers at the origin and form an initial complex in which the DNA strands are being separated in an ATP-dependent step.
Probab=22.39 E-value=2.3e+02 Score=19.54 Aligned_cols=36 Identities=28% Similarity=0.317 Sum_probs=28.1
Q ss_pred HHHHHHHhc-CCHHHHHHHHHHHHhhhhhcCCCchhhhhHH
Q psy398 95 LFRIAYFHG-CDIQAMTAAIEKATELIEGISGGDWSARNKL 134 (162)
Q Consensus 95 ~iRv~~f~~-~d~~~v~~~i~ka~~l~e~~~ggDW~rrnrL 134 (162)
+-.||=+|| .|++.|...+.+.+..+++ |.+-+..+
T Consensus 47 ~~~Ig~~fg~r~hStV~~a~~ri~~~~~~----d~~~~~~v 83 (90)
T cd06571 47 LPEIGRAFGGRDHSTVLHAVRKIEELLEE----DPELKEDV 83 (90)
T ss_pred HHHHHHHhCCCCHhHHHHHHHHHHHHHHh----CHHHHHHH
Confidence 445788887 7999999999999999975 55555444
No 246
>KOG1760|consensus
Probab=22.36 E-value=3.2e+02 Score=21.05 Aligned_cols=41 Identities=27% Similarity=0.188 Sum_probs=27.0
Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCchHHHHHHHHhhHhHHHhhcC
Q psy398 19 KFYTEMKDANEKRLKELDLEHEKNMMDEEDQVSGIWQAKLDYLCSIGD 66 (162)
Q Consensus 19 ~~~~~~~~~n~~~l~~Le~~lk~Nlg~~~~eIr~~~~~~a~~~~~iGD 66 (162)
+-++.+++.-.++++.|+.+++. .+-||.- -++..|++.||
T Consensus 84 ~~LEe~ke~l~k~i~~les~~e~------I~~~m~~-LK~~LYaKFgd 124 (131)
T KOG1760|consen 84 DQLEEKKETLEKEIEELESELES------ISARMDE-LKKVLYAKFGD 124 (131)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH------HHHHHHH-HHHHHHHHhcc
Confidence 45677777888888888888552 1222222 36778888887
No 247
>PF08559 Cut8_C: Cut8 six-helix bundle; InterPro: IPR013868 In Schizosaccharomyces pombe (Fission yeast), Cut8 is a nuclear envelope protein that physically interacts with and tethers 26S proteasome in the nucleus resulting in the nuclear accumulation of proteasomes []. Cut8 is a proteasome substrate and amino terminal residues 1-72 are polyubiquitinated and function as a degron tag. Ubiquitination of the amino terminal is essential to the function of Cut8. Lysine residues in the amino terminal 72 amino acids of Cut8 are required for physical interaction with the proteasome. In fission yeast the function of Cut8 has been demonstrated to be regulated by ubiquitin-conjugating Rhp6/Ubc2/Rad6 and ligating enzymes Ubr1. Cut8 homologs have been identified in Drosophila melanogaster (Fruit fly), Anopheles gambiae (African malaria mosquito) and Dictyostelium discoideum (Slime mold). ; PDB: 3Q5W_A 3Q5X_A.
Probab=22.21 E-value=3.7e+02 Score=20.65 Aligned_cols=40 Identities=18% Similarity=0.197 Sum_probs=28.9
Q ss_pred CCHHHHHHHHHHHHhhhhhcCCCchh----hhhHHHHHHHHHHhhh
Q psy398 104 CDIQAMTAAIEKATELIEGISGGDWS----ARNKLKAYEGVYCLAI 145 (162)
Q Consensus 104 ~d~~~v~~~i~ka~~l~e~~~ggDW~----rrnrLKvy~gl~~l~~ 145 (162)
.+|...-+++..|..++-+ =.+|+ +.-|-++|+.|...-.
T Consensus 60 ~~~~~sL~fl~~at~~v~~--LP~w~~~~~n~~k~~~ye~ls~~w~ 103 (143)
T PF08559_consen 60 QQWSTSLEFLDEATNIVHK--LPNWDNQSHNYYKDKCYEQLSNAWA 103 (143)
T ss_dssp --HHHHHHHHHHHHHHHHT--S---SSCGGGHHHHHHHHHHHHHHH
T ss_pred cCHHHHHHHHHHHHHHHHH--CCCCCCHHHHHHHHHHHHHHHHHHH
Confidence 6999999999999999987 88998 4467778998876543
No 248
>PRK10153 DNA-binding transcriptional activator CadC; Provisional
Probab=22.17 E-value=5.1e+02 Score=24.00 Aligned_cols=58 Identities=9% Similarity=0.013 Sum_probs=37.8
Q ss_pred HHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHH
Q psy398 51 SGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEK 115 (162)
Q Consensus 51 r~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~k 115 (162)
-++|..+|.++...|+.++|.+.|.++. .-..+.- .+.++.=++|+ .|.+.|.=++.+
T Consensus 453 ~~a~~~lG~~~~~~G~~~eA~~~~~~A~-~L~P~~p-----t~~~~~~~~f~-~~~~~~~~~~~~ 510 (517)
T PRK10153 453 WLNYVLLGKVYELKGDNRLAADAYSTAF-NLRPGEN-----TLYWIENLVFQ-TSVETVVPYLYR 510 (517)
T ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHH-hcCCCCc-----hHHHHHhcccc-ccHHHHHHHHHh
Confidence 3577777777777777777777777766 3333322 35666777777 777776655544
No 249
>KOG1258|consensus
Probab=22.15 E-value=7e+02 Score=23.88 Aligned_cols=105 Identities=15% Similarity=0.298 Sum_probs=0.0
Q ss_pred HHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhh--cCCCchh--
Q psy398 54 WQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEG--ISGGDWS-- 129 (162)
Q Consensus 54 ~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~--~~ggDW~-- 129 (162)
|.+.|+|=.++|..+++.+.|++.. .+.-+.+|+.++..+...--.+|-..|....+.|...+.. .|+.=|+
T Consensus 82 W~kfA~~E~klg~~~~s~~Vfergv----~aip~SvdlW~~Y~~f~~n~~~d~~~lr~~fe~A~~~vG~dF~S~~lWdky 157 (577)
T KOG1258|consen 82 WKKFADYEYKLGNAENSVKVFERGV----QAIPLSVDLWLSYLAFLKNNNGDPETLRDLFERAKSYVGLDFLSDPLWDKY 157 (577)
T ss_pred HHHHHHHHHHhhhHHHHHHHHHHHH----HhhhhHHHHHHHHHHHHhccCCCHHHHHHHHHHHHHhcccchhccHHHHHH
Q ss_pred ---------hhhHHHHHHHHHHhhhhcHHHHHHHhHhhccCC
Q psy398 130 ---------ARNKLKAYEGVYCLAIRQYCRAAELFVDVIRSR 162 (162)
Q Consensus 130 ---------rrnrLKvy~gl~~l~~R~fk~AA~lfld~~~T~ 162 (162)
+++-..+|+=+--...-.|..==..|..-+-+.
T Consensus 158 ie~en~qks~k~v~~iyeRileiP~~~~~~~f~~f~~~l~~~ 199 (577)
T KOG1258|consen 158 IEFENGQKSWKRVANIYERILEIPLHQLNRHFDRFKQLLNQN 199 (577)
T ss_pred HHHHhccccHHHHHHHHHHHHhhhhhHhHHHHHHHHHHHhcC
No 250
>KOG4648|consensus
Probab=21.97 E-value=6.3e+02 Score=23.28 Aligned_cols=45 Identities=16% Similarity=0.025 Sum_probs=30.2
Q ss_pred HhhHhHHHhhcCHHHHHHHHHHhcCCCC---CChHHHHHHHHHHHHHH
Q psy398 55 QAKLDYLCSIGDRAEAFKLADSKFADKT---NPKSHRIDAVFTLFRIA 99 (162)
Q Consensus 55 ~~~a~~~~~iGD~~~A~~ay~~~~~~~~---~s~g~kiD~~l~~iRv~ 99 (162)
.+.|+-|.+-|.+++|+.||++..+-+. +..-.+.-..|.+=|.+
T Consensus 101 KE~GN~yFKQgKy~EAIDCYs~~ia~~P~NpV~~~NRA~AYlk~K~FA 148 (536)
T KOG4648|consen 101 KERGNTYFKQGKYEEAIDCYSTAIAVYPHNPVYHINRALAYLKQKSFA 148 (536)
T ss_pred HHhhhhhhhccchhHHHHHhhhhhccCCCCccchhhHHHHHHHHHHHH
Confidence 4689999999999999999999773333 33333443444444443
No 251
>PRK08691 DNA polymerase III subunits gamma and tau; Validated
Probab=21.93 E-value=7.6e+02 Score=24.22 Aligned_cols=92 Identities=10% Similarity=0.007 Sum_probs=57.3
Q ss_pred hhHHHHHHhcCCCcCHHHHHHHHHHHHHHHHHHHHHHH-------hcCCCchHHHHH--------HHHhhHhHHHhhcCH
Q psy398 3 PYYKEVCEEFGWKMDEKFYTEMKDANEKRLKELDLEHE-------KNMMDEEDQVSG--------IWQAKLDYLCSIGDR 67 (162)
Q Consensus 3 p~Y~~l~~~~~~~~D~~~~~~~~~~n~~~l~~Le~~lk-------~Nlg~~~~eIr~--------~~~~~a~~~~~iGD~ 67 (162)
.+.+.+|..-++.+|.+-++.+-+...--+-.+...|. .... .+.|+. -..++.+.+.. ||.
T Consensus 185 ~~L~~Il~kEgi~id~eAL~~Ia~~A~GslRdAlnLLDqaia~g~g~It--~e~V~~lLG~~d~~~If~LldAL~~-~d~ 261 (709)
T PRK08691 185 DHLAHVLDSEKIAYEPPALQLLGRAAAGSMRDALSLLDQAIALGSGKVA--ENDVRQMIGAVDKQYLYELLTGIIN-QDG 261 (709)
T ss_pred HHHHHHHHHcCCCcCHHHHHHHHHHhCCCHHHHHHHHHHHHHhcCCCcC--HHHHHHHHcccCHHHHHHHHHHHHc-CCH
Confidence 34566777778888888877777666544433333332 1122 333333 24444555544 899
Q ss_pred HHHHHHHHHhcCCCCCChHHHHHHHHHHHHH
Q psy398 68 AEAFKLADSKFADKTNPKSHRIDAVFTLFRI 98 (162)
Q Consensus 68 ~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv 98 (162)
..+++.+..+. +......+.+.-++..+|=
T Consensus 262 ~~al~~l~~L~-~~G~d~~~~l~~L~~~l~~ 291 (709)
T PRK08691 262 AALLAKAQEMA-ACAVGFDNALGELAILLQQ 291 (709)
T ss_pred HHHHHHHHHHH-HhCCCHHHHHHHHHHHHHH
Confidence 99999999999 7777777776655555543
No 252
>cd02683 MIT_1 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in proteins with unknown function, co-occuring with an as yet undescribed domain. The molecular function of the MIT domain is unclear.
Probab=21.87 E-value=2.7e+02 Score=18.98 Aligned_cols=24 Identities=17% Similarity=-0.015 Sum_probs=19.2
Q ss_pred HhhHhHHHhhcCHHHHHHHHHHhc
Q psy398 55 QAKLDYLCSIGDRAEAFKLADSKF 78 (162)
Q Consensus 55 ~~~a~~~~~iGD~~~A~~ay~~~~ 78 (162)
...|.=.-+.|++++|+.+|....
T Consensus 10 ~~~Ave~D~~g~y~eAl~~Y~~ai 33 (77)
T cd02683 10 LKRAVELDQEGRFQEALVCYQEGI 33 (77)
T ss_pred HHHHHHHHHhccHHHHHHHHHHHH
Confidence 345566678899999999999887
No 253
>PF10440 WIYLD: Ubiquitin-binding WIYLD domain; InterPro: IPR018848 This entry represents a presumed domain which has been predicted to contain three alpha helices. It was named the WIYLD domain based on the pattern of the ost conserved residues []. This domain appears to be specific to plant SET-domain proteins. ; GO: 0018024 histone-lysine N-methyltransferase activity
Probab=21.66 E-value=2.5e+02 Score=19.01 Aligned_cols=39 Identities=8% Similarity=-0.017 Sum_probs=28.7
Q ss_pred HHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHH
Q psy398 68 AEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMT 110 (162)
Q Consensus 68 ~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~ 110 (162)
+..-+|+..|. ..-.+..++.-++..++.+. |+||.++.
T Consensus 9 ~R~daA~dam~-~lG~~~~~v~~vl~~LL~lY---~~nW~lIE 47 (65)
T PF10440_consen 9 ERIDAALDAMR-QLGFSKKQVRPVLKNLLKLY---DGNWELIE 47 (65)
T ss_pred HHHHHHHHHHH-HcCCCHHHHHHHHHHHHHHH---cCCchhhh
Confidence 55666777777 77777777888888887765 57787776
No 254
>PF04733 Coatomer_E: Coatomer epsilon subunit; InterPro: IPR006822 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the epsilon subunit of the coatomer complex, which is involved in the regulation of intracellular protein trafficking between the endoplasmic reticulum and the Golgi complex []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006890 retrograde vesicle-mediated transport, Golgi to ER, 0030126 COPI vesicle coat; PDB: 3MV2_B 3MV3_F 3MKR_A.
Probab=21.38 E-value=5e+02 Score=21.92 Aligned_cols=97 Identities=20% Similarity=0.196 Sum_probs=65.4
Q ss_pred HHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhhhh-----------
Q psy398 54 WQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDIQAMTAAIEKATELIEG----------- 122 (162)
Q Consensus 54 ~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~~~v~~~i~ka~~l~e~----------- 122 (162)
..-.|..|...|+.+.|++...+ . .-+++..-.|.+.+-. +..++..+.+.++++.-|.
T Consensus 105 ~~~~A~i~~~~~~~~~AL~~l~~-----~----~~lE~~al~Vqi~L~~-~R~dlA~k~l~~~~~~~eD~~l~qLa~awv 174 (290)
T PF04733_consen 105 QLLAATILFHEGDYEEALKLLHK-----G----GSLELLALAVQILLKM-NRPDLAEKELKNMQQIDEDSILTQLAEAWV 174 (290)
T ss_dssp HHHHHHHHCCCCHHHHHHCCCTT-----T----TCHHHHHHHHHHHHHT-T-HHHHHHHHHHHHCCSCCHHHHHHHHHHH
T ss_pred HHHHHHHHHHcCCHHHHHHHHHc-----c----CcccHHHHHHHHHHHc-CCHHHHHHHHHHHHhcCCcHHHHHHHHHHH
Confidence 34566778889999999874432 2 3368888888888888 8899999888888743111
Q ss_pred --cCCCchh---------------hhhHHHHHHHHHHhhhhcHHHHHHHhHhhcc
Q psy398 123 --ISGGDWS---------------ARNKLKAYEGVYCLAIRQYCRAAELFVDVIR 160 (162)
Q Consensus 123 --~~ggDW~---------------rrnrLKvy~gl~~l~~R~fk~AA~lfld~~~ 160 (162)
..|||-- ....+-.-.++.+|..|+|.+|-+++.+++.
T Consensus 175 ~l~~g~e~~~~A~y~f~El~~~~~~t~~~lng~A~~~l~~~~~~eAe~~L~~al~ 229 (290)
T PF04733_consen 175 NLATGGEKYQDAFYIFEELSDKFGSTPKLLNGLAVCHLQLGHYEEAEELLEEALE 229 (290)
T ss_dssp HHHHTTTCCCHHHHHHHHHHCCS--SHHHHHHHHHHHHHCT-HHHHHHHHHHHCC
T ss_pred HHHhCchhHHHHHHHHHHHHhccCCCHHHHHHHHHHHHHhCCHHHHHHHHHHHHH
Confidence 0144311 1134445678889999999999999888763
No 255
>PF02255 PTS_IIA: PTS system, Lactose/Cellobiose specific IIA subunit; InterPro: IPR003188 The phosphoenolpyruvate-dependent sugar phosphotransferase system (PTS) [, ] is a major carbohydrate transport system in bacteria. The PTS catalyses the phosphorylation of incoming sugar substrates and coupled with translocation across the cell membrane, makes the PTS a link between the uptake and metabolism of sugars. The general mechanism of the PTS is the following: a phosphoryl group from phosphoenolpyruvate (PEP) is transferred via a signal transduction pathway, to enzyme I (EI) which in turn transfers it to a phosphoryl carrier, the histidine protein (HPr). Phospho-HPr then transfers the phosphoryl group to a sugar-specific permease, a membrane-bound complex known as enzyme 2 (EII), which transports the sugar to the cell. EII consists of at least three structurally distinct domains IIA, IIB and IIC []. These can either be fused together in a single polypeptide chain or exist as two or three interactive chains, formerly called enzymes II (EII) and III (EIII). The first domain (IIA or EIIA) carries the first permease-specific phosphorylation site, a histidine which is phosphorylated by phospho-HPr. The second domain (IIB or EIIB) is phosphorylated by phospho-IIA on a cysteinyl or histidyl residue, depending on the sugar transported. Finally, the phosphoryl group is transferred from the IIB domain to the sugar substrate concomitantly with the sugar uptake processed by the IIC domain. This third domain (IIC or EIIC) forms the translocation channel and the specific substrate-binding site. An additional transmembrane domain IID, homologous to IIC, can be found in some PTSs, e.g. for mannose [, , , ]. The lactose/cellobiose-specific family are one of four structurally and functionally distinct group IIA PTS system enzymes. This family of proteins normally function as a homotrimer, stabilised by a centrally located metal ion []. Separation into subunits is thought to occur after phosphorylation.; GO: 0005351 sugar:hydrogen symporter activity, 0006810 transport, 0009401 phosphoenolpyruvate-dependent sugar phosphotransferase system, 0016020 membrane; PDB: 3L8R_D 2E2A_B 1E2A_C 3K1S_C 2LRK_C 2LRL_A 2WY2_A 1WCR_A 2WWV_C.
Probab=21.28 E-value=1.5e+02 Score=21.17 Aligned_cols=26 Identities=19% Similarity=0.202 Sum_probs=20.7
Q ss_pred hHHHHHHHHHHhhhhcHHHHHHHhHh
Q psy398 132 NKLKAYEGVYCLAIRQYCRAAELFVD 157 (162)
Q Consensus 132 nrLKvy~gl~~l~~R~fk~AA~lfld 157 (162)
.|-.++++|.....+||.+|-+++-+
T Consensus 14 Ars~~~eAl~~a~~~~fe~A~~~l~~ 39 (96)
T PF02255_consen 14 ARSLAMEALKAAREGDFEEAEELLKE 39 (96)
T ss_dssp HHHHHHHHHHHHHTT-HHHHHHHHHH
T ss_pred HHHHHHHHHHHHHhCCHHHHHHHHHH
Confidence 67788999999999999888887654
No 256
>PF10187 Nefa_Nip30_N: N-terminal domain of NEFA-interacting nuclear protein NIP30; InterPro: IPR019331 This is a the N-terminal 100 amino acids of a family of proteins conserved from plants to humans. The full-length protein has putatively been called NEFA-interacting nuclear protein NIP30, however no reference could be found to confirm this.
Probab=21.21 E-value=2e+02 Score=20.89 Aligned_cols=48 Identities=19% Similarity=0.447 Sum_probs=34.4
Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHH-hcC--CCchHHHHHHHHhhHhHHHhhcCHHHHHH
Q psy398 17 DEKFYTEMKDANEKRLKELDLEHE-KNM--MDEEDQVSGIWQAKLDYLCSIGDRAEAFK 72 (162)
Q Consensus 17 D~~~~~~~~~~n~~~l~~Le~~lk-~Nl--g~~~~eIr~~~~~~a~~~~~iGD~~~A~~ 72 (162)
...|++.+++.-.++.++.++.++ .|. |+.++| ..||-.+-....+.+
T Consensus 37 ~rsLye~LqenK~~Kq~efeE~~K~kn~~r~LDedE--------~eFLd~v~~~~~~~E 87 (102)
T PF10187_consen 37 GRSLYERLQENKAAKQEEFEEKHKLKNQFRGLDEDE--------IEFLDEVEEKKRAEE 87 (102)
T ss_pred CCCHHHHHHHHHHHHHHHHHHHHhcccccCCCCHHH--------HHHHHHHHHHHHHHH
Confidence 457899999999999999999999 666 333444 466666655544443
No 257
>PF01109 GM_CSF: Granulocyte-macrophage colony-stimulating factor; InterPro: IPR000773 Granulocyte-macrophage colony-stimulating factor (GMCSF) is a cytokine that acts in hematopoiesis to stimulate growth and differentiation of hematopoietic precursor cells from various lineages including granulocytes, macrophages, eosinophils and erythrocytes [, ]. GMCSF is a glycoprotein of ~120 residues that contains 4 conserved cysteines that participate in disulphide bond formation. The crystal structure of recombinant human GMCSF has been determined []. There are two molecules in the asymmetric unit, which are related by an approximate non-crystallographic 2-fold axis. The overall structure, which is highly compact and globular with a predominantly hydrophobic core, is characterised by a 4-alpha-helix bundle. The helices are arranged in a left-handed anti-parallel fashion, with two overhand connections. Within the connections is a two-stranded anti-parallel beta-sheet. The tertiary structure has a topology similar to that of Sus scrofa (pig) growth factor and interferon-beta. Most of the proposed critical regions for receptor binding are located on a continuous surface at one end of the molecule that includes the C terminus [].; GO: 0005129 granulocyte macrophage colony-stimulating factor receptor binding, 0008083 growth factor activity, 0006955 immune response, 0005576 extracellular region; PDB: 3CXE_B 1CSG_A 2GMF_B.
Probab=21.20 E-value=1.7e+02 Score=22.29 Aligned_cols=18 Identities=22% Similarity=0.294 Sum_probs=11.1
Q ss_pred CCHHHHHHHHHHHHhhhhh
Q psy398 104 CDIQAMTAAIEKATELIEG 122 (162)
Q Consensus 104 ~d~~~v~~~i~ka~~l~e~ 122 (162)
+-|.||.. |..|-+|++.
T Consensus 11 ~p~~hvda-ikeal~ll~~ 28 (122)
T PF01109_consen 11 RPWKHVDA-IKEALSLLNN 28 (122)
T ss_dssp --HHHHHH-HHHHHHHHHT
T ss_pred chHHHHHH-HHHHHHHHhc
Confidence 45777766 7777777653
No 258
>PRK14953 DNA polymerase III subunits gamma and tau; Provisional
Probab=21.16 E-value=6.5e+02 Score=23.10 Aligned_cols=97 Identities=18% Similarity=0.136 Sum_probs=61.5
Q ss_pred hHHHHHHhcCCCcCHHHHHHHHHHHHHHHHHHHHHHH-----hcCCCchHHHHHH--------HHhhHhHHHhhcCHHHH
Q psy398 4 YYKEVCEEFGWKMDEKFYTEMKDANEKRLKELDLEHE-----KNMMDEEDQVSGI--------WQAKLDYLCSIGDRAEA 70 (162)
Q Consensus 4 ~Y~~l~~~~~~~~D~~~~~~~~~~n~~~l~~Le~~lk-----~Nlg~~~~eIr~~--------~~~~a~~~~~iGD~~~A 70 (162)
+-+..|...++++|++-++.+-+...--+-.+...+. .+--.+.+.|+.. ..++.+.. ..||.+.|
T Consensus 186 ~L~~i~k~egi~id~~al~~La~~s~G~lr~al~~Ldkl~~~~~~~It~~~V~~~lg~~~~~~vf~Li~ai-~~~d~~~a 264 (486)
T PRK14953 186 YLKRICNEEKIEYEEKALDLLAQASEGGMRDAASLLDQASTYGEGKVTIKVVEEFLGIVSQESVRKFLNLL-LESDVDEA 264 (486)
T ss_pred HHHHHHHHcCCCCCHHHHHHHHHHcCCCHHHHHHHHHHHHHhcCCCcCHHHHHHHhCCCCHHHHHHHHHHH-HCCCHHHH
Confidence 5566677778888888777765554444443333333 1100114444442 22333332 35799999
Q ss_pred HHHHHHhcCCCCCChHHHHHHHHHHHHHHHHh
Q psy398 71 FKLADSKFADKTNPKSHRIDAVFTLFRIAYFH 102 (162)
Q Consensus 71 ~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~ 102 (162)
+..+..+. +........++++...+|--+..
T Consensus 265 l~~l~~L~-~~g~~~~~iL~~L~~~~rdll~~ 295 (486)
T PRK14953 265 IKFLRTLE-EKGYNLNKFWKQLEEEIRNILLN 295 (486)
T ss_pred HHHHHHHH-HcCCCHHHHHHHHHHHHHHHHHH
Confidence 99999999 88888888899998888865554
No 259
>PF02840 Prp18: Prp18 domain; InterPro: IPR004098 The splicing factor Prp18 is required for the second step of pre-mRNA splicing. PRP18 appears to be primarily associated with the U5 snRNP. The structure of a large fragment of the Saccharomyces cerevisiae Prp18 is known []. This fragment is fully active in yeast splicing in vitro and includes the sequences of Prp18 that have been evolutionarily conserved. The core structure consists of five alpha-helices that adopt a novel fold. The most highly conserved region of Prp18, a nearly invariant stretch of 19 aa, forms part of a loop between two alpha-helices and may interact with the U5 small nuclear ribonucleoprotein particles [].; GO: 0008380 RNA splicing, 0005681 spliceosomal complex; PDB: 1DVK_A.
Probab=21.12 E-value=4e+02 Score=20.72 Aligned_cols=52 Identities=12% Similarity=0.027 Sum_probs=39.3
Q ss_pred HHHHHHHHHHHHHHHHHhcCCCchHHHHHHHHhhHhHHHhhcCHHHHHHHHHHhc
Q psy398 24 MKDANEKRLKELDLEHEKNMMDEEDQVSGIWQAKLDYLCSIGDRAEAFKLADSKF 78 (162)
Q Consensus 24 ~~~~n~~~l~~Le~~lk~Nlg~~~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~ 78 (162)
+-..-..-+.-|=..|+.+-. .++|+..+.... ++++-+|+.+|..+|-++-
T Consensus 35 ~~~qtk~~l~PL~~~Lk~~~l--~~dil~~L~~Iv-~~~q~r~y~~And~Yl~Ls 86 (144)
T PF02840_consen 35 TYKQTKRYLKPLFKKLKKRTL--PEDILDSLATIV-YHLQQREYVKANDAYLKLS 86 (144)
T ss_dssp CHHHHHHHHHHHHHHHHCT-S---HHHHHHHHHHH-HHHCCCGHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHhCCC--CHHHHHHHHHHH-HHHHHhHHHHHHHHHHHHH
Confidence 344455567777777886667 799999999887 5588999999999999854
No 260
>PRK09591 celC cellobiose phosphotransferase system IIA component; Reviewed
Probab=21.03 E-value=1.4e+02 Score=21.81 Aligned_cols=26 Identities=8% Similarity=0.057 Sum_probs=21.3
Q ss_pred hHHHHHHHHHHhhhhcHHHHHHHhHh
Q psy398 132 NKLKAYEGVYCLAIRQYCRAAELFVD 157 (162)
Q Consensus 132 nrLKvy~gl~~l~~R~fk~AA~lfld 157 (162)
.|-.++++|-..-.+||.+|-+++-+
T Consensus 20 Ars~~~eAl~~ak~gdf~~A~~~l~e 45 (104)
T PRK09591 20 ARTEVHEAFAAMREGNFDLAEQKLNQ 45 (104)
T ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHH
Confidence 67788899999999999888877644
No 261
>PHA03025 hypothetical protein; Provisional
Probab=20.96 E-value=68 Score=21.72 Aligned_cols=22 Identities=27% Similarity=0.454 Sum_probs=18.7
Q ss_pred HHHHHHHHHHHHHHHHHHhcCC
Q psy398 23 EMKDANEKRLKELDLEHEKNMM 44 (162)
Q Consensus 23 ~~~~~n~~~l~~Le~~lk~Nlg 44 (162)
-+.+.|+++|.++=+++++|++
T Consensus 21 ~~~k~nakkl~ei~e~~knnmi 42 (68)
T PHA03025 21 DIGKENAKKLAEICEKIKNNMI 42 (68)
T ss_pred ehhhhhHHHHHHHHHhhhCcEE
Confidence 4678999999999999997765
No 262
>PF03704 BTAD: Bacterial transcriptional activator domain; InterPro: IPR005158 Found in the DNRI/REDD/AFSR family of regulators, this region of AFSR (P25941 from SWISSPROT) along with the C-terminal region is capable of independently directing actinorhodin production. It is important for the formation of secondary metabolites.; PDB: 2FF4_B 2FEZ_A.
Probab=20.84 E-value=1.5e+02 Score=21.51 Aligned_cols=44 Identities=18% Similarity=0.228 Sum_probs=27.8
Q ss_pred HHHhhHhHHHhhcCHHHHHHHHHHhc----CCCCCChHHHHHHHHHHH
Q psy398 53 IWQAKLDYLCSIGDRAEAFKLADSKF----ADKTNPKSHRIDAVFTLF 96 (162)
Q Consensus 53 ~~~~~a~~~~~iGD~~~A~~ay~~~~----~~~~~s~g~kiD~~l~~i 96 (162)
++..+-.-|.+.|+...|++.|.+.. ++..++++..+.-+..-|
T Consensus 98 ~~~~lm~~~~~~g~~~~A~~~Y~~~~~~l~~elg~~Ps~~~~~l~~~i 145 (146)
T PF03704_consen 98 AYRLLMRALAAQGRRAEALRVYERYRRRLREELGIEPSPETRALYREI 145 (146)
T ss_dssp HHHHHHHHHHHTT-HHHHHHHHHHHHHHHHHHHS----HHHHHHHHHH
T ss_pred HHHHHHHHHHHCcCHHHHHHHHHHHHHHHHHHhCcCcCHHHHHHHHHh
Confidence 56677888999999999999999864 134556666665544433
No 263
>COG4785 NlpI Lipoprotein NlpI, contains TPR repeats [General function prediction only]
Probab=20.58 E-value=5.2e+02 Score=22.30 Aligned_cols=112 Identities=19% Similarity=0.186 Sum_probs=78.5
Q ss_pred HHHHHHHHHHHHH-hcCCCchHHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHHHhcCCH
Q psy398 28 NEKRLKELDLEHE-KNMMDEEDQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAYFHGCDI 106 (162)
Q Consensus 28 n~~~l~~Le~~lk-~Nlg~~~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~f~~~d~ 106 (162)
.+-.+.++++-|- ..+. ++|--.-+.++|.+|-+.|=..-|.--|+.++ +...+|--+|+.+=|.+-.-+|+
T Consensus 43 qEV~iarlsqlL~~~~l~--~eeRA~l~fERGvlYDSlGL~~LAR~DftQaL-----ai~P~m~~vfNyLG~Yl~~a~~f 115 (297)
T COG4785 43 QEVILARMSQILASRALT--DEERAQLLFERGVLYDSLGLRALARNDFSQAL-----AIRPDMPEVFNYLGIYLTQAGNF 115 (297)
T ss_pred HHHHHHHHHHHHHhccCC--hHHHHHHHHHhcchhhhhhHHHHHhhhhhhhh-----hcCCCcHHHHHHHHHHHHhcccc
Confidence 3455777777777 6666 77766668999999999998887777666666 55667777888877766654788
Q ss_pred HHHHHHHHHHHhhhhhcCCCchhhhhHHHH--HHHHHHhhhhcHH
Q psy398 107 QAMTAAIEKATELIEGISGGDWSARNKLKA--YEGVYCLAIRQYC 149 (162)
Q Consensus 107 ~~v~~~i~ka~~l~e~~~ggDW~rrnrLKv--y~gl~~l~~R~fk 149 (162)
+..-...+-.=.+ | .+-+..++||==. |-|=|.|+++||-
T Consensus 116 daa~eaFds~~EL-D--p~y~Ya~lNRgi~~YY~gR~~LAq~d~~ 157 (297)
T COG4785 116 DAAYEAFDSVLEL-D--PTYNYAHLNRGIALYYGGRYKLAQDDLL 157 (297)
T ss_pred hHHHHHhhhHhcc-C--CcchHHHhccceeeeecCchHhhHHHHH
Confidence 7766655544333 2 2457778888544 6777778887774
No 264
>KOG2518|consensus
Probab=20.04 E-value=1.3e+02 Score=28.40 Aligned_cols=47 Identities=13% Similarity=0.034 Sum_probs=37.5
Q ss_pred hHHHHHHHHhhHhHHHhhcCHHHHHHHHHHhcCCCCCChHHHHHHHHHHHHHHH
Q psy398 47 EDQVSGIWQAKLDYLCSIGDRAEAFKLADSKFADKTNPKSHRIDAVFTLFRIAY 100 (162)
Q Consensus 47 ~~eIr~~~~~~a~~~~~iGD~~~A~~ay~~~~~~~~~s~g~kiD~~l~~iRv~~ 100 (162)
.-+-|....+.|..+.+-|+..+|.++|.++. +-|. +|.-.+|++.-
T Consensus 92 Rr~~R~~n~~~a~~ll~~G~~~~A~~~fqr~V-dIT~------~ma~~lI~~~r 138 (556)
T KOG2518|consen 92 RRERRKKNLDAAEQLLAEGKESNARECFQRCV-DITP------EMAHKLIQYLR 138 (556)
T ss_pred HHHHHHHhHHHHHHHHHcCCHHHHHHHHHHhc-cCcH------HHHHHHHHHHH
Confidence 45668889999999999999999999999999 6665 45555565543
Done!