Query psy11525
Match_columns 244
No_of_seqs 156 out of 484
Neff 7.2
Searched_HMMs 46136
Date Fri Aug 16 20:38:41 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy11525.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/11525hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1524|consensus 100.0 3E-55 6.5E-60 404.2 20.9 241 1-241 496-736 (737)
2 KOG3616|consensus 100.0 3E-31 6.5E-36 252.9 14.7 209 1-229 377-612 (1636)
3 KOG1538|consensus 100.0 2E-27 4.3E-32 224.6 16.4 213 2-232 497-717 (1081)
4 KOG3617|consensus 99.8 9.6E-19 2.1E-23 169.4 11.1 144 89-232 739-885 (1416)
5 KOG2041|consensus 99.7 7.4E-17 1.6E-21 154.3 11.3 145 88-232 673-823 (1189)
6 PF04053 Coatomer_WDAD: Coatom 99.7 2.6E-14 5.7E-19 134.7 22.4 186 2-203 199-443 (443)
7 KOG0292|consensus 99.3 1.7E-11 3.7E-16 120.2 14.6 165 52-221 567-781 (1202)
8 KOG2041|consensus 99.2 3.3E-10 7.1E-15 109.4 13.5 152 88-239 770-944 (1189)
9 KOG3616|consensus 98.9 7.9E-09 1.7E-13 100.6 12.1 135 81-215 794-950 (1636)
10 KOG0276|consensus 98.9 2.3E-08 4.9E-13 95.6 13.2 129 89-217 648-777 (794)
11 KOG2247|consensus 98.6 1.9E-08 4.2E-13 94.2 2.8 114 108-222 4-118 (615)
12 KOG1538|consensus 98.1 2E-05 4.3E-10 76.5 9.9 148 84-234 664-833 (1081)
13 PF04053 Coatomer_WDAD: Coatom 97.5 0.024 5.3E-07 53.9 19.8 119 80-200 260-398 (443)
14 KOG1920|consensus 97.4 0.0011 2.3E-08 68.3 10.3 98 105-202 950-1053(1265)
15 KOG3617|consensus 97.0 0.011 2.4E-07 59.5 12.8 138 91-231 951-1105(1416)
16 PF04840 Vps16_C: Vps16, C-ter 97.0 0.016 3.5E-07 52.8 13.3 109 80-202 179-289 (319)
17 PRK11788 tetratricopeptide rep 97.0 0.017 3.7E-07 52.5 13.5 78 153-231 218-308 (389)
18 PF13429 TPR_15: Tetratricopep 97.0 0.0011 2.4E-08 58.3 4.9 136 88-232 18-173 (280)
19 TIGR00990 3a0801s09 mitochondr 96.8 0.054 1.2E-06 53.2 16.0 143 89-231 376-568 (615)
20 TIGR02521 type_IV_pilW type IV 96.7 0.075 1.6E-06 43.3 14.0 141 88-228 41-225 (234)
21 TIGR00990 3a0801s09 mitochondr 96.7 0.054 1.2E-06 53.2 15.2 136 89-232 342-494 (615)
22 KOG1920|consensus 96.7 0.018 3.9E-07 59.7 11.7 79 155-233 945-1027(1265)
23 TIGR02521 type_IV_pilW type IV 96.7 0.11 2.5E-06 42.2 14.9 79 153-231 103-195 (234)
24 TIGR02917 PEP_TPR_lipo putativ 96.6 0.071 1.5E-06 52.6 15.0 105 88-200 577-694 (899)
25 PLN03077 Protein ECB2; Provisi 96.5 0.036 7.9E-07 56.5 12.2 142 84-234 329-484 (857)
26 PLN03081 pentatricopeptide (PP 96.2 0.1 2.2E-06 52.1 13.4 152 82-233 263-454 (697)
27 PF11768 DUF3312: Protein of u 96.2 0.23 5.1E-06 48.0 15.1 117 2-123 302-449 (545)
28 PLN03218 maturation of RBCL 1; 96.2 0.12 2.6E-06 54.2 14.3 50 84-133 585-640 (1060)
29 PRK15174 Vi polysaccharide exp 96.1 0.26 5.6E-06 49.1 16.1 44 89-132 87-135 (656)
30 PRK11788 tetratricopeptide rep 96.1 0.11 2.3E-06 47.4 12.3 45 88-132 45-94 (389)
31 TIGR02917 PEP_TPR_lipo putativ 96.1 0.31 6.7E-06 48.1 16.4 116 85-200 438-592 (899)
32 PRK12370 invasion protein regu 96.0 0.2 4.4E-06 48.7 14.6 75 92-175 318-398 (553)
33 PRK15359 type III secretion sy 96.0 0.084 1.8E-06 42.1 9.6 79 88-174 34-117 (144)
34 PLN03081 pentatricopeptide (PP 95.9 0.22 4.7E-06 49.7 14.6 153 80-232 362-555 (697)
35 KOG2114|consensus 95.9 0.57 1.2E-05 47.5 17.0 171 63-233 319-530 (933)
36 COG5010 TadD Flp pilus assembl 95.7 0.11 2.3E-06 45.9 9.7 107 88-202 110-229 (257)
37 PRK15363 pathogenicity island 95.7 0.21 4.6E-06 41.0 10.8 83 83-173 36-127 (157)
38 PF12895 Apc3: Anaphase-promot 95.6 0.052 1.1E-06 38.9 6.5 76 90-174 1-83 (84)
39 PRK11189 lipoprotein NlpI; Pro 95.6 0.78 1.7E-05 40.9 15.4 46 88-133 74-124 (296)
40 PRK09782 bacteriophage N4 rece 95.6 0.38 8.3E-06 50.2 15.0 144 89-232 520-704 (987)
41 TIGR02552 LcrH_SycD type III s 95.4 0.12 2.5E-06 39.8 8.4 80 86-173 25-109 (135)
42 PRK15174 Vi polysaccharide exp 95.3 0.51 1.1E-05 47.0 14.5 152 81-232 45-239 (656)
43 PLN03218 maturation of RBCL 1; 95.3 0.38 8.2E-06 50.7 13.8 53 81-133 440-498 (1060)
44 PRK11447 cellulose synthase su 95.0 0.81 1.8E-05 48.5 15.5 144 88-231 471-663 (1157)
45 cd00189 TPR Tetratricopeptide 95.0 0.12 2.6E-06 34.9 6.5 81 87-175 9-94 (100)
46 PRK11447 cellulose synthase su 94.9 1.8 4E-05 45.9 17.9 145 88-232 38-208 (1157)
47 PRK10747 putative protoheme IX 94.8 0.37 8E-06 44.9 11.3 101 89-201 95-213 (398)
48 PF13429 TPR_15: Tetratricopep 94.7 0.48 1E-05 41.5 11.2 138 83-231 115-274 (280)
49 PLN03077 Protein ECB2; Provisi 94.6 0.55 1.2E-05 48.0 12.6 53 82-134 226-280 (857)
50 PRK12370 invasion protein regu 94.5 1.2 2.5E-05 43.5 14.3 105 89-202 349-468 (553)
51 PF09976 TPR_21: Tetratricopep 94.5 1.6 3.5E-05 34.4 12.9 86 84-174 17-110 (145)
52 KOG0985|consensus 94.4 0.52 1.1E-05 48.9 11.5 49 90-138 1087-1135(1666)
53 smart00299 CLH Clathrin heavy 93.8 1.2 2.6E-05 34.8 10.6 113 80-202 9-123 (140)
54 PLN03088 SGT1, suppressor of 93.8 0.35 7.6E-06 44.6 8.6 82 86-175 10-96 (356)
55 TIGR00540 hemY_coli hemY prote 93.6 2.1 4.6E-05 39.9 13.7 104 89-201 95-213 (409)
56 PF09976 TPR_21: Tetratricopep 93.4 0.77 1.7E-05 36.3 9.0 79 88-175 58-144 (145)
57 KOG1840|consensus 93.3 0.58 1.3E-05 45.4 9.5 140 87-231 292-475 (508)
58 TIGR00540 hemY_coli hemY prote 93.3 4.7 0.0001 37.5 15.5 112 112-232 89-214 (409)
59 TIGR03302 OM_YfiO outer membra 93.2 4.6 9.9E-05 34.0 14.3 48 86-133 41-96 (235)
60 PF13432 TPR_16: Tetratricopep 93.1 0.32 6.9E-06 32.7 5.5 47 87-133 6-57 (65)
61 PF14559 TPR_19: Tetratricopep 93.0 0.25 5.5E-06 33.3 4.9 46 89-134 2-52 (68)
62 PF13414 TPR_11: TPR repeat; P 93.0 0.38 8.2E-06 32.6 5.7 62 106-175 2-64 (69)
63 PRK09782 bacteriophage N4 rece 92.7 7.5 0.00016 40.9 17.2 144 89-232 487-670 (987)
64 PRK10049 pgaA outer membrane p 92.7 1.6 3.6E-05 44.2 12.2 120 87-215 319-470 (765)
65 CHL00033 ycf3 photosystem I as 92.6 1.1 2.3E-05 36.2 9.0 98 86-197 43-149 (168)
66 PRK15359 type III secretion sy 92.2 2.5 5.4E-05 33.5 10.5 94 96-199 14-116 (144)
67 KOG1125|consensus 92.0 0.21 4.6E-06 48.4 4.6 84 88-187 440-530 (579)
68 KOG3785|consensus 91.7 2.9 6.3E-05 39.0 11.2 132 84-230 28-176 (557)
69 KOG0276|consensus 91.6 14 0.0003 36.8 16.2 53 150-202 667-722 (794)
70 KOG2034|consensus 91.5 3.5 7.5E-05 42.2 12.5 145 84-230 364-583 (911)
71 PRK15179 Vi polysaccharide bio 91.5 2.2 4.8E-05 43.0 11.3 110 118-233 60-182 (694)
72 cd00189 TPR Tetratricopeptide 91.2 1.5 3.2E-05 29.3 7.2 59 109-175 2-60 (100)
73 PF09295 ChAPs: ChAPs (Chs5p-A 91.0 3.8 8.2E-05 38.5 11.7 108 81-200 171-293 (395)
74 PRK15331 chaperone protein Sic 90.9 1.9 4.1E-05 35.6 8.5 81 85-173 44-129 (165)
75 PF04733 Coatomer_E: Coatomer 90.9 0.66 1.4E-05 41.6 6.4 48 88-135 112-159 (290)
76 COG2956 Predicted N-acetylgluc 90.8 1.2 2.7E-05 40.8 7.9 102 87-201 44-167 (389)
77 KOG1840|consensus 90.8 2.4 5.1E-05 41.2 10.3 121 87-219 334-487 (508)
78 PRK10370 formate-dependent nit 90.7 1.9 4E-05 36.4 8.6 102 91-200 52-169 (198)
79 PF00637 Clathrin: Region in C 90.7 0.028 6E-07 44.3 -2.5 51 153-203 74-124 (143)
80 PF14938 SNAP: Soluble NSF att 90.4 1.6 3.4E-05 38.7 8.3 16 84-99 41-56 (282)
81 PF13371 TPR_9: Tetratricopept 90.4 0.84 1.8E-05 31.1 5.3 47 87-133 4-55 (73)
82 TIGR02795 tol_pal_ybgF tol-pal 90.3 3.6 7.8E-05 30.1 9.1 48 86-133 10-65 (119)
83 PRK10747 putative protoheme IX 90.3 1.9 4.2E-05 40.1 9.2 81 85-174 301-386 (398)
84 PRK02603 photosystem I assembl 90.3 2 4.3E-05 34.9 8.3 84 88-185 45-136 (172)
85 PF13432 TPR_16: Tetratricopep 89.4 0.76 1.7E-05 30.8 4.3 56 112-175 2-57 (65)
86 KOG0292|consensus 89.3 12 0.00025 38.8 13.9 108 80-200 592-723 (1202)
87 KOG2114|consensus 88.8 9.5 0.00021 39.0 12.9 123 109-231 420-547 (933)
88 KOG0550|consensus 88.3 2.3 4.9E-05 40.2 7.8 135 105-239 247-428 (486)
89 smart00299 CLH Clathrin heavy 88.2 2.3 5E-05 33.1 6.9 38 164-201 71-108 (140)
90 PF13424 TPR_12: Tetratricopep 88.1 0.63 1.4E-05 32.4 3.3 63 108-175 6-72 (78)
91 KOG1155|consensus 87.8 5.1 0.00011 38.5 9.9 85 105-200 430-532 (559)
92 KOG2063|consensus 87.7 6.8 0.00015 40.5 11.4 106 49-186 278-384 (877)
93 TIGR02795 tol_pal_ybgF tol-pal 87.7 8.8 0.00019 27.9 9.8 85 108-200 3-101 (119)
94 PLN03088 SGT1, suppressor of 87.6 4.9 0.00011 37.0 9.7 100 109-216 4-117 (356)
95 COG2956 Predicted N-acetylgluc 86.9 16 0.00035 33.7 12.2 125 89-225 152-302 (389)
96 PRK11189 lipoprotein NlpI; Pro 86.9 12 0.00027 33.2 11.7 45 88-132 108-157 (296)
97 KOG2280|consensus 86.9 12 0.00026 37.9 12.2 110 80-202 686-797 (829)
98 KOG2076|consensus 86.4 37 0.0008 35.1 15.5 106 81-196 143-262 (895)
99 PRK10049 pgaA outer membrane p 86.1 29 0.00064 35.2 15.1 44 89-132 26-74 (765)
100 TIGR03302 OM_YfiO outer membra 86.1 9.2 0.0002 32.1 10.0 33 88-120 80-120 (235)
101 PF12931 Sec16_C: Sec23-bindin 85.0 1.6 3.5E-05 39.0 5.0 43 171-216 4-47 (284)
102 cd05804 StaR_like StaR_like; a 84.9 4.6 9.9E-05 36.1 7.9 81 88-176 124-213 (355)
103 PRK14574 hmsH outer membrane p 84.4 22 0.00048 36.7 13.3 139 83-232 39-196 (822)
104 PRK15179 Vi polysaccharide bio 84.2 19 0.00042 36.4 12.6 47 89-135 67-114 (694)
105 PF03983 SHD1: SLA1 homology d 83.7 2.8 6E-05 29.7 4.6 42 49-90 21-62 (70)
106 PF14559 TPR_19: Tetratricopep 83.4 3.6 7.8E-05 27.5 5.1 23 153-175 29-51 (68)
107 COG3071 HemY Uncharacterized e 83.0 17 0.00036 34.1 10.7 105 89-201 95-213 (400)
108 PF10602 RPN7: 26S proteasome 82.7 5.3 0.00011 33.1 6.8 81 106-197 35-115 (177)
109 PRK14574 hmsH outer membrane p 82.6 18 0.00038 37.4 11.8 106 83-200 372-509 (822)
110 PRK10803 tol-pal system protei 82.6 14 0.00029 32.8 9.7 78 88-173 153-241 (263)
111 TIGR02552 LcrH_SycD type III s 82.4 8.2 0.00018 29.2 7.4 81 111-199 21-109 (135)
112 PF00400 WD40: WD domain, G-be 82.3 2.7 5.8E-05 24.9 3.7 24 2-25 14-38 (39)
113 KOG1129|consensus 81.6 1.6 3.5E-05 40.2 3.5 78 89-176 376-456 (478)
114 KOG0985|consensus 81.3 6.4 0.00014 41.3 7.9 76 155-232 1054-1131(1666)
115 PF04840 Vps16_C: Vps16, C-ter 81.2 20 0.00043 32.7 10.5 88 84-188 214-301 (319)
116 KOG2076|consensus 81.1 24 0.00053 36.3 11.8 106 86-199 422-550 (895)
117 cd05804 StaR_like StaR_like; a 80.8 30 0.00066 30.7 11.6 48 153-200 152-211 (355)
118 PF13414 TPR_11: TPR repeat; P 80.6 4.6 0.0001 27.0 4.9 46 87-132 12-63 (69)
119 CHL00033 ycf3 photosystem I as 80.1 16 0.00036 29.2 8.8 65 106-175 34-98 (168)
120 KOG3081|consensus 80.0 8.9 0.00019 34.4 7.5 50 86-135 116-165 (299)
121 PLN03098 LPA1 LOW PSII ACCUMUL 79.5 33 0.00072 32.9 11.6 47 88-134 85-139 (453)
122 COG4783 Putative Zn-dependent 77.9 41 0.00088 32.5 11.6 39 153-200 412-450 (484)
123 KOG4626|consensus 76.9 5 0.00011 40.0 5.4 45 88-132 126-175 (966)
124 PF07719 TPR_2: Tetratricopept 76.6 6.6 0.00014 22.3 4.1 26 108-133 2-27 (34)
125 PRK04841 transcriptional regul 76.2 41 0.00089 34.3 12.2 46 89-134 584-639 (903)
126 PF12895 Apc3: Anaphase-promot 75.8 8.3 0.00018 27.1 5.2 45 88-132 35-83 (84)
127 PF03704 BTAD: Bacterial trans 75.8 14 0.0003 28.8 7.0 47 88-134 72-123 (146)
128 PRK14720 transcript cleavage f 75.4 26 0.00056 36.5 10.3 47 83-129 36-87 (906)
129 KOG1586|consensus 75.4 17 0.00036 32.2 7.7 40 90-135 26-69 (288)
130 KOG3081|consensus 75.3 60 0.0013 29.2 11.3 63 154-216 113-180 (299)
131 PF05131 Pep3_Vps18: Pep3/Vps1 75.3 4.3 9.3E-05 32.8 3.9 39 179-217 107-145 (147)
132 COG3063 PilF Tfp pilus assembl 75.2 58 0.0012 28.7 14.5 146 85-230 42-231 (250)
133 PF13428 TPR_14: Tetratricopep 74.1 5.6 0.00012 24.7 3.5 26 108-133 2-27 (44)
134 PF10366 Vps39_1: Vacuolar sor 73.8 31 0.00068 26.2 8.2 61 127-202 6-66 (108)
135 PRK10370 formate-dependent nit 73.7 52 0.0011 27.5 11.1 81 153-233 77-172 (198)
136 PF12234 Rav1p_C: RAVE protein 73.4 49 0.0011 33.2 11.4 97 108-204 388-508 (631)
137 KOG1126|consensus 72.7 18 0.00039 35.9 8.1 126 106-231 420-583 (638)
138 KOG1155|consensus 72.7 35 0.00076 33.0 9.7 108 91-198 343-489 (559)
139 KOG1173|consensus 72.2 14 0.00031 36.2 7.2 78 90-175 426-515 (611)
140 KOG1585|consensus 72.1 70 0.0015 28.6 10.8 47 140-186 79-134 (308)
141 PF10602 RPN7: 26S proteasome 72.0 17 0.00036 30.1 6.8 83 85-179 43-143 (177)
142 PF07719 TPR_2: Tetratricopept 71.8 5.5 0.00012 22.6 2.9 22 154-175 6-27 (34)
143 PF13176 TPR_7: Tetratricopept 71.8 7 0.00015 23.2 3.4 20 155-174 5-24 (36)
144 KOG0307|consensus 71.4 1.4E+02 0.0031 31.6 14.8 59 162-222 487-545 (1049)
145 cd02577 PSTD1 PSTD1: Pseudouri 71.3 7.8 0.00017 35.4 5.1 43 157-199 157-206 (319)
146 PF10366 Vps39_1: Vacuolar sor 71.3 9.5 0.00021 29.1 4.8 32 159-190 36-67 (108)
147 PF15016 DUF4520: Domain of un 71.0 32 0.00069 25.3 7.2 62 9-89 13-75 (85)
148 KOG0553|consensus 70.5 11 0.00023 34.2 5.6 48 88-135 125-177 (304)
149 KOG0772|consensus 70.0 6.3 0.00014 38.2 4.2 36 2-38 320-356 (641)
150 PF00515 TPR_1: Tetratricopept 67.8 14 0.0003 21.1 4.0 26 108-133 2-27 (34)
151 COG5290 IkappaB kinase complex 67.0 23 0.00049 36.5 7.5 49 155-203 928-976 (1243)
152 PF13374 TPR_10: Tetratricopep 67.0 7.1 0.00015 23.1 2.7 21 154-174 7-27 (42)
153 KOG1585|consensus 65.7 30 0.00064 30.9 7.2 96 107-216 24-137 (308)
154 PF14938 SNAP: Soluble NSF att 65.5 21 0.00045 31.5 6.5 84 93-197 30-116 (282)
155 PF13428 TPR_14: Tetratricopep 65.4 11 0.00023 23.4 3.4 21 154-174 6-26 (44)
156 PF13176 TPR_7: Tetratricopept 65.1 13 0.00027 22.1 3.5 23 110-132 2-24 (36)
157 PF13371 TPR_9: Tetratricopept 65.0 28 0.00061 23.3 5.8 49 114-174 2-54 (73)
158 PF12894 Apc4_WD40: Anaphase-p 64.9 12 0.00026 24.2 3.6 26 2-27 14-40 (47)
159 PF07721 TPR_4: Tetratricopept 64.3 9.7 0.00021 21.0 2.7 20 154-173 6-25 (26)
160 PF13424 TPR_12: Tetratricopep 64.2 24 0.00053 24.0 5.4 21 154-174 10-30 (78)
161 PRK04841 transcriptional regul 64.0 44 0.00095 34.1 9.3 107 89-200 463-598 (903)
162 PRK10153 DNA-binding transcrip 63.7 35 0.00076 33.2 8.1 21 153-173 457-477 (517)
163 KOG2280|consensus 63.6 1.2E+02 0.0025 31.2 11.6 137 84-233 443-600 (829)
164 KOG2034|consensus 63.3 8.8 0.00019 39.4 4.0 48 175-222 358-405 (911)
165 PF13181 TPR_8: Tetratricopept 63.1 11 0.00024 21.4 3.0 23 153-175 5-27 (34)
166 COG3071 HemY Uncharacterized e 61.9 37 0.0008 31.9 7.5 59 105-172 326-384 (400)
167 PLN03098 LPA1 LOW PSII ACCUMUL 61.3 49 0.0011 31.7 8.4 62 105-175 73-138 (453)
168 PRK02603 photosystem I assembl 61.1 35 0.00075 27.4 6.6 64 107-175 35-98 (172)
169 KOG4626|consensus 61.0 1.1E+02 0.0023 31.1 10.7 22 153-174 392-413 (966)
170 COG3063 PilF Tfp pilus assembl 60.7 64 0.0014 28.4 8.3 78 89-174 114-198 (250)
171 KOG1126|consensus 60.4 43 0.00094 33.4 8.0 104 87-202 464-584 (638)
172 KOG0543|consensus 60.2 30 0.00064 32.6 6.6 87 88-175 218-317 (397)
173 PF04190 DUF410: Protein of un 58.6 33 0.0007 30.2 6.4 32 152-183 93-124 (260)
174 KOG2376|consensus 58.2 2E+02 0.0044 28.7 12.8 79 153-231 114-201 (652)
175 PF07163 Pex26: Pex26 protein; 57.7 1.5E+02 0.0032 26.9 11.3 127 82-216 39-196 (309)
176 KOG1156|consensus 57.4 1.5E+02 0.0033 29.9 11.1 43 91-133 54-101 (700)
177 PF08631 SPO22: Meiosis protei 57.4 10 0.00022 33.5 3.0 62 116-177 2-64 (278)
178 KOG2002|consensus 57.0 23 0.0005 36.9 5.6 104 89-200 623-741 (1018)
179 COG4235 Cytochrome c biogenesi 56.0 66 0.0014 29.0 7.9 30 105-134 154-183 (287)
180 KOG3060|consensus 55.3 35 0.00077 30.5 5.9 92 108-199 53-178 (289)
181 KOG2066|consensus 55.3 93 0.002 31.9 9.4 51 88-138 366-424 (846)
182 smart00028 TPR Tetratricopepti 54.5 24 0.00051 18.1 3.3 26 108-133 2-27 (34)
183 KOG0686|consensus 54.5 39 0.00085 32.1 6.3 31 107-137 150-180 (466)
184 PF07035 Mic1: Colon cancer-as 53.8 1.3E+02 0.0027 24.9 10.7 84 104-201 26-115 (167)
185 TIGR00756 PPR pentatricopeptid 53.6 19 0.00041 20.0 2.8 19 170-188 8-26 (35)
186 KOG3785|consensus 52.4 1.9E+02 0.0041 27.4 10.3 96 80-182 360-461 (557)
187 PF11817 Foie-gras_1: Foie gra 52.3 20 0.00044 31.1 4.0 50 87-138 187-236 (247)
188 PF04762 IKI3: IKI3 family; I 51.7 40 0.00087 35.3 6.6 63 113-176 864-928 (928)
189 TIGR02561 HrpB1_HrpK type III 51.6 55 0.0012 26.7 6.0 18 88-105 54-71 (153)
190 smart00320 WD40 WD40 repeats. 51.5 25 0.00053 18.4 3.1 24 2-25 15-39 (40)
191 KOG0307|consensus 51.3 1.1E+02 0.0024 32.4 9.4 109 83-192 492-631 (1049)
192 KOG0547|consensus 51.0 1.9E+02 0.0041 28.4 10.3 85 148-232 461-564 (606)
193 PF01535 PPR: PPR repeat; Int 50.1 17 0.00036 19.9 2.1 16 182-197 7-22 (31)
194 cd02576 PseudoU_synth_ScPUS7 P 50.0 28 0.0006 32.3 4.7 44 156-199 164-213 (371)
195 PF00637 Clathrin: Region in C 49.7 3.7 8E-05 32.0 -1.0 36 164-199 72-107 (143)
196 KOG2002|consensus 49.5 53 0.0011 34.4 6.8 81 88-173 280-366 (1018)
197 PF13812 PPR_3: Pentatricopept 48.9 38 0.00082 18.9 3.7 17 181-197 7-23 (34)
198 smart00668 CTLH C-terminal to 48.7 44 0.00096 21.5 4.4 35 185-219 11-51 (58)
199 PF11768 DUF3312: Protein of u 48.4 24 0.00052 34.5 4.1 42 88-129 504-545 (545)
200 KOG0553|consensus 48.3 72 0.0016 29.0 6.8 78 154-231 86-175 (304)
201 PF04097 Nic96: Nup93/Nic96; 48.3 1E+02 0.0022 30.7 8.7 101 81-187 327-452 (613)
202 PRK15180 Vi polysaccharide bio 47.6 1.3E+02 0.0029 29.5 8.8 80 114-201 296-383 (831)
203 PF12854 PPR_1: PPR repeat 46.8 33 0.00072 20.1 3.2 25 80-104 9-33 (34)
204 PF10300 DUF3808: Protein of u 45.9 2.1E+02 0.0046 27.3 10.2 92 90-186 245-345 (468)
205 smart00685 DM14 Repeats in fly 45.1 63 0.0014 22.1 4.6 41 162-202 8-48 (59)
206 KOG1128|consensus 45.0 77 0.0017 32.3 7.0 103 88-200 495-612 (777)
207 PF13174 TPR_6: Tetratricopept 44.9 24 0.00053 19.5 2.4 18 155-172 6-23 (33)
208 KOG1129|consensus 44.4 2.4E+02 0.0052 26.5 9.6 106 127-233 186-318 (478)
209 COG1729 Uncharacterized protei 42.8 1.7E+02 0.0037 26.1 8.3 85 83-174 146-240 (262)
210 PF12688 TPR_5: Tetratrico pep 42.3 1.6E+02 0.0034 22.7 9.3 20 155-174 81-100 (120)
211 PF12569 NARP1: NMDA receptor- 41.7 91 0.002 30.5 7.0 73 128-200 202-287 (517)
212 PF13041 PPR_2: PPR repeat fam 41.6 34 0.00073 21.5 2.8 17 171-187 12-28 (50)
213 COG4783 Putative Zn-dependent 41.1 3.5E+02 0.0075 26.3 12.0 78 87-175 315-400 (484)
214 KOG4499|consensus 40.1 1.3E+02 0.0029 26.7 7.0 78 3-80 161-252 (310)
215 PF09613 HrpB1_HrpK: Bacterial 39.7 71 0.0015 26.3 5.1 51 88-148 54-106 (160)
216 PF09295 ChAPs: ChAPs (Chs5p-A 39.6 65 0.0014 30.4 5.5 43 89-131 245-292 (395)
217 PF11349 DUF3151: Protein of u 38.8 70 0.0015 25.2 4.6 87 82-171 12-114 (129)
218 PF08625 Utp13: Utp13 specific 38.6 2E+02 0.0044 22.9 7.9 65 83-147 2-72 (141)
219 KOG4234|consensus 38.5 1.5E+02 0.0033 25.8 7.0 47 185-231 144-194 (271)
220 PF08450 SGL: SMP-30/Gluconola 37.9 1.9E+02 0.0041 24.3 7.8 77 2-80 136-223 (246)
221 PF02736 Myosin_N: Myosin N-te 36.5 88 0.0019 19.4 4.1 22 55-76 17-38 (42)
222 PF02064 MAS20: MAS20 protein 34.9 1.1E+02 0.0024 23.9 5.3 37 152-188 66-106 (121)
223 KOG1070|consensus 34.6 7E+02 0.015 27.9 12.3 135 89-230 1541-1695(1710)
224 KOG2796|consensus 34.3 2.3E+02 0.0051 25.7 7.7 90 105-201 210-312 (366)
225 KOG1586|consensus 34.1 3.4E+02 0.0074 24.2 9.4 39 107-145 54-99 (288)
226 KOG0772|consensus 33.4 38 0.00082 33.1 2.8 30 2-31 460-490 (641)
227 COG2976 Uncharacterized protei 33.3 1.9E+02 0.004 24.9 6.7 81 89-174 64-151 (207)
228 PF13763 DUF4167: Domain of un 32.7 59 0.0013 23.6 3.1 32 109-140 41-72 (80)
229 PF12688 TPR_5: Tetratrico pep 32.6 2.3E+02 0.005 21.8 8.9 79 112-200 6-100 (120)
230 KOG0495|consensus 32.3 4E+02 0.0087 27.3 9.6 43 90-132 663-710 (913)
231 PF02259 FAT: FAT domain; Int 31.8 1.3E+02 0.0029 26.5 6.1 31 106-136 145-175 (352)
232 PF12862 Apc5: Anaphase-promot 31.8 77 0.0017 23.0 3.8 22 176-203 55-76 (94)
233 cd00200 WD40 WD40 domain, foun 31.0 2.2E+02 0.0047 22.8 6.9 28 2-29 12-40 (289)
234 PF10516 SHNi-TPR: SHNi-TPR; 30.3 91 0.002 19.1 3.3 23 152-174 4-26 (38)
235 PF11817 Foie-gras_1: Foie gra 30.3 3.4E+02 0.0074 23.3 8.2 56 125-195 163-218 (247)
236 KOG2168|consensus 29.6 90 0.0019 32.2 4.9 31 154-184 627-657 (835)
237 PRK05441 murQ N-acetylmuramic 29.5 4.2E+02 0.009 23.8 10.0 83 126-209 209-295 (299)
238 KOG2066|consensus 29.2 3.4E+02 0.0075 28.0 8.7 118 85-202 512-688 (846)
239 COG2103 Predicted sugar phosph 29.1 1.5E+02 0.0033 26.7 5.7 47 164-210 247-294 (298)
240 COG2976 Uncharacterized protei 28.5 3.8E+02 0.0083 23.0 8.2 83 84-175 95-185 (207)
241 PF04762 IKI3: IKI3 family; I 28.0 61 0.0013 34.0 3.5 42 63-104 678-720 (928)
242 KOG2880|consensus 28.0 1.2E+02 0.0026 28.3 5.0 25 150-174 36-60 (424)
243 KOG0686|consensus 27.9 1.4E+02 0.0031 28.5 5.5 22 222-243 246-268 (466)
244 cd00200 WD40 WD40 domain, foun 27.5 3.2E+02 0.0069 21.8 7.6 27 2-28 222-249 (289)
245 COG3898 Uncharacterized membra 27.5 5.6E+02 0.012 24.6 9.7 82 112-202 125-215 (531)
246 PF05944 Phage_term_smal: Phag 27.4 2.9E+02 0.0063 21.9 6.6 56 142-200 12-73 (132)
247 PRK14720 transcript cleavage f 27.3 7.4E+02 0.016 26.2 11.1 91 123-220 119-218 (906)
248 KOG3060|consensus 27.3 2.1E+02 0.0045 25.7 6.2 44 89-132 131-179 (289)
249 PF12569 NARP1: NMDA receptor- 26.7 6.1E+02 0.013 24.8 12.0 77 88-172 204-285 (517)
250 PF04097 Nic96: Nup93/Nic96; 26.7 1.2E+02 0.0027 30.1 5.3 35 166-200 418-452 (613)
251 PF07575 Nucleopor_Nup85: Nup8 26.7 49 0.0011 32.4 2.5 53 92-144 406-462 (566)
252 PRK10803 tol-pal system protei 26.0 1.9E+02 0.0042 25.4 6.0 46 88-133 190-243 (263)
253 PF06977 SdiA-regulated: SdiA- 25.9 2.3E+02 0.005 24.8 6.4 24 2-25 24-47 (248)
254 PRK12570 N-acetylmuramic acid- 25.7 1.7E+02 0.0037 26.3 5.6 52 159-210 239-292 (296)
255 KOG0302|consensus 25.6 56 0.0012 30.7 2.5 24 2-25 350-375 (440)
256 PF10300 DUF3808: Protein of u 25.4 2.5E+02 0.0055 26.9 7.0 49 153-201 271-331 (468)
257 PF08662 eIF2A: Eukaryotic tra 25.0 1E+02 0.0022 25.5 3.9 27 2-28 103-133 (194)
258 TIGR00274 N-acetylmuramic acid 24.8 1.9E+02 0.0042 25.9 5.8 44 164-207 244-288 (291)
259 KOG2003|consensus 24.7 6.8E+02 0.015 24.6 10.0 45 89-134 501-551 (840)
260 KOG0290|consensus 24.5 1.5E+02 0.0032 27.1 4.8 28 2-29 153-182 (364)
261 COG1729 Uncharacterized protei 24.4 2E+02 0.0043 25.6 5.6 53 112-175 146-204 (262)
262 PF13525 YfiO: Outer membrane 24.4 2.4E+02 0.0053 23.3 6.1 47 86-132 13-67 (203)
263 KOG2247|consensus 24.3 1.6E+02 0.0035 28.8 5.3 67 5-76 515-584 (615)
264 PF08662 eIF2A: Eukaryotic tra 24.2 81 0.0018 26.2 3.1 17 2-18 62-78 (194)
265 PF15015 NYD-SP12_N: Spermatog 24.1 3.3E+02 0.0071 26.4 7.2 106 89-200 194-323 (569)
266 PRK15331 chaperone protein Sic 23.9 4.2E+02 0.009 21.9 9.5 101 112-221 42-153 (165)
267 KOG4340|consensus 23.9 1.7E+02 0.0037 27.0 5.2 56 80-135 12-72 (459)
268 PF13431 TPR_17: Tetratricopep 23.7 96 0.0021 18.1 2.5 23 105-127 11-33 (34)
269 KOG0889|consensus 23.3 4.8E+02 0.01 31.8 9.4 118 80-202 2415-2552(3550)
270 KOG2168|consensus 23.1 1E+02 0.0023 31.8 4.0 36 165-200 625-660 (835)
271 PF04049 APC8: Anaphase promot 22.4 1.2E+02 0.0027 24.1 3.7 45 182-226 81-127 (142)
272 COG3629 DnrI DNA-binding trans 22.3 3E+02 0.0064 24.7 6.4 49 88-136 163-216 (280)
273 PF01436 NHL: NHL repeat; Int 22.2 1.4E+02 0.0031 16.5 2.9 17 3-19 5-21 (28)
274 PF13838 Clathrin_H_link: Clat 22.2 1.1E+02 0.0024 21.3 2.9 23 82-104 10-32 (66)
275 COG3118 Thioredoxin domain-con 22.2 6E+02 0.013 23.1 16.1 99 88-186 144-260 (304)
276 KOG0645|consensus 22.2 2.6E+02 0.0057 25.3 5.9 27 2-28 64-91 (312)
277 cd08312 Death_MyD88 Death doma 22.1 2.9E+02 0.0064 19.5 5.9 19 99-117 4-26 (79)
278 KOG1188|consensus 22.0 2.3E+02 0.0049 26.4 5.6 65 2-73 168-236 (376)
279 PF06957 COPI_C: Coatomer (COP 22.0 2.3E+02 0.0049 27.1 5.8 133 85-225 211-352 (422)
280 smart00035 CLa CLUSTERIN alpha 21.6 76 0.0017 27.3 2.4 60 156-219 66-128 (216)
281 PF05131 Pep3_Vps18: Pep3/Vps1 21.6 59 0.0013 26.2 1.6 22 83-104 108-129 (147)
282 COG5010 TadD Flp pilus assembl 21.5 4.9E+02 0.011 23.1 7.4 82 88-181 144-234 (257)
283 PF14669 Asp_Glu_race_2: Putat 21.4 79 0.0017 27.2 2.4 23 165-187 184-206 (233)
No 1
>KOG1524|consensus
Probab=100.00 E-value=3e-55 Score=404.16 Aligned_cols=241 Identities=44% Similarity=0.762 Sum_probs=237.8
Q ss_pred CcceeeecCCCceEEEEeCCeEEEEEcCCccccCcccceeeeEeeccCCCCCCCeEEeEeCCEEEEEecCCcEEEeecCc
Q psy11525 1 MVQSICWNTDVNILAAMQDSALCVWFFPAVVFADQGLLRKTVLLKDIGEFGKSPSIVSFVKNHLTIRRYDGTVINYPISP 80 (244)
Q Consensus 1 ~v~~v~W~~~~~vlv~~~~~~l~vw~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~v~f~g~~v~~~~~~G~~~~~~~~p 80 (244)
||..++||+.+||||+.+|+.+.|||||+..|+|+++|++|.+++++++||++|++++|.|++|+++|+||.+++++++|
T Consensus 496 mVhtLawndttNiLcglqDt~fsVWy~pn~vyvDrdiLpkTlierdt~efgKnpqIvsFvgNqvtirrsdG~LlpisV~p 575 (737)
T KOG1524|consen 496 MVHTLAWNDTTNILCGLQDTCFSVWYYPNEVYVDRDILPKTLIERDTTEFGKNPQIVSFVGNQVTIRRSDGALLPISVNP 575 (737)
T ss_pred hhhhhhhccccceeeeeccceEEEEEcCCcceecccccchhheecchhhccCCcceeeeeccEEEEEeccCceEeeeccc
Confidence 78899999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred cHHHHHHHHHcCCHHHHHHHHhccCCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHHH
Q psy11525 81 YISVLHSYAASHSWPQALSLCRTLNDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMYL 160 (244)
Q Consensus 81 ~~~~l~~~l~~~~~~~A~~~~~~~~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia~ 160 (244)
|++.|++|+..++|++|+++||++++.+||++||.+|+.+.++.++|.+|+++++.++|++|++|+.+.+++.++|+..+
T Consensus 576 y~~iL~e~~sssKWeqavRLCrfv~eqTMWAtlAa~Av~~~~m~~~EiAYaA~~~idKVsyin~iK~ltske~~mA~~~l 655 (737)
T KOG1524|consen 576 YPEILHEYLSSSKWEQAVRLCRFVQEQTMWATLAAVAVRKHQMQISEIAYAAALQIDKVSYINHIKALTSKEEQMAENSL 655 (737)
T ss_pred cHHHHHHHhccchHHHHHHHHHhccchHHHHHHHHHHHhhccccHHHHHHHHhhchhhHHHHHHHhccCcHHHHHHHHHH
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HcCCHHHHHHHHHHCCCHHHHHHHHHHhcCHHHHHHHHHhcCCchhHHHHHHHHHHHHcCCcccHHHHHhcccccccCcc
Q psy11525 161 LGGNISEAESILLQHGLIFRAIQVSILTHNWDRALELALRHKTHIDTVLYQRKKYLDNLEKIETNEKFLRLQSEVIMSGL 240 (244)
Q Consensus 161 ~~G~~~eAe~~~l~~g~~~~Ai~m~~~l~~W~~Al~LA~~~~~~~~~vl~~r~~yl~~~~~~e~~~~f~~~~~~~~~~~~ 240 (244)
+.|+..|||.+++++|.+++|+.+...+++|+|||+|+.+|+.+++.||.+|++||+..|++||+++|+++..++|||..
T Consensus 656 ~~G~~~eAe~iLl~~gl~~qav~lni~m~nW~RALEl~~K~K~~v~~Vl~yR~KyLk~~g~~EtdplyL~~~~~~eiD~i 735 (737)
T KOG1524|consen 656 MLGRMLEAETILLHGGLIEQAVGLNIRMHNWRRALELSQKHKELVPRVLQYRRKYLKALGREETDPLYLPLVAKEEIDNI 735 (737)
T ss_pred HhccchhhhHHHHhcchHHHhhhhhhhhhhHHHHHHHHHhHHHHHHHHHHHHHHHHHHhcccccCchhhhhhhhcccccc
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999954
Q ss_pred c
Q psy11525 241 R 241 (244)
Q Consensus 241 ~ 241 (244)
+
T Consensus 736 ~ 736 (737)
T KOG1524|consen 736 S 736 (737)
T ss_pred c
Confidence 3
No 2
>KOG3616|consensus
Probab=99.97 E-value=3e-31 Score=252.95 Aligned_cols=209 Identities=19% Similarity=0.295 Sum_probs=182.3
Q ss_pred CcceeeecCCCceEEEEeCCeEEEEEcCCccccCcccceeeeEeeccCCCCCCCeEEeEeCCEEEEEecCCcEEEee---
Q psy11525 1 MVQSICWNTDVNILAAMQDSALCVWFFPAVVFADQGLLRKTVLLKDIGEFGKSPSIVSFVKNHLTIRRYDGTVINYP--- 77 (244)
Q Consensus 1 ~v~~v~W~~~~~vlv~~~~~~l~vw~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~v~f~g~~v~~~~~~G~~~~~~--- 77 (244)
+|++++|+|-+||+|++++++||+||+|..| .+++.+.++|++..+.|..|++-.+.
T Consensus 377 fcsymqwvp~sdvivaqn~dnl~iwyn~d~p--------------------eqvt~~kikgdv~~l~r~~~kteviv~e~ 436 (1636)
T KOG3616|consen 377 FCSYMQWVPGSDVIVAQNGDNLCIWYNIDAP--------------------EQVTMFKIKGDVEALERDADKTEVIVMEG 436 (1636)
T ss_pred HhhhheeccCcceEEecCCCceEEEecCCcc--------------------hhheeeEecccchhhhcCCCceeEEEeec
Confidence 4899999999999999999999999999988 78888899999999999878773221
Q ss_pred ----cCccHHHHHHH---HHcCCHHHHHHHHhcc----CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhh
Q psy11525 78 ----ISPYISVLHSY---AASHSWPQALSLCRTL----NDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIK 146 (244)
Q Consensus 78 ----~~p~~~~l~~~---l~~~~~~~A~~~~~~~----~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~ 146 (244)
-..+++.+++| ++.|+|++|..+.+.+ ++.+||..++++||+++++-+|+|||+++||+.++++|.++.
T Consensus 437 takvayeld~~~iefgaaid~~df~ra~afles~~~~~da~amw~~laelale~~nl~iaercfaai~dvak~r~lhd~~ 516 (1636)
T KOG3616|consen 437 TAKVAYELDEGLIEFGAAIDDGDFDRATAFLESLEMGPDAEAMWIRLAELALEAGNLFIAERCFAAIGDVAKARFLHDIL 516 (1636)
T ss_pred ceeEEEEecCcceecccccccCchHHHHHHHHhhccCccHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHH
Confidence 12678999999 9999999999999886 578999999999999999999999999999999999999999
Q ss_pred CCCcHH------------HHHHHHHHHcCCHHHHHHHHHHCCCHHHHHHHHHHhcCHHHHHHHHHhcCC-chhHHHHHHH
Q psy11525 147 GIPVKA------------AQQAEMYLLGGNISEAESILLQHGLIFRAIQVSILTHNWDRALELALRHKT-HIDTVLYQRK 213 (244)
Q Consensus 147 ~~~~~~------------~~~A~ia~~~G~~~eAe~~~l~~g~~~~Ai~m~~~l~~W~~Al~LA~~~~~-~~~~vl~~r~ 213 (244)
++.+.. ..+|.++++..+|++||.+|++.+..+.||-||+.|++||+|+.+|+..+. .++.+-.-+-
T Consensus 517 eiadeas~~~ggdgt~fykvra~lail~kkfk~ae~ifleqn~te~aigmy~~lhkwde~i~lae~~~~p~~eklk~sy~ 596 (1636)
T KOG3616|consen 517 EIADEASIEIGGDGTDFYKVRAMLAILEKKFKEAEMIFLEQNATEEAIGMYQELHKWDEAIALAEAKGHPALEKLKRSYL 596 (1636)
T ss_pred HHHHHHhHhhCCCCchHHHHHHHHHHHHhhhhHHHHHHHhcccHHHHHHHHHHHHhHHHHHHHHHhcCChHHHHHHHHHH
Confidence 886532 458999999999999999999999999999999999999999999999984 4787333334
Q ss_pred HHHHHcCCcccHHHHH
Q psy11525 214 KYLDNLEKIETNEKFL 229 (244)
Q Consensus 214 ~yl~~~~~~e~~~~f~ 229 (244)
++|+.+||+|.....+
T Consensus 597 q~l~dt~qd~ka~elk 612 (1636)
T KOG3616|consen 597 QALMDTGQDEKAAELK 612 (1636)
T ss_pred HHHHhcCchhhhhhhc
Confidence 4448889988765543
No 3
>KOG1538|consensus
Probab=99.95 E-value=2e-27 Score=224.56 Aligned_cols=213 Identities=17% Similarity=0.197 Sum_probs=180.2
Q ss_pred cceeeecCCC-ceEEEEeCCeEEEEEcCCccccCcccceeeeEeeccCCCCCCCeEEeEeCCEEEEEecCCcEEEeecCc
Q psy11525 2 VQSICWNTDV-NILAAMQDSALCVWFFPAVVFADQGLLRKTVLLKDIGEFGKSPSIVSFVKNHLTIRRYDGTVINYPISP 80 (244)
Q Consensus 2 v~~v~W~~~~-~vlv~~~~~~l~vw~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~v~f~g~~v~~~~~~G~~~~~~~~p 80 (244)
|+||+||+.. ||||...++.|.| +++- .| +..++ -+..+++|.|++++|++ |..+.-..+|
T Consensus 497 ~nSV~wNT~~E~MlcfT~~g~L~V----R~~~-----~P-~h~qk------~~G~VvG~~gsk~FCL~--~~~i~~~evp 558 (1081)
T KOG1538|consen 497 ANSVAWNTQCEDMLCFTGGGYLNV----RAST-----FP-VHRQK------LQGFVVGYNGSKIFCLH--VFSISAVEVP 558 (1081)
T ss_pred CceEEeeccccceEEEecCCceEE----Eecc-----CC-cchhc------ceEEEEEecCceEEEEE--eeeeeccccc
Confidence 7899999975 8999888999988 4441 01 00011 34468999999999997 6666666789
Q ss_pred cHHHHHHHHHcCCHHHHHHHHhccCCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCC-----cHHHHH
Q psy11525 81 YISVLHSYAASHSWPQALSLCRTLNDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIP-----VKAAQQ 155 (244)
Q Consensus 81 ~~~~l~~~l~~~~~~~A~~~~~~~~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~-----~~~~~~ 155 (244)
+...+++||+.|.|.+|++++|.+.+.++|+.||..||+++++++|+++|.+..|..+++.+-++++.. .++.+.
T Consensus 559 ~~~~m~q~Ieag~f~ea~~iaclgVv~~DW~~LA~~ALeAL~f~~ARkAY~rVRdl~~L~li~EL~~~k~rge~P~~iLl 638 (1081)
T KOG1538|consen 559 QSAPMYQYIERGLFKEAYQIACLGVTDTDWRELAMEALEALDFETARKAYIRVRDLRYLELISELEERKKRGETPNDLLL 638 (1081)
T ss_pred ccccchhhhhccchhhhhcccccceecchHHHHHHHHHhhhhhHHHHHHHHHHhccHHHHHHHHHHHHHhcCCCchHHHH
Confidence 999999999999999999999999999999999999999999999999999999999888877776542 356789
Q ss_pred HHHHHHcCCHHHHHHHHHHCCCHHHHHHHHHHhcCHHHHHHHHHhcCCc-hhHHHHHHHHHHHHcCCccc-HHHHHhcc
Q psy11525 156 AEMYLLGGNISEAESILLQHGLIFRAIQVSILTHNWDRALELALRHKTH-IDTVLYQRKKYLDNLEKIET-NEKFLRLQ 232 (244)
Q Consensus 156 A~ia~~~G~~~eAe~~~l~~g~~~~Ai~m~~~l~~W~~Al~LA~~~~~~-~~~vl~~r~~yl~~~~~~e~-~~~f~~~~ 232 (244)
|.++++.|+|.||.++|+++|.-.+|++||.|++|||.|.+..+...+. -+.+++.|+.|..+.++... +|....++
T Consensus 639 A~~~Ay~gKF~EAAklFk~~G~enRAlEmyTDlRMFD~aQE~~~~g~~~eKKmL~RKRA~WAr~~kePkaAAEmLiSaG 717 (1081)
T KOG1538|consen 639 ADVFAYQGKFHEAAKLFKRSGHENRALEMYTDLRMFDYAQEFLGSGDPKEKKMLIRKRADWARNIKEPKAAAEMLISAG 717 (1081)
T ss_pred HHHHHhhhhHHHHHHHHHHcCchhhHHHHHHHHHHHHHHHHHhhcCChHHHHHHHHHHHHHhhhcCCcHHHHHHhhccc
Confidence 9999999999999999999999999999999999999999999887765 57888999999988777544 44444444
No 4
>KOG3617|consensus
Probab=99.78 E-value=9.6e-19 Score=169.39 Aligned_cols=144 Identities=17% Similarity=0.203 Sum_probs=136.5
Q ss_pred HHcCCHHHHHHHHhccCCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCC-cHHHHHHHHHHHcCCHHH
Q psy11525 89 AASHSWPQALSLCRTLNDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIP-VKAAQQAEMYLLGGNISE 167 (244)
Q Consensus 89 l~~~~~~~A~~~~~~~~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~-~~~~~~A~ia~~~G~~~e 167 (244)
+..|+++.|++..++++++..|..+|.+|++.++||+|..|..++++++.++.+++.+..+ +.+++.|.+|..+|+.+|
T Consensus 739 vtiG~MD~AfksI~~IkS~~vW~nmA~McVkT~RLDVAkVClGhm~~aRgaRAlR~a~q~~~e~eakvAvLAieLgMlEe 818 (1416)
T KOG3617|consen 739 VTIGSMDAAFKSIQFIKSDSVWDNMASMCVKTRRLDVAKVCLGHMKNARGARALRRAQQNGEEDEAKVAVLAIELGMLEE 818 (1416)
T ss_pred EEeccHHHHHHHHHHHhhhHHHHHHHHHhhhhccccHHHHhhhhhhhhhhHHHHHHHHhCCcchhhHHHHHHHHHhhHHH
Confidence 8899999999999999999999999999999999999999999999999999999999886 567899999999999999
Q ss_pred HHHHHHHCCCHHHHHHHHHHhcCHHHHHHHHHhcC-CchhHHHHHHHHHHHHcCCc-ccHHHHHhcc
Q psy11525 168 AESILLQHGLIFRAIQVSILTHNWDRALELALRHK-THIDTVLYQRKKYLDNLEKI-ETNEKFLRLQ 232 (244)
Q Consensus 168 Ae~~~l~~g~~~~Ai~m~~~l~~W~~Al~LA~~~~-~~~~~vl~~r~~yl~~~~~~-e~~~~f~~~~ 232 (244)
|+.+|.+|.|.|.-.++|+..|+|++|+++|+.++ .|+....+.+++||++.+.. -+.+.|.|.+
T Consensus 819 A~~lYr~ckR~DLlNKlyQs~g~w~eA~eiAE~~DRiHLr~Tyy~yA~~Lear~Di~~AleyyEK~~ 885 (1416)
T KOG3617|consen 819 ALILYRQCKRYDLLNKLYQSQGMWSEAFEIAETKDRIHLRNTYYNYAKYLEARRDIEAALEYYEKAG 885 (1416)
T ss_pred HHHHHHHHHHHHHHHHHHHhcccHHHHHHHHhhccceehhhhHHHHHHHHHhhccHHHHHHHHHhcC
Confidence 99999999999999999999999999999999998 89999999999999998884 5677787775
No 5
>KOG2041|consensus
Probab=99.70 E-value=7.4e-17 Score=154.35 Aligned_cols=145 Identities=20% Similarity=0.224 Sum_probs=123.6
Q ss_pred HHHcCCHHHHHHHHhccCCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHHHHcCCHHH
Q psy11525 88 YAASHSWPQALSLCRTLNDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMYLLGGNISE 167 (244)
Q Consensus 88 ~l~~~~~~~A~~~~~~~~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia~~~G~~~e 167 (244)
.++.-..++|.++++--+++.+|+.||+.|+...+|+.||.+|.+++|+..+.+++++..+.+++.++|++..+.|.|+|
T Consensus 673 Lve~vgledA~qfiEdnPHprLWrllAe~Al~Kl~l~tAE~AFVrc~dY~Gik~vkrl~~i~s~~~q~aei~~~~g~fee 752 (1189)
T KOG2041|consen 673 LVEAVGLEDAIQFIEDNPHPRLWRLLAEYALFKLALDTAEHAFVRCGDYAGIKLVKRLRTIHSKEQQRAEISAFYGEFEE 752 (1189)
T ss_pred HHHHhchHHHHHHHhcCCchHHHHHHHHHHHHHHhhhhHhhhhhhhccccchhHHHHhhhhhhHHHHhHhHhhhhcchhH
Confidence 46777889999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HHHHHHHCCCHHHHHHHHHHhcCHHHHHHHHHhcCCchhH-HH--HHH--HHHH-HHcCCcccHHHHHhcc
Q psy11525 168 AESILLQHGLIFRAIQVSILTHNWDRALELALRHKTHIDT-VL--YQR--KKYL-DNLEKIETNEKFLRLQ 232 (244)
Q Consensus 168 Ae~~~l~~g~~~~Ai~m~~~l~~W~~Al~LA~~~~~~~~~-vl--~~r--~~yl-~~~~~~e~~~~f~~~~ 232 (244)
||++|+.+.+-|.||+||..+++|-+.++|-+..+.+.++ .+ +.| ..|+ +-..++|+.+.|.+..
T Consensus 753 aek~yld~drrDLAielr~klgDwfrV~qL~r~g~~d~dD~~~e~A~r~ig~~fa~~~~We~A~~yY~~~~ 823 (1189)
T KOG2041|consen 753 AEKLYLDADRRDLAIELRKKLGDWFRVYQLIRNGGSDDDDEGKEDAFRNIGETFAEMMEWEEAAKYYSYCG 823 (1189)
T ss_pred hhhhhhccchhhhhHHHHHhhhhHHHHHHHHHccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc
Confidence 9999999999999999999999999999999986654222 11 111 2222 4445566666555543
No 6
>PF04053 Coatomer_WDAD: Coatomer WD associated region ; InterPro: IPR006692 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the WD-associated region found in coatomer subunits alpha, beta and beta' subunits. The alpha-subunit (RET1P) of the coatomer complex in Saccharomyces cerevisiae (Baker's yeast), participates in membrane transport between the endoplasmic reticulum and Golgi apparatus. The protein contains six WD-40 repeat motifs in its N-terminal region []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 3MKQ_B.
Probab=99.65 E-value=2.6e-14 Score=134.65 Aligned_cols=186 Identities=21% Similarity=0.292 Sum_probs=142.6
Q ss_pred cceeeecCCCceEEEEeCCeEEEEEcCCccccCcccceeeeEeeccCCCCCCCeEEeEeC--CEEEEEecCCcEEEeecC
Q psy11525 2 VQSICWNTDVNILAAMQDSALCVWFFPAVVFADQGLLRKTVLLKDIGEFGKSPSIVSFVK--NHLTIRRYDGTVINYPIS 79 (244)
Q Consensus 2 v~~v~W~~~~~vlv~~~~~~l~vw~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~v~f~g--~~v~~~~~~G~~~~~~~~ 79 (244)
|.|..|+.| +|+..+.++|. |++.-- +.+.+ .+.+...++++.+ ++|+|...++.+....+.
T Consensus 199 IkSg~W~~d--~fiYtT~~~lk--Yl~~Ge---------~~~i~---~ld~~~yllgy~~~~~~ly~~Dr~~~v~~~~ld 262 (443)
T PF04053_consen 199 IKSGCWVED--CFIYTTSNHLK--YLVNGE---------TGIIA---HLDKPLYLLGYLPKENRLYLIDRDGNVISYELD 262 (443)
T ss_dssp -SEEEEETT--EEEEE-TTEEE--EEETTE---------EEEEE---E-SS--EEEEEETTTTEEEEE-TT--EEEEE--
T ss_pred eEEEEEEcC--EEEEEcCCeEE--EEEcCC---------cceEE---EcCCceEEEEEEccCCEEEEEECCCCEEEEEEC
Confidence 678999776 88888888554 344311 11111 2445667889999 999999989988666321
Q ss_pred ----------------------------c------------------cHH--------HHHHH---HHcCCHHHHHHHHh
Q psy11525 80 ----------------------------P------------------YIS--------VLHSY---AASHSWPQALSLCR 102 (244)
Q Consensus 80 ----------------------------p------------------~~~--------~l~~~---l~~~~~~~A~~~~~ 102 (244)
| +++ .-++| |+.|+++.|+++|+
T Consensus 263 ~~~~~fk~av~~~d~~~v~~~i~~~~ll~~i~~~~~~~i~~fL~~~G~~e~AL~~~~D~~~rFeLAl~lg~L~~A~~~a~ 342 (443)
T PF04053_consen 263 LSELEFKTAVLRGDFEEVLRMIAASNLLPNIPKDQGQSIARFLEKKGYPELALQFVTDPDHRFELALQLGNLDIALEIAK 342 (443)
T ss_dssp HHHHHHHHHHHTT-HHH-----HHHHTGGG--HHHHHHHHHHHHHTT-HHHHHHHSS-HHHHHHHHHHCT-HHHHHHHCC
T ss_pred HHHHHHHHHHHcCChhhhhhhhhhhhhcccCChhHHHHHHHHHHHCCCHHHHHhhcCChHHHhHHHHhcCCHHHHHHHHH
Confidence 1 111 11223 99999999999999
Q ss_pred ccCCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHCCCHHHHH
Q psy11525 103 TLNDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMYLLGGNISEAESILLQHGLIFRAI 182 (244)
Q Consensus 103 ~~~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia~~~G~~~eAe~~~l~~g~~~~Ai 182 (244)
.++.+..|++||..||..+++++|++||.+.+|++++.+|..+..-.++-..+|.++...|++..|...++-.|++++.+
T Consensus 343 ~~~~~~~W~~Lg~~AL~~g~~~lAe~c~~k~~d~~~L~lLy~~~g~~~~L~kl~~~a~~~~~~n~af~~~~~lgd~~~cv 422 (443)
T PF04053_consen 343 ELDDPEKWKQLGDEALRQGNIELAEECYQKAKDFSGLLLLYSSTGDREKLSKLAKIAEERGDINIAFQAALLLGDVEECV 422 (443)
T ss_dssp CCSTHHHHHHHHHHHHHTTBHHHHHHHHHHCT-HHHHHHHHHHCT-HHHHHHHHHHHHHTT-HHHHHHHHHHHT-HHHHH
T ss_pred hcCcHHHHHHHHHHHHHcCCHHHHHHHHHhhcCccccHHHHHHhCCHHHHHHHHHHHHHccCHHHHHHHHHHcCCHHHHH
Confidence 99999999999999999999999999999999999999999999888888899999999999999999999999999999
Q ss_pred HHHHHhcCHHHHHHHHHhcCC
Q psy11525 183 QVSILTHNWDRALELALRHKT 203 (244)
Q Consensus 183 ~m~~~l~~W~~Al~LA~~~~~ 203 (244)
++..+.+++.+|.-.|++|+|
T Consensus 423 ~lL~~~~~~~~A~~~A~ty~~ 443 (443)
T PF04053_consen 423 DLLIETGRLPEAALFARTYGP 443 (443)
T ss_dssp HHHHHTT-HHHHHHHHHHTT-
T ss_pred HHHHHcCCchHHHHHHHhcCC
Confidence 999999999999999999875
No 7
>KOG0292|consensus
Probab=99.34 E-value=1.7e-11 Score=120.19 Aligned_cols=165 Identities=18% Similarity=0.182 Sum_probs=126.6
Q ss_pred CCCeEEeEeCCEEEEEecCCcEEEeecC---------------------------------------ccHHHHHHH----
Q psy11525 52 KSPSIVSFVKNHLTIRRYDGTVINYPIS---------------------------------------PYISVLHSY---- 88 (244)
Q Consensus 52 ~~~~~v~f~g~~v~~~~~~G~~~~~~~~---------------------------------------p~~~~l~~~---- 88 (244)
+...+...+|+.|+|...+|......+- .|++.-.+|
T Consensus 567 ~~iyitkv~gn~V~cl~rd~~~~~~~IDptEy~FKlALi~k~ydeVl~lI~ns~LvGqaiIaYLqKkgypeiAL~FVkD~ 646 (1202)
T KOG0292|consen 567 KPIYITKVKGNKVFCLNRDGEIECLTIDPTEYRFKLALLNKKYDEVLHLIKNSNLVGQAIIAYLQKKGYPEIALHFVKDE 646 (1202)
T ss_pred cceEEEEeeCCEEEEEecCCCeEEEeechHHHHHHHHHHhhhhHHHHHHHHhcCcccHHHHHHHHhcCCcceeeeeecCc
Confidence 3445667899999999999988544221 122222222
Q ss_pred -------HHcCCHHHHHHHHhccCCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHHHH
Q psy11525 89 -------AASHSWPQALSLCRTLNDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMYLL 161 (244)
Q Consensus 89 -------l~~~~~~~A~~~~~~~~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia~~ 161 (244)
|+.|+.+.|++.|+.++.+..|..||+.||..+|-++||.||.+..+++|+.||.-+..-.++..+++.++..
T Consensus 647 ~tRF~LaLe~gnle~ale~akkldd~d~w~rLge~Al~qgn~~IaEm~yQ~~knfekLsfLYliTgn~eKL~Km~~iae~ 726 (1202)
T KOG0292|consen 647 RTRFELALECGNLEVALEAAKKLDDKDVWERLGEEALRQGNHQIAEMCYQRTKNFEKLSFLYLITGNLEKLSKMMKIAEI 726 (1202)
T ss_pred chheeeehhcCCHHHHHHHHHhcCcHHHHHHHHHHHHHhcchHHHHHHHHHhhhhhheeEEEEEeCCHHHHHHHHHHHHh
Confidence 8999999999999999999999999999999999999999999999999999988888877777777777766
Q ss_pred cCCHHHHHHHHHHCCCHHHHHHHHHHhcCHHHHHHHHHhcCCchhHHHHHHHHHHHHcCC
Q psy11525 162 GGNISEAESILLQHGLIFRAIQVSILTHNWDRALELALRHKTHIDTVLYQRKKYLDNLEK 221 (244)
Q Consensus 162 ~G~~~eAe~~~l~~g~~~~Ai~m~~~l~~W~~Al~LA~~~~~~~~~vl~~r~~yl~~~~~ 221 (244)
..+..---+.-+..|+.++-++....-|+-+.|...|..|+.+ + ..++++++.+.
T Consensus 727 r~D~~~~~qnalYl~dv~ervkIl~n~g~~~laylta~~~G~~--~---~ae~l~ee~~~ 781 (1202)
T KOG0292|consen 727 RNDATGQFQNALYLGDVKERVKILENGGQLPLAYLTAAAHGLE--D---QAEKLGEELEK 781 (1202)
T ss_pred hhhhHHHHHHHHHhccHHHHHHHHHhcCcccHHHHHHhhcCcH--H---HHHHHHHhhcc
Confidence 6555555555566677777777777777888888888888753 3 44555555443
No 8
>KOG2041|consensus
Probab=99.18 E-value=3.3e-10 Score=109.37 Aligned_cols=152 Identities=18% Similarity=0.228 Sum_probs=115.7
Q ss_pred HHHcCCHHHHHHHHhccC-------CHHH-------------HHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhC
Q psy11525 88 YAASHSWPQALSLCRTLN-------DDIL-------------WACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKG 147 (244)
Q Consensus 88 ~l~~~~~~~A~~~~~~~~-------~~~l-------------W~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~ 147 (244)
++..|+|-+-+++.+.+. -+.. |...+++....++.+--..||.++++++.++.+-+...
T Consensus 770 r~klgDwfrV~qL~r~g~~d~dD~~~e~A~r~ig~~fa~~~~We~A~~yY~~~~~~e~~~ecly~le~f~~LE~la~~Lp 849 (1189)
T KOG2041|consen 770 RKKLGDWFRVYQLIRNGGSDDDDEGKEDAFRNIGETFAEMMEWEEAAKYYSYCGDTENQIECLYRLELFGELEVLARTLP 849 (1189)
T ss_pred HHhhhhHHHHHHHHHccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccchHhHHHHHHHHHhhhhHHHHHHhcC
Confidence 478899999999999952 1334 55555555555555555556666666665555444333
Q ss_pred CCcH-HHHHHHHHHHcCCHHHHHHHHHHCCCHHHHHHHHHHhcCHHHHHHHHHhcC-CchhHHHHHHHHHH-HHcCCccc
Q psy11525 148 IPVK-AAQQAEMYLLGGNISEAESILLQHGLIFRAIQVSILTHNWDRALELALRHK-THIDTVLYQRKKYL-DNLEKIET 224 (244)
Q Consensus 148 ~~~~-~~~~A~ia~~~G~~~eAe~~~l~~g~~~~Ai~m~~~l~~W~~Al~LA~~~~-~~~~~vl~~r~~yl-~~~~~~e~ 224 (244)
..++ ...+|.++...|+.++|.+.|++.|.|..|+.-+..|++|.+|.+||+++. +++.++++..+.+| +..+..|+
T Consensus 850 e~s~llp~~a~mf~svGMC~qAV~a~Lr~s~pkaAv~tCv~LnQW~~avelaq~~~l~qv~tliak~aaqll~~~~~~ea 929 (1189)
T KOG2041|consen 850 EDSELLPVMADMFTSVGMCDQAVEAYLRRSLPKAAVHTCVELNQWGEAVELAQRFQLPQVQTLIAKQAAQLLADANHMEA 929 (1189)
T ss_pred cccchHHHHHHHHHhhchHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHhccchhHHHHHHHHHHHHHhhcchHHH
Confidence 3222 246899999999999999999999999999999999999999999999999 55999998887777 66677899
Q ss_pred HHHHHhcccccccCc
Q psy11525 225 NEKFLRLQSEVIMSG 239 (244)
Q Consensus 225 ~~~f~~~~~~~~~~~ 239 (244)
+++.||++.-.+--+
T Consensus 930 Ie~~Rka~~~~daar 944 (1189)
T KOG2041|consen 930 IEKDRKAGRHLDAAR 944 (1189)
T ss_pred HHHhhhcccchhHHH
Confidence 999999976544333
No 9
>KOG3616|consensus
Probab=98.94 E-value=7.9e-09 Score=100.63 Aligned_cols=135 Identities=18% Similarity=0.254 Sum_probs=111.0
Q ss_pred cHHHHHHHHHcCCHHHHHHHHhcc----CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHH-----------h
Q psy11525 81 YISVLHSYAASHSWPQALSLCRTL----NDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINH-----------I 145 (244)
Q Consensus 81 ~~~~l~~~l~~~~~~~A~~~~~~~----~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~-----------i 145 (244)
+..+..-|-..|+|++|++++..- .+-+++-.-|+.--++|.+..|++.|..+|.+++.-...+ +
T Consensus 794 ~~dai~my~k~~kw~da~kla~e~~~~e~t~~~yiakaedldehgkf~eaeqlyiti~~p~~aiqmydk~~~~ddmirlv 873 (1636)
T KOG3616|consen 794 FKDAIDMYGKAGKWEDAFKLAEECHGPEATISLYIAKAEDLDEHGKFAEAEQLYITIGEPDKAIQMYDKHGLDDDMIRLV 873 (1636)
T ss_pred hHHHHHHHhccccHHHHHHHHHHhcCchhHHHHHHHhHHhHHhhcchhhhhheeEEccCchHHHHHHHhhCcchHHHHHH
Confidence 345667789999999999999773 4667888888899999999999999999999876622221 2
Q ss_pred hC-----CCcHHHHHHHHHHHcCCHHHHHHHHHHCCCHHHHHHHHHHhcCHHHHHHHHHhcCCc--hhHHHHHHHHH
Q psy11525 146 KG-----IPVKAAQQAEMYLLGGNISEAESILLQHGLIFRAIQVSILTHNWDRALELALRHKTH--IDTVLYQRKKY 215 (244)
Q Consensus 146 ~~-----~~~~~~~~A~ia~~~G~~~eAe~~~l~~g~~~~Ai~m~~~l~~W~~Al~LA~~~~~~--~~~vl~~r~~y 215 (244)
.+ +.+...+.|.-+...|+.++||..|+++|++..|++||++.++|++|+++|++.+.. -+.|++..++-
T Consensus 874 ~k~h~d~l~dt~~~f~~e~e~~g~lkaae~~flea~d~kaavnmyk~s~lw~dayriaktegg~n~~k~v~flwaks 950 (1636)
T KOG3616|consen 874 EKHHGDHLHDTHKHFAKELEAEGDLKAAEEHFLEAGDFKAAVNMYKASELWEDAYRIAKTEGGANAEKHVAFLWAKS 950 (1636)
T ss_pred HHhChhhhhHHHHHHHHHHHhccChhHHHHHHHhhhhHHHHHHHhhhhhhHHHHHHHHhccccccHHHHHHHHHHHh
Confidence 11 234557789999999999999999999999999999999999999999999998865 57777776543
No 10
>KOG0276|consensus
Probab=98.90 E-value=2.3e-08 Score=95.59 Aligned_cols=129 Identities=16% Similarity=0.129 Sum_probs=119.5
Q ss_pred HHcCCHHHHHHHHhccCCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHHHHcCCHHHH
Q psy11525 89 AASHSWPQALSLCRTLNDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMYLLGGNISEA 168 (244)
Q Consensus 89 l~~~~~~~A~~~~~~~~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia~~~G~~~eA 168 (244)
|+.|+++.|+++|...+++.-|++||..|+..+++..|.+||.+-.|+..+..+.....-.+.....|..+-..|+.+-|
T Consensus 648 l~lgrl~iA~~la~e~~s~~Kw~~Lg~~al~~~~l~lA~EC~~~a~d~~~LlLl~t~~g~~~~l~~la~~~~~~g~~N~A 727 (794)
T KOG0276|consen 648 LKLGRLDIAFDLAVEANSEVKWRQLGDAALSAGELPLASECFLRARDLGSLLLLYTSSGNAEGLAVLASLAKKQGKNNLA 727 (794)
T ss_pred hhcCcHHHHHHHHHhhcchHHHHHHHHHHhhcccchhHHHHHHhhcchhhhhhhhhhcCChhHHHHHHHHHHhhcccchH
Confidence 89999999999999999999999999999999999999999999999999999888877666667789999999999999
Q ss_pred HHHHHHCCCHHHHHHHHHHhcCHHHHHHHHHhcCCc-hhHHHHHHHHHHH
Q psy11525 169 ESILLQHGLIFRAIQVSILTHNWDRALELALRHKTH-IDTVLYQRKKYLD 217 (244)
Q Consensus 169 e~~~l~~g~~~~Ai~m~~~l~~W~~Al~LA~~~~~~-~~~vl~~r~~yl~ 217 (244)
.-.|...|++++++++....++..+|--+|+++.+. +..++.+.+.=|.
T Consensus 728 F~~~~l~g~~~~C~~lLi~t~r~peAal~ArtYlps~vs~iv~~wk~~l~ 777 (794)
T KOG0276|consen 728 FLAYFLSGDYEECLELLISTQRLPEAALFARTYLPSQVSRIVELWKEDLS 777 (794)
T ss_pred HHHHHHcCCHHHHHHHHHhcCcCcHHHHHHhhhChHHHHHHHHHHHHHhh
Confidence 999999999999999999999999999999999976 7777777766663
No 11
>KOG2247|consensus
Probab=98.61 E-value=1.9e-08 Score=94.16 Aligned_cols=114 Identities=14% Similarity=-0.012 Sum_probs=101.4
Q ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHCCCHHHHHHHHHH
Q psy11525 108 ILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMYLLGGNISEAESILLQHGLIFRAIQVSIL 187 (244)
Q Consensus 108 ~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia~~~G~~~eAe~~~l~~g~~~~Ai~m~~~ 187 (244)
+.|.+.+..|+...+..+|.+.|+++||..++..|..+...++.+...|.+.-..-.||.|++.+++.+.+..|++|..|
T Consensus 4 ~~~~~~~~~a~~d~~~~~airiyr~ledaalv~pi~~~w~~e~~nlavaca~tiv~~YD~agq~~le~n~tg~aldm~wD 83 (615)
T KOG2247|consen 4 KVIPCTLTKAQEDFKCVSAIRIYRRLEDAALVGPIIHRWRPEGHNLAVACANTIVIYYDKAGQVILELNPTGKALDMAWD 83 (615)
T ss_pred cchhhHHHhhhhhccchHHHHHHHHhhhhhccccceeeEecCCCceehhhhhhHHHhhhhhcceecccCCchhHhhhhhc
Confidence 57999999999999999999999999999999999999999999988888888999999999999999999999999999
Q ss_pred hcCHHHHHHHHHhcCCc-hhHHHHHHHHHHHHcCCc
Q psy11525 188 THNWDRALELALRHKTH-IDTVLYQRKKYLDNLEKI 222 (244)
Q Consensus 188 l~~W~~Al~LA~~~~~~-~~~vl~~r~~yl~~~~~~ 222 (244)
+ +||.|+-+|++.++- +-.+-..++.-|+.+|..
T Consensus 84 k-egdvlavlAek~~piylwd~n~eytqqLE~gg~~ 118 (615)
T KOG2247|consen 84 K-EGDVLAVLAEKTGPIYLWDVNSEYTQQLESGGTS 118 (615)
T ss_pred c-ccchhhhhhhcCCCeeechhhhhhHHHHhccCcc
Confidence 9 999999999999965 555444555555555543
No 12
>KOG1538|consensus
Probab=98.09 E-value=2e-05 Score=76.49 Aligned_cols=148 Identities=20% Similarity=0.193 Sum_probs=103.2
Q ss_pred HHHHHHHcCCHHHHHHHHhccC---------CHHHHHH-------HHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhC
Q psy11525 84 VLHSYAASHSWPQALSLCRTLN---------DDILWAC-------LAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKG 147 (244)
Q Consensus 84 ~l~~~l~~~~~~~A~~~~~~~~---------~~~lW~~-------LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~ 147 (244)
+|.-|-..+.|+.|-++...+. ....|+. .|++-+.+|+.+-|... ++|-+-+..+.+|.+
T Consensus 664 AlEmyTDlRMFD~aQE~~~~g~~~eKKmL~RKRA~WAr~~kePkaAAEmLiSaGe~~KAi~i---~~d~gW~d~lidI~r 740 (1081)
T KOG1538|consen 664 ALEMYTDLRMFDYAQEFLGSGDPKEKKMLIRKRADWARNIKEPKAAAEMLISAGEHVKAIEI---CGDHGWVDMLIDIAR 740 (1081)
T ss_pred HHHHHHHHHHHHHHHHHhhcCChHHHHHHHHHHHHHhhhcCCcHHHHHHhhcccchhhhhhh---hhcccHHHHHHHHHh
Confidence 3334455566666666665542 2344542 44555555555554433 334445556666654
Q ss_pred CCcH-----HHHHHHHHHHcCCHHHHHHHHHHCCCHHHHHHHHHHhcCHHHHHHHHHhcCCchhHHHHHHHHHHHHcCC-
Q psy11525 148 IPVK-----AAQQAEMYLLGGNISEAESILLQHGLIFRAIQVSILTHNWDRALELALRHKTHIDTVLYQRKKYLDNLEK- 221 (244)
Q Consensus 148 ~~~~-----~~~~A~ia~~~G~~~eAe~~~l~~g~~~~Ai~m~~~l~~W~~Al~LA~~~~~~~~~vl~~r~~yl~~~~~- 221 (244)
.-++ -...|.-..-+.-+..|.++|.+.|+...-++|....++|++|..||++|..-+++|.+.+.+||.++.+
T Consensus 741 kld~~ere~l~~~a~ylk~l~~~gLAaeIF~k~gD~ksiVqlHve~~~W~eAFalAe~hPe~~~dVy~pyaqwLAE~DrF 820 (1081)
T KOG1538|consen 741 KLDKAEREPLLLCATYLKKLDSPGLAAEIFLKMGDLKSLVQLHVETQRWDEAFALAEKHPEFKDDVYMPYAQWLAENDRF 820 (1081)
T ss_pred hcchhhhhHHHHHHHHHhhccccchHHHHHHHhccHHHHhhheeecccchHhHhhhhhCccccccccchHHHHhhhhhhH
Confidence 3221 2345555555667889999999999999999999999999999999999987789999999999977766
Q ss_pred cccHHHHHhcccc
Q psy11525 222 IETNEKFLRLQSE 234 (244)
Q Consensus 222 ~e~~~~f~~~~~~ 234 (244)
+|+-.-|-+++++
T Consensus 821 eEAqkAfhkAGr~ 833 (1081)
T KOG1538|consen 821 EEAQKAFHKAGRQ 833 (1081)
T ss_pred HHHHHHHHHhcch
Confidence 7888889988865
No 13
>PF04053 Coatomer_WDAD: Coatomer WD associated region ; InterPro: IPR006692 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the WD-associated region found in coatomer subunits alpha, beta and beta' subunits. The alpha-subunit (RET1P) of the coatomer complex in Saccharomyces cerevisiae (Baker's yeast), participates in membrane transport between the endoplasmic reticulum and Golgi apparatus. The protein contains six WD-40 repeat motifs in its N-terminal region []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 3MKQ_B.
Probab=97.45 E-value=0.024 Score=53.92 Aligned_cols=119 Identities=13% Similarity=0.010 Sum_probs=72.3
Q ss_pred ccHHHHHHH---HHcCCHHHHHHHHhc--c---CCHHHHHHHHHHHHHcCCHHHHHHH----------HHHhCChhHHHH
Q psy11525 80 PYISVLHSY---AASHSWPQALSLCRT--L---NDDILWACLAGMATYSRDLATSEEA----------YAAIEQVDKVMY 141 (244)
Q Consensus 80 p~~~~l~~~---l~~~~~~~A~~~~~~--~---~~~~lW~~LA~~Al~~~~l~~A~~~----------y~~l~d~~~v~~ 141 (244)
+++...++| +-.++++++.+..+. + ..+..=..++...-+.|-.++|... -..+|+.+.+
T Consensus 260 ~ld~~~~~fk~av~~~d~~~v~~~i~~~~ll~~i~~~~~~~i~~fL~~~G~~e~AL~~~~D~~~rFeLAl~lg~L~~A-- 337 (443)
T PF04053_consen 260 ELDLSELEFKTAVLRGDFEEVLRMIAASNLLPNIPKDQGQSIARFLEKKGYPELALQFVTDPDHRFELALQLGNLDIA-- 337 (443)
T ss_dssp E--HHHHHHHHHHHTT-HHH-----HHHHTGGG--HHHHHHHHHHHHHTT-HHHHHHHSS-HHHHHHHHHHCT-HHHH--
T ss_pred EECHHHHHHHHHHHcCChhhhhhhhhhhhhcccCChhHHHHHHHHHHHCCCHHHHHhhcCChHHHhHHHHhcCCHHHH--
Confidence 568888888 999999996555441 1 1244445556555555555555432 1223333222
Q ss_pred HHHhhCCCc--HHHHHHHHHHHcCCHHHHHHHHHHCCCHHHHHHHHHHhcCHHHHHHHHHh
Q psy11525 142 INHIKGIPV--KAAQQAEMYLLGGNISEAESILLQHGLIFRAIQVSILTHNWDRALELALR 200 (244)
Q Consensus 142 l~~i~~~~~--~~~~~A~ia~~~G~~~eAe~~~l~~g~~~~Ai~m~~~l~~W~~Al~LA~~ 200 (244)
++-++++++ +-.+.|..|+.+|+++.||+.|.++++++...=+|.-.|+=+.=.++|+.
T Consensus 338 ~~~a~~~~~~~~W~~Lg~~AL~~g~~~lAe~c~~k~~d~~~L~lLy~~~g~~~~L~kl~~~ 398 (443)
T PF04053_consen 338 LEIAKELDDPEKWKQLGDEALRQGNIELAEECYQKAKDFSGLLLLYSSTGDREKLSKLAKI 398 (443)
T ss_dssp HHHCCCCSTHHHHHHHHHHHHHTTBHHHHHHHHHHCT-HHHHHHHHHHCT-HHHHHHHHHH
T ss_pred HHHHHhcCcHHHHHHHHHHHHHcCCHHHHHHHHHhhcCccccHHHHHHhCCHHHHHHHHHH
Confidence 233333433 44789999999999999999999999999999999999998775555543
No 14
>KOG1920|consensus
Probab=97.40 E-value=0.0011 Score=68.31 Aligned_cols=98 Identities=16% Similarity=0.131 Sum_probs=80.7
Q ss_pred CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHH-----HHHHHHHHHcCCHHHHHHHHH-HCCCH
Q psy11525 105 NDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKA-----AQQAEMYLLGGNISEAESILL-QHGLI 178 (244)
Q Consensus 105 ~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~-----~~~A~ia~~~G~~~eAe~~~l-~~g~~ 178 (244)
..+.+|...|.+....|+++-|..||..+||++.+..+-.-....... ..++.-+..+|++-||.++.+ .++++
T Consensus 950 ~~~~~~~~Aal~Ye~~GklekAl~a~~~~~dWr~~l~~a~ql~~~~de~~~~a~~L~s~L~e~~kh~eAa~il~e~~sd~ 1029 (1265)
T KOG1920|consen 950 REELMSDEAALMYERCGKLEKALKAYKECGDWREALSLAAQLSEGKDELVILAEELVSRLVEQRKHYEAAKILLEYLSDP 1029 (1265)
T ss_pred HHhccccHHHHHHHHhccHHHHHHHHHHhccHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHcccchhHHHHHHHHhcCH
Confidence 356788999999999999999999999999999998777655543322 235556678888877777665 56999
Q ss_pred HHHHHHHHHhcCHHHHHHHHHhcC
Q psy11525 179 FRAIQVSILTHNWDRALELALRHK 202 (244)
Q Consensus 179 ~~Ai~m~~~l~~W~~Al~LA~~~~ 202 (244)
.+|+..+++-..|++|+++|..++
T Consensus 1030 ~~av~ll~ka~~~~eAlrva~~~~ 1053 (1265)
T KOG1920|consen 1030 EEAVALLCKAKEWEEALRVASKAK 1053 (1265)
T ss_pred HHHHHHHhhHhHHHHHHHHHHhcc
Confidence 999999999999999999999988
No 15
>KOG3617|consensus
Probab=97.04 E-value=0.011 Score=59.47 Aligned_cols=138 Identities=17% Similarity=0.156 Sum_probs=102.7
Q ss_pred cCCHHHHHHHHhccCCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHHHHcCCH--HHH
Q psy11525 91 SHSWPQALSLCRTLNDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMYLLGGNI--SEA 168 (244)
Q Consensus 91 ~~~~~~A~~~~~~~~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia~~~G~~--~eA 168 (244)
.|+.++|-++|+.-.+...=--||++.-..+++.-|..-|.+-.-+. ..-++-+.++...+.+.+++..|.- -+|
T Consensus 951 qGk~~kAa~iA~esgd~AAcYhlaR~YEn~g~v~~Av~FfTrAqafs---nAIRlcKEnd~~d~L~nlal~s~~~d~v~a 1027 (1416)
T KOG3617|consen 951 QGKTDKAARIAEESGDKAACYHLARMYENDGDVVKAVKFFTRAQAFS---NAIRLCKENDMKDRLANLALMSGGSDLVSA 1027 (1416)
T ss_pred ccCchHHHHHHHhcccHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHH---HHHHHHHhcCHHHHHHHHHhhcCchhHHHH
Confidence 57778888888887777777788999888999988888888755332 2223333455556778888887754 567
Q ss_pred HHHHHHCC-CHHHHHHHHHHhcCHHHHHHHHHh-------------cCCc-hhHHHHHHHHHHHHcCCcccHHHHHhc
Q psy11525 169 ESILLQHG-LIFRAIQVSILTHNWDRALELALR-------------HKTH-IDTVLYQRKKYLDNLEKIETNEKFLRL 231 (244)
Q Consensus 169 e~~~l~~g-~~~~Ai~m~~~l~~W~~Al~LA~~-------------~~~~-~~~vl~~r~~yl~~~~~~e~~~~f~~~ 231 (244)
...|-..| ..++|+++|..-||..+||+||=. .+.. -..++.+++.|+.+..+-|.+-.|+-.
T Consensus 1028 ArYyEe~g~~~~~AVmLYHkAGm~~kALelAF~tqQf~aL~lIa~DLd~~sDp~ll~RcadFF~~~~qyekAV~lL~~ 1105 (1416)
T KOG3617|consen 1028 ARYYEELGGYAHKAVMLYHKAGMIGKALELAFRTQQFSALDLIAKDLDAGSDPKLLRRCADFFENNQQYEKAVNLLCL 1105 (1416)
T ss_pred HHHHHHcchhhhHHHHHHHhhcchHHHHHHHHhhcccHHHHHHHHhcCCCCCHHHHHHHHHHHHhHHHHHHHHHHHHH
Confidence 88999998 999999999999999999999844 2222 356788889999887776665555433
No 16
>PF04840 Vps16_C: Vps16, C-terminal region; InterPro: IPR006925 This protein forms part of the Class C vacuolar protein sorting (Vps) complex. Vps16 is essential for vacuolar protein sorting, which is essential for viability in plants, but not yeast []. The Class C Vps complex is required for SNARE-mediated membrane fusion at the lysosome-like yeast vacuole. It is thought to play essential roles in membrane docking and fusion at the Golgi-to-endosome and endosome-to-vacuole stages of transport []. The role of VPS16 in this complex is not known.; GO: 0006886 intracellular protein transport, 0005737 cytoplasm
Probab=97.04 E-value=0.016 Score=52.81 Aligned_cols=109 Identities=17% Similarity=0.142 Sum_probs=85.6
Q ss_pred ccHHHHHHHHHcCCHHHHHHHHhcc--CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHH
Q psy11525 80 PYISVLHSYAASHSWPQALSLCRTL--NDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAE 157 (244)
Q Consensus 80 p~~~~l~~~l~~~~~~~A~~~~~~~--~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ 157 (244)
++.+++.+.+..|+...|-++.... ..+..|-.--. +|+..++++.+..+-.-++-|=...-..+
T Consensus 179 Sl~~Ti~~li~~~~~k~A~kl~k~Fkv~dkrfw~lki~-------------aLa~~~~w~eL~~fa~skKsPIGyepFv~ 245 (319)
T PF04840_consen 179 SLNDTIRKLIEMGQEKQAEKLKKEFKVPDKRFWWLKIK-------------ALAENKDWDELEKFAKSKKSPIGYEPFVE 245 (319)
T ss_pred CHHHHHHHHHHCCCHHHHHHHHHHcCCcHHHHHHHHHH-------------HHHhcCCHHHHHHHHhCCCCCCChHHHHH
Confidence 6778899999999999999999874 67788875544 45555555555444333444444456778
Q ss_pred HHHHcCCHHHHHHHHHHCCCHHHHHHHHHHhcCHHHHHHHHHhcC
Q psy11525 158 MYLLGGNISEAESILLQHGLIFRAIQVSILTHNWDRALELALRHK 202 (244)
Q Consensus 158 ia~~~G~~~eAe~~~l~~g~~~~Ai~m~~~l~~W~~Al~LA~~~~ 202 (244)
.++-.|..++|....-+... +.-++||..+|+|.+|.+.|.+.+
T Consensus 246 ~~~~~~~~~eA~~yI~k~~~-~~rv~~y~~~~~~~~A~~~A~~~k 289 (319)
T PF04840_consen 246 ACLKYGNKKEASKYIPKIPD-EERVEMYLKCGDYKEAAQEAFKEK 289 (319)
T ss_pred HHHHCCCHHHHHHHHHhCCh-HHHHHHHHHCCCHHHHHHHHHHcC
Confidence 88899999999999999888 999999999999999999999887
No 17
>PRK11788 tetratricopeptide repeat protein; Provisional
Probab=97.03 E-value=0.017 Score=52.53 Aligned_cols=78 Identities=13% Similarity=0.042 Sum_probs=43.6
Q ss_pred HHHHHHHHHcCCHHHHHHHHHHCC--CHH-------HHHHHHHHhcCHHHHHHHHHhc---CCchhHHHHHHHHHHHHcC
Q psy11525 153 AQQAEMYLLGGNISEAESILLQHG--LIF-------RAIQVSILTHNWDRALELALRH---KTHIDTVLYQRKKYLDNLE 220 (244)
Q Consensus 153 ~~~A~ia~~~G~~~eAe~~~l~~g--~~~-------~Ai~m~~~l~~W~~Al~LA~~~---~~~~~~vl~~r~~yl~~~~ 220 (244)
...|.++...|++++|.++|.+.- .+. .....|...+++++|++..++. .++.... ....+.+...|
T Consensus 218 ~~la~~~~~~g~~~~A~~~~~~~~~~~p~~~~~~~~~l~~~~~~~g~~~~A~~~l~~~~~~~p~~~~~-~~la~~~~~~g 296 (389)
T PRK11788 218 ILLGDLALAQGDYAAAIEALERVEEQDPEYLSEVLPKLMECYQALGDEAEGLEFLRRALEEYPGADLL-LALAQLLEEQE 296 (389)
T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHHChhhHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCCCchHH-HHHHHHHHHhC
Confidence 346777777888888888777654 221 2245677778888877766542 2332222 33344444444
Q ss_pred C-cccHHHHHhc
Q psy11525 221 K-IETNEKFLRL 231 (244)
Q Consensus 221 ~-~e~~~~f~~~ 231 (244)
+ .++...|.++
T Consensus 297 ~~~~A~~~l~~~ 308 (389)
T PRK11788 297 GPEAAQALLREQ 308 (389)
T ss_pred CHHHHHHHHHHH
Confidence 3 4444445444
No 18
>PF13429 TPR_15: Tetratricopeptide repeat; PDB: 2VQ2_A 2PL2_B.
Probab=96.96 E-value=0.0011 Score=58.27 Aligned_cols=136 Identities=18% Similarity=0.228 Sum_probs=32.2
Q ss_pred HHHcCCHHHHHHHHhcc-------CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHHH
Q psy11525 88 YAASHSWPQALSLCRTL-------NDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMYL 160 (244)
Q Consensus 88 ~l~~~~~~~A~~~~~~~-------~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia~ 160 (244)
+...|++++|+++.+.. .....|..+|..|...++.+.|+.+|.++=..+.. .+....+.+.+ .
T Consensus 18 ~~~~~~~~~Al~~L~~~~~~~~~~~~~~~~~~~a~La~~~~~~~~A~~ay~~l~~~~~~--------~~~~~~~l~~l-~ 88 (280)
T PF13429_consen 18 LYQRGDYEKALEVLKKAAQKIAPPDDPEYWRLLADLAWSLGDYDEAIEAYEKLLASDKA--------NPQDYERLIQL-L 88 (280)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred ccccccccccccccccccccccccccccccccccccccccccccccccccccccccccc--------ccccccccccc-c
Confidence 36789999999999432 35789999999999999999999999976433222 11112334444 5
Q ss_pred HcCCHHHHHHHHHHC-------CCHHHHHHHHHHhcCHHHHHHHHHh--cCC---chhHHHHHHHHHHHHcCCc-ccHHH
Q psy11525 161 LGGNISEAESILLQH-------GLIFRAIQVSILTHNWDRALELALR--HKT---HIDTVLYQRKKYLDNLEKI-ETNEK 227 (244)
Q Consensus 161 ~~G~~~eAe~~~l~~-------g~~~~Ai~m~~~l~~W~~Al~LA~~--~~~---~~~~vl~~r~~yl~~~~~~-e~~~~ 227 (244)
..|++++|.+++.++ .....++.++...++|+++.++-++ ..+ .-..++..+..++.+.|+. ++.+.
T Consensus 89 ~~~~~~~A~~~~~~~~~~~~~~~~l~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~a~~~~~~G~~~~A~~~ 168 (280)
T PF13429_consen 89 QDGDPEEALKLAEKAYERDGDPRYLLSALQLYYRLGDYDEAEELLEKLEELPAAPDSARFWLALAEIYEQLGDPDKALRD 168 (280)
T ss_dssp -----------------------------H-HHHTT-HHHHHHHHHHHHH-T---T-HHHHHHHHHHHHHCCHHHHHHHH
T ss_pred ccccccccccccccccccccccchhhHHHHHHHHHhHHHHHHHHHHHHHhccCCCCCHHHHHHHHHHHHHcCCHHHHHHH
Confidence 777888887666543 3344678888899999998777555 222 2346677788888888774 66666
Q ss_pred HHhcc
Q psy11525 228 FLRLQ 232 (244)
Q Consensus 228 f~~~~ 232 (244)
|+++-
T Consensus 169 ~~~al 173 (280)
T PF13429_consen 169 YRKAL 173 (280)
T ss_dssp HHHHH
T ss_pred HHHHH
Confidence 66653
No 19
>TIGR00990 3a0801s09 mitochondrial precursor proteins import receptor (72 kDa mitochondrial outermembrane protein) (mitochondrial import receptor for the ADP/ATP carrier) (translocase of outermembrane tom70).
Probab=96.83 E-value=0.054 Score=53.19 Aligned_cols=143 Identities=13% Similarity=0.030 Sum_probs=75.8
Q ss_pred HHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCC---------------------hhHH-HH
Q psy11525 89 AASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAAIEQ---------------------VDKV-MY 141 (244)
Q Consensus 89 l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d---------------------~~~v-~~ 141 (244)
...|++++|....+.. .....|..+|......++++-|..+|.+.-+ ++.. ..
T Consensus 376 ~~~g~~~eA~~~~~~al~~~p~~~~~~~~lg~~~~~~g~~~~A~~~~~kal~l~P~~~~~~~~la~~~~~~g~~~eA~~~ 455 (615)
T TIGR00990 376 LELGDPDKAEEDFDKALKLNSEDPDIYYHRAQLHFIKGEFAQAGKDYQKSIDLDPDFIFSHIQLGVTQYKEGSIASSMAT 455 (615)
T ss_pred HHCCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCccCHHHHHHHHHHHHHCCCHHHHHHH
Confidence 5666777777665543 3466777777777777777777777654411 1100 11
Q ss_pred HHHhhCC-C---cHHHHHHHHHHHcCCHHHHHHHHHHCCCH---------------HHHHHHHHHhcCHHHHHHHHHh--
Q psy11525 142 INHIKGI-P---VKAAQQAEMYLLGGNISEAESILLQHGLI---------------FRAIQVSILTHNWDRALELALR-- 200 (244)
Q Consensus 142 l~~i~~~-~---~~~~~~A~ia~~~G~~~eAe~~~l~~g~~---------------~~Ai~m~~~l~~W~~Al~LA~~-- 200 (244)
+++.... + +-....|.++...|++++|.+.|.++-.. ..+...+...+++++|+++.++
T Consensus 456 ~~~al~~~P~~~~~~~~lg~~~~~~g~~~~A~~~~~~Al~l~p~~~~~~~~~~~l~~~a~~~~~~~~~~~eA~~~~~kAl 535 (615)
T TIGR00990 456 FRRCKKNFPEAPDVYNYYGELLLDQNKFDEAIEKFDTAIELEKETKPMYMNVLPLINKALALFQWKQDFIEAENLCEKAL 535 (615)
T ss_pred HHHHHHhCCCChHHHHHHHHHHHHccCHHHHHHHHHHHHhcCCccccccccHHHHHHHHHHHHHHhhhHHHHHHHHHHHH
Confidence 1111111 1 11234677777777777777777663211 1233344556777777777654
Q ss_pred -cCCchhHHHHHHHHHHHHcCC-cccHHHHHhc
Q psy11525 201 -HKTHIDTVLYQRKKYLDNLEK-IETNEKFLRL 231 (244)
Q Consensus 201 -~~~~~~~vl~~r~~yl~~~~~-~e~~~~f~~~ 231 (244)
.++.....+......+.+.|+ .|+++.|.++
T Consensus 536 ~l~p~~~~a~~~la~~~~~~g~~~eAi~~~e~A 568 (615)
T TIGR00990 536 IIDPECDIAVATMAQLLLQQGDVDEALKLFERA 568 (615)
T ss_pred hcCCCcHHHHHHHHHHHHHccCHHHHHHHHHHH
Confidence 233322223333333444444 5666666665
No 20
>TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW. Members of this family are designated PilF in ref (PubMed:8973346) and PilW in ref (PubMed:15612916). This outer membrane protein is required both for pilus stability and for pilus function such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.
Probab=96.75 E-value=0.075 Score=43.33 Aligned_cols=141 Identities=13% Similarity=0.129 Sum_probs=74.8
Q ss_pred HHHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCC---------------------hhH-HH
Q psy11525 88 YAASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAAIEQ---------------------VDK-VM 140 (244)
Q Consensus 88 ~l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d---------------------~~~-v~ 140 (244)
+...|++++|...+... .+...|..+|......++++.|+..|.+.-. ++. ..
T Consensus 41 ~~~~~~~~~A~~~~~~~l~~~p~~~~~~~~la~~~~~~~~~~~A~~~~~~al~~~~~~~~~~~~~~~~~~~~g~~~~A~~ 120 (234)
T TIGR02521 41 YLEQGDLEVAKENLDKALEHDPDDYLAYLALALYYQQLGELEKAEDSFRRALTLNPNNGDVLNNYGTFLCQQGKYEQAMQ 120 (234)
T ss_pred HHHCCCHHHHHHHHHHHHHhCcccHHHHHHHHHHHHHcCCHHHHHHHHHHHHhhCCCCHHHHHHHHHHHHHcccHHHHHH
Confidence 46666777766665542 2355666666666666666666666543211 111 01
Q ss_pred HHHHhhCCC------cHHHHHHHHHHHcCCHHHHHHHHHHCCC-----H---HHHHHHHHHhcCHHHHHHHHHhc---CC
Q psy11525 141 YINHIKGIP------VKAAQQAEMYLLGGNISEAESILLQHGL-----I---FRAIQVSILTHNWDRALELALRH---KT 203 (244)
Q Consensus 141 ~l~~i~~~~------~~~~~~A~ia~~~G~~~eAe~~~l~~g~-----~---~~Ai~m~~~l~~W~~Al~LA~~~---~~ 203 (244)
.++++.... ......|.++...|++++|.+.|.++-. + ..--..+...++|++|++..++. .+
T Consensus 121 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~g~~~~A~~~~~~~~~~~~~~~~~~~~la~~~~~~~~~~~A~~~~~~~~~~~~ 200 (234)
T TIGR02521 121 QFEQAIEDPLYPQPARSLENAGLCALKAGDFDKAEKYLTRALQIDPQRPESLLELAELYYLRGQYKDARAYLERYQQTYN 200 (234)
T ss_pred HHHHHHhccccccchHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCcCChHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCC
Confidence 111211111 1123467778888888888888866411 1 12235667788888887666542 22
Q ss_pred chhHHHHHHHHHHHHcCCcccHHHH
Q psy11525 204 HIDTVLYQRKKYLDNLEKIETNEKF 228 (244)
Q Consensus 204 ~~~~vl~~r~~yl~~~~~~e~~~~f 228 (244)
.....+..........|+.+....+
T Consensus 201 ~~~~~~~~~~~~~~~~~~~~~a~~~ 225 (234)
T TIGR02521 201 QTAESLWLGIRIARALGDVAAAQRY 225 (234)
T ss_pred CCHHHHHHHHHHHHHHhhHHHHHHH
Confidence 2223333444445555555544444
No 21
>TIGR00990 3a0801s09 mitochondrial precursor proteins import receptor (72 kDa mitochondrial outermembrane protein) (mitochondrial import receptor for the ADP/ATP carrier) (translocase of outermembrane tom70).
Probab=96.73 E-value=0.054 Score=53.18 Aligned_cols=136 Identities=14% Similarity=-0.014 Sum_probs=88.6
Q ss_pred HHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHHHHcC
Q psy11525 89 AASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMYLLGG 163 (244)
Q Consensus 89 l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia~~~G 163 (244)
...|++++|....+.. .....|..+|......++++.|+.+|.+.=+. --...+-....|.++...|
T Consensus 342 ~~~g~~~eA~~~~~kal~l~P~~~~~~~~la~~~~~~g~~~eA~~~~~~al~~--------~p~~~~~~~~lg~~~~~~g 413 (615)
T TIGR00990 342 CLKGKHLEALADLSKSIELDPRVTQSYIKRASMNLELGDPDKAEEDFDKALKL--------NSEDPDIYYHRAQLHFIKG 413 (615)
T ss_pred HHcCCHHHHHHHHHHHHHcCCCcHHHHHHHHHHHHHCCCHHHHHHHHHHHHHh--------CCCCHHHHHHHHHHHHHcC
Confidence 5688999999888764 34667888898888888998888887764111 0111223356788888889
Q ss_pred CHHHHHHHHHHCCC--HH------HHHHHHHHhcCHHHHHHHHHh---cCCchhHHHHHHHHHHHHcCC-cccHHHHHhc
Q psy11525 164 NISEAESILLQHGL--IF------RAIQVSILTHNWDRALELALR---HKTHIDTVLYQRKKYLDNLEK-IETNEKFLRL 231 (244)
Q Consensus 164 ~~~eAe~~~l~~g~--~~------~Ai~m~~~l~~W~~Al~LA~~---~~~~~~~vl~~r~~yl~~~~~-~e~~~~f~~~ 231 (244)
++++|...|.++=. |+ .--.++..++++++|+....+ ..+....++......+...|+ .++.+.|.++
T Consensus 414 ~~~~A~~~~~kal~l~P~~~~~~~~la~~~~~~g~~~eA~~~~~~al~~~P~~~~~~~~lg~~~~~~g~~~~A~~~~~~A 493 (615)
T TIGR00990 414 EFAQAGKDYQKSIDLDPDFIFSHIQLGVTQYKEGSIASSMATFRRCKKNFPEAPDVYNYYGELLLDQNKFDEAIEKFDTA 493 (615)
T ss_pred CHHHHHHHHHHHHHcCccCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCChHHHHHHHHHHHHccCHHHHHHHHHHH
Confidence 99988888876611 11 112345677888888877543 334444555555666666665 5666666665
Q ss_pred c
Q psy11525 232 Q 232 (244)
Q Consensus 232 ~ 232 (244)
.
T Consensus 494 l 494 (615)
T TIGR00990 494 I 494 (615)
T ss_pred H
Confidence 4
No 22
>KOG1920|consensus
Probab=96.72 E-value=0.018 Score=59.66 Aligned_cols=79 Identities=18% Similarity=0.182 Sum_probs=62.8
Q ss_pred HHHHHHHcCCHHHHHHHHHHCCCHHHHHHHHHHhcCHHHHHHHHHhcCCchhHHHHHHHHHH----HHcCCcccHHHHHh
Q psy11525 155 QAEMYLLGGNISEAESILLQHGLIFRAIQVSILTHNWDRALELALRHKTHIDTVLYQRKKYL----DNLEKIETNEKFLR 230 (244)
Q Consensus 155 ~A~ia~~~G~~~eAe~~~l~~g~~~~Ai~m~~~l~~W~~Al~LA~~~~~~~~~vl~~r~~yl----~~~~~~e~~~~f~~ 230 (244)
.|.-+...+++++|.-+|..+|..++|++-|...++|.+|+.+|.+....-+++.....+-. ++....|+...-+.
T Consensus 945 ya~hL~~~~~~~~Aal~Ye~~GklekAl~a~~~~~dWr~~l~~a~ql~~~~de~~~~a~~L~s~L~e~~kh~eAa~il~e 1024 (1265)
T KOG1920|consen 945 YADHLREELMSDEAALMYERCGKLEKALKAYKECGDWREALSLAAQLSEGKDELVILAEELVSRLVEQRKHYEAAKILLE 1024 (1265)
T ss_pred HHHHHHHhccccHHHHHHHHhccHHHHHHHHHHhccHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHcccchhHHHHHHH
Confidence 56677788999999999999999999999999999999999999998876666654433332 44455677777666
Q ss_pred ccc
Q psy11525 231 LQS 233 (244)
Q Consensus 231 ~~~ 233 (244)
|.+
T Consensus 1025 ~~s 1027 (1265)
T KOG1920|consen 1025 YLS 1027 (1265)
T ss_pred Hhc
Confidence 654
No 23
>TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW. Members of this family are designated PilF in ref (PubMed:8973346) and PilW in ref (PubMed:15612916). This outer membrane protein is required both for pilus stability and for pilus function such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.
Probab=96.71 E-value=0.11 Score=42.22 Aligned_cols=79 Identities=11% Similarity=0.013 Sum_probs=44.7
Q ss_pred HHHHHHHHHcCCHHHHHHHHHHCCC-------H---HHHHHHHHHhcCHHHHHHHHHh---cCCchhHHHHHHHHHHHHc
Q psy11525 153 AQQAEMYLLGGNISEAESILLQHGL-------I---FRAIQVSILTHNWDRALELALR---HKTHIDTVLYQRKKYLDNL 219 (244)
Q Consensus 153 ~~~A~ia~~~G~~~eAe~~~l~~g~-------~---~~Ai~m~~~l~~W~~Al~LA~~---~~~~~~~vl~~r~~yl~~~ 219 (244)
...|.++...|++++|.+.|.++-. + ...-..+...+++++|.+...+ ..+............+...
T Consensus 103 ~~~~~~~~~~g~~~~A~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~g~~~~A~~~~~~~~~~~~~~~~~~~~la~~~~~~ 182 (234)
T TIGR02521 103 NNYGTFLCQQGKYEQAMQQFEQAIEDPLYPQPARSLENAGLCALKAGDFDKAEKYLTRALQIDPQRPESLLELAELYYLR 182 (234)
T ss_pred HHHHHHHHHcccHHHHHHHHHHHHhccccccchHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCcCChHHHHHHHHHHHHc
Confidence 3467777888888888888877521 1 1123456777888888776654 2222222333344444455
Q ss_pred CC-cccHHHHHhc
Q psy11525 220 EK-IETNEKFLRL 231 (244)
Q Consensus 220 ~~-~e~~~~f~~~ 231 (244)
|+ .++...|.++
T Consensus 183 ~~~~~A~~~~~~~ 195 (234)
T TIGR02521 183 GQYKDARAYLERY 195 (234)
T ss_pred CCHHHHHHHHHHH
Confidence 55 4555555554
No 24
>TIGR02917 PEP_TPR_lipo putative PEP-CTERM system TPR-repeat lipoprotein. This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
Probab=96.60 E-value=0.071 Score=52.58 Aligned_cols=105 Identities=14% Similarity=0.141 Sum_probs=59.5
Q ss_pred HHHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHHHHc
Q psy11525 88 YAASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMYLLG 162 (244)
Q Consensus 88 ~l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia~~~ 162 (244)
|...|++++|..+++.. .+...|..+|......++++.|..+|..+-.. -..........|.++...
T Consensus 577 ~~~~~~~~~A~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~A~~~~~~~~~~--------~~~~~~~~~~l~~~~~~~ 648 (899)
T TIGR02917 577 YLGKGQLKKALAILNEAADAAPDSPEAWLMLGRAQLAAGDLNKAVSSFKKLLAL--------QPDSALALLLLADAYAVM 648 (899)
T ss_pred HHHCCCHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHh--------CCCChHHHHHHHHHHHHc
Confidence 46677777777777664 35677888888888888888777777653110 000111223455666666
Q ss_pred CCHHHHHHHHHHCCC--------HHHHHHHHHHhcCHHHHHHHHHh
Q psy11525 163 GNISEAESILLQHGL--------IFRAIQVSILTHNWDRALELALR 200 (244)
Q Consensus 163 G~~~eAe~~~l~~g~--------~~~Ai~m~~~l~~W~~Al~LA~~ 200 (244)
|++++|...|.+.-. ......++...++|++|+++.+.
T Consensus 649 ~~~~~A~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~A~~~~~~ 694 (899)
T TIGR02917 649 KNYAKAITSLKRALELKPDNTEAQIGLAQLLLAAKRTESAKKIAKS 694 (899)
T ss_pred CCHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHcCCHHHHHHHHHH
Confidence 666666666654311 11224444555555555555544
No 25
>PLN03077 Protein ECB2; Provisional
Probab=96.45 E-value=0.036 Score=56.50 Aligned_cols=142 Identities=15% Similarity=0.106 Sum_probs=82.6
Q ss_pred HHHHHHHcCCHHHHHHHHhccC--CHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcH--HHHHHHHH
Q psy11525 84 VLHSYAASHSWPQALSLCRTLN--DDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVK--AAQQAEMY 159 (244)
Q Consensus 84 ~l~~~l~~~~~~~A~~~~~~~~--~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~--~~~~A~ia 159 (244)
.+.-|...|++++|.++.+.+. +...|..+.....+.++++.|...|..+..- .+ .++. ....-..+
T Consensus 329 Li~~y~k~g~~~~A~~vf~~m~~~d~~s~n~li~~~~~~g~~~~A~~lf~~M~~~-------g~--~Pd~~t~~~ll~a~ 399 (857)
T PLN03077 329 LIQMYLSLGSWGEAEKVFSRMETKDAVSWTAMISGYEKNGLPDKALETYALMEQD-------NV--SPDEITIASVLSAC 399 (857)
T ss_pred HHHHHHhcCCHHHHHHHHhhCCCCCeeeHHHHHHHHHhCCCHHHHHHHHHHHHHh-------CC--CCCceeHHHHHHHH
Confidence 3444577777777777777763 3456777777777777777777777654211 01 1221 11122344
Q ss_pred HHcCCHHHHHHHHHHCCC---------HHHHHHHHHHhcCHHHHHHHHHhcCC-chhHHHHHHHHHHHHcCCcccHHHHH
Q psy11525 160 LLGGNISEAESILLQHGL---------IFRAIQVSILTHNWDRALELALRHKT-HIDTVLYQRKKYLDNLEKIETNEKFL 229 (244)
Q Consensus 160 ~~~G~~~eAe~~~l~~g~---------~~~Ai~m~~~l~~W~~Al~LA~~~~~-~~~~vl~~r~~yl~~~~~~e~~~~f~ 229 (244)
...|++++|.+++-..-+ ...-|+||...+++++|.++-++... .+.+--..=.-|.+.....|+...|+
T Consensus 400 ~~~g~~~~a~~l~~~~~~~g~~~~~~~~n~Li~~y~k~g~~~~A~~vf~~m~~~d~vs~~~mi~~~~~~g~~~eA~~lf~ 479 (857)
T PLN03077 400 ACLGDLDVGVKLHELAERKGLISYVVVANALIEMYSKCKCIDKALEVFHNIPEKDVISWTSIIAGLRLNNRCFEALIFFR 479 (857)
T ss_pred hccchHHHHHHHHHHHHHhCCCcchHHHHHHHHHHHHcCCHHHHHHHHHhCCCCCeeeHHHHHHHHHHCCCHHHHHHHHH
Confidence 556788888877754321 12347888888888888888776542 22111112233444445567778888
Q ss_pred hcccc
Q psy11525 230 RLQSE 234 (244)
Q Consensus 230 ~~~~~ 234 (244)
++.+.
T Consensus 480 ~m~~~ 484 (857)
T PLN03077 480 QMLLT 484 (857)
T ss_pred HHHhC
Confidence 77544
No 26
>PLN03081 pentatricopeptide (PPR) repeat-containing protein; Provisional
Probab=96.17 E-value=0.1 Score=52.06 Aligned_cols=152 Identities=11% Similarity=0.063 Sum_probs=94.7
Q ss_pred HHHHHHHHHcCCHHHHHHHHhcc--CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCC----hhHHHH---HH---------
Q psy11525 82 ISVLHSYAASHSWPQALSLCRTL--NDDILWACLAGMATYSRDLATSEEAYAAIEQ----VDKVMY---IN--------- 143 (244)
Q Consensus 82 ~~~l~~~l~~~~~~~A~~~~~~~--~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d----~~~v~~---l~--------- 143 (244)
...+.-|...|++++|.++...+ .+...|..|.....+.++.+.|...|..+.. ++.+-+ +.
T Consensus 263 n~Li~~y~k~g~~~~A~~vf~~m~~~~~vt~n~li~~y~~~g~~~eA~~lf~~M~~~g~~pd~~t~~~ll~a~~~~g~~~ 342 (697)
T PLN03081 263 CALIDMYSKCGDIEDARCVFDGMPEKTTVAWNSMLAGYALHGYSEEALCLYYEMRDSGVSIDQFTFSIMIRIFSRLALLE 342 (697)
T ss_pred HHHHHHHHHCCCHHHHHHHHHhCCCCChhHHHHHHHHHHhCCCHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHhccchH
Confidence 34556689999999999999887 4567899999999999999999999987742 111111 11
Q ss_pred -------HhhCC---CcH--HHHHHHHHHHcCCHHHHHHHHHHCCCHH-----HHHHHHHHhcCHHHHHHHHHhc---C-
Q psy11525 144 -------HIKGI---PVK--AAQQAEMYLLGGNISEAESILLQHGLIF-----RAIQVSILTHNWDRALELALRH---K- 202 (244)
Q Consensus 144 -------~i~~~---~~~--~~~~A~ia~~~G~~~eAe~~~l~~g~~~-----~Ai~m~~~l~~W~~Al~LA~~~---~- 202 (244)
.+.+. ++. ..-....+...|++++|+++|-+...++ .-|..|...|++++|+++-++. +
T Consensus 343 ~a~~i~~~m~~~g~~~d~~~~~~Li~~y~k~G~~~~A~~vf~~m~~~d~~t~n~lI~~y~~~G~~~~A~~lf~~M~~~g~ 422 (697)
T PLN03081 343 HAKQAHAGLIRTGFPLDIVANTALVDLYSKWGRMEDARNVFDRMPRKNLISWNALIAGYGNHGRGTKAVEMFERMIAEGV 422 (697)
T ss_pred HHHHHHHHHHHhCCCCCeeehHHHHHHHHHCCCHHHHHHHHHhCCCCCeeeHHHHHHHHHHcCCHHHHHHHHHHHHHhCC
Confidence 11100 011 1123445556788888888888765543 4577888888999998887763 2
Q ss_pred -CchhHHHHHHHHHHHHcCCcccHHHHHhccc
Q psy11525 203 -THIDTVLYQRKKYLDNLEKIETNEKFLRLQS 233 (244)
Q Consensus 203 -~~~~~vl~~r~~yl~~~~~~e~~~~f~~~~~ 233 (244)
|..-+....=.-+......++..+.|.++.+
T Consensus 423 ~Pd~~T~~~ll~a~~~~g~~~~a~~~f~~m~~ 454 (697)
T PLN03081 423 APNHVTFLAVLSACRYSGLSEQGWEIFQSMSE 454 (697)
T ss_pred CCCHHHHHHHHHHHhcCCcHHHHHHHHHHHHH
Confidence 2222211111222233333566666766643
No 27
>PF11768 DUF3312: Protein of unknown function (DUF3312); InterPro: IPR024511 This is a eukaryotic family of uncharacterised proteins that contain WD40 repeats.
Probab=96.16 E-value=0.23 Score=48.04 Aligned_cols=117 Identities=13% Similarity=0.190 Sum_probs=70.9
Q ss_pred cceeeecCCCceEEEEeC-CeEEEEEcCCccccCcccc----------eeeeEeecc-------C-----------CCCC
Q psy11525 2 VQSICWNTDVNILAAMQD-SALCVWFFPAVVFADQGLL----------RKTVLLKDI-------G-----------EFGK 52 (244)
Q Consensus 2 v~~v~W~~~~~vlv~~~~-~~l~vw~~~~~~~~~~~~l----------~~~~~~~~~-------~-----------~~~~ 52 (244)
.+.++|+|++.+|+..++ |.|.+|=..-.+ +.-.++ ..+++-... . ..+.
T Consensus 302 P~~iaWHp~gai~~V~s~qGelQ~FD~ALsp-i~~qLlsEd~~P~~~L~Ls~yf~~~~~L~~iqW~~~~~~~~~~~~~~~ 380 (545)
T PF11768_consen 302 PTLIAWHPDGAIFVVGSEQGELQCFDMALSP-IKMQLLSEDATPKSTLQLSKYFRVSSSLVHIQWAPAPQLSSQGEFYAD 380 (545)
T ss_pred ceEEEEcCCCcEEEEEcCCceEEEEEeecCc-cceeeccccCCCccEEeeehhccCcchhheeEeccCCCccccCCCcCC
Confidence 467899999988887776 888887322222 111111 111111000 0 0001
Q ss_pred --CCeEEeEeCCEEEEEecCCcEEEeecCccHHHHHHHHHcCCHHHHHHHHhccCCHHHHHHHHHHHHHcCCH
Q psy11525 53 --SPSIVSFVKNHLTIRRYDGTVINYPISPYISVLHSYAASHSWPQALSLCRTLNDDILWACLAGMATYSRDL 123 (244)
Q Consensus 53 --~~~~v~f~g~~v~~~~~~G~~~~~~~~p~~~~l~~~l~~~~~~~A~~~~~~~~~~~lW~~LA~~Al~~~~l 123 (244)
.-.++.|.+|=+.|.+-.+...+.+.....+...+||..++.++|.++...+ .|...+.+|...+.-
T Consensus 381 ~~~~L~l~f~~GPl~vl~~~~G~~~~G~l~~~eL~~~yl~~~qi~eAi~lL~sm----nW~~~g~~C~~~L~~ 449 (545)
T PF11768_consen 381 TYDLLLLVFERGPLAVLRFKLGVFTRGDLGLVELISQYLRCDQIEEAINLLLSM----NWNTMGEQCFHCLSA 449 (545)
T ss_pred ccceEEEEEcCCCeEEEEEeeccccCCcccHHHHHHHHHhcCCHHHHHHHHHhC----CccccHHHHHHHHHH
Confidence 1135567777777776444444444445667888999999999999999988 577777776665443
No 28
>PLN03218 maturation of RBCL 1; Provisional
Probab=96.15 E-value=0.12 Score=54.25 Aligned_cols=50 Identities=8% Similarity=0.136 Sum_probs=29.6
Q ss_pred HHHHHHHcCCHHHHHHHHhcc------CCHHHHHHHHHHHHHcCCHHHHHHHHHHh
Q psy11525 84 VLHSYAASHSWPQALSLCRTL------NDDILWACLAGMATYSRDLATSEEAYAAI 133 (244)
Q Consensus 84 ~l~~~l~~~~~~~A~~~~~~~------~~~~lW~~LA~~Al~~~~l~~A~~~y~~l 133 (244)
.+.-|...|++++|.++.+.+ .+...|..+...+.+.++++.|.+.|..+
T Consensus 585 LI~ay~k~G~ldeA~elf~~M~e~gi~p~~~tynsLI~ay~k~G~~deAl~lf~eM 640 (1060)
T PLN03218 585 LMKACANAGQVDRAKEVYQMIHEYNIKGTPEVYTIAVNSCSQKGDWDFALSIYDDM 640 (1060)
T ss_pred HHHHHHHCCCHHHHHHHHHHHHHcCCCCChHHHHHHHHHHHhcCCHHHHHHHHHHH
Confidence 333456666666666666554 23456666666666666666666655544
No 29
>PRK15174 Vi polysaccharide export protein VexE; Provisional
Probab=96.15 E-value=0.26 Score=49.11 Aligned_cols=44 Identities=18% Similarity=0.039 Sum_probs=30.9
Q ss_pred HHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHH
Q psy11525 89 AASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAA 132 (244)
Q Consensus 89 l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~ 132 (244)
+..|++++|....+.. .....|..+|......++++.|..+|.+
T Consensus 87 l~~g~~~~A~~~l~~~l~~~P~~~~a~~~la~~l~~~g~~~~Ai~~l~~ 135 (656)
T PRK15174 87 LASSQPDAVLQVVNKLLAVNVCQPEDVLLVASVLLKSKQYATVADLAEQ 135 (656)
T ss_pred hhcCCHHHHHHHHHHHHHhCCCChHHHHHHHHHHHHcCCHHHHHHHHHH
Confidence 6677777777777664 3456777777777777777777766654
No 30
>PRK11788 tetratricopeptide repeat protein; Provisional
Probab=96.12 E-value=0.11 Score=47.36 Aligned_cols=45 Identities=9% Similarity=0.065 Sum_probs=27.4
Q ss_pred HHHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHH
Q psy11525 88 YAASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAA 132 (244)
Q Consensus 88 ~l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~ 132 (244)
++..|++++|....... ++...|..+|......++++.|...|.+
T Consensus 45 ~~~~~~~~~A~~~~~~al~~~p~~~~~~~~la~~~~~~g~~~~A~~~~~~ 94 (389)
T PRK11788 45 FLLNEQPDKAIDLFIEMLKVDPETVELHLALGNLFRRRGEVDRAIRIHQN 94 (389)
T ss_pred HHhcCChHHHHHHHHHHHhcCcccHHHHHHHHHHHHHcCcHHHHHHHHHH
Confidence 45666666666666554 2334566666666666666666665554
No 31
>TIGR02917 PEP_TPR_lipo putative PEP-CTERM system TPR-repeat lipoprotein. This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
Probab=96.10 E-value=0.31 Score=48.05 Aligned_cols=116 Identities=22% Similarity=0.234 Sum_probs=68.9
Q ss_pred HHHHHHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHHh---------------------CChhH
Q psy11525 85 LHSYAASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAAI---------------------EQVDK 138 (244)
Q Consensus 85 l~~~l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~l---------------------~d~~~ 138 (244)
...|+..|++++|.+++..+ .....|..+|......++++.|+.+|.+. |++..
T Consensus 438 ~~~~~~~~~~~~A~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~A~~~~~~a~~~~~~~~~~~~~la~~~~~~g~~~~ 517 (899)
T TIGR02917 438 ILSYLRSGQFDKALAAAKKLEKKQPDNASLHNLLGAIYLGKGDLAKAREAFEKALSIEPDFFPAAANLARIDIQEGNPDD 517 (899)
T ss_pred HHHHHhcCCHHHHHHHHHHHHHhCCCCcHHHHHHHHHHHhCCCHHHHHHHHHHHHhhCCCcHHHHHHHHHHHHHCCCHHH
Confidence 34567888888888888765 35678999999999999999999888753 22211
Q ss_pred H-HHHHHhhCCC-c---HHHHHHHHHHHcCCHHHHHHHHHHC--------CCHHHHHHHHHHhcCHHHHHHHHHh
Q psy11525 139 V-MYINHIKGIP-V---KAAQQAEMYLLGGNISEAESILLQH--------GLIFRAIQVSILTHNWDRALELALR 200 (244)
Q Consensus 139 v-~~l~~i~~~~-~---~~~~~A~ia~~~G~~~eAe~~~l~~--------g~~~~Ai~m~~~l~~W~~Al~LA~~ 200 (244)
. ..++++.... + .....+.++...|++++|...|.+. .....-...|...+++++|+++.++
T Consensus 518 A~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~A~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~A~~~~~~ 592 (899)
T TIGR02917 518 AIQRFEKVLTIDPKNLRAILALAGLYLRTGNEEEAVAWLEKAAELNPQEIEPALALAQYYLGKGQLKKALAILNE 592 (899)
T ss_pred HHHHHHHHHHhCcCcHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCccchhHHHHHHHHHHHCCCHHHHHHHHHH
Confidence 1 2222222211 1 1234566666777777777766553 0011123444555555555555443
No 32
>PRK12370 invasion protein regulator; Provisional
Probab=96.05 E-value=0.2 Score=48.70 Aligned_cols=75 Identities=11% Similarity=0.041 Sum_probs=48.9
Q ss_pred CCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCC-CcHHHHHHHHHHHcCCH
Q psy11525 92 HSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGI-PVKAAQQAEMYLLGGNI 165 (244)
Q Consensus 92 ~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~-~~~~~~~A~ia~~~G~~ 165 (244)
+++++|...++.. ++...|..+|......++++.|+.+|.+.-. +... .+.....|.++...|++
T Consensus 318 ~~~~~A~~~~~~Al~ldP~~~~a~~~lg~~~~~~g~~~~A~~~~~~Al~---------l~P~~~~a~~~lg~~l~~~G~~ 388 (553)
T PRK12370 318 NAMIKAKEHAIKATELDHNNPQALGLLGLINTIHSEYIVGSLLFKQANL---------LSPISADIKYYYGWNLFMAGQL 388 (553)
T ss_pred hHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHccCHHHHHHHHHHHHH---------hCCCCHHHHHHHHHHHHHCCCH
Confidence 4478888887764 4678899999999999999999988865211 1110 11123456666666666
Q ss_pred HHHHHHHHHC
Q psy11525 166 SEAESILLQH 175 (244)
Q Consensus 166 ~eAe~~~l~~ 175 (244)
++|...|.++
T Consensus 389 ~eAi~~~~~A 398 (553)
T PRK12370 389 EEALQTINEC 398 (553)
T ss_pred HHHHHHHHHH
Confidence 6666666654
No 33
>PRK15359 type III secretion system chaperone protein SscB; Provisional
Probab=95.95 E-value=0.084 Score=42.13 Aligned_cols=79 Identities=14% Similarity=0.004 Sum_probs=56.8
Q ss_pred HHHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHHHHc
Q psy11525 88 YAASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMYLLG 162 (244)
Q Consensus 88 ~l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia~~~ 162 (244)
+...|++++|...++.. .....|..+|..+...++++-|..+|.+.-.. --.-.+.....|.++...
T Consensus 34 ~~~~g~~~~A~~~~~~al~~~P~~~~a~~~lg~~~~~~g~~~~A~~~y~~Al~l--------~p~~~~a~~~lg~~l~~~ 105 (144)
T PRK15359 34 SWQEGDYSRAVIDFSWLVMAQPWSWRAHIALAGTWMMLKEYTTAINFYGHALML--------DASHPEPVYQTGVCLKMM 105 (144)
T ss_pred HHHcCCHHHHHHHHHHHHHcCCCcHHHHHHHHHHHHHHhhHHHHHHHHHHHHhc--------CCCCcHHHHHHHHHHHHc
Confidence 47889999998877764 46788888999999999999888888763211 111123345678888888
Q ss_pred CCHHHHHHHHHH
Q psy11525 163 GNISEAESILLQ 174 (244)
Q Consensus 163 G~~~eAe~~~l~ 174 (244)
|++++|...|.+
T Consensus 106 g~~~eAi~~~~~ 117 (144)
T PRK15359 106 GEPGLAREAFQT 117 (144)
T ss_pred CCHHHHHHHHHH
Confidence 888888777743
No 34
>PLN03081 pentatricopeptide (PPR) repeat-containing protein; Provisional
Probab=95.94 E-value=0.22 Score=49.69 Aligned_cols=153 Identities=12% Similarity=0.104 Sum_probs=100.4
Q ss_pred ccHHHHHHHHHcCCHHHHHHHHhcc--CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCC----------------------
Q psy11525 80 PYISVLHSYAASHSWPQALSLCRTL--NDDILWACLAGMATYSRDLATSEEAYAAIEQ---------------------- 135 (244)
Q Consensus 80 p~~~~l~~~l~~~~~~~A~~~~~~~--~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d---------------------- 135 (244)
.+...+.-|.+.|++++|.++-..+ .+-..|..|.....++++.+.|...|.++..
T Consensus 362 ~~~~Li~~y~k~G~~~~A~~vf~~m~~~d~~t~n~lI~~y~~~G~~~~A~~lf~~M~~~g~~Pd~~T~~~ll~a~~~~g~ 441 (697)
T PLN03081 362 ANTALVDLYSKWGRMEDARNVFDRMPRKNLISWNALIAGYGNHGRGTKAVEMFERMIAEGVAPNHVTFLAVLSACRYSGL 441 (697)
T ss_pred ehHHHHHHHHHCCCHHHHHHHHHhCCCCCeeeHHHHHHHHHHcCCHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhcCCc
Confidence 3455666678899999999998886 3556799999999999999999888887543
Q ss_pred hhH-HHHHHHhhC---C-Cc--HHHHHHHHHHHcCCHHHHHHHHHHCCC------HHHHHHHHHHhcCHHHHHHHHHhc-
Q psy11525 136 VDK-VMYINHIKG---I-PV--KAAQQAEMYLLGGNISEAESILLQHGL------IFRAIQVSILTHNWDRALELALRH- 201 (244)
Q Consensus 136 ~~~-v~~l~~i~~---~-~~--~~~~~A~ia~~~G~~~eAe~~~l~~g~------~~~Ai~m~~~l~~W~~Al~LA~~~- 201 (244)
++. .++++.+.+ + ++ ...-+..++...|++++|.++|.+.+. +...+..|...++++.|...+++.
T Consensus 442 ~~~a~~~f~~m~~~~g~~p~~~~y~~li~~l~r~G~~~eA~~~~~~~~~~p~~~~~~~Ll~a~~~~g~~~~a~~~~~~l~ 521 (697)
T PLN03081 442 SEQGWEIFQSMSENHRIKPRAMHYACMIELLGREGLLDEAYAMIRRAPFKPTVNMWAALLTACRIHKNLELGRLAAEKLY 521 (697)
T ss_pred HHHHHHHHHHHHHhcCCCCCccchHhHHHHHHhcCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHcCCcHHHHHHHHHHh
Confidence 111 122333321 1 11 223456777788999999999988764 344578888899999998888763
Q ss_pred --CCc-hhHHHHHHHHHHHHcCCcccHHHHHhcc
Q psy11525 202 --KTH-IDTVLYQRKKYLDNLEKIETNEKFLRLQ 232 (244)
Q Consensus 202 --~~~-~~~vl~~r~~yl~~~~~~e~~~~f~~~~ 232 (244)
++. ...-...-.-|......+++.+.|.++.
T Consensus 522 ~~~p~~~~~y~~L~~~y~~~G~~~~A~~v~~~m~ 555 (697)
T PLN03081 522 GMGPEKLNNYVVLLNLYNSSGRQAEAAKVVETLK 555 (697)
T ss_pred CCCCCCCcchHHHHHHHHhCCCHHHHHHHHHHHH
Confidence 222 2222223333445544566666666654
No 35
>KOG2114|consensus
Probab=95.94 E-value=0.57 Score=47.47 Aligned_cols=171 Identities=13% Similarity=0.154 Sum_probs=112.5
Q ss_pred EEEEEecCCcEEEeecCccHHHHHHHHHcCCHHHHHHHHhccC-----CHHHHHHHHHHHHHcCCHHHHHHHHHH-hCCh
Q psy11525 63 HLTIRRYDGTVINYPISPYISVLHSYAASHSWPQALSLCRTLN-----DDILWACLAGMATYSRDLATSEEAYAA-IEQV 136 (244)
Q Consensus 63 ~v~~~~~~G~~~~~~~~p~~~~l~~~l~~~~~~~A~~~~~~~~-----~~~lW~~LA~~Al~~~~l~~A~~~y~~-l~d~ 136 (244)
...+.++||.+-...-.++...|.--..++.|..|+.+|+... .+..-+.-|...-..|+++.|.-=|.+ +|-.
T Consensus 319 ~~~vltsdg~~~~L~ek~le~kL~iL~kK~ly~~Ai~LAk~~~~d~d~~~~i~~kYgd~Ly~Kgdf~~A~~qYI~tI~~l 398 (933)
T KOG2114|consen 319 DCLVLTSDGVVHELIEKDLETKLDILFKKNLYKVAINLAKSQHLDEDTLAEIHRKYGDYLYGKGDFDEATDQYIETIGFL 398 (933)
T ss_pred cEEEEecCCceeeeeeccHHHHHHHHHHhhhHHHHHHHHHhcCCCHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHcccC
Confidence 6777888888877777788888888899999999999999853 355667778888888899888887774 3322
Q ss_pred hHH----------------HHHHHhhCC----CcHHHHHHHHHHHcCCHHHHHHHHHHCC------CHHHHHHHHHHhcC
Q psy11525 137 DKV----------------MYINHIKGI----PVKAAQQAEMYLLGGNISEAESILLQHG------LIFRAIQVSILTHN 190 (244)
Q Consensus 137 ~~v----------------~~l~~i~~~----~~~~~~~A~ia~~~G~~~eAe~~~l~~g------~~~~Ai~m~~~l~~ 190 (244)
+-- .||+.+.+- .+.....-.++.-+++.+.=.+.--+.. +.+.|++..++-+=
T Consensus 399 e~s~Vi~kfLdaq~IknLt~YLe~L~~~gla~~dhttlLLncYiKlkd~~kL~efI~~~~~g~~~fd~e~al~Ilr~sny 478 (933)
T KOG2114|consen 399 EPSEVIKKFLDAQRIKNLTSYLEALHKKGLANSDHTTLLLNCYIKLKDVEKLTEFISKCDKGEWFFDVETALEILRKSNY 478 (933)
T ss_pred ChHHHHHHhcCHHHHHHHHHHHHHHHHcccccchhHHHHHHHHHHhcchHHHHHHHhcCCCcceeeeHHHHHHHHHHhCh
Confidence 211 222222221 1122334456666677665444333333 48888888888888
Q ss_pred HHHHHHHHHhcCCc---hhHHHHHHH------HHHHHcCCcccHHHHHhccc
Q psy11525 191 WDRALELALRHKTH---IDTVLYQRK------KYLDNLEKIETNEKFLRLQS 233 (244)
Q Consensus 191 W~~Al~LA~~~~~~---~~~vl~~r~------~yl~~~~~~e~~~~f~~~~~ 233 (244)
.++|..||++++.| ++.++..-+ +|++.+.-.|+..-..+|++
T Consensus 479 l~~a~~LA~k~~~he~vl~ille~~~ny~eAl~yi~slp~~e~l~~l~kyGk 530 (933)
T KOG2114|consen 479 LDEAELLATKFKKHEWVLDILLEDLHNYEEALRYISSLPISELLRTLNKYGK 530 (933)
T ss_pred HHHHHHHHHHhccCHHHHHHHHHHhcCHHHHHHHHhcCCHHHHHHHHHHHHH
Confidence 88888888888765 455554444 34455555666666666654
No 36
>COG5010 TadD Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking and secretion]
Probab=95.68 E-value=0.11 Score=45.85 Aligned_cols=107 Identities=20% Similarity=0.251 Sum_probs=83.7
Q ss_pred HHHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHHHHc
Q psy11525 88 YAASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMYLLG 162 (244)
Q Consensus 88 ~l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia~~~ 162 (244)
.+..|+|.+|....+.. .+...|.-+|....+.|+++-|++.|... ++-.-..+.....+|..+++.
T Consensus 110 ~~~~g~~~~A~~~~rkA~~l~p~d~~~~~~lgaaldq~Gr~~~Ar~ay~qA--------l~L~~~~p~~~nNlgms~~L~ 181 (257)
T COG5010 110 QIRNGNFGEAVSVLRKAARLAPTDWEAWNLLGAALDQLGRFDEARRAYRQA--------LELAPNEPSIANNLGMSLLLR 181 (257)
T ss_pred HHHhcchHHHHHHHHHHhccCCCChhhhhHHHHHHHHccChhHHHHHHHHH--------HHhccCCchhhhhHHHHHHHc
Confidence 49999999999999886 56889999999999999999999999763 222223344456789999999
Q ss_pred CCHHHHHHHHHHC---C----CHHHH-HHHHHHhcCHHHHHHHHHhcC
Q psy11525 163 GNISEAESILLQH---G----LIFRA-IQVSILTHNWDRALELALRHK 202 (244)
Q Consensus 163 G~~~eAe~~~l~~---g----~~~~A-i~m~~~l~~W~~Al~LA~~~~ 202 (244)
|+++.|+.+++++ + ++... .-.....+++++|-+++.+.-
T Consensus 182 gd~~~A~~lll~a~l~~~ad~~v~~NLAl~~~~~g~~~~A~~i~~~e~ 229 (257)
T COG5010 182 GDLEDAETLLLPAYLSPAADSRVRQNLALVVGLQGDFREAEDIAVQEL 229 (257)
T ss_pred CCHHHHHHHHHHHHhCCCCchHHHHHHHHHHhhcCChHHHHhhccccc
Confidence 9999999999875 1 22222 234567789999999988765
No 37
>PRK15363 pathogenicity island 2 chaperone protein SscA; Provisional
Probab=95.66 E-value=0.21 Score=40.96 Aligned_cols=83 Identities=22% Similarity=0.213 Sum_probs=65.8
Q ss_pred HHHHHH----HHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHH
Q psy11525 83 SVLHSY----AASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAA 153 (244)
Q Consensus 83 ~~l~~~----l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~ 153 (244)
+.+|.+ .+.|++++|.++-+.+ ....-|-.||...-..+++.-|..||.+....+ .+-+...-
T Consensus 36 ~~lY~~A~~ly~~G~l~~A~~~f~~L~~~Dp~~~~y~~gLG~~~Q~~g~~~~AI~aY~~A~~L~--------~ddp~~~~ 107 (157)
T PRK15363 36 NTLYRYAMQLMEVKEFAGAARLFQLLTIYDAWSFDYWFRLGECCQAQKHWGEAIYAYGRAAQIK--------IDAPQAPW 107 (157)
T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHHhCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcC--------CCCchHHH
Confidence 445554 8999999999888775 578999999999999999999999999865321 12344556
Q ss_pred HHHHHHHHcCCHHHHHHHHH
Q psy11525 154 QQAEMYLLGGNISEAESILL 173 (244)
Q Consensus 154 ~~A~ia~~~G~~~eAe~~~l 173 (244)
..|.+.+..|+.+.|.+.|.
T Consensus 108 ~ag~c~L~lG~~~~A~~aF~ 127 (157)
T PRK15363 108 AAAECYLACDNVCYAIKALK 127 (157)
T ss_pred HHHHHHHHcCCHHHHHHHHH
Confidence 78899999999999888763
No 38
>PF12895 Apc3: Anaphase-promoting complex, cyclosome, subunit 3; PDB: 3KAE_D 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2XPI_A 3ULQ_A.
Probab=95.62 E-value=0.052 Score=38.87 Aligned_cols=76 Identities=17% Similarity=0.080 Sum_probs=52.3
Q ss_pred HcCCHHHHHHHHhcc----C---CHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHHHHc
Q psy11525 90 ASHSWPQALSLCRTL----N---DDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMYLLG 162 (244)
Q Consensus 90 ~~~~~~~A~~~~~~~----~---~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia~~~ 162 (244)
+.|+|++|..+.+.+ + ....|-.+|....+.++++-|...+.+ ..... ...+.....|.++..+
T Consensus 1 ~~~~y~~Ai~~~~k~~~~~~~~~~~~~~~~la~~~~~~~~y~~A~~~~~~-~~~~~--------~~~~~~~l~a~~~~~l 71 (84)
T PF12895_consen 1 DQGNYENAIKYYEKLLELDPTNPNSAYLYNLAQCYFQQGKYEEAIELLQK-LKLDP--------SNPDIHYLLARCLLKL 71 (84)
T ss_dssp HTT-HHHHHHHHHHHHHHHCGTHHHHHHHHHHHHHHHTTHHHHHHHHHHC-HTHHH--------CHHHHHHHHHHHHHHT
T ss_pred CCccHHHHHHHHHHHHHHCCCChhHHHHHHHHHHHHHCCCHHHHHHHHHH-hCCCC--------CCHHHHHHHHHHHHHh
Confidence 368899999988875 3 346788899999999998888877766 11111 0012233458999999
Q ss_pred CCHHHHHHHHHH
Q psy11525 163 GNISEAESILLQ 174 (244)
Q Consensus 163 G~~~eAe~~~l~ 174 (244)
|++++|.+.|.+
T Consensus 72 ~~y~eAi~~l~~ 83 (84)
T PF12895_consen 72 GKYEEAIKALEK 83 (84)
T ss_dssp T-HHHHHHHHHH
T ss_pred CCHHHHHHHHhc
Confidence 999999988865
No 39
>PRK11189 lipoprotein NlpI; Provisional
Probab=95.60 E-value=0.78 Score=40.94 Aligned_cols=46 Identities=13% Similarity=0.072 Sum_probs=37.5
Q ss_pred HHHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHHh
Q psy11525 88 YAASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAAI 133 (244)
Q Consensus 88 ~l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~l 133 (244)
|...|++++|....... +....|..+|......++++.|..+|.+.
T Consensus 74 ~~~~g~~~~A~~~~~~Al~l~P~~~~a~~~lg~~~~~~g~~~~A~~~~~~A 124 (296)
T PRK11189 74 YDSLGLRALARNDFSQALALRPDMADAYNYLGIYLTQAGNFDAAYEAFDSV 124 (296)
T ss_pred HHHCCCHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHH
Confidence 46789999998755543 56789999999999999999999888764
No 40
>PRK09782 bacteriophage N4 receptor, outer membrane subunit; Provisional
Probab=95.58 E-value=0.38 Score=50.24 Aligned_cols=144 Identities=13% Similarity=0.001 Sum_probs=79.1
Q ss_pred HHcCCHHHHHHHHhcc----CCHHHHHHHHHHHHHcCCHHHHHHHHHHh---------------------CChhHH-HHH
Q psy11525 89 AASHSWPQALSLCRTL----NDDILWACLAGMATYSRDLATSEEAYAAI---------------------EQVDKV-MYI 142 (244)
Q Consensus 89 l~~~~~~~A~~~~~~~----~~~~lW~~LA~~Al~~~~l~~A~~~y~~l---------------------~d~~~v-~~l 142 (244)
...|++++|....+.. .....|..+|..+++.++++.|+.+|.+. |+++.. ..+
T Consensus 520 ~~~Gr~eeAi~~~rka~~~~p~~~a~~~la~all~~Gd~~eA~~~l~qAL~l~P~~~~l~~~La~~l~~~Gr~~eAl~~~ 599 (987)
T PRK09782 520 YQVEDYATALAAWQKISLHDMSNEDLLAAANTAQAAGNGAARDRWLQQAEQRGLGDNALYWWLHAQRYIPGQPELALNDL 599 (987)
T ss_pred HHCCCHHHHHHHHHHHhccCCCcHHHHHHHHHHHHCCCHHHHHHHHHHHHhcCCccHHHHHHHHHHHHhCCCHHHHHHHH
Confidence 4788888888777653 23456778888888888888888877432 222221 112
Q ss_pred HHhhCC-CcH--HHHHHHHHHHcCCHHHHHHHHHHC-----CCHHHH---HHHHHHhcCHHHHHHHHHh---cCCchhHH
Q psy11525 143 NHIKGI-PVK--AAQQAEMYLLGGNISEAESILLQH-----GLIFRA---IQVSILTHNWDRALELALR---HKTHIDTV 208 (244)
Q Consensus 143 ~~i~~~-~~~--~~~~A~ia~~~G~~~eAe~~~l~~-----g~~~~A---i~m~~~l~~W~~Al~LA~~---~~~~~~~v 208 (244)
++...+ ++. ....|.++...|++++|+..|.++ +.+..- -..+.+.+++++|++..++ ..|.-...
T Consensus 600 ~~AL~l~P~~~a~~~LA~~l~~lG~~deA~~~l~~AL~l~Pd~~~a~~nLG~aL~~~G~~eeAi~~l~~AL~l~P~~~~a 679 (987)
T PRK09782 600 TRSLNIAPSANAYVARATIYRQRHNVPAAVSDLRAALELEPNNSNYQAALGYALWDSGDIAQSREMLERAHKGLPDDPAL 679 (987)
T ss_pred HHHHHhCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCCHHH
Confidence 222221 111 133566666667777776666553 111100 1245667778887665543 33443444
Q ss_pred HHHHHHHHHHcCC-cccHHHHHhcc
Q psy11525 209 LYQRKKYLDNLEK-IETNEKFLRLQ 232 (244)
Q Consensus 209 l~~r~~yl~~~~~-~e~~~~f~~~~ 232 (244)
......-+...|+ .++...|+++.
T Consensus 680 ~~nLA~al~~lGd~~eA~~~l~~Al 704 (987)
T PRK09782 680 IRQLAYVNQRLDDMAATQHYARLVI 704 (987)
T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHH
Confidence 5555555566666 35555565553
No 41
>TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD. ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array.
Probab=95.44 E-value=0.12 Score=39.80 Aligned_cols=80 Identities=23% Similarity=0.236 Sum_probs=56.5
Q ss_pred HHHHHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHHH
Q psy11525 86 HSYAASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMYL 160 (244)
Q Consensus 86 ~~~l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia~ 160 (244)
..++..|++++|.+..+.. .+...|..+|......++++-|..+|.+.-... ....+.....|.++.
T Consensus 25 ~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~la~~~~~~~~~~~A~~~~~~~~~~~--------p~~~~~~~~la~~~~ 96 (135)
T TIGR02552 25 YNLYQQGRYDEALKLFQLLAAYDPYNSRYWLGLAACCQMLKEYEEAIDAYALAAALD--------PDDPRPYFHAAECLL 96 (135)
T ss_pred HHHHHcccHHHHHHHHHHHHHhCCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC--------CCChHHHHHHHHHHH
Confidence 3458899999999887664 467899999988888888888888776543211 111223355677888
Q ss_pred HcCCHHHHHHHHH
Q psy11525 161 LGGNISEAESILL 173 (244)
Q Consensus 161 ~~G~~~eAe~~~l 173 (244)
..|++++|.+.|.
T Consensus 97 ~~g~~~~A~~~~~ 109 (135)
T TIGR02552 97 ALGEPESALKALD 109 (135)
T ss_pred HcCCHHHHHHHHH
Confidence 8888887777664
No 42
>PRK15174 Vi polysaccharide export protein VexE; Provisional
Probab=95.33 E-value=0.51 Score=47.05 Aligned_cols=152 Identities=10% Similarity=-0.148 Sum_probs=92.3
Q ss_pred cHHHHHHHHHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCC-----hhH------------
Q psy11525 81 YISVLHSYAASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAAIEQ-----VDK------------ 138 (244)
Q Consensus 81 ~~~~l~~~l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d-----~~~------------ 138 (244)
+.....+.+..|++++|..+++.. .+...+..||...+..++++-|...|.++-. ...
T Consensus 45 ~~~~~~~~~~~g~~~~A~~l~~~~l~~~p~~~~~l~~l~~~~l~~g~~~~A~~~l~~~l~~~P~~~~a~~~la~~l~~~g 124 (656)
T PRK15174 45 IILFAIACLRKDETDVGLTLLSDRVLTAKNGRDLLRRWVISPLASSQPDAVLQVVNKLLAVNVCQPEDVLLVASVLLKSK 124 (656)
T ss_pred HHHHHHHHHhcCCcchhHHHhHHHHHhCCCchhHHHHHhhhHhhcCCHHHHHHHHHHHHHhCCCChHHHHHHHHHHHHcC
Confidence 445667779999999999998875 5788999999999999999999998877522 111
Q ss_pred -----HHHHHHhhCC-Cc---HHHHHHHHHHHcCCHHHHHHHHHHC-----CCHHH--HHHHHHHhcCHHHHHHHHHhcC
Q psy11525 139 -----VMYINHIKGI-PV---KAAQQAEMYLLGGNISEAESILLQH-----GLIFR--AIQVSILTHNWDRALELALRHK 202 (244)
Q Consensus 139 -----v~~l~~i~~~-~~---~~~~~A~ia~~~G~~~eAe~~~l~~-----g~~~~--Ai~m~~~l~~W~~Al~LA~~~~ 202 (244)
+..+++...+ ++ .....|.++...|++++|...|.+. +.+.. ..-.+...+++++|+...++.-
T Consensus 125 ~~~~Ai~~l~~Al~l~P~~~~a~~~la~~l~~~g~~~eA~~~~~~~~~~~P~~~~a~~~~~~l~~~g~~~eA~~~~~~~l 204 (656)
T PRK15174 125 QYATVADLAEQAWLAFSGNSQIFALHLRTLVLMDKELQAISLARTQAQEVPPRGDMIATCLSFLNKSRLPEDHDLARALL 204 (656)
T ss_pred CHHHHHHHHHHHHHhCCCcHHHHHHHHHHHHHCCChHHHHHHHHHHHHhCCCCHHHHHHHHHHHHcCCHHHHHHHHHHHH
Confidence 1222222222 11 1234567777788888887776432 11111 1223566788888887766521
Q ss_pred ----CchhHHHHHHHHHHHHcCC-cccHHHHHhcc
Q psy11525 203 ----THIDTVLYQRKKYLDNLEK-IETNEKFLRLQ 232 (244)
Q Consensus 203 ----~~~~~vl~~r~~yl~~~~~-~e~~~~f~~~~ 232 (244)
.+...........+.+.|+ .+....|+++.
T Consensus 205 ~~~~~~~~~~~~~l~~~l~~~g~~~eA~~~~~~al 239 (656)
T PRK15174 205 PFFALERQESAGLAVDTLCAVGKYQEAIQTGESAL 239 (656)
T ss_pred hcCCCcchhHHHHHHHHHHHCCCHHHHHHHHHHHH
Confidence 1111111222333344444 56666666654
No 43
>PLN03218 maturation of RBCL 1; Provisional
Probab=95.28 E-value=0.38 Score=50.65 Aligned_cols=53 Identities=19% Similarity=0.239 Sum_probs=27.7
Q ss_pred cHHHHHHHHHcCCHHHHHHHHhcc------CCHHHHHHHHHHHHHcCCHHHHHHHHHHh
Q psy11525 81 YISVLHSYAASHSWPQALSLCRTL------NDDILWACLAGMATYSRDLATSEEAYAAI 133 (244)
Q Consensus 81 ~~~~l~~~l~~~~~~~A~~~~~~~------~~~~lW~~LA~~Al~~~~l~~A~~~y~~l 133 (244)
|...+..|...|++++|.++.+.+ .+...|..|-..+.+.++++.|.+.|..+
T Consensus 440 yn~LL~a~~k~g~~e~A~~lf~~M~~~Gl~pD~~tynsLI~~y~k~G~vd~A~~vf~eM 498 (1060)
T PLN03218 440 FNMLMSVCASSQDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEM 498 (1060)
T ss_pred HHHHHHHHHhCcCHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHhCcCHHHHHHHHHHH
Confidence 344444455556666655555443 24455555555555555555555544443
No 44
>PRK11447 cellulose synthase subunit BcsC; Provisional
Probab=94.98 E-value=0.81 Score=48.54 Aligned_cols=144 Identities=15% Similarity=0.112 Sum_probs=84.9
Q ss_pred HHHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCC-----hhHH--------------HHHH
Q psy11525 88 YAASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAAIEQ-----VDKV--------------MYIN 143 (244)
Q Consensus 88 ~l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d-----~~~v--------------~~l~ 143 (244)
+...|++++|.+..+.. .+...|..+|......++++.|+..|.++-. .... ..+.
T Consensus 471 ~~~~g~~~eA~~~~~~Al~~~P~~~~~~~~LA~~~~~~G~~~~A~~~l~~al~~~P~~~~~~~a~al~l~~~~~~~~Al~ 550 (1157)
T PRK11447 471 LENQGKWAQAAELQRQRLALDPGSVWLTYRLAQDLRQAGQRSQADALMRRLAQQKPNDPEQVYAYGLYLSGSDRDRAALA 550 (1157)
T ss_pred HHHCCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHhCCCHHHHHH
Confidence 46789999999988774 3567788899999999999999999886521 1111 0111
Q ss_pred HhhCCCc-----------------HHHHHHHHHHHcCCHHHHHHHHHHCCCH----HHHHHHHHHhcCHHHHHHHHHh--
Q psy11525 144 HIKGIPV-----------------KAAQQAEMYLLGGNISEAESILLQHGLI----FRAIQVSILTHNWDRALELALR-- 200 (244)
Q Consensus 144 ~i~~~~~-----------------~~~~~A~ia~~~G~~~eAe~~~l~~g~~----~~Ai~m~~~l~~W~~Al~LA~~-- 200 (244)
.+..+.. .....|..+...|++++|.+++.....- ...-.++...+++++|++..++
T Consensus 551 ~l~~l~~~~~~~~~~~l~~~l~~~~~l~~a~~l~~~G~~~eA~~~l~~~p~~~~~~~~La~~~~~~g~~~~A~~~y~~al 630 (1157)
T PRK11447 551 HLNTLPRAQWNSNIQELAQRLQSDQVLETANRLRDSGKEAEAEALLRQQPPSTRIDLTLADWAQQRGDYAAARAAYQRVL 630 (1157)
T ss_pred HHHhCCchhcChhHHHHHHHHhhhHHHHHHHHHHHCCCHHHHHHHHHhCCCCchHHHHHHHHHHHcCCHHHHHHHHHHHH
Confidence 1111110 0123456667778888888877643221 1123456777888888776554
Q ss_pred -cCCchhHHHHHHHHHHHHcCC-cccHHHHHhc
Q psy11525 201 -HKTHIDTVLYQRKKYLDNLEK-IETNEKFLRL 231 (244)
Q Consensus 201 -~~~~~~~vl~~r~~yl~~~~~-~e~~~~f~~~ 231 (244)
..|.-.+.+......+...|+ .++.+.|.+.
T Consensus 631 ~~~P~~~~a~~~la~~~~~~g~~~eA~~~l~~l 663 (1157)
T PRK11447 631 TREPGNADARLGLIEVDIAQGDLAAARAQLAKL 663 (1157)
T ss_pred HhCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHH
Confidence 334322333333333434444 5555566544
No 45
>cd00189 TPR Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and C
Probab=94.97 E-value=0.12 Score=34.91 Aligned_cols=81 Identities=15% Similarity=0.068 Sum_probs=54.0
Q ss_pred HHHHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHHHH
Q psy11525 87 SYAASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMYLL 161 (244)
Q Consensus 87 ~~l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia~~ 161 (244)
.+...|++++|.+.+... .+...|..+|......++++.|..+|...-+.. ..........|.++..
T Consensus 9 ~~~~~~~~~~A~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~--------~~~~~~~~~~~~~~~~ 80 (100)
T cd00189 9 LYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALELD--------PDNAKAYYNLGLAYYK 80 (100)
T ss_pred HHHHHhcHHHHHHHHHHHHhcCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--------CcchhHHHHHHHHHHH
Confidence 346789999999888764 234678888888888888888887776542210 1111233456777777
Q ss_pred cCCHHHHHHHHHHC
Q psy11525 162 GGNISEAESILLQH 175 (244)
Q Consensus 162 ~G~~~eAe~~~l~~ 175 (244)
.|++++|...|.++
T Consensus 81 ~~~~~~a~~~~~~~ 94 (100)
T cd00189 81 LGKYEEALEAYEKA 94 (100)
T ss_pred HHhHHHHHHHHHHH
Confidence 77777777776553
No 46
>PRK11447 cellulose synthase subunit BcsC; Provisional
Probab=94.92 E-value=1.8 Score=45.93 Aligned_cols=145 Identities=14% Similarity=-0.041 Sum_probs=101.1
Q ss_pred HHHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCC-----hhHHHHHHHh--hCCCc-HHHH
Q psy11525 88 YAASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAAIEQ-----VDKVMYINHI--KGIPV-KAAQ 154 (244)
Q Consensus 88 ~l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d-----~~~v~~l~~i--~~~~~-~~~~ 154 (244)
+...++.+.|.+....+ +++..+..++..++..++.+-|++.+.++-. .........+ ...+. ....
T Consensus 38 ~~~~~~~d~a~~~l~kl~~~~p~~p~~~~~~~~~~l~~g~~~~A~~~l~~l~~~~P~~~~~~~~~~~~~~~~~~~~~~l~ 117 (1157)
T PRK11447 38 GEATHREDLVRQSLYRLELIDPNNPDVIAARFRLLLRQGDSDGAQKLLDRLSQLAPDSNAYRSSRTTMLLSTPEGRQALQ 117 (1157)
T ss_pred HHhhCChHHHHHHHHHHHccCCCCHHHHHHHHHHHHhCCCHHHHHHHHHHHHhhCCCChHHHHHHHHHHhcCCchhhHHH
Confidence 48899999999888775 5789999999999999999999999998843 2222211111 11122 2366
Q ss_pred HHHHHHHcCCHHHHHHHHHHC-----CCHHHHHHHHHH----hcCHHHHHHHHHhc---CCchhHHHHHHHHHHHHcCC-
Q psy11525 155 QAEMYLLGGNISEAESILLQH-----GLIFRAIQVSIL----THNWDRALELALRH---KTHIDTVLYQRKKYLDNLEK- 221 (244)
Q Consensus 155 ~A~ia~~~G~~~eAe~~~l~~-----g~~~~Ai~m~~~----l~~W~~Al~LA~~~---~~~~~~vl~~r~~yl~~~~~- 221 (244)
.|.++...|++++|.+.|.+. .....|++.+.. .++|++|++.-++. .|....+.....+.+...|+
T Consensus 118 ~A~ll~~~g~~~eA~~~~~~~l~~~p~~~~la~~y~~~~~~~~g~~~~A~~~L~~ll~~~P~~~~~~~~LA~ll~~~g~~ 197 (1157)
T PRK11447 118 QARLLATTGRTEEALASYDKLFNGAPPELDLAVEYWRLVAKLPAQRPEAINQLQRLNADYPGNTGLRNTLALLLFSSGRR 197 (1157)
T ss_pred HHHHHHhCCCHHHHHHHHHHHccCCCCChHHHHHHHHHHhhCCccHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHccCCH
Confidence 788999999999999999874 234556655554 49999998877653 24333444455555555555
Q ss_pred cccHHHHHhcc
Q psy11525 222 IETNEKFLRLQ 232 (244)
Q Consensus 222 ~e~~~~f~~~~ 232 (244)
.|+.+.|.++.
T Consensus 198 ~eAl~~l~~~~ 208 (1157)
T PRK11447 198 DEGFAVLEQMA 208 (1157)
T ss_pred HHHHHHHHHHh
Confidence 67778787764
No 47
>PRK10747 putative protoheme IX biogenesis protein; Provisional
Probab=94.84 E-value=0.37 Score=44.89 Aligned_cols=101 Identities=16% Similarity=0.044 Sum_probs=70.1
Q ss_pred HHcCCHHHHHHHHhccC----CHHHHHHH-HHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCC-Cc----HHHHHHHH
Q psy11525 89 AASHSWPQALSLCRTLN----DDILWACL-AGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGI-PV----KAAQQAEM 158 (244)
Q Consensus 89 l~~~~~~~A~~~~~~~~----~~~lW~~L-A~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~-~~----~~~~~A~i 158 (244)
+..|+|+.|.+...... ++.+-..+ |..|.+.++++.|..-|.+.. +. ++ .....|++
T Consensus 95 ~~eGd~~~A~k~l~~~~~~~~~p~l~~llaA~aA~~~g~~~~A~~~l~~A~------------~~~~~~~~~~~l~~a~l 162 (398)
T PRK10747 95 LAEGDYQQVEKLMTRNADHAEQPVVNYLLAAEAAQQRGDEARANQHLERAA------------ELADNDQLPVEITRVRI 162 (398)
T ss_pred HhCCCHHHHHHHHHHHHhcccchHHHHHHHHHHHHHCCCHHHHHHHHHHHH------------hcCCcchHHHHHHHHHH
Confidence 55799999998887652 24444444 555588888887776665532 22 22 12355899
Q ss_pred HHHcCCHHHHHHHHHHCCC-----H---HHHHHHHHHhcCHHHHHHHHHhc
Q psy11525 159 YLLGGNISEAESILLQHGL-----I---FRAIQVSILTHNWDRALELALRH 201 (244)
Q Consensus 159 a~~~G~~~eAe~~~l~~g~-----~---~~Ai~m~~~l~~W~~Al~LA~~~ 201 (244)
++..|++++|.+.+.+.-. + ....++|...++|++|+++-.+.
T Consensus 163 ~l~~g~~~~Al~~l~~~~~~~P~~~~al~ll~~~~~~~gdw~~a~~~l~~l 213 (398)
T PRK10747 163 QLARNENHAARHGVDKLLEVAPRHPEVLRLAEQAYIRTGAWSSLLDILPSM 213 (398)
T ss_pred HHHCCCHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHhHHHHHHHHHHH
Confidence 9999999999998876421 1 35688999999999999555443
No 48
>PF13429 TPR_15: Tetratricopeptide repeat; PDB: 2VQ2_A 2PL2_B.
Probab=94.73 E-value=0.48 Score=41.46 Aligned_cols=138 Identities=18% Similarity=0.159 Sum_probs=72.8
Q ss_pred HHHHHHHHcCCHHHHHHHHhcc-------CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCc-HHH-
Q psy11525 83 SVLHSYAASHSWPQALSLCRTL-------NDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPV-KAA- 153 (244)
Q Consensus 83 ~~l~~~l~~~~~~~A~~~~~~~-------~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~-~~~- 153 (244)
..+.-+...++|+++.++.+.+ .+...|..+|....+.|+.+.|+++|.+. ++ ..|+ ...
T Consensus 115 ~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~a~~~~~~G~~~~A~~~~~~a--------l~---~~P~~~~~~ 183 (280)
T PF13429_consen 115 SALQLYYRLGDYDEAEELLEKLEELPAAPDSARFWLALAEIYEQLGDPDKALRDYRKA--------LE---LDPDDPDAR 183 (280)
T ss_dssp ---H-HHHTT-HHHHHHHHHHHHH-T---T-HHHHHHHHHHHHHCCHHHHHHHHHHHH--------HH---H-TT-HHHH
T ss_pred HHHHHHHHHhHHHHHHHHHHHHHhccCCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHH--------HH---cCCCCHHHH
Confidence 3444567888898888876663 35678899999999999998888888752 11 1121 222
Q ss_pred -HHHHHHHHcCCHHHHHHHH---HHC----CCHHH-HHHHHHHhcCHHHHHHHHHhc---CCchhHHHHHHHHHHHHcCC
Q psy11525 154 -QQAEMYLLGGNISEAESIL---LQH----GLIFR-AIQVSILTHNWDRALELALRH---KTHIDTVLYQRKKYLDNLEK 221 (244)
Q Consensus 154 -~~A~ia~~~G~~~eAe~~~---l~~----g~~~~-Ai~m~~~l~~W~~Al~LA~~~---~~~~~~vl~~r~~yl~~~~~ 221 (244)
..+.+++..|++++|..++ .+. ...-. --..|..+|+.++|+...++. ++.--.++......|.+.|+
T Consensus 184 ~~l~~~li~~~~~~~~~~~l~~~~~~~~~~~~~~~~la~~~~~lg~~~~Al~~~~~~~~~~p~d~~~~~~~a~~l~~~g~ 263 (280)
T PF13429_consen 184 NALAWLLIDMGDYDEAREALKRLLKAAPDDPDLWDALAAAYLQLGRYEEALEYLEKALKLNPDDPLWLLAYADALEQAGR 263 (280)
T ss_dssp HHHHHHHCTTCHHHHHHHHHHHHHHH-HTSCCHCHHHHHHHHHHT-HHHHHHHHHHHHHHSTT-HHHHHHHHHHHT----
T ss_pred HHHHHHHHHCCChHHHHHHHHHHHHHCcCHHHHHHHHHHHhccccccccccccccccccccccccccccccccccccccc
Confidence 2344555557777644444 332 22222 246677888888888776552 33334556666777777776
Q ss_pred c-ccHHHHHhc
Q psy11525 222 I-ETNEKFLRL 231 (244)
Q Consensus 222 ~-e~~~~f~~~ 231 (244)
. ++....+++
T Consensus 264 ~~~A~~~~~~~ 274 (280)
T PF13429_consen 264 KDEALRLRRQA 274 (280)
T ss_dssp -----------
T ss_pred ccccccccccc
Confidence 5 444444444
No 49
>PLN03077 Protein ECB2; Provisional
Probab=94.56 E-value=0.55 Score=47.98 Aligned_cols=53 Identities=11% Similarity=0.127 Sum_probs=43.5
Q ss_pred HHHHHHHHHcCCHHHHHHHHhccC--CHHHHHHHHHHHHHcCCHHHHHHHHHHhC
Q psy11525 82 ISVLHSYAASHSWPQALSLCRTLN--DDILWACLAGMATYSRDLATSEEAYAAIE 134 (244)
Q Consensus 82 ~~~l~~~l~~~~~~~A~~~~~~~~--~~~lW~~LA~~Al~~~~l~~A~~~y~~l~ 134 (244)
...+.-|...|++++|.++...+. +...|..+-....+.++++.|...|..+.
T Consensus 226 n~Li~~y~k~g~~~~A~~lf~~m~~~d~~s~n~li~~~~~~g~~~eAl~lf~~M~ 280 (857)
T PLN03077 226 NALITMYVKCGDVVSARLVFDRMPRRDCISWNAMISGYFENGECLEGLELFFTMR 280 (857)
T ss_pred hHHHHHHhcCCCHHHHHHHHhcCCCCCcchhHHHHHHHHhCCCHHHHHHHHHHHH
Confidence 445556789999999999998874 45679999999999999999988888764
No 50
>PRK12370 invasion protein regulator; Provisional
Probab=94.52 E-value=1.2 Score=43.49 Aligned_cols=105 Identities=7% Similarity=-0.077 Sum_probs=72.8
Q ss_pred HHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCC-cHHHHHHHHHHHc
Q psy11525 89 AASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIP-VKAAQQAEMYLLG 162 (244)
Q Consensus 89 l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~-~~~~~~A~ia~~~ 162 (244)
...|++++|....+.. +....|..+|......++++.|...|.+.-.. .... .-....+.++...
T Consensus 349 ~~~g~~~~A~~~~~~Al~l~P~~~~a~~~lg~~l~~~G~~~eAi~~~~~Al~l---------~P~~~~~~~~~~~~~~~~ 419 (553)
T PRK12370 349 TIHSEYIVGSLLFKQANLLSPISADIKYYYGWNLFMAGQLEEALQTINECLKL---------DPTRAAAGITKLWITYYH 419 (553)
T ss_pred HHccCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHhc---------CCCChhhHHHHHHHHHhc
Confidence 6789999999988774 45778999999999999999998887763211 1111 1112334556678
Q ss_pred CCHHHHHHHHHHC------CCHH---HHHHHHHHhcCHHHHHHHHHhcC
Q psy11525 163 GNISEAESILLQH------GLIF---RAIQVSILTHNWDRALELALRHK 202 (244)
Q Consensus 163 G~~~eAe~~~l~~------g~~~---~Ai~m~~~l~~W~~Al~LA~~~~ 202 (244)
|++++|...|.+. +.+. ..-..+..+|+.++|.+...+..
T Consensus 420 g~~eeA~~~~~~~l~~~~p~~~~~~~~la~~l~~~G~~~eA~~~~~~~~ 468 (553)
T PRK12370 420 TGIDDAIRLGDELRSQHLQDNPILLSMQVMFLSLKGKHELARKLTKEIS 468 (553)
T ss_pred cCHHHHHHHHHHHHHhccccCHHHHHHHHHHHHhCCCHHHHHHHHHHhh
Confidence 9999998888653 2222 12345567899999999887754
No 51
>PF09976 TPR_21: Tetratricopeptide repeat; InterPro: IPR018704 This domain, found in various hypothetical prokaryotic proteins, has no known function.
Probab=94.52 E-value=1.6 Score=34.38 Aligned_cols=86 Identities=16% Similarity=0.040 Sum_probs=53.8
Q ss_pred HHHHHHHcCCHHHHHHHHhcc-----CC---HHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHH
Q psy11525 84 VLHSYAASHSWPQALSLCRTL-----ND---DILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQ 155 (244)
Q Consensus 84 ~l~~~l~~~~~~~A~~~~~~~-----~~---~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~ 155 (244)
.+...+..+++..+-..+..+ .+ ..-+-.+|..++..++++-|...|..+-+-..=..+..+ -..+.
T Consensus 17 ~~~~~~~~~~~~~~~~~~~~l~~~~~~s~ya~~A~l~lA~~~~~~g~~~~A~~~l~~~~~~~~d~~l~~~-----a~l~L 91 (145)
T PF09976_consen 17 QALQALQAGDPAKAEAAAEQLAKDYPSSPYAALAALQLAKAAYEQGDYDEAKAALEKALANAPDPELKPL-----ARLRL 91 (145)
T ss_pred HHHHHHHCCCHHHHHHHHHHHHHHCCCChHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHhhCCCHHHHHH-----HHHHH
Confidence 344446788888887755444 22 344556778888888888888888764432100111111 12467
Q ss_pred HHHHHHcCCHHHHHHHHHH
Q psy11525 156 AEMYLLGGNISEAESILLQ 174 (244)
Q Consensus 156 A~ia~~~G~~~eAe~~~l~ 174 (244)
|.+.+..|++++|...+-.
T Consensus 92 A~~~~~~~~~d~Al~~L~~ 110 (145)
T PF09976_consen 92 ARILLQQGQYDEALATLQQ 110 (145)
T ss_pred HHHHHHcCCHHHHHHHHHh
Confidence 8888888888888888754
No 52
>KOG0985|consensus
Probab=94.36 E-value=0.52 Score=48.90 Aligned_cols=49 Identities=16% Similarity=0.223 Sum_probs=45.6
Q ss_pred HcCCHHHHHHHHhccCCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhH
Q psy11525 90 ASHSWPQALSLCRTLNDDILWACLAGMATYSRDLATSEEAYAAIEQVDK 138 (244)
Q Consensus 90 ~~~~~~~A~~~~~~~~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~ 138 (244)
.-+..++|+.+|...+++..|.+||++-|+.+.+..|...|.+-.|++.
T Consensus 1087 ~i~~ldRA~efAe~~n~p~vWsqlakAQL~~~~v~dAieSyikadDps~ 1135 (1666)
T KOG0985|consen 1087 NIGSLDRAYEFAERCNEPAVWSQLAKAQLQGGLVKDAIESYIKADDPSN 1135 (1666)
T ss_pred HhhhHHHHHHHHHhhCChHHHHHHHHHHHhcCchHHHHHHHHhcCCcHH
Confidence 3578999999999999999999999999999999999999999999763
No 53
>smart00299 CLH Clathrin heavy chain repeat homology.
Probab=93.77 E-value=1.2 Score=34.79 Aligned_cols=113 Identities=12% Similarity=0.112 Sum_probs=71.7
Q ss_pred ccHHHHHHHHHcCCHHHHHHHHhccCCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHH-hhCCCcHHHHHHHH
Q psy11525 80 PYISVLHSYAASHSWPQALSLCRTLNDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINH-IKGIPVKAAQQAEM 158 (244)
Q Consensus 80 p~~~~l~~~l~~~~~~~A~~~~~~~~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~-i~~~~~~~~~~A~i 158 (244)
|.+..+..|.+.+.....+.+.+.+.... . +..++ .+....+|+....-..+.+++. -... .....+.+
T Consensus 9 ~~~~vv~~~~~~~~~~~l~~yLe~~~~~~---~-~~~~~----~~~li~ly~~~~~~~ll~~l~~~~~~y--d~~~~~~~ 78 (140)
T smart00299 9 DVSEVVELFEKRNLLEELIPYLESALKLN---S-ENPAL----QTKLIELYAKYDPQKEIERLDNKSNHY--DIEKVGKL 78 (140)
T ss_pred CHHHHHHHHHhCCcHHHHHHHHHHHHccC---c-cchhH----HHHHHHHHHHHCHHHHHHHHHhccccC--CHHHHHHH
Confidence 44556666677788888888888753221 0 12222 2334455666555555666662 1111 12345667
Q ss_pred HHHcCCHHHHHHHHHHCCCHHHHHHHHHHh-cCHHHHHHHHHhcC
Q psy11525 159 YLLGGNISEAESILLQHGLIFRAIQVSILT-HNWDRALELALRHK 202 (244)
Q Consensus 159 a~~~G~~~eAe~~~l~~g~~~~Ai~m~~~l-~~W~~Al~LA~~~~ 202 (244)
+...|.+++|.-+|.+.|....|++...+. ++.+.|++.|++.+
T Consensus 79 c~~~~l~~~~~~l~~k~~~~~~Al~~~l~~~~d~~~a~~~~~~~~ 123 (140)
T smart00299 79 CEKAKLYEEAVELYKKDGNFKDAIVTLIEHLGNYEKAIEYFVKQN 123 (140)
T ss_pred HHHcCcHHHHHHHHHhhcCHHHHHHHHHHcccCHHHHHHHHHhCC
Confidence 777778888888888888888888888776 77888888888744
No 54
>PLN03088 SGT1, suppressor of G2 allele of SKP1; Provisional
Probab=93.76 E-value=0.35 Score=44.56 Aligned_cols=82 Identities=11% Similarity=0.066 Sum_probs=60.7
Q ss_pred HHHHHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHHH
Q psy11525 86 HSYAASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMYL 160 (244)
Q Consensus 86 ~~~l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia~ 160 (244)
......|+|++|....+.. .....|..+|...+..++++.|..+|.+.=+.+ -.......++|.++.
T Consensus 10 ~~a~~~~~~~~Ai~~~~~Al~~~P~~~~a~~~~a~~~~~~g~~~eAl~~~~~Al~l~--------P~~~~a~~~lg~~~~ 81 (356)
T PLN03088 10 KEAFVDDDFALAVDLYTQAIDLDPNNAELYADRAQANIKLGNFTEAVADANKAIELD--------PSLAKAYLRKGTACM 81 (356)
T ss_pred HHHHHcCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhC--------cCCHHHHHHHHHHHH
Confidence 3457889999999988774 567899999999999999998888875532111 111223456788889
Q ss_pred HcCCHHHHHHHHHHC
Q psy11525 161 LGGNISEAESILLQH 175 (244)
Q Consensus 161 ~~G~~~eAe~~~l~~ 175 (244)
.+|+|++|...|.++
T Consensus 82 ~lg~~~eA~~~~~~a 96 (356)
T PLN03088 82 KLEEYQTAKAALEKG 96 (356)
T ss_pred HhCCHHHHHHHHHHH
Confidence 999999999988654
No 55
>TIGR00540 hemY_coli hemY protein. This is an uncharacterized protein encoded next to a heme-biosynthetic enzyme in two gamma division proteobacteria (E. coli and H. influenzae). It is known in no other species. The gene symbol hemY is unfortunate in that an unrelated protein, protoporphyrinogen oxidase, is designated as HemG in E. coli but as HemY in Bacillus subtilis.
Probab=93.65 E-value=2.1 Score=39.91 Aligned_cols=104 Identities=13% Similarity=0.024 Sum_probs=64.3
Q ss_pred HHcCCHHHHHHHHhcc----CC-HHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcH--HHHHHHHHHH
Q psy11525 89 AASHSWPQALSLCRTL----ND-DILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVK--AAQQAEMYLL 161 (244)
Q Consensus 89 l~~~~~~~A~~~~~~~----~~-~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~--~~~~A~ia~~ 161 (244)
+..|+|+.|.+..... +. ....-..|+.|.+.++.+.|...|.+. .+. ...... ....|++++.
T Consensus 95 ~~~g~~~~A~~~l~~~~~~~~~~~~~~llaA~aa~~~g~~~~A~~~l~~a--------~~~-~p~~~l~~~~~~a~l~l~ 165 (409)
T TIGR00540 95 LAEGDYAKAEKLIAKNADHAAEPVLNLIKAAEAAQQRGDEARANQHLEEA--------AEL-AGNDNILVEIARTRILLA 165 (409)
T ss_pred HhCCCHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHH--------HHh-CCcCchHHHHHHHHHHHH
Confidence 6678888888877654 11 222345566677777777777666552 111 111111 2235888888
Q ss_pred cCCHHHHHHHHHHC-----CCHH---HHHHHHHHhcCHHHHHHHHHhc
Q psy11525 162 GGNISEAESILLQH-----GLIF---RAIQVSILTHNWDRALELALRH 201 (244)
Q Consensus 162 ~G~~~eAe~~~l~~-----g~~~---~Ai~m~~~l~~W~~Al~LA~~~ 201 (244)
.|+++.|.+.+.+- +++. .+..++...++|++|+++..+.
T Consensus 166 ~~~~~~Al~~l~~l~~~~P~~~~~l~ll~~~~~~~~d~~~a~~~l~~l 213 (409)
T TIGR00540 166 QNELHAARHGVDKLLEMAPRHKEVLKLAEEAYIRSGAWQALDDIIDNM 213 (409)
T ss_pred CCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHH
Confidence 99998887776552 2222 4578888899999887776553
No 56
>PF09976 TPR_21: Tetratricopeptide repeat; InterPro: IPR018704 This domain, found in various hypothetical prokaryotic proteins, has no known function.
Probab=93.40 E-value=0.77 Score=36.30 Aligned_cols=79 Identities=14% Similarity=0.109 Sum_probs=54.3
Q ss_pred HHHcCCHHHHHHHHhccC----C----HHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHH
Q psy11525 88 YAASHSWPQALSLCRTLN----D----DILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMY 159 (244)
Q Consensus 88 ~l~~~~~~~A~~~~~~~~----~----~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia 159 (244)
++..|++++|....+.+. + ...+-.||...+..++++-|...+..+.+-..- +......|.++
T Consensus 58 ~~~~g~~~~A~~~l~~~~~~~~d~~l~~~a~l~LA~~~~~~~~~d~Al~~L~~~~~~~~~---------~~~~~~~Gdi~ 128 (145)
T PF09976_consen 58 AYEQGDYDEAKAALEKALANAPDPELKPLARLRLARILLQQGQYDEALATLQQIPDEAFK---------ALAAELLGDIY 128 (145)
T ss_pred HHHCCCHHHHHHHHHHHHhhCCCHHHHHHHHHHHHHHHHHcCCHHHHHHHHHhccCcchH---------HHHHHHHHHHH
Confidence 488999999999887752 1 235666777777777777776666443221111 11245689999
Q ss_pred HHcCCHHHHHHHHHHC
Q psy11525 160 LLGGNISEAESILLQH 175 (244)
Q Consensus 160 ~~~G~~~eAe~~~l~~ 175 (244)
+-.|++++|...|.++
T Consensus 129 ~~~g~~~~A~~~y~~A 144 (145)
T PF09976_consen 129 LAQGDYDEARAAYQKA 144 (145)
T ss_pred HHCCCHHHHHHHHHHh
Confidence 9999999999988763
No 57
>KOG1840|consensus
Probab=93.32 E-value=0.58 Score=45.36 Aligned_cols=140 Identities=15% Similarity=0.106 Sum_probs=85.8
Q ss_pred HHHHcCCHHHHHHHHhcc-------------CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCC-----
Q psy11525 87 SYAASHSWPQALSLCRTL-------------NDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGI----- 148 (244)
Q Consensus 87 ~~l~~~~~~~A~~~~~~~-------------~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~----- 148 (244)
-|...|+|.+|...|+.. .....-..++.......+++.|..-|++.. ..+.+.-..
T Consensus 292 ly~~~GKf~EA~~~~e~Al~I~~~~~~~~~~~v~~~l~~~~~~~~~~~~~Eea~~l~q~al-----~i~~~~~g~~~~~~ 366 (508)
T KOG1840|consen 292 LYYKQGKFAEAEEYCERALEIYEKLLGASHPEVAAQLSELAAILQSMNEYEEAKKLLQKAL-----KIYLDAPGEDNVNL 366 (508)
T ss_pred HHhccCChHHHHHHHHHHHHHHHHhhccChHHHHHHHHHHHHHHHHhcchhHHHHHHHHHH-----HHHHhhccccchHH
Confidence 378999999999988774 235667777888888888999998888432 222222221
Q ss_pred CcHHHHHHHHHHHcCCHHHHHHHHHHC--------C--CHHHHH------HHHHHhcCH-------HHHHHHHHhcCCch
Q psy11525 149 PVKAAQQAEMYLLGGNISEAESILLQH--------G--LIFRAI------QVSILTHNW-------DRALELALRHKTHI 205 (244)
Q Consensus 149 ~~~~~~~A~ia~~~G~~~eAe~~~l~~--------g--~~~~Ai------~m~~~l~~W-------~~Al~LA~~~~~~~ 205 (244)
......+|.++...|+|+||+++|.++ | ++.-++ ..|-++.+. ++|..+.+.-++.-
T Consensus 367 a~~~~nl~~l~~~~gk~~ea~~~~k~ai~~~~~~~~~~~~~~~~~l~~la~~~~~~k~~~~a~~l~~~~~~i~~~~g~~~ 446 (508)
T KOG1840|consen 367 AKIYANLAELYLKMGKYKEAEELYKKAIQILRELLGKKDYGVGKPLNQLAEAYEELKKYEEAEQLFEEAKDIMKLCGPDH 446 (508)
T ss_pred HHHHHHHHHHHHHhcchhHHHHHHHHHHHHHHhcccCcChhhhHHHHHHHHHHHHhcccchHHHHHHHHHHHHHHhCCCC
Confidence 123467999999999999999999775 1 111110 112233332 34444444444332
Q ss_pred hHHHHHH---HHHHHHcCCcccHHHHHhc
Q psy11525 206 DTVLYQR---KKYLDNLEKIETNEKFLRL 231 (244)
Q Consensus 206 ~~vl~~r---~~yl~~~~~~e~~~~f~~~ 231 (244)
.++.+-+ ..-+...|..|.+..|...
T Consensus 447 ~~~~~~~~nL~~~Y~~~g~~e~a~~~~~~ 475 (508)
T KOG1840|consen 447 PDVTYTYLNLAALYRAQGNYEAAEELEEK 475 (508)
T ss_pred CchHHHHHHHHHHHHHcccHHHHHHHHHH
Confidence 2223233 3333778888888887765
No 58
>TIGR00540 hemY_coli hemY protein. This is an uncharacterized protein encoded next to a heme-biosynthetic enzyme in two gamma division proteobacteria (E. coli and H. influenzae). It is known in no other species. The gene symbol hemY is unfortunate in that an unrelated protein, protoporphyrinogen oxidase, is designated as HemG in E. coli but as HemY in Bacillus subtilis.
Probab=93.31 E-value=4.7 Score=37.55 Aligned_cols=112 Identities=14% Similarity=0.004 Sum_probs=71.3
Q ss_pred HHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcH-HHHHHHHHHHcCCHHHHHHHHHH----CCCHHH-----H
Q psy11525 112 CLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVK-AAQQAEMYLLGGNISEAESILLQ----HGLIFR-----A 181 (244)
Q Consensus 112 ~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~-~~~~A~ia~~~G~~~eAe~~~l~----~g~~~~-----A 181 (244)
.-|..++..|+++.|++...+.-+ ....+.. ....|+++...|++++|.+.|.+ .+.... .
T Consensus 89 ~~glla~~~g~~~~A~~~l~~~~~---------~~~~~~~~~llaA~aa~~~g~~~~A~~~l~~a~~~~p~~~l~~~~~~ 159 (409)
T TIGR00540 89 EEALLKLAEGDYAKAEKLIAKNAD---------HAAEPVLNLIKAAEAAQQRGDEARANQHLEEAAELAGNDNILVEIAR 159 (409)
T ss_pred HHHHHHHhCCCHHHHHHHHHHHhh---------cCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCcCchHHHHHH
Confidence 346778889999999988744211 1111112 35678999999999999999988 344332 3
Q ss_pred HHHHHHhcCHHHHHHHHHhc---CCchhHHHHHHHHHH-HHcCCcccHHHHHhcc
Q psy11525 182 IQVSILTHNWDRALELALRH---KTHIDTVLYQRKKYL-DNLEKIETNEKFLRLQ 232 (244)
Q Consensus 182 i~m~~~l~~W~~Al~LA~~~---~~~~~~vl~~r~~yl-~~~~~~e~~~~f~~~~ 232 (244)
.+++...+++++|++.++.. .|+-..++.....-+ +.....+..+.+.+..
T Consensus 160 a~l~l~~~~~~~Al~~l~~l~~~~P~~~~~l~ll~~~~~~~~d~~~a~~~l~~l~ 214 (409)
T TIGR00540 160 TRILLAQNELHAARHGVDKLLEMAPRHKEVLKLAEEAYIRSGAWQALDDIIDNMA 214 (409)
T ss_pred HHHHHHCCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHhhHHHHHHHHHHHH
Confidence 67777889999999887663 344344554444444 4444454555554443
No 59
>TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795).
Probab=93.20 E-value=4.6 Score=34.03 Aligned_cols=48 Identities=19% Similarity=0.060 Sum_probs=38.5
Q ss_pred HHHHHcCCHHHHHHHHhcc----C-C---HHHHHHHHHHHHHcCCHHHHHHHHHHh
Q psy11525 86 HSYAASHSWPQALSLCRTL----N-D---DILWACLAGMATYSRDLATSEEAYAAI 133 (244)
Q Consensus 86 ~~~l~~~~~~~A~~~~~~~----~-~---~~lW~~LA~~Al~~~~l~~A~~~y~~l 133 (244)
..++..|+|++|....+.+ . + ...|-.+|......++++.|...|...
T Consensus 41 ~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~a~~~la~~~~~~~~~~~A~~~~~~~ 96 (235)
T TIGR03302 41 KEALDSGDYTEAIKYFEALESRYPFSPYAEQAQLDLAYAYYKSGDYAEAIAAADRF 96 (235)
T ss_pred HHHHHcCCHHHHHHHHHHHHHhCCCchhHHHHHHHHHHHHHhcCCHHHHHHHHHHH
Confidence 3458999999999988764 1 1 256788999999999999999988865
No 60
>PF13432 TPR_16: Tetratricopeptide repeat; PDB: 3CVP_A 3CVL_A 3CVQ_A 3CV0_A 2GW1_B 3CVN_A 3QKY_A 2PL2_B.
Probab=93.09 E-value=0.32 Score=32.68 Aligned_cols=47 Identities=9% Similarity=0.138 Sum_probs=41.2
Q ss_pred HHHHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHHh
Q psy11525 87 SYAASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAAI 133 (244)
Q Consensus 87 ~~l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~l 133 (244)
.++..|+|++|.+..+.+ .....|..+|......++++-|...|.+.
T Consensus 6 ~~~~~g~~~~A~~~~~~~l~~~P~~~~a~~~lg~~~~~~g~~~~A~~~~~~a 57 (65)
T PF13432_consen 6 ALYQQGDYDEAIAAFEQALKQDPDNPEAWYLLGRILYQQGRYDEALAYYERA 57 (65)
T ss_dssp HHHHCTHHHHHHHHHHHHHCCSTTHHHHHHHHHHHHHHTT-HHHHHHHHHHH
T ss_pred HHHHcCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 468999999999999886 47899999999999999999999988864
No 61
>PF14559 TPR_19: Tetratricopeptide repeat; PDB: 2R5S_A 3QDN_B 3QOU_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 3FP3_A 3LCA_A ....
Probab=92.97 E-value=0.25 Score=33.33 Aligned_cols=46 Identities=13% Similarity=0.079 Sum_probs=39.6
Q ss_pred HHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHHhC
Q psy11525 89 AASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAAIE 134 (244)
Q Consensus 89 l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~ 134 (244)
++.|+|++|.++.+.. .+...|..+|...++.++++-|+..+.++-
T Consensus 2 l~~~~~~~A~~~~~~~l~~~p~~~~~~~~la~~~~~~g~~~~A~~~l~~~~ 52 (68)
T PF14559_consen 2 LKQGDYDEAIELLEKALQRNPDNPEARLLLAQCYLKQGQYDEAEELLERLL 52 (68)
T ss_dssp HHTTHHHHHHHHHHHHHHHTTTSHHHHHHHHHHHHHTT-HHHHHHHHHCCH
T ss_pred hhccCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHH
Confidence 6789999999999885 578999999999999999999998887654
No 62
>PF13414 TPR_11: TPR repeat; PDB: 2HO1_B 2FI7_B 2DBA_A 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2PL2_B 3IEG_B 2FBN_A ....
Probab=92.96 E-value=0.38 Score=32.62 Aligned_cols=62 Identities=15% Similarity=0.109 Sum_probs=45.4
Q ss_pred CHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHHHHcC-CHHHHHHHHHHC
Q psy11525 106 DDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMYLLGG-NISEAESILLQH 175 (244)
Q Consensus 106 ~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia~~~G-~~~eAe~~~l~~ 175 (244)
+...|..+|......++++.|+.+|.+.=+.+ .+.......+|.+...+| ++++|.+.|.++
T Consensus 2 ~a~~~~~~g~~~~~~~~~~~A~~~~~~ai~~~--------p~~~~~~~~~g~~~~~~~~~~~~A~~~~~~a 64 (69)
T PF13414_consen 2 NAEAWYNLGQIYFQQGDYEEAIEYFEKAIELD--------PNNAEAYYNLGLAYMKLGKDYEEAIEDFEKA 64 (69)
T ss_dssp SHHHHHHHHHHHHHTTHHHHHHHHHHHHHHHS--------TTHHHHHHHHHHHHHHTTTHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcC--------CCCHHHHHHHHHHHHHhCccHHHHHHHHHHH
Confidence 45789999999999999999999998632211 111123356888999999 688888877653
No 63
>PRK09782 bacteriophage N4 receptor, outer membrane subunit; Provisional
Probab=92.72 E-value=7.5 Score=40.88 Aligned_cols=144 Identities=14% Similarity=-0.069 Sum_probs=72.7
Q ss_pred HHcCCHHHHHHHHhcc--CCHHHHHHH--HHHHHHcCCHHHHHHHHHHhCCh--------------------hH-HHHHH
Q psy11525 89 AASHSWPQALSLCRTL--NDDILWACL--AGMATYSRDLATSEEAYAAIEQV--------------------DK-VMYIN 143 (244)
Q Consensus 89 l~~~~~~~A~~~~~~~--~~~~lW~~L--A~~Al~~~~l~~A~~~y~~l~d~--------------------~~-v~~l~ 143 (244)
+..++.++|....... ..+..|..+ |......++++.|+.+|.++... .. ..+++
T Consensus 487 l~~~~~~eAi~a~~~Al~~~Pd~~~~L~lA~al~~~Gr~eeAi~~~rka~~~~p~~~a~~~la~all~~Gd~~eA~~~l~ 566 (987)
T PRK09782 487 YRDTLPGVALYAWLQAEQRQPDAWQHRAVAYQAYQVEDYATALAAWQKISLHDMSNEDLLAAANTAQAAGNGAARDRWLQ 566 (987)
T ss_pred HHhCCcHHHHHHHHHHHHhCCchHHHHHHHHHHHHCCCHHHHHHHHHHHhccCCCcHHHHHHHHHHHHCCCHHHHHHHHH
Confidence 4458888888833331 122355554 55557899999999999765331 11 12222
Q ss_pred HhhCC-CcHHHHHHHHHHHc---CCHHHHHHHHHHCCC--H-----HHHHHHHHHhcCHHHHHHHHHh---cCCchhHHH
Q psy11525 144 HIKGI-PVKAAQQAEMYLLG---GNISEAESILLQHGL--I-----FRAIQVSILTHNWDRALELALR---HKTHIDTVL 209 (244)
Q Consensus 144 ~i~~~-~~~~~~~A~ia~~~---G~~~eAe~~~l~~g~--~-----~~Ai~m~~~l~~W~~Al~LA~~---~~~~~~~vl 209 (244)
..... ++.....+.++... |++++|...|.++=. | ...-.++..++++++|+....+ .+|.-....
T Consensus 567 qAL~l~P~~~~l~~~La~~l~~~Gr~~eAl~~~~~AL~l~P~~~a~~~LA~~l~~lG~~deA~~~l~~AL~l~Pd~~~a~ 646 (987)
T PRK09782 567 QAEQRGLGDNALYWWLHAQRYIPGQPELALNDLTRSLNIAPSANAYVARATIYRQRHNVPAAVSDLRAALELEPNNSNYQ 646 (987)
T ss_pred HHHhcCCccHHHHHHHHHHHHhCCCHHHHHHHHHHHHHhCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCCHHHH
Confidence 22222 11111222233333 666666666655421 1 0112345666777776654433 344433444
Q ss_pred HHHHHHHHHcCC-cccHHHHHhcc
Q psy11525 210 YQRKKYLDNLEK-IETNEKFLRLQ 232 (244)
Q Consensus 210 ~~r~~yl~~~~~-~e~~~~f~~~~ 232 (244)
......|.+.|+ .++.+.|+++.
T Consensus 647 ~nLG~aL~~~G~~eeAi~~l~~AL 670 (987)
T PRK09782 647 AALGYALWDSGDIAQSREMLERAH 670 (987)
T ss_pred HHHHHHHHHCCCHHHHHHHHHHHH
Confidence 444444555554 45556666553
No 64
>PRK10049 pgaA outer membrane protein PgaA; Provisional
Probab=92.71 E-value=1.6 Score=44.18 Aligned_cols=120 Identities=18% Similarity=0.142 Sum_probs=81.0
Q ss_pred HHHHcCCHHHHHHHHhccCCH--------------------HHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhh
Q psy11525 87 SYAASHSWPQALSLCRTLNDD--------------------ILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIK 146 (244)
Q Consensus 87 ~~l~~~~~~~A~~~~~~~~~~--------------------~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~ 146 (244)
-+++.+++++|.+.+..+... ..+..+|......++++-|+.+|..+-. ..-
T Consensus 319 a~~~~g~~~eA~~~l~~~~~~~P~~~~~~~~~~~~p~~~~~~a~~~~a~~l~~~g~~~eA~~~l~~al~--------~~P 390 (765)
T PRK10049 319 SLLESENYPGALTVTAHTINNSPPFLRLYGSPTSIPNDDWLQGQSLLSQVAKYSNDLPQAEMRARELAY--------NAP 390 (765)
T ss_pred HHHhcccHHHHHHHHHHHhhcCCceEeecCCCCCCCCchHHHHHHHHHHHHHHcCCHHHHHHHHHHHHH--------hCC
Confidence 458999999999997765221 2346788888888999988888654211 111
Q ss_pred CCCcHHHHHHHHHHHcCCHHHHHHHHHHCCCH---------HHHHHHHHHhcCHHHHHHHHHhc---CCchhHHHHHHHH
Q psy11525 147 GIPVKAAQQAEMYLLGGNISEAESILLQHGLI---------FRAIQVSILTHNWDRALELALRH---KTHIDTVLYQRKK 214 (244)
Q Consensus 147 ~~~~~~~~~A~ia~~~G~~~eAe~~~l~~g~~---------~~Ai~m~~~l~~W~~Al~LA~~~---~~~~~~vl~~r~~ 214 (244)
...+.....|.++...|++++|++.|.++-.. ..|+ .+..+++|++|..+.... .|.-..+..+.+.
T Consensus 391 ~n~~l~~~lA~l~~~~g~~~~A~~~l~~al~l~Pd~~~l~~~~a~-~al~~~~~~~A~~~~~~ll~~~Pd~~~~~~~~~~ 469 (765)
T PRK10049 391 GNQGLRIDYASVLQARGWPRAAENELKKAEVLEPRNINLEVEQAW-TALDLQEWRQMDVLTDDVVAREPQDPGVQRLARA 469 (765)
T ss_pred CCHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCCCChHHHHHHHH-HHHHhCCHHHHHHHHHHHHHhCCCCHHHHHHHHH
Confidence 11122356788999999999999999886442 3344 788999999999887653 2333344444444
Q ss_pred H
Q psy11525 215 Y 215 (244)
Q Consensus 215 y 215 (244)
+
T Consensus 470 ~ 470 (765)
T PRK10049 470 R 470 (765)
T ss_pred H
Confidence 4
No 65
>CHL00033 ycf3 photosystem I assembly protein Ycf3
Probab=92.63 E-value=1.1 Score=36.21 Aligned_cols=98 Identities=14% Similarity=0.086 Sum_probs=59.2
Q ss_pred HHHHHcCCHHHHHHHHhcc-----C---CHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHH
Q psy11525 86 HSYAASHSWPQALSLCRTL-----N---DDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAE 157 (244)
Q Consensus 86 ~~~l~~~~~~~A~~~~~~~-----~---~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ 157 (244)
..+...|++++|....+.. + ....|..+|......++++.|..+|.+.-.. -....+.....|.
T Consensus 43 ~~~~~~g~~~~A~~~~~~al~l~~~~~~~~~~~~~lg~~~~~~g~~~eA~~~~~~Al~~--------~~~~~~~~~~la~ 114 (168)
T CHL00033 43 MSAQSEGEYAEALQNYYEAMRLEIDPYDRSYILYNIGLIHTSNGEHTKALEYYFQALER--------NPFLPQALNNMAV 114 (168)
T ss_pred HHHHHcCCHHHHHHHHHHHHhccccchhhHHHHHHHHHHHHHcCCHHHHHHHHHHHHHh--------CcCcHHHHHHHHH
Confidence 3357889999999887664 2 2247999999999999999999988653211 0111122234455
Q ss_pred HHHHcCCHHHHHHHHHHCCCHHHHHHHHH-HhcCHHHHHHH
Q psy11525 158 MYLLGGNISEAESILLQHGLIFRAIQVSI-LTHNWDRALEL 197 (244)
Q Consensus 158 ia~~~G~~~eAe~~~l~~g~~~~Ai~m~~-~l~~W~~Al~L 197 (244)
++...|+ .+.+.|+++.|+..+. .+..|++|+.+
T Consensus 115 i~~~~~~------~~~~~g~~~~A~~~~~~a~~~~~~a~~~ 149 (168)
T CHL00033 115 ICHYRGE------QAIEQGDSEIAEAWFDQAAEYWKQAIAL 149 (168)
T ss_pred HHHHhhH------HHHHcccHHHHHHHHHHHHHHHHHHHHh
Confidence 5554443 3345666666665553 23445555543
No 66
>PRK15359 type III secretion system chaperone protein SscB; Provisional
Probab=92.23 E-value=2.5 Score=33.51 Aligned_cols=94 Identities=6% Similarity=-0.132 Sum_probs=60.6
Q ss_pred HHHHHHhccCCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhC-CCcHHHHHHHHHHHcCCHHHHHHHHHH
Q psy11525 96 QALSLCRTLNDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKG-IPVKAAQQAEMYLLGGNISEAESILLQ 174 (244)
Q Consensus 96 ~A~~~~~~~~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~-~~~~~~~~A~ia~~~G~~~eAe~~~l~ 174 (244)
..++-+...+.+ .|..+|..+.+.++++-|..+|.+.=+ +.. ..+-....|.++...|++++|...|-+
T Consensus 14 ~~~~~al~~~p~-~~~~~g~~~~~~g~~~~A~~~~~~al~---------~~P~~~~a~~~lg~~~~~~g~~~~A~~~y~~ 83 (144)
T PRK15359 14 DILKQLLSVDPE-TVYASGYASWQEGDYSRAVIDFSWLVM---------AQPWSWRAHIALAGTWMMLKEYTTAINFYGH 83 (144)
T ss_pred HHHHHHHHcCHH-HHHHHHHHHHHcCCHHHHHHHHHHHHH---------cCCCcHHHHHHHHHHHHHHhhHHHHHHHHHH
Confidence 344444445443 377889999999999999888876211 111 112335689999999999999998877
Q ss_pred CCC-----HH---HHHHHHHHhcCHHHHHHHHH
Q psy11525 175 HGL-----IF---RAIQVSILTHNWDRALELAL 199 (244)
Q Consensus 175 ~g~-----~~---~Ai~m~~~l~~W~~Al~LA~ 199 (244)
+-. ++ ..-..+..+|+.++|+....
T Consensus 84 Al~l~p~~~~a~~~lg~~l~~~g~~~eAi~~~~ 116 (144)
T PRK15359 84 ALMLDASHPEPVYQTGVCLKMMGEPGLAREAFQ 116 (144)
T ss_pred HHhcCCCCcHHHHHHHHHHHHcCCHHHHHHHHH
Confidence 632 11 11234567778877765443
No 67
>KOG1125|consensus
Probab=92.04 E-value=0.21 Score=48.39 Aligned_cols=84 Identities=21% Similarity=0.213 Sum_probs=63.4
Q ss_pred HHHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCCh--hHHHHHHHhhCCCcHHHHHHHHHH
Q psy11525 88 YAASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAAIEQV--DKVMYINHIKGIPVKAAQQAEMYL 160 (244)
Q Consensus 88 ~l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~--~~v~~l~~i~~~~~~~~~~A~ia~ 160 (244)
|--+|.|++|+..-+.. ++..+|.+||..--...+-..|..+|.+.-+. ..|+ -.-.+|..++
T Consensus 440 y~ls~efdraiDcf~~AL~v~Pnd~~lWNRLGAtLAN~~~s~EAIsAY~rALqLqP~yVR----------~RyNlgIS~m 509 (579)
T KOG1125|consen 440 YNLSGEFDRAVDCFEAALQVKPNDYLLWNRLGATLANGNRSEEAISAYNRALQLQPGYVR----------VRYNLGISCM 509 (579)
T ss_pred HhcchHHHHHHHHHHHHHhcCCchHHHHHHhhHHhcCCcccHHHHHHHHHHHhcCCCeee----------eehhhhhhhh
Confidence 45688999988877663 57899999999888888899999999875431 1111 1124788999
Q ss_pred HcCCHHHHHHHHHHCCCHHHHHHHHHH
Q psy11525 161 LGGNISEAESILLQHGLIFRAIQVSIL 187 (244)
Q Consensus 161 ~~G~~~eAe~~~l~~g~~~~Ai~m~~~ 187 (244)
-+|.|+||.++|+. ||.|...
T Consensus 510 NlG~ykEA~~hlL~------AL~mq~k 530 (579)
T KOG1125|consen 510 NLGAYKEAVKHLLE------ALSMQRK 530 (579)
T ss_pred hhhhHHHHHHHHHH------HHHhhhc
Confidence 99999999999986 5666554
No 68
>KOG3785|consensus
Probab=91.67 E-value=2.9 Score=39.00 Aligned_cols=132 Identities=14% Similarity=0.123 Sum_probs=82.2
Q ss_pred HHHHHHHcCCHHHHHHHHhccC--------CHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhh-CCCc----
Q psy11525 84 VLHSYAASHSWPQALSLCRTLN--------DDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIK-GIPV---- 150 (244)
Q Consensus 84 ~l~~~l~~~~~~~A~~~~~~~~--------~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~-~~~~---- 150 (244)
.|..||..++|..|..+.++-. +-.+|- ..||.++||+..+-...... +.++
T Consensus 28 ~Ledfls~rDytGAislLefk~~~~~EEE~~~~lWi---------------a~C~fhLgdY~~Al~~Y~~~~~~~~~~~e 92 (557)
T KOG3785|consen 28 ELEDFLSNRDYTGAISLLEFKLNLDREEEDSLQLWI---------------AHCYFHLGDYEEALNVYTFLMNKDDAPAE 92 (557)
T ss_pred hHHHHHhcccchhHHHHHHHhhccchhhhHHHHHHH---------------HHHHHhhccHHHHHHHHHHHhccCCCCcc
Confidence 3889999999999999987731 223453 46888999987664333322 2211
Q ss_pred HHHHHHHHHHHcCCHHHHHHHHHHCCCHHHHHHHHH----HhcCHHHHHHHHHhcCCchhHHHHHHHHHHHHcCCcccHH
Q psy11525 151 KAAQQAEMYLLGGNISEAESILLQHGLIFRAIQVSI----LTHNWDRALELALRHKTHIDTVLYQRKKYLDNLEKIETNE 226 (244)
Q Consensus 151 ~~~~~A~ia~~~G~~~eAe~~~l~~g~~~~Ai~m~~----~l~~W~~Al~LA~~~~~~~~~vl~~r~~yl~~~~~~e~~~ 226 (244)
.....|...-++|.|.||+.+--++.+-..-+++.. .+++=++-+.+-....-..++-|.+..-+.+...-.|+++
T Consensus 93 l~vnLAcc~FyLg~Y~eA~~~~~ka~k~pL~~RLlfhlahklndEk~~~~fh~~LqD~~EdqLSLAsvhYmR~HYQeAId 172 (557)
T KOG3785|consen 93 LGVNLACCKFYLGQYIEAKSIAEKAPKTPLCIRLLFHLAHKLNDEKRILTFHSSLQDTLEDQLSLASVHYMRMHYQEAID 172 (557)
T ss_pred cchhHHHHHHHHHHHHHHHHHHhhCCCChHHHHHHHHHHHHhCcHHHHHHHHHHHhhhHHHHHhHHHHHHHHHHHHHHHH
Confidence 235688899999999999999999988777666543 3455444444444333223333333333334444445555
Q ss_pred HHHh
Q psy11525 227 KFLR 230 (244)
Q Consensus 227 ~f~~ 230 (244)
.|+.
T Consensus 173 vYkr 176 (557)
T KOG3785|consen 173 VYKR 176 (557)
T ss_pred HHHH
Confidence 5544
No 69
>KOG0276|consensus
Probab=91.64 E-value=14 Score=36.83 Aligned_cols=53 Identities=19% Similarity=0.122 Sum_probs=41.9
Q ss_pred cHHHHHHHHHHHcCCHHHHHHHHHHCCCHHHHHHHHHHhcCHHH---HHHHHHhcC
Q psy11525 150 VKAAQQAEMYLLGGNISEAESILLQHGLIFRAIQVSILTHNWDR---ALELALRHK 202 (244)
Q Consensus 150 ~~~~~~A~ia~~~G~~~eAe~~~l~~g~~~~Ai~m~~~l~~W~~---Al~LA~~~~ 202 (244)
.|-.+.|..++..|++..|++.|.++.++.-.+=++...|+=+- +-.+|++.+
T Consensus 667 ~Kw~~Lg~~al~~~~l~lA~EC~~~a~d~~~LlLl~t~~g~~~~l~~la~~~~~~g 722 (794)
T KOG0276|consen 667 VKWRQLGDAALSAGELPLASECFLRARDLGSLLLLYTSSGNAEGLAVLASLAKKQG 722 (794)
T ss_pred HHHHHHHHHHhhcccchhHHHHHHhhcchhhhhhhhhhcCChhHHHHHHHHHHhhc
Confidence 35567899999999999999999999999999999999888773 333444444
No 70
>KOG2034|consensus
Probab=91.49 E-value=3.5 Score=42.21 Aligned_cols=145 Identities=17% Similarity=0.204 Sum_probs=94.1
Q ss_pred HHHHHHHcCCHHHHHHHHhccC--CHHHHHHHHHHHHHcCCHHHHHHHHHHhCC-hhHH---------------H-----
Q psy11525 84 VLHSYAASHSWPQALSLCRTLN--DDILWACLAGMATYSRDLATSEEAYAAIEQ-VDKV---------------M----- 140 (244)
Q Consensus 84 ~l~~~l~~~~~~~A~~~~~~~~--~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d-~~~v---------------~----- 140 (244)
.=.-||+.|+|+.|.++|+.-+ -+..-..=|..+.+.+++..|..+|+..-. +.-| -
T Consensus 364 vWk~yLd~g~y~kAL~~ar~~p~~le~Vl~~qAdf~f~~k~y~~AA~~yA~t~~~FEEVaLKFl~~~~~~~L~~~L~KKL 443 (911)
T KOG2034|consen 364 VWKTYLDKGEFDKALEIARTRPDALETVLLKQADFLFQDKEYLRAAEIYAETLSSFEEVALKFLEINQERALRTFLDKKL 443 (911)
T ss_pred HHHHHHhcchHHHHHHhccCCHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhhhHHHHHHHHHhcCCHHHHHHHHHHHH
Confidence 3344799999999999999863 366777889999999999999999997632 1100 1
Q ss_pred ---------------------HHHHhhCCC--cHH-----------------------------HHHHHHHHHcCCHHHH
Q psy11525 141 ---------------------YINHIKGIP--VKA-----------------------------AQQAEMYLLGGNISEA 168 (244)
Q Consensus 141 ---------------------~l~~i~~~~--~~~-----------------------------~~~A~ia~~~G~~~eA 168 (244)
+|.++..+. ++. .-.-.++...|+.+++
T Consensus 444 ~~lt~~dk~q~~~Lv~WLlel~L~~Ln~l~~~de~~~en~~~~~~~~~re~~~~~~~~~~~~nretv~~l~~~~~~~e~l 523 (911)
T KOG2034|consen 444 DRLTPEDKTQRDALVTWLLELYLEQLNDLDSTDEEALENWRLEYDEVQREFSKFLVLHKDELNRETVYQLLASHGRQEEL 523 (911)
T ss_pred hhCChHHHHHHHHHHHHHHHHHHHHHhcccccChhHHHHHHHHHHHHHHHHHHHHHhhHHhhhHHHHHHHHHHccCHHHH
Confidence 122222221 000 0023456667777777
Q ss_pred HHHHHHCCCHHHHHHHHHHhcCHHHHHHHHHhcCCchhHHHHHHHHHHHHcCCcccHHHHHh
Q psy11525 169 ESILLQHGLIFRAIQVSILTHNWDRALELALRHKTHIDTVLYQRKKYLDNLEKIETNEKFLR 230 (244)
Q Consensus 169 e~~~l~~g~~~~Ai~m~~~l~~W~~Al~LA~~~~~~~~~vl~~r~~yl~~~~~~e~~~~f~~ 230 (244)
-..-.--++++.-+--+++-+.|++||+.-.+... .++.+...--|-..-..++.....+
T Consensus 524 l~fA~l~~d~~~vv~~~~q~e~yeeaLevL~~~~~--~el~yk~ap~Li~~~p~~tV~~wm~ 583 (911)
T KOG2034|consen 524 LQFANLIKDYEFVVSYWIQQENYEEALEVLLNQRN--PELFYKYAPELITHSPKETVSAWMA 583 (911)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc--hhhHHHhhhHHHhcCcHHHHHHHHH
Confidence 76666667778888888888999999888777643 2555555555544445555544443
No 71
>PRK15179 Vi polysaccharide biosynthesis protein TviE; Provisional
Probab=91.48 E-value=2.2 Score=43.03 Aligned_cols=110 Identities=9% Similarity=0.009 Sum_probs=63.6
Q ss_pred HHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHCCC---------HHHHHHHHHHh
Q psy11525 118 TYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMYLLGGNISEAESILLQHGL---------IFRAIQVSILT 188 (244)
Q Consensus 118 l~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia~~~G~~~eAe~~~l~~g~---------~~~Ai~m~~~l 188 (244)
..++...++..+..++. .+.++..+..+-....|.+...+|+++||+.++..+-. ... ......+
T Consensus 60 ~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~La~i~~~~g~~~ea~~~l~~~~~~~Pd~~~a~~~~-a~~L~~~ 133 (694)
T PRK15179 60 ERHAAVHKPAAALPELL-----DYVRRYPHTELFQVLVARALEAAHRSDEGLAVWRGIHQRFPDSSEAFILM-LRGVKRQ 133 (694)
T ss_pred HHhhhhcchHhhHHHHH-----HHHHhccccHHHHHHHHHHHHHcCCcHHHHHHHHHHHhhCCCcHHHHHHH-HHHHHHh
Confidence 33444444444444433 23334334444456677777788888888777766521 122 2334567
Q ss_pred cCHHHHHHHHHhc---CCchhHHHHHHHHHHHHcCC-cccHHHHHhccc
Q psy11525 189 HNWDRALELALRH---KTHIDTVLYQRKKYLDNLEK-IETNEKFLRLQS 233 (244)
Q Consensus 189 ~~W~~Al~LA~~~---~~~~~~vl~~r~~yl~~~~~-~e~~~~f~~~~~ 233 (244)
++.|+|+..|++. +++--+.+..+..-|.+.|+ +|+++.|+++..
T Consensus 134 ~~~eeA~~~~~~~l~~~p~~~~~~~~~a~~l~~~g~~~~A~~~y~~~~~ 182 (694)
T PRK15179 134 QGIEAGRAEIELYFSGGSSSAREILLEAKSWDEIGQSEQADACFERLSR 182 (694)
T ss_pred ccHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHhcchHHHHHHHHHHHh
Confidence 7777777777663 34445556666777777776 466777777653
No 72
>cd00189 TPR Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and C
Probab=91.18 E-value=1.5 Score=29.25 Aligned_cols=59 Identities=17% Similarity=0.032 Sum_probs=37.4
Q ss_pred HHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHC
Q psy11525 109 LWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMYLLGGNISEAESILLQH 175 (244)
Q Consensus 109 lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia~~~G~~~eAe~~~l~~ 175 (244)
.|..+|......++++.|...|...-+... .........|.++...|++++|.+.|.++
T Consensus 2 ~~~~~a~~~~~~~~~~~A~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~a~~~~~~~ 60 (100)
T cd00189 2 ALLNLGNLYYKLGDYDEALEYYEKALELDP--------DNADAYYNLAAAYYKLGKYEEALEDYEKA 60 (100)
T ss_pred HHHHHHHHHHHHhcHHHHHHHHHHHHhcCC--------ccHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 366778888888888888777654321100 00012345677888888888888877654
No 73
>PF09295 ChAPs: ChAPs (Chs5p-Arf1p-binding proteins); InterPro: IPR015374 ChAPs (Chs5p-Arf1p-binding proteins) are required for the export of specialised cargo from the Golgi. They physically interact with Chs3, Chs5 and the small GTPase Arf1, and they also form interactions with each other [].
Probab=91.02 E-value=3.8 Score=38.55 Aligned_cols=108 Identities=18% Similarity=0.151 Sum_probs=77.1
Q ss_pred cHHHHHHHH-HcCCHHHHHHHHhcc--CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCC--cH--HH
Q psy11525 81 YISVLHSYA-ASHSWPQALSLCRTL--NDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIP--VK--AA 153 (244)
Q Consensus 81 ~~~~l~~~l-~~~~~~~A~~~~~~~--~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~--~~--~~ 153 (244)
+-++|.+|+ ..++++.|..+.+.+ ..+..+..||+..+...+- ...++.+++..+.. +. ..
T Consensus 171 Lv~~Ll~~l~~t~~~~~ai~lle~L~~~~pev~~~LA~v~l~~~~E------------~~AI~ll~~aL~~~p~d~~LL~ 238 (395)
T PF09295_consen 171 LVDTLLKYLSLTQRYDEAIELLEKLRERDPEVAVLLARVYLLMNEE------------VEAIRLLNEALKENPQDSELLN 238 (395)
T ss_pred HHHHHHHHHhhcccHHHHHHHHHHHHhcCCcHHHHHHHHHHhcCcH------------HHHHHHHHHHHHhCCCCHHHHH
Confidence 456777774 457999999999986 4466677777776654443 23344455444321 22 24
Q ss_pred HHHHHHHHcCCHHHHHHHHHHC--------CCHHHHHHHHHHhcCHHHHHHHHHh
Q psy11525 154 QQAEMYLLGGNISEAESILLQH--------GLIFRAIQVSILTHNWDRALELALR 200 (244)
Q Consensus 154 ~~A~ia~~~G~~~eAe~~~l~~--------g~~~~Ai~m~~~l~~W~~Al~LA~~ 200 (244)
.+|..++..|+++.|.++..++ .-|....+.|+.+++|+.||.....
T Consensus 239 ~Qa~fLl~k~~~~lAL~iAk~av~lsP~~f~~W~~La~~Yi~~~d~e~ALlaLNs 293 (395)
T PF09295_consen 239 LQAEFLLSKKKYELALEIAKKAVELSPSEFETWYQLAECYIQLGDFENALLALNS 293 (395)
T ss_pred HHHHHHHhcCCHHHHHHHHHHHHHhCchhHHHHHHHHHHHHhcCCHHHHHHHHhc
Confidence 5788999999999999999887 3567779999999999999965554
No 74
>PRK15331 chaperone protein SicA; Provisional
Probab=90.95 E-value=1.9 Score=35.64 Aligned_cols=81 Identities=14% Similarity=0.046 Sum_probs=63.0
Q ss_pred HHHHHHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHH
Q psy11525 85 LHSYAASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMY 159 (244)
Q Consensus 85 l~~~l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia 159 (244)
-++|...|++++|-.+-+++ .+..-|.-||...-..++++.|..+|......+ .+-+.+.-..|.+.
T Consensus 44 Ay~~y~~Gk~~eA~~~F~~L~~~d~~n~~Y~~GLaa~~Q~~k~y~~Ai~~Y~~A~~l~--------~~dp~p~f~agqC~ 115 (165)
T PRK15331 44 AYEFYNQGRLDEAETFFRFLCIYDFYNPDYTMGLAAVCQLKKQFQKACDLYAVAFTLL--------KNDYRPVFFTGQCQ 115 (165)
T ss_pred HHHHHHCCCHHHHHHHHHHHHHhCcCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcc--------cCCCCccchHHHHH
Confidence 34569999999998877665 568899999999999999999999998754221 12344556788999
Q ss_pred HHcCCHHHHHHHHH
Q psy11525 160 LLGGNISEAESILL 173 (244)
Q Consensus 160 ~~~G~~~eAe~~~l 173 (244)
+.+|+.+.|.+.|-
T Consensus 116 l~l~~~~~A~~~f~ 129 (165)
T PRK15331 116 LLMRKAAKARQCFE 129 (165)
T ss_pred HHhCCHHHHHHHHH
Confidence 99999998888543
No 75
>PF04733 Coatomer_E: Coatomer epsilon subunit; InterPro: IPR006822 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the epsilon subunit of the coatomer complex, which is involved in the regulation of intracellular protein trafficking between the endoplasmic reticulum and the Golgi complex []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006890 retrograde vesicle-mediated transport, Golgi to ER, 0030126 COPI vesicle coat; PDB: 3MV2_B 3MV3_F 3MKR_A.
Probab=90.93 E-value=0.66 Score=41.64 Aligned_cols=48 Identities=15% Similarity=0.057 Sum_probs=36.9
Q ss_pred HHHcCCHHHHHHHHhccCCHHHHHHHHHHHHHcCCHHHHHHHHHHhCC
Q psy11525 88 YAASHSWPQALSLCRTLNDDILWACLAGMATYSRDLATSEEAYAAIEQ 135 (244)
Q Consensus 88 ~l~~~~~~~A~~~~~~~~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d 135 (244)
++..|++++|++++....+-..=...-...|..++++.|++-+..+.+
T Consensus 112 ~~~~~~~~~AL~~l~~~~~lE~~al~Vqi~L~~~R~dlA~k~l~~~~~ 159 (290)
T PF04733_consen 112 LFHEGDYEEALKLLHKGGSLELLALAVQILLKMNRPDLAEKELKNMQQ 159 (290)
T ss_dssp HCCCCHHHHHHCCCTTTTCHHHHHHHHHHHHHTT-HHHHHHHHHHHHC
T ss_pred HHHcCCHHHHHHHHHccCcccHHHHHHHHHHHcCCHHHHHHHHHHHHh
Confidence 366899999999999887666666777788888888888877766653
No 76
>COG2956 Predicted N-acetylglucosaminyl transferase [Carbohydrate transport and metabolism]
Probab=90.83 E-value=1.2 Score=40.76 Aligned_cols=102 Identities=21% Similarity=0.270 Sum_probs=66.0
Q ss_pred HHHHcCCHHHHHHHHhcc---CCH--HHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHh-hCCCcH--------H
Q psy11525 87 SYAASHSWPQALSLCRTL---NDD--ILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHI-KGIPVK--------A 152 (244)
Q Consensus 87 ~~l~~~~~~~A~~~~~~~---~~~--~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i-~~~~~~--------~ 152 (244)
.||-+++-+.|+++--.+ +++ ..=-.||.- |+.=|.+++.-.+... .+.++. .
T Consensus 44 NfLLs~Q~dKAvdlF~e~l~~d~~t~e~~ltLGnL-------------fRsRGEvDRAIRiHQ~L~~spdlT~~qr~lAl 110 (389)
T COG2956 44 NFLLSNQPDKAVDLFLEMLQEDPETFEAHLTLGNL-------------FRSRGEVDRAIRIHQTLLESPDLTFEQRLLAL 110 (389)
T ss_pred HHHhhcCcchHHHHHHHHHhcCchhhHHHHHHHHH-------------HHhcchHHHHHHHHHHHhcCCCCchHHHHHHH
Confidence 467778888888754443 222 222234433 5555555554333322 233331 1
Q ss_pred HHHHHHHHHcCCHHHHHHHHHHCCC-HHHH-------HHHHHHhcCHHHHHHHHHhc
Q psy11525 153 AQQAEMYLLGGNISEAESILLQHGL-IFRA-------IQVSILTHNWDRALELALRH 201 (244)
Q Consensus 153 ~~~A~ia~~~G~~~eAe~~~l~~g~-~~~A-------i~m~~~l~~W~~Al~LA~~~ 201 (244)
-..|+=++..|-+|-||++|..--+ ++.| +.+|+.-..|++|++.|++.
T Consensus 111 ~qL~~Dym~aGl~DRAE~~f~~L~de~efa~~AlqqLl~IYQ~treW~KAId~A~~L 167 (389)
T COG2956 111 QQLGRDYMAAGLLDRAEDIFNQLVDEGEFAEGALQQLLNIYQATREWEKAIDVAERL 167 (389)
T ss_pred HHHHHHHHHhhhhhHHHHHHHHHhcchhhhHHHHHHHHHHHHHhhHHHHHHHHHHHH
Confidence 2468889999999999999998755 4443 78999999999999999853
No 77
>KOG1840|consensus
Probab=90.78 E-value=2.4 Score=41.22 Aligned_cols=121 Identities=21% Similarity=0.208 Sum_probs=87.5
Q ss_pred HHHHcCCHHHHHHHHhcc-------------CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCc---
Q psy11525 87 SYAASHSWPQALSLCRTL-------------NDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPV--- 150 (244)
Q Consensus 87 ~~l~~~~~~~A~~~~~~~-------------~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~--- 150 (244)
.+-..+++++|..+.+.. .....-.+||..-...|+++-|+.-|..+ +.-.++...-..
T Consensus 334 ~~~~~~~~Eea~~l~q~al~i~~~~~g~~~~~~a~~~~nl~~l~~~~gk~~ea~~~~k~a-----i~~~~~~~~~~~~~~ 408 (508)
T KOG1840|consen 334 ILQSMNEYEEAKKLLQKALKIYLDAPGEDNVNLAKIYANLAELYLKMGKYKEAEELYKKA-----IQILRELLGKKDYGV 408 (508)
T ss_pred HHHHhcchhHHHHHHHHHHHHHHhhccccchHHHHHHHHHHHHHHHhcchhHHHHHHHHH-----HHHHHhcccCcChhh
Confidence 346778888888776543 24678899999999999999999988752 233333322211
Q ss_pred --HHHHHHHHHHHcCCHHHHHHHHHHCCCHHHH---------------HHHHHHhcCHHHHHHHHHhcCCchhHHHHHHH
Q psy11525 151 --KAAQQAEMYLLGGNISEAESILLQHGLIFRA---------------IQVSILTHNWDRALELALRHKTHIDTVLYQRK 213 (244)
Q Consensus 151 --~~~~~A~ia~~~G~~~eAe~~~l~~g~~~~A---------------i~m~~~l~~W~~Al~LA~~~~~~~~~vl~~r~ 213 (244)
....+|..+...+++.+|+++|.++..+..+ ...|..+|++|.|.+++.. |+.-|.
T Consensus 409 ~~~l~~la~~~~~~k~~~~a~~l~~~~~~i~~~~g~~~~~~~~~~~nL~~~Y~~~g~~e~a~~~~~~-------~~~~~~ 481 (508)
T KOG1840|consen 409 GKPLNQLAEAYEELKKYEEAEQLFEEAKDIMKLCGPDHPDVTYTYLNLAALYRAQGNYEAAEELEEK-------VLNARE 481 (508)
T ss_pred hHHHHHHHHHHHHhcccchHHHHHHHHHHHHHHhCCCCCchHHHHHHHHHHHHHcccHHHHHHHHHH-------HHHHHH
Confidence 2356788888888999999999999887754 4679999999999998763 455666
Q ss_pred HHHHHc
Q psy11525 214 KYLDNL 219 (244)
Q Consensus 214 ~yl~~~ 219 (244)
..+.+-
T Consensus 482 ~~~~~~ 487 (508)
T KOG1840|consen 482 QRLGTA 487 (508)
T ss_pred HcCCCC
Confidence 655443
No 78
>PRK10370 formate-dependent nitrite reductase complex subunit NrfG; Provisional
Probab=90.67 E-value=1.9 Score=36.36 Aligned_cols=102 Identities=15% Similarity=0.148 Sum_probs=57.7
Q ss_pred cCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHH-HHcCC
Q psy11525 91 SHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMY-LLGGN 164 (244)
Q Consensus 91 ~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia-~~~G~ 164 (244)
..+.+++....+.. ++...|..||...+..++++.|..+|.+.-... -+..+-....|.+. ...|+
T Consensus 52 ~~~~~~~i~~l~~~L~~~P~~~~~w~~Lg~~~~~~g~~~~A~~a~~~Al~l~--------P~~~~~~~~lA~aL~~~~g~ 123 (198)
T PRK10370 52 QQTPEAQLQALQDKIRANPQNSEQWALLGEYYLWRNDYDNALLAYRQALQLR--------GENAELYAALATVLYYQAGQ 123 (198)
T ss_pred chhHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhC--------CCCHHHHHHHHHHHHHhcCC
Confidence 44445555544442 678899999999999999999999996532211 00111123345543 34455
Q ss_pred --HHHHHHHHHHCCCHH----HH----HHHHHHhcCHHHHHHHHHh
Q psy11525 165 --ISEAESILLQHGLIF----RA----IQVSILTHNWDRALELALR 200 (244)
Q Consensus 165 --~~eAe~~~l~~g~~~----~A----i~m~~~l~~W~~Al~LA~~ 200 (244)
+++|.++|.+.=..+ .| =..+..+|+|++|+..-++
T Consensus 124 ~~~~~A~~~l~~al~~dP~~~~al~~LA~~~~~~g~~~~Ai~~~~~ 169 (198)
T PRK10370 124 HMTPQTREMIDKALALDANEVTALMLLASDAFMQADYAQAIELWQK 169 (198)
T ss_pred CCcHHHHHHHHHHHHhCCCChhHHHHHHHHHHHcCCHHHHHHHHHH
Confidence 477777766541110 01 1223456666666655544
No 79
>PF00637 Clathrin: Region in Clathrin and VPS; InterPro: IPR000547 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. These vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transport []. Clathrin coats contain both clathrin (acts as a scaffold) and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors [, ]. Clathrin is a trimer composed of three heavy chains and three light chains, each monomer projecting outwards like a leg; this three-legged structure is known as a triskelion [, ]. The heavy chains form the legs, their N-terminal beta-propeller regions extending outwards, while their C-terminal alpha-alpha-superhelical regions form the central hub of the triskelion. Peptide motifs can bind between the beta-propeller blades. The light chains appear to have a regulatory role, and may help orient the assembly and disassembly of clathrin coats as they interact with hsc70 uncoating ATPase []. Clathrin triskelia self-polymerise into a curved lattice by twisting individual legs together. The clathrin lattice forms around a vesicle as it buds from the TGN, plasma membrane or endosomes, acting to stabilise the vesicle and facilitate the budding process []. The multiple blades created when the triskelia polymerise are involved in multiple protein interactions, enabling the recruitment of different cargo adaptors and membrane attachment proteins []. This entry represents the 7-fold alpha-alpha-superhelical ARM-type repeat found at the C-terminal of clathrin heavy chains and in VPS (vacuolar protein sorting-associated) proteins. In clathrin heavy chains, the C-terminal 7-fold ARM-type repeats interact to form the central hub of the triskelion. VPS proteins are required for vacuolar assembly and vacuolar traffick, and contain one clathrin-type repeat []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0006886 intracellular protein transport, 0016192 vesicle-mediated transport; PDB: 3LVH_A 3LVG_C 1B89_A 3QIL_L.
Probab=90.65 E-value=0.028 Score=44.31 Aligned_cols=51 Identities=16% Similarity=0.144 Sum_probs=46.7
Q ss_pred HHHHHHHHHcCCHHHHHHHHHHCCCHHHHHHHHHHhcCHHHHHHHHHhcCC
Q psy11525 153 AQQAEMYLLGGNISEAESILLQHGLIFRAIQVSILTHNWDRALELALRHKT 203 (244)
Q Consensus 153 ~~~A~ia~~~G~~~eAe~~~l~~g~~~~Ai~m~~~l~~W~~Al~LA~~~~~ 203 (244)
...+.++...|.+++|.-+|.+.|++++|+++...++.|++|++.|++.+.
T Consensus 74 ~~~~~~c~~~~l~~~a~~Ly~~~~~~~~al~i~~~~~~~~~a~e~~~~~~~ 124 (143)
T PF00637_consen 74 DKALRLCEKHGLYEEAVYLYSKLGNHDEALEILHKLKDYEEAIEYAKKVDD 124 (143)
T ss_dssp THHHHHHHTTTSHHHHHHHHHCCTTHTTCSSTSSSTHCSCCCTTTGGGCSS
T ss_pred HHHHHHHHhcchHHHHHHHHHHcccHHHHHHHHHHHccHHHHHHHHHhcCc
Confidence 456788889999999999999999999999999999999999999999874
No 80
>PF14938 SNAP: Soluble NSF attachment protein, SNAP; PDB: 1QQE_A 2IFU_A.
Probab=90.41 E-value=1.6 Score=38.71 Aligned_cols=16 Identities=13% Similarity=0.470 Sum_probs=8.6
Q ss_pred HHHHHHHcCCHHHHHH
Q psy11525 84 VLHSYAASHSWPQALS 99 (244)
Q Consensus 84 ~l~~~l~~~~~~~A~~ 99 (244)
+-..|-..|+|++|..
T Consensus 41 Aa~~fk~~~~~~~A~~ 56 (282)
T PF14938_consen 41 AANCFKLAKDWEKAAE 56 (282)
T ss_dssp HHHHHHHTT-CHHHHH
T ss_pred HHHHHHHHhccchhHH
Confidence 3334566677776664
No 81
>PF13371 TPR_9: Tetratricopeptide repeat
Probab=90.35 E-value=0.84 Score=31.14 Aligned_cols=47 Identities=17% Similarity=0.179 Sum_probs=40.6
Q ss_pred HHHHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHHh
Q psy11525 87 SYAASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAAI 133 (244)
Q Consensus 87 ~~l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~l 133 (244)
-|+..++|+.|.+.+..+ .+...|...|......+++..|..+|.+.
T Consensus 4 ~~~~~~~~~~A~~~~~~~l~~~p~~~~~~~~~a~~~~~~g~~~~A~~~l~~~ 55 (73)
T PF13371_consen 4 IYLQQEDYEEALEVLERALELDPDDPELWLQRARCLFQLGRYEEALEDLERA 55 (73)
T ss_pred HHHhCCCHHHHHHHHHHHHHhCcccchhhHHHHHHHHHhccHHHHHHHHHHH
Confidence 368899999999999885 57889999999999999999998888753
No 82
>TIGR02795 tol_pal_ybgF tol-pal system protein YbgF. Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.
Probab=90.35 E-value=3.6 Score=30.07 Aligned_cols=48 Identities=10% Similarity=0.058 Sum_probs=32.6
Q ss_pred HHHHHcCCHHHHHHHHhccC----C----HHHHHHHHHHHHHcCCHHHHHHHHHHh
Q psy11525 86 HSYAASHSWPQALSLCRTLN----D----DILWACLAGMATYSRDLATSEEAYAAI 133 (244)
Q Consensus 86 ~~~l~~~~~~~A~~~~~~~~----~----~~lW~~LA~~Al~~~~l~~A~~~y~~l 133 (244)
..++..|++++|.+.+.... . ...+..+|......++++-|...|..+
T Consensus 10 ~~~~~~~~~~~A~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~A~~~~~~~ 65 (119)
T TIGR02795 10 LLVLKAGDYADAIQAFQAFLKKYPKSTYAPNAHYWLGEAYYAQGKYADAAKAFLAV 65 (119)
T ss_pred HHHHHcCCHHHHHHHHHHHHHHCCCccccHHHHHHHHHHHHhhccHHHHHHHHHHH
Confidence 34577888888888776641 1 345666777777777777777776653
No 83
>PRK10747 putative protoheme IX biogenesis protein; Provisional
Probab=90.33 E-value=1.9 Score=40.08 Aligned_cols=81 Identities=15% Similarity=0.079 Sum_probs=51.7
Q ss_pred HHHHHHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHH
Q psy11525 85 LHSYAASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMY 159 (244)
Q Consensus 85 l~~~l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia 159 (244)
++-.+..++.++|.+.++.. ++.....++|..|+..++++.|+..|.+.-. +..-.......|.+.
T Consensus 301 l~~~l~~~~~~~al~~~e~~lk~~P~~~~l~l~lgrl~~~~~~~~~A~~~le~al~---------~~P~~~~~~~La~~~ 371 (398)
T PRK10747 301 LIPRLKTNNPEQLEKVLRQQIKQHGDTPLLWSTLGQLLMKHGEWQEASLAFRAALK---------QRPDAYDYAWLADAL 371 (398)
T ss_pred HHhhccCCChHHHHHHHHHHHhhCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHh---------cCCCHHHHHHHHHHH
Confidence 34446668888888877654 4566777888888888888888877754321 111111224567777
Q ss_pred HHcCCHHHHHHHHHH
Q psy11525 160 LLGGNISEAESILLQ 174 (244)
Q Consensus 160 ~~~G~~~eAe~~~l~ 174 (244)
...|+.++|.++|.+
T Consensus 372 ~~~g~~~~A~~~~~~ 386 (398)
T PRK10747 372 DRLHKPEEAAAMRRD 386 (398)
T ss_pred HHcCCHHHHHHHHHH
Confidence 777777777666653
No 84
>PRK02603 photosystem I assembly protein Ycf3; Provisional
Probab=90.32 E-value=2 Score=34.89 Aligned_cols=84 Identities=14% Similarity=0.062 Sum_probs=56.4
Q ss_pred HHHcCCHHHHHHHHhccC--------CHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHH
Q psy11525 88 YAASHSWPQALSLCRTLN--------DDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMY 159 (244)
Q Consensus 88 ~l~~~~~~~A~~~~~~~~--------~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia 159 (244)
+...|++++|...++... ....|..+|......++++.|..+|.+.=+. -..........|.++
T Consensus 45 ~~~~g~~~~A~~~~~~al~~~~~~~~~~~~~~~la~~~~~~g~~~~A~~~~~~al~~--------~p~~~~~~~~lg~~~ 116 (172)
T PRK02603 45 AQADGEYAEALENYEEALKLEEDPNDRSYILYNMGIIYASNGEHDKALEYYHQALEL--------NPKQPSALNNIAVIY 116 (172)
T ss_pred HHHcCCHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHh--------CcccHHHHHHHHHHH
Confidence 478899999999887642 1358999999999999999999888642211 111112234567777
Q ss_pred HHcCCHHHHHHHHHHCCCHHHHHHHH
Q psy11525 160 LLGGNISEAESILLQHGLIFRAIQVS 185 (244)
Q Consensus 160 ~~~G~~~eAe~~~l~~g~~~~Ai~m~ 185 (244)
...|+...|. |+.+.|+..+
T Consensus 117 ~~~g~~~~a~------~~~~~A~~~~ 136 (172)
T PRK02603 117 HKRGEKAEEA------GDQDEAEALF 136 (172)
T ss_pred HHcCChHhHh------hCHHHHHHHH
Confidence 7878776655 4455565554
No 85
>PF13432 TPR_16: Tetratricopeptide repeat; PDB: 3CVP_A 3CVL_A 3CVQ_A 3CV0_A 2GW1_B 3CVN_A 3QKY_A 2PL2_B.
Probab=89.36 E-value=0.76 Score=30.76 Aligned_cols=56 Identities=16% Similarity=0.078 Sum_probs=37.1
Q ss_pred HHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHC
Q psy11525 112 CLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMYLLGGNISEAESILLQH 175 (244)
Q Consensus 112 ~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia~~~G~~~eAe~~~l~~ 175 (244)
.+|...++.++++.|+.+|..+-... -.-.+-....|.++...|++++|...|.+.
T Consensus 2 ~~a~~~~~~g~~~~A~~~~~~~l~~~--------P~~~~a~~~lg~~~~~~g~~~~A~~~~~~a 57 (65)
T PF13432_consen 2 ALARALYQQGDYDEAIAAFEQALKQD--------PDNPEAWYLLGRILYQQGRYDEALAYYERA 57 (65)
T ss_dssp HHHHHHHHCTHHHHHHHHHHHHHCCS--------TTHHHHHHHHHHHHHHTT-HHHHHHHHHHH
T ss_pred hHHHHHHHcCCHHHHHHHHHHHHHHC--------CCCHHHHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 46778888888888888887643221 000112346788999999999998887653
No 86
>KOG0292|consensus
Probab=89.30 E-value=12 Score=38.84 Aligned_cols=108 Identities=16% Similarity=0.110 Sum_probs=81.5
Q ss_pred ccHHHHHHH---HHcCCHHHHHHHHhcc--CCHHHHHHHH-------------------HHHHHcCCHHHHHHHHHHhCC
Q psy11525 80 PYISVLHSY---AASHSWPQALSLCRTL--NDDILWACLA-------------------GMATYSRDLATSEEAYAAIEQ 135 (244)
Q Consensus 80 p~~~~l~~~---l~~~~~~~A~~~~~~~--~~~~lW~~LA-------------------~~Al~~~~l~~A~~~y~~l~d 135 (244)
+.+.+=+.| |-.++|++-+.+.+.. +-.+.-..|- ..||+.+|+++|..+-.+++|
T Consensus 592 ~IDptEy~FKlALi~k~ydeVl~lI~ns~LvGqaiIaYLqKkgypeiAL~FVkD~~tRF~LaLe~gnle~ale~akkldd 671 (1202)
T KOG0292|consen 592 TIDPTEYRFKLALLNKKYDEVLHLIKNSNLVGQAIIAYLQKKGYPEIALHFVKDERTRFELALECGNLEVALEAAKKLDD 671 (1202)
T ss_pred eechHHHHHHHHHHhhhhHHHHHHHHhcCcccHHHHHHHHhcCCcceeeeeecCcchheeeehhcCCHHHHHHHHHhcCc
Confidence 567777777 8889999999999874 2333333222 357888888888877777776
Q ss_pred hhHHHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHCCCHHHHHHHHHHhcCHHHHHHHHHh
Q psy11525 136 VDKVMYINHIKGIPVKAAQQAEMYLLGGNISEAESILLQHGLIFRAIQVSILTHNWDRALELALR 200 (244)
Q Consensus 136 ~~~v~~l~~i~~~~~~~~~~A~ia~~~G~~~eAe~~~l~~g~~~~Ai~m~~~l~~W~~Al~LA~~ 200 (244)
.+.- .+.|+-|+.+|+.+-||..|-+...+++.--+|.-.|+-|+--++++.
T Consensus 672 ~d~w-------------~rLge~Al~qgn~~IaEm~yQ~~knfekLsfLYliTgn~eKL~Km~~i 723 (1202)
T KOG0292|consen 672 KDVW-------------ERLGEEALRQGNHQIAEMCYQRTKNFEKLSFLYLITGNLEKLSKMMKI 723 (1202)
T ss_pred HHHH-------------HHHHHHHHHhcchHHHHHHHHHhhhhhheeEEEEEeCCHHHHHHHHHH
Confidence 6443 467899999999999999999999998888888777777776555543
No 87
>KOG2114|consensus
Probab=88.78 E-value=9.5 Score=39.04 Aligned_cols=123 Identities=16% Similarity=0.029 Sum_probs=79.9
Q ss_pred HHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcH---HHHHHHHHHHcCCHHHHHHHHHHCCCHHHH-HHH
Q psy11525 109 LWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVK---AAQQAEMYLLGGNISEAESILLQHGLIFRA-IQV 184 (244)
Q Consensus 109 lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~---~~~~A~ia~~~G~~~eAe~~~l~~g~~~~A-i~m 184 (244)
+|..-...--..-......-||.+++|+.++.-.-+.....+- ....-+++--.+-.++|+-+-.+.+.-..+ .-+
T Consensus 420 Le~L~~~gla~~dhttlLLncYiKlkd~~kL~efI~~~~~g~~~fd~e~al~Ilr~snyl~~a~~LA~k~~~he~vl~il 499 (933)
T KOG2114|consen 420 LEALHKKGLANSDHTTLLLNCYIKLKDVEKLTEFISKCDKGEWFFDVETALEILRKSNYLDEAELLATKFKKHEWVLDIL 499 (933)
T ss_pred HHHHHHcccccchhHHHHHHHHHHhcchHHHHHHHhcCCCcceeeeHHHHHHHHHHhChHHHHHHHHHHhccCHHHHHHH
Confidence 3433333333334466788999999999998644443332111 122335555666678888888887774444 557
Q ss_pred HHHhcCHHHHHHHHHhcCCc-hhHHHHHHHHHHHHcCCcccHHHHHhc
Q psy11525 185 SILTHNWDRALELALRHKTH-IDTVLYQRKKYLDNLEKIETNEKFLRL 231 (244)
Q Consensus 185 ~~~l~~W~~Al~LA~~~~~~-~~~vl~~r~~yl~~~~~~e~~~~f~~~ 231 (244)
+.+++++++|++.-++..+. +-..+..+-+.|.+-..+||...+.+.
T Consensus 500 le~~~ny~eAl~yi~slp~~e~l~~l~kyGk~Ll~h~P~~t~~ili~~ 547 (933)
T KOG2114|consen 500 LEDLHNYEEALRYISSLPISELLRTLNKYGKILLEHDPEETMKILIEL 547 (933)
T ss_pred HHHhcCHHHHHHHHhcCCHHHHHHHHHHHHHHHHhhChHHHHHHHHHH
Confidence 78899999999999999866 555566667777666667776655543
No 88
>KOG0550|consensus
Probab=88.25 E-value=2.3 Score=40.21 Aligned_cols=135 Identities=16% Similarity=0.021 Sum_probs=92.1
Q ss_pred CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChh--------H-------H-----------HHHHHhhCCCc----HHHH
Q psy11525 105 NDDILWACLAGMATYSRDLATSEEAYAAIEQVD--------K-------V-----------MYINHIKGIPV----KAAQ 154 (244)
Q Consensus 105 ~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~--------~-------v-----------~~l~~i~~~~~----~~~~ 154 (244)
+.=..|+.=|..+-+.|++..|.+||..-=..+ + | .-.....++.+ ...+
T Consensus 247 k~le~~k~~gN~~fk~G~y~~A~E~Yteal~idP~n~~~naklY~nra~v~~rLgrl~eaisdc~~Al~iD~syikall~ 326 (486)
T KOG0550|consen 247 KKLEVKKERGNDAFKNGNYRKAYECYTEALNIDPSNKKTNAKLYGNRALVNIRLGRLREAISDCNEALKIDSSYIKALLR 326 (486)
T ss_pred HHHHHHHhhhhhHhhccchhHHHHHHHHhhcCCccccchhHHHHHHhHhhhcccCCchhhhhhhhhhhhcCHHHHHHHHH
Confidence 344679999999999999999999998532211 1 1 11111112222 2357
Q ss_pred HHHHHHHcCCHHHHHHHHHHCCCHHHHHHHHHHhcCHHHHHHHHHhcCC-----------chhHHHHHHHHHH------H
Q psy11525 155 QAEMYLLGGNISEAESILLQHGLIFRAIQVSILTHNWDRALELALRHKT-----------HIDTVLYQRKKYL------D 217 (244)
Q Consensus 155 ~A~ia~~~G~~~eAe~~~l~~g~~~~Ai~m~~~l~~W~~Al~LA~~~~~-----------~~~~vl~~r~~yl------~ 217 (244)
+|...+.++++++|.+-|-++..-..-.+..+.+++-..||.-+++.+. ..+.--.+|..-| .
T Consensus 327 ra~c~l~le~~e~AV~d~~~a~q~~~s~e~r~~l~~A~~aLkkSkRkd~ykilGi~~~as~~eikkayrk~AL~~Hpd~~ 406 (486)
T KOG0550|consen 327 RANCHLALEKWEEAVEDYEKAMQLEKDCEIRRTLREAQLALKKSKRKDWYKILGISRNASDDEIKKAYRKLALVHHPDKN 406 (486)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccccchHHHHHHHHHHHHHhhhhhHHHHhhhhhhcccchhhhHHHHHHHHhCCCcC
Confidence 8999999999999999999998888888888888888888887766331 1122233444444 2
Q ss_pred HcCCcccHHHHHhcccccccCc
Q psy11525 218 NLEKIETNEKFLRLQSEVIMSG 239 (244)
Q Consensus 218 ~~~~~e~~~~f~~~~~~~~~~~ 239 (244)
...+.|+..+|+..+.+-.|++
T Consensus 407 agsq~eaE~kFkevgeAy~il~ 428 (486)
T KOG0550|consen 407 AGSQKEAEAKFKEVGEAYTILS 428 (486)
T ss_pred cchhHHHHHHHHHHHHHHHHhc
Confidence 3446899999999987766653
No 89
>smart00299 CLH Clathrin heavy chain repeat homology.
Probab=88.22 E-value=2.3 Score=33.14 Aligned_cols=38 Identities=16% Similarity=0.212 Sum_probs=32.3
Q ss_pred CHHHHHHHHHHCCCHHHHHHHHHHhcCHHHHHHHHHhc
Q psy11525 164 NISEAESILLQHGLIFRAIQVSILTHNWDRALELALRH 201 (244)
Q Consensus 164 ~~~eAe~~~l~~g~~~~Ai~m~~~l~~W~~Al~LA~~~ 201 (244)
+.+.|-++..+.+.+..|+-+|...++|++|++++-.+
T Consensus 71 d~~~~~~~c~~~~l~~~~~~l~~k~~~~~~Al~~~l~~ 108 (140)
T smart00299 71 DIEKVGKLCEKAKLYEEAVELYKKDGNFKDAIVTLIEH 108 (140)
T ss_pred CHHHHHHHHHHcCcHHHHHHHHHhhcCHHHHHHHHHHc
Confidence 36778888888888888899999899999998888776
No 90
>PF13424 TPR_12: Tetratricopeptide repeat; PDB: 3RO2_A 3Q15_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 4A1S_B 3CEQ_B 3EDT_H ....
Probab=88.10 E-value=0.63 Score=32.38 Aligned_cols=63 Identities=13% Similarity=0.037 Sum_probs=39.8
Q ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCC----CcHHHHHHHHHHHcCCHHHHHHHHHHC
Q psy11525 108 ILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGI----PVKAAQQAEMYLLGGNISEAESILLQH 175 (244)
Q Consensus 108 ~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~----~~~~~~~A~ia~~~G~~~eAe~~~l~~ 175 (244)
..+..+|......++++.|..+|.+.-+ ..+.+.+. ......+|.+....|++++|.+.|.++
T Consensus 6 ~~~~~la~~~~~~~~~~~A~~~~~~al~-----~~~~~~~~~~~~a~~~~~lg~~~~~~g~~~~A~~~~~~a 72 (78)
T PF13424_consen 6 NAYNNLARVYRELGRYDEALDYYEKALD-----IEEQLGDDHPDTANTLNNLGECYYRLGDYEEALEYYQKA 72 (78)
T ss_dssp HHHHHHHHHHHHTT-HHHHHHHHHHHHH-----HHHHTTTHHHHHHHHHHHHHHHHHHTTHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHHH-----HHHHHCCCCHHHHHHHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 4567788888888888888888876332 21111110 011235788888888888888887654
No 91
>KOG1155|consensus
Probab=87.82 E-value=5.1 Score=38.49 Aligned_cols=85 Identities=18% Similarity=0.193 Sum_probs=61.0
Q ss_pred CCHHHHHHHHHHHHHcCCHHHHHHHHHH---hCChhHHHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHCCCH---
Q psy11525 105 NDDILWACLAGMATYSRDLATSEEAYAA---IEQVDKVMYINHIKGIPVKAAQQAEMYLLGGNISEAESILLQHGLI--- 178 (244)
Q Consensus 105 ~~~~lW~~LA~~Al~~~~l~~A~~~y~~---l~d~~~v~~l~~i~~~~~~~~~~A~ia~~~G~~~eAe~~~l~~g~~--- 178 (244)
.+..+|..||+--.+-.+++.|++||.+ +||+.+ +...+.|.+.-.+++.++|.+.|.+.=..
T Consensus 430 nDsRlw~aLG~CY~kl~~~~eAiKCykrai~~~dte~-----------~~l~~LakLye~l~d~~eAa~~yek~v~~~~~ 498 (559)
T KOG1155|consen 430 NDSRLWVALGECYEKLNRLEEAIKCYKRAILLGDTEG-----------SALVRLAKLYEELKDLNEAAQYYEKYVEVSEL 498 (559)
T ss_pred CchHHHHHHHHHHHHhccHHHHHHHHHHHHhccccch-----------HHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHh
Confidence 4789999999999999999999999985 344411 22356778888888888888888765331
Q ss_pred --------HHH----HHHHHHhcCHHHHHHHHHh
Q psy11525 179 --------FRA----IQVSILTHNWDRALELALR 200 (244)
Q Consensus 179 --------~~A----i~m~~~l~~W~~Al~LA~~ 200 (244)
.+| -+-+...++|++|-.-|..
T Consensus 499 eg~~~~~t~ka~~fLA~~f~k~~~~~~As~Ya~~ 532 (559)
T KOG1155|consen 499 EGEIDDETIKARLFLAEYFKKMKDFDEASYYATL 532 (559)
T ss_pred hcccchHHHHHHHHHHHHHHhhcchHHHHHHHHH
Confidence 122 2345677788888775544
No 92
>KOG2063|consensus
Probab=87.73 E-value=6.8 Score=40.49 Aligned_cols=106 Identities=18% Similarity=0.190 Sum_probs=65.5
Q ss_pred CCCCCCeEEeEeCCEEEEEecCCcEEEeecCccHHHHHHHHHcCCHHHHHHHHhccCCHHHHHHHHHHHHHcCCHHHHHH
Q psy11525 49 EFGKSPSIVSFVKNHLTIRRYDGTVINYPISPYISVLHSYAASHSWPQALSLCRTLNDDILWACLAGMATYSRDLATSEE 128 (244)
Q Consensus 49 ~~~~~~~~v~f~g~~v~~~~~~G~~~~~~~~p~~~~l~~~l~~~~~~~A~~~~~~~~~~~lW~~LA~~Al~~~~l~~A~~ 128 (244)
+++....+++..++.++...-+|...-..++.+.....+++...+|++|..+|+..+.
T Consensus 278 ~~~~~~~l~s~~~~i~~~~~~s~v~~L~p~~~~~~qi~~lL~~k~fe~ai~L~e~~~~---------------------- 335 (877)
T KOG2063|consen 278 PLSNGRSLLSAHNGIIFVASLSNVWILVPVSNFEKQIQDLLQEKSFEEAISLAEILDS---------------------- 335 (877)
T ss_pred cccccceeeecCCcEEEEEeccceEEEEeccchHHHHHHHHHhhhHHHHHHHHhccCC----------------------
Confidence 4556666777677655555434444333332388889999999999999999887654
Q ss_pred HHHHhCChhHHHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHCCCHH-HHHHHHH
Q psy11525 129 AYAAIEQVDKVMYINHIKGIPVKAAQQAEMYLLGGNISEAESILLQHGLIF-RAIQVSI 186 (244)
Q Consensus 129 ~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia~~~G~~~eAe~~~l~~g~~~-~Ai~m~~ 186 (244)
.+....+.+..++.+ ..|...-.+|+|++|-+.|.++..=. --+.||=
T Consensus 336 -----~~p~~~~~i~~~~~l-----~~a~~lf~q~~f~ea~~~F~~~~~d~~~vi~lfP 384 (877)
T KOG2063|consen 336 -----PNPKEKRQISCIKIL-----IDAFELFLQKQFEEAMSLFEKSEIDPRHVISLFP 384 (877)
T ss_pred -----CChHHHHHHHHHHHH-----HHHHHHHHhhhHHHHHHHHHhhccChHHHHHhch
Confidence 122222222222111 11667778889999999998886544 3444544
No 93
>TIGR02795 tol_pal_ybgF tol-pal system protein YbgF. Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.
Probab=87.68 E-value=8.8 Score=27.92 Aligned_cols=85 Identities=14% Similarity=0.060 Sum_probs=54.8
Q ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhC---CCcHHHHHHHHHHHcCCHHHHHHHHHHCCC-------
Q psy11525 108 ILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKG---IPVKAAQQAEMYLLGGNISEAESILLQHGL------- 177 (244)
Q Consensus 108 ~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~---~~~~~~~~A~ia~~~G~~~eAe~~~l~~g~------- 177 (244)
..+-.+|...+..++++-|...|..+-. .-.. ..+.....|.++...|++++|.+.|.+.-.
T Consensus 3 ~~~~~~~~~~~~~~~~~~A~~~~~~~~~--------~~~~~~~~~~~~~~l~~~~~~~~~~~~A~~~~~~~~~~~p~~~~ 74 (119)
T TIGR02795 3 EAYYDAALLVLKAGDYADAIQAFQAFLK--------KYPKSTYAPNAHYWLGEAYYAQGKYADAAKAFLAVVKKYPKSPK 74 (119)
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHHH--------HCCCccccHHHHHHHHHHHHhhccHHHHHHHHHHHHHHCCCCCc
Confidence 4567788888999999988888776521 1000 011234578888888999888888875421
Q ss_pred ----HHHHHHHHHHhcCHHHHHHHHHh
Q psy11525 178 ----IFRAIQVSILTHNWDRALELALR 200 (244)
Q Consensus 178 ----~~~Ai~m~~~l~~W~~Al~LA~~ 200 (244)
....-.++..++++++|+..-.+
T Consensus 75 ~~~~~~~~~~~~~~~~~~~~A~~~~~~ 101 (119)
T TIGR02795 75 APDALLKLGMSLQELGDKEKAKATLQQ 101 (119)
T ss_pred ccHHHHHHHHHHHHhCChHHHHHHHHH
Confidence 12334566677777777665543
No 94
>PLN03088 SGT1, suppressor of G2 allele of SKP1; Provisional
Probab=87.55 E-value=4.9 Score=36.99 Aligned_cols=100 Identities=14% Similarity=0.031 Sum_probs=65.0
Q ss_pred HHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHC-----CCHH---H
Q psy11525 109 LWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMYLLGGNISEAESILLQH-----GLIF---R 180 (244)
Q Consensus 109 lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia~~~G~~~eAe~~~l~~-----g~~~---~ 180 (244)
.+...|..+...++++.|..+|.+.= +.--........+|.+++.+|++++|...|.++ +.+. .
T Consensus 4 ~l~~~a~~a~~~~~~~~Ai~~~~~Al--------~~~P~~~~a~~~~a~~~~~~g~~~eAl~~~~~Al~l~P~~~~a~~~ 75 (356)
T PLN03088 4 DLEDKAKEAFVDDDFALAVDLYTQAI--------DLDPNNAELYADRAQANIKLGNFTEAVADANKAIELDPSLAKAYLR 75 (356)
T ss_pred HHHHHHHHHHHcCCHHHHHHHHHHHH--------HhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCcCCHHHHHH
Confidence 46677999999999999999886531 111111123356788999999999999988776 1111 1
Q ss_pred HHHHHHHhcCHHHHHHHHHh---cCCc---hhHHHHHHHHHH
Q psy11525 181 AIQVSILTHNWDRALELALR---HKTH---IDTVLYQRKKYL 216 (244)
Q Consensus 181 Ai~m~~~l~~W~~Al~LA~~---~~~~---~~~vl~~r~~yl 216 (244)
.-..|..++++++|+...++ .++. +...+......+
T Consensus 76 lg~~~~~lg~~~eA~~~~~~al~l~P~~~~~~~~l~~~~~kl 117 (356)
T PLN03088 76 KGTACMKLEEYQTAKAALEKGASLAPGDSRFTKLIKECDEKI 117 (356)
T ss_pred HHHHHHHhCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHH
Confidence 12466788999999887655 2332 444444444444
No 95
>COG2956 Predicted N-acetylglucosaminyl transferase [Carbohydrate transport and metabolism]
Probab=86.93 E-value=16 Score=33.70 Aligned_cols=125 Identities=14% Similarity=0.043 Sum_probs=68.8
Q ss_pred HHcCCHHHHHHHHhccC----------CHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCc----HHHH
Q psy11525 89 AASHSWPQALSLCRTLN----------DDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPV----KAAQ 154 (244)
Q Consensus 89 l~~~~~~~A~~~~~~~~----------~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~----~~~~ 154 (244)
=..++|+.|+..|+.+. -....-.||..++...+++-|. ..+.+....+. -...
T Consensus 152 Q~treW~KAId~A~~L~k~~~q~~~~eIAqfyCELAq~~~~~~~~d~A~------------~~l~kAlqa~~~cvRAsi~ 219 (389)
T COG2956 152 QATREWEKAIDVAERLVKLGGQTYRVEIAQFYCELAQQALASSDVDRAR------------ELLKKALQADKKCVRASII 219 (389)
T ss_pred HHhhHHHHHHHHHHHHHHcCCccchhHHHHHHHHHHHHHhhhhhHHHHH------------HHHHHHHhhCccceehhhh
Confidence 34556666666665431 1233444555555555554433 33333333322 2466
Q ss_pred HHHHHHHcCCHHHHHHHHHHCCCH---------HHHHHHHHHhcCHHHHHHHHHh---cCCchhHHHHHHHHHHHHcCCc
Q psy11525 155 QAEMYLLGGNISEAESILLQHGLI---------FRAIQVSILTHNWDRALELALR---HKTHIDTVLYQRKKYLDNLEKI 222 (244)
Q Consensus 155 ~A~ia~~~G~~~eAe~~~l~~g~~---------~~Ai~m~~~l~~W~~Al~LA~~---~~~~~~~vl~~r~~yl~~~~~~ 222 (244)
.|.+++..|+|..|.+.+-..-+- +.-.+.|..+|+-++.+....+ +.+..+-++...+.-+..-|..
T Consensus 220 lG~v~~~~g~y~~AV~~~e~v~eQn~~yl~evl~~L~~~Y~~lg~~~~~~~fL~~~~~~~~g~~~~l~l~~lie~~~G~~ 299 (389)
T COG2956 220 LGRVELAKGDYQKAVEALERVLEQNPEYLSEVLEMLYECYAQLGKPAEGLNFLRRAMETNTGADAELMLADLIELQEGID 299 (389)
T ss_pred hhHHHHhccchHHHHHHHHHHHHhChHHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHHccCCccHHHHHHHHHHHhhChH
Confidence 899999999999998877654221 2235678889988887766544 3333444444444444444444
Q ss_pred ccH
Q psy11525 223 ETN 225 (244)
Q Consensus 223 e~~ 225 (244)
++-
T Consensus 300 ~Aq 302 (389)
T COG2956 300 AAQ 302 (389)
T ss_pred HHH
Confidence 433
No 96
>PRK11189 lipoprotein NlpI; Provisional
Probab=86.91 E-value=12 Score=33.17 Aligned_cols=45 Identities=7% Similarity=0.065 Sum_probs=38.0
Q ss_pred HHHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHH
Q psy11525 88 YAASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAA 132 (244)
Q Consensus 88 ~l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~ 132 (244)
+...|++++|....... .....|..+|......++++.|..+|.+
T Consensus 108 ~~~~g~~~~A~~~~~~Al~l~P~~~~a~~~lg~~l~~~g~~~eA~~~~~~ 157 (296)
T PRK11189 108 LTQAGNFDAAYEAFDSVLELDPTYNYAYLNRGIALYYGGRYELAQDDLLA 157 (296)
T ss_pred HHHCCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHCCCHHHHHHHHHH
Confidence 48899999999876664 4578899999999999999999988864
No 97
>KOG2280|consensus
Probab=86.85 E-value=12 Score=37.93 Aligned_cols=110 Identities=19% Similarity=0.195 Sum_probs=79.8
Q ss_pred ccHHHHHHHHHcCCHHHHHHHHhcc--CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHH
Q psy11525 80 PYISVLHSYAASHSWPQALSLCRTL--NDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAE 157 (244)
Q Consensus 80 p~~~~l~~~l~~~~~~~A~~~~~~~--~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ 157 (244)
.+.+++.+.+..|+-.+|-++.... .++.+|=.-. ++.+.+++..-+.-+-.=++-+=-..=.-+
T Consensus 686 Sl~dTv~~li~~g~~k~a~ql~~~FkipdKr~~wLk~-------------~aLa~~~kweeLekfAkskksPIGy~PFVe 752 (829)
T KOG2280|consen 686 SLHDTVTTLILIGQNKRAEQLKSDFKIPDKRLWWLKL-------------TALADIKKWEELEKFAKSKKSPIGYLPFVE 752 (829)
T ss_pred cHHHHHHHHHHccchHHHHHHHHhcCCcchhhHHHHH-------------HHHHhhhhHHHHHHHHhccCCCCCchhHHH
Confidence 4567888889999999999999874 4555553222 233455554433322222222222334557
Q ss_pred HHHHcCCHHHHHHHHHHCCCHHHHHHHHHHhcCHHHHHHHHHhcC
Q psy11525 158 MYLLGGNISEAESILLQHGLIFRAIQVSILTHNWDRALELALRHK 202 (244)
Q Consensus 158 ia~~~G~~~eAe~~~l~~g~~~~Ai~m~~~l~~W~~Al~LA~~~~ 202 (244)
.++..|+-+||.+.+-+++...+-+++|...|++.+|.++|-+++
T Consensus 753 ~c~~~~n~~EA~KYiprv~~l~ekv~ay~~~~~~~eAad~A~~~r 797 (829)
T KOG2280|consen 753 ACLKQGNKDEAKKYIPRVGGLQEKVKAYLRVGDVKEAADLAAEHR 797 (829)
T ss_pred HHHhcccHHHHhhhhhccCChHHHHHHHHHhccHHHHHHHHHHhc
Confidence 888999999999999999999999999999999999999998876
No 98
>KOG2076|consensus
Probab=86.43 E-value=37 Score=35.08 Aligned_cols=106 Identities=16% Similarity=0.134 Sum_probs=63.7
Q ss_pred cHHHHHHHHHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCC-CcHHHH
Q psy11525 81 YISVLHSYAASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGI-PVKAAQ 154 (244)
Q Consensus 81 ~~~~l~~~l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~-~~~~~~ 154 (244)
+.++..-|.. |++++|.++|..+ ....-|..||...-+.|+++-+-.+... .-++..- .+.-.+
T Consensus 143 l~eAN~lfar-g~~eeA~~i~~EvIkqdp~~~~ay~tL~~IyEqrGd~eK~l~~~ll---------AAHL~p~d~e~W~~ 212 (895)
T KOG2076|consen 143 LGEANNLFAR-GDLEEAEEILMEVIKQDPRNPIAYYTLGEIYEQRGDIEKALNFWLL---------AAHLNPKDYELWKR 212 (895)
T ss_pred HHHHHHHHHh-CCHHHHHHHHHHHHHhCccchhhHHHHHHHHHHcccHHHHHHHHHH---------HHhcCCCChHHHHH
Confidence 3455555665 9999999999886 4577788888766666655433332211 1111111 112256
Q ss_pred HHHHHHHcCCHHHHHHHHHHC--------CCHHHHHHHHHHhcCHHHHHH
Q psy11525 155 QAEMYLLGGNISEAESILLQH--------GLIFRAIQVSILTHNWDRALE 196 (244)
Q Consensus 155 ~A~ia~~~G~~~eAe~~~l~~--------g~~~~Ai~m~~~l~~W~~Al~ 196 (244)
.|.++..+|+++.|--+|.++ -.+.+-..||..+|.-.+|++
T Consensus 213 ladls~~~~~i~qA~~cy~rAI~~~p~n~~~~~ers~L~~~~G~~~~Am~ 262 (895)
T KOG2076|consen 213 LADLSEQLGNINQARYCYSRAIQANPSNWELIYERSSLYQKTGDLKRAME 262 (895)
T ss_pred HHHHHHhcccHHHHHHHHHHHHhcCCcchHHHHHHHHHHHHhChHHHHHH
Confidence 777888888888877777654 112334566777777777654
No 99
>PRK10049 pgaA outer membrane protein PgaA; Provisional
Probab=86.08 E-value=29 Score=35.21 Aligned_cols=44 Identities=7% Similarity=-0.050 Sum_probs=20.3
Q ss_pred HHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHH
Q psy11525 89 AASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAA 132 (244)
Q Consensus 89 l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~ 132 (244)
.-.|+.++|.+++... .....|..+|..+...++++.|..+|.+
T Consensus 26 ~~~g~~~~A~~~~~~~~~~~~~~a~~~~~lA~~~~~~g~~~~A~~~~~~ 74 (765)
T PRK10049 26 LWAGQDAEVITVYNRYRVHMQLPARGYAAVAVAYRNLKQWQNSLTLWQK 74 (765)
T ss_pred HHcCCHHHHHHHHHHHHhhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHH
Confidence 3445555554444332 1122355555555555555555544444
No 100
>TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795).
Probab=86.07 E-value=9.2 Score=32.12 Aligned_cols=33 Identities=9% Similarity=0.064 Sum_probs=22.6
Q ss_pred HHHcCCHHHHHHHHhcc----C-CH---HHHHHHHHHHHHc
Q psy11525 88 YAASHSWPQALSLCRTL----N-DD---ILWACLAGMATYS 120 (244)
Q Consensus 88 ~l~~~~~~~A~~~~~~~----~-~~---~lW~~LA~~Al~~ 120 (244)
|...|+|++|....... + +. ..|..+|....+.
T Consensus 80 ~~~~~~~~~A~~~~~~~l~~~p~~~~~~~a~~~~g~~~~~~ 120 (235)
T TIGR03302 80 YYKSGDYAEAIAAADRFIRLHPNHPDADYAYYLRGLSNYNQ 120 (235)
T ss_pred HHhcCCHHHHHHHHHHHHHHCcCCCchHHHHHHHHHHHHHh
Confidence 58899999999999875 1 11 2466666665543
No 101
>PF12931 Sec16_C: Sec23-binding domain of Sec16; PDB: 3MZK_C.
Probab=85.03 E-value=1.6 Score=38.96 Aligned_cols=43 Identities=23% Similarity=0.255 Sum_probs=32.7
Q ss_pred HHHHCCCHHHHHHHHHHhcCHHHHHHHHHhcCCc-hhHHHHHHHHHH
Q psy11525 171 ILLQHGLIFRAIQVSILTHNWDRALELALRHKTH-IDTVLYQRKKYL 216 (244)
Q Consensus 171 ~~l~~g~~~~Ai~m~~~l~~W~~Al~LA~~~~~~-~~~vl~~r~~yl 216 (244)
-++..|+.++|++++.+-++|..||-||...+++ .+. -.++|+
T Consensus 4 ~~Ll~G~~~~Av~~al~~~~wa~ALlLAs~~g~e~~~~---v~~~y~ 47 (284)
T PF12931_consen 4 QLLLVGNREEAVELALDNGLWAHALLLASSLGPELWKK---VVQEYF 47 (284)
T ss_dssp HHHHTT-HHHHHHHHHHTT-HHHHHHHHHTS-HHHHHH---HHHHHH
T ss_pred HHHhCCCHHHHHHHHHHCCChHHHHHHHHhcCHHHHHH---HHHHHH
Confidence 4788999999999999999999999999999876 444 334455
No 102
>cd05804 StaR_like StaR_like; a well-conserved protein found in bacteria, plants, and animals. A family member from Streptomyces toyocaensis, StaR is part of a gene cluster involved in the biosynthesis of glycopeptide antibiotics (GPAs), specifically A47934. It has been speculated that StaR could be a flavoprotein hydroxylating a tyrosine sidechain. Some family members have been annotated as proteins containing tetratricopeptide (TPR) repeats, which may at least indicate mostly alpha-helical secondary structure.
Probab=84.90 E-value=4.6 Score=36.10 Aligned_cols=81 Identities=14% Similarity=0.046 Sum_probs=60.7
Q ss_pred HHHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcH----HHHHHHH
Q psy11525 88 YAASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVK----AAQQAEM 158 (244)
Q Consensus 88 ~l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~----~~~~A~i 158 (244)
+...|++++|.+.++.. .....+..+|....+.++++.|+..|.+.-+... ..+.. ....|.+
T Consensus 124 ~~~~G~~~~A~~~~~~al~~~p~~~~~~~~la~i~~~~g~~~eA~~~l~~~l~~~~--------~~~~~~~~~~~~la~~ 195 (355)
T cd05804 124 LEEAGQYDRAEEAARRALELNPDDAWAVHAVAHVLEMQGRFKEGIAFMESWRDTWD--------CSSMLRGHNWWHLALF 195 (355)
T ss_pred HHHcCCHHHHHHHHHHHHhhCCCCcHHHHHHHHHHHHcCCHHHHHHHHHhhhhccC--------CCcchhHHHHHHHHHH
Confidence 48899999999998885 3456778899999999999999988876432211 01111 2357889
Q ss_pred HHHcCCHHHHHHHHHHCC
Q psy11525 159 YLLGGNISEAESILLQHG 176 (244)
Q Consensus 159 a~~~G~~~eAe~~~l~~g 176 (244)
+...|++++|..+|-+..
T Consensus 196 ~~~~G~~~~A~~~~~~~~ 213 (355)
T cd05804 196 YLERGDYEAALAIYDTHI 213 (355)
T ss_pred HHHCCCHHHHHHHHHHHh
Confidence 999999999999998853
No 103
>PRK14574 hmsH outer membrane protein; Provisional
Probab=84.45 E-value=22 Score=36.71 Aligned_cols=139 Identities=10% Similarity=-0.024 Sum_probs=83.2
Q ss_pred HHHHHHHHcCCHHHHHHHHhcc----C-C-HHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHH-
Q psy11525 83 SVLHSYAASHSWPQALSLCRTL----N-D-DILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQ- 155 (244)
Q Consensus 83 ~~l~~~l~~~~~~~A~~~~~~~----~-~-~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~- 155 (244)
...+.....|+++.|+...+.. + + +... .++..+...++.+.|+..+.+.-+..-. ....+.
T Consensus 39 ~~aii~~r~Gd~~~Al~~L~qaL~~~P~~~~av~-dll~l~~~~G~~~~A~~~~eka~~p~n~----------~~~~lla 107 (822)
T PRK14574 39 DSLIIRARAGDTAPVLDYLQEESKAGPLQSGQVD-DWLQIAGWAGRDQEVIDVYERYQSSMNI----------SSRGLAS 107 (822)
T ss_pred HHHHHHHhCCCHHHHHHHHHHHHhhCccchhhHH-HHHHHHHHcCCcHHHHHHHHHhccCCCC----------CHHHHHH
Confidence 3444457899999999988774 1 2 1233 6666666778888777666554321111 112333
Q ss_pred -HHHHHHcCCHHHHHHHHHHC-----CCHH---HHHHHHHHhcCHHHHHHHHHhcCCc-hhHHHHHHHHHHH-HcC-Ccc
Q psy11525 156 -AEMYLLGGNISEAESILLQH-----GLIF---RAIQVSILTHNWDRALELALRHKTH-IDTVLYQRKKYLD-NLE-KIE 223 (244)
Q Consensus 156 -A~ia~~~G~~~eAe~~~l~~-----g~~~---~Ai~m~~~l~~W~~Al~LA~~~~~~-~~~vl~~r~~yl~-~~~-~~e 223 (244)
|.++...|++++|.++|.+. +.++ .....|.+.++.++|++.+++.... -+...+.-.-|+. ..+ ..+
T Consensus 108 lA~ly~~~gdyd~Aiely~kaL~~dP~n~~~l~gLa~~y~~~~q~~eAl~~l~~l~~~dp~~~~~l~layL~~~~~~~~~ 187 (822)
T PRK14574 108 AARAYRNEKRWDQALALWQSSLKKDPTNPDLISGMIMTQADAGRGGVVLKQATELAERDPTVQNYMTLSYLNRATDRNYD 187 (822)
T ss_pred HHHHHHHcCCHHHHHHHHHHHHhhCCCCHHHHHHHHHHHhhcCCHHHHHHHHHHhcccCcchHHHHHHHHHHHhcchHHH
Confidence 66888889999999988876 2211 1146678889999999998886532 1111122234442 222 235
Q ss_pred cHHHHHhcc
Q psy11525 224 TNEKFLRLQ 232 (244)
Q Consensus 224 ~~~~f~~~~ 232 (244)
+.+.|+++.
T Consensus 188 AL~~~ekll 196 (822)
T PRK14574 188 ALQASSEAV 196 (822)
T ss_pred HHHHHHHHH
Confidence 666666653
No 104
>PRK15179 Vi polysaccharide biosynthesis protein TviE; Provisional
Probab=84.20 E-value=19 Score=36.36 Aligned_cols=47 Identities=13% Similarity=0.191 Sum_probs=34.9
Q ss_pred HHcCCHHHHHHHHhc-cCCHHHHHHHHHHHHHcCCHHHHHHHHHHhCC
Q psy11525 89 AASHSWPQALSLCRT-LNDDILWACLAGMATYSRDLATSEEAYAAIEQ 135 (244)
Q Consensus 89 l~~~~~~~A~~~~~~-~~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d 135 (244)
...-.+-++..+++. +.+...-..||....+.+.++.|++.+.++-.
T Consensus 67 ~~~~~~~~~~~~~~~~~~~~~~~~~La~i~~~~g~~~ea~~~l~~~~~ 114 (694)
T PRK15179 67 KPAAALPELLDYVRRYPHTELFQVLVARALEAAHRSDEGLAVWRGIHQ 114 (694)
T ss_pred chHhhHHHHHHHHHhccccHHHHHHHHHHHHHcCCcHHHHHHHHHHHh
Confidence 334456666777766 46688888899999999999988888776543
No 105
>PF03983 SHD1: SLA1 homology domain 1, SHD1 ; InterPro: IPR007131 The SLA1 homology domain is found in the cytoskeleton assembly control protein SLA1, which is responsible for the correct formation of the actin cytoskeleton.; GO: 0008092 cytoskeletal protein binding, 0030674 protein binding, bridging, 0042802 identical protein binding, 0043130 ubiquitin binding; PDB: 2HBP_A.
Probab=83.72 E-value=2.8 Score=29.70 Aligned_cols=42 Identities=14% Similarity=0.302 Sum_probs=29.0
Q ss_pred CCCCCCeEEeEeCCEEEEEecCCcEEEeecCccHHHHHHHHH
Q psy11525 49 EFGKSPSIVSFVKNHLTIRRYDGTVINYPISPYISVLHSYAA 90 (244)
Q Consensus 49 ~~~~~~~~v~f~g~~v~~~~~~G~~~~~~~~p~~~~l~~~l~ 90 (244)
.|.....++++.+|+|.+++.||..+.+.+.-++..=.+|++
T Consensus 21 ~f~VeA~fv~~~dgkV~L~k~nG~~i~VP~~kLS~~D~~yve 62 (70)
T PF03983_consen 21 KFKVEAEFVGVNDGKVHLHKTNGVKIAVPLEKLSDEDQEYVE 62 (70)
T ss_dssp --EEEEEEEEEETTEEEEE-TTS-EEEEETTSB-HHHHHHHH
T ss_pred CEEEEEEEEEeeCCEEEEEecCCeEEEeEhHHcCHHHHHHHH
Confidence 343455778999999999999999998887777766555543
No 106
>PF14559 TPR_19: Tetratricopeptide repeat; PDB: 2R5S_A 3QDN_B 3QOU_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 3FP3_A 3LCA_A ....
Probab=83.35 E-value=3.6 Score=27.46 Aligned_cols=23 Identities=35% Similarity=0.407 Sum_probs=14.2
Q ss_pred HHHHHHHHHcCCHHHHHHHHHHC
Q psy11525 153 AQQAEMYLLGGNISEAESILLQH 175 (244)
Q Consensus 153 ~~~A~ia~~~G~~~eAe~~~l~~ 175 (244)
...|.+++..|++++|++++.+.
T Consensus 29 ~~la~~~~~~g~~~~A~~~l~~~ 51 (68)
T PF14559_consen 29 LLLAQCYLKQGQYDEAEELLERL 51 (68)
T ss_dssp HHHHHHHHHTT-HHHHHHHHHCC
T ss_pred HHHHHHHHHcCCHHHHHHHHHHH
Confidence 34666777777777777666544
No 107
>COG3071 HemY Uncharacterized enzyme of heme biosynthesis [Coenzyme metabolism]
Probab=82.99 E-value=17 Score=34.15 Aligned_cols=105 Identities=16% Similarity=0.077 Sum_probs=72.1
Q ss_pred HHcCCHHHHHHHHhccCCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHH-HHHHHhhCCCc-----HHHHHHHHHHHc
Q psy11525 89 AASHSWPQALSLCRTLNDDILWACLAGMATYSRDLATSEEAYAAIEQVDKV-MYINHIKGIPV-----KAAQQAEMYLLG 162 (244)
Q Consensus 89 l~~~~~~~A~~~~~~~~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v-~~l~~i~~~~~-----~~~~~A~ia~~~ 162 (244)
+..|+|+.|-+.+..- +.-+. ....++-+|.++=..+||.+.. +++.++.+.++ .+..++++.+..
T Consensus 95 l~eG~~~qAEkl~~rn------ae~~e--~p~l~~l~aA~AA~qrgd~~~an~yL~eaae~~~~~~l~v~ltrarlll~~ 166 (400)
T COG3071 95 LFEGDFQQAEKLLRRN------AEHGE--QPVLAYLLAAEAAQQRGDEDRANRYLAEAAELAGDDTLAVELTRARLLLNR 166 (400)
T ss_pred HhcCcHHHHHHHHHHh------hhcCc--chHHHHHHHHHHHHhcccHHHHHHHHHHHhccCCCchHHHHHHHHHHHHhC
Confidence 5678888887777652 11111 1122345677777788887777 67888888732 346789999999
Q ss_pred CCHHHHHHHHHH----CCCH----HHHHHHHHHhcCHHHHHHHHHhc
Q psy11525 163 GNISEAESILLQ----HGLI----FRAIQVSILTHNWDRALELALRH 201 (244)
Q Consensus 163 G~~~eAe~~~l~----~g~~----~~Ai~m~~~l~~W~~Al~LA~~~ 201 (244)
|++.-|..-..+ .-+. ..|.+.|.++|.|...+.+-.+.
T Consensus 167 ~d~~aA~~~v~~ll~~~pr~~~vlrLa~r~y~~~g~~~~ll~~l~~L 213 (400)
T COG3071 167 RDYPAARENVDQLLEMTPRHPEVLRLALRAYIRLGAWQALLAILPKL 213 (400)
T ss_pred CCchhHHHHHHHHHHhCcCChHHHHHHHHHHHHhccHHHHHHHHHHH
Confidence 999888765443 2332 45788999999999999877653
No 108
>PF10602 RPN7: 26S proteasome subunit RPN7; InterPro: IPR019585 This entry represents the regulatory subunit RPN7 (known as the non-ATPase regulatory subunit 6 in higher eukaryotes) of the 26S proteasome. This entry also matches the evolutionarily related subunit 1 of the COP9 signalosome complex (CSN) from Arabidopsis []. The 26S proteasome plays a major role in ATP-dependent degradation of ubiquitinated proteins. Substrate specificity is conferred by the regulatory particle (RP), which can dissociate into stable lid and base subcomplexes. The regulatory subunit RPN7 is one of the lid subunits of the 26S proteasome and has been shown in Saccharomyces cerevisiae (Baker's yeast) to be required for structural integrity []. The COP9 signalosome is a conserved protein complex composed of eight subunits, where Individual subunits of the complex have been linked to various signal transduction pathways leading to gene expression and cell cycle control []. The overall organisation and the amino acid sequences of the COP9 signalosome subunits resemble the lid subcomplex of the 19 S regulatory particle for the 26 S proteasome []. COP9 subunit 1 (CSN1 or GPS1) of the COP9 complex is an essential subunit of the complex with regard to both structural integrity and functionality. The N-terminal region of subunit 1 (CSN1-N) can inhibit c-fos expression from either a transfected template or a chromosomal transgene (fos-lacZ), and may contain the activity domain that confers most of the repression functions of CSN1. The C-terminal region of subunit 1 (CSN1-C) allows integration of the protein into the COP9 signalosome.
Probab=82.74 E-value=5.3 Score=33.14 Aligned_cols=81 Identities=16% Similarity=0.086 Sum_probs=56.5
Q ss_pred CHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHCCCHHHHHHHH
Q psy11525 106 DDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMYLLGGNISEAESILLQHGLIFRAIQVS 185 (244)
Q Consensus 106 ~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia~~~G~~~eAe~~~l~~g~~~~Ai~m~ 185 (244)
....|..+|....+.|+++-|.+||.++.++.. ...+.-+.-...-+++++.|++....... .+|-.+-
T Consensus 35 ir~~~~~l~~~~~~~Gd~~~A~k~y~~~~~~~~-----~~~~~id~~l~~irv~i~~~d~~~v~~~i------~ka~~~~ 103 (177)
T PF10602_consen 35 IRMALEDLADHYCKIGDLEEALKAYSRARDYCT-----SPGHKIDMCLNVIRVAIFFGDWSHVEKYI------EKAESLI 103 (177)
T ss_pred HHHHHHHHHHHHHHhhhHHHHHHHHHHHhhhcC-----CHHHHHHHHHHHHHHHHHhCCHHHHHHHH------HHHHHHH
Confidence 357888999999999999999999999877531 11111223455678999999998877664 3444555
Q ss_pred HHhcCHHHHHHH
Q psy11525 186 ILTHNWDRALEL 197 (244)
Q Consensus 186 ~~l~~W~~Al~L 197 (244)
-..++|++--+|
T Consensus 104 ~~~~d~~~~nrl 115 (177)
T PF10602_consen 104 EKGGDWERRNRL 115 (177)
T ss_pred hccchHHHHHHH
Confidence 556668775444
No 109
>PRK14574 hmsH outer membrane protein; Provisional
Probab=82.63 E-value=18 Score=37.40 Aligned_cols=106 Identities=16% Similarity=0.133 Sum_probs=72.7
Q ss_pred HHHHHHHHcCCHHHHHHHHhccCC--H--------------HHHHH----HHHHHHHcCCHHHHHHHHHHhCChhHHHHH
Q psy11525 83 SVLHSYAASHSWPQALSLCRTLND--D--------------ILWAC----LAGMATYSRDLATSEEAYAAIEQVDKVMYI 142 (244)
Q Consensus 83 ~~l~~~l~~~~~~~A~~~~~~~~~--~--------------~lW~~----LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l 142 (244)
...+.+++.+++++|..++..+.. + ..|.. +|...+-.+++..|++.+.++
T Consensus 372 ~L~yA~ld~e~~~~A~~~l~~~~~~~p~~~~~~~~~~~~pn~d~~~~~~l~a~~~~~~gdl~~Ae~~le~l--------- 442 (822)
T PRK14574 372 DLYYSLNESEQLDKAYQFAVNYSEQTPYQVGVYGLPGKEPNDDWIEGQTLLVQSLVALNDLPTAQKKLEDL--------- 442 (822)
T ss_pred HHHHHHHhcccHHHHHHHHHHHHhcCCcEEeccCCCCCCCCccHHHHHHHHHHHHHHcCCHHHHHHHHHHH---------
Confidence 457788999999999999988633 2 23332 466667777777777655442
Q ss_pred HHhhCC-C---cHHHHHHHHHHHcCCHHHHHHHHHHC--CCHH--H----HHHHHHHhcCHHHHHHHHHh
Q psy11525 143 NHIKGI-P---VKAAQQAEMYLLGGNISEAESILLQH--GLIF--R----AIQVSILTHNWDRALELALR 200 (244)
Q Consensus 143 ~~i~~~-~---~~~~~~A~ia~~~G~~~eAe~~~l~~--g~~~--~----Ai~m~~~l~~W~~Al~LA~~ 200 (244)
... | +.-..+|.+..-.|.+.+|++.+..+ -.|+ . -..-+.++++|++|-.+..+
T Consensus 443 ---~~~aP~n~~l~~~~A~v~~~Rg~p~~A~~~~k~a~~l~P~~~~~~~~~~~~al~l~e~~~A~~~~~~ 509 (822)
T PRK14574 443 ---SSTAPANQNLRIALASIYLARDLPRKAEQELKAVESLAPRSLILERAQAETAMALQEWHQMELLTDD 509 (822)
T ss_pred ---HHhCCCCHHHHHHHHHHHHhcCCHHHHHHHHHHHhhhCCccHHHHHHHHHHHHhhhhHHHHHHHHHH
Confidence 222 1 12356899999999999999999653 1222 1 14566789999999777743
No 110
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=82.55 E-value=14 Score=32.76 Aligned_cols=78 Identities=6% Similarity=0.015 Sum_probs=49.3
Q ss_pred HHHcCCHHHHHHHHhcc-----C---CHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhC---CCcHHHHHH
Q psy11525 88 YAASHSWPQALSLCRTL-----N---DDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKG---IPVKAAQQA 156 (244)
Q Consensus 88 ~l~~~~~~~A~~~~~~~-----~---~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~---~~~~~~~~A 156 (244)
++..|+|++|....+.. + .+..|-.+|......++++-|...|..+= ..--. .++-....|
T Consensus 153 ~~~~~~y~~Ai~af~~fl~~yP~s~~a~~A~y~LG~~y~~~g~~~~A~~~f~~vv--------~~yP~s~~~~dAl~klg 224 (263)
T PRK10803 153 VQDKSRQDDAIVAFQNFVKKYPDSTYQPNANYWLGQLNYNKGKKDDAAYYFASVV--------KNYPKSPKAADAMFKVG 224 (263)
T ss_pred HHhcCCHHHHHHHHHHHHHHCcCCcchHHHHHHHHHHHHHcCCHHHHHHHHHHHH--------HHCCCCcchhHHHHHHH
Confidence 36679999999877664 1 24688889999999999988887766531 11111 111223356
Q ss_pred HHHHHcCCHHHHHHHHH
Q psy11525 157 EMYLLGGNISEAESILL 173 (244)
Q Consensus 157 ~ia~~~G~~~eAe~~~l 173 (244)
.++...|++++|.+.|.
T Consensus 225 ~~~~~~g~~~~A~~~~~ 241 (263)
T PRK10803 225 VIMQDKGDTAKAKAVYQ 241 (263)
T ss_pred HHHHHcCCHHHHHHHHH
Confidence 66656666666655553
No 111
>TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD. ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array.
Probab=82.36 E-value=8.2 Score=29.24 Aligned_cols=81 Identities=11% Similarity=-0.128 Sum_probs=50.6
Q ss_pred HHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHCC--CHH------HHH
Q psy11525 111 ACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMYLLGGNISEAESILLQHG--LIF------RAI 182 (244)
Q Consensus 111 ~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia~~~G~~~eAe~~~l~~g--~~~------~Ai 182 (244)
..+|..+...++++-|...|..+-... -...+-...+|.++...|++++|...|.++- .++ ..=
T Consensus 21 ~~~a~~~~~~~~~~~A~~~~~~~~~~~--------p~~~~~~~~la~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~la 92 (135)
T TIGR02552 21 YALAYNLYQQGRYDEALKLFQLLAAYD--------PYNSRYWLGLAACCQMLKEYEEAIDAYALAAALDPDDPRPYFHAA 92 (135)
T ss_pred HHHHHHHHHcccHHHHHHHHHHHHHhC--------CCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChHHHHHHH
Confidence 456777888888888888875432110 0011223467888888999999998887761 111 112
Q ss_pred HHHHHhcCHHHHHHHHH
Q psy11525 183 QVSILTHNWDRALELAL 199 (244)
Q Consensus 183 ~m~~~l~~W~~Al~LA~ 199 (244)
..|...+++++|+..-+
T Consensus 93 ~~~~~~g~~~~A~~~~~ 109 (135)
T TIGR02552 93 ECLLALGEPESALKALD 109 (135)
T ss_pred HHHHHcCCHHHHHHHHH
Confidence 35677788887766543
No 112
>PF00400 WD40: WD domain, G-beta repeat; InterPro: IPR019781 WD-40 repeats (also known as WD or beta-transducin repeats) are short ~40 amino acid motifs, often terminating in a Trp-Asp (W-D) dipeptide. WD40 repeats usually assume a 7-8 bladed beta-propeller fold, but proteins have been found with 4 to 16 repeated units, which also form a circularised beta-propeller structure. WD-repeat proteins are a large family found in all eukaryotes and are implicated in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control and apoptosis. Repeated WD40 motifs act as a site for protein-protein interaction, and proteins containing WD40 repeats are known to serve as platforms for the assembly of protein complexes or mediators of transient interplay among other proteins. The specificity of the proteins is determined by the sequences outside the repeats themselves. Examples of such complexes are G proteins (beta subunit is a beta-propeller), TAFII transcription factor, and E3 ubiquitin ligase [, ]. In Arabidopsis spp., several WD40-containing proteins act as key regulators of plant-specific developmental events.; PDB: 2ZKQ_a 3CFV_B 3CFS_B 1PEV_A 1NR0_A 1VYH_T 3RFH_A 3O2Z_T 3FRX_C 3U5G_g ....
Probab=82.26 E-value=2.7 Score=24.86 Aligned_cols=24 Identities=42% Similarity=0.833 Sum_probs=18.5
Q ss_pred cceeeecCCCceEEEEe-CCeEEEE
Q psy11525 2 VQSICWNTDVNILAAMQ-DSALCVW 25 (244)
Q Consensus 2 v~~v~W~~~~~vlv~~~-~~~l~vw 25 (244)
|++++|+++.+.|++.+ |+++.+|
T Consensus 14 i~~i~~~~~~~~~~s~~~D~~i~vw 38 (39)
T PF00400_consen 14 INSIAWSPDGNFLASGSSDGTIRVW 38 (39)
T ss_dssp EEEEEEETTSSEEEEEETTSEEEEE
T ss_pred EEEEEEecccccceeeCCCCEEEEE
Confidence 67899999976666654 5888877
No 113
>KOG1129|consensus
Probab=81.64 E-value=1.6 Score=40.25 Aligned_cols=78 Identities=18% Similarity=0.151 Sum_probs=57.2
Q ss_pred HHcCCHHHHHHHHhc-cCCHHHHHHHHHHHHHcCCHHHHHHHHHHh--CChhHHHHHHHhhCCCcHHHHHHHHHHHcCCH
Q psy11525 89 AASHSWPQALSLCRT-LNDDILWACLAGMATYSRDLATSEEAYAAI--EQVDKVMYINHIKGIPVKAAQQAEMYLLGGNI 165 (244)
Q Consensus 89 l~~~~~~~A~~~~~~-~~~~~lW~~LA~~Al~~~~l~~A~~~y~~l--~d~~~v~~l~~i~~~~~~~~~~A~ia~~~G~~ 165 (244)
+..-.|++|...+.. .+....|-.|+..|+.-|++..|.+||+-. .|..- .+.....|.+++..|+.
T Consensus 376 ~~L~sf~RAlstat~~~~aaDvWYNlg~vaV~iGD~nlA~rcfrlaL~~d~~h----------~ealnNLavL~~r~G~i 445 (478)
T KOG1129|consen 376 LVLPSFQRALSTATQPGQAADVWYNLGFVAVTIGDFNLAKRCFRLALTSDAQH----------GEALNNLAVLAARSGDI 445 (478)
T ss_pred hhHHHHHHHHhhccCcchhhhhhhccceeEEeccchHHHHHHHHHHhccCcch----------HHHHHhHHHHHhhcCch
Confidence 445567788888774 677889999999999999999999999742 11111 12234568888899999
Q ss_pred HHHHHHHHHCC
Q psy11525 166 SEAESILLQHG 176 (244)
Q Consensus 166 ~eAe~~~l~~g 176 (244)
++|..+|-.+.
T Consensus 446 ~~Arsll~~A~ 456 (478)
T KOG1129|consen 446 LGARSLLNAAK 456 (478)
T ss_pred HHHHHHHHHhh
Confidence 99988876553
No 114
>KOG0985|consensus
Probab=81.31 E-value=6.4 Score=41.33 Aligned_cols=76 Identities=25% Similarity=0.194 Sum_probs=59.3
Q ss_pred HHHHHHHcCCHHHHHHHHHHCCCHHHHHHHHH-HhcCHHHHHHHHHhcCCc-hhHHHHHHHHHHHHcCCcccHHHHHhcc
Q psy11525 155 QAEMYLLGGNISEAESILLQHGLIFRAIQVSI-LTHNWDRALELALRHKTH-IDTVLYQRKKYLDNLEKIETNEKFLRLQ 232 (244)
Q Consensus 155 ~A~ia~~~G~~~eAe~~~l~~g~~~~Ai~m~~-~l~~W~~Al~LA~~~~~~-~~~vl~~r~~yl~~~~~~e~~~~f~~~~ 232 (244)
.|.++...+-|+||..+|.+.+....|++... +.+..|||.+.|++-+.+ +=. ...+..|++.--.++++.|.|+-
T Consensus 1054 ia~iai~~~LyEEAF~ifkkf~~n~~A~~VLie~i~~ldRA~efAe~~n~p~vWs--qlakAQL~~~~v~dAieSyikad 1131 (1666)
T KOG0985|consen 1054 IAEIAIENQLYEEAFAIFKKFDMNVSAIQVLIENIGSLDRAYEFAERCNEPAVWS--QLAKAQLQGGLVKDAIESYIKAD 1131 (1666)
T ss_pred HHHHHhhhhHHHHHHHHHHHhcccHHHHHHHHHHhhhHHHHHHHHHhhCChHHHH--HHHHHHHhcCchHHHHHHHHhcC
Confidence 78999999999999999999999999999776 579999999999997743 211 13344455555567777777763
No 115
>PF04840 Vps16_C: Vps16, C-terminal region; InterPro: IPR006925 This protein forms part of the Class C vacuolar protein sorting (Vps) complex. Vps16 is essential for vacuolar protein sorting, which is essential for viability in plants, but not yeast []. The Class C Vps complex is required for SNARE-mediated membrane fusion at the lysosome-like yeast vacuole. It is thought to play essential roles in membrane docking and fusion at the Golgi-to-endosome and endosome-to-vacuole stages of transport []. The role of VPS16 in this complex is not known.; GO: 0006886 intracellular protein transport, 0005737 cytoplasm
Probab=81.23 E-value=20 Score=32.67 Aligned_cols=88 Identities=18% Similarity=0.238 Sum_probs=63.4
Q ss_pred HHHHHHHcCCHHHHHHHHhccCCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHHHHcC
Q psy11525 84 VLHSYAASHSWPQALSLCRTLNDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMYLLGG 163 (244)
Q Consensus 84 ~l~~~l~~~~~~~A~~~~~~~~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia~~~G 163 (244)
.+..+.+.++|++-.+++..-.++.-|.-+...|+..++...|.+ ||.+ +++ ....++++..|
T Consensus 214 ki~aLa~~~~w~eL~~fa~skKsPIGyepFv~~~~~~~~~~eA~~------------yI~k---~~~--~~rv~~y~~~~ 276 (319)
T PF04840_consen 214 KIKALAENKDWDELEKFAKSKKSPIGYEPFVEACLKYGNKKEASK------------YIPK---IPD--EERVEMYLKCG 276 (319)
T ss_pred HHHHHHhcCCHHHHHHHHhCCCCCCChHHHHHHHHHCCCHHHHHH------------HHHh---CCh--HHHHHHHHHCC
Confidence 344568999999999999988889999999999999998876653 2332 222 23345667777
Q ss_pred CHHHHHHHHHHCCCHHHHHHHHHHh
Q psy11525 164 NISEAESILLQHGLIFRAIQVSILT 188 (244)
Q Consensus 164 ~~~eAe~~~l~~g~~~~Ai~m~~~l 188 (244)
++.+|.+.-.+.++++.-.+++...
T Consensus 277 ~~~~A~~~A~~~kd~~~L~~i~~~~ 301 (319)
T PF04840_consen 277 DYKEAAQEAFKEKDIDLLKQILKRC 301 (319)
T ss_pred CHHHHHHHHHHcCCHHHHHHHHHHC
Confidence 7777777777777777666665544
No 116
>KOG2076|consensus
Probab=81.13 E-value=24 Score=36.32 Aligned_cols=106 Identities=15% Similarity=0.070 Sum_probs=72.6
Q ss_pred HHHHHcCCHHHHHHHHhcc------CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHH
Q psy11525 86 HSYAASHSWPQALSLCRTL------NDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMY 159 (244)
Q Consensus 86 ~~~l~~~~~~~A~~~~~~~------~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia 159 (244)
..|+..|++.+|.++.-.+ ++...|-.+|..-+.-++.+.|..+|-++= ...-...+--...+.+.
T Consensus 422 ~al~~~~~~~~Al~~l~~i~~~~~~~~~~vw~~~a~c~~~l~e~e~A~e~y~kvl--------~~~p~~~D~Ri~Lasl~ 493 (895)
T KOG2076|consen 422 DALTNIGKYKEALRLLSPITNREGYQNAFVWYKLARCYMELGEYEEAIEFYEKVL--------ILAPDNLDARITLASLY 493 (895)
T ss_pred HHHHhcccHHHHHHHHHHHhcCccccchhhhHHHHHHHHHHhhHHHHHHHHHHHH--------hcCCCchhhhhhHHHHH
Confidence 3468889999999876554 456799999999999999999998887631 11111122335678899
Q ss_pred HHcCCHHHHHHHHHHCCCHH-----------------HHHHHHHHhcCHHHHHHHHH
Q psy11525 160 LLGGNISEAESILLQHGLIF-----------------RAIQVSILTHNWDRALELAL 199 (244)
Q Consensus 160 ~~~G~~~eAe~~~l~~g~~~-----------------~Ai~m~~~l~~W~~Al~LA~ 199 (244)
..+|++++|-+.+-+--.|| .-..++...|+.++=++.+.
T Consensus 494 ~~~g~~EkalEtL~~~~~~D~~~~e~~a~~~e~ri~~~r~d~l~~~gk~E~fi~t~~ 550 (895)
T KOG2076|consen 494 QQLGNHEKALETLEQIINPDGRNAEACAWEPERRILAHRCDILFQVGKREEFINTAS 550 (895)
T ss_pred HhcCCHHHHHHHHhcccCCCccchhhccccHHHHHHHHHHHHHHHhhhHHHHHHHHH
Confidence 99999999988888744444 11445566666666555444
No 117
>cd05804 StaR_like StaR_like; a well-conserved protein found in bacteria, plants, and animals. A family member from Streptomyces toyocaensis, StaR is part of a gene cluster involved in the biosynthesis of glycopeptide antibiotics (GPAs), specifically A47934. It has been speculated that StaR could be a flavoprotein hydroxylating a tyrosine sidechain. Some family members have been annotated as proteins containing tetratricopeptide (TPR) repeats, which may at least indicate mostly alpha-helical secondary structure.
Probab=80.76 E-value=30 Score=30.73 Aligned_cols=48 Identities=10% Similarity=0.081 Sum_probs=30.6
Q ss_pred HHHHHHHHHcCCHHHHHHHHHHCCC-----HH-------HHHHHHHHhcCHHHHHHHHHh
Q psy11525 153 AQQAEMYLLGGNISEAESILLQHGL-----IF-------RAIQVSILTHNWDRALELALR 200 (244)
Q Consensus 153 ~~~A~ia~~~G~~~eAe~~~l~~g~-----~~-------~Ai~m~~~l~~W~~Al~LA~~ 200 (244)
...|+++...|++++|..+|.+.=. +. ..-.++...|++++|+++.++
T Consensus 152 ~~la~i~~~~g~~~eA~~~l~~~l~~~~~~~~~~~~~~~~la~~~~~~G~~~~A~~~~~~ 211 (355)
T cd05804 152 HAVAHVLEMQGRFKEGIAFMESWRDTWDCSSMLRGHNWWHLALFYLERGDYEAALAIYDT 211 (355)
T ss_pred HHHHHHHHHcCCHHHHHHHHHhhhhccCCCcchhHHHHHHHHHHHHHCCCHHHHHHHHHH
Confidence 3467777777888888777665322 11 133466777777777776665
No 118
>PF13414 TPR_11: TPR repeat; PDB: 2HO1_B 2FI7_B 2DBA_A 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2PL2_B 3IEG_B 2FBN_A ....
Probab=80.61 E-value=4.6 Score=27.03 Aligned_cols=46 Identities=13% Similarity=0.120 Sum_probs=38.9
Q ss_pred HHHHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcC-CHHHHHHHHHH
Q psy11525 87 SYAASHSWPQALSLCRTL-----NDDILWACLAGMATYSR-DLATSEEAYAA 132 (244)
Q Consensus 87 ~~l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~-~l~~A~~~y~~ 132 (244)
.+...|+|++|....... .+...|..+|......+ +.+-|..+|.+
T Consensus 12 ~~~~~~~~~~A~~~~~~ai~~~p~~~~~~~~~g~~~~~~~~~~~~A~~~~~~ 63 (69)
T PF13414_consen 12 IYFQQGDYEEAIEYFEKAIELDPNNAEAYYNLGLAYMKLGKDYEEAIEDFEK 63 (69)
T ss_dssp HHHHTTHHHHHHHHHHHHHHHSTTHHHHHHHHHHHHHHTTTHHHHHHHHHHH
T ss_pred HHHHcCCHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHhCccHHHHHHHHHH
Confidence 358899999999988774 57889999999999998 68888888765
No 119
>CHL00033 ycf3 photosystem I assembly protein Ycf3
Probab=80.13 E-value=16 Score=29.17 Aligned_cols=65 Identities=9% Similarity=-0.015 Sum_probs=46.6
Q ss_pred CHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHC
Q psy11525 106 DDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMYLLGGNISEAESILLQH 175 (244)
Q Consensus 106 ~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia~~~G~~~eAe~~~l~~ 175 (244)
....|..+|..+...++++.|..+|.+.-.... .-.........+|.++...|++++|...|.++
T Consensus 34 ~a~~~~~~g~~~~~~g~~~~A~~~~~~al~l~~-----~~~~~~~~~~~lg~~~~~~g~~~eA~~~~~~A 98 (168)
T CHL00033 34 EAFTYYRDGMSAQSEGEYAEALQNYYEAMRLEI-----DPYDRSYILYNIGLIHTSNGEHTKALEYYFQA 98 (168)
T ss_pred HHHHHHHHHHHHHHcCCHHHHHHHHHHHHhccc-----cchhhHHHHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 356899999999999999999999887532210 00000112356789999999999999998775
No 120
>KOG3081|consensus
Probab=80.03 E-value=8.9 Score=34.35 Aligned_cols=50 Identities=12% Similarity=0.022 Sum_probs=22.7
Q ss_pred HHHHHcCCHHHHHHHHhccCCHHHHHHHHHHHHHcCCHHHHHHHHHHhCC
Q psy11525 86 HSYAASHSWPQALSLCRTLNDDILWACLAGMATYSRDLATSEEAYAAIEQ 135 (244)
Q Consensus 86 ~~~l~~~~~~~A~~~~~~~~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d 135 (244)
+=|++.+++++|.+....+.+=..-+.=-..-++..+++.|+....++.+
T Consensus 116 ~i~~~~~~~deAl~~~~~~~~lE~~Al~VqI~lk~~r~d~A~~~lk~mq~ 165 (299)
T KOG3081|consen 116 IIYMHDGDFDEALKALHLGENLEAAALNVQILLKMHRFDLAEKELKKMQQ 165 (299)
T ss_pred HHhhcCCChHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHc
Confidence 33566666666666665543222222112222333344555544444444
No 121
>PLN03098 LPA1 LOW PSII ACCUMULATION1; Provisional
Probab=79.51 E-value=33 Score=32.88 Aligned_cols=47 Identities=15% Similarity=0.085 Sum_probs=39.3
Q ss_pred HHHcCCHHHHHHHHhcc---C--CH---HHHHHHHHHHHHcCCHHHHHHHHHHhC
Q psy11525 88 YAASHSWPQALSLCRTL---N--DD---ILWACLAGMATYSRDLATSEEAYAAIE 134 (244)
Q Consensus 88 ~l~~~~~~~A~~~~~~~---~--~~---~lW~~LA~~Al~~~~l~~A~~~y~~l~ 134 (244)
+...|++++|...++.. + .. ..|.++|..-...++++.|..||.+.=
T Consensus 85 L~~lGryeEAIa~f~rALeL~Pd~aeA~~A~yNLAcaya~LGr~dEAla~LrrAL 139 (453)
T PLN03098 85 LFSKGRVKDALAQFETALELNPNPDEAQAAYYNKACCHAYREEGKKAADCLRTAL 139 (453)
T ss_pred HHHcCCHHHHHHHHHHHHhhCCCchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHH
Confidence 48899999999998773 2 22 359999999999999999999999643
No 122
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=77.90 E-value=41 Score=32.46 Aligned_cols=39 Identities=23% Similarity=0.166 Sum_probs=23.5
Q ss_pred HHHHHHHHHcCCHHHHHHHHHHCCCHHHHHHHHHHhcCHHHHHHHHHh
Q psy11525 153 AQQAEMYLLGGNISEAESILLQHGLIFRAIQVSILTHNWDRALELALR 200 (244)
Q Consensus 153 ~~~A~ia~~~G~~~eAe~~~l~~g~~~~Ai~m~~~l~~W~~Al~LA~~ 200 (244)
-..|+.+..+|+-.+|...+ =++|.-.|+|++|+..+..
T Consensus 412 ~~LAqay~~~g~~~~a~~A~---------AE~~~~~G~~~~A~~~l~~ 450 (484)
T COG4783 412 DLLAQAYAELGNRAEALLAR---------AEGYALAGRLEQAIIFLMR 450 (484)
T ss_pred HHHHHHHHHhCchHHHHHHH---------HHHHHhCCCHHHHHHHHHH
Confidence 34566666666665554333 2456667777777777654
No 123
>KOG4626|consensus
Probab=76.86 E-value=5 Score=40.04 Aligned_cols=45 Identities=18% Similarity=0.122 Sum_probs=39.0
Q ss_pred HHHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHH
Q psy11525 88 YAASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAA 132 (244)
Q Consensus 88 ~l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~ 132 (244)
+-+.|++++|+++-+.+ +--..|-.||..-+..++++.|-.||..
T Consensus 126 ~kerg~~~~al~~y~~aiel~p~fida~inla~al~~~~~~~~a~~~~~~ 175 (966)
T KOG4626|consen 126 LKERGQLQDALALYRAAIELKPKFIDAYINLAAALVTQGDLELAVQCFFE 175 (966)
T ss_pred HHHhchHHHHHHHHHHHHhcCchhhHHHhhHHHHHHhcCCCcccHHHHHH
Confidence 37899999999987775 3467899999999999999999999974
No 124
>PF07719 TPR_2: Tetratricopeptide repeat; InterPro: IPR013105 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. This repeat includes outlying Tetratricopeptide-like repeats (TPR) that are not matched by IPR001440 from INTERPRO.; PDB: 1XNF_B 3Q15_A 4ABN_A 1OUV_A 3U4T_A 3MA5_C 2KCV_A 2KCL_A 2XEV_A 3NF1_A ....
Probab=76.55 E-value=6.6 Score=22.26 Aligned_cols=26 Identities=12% Similarity=0.096 Sum_probs=22.1
Q ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHh
Q psy11525 108 ILWACLAGMATYSRDLATSEEAYAAI 133 (244)
Q Consensus 108 ~lW~~LA~~Al~~~~l~~A~~~y~~l 133 (244)
..|..+|......++++.|.++|.+.
T Consensus 2 ~~~~~lg~~~~~~~~~~~A~~~~~~a 27 (34)
T PF07719_consen 2 EAWYYLGQAYYQLGNYEEAIEYFEKA 27 (34)
T ss_dssp HHHHHHHHHHHHTT-HHHHHHHHHHH
T ss_pred HHHHHHHHHHHHhCCHHHHHHHHHHH
Confidence 57899999999999999999999763
No 125
>PRK04841 transcriptional regulator MalT; Provisional
Probab=76.19 E-value=41 Score=34.33 Aligned_cols=46 Identities=17% Similarity=0.011 Sum_probs=33.5
Q ss_pred HHcCCHHHHHHHHhccC----------CHHHHHHHHHHHHHcCCHHHHHHHHHHhC
Q psy11525 89 AASHSWPQALSLCRTLN----------DDILWACLAGMATYSRDLATSEEAYAAIE 134 (244)
Q Consensus 89 l~~~~~~~A~~~~~~~~----------~~~lW~~LA~~Al~~~~l~~A~~~y~~l~ 134 (244)
...|++++|...+...- ....+..+|..++..++.+.|.+.+.+..
T Consensus 584 ~~~G~~~~A~~~~~~al~~~~~~~~~~~~~~~~~la~~~~~~G~~~~A~~~l~~a~ 639 (903)
T PRK04841 584 WEWARLDEAEQCARKGLEVLSNYQPQQQLQCLAMLAKISLARGDLDNARRYLNRLE 639 (903)
T ss_pred HHhcCHHHHHHHHHHhHHhhhccCchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHH
Confidence 56799999987776531 12345557888889999999888877764
No 126
>PF12895 Apc3: Anaphase-promoting complex, cyclosome, subunit 3; PDB: 3KAE_D 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2XPI_A 3ULQ_A.
Probab=75.84 E-value=8.3 Score=27.13 Aligned_cols=45 Identities=16% Similarity=0.141 Sum_probs=34.9
Q ss_pred HHHcCCHHHHHHHHhcc----CCHHHHHHHHHHHHHcCCHHHHHHHHHH
Q psy11525 88 YAASHSWPQALSLCRTL----NDDILWACLAGMATYSRDLATSEEAYAA 132 (244)
Q Consensus 88 ~l~~~~~~~A~~~~~~~----~~~~lW~~LA~~Al~~~~l~~A~~~y~~ 132 (244)
+...|+|++|+.+++.. .+....-.+|...++-++++.|..+|.+
T Consensus 35 ~~~~~~y~~A~~~~~~~~~~~~~~~~~~l~a~~~~~l~~y~eAi~~l~~ 83 (84)
T PF12895_consen 35 YFQQGKYEEAIELLQKLKLDPSNPDIHYLLARCLLKLGKYEEAIKALEK 83 (84)
T ss_dssp HHHTTHHHHHHHHHHCHTHHHCHHHHHHHHHHHHHHTT-HHHHHHHHHH
T ss_pred HHHCCCHHHHHHHHHHhCCCCCCHHHHHHHHHHHHHhCCHHHHHHHHhc
Confidence 48999999999999664 2334555669999999999999988864
No 127
>PF03704 BTAD: Bacterial transcriptional activator domain; InterPro: IPR005158 Found in the DNRI/REDD/AFSR family of regulators, this region of AFSR (P25941 from SWISSPROT) along with the C-terminal region is capable of independently directing actinorhodin production. It is important for the formation of secondary metabolites.; PDB: 2FF4_B 2FEZ_A.
Probab=75.77 E-value=14 Score=28.79 Aligned_cols=47 Identities=15% Similarity=0.149 Sum_probs=40.2
Q ss_pred HHHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHHhC
Q psy11525 88 YAASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAAIE 134 (244)
Q Consensus 88 ~l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~ 134 (244)
+++.|++++|.++|+.+ -++..|..|-..-...|+...|.+.|.++.
T Consensus 72 ~~~~~~~~~a~~~~~~~l~~dP~~E~~~~~lm~~~~~~g~~~~A~~~Y~~~~ 123 (146)
T PF03704_consen 72 LLEAGDYEEALRLLQRALALDPYDEEAYRLLMRALAAQGRRAEALRVYERYR 123 (146)
T ss_dssp HHHTT-HHHHHHHHHHHHHHSTT-HHHHHHHHHHHHHTT-HHHHHHHHHHHH
T ss_pred HHhccCHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHCcCHHHHHHHHHHHH
Confidence 58899999999999986 468999999999999999999999999865
No 128
>PRK14720 transcript cleavage factor/unknown domain fusion protein; Provisional
Probab=75.38 E-value=26 Score=36.55 Aligned_cols=47 Identities=13% Similarity=0.123 Sum_probs=34.3
Q ss_pred HHHHHHHHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHH
Q psy11525 83 SVLHSYAASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEA 129 (244)
Q Consensus 83 ~~l~~~l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~ 129 (244)
..+..|...+++++|.++|+.. ..-..|-.+|....+.++++-|...
T Consensus 36 ~Li~~~~~~~~~deai~i~~~~l~~~P~~i~~yy~~G~l~~q~~~~~~~~lv 87 (906)
T PRK14720 36 DLIDAYKSENLTDEAKDICEEHLKEHKKSISALYISGILSLSRRPLNDSNLL 87 (906)
T ss_pred HHHHHHHhcCCHHHHHHHHHHHHHhCCcceehHHHHHHHHHhhcchhhhhhh
Confidence 3444457999999999999875 4567888888877777775444433
No 129
>KOG1586|consensus
Probab=75.36 E-value=17 Score=32.23 Aligned_cols=40 Identities=10% Similarity=0.092 Sum_probs=22.2
Q ss_pred HcCCHHHHHHHHhcc----CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCC
Q psy11525 90 ASHSWPQALSLCRTL----NDDILWACLAGMATYSRDLATSEEAYAAIEQ 135 (244)
Q Consensus 90 ~~~~~~~A~~~~~~~----~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d 135 (244)
-.+++++|..++... .-..-|...|..-++ |..++.+.|+
T Consensus 26 g~~k~eeAadl~~~Aan~yklaK~w~~AG~aflk------aA~~h~k~~s 69 (288)
T KOG1586|consen 26 GSNKYEEAAELYERAANMYKLAKNWSAAGDAFLK------AADLHLKAGS 69 (288)
T ss_pred CCcchHHHHHHHHHHHHHHHHHHhHHHHHHHHHH------HHHHHHhcCC
Confidence 356788888776654 234555555555444 4444444443
No 130
>KOG3081|consensus
Probab=75.33 E-value=60 Score=29.21 Aligned_cols=63 Identities=19% Similarity=0.196 Sum_probs=46.7
Q ss_pred HHHHHHHHcCCHHHHHHHHHHCCCHHHH---HHHHHHhcCHHHHHHHHHhcC-Cc-hhHHHHHHHHHH
Q psy11525 154 QQAEMYLLGGNISEAESILLQHGLIFRA---IQVSILTHNWDRALELALRHK-TH-IDTVLYQRKKYL 216 (244)
Q Consensus 154 ~~A~ia~~~G~~~eAe~~~l~~g~~~~A---i~m~~~l~~W~~Al~LA~~~~-~~-~~~vl~~r~~yl 216 (244)
..|.+++..|+|++|.++.-+.+..+.+ +.....+++.|-|....++.. +. -.++..+.+.|.
T Consensus 113 ~aa~i~~~~~~~deAl~~~~~~~~lE~~Al~VqI~lk~~r~d~A~~~lk~mq~ided~tLtQLA~awv 180 (299)
T KOG3081|consen 113 LAAIIYMHDGDFDEALKALHLGENLEAAALNVQILLKMHRFDLAEKELKKMQQIDEDATLTQLAQAWV 180 (299)
T ss_pred HhhHHhhcCCChHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHccchHHHHHHHHHHHH
Confidence 4688999999999999999885555544 566777888888877666654 33 345555888888
No 131
>PF05131 Pep3_Vps18: Pep3/Vps18/deep orange family; InterPro: IPR007810 This region is found in a number of proteins identified as being involved in Golgi function and vacuolar sorting. The molecular function of this region is unknown. Proteins containing this domain also contain a C-terminal ring finger domain.
Probab=75.28 E-value=4.3 Score=32.83 Aligned_cols=39 Identities=18% Similarity=0.207 Sum_probs=28.9
Q ss_pred HHHHHHHHHhcCHHHHHHHHHhcCCchhHHHHHHHHHHH
Q psy11525 179 FRAIQVSILTHNWDRALELALRHKTHIDTVLYQRKKYLD 217 (244)
Q Consensus 179 ~~Ai~m~~~l~~W~~Al~LA~~~~~~~~~vl~~r~~yl~ 217 (244)
+..=++|.+.++||+|+++|+......+.|+..++.+|-
T Consensus 107 r~vWk~yl~~~~fd~Al~~~~~~~~~~d~V~~~qa~~lf 145 (147)
T PF05131_consen 107 RDVWKIYLDKGDFDEALQYCKTNPAQRDQVLIKQADHLF 145 (147)
T ss_pred HHHHHHHHhcCcHHHHHHHccCCHHHHHHHHHHHHHHHh
Confidence 445566777788888888888764457888888887773
No 132
>COG3063 PilF Tfp pilus assembly protein PilF [Cell motility and secretion / Intracellular trafficking and secretion]
Probab=75.21 E-value=58 Score=28.67 Aligned_cols=146 Identities=10% Similarity=0.036 Sum_probs=91.4
Q ss_pred HHHHHHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHHh-------CC----hh-----------
Q psy11525 85 LHSYAASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAAI-------EQ----VD----------- 137 (244)
Q Consensus 85 l~~~l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~l-------~d----~~----------- 137 (244)
-..|++.|++..|-.-.+.. +....|..+|.....-|+.+.|.+.|.+- || +.
T Consensus 42 al~YL~~gd~~~A~~nlekAL~~DPs~~~a~~~~A~~Yq~~Ge~~~A~e~YrkAlsl~p~~GdVLNNYG~FLC~qg~~~e 121 (250)
T COG3063 42 ALGYLQQGDYAQAKKNLEKALEHDPSYYLAHLVRAHYYQKLGENDLADESYRKALSLAPNNGDVLNNYGAFLCAQGRPEE 121 (250)
T ss_pred HHHHHHCCCHHHHHHHHHHHHHhCcccHHHHHHHHHHHHHcCChhhHHHHHHHHHhcCCCccchhhhhhHHHHhCCChHH
Confidence 34589999999999888774 56789999999999999999999999853 11 11
Q ss_pred -HHHHHHHhhC-----CCcHHHHHHHHHHHcCCHHHHHHHHHHC-----CCHHHHHH---HHHHhcCHHHHHHHHHhcC-
Q psy11525 138 -KVMYINHIKG-----IPVKAAQQAEMYLLGGNISEAESILLQH-----GLIFRAIQ---VSILTHNWDRALELALRHK- 202 (244)
Q Consensus 138 -~v~~l~~i~~-----~~~~~~~~A~ia~~~G~~~eAe~~~l~~-----g~~~~Ai~---m~~~l~~W~~Al~LA~~~~- 202 (244)
+.+|-+-+.. ..+.....|-+++..|+++.|+..|.++ +.+....+ +..+-|++-+|-..-+...
T Consensus 122 A~q~F~~Al~~P~Y~~~s~t~eN~G~Cal~~gq~~~A~~~l~raL~~dp~~~~~~l~~a~~~~~~~~y~~Ar~~~~~~~~ 201 (250)
T COG3063 122 AMQQFERALADPAYGEPSDTLENLGLCALKAGQFDQAEEYLKRALELDPQFPPALLELARLHYKAGDYAPARLYLERYQQ 201 (250)
T ss_pred HHHHHHHHHhCCCCCCcchhhhhhHHHHhhcCCchhHHHHHHHHHHhCcCCChHHHHHHHHHHhcccchHHHHHHHHHHh
Confidence 2233222221 1234566788889999999999988776 22222222 3344455555544444432
Q ss_pred --CchhHHHHHHHHHHHHcCCcccHHHHHh
Q psy11525 203 --THIDTVLYQRKKYLDNLEKIETNEKFLR 230 (244)
Q Consensus 203 --~~~~~vl~~r~~yl~~~~~~e~~~~f~~ 230 (244)
......|-+--+..+..|...+..+|.+
T Consensus 202 ~~~~~A~sL~L~iriak~~gd~~~a~~Y~~ 231 (250)
T COG3063 202 RGGAQAESLLLGIRIAKRLGDRAAAQRYQA 231 (250)
T ss_pred cccccHHHHHHHHHHHHHhccHHHHHHHHH
Confidence 1133444444444455555555555443
No 133
>PF13428 TPR_14: Tetratricopeptide repeat
Probab=74.08 E-value=5.6 Score=24.69 Aligned_cols=26 Identities=15% Similarity=0.118 Sum_probs=23.0
Q ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHh
Q psy11525 108 ILWACLAGMATYSRDLATSEEAYAAI 133 (244)
Q Consensus 108 ~lW~~LA~~Al~~~~l~~A~~~y~~l 133 (244)
..|..+|......|+++-|+++|.++
T Consensus 2 ~~~~~la~~~~~~G~~~~A~~~~~~~ 27 (44)
T PF13428_consen 2 AAWLALARAYRRLGQPDEAERLLRRA 27 (44)
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 57899999999999999999999864
No 134
>PF10366 Vps39_1: Vacuolar sorting protein 39 domain 1; InterPro: IPR019452 This entry represents a domain found in the vacuolar sorting protein Vps39 and transforming growth factor beta receptor-associated protein Trap1. Vps39, a component of the C-Vps complex, is thought to be required for the fusion of endosomes and other types of transport intermediates with the vacuole [, ]. In Saccharomyces cerevisiae (Baker's yeast), Vps39 has been shown to stimulate nucleotide exchange []. Trap1 plays a role in the TGF-beta/activin signaling pathway. It associates with inactive heteromeric TGF-beta and activin receptor complexes, mainly through the type II receptor, and is released upon activation of signaling [, ]. The precise function of this domain has not been characterised.
Probab=73.82 E-value=31 Score=26.20 Aligned_cols=61 Identities=16% Similarity=0.163 Sum_probs=37.0
Q ss_pred HHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHCCCHHHHHHHHHHhcCHHHHHHHHHhcC
Q psy11525 127 EEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMYLLGGNISEAESILLQHGLIFRAIQVSILTHNWDRALELALRHK 202 (244)
Q Consensus 127 ~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia~~~G~~~eAe~~~l~~g~~~~Ai~m~~~l~~W~~Al~LA~~~~ 202 (244)
-+||... +.+++.-+-++.+. =++++.|+.+.+.|++.+-+.+|..-++-++||++-++..
T Consensus 6 lk~Yl~~-~~~~l~~llr~~N~--------------C~~~~~e~~L~~~~~~~eL~~lY~~kg~h~~AL~ll~~l~ 66 (108)
T PF10366_consen 6 LKCYLET-NPSLLGPLLRLPNY--------------CDLEEVEEVLKEHGKYQELVDLYQGKGLHRKALELLKKLA 66 (108)
T ss_pred HHHHHHh-CHHHHHHHHccCCc--------------CCHHHHHHHHHHcCCHHHHHHHHHccCccHHHHHHHHHHh
Confidence 4778887 55555444443321 1456666666666777777777777777777766665543
No 135
>PRK10370 formate-dependent nitrite reductase complex subunit NrfG; Provisional
Probab=73.73 E-value=52 Score=27.48 Aligned_cols=81 Identities=14% Similarity=0.011 Sum_probs=52.0
Q ss_pred HHHHHHHHHcCCHHHHHHHHHHCCC---------HHHHHHHHHHhcCH--HHHHHHHHh---cCCchhHHHHHHH-HHHH
Q psy11525 153 AQQAEMYLLGGNISEAESILLQHGL---------IFRAIQVSILTHNW--DRALELALR---HKTHIDTVLYQRK-KYLD 217 (244)
Q Consensus 153 ~~~A~ia~~~G~~~eAe~~~l~~g~---------~~~Ai~m~~~l~~W--~~Al~LA~~---~~~~~~~vl~~r~-~yl~ 217 (244)
...|.+++..|++++|...|.++-+ ...|.-++...+++ ++|.++..+ .++.--+.+.... .|++
T Consensus 77 ~~Lg~~~~~~g~~~~A~~a~~~Al~l~P~~~~~~~~lA~aL~~~~g~~~~~~A~~~l~~al~~dP~~~~al~~LA~~~~~ 156 (198)
T PRK10370 77 ALLGEYYLWRNDYDNALLAYRQALQLRGENAELYAALATVLYYQAGQHMTPQTREMIDKALALDANEVTALMLLASDAFM 156 (198)
T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHhcCCCCcHHHHHHHHHHHHhCCCChhHHHHHHHHHHH
Confidence 5689999999999999999988743 22334445666664 777666544 4444233344444 4444
Q ss_pred HcCCcccHHHHHhccc
Q psy11525 218 NLEKIETNEKFLRLQS 233 (244)
Q Consensus 218 ~~~~~e~~~~f~~~~~ 233 (244)
.....+++..|+++.+
T Consensus 157 ~g~~~~Ai~~~~~aL~ 172 (198)
T PRK10370 157 QADYAQAIELWQKVLD 172 (198)
T ss_pred cCCHHHHHHHHHHHHh
Confidence 4444788888888754
No 136
>PF12234 Rav1p_C: RAVE protein 1 C terminal; InterPro: IPR022033 This domain family is found in eukaryotes, and is typically between 621 and 644 amino acids in length. This family is the C-terminal region of the protein RAVE (regulator of the ATPase of vacuolar and endosomal membranes). Rav1p is involved in regulating the glucose dependent assembly and disassembly of vacuolar ATPase V1 and V0 subunits.
Probab=73.42 E-value=49 Score=33.15 Aligned_cols=97 Identities=14% Similarity=0.084 Sum_probs=69.3
Q ss_pred HHHHHHHHHHHH---cCCHHHHHHHHHHhCChhHHHHHHHhhC-CC--------------c-----HHHHHHHHHHHcCC
Q psy11525 108 ILWACLAGMATY---SRDLATSEEAYAAIEQVDKVMYINHIKG-IP--------------V-----KAAQQAEMYLLGGN 164 (244)
Q Consensus 108 ~lW~~LA~~Al~---~~~l~~A~~~y~~l~d~~~v~~l~~i~~-~~--------------~-----~~~~~A~ia~~~G~ 164 (244)
..-..+|+..-. .++.--+.--|.++|....+.-|-++.. .+ + ...+.|..++-..+
T Consensus 388 ~~~E~iAr~~~~~~~~~dp~~~al~YlAl~Kk~vL~~L~k~a~~~~~~~k~~~Fl~ndF~~~rwr~AAlKNAyaLlsk~R 467 (631)
T PF12234_consen 388 SQFEKIARNEFQKNDTKDPSDCALFYLALGKKKVLAGLWKMASWHKEQQKMAKFLSNDFTEPRWRTAALKNAYALLSKHR 467 (631)
T ss_pred HHHHHHHHHHHhhccCCCHHHHHHHHHHhccHHHHHHHHHhhhcccccHHHHHHHhhcCCChHHHHHHHHhHHHHHhccc
Confidence 334455554444 3566556667778887776666666321 11 1 12457888888999
Q ss_pred HHHHHHHHHHCCCHHHHHHHHH-HhcCHHHHHHHHHhcCCc
Q psy11525 165 ISEAESILLQHGLIFRAIQVSI-LTHNWDRALELALRHKTH 204 (244)
Q Consensus 165 ~~eAe~~~l~~g~~~~Ai~m~~-~l~~W~~Al~LA~~~~~~ 204 (244)
|+.|...|+=+|+...|++.+. .+++..-|+-+|+-+..+
T Consensus 468 y~~AAaFFLLag~l~dAv~V~~~~l~D~qLAi~i~Rl~e~d 508 (631)
T PF12234_consen 468 YEYAAAFFLLAGSLKDAVNVCLRQLNDPQLAIAIARLYEGD 508 (631)
T ss_pred HHHHHHHHHhcccHHHHHHHHHHHccChhHHHHHHHHHcCC
Confidence 9999999999999999999774 579999999999887643
No 137
>KOG1126|consensus
Probab=72.74 E-value=18 Score=35.93 Aligned_cols=126 Identities=19% Similarity=0.218 Sum_probs=79.4
Q ss_pred CHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhH-HHHHHHhh---------------------CCCcHH--H--HHHHHH
Q psy11525 106 DDILWACLAGMATYSRDLATSEEAYAAIEQVDK-VMYINHIK---------------------GIPVKA--A--QQAEMY 159 (244)
Q Consensus 106 ~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~-v~~l~~i~---------------------~~~~~~--~--~~A~ia 159 (244)
.+.-|..+|.-.-..++.+.|.+||.+--.++- ..|-.-+. .+...+ + -+|.++
T Consensus 420 sPesWca~GNcfSLQkdh~~Aik~f~RAiQldp~faYayTLlGhE~~~~ee~d~a~~~fr~Al~~~~rhYnAwYGlG~vy 499 (638)
T KOG1126|consen 420 SPESWCALGNCFSLQKDHDTAIKCFKRAIQLDPRFAYAYTLLGHESIATEEFDKAMKSFRKALGVDPRHYNAWYGLGTVY 499 (638)
T ss_pred CcHHHHHhcchhhhhhHHHHHHHHHHHhhccCCccchhhhhcCChhhhhHHHHhHHHHHHhhhcCCchhhHHHHhhhhhe
Confidence 568899999988889999999999998766553 21111111 111111 1 278899
Q ss_pred HHcCCHHHHHHHHHHCCCH---H-----HHHHHHHHhcCHHHHHHHHHhcC-CchhHH--HHHHHHHHHHcCC-cccHHH
Q psy11525 160 LLGGNISEAESILLQHGLI---F-----RAIQVSILTHNWDRALELALRHK-THIDTV--LYQRKKYLDNLEK-IETNEK 227 (244)
Q Consensus 160 ~~~G~~~eAe~~~l~~g~~---~-----~Ai~m~~~l~~W~~Al~LA~~~~-~~~~~v--l~~r~~yl~~~~~-~e~~~~ 227 (244)
+-+|+++.||-.|.++=.+ . -+-.++..+++-|+||++-++-- .+-+.. .+.|..-|...++ .|+...
T Consensus 500 ~Kqek~e~Ae~~fqkA~~INP~nsvi~~~~g~~~~~~k~~d~AL~~~~~A~~ld~kn~l~~~~~~~il~~~~~~~eal~~ 579 (638)
T KOG1126|consen 500 LKQEKLEFAEFHFQKAVEINPSNSVILCHIGRIQHQLKRKDKALQLYEKAIHLDPKNPLCKYHRASILFSLGRYVEALQE 579 (638)
T ss_pred eccchhhHHHHHHHhhhcCCccchhHHhhhhHHHHHhhhhhHHHHHHHHHHhcCCCCchhHHHHHHHHHhhcchHHHHHH
Confidence 9999999999999887332 1 23467788899999988876632 111122 2344455544444 455554
Q ss_pred HHhc
Q psy11525 228 FLRL 231 (244)
Q Consensus 228 f~~~ 231 (244)
+.+.
T Consensus 580 LEeL 583 (638)
T KOG1126|consen 580 LEEL 583 (638)
T ss_pred HHHH
Confidence 4444
No 138
>KOG1155|consensus
Probab=72.70 E-value=35 Score=33.01 Aligned_cols=108 Identities=12% Similarity=0.125 Sum_probs=74.0
Q ss_pred cCCHHHHHHHHhc---c--CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhH----------------------HHHHH
Q psy11525 91 SHSWPQALSLCRT---L--NDDILWACLAGMATYSRDLATSEEAYAAIEQVDK----------------------VMYIN 143 (244)
Q Consensus 91 ~~~~~~A~~~~~~---~--~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~----------------------v~~l~ 143 (244)
.++-+.|+.+-+. + ..-..|..+|..-++-+|-..|..||++-=|+.. +-+.+
T Consensus 343 r~eHEKAv~YFkRALkLNp~~~~aWTLmGHEyvEmKNt~AAi~sYRrAvdi~p~DyRAWYGLGQaYeim~Mh~YaLyYfq 422 (559)
T KOG1155|consen 343 RSEHEKAVMYFKRALKLNPKYLSAWTLMGHEYVEMKNTHAAIESYRRAVDINPRDYRAWYGLGQAYEIMKMHFYALYYFQ 422 (559)
T ss_pred HHhHHHHHHHHHHHHhcCcchhHHHHHhhHHHHHhcccHHHHHHHHHHHhcCchhHHHHhhhhHHHHHhcchHHHHHHHH
Confidence 3444555544433 2 4578999999999999999999999998665442 24455
Q ss_pred HhhCCCc--HH--HHHHHHHHHcCCHHHHHHHHHHC---CCH-----HHHHHHHHHhcCHHHHHHHH
Q psy11525 144 HIKGIPV--KA--AQQAEMYLLGGNISEAESILLQH---GLI-----FRAIQVSILTHNWDRALELA 198 (244)
Q Consensus 144 ~i~~~~~--~~--~~~A~ia~~~G~~~eAe~~~l~~---g~~-----~~Ai~m~~~l~~W~~Al~LA 198 (244)
+...... .- .-+|+++..+++.+||.+.|.++ |+. -.-=++|..+.+-++|...=
T Consensus 423 kA~~~kPnDsRlw~aLG~CY~kl~~~~eAiKCykrai~~~dte~~~l~~LakLye~l~d~~eAa~~y 489 (559)
T KOG1155|consen 423 KALELKPNDSRLWVALGECYEKLNRLEEAIKCYKRAILLGDTEGSALVRLAKLYEELKDLNEAAQYY 489 (559)
T ss_pred HHHhcCCCchHHHHHHHHHHHHhccHHHHHHHHHHHHhccccchHHHHHHHHHHHHHHhHHHHHHHH
Confidence 5554422 22 34789999999999999999875 443 12236777777777775443
No 139
>KOG1173|consensus
Probab=72.18 E-value=14 Score=36.18 Aligned_cols=78 Identities=14% Similarity=0.072 Sum_probs=52.4
Q ss_pred HcCCHHHHHHHHhcc-------C-----CHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHH
Q psy11525 90 ASHSWPQALSLCRTL-------N-----DDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAE 157 (244)
Q Consensus 90 ~~~~~~~A~~~~~~~-------~-----~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ 157 (244)
..+.|.+|.+.-+.. . -+.+|..||..+.+.+.++.|..+|.+ .|...-.-.+...-.|.
T Consensus 426 ~~~~y~~A~~~f~~~l~~ik~~~~e~~~w~p~~~NLGH~~Rkl~~~~eAI~~~q~--------aL~l~~k~~~~~asig~ 497 (611)
T KOG1173|consen 426 TYEEYPEALKYFQKALEVIKSVLNEKIFWEPTLNNLGHAYRKLNKYEEAIDYYQK--------ALLLSPKDASTHASIGY 497 (611)
T ss_pred hHhhhHHHHHHHHHHHHHhhhccccccchhHHHHhHHHHHHHHhhHHHHHHHHHH--------HHHcCCCchhHHHHHHH
Confidence 355556665544332 1 355688899988888888888877764 12222222344566788
Q ss_pred HHHHcCCHHHHHHHHHHC
Q psy11525 158 MYLLGGNISEAESILLQH 175 (244)
Q Consensus 158 ia~~~G~~~eAe~~~l~~ 175 (244)
+...+|+++.|...|-++
T Consensus 498 iy~llgnld~Aid~fhKa 515 (611)
T KOG1173|consen 498 IYHLLGNLDKAIDHFHKA 515 (611)
T ss_pred HHHHhcChHHHHHHHHHH
Confidence 999999999998888776
No 140
>KOG1585|consensus
Probab=72.13 E-value=70 Score=28.62 Aligned_cols=47 Identities=19% Similarity=0.193 Sum_probs=24.6
Q ss_pred HHHHHhhCCCcH---HHHHHHHHHHcCCHHHHHHHH------HHCCCHHHHHHHHH
Q psy11525 140 MYINHIKGIPVK---AAQQAEMYLLGGNISEAESIL------LQHGLIFRAIQVSI 186 (244)
Q Consensus 140 ~~l~~i~~~~~~---~~~~A~ia~~~G~~~eAe~~~------l~~g~~~~Ai~m~~ 186 (244)
..++++..+++- ..+.-++++..|..+-|...+ +++-+|++|+++|+
T Consensus 79 mLake~~klsEvvdl~eKAs~lY~E~GspdtAAmaleKAak~lenv~Pd~AlqlYq 134 (308)
T KOG1585|consen 79 MLAKELSKLSEVVDLYEKASELYVECGSPDTAAMALEKAAKALENVKPDDALQLYQ 134 (308)
T ss_pred HHHHHHHHhHHHHHHHHHHHHHHHHhCCcchHHHHHHHHHHHhhcCCHHHHHHHHH
Confidence 344444444432 233445666666665554443 33456677777766
No 141
>PF10602 RPN7: 26S proteasome subunit RPN7; InterPro: IPR019585 This entry represents the regulatory subunit RPN7 (known as the non-ATPase regulatory subunit 6 in higher eukaryotes) of the 26S proteasome. This entry also matches the evolutionarily related subunit 1 of the COP9 signalosome complex (CSN) from Arabidopsis []. The 26S proteasome plays a major role in ATP-dependent degradation of ubiquitinated proteins. Substrate specificity is conferred by the regulatory particle (RP), which can dissociate into stable lid and base subcomplexes. The regulatory subunit RPN7 is one of the lid subunits of the 26S proteasome and has been shown in Saccharomyces cerevisiae (Baker's yeast) to be required for structural integrity []. The COP9 signalosome is a conserved protein complex composed of eight subunits, where Individual subunits of the complex have been linked to various signal transduction pathways leading to gene expression and cell cycle control []. The overall organisation and the amino acid sequences of the COP9 signalosome subunits resemble the lid subcomplex of the 19 S regulatory particle for the 26 S proteasome []. COP9 subunit 1 (CSN1 or GPS1) of the COP9 complex is an essential subunit of the complex with regard to both structural integrity and functionality. The N-terminal region of subunit 1 (CSN1-N) can inhibit c-fos expression from either a transfected template or a chromosomal transgene (fos-lacZ), and may contain the activity domain that confers most of the repression functions of CSN1. The C-terminal region of subunit 1 (CSN1-C) allows integration of the protein into the COP9 signalosome.
Probab=71.99 E-value=17 Score=30.10 Aligned_cols=83 Identities=14% Similarity=0.111 Sum_probs=54.4
Q ss_pred HHHHHHcCCHHHHHHHHhccC--------CHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCc----H-
Q psy11525 85 LHSYAASHSWPQALSLCRTLN--------DDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPV----K- 151 (244)
Q Consensus 85 l~~~l~~~~~~~A~~~~~~~~--------~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~----~- 151 (244)
-..|.+.|+++.|++.....- .=.+|-.+-+.++..+|+..+... +.+++.+.+ .
T Consensus 43 ~~~~~~~Gd~~~A~k~y~~~~~~~~~~~~~id~~l~~irv~i~~~d~~~v~~~------------i~ka~~~~~~~~d~~ 110 (177)
T PF10602_consen 43 ADHYCKIGDLEEALKAYSRARDYCTSPGHKIDMCLNVIRVAIFFGDWSHVEKY------------IEKAESLIEKGGDWE 110 (177)
T ss_pred HHHHHHhhhHHHHHHHHHHHhhhcCCHHHHHHHHHHHHHHHHHhCCHHHHHHH------------HHHHHHHHhccchHH
Confidence 334699999999999877652 235666677777776666555433 233332211 1
Q ss_pred -----HHHHHHHHHHcCCHHHHHHHHHHCCCHH
Q psy11525 152 -----AAQQAEMYLLGGNISEAESILLQHGLIF 179 (244)
Q Consensus 152 -----~~~~A~ia~~~G~~~eAe~~~l~~g~~~ 179 (244)
..-.|-..+..|+|.+|.++|+.+..-.
T Consensus 111 ~~nrlk~~~gL~~l~~r~f~~AA~~fl~~~~t~ 143 (177)
T PF10602_consen 111 RRNRLKVYEGLANLAQRDFKEAAELFLDSLSTF 143 (177)
T ss_pred HHHHHHHHHHHHHHHhchHHHHHHHHHccCcCC
Confidence 1335677788899999999999986444
No 142
>PF07719 TPR_2: Tetratricopeptide repeat; InterPro: IPR013105 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. This repeat includes outlying Tetratricopeptide-like repeats (TPR) that are not matched by IPR001440 from INTERPRO.; PDB: 1XNF_B 3Q15_A 4ABN_A 1OUV_A 3U4T_A 3MA5_C 2KCV_A 2KCL_A 2XEV_A 3NF1_A ....
Probab=71.77 E-value=5.5 Score=22.59 Aligned_cols=22 Identities=23% Similarity=0.193 Sum_probs=15.9
Q ss_pred HHHHHHHHcCCHHHHHHHHHHC
Q psy11525 154 QQAEMYLLGGNISEAESILLQH 175 (244)
Q Consensus 154 ~~A~ia~~~G~~~eAe~~~l~~ 175 (244)
.+|.++...|++++|.+.|.++
T Consensus 6 ~lg~~~~~~~~~~~A~~~~~~a 27 (34)
T PF07719_consen 6 YLGQAYYQLGNYEEAIEYFEKA 27 (34)
T ss_dssp HHHHHHHHTT-HHHHHHHHHHH
T ss_pred HHHHHHHHhCCHHHHHHHHHHH
Confidence 4678888888888888877653
No 143
>PF13176 TPR_7: Tetratricopeptide repeat; PDB: 3SF4_C 3RO3_A 3RO2_A.
Probab=71.76 E-value=7 Score=23.22 Aligned_cols=20 Identities=20% Similarity=0.225 Sum_probs=9.6
Q ss_pred HHHHHHHcCCHHHHHHHHHH
Q psy11525 155 QAEMYLLGGNISEAESILLQ 174 (244)
Q Consensus 155 ~A~ia~~~G~~~eAe~~~l~ 174 (244)
+|.++...|++++|.++|.+
T Consensus 5 Lg~~~~~~g~~~~Ai~~y~~ 24 (36)
T PF13176_consen 5 LGRIYRQQGDYEKAIEYYEQ 24 (36)
T ss_dssp HHHHHHHCT-HHHHHHHHHH
T ss_pred HHHHHHHcCCHHHHHHHHHH
Confidence 44555555555555555443
No 144
>KOG0307|consensus
Probab=71.42 E-value=1.4e+02 Score=31.58 Aligned_cols=59 Identities=22% Similarity=0.188 Sum_probs=43.3
Q ss_pred cCCHHHHHHHHHHCCCHHHHHHHHHHhcCHHHHHHHHHhcCCchhHHHHHHHHHHHHcCCc
Q psy11525 162 GGNISEAESILLQHGLIFRAIQVSILTHNWDRALELALRHKTHIDTVLYQRKKYLDNLEKI 222 (244)
Q Consensus 162 ~G~~~eAe~~~l~~g~~~~Ai~m~~~l~~W~~Al~LA~~~~~~~~~vl~~r~~yl~~~~~~ 222 (244)
.++++..-..-+=.|++..|++.+..-++.-+|+-+|..++++ ..-..|.+|+...+.+
T Consensus 487 ~~d~d~~Is~alitgd~~~aV~~cl~~~~~a~AliiA~~gg~e--l~~~t~~~Y~~k~~~k 545 (1049)
T KOG0307|consen 487 DSDIDGLISEALITGDFKSAVELCLEANKMADALIIAHAGGTE--LLESTRDKYLAKSNSK 545 (1049)
T ss_pred CCcHHHHHHHHHHhccHHHHHHHHHhhhHHHHHHHHHhcCCHH--HHHHHHHHHHHHhCCh
Confidence 3456666666666688888888888888899999999988875 4444788888665444
No 145
>cd02577 PSTD1 PSTD1: Pseudouridine synthase, a subgroup of the TruD family. This group consists of several hypothetical archeal pseudouridine synthases assigned to the TruD family of psuedouridine synthases. Pseudouridine synthases catalyze the isomerization of specific uridines in an RNA molecule to pseudouridines (5-ribosyluracil, psi). The TruD family is comprised of proteins related to Escherichia coli TruD.
Probab=71.33 E-value=7.8 Score=35.38 Aligned_cols=43 Identities=26% Similarity=0.319 Sum_probs=35.1
Q ss_pred HHHHHcCCHHHHHHHHHHC-------CCHHHHHHHHHHhcCHHHHHHHHH
Q psy11525 157 EMYLLGGNISEAESILLQH-------GLIFRAIQVSILTHNWDRALELAL 199 (244)
Q Consensus 157 ~ia~~~G~~~eAe~~~l~~-------g~~~~Ai~m~~~l~~W~~Al~LA~ 199 (244)
=.+++.|++++|..+++.. +...+|.+++.+-++|++|+.+.-
T Consensus 157 G~~ll~gd~~~Av~~il~~~~~~~~~~~~~~ar~~~~~~gd~~~~l~~~p 206 (319)
T cd02577 157 GKLILEGDYEEAAKTYLTARYPLEETGDKKRALRLFPKWGDWERCLKEYP 206 (319)
T ss_pred HHHHHccCHHHHHHHHhcCCcccccchHHHHHHHHHHhcCCHHHHHHHhh
Confidence 3677889999999999953 345678888999999999998753
No 146
>PF10366 Vps39_1: Vacuolar sorting protein 39 domain 1; InterPro: IPR019452 This entry represents a domain found in the vacuolar sorting protein Vps39 and transforming growth factor beta receptor-associated protein Trap1. Vps39, a component of the C-Vps complex, is thought to be required for the fusion of endosomes and other types of transport intermediates with the vacuole [, ]. In Saccharomyces cerevisiae (Baker's yeast), Vps39 has been shown to stimulate nucleotide exchange []. Trap1 plays a role in the TGF-beta/activin signaling pathway. It associates with inactive heteromeric TGF-beta and activin receptor complexes, mainly through the type II receptor, and is released upon activation of signaling [, ]. The precise function of this domain has not been characterised.
Probab=71.31 E-value=9.5 Score=29.10 Aligned_cols=32 Identities=16% Similarity=0.135 Sum_probs=28.1
Q ss_pred HHHcCCHHHHHHHHHHCCCHHHHHHHHHHhcC
Q psy11525 159 YLLGGNISEAESILLQHGLIFRAIQVSILTHN 190 (244)
Q Consensus 159 a~~~G~~~eAe~~~l~~g~~~~Ai~m~~~l~~ 190 (244)
....|+|.+=..+|-..|..++|+++.+++.+
T Consensus 36 L~~~~~~~eL~~lY~~kg~h~~AL~ll~~l~~ 67 (108)
T PF10366_consen 36 LKEHGKYQELVDLYQGKGLHRKALELLKKLAD 67 (108)
T ss_pred HHHcCCHHHHHHHHHccCccHHHHHHHHHHhc
Confidence 35678999999999999999999999998776
No 147
>PF15016 DUF4520: Domain of unknown function (DUF4520)
Probab=71.04 E-value=32 Score=25.29 Aligned_cols=62 Identities=19% Similarity=0.244 Sum_probs=42.6
Q ss_pred CCCceEEEEeCCeEEEEEcCCccccCcccceeeeEeeccCCCCCCCeEEeEeCCEEEEEecCCcEEEee-cCccHHHHHH
Q psy11525 9 TDVNILAAMQDSALCVWFFPAVVFADQGLLRKTVLLKDIGEFGKSPSIVSFVKNHLTIRRYDGTVINYP-ISPYISVLHS 87 (244)
Q Consensus 9 ~~~~vlv~~~~~~l~vw~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~v~f~g~~v~~~~~~G~~~~~~-~~p~~~~l~~ 87 (244)
|...-|.|++|+++.+. |.|+++|.... +|+... ..+.+.|.-+||....+. ..| ....+
T Consensus 13 ~~~GrF~AysDgrVr~~------F~Drt~L~l~~------~~~~~~-----~~~~c~l~~pDG~~~~i~i~~p--~~y~r 73 (85)
T PF15016_consen 13 PGVGRFTAYSDGRVRVH------FDDRTILTLIW------NFSSRE-----ELGWCKLTFPDGQQVLIQIEHP--GPYER 73 (85)
T ss_pred CCceEEEEEcCCeEEEE------EcCCCEEEEEe------CCCCcc-----cCCEEEEEccCCCEEEEEccCC--chhHH
Confidence 45578999999998874 88888887442 121221 778899999999987664 345 44445
Q ss_pred HH
Q psy11525 88 YA 89 (244)
Q Consensus 88 ~l 89 (244)
|+
T Consensus 74 YV 75 (85)
T PF15016_consen 74 YV 75 (85)
T ss_pred HH
Confidence 53
No 148
>KOG0553|consensus
Probab=70.50 E-value=11 Score=34.20 Aligned_cols=48 Identities=15% Similarity=0.146 Sum_probs=41.2
Q ss_pred HHHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCC
Q psy11525 88 YAASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAAIEQ 135 (244)
Q Consensus 88 ~l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d 135 (244)
|.+.|.+++|++-|+.. ..-..|.+||..-+..+++..|+++|.+-=+
T Consensus 125 y~~Lg~~~~AVkDce~Al~iDp~yskay~RLG~A~~~~gk~~~A~~aykKaLe 177 (304)
T KOG0553|consen 125 YSKLGEYEDAVKDCESALSIDPHYSKAYGRLGLAYLALGKYEEAIEAYKKALE 177 (304)
T ss_pred HHHhcchHHHHHHHHHHHhcChHHHHHHHHHHHHHHccCcHHHHHHHHHhhhc
Confidence 69999999999999884 2367888999999999999999999987544
No 149
>KOG0772|consensus
Probab=70.03 E-value=6.3 Score=38.21 Aligned_cols=36 Identities=25% Similarity=0.361 Sum_probs=26.0
Q ss_pred cceeeecCCCc-eEEEEeCCeEEEEEcCCccccCcccc
Q psy11525 2 VQSICWNTDVN-ILAAMQDSALCVWFFPAVVFADQGLL 38 (244)
Q Consensus 2 v~~v~W~~~~~-vlv~~~~~~l~vw~~~~~~~~~~~~l 38 (244)
+++++||+|.+ |-.++.|+.+.+|-+++-. +.++..
T Consensus 320 ~tsC~~nrdg~~iAagc~DGSIQ~W~~~~~~-v~p~~~ 356 (641)
T KOG0772|consen 320 VTSCAWNRDGKLIAAGCLDGSIQIWDKGSRT-VRPVMK 356 (641)
T ss_pred ceeeecCCCcchhhhcccCCceeeeecCCcc-cccceE
Confidence 68999999875 4455999999999875433 444443
No 150
>PF00515 TPR_1: Tetratricopeptide repeat; InterPro: IPR001440 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees; within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B. ; GO: 0005515 protein binding; PDB: 3SF4_C 2LNI_A 1ELW_A 2C0M_A 1FCH_B 3R9A_B 2J9Q_A 2C0L_A 1KT1_A 3FWV_A ....
Probab=67.82 E-value=14 Score=21.09 Aligned_cols=26 Identities=15% Similarity=0.024 Sum_probs=22.1
Q ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHh
Q psy11525 108 ILWACLAGMATYSRDLATSEEAYAAI 133 (244)
Q Consensus 108 ~lW~~LA~~Al~~~~l~~A~~~y~~l 133 (244)
..|-.+|......++++.|..+|.+.
T Consensus 2 ~~~~~~g~~~~~~~~~~~A~~~~~~a 27 (34)
T PF00515_consen 2 EAYYNLGNAYFQLGDYEEALEYYQRA 27 (34)
T ss_dssp HHHHHHHHHHHHTT-HHHHHHHHHHH
T ss_pred HHHHHHHHHHHHhCCchHHHHHHHHH
Confidence 47899999999999999999999863
No 151
>COG5290 IkappaB kinase complex, IKAP component [Transcription]
Probab=67.04 E-value=23 Score=36.49 Aligned_cols=49 Identities=12% Similarity=-0.041 Sum_probs=42.6
Q ss_pred HHHHHHHcCCHHHHHHHHHHCCCHHHHHHHHHHhcCHHHHHHHHHhcCC
Q psy11525 155 QAEMYLLGGNISEAESILLQHGLIFRAIQVSILTHNWDRALELALRHKT 203 (244)
Q Consensus 155 ~A~ia~~~G~~~eAe~~~l~~g~~~~Ai~m~~~l~~W~~Al~LA~~~~~ 203 (244)
.|.-....+.+.+|...|---|.+.+|+..|..-+||.+++.++.+..+
T Consensus 928 fa~~l~~n~~~~~aa~aye~~gK~~Ea~gay~sA~mwrec~si~~q~~~ 976 (1243)
T COG5290 928 FAGNLVDNLYHISAAKAYEVEGKYIEAHGAYDSALMWRECGSISTQEKG 976 (1243)
T ss_pred HHHHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhhhcc
Confidence 4555556778999999999999999999999999999999999887654
No 152
>PF13374 TPR_10: Tetratricopeptide repeat; PDB: 3CEQ_B 3EDT_H 3NF1_A.
Probab=66.97 E-value=7.1 Score=23.06 Aligned_cols=21 Identities=29% Similarity=0.328 Sum_probs=14.4
Q ss_pred HHHHHHHHcCCHHHHHHHHHH
Q psy11525 154 QQAEMYLLGGNISEAESILLQ 174 (244)
Q Consensus 154 ~~A~ia~~~G~~~eAe~~~l~ 174 (244)
.+|.++...|++++|+.++.+
T Consensus 7 ~la~~~~~~g~~~~A~~~~~~ 27 (42)
T PF13374_consen 7 NLANAYRAQGRYEEALELLEE 27 (42)
T ss_dssp HHHHHHHHCT-HHHHHHHHHH
T ss_pred HHHHHHHhhhhcchhhHHHHH
Confidence 456777777888888877754
No 153
>KOG1585|consensus
Probab=65.72 E-value=30 Score=30.90 Aligned_cols=96 Identities=16% Similarity=0.108 Sum_probs=56.9
Q ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHH-HHHHhhC-CCc---------HHHHHHHHHHHcCCHHHHHHHHHHC
Q psy11525 107 DILWACLAGMATYSRDLATSEEAYAAIEQVDKVM-YINHIKG-IPV---------KAAQQAEMYLLGGNISEAESILLQH 175 (244)
Q Consensus 107 ~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~-~l~~i~~-~~~---------~~~~~A~ia~~~G~~~eAe~~~l~~ 175 (244)
..+|..-|.+ +.-|..||+.-.++.+.. .|.+..+ .+. ...+.|.++-...++.|++.+|
T Consensus 24 kad~dgaas~------yekAAvafRnAk~feKakdcLlkA~~~yEnnrslfhAAKayEqaamLake~~klsEvvdl~--- 94 (308)
T KOG1585|consen 24 KADWDGAASL------YEKAAVAFRNAKKFEKAKDCLLKASKGYENNRSLFHAAKAYEQAAMLAKELSKLSEVVDLY--- 94 (308)
T ss_pred CCCchhhHHH------HHHHHHHHHhhccHHHHHHHHHHHHHHHHhcccHHHHHHHHHHHHHHHHHHHHhHHHHHHH---
Confidence 4567666655 455999999999988763 3333332 111 1234566666777788888776
Q ss_pred CCHHHHHHHHHHhcCHHHHH-------HHHHhcCCchhHHHHHHHHHH
Q psy11525 176 GLIFRAIQVSILTHNWDRAL-------ELALRHKTHIDTVLYQRKKYL 216 (244)
Q Consensus 176 g~~~~Ai~m~~~l~~W~~Al-------~LA~~~~~~~~~vl~~r~~yl 216 (244)
++|.+||...|.-+-|- ++-+.-+| +.-+.++++-+
T Consensus 95 ---eKAs~lY~E~GspdtAAmaleKAak~lenv~P--d~AlqlYqral 137 (308)
T KOG1585|consen 95 ---EKASELYVECGSPDTAAMALEKAAKALENVKP--DDALQLYQRAL 137 (308)
T ss_pred ---HHHHHHHHHhCCcchHHHHHHHHHHHhhcCCH--HHHHHHHHHHH
Confidence 45677777777766542 22222223 45555665555
No 154
>PF14938 SNAP: Soluble NSF attachment protein, SNAP; PDB: 1QQE_A 2IFU_A.
Probab=65.48 E-value=21 Score=31.47 Aligned_cols=84 Identities=19% Similarity=0.199 Sum_probs=43.9
Q ss_pred CHHHHHHHHhccCCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCc--H-HHHHHHHHHHcCCHHHHH
Q psy11525 93 SWPQALSLCRTLNDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPV--K-AAQQAEMYLLGGNISEAE 169 (244)
Q Consensus 93 ~~~~A~~~~~~~~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~--~-~~~~A~ia~~~G~~~eAe 169 (244)
+|++|. .+....|...-..++++-|..||.+..+.. .++....+ . ....|.++ ..+++++|.
T Consensus 30 ~~e~Aa---------~~y~~Aa~~fk~~~~~~~A~~ay~kAa~~~-----~~~~~~~~Aa~~~~~Aa~~~-k~~~~~~Ai 94 (282)
T PF14938_consen 30 DYEEAA---------DLYEKAANCFKLAKDWEKAAEAYEKAADCY-----EKLGDKFEAAKAYEEAANCY-KKGDPDEAI 94 (282)
T ss_dssp HHHHHH---------HHHHHHHHHHHHTT-CHHHHHHHHHHHHHH-----HHTT-HHHHHHHHHHHHHHH-HHTTHHHHH
T ss_pred CHHHHH---------HHHHHHHHHHHHHhccchhHHHHHHHHHHH-----HHcCCHHHHHHHHHHHHHHH-HhhCHHHHH
Confidence 566665 456666666677777777777777654322 11111100 1 12233333 334666666
Q ss_pred HHHHHCCCHHHHHHHHHHhcCHHHHHHH
Q psy11525 170 SILLQHGLIFRAIQVSILTHNWDRALEL 197 (244)
Q Consensus 170 ~~~l~~g~~~~Ai~m~~~l~~W~~Al~L 197 (244)
+.| ..|+++|+..|++++|-+.
T Consensus 95 ~~~------~~A~~~y~~~G~~~~aA~~ 116 (282)
T PF14938_consen 95 ECY------EKAIEIYREAGRFSQAAKC 116 (282)
T ss_dssp HHH------HHHHHHHHHCT-HHHHHHH
T ss_pred HHH------HHHHHHHHhcCcHHHHHHH
Confidence 654 3466777777777776443
No 155
>PF13428 TPR_14: Tetratricopeptide repeat
Probab=65.44 E-value=11 Score=23.36 Aligned_cols=21 Identities=33% Similarity=0.319 Sum_probs=11.0
Q ss_pred HHHHHHHHcCCHHHHHHHHHH
Q psy11525 154 QQAEMYLLGGNISEAESILLQ 174 (244)
Q Consensus 154 ~~A~ia~~~G~~~eAe~~~l~ 174 (244)
..|..+..+|++++|+++|.+
T Consensus 6 ~la~~~~~~G~~~~A~~~~~~ 26 (44)
T PF13428_consen 6 ALARAYRRLGQPDEAERLLRR 26 (44)
T ss_pred HHHHHHHHcCCHHHHHHHHHH
Confidence 345555555555555555543
No 156
>PF13176 TPR_7: Tetratricopeptide repeat; PDB: 3SF4_C 3RO3_A 3RO2_A.
Probab=65.10 E-value=13 Score=22.07 Aligned_cols=23 Identities=17% Similarity=0.154 Sum_probs=19.8
Q ss_pred HHHHHHHHHHcCCHHHHHHHHHH
Q psy11525 110 WACLAGMATYSRDLATSEEAYAA 132 (244)
Q Consensus 110 W~~LA~~Al~~~~l~~A~~~y~~ 132 (244)
|..||....+.++++-|+.+|.+
T Consensus 2 l~~Lg~~~~~~g~~~~Ai~~y~~ 24 (36)
T PF13176_consen 2 LNNLGRIYRQQGDYEKAIEYYEQ 24 (36)
T ss_dssp HHHHHHHHHHCT-HHHHHHHHHH
T ss_pred HHHHHHHHHHcCCHHHHHHHHHH
Confidence 67899999999999999999987
No 157
>PF13371 TPR_9: Tetratricopeptide repeat
Probab=65.04 E-value=28 Score=23.27 Aligned_cols=49 Identities=16% Similarity=0.022 Sum_probs=29.2
Q ss_pred HHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCC----cHHHHHHHHHHHcCCHHHHHHHHHH
Q psy11525 114 AGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIP----VKAAQQAEMYLLGGNISEAESILLQ 174 (244)
Q Consensus 114 A~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~----~~~~~~A~ia~~~G~~~eAe~~~l~ 174 (244)
+..++..++++.|.+++.+ +..+. ......|.+....|++++|.+.|-+
T Consensus 2 ~~~~~~~~~~~~A~~~~~~------------~l~~~p~~~~~~~~~a~~~~~~g~~~~A~~~l~~ 54 (73)
T PF13371_consen 2 KQIYLQQEDYEEALEVLER------------ALELDPDDPELWLQRARCLFQLGRYEEALEDLER 54 (73)
T ss_pred HHHHHhCCCHHHHHHHHHH------------HHHhCcccchhhHHHHHHHHHhccHHHHHHHHHH
Confidence 3445556666666555443 22221 1234578888888888888777654
No 158
>PF12894 Apc4_WD40: Anaphase-promoting complex subunit 4 WD40 domain
Probab=64.87 E-value=12 Score=24.16 Aligned_cols=26 Identities=19% Similarity=0.542 Sum_probs=19.8
Q ss_pred cceeeecCCCceEEE-EeCCeEEEEEc
Q psy11525 2 VQSICWNTDVNILAA-MQDSALCVWFF 27 (244)
Q Consensus 2 v~~v~W~~~~~vlv~-~~~~~l~vw~~ 27 (244)
|+.++|+|..|+++. ..++.+.+|-.
T Consensus 14 v~~~~w~P~mdLiA~~t~~g~v~v~Rl 40 (47)
T PF12894_consen 14 VSCMSWCPTMDLIALGTEDGEVLVYRL 40 (47)
T ss_pred EEEEEECCCCCEEEEEECCCeEEEEEC
Confidence 678999999998776 44677877644
No 159
>PF07721 TPR_4: Tetratricopeptide repeat; InterPro: IPR011717 This entry includes tetratricopeptide-like repeats not detected by the IPR001440 from INTERPRO, IPR013105 from INTERPRO and IPR011716 from INTERPRO models. The tetratricopeptide repeat (TPR) motif is a protein-protein interaction module found in multiple copies in a number of functionally different proteins that facilitates specific interactions with a partner protein(s) [].; GO: 0042802 identical protein binding
Probab=64.27 E-value=9.7 Score=20.97 Aligned_cols=20 Identities=35% Similarity=0.381 Sum_probs=15.2
Q ss_pred HHHHHHHHcCCHHHHHHHHH
Q psy11525 154 QQAEMYLLGGNISEAESILL 173 (244)
Q Consensus 154 ~~A~ia~~~G~~~eAe~~~l 173 (244)
..|..+...|++++|+.++-
T Consensus 6 ~la~~~~~~G~~~eA~~~l~ 25 (26)
T PF07721_consen 6 ALARALLAQGDPDEAERLLR 25 (26)
T ss_pred HHHHHHHHcCCHHHHHHHHh
Confidence 46778888888888887763
No 160
>PF13424 TPR_12: Tetratricopeptide repeat; PDB: 3RO2_A 3Q15_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 4A1S_B 3CEQ_B 3EDT_H ....
Probab=64.16 E-value=24 Score=24.05 Aligned_cols=21 Identities=24% Similarity=0.165 Sum_probs=13.1
Q ss_pred HHHHHHHHcCCHHHHHHHHHH
Q psy11525 154 QQAEMYLLGGNISEAESILLQ 174 (244)
Q Consensus 154 ~~A~ia~~~G~~~eAe~~~l~ 174 (244)
..|.++..+|++++|...|.+
T Consensus 10 ~la~~~~~~~~~~~A~~~~~~ 30 (78)
T PF13424_consen 10 NLARVYRELGRYDEALDYYEK 30 (78)
T ss_dssp HHHHHHHHTT-HHHHHHHHHH
T ss_pred HHHHHHHHcCCHHHHHHHHHH
Confidence 456677777777777776643
No 161
>PRK04841 transcriptional regulator MalT; Provisional
Probab=63.96 E-value=44 Score=34.12 Aligned_cols=107 Identities=16% Similarity=0.037 Sum_probs=63.2
Q ss_pred HHcCCHHHHHHHHhcc----CCH------HHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCC---cHHHHH
Q psy11525 89 AASHSWPQALSLCRTL----NDD------ILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIP---VKAAQQ 155 (244)
Q Consensus 89 l~~~~~~~A~~~~~~~----~~~------~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~---~~~~~~ 155 (244)
...|++++|...++.. ... ..+..+|......++++.|+..|.+.-. ..++..... ......
T Consensus 463 ~~~g~~~~A~~~~~~al~~~~~~~~~~~~~a~~~lg~~~~~~G~~~~A~~~~~~al~-----~~~~~g~~~~~~~~~~~l 537 (903)
T PRK04841 463 INDGDPEEAERLAELALAELPLTWYYSRIVATSVLGEVHHCKGELARALAMMQQTEQ-----MARQHDVYHYALWSLLQQ 537 (903)
T ss_pred HhCCCHHHHHHHHHHHHhcCCCccHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHH-----HHhhhcchHHHHHHHHHH
Confidence 5788999998877663 111 2234566666777788777777655321 111111100 012356
Q ss_pred HHHHHHcCCHHHHHHHHHHC-------CC---HHHH------HHHHHHhcCHHHHHHHHHh
Q psy11525 156 AEMYLLGGNISEAESILLQH-------GL---IFRA------IQVSILTHNWDRALELALR 200 (244)
Q Consensus 156 A~ia~~~G~~~eAe~~~l~~-------g~---~~~A------i~m~~~l~~W~~Al~LA~~ 200 (244)
|.++...|++++|+..+.++ |. +-.+ -.++...|+|++|.+.+.+
T Consensus 538 a~~~~~~G~~~~A~~~~~~al~~~~~~~~~~~~~~~~~~~~la~~~~~~G~~~~A~~~~~~ 598 (903)
T PRK04841 538 SEILFAQGFLQAAYETQEKAFQLIEEQHLEQLPMHEFLLRIRAQLLWEWARLDEAEQCARK 598 (903)
T ss_pred HHHHHHCCCHHHHHHHHHHHHHHHHHhccccccHHHHHHHHHHHHHHHhcCHHHHHHHHHH
Confidence 88889999999998876553 11 1111 1345667999988776654
No 162
>PRK10153 DNA-binding transcriptional activator CadC; Provisional
Probab=63.74 E-value=35 Score=33.25 Aligned_cols=21 Identities=19% Similarity=0.178 Sum_probs=11.0
Q ss_pred HHHHHHHHHcCCHHHHHHHHH
Q psy11525 153 AQQAEMYLLGGNISEAESILL 173 (244)
Q Consensus 153 ~~~A~ia~~~G~~~eAe~~~l 173 (244)
..+|.++.+.|++++|...|.
T Consensus 457 ~~lG~~~~~~G~~~eA~~~~~ 477 (517)
T PRK10153 457 VLLGKVYELKGDNRLAADAYS 477 (517)
T ss_pred HHHHHHHHHcCCHHHHHHHHH
Confidence 344555555555555555443
No 163
>KOG2280|consensus
Probab=63.63 E-value=1.2e+02 Score=31.17 Aligned_cols=137 Identities=17% Similarity=0.102 Sum_probs=83.8
Q ss_pred HHHHHHHcCCHHHHHHHHhccCCHH---HHHHH--HHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhC-CCc------H
Q psy11525 84 VLHSYAASHSWPQALSLCRTLNDDI---LWACL--AGMATYSRDLATSEEAYAAIEQVDKVMYINHIKG-IPV------K 151 (244)
Q Consensus 84 ~l~~~l~~~~~~~A~~~~~~~~~~~---lW~~L--A~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~-~~~------~ 151 (244)
.+.+++..+.|..|.++|..++-+. .|-.+ |..-++. .|..--..++.|.+ +.. .
T Consensus 443 vi~Rl~~r~~Y~vaIQva~~l~~p~~~~~~Vl~~Wa~~kI~~-------------~d~~d~~vld~I~~kls~~~~~~iS 509 (829)
T KOG2280|consen 443 VIDRLVDRHLYSVAIQVAKLLNLPESQGDRVLLEWARRKIKQ-------------SDKMDEEVLDKIDEKLSAKLTPGIS 509 (829)
T ss_pred hhHHHHhcchhHHHHHHHHHhCCccccccHHHHHHHHHHHhc-------------cCccchHHHHHHHHHhcccCCCcee
Confidence 3455688999999999999976544 45432 2222222 12222233333321 111 1
Q ss_pred HHHHHHHHHHcCCHHHHHHHHHHCCCHHHHHHHHHHhcCHHHHHHHHHhcC-Cc-hhHHHHHHHHHH------HHc-CCc
Q psy11525 152 AAQQAEMYLLGGNISEAESILLQHGLIFRAIQVSILTHNWDRALELALRHK-TH-IDTVLYQRKKYL------DNL-EKI 222 (244)
Q Consensus 152 ~~~~A~ia~~~G~~~eAe~~~l~~g~~~~Ai~m~~~l~~W~~Al~LA~~~~-~~-~~~vl~~r~~yl------~~~-~~~ 222 (244)
....|.-|-..|+++.|-++...-.+-..=+.+...+++.+.||.=|.+-+ ++ +-+|+..=.+.+ +.. .+.
T Consensus 510 y~~iA~~Ay~~GR~~LA~kLle~E~~~~~qV~lLL~m~~~~~AL~kaies~d~~Li~~Vllhlk~~~~~s~l~~~l~~~p 589 (829)
T KOG2280|consen 510 YAAIARRAYQEGRFELARKLLELEPRSGEQVPLLLKMKDSSLALKKAIESGDTDLIIQVLLHLKNKLNRSSLFMTLRNQP 589 (829)
T ss_pred HHHHHHHHHhcCcHHHHHHHHhcCCCccchhHHHhccchHHHHHHHHHhcCCchhHHHHHHHHHHHHHHHHHHHHHHhch
Confidence 244677788899999999999988888888999999999999998887765 44 444443322222 222 334
Q ss_pred ccHHHHHhccc
Q psy11525 223 ETNEKFLRLQS 233 (244)
Q Consensus 223 e~~~~f~~~~~ 233 (244)
-+...|+++.+
T Consensus 590 ~a~~lY~~~~r 600 (829)
T KOG2280|consen 590 LALSLYRQFMR 600 (829)
T ss_pred hhhHHHHHHHH
Confidence 45555555543
No 164
>KOG2034|consensus
Probab=63.28 E-value=8.8 Score=39.42 Aligned_cols=48 Identities=19% Similarity=0.245 Sum_probs=39.5
Q ss_pred CCCHHHHHHHHHHhcCHHHHHHHHHhcCCchhHHHHHHHHHHHHcCCc
Q psy11525 175 HGLIFRAIQVSILTHNWDRALELALRHKTHIDTVLYQRKKYLDNLEKI 222 (244)
Q Consensus 175 ~g~~~~Ai~m~~~l~~W~~Al~LA~~~~~~~~~vl~~r~~yl~~~~~~ 222 (244)
.+..+.+=+.|.|.|+++.|++.|.....++++|+..++.|+.+.+.-
T Consensus 358 ndE~R~vWk~yLd~g~y~kAL~~ar~~p~~le~Vl~~qAdf~f~~k~y 405 (911)
T KOG2034|consen 358 NDEARDVWKTYLDKGEFDKALEIARTRPDALETVLLKQADFLFQDKEY 405 (911)
T ss_pred ccchHHHHHHHHhcchHHHHHHhccCCHHHHHHHHHHHHHHHHhhhHH
Confidence 345677788899999999999999987555999999999999666543
No 165
>PF13181 TPR_8: Tetratricopeptide repeat; PDB: 3GW4_B 3MA5_C 2KCV_A 2KCL_A 3FP3_A 3LCA_A 3FP4_A 3FP2_A 1W3B_B 1ELW_A ....
Probab=63.10 E-value=11 Score=21.40 Aligned_cols=23 Identities=17% Similarity=0.161 Sum_probs=17.5
Q ss_pred HHHHHHHHHcCCHHHHHHHHHHC
Q psy11525 153 AQQAEMYLLGGNISEAESILLQH 175 (244)
Q Consensus 153 ~~~A~ia~~~G~~~eAe~~~l~~ 175 (244)
...|.++..+|++++|.+.|.++
T Consensus 5 ~~lg~~y~~~~~~~~A~~~~~~a 27 (34)
T PF13181_consen 5 YNLGKIYEQLGDYEEALEYFEKA 27 (34)
T ss_dssp HHHHHHHHHTTSHHHHHHHHHHH
T ss_pred HHHHHHHHHcCCHHHHHHHHHHH
Confidence 35678888888888888887654
No 166
>COG3071 HemY Uncharacterized enzyme of heme biosynthesis [Coenzyme metabolism]
Probab=61.95 E-value=37 Score=31.92 Aligned_cols=59 Identities=22% Similarity=0.203 Sum_probs=34.1
Q ss_pred CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHH
Q psy11525 105 NDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMYLLGGNISEAESIL 172 (244)
Q Consensus 105 ~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia~~~G~~~eAe~~~ 172 (244)
+++.+|..||.-|++++.+.-|...|.+-- +...-.+.....|+.+..+|+..+|++.+
T Consensus 326 ~~p~L~~tLG~L~~k~~~w~kA~~~leaAl---------~~~~s~~~~~~la~~~~~~g~~~~A~~~r 384 (400)
T COG3071 326 EDPLLLSTLGRLALKNKLWGKASEALEAAL---------KLRPSASDYAELADALDQLGEPEEAEQVR 384 (400)
T ss_pred CChhHHHHHHHHHHHhhHHHHHHHHHHHHH---------hcCCChhhHHHHHHHHHHcCChHHHHHHH
Confidence 567788888888888888887776665211 00111112244555555566555555544
No 167
>PLN03098 LPA1 LOW PSII ACCUMULATION1; Provisional
Probab=61.33 E-value=49 Score=31.75 Aligned_cols=62 Identities=8% Similarity=-0.116 Sum_probs=46.0
Q ss_pred CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCc----HHHHHHHHHHHcCCHHHHHHHHHHC
Q psy11525 105 NDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPV----KAAQQAEMYLLGGNISEAESILLQH 175 (244)
Q Consensus 105 ~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~----~~~~~A~ia~~~G~~~eAe~~~l~~ 175 (244)
++...|..+|......++++.|..+|.+.= .+..-.. .....|.++..+|++++|...|.++
T Consensus 73 ~~a~a~~NLG~AL~~lGryeEAIa~f~rAL---------eL~Pd~aeA~~A~yNLAcaya~LGr~dEAla~LrrA 138 (453)
T PLN03098 73 KTAEDAVNLGLSLFSKGRVKDALAQFETAL---------ELNPNPDEAQAAYYNKACCHAYREEGKKAADCLRTA 138 (453)
T ss_pred CCHHHHHHHHHHHHHcCCHHHHHHHHHHHH---------hhCCCchHHHHHHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 456789999999999999999999986411 1111111 1356899999999999999988764
No 168
>PRK02603 photosystem I assembly protein Ycf3; Provisional
Probab=61.12 E-value=35 Score=27.44 Aligned_cols=64 Identities=13% Similarity=0.068 Sum_probs=44.7
Q ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHC
Q psy11525 107 DILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMYLLGGNISEAESILLQH 175 (244)
Q Consensus 107 ~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia~~~G~~~eAe~~~l~~ 175 (244)
...+..+|......++++.|..+|.+.=.... .-..........|.++...|++++|...|.++
T Consensus 35 a~~~~~lg~~~~~~g~~~~A~~~~~~al~~~~-----~~~~~~~~~~~la~~~~~~g~~~~A~~~~~~a 98 (172)
T PRK02603 35 AFVYYRDGMSAQADGEYAEALENYEEALKLEE-----DPNDRSYILYNMGIIYASNGEHDKALEYYHQA 98 (172)
T ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHHHHhh-----ccchHHHHHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 44688899999999999999999985321100 00000112356789999999999999988765
No 169
>KOG4626|consensus
Probab=61.02 E-value=1.1e+02 Score=31.08 Aligned_cols=22 Identities=23% Similarity=0.308 Sum_probs=14.3
Q ss_pred HHHHHHHHHcCCHHHHHHHHHH
Q psy11525 153 AQQAEMYLLGGNISEAESILLQ 174 (244)
Q Consensus 153 ~~~A~ia~~~G~~~eAe~~~l~ 174 (244)
...|.++-.+|++++|-..|..
T Consensus 392 nNLa~i~kqqgnl~~Ai~~Yke 413 (966)
T KOG4626|consen 392 NNLASIYKQQGNLDDAIMCYKE 413 (966)
T ss_pred hhHHHHHHhcccHHHHHHHHHH
Confidence 3466777777777777665543
No 170
>COG3063 PilF Tfp pilus assembly protein PilF [Cell motility and secretion / Intracellular trafficking and secretion]
Probab=60.68 E-value=64 Score=28.38 Aligned_cols=78 Identities=14% Similarity=0.046 Sum_probs=61.3
Q ss_pred HHcCCHHHHHHHHhcc-------CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHHHH
Q psy11525 89 AASHSWPQALSLCRTL-------NDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMYLL 161 (244)
Q Consensus 89 l~~~~~~~A~~~~~~~-------~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia~~ 161 (244)
.+.|++++|+..-+.. ....-|.+++.-+++.++++-|+.-|.+ .|+.-...+...+.+|.....
T Consensus 114 C~qg~~~eA~q~F~~Al~~P~Y~~~s~t~eN~G~Cal~~gq~~~A~~~l~r--------aL~~dp~~~~~~l~~a~~~~~ 185 (250)
T COG3063 114 CAQGRPEEAMQQFERALADPAYGEPSDTLENLGLCALKAGQFDQAEEYLKR--------ALELDPQFPPALLELARLHYK 185 (250)
T ss_pred HhCCChHHHHHHHHHHHhCCCCCCcchhhhhhHHHHhhcCCchhHHHHHHH--------HHHhCcCCChHHHHHHHHHHh
Confidence 6788999999876653 4578899999999999999999987765 344444555666778888889
Q ss_pred cCCHHHHHHHHHH
Q psy11525 162 GGNISEAESILLQ 174 (244)
Q Consensus 162 ~G~~~eAe~~~l~ 174 (244)
.|++-.|...|..
T Consensus 186 ~~~y~~Ar~~~~~ 198 (250)
T COG3063 186 AGDYAPARLYLER 198 (250)
T ss_pred cccchHHHHHHHH
Confidence 9999888776643
No 171
>KOG1126|consensus
Probab=60.36 E-value=43 Score=33.39 Aligned_cols=104 Identities=12% Similarity=0.041 Sum_probs=62.5
Q ss_pred HHHHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHH----HHHHH
Q psy11525 87 SYAASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKA----AQQAE 157 (244)
Q Consensus 87 ~~l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~----~~~A~ 157 (244)
+++....+|.|...-+.. .+-..|--||...++...++.|+.-|.+ +.+++..+ ...|.
T Consensus 464 E~~~~ee~d~a~~~fr~Al~~~~rhYnAwYGlG~vy~Kqek~e~Ae~~fqk------------A~~INP~nsvi~~~~g~ 531 (638)
T KOG1126|consen 464 ESIATEEFDKAMKSFRKALGVDPRHYNAWYGLGTVYLKQEKLEFAEFHFQK------------AVEINPSNSVILCHIGR 531 (638)
T ss_pred hhhhhHHHHhHHHHHHhhhcCCchhhHHHHhhhhheeccchhhHHHHHHHh------------hhcCCccchhHHhhhhH
Confidence 446666677766655442 4557788888888888888877755543 33332221 23456
Q ss_pred HHHHcCCHHHHHHHHHHCCC--------HHHHHHHHHHhcCHHHHHHHHHhcC
Q psy11525 158 MYLLGGNISEAESILLQHGL--------IFRAIQVSILTHNWDRALELALRHK 202 (244)
Q Consensus 158 ia~~~G~~~eAe~~~l~~g~--------~~~Ai~m~~~l~~W~~Al~LA~~~~ 202 (244)
+.-..|+.++|-++|-++=. ...-...+..+++.++|+..-+..+
T Consensus 532 ~~~~~k~~d~AL~~~~~A~~ld~kn~l~~~~~~~il~~~~~~~eal~~LEeLk 584 (638)
T KOG1126|consen 532 IQHQLKRKDKALQLYEKAIHLDPKNPLCKYHRASILFSLGRYVEALQELEELK 584 (638)
T ss_pred HHHHhhhhhHHHHHHHHHHhcCCCCchhHHHHHHHHHhhcchHHHHHHHHHHH
Confidence 66667777777777766521 1222445566777777777666544
No 172
>KOG0543|consensus
Probab=60.20 E-value=30 Score=32.60 Aligned_cols=87 Identities=11% Similarity=0.024 Sum_probs=55.7
Q ss_pred HHHcCCHHHHHHHHh----ccC----CHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHH-HHHHHhhCCCcHH----HH
Q psy11525 88 YAASHSWPQALSLCR----TLN----DDILWACLAGMATYSRDLATSEEAYAAIEQVDKV-MYINHIKGIPVKA----AQ 154 (244)
Q Consensus 88 ~l~~~~~~~A~~~~~----~~~----~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v-~~l~~i~~~~~~~----~~ 154 (244)
|...|+|..|...=+ .+. .+.-=...+......-++++| -||.+++++..+ +...++..+...+ =|
T Consensus 218 ~fK~gk~~~A~~~Yerav~~l~~~~~~~~ee~~~~~~~k~~~~lNlA-~c~lKl~~~~~Ai~~c~kvLe~~~~N~KALyR 296 (397)
T KOG0543|consen 218 LFKEGKFKLAKKRYERAVSFLEYRRSFDEEEQKKAEALKLACHLNLA-ACYLKLKEYKEAIESCNKVLELDPNNVKALYR 296 (397)
T ss_pred HHhhchHHHHHHHHHHHHHHhhccccCCHHHHHHHHHHHHHHhhHHH-HHHHhhhhHHHHHHHHHHHHhcCCCchhHHHH
Confidence 366666666655421 121 222334455555555556655 589999998754 5556666654433 36
Q ss_pred HHHHHHHcCCHHHHHHHHHHC
Q psy11525 155 QAEMYLLGGNISEAESILLQH 175 (244)
Q Consensus 155 ~A~ia~~~G~~~eAe~~~l~~ 175 (244)
.|+.++..|.|+.|...|.++
T Consensus 297 rG~A~l~~~e~~~A~~df~ka 317 (397)
T KOG0543|consen 297 RGQALLALGEYDLARDDFQKA 317 (397)
T ss_pred HHHHHHhhccHHHHHHHHHHH
Confidence 899999999999999988765
No 173
>PF04190 DUF410: Protein of unknown function (DUF410) ; InterPro: IPR007317 This is a family of conserved eukaryotic proteins with undetermined function.; PDB: 3LKU_E 2WPV_G.
Probab=58.56 E-value=33 Score=30.23 Aligned_cols=32 Identities=25% Similarity=-0.015 Sum_probs=26.5
Q ss_pred HHHHHHHHHHcCCHHHHHHHHHHCCCHHHHHH
Q psy11525 152 AAQQAEMYLLGGNISEAESILLQHGLIFRAIQ 183 (244)
Q Consensus 152 ~~~~A~ia~~~G~~~eAe~~~l~~g~~~~Ai~ 183 (244)
+...|+.+...|++.+|+.+|+.++++.....
T Consensus 93 H~~~a~~~~~e~~~~~A~~Hfl~~~~~~~~~~ 124 (260)
T PF04190_consen 93 HHLLAEKLWKEGNYYEAERHFLLGTDPSAFAY 124 (260)
T ss_dssp HHHHHHHHHHTT-HHHHHHHHHTS-HHHHHHH
T ss_pred HHHHHHHHHhhccHHHHHHHHHhcCChhHHHH
Confidence 45689999999999999999999999887765
No 174
>KOG2376|consensus
Probab=58.22 E-value=2e+02 Score=28.67 Aligned_cols=79 Identities=14% Similarity=-0.004 Sum_probs=40.1
Q ss_pred HHHHHHHHHcCCHHHHHHHHHH---CCCHHHHHHHHHHhcCHHHHHHH--HHhcC--Cc-hhHHHHHHHHHH-HHcCCcc
Q psy11525 153 AQQAEMYLLGGNISEAESILLQ---HGLIFRAIQVSILTHNWDRALEL--ALRHK--TH-IDTVLYQRKKYL-DNLEKIE 223 (244)
Q Consensus 153 ~~~A~ia~~~G~~~eAe~~~l~---~g~~~~Ai~m~~~l~~W~~Al~L--A~~~~--~~-~~~vl~~r~~yl-~~~~~~e 223 (244)
.+.|.++--+|+|++|-++|.. ++..+.=.+.+..+.---.++.. -+..+ +. --+.++.++--+ +.....+
T Consensus 114 ~L~AQvlYrl~~ydealdiY~~L~kn~~dd~d~~~r~nl~a~~a~l~~~~~q~v~~v~e~syel~yN~Ac~~i~~gky~q 193 (652)
T KOG2376|consen 114 ELRAQVLYRLERYDEALDIYQHLAKNNSDDQDEERRANLLAVAAALQVQLLQSVPEVPEDSYELLYNTACILIENGKYNQ 193 (652)
T ss_pred HHHHHHHHHHhhHHHHHHHHHHHHhcCCchHHHHHHHHHHHHHHhhhHHHHHhccCCCcchHHHHHHHHHHHHhcccHHH
Confidence 5678888889999999988864 34444333333322222222221 11111 11 234444444444 3444456
Q ss_pred cHHHHHhc
Q psy11525 224 TNEKFLRL 231 (244)
Q Consensus 224 ~~~~f~~~ 231 (244)
+++..+++
T Consensus 194 A~elL~kA 201 (652)
T KOG2376|consen 194 AIELLEKA 201 (652)
T ss_pred HHHHHHHH
Confidence 66666666
No 175
>PF07163 Pex26: Pex26 protein; InterPro: IPR010797 This family consists of Pex26 and related mammalian proteins. Pex26 is a type II peroxisomal membrane protein that recruits Pex6-Pex1 complexes to peroxisomes []. Mutations in Pex26 can lead to human disorders [].; GO: 0032403 protein complex binding, 0045046 protein import into peroxisome membrane, 0005779 integral to peroxisomal membrane
Probab=57.69 E-value=1.5e+02 Score=26.92 Aligned_cols=127 Identities=17% Similarity=0.145 Sum_probs=73.8
Q ss_pred HHHHHHHHHcCCHHHHHHHHhccCCHHHHHHHHHHHHHcCC-----------HHHHHHHHHHhCChhHH--HHHHHh---
Q psy11525 82 ISVLHSYAASHSWPQALSLCRTLNDDILWACLAGMATYSRD-----------LATSEEAYAAIEQVDKV--MYINHI--- 145 (244)
Q Consensus 82 ~~~l~~~l~~~~~~~A~~~~~~~~~~~lW~~LA~~Al~~~~-----------l~~A~~~y~~l~d~~~v--~~l~~i--- 145 (244)
+++-...+=..||..|+..|+.+ |..|+..|...-+ =-+.+.|++.++....| +.++..
T Consensus 39 e~Aad~LvV~rdF~aal~tCerg-----lqsL~~~a~~ee~~~~~~evK~sLcvvGIQALAEmnrWreVLsWvlqyYq~p 113 (309)
T PF07163_consen 39 EEAADLLVVHRDFQAALETCERG-----LQSLASDADAEEPAGSSLEVKCSLCVVGIQALAEMNRWREVLSWVLQYYQVP 113 (309)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHH-----HHHhcccccccccccchhhhhhhhhhhhHHHHHHHhhHHHHHHHHHHHhcCc
Confidence 44455557889999999999875 7777655443222 12678899999998887 344444
Q ss_pred hCCCcHHHH-HHHHHHHcCCH---HHHHHHHHHCCC------HHHHHHHHH-----HhcCHHHHHHHHHhcCCchhHHHH
Q psy11525 146 KGIPVKAAQ-QAEMYLLGGNI---SEAESILLQHGL------IFRAIQVSI-----LTHNWDRALELALRHKTHIDTVLY 210 (244)
Q Consensus 146 ~~~~~~~~~-~A~ia~~~G~~---~eAe~~~l~~g~------~~~Ai~m~~-----~l~~W~~Al~LA~~~~~~~~~vl~ 210 (244)
++++.+-.. +--++.-.|.+ .|+....+++-. +..-.|+|. =+|.|++|-+++....---++
T Consensus 114 EklPpkIleLCILLysKv~Ep~amlev~~~WL~~p~Nq~lp~y~~vaELyLl~VLlPLG~~~eAeelv~gs~af~Ee--- 190 (309)
T PF07163_consen 114 EKLPPKILELCILLYSKVQEPAAMLEVASAWLQDPSNQSLPEYGTVAELYLLHVLLPLGHFSEAEELVVGSAAFTEE--- 190 (309)
T ss_pred ccCCHHHHHHHHHHHHHhcCHHHHHHHHHHHHhCcccCCchhhHHHHHHHHHHHHhccccHHHHHHHHhcCCCCCHH---
Confidence 344444322 11222223333 344455555422 233345554 479999999999654422234
Q ss_pred HHHHHH
Q psy11525 211 QRKKYL 216 (244)
Q Consensus 211 ~r~~yl 216 (244)
.|+..|
T Consensus 191 Qr~~aL 196 (309)
T PF07163_consen 191 QRQEAL 196 (309)
T ss_pred HHHHHH
Confidence 555555
No 176
>KOG1156|consensus
Probab=57.41 E-value=1.5e+02 Score=29.85 Aligned_cols=43 Identities=14% Similarity=0.209 Sum_probs=33.7
Q ss_pred cCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHHh
Q psy11525 91 SHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAAI 133 (244)
Q Consensus 91 ~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~l 133 (244)
.|+-++|+..++.+ .+.--|-.+|-.--..++++.|.+||++-
T Consensus 54 lg~~~ea~~~vr~glr~d~~S~vCwHv~gl~~R~dK~Y~eaiKcy~nA 101 (700)
T KOG1156|consen 54 LGKKEEAYELVRLGLRNDLKSHVCWHVLGLLQRSDKKYDEAIKCYRNA 101 (700)
T ss_pred ccchHHHHHHHHHHhccCcccchhHHHHHHHHhhhhhHHHHHHHHHHH
Confidence 56677888888876 45677888888888888888888888753
No 177
>PF08631 SPO22: Meiosis protein SPO22/ZIP4 like; InterPro: IPR013940 SPO22 is a meiosis-specific protein with similarity to phospholipase A2, involved in completion of nuclear divisions during meiosis; induced early in meiosis []. It is also involved in sporulation [].
Probab=57.37 E-value=10 Score=33.51 Aligned_cols=62 Identities=15% Similarity=0.056 Sum_probs=36.8
Q ss_pred HHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHHHHcC-CHHHHHHHHHHCCC
Q psy11525 116 MATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMYLLGG-NISEAESILLQHGL 177 (244)
Q Consensus 116 ~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia~~~G-~~~eAe~~~l~~g~ 177 (244)
.|...+++++|+.+|.++.+......-...+++.+.--..|.-....+ ++++|-..+.++-+
T Consensus 2 ~A~~~~~~~~A~~~~~K~~~~~~~~~~~~~~~La~~~yn~G~~l~~~~~~~~~a~~wL~~a~~ 64 (278)
T PF08631_consen 2 LAWKQGDLDLAEHMYSKAKDLLNSLDPDMAEELARVCYNIGKSLLSKKDKYEEAVKWLQRAYD 64 (278)
T ss_pred cchhhCCHHHHHHHHHHhhhHHhcCCcHHHHHHHHHHHHHHHHHHHcCCChHHHHHHHHHHHH
Confidence 478899999999999998875532222222222222233455555566 67666666655433
No 178
>KOG2002|consensus
Probab=56.99 E-value=23 Score=36.91 Aligned_cols=104 Identities=12% Similarity=0.084 Sum_probs=67.8
Q ss_pred HHcCCHHHHHHHHhc---cCCHHHHHH--HHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHHHHcC
Q psy11525 89 AASHSWPQALSLCRT---LNDDILWAC--LAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMYLLGG 163 (244)
Q Consensus 89 l~~~~~~~A~~~~~~---~~~~~lW~~--LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia~~~G 163 (244)
.+.+..+.|+++=.. .+...+|+. +|..--..++++.|.-.|..+.+. +....+.-...|++++.+|
T Consensus 623 k~kk~~~KAlq~y~kvL~~dpkN~yAANGIgiVLA~kg~~~~A~dIFsqVrEa--------~~~~~dv~lNlah~~~e~~ 694 (1018)
T KOG2002|consen 623 KEKKHQEKALQLYGKVLRNDPKNMYAANGIGIVLAEKGRFSEARDIFSQVREA--------TSDFEDVWLNLAHCYVEQG 694 (1018)
T ss_pred HHHHHHHHHHHHHHHHHhcCcchhhhccchhhhhhhccCchHHHHHHHHHHHH--------HhhCCceeeeHHHHHHHHH
Confidence 456677778776443 256788887 444445667888888888775432 2234555677899999999
Q ss_pred CHHHHHHHHHHCCCHHH----------HHHHHHHhcCHHHHHHHHHh
Q psy11525 164 NISEAESILLQHGLIFR----------AIQVSILTHNWDRALELALR 200 (244)
Q Consensus 164 ~~~eAe~~~l~~g~~~~----------Ai~m~~~l~~W~~Al~LA~~ 200 (244)
+|..|-+.|-.+-.-.. -=+.|-+.++|.+|.+.+.+
T Consensus 695 qy~~AIqmYe~~lkkf~~~~~~~vl~~Lara~y~~~~~~eak~~ll~ 741 (1018)
T KOG2002|consen 695 QYRLAIQMYENCLKKFYKKNRSEVLHYLARAWYEAGKLQEAKEALLK 741 (1018)
T ss_pred HHHHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHhhhHHHHHHHHHH
Confidence 99999888876632211 12334456678777766554
No 179
>COG4235 Cytochrome c biogenesis factor [Posttranslational modification, protein turnover, chaperones]
Probab=56.01 E-value=66 Score=29.03 Aligned_cols=30 Identities=17% Similarity=0.065 Sum_probs=27.2
Q ss_pred CCHHHHHHHHHHHHHcCCHHHHHHHHHHhC
Q psy11525 105 NDDILWACLAGMATYSRDLATSEEAYAAIE 134 (244)
Q Consensus 105 ~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~ 134 (244)
++..-|..||...+..++++.|..+|.+-.
T Consensus 154 ~d~egW~~Lg~~ym~~~~~~~A~~AY~~A~ 183 (287)
T COG4235 154 GDAEGWDLLGRAYMALGRASDALLAYRNAL 183 (287)
T ss_pred CCchhHHHHHHHHHHhcchhHHHHHHHHHH
Confidence 567899999999999999999999998754
No 180
>KOG3060|consensus
Probab=55.29 E-value=35 Score=30.46 Aligned_cols=92 Identities=10% Similarity=0.023 Sum_probs=56.2
Q ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHhCCh-------hHH---------------HHHHHhhCCCcH----HHHHHHHHHH
Q psy11525 108 ILWACLAGMATYSRDLATSEEAYAAIEQV-------DKV---------------MYINHIKGIPVK----AAQQAEMYLL 161 (244)
Q Consensus 108 ~lW~~LA~~Al~~~~l~~A~~~y~~l~d~-------~~v---------------~~l~~i~~~~~~----~~~~A~ia~~ 161 (244)
.+..++.-.||..++.++|..|+-.+.|- .++ +++.++.+-+.. ..+.-.+.--
T Consensus 53 ~l~EqV~IAAld~~~~~lAq~C~~~L~~~fp~S~RV~~lkam~lEa~~~~~~A~e~y~~lL~ddpt~~v~~KRKlAilka 132 (289)
T KOG3060|consen 53 TLYEQVFIAALDTGRDDLAQKCINQLRDRFPGSKRVGKLKAMLLEATGNYKEAIEYYESLLEDDPTDTVIRKRKLAILKA 132 (289)
T ss_pred HHHHHHHHHHHHhcchHHHHHHHHHHHHhCCCChhHHHHHHHHHHHhhchhhHHHHHHHHhccCcchhHHHHHHHHHHHH
Confidence 45678899999999999999999987651 111 222222221111 1233334445
Q ss_pred cCCHHHHHHHHHHC-----C---CHHHHHHHHHHhcCHHHHHHHHH
Q psy11525 162 GGNISEAESILLQH-----G---LIFRAIQVSILTHNWDRALELAL 199 (244)
Q Consensus 162 ~G~~~eAe~~~l~~-----g---~~~~Ai~m~~~l~~W~~Al~LA~ 199 (244)
+|+--+|-+.+..- + -|-.+-++|..++++++|.---+
T Consensus 133 ~GK~l~aIk~ln~YL~~F~~D~EAW~eLaeiY~~~~~f~kA~fClE 178 (289)
T KOG3060|consen 133 QGKNLEAIKELNEYLDKFMNDQEAWHELAEIYLSEGDFEKAAFCLE 178 (289)
T ss_pred cCCcHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHhHHHHHHHHHH
Confidence 67766776655432 2 24456789999999999854333
No 181
>KOG2066|consensus
Probab=55.25 E-value=93 Score=31.93 Aligned_cols=51 Identities=4% Similarity=-0.023 Sum_probs=41.8
Q ss_pred HHHcCCHHHHHHHHhcc-------CCHHHHHHHHHHHHHcCCHHHH-HHHHHHhCChhH
Q psy11525 88 YAASHSWPQALSLCRTL-------NDDILWACLAGMATYSRDLATS-EEAYAAIEQVDK 138 (244)
Q Consensus 88 ~l~~~~~~~A~~~~~~~-------~~~~lW~~LA~~Al~~~~l~~A-~~~y~~l~d~~~ 138 (244)
+++.+.+++|+.+++.. +....|+..-..-+..+.++.| .+|+.-+|+-..
T Consensus 366 ll~~k~yeeAl~~~k~~~~~~~~~~i~kv~~~yI~HLl~~~~y~~Aas~~p~m~gn~~~ 424 (846)
T KOG2066|consen 366 LLEKKKYEEALDAAKASIGNEERFVIKKVGKTYIDHLLFEGKYDEAASLCPKMLGNNAA 424 (846)
T ss_pred HHHhhHHHHHHHHHHhccCCccccchHHHHHHHHHHHHhcchHHHHHhhhHHHhcchHH
Confidence 49999999999999885 3578999999999998888765 678888888543
No 182
>smart00028 TPR Tetratricopeptide repeats. Repeats present in 4 or more copies in proteins. Contain a minimum of 34 amino acids each and self-associate via a "knobs and holes" mechanism.
Probab=54.55 E-value=24 Score=18.06 Aligned_cols=26 Identities=15% Similarity=-0.029 Sum_probs=21.0
Q ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHh
Q psy11525 108 ILWACLAGMATYSRDLATSEEAYAAI 133 (244)
Q Consensus 108 ~lW~~LA~~Al~~~~l~~A~~~y~~l 133 (244)
..|..+|......++++.|..+|.+.
T Consensus 2 ~~~~~~a~~~~~~~~~~~a~~~~~~~ 27 (34)
T smart00028 2 EALYNLGNAYLKLGDYDEALEYYEKA 27 (34)
T ss_pred hHHHHHHHHHHHHhhHHHHHHHHHHH
Confidence 35778888889999999999888653
No 183
>KOG0686|consensus
Probab=54.49 E-value=39 Score=32.09 Aligned_cols=31 Identities=10% Similarity=-0.029 Sum_probs=25.0
Q ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHhCChh
Q psy11525 107 DILWACLAGMATYSRDLATSEEAYAAIEQVD 137 (244)
Q Consensus 107 ~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~ 137 (244)
..--..||..-+..|+++.|.|||.+..|+.
T Consensus 150 Rra~~Dl~dhy~~cG~l~~Alr~YsR~RdYC 180 (466)
T KOG0686|consen 150 RRALEDLGDHYLDCGQLDNALRCYSRARDYC 180 (466)
T ss_pred HHHHHHHHHHHHHhccHHHHHhhhhhhhhhh
Confidence 4456678888889999999999999977754
No 184
>PF07035 Mic1: Colon cancer-associated protein Mic1-like; InterPro: IPR009755 This entry represents the C terminus (approximately 160 residues) of a number of proteins that resemble colon cancer-associated protein Mic1.
Probab=53.84 E-value=1.3e+02 Score=24.92 Aligned_cols=84 Identities=18% Similarity=0.093 Sum_probs=47.7
Q ss_pred cCCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHHHHcCC-----HHHHHHHHHHCC-C
Q psy11525 104 LNDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMYLLGGN-----ISEAESILLQHG-L 177 (244)
Q Consensus 104 ~~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia~~~G~-----~~eAe~~~l~~g-~ 177 (244)
-..+.++..+-...+..+++.. +..+-.-.-++|.. ..|...+..|. ++-|-+++.+-| .
T Consensus 26 ~~~~~L~~lli~lLi~~~~~~~-------------L~qllq~~Vi~DSk-~lA~~LLs~~~~~~~~~Ql~lDMLkRL~~~ 91 (167)
T PF07035_consen 26 PVQHELYELLIDLLIRNGQFSQ-------------LHQLLQYHVIPDSK-PLACQLLSLGNQYPPAYQLGLDMLKRLGTA 91 (167)
T ss_pred CCCHHHHHHHHHHHHHcCCHHH-------------HHHHHhhcccCCcH-HHHHHHHHhHccChHHHHHHHHHHHHhhhh
Confidence 3667788888777777666543 32222222233322 12333333333 566666777766 6
Q ss_pred HHHHHHHHHHhcCHHHHHHHHHhc
Q psy11525 178 IFRAIQVSILTHNWDRALELALRH 201 (244)
Q Consensus 178 ~~~Ai~m~~~l~~W~~Al~LA~~~ 201 (244)
.+.-++....-|+.-+|+++|++.
T Consensus 92 ~~~iievLL~~g~vl~ALr~ar~~ 115 (167)
T PF07035_consen 92 YEEIIEVLLSKGQVLEALRYARQY 115 (167)
T ss_pred HHHHHHHHHhCCCHHHHHHHHHHc
Confidence 666667777777777777777663
No 185
>TIGR00756 PPR pentatricopeptide repeat domain (PPR motif). This family has a similar consensus to the TPR domain (tetratricopeptide), pfam pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.
Probab=53.56 E-value=19 Score=19.98 Aligned_cols=19 Identities=16% Similarity=0.065 Sum_probs=11.3
Q ss_pred HHHHHCCCHHHHHHHHHHh
Q psy11525 170 SILLQHGLIFRAIQVSILT 188 (244)
Q Consensus 170 ~~~l~~g~~~~Ai~m~~~l 188 (244)
..|.+.|++++|++++.++
T Consensus 8 ~~~~~~~~~~~a~~~~~~M 26 (35)
T TIGR00756 8 DGLCKAGRVEEALELFKEM 26 (35)
T ss_pred HHHHHCCCHHHHHHHHHHH
Confidence 3455666666666666554
No 186
>KOG3785|consensus
Probab=52.42 E-value=1.9e+02 Score=27.38 Aligned_cols=96 Identities=19% Similarity=0.157 Sum_probs=63.7
Q ss_pred ccHHHHHHH-HHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHH
Q psy11525 80 PYISVLHSY-AASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAA 153 (244)
Q Consensus 80 p~~~~l~~~-l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~ 153 (244)
|=...+..+ .-..+|++-+-....+ +++..--.+|+.-+..++...||+.|.++.+.++=.-+- --.
T Consensus 360 pGRQsmAs~fFL~~qFddVl~YlnSi~sYF~NdD~Fn~N~AQAk~atgny~eaEelf~~is~~~ikn~~~-------Y~s 432 (557)
T KOG3785|consen 360 PGRQSMASYFFLSFQFDDVLTYLNSIESYFTNDDDFNLNLAQAKLATGNYVEAEELFIRISGPEIKNKIL-------YKS 432 (557)
T ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcchhhhHHHHHHHHhcChHHHHHHHhhhcChhhhhhHH-------HHH
Confidence 444444433 4466788877776654 566666789999999999999999999988765211100 113
Q ss_pred HHHHHHHHcCCHHHHHHHHHHCCCHHHHH
Q psy11525 154 QQAEMYLLGGNISEAESILLQHGLIFRAI 182 (244)
Q Consensus 154 ~~A~ia~~~G~~~eAe~~~l~~g~~~~Ai 182 (244)
..|.+++..|+.+.|=++|++.+-+-++.
T Consensus 433 ~LArCyi~nkkP~lAW~~~lk~~t~~e~f 461 (557)
T KOG3785|consen 433 MLARCYIRNKKPQLAWDMMLKTNTPSERF 461 (557)
T ss_pred HHHHHHHhcCCchHHHHHHHhcCCchhHH
Confidence 45667777777777777776666665543
No 187
>PF11817 Foie-gras_1: Foie gras liver health family 1; InterPro: IPR021773 Mutating the gene foie gras in zebrafish has been shown to affect development; the mutants develop large, lipid-filled hepatocytes in the liver, resembling those in individuals with fatty liver disease []. Foie-gras protein is long and has several well-defined domains though none of them has a known function. We have annotated this one as the first []. THe C terminus of this region contains TPR repeats.
Probab=52.30 E-value=20 Score=31.11 Aligned_cols=50 Identities=14% Similarity=0.066 Sum_probs=25.3
Q ss_pred HHHHcCCHHHHHHHHhccCCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhH
Q psy11525 87 SYAASHSWPQALSLCRTLNDDILWACLAGMATYSRDLATSEEAYAAIEQVDK 138 (244)
Q Consensus 87 ~~l~~~~~~~A~~~~~~~~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~ 138 (244)
+|...|+|++|.++.+.+ ...|+.=+-..+...=+..+.+|+.++||...
T Consensus 187 ey~~~g~~~~A~~~l~~~--~~~yr~egW~~l~~~~l~~l~~Ca~~~~~~~~ 236 (247)
T PF11817_consen 187 EYFRLGDYDKALKLLEPA--ASSYRREGWWSLLTEVLWRLLECAKRLGDVED 236 (247)
T ss_pred HHHHCCCHHHHHHHHHHH--HHHHHhCCcHHHHHHHHHHHHHHHHHhCCHHH
Confidence 346666777666666665 22322222222222223356666666666554
No 188
>PF04762 IKI3: IKI3 family; InterPro: IPR006849 Members of this family are components of the elongator multi-subunit component of a novel RNA polymerase II holoenzyme for transcriptional elongation [].
Probab=51.71 E-value=40 Score=35.32 Aligned_cols=63 Identities=13% Similarity=0.068 Sum_probs=42.7
Q ss_pred HHHHHHHcCCHHHHHHHHHHhC-Ch-hHHHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHCC
Q psy11525 113 LAGMATYSRDLATSEEAYAAIE-QV-DKVMYINHIKGIPVKAAQQAEMYLLGGNISEAESILLQHG 176 (244)
Q Consensus 113 LA~~Al~~~~l~~A~~~y~~l~-d~-~~v~~l~~i~~~~~~~~~~A~ia~~~G~~~eAe~~~l~~g 176 (244)
|-..||..-||+.|.-.=.+.. |+ ..+=||++++.+++.. ++-.|=.++|+|+.|-+.+.++|
T Consensus 864 Ly~~ALG~YDl~Lal~VAq~SQkDPKEYLPfL~~L~~l~~~~-rry~ID~hLkRy~kAL~~L~~~G 928 (928)
T PF04762_consen 864 LYDVALGTYDLELALMVAQQSQKDPKEYLPFLQELQKLPPLY-RRYKIDDHLKRYEKALRHLSACG 928 (928)
T ss_pred HHHHHhhhcCHHHHHHHHHHhccChHHHHHHHHHHHhCChhh-eeeeHhhhhCCHHHHHHHHHhhC
Confidence 4445666666666654443332 32 2456788888876544 77788889999999998888876
No 189
>TIGR02561 HrpB1_HrpK type III secretion protein HrpB1/HrpK. This gene is found within type III secretion operons in a limited range of species including Xanthomonas, Ralstonia and Burkholderia.
Probab=51.64 E-value=55 Score=26.74 Aligned_cols=18 Identities=22% Similarity=0.259 Sum_probs=15.7
Q ss_pred HHHcCCHHHHHHHHhccC
Q psy11525 88 YAASHSWPQALSLCRTLN 105 (244)
Q Consensus 88 ~l~~~~~~~A~~~~~~~~ 105 (244)
++..|+|.+|+++.+.+.
T Consensus 54 ~i~rg~w~eA~rvlr~l~ 71 (153)
T TIGR02561 54 LIARGNYDEAARILRELL 71 (153)
T ss_pred HHHcCCHHHHHHHHHhhh
Confidence 389999999999998874
No 190
>smart00320 WD40 WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Probab=51.53 E-value=25 Score=18.39 Aligned_cols=24 Identities=29% Similarity=0.621 Sum_probs=15.6
Q ss_pred cceeeecCCCceEEEE-eCCeEEEE
Q psy11525 2 VQSICWNTDVNILAAM-QDSALCVW 25 (244)
Q Consensus 2 v~~v~W~~~~~vlv~~-~~~~l~vw 25 (244)
+.++.|+++.+.+++. .++.+.+|
T Consensus 15 i~~~~~~~~~~~~~~~~~d~~~~~~ 39 (40)
T smart00320 15 VTSVAFSPDGKYLASASDDGTIKLW 39 (40)
T ss_pred eeEEEECCCCCEEEEecCCCeEEEc
Confidence 5678888866665544 45666665
No 191
>KOG0307|consensus
Probab=51.29 E-value=1.1e+02 Score=32.42 Aligned_cols=109 Identities=12% Similarity=0.127 Sum_probs=62.0
Q ss_pred HHHHHHHHcCCHHHHHHHHhcc-------------CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCC-hhHH------HHH
Q psy11525 83 SVLHSYAASHSWPQALSLCRTL-------------NDDILWACLAGMATYSRDLATSEEAYAAIEQ-VDKV------MYI 142 (244)
Q Consensus 83 ~~l~~~l~~~~~~~A~~~~~~~-------------~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d-~~~v------~~l 142 (244)
..+-+.+-.|++..|++.|-.- .++ +|.+-....+...+-......|+-+.. .+-+ .+-
T Consensus 492 ~~Is~alitgd~~~aV~~cl~~~~~a~AliiA~~gg~e-l~~~t~~~Y~~k~~~k~s~li~a~v~~d~~~~ve~~~~k~W 570 (1049)
T KOG0307|consen 492 GLISEALITGDFKSAVELCLEANKMADALIIAHAGGTE-LLESTRDKYLAKSNSKLSRLIYAMVNRDLDDYVETCEVKQW 570 (1049)
T ss_pred HHHHHHHHhccHHHHHHHHHhhhHHHHHHHHHhcCCHH-HHHHHHHHHHHHhCChHHHHHHHHHhhhHHHHHhhcchhhH
Confidence 4455556677777777776442 344 888877777776666656655554433 2111 111
Q ss_pred HH----hhCCCc--H-H---HHHHHHHHHcC-CHHHHHHHHHHCCCHHHHHHHHHHhcCHH
Q psy11525 143 NH----IKGIPV--K-A---AQQAEMYLLGG-NISEAESILLQHGLIFRAIQVSILTHNWD 192 (244)
Q Consensus 143 ~~----i~~~~~--~-~---~~~A~ia~~~G-~~~eAe~~~l~~g~~~~Ai~m~~~l~~W~ 192 (244)
++ |-...+ + . ...|.-....| ....|.-.|+-+|.+++....|...++-+
T Consensus 571 ke~la~i~t~~~~~~~~elc~~Lg~rl~~~g~~~~~a~lcYi~agsv~k~v~~w~~~~~~~ 631 (1049)
T KOG0307|consen 571 KETLAAICTYAQTDEFSELCDMLGDRLENAGDLTSAAILCYICAGSVDKLVEIWLKALDLE 631 (1049)
T ss_pred HHHHHHHHHhcchhhHHHHHHHHHHHHhhccchhhhhhHHhhhccChhhhHHHHHHhcccc
Confidence 11 111111 0 0 12333334445 34667788999999999999888777765
No 192
>KOG0547|consensus
Probab=51.02 E-value=1.9e+02 Score=28.45 Aligned_cols=85 Identities=14% Similarity=0.009 Sum_probs=55.7
Q ss_pred CCcHHHHHHHHHHHcCCHHHHHHHHHHCCC---------------HHHHHHHHHHhcCHHHHHHHHHhc---CCchhHHH
Q psy11525 148 IPVKAAQQAEMYLLGGNISEAESILLQHGL---------------IFRAIQVSILTHNWDRALELALRH---KTHIDTVL 209 (244)
Q Consensus 148 ~~~~~~~~A~ia~~~G~~~eAe~~~l~~g~---------------~~~Ai~m~~~l~~W~~Al~LA~~~---~~~~~~vl 209 (244)
.++-....|+++.-+++|+.|.+.|-++=+ +-+|+-..+--.++++|++|..+- ++.-+.-.
T Consensus 461 ~~Evy~~fAeiLtDqqqFd~A~k~YD~ai~LE~~~~~~~v~~~plV~Ka~l~~qwk~d~~~a~~Ll~KA~e~Dpkce~A~ 540 (606)
T KOG0547|consen 461 CPEVYNLFAEILTDQQQFDKAVKQYDKAIELEPREHLIIVNAAPLVHKALLVLQWKEDINQAENLLRKAIELDPKCEQAY 540 (606)
T ss_pred CchHHHHHHHHHhhHHhHHHHHHHHHHHHhhccccccccccchhhhhhhHhhhchhhhHHHHHHHHHHHHccCchHHHHH
Confidence 344557899999999999999999977632 335666666667778888877653 23212211
Q ss_pred H-HHHHHHHHcCCcccHHHHHhcc
Q psy11525 210 Y-QRKKYLDNLEKIETNEKFLRLQ 232 (244)
Q Consensus 210 ~-~r~~yl~~~~~~e~~~~f~~~~ 232 (244)
. +.+=-|++..-.|+++.|.+..
T Consensus 541 ~tlaq~~lQ~~~i~eAielFEksa 564 (606)
T KOG0547|consen 541 ETLAQFELQRGKIDEAIELFEKSA 564 (606)
T ss_pred HHHHHHHHHHhhHHHHHHHHHHHH
Confidence 1 2222235555589999999874
No 193
>PF01535 PPR: PPR repeat; InterPro: IPR002885 This entry represents the PPR repeat. Pentatricopeptide repeat (PPR) proteins are characterised by tandem repeats of a degenerate 35 amino acid motif []. Most of PPR proteins have roles in mitochondria or plastid []. PPR repeats were discovered while screening Arabidopsis proteins for those predicted to be targeted to mitochondria or chloroplast [, ]. Some of these proteins have been shown to play a role in post-transcriptional processes within organelles and they are thought to be sequence-specific RNA-binding proteins [, , ]. Plant genomes have between one hundred to five hundred PPR genes per genome whereas non-plant genomes encode two to six PPR proteins. Although no PPR structures are yet known, the motif is predicted to fold into a helix-turn-helix structure similar to those found in the tetratricopeptide repeat (TPR) family (see PDOC50005 from PROSITEDOC) []. The plant PPR protein family has been divided in two subfamilies on the basis of their motif content and organisation [, ]. Examples of PPR repeat-containing proteins include PET309 P32522 from SWISSPROT, which may be involved in RNA stabilisation [], and crp1, which is involved in RNA processing []. The repeat is associated with a predicted plant protein O49549 from SWISSPROT that has a domain organisation similar to the human BRCA1 protein.
Probab=50.10 E-value=17 Score=19.90 Aligned_cols=16 Identities=25% Similarity=0.272 Sum_probs=7.4
Q ss_pred HHHHHHhcCHHHHHHH
Q psy11525 182 IQVSILTHNWDRALEL 197 (244)
Q Consensus 182 i~m~~~l~~W~~Al~L 197 (244)
|+.|...++|++|.++
T Consensus 7 i~~~~~~~~~~~a~~~ 22 (31)
T PF01535_consen 7 ISGYCKMGQFEEALEV 22 (31)
T ss_pred HHHHHccchHHHHHHH
Confidence 3444444444444444
No 194
>cd02576 PseudoU_synth_ScPUS7 PseudoU_synth_ScPUS7: Pseudouridine synthase, TruD family. This group consists of eukaryotic pseudouridine synthases similar to Saccharomyces cerevisiae Pus7. Pseudouridine synthases catalyze the isomerization of specific uridines in an RNA molecule to pseudouridines (5-ribosyluracil, psi). Saccharomyces cerevisiae Pus7 makes psi35 in U2 small nuclear RNA (U2 snRNA), psi13 in cytoplasmic tRNAs and psi35 in pre-tRNATyr. Psi35 in yeast U2 snRNA and psi13 in tRNAs are highly phylogenetically conserved. Psi34 is the mammalian U2 snRNA counterpart of yeast U2 snRNA psi35.
Probab=49.99 E-value=28 Score=32.29 Aligned_cols=44 Identities=20% Similarity=0.129 Sum_probs=37.0
Q ss_pred HHHHHHcCCHHHHHHHHHHCC------CHHHHHHHHHHhcCHHHHHHHHH
Q psy11525 156 AEMYLLGGNISEAESILLQHG------LIFRAIQVSILTHNWDRALELAL 199 (244)
Q Consensus 156 A~ia~~~G~~~eAe~~~l~~g------~~~~Ai~m~~~l~~W~~Al~LA~ 199 (244)
.-.+++.|++++|..+++... ...+|-+++.+.++|++|+++.-
T Consensus 164 iG~~ll~~~~~~Av~~~l~~~~~~~~~~~~~~r~~~~~~~~~~~~l~~~p 213 (371)
T cd02576 164 IGIAILKENWKEAVDLILKPRSELEEGDLVEAREIWKETGDAKAALKKLP 213 (371)
T ss_pred HHHHHHhhhHHHHHHHHhCCCcccccHHHHHHHHHHHhcCCHHHHHHhch
Confidence 345677899999999999875 67888999999999999998763
No 195
>PF00637 Clathrin: Region in Clathrin and VPS; InterPro: IPR000547 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. These vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transport []. Clathrin coats contain both clathrin (acts as a scaffold) and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors [, ]. Clathrin is a trimer composed of three heavy chains and three light chains, each monomer projecting outwards like a leg; this three-legged structure is known as a triskelion [, ]. The heavy chains form the legs, their N-terminal beta-propeller regions extending outwards, while their C-terminal alpha-alpha-superhelical regions form the central hub of the triskelion. Peptide motifs can bind between the beta-propeller blades. The light chains appear to have a regulatory role, and may help orient the assembly and disassembly of clathrin coats as they interact with hsc70 uncoating ATPase []. Clathrin triskelia self-polymerise into a curved lattice by twisting individual legs together. The clathrin lattice forms around a vesicle as it buds from the TGN, plasma membrane or endosomes, acting to stabilise the vesicle and facilitate the budding process []. The multiple blades created when the triskelia polymerise are involved in multiple protein interactions, enabling the recruitment of different cargo adaptors and membrane attachment proteins []. This entry represents the 7-fold alpha-alpha-superhelical ARM-type repeat found at the C-terminal of clathrin heavy chains and in VPS (vacuolar protein sorting-associated) proteins. In clathrin heavy chains, the C-terminal 7-fold ARM-type repeats interact to form the central hub of the triskelion. VPS proteins are required for vacuolar assembly and vacuolar traffick, and contain one clathrin-type repeat []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0006886 intracellular protein transport, 0016192 vesicle-mediated transport; PDB: 3LVH_A 3LVG_C 1B89_A 3QIL_L.
Probab=49.68 E-value=3.7 Score=31.96 Aligned_cols=36 Identities=28% Similarity=0.311 Sum_probs=29.6
Q ss_pred CHHHHHHHHHHCCCHHHHHHHHHHhcCHHHHHHHHH
Q psy11525 164 NISEAESILLQHGLIFRAIQVSILTHNWDRALELAL 199 (244)
Q Consensus 164 ~~~eAe~~~l~~g~~~~Ai~m~~~l~~W~~Al~LA~ 199 (244)
+.+.|.++..+.|.++.|+-+|..+|+|++|++++.
T Consensus 72 d~~~~~~~c~~~~l~~~a~~Ly~~~~~~~~al~i~~ 107 (143)
T PF00637_consen 72 DLDKALRLCEKHGLYEEAVYLYSKLGNHDEALEILH 107 (143)
T ss_dssp -CTHHHHHHHTTTSHHHHHHHHHCCTTHTTCSSTSS
T ss_pred CHHHHHHHHHhcchHHHHHHHHHHcccHHHHHHHHH
Confidence 567888888888999999999999999999888633
No 196
>KOG2002|consensus
Probab=49.54 E-value=53 Score=34.40 Aligned_cols=81 Identities=12% Similarity=0.224 Sum_probs=49.7
Q ss_pred HHHcCCHHHHHHHHhccCCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHH--HHHHHhhCCCcH----HHHHHHHHHH
Q psy11525 88 YAASHSWPQALSLCRTLNDDILWACLAGMATYSRDLATSEEAYAAIEQVDKV--MYINHIKGIPVK----AAQQAEMYLL 161 (244)
Q Consensus 88 ~l~~~~~~~A~~~~~~~~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v--~~l~~i~~~~~~----~~~~A~ia~~ 161 (244)
|...|+++.+|.++...-..+ ...++.+..+----|||-+.||+++. .|.+.++.-++. ....|.+.++
T Consensus 280 fyfK~dy~~v~~la~~ai~~t-----~~~~~~aes~Y~~gRs~Ha~Gd~ekA~~yY~~s~k~~~d~~~l~~~GlgQm~i~ 354 (1018)
T KOG2002|consen 280 FYFKKDYERVWHLAEHAIKNT-----ENKSIKAESFYQLGRSYHAQGDFEKAFKYYMESLKADNDNFVLPLVGLGQMYIK 354 (1018)
T ss_pred HhhcccHHHHHHHHHHHHHhh-----hhhHHHHHHHHHHHHHHHhhccHHHHHHHHHHHHccCCCCccccccchhHHHHH
Confidence 488899999998887742221 22233333333445677777777665 445555544443 2447788888
Q ss_pred cCCHHHHHHHHH
Q psy11525 162 GGNISEAESILL 173 (244)
Q Consensus 162 ~G~~~eAe~~~l 173 (244)
.|++++|...|-
T Consensus 355 ~~dle~s~~~fE 366 (1018)
T KOG2002|consen 355 RGDLEESKFCFE 366 (1018)
T ss_pred hchHHHHHHHHH
Confidence 888888865553
No 197
>PF13812 PPR_3: Pentatricopeptide repeat domain
Probab=48.89 E-value=38 Score=18.86 Aligned_cols=17 Identities=24% Similarity=0.257 Sum_probs=7.7
Q ss_pred HHHHHHHhcCHHHHHHH
Q psy11525 181 AIQVSILTHNWDRALEL 197 (244)
Q Consensus 181 Ai~m~~~l~~W~~Al~L 197 (244)
.+..|...++|++|+++
T Consensus 7 ll~a~~~~g~~~~a~~~ 23 (34)
T PF13812_consen 7 LLRACAKAGDPDAALQL 23 (34)
T ss_pred HHHHHHHCCCHHHHHHH
Confidence 34444444444444443
No 198
>smart00668 CTLH C-terminal to LisH motif. Alpha-helical motif of unknown function.
Probab=48.75 E-value=44 Score=21.48 Aligned_cols=35 Identities=29% Similarity=0.455 Sum_probs=22.7
Q ss_pred HHHhcCHHHHHHHHHhcCCch------hHHHHHHHHHHHHc
Q psy11525 185 SILTHNWDRALELALRHKTHI------DTVLYQRKKYLDNL 219 (244)
Q Consensus 185 ~~~l~~W~~Al~LA~~~~~~~------~~vl~~r~~yl~~~ 219 (244)
-...|+|+.|++.+..+.+.+ -...-++++|++-.
T Consensus 11 ~i~~g~~~~a~~~~~~~~~~l~~~~~~l~f~L~~q~~lell 51 (58)
T smart00668 11 LILKGDWDEALEWLSSLKPPLLERNSKLEFELRKQKFLELV 51 (58)
T ss_pred HHHcCCHHHHHHHHHHcCHHHhccCCCchhHHHHHHHHHHH
Confidence 355789999999998876532 23344667777433
No 199
>PF11768 DUF3312: Protein of unknown function (DUF3312); InterPro: IPR024511 This is a eukaryotic family of uncharacterised proteins that contain WD40 repeats.
Probab=48.44 E-value=24 Score=34.54 Aligned_cols=42 Identities=24% Similarity=0.224 Sum_probs=38.9
Q ss_pred HHHcCCHHHHHHHHhccCCHHHHHHHHHHHHHcCCHHHHHHH
Q psy11525 88 YAASHSWPQALSLCRTLNDDILWACLAGMATYSRDLATSEEA 129 (244)
Q Consensus 88 ~l~~~~~~~A~~~~~~~~~~~lW~~LA~~Al~~~~l~~A~~~ 129 (244)
.+..++|+.|+.+|--+....++..+=-.|+..|++..|+.|
T Consensus 504 LLR~~rfekAFlLAvdi~~~DLFmdlh~~A~~~ge~~La~~A 545 (545)
T PF11768_consen 504 LLRYQRFEKAFLLAVDIGDRDLFMDLHYLAKDKGELALAEVA 545 (545)
T ss_pred HHHhhHHHHHHHHHHhccchHHHHHHHHHHHhccchhhhhcC
Confidence 399999999999999999999999999999999999988753
No 200
>KOG0553|consensus
Probab=48.34 E-value=72 Score=28.98 Aligned_cols=78 Identities=12% Similarity=0.009 Sum_probs=44.9
Q ss_pred HHHHHHHHcCCHHHHHHHHHHCCCHH--HHH------HHHHHhcCHHHHHHHHHh---cCCc-hhHHHHHHHHHHHHcCC
Q psy11525 154 QQAEMYLLGGNISEAESILLQHGLIF--RAI------QVSILTHNWDRALELALR---HKTH-IDTVLYQRKKYLDNLEK 221 (244)
Q Consensus 154 ~~A~ia~~~G~~~eAe~~~l~~g~~~--~Ai------~m~~~l~~W~~Al~LA~~---~~~~-~~~vl~~r~~yl~~~~~ 221 (244)
..|.-++-.++|++|...|-++=..+ .|| .-|..||+++.|++=+++ .+++ .+.-.++-.-|+-..+-
T Consensus 86 ~eGN~~m~~~~Y~eAv~kY~~AI~l~P~nAVyycNRAAAy~~Lg~~~~AVkDce~Al~iDp~yskay~RLG~A~~~~gk~ 165 (304)
T KOG0553|consen 86 NEGNKLMKNKDYQEAVDKYTEAIELDPTNAVYYCNRAAAYSKLGEYEDAVKDCESALSIDPHYSKAYGRLGLAYLALGKY 165 (304)
T ss_pred HHHHHHHHhhhHHHHHHHHHHHHhcCCCcchHHHHHHHHHHHhcchHHHHHHHHHHHhcChHHHHHHHHHHHHHHccCcH
Confidence 35667777788888888776652221 221 346788888888766554 4444 33333333444444444
Q ss_pred cccHHHHHhc
Q psy11525 222 IETNEKFLRL 231 (244)
Q Consensus 222 ~e~~~~f~~~ 231 (244)
.++++.|+|+
T Consensus 166 ~~A~~aykKa 175 (304)
T KOG0553|consen 166 EEAIEAYKKA 175 (304)
T ss_pred HHHHHHHHhh
Confidence 5666665555
No 201
>PF04097 Nic96: Nup93/Nic96; InterPro: IPR007231 Nup93/Nic96 is a component of the nuclear pore complex. It is required for the correct assembly of the nuclear pore complex []. In Saccharomyces cerevisiae, Nic96 has been shown to be involved in the distribution and cellular concentration of the GTPase Gsp1 []. The structure of Nic96 has revealed a mostly alpha helical structure [].; GO: 0006810 transport, 0005643 nuclear pore; PDB: 2QX5_B 2RFO_A.
Probab=48.27 E-value=1e+02 Score=30.65 Aligned_cols=101 Identities=12% Similarity=0.127 Sum_probs=61.9
Q ss_pred cHHHHHHH---HHcCCHHHHHHHHhcc---C----CHHHHHHHHHHHHHcCCHHHHHHHHHHhCCh--hHH---HHHHH-
Q psy11525 81 YISVLHSY---AASHSWPQALSLCRTL---N----DDILWACLAGMATYSRDLATSEEAYAAIEQV--DKV---MYINH- 144 (244)
Q Consensus 81 ~~~~l~~~---l~~~~~~~A~~~~~~~---~----~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~--~~v---~~l~~- 144 (244)
+...+.+| .+..+-.+|.++.-.+ . ....+.++.+..+++++++ .-+|++ ++. ..|++
T Consensus 327 ~arLI~~Y~~~F~~td~~~Al~Y~~li~~~~~~~~~~l~~~~l~eLvletref~------~LLG~i~~dG~r~~G~i~~~ 400 (613)
T PF04097_consen 327 FARLIGQYTRSFEITDPREALQYLYLICLFKDPEQRNLFHECLRELVLETREFD------LLLGDINPDGSRTPGLIERR 400 (613)
T ss_dssp HHHHHHHHHHTTTTT-HHHHHHHHHGGGGS-SCCHHHHHHHHHHHHHHHH--HH------HHHEEE-TTS-EEE-HHHHT
T ss_pred HHHHHHHHHHHHhccCHHHHHHHHHHHHHcCCchHHHHHHHHHHHHHHccCCHH------HHCCCCCCCCccccceeecc
Confidence 44556666 6677888888876554 2 2567999999999999875 334442 221 23444
Q ss_pred --hhCCC---c----HHHHHHHHHHHcCCHHHHHHHHHHCCCHHHHHHHHHH
Q psy11525 145 --IKGIP---V----KAAQQAEMYLLGGNISEAESILLQHGLIFRAIQVSIL 187 (244)
Q Consensus 145 --i~~~~---~----~~~~~A~ia~~~G~~~eAe~~~l~~g~~~~Ai~m~~~ 187 (244)
+.+.. + -....|.-+...|++++|-.+|.-+|+++.++++...
T Consensus 401 ~~Li~~~~~~~~~~~i~~~~A~~~e~~g~~~dAi~Ly~La~~~d~vl~lln~ 452 (613)
T PF04097_consen 401 LSLIKFDDDEDFLREIIEQAAREAEERGRFEDAILLYHLAEEYDKVLSLLNR 452 (613)
T ss_dssp GGGGT-SSSSHHHHHHHHHHHHHHHHCT-HHHHHHHHHHTT-HHHHHHHHHH
T ss_pred ccccCCCCcHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHhhHHHHHHHHHH
Confidence 22222 1 1245777888888888888888888888888887654
No 202
>PRK15180 Vi polysaccharide biosynthesis protein TviD; Provisional
Probab=47.57 E-value=1.3e+02 Score=29.50 Aligned_cols=80 Identities=11% Similarity=0.021 Sum_probs=52.3
Q ss_pred HHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHH------CCCHHHHHHHH--
Q psy11525 114 AGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMYLLGGNISEAESILLQ------HGLIFRAIQVS-- 185 (244)
Q Consensus 114 A~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia~~~G~~~eAe~~~l~------~g~~~~Ai~m~-- 185 (244)
-..++..|++..|.+-.. ..|++-...++...+.+.|...+|.|+.|-+..-- +|.--+-+.|+
T Consensus 296 i~k~~~~gd~~aas~~~~--------~~lr~~~~~p~~i~l~~~i~~~lg~ye~~~~~~s~~~~~~~s~~~~~~~~~r~~ 367 (831)
T PRK15180 296 ITKQLADGDIIAASQQLF--------AALRNQQQDPVLIQLRSVIFSHLGYYEQAYQDISDVEKIIGTTDSTLRCRLRSL 367 (831)
T ss_pred HHHHhhccCHHHHHHHHH--------HHHHhCCCCchhhHHHHHHHHHhhhHHHHHHHhhchhhhhcCCchHHHHHHHhh
Confidence 345666778766643221 22455555677777889999999999887665532 33333334444
Q ss_pred HHhcCHHHHHHHHHhc
Q psy11525 186 ILTHNWDRALELALRH 201 (244)
Q Consensus 186 ~~l~~W~~Al~LA~~~ 201 (244)
..|++|++|+.+|.-.
T Consensus 368 ~~l~r~~~a~s~a~~~ 383 (831)
T PRK15180 368 HGLARWREALSTAEMM 383 (831)
T ss_pred hchhhHHHHHHHHHHH
Confidence 4689999999999763
No 203
>PF12854 PPR_1: PPR repeat
Probab=46.77 E-value=33 Score=20.09 Aligned_cols=25 Identities=16% Similarity=0.375 Sum_probs=20.4
Q ss_pred ccHHHHHHHHHcCCHHHHHHHHhcc
Q psy11525 80 PYISVLHSYAASHSWPQALSLCRTL 104 (244)
Q Consensus 80 p~~~~l~~~l~~~~~~~A~~~~~~~ 104 (244)
.|...+.-|.+.|++++|+++-+.+
T Consensus 9 ty~~lI~~~Ck~G~~~~A~~l~~~M 33 (34)
T PF12854_consen 9 TYNTLIDGYCKAGRVDEAFELFDEM 33 (34)
T ss_pred HHHHHHHHHHHCCCHHHHHHHHHhC
Confidence 4667777889999999999987765
No 204
>PF10300 DUF3808: Protein of unknown function (DUF3808); InterPro: IPR019412 This entry represents a family of proteins conserved from fungi to humans. In humans this protein is expressed in primary breast carcinomas but not in normal breast tissue, and has a putative eukaryotic RNP-1 RNA binding region and a candidate anchoring transmembrane domain. The human protein is coordinately regulated with oestrogen receptor, but is not necessarily oestradiol-responsive []. Members of this family carry a tetratricopeptide repeat (IPR013105 from INTERPRO) at their C terminus.
Probab=45.95 E-value=2.1e+02 Score=27.34 Aligned_cols=92 Identities=12% Similarity=0.111 Sum_probs=61.8
Q ss_pred HcCCHHHHHHHHhcc----CCHHHHHH-HHHHHHHcCCHHHHHHHHHHhCC-hhHHHHHHHhhCCCcHHHHHHHHHHHcC
Q psy11525 90 ASHSWPQALSLCRTL----NDDILWAC-LAGMATYSRDLATSEEAYAAIEQ-VDKVMYINHIKGIPVKAAQQAEMYLLGG 163 (244)
Q Consensus 90 ~~~~~~~A~~~~~~~----~~~~lW~~-LA~~Al~~~~l~~A~~~y~~l~d-~~~v~~l~~i~~~~~~~~~~A~ia~~~G 163 (244)
...+.+.|-++.... ++..+|.- -|+.....++++.|..+|.+.-+ -.....++.+ -.-..|-..++++
T Consensus 245 ~~~~~~~a~~lL~~~~~~yP~s~lfl~~~gR~~~~~g~~~~Ai~~~~~a~~~q~~~~Ql~~l-----~~~El~w~~~~~~ 319 (468)
T PF10300_consen 245 EDVPLEEAEELLEEMLKRYPNSALFLFFEGRLERLKGNLEEAIESFERAIESQSEWKQLHHL-----CYFELAWCHMFQH 319 (468)
T ss_pred cCCCHHHHHHHHHHHHHhCCCcHHHHHHHHHHHHHhcCHHHHHHHHHHhccchhhHHhHHHH-----HHHHHHHHHHHHc
Confidence 456677777776664 67888887 57888899999999999996543 2222222221 1123577888899
Q ss_pred CHHHHHHHHHHC---CCHHHHHHHHH
Q psy11525 164 NISEAESILLQH---GLIFRAIQVSI 186 (244)
Q Consensus 164 ~~~eAe~~~l~~---g~~~~Ai~m~~ 186 (244)
++++|.+.|.+. +.+-.|+-.|.
T Consensus 320 ~w~~A~~~f~~L~~~s~WSka~Y~Y~ 345 (468)
T PF10300_consen 320 DWEEAAEYFLRLLKESKWSKAFYAYL 345 (468)
T ss_pred hHHHHHHHHHHHHhccccHHHHHHHH
Confidence 999998887654 56666655544
No 205
>smart00685 DM14 Repeats in fly CG4713, worm Y37H9A.3 and human FLJ20241.
Probab=45.07 E-value=63 Score=22.05 Aligned_cols=41 Identities=15% Similarity=0.070 Sum_probs=33.0
Q ss_pred cCCHHHHHHHHHHCCCHHHHHHHHHHhcCHHHHHHHHHhcC
Q psy11525 162 GGNISEAESILLQHGLIFRAIQVSILTHNWDRALELALRHK 202 (244)
Q Consensus 162 ~G~~~eAe~~~l~~g~~~~Ai~m~~~l~~W~~Al~LA~~~~ 202 (244)
+.+|..|+.--.+.|+...|-++-+-...++.|++.++...
T Consensus 8 ~~~yk~Aa~~AK~~gd~~kAr~~~R~~K~~~~~I~~~~aG~ 48 (59)
T smart00685 8 QEQYKQAALQAKRAGDEEKARRHLRIAKQFDDAIKAARAGR 48 (59)
T ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHhhHHHHHHHHHCCC
Confidence 34678888888888888888888888888888888776644
No 206
>KOG1128|consensus
Probab=44.99 E-value=77 Score=32.26 Aligned_cols=103 Identities=14% Similarity=0.039 Sum_probs=0.0
Q ss_pred HHHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHHh--CChhHHHHHHHhhCCCcHHHHHHHHHH
Q psy11525 88 YAASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAAI--EQVDKVMYINHIKGIPVKAAQQAEMYL 160 (244)
Q Consensus 88 ~l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~l--~d~~~v~~l~~i~~~~~~~~~~A~ia~ 160 (244)
++..++|++|.+..+.- -...-|-.+|..|++.++...|..||.++ ++++.....+.+ +..++
T Consensus 495 ~~~~~~fs~~~~hle~sl~~nplq~~~wf~~G~~ALqlek~q~av~aF~rcvtL~Pd~~eaWnNl----------s~ayi 564 (777)
T KOG1128|consen 495 ILSNKDFSEADKHLERSLEINPLQLGTWFGLGCAALQLEKEQAAVKAFHRCVTLEPDNAEAWNNL----------STAYI 564 (777)
T ss_pred cccchhHHHHHHHHHHHhhcCccchhHHHhccHHHHHHhhhHHHHHHHHHHhhcCCCchhhhhhh----------hHHHH
Q ss_pred HcCCHHHHHHHHHHCCCHHHH--------HHHHHHhcCHHHHHHHHHh
Q psy11525 161 LGGNISEAESILLQHGLIFRA--------IQVSILTHNWDRALELALR 200 (244)
Q Consensus 161 ~~G~~~eAe~~~l~~g~~~~A--------i~m~~~l~~W~~Al~LA~~ 200 (244)
..|+-.+|-..+.++.....= +-.-.|.+.|++|++.+.+
T Consensus 565 ~~~~k~ra~~~l~EAlKcn~~~w~iWENymlvsvdvge~eda~~A~~r 612 (777)
T KOG1128|consen 565 RLKKKKRAFRKLKEALKCNYQHWQIWENYMLVSVDVGEFEDAIKAYHR 612 (777)
T ss_pred HHhhhHHHHHHHHHHhhcCCCCCeeeechhhhhhhcccHHHHHHHHHH
No 207
>PF13174 TPR_6: Tetratricopeptide repeat; PDB: 3QKY_A 2XEV_A 3URZ_B 2Q7F_A.
Probab=44.91 E-value=24 Score=19.50 Aligned_cols=18 Identities=28% Similarity=0.279 Sum_probs=9.2
Q ss_pred HHHHHHHcCCHHHHHHHH
Q psy11525 155 QAEMYLLGGNISEAESIL 172 (244)
Q Consensus 155 ~A~ia~~~G~~~eAe~~~ 172 (244)
+|.+....|++++|.+.|
T Consensus 6 ~a~~~~~~g~~~~A~~~~ 23 (33)
T PF13174_consen 6 LARCYYKLGDYDEAIEYF 23 (33)
T ss_dssp HHHHHHHHCHHHHHHHHH
T ss_pred HHHHHHHccCHHHHHHHH
Confidence 444555555555555544
No 208
>KOG1129|consensus
Probab=44.43 E-value=2.4e+02 Score=26.45 Aligned_cols=106 Identities=16% Similarity=0.020 Sum_probs=66.6
Q ss_pred HHHHHHhCChhHHHHHHHhhC------CCcH---------HHHHHHHHHHcCCHHHHHHHHHHCC----CHH---HHHHH
Q psy11525 127 EEAYAAIEQVDKVMYINHIKG------IPVK---------AAQQAEMYLLGGNISEAESILLQHG----LIF---RAIQV 184 (244)
Q Consensus 127 ~~~y~~l~d~~~v~~l~~i~~------~~~~---------~~~~A~ia~~~G~~~eAe~~~l~~g----~~~---~Ai~m 184 (244)
+-.|.+..|+.++..+-.... ...+ ..++|.+++-+|.+.+||+-|.++= .++ .--+.
T Consensus 186 ey~fyhenDv~~aH~~~~~~~~~~~a~~s~~~~~~~dwwWk~Q~gkCylrLgm~r~AekqlqssL~q~~~~dTfllLskv 265 (478)
T KOG1129|consen 186 EYLFYHENDVQKAHSLCQAVLEVERAKPSGSTGCTLDWWWKQQMGKCYLRLGMPRRAEKQLQSSLTQFPHPDTFLLLSKV 265 (478)
T ss_pred HHHHHhhhhHHHHHHHHHHHHHHHhccccccccchHhHHHHHHHHHHHHHhcChhhhHHHHHHHhhcCCchhHHHHHHHH
Confidence 456778888877764322211 1111 2568999999999999999887662 222 34578
Q ss_pred HHHhcCHHHHHHHHHhcC---C-chhHHHHHHHHHHHHcCC-cccHHHHHhccc
Q psy11525 185 SILTHNWDRALELALRHK---T-HIDTVLYQRKKYLDNLEK-IETNEKFLRLQS 233 (244)
Q Consensus 185 ~~~l~~W~~Al~LA~~~~---~-~~~~vl~~r~~yl~~~~~-~e~~~~f~~~~~ 233 (244)
|..+.+-++||.+-..-= | ++.-++.. ++-.+.+++ +++.+.|+...+
T Consensus 266 Y~ridQP~~AL~~~~~gld~fP~~VT~l~g~-ARi~eam~~~~~a~~lYk~vlk 318 (478)
T KOG1129|consen 266 YQRIDQPERALLVIGEGLDSFPFDVTYLLGQ-ARIHEAMEQQEDALQLYKLVLK 318 (478)
T ss_pred HHHhccHHHHHHHHhhhhhcCCchhhhhhhh-HHHHHHHHhHHHHHHHHHHHHh
Confidence 999999999998876521 1 23333333 334455555 566677766654
No 209
>COG1729 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=42.81 E-value=1.7e+02 Score=26.06 Aligned_cols=85 Identities=15% Similarity=0.095 Sum_probs=50.4
Q ss_pred HHHHHHHHcCCHHHHHHHHhcc----------CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHH
Q psy11525 83 SVLHSYAASHSWPQALSLCRTL----------NDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKA 152 (244)
Q Consensus 83 ~~l~~~l~~~~~~~A~~~~~~~----------~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~ 152 (244)
..-+.++.+|+|.+|-..-... .+...| ||+.-...++++-|...|.++- +-.-+-.+-+|..
T Consensus 146 ~~A~~~~ksgdy~~A~~~F~~fi~~YP~s~~~~nA~yW--LGe~~y~qg~y~~Aa~~f~~~~-----k~~P~s~KApdal 218 (262)
T COG1729 146 NAALDLYKSGDYAEAEQAFQAFIKKYPNSTYTPNAYYW--LGESLYAQGDYEDAAYIFARVV-----KDYPKSPKAPDAL 218 (262)
T ss_pred HHHHHHHHcCCHHHHHHHHHHHHHcCCCCcccchhHHH--HHHHHHhcccchHHHHHHHHHH-----HhCCCCCCChHHH
Confidence 4445568888888887755443 234555 5666666666666666665421 1111111123344
Q ss_pred HHHHHHHHHcCCHHHHHHHHHH
Q psy11525 153 AQQAEMYLLGGNISEAESILLQ 174 (244)
Q Consensus 153 ~~~A~ia~~~G~~~eAe~~~l~ 174 (244)
.+.|.+..-+|+-++|-..|-+
T Consensus 219 lKlg~~~~~l~~~d~A~atl~q 240 (262)
T COG1729 219 LKLGVSLGRLGNTDEACATLQQ 240 (262)
T ss_pred HHHHHHHHHhcCHHHHHHHHHH
Confidence 6778888888888888777654
No 210
>PF12688 TPR_5: Tetratrico peptide repeat
Probab=42.29 E-value=1.6e+02 Score=22.74 Aligned_cols=20 Identities=30% Similarity=0.096 Sum_probs=11.0
Q ss_pred HHHHHHHcCCHHHHHHHHHH
Q psy11525 155 QAEMYLLGGNISEAESILLQ 174 (244)
Q Consensus 155 ~A~ia~~~G~~~eAe~~~l~ 174 (244)
.|..+...|+.+||-..++.
T Consensus 81 ~Al~L~~~gr~~eAl~~~l~ 100 (120)
T PF12688_consen 81 LALALYNLGRPKEALEWLLE 100 (120)
T ss_pred HHHHHHHCCCHHHHHHHHHH
Confidence 34444455666666665554
No 211
>PF12569 NARP1: NMDA receptor-regulated protein 1 ; InterPro: IPR021183 This group represents N-terminal acetyltransferase A (NatA) auxiliary subunit and represents a non-catalytic component of the NatA N-terminal acetyltransferase, which catalyzes acetylation of proteins beginning with Met-Ser, Met-Gly and Met-Ala. N-terminal acetylation plays a role in normal eukaryotic translation and processing, protect against proteolytic degradation and protein turnover. NAT1 anchors ARD1 and NAT5 to the ribosome and may present the N- terminal of nascent polypeptides for acetylation [], [].
Probab=41.68 E-value=91 Score=30.46 Aligned_cols=73 Identities=15% Similarity=0.109 Sum_probs=49.6
Q ss_pred HHHHHhCChhHH-HHHHHhhCC----CcHHHHHHHHHHHcCCHHHHHHHHHHCCCHHHH--------HHHHHHhcCHHHH
Q psy11525 128 EAYAAIEQVDKV-MYINHIKGI----PVKAAQQAEMYLLGGNISEAESILLQHGLIFRA--------IQVSILTHNWDRA 194 (244)
Q Consensus 128 ~~y~~l~d~~~v-~~l~~i~~~----~~~~~~~A~ia~~~G~~~eAe~~~l~~g~~~~A--------i~m~~~l~~W~~A 194 (244)
.+|-.+|++... .+|++.... .+.....|.|+-..|++++|.+.+-.+...|.| ..-....|+-++|
T Consensus 202 qhyd~~g~~~~Al~~Id~aI~htPt~~ely~~KarilKh~G~~~~Aa~~~~~Ar~LD~~DRyiNsK~aKy~LRa~~~e~A 281 (517)
T PF12569_consen 202 QHYDYLGDYEKALEYIDKAIEHTPTLVELYMTKARILKHAGDLKEAAEAMDEARELDLADRYINSKCAKYLLRAGRIEEA 281 (517)
T ss_pred HHHHHhCCHHHHHHHHHHHHhcCCCcHHHHHHHHHHHHHCCCHHHHHHHHHHHHhCChhhHHHHHHHHHHHHHCCCHHHH
Confidence 345666666543 556655544 334567899999999999999877766555443 5556667777777
Q ss_pred HHHHHh
Q psy11525 195 LELALR 200 (244)
Q Consensus 195 l~LA~~ 200 (244)
.+++..
T Consensus 282 ~~~~~~ 287 (517)
T PF12569_consen 282 EKTASL 287 (517)
T ss_pred HHHHHh
Confidence 777765
No 212
>PF13041 PPR_2: PPR repeat family
Probab=41.65 E-value=34 Score=21.49 Aligned_cols=17 Identities=12% Similarity=-0.046 Sum_probs=7.8
Q ss_pred HHHHCCCHHHHHHHHHH
Q psy11525 171 ILLQHGLIFRAIQVSIL 187 (244)
Q Consensus 171 ~~l~~g~~~~Ai~m~~~ 187 (244)
.|.++|++++|.+++.+
T Consensus 12 ~~~~~~~~~~a~~l~~~ 28 (50)
T PF13041_consen 12 GYCKAGKFEEALKLFKE 28 (50)
T ss_pred HHHHCcCHHHHHHHHHH
Confidence 34444444444444443
No 213
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=41.11 E-value=3.5e+02 Score=26.30 Aligned_cols=78 Identities=18% Similarity=0.140 Sum_probs=58.0
Q ss_pred HHHHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCc---HHHHHHHH
Q psy11525 87 SYAASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPV---KAAQQAEM 158 (244)
Q Consensus 87 ~~l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~---~~~~~A~i 158 (244)
++...|++++|......+ ++.-.|...+...++.++..-|...|.+.-+. .+. -....|..
T Consensus 315 ~~~~~~~~d~A~~~l~~L~~~~P~N~~~~~~~~~i~~~~nk~~~A~e~~~kal~l-----------~P~~~~l~~~~a~a 383 (484)
T COG4783 315 QTYLAGQYDEALKLLQPLIAAQPDNPYYLELAGDILLEANKAKEAIERLKKALAL-----------DPNSPLLQLNLAQA 383 (484)
T ss_pred HHHHhcccchHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCChHHHHHHHHHHHhc-----------CCCccHHHHHHHHH
Confidence 346678888888877665 67888888999999999998888777653211 122 12457889
Q ss_pred HHHcCCHHHHHHHHHHC
Q psy11525 159 YLLGGNISEAESILLQH 175 (244)
Q Consensus 159 a~~~G~~~eAe~~~l~~ 175 (244)
++..|+++||..++...
T Consensus 384 ll~~g~~~eai~~L~~~ 400 (484)
T COG4783 384 LLKGGKPQEAIRILNRY 400 (484)
T ss_pred HHhcCChHHHHHHHHHH
Confidence 99999999999888764
No 214
>KOG4499|consensus
Probab=40.12 E-value=1.3e+02 Score=26.74 Aligned_cols=78 Identities=9% Similarity=0.088 Sum_probs=47.2
Q ss_pred ceeeecCCCceEEEEeCCeEEE--EE--cCCccccCccccee-eeEeeccCCCCC--------CCeEEeEeCCEEEEEec
Q psy11525 3 QSICWNTDVNILAAMQDSALCV--WF--FPAVVFADQGLLRK-TVLLKDIGEFGK--------SPSIVSFVKNHLTIRRY 69 (244)
Q Consensus 3 ~~v~W~~~~~vlv~~~~~~l~v--w~--~~~~~~~~~~~l~~-~~~~~~~~~~~~--------~~~~v~f~g~~v~~~~~ 69 (244)
+.++|+.+.-.|-.++..+..| |= +|.-...+|..+-. .+.+...+.+|+ +..+..|.|++|.-..+
T Consensus 161 Ngl~Wd~d~K~fY~iDsln~~V~a~dyd~~tG~~snr~~i~dlrk~~~~e~~~PDGm~ID~eG~L~Va~~ng~~V~~~dp 240 (310)
T KOG4499|consen 161 NGLAWDSDAKKFYYIDSLNYEVDAYDYDCPTGDLSNRKVIFDLRKSQPFESLEPDGMTIDTEGNLYVATFNGGTVQKVDP 240 (310)
T ss_pred ccccccccCcEEEEEccCceEEeeeecCCCcccccCcceeEEeccCCCcCCCCCCcceEccCCcEEEEEecCcEEEEECC
Confidence 5688999999999999888888 65 56555555554421 111111112222 22466688888877765
Q ss_pred -CCcEEEeecCc
Q psy11525 70 -DGTVINYPISP 80 (244)
Q Consensus 70 -~G~~~~~~~~p 80 (244)
+|+++.+...|
T Consensus 241 ~tGK~L~eiklP 252 (310)
T KOG4499|consen 241 TTGKILLEIKLP 252 (310)
T ss_pred CCCcEEEEEEcC
Confidence 67776654433
No 215
>PF09613 HrpB1_HrpK: Bacterial type III secretion protein (HrpB1_HrpK); InterPro: IPR013394 This family of proteins is encoded by genes found within type III secretion operons in a limited range of species including Xanthomonas, Ralstonia and Burkholderia.
Probab=39.65 E-value=71 Score=26.27 Aligned_cols=51 Identities=18% Similarity=0.116 Sum_probs=30.9
Q ss_pred HHHcCCHHHHHHHHhccC--CHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCC
Q psy11525 88 YAASHSWPQALSLCRTLN--DDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGI 148 (244)
Q Consensus 88 ~l~~~~~~~A~~~~~~~~--~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~ 148 (244)
+|..|+|.+|.++.+.+. .+..|.+-|-. ..|+..+||..-=.+-..+.+.
T Consensus 54 ~i~r~~w~dA~rlLr~l~~~~~~~p~~kALl----------A~CL~~~~D~~Wr~~A~evle~ 106 (160)
T PF09613_consen 54 HIVRGDWDDALRLLRELEERAPGFPYAKALL----------ALCLYALGDPSWRRYADEVLES 106 (160)
T ss_pred HHHhCCHHHHHHHHHHHhccCCCChHHHHHH----------HHHHHHcCChHHHHHHHHHHhc
Confidence 499999999999999873 33334333322 3455666665544444444443
No 216
>PF09295 ChAPs: ChAPs (Chs5p-Arf1p-binding proteins); InterPro: IPR015374 ChAPs (Chs5p-Arf1p-binding proteins) are required for the export of specialised cargo from the Golgi. They physically interact with Chs3, Chs5 and the small GTPase Arf1, and they also form interactions with each other [].
Probab=39.58 E-value=65 Score=30.35 Aligned_cols=43 Identities=16% Similarity=0.163 Sum_probs=36.0
Q ss_pred HHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHH
Q psy11525 89 AASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYA 131 (244)
Q Consensus 89 l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~ 131 (244)
+..++++.|..+|+.. ..-.-|..||+..++.++++-|.-+.-
T Consensus 245 l~k~~~~lAL~iAk~av~lsP~~f~~W~~La~~Yi~~~d~e~ALlaLN 292 (395)
T PF09295_consen 245 LSKKKYELALEIAKKAVELSPSEFETWYQLAECYIQLGDFENALLALN 292 (395)
T ss_pred HhcCCHHHHHHHHHHHHHhCchhHHHHHHHHHHHHhcCCHHHHHHHHh
Confidence 8999999999999886 356789999999999999987764433
No 217
>PF11349 DUF3151: Protein of unknown function (DUF3151); InterPro: IPR014487 This group represents an uncharacterised conserved protein.
Probab=38.80 E-value=70 Score=25.23 Aligned_cols=87 Identities=17% Similarity=0.105 Sum_probs=56.1
Q ss_pred HHHHHHHHHcCCHHHHHHHHhc-cCCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhh----------CCCc
Q psy11525 82 ISVLHSYAASHSWPQALSLCRT-LNDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIK----------GIPV 150 (244)
Q Consensus 82 ~~~l~~~l~~~~~~~A~~~~~~-~~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~----------~~~~ 150 (244)
+.....-+..|.=. -.++.. -.....|..||+.|+..++. ++.-+|++.|==..+..|++-= +.++
T Consensus 12 d~~a~~~l~~g~~~--~~vaa~~P~ss~aWA~LAe~al~~g~~-v~AYAyARTGYHRGLD~LRr~GWkG~GPVPw~HePN 88 (129)
T PF11349_consen 12 DPAARAALAAGADP--AEVAAAHPASSLAWAALAEEALAAGRP-VTAYAYARTGYHRGLDQLRRNGWKGHGPVPWSHEPN 88 (129)
T ss_pred CHHHHHHHHcCCCH--HHHHHHCCCchHHHHHHHHHHHhCCCc-hhhhhhhhccccccHHHHHHCCCCCCCCCCCccCCc
Confidence 33333445555433 334444 46788999999999999965 7788999999777777777652 1122
Q ss_pred HH-----HHHHHHHHHcCCHHHHHHH
Q psy11525 151 KA-----AQQAEMYLLGGNISEAESI 171 (244)
Q Consensus 151 ~~-----~~~A~ia~~~G~~~eAe~~ 171 (244)
.- .-.|+-+...|.-+|++.+
T Consensus 89 rGfLRal~aLa~AA~~IGE~dE~~Rc 114 (129)
T PF11349_consen 89 RGFLRALAALARAAQAIGETDEYDRC 114 (129)
T ss_pred cHHHHHHHHHHHHHHHhCChhHHHHH
Confidence 11 2245666677877777654
No 218
>PF08625 Utp13: Utp13 specific WD40 associated domain; InterPro: IPR013934 A large ribonuclear protein complex is required for the processing of the small-ribosomal-subunit rRNA - the small-subunit (SSU) processome [, ]. This preribosomal complex contains the U3 snoRNA and at least 40 proteins, which have the following properties: They are nucleolar. They are able to coimmunoprecipitate with the U3 snoRNA and Mpp10 (a protein specific to the SSU processome). They are required for 18S rRNA biogenesis. There appears to be a linkage between polymerase I transcription and the formation of the SSU processome; as some, but not all, of the SSU processome components are required for pre-rRNA transcription initiation. These SSU processome components have been termed t-Utps. They form a pre-complex with pre-18S rRNA in the absence of snoRNA U3 and other SSU processome components. It has been proposed that the t-Utp complex proteins are both rDNA and rRNA binding proteins that are involved in the initiation of pre18S rRNA transcription. Initially binding to rDNA then associating with the 5' end of the nascent pre18S rRNA. The t-Utpcomplex forms the nucleus around which the rest of the SSU processome components, including snoRNA U3, assemble []. From electron microscopy the SSU processome may correspond to the terminal knobs visualized at the 5' ends of nascent 18S rRNA. Utp13 is a nucleolar protein and component of the small subunit (SSU) processome containing the U3 snoRNA that is involved in processing of pre-18S rRNA []. Upt13 is also a component of the Pwp2 complex that forms part of a stable particle subunit independent of the U3 small nucleolar ribonucleoprotein that is essential for the initial assembly steps of the 90S pre-ribosome []. Components of the Pwp2 complex are: Utp1 (Pwp2), Utp6, Utp12 (Dip2), Utp13, Utp18, and Utp21. The relationship between the Pwp2 complex and the t-Utps complex [] that also associates with the 5' end of nascent pre-18S rRNA is unclear. ; GO: 0006364 rRNA processing, 0032040 small-subunit processome
Probab=38.62 E-value=2e+02 Score=22.91 Aligned_cols=65 Identities=14% Similarity=0.186 Sum_probs=42.9
Q ss_pred HHHHHHHHcCCHHHHHHHHhccCC-HHHHHHHHHHH---HHcCCH--HHHHHHHHHhCChhHHHHHHHhhC
Q psy11525 83 SVLHSYAASHSWPQALSLCRTLND-DILWACLAGMA---TYSRDL--ATSEEAYAAIEQVDKVMYINHIKG 147 (244)
Q Consensus 83 ~~l~~~l~~~~~~~A~~~~~~~~~-~~lW~~LA~~A---l~~~~l--~~A~~~y~~l~d~~~v~~l~~i~~ 147 (244)
..|..++..|+|.+|+.+|-.++. ..+...+-... -..... ..-..+-..+.+-.....++.++.
T Consensus 2 Q~L~N~l~~~~y~~Al~LAl~L~~P~~ll~i~~~~~~~~~~~~~~g~~~l~~~i~~L~~~~l~~LL~~ir~ 72 (141)
T PF08625_consen 2 QELSNLLRQKDYKEALRLALKLDHPFRLLKILKDLLETEEDEDSIGSEELDEVIKKLDDEQLEKLLRFIRD 72 (141)
T ss_pred chHHHHHHhhhHHHHHHHHHhcCCcHHHHHHHHHHHhcccccccchHHHHHHHHHhcCHHHHHHHHHHHHH
Confidence 357889999999999999988865 45555554444 222221 334456667777667677776664
No 219
>KOG4234|consensus
Probab=38.48 E-value=1.5e+02 Score=25.82 Aligned_cols=47 Identities=21% Similarity=0.382 Sum_probs=27.3
Q ss_pred HHHhcCHHHHHHHHHh---cCCchhHHHHHHHHHHHHcCC-cccHHHHHhc
Q psy11525 185 SILTHNWDRALELALR---HKTHIDTVLYQRKKYLDNLEK-IETNEKFLRL 231 (244)
Q Consensus 185 ~~~l~~W~~Al~LA~~---~~~~~~~vl~~r~~yl~~~~~-~e~~~~f~~~ 231 (244)
++.|+.|+.|++-+.+ .++....-|.+|+..++++.+ +++.+-|++.
T Consensus 144 ~iKl~k~e~aI~dcsKaiel~pty~kAl~RRAeayek~ek~eealeDyKki 194 (271)
T KOG4234|consen 144 LIKLRKWESAIEDCSKAIELNPTYEKALERRAEAYEKMEKYEEALEDYKKI 194 (271)
T ss_pred HHHhhhHHHHHHHHHhhHhcCchhHHHHHHHHHHHHhhhhHHHHHHHHHHH
Confidence 4567777777765544 344434445566666665544 5666666654
No 220
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=37.86 E-value=1.9e+02 Score=24.33 Aligned_cols=77 Identities=17% Similarity=0.234 Sum_probs=41.0
Q ss_pred cceeeecCCCceEEEEeCCeEEEEEcCCccccCcc-cceeeeEeeccC----------CCCCCCeEEeEeCCEEEEEecC
Q psy11525 2 VQSICWNTDVNILAAMQDSALCVWFFPAVVFADQG-LLRKTVLLKDIG----------EFGKSPSIVSFVKNHLTIRRYD 70 (244)
Q Consensus 2 v~~v~W~~~~~vlv~~~~~~l~vw~~~~~~~~~~~-~l~~~~~~~~~~----------~~~~~~~~v~f~g~~v~~~~~~ 70 (244)
.+.++|++|.+.|...+..+-.+|.++-.. +.. +-....+..... +-..+.-+..+.++.|.+.+++
T Consensus 136 pNGi~~s~dg~~lyv~ds~~~~i~~~~~~~--~~~~~~~~~~~~~~~~~~g~pDG~~vD~~G~l~va~~~~~~I~~~~p~ 213 (246)
T PF08450_consen 136 PNGIAFSPDGKTLYVADSFNGRIWRFDLDA--DGGELSNRRVFIDFPGGPGYPDGLAVDSDGNLWVADWGGGRIVVFDPD 213 (246)
T ss_dssp EEEEEEETTSSEEEEEETTTTEEEEEEEET--TTCCEEEEEEEEE-SSSSCEEEEEEEBTTS-EEEEEETTTEEEEEETT
T ss_pred ccceEECCcchheeecccccceeEEEeccc--cccceeeeeeEEEcCCCCcCCCcceEcCCCCEEEEEcCCCEEEEECCC
Confidence 367899999887765554444456555432 111 111111111000 1112334556778999999999
Q ss_pred CcEEEeecCc
Q psy11525 71 GTVINYPISP 80 (244)
Q Consensus 71 G~~~~~~~~p 80 (244)
|+++.....|
T Consensus 214 G~~~~~i~~p 223 (246)
T PF08450_consen 214 GKLLREIELP 223 (246)
T ss_dssp SCEEEEEE-S
T ss_pred ccEEEEEcCC
Confidence 9987654433
No 221
>PF02736 Myosin_N: Myosin N-terminal SH3-like domain; InterPro: IPR004009 This domain has an SH3-like fold. It is found at the N terminus of many but not all myosins. The function of this domain is unknown.; GO: 0003774 motor activity, 0005524 ATP binding, 0016459 myosin complex; PDB: 2EC6_A 2W4H_M 1O1E_P 1O1D_D 1O18_A 1O1C_P 1O1B_D 1O1F_A 2W4A_M 2W4G_M ....
Probab=36.47 E-value=88 Score=19.42 Aligned_cols=22 Identities=18% Similarity=0.344 Sum_probs=18.4
Q ss_pred eEEeEeCCEEEEEecCCcEEEe
Q psy11525 55 SIVSFVKNHLTIRRYDGTVINY 76 (244)
Q Consensus 55 ~~v~f~g~~v~~~~~~G~~~~~ 76 (244)
.++...|+.++++..||+.++.
T Consensus 17 ~I~~~~g~~vtV~~~~G~~~tv 38 (42)
T PF02736_consen 17 EIIEEEGDKVTVKTEDGKEVTV 38 (42)
T ss_dssp EEEEEESSEEEEEETTTEEEEE
T ss_pred EEEEEcCCEEEEEECCCCEEEe
Confidence 4567799999999999998765
No 222
>PF02064 MAS20: MAS20 protein import receptor; InterPro: IPR002056 Virtually all mitochondrial precursors are imported via the same mechanism []: precursors first bind to receptors on the mitochondrial surface, then insert into the translocation channel in the outer membrane. Many outer-membrane proteins participate in the early stages of import, four of which (MAS20, MAS22, MAS37 and MAS70) are components of the receptor. MAS20, which forms a subcomplex with MAS22, seems to interact with most or all mitochondrial precursors, suggesting that the protein binds directly to mitochondrial targeting sequences. The MAS37 and MAS70 components also form a subcomplex, the two subcomplexes possibly binding via their trans- membrane (TM) regions - the TM region of MAS70 promotes oligomerisation of attatched protein domains and shares sequence similarity with the TM region of MAS20 []. MAS20 is also known as TOM20.; GO: 0006605 protein targeting, 0006886 intracellular protein transport, 0005742 mitochondrial outer membrane translocase complex; PDB: 3AX3_A 3AWR_B 2V1S_A 3AX5_C 3AX2_C 1OM2_A 2V1T_B.
Probab=34.92 E-value=1.1e+02 Score=23.89 Aligned_cols=37 Identities=22% Similarity=0.145 Sum_probs=28.5
Q ss_pred HHHHHHHHHHcCCHHHHHHHHHH----CCCHHHHHHHHHHh
Q psy11525 152 AAQQAEMYLLGGNISEAESILLQ----HGLIFRAIQVSILT 188 (244)
Q Consensus 152 ~~~~A~ia~~~G~~~eAe~~~l~----~g~~~~Ai~m~~~l 188 (244)
+..+|+.++..|++++|..+|-+ ++.|..-+.+|+..
T Consensus 66 qV~lGE~L~~~G~~~~aa~hf~nAl~V~~qP~~LL~i~q~t 106 (121)
T PF02064_consen 66 QVQLGEQLLAQGDYEEAAEHFYNALKVCPQPAELLQIYQKT 106 (121)
T ss_dssp HHHHHHHHHHTT-HHHHHHHHHHHHHTSSSHHHHHHHHHHH
T ss_pred HHHHHHHHHhCCCHHHHHHHHHHHHHhCCCHHHHHHHHHhh
Confidence 45688888999999999888766 48888888887753
No 223
>KOG1070|consensus
Probab=34.60 E-value=7e+02 Score=27.94 Aligned_cols=135 Identities=15% Similarity=0.134 Sum_probs=66.5
Q ss_pred HHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHH-hCChhHHHHHHHhhCCCcHHHHHHHHHHHc
Q psy11525 89 AASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAA-IEQVDKVMYINHIKGIPVKAAQQAEMYLLG 162 (244)
Q Consensus 89 l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~-l~d~~~v~~l~~i~~~~~~~~~~A~ia~~~ 162 (244)
-...++++|-++.+.+ ++...|...|...+...+-+.|.+...+ +..+.+=+ ..+--...|.+---.
T Consensus 1541 ~k~ek~~~A~ell~~m~KKF~q~~~vW~~y~~fLl~~ne~~aa~~lL~rAL~~lPk~e-------Hv~~IskfAqLEFk~ 1613 (1710)
T KOG1070|consen 1541 EKSEKNDEADELLRLMLKKFGQTRKVWIMYADFLLRQNEAEAARELLKRALKSLPKQE-------HVEFISKFAQLEFKY 1613 (1710)
T ss_pred HHhhcchhHHHHHHHHHHHhcchhhHHHHHHHHHhcccHHHHHHHHHHHHHhhcchhh-------hHHHHHHHHHHHhhc
Confidence 3455666666666554 6788899998888887776666554432 22221100 001112234444444
Q ss_pred CCHHHHHHHHHH--------CCCHHHHHHHHHHhcCHHHHHHH---HHhcCCchhHHHHHHHHHH---HHcCCcccHHHH
Q psy11525 163 GNISEAESILLQ--------HGLIFRAIQVSILTHNWDRALEL---ALRHKTHIDTVLYQRKKYL---DNLEKIETNEKF 228 (244)
Q Consensus 163 G~~~eAe~~~l~--------~g~~~~Ai~m~~~l~~W~~Al~L---A~~~~~~~~~vl~~r~~yl---~~~~~~e~~~~f 228 (244)
|+.+.+-.+|.. -..|.-=|+|=+..+.-+.+-.| +-..+...+.+-.++.+|| ++-|.+++++.-
T Consensus 1614 GDaeRGRtlfEgll~ayPKRtDlW~VYid~eik~~~~~~vR~lfeRvi~l~l~~kkmKfffKkwLeyEk~~Gde~~vE~V 1693 (1710)
T KOG1070|consen 1614 GDAERGRTLFEGLLSAYPKRTDLWSVYIDMEIKHGDIKYVRDLFERVIELKLSIKKMKFFFKKWLEYEKSHGDEKNVEYV 1693 (1710)
T ss_pred CCchhhHHHHHHHHhhCccchhHHHHHHHHHHccCCHHHHHHHHHHHHhcCCChhHhHHHHHHHHHHHHhcCchhhHHHH
Confidence 554443334421 11222224443333332222222 1223334556666788888 556777777655
Q ss_pred Hh
Q psy11525 229 LR 230 (244)
Q Consensus 229 ~~ 230 (244)
+.
T Consensus 1694 Ka 1695 (1710)
T KOG1070|consen 1694 KA 1695 (1710)
T ss_pred HH
Confidence 43
No 224
>KOG2796|consensus
Probab=34.25 E-value=2.3e+02 Score=25.73 Aligned_cols=90 Identities=19% Similarity=0.166 Sum_probs=55.6
Q ss_pred CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcH---HHHHHHHHHHcCCHHHHHHHHHHC------
Q psy11525 105 NDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVK---AAQQAEMYLLGGNISEAESILLQH------ 175 (244)
Q Consensus 105 ~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~---~~~~A~ia~~~G~~~eAe~~~l~~------ 175 (244)
..+.+=..|+.++|+.|+.+.|+.-|.+.+ ++ -.++.++..+ ...+|.+.+-..+|-+|-..|-++
T Consensus 210 ~~p~L~s~Lgr~~MQ~GD~k~a~~yf~~ve---k~--~~kL~~~q~~~~V~~n~a~i~lg~nn~a~a~r~~~~i~~~D~~ 284 (366)
T KOG2796|consen 210 QEPQLLSGLGRISMQIGDIKTAEKYFQDVE---KV--TQKLDGLQGKIMVLMNSAFLHLGQNNFAEAHRFFTEILRMDPR 284 (366)
T ss_pred ccHHHHHHHHHHHHhcccHHHHHHHHHHHH---HH--HhhhhccchhHHHHhhhhhheecccchHHHHHHHhhccccCCC
Confidence 357899999999999999999999888765 11 1112222211 233455555666777777777443
Q ss_pred ----CCHHHHHHHHHHhcCHHHHHHHHHhc
Q psy11525 176 ----GLIFRAIQVSILTHNWDRALELALRH 201 (244)
Q Consensus 176 ----g~~~~Ai~m~~~l~~W~~Al~LA~~~ 201 (244)
..-+.-+.|| +|+-.+|++..++.
T Consensus 285 ~~~a~NnKALcllY--lg~l~DAiK~~e~~ 312 (366)
T KOG2796|consen 285 NAVANNNKALCLLY--LGKLKDALKQLEAM 312 (366)
T ss_pred chhhhchHHHHHHH--HHHHHHHHHHHHHH
Confidence 2223335555 46667777666553
No 225
>KOG1586|consensus
Probab=34.07 E-value=3.4e+02 Score=24.19 Aligned_cols=39 Identities=8% Similarity=0.130 Sum_probs=19.8
Q ss_pred HHHHHHHHHHHHHcCC-------HHHHHHHHHHhCChhHHHHHHHh
Q psy11525 107 DILWACLAGMATYSRD-------LATSEEAYAAIEQVDKVMYINHI 145 (244)
Q Consensus 107 ~~lW~~LA~~Al~~~~-------l~~A~~~y~~l~d~~~v~~l~~i 145 (244)
...+..+|..-++.+. +-.|-.||.+..-...|+-|+..
T Consensus 54 G~aflkaA~~h~k~~skhDaat~YveA~~cykk~~~~eAv~cL~~a 99 (288)
T KOG1586|consen 54 GDAFLKAADLHLKAGSKHDAATTYVEAANCYKKVDPEEAVNCLEKA 99 (288)
T ss_pred HHHHHHHHHHHHhcCCchhHHHHHHHHHHHhhccChHHHHHHHHHH
Confidence 3455566666666665 33344555555334444444433
No 226
>KOG0772|consensus
Probab=33.37 E-value=38 Score=33.08 Aligned_cols=30 Identities=23% Similarity=0.399 Sum_probs=26.5
Q ss_pred cceeeecCC-CceEEEEeCCeEEEEEcCCcc
Q psy11525 2 VQSICWNTD-VNILAAMQDSALCVWFFPAVV 31 (244)
Q Consensus 2 v~~v~W~~~-~~vlv~~~~~~l~vw~~~~~~ 31 (244)
|..++|+|. .-||++..|++.+|||-|+.+
T Consensus 460 vv~~~WhpkLNQi~~gsgdG~~~vyYdp~~S 490 (641)
T KOG0772|consen 460 VVRCLWHPKLNQIFAGSGDGTAHVYYDPNES 490 (641)
T ss_pred EEEEeecchhhheeeecCCCceEEEECcccc
Confidence 567899996 468999999999999999988
No 227
>COG2976 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=33.28 E-value=1.9e+02 Score=24.85 Aligned_cols=81 Identities=15% Similarity=0.092 Sum_probs=37.5
Q ss_pred HHcCCHHHHHHHHhcc--CCHHHHHHHHH-----HHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHHHH
Q psy11525 89 AASHSWPQALSLCRTL--NDDILWACLAG-----MATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMYLL 161 (244)
Q Consensus 89 l~~~~~~~A~~~~~~~--~~~~lW~~LA~-----~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia~~ 161 (244)
+..+.-+.+..+..+. +.++.+..||. ..++.++++.|+.-....-+..+=..|..+ ...|.|.+.+.
T Consensus 64 ~~ak~~~~~~~~ekf~~~n~~t~Ya~laaL~lAk~~ve~~~~d~A~aqL~~~l~~t~De~lk~l-----~~lRLArvq~q 138 (207)
T COG2976 64 VQAKKPKSIAAAEKFVQANGKTIYAVLAALELAKAEVEANNLDKAEAQLKQALAQTKDENLKAL-----AALRLARVQLQ 138 (207)
T ss_pred HhcCCchhHHHHHHHHhhccccHHHHHHHHHHHHHHHhhccHHHHHHHHHHHHccchhHHHHHH-----HHHHHHHHHHH
Confidence 5555555555555554 22566666553 344444444444322211100000111111 13467777777
Q ss_pred cCCHHHHHHHHHH
Q psy11525 162 GGNISEAESILLQ 174 (244)
Q Consensus 162 ~G~~~eAe~~~l~ 174 (244)
+|++++|-+++-.
T Consensus 139 ~~k~D~AL~~L~t 151 (207)
T COG2976 139 QKKADAALKTLDT 151 (207)
T ss_pred hhhHHHHHHHHhc
Confidence 7777776655543
No 228
>PF13763 DUF4167: Domain of unknown function (DUF4167)
Probab=32.68 E-value=59 Score=23.61 Aligned_cols=32 Identities=22% Similarity=0.149 Sum_probs=27.6
Q ss_pred HHHHHHHHHHHcCCHHHHHHHHHHhCChhHHH
Q psy11525 109 LWACLAGMATYSRDLATSEEAYAAIEQVDKVM 140 (244)
Q Consensus 109 lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~ 140 (244)
-|..||..|+..+|--.||-+|.+...+-.+-
T Consensus 41 KY~~LArDA~ssGDrV~aEny~QHAeHY~Ril 72 (80)
T PF13763_consen 41 KYNQLARDAQSSGDRVLAENYLQHAEHYFRIL 72 (80)
T ss_pred HHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHH
Confidence 58899999999999999999999888776553
No 229
>PF12688 TPR_5: Tetratrico peptide repeat
Probab=32.59 E-value=2.3e+02 Score=21.80 Aligned_cols=79 Identities=19% Similarity=-0.013 Sum_probs=46.0
Q ss_pred HHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcH-----HHHHHHHHHHcCCHHHHHHHHHHCCCH------HH
Q psy11525 112 CLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVK-----AAQQAEMYLLGGNISEAESILLQHGLI------FR 180 (244)
Q Consensus 112 ~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~-----~~~~A~ia~~~G~~~eAe~~~l~~g~~------~~ 180 (244)
.+|..--..|+-+-|+..|.+--.. .+.+. ....|..+..+|++++|..+|.+.-.- ..
T Consensus 6 ~~A~a~d~~G~~~~Ai~~Y~~Al~~----------gL~~~~~~~a~i~lastlr~LG~~deA~~~L~~~~~~~p~~~~~~ 75 (120)
T PF12688_consen 6 ELAWAHDSLGREEEAIPLYRRALAA----------GLSGADRRRALIQLASTLRNLGRYDEALALLEEALEEFPDDELNA 75 (120)
T ss_pred HHHHHHHhcCCHHHHHHHHHHHHHc----------CCCchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCCCccccH
Confidence 3444445556666666666542110 11221 234677778889999999888765211 22
Q ss_pred HHH-----HHHHhcCHHHHHHHHHh
Q psy11525 181 AIQ-----VSILTHNWDRALELALR 200 (244)
Q Consensus 181 Ai~-----m~~~l~~W~~Al~LA~~ 200 (244)
++. ....+|++++|+++...
T Consensus 76 ~l~~f~Al~L~~~gr~~eAl~~~l~ 100 (120)
T PF12688_consen 76 ALRVFLALALYNLGRPKEALEWLLE 100 (120)
T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHH
Confidence 333 23677888888887654
No 230
>KOG0495|consensus
Probab=32.29 E-value=4e+02 Score=27.32 Aligned_cols=43 Identities=21% Similarity=0.288 Sum_probs=36.9
Q ss_pred HcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHH
Q psy11525 90 ASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAA 132 (244)
Q Consensus 90 ~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~ 132 (244)
..++.++|.++|+.. .-..+|..+|+.--+.++++.|+.+|..
T Consensus 663 ~ld~~eeA~rllEe~lk~fp~f~Kl~lmlGQi~e~~~~ie~aR~aY~~ 710 (913)
T KOG0495|consen 663 YLDNVEEALRLLEEALKSFPDFHKLWLMLGQIEEQMENIEMAREAYLQ 710 (913)
T ss_pred HhhhHHHHHHHHHHHHHhCCchHHHHHHHhHHHHHHHHHHHHHHHHHh
Confidence 357789999999774 3589999999999999999999999964
No 231
>PF02259 FAT: FAT domain; InterPro: IPR003151 The FAT domain is a domain present in the PIK-related kinases. Members of the family of PIK-related kinases may act as intracellular sensors that govern radial and horizontal pathways [].; GO: 0005515 protein binding
Probab=31.76 E-value=1.3e+02 Score=26.47 Aligned_cols=31 Identities=16% Similarity=0.156 Sum_probs=27.3
Q ss_pred CHHHHHHHHHHHHHcCCHHHHHHHHHHhCCh
Q psy11525 106 DDILWACLAGMATYSRDLATSEEAYAAIEQV 136 (244)
Q Consensus 106 ~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~ 136 (244)
....|-.+|..|.++|++++|..+..++...
T Consensus 145 ~~~~~l~~a~~aRk~g~~~~A~~~l~~~~~~ 175 (352)
T PF02259_consen 145 LAETWLKFAKLARKAGNFQLALSALNRLFQL 175 (352)
T ss_pred HHHHHHHHHHHHHHCCCcHHHHHHHHHHhcc
Confidence 3578999999999999999999999887763
No 232
>PF12862 Apc5: Anaphase-promoting complex subunit 5
Probab=31.76 E-value=77 Score=22.97 Aligned_cols=22 Identities=27% Similarity=0.323 Sum_probs=13.4
Q ss_pred CCHHHHHHHHHHhcCHHHHHHHHHhcCC
Q psy11525 176 GLIFRAIQVSILTHNWDRALELALRHKT 203 (244)
Q Consensus 176 g~~~~Ai~m~~~l~~W~~Al~LA~~~~~ 203 (244)
|++++|++. +++|+++|++.+-
T Consensus 55 G~~~~A~~~------l~eAi~~Are~~D 76 (94)
T PF12862_consen 55 GHYEEALQA------LEEAIRLARENGD 76 (94)
T ss_pred CCHHHHHHH------HHHHHHHHHHHCC
Confidence 444455444 3578888887763
No 233
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=30.99 E-value=2.2e+02 Score=22.77 Aligned_cols=28 Identities=25% Similarity=0.492 Sum_probs=22.0
Q ss_pred cceeeecCCCceEEEEe-CCeEEEEEcCC
Q psy11525 2 VQSICWNTDVNILAAMQ-DSALCVWFFPA 29 (244)
Q Consensus 2 v~~v~W~~~~~vlv~~~-~~~l~vw~~~~ 29 (244)
|+++.|+++.+.+++.+ ++.+.+|-...
T Consensus 12 i~~~~~~~~~~~l~~~~~~g~i~i~~~~~ 40 (289)
T cd00200 12 VTCVAFSPDGKLLATGSGDGTIKVWDLET 40 (289)
T ss_pred EEEEEEcCCCCEEEEeecCcEEEEEEeeC
Confidence 67899999987777665 88999986544
No 234
>PF10516 SHNi-TPR: SHNi-TPR; InterPro: IPR019544 The tetratrico peptide repeat region (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B. This entry represents SHNi-TPR (Sim3-Hif1-NASP interrupted TPR), a sequence that is an interrupted form of TPR repeat [].
Probab=30.27 E-value=91 Score=19.12 Aligned_cols=23 Identities=17% Similarity=0.021 Sum_probs=17.8
Q ss_pred HHHHHHHHHHcCCHHHHHHHHHH
Q psy11525 152 AAQQAEMYLLGGNISEAESILLQ 174 (244)
Q Consensus 152 ~~~~A~ia~~~G~~~eAe~~~l~ 174 (244)
..+.|++.+..++|++|..=|.+
T Consensus 4 ~~~Lgeisle~e~f~qA~~D~~~ 26 (38)
T PF10516_consen 4 YDLLGEISLENENFEQAIEDYEK 26 (38)
T ss_pred HHHHHHHHHHhccHHHHHHHHHH
Confidence 45678899999999988875544
No 235
>PF11817 Foie-gras_1: Foie gras liver health family 1; InterPro: IPR021773 Mutating the gene foie gras in zebrafish has been shown to affect development; the mutants develop large, lipid-filled hepatocytes in the liver, resembling those in individuals with fatty liver disease []. Foie-gras protein is long and has several well-defined domains though none of them has a known function. We have annotated this one as the first []. THe C terminus of this region contains TPR repeats.
Probab=30.26 E-value=3.4e+02 Score=23.34 Aligned_cols=56 Identities=18% Similarity=0.180 Sum_probs=38.3
Q ss_pred HHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHCCCHHHHHHHHHHhcCHHHHH
Q psy11525 125 TSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMYLLGGNISEAESILLQHGLIFRAIQVSILTHNWDRAL 195 (244)
Q Consensus 125 ~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia~~~G~~~eAe~~~l~~g~~~~Ai~m~~~l~~W~~Al 195 (244)
-|...|.+.+..++..+| ...+|+-+...|++++|.++|... ...|+.-|-|.-.-
T Consensus 163 ~A~~~f~~~~~~R~~~~l---------~~~~A~ey~~~g~~~~A~~~l~~~------~~~yr~egW~~l~~ 218 (247)
T PF11817_consen 163 KAYEQFKKYGQNRMASYL---------SLEMAEEYFRLGDYDKALKLLEPA------ASSYRREGWWSLLT 218 (247)
T ss_pred HHHHHHHHhccchHHHHH---------HHHHHHHHHHCCCHHHHHHHHHHH------HHHHHhCCcHHHHH
Confidence 466667777765555543 346888888899999999888765 56666666665443
No 236
>KOG2168|consensus
Probab=29.57 E-value=90 Score=32.25 Aligned_cols=31 Identities=16% Similarity=0.049 Sum_probs=16.5
Q ss_pred HHHHHHHHcCCHHHHHHHHHHCCCHHHHHHH
Q psy11525 154 QQAEMYLLGGNISEAESILLQHGLIFRAIQV 184 (244)
Q Consensus 154 ~~A~ia~~~G~~~eAe~~~l~~g~~~~Ai~m 184 (244)
..|.-+-..|+|++|..+|.-+|.++.|++.
T Consensus 627 ~vA~~a~~~G~~~~sI~LY~lag~yd~al~l 657 (835)
T KOG2168|consen 627 EVASEADEDGLFEDAILLYHLAGDYDKALEL 657 (835)
T ss_pred HHHHHHHhcCCHHHHHHHHHHhhhhhHHHHH
Confidence 3455555555555555555555555555443
No 237
>PRK05441 murQ N-acetylmuramic acid-6-phosphate etherase; Reviewed
Probab=29.52 E-value=4.2e+02 Score=23.77 Aligned_cols=83 Identities=22% Similarity=0.163 Sum_probs=54.9
Q ss_pred HHHHHHHhCChhHHHHHHHhhCCCcHHHHHH--HHHHHcC-CHHHHHHHHHHC-CCHHHHHHHHHHhcCHHHHHHHHHhc
Q psy11525 126 SEEAYAAIEQVDKVMYINHIKGIPVKAAQQA--EMYLLGG-NISEAESILLQH-GLIFRAIQVSILTHNWDRALELALRH 201 (244)
Q Consensus 126 A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A--~ia~~~G-~~~eAe~~~l~~-g~~~~Ai~m~~~l~~W~~Al~LA~~~ 201 (244)
....+.+.|.+ .-.+..+++-.++|...+| .+....| ..++|++.+.++ |.++.||-|...-..+++|.++..++
T Consensus 209 st~~~~~~gkv-~~n~mvd~~~~n~kl~~ra~~i~~~~~~~~~~~a~~~l~~~~~~vk~a~~~~~~~~~~~~a~~~l~~~ 287 (299)
T PRK05441 209 STGVMIRLGKV-YGNLMVDVKATNEKLVDRAVRIVMEATGVSREEAEAALEAADGSVKLAIVMILTGLDAAEAKALLARH 287 (299)
T ss_pred HHHHHHHccHH-HHHHHHHhcCCHHHHHHHHHHHHHHHHCcCHHHHHHHHHHhCCCcHHHHHHHHhCCCHHHHHHHHHHc
Confidence 44445555552 2223334444444443333 2333334 478899999985 89999999999999999999999999
Q ss_pred CCchhHHH
Q psy11525 202 KTHIDTVL 209 (244)
Q Consensus 202 ~~~~~~vl 209 (244)
+.++...+
T Consensus 288 ~g~~~~~~ 295 (299)
T PRK05441 288 GGFLRKAL 295 (299)
T ss_pred CCCHHHHH
Confidence 88866644
No 238
>KOG2066|consensus
Probab=29.16 E-value=3.4e+02 Score=28.04 Aligned_cols=118 Identities=17% Similarity=0.198 Sum_probs=77.7
Q ss_pred HHHHHHcCCHHHHHHHHhccCCHHHH------------------------HHHHHHHHHcCCHHHHHHHHHHhCChhHHH
Q psy11525 85 LHSYAASHSWPQALSLCRTLNDDILW------------------------ACLAGMATYSRDLATSEEAYAAIEQVDKVM 140 (244)
Q Consensus 85 l~~~l~~~~~~~A~~~~~~~~~~~lW------------------------~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~ 140 (244)
++.|+..+++++|+++--.+++.... .+.+..-+++.+.---.+.|-.+++....+
T Consensus 512 a~LYl~d~~Y~~Al~~ylklk~~~vf~lI~k~nL~d~i~~~Iv~Lmll~skka~~lLldn~d~ip~a~Vveql~~~P~~l 591 (846)
T KOG2066|consen 512 AHLYLYDNKYEKALPIYLKLQDKDVFDLIKKHNLFDQIKDQIVLLMLLDSKKAIDLLLDNRDSISPSEVVEQLEDNPKLL 591 (846)
T ss_pred HHHHHHccChHHHHHHHHhccChHHHHHHHHHhhHHHHHHHHHHHHccchhhHHHHHhhccccCCHHHHHHHHhcChHHH
Confidence 44568888888888765554332211 144555666665444455566666655553
Q ss_pred HHH--HhhCCC--------c------------------------HHHHHHHHHHHcCCHHHHHHHHHHCCCHHHHHHHHH
Q psy11525 141 YIN--HIKGIP--------V------------------------KAAQQAEMYLLGGNISEAESILLQHGLIFRAIQVSI 186 (244)
Q Consensus 141 ~l~--~i~~~~--------~------------------------~~~~~A~ia~~~G~~~eAe~~~l~~g~~~~Ai~m~~ 186 (244)
+.- +..... + ......+++...|-++|-.-++-+.|.-.+|+.|-+
T Consensus 592 ~~YL~kl~~rd~~~~~~y~dk~I~LYAEyDrk~LLPFLr~s~~Y~lekA~eiC~q~~~~~E~VYlLgrmGn~k~AL~lII 671 (846)
T KOG2066|consen 592 YCYLHKLFKRDHFMGSEYHDKQIELYAEYDRKKLLPFLRKSQNYNLEKALEICSQKNFYEELVYLLGRMGNAKEALKLII 671 (846)
T ss_pred HHHHHHHhhcCccccchhhhHHHHHHHHHhHhhhhHHHHhcCCCCHHHHHHHHHhhCcHHHHHHHHHhhcchHHHHHHHH
Confidence 322 221111 0 012356888888999999999999999999998764
Q ss_pred -HhcCHHHHHHHHHhcC
Q psy11525 187 -LTHNWDRALELALRHK 202 (244)
Q Consensus 187 -~l~~W~~Al~LA~~~~ 202 (244)
++++-++|++.++.++
T Consensus 672 ~el~die~AIefvKeq~ 688 (846)
T KOG2066|consen 672 NELRDIEKAIEFVKEQD 688 (846)
T ss_pred HHhhCHHHHHHHHHhcC
Confidence 6899999999999876
No 239
>COG2103 Predicted sugar phosphate isomerase [General function prediction only]
Probab=29.10 E-value=1.5e+02 Score=26.66 Aligned_cols=47 Identities=23% Similarity=0.196 Sum_probs=40.0
Q ss_pred CHHHHHHHHHHC-CCHHHHHHHHHHhcCHHHHHHHHHhcCCchhHHHH
Q psy11525 164 NISEAESILLQH-GLIFRAIQVSILTHNWDRALELALRHKTHIDTVLY 210 (244)
Q Consensus 164 ~~~eAe~~~l~~-g~~~~Ai~m~~~l~~W~~Al~LA~~~~~~~~~vl~ 210 (244)
..++|+.+|.++ +.++.||-|-..-..-++|-++..+|+..+...+.
T Consensus 247 ~~~~A~~~L~~~~~~vK~AIvm~~~~~~a~~A~~~L~~~~g~lr~Al~ 294 (298)
T COG2103 247 SAEEAEALLEEAGGNVKLAIVMLLTGLSAEEAKRLLERAGGFLRQALS 294 (298)
T ss_pred CHHHHHHHHHHcCCccHhHHHHHHhCCCHHHHHHHHHHccChHHHHHh
Confidence 579999999998 57899999999999999999999999876655443
No 240
>COG2976 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=28.45 E-value=3.8e+02 Score=22.99 Aligned_cols=83 Identities=12% Similarity=0.110 Sum_probs=53.0
Q ss_pred HHHHHHHcCCHHHHHHHHhcc---CCHHH-----HHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHH
Q psy11525 84 VLHSYAASHSWPQALSLCRTL---NDDIL-----WACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQ 155 (244)
Q Consensus 84 ~l~~~l~~~~~~~A~~~~~~~---~~~~l-----W~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~ 155 (244)
.-..+++.|++++|..-.+.. ..+.. =.+||+.-++.+.+|.|....-.+.+-+.+..+ ...+
T Consensus 95 lAk~~ve~~~~d~A~aqL~~~l~~t~De~lk~l~~lRLArvq~q~~k~D~AL~~L~t~~~~~w~~~~---------~elr 165 (207)
T COG2976 95 LAKAEVEANNLDKAEAQLKQALAQTKDENLKALAALRLARVQLQQKKADAALKTLDTIKEESWAAIV---------AELR 165 (207)
T ss_pred HHHHHHhhccHHHHHHHHHHHHccchhHHHHHHHHHHHHHHHHHhhhHHHHHHHHhccccccHHHHH---------HHHh
Confidence 345579999999999988853 22222 234556666666666655555444444333322 3457
Q ss_pred HHHHHHcCCHHHHHHHHHHC
Q psy11525 156 AEMYLLGGNISEAESILLQH 175 (244)
Q Consensus 156 A~ia~~~G~~~eAe~~~l~~ 175 (244)
|.+++-.|+-++|-.-|-++
T Consensus 166 GDill~kg~k~~Ar~ay~kA 185 (207)
T COG2976 166 GDILLAKGDKQEARAAYEKA 185 (207)
T ss_pred hhHHHHcCchHHHHHHHHHH
Confidence 88899999998888777654
No 241
>PF04762 IKI3: IKI3 family; InterPro: IPR006849 Members of this family are components of the elongator multi-subunit component of a novel RNA polymerase II holoenzyme for transcriptional elongation [].
Probab=28.02 E-value=61 Score=34.02 Aligned_cols=42 Identities=12% Similarity=0.275 Sum_probs=29.3
Q ss_pred EEEEEecCCcEEEeecCccH-HHHHHHHHcCCHHHHHHHHhcc
Q psy11525 63 HLTIRRYDGTVINYPISPYI-SVLHSYAASHSWPQALSLCRTL 104 (244)
Q Consensus 63 ~v~~~~~~G~~~~~~~~p~~-~~l~~~l~~~~~~~A~~~~~~~ 104 (244)
.|.++-+-|.+-++.--++- ..+-++|..++|.+|+.+||..
T Consensus 678 ~vVLQmPRGNLEtI~PR~LVL~~ir~~Ld~~~Y~~Af~~~Rkh 720 (928)
T PF04762_consen 678 SVVLQMPRGNLETIYPRALVLAGIRKLLDAKDYKEAFELCRKH 720 (928)
T ss_pred eEEEEcCCCchhhhccHhHHHHHHHHHHhhccHHHHHHHHHHh
Confidence 45666666666543322332 4566789999999999999986
No 242
>KOG2880|consensus
Probab=27.96 E-value=1.2e+02 Score=28.32 Aligned_cols=25 Identities=28% Similarity=0.444 Sum_probs=20.8
Q ss_pred cHHHHHHHHHHHcCCHHHHHHHHHH
Q psy11525 150 VKAAQQAEMYLLGGNISEAESILLQ 174 (244)
Q Consensus 150 ~~~~~~A~ia~~~G~~~eAe~~~l~ 174 (244)
..-.++|.+++..|+++.|.-+|++
T Consensus 36 ~ei~rmA~VY~~EgN~enafvLy~r 60 (424)
T KOG2880|consen 36 TEILRMANVYLEEGNVENAFVLYLR 60 (424)
T ss_pred HHHHHHHHHHHhcCCcchhhhHHHH
Confidence 3457899999999999999888775
No 243
>KOG0686|consensus
Probab=27.85 E-value=1.4e+02 Score=28.49 Aligned_cols=22 Identities=18% Similarity=0.167 Sum_probs=13.0
Q ss_pred cccHHHHHhccc-ccccCcccCC
Q psy11525 222 IETNEKFLRLQS-EVIMSGLRTP 243 (244)
Q Consensus 222 ~e~~~~f~~~~~-~~~~~~~~~~ 243 (244)
.++...|..+.. ..+-+.++||
T Consensus 246 k~aa~~fL~~~~~~~d~~~ivtp 268 (466)
T KOG0686|consen 246 KSAAKYFLLAEFDHCDYPEIVTP 268 (466)
T ss_pred HHHHHHHHhCCCCccCccceecc
Confidence 456666666652 3445566666
No 244
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=27.54 E-value=3.2e+02 Score=21.76 Aligned_cols=27 Identities=30% Similarity=0.519 Sum_probs=22.4
Q ss_pred cceeeecCCCceEEEEe-CCeEEEEEcC
Q psy11525 2 VQSICWNTDVNILAAMQ-DSALCVWFFP 28 (244)
Q Consensus 2 v~~v~W~~~~~vlv~~~-~~~l~vw~~~ 28 (244)
+.++.|+++..++++.+ ++.+.+|-..
T Consensus 222 i~~~~~~~~~~~~~~~~~~~~i~i~~~~ 249 (289)
T cd00200 222 VNSVAFSPDGYLLASGSEDGTIRVWDLR 249 (289)
T ss_pred eEEEEEcCCCcEEEEEcCCCcEEEEEcC
Confidence 56789999988888888 8999998544
No 245
>COG3898 Uncharacterized membrane-bound protein [Function unknown]
Probab=27.52 E-value=5.6e+02 Score=24.64 Aligned_cols=82 Identities=13% Similarity=0.098 Sum_probs=48.3
Q ss_pred HHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHH-HHHHHHHcCCHHHHH----HHHHHCCCH----HHHH
Q psy11525 112 CLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQ-QAEMYLLGGNISEAE----SILLQHGLI----FRAI 182 (244)
Q Consensus 112 ~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~-~A~ia~~~G~~~eAe----~~~l~~g~~----~~Ai 182 (244)
.-|+.++..|+.+-|.+.|-++-+..-.+.+- ++ .---+.-+|..+-|. ..+-++-.. ..-+
T Consensus 125 LeAQaal~eG~~~~Ar~kfeAMl~dPEtRllG---------LRgLyleAqr~GareaAr~yAe~Aa~~Ap~l~WA~~AtL 195 (531)
T COG3898 125 LEAQAALLEGDYEDARKKFEAMLDDPETRLLG---------LRGLYLEAQRLGAREAARHYAERAAEKAPQLPWAARATL 195 (531)
T ss_pred HHHHHHHhcCchHHHHHHHHHHhcChHHHHHh---------HHHHHHHHHhcccHHHHHHHHHHHHhhccCCchHHHHHH
Confidence 34778899999999999998754433222210 01 001122345554333 333333322 3348
Q ss_pred HHHHHhcCHHHHHHHHHhcC
Q psy11525 183 QVSILTHNWDRALELALRHK 202 (244)
Q Consensus 183 ~m~~~l~~W~~Al~LA~~~~ 202 (244)
+-++.-|+|+.||+|.....
T Consensus 196 e~r~~~gdWd~AlkLvd~~~ 215 (531)
T COG3898 196 EARCAAGDWDGALKLVDAQR 215 (531)
T ss_pred HHHHhcCChHHHHHHHHHHH
Confidence 88999999999999986643
No 246
>PF05944 Phage_term_smal: Phage small terminase subunit; InterPro: IPR010270 This entry is represented by Bacteriophage P2, GpM. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several phage small terminase subunit proteins as well as some related bacterial sequences []. M protein is probably an endonuclease which directs cos cleavage. The Q, P and M proteins are needed to package DNA into proheads and for the conversion of proheads to capsids.; GO: 0003677 DNA binding, 0004519 endonuclease activity, 0019069 viral capsid assembly
Probab=27.40 E-value=2.9e+02 Score=21.85 Aligned_cols=56 Identities=21% Similarity=0.125 Sum_probs=33.4
Q ss_pred HHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHCCC--HHHH----HHHHHHhcCHHHHHHHHHh
Q psy11525 142 INHIKGIPVKAAQQAEMYLLGGNISEAESILLQHGL--IFRA----IQVSILTHNWDRALELALR 200 (244)
Q Consensus 142 l~~i~~~~~~~~~~A~ia~~~G~~~eAe~~~l~~g~--~~~A----i~m~~~l~~W~~Al~LA~~ 200 (244)
|+.+..+++|.....+ ++-+|.-=.+-++.+|. .+.- +==..|.|+|+.|+++|.-
T Consensus 12 Lk~iqs~e~K~~~Kr~---lLP~Y~p~v~g~L~~g~g~qd~Vl~~~mvW~~D~Gd~~~AL~~a~y 73 (132)
T PF05944_consen 12 LKQIQSIERKAEYKRE---LLPKYLPWVEGVLASGSGAQDDVLMTVMVWLFDVGDFDGALDIAEY 73 (132)
T ss_pred HhhcccHHHHHHHHHH---HHHhHHHHHHHHHHcCCCCcCchHHhhHhhhhcccCHHHHHHHHHH
Confidence 3333334444433333 55577777777777655 3322 2234799999999999864
No 247
>PRK14720 transcript cleavage factor/unknown domain fusion protein; Provisional
Probab=27.29 E-value=7.4e+02 Score=26.20 Aligned_cols=91 Identities=22% Similarity=0.170 Sum_probs=59.9
Q ss_pred HHHHHHHHHHhCChhHH-HHHHHhhCCCcHH----HHHHHHHHHcCCHHHHHHHHHHCCCHHHHHHHHHHhcCHHHHHHH
Q psy11525 123 LATSEEAYAAIEQVDKV-MYINHIKGIPVKA----AQQAEMYLLGGNISEAESILLQHGLIFRAIQVSILTHNWDRALEL 197 (244)
Q Consensus 123 l~~A~~~y~~l~d~~~v-~~l~~i~~~~~~~----~~~A~ia~~~G~~~eAe~~~l~~g~~~~Ai~m~~~l~~W~~Al~L 197 (244)
+-.-..||.++|+..+. ...+++.+....+ ...|..+... ++++|++++. +|+++|.+-.++..+.++
T Consensus 119 l~~LA~~Ydk~g~~~ka~~~yer~L~~D~~n~~aLNn~AY~~ae~-dL~KA~~m~~------KAV~~~i~~kq~~~~~e~ 191 (906)
T PRK14720 119 LRTLAEAYAKLNENKKLKGVWERLVKADRDNPEIVKKLATSYEEE-DKEKAITYLK------KAIYRFIKKKQYVGIEEI 191 (906)
T ss_pred HHHHHHHHHHcCChHHHHHHHHHHHhcCcccHHHHHHHHHHHHHh-hHHHHHHHHH------HHHHHHHhhhcchHHHHH
Confidence 33456788888887766 3455555553322 3567777777 9999999985 578999999999988886
Q ss_pred HHhcC---Cc-hhHHHHHHHHHHHHcC
Q psy11525 198 ALRHK---TH-IDTVLYQRKKYLDNLE 220 (244)
Q Consensus 198 A~~~~---~~-~~~vl~~r~~yl~~~~ 220 (244)
=++.. ++ ++.++..-++-+.+.|
T Consensus 192 W~k~~~~~~~d~d~f~~i~~ki~~~~~ 218 (906)
T PRK14720 192 WSKLVHYNSDDFDFFLRIERKVLGHRE 218 (906)
T ss_pred HHHHHhcCcccchHHHHHHHHHHhhhc
Confidence 55532 33 4555544444444433
No 248
>KOG3060|consensus
Probab=27.26 E-value=2.1e+02 Score=25.69 Aligned_cols=44 Identities=18% Similarity=0.178 Sum_probs=32.1
Q ss_pred HHcCCHHHHHHHH-----hccCCHHHHHHHHHHHHHcCCHHHHHHHHHH
Q psy11525 89 AASHSWPQALSLC-----RTLNDDILWACLAGMATYSRDLATSEEAYAA 132 (244)
Q Consensus 89 l~~~~~~~A~~~~-----~~~~~~~lW~~LA~~Al~~~~l~~A~~~y~~ 132 (244)
-..|+=-+|++-. .++.+...|..||+..+..+++.-|.-||-.
T Consensus 131 ka~GK~l~aIk~ln~YL~~F~~D~EAW~eLaeiY~~~~~f~kA~fClEE 179 (289)
T KOG3060|consen 131 KAQGKNLEAIKELNEYLDKFMNDQEAWHELAEIYLSEGDFEKAAFCLEE 179 (289)
T ss_pred HHcCCcHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHhHHHHHHHHHHH
Confidence 4455555666533 2347889999999999998888888888765
No 249
>PF12569 NARP1: NMDA receptor-regulated protein 1 ; InterPro: IPR021183 This group represents N-terminal acetyltransferase A (NatA) auxiliary subunit and represents a non-catalytic component of the NatA N-terminal acetyltransferase, which catalyzes acetylation of proteins beginning with Met-Ser, Met-Gly and Met-Ala. N-terminal acetylation plays a role in normal eukaryotic translation and processing, protect against proteolytic degradation and protein turnover. NAT1 anchors ARD1 and NAT5 to the ribosome and may present the N- terminal of nascent polypeptides for acetylation [], [].
Probab=26.71 E-value=6.1e+02 Score=24.78 Aligned_cols=77 Identities=22% Similarity=0.210 Sum_probs=46.2
Q ss_pred HHHcCCHHHHHHHHhcc----CC-HHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHHHHc
Q psy11525 88 YAASHSWPQALSLCRTL----ND-DILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMYLLG 162 (244)
Q Consensus 88 ~l~~~~~~~A~~~~~~~----~~-~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia~~~ 162 (244)
|-..|++++|..++... ++ ..+...-|..--+.|++.-|..++-.....+...-. .+...+.-.+..
T Consensus 204 yd~~g~~~~Al~~Id~aI~htPt~~ely~~KarilKh~G~~~~Aa~~~~~Ar~LD~~DRy--------iNsK~aKy~LRa 275 (517)
T PF12569_consen 204 YDYLGDYEKALEYIDKAIEHTPTLVELYMTKARILKHAGDLKEAAEAMDEARELDLADRY--------INSKCAKYLLRA 275 (517)
T ss_pred HHHhCCHHHHHHHHHHHHhcCCCcHHHHHHHHHHHHHCCCHHHHHHHHHHHHhCChhhHH--------HHHHHHHHHHHC
Confidence 46789999999999864 22 556666666666666666555544433322222100 134566667777
Q ss_pred CCHHHHHHHH
Q psy11525 163 GNISEAESIL 172 (244)
Q Consensus 163 G~~~eAe~~~ 172 (244)
|++++|+++.
T Consensus 276 ~~~e~A~~~~ 285 (517)
T PF12569_consen 276 GRIEEAEKTA 285 (517)
T ss_pred CCHHHHHHHH
Confidence 7777777653
No 250
>PF04097 Nic96: Nup93/Nic96; InterPro: IPR007231 Nup93/Nic96 is a component of the nuclear pore complex. It is required for the correct assembly of the nuclear pore complex []. In Saccharomyces cerevisiae, Nic96 has been shown to be involved in the distribution and cellular concentration of the GTPase Gsp1 []. The structure of Nic96 has revealed a mostly alpha helical structure [].; GO: 0006810 transport, 0005643 nuclear pore; PDB: 2QX5_B 2RFO_A.
Probab=26.68 E-value=1.2e+02 Score=30.10 Aligned_cols=35 Identities=26% Similarity=0.284 Sum_probs=29.0
Q ss_pred HHHHHHHHHCCCHHHHHHHHHHhcCHHHHHHHHHh
Q psy11525 166 SEAESILLQHGLIFRAIQVSILTHNWDRALELALR 200 (244)
Q Consensus 166 ~eAe~~~l~~g~~~~Ai~m~~~l~~W~~Al~LA~~ 200 (244)
..|..-.-..|++..||.+|.-.+++++|+++..+
T Consensus 418 ~~~A~~~e~~g~~~dAi~Ly~La~~~d~vl~lln~ 452 (613)
T PF04097_consen 418 EQAAREAEERGRFEDAILLYHLAEEYDKVLSLLNR 452 (613)
T ss_dssp HHHHHHHHHCT-HHHHHHHHHHTT-HHHHHHHHHH
T ss_pred HHHHHHHHHCCCHHHHHHHHHHHhhHHHHHHHHHH
Confidence 45677788899999999999999999999998765
No 251
>PF07575 Nucleopor_Nup85: Nup85 Nucleoporin; InterPro: IPR011502 This is a family of nucleoporins conserved from yeast to human. Nup85 Nucleoporin is an essential component of the nuclear pore complex (NPC) that seems to be required for NPC assembly and maintenance. As part of the NPC Nup107-160 subcomplex plays a role in RNA export and in tethering NUP98/Nup98 and NUP153 to the nucleus. The Nup107-160 complex seems to be required for spindle assembly during mitosis. NUP85 is required for membrane clustering of CCL2-activated CCR2. Seems to be involved in CCR2-mediated chemotaxis of monocytes and may link activated CCR2 to the phosphatidyl-inositol-3-kinase-Rac-lammellipodium protrusion cascade [, , ]. ; PDB: 3F3F_D 3F3P_G 3F3G_G 3EWE_B.
Probab=26.65 E-value=49 Score=32.42 Aligned_cols=53 Identities=11% Similarity=0.068 Sum_probs=34.5
Q ss_pred CCHHHHHHHHhcc----CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHH
Q psy11525 92 HSWPQALSLCRTL----NDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINH 144 (244)
Q Consensus 92 ~~~~~A~~~~~~~----~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~ 144 (244)
.+.+.++++|+.. ..+...+.+|..+++.+++..|...|.+.||..++..+-.
T Consensus 406 ~~~~k~l~iC~~~~L~~~a~~I~~~~~~~~~~~~~~g~AL~~~~ra~d~~~v~~i~~ 462 (566)
T PF07575_consen 406 DDAEKLLEICAELGLEDVAREICKILGQRLLKEGRYGEALSWFIRAGDYSLVTRIAD 462 (566)
T ss_dssp HHHHHHHHHHHHHT-HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------------
T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCHHHHHHHHH
Confidence 4567888888874 3578889999999999999999999999999988865433
No 252
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=26.04 E-value=1.9e+02 Score=25.41 Aligned_cols=46 Identities=9% Similarity=-0.035 Sum_probs=35.9
Q ss_pred HHHcCCHHHHHHHHhccC--------CHHHHHHHHHHHHHcCCHHHHHHHHHHh
Q psy11525 88 YAASHSWPQALSLCRTLN--------DDILWACLAGMATYSRDLATSEEAYAAI 133 (244)
Q Consensus 88 ~l~~~~~~~A~~~~~~~~--------~~~lW~~LA~~Al~~~~l~~A~~~y~~l 133 (244)
|...|++++|....+.+. ....|..+|......++.+-|.+.|..+
T Consensus 190 y~~~g~~~~A~~~f~~vv~~yP~s~~~~dAl~klg~~~~~~g~~~~A~~~~~~v 243 (263)
T PRK10803 190 NYNKGKKDDAAYYFASVVKNYPKSPKAADAMFKVGVIMQDKGDTAKAKAVYQQV 243 (263)
T ss_pred HHHcCCHHHHHHHHHHHHHHCCCCcchhHHHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 488999999999887752 4677888888877778888777777653
No 253
>PF06977 SdiA-regulated: SdiA-regulated; InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=25.91 E-value=2.3e+02 Score=24.79 Aligned_cols=24 Identities=25% Similarity=0.426 Sum_probs=17.6
Q ss_pred cceeeecCCCceEEEEeCCeEEEE
Q psy11525 2 VQSICWNTDVNILAAMQDSALCVW 25 (244)
Q Consensus 2 v~~v~W~~~~~vlv~~~~~~l~vw 25 (244)
++.+.|+++++.|.|.+|..=.++
T Consensus 24 ~SGLTy~pd~~tLfaV~d~~~~i~ 47 (248)
T PF06977_consen 24 LSGLTYNPDTGTLFAVQDEPGEIY 47 (248)
T ss_dssp EEEEEEETTTTEEEEEETTTTEEE
T ss_pred ccccEEcCCCCeEEEEECCCCEEE
Confidence 578999999987777766544443
No 254
>PRK12570 N-acetylmuramic acid-6-phosphate etherase; Reviewed
Probab=25.66 E-value=1.7e+02 Score=26.28 Aligned_cols=52 Identities=13% Similarity=0.091 Sum_probs=41.9
Q ss_pred HHHcC-CHHHHHHHHHHC-CCHHHHHHHHHHhcCHHHHHHHHHhcCCchhHHHH
Q psy11525 159 YLLGG-NISEAESILLQH-GLIFRAIQVSILTHNWDRALELALRHKTHIDTVLY 210 (244)
Q Consensus 159 a~~~G-~~~eAe~~~l~~-g~~~~Ai~m~~~l~~W~~Al~LA~~~~~~~~~vl~ 210 (244)
....| ..++|++.+.++ |.+..||=|...-..-++|.++..++++++...|.
T Consensus 239 ~~~~~~~~~~a~~~l~~~~~~vk~ai~~~~~~~~~~~a~~~l~~~~~~~~~~l~ 292 (296)
T PRK12570 239 MQATGCSEDEAKELLKESDNDVKLAILMILTGMDVEQARAALSHADGFLRKAIE 292 (296)
T ss_pred HHHHCcCHHHHHHHHHHhCCccHHHHHHHHhCCCHHHHHHHHHHcCChHHHHHH
Confidence 33344 468899999985 78999999999988999999999998887766554
No 255
>KOG0302|consensus
Probab=25.60 E-value=56 Score=30.70 Aligned_cols=24 Identities=33% Similarity=0.733 Sum_probs=20.2
Q ss_pred cceeeecC--CCceEEEEeCCeEEEE
Q psy11525 2 VQSICWNT--DVNILAAMQDSALCVW 25 (244)
Q Consensus 2 v~~v~W~~--~~~vlv~~~~~~l~vw 25 (244)
++||.|+| ++.+.+.-.|+++++|
T Consensus 350 ItsieW~p~e~s~iaasg~D~QitiW 375 (440)
T KOG0302|consen 350 ITSIEWHPHEDSVIAASGEDNQITIW 375 (440)
T ss_pred eeEEEeccccCceEEeccCCCcEEEE
Confidence 68999997 5677777889999999
No 256
>PF10300 DUF3808: Protein of unknown function (DUF3808); InterPro: IPR019412 This entry represents a family of proteins conserved from fungi to humans. In humans this protein is expressed in primary breast carcinomas but not in normal breast tissue, and has a putative eukaryotic RNP-1 RNA binding region and a candidate anchoring transmembrane domain. The human protein is coordinately regulated with oestrogen receptor, but is not necessarily oestradiol-responsive []. Members of this family carry a tetratricopeptide repeat (IPR013105 from INTERPRO) at their C terminus.
Probab=25.41 E-value=2.5e+02 Score=26.86 Aligned_cols=49 Identities=22% Similarity=0.255 Sum_probs=35.2
Q ss_pred HHHHHHHHHcCCHHHHHHHHHHCCCHHHH------------HHHHHHhcCHHHHHHHHHhc
Q psy11525 153 AQQAEMYLLGGNISEAESILLQHGLIFRA------------IQVSILTHNWDRALELALRH 201 (244)
Q Consensus 153 ~~~A~ia~~~G~~~eAe~~~l~~g~~~~A------------i~m~~~l~~W~~Al~LA~~~ 201 (244)
...|++....|+.++|.+.|-++...... .--+.-+++|++|.+-+...
T Consensus 271 ~~~gR~~~~~g~~~~Ai~~~~~a~~~q~~~~Ql~~l~~~El~w~~~~~~~w~~A~~~f~~L 331 (468)
T PF10300_consen 271 FFEGRLERLKGNLEEAIESFERAIESQSEWKQLHHLCYFELAWCHMFQHDWEEAAEYFLRL 331 (468)
T ss_pred HHHHHHHHHhcCHHHHHHHHHHhccchhhHHhHHHHHHHHHHHHHHHHchHHHHHHHHHHH
Confidence 35799999999999999999987642211 11245578888887766553
No 257
>PF08662 eIF2A: Eukaryotic translation initiation factor eIF2A; InterPro: IPR013979 This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins.
Probab=24.99 E-value=1e+02 Score=25.54 Aligned_cols=27 Identities=22% Similarity=0.442 Sum_probs=20.2
Q ss_pred cceeeecCCCceEEEEe----CCeEEEEEcC
Q psy11525 2 VQSICWNTDVNILAAMQ----DSALCVWFFP 28 (244)
Q Consensus 2 v~~v~W~~~~~vlv~~~----~~~l~vw~~~ 28 (244)
++.|.|+|+++.++..+ .+.|.+|=..
T Consensus 103 ~n~i~wsP~G~~l~~~g~~n~~G~l~~wd~~ 133 (194)
T PF08662_consen 103 RNTISWSPDGRFLVLAGFGNLNGDLEFWDVR 133 (194)
T ss_pred ceEEEECCCCCEEEEEEccCCCcEEEEEECC
Confidence 56899999998888765 3567887444
No 258
>TIGR00274 N-acetylmuramic acid 6-phosphate etherase. This protein, MurQ, is involved in recycling components of the bacterial murein sacculus turned over during cell growth. The cell wall metabolite anhydro-N-acetylmuramic acid (anhMurNAc) is converted by a kinase, AnmK, to MurNAc-phosphate, then converted to N-acetylglucosamine-phosphate by this etherase, called MurQ. This family of proteins is similar to the C-terminal half of a number of vertebrate glucokinase regulator proteins and contains a Prosite pattern which is shared by this group of proteins in a region of local similarity.
Probab=24.79 E-value=1.9e+02 Score=25.88 Aligned_cols=44 Identities=27% Similarity=0.135 Sum_probs=37.8
Q ss_pred CHHHHHHHHHHC-CCHHHHHHHHHHhcCHHHHHHHHHhcCCchhH
Q psy11525 164 NISEAESILLQH-GLIFRAIQVSILTHNWDRALELALRHKTHIDT 207 (244)
Q Consensus 164 ~~~eAe~~~l~~-g~~~~Ai~m~~~l~~W~~Al~LA~~~~~~~~~ 207 (244)
..++|++.+.++ |.+..||=|-..-..+++|.++..+++.++..
T Consensus 244 ~~~~a~~~l~~~~~~vk~Ai~~~~~~~~~~~a~~~l~~~~g~~~~ 288 (291)
T TIGR00274 244 NKELAEQTLLAADQNVKLAIVMILSTLSASEAKVLLDRHGGFLRQ 288 (291)
T ss_pred CHHHHHHHHHHhCCCcHHHHHHHHhCCCHHHHHHHHHHcCCcHHH
Confidence 467899999986 78999999999999999999999888877544
No 259
>KOG2003|consensus
Probab=24.74 E-value=6.8e+02 Score=24.61 Aligned_cols=45 Identities=16% Similarity=0.200 Sum_probs=30.0
Q ss_pred HHcCCHHHHHHHHhc-cCC-----HHHHHHHHHHHHHcCCHHHHHHHHHHhC
Q psy11525 89 AASHSWPQALSLCRT-LND-----DILWACLAGMATYSRDLATSEEAYAAIE 134 (244)
Q Consensus 89 l~~~~~~~A~~~~~~-~~~-----~~lW~~LA~~Al~~~~l~~A~~~y~~l~ 134 (244)
...|+++.|...-+. +.+ +.+ -..|-.+-..++++.|..||.++.
T Consensus 501 f~ngd~dka~~~ykeal~ndasc~eal-fniglt~e~~~~ldeald~f~klh 551 (840)
T KOG2003|consen 501 FANGDLDKAAEFYKEALNNDASCTEAL-FNIGLTAEALGNLDEALDCFLKLH 551 (840)
T ss_pred eecCcHHHHHHHHHHHHcCchHHHHHH-HHhcccHHHhcCHHHHHHHHHHHH
Confidence 456777777776655 322 222 245666777888999999998765
No 260
>KOG0290|consensus
Probab=24.52 E-value=1.5e+02 Score=27.14 Aligned_cols=28 Identities=25% Similarity=0.603 Sum_probs=19.6
Q ss_pred cceeeecC-CCceEEEEe-CCeEEEEEcCC
Q psy11525 2 VQSICWNT-DVNILAAMQ-DSALCVWFFPA 29 (244)
Q Consensus 2 v~~v~W~~-~~~vlv~~~-~~~l~vw~~~~ 29 (244)
.+||-||. |++.+...+ |++.+||=--.
T Consensus 153 lTSFDWne~dp~~igtSSiDTTCTiWdie~ 182 (364)
T KOG0290|consen 153 LTSFDWNEVDPNLIGTSSIDTTCTIWDIET 182 (364)
T ss_pred ccccccccCCcceeEeecccCeEEEEEEee
Confidence 47899996 888887766 56666664333
No 261
>COG1729 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=24.41 E-value=2e+02 Score=25.63 Aligned_cols=53 Identities=25% Similarity=0.167 Sum_probs=36.9
Q ss_pred HHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCc------HHHHHHHHHHHcCCHHHHHHHHHHC
Q psy11525 112 CLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPV------KAAQQAEMYLLGGNISEAESILLQH 175 (244)
Q Consensus 112 ~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~------~~~~~A~ia~~~G~~~eAe~~~l~~ 175 (244)
..|-..++.+++..|+..|. ..++..+. -.=.+|+...-+|+|++|..+|+..
T Consensus 146 ~~A~~~~ksgdy~~A~~~F~-----------~fi~~YP~s~~~~nA~yWLGe~~y~qg~y~~Aa~~f~~~ 204 (262)
T COG1729 146 NAALDLYKSGDYAEAEQAFQ-----------AFIKKYPNSTYTPNAYYWLGESLYAQGDYEDAAYIFARV 204 (262)
T ss_pred HHHHHHHHcCCHHHHHHHHH-----------HHHHcCCCCcccchhHHHHHHHHHhcccchHHHHHHHHH
Confidence 34555666677777776665 33444433 2345899999999999999999875
No 262
>PF13525 YfiO: Outer membrane lipoprotein; PDB: 3TGO_A 3Q5M_A 2YHC_A.
Probab=24.39 E-value=2.4e+02 Score=23.31 Aligned_cols=47 Identities=17% Similarity=0.058 Sum_probs=35.7
Q ss_pred HHHHHcCCHHHHHHHHhccC--------CHHHHHHHHHHHHHcCCHHHHHHHHHH
Q psy11525 86 HSYAASHSWPQALSLCRTLN--------DDILWACLAGMATYSRDLATSEEAYAA 132 (244)
Q Consensus 86 ~~~l~~~~~~~A~~~~~~~~--------~~~lW~~LA~~Al~~~~l~~A~~~y~~ 132 (244)
..+++.|+|++|.+.-+.+. .+..+-.+|......++++.|...|.+
T Consensus 13 ~~~~~~g~y~~Ai~~f~~l~~~~P~s~~a~~A~l~la~a~y~~~~y~~A~~~~~~ 67 (203)
T PF13525_consen 13 LEALQQGDYEEAIKLFEKLIDRYPNSPYAPQAQLMLAYAYYKQGDYEEAIAAYER 67 (203)
T ss_dssp HHHHHCT-HHHHHHHHHHHHHH-TTSTTHHHHHHHHHHHHHHTT-HHHHHHHHHH
T ss_pred HHHHHCCCHHHHHHHHHHHHHHCCCChHHHHHHHHHHHHHHHcCCHHHHHHHHHH
Confidence 34699999999999888762 356677788888888888888888776
No 263
>KOG2247|consensus
Probab=24.35 E-value=1.6e+02 Score=28.82 Aligned_cols=67 Identities=15% Similarity=0.244 Sum_probs=42.3
Q ss_pred eeec---CCCceEEEEeCCeEEEEEcCCccccCcccceeeeEeeccCCCCCCCeEEeEeCCEEEEEecCCcEEEe
Q psy11525 5 ICWN---TDVNILAAMQDSALCVWFFPAVVFADQGLLRKTVLLKDIGEFGKSPSIVSFVKNHLTIRRYDGTVINY 76 (244)
Q Consensus 5 v~W~---~~~~vlv~~~~~~l~vw~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~v~f~g~~v~~~~~~G~~~~~ 76 (244)
..|- .+.|.|..++..++.+++-.+-... ..+.+..+++.++...-..+..++.|+|..++|++-.+
T Consensus 515 a~~~~~~~~~n~~~~~~~~~i~~~~~~~~~~~-----~~~~~~v~~~~~~~~~~~~~l~~~~vtc~~~~~~l~~v 584 (615)
T KOG2247|consen 515 AIWETFTIDKNTFAVFDSQNIYVFLLSKQHIQ-----GESVIYVSATRLPHAYVPLSLNKGIVTCLMSNGKLSSV 584 (615)
T ss_pred eeEEEEeccccccccccccceeeEeecccccc-----ceeEEEEccccccCCccCchhcCceEEEEEeCCcccee
Confidence 4555 4779999999999999876654421 22223333333322222346688999999999988543
No 264
>PF08662 eIF2A: Eukaryotic translation initiation factor eIF2A; InterPro: IPR013979 This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins.
Probab=24.23 E-value=81 Score=26.17 Aligned_cols=17 Identities=18% Similarity=0.567 Sum_probs=14.1
Q ss_pred cceeeecCCCceEEEEe
Q psy11525 2 VQSICWNTDVNILAAMQ 18 (244)
Q Consensus 2 v~~v~W~~~~~vlv~~~ 18 (244)
|..|+|+|+++-|+.+.
T Consensus 62 I~~~~WsP~g~~favi~ 78 (194)
T PF08662_consen 62 IHDVAWSPNGNEFAVIY 78 (194)
T ss_pred eEEEEECcCCCEEEEEE
Confidence 67899999999887664
No 265
>PF15015 NYD-SP12_N: Spermatogenesis-associated, N-terminal
Probab=24.08 E-value=3.3e+02 Score=26.38 Aligned_cols=106 Identities=19% Similarity=0.141 Sum_probs=68.4
Q ss_pred HHcCCHHHHHHHHhccC--------CHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHH--HHHHHhhCCCc---HHHHH
Q psy11525 89 AASHSWPQALSLCRTLN--------DDILWACLAGMATYSRDLATSEEAYAAIEQVDKV--MYINHIKGIPV---KAAQQ 155 (244)
Q Consensus 89 l~~~~~~~A~~~~~~~~--------~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v--~~l~~i~~~~~---~~~~~ 155 (244)
.+.|+|..|+.+|..+. ...+=..++.. ++ .-...||-+++..+.+ ...+.|.--+. .++++
T Consensus 194 ~Aa~rF~taLelcskg~a~~k~~~~~~~di~~vaSf-Ie----tklv~CYL~~rkpdlALnh~hrsI~lnP~~frnHLrq 268 (569)
T PF15015_consen 194 VAAGRFRTALELCSKGAALSKPFKASAEDISSVASF-IE----TKLVTCYLRMRKPDLALNHSHRSINLNPSYFRNHLRQ 268 (569)
T ss_pred HHHHHHHHHHHHHhhhhhccCCCCCChhhHHHHHHH-HH----HHHHHhhhhcCCCchHHHHHhhhhhcCcchhhHHHHH
Confidence 88999999999999872 23333333322 11 1245799999987755 44555554444 46888
Q ss_pred HHHHHHcCCHHHHHHHHH-------HCC----CHHHHHHHHHHhcCHHHHHHHHHh
Q psy11525 156 AEMYLLGGNISEAESILL-------QHG----LIFRAIQVSILTHNWDRALELALR 200 (244)
Q Consensus 156 A~ia~~~G~~~eAe~~~l-------~~g----~~~~Ai~m~~~l~~W~~Al~LA~~ 200 (244)
|.+.--+.+|.||.+-+. -+| +.-.-|++|-.. +-++|+..|..
T Consensus 269 AavfR~LeRy~eAarSamia~ymywl~g~~~q~~S~lIklyWqa-miEeAiTr~es 323 (569)
T PF15015_consen 269 AAVFRRLERYSEAARSAMIADYMYWLSGGSEQRISKLIKLYWQA-MIEEAITRAES 323 (569)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCCchHhHHHHHHHHHHH-HHHHHHhcccc
Confidence 999999999999887443 333 334456665433 55777766665
No 266
>PRK15331 chaperone protein SicA; Provisional
Probab=23.94 E-value=4.2e+02 Score=21.90 Aligned_cols=101 Identities=16% Similarity=0.075 Sum_probs=67.0
Q ss_pred HHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHCCC--------HHHHHH
Q psy11525 112 CLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQAEMYLLGGNISEAESILLQHGL--------IFRAIQ 183 (244)
Q Consensus 112 ~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A~ia~~~G~~~eAe~~~l~~g~--------~~~Ai~ 183 (244)
..|-..-+.|+++-|+..|.-+--++-.. .+-..-.|.+...+|+|++|...|..++. +..|=+
T Consensus 42 ~~Ay~~y~~Gk~~eA~~~F~~L~~~d~~n--------~~Y~~GLaa~~Q~~k~y~~Ai~~Y~~A~~l~~~dp~p~f~agq 113 (165)
T PRK15331 42 AHAYEFYNQGRLDEAETFFRFLCIYDFYN--------PDYTMGLAAVCQLKKQFQKACDLYAVAFTLLKNDYRPVFFTGQ 113 (165)
T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHhCcCc--------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccCCCCccchHHH
Confidence 35667788999999999998765322111 11234578888899999999999987743 356778
Q ss_pred HHHHhcCHHHHHH---HHHhcCCchhHHHHHHHHHHHHcCC
Q psy11525 184 VSILTHNWDRALE---LALRHKTHIDTVLYQRKKYLDNLEK 221 (244)
Q Consensus 184 m~~~l~~W~~Al~---LA~~~~~~~~~vl~~r~~yl~~~~~ 221 (244)
.|..+++-+.|.. +|..+. .-..+-.+.+.||+...+
T Consensus 114 C~l~l~~~~~A~~~f~~a~~~~-~~~~l~~~A~~~L~~l~~ 153 (165)
T PRK15331 114 CQLLMRKAAKARQCFELVNERT-EDESLRAKALVYLEALKT 153 (165)
T ss_pred HHHHhCCHHHHHHHHHHHHhCc-chHHHHHHHHHHHHHHHc
Confidence 8888888888754 344432 323433455777766544
No 267
>KOG4340|consensus
Probab=23.87 E-value=1.7e+02 Score=27.04 Aligned_cols=56 Identities=14% Similarity=0.166 Sum_probs=44.5
Q ss_pred ccHHHHHHHHHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCC
Q psy11525 80 PYISVLHSYAASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAAIEQ 135 (244)
Q Consensus 80 p~~~~l~~~l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d 135 (244)
.++..++..|...+|.+|+++...- .+..--..||--.-...++..|..||..++.
T Consensus 12 eftaviy~lI~d~ry~DaI~~l~s~~Er~p~~rAgLSlLgyCYY~~Q~f~~AA~CYeQL~q 72 (459)
T KOG4340|consen 12 EFTAVVYRLIRDARYADAIQLLGSELERSPRSRAGLSLLGYCYYRLQEFALAAECYEQLGQ 72 (459)
T ss_pred chHHHHHHHHHHhhHHHHHHHHHHHHhcCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 4778899999999999999988663 2555666677777777888888899888875
No 268
>PF13431 TPR_17: Tetratricopeptide repeat
Probab=23.73 E-value=96 Score=18.05 Aligned_cols=23 Identities=17% Similarity=0.157 Sum_probs=19.3
Q ss_pred CCHHHHHHHHHHHHHcCCHHHHH
Q psy11525 105 NDDILWACLAGMATYSRDLATSE 127 (244)
Q Consensus 105 ~~~~lW~~LA~~Al~~~~l~~A~ 127 (244)
++...|..||..-...|+++.|+
T Consensus 11 ~n~~a~~nla~~~~~~g~~~~A~ 33 (34)
T PF13431_consen 11 NNAEAYNNLANLYLNQGDYEEAI 33 (34)
T ss_pred CCHHHHHHHHHHHHHCcCHHhhc
Confidence 46788999999988888888775
No 269
>KOG0889|consensus
Probab=23.29 E-value=4.8e+02 Score=31.76 Aligned_cols=118 Identities=15% Similarity=0.120 Sum_probs=79.8
Q ss_pred ccHHHHHHHHHcCC--HHHHHHHHhccCCHHHHHH-HHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCcHHHHHH
Q psy11525 80 PYISVLHSYAASHS--WPQALSLCRTLNDDILWAC-LAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPVKAAQQA 156 (244)
Q Consensus 80 p~~~~l~~~l~~~~--~~~A~~~~~~~~~~~lW~~-LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~~~~~~A 156 (244)
+++..|.+|+.++. |-.+..+.+.. .|.. .=....+.-.+|.-.+-|..+.+-++...+=+=+-. -.+-..|
T Consensus 2415 ~lpp~Li~yl~kt~~~wh~~I~lLE~~----~~~~~~~~~~~~~~~~dsl~elY~~L~E~Dm~~Glwrrr~~-~~eT~~a 2489 (3550)
T KOG0889|consen 2415 ELPPHLIKYLGKTYNLWHTSIRLLEDH----QSNKEMENTKGDESCLDSLAELYRSLNEEDMFYGLWRRRAK-FPETMVA 2489 (3550)
T ss_pred CCCHHHHHHHhccchHHHHHHHHHHHH----HHHHHhhhhhhhHHHHHHHHHHHHhhhHHHHHHHHHHHhhc-cHHHHHH
Confidence 47899999976643 55555555443 3433 122455566677777888888888777554332221 1234567
Q ss_pred HHHHHcCCHHHHHHHHHHCCC----------------H-HHHHHHHHHhcCHHHHHHHHHhcC
Q psy11525 157 EMYLLGGNISEAESILLQHGL----------------I-FRAIQVSILTHNWDRALELALRHK 202 (244)
Q Consensus 157 ~ia~~~G~~~eAe~~~l~~g~----------------~-~~Ai~m~~~l~~W~~Al~LA~~~~ 202 (244)
..+...|++++|+.+|.++-- | +.=|+..+.|.+||--.++++..+
T Consensus 2490 ~s~eQ~G~~e~AQ~lyekaq~Ka~~~~~~~~~~Ey~lWed~WI~Ca~eL~QWdvl~e~~k~~~ 2552 (3550)
T KOG0889|consen 2490 LSYEQLGFWEEAQSLYEKAQVKAREGAIPYSESEYKLWEDHWIRCASELQQWDVLTEFGKHEG 2552 (3550)
T ss_pred HHHHHhhhHHHHhhHHHHHHHHHhcccCCCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC
Confidence 888899999999999987621 1 233778889999999999996644
No 270
>KOG2168|consensus
Probab=23.05 E-value=1e+02 Score=31.80 Aligned_cols=36 Identities=36% Similarity=0.387 Sum_probs=32.8
Q ss_pred HHHHHHHHHHCCCHHHHHHHHHHhcCHHHHHHHHHh
Q psy11525 165 ISEAESILLQHGLIFRAIQVSILTHNWDRALELALR 200 (244)
Q Consensus 165 ~~eAe~~~l~~g~~~~Ai~m~~~l~~W~~Al~LA~~ 200 (244)
..+|..-.-..|++++||.+|.-.++++.|++++.+
T Consensus 625 ~~~vA~~a~~~G~~~~sI~LY~lag~yd~al~link 660 (835)
T KOG2168|consen 625 ILEVASEADEDGLFEDAILLYHLAGDYDKALELINK 660 (835)
T ss_pred HHHHHHHHHhcCCHHHHHHHHHHhhhhhHHHHHHHH
Confidence 467888888899999999999999999999999876
No 271
>PF04049 APC8: Anaphase promoting complex subunit 8 / Cdc23 ; InterPro: IPR007192 The anaphase-promoting complex is composed of eight protein subunits, including BimE (APC1), CDC27 (APC3), CDC16 (APC6), and CDC23 (APC8). This entry is for CDC23.; GO: 0030071 regulation of mitotic metaphase/anaphase transition, 0005680 anaphase-promoting complex
Probab=22.40 E-value=1.2e+02 Score=24.09 Aligned_cols=45 Identities=20% Similarity=0.222 Sum_probs=31.7
Q ss_pred HHHHHHhcCHHHHHHHHHhcCCchhHHHHHHHHHH--HHcCCcccHH
Q psy11525 182 IQVSILTHNWDRALELALRHKTHIDTVLYQRKKYL--DNLEKIETNE 226 (244)
Q Consensus 182 i~m~~~l~~W~~Al~LA~~~~~~~~~vl~~r~~yl--~~~~~~e~~~ 226 (244)
=+-|.|++.++||-.+.+.-.......|+++.+|| +....+++.+
T Consensus 81 AksyFD~kEy~RaA~~L~~~~s~~~~FL~lYs~YLa~EKr~~Ee~~~ 127 (142)
T PF04049_consen 81 AKSYFDCKEYDRAAHVLKDCKSPKALFLRLYSRYLAGEKRKEEEMEE 127 (142)
T ss_pred HHHHhchhHHHHHHHHHccCCCchHHHHHHHHHHHHHHHHHhhhhHh
Confidence 35677888888887777665555778999999999 3333344443
No 272
>COG3629 DnrI DNA-binding transcriptional activator of the SARP family [Signal transduction mechanisms]
Probab=22.32 E-value=3e+02 Score=24.75 Aligned_cols=49 Identities=12% Similarity=0.091 Sum_probs=41.7
Q ss_pred HHHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCCh
Q psy11525 88 YAASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAAIEQV 136 (244)
Q Consensus 88 ~l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~ 136 (244)
++..++++.+......+ -++..|..|=+.++..++...|+++|..+...
T Consensus 163 ~~~~~~~~~~~~~l~~Li~~dp~~E~~~~~lm~~y~~~g~~~~ai~~y~~l~~~ 216 (280)
T COG3629 163 LIACGRADAVIEHLERLIELDPYDEPAYLRLMEAYLVNGRQSAAIRAYRQLKKT 216 (280)
T ss_pred HHhcccHHHHHHHHHHHHhcCccchHHHHHHHHHHHHcCCchHHHHHHHHHHHH
Confidence 47788888888776664 47999999999999999999999999987754
No 273
>PF01436 NHL: NHL repeat; InterPro: IPR001258 The NHL repeat, named after NCL-1, HT2A and Lin-41, is found largely in a large number of eukaryotic and prokaryotic proteins. For example, the repeat is found in a variety of enzymes of the copper type II, ascorbate-dependent monooxygenase family which catalyse the C terminus alpha-amidation of biological peptides []. In many it occurs in tandem arrays, for example in the ringfinger beta-box, coiled-coil (RBCC) eukaryotic growth regulators []. The 'Brain Tumor' protein (Brat) is one such growth regulator that contains a 6-bladed NHL-repeat beta-propeller [, ]. The NHL repeats are also found in serine/threonine protein kinase (STPK) in diverse range of pathogenic bacteria. These STPK are transmembrane receptors with a intracellular N-terminal kinase domain and extracellular C-terminal sensor domain. In the STPK, PknD, from Mycobacterium tuberculosis, the sensor domain forms a rigid, six-bladed b-propeller composed of NHL repeats with a flexible tether to the transmembrane domain.; GO: 0005515 protein binding; PDB: 3FVZ_A 3FW0_A 1RWL_A 1RWI_A 1Q7F_A.
Probab=22.23 E-value=1.4e+02 Score=16.53 Aligned_cols=17 Identities=24% Similarity=0.302 Sum_probs=11.5
Q ss_pred ceeeecCCCceEEEEeC
Q psy11525 3 QSICWNTDVNILAAMQD 19 (244)
Q Consensus 3 ~~v~W~~~~~vlv~~~~ 19 (244)
.+++.+++.+++|+=.+
T Consensus 5 ~gvav~~~g~i~VaD~~ 21 (28)
T PF01436_consen 5 HGVAVDSDGNIYVADSG 21 (28)
T ss_dssp EEEEEETTSEEEEEECC
T ss_pred cEEEEeCCCCEEEEECC
Confidence 45677777777777544
No 274
>PF13838 Clathrin_H_link: Clathrin-H-link; PDB: 2XZG_A 3GD1_I 1BPO_C 1C9I_B 1C9L_A.
Probab=22.23 E-value=1.1e+02 Score=21.32 Aligned_cols=23 Identities=4% Similarity=0.143 Sum_probs=17.0
Q ss_pred HHHHHHHHHcCCHHHHHHHHhcc
Q psy11525 82 ISVLHSYAASHSWPQALSLCRTL 104 (244)
Q Consensus 82 ~~~l~~~l~~~~~~~A~~~~~~~ 104 (244)
...+.+++..|+|++|.++|-.-
T Consensus 10 ~~~F~~l~~~g~y~eAA~~AA~s 32 (66)
T PF13838_consen 10 VQQFNELFSQGQYEEAAKVAANS 32 (66)
T ss_dssp HHHHHHHHHTT-HHHHHHHHHHS
T ss_pred HHHHHHHHHcCCHHHHHHHHHhC
Confidence 34566679999999999988653
No 275
>COG3118 Thioredoxin domain-containing protein [Posttranslational modification, protein turnover, chaperones]
Probab=22.23 E-value=6e+02 Score=23.14 Aligned_cols=99 Identities=13% Similarity=0.099 Sum_probs=54.4
Q ss_pred HHHcCCHHHHHHHHhcc-----CCHHHHHHHHHHHHHcCCHHHHHHHHHHhCC----------hhHHHHHHHhhCCCcHH
Q psy11525 88 YAASHSWPQALSLCRTL-----NDDILWACLAGMATYSRDLATSEEAYAAIEQ----------VDKVMYINHIKGIPVKA 152 (244)
Q Consensus 88 ~l~~~~~~~A~~~~~~~-----~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d----------~~~v~~l~~i~~~~~~~ 152 (244)
.++.+++.+|-.+.... .....=..||+.-+..++.+-|+..+..+-. ...+.++.+....++..
T Consensus 144 ~~~~e~~~~a~~~~~~al~~~~~~~~~~~~la~~~l~~g~~e~A~~iL~~lP~~~~~~~~~~l~a~i~ll~qaa~~~~~~ 223 (304)
T COG3118 144 LIEAEDFGEAAPLLKQALQAAPENSEAKLLLAECLLAAGDVEAAQAILAALPLQAQDKAAHGLQAQIELLEQAAATPEIQ 223 (304)
T ss_pred hhhccchhhHHHHHHHHHHhCcccchHHHHHHHHHHHcCChHHHHHHHHhCcccchhhHHHHHHHHHHHHHHHhcCCCHH
Confidence 36777777777654442 2233444566666677777666666665321 12355666666666655
Q ss_pred HHHHHHHHHcCCHHHH---HHHHHHCCCHHHHHHHHH
Q psy11525 153 AQQAEMYLLGGNISEA---ESILLQHGLIFRAIQVSI 186 (244)
Q Consensus 153 ~~~A~ia~~~G~~~eA---e~~~l~~g~~~~Ai~m~~ 186 (244)
.....++.-=++++-| ...+...|+++.|++...
T Consensus 224 ~l~~~~aadPdd~~aa~~lA~~~~~~g~~e~Ale~Ll 260 (304)
T COG3118 224 DLQRRLAADPDDVEAALALADQLHLVGRNEAALEHLL 260 (304)
T ss_pred HHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHH
Confidence 5555555555555433 334555555555555443
No 276
>KOG0645|consensus
Probab=22.22 E-value=2.6e+02 Score=25.26 Aligned_cols=27 Identities=26% Similarity=0.593 Sum_probs=21.0
Q ss_pred cceeeecCCCceEEEEe-CCeEEEEEcC
Q psy11525 2 VQSICWNTDVNILAAMQ-DSALCVWFFP 28 (244)
Q Consensus 2 v~~v~W~~~~~vlv~~~-~~~l~vw~~~ 28 (244)
|.+|+|.|....|++-+ |.+.++|--.
T Consensus 64 VRsvAwsp~g~~La~aSFD~t~~Iw~k~ 91 (312)
T KOG0645|consen 64 VRSVAWSPHGRYLASASFDATVVIWKKE 91 (312)
T ss_pred eeeeeecCCCcEEEEeeccceEEEeecC
Confidence 57899999988887755 7888887444
No 277
>cd08312 Death_MyD88 Death domain of Myeloid Differentation primary response protein MyD88. Death Domain (DD) of Myeloid Differentiation primary response protein 88 (MyD88). MyD88 is an adaptor protein involved in interleukin-1 receptor (IL-1R)- and Toll-like receptor (TLR)-induced activation of nuclear factor-kappaB (NF-kB) and mitogen activated protein kinase pathways that lead to the induction of proinflammatory cytokines. It is a key component in the signaling pathway of pathogen recognition in the innate immune system. MyD88 contains an N-terminal DD and a C-terminal Toll/IL-1 Receptor (TIR) homology domain that mediates interaction with TLRs and IL-1R. In general, DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including CARD (Caspase activation and recruitment domain), DED (Death Effector Domain), and
Probab=22.10 E-value=2.9e+02 Score=19.50 Aligned_cols=19 Identities=32% Similarity=0.340 Sum_probs=14.1
Q ss_pred HHHhccCCH----HHHHHHHHHH
Q psy11525 99 SLCRTLNDD----ILWACLAGMA 117 (244)
Q Consensus 99 ~~~~~~~~~----~lW~~LA~~A 117 (244)
++|..++.+ .+|+.||...
T Consensus 4 ~L~~~Ldp~~~~g~DWr~LA~~L 26 (79)
T cd08312 4 RLSLFLNPPRVVAADWTALAEEM 26 (79)
T ss_pred HHHHHcCCCCCcccCHHHHHHHc
Confidence 567777544 7999999764
No 278
>KOG1188|consensus
Probab=22.00 E-value=2.3e+02 Score=26.38 Aligned_cols=65 Identities=17% Similarity=0.236 Sum_probs=39.9
Q ss_pred cceeeecC-CCceEEEEeCCeEEEEEcCCccccCcccceeeeEeeccCCCCCCCeEEeEeCC---EEEEEecCCcE
Q psy11525 2 VQSICWNT-DVNILAAMQDSALCVWFFPAVVFADQGLLRKTVLLKDIGEFGKSPSIVSFVKN---HLTIRRYDGTV 73 (244)
Q Consensus 2 v~~v~W~~-~~~vlv~~~~~~l~vw~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~v~f~g~---~v~~~~~~G~~ 73 (244)
|+++.|.| ++|+|+.-+-+.|+-.+..+..--+-.|+. +. +.+-.+..++|.|. +|.|.+..+..
T Consensus 168 VT~lrFHP~~pnlLlSGSvDGLvnlfD~~~d~EeDaL~~---vi----N~~sSI~~igw~~~~ykrI~clTH~Etf 236 (376)
T KOG1188|consen 168 VTQLRFHPSDPNLLLSGSVDGLVNLFDTKKDNEEDALLH---VI----NHGSSIHLIGWLSKKYKRIMCLTHMETF 236 (376)
T ss_pred ceeEEecCCCCCeEEeecccceEEeeecCCCcchhhHHH---hh----cccceeeeeeeecCCcceEEEEEccCce
Confidence 67899997 789999877666666566554422222222 11 44455566677665 48888766544
No 279
>PF06957 COPI_C: Coatomer (COPI) alpha subunit C-terminus; InterPro: IPR010714 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the C terminus (approximately 500 residues) of the eukaryotic coatomer alpha subunit [, ]. This domain is found along with the IPR006692 from INTERPRO domain. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0005515 protein binding, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030126 COPI vesicle coat; PDB: 3MKR_B 3MV2_E 3MKQ_B 3MV3_A.
Probab=21.97 E-value=2.3e+02 Score=27.07 Aligned_cols=133 Identities=15% Similarity=0.099 Sum_probs=68.9
Q ss_pred HHHHHHcCCHHHHHHHHhccCCHHHHHHHHHHHHH-cCCHHHHHHHHHHhCChhHHHHHHHhhCC-Cc----HHHHHHHH
Q psy11525 85 LHSYAASHSWPQALSLCRTLNDDILWACLAGMATY-SRDLATSEEAYAAIEQVDKVMYINHIKGI-PV----KAAQQAEM 158 (244)
Q Consensus 85 l~~~l~~~~~~~A~~~~~~~~~~~lW~~LA~~Al~-~~~l~~A~~~y~~l~d~~~v~~l~~i~~~-~~----~~~~~A~i 158 (244)
-|++...|+|++|....+.+-... --..+. ..+.+.+......+.+|-....++-.++. ++ ...|..++
T Consensus 211 gyk~~t~gKF~eA~~~Fr~iL~~i-----~l~vv~~~~E~~e~~eli~icrEYilgl~iEl~Rr~l~~~~~~~~kR~lEL 285 (422)
T PF06957_consen 211 GYKLFTAGKFEEAIEIFRSILHSI-----PLLVVESREEEDEAKELIEICREYILGLSIELERRELPKDPVEDQKRNLEL 285 (422)
T ss_dssp HHHHHHTT-HHHHHHHHHHHHHHH-----HC--BSSCHHHHHHHHHHHHHHHHHHHHHHHHHHCTS-TTTHHHHHHHHHH
T ss_pred HHHHHhcCCHHHHHHHHHHHHHHh-----heeeecCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhhHHHHHHH
Confidence 355699999999999888751000 000000 01122233333333333333334433332 21 22577899
Q ss_pred HHHcCCHHHHHHHHHHCCCHHHHHHHHHHhcCHHHHHHHHHhc---CCchhHHHHHHHHHHHHcCCcccH
Q psy11525 159 YLLGGNISEAESILLQHGLIFRAIQVSILTHNWDRALELALRH---KTHIDTVLYQRKKYLDNLEKIETN 225 (244)
Q Consensus 159 a~~~G~~~eAe~~~l~~g~~~~Ai~m~~~l~~W~~Al~LA~~~---~~~~~~vl~~r~~yl~~~~~~e~~ 225 (244)
++|+-.++.--.+..-+ ...|+.+.-++.++--|-.+|++. ++. ..+....++.+....+..++
T Consensus 286 AAYFThc~LQp~H~~La--Lr~AM~~~~K~KNf~tAa~FArRLLel~p~-~~~a~qArKil~~~e~~~tD 352 (422)
T PF06957_consen 286 AAYFTHCKLQPSHLILA--LRSAMSQAFKLKNFITAASFARRLLELNPS-PEVAEQARKILQACERNPTD 352 (422)
T ss_dssp HHHHCCS---HHHHHHH--HHHHHHHCCCTTBHHHHHHHHHHHHCT--S-CHHHHHHHHHHHHHCCS--B
T ss_pred HHHHhcCCCcHHHHHHH--HHHHHHHHHHhccHHHHHHHHHHHHHcCCC-HHHHHHHHHHHHHHhcCCCC
Confidence 99988876654444332 466788888888888888888763 222 34445667777655554443
No 280
>smart00035 CLa CLUSTERIN alpha chain.
Probab=21.65 E-value=76 Score=27.32 Aligned_cols=60 Identities=20% Similarity=0.175 Sum_probs=36.9
Q ss_pred HHHHHHcCCHHHHHHHHH-HC--CCHHHHHHHHHHhcCHHHHHHHHHhcCCchhHHHHHHHHHHHHc
Q psy11525 156 AEMYLLGGNISEAESILL-QH--GLIFRAIQVSILTHNWDRALELALRHKTHIDTVLYQRKKYLDNL 219 (244)
Q Consensus 156 A~ia~~~G~~~eAe~~~l-~~--g~~~~Ai~m~~~l~~W~~Al~LA~~~~~~~~~vl~~r~~yl~~~ 219 (244)
+.+.-+.++++.-.+++. .| .+|.. -+++. ..++||+||+......+++|...+.+++++
T Consensus 66 sgClkm~dqCEKCqeiLsvDCs~~~P~q-~~Lr~---El~eAL~LaE~ftqqYd~lL~~~q~~m~nT 128 (216)
T smart00035 66 TGCLKMKDQCEKCQEILSVDCSTNNPDQ-PQLRQ---ELDESLQLAERFTQQYDQLLQSYQKKMLNT 128 (216)
T ss_pred HHHHHHHHHHHHHHHHHhhhccCCCcch-HHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHhH
Confidence 334445555555544442 22 22221 24444 578999999998877888888888877543
No 281
>PF05131 Pep3_Vps18: Pep3/Vps18/deep orange family; InterPro: IPR007810 This region is found in a number of proteins identified as being involved in Golgi function and vacuolar sorting. The molecular function of this region is unknown. Proteins containing this domain also contain a C-terminal ring finger domain.
Probab=21.57 E-value=59 Score=26.20 Aligned_cols=22 Identities=27% Similarity=0.492 Sum_probs=18.0
Q ss_pred HHHHHHHHcCCHHHHHHHHhcc
Q psy11525 83 SVLHSYAASHSWPQALSLCRTL 104 (244)
Q Consensus 83 ~~l~~~l~~~~~~~A~~~~~~~ 104 (244)
..=.-|++.|+|++|+++|+.-
T Consensus 108 ~vWk~yl~~~~fd~Al~~~~~~ 129 (147)
T PF05131_consen 108 DVWKIYLDKGDFDEALQYCKTN 129 (147)
T ss_pred HHHHHHHhcCcHHHHHHHccCC
Confidence 3444579999999999999883
No 282
>COG5010 TadD Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking and secretion]
Probab=21.45 E-value=4.9e+02 Score=23.14 Aligned_cols=82 Identities=22% Similarity=0.125 Sum_probs=58.5
Q ss_pred HHHcCCHHHHHHHHhc-----cCCHHHHHHHHHHHHHcCCHHHHHHHHHHhCChhHHHHHHHhhCCCc--H--HHHHHHH
Q psy11525 88 YAASHSWPQALSLCRT-----LNDDILWACLAGMATYSRDLATSEEAYAAIEQVDKVMYINHIKGIPV--K--AAQQAEM 158 (244)
Q Consensus 88 ~l~~~~~~~A~~~~~~-----~~~~~lW~~LA~~Al~~~~l~~A~~~y~~l~d~~~v~~l~~i~~~~~--~--~~~~A~i 158 (244)
|...|++++|-..-+. ...+..-.+||..-+-.++++.|+.-+.. ....+. . ....|.+
T Consensus 144 ldq~Gr~~~Ar~ay~qAl~L~~~~p~~~nNlgms~~L~gd~~~A~~lll~------------a~l~~~ad~~v~~NLAl~ 211 (257)
T COG5010 144 LDQLGRFDEARRAYRQALELAPNEPSIANNLGMSLLLRGDLEDAETLLLP------------AYLSPAADSRVRQNLALV 211 (257)
T ss_pred HHHccChhHHHHHHHHHHHhccCCchhhhhHHHHHHHcCCHHHHHHHHHH------------HHhCCCCchHHHHHHHHH
Confidence 5889999988664443 37889999999999999999999865543 222221 1 2345677
Q ss_pred HHHcCCHHHHHHHHHHCCCHHHH
Q psy11525 159 YLLGGNISEAESILLQHGLIFRA 181 (244)
Q Consensus 159 a~~~G~~~eAe~~~l~~g~~~~A 181 (244)
.-..|++++|+++-.+--..+.+
T Consensus 212 ~~~~g~~~~A~~i~~~e~~~~~~ 234 (257)
T COG5010 212 VGLQGDFREAEDIAVQELLSEQA 234 (257)
T ss_pred HhhcCChHHHHhhccccccchhH
Confidence 77899999999987665444444
No 283
>PF14669 Asp_Glu_race_2: Putative aspartate racemase
Probab=21.37 E-value=79 Score=27.20 Aligned_cols=23 Identities=30% Similarity=0.245 Sum_probs=18.4
Q ss_pred HHHHHHHHHHCCCHHHHHHHHHH
Q psy11525 165 ISEAESILLQHGLIFRAIQVSIL 187 (244)
Q Consensus 165 ~~eAe~~~l~~g~~~~Ai~m~~~ 187 (244)
.+-|.++|+++|.++.|+.+.++
T Consensus 184 vn~AaEiFL~sgsidGA~~vLre 206 (233)
T PF14669_consen 184 VNIAAEIFLKSGSIDGALWVLRE 206 (233)
T ss_pred HHHHHHHHHHcCCchHHHHHHhc
Confidence 46688888888888888888763
Done!