Query psy11108
Match_columns 106
No_of_seqs 107 out of 207
Neff 5.7
Searched_HMMs 46136
Date Fri Aug 16 18:08:52 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy11108.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/11108hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2041|consensus 99.9 3E-26 6.5E-31 196.6 4.9 97 8-104 679-779 (1189)
2 KOG1538|consensus 99.7 1.9E-17 4.1E-22 142.1 4.5 94 10-103 568-676 (1081)
3 KOG1524|consensus 99.7 1.6E-16 3.5E-21 133.7 6.4 90 11-103 595-688 (737)
4 KOG3616|consensus 98.8 1.9E-09 4E-14 94.9 3.8 87 17-103 473-575 (1636)
5 PF04053 Coatomer_WDAD: Coatom 98.6 1.3E-07 2.8E-12 78.4 7.6 88 7-94 333-433 (443)
6 KOG0292|consensus 97.8 4.6E-05 9.9E-10 68.3 5.5 88 8-95 659-759 (1202)
7 KOG3617|consensus 97.4 0.00028 6.2E-09 63.4 5.9 80 12-94 751-831 (1416)
8 KOG0276|consensus 97.0 0.0018 3.8E-08 56.6 6.1 86 8-93 653-751 (794)
9 KOG2247|consensus 96.9 0.00026 5.6E-09 60.3 0.3 82 21-102 3-87 (615)
10 PF13414 TPR_11: TPR repeat; P 94.6 0.17 3.7E-06 30.1 5.7 62 20-89 2-64 (69)
11 PF13432 TPR_16: Tetratricopep 94.0 0.45 9.7E-06 28.0 6.6 56 26-89 2-57 (65)
12 PF09976 TPR_21: Tetratricopep 93.4 1.2 2.6E-05 30.6 8.8 81 3-88 25-110 (145)
13 TIGR02552 LcrH_SycD type III s 93.1 0.63 1.4E-05 30.7 6.7 65 17-89 47-111 (135)
14 PRK10747 putative protoheme IX 91.1 1.6 3.5E-05 35.1 8.1 72 7-89 313-387 (398)
15 PF10602 RPN7: 26S proteasome 90.8 1 2.3E-05 32.8 6.2 72 22-93 37-143 (177)
16 TIGR02521 type_IV_pilW type IV 90.4 2.2 4.8E-05 29.2 7.3 30 18-47 62-91 (234)
17 PRK15359 type III secretion sy 90.3 1.9 4.1E-05 29.9 6.9 64 18-89 55-118 (144)
18 PRK10370 formate-dependent nit 89.9 3.3 7.2E-05 30.4 8.3 73 17-89 69-170 (198)
19 PF12895 Apc3: Anaphase-promot 89.8 1.2 2.6E-05 27.7 5.1 60 20-88 24-83 (84)
20 PF09976 TPR_21: Tetratricopep 89.4 4.7 0.0001 27.6 8.3 57 24-89 88-144 (145)
21 cd00189 TPR Tetratricopeptide 88.6 2.7 5.9E-05 23.6 5.8 62 20-89 33-94 (100)
22 COG5010 TadD Flp pilus assembl 88.4 1.6 3.6E-05 34.4 5.9 66 16-89 129-194 (257)
23 TIGR02521 type_IV_pilW type IV 88.2 3.7 7.9E-05 28.1 7.1 27 21-47 31-57 (234)
24 PF13424 TPR_12: Tetratricopep 88.0 0.44 9.6E-06 29.0 2.1 62 22-89 6-72 (78)
25 PF14559 TPR_19: Tetratricopep 87.7 2.2 4.8E-05 24.9 5.1 50 32-89 2-51 (68)
26 PLN03081 pentatricopeptide (PP 87.0 4 8.8E-05 35.0 8.0 77 8-92 478-557 (697)
27 PF07719 TPR_2: Tetratricopept 86.4 1.2 2.5E-05 22.7 2.9 23 67-89 5-27 (34)
28 PF11349 DUF3151: Protein of u 86.3 1 2.3E-05 32.1 3.5 72 13-85 27-114 (129)
29 PLN03077 Protein ECB2; Provisi 86.1 4.2 9E-05 35.7 7.8 74 8-90 641-718 (857)
30 PRK11189 lipoprotein NlpI; Pro 86.0 6.1 0.00013 30.5 8.0 62 17-88 94-157 (296)
31 PF13374 TPR_10: Tetratricopep 86.0 1.1 2.4E-05 23.6 2.8 22 68-89 7-28 (42)
32 PRK10153 DNA-binding transcrip 85.7 3.2 7E-05 35.3 6.7 65 17-90 416-480 (517)
33 PF13176 TPR_7: Tetratricopept 85.7 1.3 2.7E-05 23.8 2.9 21 68-88 4-24 (36)
34 KOG3616|consensus 85.7 0.21 4.6E-06 45.5 -0.4 77 24-100 768-861 (1636)
35 PRK12370 invasion protein regu 84.9 4.9 0.00011 33.8 7.4 31 17-47 334-364 (553)
36 PLN03088 SGT1, suppressor of 84.8 3.8 8.3E-05 32.8 6.5 65 17-89 32-96 (356)
37 COG3071 HemY Uncharacterized e 84.1 8.5 0.00018 32.2 8.3 71 7-88 313-386 (400)
38 PRK11447 cellulose synthase su 84.1 8.1 0.00018 35.5 8.9 74 16-89 57-138 (1157)
39 PF07721 TPR_4: Tetratricopept 83.9 2 4.3E-05 21.6 3.0 21 67-87 5-25 (26)
40 PF13371 TPR_9: Tetratricopept 83.7 5.5 0.00012 23.5 5.5 50 28-89 2-55 (73)
41 cd00189 TPR Tetratricopeptide 83.6 5.5 0.00012 22.3 5.3 58 24-89 3-60 (100)
42 PRK15179 Vi polysaccharide bio 81.9 7.1 0.00015 34.5 7.4 85 6-90 70-181 (694)
43 PRK11788 tetratricopeptide rep 81.3 9.6 0.00021 29.4 7.3 23 67-89 218-240 (389)
44 PF13428 TPR_14: Tetratricopep 80.8 3.2 7E-05 23.0 3.4 23 67-89 5-27 (44)
45 PF07719 TPR_2: Tetratricopept 80.8 4.4 9.6E-05 20.4 3.7 27 21-47 1-27 (34)
46 PF00515 TPR_1: Tetratricopept 80.7 2.8 6E-05 21.5 2.9 23 67-89 5-27 (34)
47 CHL00033 ycf3 photosystem I as 80.2 15 0.00034 25.4 7.5 64 21-89 35-98 (168)
48 TIGR00540 hemY_coli hemY prote 79.9 12 0.00027 30.0 7.7 69 12-89 323-396 (409)
49 PRK15359 type III secretion sy 79.3 11 0.00025 25.9 6.5 62 21-90 24-85 (144)
50 TIGR02917 PEP_TPR_lipo putativ 79.1 13 0.00027 30.9 7.6 30 18-47 598-627 (899)
51 PF13181 TPR_8: Tetratricopept 79.0 3.9 8.5E-05 20.8 3.2 23 67-89 5-27 (34)
52 PF13429 TPR_15: Tetratricopep 78.3 0.68 1.5E-05 34.9 0.0 32 18-49 41-72 (280)
53 KOG2041|consensus 76.0 7.7 0.00017 35.5 5.8 81 22-102 797-895 (1189)
54 TIGR00990 3a0801s09 mitochondr 75.9 14 0.00029 31.3 7.1 74 17-90 395-494 (615)
55 smart00028 TPR Tetratricopepti 75.5 4.8 0.0001 18.3 2.7 23 67-89 5-27 (34)
56 PRK11788 tetratricopeptide rep 74.9 26 0.00056 27.0 8.0 29 19-47 67-95 (389)
57 COG4235 Cytochrome c biogenesi 74.9 14 0.00031 29.5 6.6 42 7-48 141-183 (287)
58 PF13428 TPR_14: Tetratricopep 74.7 6.1 0.00013 21.8 3.4 27 21-47 1-27 (44)
59 TIGR02917 PEP_TPR_lipo putativ 74.1 20 0.00044 29.7 7.6 30 18-47 190-219 (899)
60 PF13429 TPR_15: Tetratricopep 73.3 27 0.0006 26.1 7.6 82 8-89 133-240 (280)
61 COG2976 Uncharacterized protei 72.0 40 0.00087 25.9 8.2 40 6-45 70-113 (207)
62 PF13432 TPR_16: Tetratricopep 71.3 5.6 0.00012 23.1 2.8 22 68-89 2-23 (65)
63 PF12862 Apc5: Anaphase-promot 69.8 6.7 0.00015 25.3 3.2 61 29-89 6-67 (94)
64 PF13174 TPR_6: Tetratricopept 68.9 7.2 0.00016 19.4 2.6 22 67-88 4-25 (33)
65 PF13424 TPR_12: Tetratricopep 68.8 6.3 0.00014 23.7 2.8 22 68-89 10-31 (78)
66 PRK09782 bacteriophage N4 rece 68.6 31 0.00067 31.8 8.1 31 17-47 538-568 (987)
67 PF04053 Coatomer_WDAD: Coatom 67.6 8.3 0.00018 32.3 4.0 27 67-93 351-377 (443)
68 TIGR00990 3a0801s09 mitochondr 66.9 34 0.00074 28.9 7.6 20 69-88 405-424 (615)
69 PF13414 TPR_11: TPR repeat; P 66.6 6.8 0.00015 22.9 2.5 23 67-89 7-29 (69)
70 PRK15174 Vi polysaccharide exp 65.7 36 0.00079 29.4 7.6 30 18-47 107-136 (656)
71 TIGR03302 OM_YfiO outer membra 64.7 50 0.0011 23.7 7.8 64 18-89 30-96 (235)
72 KOG1155|consensus 64.3 37 0.00081 29.5 7.2 61 18-89 429-492 (559)
73 PF02259 FAT: FAT domain; Int 62.5 40 0.00086 25.5 6.6 76 8-87 129-208 (352)
74 PRK11447 cellulose synthase su 62.4 43 0.00092 30.9 7.8 73 17-89 599-697 (1157)
75 KOG0985|consensus 62.2 13 0.00028 35.4 4.4 46 7-52 1090-1135(1666)
76 PF08631 SPO22: Meiosis protei 62.1 8.7 0.00019 29.5 2.9 60 30-89 2-62 (278)
77 PRK15174 Vi polysaccharide exp 61.9 43 0.00094 29.0 7.4 31 17-47 72-102 (656)
78 TIGR02795 tol_pal_ybgF tol-pal 60.6 38 0.00082 21.0 6.9 58 23-88 4-64 (119)
79 PRK15363 pathogenicity island 60.4 61 0.0013 23.7 7.0 65 17-89 65-129 (157)
80 PLN03098 LPA1 LOW PSII ACCUMUL 59.6 48 0.001 28.3 7.1 63 16-89 70-138 (453)
81 CHL00033 ycf3 photosystem I as 59.2 45 0.00097 23.0 6.0 68 21-89 72-139 (168)
82 TIGR02552 LcrH_SycD type III s 58.9 36 0.00079 22.1 5.2 60 22-89 18-77 (135)
83 TIGR00540 hemY_coli hemY prote 58.6 44 0.00095 26.8 6.5 23 27-49 90-112 (409)
84 cd05804 StaR_like StaR_like; a 57.4 59 0.0013 24.7 6.8 23 68-90 191-213 (355)
85 PRK10049 pgaA outer membrane p 56.6 56 0.0012 28.7 7.3 62 17-89 389-453 (765)
86 PF04733 Coatomer_E: Coatomer 56.4 95 0.0021 24.3 8.5 80 10-89 120-227 (290)
87 PRK15331 chaperone protein Sic 56.1 24 0.00052 26.1 4.3 33 55-87 25-61 (165)
88 KOG2076|consensus 55.9 60 0.0013 30.0 7.4 61 18-88 204-266 (895)
89 PRK12370 invasion protein regu 55.6 64 0.0014 27.2 7.3 73 17-89 368-467 (553)
90 PLN03088 SGT1, suppressor of 55.3 58 0.0013 26.0 6.7 60 22-89 3-62 (356)
91 KOG1920|consensus 54.9 42 0.00091 32.0 6.4 74 24-97 955-1034(1265)
92 PRK10747 putative protoheme IX 54.5 51 0.0011 26.5 6.3 22 68-89 158-179 (398)
93 KOG3060|consensus 53.7 43 0.00094 26.9 5.6 58 23-88 54-111 (289)
94 cd05804 StaR_like StaR_like; a 53.5 61 0.0013 24.6 6.4 58 23-88 116-173 (355)
95 PRK09782 bacteriophage N4 rece 51.4 84 0.0018 29.1 7.8 75 15-89 603-703 (987)
96 KOG1840|consensus 51.0 24 0.00052 30.3 4.0 61 24-89 328-393 (508)
97 KOG0543|consensus 49.1 51 0.0011 27.7 5.5 65 25-90 249-318 (397)
98 PF11817 Foie-gras_1: Foie gra 48.6 61 0.0013 24.5 5.6 43 38-89 162-204 (247)
99 PF07219 HemY_N: HemY protein 47.3 48 0.001 22.0 4.4 27 65-91 61-87 (108)
100 PRK10049 pgaA outer membrane p 46.4 1.1E+02 0.0024 27.0 7.5 29 18-46 46-74 (765)
101 PF00637 Clathrin: Region in C 45.6 23 0.00051 23.7 2.7 55 42-98 50-105 (143)
102 KOG2796|consensus 45.1 54 0.0012 26.9 5.0 29 20-48 211-239 (366)
103 KOG1538|consensus 44.9 29 0.00064 31.6 3.7 64 33-97 759-838 (1081)
104 COG3063 PilF Tfp pilus assembl 44.6 92 0.002 24.6 6.1 63 18-88 135-198 (250)
105 TIGR02795 tol_pal_ybgF tol-pal 44.4 75 0.0016 19.6 5.8 63 21-89 39-102 (119)
106 COG1729 Uncharacterized protei 43.6 77 0.0017 25.1 5.6 71 9-89 162-241 (262)
107 PF12854 PPR_1: PPR repeat 42.7 31 0.00067 18.2 2.4 16 73-88 17-32 (34)
108 KOG1840|consensus 42.6 76 0.0017 27.3 5.8 61 24-89 202-267 (508)
109 PRK10803 tol-pal system protei 40.5 1.7E+02 0.0037 22.6 7.9 73 8-88 163-242 (263)
110 PF14938 SNAP: Soluble NSF att 40.0 24 0.00053 26.9 2.3 27 67-93 159-185 (282)
111 TIGR00756 PPR pentatricopeptid 39.8 47 0.001 16.0 2.7 17 73-89 10-26 (35)
112 PRK02603 photosystem I assembl 39.2 1.3E+02 0.0028 20.8 6.8 27 21-47 72-98 (172)
113 COG1729 Uncharacterized protei 39.1 82 0.0018 24.9 5.1 52 27-89 147-204 (262)
114 PF10516 SHNi-TPR: SHNi-TPR; 39.1 51 0.0011 18.4 3.0 24 66-89 4-27 (38)
115 PRK10370 formate-dependent nit 38.6 1E+02 0.0022 22.4 5.4 31 17-47 140-170 (198)
116 PF01535 PPR: PPR repeat; Int 37.7 40 0.00086 16.2 2.2 18 72-89 9-26 (31)
117 PRK02603 photosystem I assembl 36.5 1.4E+02 0.0031 20.6 8.0 61 21-89 35-98 (172)
118 KOG1129|consensus 35.2 78 0.0017 26.8 4.6 71 10-90 383-456 (478)
119 PF12569 NARP1: NMDA receptor- 34.9 44 0.00095 28.6 3.2 21 68-88 43-63 (517)
120 PF07720 TPR_3: Tetratricopept 34.7 54 0.0012 17.9 2.6 17 69-85 7-23 (36)
121 KOG1127|consensus 34.7 32 0.00069 32.5 2.4 73 6-89 838-910 (1238)
122 PF11846 DUF3366: Domain of un 34.1 86 0.0019 22.4 4.3 32 61-92 140-173 (193)
123 PF13431 TPR_17: Tetratricopep 34.0 56 0.0012 17.2 2.5 25 17-41 9-33 (34)
124 PF13041 PPR_2: PPR repeat fam 33.9 56 0.0012 18.1 2.6 18 73-90 13-30 (50)
125 KOG3785|consensus 32.8 1.6E+02 0.0035 25.3 6.0 70 24-93 326-423 (557)
126 KOG3785|consensus 32.4 1.4E+02 0.003 25.6 5.7 62 25-93 397-458 (557)
127 PF02064 MAS20: MAS20 protein 32.3 70 0.0015 22.3 3.4 25 65-89 65-89 (121)
128 PRK11189 lipoprotein NlpI; Pro 32.3 2.3E+02 0.005 21.7 6.8 58 21-89 64-124 (296)
129 PRK15363 pathogenicity island 31.6 1.9E+02 0.0041 21.1 5.7 60 26-93 40-99 (157)
130 PLN03218 maturation of RBCL 1; 31.1 1.9E+02 0.0042 27.1 6.8 29 19-47 612-640 (1060)
131 PLN03081 pentatricopeptide (PP 30.7 2.7E+02 0.006 23.9 7.4 73 20-92 289-389 (697)
132 PRK15331 chaperone protein Sic 30.1 1.9E+02 0.004 21.4 5.5 66 16-89 66-131 (165)
133 PHA02608 67 prohead core prote 30.0 56 0.0012 21.5 2.4 31 30-60 6-36 (80)
134 KOG1126|consensus 29.3 91 0.002 27.8 4.2 74 17-90 417-516 (638)
135 PF04762 IKI3: IKI3 family; I 28.3 48 0.0011 30.4 2.5 59 31-90 868-928 (928)
136 COG3063 PilF Tfp pilus assembl 28.1 3.1E+02 0.0066 21.8 7.7 30 18-47 66-95 (250)
137 KOG3081|consensus 26.3 67 0.0015 26.0 2.7 21 69-89 213-233 (299)
138 COG4783 Putative Zn-dependent 25.6 3.9E+02 0.0085 23.1 7.3 31 17-47 336-366 (484)
139 KOG2404|consensus 25.4 51 0.0011 27.8 1.9 24 13-36 258-281 (477)
140 KOG0292|consensus 24.6 1.5E+02 0.0033 28.0 4.9 63 11-92 639-701 (1202)
141 KOG2066|consensus 24.4 2.2E+02 0.0048 26.2 5.8 53 7-61 676-728 (846)
142 smart00299 CLH Clathrin heavy 23.9 1.3E+02 0.0027 20.0 3.5 27 71-97 90-117 (140)
143 COG4700 Uncharacterized protei 23.9 81 0.0017 24.7 2.7 24 66-89 92-115 (251)
144 PLN03218 maturation of RBCL 1; 23.9 3.9E+02 0.0084 25.1 7.5 14 75-88 731-744 (1060)
145 PF08311 Mad3_BUB1_I: Mad3/BUB 23.9 84 0.0018 21.5 2.6 21 68-88 104-124 (126)
146 KOG3617|consensus 22.5 84 0.0018 29.7 2.9 86 12-97 903-1001(1416)
147 PF13812 PPR_3: Pentatricopept 22.0 1.1E+02 0.0024 14.9 2.2 24 24-47 4-27 (34)
148 PLN03077 Protein ECB2; Provisi 21.9 4E+02 0.0086 23.5 6.9 29 20-48 322-350 (857)
149 TIGR03276 Phn-HD phosphonate d 21.7 42 0.00091 25.0 0.8 32 72-103 123-156 (179)
150 KOG1155|consensus 21.6 4.7E+02 0.01 23.0 7.0 72 18-89 361-458 (559)
151 COG1447 CelC Phosphotransferas 21.5 1.2E+02 0.0026 20.9 2.9 25 66-90 22-46 (105)
152 COG4783 Putative Zn-dependent 21.2 5.5E+02 0.012 22.3 7.5 31 16-46 301-331 (484)
153 KOG0495|consensus 20.9 4.8E+02 0.01 24.1 7.1 22 67-88 655-676 (913)
154 PF14346 DUF4398: Domain of un 20.5 1.4E+02 0.003 19.4 3.0 56 35-90 17-72 (103)
155 cd02680 MIT_calpain7_2 MIT: do 20.4 65 0.0014 20.7 1.3 15 75-89 18-32 (75)
156 COG4235 Cytochrome c biogenesi 20.4 89 0.0019 25.1 2.4 26 67-92 160-185 (287)
157 PF13525 YfiO: Outer membrane 20.4 3.4E+02 0.0073 19.5 5.4 59 24-88 8-67 (203)
158 PF12234 Rav1p_C: RAVE protein 20.1 1.4E+02 0.0029 26.6 3.6 52 46-97 432-486 (631)
No 1
>KOG2041|consensus
Probab=99.92 E-value=3e-26 Score=196.65 Aligned_cols=97 Identities=29% Similarity=0.404 Sum_probs=95.0
Q ss_pred HhhhhhhhcCCCCHHHHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHH
Q psy11108 8 IASRHSFVLCPYVAGQASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDDIMKHAYIAQHQGRFQTAENYQL 87 (106)
Q Consensus 8 ~~~~~~f~~~~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~~~~~Aei~ay~Gkf~EAe~i~l 87 (106)
|-+|..|++++|||.||++||+.||.+++|++||.||+||+||.+|.|+++|+.+.++++++|||.+|.|+|+|||++|+
T Consensus 679 ledA~qfiEdnPHprLWrllAe~Al~Kl~l~tAE~AFVrc~dY~Gik~vkrl~~i~s~~~q~aei~~~~g~feeaek~yl 758 (1189)
T KOG2041|consen 679 LEDAIQFIEDNPHPRLWRLLAEYALFKLALDTAEHAFVRCGDYAGIKLVKRLRTIHSKEQQRAEISAFYGEFEEAEKLYL 758 (1189)
T ss_pred hHHHHHHHhcCCchHHHHHHHHHHHHHHhhhhHhhhhhhhccccchhHHHHhhhhhhHHHHhHhHhhhhcchhHhhhhhh
Confidence 67899999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred HcCCccccccCC----Ccccc
Q psy11108 88 QADRRTWKSKPS----YWTRF 104 (106)
Q Consensus 88 ~ag~~~~Aiemr----~W~r~ 104 (106)
.++|.|+||+|| +|+||
T Consensus 759 d~drrDLAielr~klgDwfrV 779 (1189)
T KOG2041|consen 759 DADRRDLAIELRKKLGDWFRV 779 (1189)
T ss_pred ccchhhhhHHHHHhhhhHHHH
Confidence 999999999987 89997
No 2
>KOG1538|consensus
Probab=99.68 E-value=1.9e-17 Score=142.12 Aligned_cols=94 Identities=17% Similarity=0.119 Sum_probs=85.2
Q ss_pred hhhhhhcCCCCHHH------HHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCC-----cHHHHHHHHHHHcCC
Q psy11108 10 SRHSFVLCPYVAGQ------ASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALP-----DDIMKHAYIAQHQGR 78 (106)
Q Consensus 10 ~~~~f~~~~~~~~l------W~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~-----~~~~~~Aei~ay~Gk 78 (106)
+|-.|-+++..+|+ |+.||..||++++|++|++||+|++|.+++++|..|++.+ .++++.|++++|+||
T Consensus 568 eag~f~ea~~iaclgVv~~DW~~LA~~ALeAL~f~~ARkAY~rVRdl~~L~li~EL~~~k~rge~P~~iLlA~~~Ay~gK 647 (1081)
T KOG1538|consen 568 ERGLFKEAYQIACLGVTDTDWRELAMEALEALDFETARKAYIRVRDLRYLELISELEERKKRGETPNDLLLADVFAYQGK 647 (1081)
T ss_pred hccchhhhhcccccceecchHHHHHHHHHhhhhhHHHHHHHHHHhccHHHHHHHHHHHHHhcCCCchHHHHHHHHHhhhh
Confidence 56678888888876 9999999999999999999999999999999999998763 367999999999999
Q ss_pred HHHHHHHHHHcCCccccccC----CCccc
Q psy11108 79 FQTAENYQLQADRRTWKSKP----SYWTR 103 (106)
Q Consensus 79 f~EAe~i~l~ag~~~~Aiem----r~W~r 103 (106)
|.||+++|+++|+...|+|| |+|+-
T Consensus 648 F~EAAklFk~~G~enRAlEmyTDlRMFD~ 676 (1081)
T KOG1538|consen 648 FHEAAKLFKRSGHENRALEMYTDLRMFDY 676 (1081)
T ss_pred HHHHHHHHHHcCchhhHHHHHHHHHHHHH
Confidence 99999999999999999995 56653
No 3
>KOG1524|consensus
Probab=99.65 E-value=1.6e-16 Score=133.72 Aligned_cols=90 Identities=20% Similarity=0.189 Sum_probs=85.8
Q ss_pred hhhhhcCCCCHHHHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHcC
Q psy11108 11 RHSFVLCPYVAGQASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDDIMKHAYIAQHQGRFQTAENYQLQAD 90 (106)
Q Consensus 11 ~~~f~~~~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~~~~~Aei~ay~Gkf~EAe~i~l~ag 90 (106)
-|.|++. +++|++||..|..+.++-++|.||+++++.++|++|++|+++++++-++|++.++.||..|||.+++++|
T Consensus 595 LCrfv~e---qTMWAtlAa~Av~~~~m~~~EiAYaA~~~idKVsyin~iK~ltske~~mA~~~l~~G~~~eAe~iLl~~g 671 (737)
T KOG1524|consen 595 LCRFVQE---QTMWATLAAVAVRKHQMQISEIAYAAALQIDKVSYINHIKALTSKEEQMAENSLMLGRMLEAETILLHGG 671 (737)
T ss_pred HHHhccc---hHHHHHHHHHHHhhccccHHHHHHHHhhchhhHHHHHHHhccCcHHHHHHHHHHHhccchhhhHHHHhcc
Confidence 3678877 8999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred Ccc----ccccCCCccc
Q psy11108 91 RRT----WKSKPSYWTR 103 (106)
Q Consensus 91 ~~~----~Aiemr~W~r 103 (106)
... ++|+|++|.|
T Consensus 672 l~~qav~lni~m~nW~R 688 (737)
T KOG1524|consen 672 LIEQAVGLNIRMHNWRR 688 (737)
T ss_pred hHHHhhhhhhhhhhHHH
Confidence 996 7788999998
No 4
>KOG3616|consensus
Probab=98.85 E-value=1.9e-09 Score=94.92 Aligned_cols=87 Identities=15% Similarity=0.208 Sum_probs=76.5
Q ss_pred CCCCHHHHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcHH------------HHHHHHHHHcCCHHHHHH
Q psy11108 17 CPYVAGQASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDDI------------MKHAYIAQHQGRFQTAEN 84 (106)
Q Consensus 17 ~~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~~------------~~~Aei~ay~Gkf~EAe~ 84 (106)
.+...++|..|++.||+..+|-||++||.+++|+.++++++.+.++-+.. ..+|.++++-.+|+|||.
T Consensus 473 ~~da~amw~~laelale~~nl~iaercfaai~dvak~r~lhd~~eiadeas~~~ggdgt~fykvra~lail~kkfk~ae~ 552 (1636)
T KOG3616|consen 473 GPDAEAMWIRLAELALEAGNLFIAERCFAAIGDVAKARFLHDILEIADEASIEIGGDGTDFYKVRAMLAILEKKFKEAEM 552 (1636)
T ss_pred CccHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhCCCCchHHHHHHHHHHHHhhhhHHHH
Confidence 34557889999999999999999999999999999999999998875432 357999999999999999
Q ss_pred HHHHcCCccccccC----CCccc
Q psy11108 85 YQLQADRRTWKSKP----SYWTR 103 (106)
Q Consensus 85 i~l~ag~~~~Aiem----r~W~r 103 (106)
||++.+-.+.||-| ++|++
T Consensus 553 ifleqn~te~aigmy~~lhkwde 575 (1636)
T KOG3616|consen 553 IFLEQNATEEAIGMYQELHKWDE 575 (1636)
T ss_pred HHHhcccHHHHHHHHHHHHhHHH
Confidence 99999999988875 58875
No 5
>PF04053 Coatomer_WDAD: Coatomer WD associated region ; InterPro: IPR006692 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the WD-associated region found in coatomer subunits alpha, beta and beta' subunits. The alpha-subunit (RET1P) of the coatomer complex in Saccharomyces cerevisiae (Baker's yeast), participates in membrane transport between the endoplasmic reticulum and Golgi apparatus. The protein contains six WD-40 repeat motifs in its N-terminal region []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 3MKQ_B.
Probab=98.62 E-value=1.3e-07 Score=78.38 Aligned_cols=88 Identities=14% Similarity=0.087 Sum_probs=65.9
Q ss_pred HHhhhhhhhcCCCCHHHHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcHHHHHH-------------HHH
Q psy11108 7 VIASRHSFVLCPYVAGQASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDDIMKHA-------------YIA 73 (106)
Q Consensus 7 ~~~~~~~f~~~~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~~~~~A-------------ei~ 73 (106)
-|..|.+..+.-+.+.-|+.||..||..+|+++||+||.+++|++++.||-.+-.-.++-...| ..+
T Consensus 333 ~L~~A~~~a~~~~~~~~W~~Lg~~AL~~g~~~lAe~c~~k~~d~~~L~lLy~~~g~~~~L~kl~~~a~~~~~~n~af~~~ 412 (443)
T PF04053_consen 333 NLDIALEIAKELDDPEKWKQLGDEALRQGNIELAEECYQKAKDFSGLLLLYSSTGDREKLSKLAKIAEERGDINIAFQAA 412 (443)
T ss_dssp -HHHHHHHCCCCSTHHHHHHHHHHHHHTTBHHHHHHHHHHCT-HHHHHHHHHHCT-HHHHHHHHHHHHHTT-HHHHHHHH
T ss_pred CHHHHHHHHHhcCcHHHHHHHHHHHHHcCCHHHHHHHHHhhcCccccHHHHHHhCCHHHHHHHHHHHHHccCHHHHHHHH
Confidence 3677889999999999999999999999999999999999999999999987754332222223 345
Q ss_pred HHcCCHHHHHHHHHHcCCccc
Q psy11108 74 QHQGRFQTAENYQLQADRRTW 94 (106)
Q Consensus 74 ay~Gkf~EAe~i~l~ag~~~~ 94 (106)
.+.|++++--++++++||.-+
T Consensus 413 ~~lgd~~~cv~lL~~~~~~~~ 433 (443)
T PF04053_consen 413 LLLGDVEECVDLLIETGRLPE 433 (443)
T ss_dssp HHHT-HHHHHHHHHHTT-HHH
T ss_pred HHcCCHHHHHHHHHHcCCchH
Confidence 566777777777777777543
No 6
>KOG0292|consensus
Probab=97.76 E-value=4.6e-05 Score=68.28 Aligned_cols=88 Identities=8% Similarity=-0.029 Sum_probs=71.8
Q ss_pred HhhhhhhhcCCCCHHHHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHH-------HhhCCCc------HHHHHHHHHH
Q psy11108 8 IASRHSFVLCPYVAGQASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIK-------SLRALPD------DIMKHAYIAQ 74 (106)
Q Consensus 8 ~~~~~~f~~~~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~-------~l~~~~~------~~~~~Aei~a 74 (106)
|-.|.+..++-..+..|+.|++.||...|-.|||.||.+++++++|.|+- +|.++.+ +..-+=..++
T Consensus 659 le~ale~akkldd~d~w~rLge~Al~qgn~~IaEm~yQ~~knfekLsfLYliTgn~eKL~Km~~iae~r~D~~~~~qnal 738 (1202)
T KOG0292|consen 659 LEVALEAAKKLDDKDVWERLGEEALRQGNHQIAEMCYQRTKNFEKLSFLYLITGNLEKLSKMMKIAEIRNDATGQFQNAL 738 (1202)
T ss_pred HHHHHHHHHhcCcHHHHHHHHHHHHHhcchHHHHHHHHHhhhhhheeEEEEEeCCHHHHHHHHHHHHhhhhhHHHHHHHH
Confidence 55688888888889999999999999999999999999999999998763 3332221 1223346789
Q ss_pred HcCCHHHHHHHHHHcCCcccc
Q psy11108 75 HQGRFQTAENYQLQADRRTWK 95 (106)
Q Consensus 75 y~Gkf~EAe~i~l~ag~~~~A 95 (106)
|.|..+|=-+|+.++|+..+|
T Consensus 739 Yl~dv~ervkIl~n~g~~~la 759 (1202)
T KOG0292|consen 739 YLGDVKERVKILENGGQLPLA 759 (1202)
T ss_pred HhccHHHHHHHHHhcCcccHH
Confidence 999999999999999998755
No 7
>KOG3617|consensus
Probab=97.43 E-value=0.00028 Score=63.43 Aligned_cols=80 Identities=16% Similarity=0.140 Sum_probs=71.9
Q ss_pred hhhhcCCCCHHHHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCC-cHHHHHHHHHHHcCCHHHHHHHHHHcC
Q psy11108 12 HSFVLCPYVAGQASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALP-DDIMKHAYIAQHQGRFQTAENYQLQAD 90 (106)
Q Consensus 12 ~~f~~~~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~-~~~~~~Aei~ay~Gkf~EAe~i~l~ag 90 (106)
..||+. ++.|..+|.++.+...||+|..|.-.+++.++++-+.+-++-+ ..+.+-|-++.-.|-.+||+.+|.+|.
T Consensus 751 I~~IkS---~~vW~nmA~McVkT~RLDVAkVClGhm~~aRgaRAlR~a~q~~~e~eakvAvLAieLgMlEeA~~lYr~ck 827 (1416)
T KOG3617|consen 751 IQFIKS---DSVWDNMASMCVKTRRLDVAKVCLGHMKNARGARALRRAQQNGEEDEAKVAVLAIELGMLEEALILYRQCK 827 (1416)
T ss_pred HHHHhh---hHHHHHHHHHhhhhccccHHHHhhhhhhhhhhHHHHHHHHhCCcchhhHHHHHHHHHhhHHHHHHHHHHHH
Confidence 457777 8999999999999999999999999999999999999987766 456777888889999999999999999
Q ss_pred Cccc
Q psy11108 91 RRTW 94 (106)
Q Consensus 91 ~~~~ 94 (106)
|.|+
T Consensus 828 R~DL 831 (1416)
T KOG3617|consen 828 RYDL 831 (1416)
T ss_pred HHHH
Confidence 9874
No 8
>KOG0276|consensus
Probab=96.99 E-value=0.0018 Score=56.57 Aligned_cols=86 Identities=8% Similarity=0.112 Sum_probs=62.1
Q ss_pred HhhhhhhhcCCCCHHHHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcHHHH------------HHH-HHH
Q psy11108 8 IASRHSFVLCPYVAGQASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDDIMK------------HAY-IAQ 74 (106)
Q Consensus 8 ~~~~~~f~~~~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~~~~------------~Ae-i~a 74 (106)
+..|.+-..+-..+.=|+.|+..||.+.+|..|..||.+-+|+.+|-++...-.--.--.. .|- .+.
T Consensus 653 l~iA~~la~e~~s~~Kw~~Lg~~al~~~~l~lA~EC~~~a~d~~~LlLl~t~~g~~~~l~~la~~~~~~g~~N~AF~~~~ 732 (794)
T KOG0276|consen 653 LDIAFDLAVEANSEVKWRQLGDAALSAGELPLASECFLRARDLGSLLLLYTSSGNAEGLAVLASLAKKQGKNNLAFLAYF 732 (794)
T ss_pred HHHHHHHHHhhcchHHHHHHHHHHhhcccchhHHHHHHhhcchhhhhhhhhhcCChhHHHHHHHHHHhhcccchHHHHHH
Confidence 3445665556666788999999999999999999999999999999988765322110011 122 234
Q ss_pred HcCCHHHHHHHHHHcCCcc
Q psy11108 75 HQGRFQTAENYQLQADRRT 93 (106)
Q Consensus 75 y~Gkf~EAe~i~l~ag~~~ 93 (106)
..|++++...++...+|.-
T Consensus 733 l~g~~~~C~~lLi~t~r~p 751 (794)
T KOG0276|consen 733 LSGDYEECLELLISTQRLP 751 (794)
T ss_pred HcCCHHHHHHHHHhcCcCc
Confidence 6788888888888877653
No 9
>KOG2247|consensus
Probab=96.90 E-value=0.00026 Score=60.35 Aligned_cols=82 Identities=9% Similarity=-0.134 Sum_probs=76.2
Q ss_pred HHHHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHcCCccccccCC-
Q psy11108 21 AGQASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDDIMKHAYIAQHQGRFQTAENYQLQADRRTWKSKPS- 99 (106)
Q Consensus 21 ~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~~~~~Aei~ay~Gkf~EAe~i~l~ag~~~~Aiemr- 99 (106)
.+-|++....|+...+-.+|.+.|++++|..++--|..++..++.++..|-+.-..-.||.|++.+++.+.+..|++|+
T Consensus 3 ~~~~~~~~~~a~~d~~~~~airiyr~ledaalv~pi~~~w~~e~~nlavaca~tiv~~YD~agq~~le~n~tg~aldm~w 82 (615)
T KOG2247|consen 3 LKVIPCTLTKAQEDFKCVSAIRIYRRLEDAALVGPIIHRWRPEGHNLAVACANTIVIYYDKAGQVILELNPTGKALDMAW 82 (615)
T ss_pred ccchhhHHHhhhhhccchHHHHHHHHhhhhhccccceeeEecCCCceehhhhhhHHHhhhhhcceecccCCchhHhhhhh
Confidence 3569999999999999999999999999999999999999999999999999999999999999999999999999985
Q ss_pred --Ccc
Q psy11108 100 --YWT 102 (106)
Q Consensus 100 --~W~ 102 (106)
.|+
T Consensus 83 Dkegd 87 (615)
T KOG2247|consen 83 DKEGD 87 (615)
T ss_pred ccccc
Confidence 555
No 10
>PF13414 TPR_11: TPR repeat; PDB: 2HO1_B 2FI7_B 2DBA_A 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2PL2_B 3IEG_B 2FBN_A ....
Probab=94.64 E-value=0.17 Score=30.08 Aligned_cols=62 Identities=8% Similarity=-0.011 Sum_probs=48.2
Q ss_pred CHHHHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcHHHHHHHHHHHcC-CHHHHHHHHHHc
Q psy11108 20 VAGQASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDDIMKHAYIAQHQG-RFQTAENYQLQA 89 (106)
Q Consensus 20 ~~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~~~~~Aei~ay~G-kf~EAe~i~l~a 89 (106)
.+..|..+|...+...+++.|...|.+.=+++ .+-+.--..+|.++..+| ++++|.+.|.++
T Consensus 2 ~a~~~~~~g~~~~~~~~~~~A~~~~~~ai~~~--------p~~~~~~~~~g~~~~~~~~~~~~A~~~~~~a 64 (69)
T PF13414_consen 2 NAEAWYNLGQIYFQQGDYEEAIEYFEKAIELD--------PNNAEAYYNLGLAYMKLGKDYEEAIEDFEKA 64 (69)
T ss_dssp SHHHHHHHHHHHHHTTHHHHHHHHHHHHHHHS--------TTHHHHHHHHHHHHHHTTTHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcC--------CCCHHHHHHHHHHHHHhCccHHHHHHHHHHH
Confidence 36789999999999999999999998763332 111112357899999999 799999998774
No 11
>PF13432 TPR_16: Tetratricopeptide repeat; PDB: 3CVP_A 3CVL_A 3CVQ_A 3CV0_A 2GW1_B 3CVN_A 3QKY_A 2PL2_B.
Probab=94.05 E-value=0.45 Score=28.00 Aligned_cols=56 Identities=18% Similarity=0.163 Sum_probs=40.1
Q ss_pred HHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHc
Q psy11108 26 FLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDDIMKHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 26 ~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~~~~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
.+|...+..++++-|+..|.++-...- +-.+--...|.++.-+|++++|...|.++
T Consensus 2 ~~a~~~~~~g~~~~A~~~~~~~l~~~P--------~~~~a~~~lg~~~~~~g~~~~A~~~~~~a 57 (65)
T PF13432_consen 2 ALARALYQQGDYDEAIAAFEQALKQDP--------DNPEAWYLLGRILYQQGRYDEALAYYERA 57 (65)
T ss_dssp HHHHHHHHCTHHHHHHHHHHHHHCCST--------THHHHHHHHHHHHHHTT-HHHHHHHHHHH
T ss_pred hHHHHHHHcCCHHHHHHHHHHHHHHCC--------CCHHHHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 367888999999999998886633320 00112256799999999999999998764
No 12
>PF09976 TPR_21: Tetratricopeptide repeat; InterPro: IPR018704 This domain, found in various hypothetical prokaryotic proteins, has no known function.
Probab=93.40 E-value=1.2 Score=30.57 Aligned_cols=81 Identities=19% Similarity=0.099 Sum_probs=49.7
Q ss_pred chhHHHhh-hhhhhcCCCCHHH----HHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcHHHHHHHHHHHcC
Q psy11108 3 GQYAVIAS-RHSFVLCPYVAGQ----ASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDDIMKHAYIAQHQG 77 (106)
Q Consensus 3 ~~~~~~~~-~~~f~~~~~~~~l----W~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~~~~~Aei~ay~G 77 (106)
|+...+.. ...|++++|.... +-.||...+..++++-|...|..+-+-..=.-+..+ -.++.|.|++-+|
T Consensus 25 ~~~~~~~~~~~~l~~~~~~s~ya~~A~l~lA~~~~~~g~~~~A~~~l~~~~~~~~d~~l~~~-----a~l~LA~~~~~~~ 99 (145)
T PF09976_consen 25 GDPAKAEAAAEQLAKDYPSSPYAALAALQLAKAAYEQGDYDEAKAALEKALANAPDPELKPL-----ARLRLARILLQQG 99 (145)
T ss_pred CCHHHHHHHHHHHHHHCCCChHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHhhCCCHHHHHH-----HHHHHHHHHHHcC
Confidence 33444443 6778888887622 666889999999999999999876432210011100 1245566666666
Q ss_pred CHHHHHHHHHH
Q psy11108 78 RFQTAENYQLQ 88 (106)
Q Consensus 78 kf~EAe~i~l~ 88 (106)
+|++|.+++.+
T Consensus 100 ~~d~Al~~L~~ 110 (145)
T PF09976_consen 100 QYDEALATLQQ 110 (145)
T ss_pred CHHHHHHHHHh
Confidence 66666666644
No 13
>TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD. ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array.
Probab=93.06 E-value=0.63 Score=30.73 Aligned_cols=65 Identities=15% Similarity=0.000 Sum_probs=48.3
Q ss_pred CCCCHHHHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHc
Q psy11108 17 CPYVAGQASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDDIMKHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 17 ~~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~~~~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
.+..+..|..+|.......+++-|...|.++-+.. .+.++.....|.++...|++++|.+.|.++
T Consensus 47 ~p~~~~~~~~la~~~~~~~~~~~A~~~~~~~~~~~--------p~~~~~~~~la~~~~~~g~~~~A~~~~~~a 111 (135)
T TIGR02552 47 DPYNSRYWLGLAACCQMLKEYEEAIDAYALAAALD--------PDDPRPYFHAAECLLALGEPESALKALDLA 111 (135)
T ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC--------CCChHHHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 34467888899988888888988888877654321 112233456799999999999999999775
No 14
>PRK10747 putative protoheme IX biogenesis protein; Provisional
Probab=91.06 E-value=1.6 Score=35.09 Aligned_cols=72 Identities=11% Similarity=-0.074 Sum_probs=57.6
Q ss_pred HHhhhhhhhcCCCC-HHHHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcHH--HHHHHHHHHcCCHHHHH
Q psy11108 7 VIASRHSFVLCPYV-AGQASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDDI--MKHAYIAQHQGRFQTAE 83 (106)
Q Consensus 7 ~~~~~~~f~~~~~~-~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~~--~~~Aei~ay~Gkf~EAe 83 (106)
.++-+.++.+.+|. +.+..++|..++...+++-|+..|.+.-.. -|+.. ...|.+.--+|+.++|.
T Consensus 313 al~~~e~~lk~~P~~~~l~l~lgrl~~~~~~~~~A~~~le~al~~-----------~P~~~~~~~La~~~~~~g~~~~A~ 381 (398)
T PRK10747 313 LEKVLRQQIKQHGDTPLLWSTLGQLLMKHGEWQEASLAFRAALKQ-----------RPDAYDYAWLADALDRLHKPEEAA 381 (398)
T ss_pred HHHHHHHHHhhCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHhc-----------CCCHHHHHHHHHHHHHcCCHHHHH
Confidence 35566678888777 556889999999999999999998754322 24432 68899999999999999
Q ss_pred HHHHHc
Q psy11108 84 NYQLQA 89 (106)
Q Consensus 84 ~i~l~a 89 (106)
++|.++
T Consensus 382 ~~~~~~ 387 (398)
T PRK10747 382 AMRRDG 387 (398)
T ss_pred HHHHHH
Confidence 999986
No 15
>PF10602 RPN7: 26S proteasome subunit RPN7; InterPro: IPR019585 This entry represents the regulatory subunit RPN7 (known as the non-ATPase regulatory subunit 6 in higher eukaryotes) of the 26S proteasome. This entry also matches the evolutionarily related subunit 1 of the COP9 signalosome complex (CSN) from Arabidopsis []. The 26S proteasome plays a major role in ATP-dependent degradation of ubiquitinated proteins. Substrate specificity is conferred by the regulatory particle (RP), which can dissociate into stable lid and base subcomplexes. The regulatory subunit RPN7 is one of the lid subunits of the 26S proteasome and has been shown in Saccharomyces cerevisiae (Baker's yeast) to be required for structural integrity []. The COP9 signalosome is a conserved protein complex composed of eight subunits, where Individual subunits of the complex have been linked to various signal transduction pathways leading to gene expression and cell cycle control []. The overall organisation and the amino acid sequences of the COP9 signalosome subunits resemble the lid subcomplex of the 19 S regulatory particle for the 26 S proteasome []. COP9 subunit 1 (CSN1 or GPS1) of the COP9 complex is an essential subunit of the complex with regard to both structural integrity and functionality. The N-terminal region of subunit 1 (CSN1-N) can inhibit c-fos expression from either a transfected template or a chromosomal transgene (fos-lacZ), and may contain the activity domain that confers most of the repression functions of CSN1. The C-terminal region of subunit 1 (CSN1-C) allows integration of the protein into the COP9 signalosome.
Probab=90.84 E-value=1 Score=32.84 Aligned_cols=72 Identities=15% Similarity=0.106 Sum_probs=51.9
Q ss_pred HHHHHHHHHHHhcccHHHHHHHHHHcCCch-----hhH--------------------HHHHhhCCCcH----------H
Q psy11108 22 GQASFLSGVSQKKERFYHVEQALVNCQDYA-----GLQ--------------------LIKSLRALPDD----------I 66 (106)
Q Consensus 22 ~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~-----~v~--------------------~I~~l~~~~~~----------~ 66 (106)
..|..||..-.+-+|++-|.++|.+++|+. +++ +|.+.+.+.++ .
T Consensus 37 ~~~~~l~~~~~~~Gd~~~A~k~y~~~~~~~~~~~~~id~~l~~irv~i~~~d~~~v~~~i~ka~~~~~~~~d~~~~nrlk 116 (177)
T PF10602_consen 37 MALEDLADHYCKIGDLEEALKAYSRARDYCTSPGHKIDMCLNVIRVAIFFGDWSHVEKYIEKAESLIEKGGDWERRNRLK 116 (177)
T ss_pred HHHHHHHHHHHHhhhHHHHHHHHHHHhhhcCCHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHhccchHHHHHHHH
Confidence 348899999999999999999999998875 121 22222222211 2
Q ss_pred HHHHHHHHHcCCHHHHHHHHHHcCCcc
Q psy11108 67 MKHAYIAQHQGRFQTAENYQLQADRRT 93 (106)
Q Consensus 67 ~~~Aei~ay~Gkf~EAe~i~l~ag~~~ 93 (106)
+..|-..+.+|+|.+|.++|+++....
T Consensus 117 ~~~gL~~l~~r~f~~AA~~fl~~~~t~ 143 (177)
T PF10602_consen 117 VYEGLANLAQRDFKEAAELFLDSLSTF 143 (177)
T ss_pred HHHHHHHHHhchHHHHHHHHHccCcCC
Confidence 345777788999999999999986543
No 16
>TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW. Members of this family are designated PilF in ref (PubMed:8973346) and PilW in ref (PubMed:15612916). This outer membrane protein is required both for pilus stability and for pilus function such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.
Probab=90.42 E-value=2.2 Score=29.17 Aligned_cols=30 Identities=13% Similarity=-0.061 Sum_probs=21.6
Q ss_pred CCCHHHHHHHHHHHHhcccHHHHHHHHHHc
Q psy11108 18 PYVAGQASFLSGVSQKKERFYHVEQALVNC 47 (106)
Q Consensus 18 ~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl 47 (106)
+.+...|..+|.......+++-|...|.+.
T Consensus 62 p~~~~~~~~la~~~~~~~~~~~A~~~~~~a 91 (234)
T TIGR02521 62 PDDYLAYLALALYYQQLGELEKAEDSFRRA 91 (234)
T ss_pred cccHHHHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 344666777888888888888877777654
No 17
>PRK15359 type III secretion system chaperone protein SscB; Provisional
Probab=90.30 E-value=1.9 Score=29.91 Aligned_cols=64 Identities=11% Similarity=0.043 Sum_probs=42.1
Q ss_pred CCCHHHHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHc
Q psy11108 18 PYVAGQASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDDIMKHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 18 ~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~~~~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
+.+...|..||......++++-|..+|.+.-... -.-++.-...|.++...|+++||...|.++
T Consensus 55 P~~~~a~~~lg~~~~~~g~~~~A~~~y~~Al~l~--------p~~~~a~~~lg~~l~~~g~~~eAi~~~~~A 118 (144)
T PRK15359 55 PWSWRAHIALAGTWMMLKEYTTAINFYGHALMLD--------ASHPEPVYQTGVCLKMMGEPGLAREAFQTA 118 (144)
T ss_pred CCcHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcC--------CCCcHHHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 4445667777777777777777776666542211 011223456788889999999999999875
No 18
>PRK10370 formate-dependent nitrite reductase complex subunit NrfG; Provisional
Probab=89.94 E-value=3.3 Score=30.39 Aligned_cols=73 Identities=16% Similarity=0.054 Sum_probs=49.4
Q ss_pred CCCCHHHHHHHHHHHHhcccHHHHHHHHHHcCCch-----hhH--------------------HHHHhhCCCc----HHH
Q psy11108 17 CPYVAGQASFLSGVSQKKERFYHVEQALVNCQDYA-----GLQ--------------------LIKSLRALPD----DIM 67 (106)
Q Consensus 17 ~~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~-----~v~--------------------~I~~l~~~~~----~~~ 67 (106)
++++...|..||...+...+++.|..+|.+.-... ... .+++.-++.. --.
T Consensus 69 ~P~~~~~w~~Lg~~~~~~g~~~~A~~a~~~Al~l~P~~~~~~~~lA~aL~~~~g~~~~~~A~~~l~~al~~dP~~~~al~ 148 (198)
T PRK10370 69 NPQNSEQWALLGEYYLWRNDYDNALLAYRQALQLRGENAELYAALATVLYYQAGQHMTPQTREMIDKALALDANEVTALM 148 (198)
T ss_pred CCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHhcCCCCcHHHHHHHHHHHHhCCCChhHHH
Confidence 46678889999999999999999999998653221 111 1111111111 124
Q ss_pred HHHHHHHHcCCHHHHHHHHHHc
Q psy11108 68 KHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 68 ~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
..|.++.-.|+|++|...|.+.
T Consensus 149 ~LA~~~~~~g~~~~Ai~~~~~a 170 (198)
T PRK10370 149 LLASDAFMQADYAQAIELWQKV 170 (198)
T ss_pred HHHHHHHHcCCHHHHHHHHHHH
Confidence 5688888889999999988774
No 19
>PF12895 Apc3: Anaphase-promoting complex, cyclosome, subunit 3; PDB: 3KAE_D 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2XPI_A 3ULQ_A.
Probab=89.77 E-value=1.2 Score=27.68 Aligned_cols=60 Identities=12% Similarity=0.026 Sum_probs=43.3
Q ss_pred CHHHHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHH
Q psy11108 20 VAGQASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDDIMKHAYIAQHQGRFQTAENYQLQ 88 (106)
Q Consensus 20 ~~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~~~~~Aei~ay~Gkf~EAe~i~l~ 88 (106)
++..|-.||..-....+++-|...+.+ ..... ...+.....|.++.-+|+++||.+.|.+
T Consensus 24 ~~~~~~~la~~~~~~~~y~~A~~~~~~-~~~~~--------~~~~~~~l~a~~~~~l~~y~eAi~~l~~ 83 (84)
T PF12895_consen 24 NSAYLYNLAQCYFQQGKYEEAIELLQK-LKLDP--------SNPDIHYLLARCLLKLGKYEEAIKALEK 83 (84)
T ss_dssp HHHHHHHHHHHHHHTTHHHHHHHHHHC-HTHHH--------CHHHHHHHHHHHHHHTT-HHHHHHHHHH
T ss_pred hHHHHHHHHHHHHHCCCHHHHHHHHHH-hCCCC--------CCHHHHHHHHHHHHHhCCHHHHHHHHhc
Confidence 456788899999999999888877766 11111 1123344569999999999999999876
No 20
>PF09976 TPR_21: Tetratricopeptide repeat; InterPro: IPR018704 This domain, found in various hypothetical prokaryotic proteins, has no known function.
Probab=89.37 E-value=4.7 Score=27.58 Aligned_cols=57 Identities=18% Similarity=0.063 Sum_probs=38.8
Q ss_pred HHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHc
Q psy11108 24 ASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDDIMKHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 24 W~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~~~~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
+-.||..-+..++++-|...+..+.+-.+--. -...+|+|+.-.|++++|.+.|.++
T Consensus 88 ~l~LA~~~~~~~~~d~Al~~L~~~~~~~~~~~---------~~~~~Gdi~~~~g~~~~A~~~y~~A 144 (145)
T PF09976_consen 88 RLRLARILLQQGQYDEALATLQQIPDEAFKAL---------AAELLGDIYLAQGDYDEARAAYQKA 144 (145)
T ss_pred HHHHHHHHHHcCCHHHHHHHHHhccCcchHHH---------HHHHHHHHHHHCCCHHHHHHHHHHh
Confidence 55566666777777766666544333222111 2357899999999999999999764
No 21
>cd00189 TPR Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and C
Probab=88.63 E-value=2.7 Score=23.65 Aligned_cols=62 Identities=10% Similarity=0.016 Sum_probs=44.8
Q ss_pred CHHHHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHc
Q psy11108 20 VAGQASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDDIMKHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 20 ~~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~~~~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
++..|..+|.......+++-|...|.++-+.. ...+.-....|.++.-.|++++|..+|.++
T Consensus 33 ~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~ 94 (100)
T cd00189 33 NADAYYNLAAAYYKLGKYEEALEDYEKALELD--------PDNAKAYYNLGLAYYKLGKYEEALEAYEKA 94 (100)
T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC--------CcchhHHHHHHHHHHHHHhHHHHHHHHHHH
Confidence 34678888888888888888888777653321 111123456788999999999999998775
No 22
>COG5010 TadD Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking and secretion]
Probab=88.37 E-value=1.6 Score=34.39 Aligned_cols=66 Identities=20% Similarity=0.167 Sum_probs=49.0
Q ss_pred cCCCCHHHHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHc
Q psy11108 16 LCPYVAGQASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDDIMKHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 16 ~~~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~~~~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
.+++....|..||..-.+.++++-|+.+|...-++.. .-+..---.|..+...|+++.|+++++.+
T Consensus 129 l~p~d~~~~~~lgaaldq~Gr~~~Ar~ay~qAl~L~~--------~~p~~~nNlgms~~L~gd~~~A~~lll~a 194 (257)
T COG5010 129 LAPTDWEAWNLLGAALDQLGRFDEARRAYRQALELAP--------NEPSIANNLGMSLLLRGDLEDAETLLLPA 194 (257)
T ss_pred cCCCChhhhhHHHHHHHHccChhHHHHHHHHHHHhcc--------CCchhhhhHHHHHHHcCCHHHHHHHHHHH
Confidence 4567788899999999999999999999986544321 11222224588888888888888888874
No 23
>TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW. Members of this family are designated PilF in ref (PubMed:8973346) and PilW in ref (PubMed:15612916). This outer membrane protein is required both for pilus stability and for pilus function such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.
Probab=88.22 E-value=3.7 Score=28.06 Aligned_cols=27 Identities=11% Similarity=-0.082 Sum_probs=22.5
Q ss_pred HHHHHHHHHHHHhcccHHHHHHHHHHc
Q psy11108 21 AGQASFLSGVSQKKERFYHVEQALVNC 47 (106)
Q Consensus 21 ~~lW~~LA~~AL~~~~l~iAe~Af~rl 47 (106)
+..+..+|...+...+++.|...|.+.
T Consensus 31 ~~~~~~la~~~~~~~~~~~A~~~~~~~ 57 (234)
T TIGR02521 31 AKIRVQLALGYLEQGDLEVAKENLDKA 57 (234)
T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHH
Confidence 556888999999999999998888764
No 24
>PF13424 TPR_12: Tetratricopeptide repeat; PDB: 3RO2_A 3Q15_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 4A1S_B 3CEQ_B 3EDT_H ....
Probab=88.03 E-value=0.44 Score=28.98 Aligned_cols=62 Identities=15% Similarity=0.110 Sum_probs=43.7
Q ss_pred HHHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcH-----HHHHHHHHHHcCCHHHHHHHHHHc
Q psy11108 22 GQASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDD-----IMKHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 22 ~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~-----~~~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
..+..||..-....+++-|...|.++-++ .+.+.+ ..+ -...|.+....|++++|.+.|.++
T Consensus 6 ~~~~~la~~~~~~~~~~~A~~~~~~al~~-----~~~~~~-~~~~~a~~~~~lg~~~~~~g~~~~A~~~~~~a 72 (78)
T PF13424_consen 6 NAYNNLARVYRELGRYDEALDYYEKALDI-----EEQLGD-DHPDTANTLNNLGECYYRLGDYEEALEYYQKA 72 (78)
T ss_dssp HHHHHHHHHHHHTT-HHHHHHHHHHHHHH-----HHHTTT-HHHHHHHHHHHHHHHHHHTTHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHHHH-----HHHHCC-CCHHHHHHHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 34677888888999999999988875333 222211 111 135789999999999999999875
No 25
>PF14559 TPR_19: Tetratricopeptide repeat; PDB: 2R5S_A 3QDN_B 3QOU_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 3FP3_A 3LCA_A ....
Probab=87.69 E-value=2.2 Score=24.93 Aligned_cols=50 Identities=16% Similarity=0.141 Sum_probs=31.2
Q ss_pred HhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHc
Q psy11108 32 QKKERFYHVEQALVNCQDYAGLQLIKSLRALPDDIMKHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 32 L~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~~~~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
++.++++-|...|.++- +.--+-++--...|.++.-+|++++|.+++.+.
T Consensus 2 l~~~~~~~A~~~~~~~l--------~~~p~~~~~~~~la~~~~~~g~~~~A~~~l~~~ 51 (68)
T PF14559_consen 2 LKQGDYDEAIELLEKAL--------QRNPDNPEARLLLAQCYLKQGQYDEAEELLERL 51 (68)
T ss_dssp HHTTHHHHHHHHHHHHH--------HHTTTSHHHHHHHHHHHHHTT-HHHHHHHHHCC
T ss_pred hhccCHHHHHHHHHHHH--------HHCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 45666676666655431 111111112356899999999999999999875
No 26
>PLN03081 pentatricopeptide (PPR) repeat-containing protein; Provisional
Probab=86.97 E-value=4 Score=34.99 Aligned_cols=77 Identities=8% Similarity=-0.027 Sum_probs=56.5
Q ss_pred HhhhhhhhcCC---CCHHHHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHH
Q psy11108 8 IASRHSFVLCP---YVAGQASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDDIMKHAYIAQHQGRFQTAEN 84 (106)
Q Consensus 8 ~~~~~~f~~~~---~~~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~~~~~Aei~ay~Gkf~EAe~ 84 (106)
+.+|.++++.. |....|..|...+-..++++.|++++.++-+..- +....-.....+++-.|++++|++
T Consensus 478 ~~eA~~~~~~~~~~p~~~~~~~Ll~a~~~~g~~~~a~~~~~~l~~~~p--------~~~~~y~~L~~~y~~~G~~~~A~~ 549 (697)
T PLN03081 478 LDEAYAMIRRAPFKPTVNMWAALLTACRIHKNLELGRLAAEKLYGMGP--------EKLNNYVVLLNLYNSSGRQAEAAK 549 (697)
T ss_pred HHHHHHHHHHCCCCCCHHHHHHHHHHHHHcCCcHHHHHHHHHHhCCCC--------CCCcchHHHHHHHHhCCCHHHHHH
Confidence 56677777764 4566799999999999999999998876633210 001123566889999999999999
Q ss_pred HHHHcCCc
Q psy11108 85 YQLQADRR 92 (106)
Q Consensus 85 i~l~ag~~ 92 (106)
++.+....
T Consensus 550 v~~~m~~~ 557 (697)
T PLN03081 550 VVETLKRK 557 (697)
T ss_pred HHHHHHHc
Confidence 99875433
No 27
>PF07719 TPR_2: Tetratricopeptide repeat; InterPro: IPR013105 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. This repeat includes outlying Tetratricopeptide-like repeats (TPR) that are not matched by IPR001440 from INTERPRO.; PDB: 1XNF_B 3Q15_A 4ABN_A 1OUV_A 3U4T_A 3MA5_C 2KCV_A 2KCL_A 2XEV_A 3NF1_A ....
Probab=86.39 E-value=1.2 Score=22.72 Aligned_cols=23 Identities=17% Similarity=0.147 Sum_probs=19.3
Q ss_pred HHHHHHHHHcCCHHHHHHHHHHc
Q psy11108 67 MKHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 67 ~~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
...|.++...|+|++|.+.|.++
T Consensus 5 ~~lg~~~~~~~~~~~A~~~~~~a 27 (34)
T PF07719_consen 5 YYLGQAYYQLGNYEEAIEYFEKA 27 (34)
T ss_dssp HHHHHHHHHTT-HHHHHHHHHHH
T ss_pred HHHHHHHHHhCCHHHHHHHHHHH
Confidence 35789999999999999999874
No 28
>PF11349 DUF3151: Protein of unknown function (DUF3151); InterPro: IPR014487 This group represents an uncharacterised conserved protein.
Probab=86.35 E-value=1 Score=32.09 Aligned_cols=72 Identities=14% Similarity=0.027 Sum_probs=51.9
Q ss_pred hhhcCCCCHHH-HHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhh----------CCCcHHHHH-----HHHHHHc
Q psy11108 13 SFVLCPYVAGQ-ASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLR----------ALPDDIMKH-----AYIAQHQ 76 (106)
Q Consensus 13 ~f~~~~~~~~l-W~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~----------~~~~~~~~~-----Aei~ay~ 76 (106)
+-+..+|..++ |..||+.||...+ .|+-.||+|.|=-++|+-+.+-- +.+++-.++ |..+.-.
T Consensus 27 ~vaa~~P~ss~aWA~LAe~al~~g~-~v~AYAyARTGYHRGLD~LRr~GWkG~GPVPw~HePNrGfLRal~aLa~AA~~I 105 (129)
T PF11349_consen 27 EVAAAHPASSLAWAALAEEALAAGR-PVTAYAYARTGYHRGLDQLRRNGWKGHGPVPWSHEPNRGFLRALAALARAAQAI 105 (129)
T ss_pred HHHHHCCCchHHHHHHHHHHHhCCC-chhhhhhhhccccccHHHHHHCCCCCCCCCCCccCCccHHHHHHHHHHHHHHHh
Confidence 44556666555 9999999999985 56678999999999988777642 234444433 4666677
Q ss_pred CCHHHHHHH
Q psy11108 77 GRFQTAENY 85 (106)
Q Consensus 77 Gkf~EAe~i 85 (106)
|.-+|++++
T Consensus 106 GE~dE~~Rc 114 (129)
T PF11349_consen 106 GETDEYDRC 114 (129)
T ss_pred CChhHHHHH
Confidence 888888765
No 29
>PLN03077 Protein ECB2; Provisional
Probab=86.07 E-value=4.2 Score=35.74 Aligned_cols=74 Identities=8% Similarity=-0.023 Sum_probs=56.7
Q ss_pred HhhhhhhhcCC---CCHHHHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCC-CcHHHHHHHHHHHcCCHHHHH
Q psy11108 8 IASRHSFVLCP---YVAGQASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRAL-PDDIMKHAYIAQHQGRFQTAE 83 (106)
Q Consensus 8 ~~~~~~f~~~~---~~~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~-~~~~~~~Aei~ay~Gkf~EAe 83 (106)
+.+|.++++.. |.+..|..|-..+-...|++.++.+..++-+ ++.- ...-+..+.+++..|+++||+
T Consensus 641 ~~eA~~~~~~m~~~pd~~~~~aLl~ac~~~~~~e~~e~~a~~l~~---------l~p~~~~~y~ll~n~ya~~g~~~~a~ 711 (857)
T PLN03077 641 LTEAYNFINKMPITPDPAVWGALLNACRIHRHVELGELAAQHIFE---------LDPNSVGYYILLCNLYADAGKWDEVA 711 (857)
T ss_pred HHHHHHHHHHCCCCCCHHHHHHHHHHHHHcCChHHHHHHHHHHHh---------hCCCCcchHHHHHHHHHHCCChHHHH
Confidence 56788888774 5677899999998889999999988655432 2221 234577889999999999999
Q ss_pred HHHHHcC
Q psy11108 84 NYQLQAD 90 (106)
Q Consensus 84 ~i~l~ag 90 (106)
++.....
T Consensus 712 ~vr~~M~ 718 (857)
T PLN03077 712 RVRKTMR 718 (857)
T ss_pred HHHHHHH
Confidence 9997754
No 30
>PRK11189 lipoprotein NlpI; Provisional
Probab=86.04 E-value=6.1 Score=30.49 Aligned_cols=62 Identities=15% Similarity=0.109 Sum_probs=40.3
Q ss_pred CCCCHHHHHHHHHHHHhcccHHHHHHHHHHcCC--chhhHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHH
Q psy11108 17 CPYVAGQASFLSGVSQKKERFYHVEQALVNCQD--YAGLQLIKSLRALPDDIMKHAYIAQHQGRFQTAENYQLQ 88 (106)
Q Consensus 17 ~~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl~d--~~~v~~I~~l~~~~~~~~~~Aei~ay~Gkf~EAe~i~l~ 88 (106)
.+..+..|..+|...+...+++-|..+|.+.-. .... .--...|.++.+.|++++|.+.|.+
T Consensus 94 ~P~~~~a~~~lg~~~~~~g~~~~A~~~~~~Al~l~P~~~----------~a~~~lg~~l~~~g~~~eA~~~~~~ 157 (296)
T PRK11189 94 RPDMADAYNYLGIYLTQAGNFDAAYEAFDSVLELDPTYN----------YAYLNRGIALYYGGRYELAQDDLLA 157 (296)
T ss_pred CCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCCH----------HHHHHHHHHHHHCCCHHHHHHHHHH
Confidence 344567788888888888888888888776521 1110 1124556777777777777776655
No 31
>PF13374 TPR_10: Tetratricopeptide repeat; PDB: 3CEQ_B 3EDT_H 3NF1_A.
Probab=85.96 E-value=1.1 Score=23.63 Aligned_cols=22 Identities=27% Similarity=0.282 Sum_probs=18.3
Q ss_pred HHHHHHHHcCCHHHHHHHHHHc
Q psy11108 68 KHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 68 ~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
..|.++..+|++++|+.++.++
T Consensus 7 ~la~~~~~~g~~~~A~~~~~~a 28 (42)
T PF13374_consen 7 NLANAYRAQGRYEEALELLEEA 28 (42)
T ss_dssp HHHHHHHHCT-HHHHHHHHHHH
T ss_pred HHHHHHHhhhhcchhhHHHHHH
Confidence 3588899999999999999875
No 32
>PRK10153 DNA-binding transcriptional activator CadC; Provisional
Probab=85.72 E-value=3.2 Score=35.27 Aligned_cols=65 Identities=12% Similarity=-0.057 Sum_probs=49.7
Q ss_pred CCCCHHHHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHcC
Q psy11108 17 CPYVAGQASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDDIMKHAYIAQHQGRFQTAENYQLQAD 90 (106)
Q Consensus 17 ~~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~~~~~Aei~ay~Gkf~EAe~i~l~ag 90 (106)
.+..+..+..+|..++...+++-|+.+|.+.-+.. .-.......|.++.+.|++++|...|.++=
T Consensus 416 ~~~~~~~~~ala~~~~~~g~~~~A~~~l~rAl~L~---------ps~~a~~~lG~~~~~~G~~~eA~~~~~~A~ 480 (517)
T PRK10153 416 LNVLPRIYEILAVQALVKGKTDEAYQAINKAIDLE---------MSWLNYVLLGKVYELKGDNRLAADAYSTAF 480 (517)
T ss_pred CcCChHHHHHHHHHHHhcCCHHHHHHHHHHHHHcC---------CCHHHHHHHHHHHHHcCCHHHHHHHHHHHH
Confidence 34446778889999999999999998888754443 111234678999999999999999998863
No 33
>PF13176 TPR_7: Tetratricopeptide repeat; PDB: 3SF4_C 3RO3_A 3RO2_A.
Probab=85.69 E-value=1.3 Score=23.83 Aligned_cols=21 Identities=29% Similarity=0.392 Sum_probs=18.4
Q ss_pred HHHHHHHHcCCHHHHHHHHHH
Q psy11108 68 KHAYIAQHQGRFQTAENYQLQ 88 (106)
Q Consensus 68 ~~Aei~ay~Gkf~EAe~i~l~ 88 (106)
..|.++.-+|+|++|..+|.+
T Consensus 4 ~Lg~~~~~~g~~~~Ai~~y~~ 24 (36)
T PF13176_consen 4 NLGRIYRQQGDYEKAIEYYEQ 24 (36)
T ss_dssp HHHHHHHHCT-HHHHHHHHHH
T ss_pred HHHHHHHHcCCHHHHHHHHHH
Confidence 468999999999999999988
No 34
>KOG3616|consensus
Probab=85.67 E-value=0.21 Score=45.47 Aligned_cols=77 Identities=13% Similarity=0.122 Sum_probs=50.3
Q ss_pred HHHHHHHHHhcccHHHHHHHHHHcCCchh-hHHH----------HHhhCC--C----cHHHHHHHHHHHcCCHHHHHHHH
Q psy11108 24 ASFLSGVSQKKERFYHVEQALVNCQDYAG-LQLI----------KSLRAL--P----DDIMKHAYIAQHQGRFQTAENYQ 86 (106)
Q Consensus 24 W~~LA~~AL~~~~l~iAe~Af~rl~d~~~-v~~I----------~~l~~~--~----~~~~~~Aei~ay~Gkf~EAe~i~ 86 (106)
+-..|..=-..++|++||.-|+.-+-+.- ++.- +--++- + +.-+-+|+=+--+|||.|||++|
T Consensus 768 y~~iadhyan~~dfe~ae~lf~e~~~~~dai~my~k~~kw~da~kla~e~~~~e~t~~~yiakaedldehgkf~eaeqly 847 (1636)
T KOG3616|consen 768 YGEIADHYANKGDFEIAEELFTEADLFKDAIDMYGKAGKWEDAFKLAEECHGPEATISLYIAKAEDLDEHGKFAEAEQLY 847 (1636)
T ss_pred chHHHHHhccchhHHHHHHHHHhcchhHHHHHHHhccccHHHHHHHHHHhcCchhHHHHHHHhHHhHHhhcchhhhhhee
Confidence 34455555567889999999987643321 1110 000110 1 11245677777899999999999
Q ss_pred HHcCCccccccCCC
Q psy11108 87 LQADRRTWKSKPSY 100 (106)
Q Consensus 87 l~ag~~~~Aiemr~ 100 (106)
+.-|.|++||+|++
T Consensus 848 iti~~p~~aiqmyd 861 (1636)
T KOG3616|consen 848 ITIGEPDKAIQMYD 861 (1636)
T ss_pred EEccCchHHHHHHH
Confidence 99999999999863
No 35
>PRK12370 invasion protein regulator; Provisional
Probab=84.90 E-value=4.9 Score=33.81 Aligned_cols=31 Identities=3% Similarity=-0.067 Sum_probs=25.7
Q ss_pred CCCCHHHHHHHHHHHHhcccHHHHHHHHHHc
Q psy11108 17 CPYVAGQASFLSGVSQKKERFYHVEQALVNC 47 (106)
Q Consensus 17 ~~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl 47 (106)
++.++..|..||...+...+++-|..+|.+.
T Consensus 334 dP~~~~a~~~lg~~~~~~g~~~~A~~~~~~A 364 (553)
T PRK12370 334 DHNNPQALGLLGLINTIHSEYIVGSLLFKQA 364 (553)
T ss_pred CCCCHHHHHHHHHHHHHccCHHHHHHHHHHH
Confidence 4556778999999999999999999888663
No 36
>PLN03088 SGT1, suppressor of G2 allele of SKP1; Provisional
Probab=84.80 E-value=3.8 Score=32.75 Aligned_cols=65 Identities=12% Similarity=0.047 Sum_probs=47.0
Q ss_pred CCCCHHHHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHc
Q psy11108 17 CPYVAGQASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDDIMKHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 17 ~~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~~~~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
++.++..|..+|...+...+++-|...|.++-+..- ....--.+.|.++...|+|++|...|.++
T Consensus 32 ~P~~~~a~~~~a~~~~~~g~~~eAl~~~~~Al~l~P--------~~~~a~~~lg~~~~~lg~~~eA~~~~~~a 96 (356)
T PLN03088 32 DPNNAELYADRAQANIKLGNFTEAVADANKAIELDP--------SLAKAYLRKGTACMKLEEYQTAKAALEKG 96 (356)
T ss_pred CCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCc--------CCHHHHHHHHHHHHHhCCHHHHHHHHHHH
Confidence 344577788888888888888888877765432211 11123467899999999999999999875
No 37
>COG3071 HemY Uncharacterized enzyme of heme biosynthesis [Coenzyme metabolism]
Probab=84.14 E-value=8.5 Score=32.22 Aligned_cols=71 Identities=17% Similarity=0.134 Sum_probs=54.0
Q ss_pred HHhhhhhhhcCCCC-HHHHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcH--HHHHHHHHHHcCCHHHHH
Q psy11108 7 VIASRHSFVLCPYV-AGQASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDD--IMKHAYIAQHQGRFQTAE 83 (106)
Q Consensus 7 ~~~~~~~f~~~~~~-~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~--~~~~Aei~ay~Gkf~EAe 83 (106)
.|+-+...++.+|. |++|.+||.-+++.+.+.-|+.+|.+ .++..++. -.-.|+++.-+|+..+|+
T Consensus 313 l~k~~e~~l~~h~~~p~L~~tLG~L~~k~~~w~kA~~~lea-----------Al~~~~s~~~~~~la~~~~~~g~~~~A~ 381 (400)
T COG3071 313 LIKAAEKWLKQHPEDPLLLSTLGRLALKNKLWGKASEALEA-----------ALKLRPSASDYAELADALDQLGEPEEAE 381 (400)
T ss_pred HHHHHHHHHHhCCCChhHHHHHHHHHHHhhHHHHHHHHHHH-----------HHhcCCChhhHHHHHHHHHHcCChHHHH
Confidence 34555556666655 89999999999999999999998873 34344432 245699999999999999
Q ss_pred HHHHH
Q psy11108 84 NYQLQ 88 (106)
Q Consensus 84 ~i~l~ 88 (106)
+.+.+
T Consensus 382 ~~r~e 386 (400)
T COG3071 382 QVRRE 386 (400)
T ss_pred HHHHH
Confidence 88765
No 38
>PRK11447 cellulose synthase subunit BcsC; Provisional
Probab=84.05 E-value=8.1 Score=35.45 Aligned_cols=74 Identities=9% Similarity=-0.042 Sum_probs=55.4
Q ss_pred cCCCCHHHHHHHHHHHHhcccHHHHHHHHHHcC-----CchhhHHHHHhhC--CCc-HHHHHHHHHHHcCCHHHHHHHHH
Q psy11108 16 LCPYVAGQASFLSGVSQKKERFYHVEQALVNCQ-----DYAGLQLIKSLRA--LPD-DIMKHAYIAQHQGRFQTAENYQL 87 (106)
Q Consensus 16 ~~~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl~-----d~~~v~~I~~l~~--~~~-~~~~~Aei~ay~Gkf~EAe~i~l 87 (106)
.++.++..+..++..++..++.+-|.+.+.++. +...-++-..+.. .+. ..+..|.+++-.|++++|.+.|.
T Consensus 57 ~~p~~p~~~~~~~~~~l~~g~~~~A~~~l~~l~~~~P~~~~~~~~~~~~~~~~~~~~~~l~~A~ll~~~g~~~eA~~~~~ 136 (1157)
T PRK11447 57 IDPNNPDVIAARFRLLLRQGDSDGAQKLLDRLSQLAPDSNAYRSSRTTMLLSTPEGRQALQQARLLATTGRTEEALASYD 136 (1157)
T ss_pred cCCCCHHHHHHHHHHHHhCCCHHHHHHHHHHHHhhCCCChHHHHHHHHHHhcCCchhhHHHHHHHHHhCCCHHHHHHHHH
Confidence 356678889999999999999999999999983 3332222222222 222 24788999999999999999998
Q ss_pred Hc
Q psy11108 88 QA 89 (106)
Q Consensus 88 ~a 89 (106)
+.
T Consensus 137 ~~ 138 (1157)
T PRK11447 137 KL 138 (1157)
T ss_pred HH
Confidence 84
No 39
>PF07721 TPR_4: Tetratricopeptide repeat; InterPro: IPR011717 This entry includes tetratricopeptide-like repeats not detected by the IPR001440 from INTERPRO, IPR013105 from INTERPRO and IPR011716 from INTERPRO models. The tetratricopeptide repeat (TPR) motif is a protein-protein interaction module found in multiple copies in a number of functionally different proteins that facilitates specific interactions with a partner protein(s) [].; GO: 0042802 identical protein binding
Probab=83.90 E-value=2 Score=21.63 Aligned_cols=21 Identities=24% Similarity=0.085 Sum_probs=18.3
Q ss_pred HHHHHHHHHcCCHHHHHHHHH
Q psy11108 67 MKHAYIAQHQGRFQTAENYQL 87 (106)
Q Consensus 67 ~~~Aei~ay~Gkf~EAe~i~l 87 (106)
...|.++.-+|++++|+.++-
T Consensus 5 ~~la~~~~~~G~~~eA~~~l~ 25 (26)
T PF07721_consen 5 LALARALLAQGDPDEAERLLR 25 (26)
T ss_pred HHHHHHHHHcCCHHHHHHHHh
Confidence 457899999999999999874
No 40
>PF13371 TPR_9: Tetratricopeptide repeat
Probab=83.66 E-value=5.5 Score=23.50 Aligned_cols=50 Identities=16% Similarity=0.145 Sum_probs=33.7
Q ss_pred HHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCC----cHHHHHHHHHHHcCCHHHHHHHHHHc
Q psy11108 28 SGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALP----DDIMKHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 28 A~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~----~~~~~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
+...+...+++.|.+++.+ +-.+. ..-...|.++...|+|++|.+.|.+.
T Consensus 2 ~~~~~~~~~~~~A~~~~~~------------~l~~~p~~~~~~~~~a~~~~~~g~~~~A~~~l~~~ 55 (73)
T PF13371_consen 2 KQIYLQQEDYEEALEVLER------------ALELDPDDPELWLQRARCLFQLGRYEEALEDLERA 55 (73)
T ss_pred HHHHHhCCCHHHHHHHHHH------------HHHhCcccchhhHHHHHHHHHhccHHHHHHHHHHH
Confidence 4455666666666555443 32222 22356899999999999999998774
No 41
>cd00189 TPR Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and C
Probab=83.59 E-value=5.5 Score=22.30 Aligned_cols=58 Identities=12% Similarity=0.012 Sum_probs=39.4
Q ss_pred HHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHc
Q psy11108 24 ASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDDIMKHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 24 W~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~~~~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
|..+|.......+++-|...|.+.-+..- ..+.-....|.++...|++++|.++|.++
T Consensus 3 ~~~~a~~~~~~~~~~~A~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~a~~~~~~~ 60 (100)
T cd00189 3 LLNLGNLYYKLGDYDEALEYYEKALELDP--------DNADAYYNLAAAYYKLGKYEEALEDYEKA 60 (100)
T ss_pred HHHHHHHHHHHhcHHHHHHHHHHHHhcCC--------ccHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 56677777778888888777664422110 00012346789999999999999999775
No 42
>PRK15179 Vi polysaccharide biosynthesis protein TviE; Provisional
Probab=81.92 E-value=7.1 Score=34.54 Aligned_cols=85 Identities=11% Similarity=-0.006 Sum_probs=65.0
Q ss_pred HHHhhhhhhhcCCCC-HHHHHHHHHHHHhcccHHHHHHHHHHcCCc--h-------hhHHHHHhhCCCc-----------
Q psy11108 6 AVIASRHSFVLCPYV-AGQASFLSGVSQKKERFYHVEQALVNCQDY--A-------GLQLIKSLRALPD----------- 64 (106)
Q Consensus 6 ~~~~~~~~f~~~~~~-~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~--~-------~v~~I~~l~~~~~----------- 64 (106)
+-++.+-+|+..|++ +..-..||......+.++-|+.-+.++-.+ + +...+.+.++.++
T Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~La~i~~~~g~~~ea~~~l~~~~~~~Pd~~~a~~~~a~~L~~~~~~eeA~~~~~~~l~~ 149 (694)
T PRK15179 70 AALPELLDYVRRYPHTELFQVLVARALEAAHRSDEGLAVWRGIHQRFPDSSEAFILMLRGVKRQQGIEAGRAEIELYFSG 149 (694)
T ss_pred hhHHHHHHHHHhccccHHHHHHHHHHHHHcCCcHHHHHHHHHHHhhCCCcHHHHHHHHHHHHHhccHHHHHHHHHHHhhc
Confidence 457888999999999 677899999999999999999999887443 1 2233333333221
Q ss_pred ------HHHHHHHHHHHcCCHHHHHHHHHHcC
Q psy11108 65 ------DIMKHAYIAQHQGRFQTAENYQLQAD 90 (106)
Q Consensus 65 ------~~~~~Aei~ay~Gkf~EAe~i~l~ag 90 (106)
.....|..+.-.|+++||..+|-+.-
T Consensus 150 ~p~~~~~~~~~a~~l~~~g~~~~A~~~y~~~~ 181 (694)
T PRK15179 150 GSSSAREILLEAKSWDEIGQSEQADACFERLS 181 (694)
T ss_pred CCCCHHHHHHHHHHHHHhcchHHHHHHHHHHH
Confidence 23578999999999999999998854
No 43
>PRK11788 tetratricopeptide repeat protein; Provisional
Probab=81.26 E-value=9.6 Score=29.39 Aligned_cols=23 Identities=17% Similarity=0.134 Sum_probs=15.0
Q ss_pred HHHHHHHHHcCCHHHHHHHHHHc
Q psy11108 67 MKHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 67 ~~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
...|.++...|++++|.++|.+.
T Consensus 218 ~~la~~~~~~g~~~~A~~~~~~~ 240 (389)
T PRK11788 218 ILLGDLALAQGDYAAAIEALERV 240 (389)
T ss_pred HHHHHHHHHCCCHHHHHHHHHHH
Confidence 34566666777777777766654
No 44
>PF13428 TPR_14: Tetratricopeptide repeat
Probab=80.78 E-value=3.2 Score=22.97 Aligned_cols=23 Identities=22% Similarity=0.148 Sum_probs=20.3
Q ss_pred HHHHHHHHHcCCHHHHHHHHHHc
Q psy11108 67 MKHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 67 ~~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
...|..+.-+|++++|+++|.+.
T Consensus 5 ~~la~~~~~~G~~~~A~~~~~~~ 27 (44)
T PF13428_consen 5 LALARAYRRLGQPDEAERLLRRA 27 (44)
T ss_pred HHHHHHHHHcCCHHHHHHHHHHH
Confidence 46789999999999999999874
No 45
>PF07719 TPR_2: Tetratricopeptide repeat; InterPro: IPR013105 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. This repeat includes outlying Tetratricopeptide-like repeats (TPR) that are not matched by IPR001440 from INTERPRO.; PDB: 1XNF_B 3Q15_A 4ABN_A 1OUV_A 3U4T_A 3MA5_C 2KCV_A 2KCL_A 2XEV_A 3NF1_A ....
Probab=80.76 E-value=4.4 Score=20.41 Aligned_cols=27 Identities=7% Similarity=0.014 Sum_probs=23.0
Q ss_pred HHHHHHHHHHHHhcccHHHHHHHHHHc
Q psy11108 21 AGQASFLSGVSQKKERFYHVEQALVNC 47 (106)
Q Consensus 21 ~~lW~~LA~~AL~~~~l~iAe~Af~rl 47 (106)
+..|..||...+..++++-|..+|.+.
T Consensus 1 a~~~~~lg~~~~~~~~~~~A~~~~~~a 27 (34)
T PF07719_consen 1 AEAWYYLGQAYYQLGNYEEAIEYFEKA 27 (34)
T ss_dssp HHHHHHHHHHHHHTT-HHHHHHHHHHH
T ss_pred CHHHHHHHHHHHHhCCHHHHHHHHHHH
Confidence 357899999999999999999999864
No 46
>PF00515 TPR_1: Tetratricopeptide repeat; InterPro: IPR001440 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees; within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B. ; GO: 0005515 protein binding; PDB: 3SF4_C 2LNI_A 1ELW_A 2C0M_A 1FCH_B 3R9A_B 2J9Q_A 2C0L_A 1KT1_A 3FWV_A ....
Probab=80.67 E-value=2.8 Score=21.54 Aligned_cols=23 Identities=17% Similarity=0.200 Sum_probs=19.1
Q ss_pred HHHHHHHHHcCCHHHHHHHHHHc
Q psy11108 67 MKHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 67 ~~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
..+|.++..+|++++|...|.++
T Consensus 5 ~~~g~~~~~~~~~~~A~~~~~~a 27 (34)
T PF00515_consen 5 YNLGNAYFQLGDYEEALEYYQRA 27 (34)
T ss_dssp HHHHHHHHHTT-HHHHHHHHHHH
T ss_pred HHHHHHHHHhCCchHHHHHHHHH
Confidence 35799999999999999999874
No 47
>CHL00033 ycf3 photosystem I assembly protein Ycf3
Probab=80.23 E-value=15 Score=25.42 Aligned_cols=64 Identities=14% Similarity=0.005 Sum_probs=46.7
Q ss_pred HHHHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHc
Q psy11108 21 AGQASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDDIMKHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 21 ~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~~~~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
...|-.+|.......+++-|...|.++-.... +. .....-....|.++...|++++|...|.++
T Consensus 35 a~~~~~~g~~~~~~g~~~~A~~~~~~al~l~~----~~-~~~~~~~~~lg~~~~~~g~~~eA~~~~~~A 98 (168)
T CHL00033 35 AFTYYRDGMSAQSEGEYAEALQNYYEAMRLEI----DP-YDRSYILYNIGLIHTSNGEHTKALEYYFQA 98 (168)
T ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHHhccc----cc-hhhHHHHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 45699999999999999999999987744311 00 000112345689999999999999999875
No 48
>TIGR00540 hemY_coli hemY protein. This is an uncharacterized protein encoded next to a heme-biosynthetic enzyme in two gamma division proteobacteria (E. coli and H. influenzae). It is known in no other species. The gene symbol hemY is unfortunate in that an unrelated protein, protoporphyrinogen oxidase, is designated as HemG in E. coli but as HemY in Bacillus subtilis.
Probab=79.86 E-value=12 Score=30.04 Aligned_cols=69 Identities=13% Similarity=0.031 Sum_probs=49.5
Q ss_pred hhhhcCCCC-H--HHHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcH--HHHHHHHHHHcCCHHHHHHHH
Q psy11108 12 HSFVLCPYV-A--GQASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDD--IMKHAYIAQHQGRFQTAENYQ 86 (106)
Q Consensus 12 ~~f~~~~~~-~--~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~--~~~~Aei~ay~Gkf~EAe~i~ 86 (106)
....+.+|. + .+-.+||..++..++++-|++.|.+... ++..|+. ..-.|+++.-.|+.++|.++|
T Consensus 323 e~~lk~~p~~~~~~ll~sLg~l~~~~~~~~~A~~~le~a~a---------~~~~p~~~~~~~La~ll~~~g~~~~A~~~~ 393 (409)
T TIGR00540 323 EKQAKNVDDKPKCCINRALGQLLMKHGEFIEAADAFKNVAA---------CKEQLDANDLAMAADAFDQAGDKAEAAAMR 393 (409)
T ss_pred HHHHHhCCCChhHHHHHHHHHHHHHcccHHHHHHHHHHhHH---------hhcCCCHHHHHHHHHHHHHcCCHHHHHHHH
Confidence 344455554 4 6778999999999999999988773221 1112332 234599999999999999999
Q ss_pred HHc
Q psy11108 87 LQA 89 (106)
Q Consensus 87 l~a 89 (106)
.++
T Consensus 394 ~~~ 396 (409)
T TIGR00540 394 QDS 396 (409)
T ss_pred HHH
Confidence 985
No 49
>PRK15359 type III secretion system chaperone protein SscB; Provisional
Probab=79.33 E-value=11 Score=25.92 Aligned_cols=62 Identities=10% Similarity=-0.190 Sum_probs=44.2
Q ss_pred HHHHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHcC
Q psy11108 21 AGQASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDDIMKHAYIAQHQGRFQTAENYQLQAD 90 (106)
Q Consensus 21 ~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~~~~~Aei~ay~Gkf~EAe~i~l~ag 90 (106)
|..|..+|......++++-|...|.++-+.+- .-..--...|.++.-.|++++|...|.++-
T Consensus 24 p~~~~~~g~~~~~~g~~~~A~~~~~~al~~~P--------~~~~a~~~lg~~~~~~g~~~~A~~~y~~Al 85 (144)
T PRK15359 24 PETVYASGYASWQEGDYSRAVIDFSWLVMAQP--------WSWRAHIALAGTWMMLKEYTTAINFYGHAL 85 (144)
T ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCC--------CcHHHHHHHHHHHHHHhhHHHHHHHHHHHH
Confidence 33477789999999999988888776322210 001123568999999999999999998763
No 50
>TIGR02917 PEP_TPR_lipo putative PEP-CTERM system TPR-repeat lipoprotein. This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
Probab=79.11 E-value=13 Score=30.92 Aligned_cols=30 Identities=7% Similarity=-0.003 Sum_probs=23.2
Q ss_pred CCCHHHHHHHHHHHHhcccHHHHHHHHHHc
Q psy11108 18 PYVAGQASFLSGVSQKKERFYHVEQALVNC 47 (106)
Q Consensus 18 ~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl 47 (106)
++++..|..|+..-....+++-|...|.++
T Consensus 598 ~~~~~~~~~l~~~~~~~~~~~~A~~~~~~~ 627 (899)
T TIGR02917 598 PDSPEAWLMLGRAQLAAGDLNKAVSSFKKL 627 (899)
T ss_pred CCCHHHHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 445677888888888888888888877654
No 51
>PF13181 TPR_8: Tetratricopeptide repeat; PDB: 3GW4_B 3MA5_C 2KCV_A 2KCL_A 3FP3_A 3LCA_A 3FP4_A 3FP2_A 1W3B_B 1ELW_A ....
Probab=78.99 E-value=3.9 Score=20.79 Aligned_cols=23 Identities=22% Similarity=0.291 Sum_probs=20.3
Q ss_pred HHHHHHHHHcCCHHHHHHHHHHc
Q psy11108 67 MKHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 67 ~~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
...|.++...|++++|...|.++
T Consensus 5 ~~lg~~y~~~~~~~~A~~~~~~a 27 (34)
T PF13181_consen 5 YNLGKIYEQLGDYEEALEYFEKA 27 (34)
T ss_dssp HHHHHHHHHTTSHHHHHHHHHHH
T ss_pred HHHHHHHHHcCCHHHHHHHHHHH
Confidence 45789999999999999999875
No 52
>PF13429 TPR_15: Tetratricopeptide repeat; PDB: 2VQ2_A 2PL2_B.
Probab=78.29 E-value=0.68 Score=34.86 Aligned_cols=32 Identities=9% Similarity=0.001 Sum_probs=0.0
Q ss_pred CCCHHHHHHHHHHHHhcccHHHHHHHHHHcCC
Q psy11108 18 PYVAGQASFLSGVSQKKERFYHVEQALVNCQD 49 (106)
Q Consensus 18 ~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl~d 49 (106)
+..+..|+.+|..|....+++.|+.+|.++-.
T Consensus 41 ~~~~~~~~~~a~La~~~~~~~~A~~ay~~l~~ 72 (280)
T PF13429_consen 41 PDDPEYWRLLADLAWSLGDYDEAIEAYEKLLA 72 (280)
T ss_dssp --------------------------------
T ss_pred cccccccccccccccccccccccccccccccc
Confidence 45578899999999999999999999997743
No 53
>KOG2041|consensus
Probab=75.98 E-value=7.7 Score=35.47 Aligned_cols=81 Identities=14% Similarity=0.112 Sum_probs=62.6
Q ss_pred HHHHHHHHHHHhcccHHHHHHHHHHcCCchhh-HHHHHhhC----------CCcHH---HHHHHHHHHcCCHHHHHHHHH
Q psy11108 22 GQASFLSGVSQKKERFYHVEQALVNCQDYAGL-QLIKSLRA----------LPDDI---MKHAYIAQHQGRFQTAENYQL 87 (106)
Q Consensus 22 ~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v-~~I~~l~~----------~~~~~---~~~Aei~ay~Gkf~EAe~i~l 87 (106)
.-|+.+|..=-+-+.++-|.+.|..++|.... +.+-+++. +|++. -..|++.+--|--++|-+.|+
T Consensus 797 ~A~r~ig~~fa~~~~We~A~~yY~~~~~~e~~~ecly~le~f~~LE~la~~Lpe~s~llp~~a~mf~svGMC~qAV~a~L 876 (1189)
T KOG2041|consen 797 DAFRNIGETFAEMMEWEEAAKYYSYCGDTENQIECLYRLELFGELEVLARTLPEDSELLPVMADMFTSVGMCDQAVEAYL 876 (1189)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccchHhHHHHHHHHHhhhhHHHHHHhcCcccchHHHHHHHHHhhchHHHHHHHHH
Confidence 44999998888889999999999999887643 33333332 33322 257999999999999999999
Q ss_pred HcCCcccccc----CCCcc
Q psy11108 88 QADRRTWKSK----PSYWT 102 (106)
Q Consensus 88 ~ag~~~~Aie----mr~W~ 102 (106)
+.+.|..|+. |++|.
T Consensus 877 r~s~pkaAv~tCv~LnQW~ 895 (1189)
T KOG2041|consen 877 RRSLPKAAVHTCVELNQWG 895 (1189)
T ss_pred hccCcHHHHHHHHHHHHHH
Confidence 9999987765 88993
No 54
>TIGR00990 3a0801s09 mitochondrial precursor proteins import receptor (72 kDa mitochondrial outermembrane protein) (mitochondrial import receptor for the ADP/ATP carrier) (translocase of outermembrane tom70).
Probab=75.86 E-value=14 Score=31.30 Aligned_cols=74 Identities=11% Similarity=-0.006 Sum_probs=44.8
Q ss_pred CCCCHHHHHHHHHHHHhcccHHHHHHHHHHcCCc--hhhH--------------------HHHHh-hCCCc-H--HHHHH
Q psy11108 17 CPYVAGQASFLSGVSQKKERFYHVEQALVNCQDY--AGLQ--------------------LIKSL-RALPD-D--IMKHA 70 (106)
Q Consensus 17 ~~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~--~~v~--------------------~I~~l-~~~~~-~--~~~~A 70 (106)
.+.++..|..+|......++++-|...|.+.-+. ..+. .+++. +..|+ . -...|
T Consensus 395 ~p~~~~~~~~lg~~~~~~g~~~~A~~~~~kal~l~P~~~~~~~~la~~~~~~g~~~eA~~~~~~al~~~P~~~~~~~~lg 474 (615)
T TIGR00990 395 NSEDPDIYYHRAQLHFIKGEFAQAGKDYQKSIDLDPDFIFSHIQLGVTQYKEGSIASSMATFRRCKKNFPEAPDVYNYYG 474 (615)
T ss_pred CCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCccCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCChHHHHHHH
Confidence 3455778888888888888888888888765221 1111 11111 11111 1 13457
Q ss_pred HHHHHcCCHHHHHHHHHHcC
Q psy11108 71 YIAQHQGRFQTAENYQLQAD 90 (106)
Q Consensus 71 ei~ay~Gkf~EAe~i~l~ag 90 (106)
.++.-.|++++|.+.|.++-
T Consensus 475 ~~~~~~g~~~~A~~~~~~Al 494 (615)
T TIGR00990 475 ELLLDQNKFDEAIEKFDTAI 494 (615)
T ss_pred HHHHHccCHHHHHHHHHHHH
Confidence 77777788888887777753
No 55
>smart00028 TPR Tetratricopeptide repeats. Repeats present in 4 or more copies in proteins. Contain a minimum of 34 amino acids each and self-associate via a "knobs and holes" mechanism.
Probab=75.52 E-value=4.8 Score=18.29 Aligned_cols=23 Identities=17% Similarity=0.177 Sum_probs=19.9
Q ss_pred HHHHHHHHHcCCHHHHHHHHHHc
Q psy11108 67 MKHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 67 ~~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
...|.++...|++++|...|.++
T Consensus 5 ~~~a~~~~~~~~~~~a~~~~~~~ 27 (34)
T smart00028 5 YNLGNAYLKLGDYDEALEYYEKA 27 (34)
T ss_pred HHHHHHHHHHhhHHHHHHHHHHH
Confidence 46789999999999999999764
No 56
>PRK11788 tetratricopeptide repeat protein; Provisional
Probab=74.95 E-value=26 Score=26.99 Aligned_cols=29 Identities=7% Similarity=-0.055 Sum_probs=19.9
Q ss_pred CCHHHHHHHHHHHHhcccHHHHHHHHHHc
Q psy11108 19 YVAGQASFLSGVSQKKERFYHVEQALVNC 47 (106)
Q Consensus 19 ~~~~lW~~LA~~AL~~~~l~iAe~Af~rl 47 (106)
.+...|..||.......+++-|...|.++
T Consensus 67 ~~~~~~~~la~~~~~~g~~~~A~~~~~~~ 95 (389)
T PRK11788 67 ETVELHLALGNLFRRRGEVDRAIRIHQNL 95 (389)
T ss_pred ccHHHHHHHHHHHHHcCcHHHHHHHHHHH
Confidence 34556777777777777777777666654
No 57
>COG4235 Cytochrome c biogenesis factor [Posttranslational modification, protein turnover, chaperones]
Probab=74.86 E-value=14 Score=29.55 Aligned_cols=42 Identities=17% Similarity=-0.040 Sum_probs=33.5
Q ss_pred HHhhhhhhhcCCCC-HHHHHHHHHHHHhcccHHHHHHHHHHcC
Q psy11108 7 VIASRHSFVLCPYV-AGQASFLSGVSQKKERFYHVEQALVNCQ 48 (106)
Q Consensus 7 ~~~~~~~f~~~~~~-~~lW~~LA~~AL~~~~l~iAe~Af~rl~ 48 (106)
.|+.....+..+|. ..-|-.||..-|...+++.|..||.+-.
T Consensus 141 l~a~Le~~L~~nP~d~egW~~Lg~~ym~~~~~~~A~~AY~~A~ 183 (287)
T COG4235 141 LIARLETHLQQNPGDAEGWDLLGRAYMALGRASDALLAYRNAL 183 (287)
T ss_pred HHHHHHHHHHhCCCCchhHHHHHHHHHHhcchhHHHHHHHHHH
Confidence 45555566655554 6679999999999999999999999764
No 58
>PF13428 TPR_14: Tetratricopeptide repeat
Probab=74.74 E-value=6.1 Score=21.81 Aligned_cols=27 Identities=11% Similarity=-0.018 Sum_probs=24.1
Q ss_pred HHHHHHHHHHHHhcccHHHHHHHHHHc
Q psy11108 21 AGQASFLSGVSQKKERFYHVEQALVNC 47 (106)
Q Consensus 21 ~~lW~~LA~~AL~~~~l~iAe~Af~rl 47 (106)
|..|..||..-...++++-|++.|.++
T Consensus 1 p~~~~~la~~~~~~G~~~~A~~~~~~~ 27 (44)
T PF13428_consen 1 PAAWLALARAYRRLGQPDEAERLLRRA 27 (44)
T ss_pred CHHHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 457899999999999999999999875
No 59
>TIGR02917 PEP_TPR_lipo putative PEP-CTERM system TPR-repeat lipoprotein. This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
Probab=74.08 E-value=20 Score=29.67 Aligned_cols=30 Identities=7% Similarity=-0.116 Sum_probs=18.8
Q ss_pred CCCHHHHHHHHHHHHhcccHHHHHHHHHHc
Q psy11108 18 PYVAGQASFLSGVSQKKERFYHVEQALVNC 47 (106)
Q Consensus 18 ~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl 47 (106)
++++..|..+|......++++-|...|.++
T Consensus 190 ~~~~~~~~~~~~~~~~~g~~~~A~~~~~~a 219 (899)
T TIGR02917 190 PGNVDALLLKGDLLLSLGNIELALAAYRKA 219 (899)
T ss_pred CCChHHHHHHHHHHHhcCCHHHHHHHHHHH
Confidence 344556666666666666666666666554
No 60
>PF13429 TPR_15: Tetratricopeptide repeat; PDB: 2VQ2_A 2PL2_B.
Probab=73.29 E-value=27 Score=26.06 Aligned_cols=82 Identities=16% Similarity=0.039 Sum_probs=48.0
Q ss_pred HhhhhhhhcCCCCHHHHHHHHHHHHhcccHHHHHHHHHHc-----CCchhh-HHHHHhhCCCc-----------------
Q psy11108 8 IASRHSFVLCPYVAGQASFLSGVSQKKERFYHVEQALVNC-----QDYAGL-QLIKSLRALPD----------------- 64 (106)
Q Consensus 8 ~~~~~~f~~~~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl-----~d~~~v-~~I~~l~~~~~----------------- 64 (106)
|..+..-....+.+..|..+|....+.++.+-|..+|.+. +|.... .++.-+-+..+
T Consensus 133 l~~~~~~~~~~~~~~~~~~~a~~~~~~G~~~~A~~~~~~al~~~P~~~~~~~~l~~~li~~~~~~~~~~~l~~~~~~~~~ 212 (280)
T PF13429_consen 133 LEKLEELPAAPDSARFWLALAEIYEQLGDPDKALRDYRKALELDPDDPDARNALAWLLIDMGDYDEAREALKRLLKAAPD 212 (280)
T ss_dssp HHHHHH-T---T-HHHHHHHHHHHHHCCHHHHHHHHHHHHHHH-TT-HHHHHHHHHHHCTTCHHHHHHHHHHHHHHH-HT
T ss_pred HHHHHhccCCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHCCChHHHHHHHHHHHHHCcC
Confidence 3333333333456788999999999999999999998754 112111 12222211111
Q ss_pred -HH--HHHHHHHHHcCCHHHHHHHHHHc
Q psy11108 65 -DI--MKHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 65 -~~--~~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
+. ...|.++...|++++|-.+|.+.
T Consensus 213 ~~~~~~~la~~~~~lg~~~~Al~~~~~~ 240 (280)
T PF13429_consen 213 DPDLWDALAAAYLQLGRYEEALEYLEKA 240 (280)
T ss_dssp SCCHCHHHHHHHHHHT-HHHHHHHHHHH
T ss_pred HHHHHHHHHHHhcccccccccccccccc
Confidence 11 23578888889999999988875
No 61
>COG2976 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=71.97 E-value=40 Score=25.89 Aligned_cols=40 Identities=20% Similarity=0.083 Sum_probs=25.7
Q ss_pred HHHhhhhhhhcCCCCHHH----HHHHHHHHHhcccHHHHHHHHH
Q psy11108 6 AVIASRHSFVLCPYVAGQ----ASFLSGVSQKKERFYHVEQALV 45 (106)
Q Consensus 6 ~~~~~~~~f~~~~~~~~l----W~~LA~~AL~~~~l~iAe~Af~ 45 (106)
..++.+..|+.+|++..- =-.||....++.+|+-|+.-..
T Consensus 70 ~~~~~~ekf~~~n~~t~Ya~laaL~lAk~~ve~~~~d~A~aqL~ 113 (207)
T COG2976 70 KSIAAAEKFVQANGKTIYAVLAALELAKAEVEANNLDKAEAQLK 113 (207)
T ss_pred hhHHHHHHHHhhccccHHHHHHHHHHHHHHHhhccHHHHHHHHH
Confidence 457778889998765433 2345566667777777765444
No 62
>PF13432 TPR_16: Tetratricopeptide repeat; PDB: 3CVP_A 3CVL_A 3CVQ_A 3CV0_A 2GW1_B 3CVN_A 3QKY_A 2PL2_B.
Probab=71.25 E-value=5.6 Score=23.12 Aligned_cols=22 Identities=27% Similarity=0.117 Sum_probs=19.5
Q ss_pred HHHHHHHHcCCHHHHHHHHHHc
Q psy11108 68 KHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 68 ~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
..|..+.-.|+|++|.++|.+.
T Consensus 2 ~~a~~~~~~g~~~~A~~~~~~~ 23 (65)
T PF13432_consen 2 ALARALYQQGDYDEAIAAFEQA 23 (65)
T ss_dssp HHHHHHHHCTHHHHHHHHHHHH
T ss_pred hHHHHHHHcCCHHHHHHHHHHH
Confidence 4688999999999999999874
No 63
>PF12862 Apc5: Anaphase-promoting complex subunit 5
Probab=69.84 E-value=6.7 Score=25.32 Aligned_cols=61 Identities=18% Similarity=0.134 Sum_probs=42.7
Q ss_pred HHHHhcccHHHHHHHHHHcCCchhhHHHHHhh-CCCcHHHHHHHHHHHcCCHHHHHHHHHHc
Q psy11108 29 GVSQKKERFYHVEQALVNCQDYAGLQLIKSLR-ALPDDIMKHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 29 ~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~-~~~~~~~~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
..++..+|+..|.....+.=|+.+-+.-.... ...-.-+..|.+...+|++++|.+.+.++
T Consensus 6 ~~~~~~~dy~~A~d~L~~~fD~~~~~~~~~~~~~~~~all~lA~~~~~~G~~~~A~~~l~eA 67 (94)
T PF12862_consen 6 LNALRSGDYSEALDALHRYFDYAKQSNNSSSNSGLAYALLNLAELHRRFGHYEEALQALEEA 67 (94)
T ss_pred HHHHHcCCHHHHHHHHHHHHHHHhhcccchhhHHHHHHHHHHHHHHHHhCCHHHHHHHHHHH
Confidence 35788889999988888777775554332210 01122466899999999999999988765
No 64
>PF13174 TPR_6: Tetratricopeptide repeat; PDB: 3QKY_A 2XEV_A 3URZ_B 2Q7F_A.
Probab=68.92 E-value=7.2 Score=19.36 Aligned_cols=22 Identities=18% Similarity=0.200 Sum_probs=19.0
Q ss_pred HHHHHHHHHcCCHHHHHHHHHH
Q psy11108 67 MKHAYIAQHQGRFQTAENYQLQ 88 (106)
Q Consensus 67 ~~~Aei~ay~Gkf~EAe~i~l~ 88 (106)
...|.++...|++++|.++|.+
T Consensus 4 ~~~a~~~~~~g~~~~A~~~~~~ 25 (33)
T PF13174_consen 4 YRLARCYYKLGDYDEAIEYFQR 25 (33)
T ss_dssp HHHHHHHHHHCHHHHHHHHHHH
T ss_pred HHHHHHHHHccCHHHHHHHHHH
Confidence 4678899999999999999875
No 65
>PF13424 TPR_12: Tetratricopeptide repeat; PDB: 3RO2_A 3Q15_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 4A1S_B 3CEQ_B 3EDT_H ....
Probab=68.80 E-value=6.3 Score=23.68 Aligned_cols=22 Identities=27% Similarity=0.409 Sum_probs=18.5
Q ss_pred HHHHHHHHcCCHHHHHHHHHHc
Q psy11108 68 KHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 68 ~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
..|.++..+|+|++|...|.++
T Consensus 10 ~la~~~~~~~~~~~A~~~~~~a 31 (78)
T PF13424_consen 10 NLARVYRELGRYDEALDYYEKA 31 (78)
T ss_dssp HHHHHHHHTT-HHHHHHHHHHH
T ss_pred HHHHHHHHcCCHHHHHHHHHHH
Confidence 4688999999999999999885
No 66
>PRK09782 bacteriophage N4 receptor, outer membrane subunit; Provisional
Probab=68.60 E-value=31 Score=31.84 Aligned_cols=31 Identities=6% Similarity=-0.127 Sum_probs=19.9
Q ss_pred CCCCHHHHHHHHHHHHhcccHHHHHHHHHHc
Q psy11108 17 CPYVAGQASFLSGVSQKKERFYHVEQALVNC 47 (106)
Q Consensus 17 ~~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl 47 (106)
.++.+..|..+|..++..++++-|..+|.+.
T Consensus 538 ~~p~~~a~~~la~all~~Gd~~eA~~~l~qA 568 (987)
T PRK09782 538 HDMSNEDLLAAANTAQAAGNGAARDRWLQQA 568 (987)
T ss_pred cCCCcHHHHHHHHHHHHCCCHHHHHHHHHHH
Confidence 3444555667777777777777777776543
No 67
>PF04053 Coatomer_WDAD: Coatomer WD associated region ; InterPro: IPR006692 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the WD-associated region found in coatomer subunits alpha, beta and beta' subunits. The alpha-subunit (RET1P) of the coatomer complex in Saccharomyces cerevisiae (Baker's yeast), participates in membrane transport between the endoplasmic reticulum and Golgi apparatus. The protein contains six WD-40 repeat motifs in its N-terminal region []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 3MKQ_B.
Probab=67.60 E-value=8.3 Score=32.27 Aligned_cols=27 Identities=22% Similarity=0.139 Sum_probs=23.7
Q ss_pred HHHHHHHHHcCCHHHHHHHHHHcCCcc
Q psy11108 67 MKHAYIAQHQGRFQTAENYQLQADRRT 93 (106)
Q Consensus 67 ~~~Aei~ay~Gkf~EAe~i~l~ag~~~ 93 (106)
-+.|+.++-+|+++-||..|.+++..+
T Consensus 351 ~~Lg~~AL~~g~~~lAe~c~~k~~d~~ 377 (443)
T PF04053_consen 351 KQLGDEALRQGNIELAEECYQKAKDFS 377 (443)
T ss_dssp HHHHHHHHHTTBHHHHHHHHHHCT-HH
T ss_pred HHHHHHHHHcCCHHHHHHHHHhhcCcc
Confidence 477999999999999999999998764
No 68
>TIGR00990 3a0801s09 mitochondrial precursor proteins import receptor (72 kDa mitochondrial outermembrane protein) (mitochondrial import receptor for the ADP/ATP carrier) (translocase of outermembrane tom70).
Probab=66.87 E-value=34 Score=28.91 Aligned_cols=20 Identities=20% Similarity=0.195 Sum_probs=9.9
Q ss_pred HHHHHHHcCCHHHHHHHHHH
Q psy11108 69 HAYIAQHQGRFQTAENYQLQ 88 (106)
Q Consensus 69 ~Aei~ay~Gkf~EAe~i~l~ 88 (106)
.|.++...|++++|...|.+
T Consensus 405 lg~~~~~~g~~~~A~~~~~k 424 (615)
T TIGR00990 405 RAQLHFIKGEFAQAGKDYQK 424 (615)
T ss_pred HHHHHHHcCCHHHHHHHHHH
Confidence 44444455555555554444
No 69
>PF13414 TPR_11: TPR repeat; PDB: 2HO1_B 2FI7_B 2DBA_A 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2PL2_B 3IEG_B 2FBN_A ....
Probab=66.60 E-value=6.8 Score=22.88 Aligned_cols=23 Identities=26% Similarity=0.299 Sum_probs=20.7
Q ss_pred HHHHHHHHHcCCHHHHHHHHHHc
Q psy11108 67 MKHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 67 ~~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
...|.++...|+|++|...|.++
T Consensus 7 ~~~g~~~~~~~~~~~A~~~~~~a 29 (69)
T PF13414_consen 7 YNLGQIYFQQGDYEEAIEYFEKA 29 (69)
T ss_dssp HHHHHHHHHTTHHHHHHHHHHHH
T ss_pred HHHHHHHHHcCCHHHHHHHHHHH
Confidence 46799999999999999999885
No 70
>PRK15174 Vi polysaccharide export protein VexE; Provisional
Probab=65.67 E-value=36 Score=29.45 Aligned_cols=30 Identities=10% Similarity=0.107 Sum_probs=19.7
Q ss_pred CCCHHHHHHHHHHHHhcccHHHHHHHHHHc
Q psy11108 18 PYVAGQASFLSGVSQKKERFYHVEQALVNC 47 (106)
Q Consensus 18 ~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl 47 (106)
+.++..|..||...+..++++-|..+|.+.
T Consensus 107 P~~~~a~~~la~~l~~~g~~~~Ai~~l~~A 136 (656)
T PRK15174 107 VCQPEDVLLVASVLLKSKQYATVADLAEQA 136 (656)
T ss_pred CCChHHHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 344556777777777777777776666554
No 71
>TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795).
Probab=64.68 E-value=50 Score=23.72 Aligned_cols=64 Identities=9% Similarity=0.020 Sum_probs=46.2
Q ss_pred CCCHHHHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCC---cHHHHHHHHHHHcCCHHHHHHHHHHc
Q psy11108 18 PYVAGQASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALP---DDIMKHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 18 ~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~---~~~~~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
...+..+-.+|...+...+++-|...|.++-.. -..-+ ..-...|.++.-.|++++|...|.+.
T Consensus 30 ~~~~~~~~~~g~~~~~~~~~~~A~~~~~~~~~~--------~p~~~~~~~a~~~la~~~~~~~~~~~A~~~~~~~ 96 (235)
T TIGR03302 30 EWPAEELYEEAKEALDSGDYTEAIKYFEALESR--------YPFSPYAEQAQLDLAYAYYKSGDYAEAIAAADRF 96 (235)
T ss_pred cCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHh--------CCCchhHHHHHHHHHHHHHhcCCHHHHHHHHHHH
Confidence 344666899999999999999998877654111 11111 12256789999999999999999874
No 72
>KOG1155|consensus
Probab=64.31 E-value=37 Score=29.51 Aligned_cols=61 Identities=16% Similarity=0.122 Sum_probs=43.2
Q ss_pred CCCHHHHHHHHHHHHhcccHHHHHHHHHH---cCCchhhHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHc
Q psy11108 18 PYVAGQASFLSGVSQKKERFYHVEQALVN---CQDYAGLQLIKSLRALPDDIMKHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 18 ~~~~~lW~~LA~~AL~~~~l~iAe~Af~r---l~d~~~v~~I~~l~~~~~~~~~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
+..+.+|-.||..=.+-.+++-|.+||-| ++|..+.-+ .+.|...=-.++.+||++.|.+.
T Consensus 429 PnDsRlw~aLG~CY~kl~~~~eAiKCykrai~~~dte~~~l-----------~~LakLye~l~d~~eAa~~yek~ 492 (559)
T KOG1155|consen 429 PNDSRLWVALGECYEKLNRLEEAIKCYKRAILLGDTEGSAL-----------VRLAKLYEELKDLNEAAQYYEKY 492 (559)
T ss_pred CCchHHHHHHHHHHHHhccHHHHHHHHHHHHhccccchHHH-----------HHHHHHHHHHHhHHHHHHHHHHH
Confidence 44588999999999888999999999986 466633222 23455555566677777776653
No 73
>PF02259 FAT: FAT domain; InterPro: IPR003151 The FAT domain is a domain present in the PIK-related kinases. Members of the family of PIK-related kinases may act as intracellular sensors that govern radial and horizontal pathways [].; GO: 0005515 protein binding
Probab=62.49 E-value=40 Score=25.55 Aligned_cols=76 Identities=16% Similarity=0.029 Sum_probs=52.3
Q ss_pred Hhhhhhhhc----CCCCHHHHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcHHHHHHHHHHHcCCHHHHH
Q psy11108 8 IASRHSFVL----CPYVAGQASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDDIMKHAYIAQHQGRFQTAE 83 (106)
Q Consensus 8 ~~~~~~f~~----~~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~~~~~Aei~ay~Gkf~EAe 83 (106)
+.-+.-+.+ .......|-.+|..|-+.+++++|..++.++.++....-.. .+.-.+..|.+..-.|+..+|-
T Consensus 129 l~~R~~~l~~~~~~~~~~~~~l~~a~~aRk~g~~~~A~~~l~~~~~~~~~~~~~----~~~v~~e~akllw~~g~~~~Ai 204 (352)
T PF02259_consen 129 LSLRRLVLSLILLPEELAETWLKFAKLARKAGNFQLALSALNRLFQLNPSSESL----LPRVFLEYAKLLWAQGEQEEAI 204 (352)
T ss_pred HHHHHHHHhcccchhHHHHHHHHHHHHHHHCCCcHHHHHHHHHHhccCCcccCC----CcchHHHHHHHHHHcCCHHHHH
Confidence 333444444 44556779999999999999999999999887753111000 3344556788888888888887
Q ss_pred HHHH
Q psy11108 84 NYQL 87 (106)
Q Consensus 84 ~i~l 87 (106)
+.+.
T Consensus 205 ~~L~ 208 (352)
T PF02259_consen 205 QKLR 208 (352)
T ss_pred HHHH
Confidence 7653
No 74
>PRK11447 cellulose synthase subunit BcsC; Provisional
Probab=62.35 E-value=43 Score=30.88 Aligned_cols=73 Identities=12% Similarity=-0.006 Sum_probs=46.8
Q ss_pred CCCCHHHHHHHHHHHHhcccHHHHHHHHHHcCC--chhh--------------------HHHHHhhCCC--cH--HHHHH
Q psy11108 17 CPYVAGQASFLSGVSQKKERFYHVEQALVNCQD--YAGL--------------------QLIKSLRALP--DD--IMKHA 70 (106)
Q Consensus 17 ~~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl~d--~~~v--------------------~~I~~l~~~~--~~--~~~~A 70 (106)
.++++..+-.||...++..+++-|...|.++-. .... ..++++.+.. +. ....|
T Consensus 599 ~p~~~~~~~~La~~~~~~g~~~~A~~~y~~al~~~P~~~~a~~~la~~~~~~g~~~eA~~~l~~ll~~~p~~~~~~~~la 678 (1157)
T PRK11447 599 QPPSTRIDLTLADWAQQRGDYAAARAAYQRVLTREPGNADARLGLIEVDIAQGDLAAARAQLAKLPATANDSLNTQRRVA 678 (1157)
T ss_pred CCCCchHHHHHHHHHHHcCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHhccCCCChHHHHHHH
Confidence 345567788999999999999999988875532 1111 1222222221 11 23457
Q ss_pred HHHHHcCCHHHHHHHHHHc
Q psy11108 71 YIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 71 ei~ay~Gkf~EAe~i~l~a 89 (106)
.++.-.|++++|.++|.+.
T Consensus 679 ~~~~~~g~~~eA~~~~~~a 697 (1157)
T PRK11447 679 LAWAALGDTAAAQRTFNRL 697 (1157)
T ss_pred HHHHhCCCHHHHHHHHHHH
Confidence 7777888888888888775
No 75
>KOG0985|consensus
Probab=62.23 E-value=13 Score=35.40 Aligned_cols=46 Identities=9% Similarity=-0.021 Sum_probs=42.4
Q ss_pred HHhhhhhhhcCCCCHHHHHHHHHHHHhcccHHHHHHHHHHcCCchh
Q psy11108 7 VIASRHSFVLCPYVAGQASFLSGVSQKKERFYHVEQALVNCQDYAG 52 (106)
Q Consensus 7 ~~~~~~~f~~~~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~ 52 (106)
-|-.|.+|.+....|..|+.||.+-|+++-..-|-..|++..|.+-
T Consensus 1090 ~ldRA~efAe~~n~p~vWsqlakAQL~~~~v~dAieSyikadDps~ 1135 (1666)
T KOG0985|consen 1090 SLDRAYEFAERCNEPAVWSQLAKAQLQGGLVKDAIESYIKADDPSN 1135 (1666)
T ss_pred hHHHHHHHHHhhCChHHHHHHHHHHHhcCchHHHHHHHHhcCCcHH
Confidence 3567999999999999999999999999999999999999999863
No 76
>PF08631 SPO22: Meiosis protein SPO22/ZIP4 like; InterPro: IPR013940 SPO22 is a meiosis-specific protein with similarity to phospholipase A2, involved in completion of nuclear divisions during meiosis; induced early in meiosis []. It is also involved in sporulation [].
Probab=62.14 E-value=8.7 Score=29.52 Aligned_cols=60 Identities=10% Similarity=0.041 Sum_probs=35.9
Q ss_pred HHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcHHHHHHHHHHHcC-CHHHHHHHHHHc
Q psy11108 30 VSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDDIMKHAYIAQHQG-RFQTAENYQLQA 89 (106)
Q Consensus 30 ~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~~~~~Aei~ay~G-kf~EAe~i~l~a 89 (106)
.|....|+++|+..|.|+.+..-...-+..+.+.+--.-.|.-+.-.+ ++++|-..+.++
T Consensus 2 ~A~~~~~~~~A~~~~~K~~~~~~~~~~~~~~~La~~~yn~G~~l~~~~~~~~~a~~wL~~a 62 (278)
T PF08631_consen 2 LAWKQGDLDLAEHMYSKAKDLLNSLDPDMAEELARVCYNIGKSLLSKKDKYEEAVKWLQRA 62 (278)
T ss_pred cchhhCCHHHHHHHHHHhhhHHhcCCcHHHHHHHHHHHHHHHHHHHcCCChHHHHHHHHHH
Confidence 478899999999999999887521111111111111122344444555 788887776664
No 77
>PRK15174 Vi polysaccharide export protein VexE; Provisional
Probab=61.93 E-value=43 Score=28.98 Aligned_cols=31 Identities=6% Similarity=-0.250 Sum_probs=26.9
Q ss_pred CCCCHHHHHHHHHHHHhcccHHHHHHHHHHc
Q psy11108 17 CPYVAGQASFLSGVSQKKERFYHVEQALVNC 47 (106)
Q Consensus 17 ~~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl 47 (106)
.+.|+..+..|+...+..++++-|...|.++
T Consensus 72 ~p~~~~~l~~l~~~~l~~g~~~~A~~~l~~~ 102 (656)
T PRK15174 72 AKNGRDLLRRWVISPLASSQPDAVLQVVNKL 102 (656)
T ss_pred CCCchhHHHHHhhhHhhcCCHHHHHHHHHHH
Confidence 4556777999999999999999999999876
No 78
>TIGR02795 tol_pal_ybgF tol-pal system protein YbgF. Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.
Probab=60.56 E-value=38 Score=20.98 Aligned_cols=58 Identities=14% Similarity=0.004 Sum_probs=31.4
Q ss_pred HHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhC---CCcHHHHHHHHHHHcCCHHHHHHHHHH
Q psy11108 23 QASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRA---LPDDIMKHAYIAQHQGRFQTAENYQLQ 88 (106)
Q Consensus 23 lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~---~~~~~~~~Aei~ay~Gkf~EAe~i~l~ 88 (106)
.+-.+|...+...+++-|...|..+-.. -.+ .+......|.++.-.|++++|.++|.+
T Consensus 4 ~~~~~~~~~~~~~~~~~A~~~~~~~~~~--------~~~~~~~~~~~~~l~~~~~~~~~~~~A~~~~~~ 64 (119)
T TIGR02795 4 AYYDAALLVLKAGDYADAIQAFQAFLKK--------YPKSTYAPNAHYWLGEAYYAQGKYADAAKAFLA 64 (119)
T ss_pred HHHHHHHHHHHcCCHHHHHHHHHHHHHH--------CCCccccHHHHHHHHHHHHhhccHHHHHHHHHH
Confidence 3455666677777777776666554110 000 011234456666666666666666654
No 79
>PRK15363 pathogenicity island 2 chaperone protein SscA; Provisional
Probab=60.44 E-value=61 Score=23.66 Aligned_cols=65 Identities=9% Similarity=-0.146 Sum_probs=50.8
Q ss_pred CCCCHHHHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHc
Q psy11108 17 CPYVAGQASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDDIMKHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 17 ~~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~~~~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
++.+..-|--||...-..+++.-|-.+|.+..-.+. +-+.+..-.|......|+.++|.+-|..+
T Consensus 65 Dp~~~~y~~gLG~~~Q~~g~~~~AI~aY~~A~~L~~--------ddp~~~~~ag~c~L~lG~~~~A~~aF~~A 129 (157)
T PRK15363 65 DAWSFDYWFRLGECCQAQKHWGEAIYAYGRAAQIKI--------DAPQAPWAAAECYLACDNVCYAIKALKAV 129 (157)
T ss_pred CcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcCC--------CCchHHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 445566799999999999999999999987754421 33455677899999999999999999763
No 80
>PLN03098 LPA1 LOW PSII ACCUMULATION1; Provisional
Probab=59.62 E-value=48 Score=28.28 Aligned_cols=63 Identities=14% Similarity=-0.025 Sum_probs=48.5
Q ss_pred cCCCCHHHHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcH------HHHHHHHHHHcCCHHHHHHHHHHc
Q psy11108 16 LCPYVAGQASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDD------IMKHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 16 ~~~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~------~~~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
.+++++..|..||..=....+++-|..+|.+.=. --|+. ....|-.++.+|+++||...|.++
T Consensus 70 ~dP~~a~a~~NLG~AL~~lGryeEAIa~f~rALe-----------L~Pd~aeA~~A~yNLAcaya~LGr~dEAla~LrrA 138 (453)
T PLN03098 70 ADVKTAEDAVNLGLSLFSKGRVKDALAQFETALE-----------LNPNPDEAQAAYYNKACCHAYREEGKKAADCLRTA 138 (453)
T ss_pred CCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHh-----------hCCCchHHHHHHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 4667788899999988888889988888865211 11221 256799999999999999999885
No 81
>CHL00033 ycf3 photosystem I assembly protein Ycf3
Probab=59.25 E-value=45 Score=23.03 Aligned_cols=68 Identities=12% Similarity=-0.024 Sum_probs=39.6
Q ss_pred HHHHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHc
Q psy11108 21 AGQASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDDIMKHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 21 ~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~~~~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
+..|..||.......+++-|..+|.+.-....-. ......+-.--...|+++.-.|++++|...|.++
T Consensus 72 ~~~~~~lg~~~~~~g~~~eA~~~~~~Al~~~~~~-~~~~~~la~i~~~~~~~~~~~g~~~~A~~~~~~a 139 (168)
T CHL00033 72 SYILYNIGLIHTSNGEHTKALEYYFQALERNPFL-PQALNNMAVICHYRGEQAIEQGDSEIAEAWFDQA 139 (168)
T ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCcCc-HHHHHHHHHHHHHhhHHHHHcccHHHHHHHHHHH
Confidence 4579999999999999999999987654321100 0000000001122344445788888666666553
No 82
>TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD. ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array.
Probab=58.89 E-value=36 Score=22.06 Aligned_cols=60 Identities=13% Similarity=-0.029 Sum_probs=40.4
Q ss_pred HHHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHc
Q psy11108 22 GQASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDDIMKHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 22 ~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~~~~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
..=-.+|...+...+++-|...|.++-... -.-+.--...|.++..+|++++|..+|.++
T Consensus 18 ~~~~~~a~~~~~~~~~~~A~~~~~~~~~~~--------p~~~~~~~~la~~~~~~~~~~~A~~~~~~~ 77 (135)
T TIGR02552 18 EQIYALAYNLYQQGRYDEALKLFQLLAAYD--------PYNSRYWLGLAACCQMLKEYEEAIDAYALA 77 (135)
T ss_pred HHHHHHHHHHHHcccHHHHHHHHHHHHHhC--------CCcHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 334567777888888888888775542211 001112346788999999999999998876
No 83
>TIGR00540 hemY_coli hemY protein. This is an uncharacterized protein encoded next to a heme-biosynthetic enzyme in two gamma division proteobacteria (E. coli and H. influenzae). It is known in no other species. The gene symbol hemY is unfortunate in that an unrelated protein, protoporphyrinogen oxidase, is designated as HemG in E. coli but as HemY in Bacillus subtilis.
Probab=58.55 E-value=44 Score=26.85 Aligned_cols=23 Identities=9% Similarity=0.115 Sum_probs=16.8
Q ss_pred HHHHHHhcccHHHHHHHHHHcCC
Q psy11108 27 LSGVSQKKERFYHVEQALVNCQD 49 (106)
Q Consensus 27 LA~~AL~~~~l~iAe~Af~rl~d 49 (106)
-|..++..+|++.|++...+.-+
T Consensus 90 ~glla~~~g~~~~A~~~l~~~~~ 112 (409)
T TIGR00540 90 EALLKLAEGDYAKAEKLIAKNAD 112 (409)
T ss_pred HHHHHHhCCCHHHHHHHHHHHhh
Confidence 35667888888888888865433
No 84
>cd05804 StaR_like StaR_like; a well-conserved protein found in bacteria, plants, and animals. A family member from Streptomyces toyocaensis, StaR is part of a gene cluster involved in the biosynthesis of glycopeptide antibiotics (GPAs), specifically A47934. It has been speculated that StaR could be a flavoprotein hydroxylating a tyrosine sidechain. Some family members have been annotated as proteins containing tetratricopeptide (TPR) repeats, which may at least indicate mostly alpha-helical secondary structure.
Probab=57.45 E-value=59 Score=24.69 Aligned_cols=23 Identities=13% Similarity=0.034 Sum_probs=18.1
Q ss_pred HHHHHHHHcCCHHHHHHHHHHcC
Q psy11108 68 KHAYIAQHQGRFQTAENYQLQAD 90 (106)
Q Consensus 68 ~~Aei~ay~Gkf~EAe~i~l~ag 90 (106)
..|.++..+|++++|..+|.+..
T Consensus 191 ~la~~~~~~G~~~~A~~~~~~~~ 213 (355)
T cd05804 191 HLALFYLERGDYEAALAIYDTHI 213 (355)
T ss_pred HHHHHHHHCCCHHHHHHHHHHHh
Confidence 46778888888888888888764
No 85
>PRK10049 pgaA outer membrane protein PgaA; Provisional
Probab=56.56 E-value=56 Score=28.71 Aligned_cols=62 Identities=15% Similarity=0.079 Sum_probs=39.9
Q ss_pred CCCCHHHHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCc---HHHHHHHHHHHcCCHHHHHHHHHHc
Q psy11108 17 CPYVAGQASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPD---DIMKHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 17 ~~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~---~~~~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
.+.++.+|..||...+...+.+-|+..+.++-.. -|+ -....|.++.-.|+|++|++++-+.
T Consensus 389 ~P~n~~l~~~lA~l~~~~g~~~~A~~~l~~al~l-----------~Pd~~~l~~~~a~~al~~~~~~~A~~~~~~l 453 (765)
T PRK10049 389 APGNQGLRIDYASVLQARGWPRAAENELKKAEVL-----------EPRNINLEVEQAWTALDLQEWRQMDVLTDDV 453 (765)
T ss_pred CCCCHHHHHHHHHHHHhcCCHHHHHHHHHHHHhh-----------CCCChHHHHHHHHHHHHhCCHHHHHHHHHHH
Confidence 3445666777776666666666666555443221 122 1246788888999999999999664
No 86
>PF04733 Coatomer_E: Coatomer epsilon subunit; InterPro: IPR006822 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the epsilon subunit of the coatomer complex, which is involved in the regulation of intracellular protein trafficking between the endoplasmic reticulum and the Golgi complex []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006890 retrograde vesicle-mediated transport, Golgi to ER, 0030126 COPI vesicle coat; PDB: 3MV2_B 3MV3_F 3MKR_A.
Probab=56.35 E-value=95 Score=24.26 Aligned_cols=80 Identities=16% Similarity=0.049 Sum_probs=48.5
Q ss_pred hhhhhhcCCCCHHHHHHHHHHHHhcccHHHHHHHHHHcCCch----hhHHHHHh--------------------hCC--C
Q psy11108 10 SRHSFVLCPYVAGQASFLSGVSQKKERFYHVEQALVNCQDYA----GLQLIKSL--------------------RAL--P 63 (106)
Q Consensus 10 ~~~~f~~~~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~----~v~~I~~l--------------------~~~--~ 63 (106)
+|.+......+..+=.+.....|.-.+++.|++-|....+.+ -+++.... .+. +
T Consensus 120 ~AL~~l~~~~~lE~~al~Vqi~L~~~R~dlA~k~l~~~~~~~eD~~l~qLa~awv~l~~g~e~~~~A~y~f~El~~~~~~ 199 (290)
T PF04733_consen 120 EALKLLHKGGSLELLALAVQILLKMNRPDLAEKELKNMQQIDEDSILTQLAEAWVNLATGGEKYQDAFYIFEELSDKFGS 199 (290)
T ss_dssp HHHCCCTTTTCHHHHHHHHHHHHHTT-HHHHHHHHHHHHCCSCCHHHHHHHHHHHHHHHTTTCCCHHHHHHHHHHCCS--
T ss_pred HHHHHHHccCcccHHHHHHHHHHHcCCHHHHHHHHHHHHhcCCcHHHHHHHHHHHHHHhCchhHHHHHHHHHHHHhccCC
Confidence 444455555444455667778888888999999888775322 22322211 111 1
Q ss_pred cHHH--HHHHHHHHcCCHHHHHHHHHHc
Q psy11108 64 DDIM--KHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 64 ~~~~--~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
++.+ -+|-..+.+|+|+||+.++.++
T Consensus 200 t~~~lng~A~~~l~~~~~~eAe~~L~~a 227 (290)
T PF04733_consen 200 TPKLLNGLAVCHLQLGHYEEAEELLEEA 227 (290)
T ss_dssp SHHHHHHHHHHHHHCT-HHHHHHHHHHH
T ss_pred CHHHHHHHHHHHHHhCCHHHHHHHHHHH
Confidence 2233 3567788999999999999885
No 87
>PRK15331 chaperone protein SicA; Provisional
Probab=56.07 E-value=24 Score=26.08 Aligned_cols=33 Identities=36% Similarity=0.655 Sum_probs=23.9
Q ss_pred HHHHhhCCCcHHH----HHHHHHHHcCCHHHHHHHHH
Q psy11108 55 LIKSLRALPDDIM----KHAYIAQHQGRFQTAENYQL 87 (106)
Q Consensus 55 ~I~~l~~~~~~~~----~~Aei~ay~Gkf~EAe~i~l 87 (106)
.++.+..+++..+ ..|.-.-.+||++||+++|.
T Consensus 25 tlk~l~gis~~~le~iY~~Ay~~y~~Gk~~eA~~~F~ 61 (165)
T PRK15331 25 TLKDVHGIPQDMMDGLYAHAYEFYNQGRLDEAETFFR 61 (165)
T ss_pred CHHHHhCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHH
Confidence 3556667777665 45666667899999999986
No 88
>KOG2076|consensus
Probab=55.94 E-value=60 Score=29.97 Aligned_cols=61 Identities=16% Similarity=0.156 Sum_probs=42.4
Q ss_pred CCCHHHHHHHHHHHHhcccHHHHHHHHHHc--CCchhhHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHH
Q psy11108 18 PYVAGQASFLSGVSQKKERFYHVEQALVNC--QDYAGLQLIKSLRALPDDIMKHAYIAQHQGRFQTAENYQLQ 88 (106)
Q Consensus 18 ~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl--~d~~~v~~I~~l~~~~~~~~~~Aei~ay~Gkf~EAe~i~l~ 88 (106)
+....+|..||..+.+..+++.|..||.+. -+....+++ ..++.++--.|+...|..-|++
T Consensus 204 p~d~e~W~~ladls~~~~~i~qA~~cy~rAI~~~p~n~~~~----------~ers~L~~~~G~~~~Am~~f~~ 266 (895)
T KOG2076|consen 204 PKDYELWKRLADLSEQLGNINQARYCYSRAIQANPSNWELI----------YERSSLYQKTGDLKRAMETFLQ 266 (895)
T ss_pred CCChHHHHHHHHHHHhcccHHHHHHHHHHHHhcCCcchHHH----------HHHHHHHHHhChHHHHHHHHHH
Confidence 344688999999999999999999999875 122222222 2345555555887777777766
No 89
>PRK12370 invasion protein regulator; Provisional
Probab=55.59 E-value=64 Score=27.17 Aligned_cols=73 Identities=14% Similarity=0.163 Sum_probs=45.9
Q ss_pred CCCCHHHHHHHHHHHHhcccHHHHHHHHHHcCCc--h-----h---------------hHHHHHhhCC--Cc-H--HHHH
Q psy11108 17 CPYVAGQASFLSGVSQKKERFYHVEQALVNCQDY--A-----G---------------LQLIKSLRAL--PD-D--IMKH 69 (106)
Q Consensus 17 ~~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~--~-----~---------------v~~I~~l~~~--~~-~--~~~~ 69 (106)
++.++..|..||...+..++++-|...|.+.-.. . . +...+++... ++ + ....
T Consensus 368 ~P~~~~a~~~lg~~l~~~G~~~eAi~~~~~Al~l~P~~~~~~~~~~~~~~~~g~~eeA~~~~~~~l~~~~p~~~~~~~~l 447 (553)
T PRK12370 368 SPISADIKYYYGWNLFMAGQLEEALQTINECLKLDPTRAAAGITKLWITYYHTGIDDAIRLGDELRSQHLQDNPILLSMQ 447 (553)
T ss_pred CCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHhcCCCChhhHHHHHHHHHhccCHHHHHHHHHHHHHhccccCHHHHHHH
Confidence 4455777899999888889988888877764111 1 0 1111111111 11 1 2346
Q ss_pred HHHHHHcCCHHHHHHHHHHc
Q psy11108 70 AYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 70 Aei~ay~Gkf~EAe~i~l~a 89 (106)
|.++..+|+++||...+.+.
T Consensus 448 a~~l~~~G~~~eA~~~~~~~ 467 (553)
T PRK12370 448 VMFLSLKGKHELARKLTKEI 467 (553)
T ss_pred HHHHHhCCCHHHHHHHHHHh
Confidence 78888899999999988764
No 90
>PLN03088 SGT1, suppressor of G2 allele of SKP1; Provisional
Probab=55.26 E-value=58 Score=25.98 Aligned_cols=60 Identities=12% Similarity=-0.015 Sum_probs=42.5
Q ss_pred HHHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHc
Q psy11108 22 GQASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDDIMKHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 22 ~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~~~~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
..+...|..++...+++-|...|.+.-...- .-..--..+|.++...|++++|...+.++
T Consensus 3 ~~l~~~a~~a~~~~~~~~Ai~~~~~Al~~~P--------~~~~a~~~~a~~~~~~g~~~eAl~~~~~A 62 (356)
T PLN03088 3 KDLEDKAKEAFVDDDFALAVDLYTQAIDLDP--------NNAELYADRAQANIKLGNFTEAVADANKA 62 (356)
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHHHhCC--------CCHHHHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 3456779999999999999988875421100 01112346788999999999999988775
No 91
>KOG1920|consensus
Probab=54.89 E-value=42 Score=32.01 Aligned_cols=74 Identities=9% Similarity=0.013 Sum_probs=54.0
Q ss_pred HHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcHHHH-----HHHHHHHcCCHHHHHHHHHH-cCCcccccc
Q psy11108 24 ASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDDIMK-----HAYIAQHQGRFQTAENYQLQ-ADRRTWKSK 97 (106)
Q Consensus 24 W~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~~~~-----~Aei~ay~Gkf~EAe~i~l~-ag~~~~Aie 97 (106)
|.--|.+=...++++-|-+||..++|.+.+.-|-.....+..++. .+.-+.-+||+-||.++.+. ++.+..|+.
T Consensus 955 ~~~Aal~Ye~~GklekAl~a~~~~~dWr~~l~~a~ql~~~~de~~~~a~~L~s~L~e~~kh~eAa~il~e~~sd~~~av~ 1034 (1265)
T KOG1920|consen 955 SDEAALMYERCGKLEKALKAYKECGDWREALSLAAQLSEGKDELVILAEELVSRLVEQRKHYEAAKILLEYLSDPEEAVA 1034 (1265)
T ss_pred ccHHHHHHHHhccHHHHHHHHHHhccHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHcccchhHHHHHHHHhcCHHHHHH
Confidence 555666667788999999999999999999888776666554443 45556678888888888777 344444443
No 92
>PRK10747 putative protoheme IX biogenesis protein; Provisional
Probab=54.46 E-value=51 Score=26.45 Aligned_cols=22 Identities=9% Similarity=-0.055 Sum_probs=17.8
Q ss_pred HHHHHHHHcCCHHHHHHHHHHc
Q psy11108 68 KHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 68 ~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
..|+++..+|++++|.+.+.+.
T Consensus 158 ~~a~l~l~~g~~~~Al~~l~~~ 179 (398)
T PRK10747 158 TRVRIQLARNENHAARHGVDKL 179 (398)
T ss_pred HHHHHHHHCCCHHHHHHHHHHH
Confidence 5588888888888888888774
No 93
>KOG3060|consensus
Probab=53.68 E-value=43 Score=26.92 Aligned_cols=58 Identities=10% Similarity=0.021 Sum_probs=42.3
Q ss_pred HHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHH
Q psy11108 23 QASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDDIMKHAYIAQHQGRFQTAENYQLQ 88 (106)
Q Consensus 23 lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~~~~~Aei~ay~Gkf~EAe~i~l~ 88 (106)
+-..+.-+||....+++|.+|+-+++| +...-+.-....|..+=-.|++++|.++|-.
T Consensus 54 l~EqV~IAAld~~~~~lAq~C~~~L~~--------~fp~S~RV~~lkam~lEa~~~~~~A~e~y~~ 111 (289)
T KOG3060|consen 54 LYEQVFIAALDTGRDDLAQKCINQLRD--------RFPGSKRVGKLKAMLLEATGNYKEAIEYYES 111 (289)
T ss_pred HHHHHHHHHHHhcchHHHHHHHHHHHH--------hCCCChhHHHHHHHHHHHhhchhhHHHHHHH
Confidence 346778889999999999999987654 2211112234678888888999999999865
No 94
>cd05804 StaR_like StaR_like; a well-conserved protein found in bacteria, plants, and animals. A family member from Streptomyces toyocaensis, StaR is part of a gene cluster involved in the biosynthesis of glycopeptide antibiotics (GPAs), specifically A47934. It has been speculated that StaR could be a flavoprotein hydroxylating a tyrosine sidechain. Some family members have been annotated as proteins containing tetratricopeptide (TPR) repeats, which may at least indicate mostly alpha-helical secondary structure.
Probab=53.50 E-value=61 Score=24.60 Aligned_cols=58 Identities=14% Similarity=0.111 Sum_probs=40.0
Q ss_pred HHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHH
Q psy11108 23 QASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDDIMKHAYIAQHQGRFQTAENYQLQ 88 (106)
Q Consensus 23 lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~~~~~Aei~ay~Gkf~EAe~i~l~ 88 (106)
....+|..-+..++++-|+..|.+.-... .+...-....|+++...|+++||..+|.+
T Consensus 116 ~~~~~a~~~~~~G~~~~A~~~~~~al~~~--------p~~~~~~~~la~i~~~~g~~~eA~~~l~~ 173 (355)
T cd05804 116 LLGMLAFGLEEAGQYDRAEEAARRALELN--------PDDAWAVHAVAHVLEMQGRFKEGIAFMES 173 (355)
T ss_pred HHHHHHHHHHHcCCHHHHHHHHHHHHhhC--------CCCcHHHHHHHHHHHHcCCHHHHHHHHHh
Confidence 34567778888889999988888763321 01122334568888888888888888765
No 95
>PRK09782 bacteriophage N4 receptor, outer membrane subunit; Provisional
Probab=51.36 E-value=84 Score=29.10 Aligned_cols=75 Identities=13% Similarity=0.151 Sum_probs=48.1
Q ss_pred hcCCCCHHHHHHHHHHHHhcccHHHHHHHHHHcCC--chhhH--------------------HHHHhhCC-CcH---HHH
Q psy11108 15 VLCPYVAGQASFLSGVSQKKERFYHVEQALVNCQD--YAGLQ--------------------LIKSLRAL-PDD---IMK 68 (106)
Q Consensus 15 ~~~~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl~d--~~~v~--------------------~I~~l~~~-~~~---~~~ 68 (106)
++..|++..|..+|....+..+.+-|+.+|.+.-. ++-.. .+++.-++ |+. ...
T Consensus 603 L~l~P~~~a~~~LA~~l~~lG~~deA~~~l~~AL~l~Pd~~~a~~nLG~aL~~~G~~eeAi~~l~~AL~l~P~~~~a~~n 682 (987)
T PRK09782 603 LNIAPSANAYVARATIYRQRHNVPAAVSDLRAALELEPNNSNYQAALGYALWDSGDIAQSREMLERAHKGLPDDPALIRQ 682 (987)
T ss_pred HHhCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCCHHHHHH
Confidence 34445678899999999999999999888875422 11111 11111111 221 245
Q ss_pred HHHHHHHcCCHHHHHHHHHHc
Q psy11108 69 HAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 69 ~Aei~ay~Gkf~EAe~i~l~a 89 (106)
.|.++...|++++|+..|.++
T Consensus 683 LA~al~~lGd~~eA~~~l~~A 703 (987)
T PRK09782 683 LAYVNQRLDDMAATQHYARLV 703 (987)
T ss_pred HHHHHHHCCCHHHHHHHHHHH
Confidence 678888888899998888886
No 96
>KOG1840|consensus
Probab=50.99 E-value=24 Score=30.31 Aligned_cols=61 Identities=16% Similarity=0.170 Sum_probs=36.5
Q ss_pred HHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCC-----cHHHHHHHHHHHcCCHHHHHHHHHHc
Q psy11108 24 ASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALP-----DDIMKHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 24 W~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~-----~~~~~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
-..++..-....+++-|.+=|.+. +..+...-... .-....|+++..+|||+||+.+|.++
T Consensus 328 l~~~~~~~~~~~~~Eea~~l~q~a-----l~i~~~~~g~~~~~~a~~~~nl~~l~~~~gk~~ea~~~~k~a 393 (508)
T KOG1840|consen 328 LSELAAILQSMNEYEEAKKLLQKA-----LKIYLDAPGEDNVNLAKIYANLAELYLKMGKYKEAEELYKKA 393 (508)
T ss_pred HHHHHHHHHHhcchhHHHHHHHHH-----HHHHHhhccccchHHHHHHHHHHHHHHHhcchhHHHHHHHHH
Confidence 334444444555677777666622 22222111111 12246799999999999999999886
No 97
>KOG0543|consensus
Probab=49.09 E-value=51 Score=27.70 Aligned_cols=65 Identities=8% Similarity=0.043 Sum_probs=47.1
Q ss_pred HHHHHHHHhcccHHHHHHHHHHcCCchhh-HHHHHhhCCCcH---H-HHHHHHHHHcCCHHHHHHHHHHcC
Q psy11108 25 SFLSGVSQKKERFYHVEQALVNCQDYAGL-QLIKSLRALPDD---I-MKHAYIAQHQGRFQTAENYQLQAD 90 (106)
Q Consensus 25 ~~LA~~AL~~~~l~iAe~Af~rl~d~~~v-~~I~~l~~~~~~---~-~~~Aei~ay~Gkf~EAe~i~l~ag 90 (106)
...+.......++++| .|+.+++++... +.+.+.-.+++. . -++|+.++-.|.|+.|...|.++=
T Consensus 249 ~~~~~~~k~~~~lNlA-~c~lKl~~~~~Ai~~c~kvLe~~~~N~KALyRrG~A~l~~~e~~~A~~df~ka~ 318 (397)
T KOG0543|consen 249 QKKAEALKLACHLNLA-ACYLKLKEYKEAIESCNKVLELDPNNVKALYRRGQALLALGEYDLARDDFQKAL 318 (397)
T ss_pred HHHHHHHHHHHhhHHH-HHHHhhhhHHHHHHHHHHHHhcCCCchhHHHHHHHHHHhhccHHHHHHHHHHHH
Confidence 3455556666667776 488889888743 555555555443 3 378999999999999999999863
No 98
>PF11817 Foie-gras_1: Foie gras liver health family 1; InterPro: IPR021773 Mutating the gene foie gras in zebrafish has been shown to affect development; the mutants develop large, lipid-filled hepatocytes in the liver, resembling those in individuals with fatty liver disease []. Foie-gras protein is long and has several well-defined domains though none of them has a known function. We have annotated this one as the first []. THe C terminus of this region contains TPR repeats.
Probab=48.60 E-value=61 Score=24.48 Aligned_cols=43 Identities=9% Similarity=-0.011 Sum_probs=30.5
Q ss_pred HHHHHHHHHcCCchhhHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHc
Q psy11108 38 YHVEQALVNCQDYAGLQLIKSLRALPDDIMKHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 38 ~iAe~Af~rl~d~~~v~~I~~l~~~~~~~~~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
+-|..-|.+.+..++..+|. ...|+-+...|++++|.++|...
T Consensus 162 ~~A~~~f~~~~~~R~~~~l~---------~~~A~ey~~~g~~~~A~~~l~~~ 204 (247)
T PF11817_consen 162 EKAYEQFKKYGQNRMASYLS---------LEMAEEYFRLGDYDKALKLLEPA 204 (247)
T ss_pred HHHHHHHHHhccchHHHHHH---------HHHHHHHHHCCCHHHHHHHHHHH
Confidence 34444455555544444443 67899999999999999999886
No 99
>PF07219 HemY_N: HemY protein N-terminus; InterPro: IPR010817 This entry represents the N terminus (approximately 150 residues) of bacterial HemY porphyrin biosynthesis proteins. These are membrane protein involved in a late step of protoheme IX synthesis [].
Probab=47.31 E-value=48 Score=22.01 Aligned_cols=27 Identities=26% Similarity=0.388 Sum_probs=23.3
Q ss_pred HHHHHHHHHHHcCCHHHHHHHHHHcCC
Q psy11108 65 DIMKHAYIAQHQGRFQTAENYQLQADR 91 (106)
Q Consensus 65 ~~~~~Aei~ay~Gkf~EAe~i~l~ag~ 91 (106)
..+..|-++...|++..|++...++..
T Consensus 61 ~al~~Gl~al~~G~~~~A~k~~~~a~~ 87 (108)
T PF07219_consen 61 RALSRGLIALAEGDWQRAEKLLAKAAK 87 (108)
T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHh
Confidence 456789999999999999999999743
No 100
>PRK10049 pgaA outer membrane protein PgaA; Provisional
Probab=46.44 E-value=1.1e+02 Score=26.95 Aligned_cols=29 Identities=3% Similarity=-0.084 Sum_probs=19.0
Q ss_pred CCCHHHHHHHHHHHHhcccHHHHHHHHHH
Q psy11108 18 PYVAGQASFLSGVSQKKERFYHVEQALVN 46 (106)
Q Consensus 18 ~~~~~lW~~LA~~AL~~~~l~iAe~Af~r 46 (106)
+.....|..||.......+++.|...|.+
T Consensus 46 ~~~a~~~~~lA~~~~~~g~~~~A~~~~~~ 74 (765)
T PRK10049 46 QLPARGYAAVAVAYRNLKQWQNSLTLWQK 74 (765)
T ss_pred CCCHHHHHHHHHHHHHcCCHHHHHHHHHH
Confidence 33344577777777777777777776666
No 101
>PF00637 Clathrin: Region in Clathrin and VPS; InterPro: IPR000547 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. These vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transport []. Clathrin coats contain both clathrin (acts as a scaffold) and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors [, ]. Clathrin is a trimer composed of three heavy chains and three light chains, each monomer projecting outwards like a leg; this three-legged structure is known as a triskelion [, ]. The heavy chains form the legs, their N-terminal beta-propeller regions extending outwards, while their C-terminal alpha-alpha-superhelical regions form the central hub of the triskelion. Peptide motifs can bind between the beta-propeller blades. The light chains appear to have a regulatory role, and may help orient the assembly and disassembly of clathrin coats as they interact with hsc70 uncoating ATPase []. Clathrin triskelia self-polymerise into a curved lattice by twisting individual legs together. The clathrin lattice forms around a vesicle as it buds from the TGN, plasma membrane or endosomes, acting to stabilise the vesicle and facilitate the budding process []. The multiple blades created when the triskelia polymerise are involved in multiple protein interactions, enabling the recruitment of different cargo adaptors and membrane attachment proteins []. This entry represents the 7-fold alpha-alpha-superhelical ARM-type repeat found at the C-terminal of clathrin heavy chains and in VPS (vacuolar protein sorting-associated) proteins. In clathrin heavy chains, the C-terminal 7-fold ARM-type repeats interact to form the central hub of the triskelion. VPS proteins are required for vacuolar assembly and vacuolar traffick, and contain one clathrin-type repeat []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0006886 intracellular protein transport, 0016192 vesicle-mediated transport; PDB: 3LVH_A 3LVG_C 1B89_A 3QIL_L.
Probab=45.57 E-value=23 Score=23.69 Aligned_cols=55 Identities=11% Similarity=0.176 Sum_probs=35.8
Q ss_pred HHHHHcCCchh-hHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHcCCccccccC
Q psy11108 42 QALVNCQDYAG-LQLIKSLRALPDDIMKHAYIAQHQGRFQTAENYQLQADRRTWKSKP 98 (106)
Q Consensus 42 ~Af~rl~d~~~-v~~I~~l~~~~~~~~~~Aei~ay~Gkf~EAe~i~l~ag~~~~Aiem 98 (106)
..|.+.+++.. ..|++.....+ .-..+.++--.|.+++|.-+|.+.|+.+.|+++
T Consensus 50 ~ly~~~~~~~~l~~~L~~~~~yd--~~~~~~~c~~~~l~~~a~~Ly~~~~~~~~al~i 105 (143)
T PF00637_consen 50 ELYIKYDPYEKLLEFLKTSNNYD--LDKALRLCEKHGLYEEAVYLYSKLGNHDEALEI 105 (143)
T ss_dssp HHHHCTTTCCHHHHTTTSSSSS---CTHHHHHHHTTTSHHHHHHHHHCCTTHTTCSST
T ss_pred HHHHhcCCchHHHHHcccccccC--HHHHHHHHHhcchHHHHHHHHHHcccHHHHHHH
Confidence 34444455333 35555333322 234467777788999999999999999988874
No 102
>KOG2796|consensus
Probab=45.08 E-value=54 Score=26.87 Aligned_cols=29 Identities=14% Similarity=0.044 Sum_probs=26.2
Q ss_pred CHHHHHHHHHHHHhcccHHHHHHHHHHcC
Q psy11108 20 VAGQASFLSGVSQKKERFYHVEQALVNCQ 48 (106)
Q Consensus 20 ~~~lW~~LA~~AL~~~~l~iAe~Af~rl~ 48 (106)
.|.|-+.|+..+|+.+|.+.|++-|.++.
T Consensus 211 ~p~L~s~Lgr~~MQ~GD~k~a~~yf~~ve 239 (366)
T KOG2796|consen 211 EPQLLSGLGRISMQIGDIKTAEKYFQDVE 239 (366)
T ss_pred cHHHHHHHHHHHHhcccHHHHHHHHHHHH
Confidence 48899999999999999999999998654
No 103
>KOG1538|consensus
Probab=44.89 E-value=29 Score=31.63 Aligned_cols=64 Identities=17% Similarity=0.129 Sum_probs=46.0
Q ss_pred hcccHHHHHHHHHHcCCchhhHHHHHhhC----------------CCcHHHHHHHHHHHcCCHHHHHHHHHHcCCccccc
Q psy11108 33 KKERFYHVEQALVNCQDYAGLQLIKSLRA----------------LPDDIMKHAYIAQHQGRFQTAENYQLQADRRTWKS 96 (106)
Q Consensus 33 ~~~~l~iAe~Af~rl~d~~~v~~I~~l~~----------------~~~~~~~~Aei~ay~Gkf~EAe~i~l~ag~~~~Ai 96 (106)
....+..|-+-|..+||...|-=+ +++. +++-.+..|.-++-..+|.||.+.|.++|+...|+
T Consensus 759 ~l~~~gLAaeIF~k~gD~ksiVql-Hve~~~W~eAFalAe~hPe~~~dVy~pyaqwLAE~DrFeEAqkAfhkAGr~~EA~ 837 (1081)
T KOG1538|consen 759 KLDSPGLAAEIFLKMGDLKSLVQL-HVETQRWDEAFALAEKHPEFKDDVYMPYAQWLAENDRFEEAQKAFHKAGRQREAV 837 (1081)
T ss_pred hccccchHHHHHHHhccHHHHhhh-eeecccchHhHhhhhhCccccccccchHHHHhhhhhhHHHHHHHHHHhcchHHHH
Confidence 344567788889999988765222 1111 12233567888889999999999999999998887
Q ss_pred c
Q psy11108 97 K 97 (106)
Q Consensus 97 e 97 (106)
+
T Consensus 838 ~ 838 (1081)
T KOG1538|consen 838 Q 838 (1081)
T ss_pred H
Confidence 6
No 104
>COG3063 PilF Tfp pilus assembly protein PilF [Cell motility and secretion / Intracellular trafficking and secretion]
Probab=44.63 E-value=92 Score=24.63 Aligned_cols=63 Identities=16% Similarity=0.160 Sum_probs=42.0
Q ss_pred CCCHH-HHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHH
Q psy11108 18 PYVAG-QASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDDIMKHAYIAQHQGRFQTAENYQLQ 88 (106)
Q Consensus 18 ~~~~~-lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~~~~~Aei~ay~Gkf~EAe~i~l~ 88 (106)
|++++ -|..++..||+.++++-|+.-|.|- +....+.+...+.+|....-.|++-+|...|..
T Consensus 135 Y~~~s~t~eN~G~Cal~~gq~~~A~~~l~ra--------L~~dp~~~~~~l~~a~~~~~~~~y~~Ar~~~~~ 198 (250)
T COG3063 135 YGEPSDTLENLGLCALKAGQFDQAEEYLKRA--------LELDPQFPPALLELARLHYKAGDYAPARLYLER 198 (250)
T ss_pred CCCcchhhhhhHHHHhhcCCchhHHHHHHHH--------HHhCcCCChHHHHHHHHHHhcccchHHHHHHHH
Confidence 44444 4999999999999999999888764 333334444455556666666666666555443
No 105
>TIGR02795 tol_pal_ybgF tol-pal system protein YbgF. Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.
Probab=44.44 E-value=75 Score=19.58 Aligned_cols=63 Identities=14% Similarity=0.153 Sum_probs=43.6
Q ss_pred HHHHHHHHHHHHhcccHHHHHHHHHHcCCch-hhHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHc
Q psy11108 21 AGQASFLSGVSQKKERFYHVEQALVNCQDYA-GLQLIKSLRALPDDIMKHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 21 ~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~-~v~~I~~l~~~~~~~~~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
+..+-.+|...+...+++-|...|..+-... .-. ..+.--...|.++.-.|++++|.+.|.+.
T Consensus 39 ~~~~~~l~~~~~~~~~~~~A~~~~~~~~~~~p~~~------~~~~~~~~~~~~~~~~~~~~~A~~~~~~~ 102 (119)
T TIGR02795 39 PNAHYWLGEAYYAQGKYADAAKAFLAVVKKYPKSP------KAPDALLKLGMSLQELGDKEKAKATLQQV 102 (119)
T ss_pred HHHHHHHHHHHHhhccHHHHHHHHHHHHHHCCCCC------cccHHHHHHHHHHHHhCChHHHHHHHHHH
Confidence 3456778999999999999888887653210 000 00112356788899999999999999864
No 106
>COG1729 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=43.58 E-value=77 Score=25.08 Aligned_cols=71 Identities=13% Similarity=0.059 Sum_probs=48.2
Q ss_pred hhhhhhhcCCCCHHH------HHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCC---CcHHHHHHHHHHHcCCH
Q psy11108 9 ASRHSFVLCPYVAGQ------ASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRAL---PDDIMKHAYIAQHQGRF 79 (106)
Q Consensus 9 ~~~~~f~~~~~~~~l------W~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~---~~~~~~~Aei~ay~Gkf 79 (106)
+.=.+||+.||.... | |++.=...++++=|...|.++ +++.-+- |+--++.|.++.-.|+-
T Consensus 162 ~~F~~fi~~YP~s~~~~nA~yW--LGe~~y~qg~y~~Aa~~f~~~--------~k~~P~s~KApdallKlg~~~~~l~~~ 231 (262)
T COG1729 162 QAFQAFIKKYPNSTYTPNAYYW--LGESLYAQGDYEDAAYIFARV--------VKDYPKSPKAPDALLKLGVSLGRLGNT 231 (262)
T ss_pred HHHHHHHHcCCCCcccchhHHH--HHHHHHhcccchHHHHHHHHH--------HHhCCCCCCChHHHHHHHHHHHHhcCH
Confidence 334578888887654 6 676666777777777666543 2333222 23346789999999999
Q ss_pred HHHHHHHHHc
Q psy11108 80 QTAENYQLQA 89 (106)
Q Consensus 80 ~EAe~i~l~a 89 (106)
++|-.+|.+-
T Consensus 232 d~A~atl~qv 241 (262)
T COG1729 232 DEACATLQQV 241 (262)
T ss_pred HHHHHHHHHH
Confidence 9999998774
No 107
>PF12854 PPR_1: PPR repeat
Probab=42.66 E-value=31 Score=18.18 Aligned_cols=16 Identities=19% Similarity=0.055 Sum_probs=12.3
Q ss_pred HHHcCCHHHHHHHHHH
Q psy11108 73 AQHQGRFQTAENYQLQ 88 (106)
Q Consensus 73 ~ay~Gkf~EAe~i~l~ 88 (106)
+.-.|+++||.++|-+
T Consensus 17 ~Ck~G~~~~A~~l~~~ 32 (34)
T PF12854_consen 17 YCKAGRVDEAFELFDE 32 (34)
T ss_pred HHHCCCHHHHHHHHHh
Confidence 3457889999988865
No 108
>KOG1840|consensus
Probab=42.61 E-value=76 Score=27.29 Aligned_cols=61 Identities=15% Similarity=0.152 Sum_probs=43.0
Q ss_pred HHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcHH----H-HHHHHHHHcCCHHHHHHHHHHc
Q psy11108 24 ASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDDI----M-KHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 24 W~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~~----~-~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
-..||.+-...++|+.|+.-|.+.=+. +.+-.-...+. + -.|.++.-+++|+||..+|.++
T Consensus 202 ~~~La~~y~~~g~~e~A~~l~k~Al~~-----l~k~~G~~hl~va~~l~~~a~~y~~~~k~~eAv~ly~~A 267 (508)
T KOG1840|consen 202 LRNLAEMYAVQGRLEKAEPLCKQALRI-----LEKTSGLKHLVVASMLNILALVYRSLGKYDEAVNLYEEA 267 (508)
T ss_pred HHHHHHHHHHhccHHHHHHHHHHHHHH-----HHHccCccCHHHHHHHHHHHHHHHHhccHHHHHHHHHHH
Confidence 455889999999999999888754222 22221122211 1 3689999999999999999876
No 109
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=40.54 E-value=1.7e+02 Score=22.61 Aligned_cols=73 Identities=18% Similarity=0.156 Sum_probs=51.9
Q ss_pred HhhhhhhhcCCCCH----HHHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCC---CcHHHHHHHHHHHcCCHH
Q psy11108 8 IASRHSFVLCPYVA----GQASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRAL---PDDIMKHAYIAQHQGRFQ 80 (106)
Q Consensus 8 ~~~~~~f~~~~~~~----~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~---~~~~~~~Aei~ay~Gkf~ 80 (106)
|..-..|++.||.. ..+-.||..-....+++-|...|.++ |+.--+- ++--...|.++.-.|+++
T Consensus 163 i~af~~fl~~yP~s~~a~~A~y~LG~~y~~~g~~~~A~~~f~~v--------v~~yP~s~~~~dAl~klg~~~~~~g~~~ 234 (263)
T PRK10803 163 IVAFQNFVKKYPDSTYQPNANYWLGQLNYNKGKKDDAAYYFASV--------VKNYPKSPKAADAMFKVGVIMQDKGDTA 234 (263)
T ss_pred HHHHHHHHHHCcCCcchHHHHHHHHHHHHHcCCHHHHHHHHHHH--------HHHCCCCcchhHHHHHHHHHHHHcCCHH
Confidence 34456788888875 46788898889999999998887654 2222111 122245688888999999
Q ss_pred HHHHHHHH
Q psy11108 81 TAENYQLQ 88 (106)
Q Consensus 81 EAe~i~l~ 88 (106)
+|.++|.+
T Consensus 235 ~A~~~~~~ 242 (263)
T PRK10803 235 KAKAVYQQ 242 (263)
T ss_pred HHHHHHHH
Confidence 99999975
No 110
>PF14938 SNAP: Soluble NSF attachment protein, SNAP; PDB: 1QQE_A 2IFU_A.
Probab=39.99 E-value=24 Score=26.93 Aligned_cols=27 Identities=19% Similarity=0.298 Sum_probs=22.1
Q ss_pred HHHHHHHHHcCCHHHHHHHHHHcCCcc
Q psy11108 67 MKHAYIAQHQGRFQTAENYQLQADRRT 93 (106)
Q Consensus 67 ~~~Aei~ay~Gkf~EAe~i~l~ag~~~ 93 (106)
...|.+++-.|+|++|-++|.+.+...
T Consensus 159 ~~~A~l~~~l~~y~~A~~~~e~~~~~~ 185 (282)
T PF14938_consen 159 LKAADLYARLGRYEEAIEIYEEVAKKC 185 (282)
T ss_dssp HHHHHHHHHTT-HHHHHHHHHHHHHTC
T ss_pred HHHHHHHHHhCCHHHHHHHHHHHHHHh
Confidence 577999999999999999998876543
No 111
>TIGR00756 PPR pentatricopeptide repeat domain (PPR motif). This family has a similar consensus to the TPR domain (tetratricopeptide), pfam pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.
Probab=39.76 E-value=47 Score=16.03 Aligned_cols=17 Identities=18% Similarity=0.132 Sum_probs=13.5
Q ss_pred HHHcCCHHHHHHHHHHc
Q psy11108 73 AQHQGRFQTAENYQLQA 89 (106)
Q Consensus 73 ~ay~Gkf~EAe~i~l~a 89 (106)
+.-.|++++|.++|.+.
T Consensus 10 ~~~~~~~~~a~~~~~~M 26 (35)
T TIGR00756 10 LCKAGRVEEALELFKEM 26 (35)
T ss_pred HHHCCCHHHHHHHHHHH
Confidence 45678899999998764
No 112
>PRK02603 photosystem I assembly protein Ycf3; Provisional
Probab=39.20 E-value=1.3e+02 Score=20.85 Aligned_cols=27 Identities=0% Similarity=-0.208 Sum_probs=19.7
Q ss_pred HHHHHHHHHHHHhcccHHHHHHHHHHc
Q psy11108 21 AGQASFLSGVSQKKERFYHVEQALVNC 47 (106)
Q Consensus 21 ~~lW~~LA~~AL~~~~l~iAe~Af~rl 47 (106)
...|..||......++++-|..+|.+.
T Consensus 72 ~~~~~~la~~~~~~g~~~~A~~~~~~a 98 (172)
T PRK02603 72 SYILYNMGIIYASNGEHDKALEYYHQA 98 (172)
T ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 356777888888888888887777643
No 113
>COG1729 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=39.11 E-value=82 Score=24.91 Aligned_cols=52 Identities=15% Similarity=0.098 Sum_probs=39.5
Q ss_pred HHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCc------HHHHHHHHHHHcCCHHHHHHHHHHc
Q psy11108 27 LSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPD------DIMKHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 27 LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~------~~~~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
.|...+..+++..|+.+|. +.|++.|+ ..--.||...-+|+|++|+.+|+..
T Consensus 147 ~A~~~~ksgdy~~A~~~F~-----------~fi~~YP~s~~~~nA~yWLGe~~y~qg~y~~Aa~~f~~~ 204 (262)
T COG1729 147 AALDLYKSGDYAEAEQAFQ-----------AFIKKYPNSTYTPNAYYWLGESLYAQGDYEDAAYIFARV 204 (262)
T ss_pred HHHHHHHcCCHHHHHHHHH-----------HHHHcCCCCcccchhHHHHHHHHHhcccchHHHHHHHHH
Confidence 4667788888888888875 23334443 3456799999999999999999984
No 114
>PF10516 SHNi-TPR: SHNi-TPR; InterPro: IPR019544 The tetratrico peptide repeat region (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B. This entry represents SHNi-TPR (Sim3-Hif1-NASP interrupted TPR), a sequence that is an interrupted form of TPR repeat [].
Probab=39.06 E-value=51 Score=18.35 Aligned_cols=24 Identities=17% Similarity=0.004 Sum_probs=20.4
Q ss_pred HHHHHHHHHHcCCHHHHHHHHHHc
Q psy11108 66 IMKHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 66 ~~~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
-.+.|+|.+-.++|++|..=|.++
T Consensus 4 ~~~Lgeisle~e~f~qA~~D~~~a 27 (38)
T PF10516_consen 4 YDLLGEISLENENFEQAIEDYEKA 27 (38)
T ss_pred HHHHHHHHHHhccHHHHHHHHHHH
Confidence 357899999999999999887764
No 115
>PRK10370 formate-dependent nitrite reductase complex subunit NrfG; Provisional
Probab=38.62 E-value=1e+02 Score=22.42 Aligned_cols=31 Identities=3% Similarity=-0.067 Sum_probs=26.5
Q ss_pred CCCCHHHHHHHHHHHHhcccHHHHHHHHHHc
Q psy11108 17 CPYVAGQASFLSGVSQKKERFYHVEQALVNC 47 (106)
Q Consensus 17 ~~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl 47 (106)
++.++..+..||..+.+.++++-|..+|.++
T Consensus 140 dP~~~~al~~LA~~~~~~g~~~~Ai~~~~~a 170 (198)
T PRK10370 140 DANEVTALMLLASDAFMQADYAQAIELWQKV 170 (198)
T ss_pred CCCChhHHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 4456778999999999999999999998876
No 116
>PF01535 PPR: PPR repeat; InterPro: IPR002885 This entry represents the PPR repeat. Pentatricopeptide repeat (PPR) proteins are characterised by tandem repeats of a degenerate 35 amino acid motif []. Most of PPR proteins have roles in mitochondria or plastid []. PPR repeats were discovered while screening Arabidopsis proteins for those predicted to be targeted to mitochondria or chloroplast [, ]. Some of these proteins have been shown to play a role in post-transcriptional processes within organelles and they are thought to be sequence-specific RNA-binding proteins [, , ]. Plant genomes have between one hundred to five hundred PPR genes per genome whereas non-plant genomes encode two to six PPR proteins. Although no PPR structures are yet known, the motif is predicted to fold into a helix-turn-helix structure similar to those found in the tetratricopeptide repeat (TPR) family (see PDOC50005 from PROSITEDOC) []. The plant PPR protein family has been divided in two subfamilies on the basis of their motif content and organisation [, ]. Examples of PPR repeat-containing proteins include PET309 P32522 from SWISSPROT, which may be involved in RNA stabilisation [], and crp1, which is involved in RNA processing []. The repeat is associated with a predicted plant protein O49549 from SWISSPROT that has a domain organisation similar to the human BRCA1 protein.
Probab=37.72 E-value=40 Score=16.15 Aligned_cols=18 Identities=17% Similarity=0.021 Sum_probs=13.1
Q ss_pred HHHHcCCHHHHHHHHHHc
Q psy11108 72 IAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 72 i~ay~Gkf~EAe~i~l~a 89 (106)
.+.-.|++++|.++|.+.
T Consensus 9 ~~~~~~~~~~a~~~~~~M 26 (31)
T PF01535_consen 9 GYCKMGQFEEALEVFDEM 26 (31)
T ss_pred HHHccchHHHHHHHHHHH
Confidence 345578888888888764
No 117
>PRK02603 photosystem I assembly protein Ycf3; Provisional
Probab=36.49 E-value=1.4e+02 Score=20.60 Aligned_cols=61 Identities=13% Similarity=-0.018 Sum_probs=43.7
Q ss_pred HHHHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCC---cHHHHHHHHHHHcCCHHHHHHHHHHc
Q psy11108 21 AGQASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALP---DDIMKHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 21 ~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~---~~~~~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
...+-.+|...+...+++-|...|.+.-+.. .+.+ ......|.++...|++++|..+|.++
T Consensus 35 a~~~~~lg~~~~~~g~~~~A~~~~~~al~~~--------~~~~~~~~~~~~la~~~~~~g~~~~A~~~~~~a 98 (172)
T PRK02603 35 AFVYYRDGMSAQADGEYAEALENYEEALKLE--------EDPNDRSYILYNMGIIYASNGEHDKALEYYHQA 98 (172)
T ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHHHHh--------hccchHHHHHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 3447889999999999999988887542210 0111 12356789999999999999998875
No 118
>KOG1129|consensus
Probab=35.20 E-value=78 Score=26.75 Aligned_cols=71 Identities=17% Similarity=0.012 Sum_probs=47.3
Q ss_pred hhhhhhcCCCC-HHHHHHHHHHHHhcccHHHHHHHHHHc--CCchhhHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHH
Q psy11108 10 SRHSFVLCPYV-AGQASFLSGVSQKKERFYHVEQALVNC--QDYAGLQLIKSLRALPDDIMKHAYIAQHQGRFQTAENYQ 86 (106)
Q Consensus 10 ~~~~f~~~~~~-~~lW~~LA~~AL~~~~l~iAe~Af~rl--~d~~~v~~I~~l~~~~~~~~~~Aei~ay~Gkf~EAe~i~ 86 (106)
.|-+-...+.. ...|=.|+..|..-+|++.|.+||.-+ .|..--.-+ .-.|-+.+-.|+.++|-.+|
T Consensus 383 RAlstat~~~~aaDvWYNlg~vaV~iGD~nlA~rcfrlaL~~d~~h~eal----------nNLavL~~r~G~i~~Arsll 452 (478)
T KOG1129|consen 383 RALSTATQPGQAADVWYNLGFVAVTIGDFNLAKRCFRLALTSDAQHGEAL----------NNLAVLAARSGDILGARSLL 452 (478)
T ss_pred HHHhhccCcchhhhhhhccceeEEeccchHHHHHHHHHHhccCcchHHHH----------HhHHHHHhhcCchHHHHHHH
Confidence 33344443333 344999999999999999999999743 222211111 12366677889999999998
Q ss_pred HHcC
Q psy11108 87 LQAD 90 (106)
Q Consensus 87 l~ag 90 (106)
.-+.
T Consensus 453 ~~A~ 456 (478)
T KOG1129|consen 453 NAAK 456 (478)
T ss_pred HHhh
Confidence 7764
No 119
>PF12569 NARP1: NMDA receptor-regulated protein 1 ; InterPro: IPR021183 This group represents N-terminal acetyltransferase A (NatA) auxiliary subunit and represents a non-catalytic component of the NatA N-terminal acetyltransferase, which catalyzes acetylation of proteins beginning with Met-Ser, Met-Gly and Met-Ala. N-terminal acetylation plays a role in normal eukaryotic translation and processing, protect against proteolytic degradation and protein turnover. NAT1 anchors ARD1 and NAT5 to the ribosome and may present the N- terminal of nascent polypeptides for acetylation [], [].
Probab=34.88 E-value=44 Score=28.64 Aligned_cols=21 Identities=29% Similarity=0.256 Sum_probs=0.0
Q ss_pred HHHHHHHHcCCHHHHHHHHHH
Q psy11108 68 KHAYIAQHQGRFQTAENYQLQ 88 (106)
Q Consensus 68 ~~Aei~ay~Gkf~EAe~i~l~ 88 (106)
.+|++++..|+++||+.+|..
T Consensus 43 ~rA~ll~kLg~~~eA~~~y~~ 63 (517)
T PF12569_consen 43 KRAELLLKLGRKEEAEKIYRE 63 (517)
T ss_pred HHHHHHHHcCCHHHHHHHHHH
No 120
>PF07720 TPR_3: Tetratricopeptide repeat; InterPro: IPR011716 This entry includes tetratricopeptide-like repeats found in the LcrH/SycD-like chaperones [].; PDB: 3KS2_O 3GZ2_A 3GZ1_A 3GYZ_A 4AM9_A 2VGX_A 2VGY_A.
Probab=34.74 E-value=54 Score=17.90 Aligned_cols=17 Identities=24% Similarity=0.499 Sum_probs=13.3
Q ss_pred HHHHHHHcCCHHHHHHH
Q psy11108 69 HAYIAQHQGRFQTAENY 85 (106)
Q Consensus 69 ~Aei~ay~Gkf~EAe~i 85 (106)
.|-..--+|++++|.++
T Consensus 7 ~a~~~y~~~ky~~A~~~ 23 (36)
T PF07720_consen 7 LAYNFYQKGKYDEAIHF 23 (36)
T ss_dssp HHHHHHHTT-HHHHHHH
T ss_pred HHHHHHHHhhHHHHHHH
Confidence 36667789999999999
No 121
>KOG1127|consensus
Probab=34.72 E-value=32 Score=32.49 Aligned_cols=73 Identities=25% Similarity=0.242 Sum_probs=55.9
Q ss_pred HHHhhhhhhhcCCCCHHHHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHH
Q psy11108 6 AVIASRHSFVLCPYVAGQASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDDIMKHAYIAQHQGRFQTAENY 85 (106)
Q Consensus 6 ~~~~~~~~f~~~~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~~~~~Aei~ay~Gkf~EAe~i 85 (106)
..|++...+-.+ .+-|..||...|+..|++-|+.||.+++.++-..+..- +.+|.|.--.|+.-++-.+
T Consensus 838 CfIks~~sep~~---~~~W~NlgvL~l~n~d~E~A~~af~~~qSLdP~nl~~W--------lG~Ali~eavG~ii~~~~l 906 (1238)
T KOG1127|consen 838 CFIKSRFSEPTC---HCQWLNLGVLVLENQDFEHAEPAFSSVQSLDPLNLVQW--------LGEALIPEAVGRIIERLIL 906 (1238)
T ss_pred hhhhhhhccccc---hhheeccceeEEecccHHHhhHHHHhhhhcCchhhHHH--------HHHHHhHHHHHHHHHHHHH
Confidence 346666666555 78899999999999999999999999887766655554 3466777777777777777
Q ss_pred HHHc
Q psy11108 86 QLQA 89 (106)
Q Consensus 86 ~l~a 89 (106)
|-++
T Consensus 907 faHs 910 (1238)
T KOG1127|consen 907 FAHS 910 (1238)
T ss_pred HHhh
Confidence 7763
No 122
>PF11846 DUF3366: Domain of unknown function (DUF3366); InterPro: IPR021797 This domain is functionally uncharacterised. This domain is found in bacteria. This presumed domain is about 200 amino acids in length.
Probab=34.09 E-value=86 Score=22.43 Aligned_cols=32 Identities=22% Similarity=0.208 Sum_probs=24.1
Q ss_pred CCCcHHH--HHHHHHHHcCCHHHHHHHHHHcCCc
Q psy11108 61 ALPDDIM--KHAYIAQHQGRFQTAENYQLQADRR 92 (106)
Q Consensus 61 ~~~~~~~--~~Aei~ay~Gkf~EAe~i~l~ag~~ 92 (106)
..|++.+ ..+.++.++|+.+||++...+.-..
T Consensus 140 ~~P~~~~~~~~a~~l~~~G~~~eA~~~~~~~~~l 173 (193)
T PF11846_consen 140 RRPDPNVYQRYALALALLGDPEEARQWLARARRL 173 (193)
T ss_pred hCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHh
Confidence 3456553 3499999999999999988876443
No 123
>PF13431 TPR_17: Tetratricopeptide repeat
Probab=34.00 E-value=56 Score=17.16 Aligned_cols=25 Identities=12% Similarity=0.061 Sum_probs=20.9
Q ss_pred CCCCHHHHHHHHHHHHhcccHHHHH
Q psy11108 17 CPYVAGQASFLSGVSQKKERFYHVE 41 (106)
Q Consensus 17 ~~~~~~lW~~LA~~AL~~~~l~iAe 41 (106)
+|.++..|..||..=...++++.|+
T Consensus 9 ~P~n~~a~~nla~~~~~~g~~~~A~ 33 (34)
T PF13431_consen 9 NPNNAEAYNNLANLYLNQGDYEEAI 33 (34)
T ss_pred CCCCHHHHHHHHHHHHHCcCHHhhc
Confidence 4566888999999988888888875
No 124
>PF13041 PPR_2: PPR repeat family
Probab=33.86 E-value=56 Score=18.06 Aligned_cols=18 Identities=17% Similarity=0.204 Sum_probs=14.4
Q ss_pred HHHcCCHHHHHHHHHHcC
Q psy11108 73 AQHQGRFQTAENYQLQAD 90 (106)
Q Consensus 73 ~ay~Gkf~EAe~i~l~ag 90 (106)
+.-.|++++|.++|.+..
T Consensus 13 ~~~~~~~~~a~~l~~~M~ 30 (50)
T PF13041_consen 13 YCKAGKFEEALKLFKEMK 30 (50)
T ss_pred HHHCcCHHHHHHHHHHHH
Confidence 446789999999998754
No 125
>KOG3785|consensus
Probab=32.78 E-value=1.6e+02 Score=25.27 Aligned_cols=70 Identities=17% Similarity=0.179 Sum_probs=45.4
Q ss_pred HHHHHHHHHhcccHHHHHHHHHHcCCch------------------------hhHHHHHhhCC--CcH--HHHHHHHHHH
Q psy11108 24 ASFLSGVSQKKERFYHVEQALVNCQDYA------------------------GLQLIKSLRAL--PDD--IMKHAYIAQH 75 (106)
Q Consensus 24 W~~LA~~AL~~~~l~iAe~Af~rl~d~~------------------------~v~~I~~l~~~--~~~--~~~~Aei~ay 75 (106)
...|++.-=..-.+.||+.-|.-+|.-. -+-++++|+.. +++ ..--|...+-
T Consensus 326 ~aalGQe~gSreHlKiAqqffqlVG~Sa~ecDTIpGRQsmAs~fFL~~qFddVl~YlnSi~sYF~NdD~Fn~N~AQAk~a 405 (557)
T KOG3785|consen 326 FAALGQETGSREHLKIAQQFFQLVGESALECDTIPGRQSMASYFFLSFQFDDVLTYLNSIESYFTNDDDFNLNLAQAKLA 405 (557)
T ss_pred HHHhhhhcCcHHHHHHHHHHHHHhcccccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcchhhhHHHHHHHH
Confidence 3344444444456889999888877532 23455566543 222 2445777778
Q ss_pred cCCHHHHHHHHHHcCCcc
Q psy11108 76 QGRFQTAENYQLQADRRT 93 (106)
Q Consensus 76 ~Gkf~EAe~i~l~ag~~~ 93 (106)
.|++.|||++|++-.-++
T Consensus 406 tgny~eaEelf~~is~~~ 423 (557)
T KOG3785|consen 406 TGNYVEAEELFIRISGPE 423 (557)
T ss_pred hcChHHHHHHHhhhcChh
Confidence 999999999999876665
No 126
>KOG3785|consensus
Probab=32.41 E-value=1.4e+02 Score=25.60 Aligned_cols=62 Identities=11% Similarity=-0.020 Sum_probs=46.2
Q ss_pred HHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHcCCcc
Q psy11108 25 SFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDDIMKHAYIAQHQGRFQTAENYQLQADRRT 93 (106)
Q Consensus 25 ~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~~~~~Aei~ay~Gkf~EAe~i~l~ag~~~ 93 (106)
-.+|++=+-.++..-||+.|.++.+.+.=.=+-. -...|..+...|+++-|=.+|++.+-+.
T Consensus 397 ~N~AQAk~atgny~eaEelf~~is~~~ikn~~~Y-------~s~LArCyi~nkkP~lAW~~~lk~~t~~ 458 (557)
T KOG3785|consen 397 LNLAQAKLATGNYVEAEELFIRISGPEIKNKILY-------KSMLARCYIRNKKPQLAWDMMLKTNTPS 458 (557)
T ss_pred hHHHHHHHHhcChHHHHHHHhhhcChhhhhhHHH-------HHHHHHHHHhcCCchHHHHHHHhcCCch
Confidence 4688999999999999999999998863211111 1235777788888888888888876664
No 127
>PF02064 MAS20: MAS20 protein import receptor; InterPro: IPR002056 Virtually all mitochondrial precursors are imported via the same mechanism []: precursors first bind to receptors on the mitochondrial surface, then insert into the translocation channel in the outer membrane. Many outer-membrane proteins participate in the early stages of import, four of which (MAS20, MAS22, MAS37 and MAS70) are components of the receptor. MAS20, which forms a subcomplex with MAS22, seems to interact with most or all mitochondrial precursors, suggesting that the protein binds directly to mitochondrial targeting sequences. The MAS37 and MAS70 components also form a subcomplex, the two subcomplexes possibly binding via their trans- membrane (TM) regions - the TM region of MAS70 promotes oligomerisation of attatched protein domains and shares sequence similarity with the TM region of MAS20 []. MAS20 is also known as TOM20.; GO: 0006605 protein targeting, 0006886 intracellular protein transport, 0005742 mitochondrial outer membrane translocase complex; PDB: 3AX3_A 3AWR_B 2V1S_A 3AX5_C 3AX2_C 1OM2_A 2V1T_B.
Probab=32.29 E-value=70 Score=22.34 Aligned_cols=25 Identities=16% Similarity=0.133 Sum_probs=20.9
Q ss_pred HHHHHHHHHHHcCCHHHHHHHHHHc
Q psy11108 65 DIMKHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 65 ~~~~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
.++..||.++-+|++++|...|-++
T Consensus 65 ~qV~lGE~L~~~G~~~~aa~hf~nA 89 (121)
T PF02064_consen 65 QQVQLGEQLLAQGDYEEAAEHFYNA 89 (121)
T ss_dssp HHHHHHHHHHHTT-HHHHHHHHHHH
T ss_pred HHHHHHHHHHhCCCHHHHHHHHHHH
Confidence 5688999999999999999988774
No 128
>PRK11189 lipoprotein NlpI; Provisional
Probab=32.28 E-value=2.3e+02 Score=21.67 Aligned_cols=58 Identities=12% Similarity=-0.048 Sum_probs=41.4
Q ss_pred HHHHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCc---HHHHHHHHHHHcCCHHHHHHHHHHc
Q psy11108 21 AGQASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPD---DIMKHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 21 ~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~---~~~~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
+..|-.+|..-...++.+-|...|.+. ++.-|+ --...|.++...|++++|...|.++
T Consensus 64 a~~~~~~g~~~~~~g~~~~A~~~~~~A-----------l~l~P~~~~a~~~lg~~~~~~g~~~~A~~~~~~A 124 (296)
T PRK11189 64 AQLHYERGVLYDSLGLRALARNDFSQA-----------LALRPDMADAYNYLGIYLTQAGNFDAAYEAFDSV 124 (296)
T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHH-----------HHcCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHH
Confidence 455888888777778888777665544 222222 2346799999999999999988775
No 129
>PRK15363 pathogenicity island 2 chaperone protein SscA; Provisional
Probab=31.55 E-value=1.9e+02 Score=21.07 Aligned_cols=60 Identities=10% Similarity=-0.069 Sum_probs=42.3
Q ss_pred HHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHcCCcc
Q psy11108 26 FLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDDIMKHAYIAQHQGRFQTAENYQLQADRRT 93 (106)
Q Consensus 26 ~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~~~~~Aei~ay~Gkf~EAe~i~l~ag~~~ 93 (106)
.+|-.-.+.++++-|++.|.-+--++--... =-.-.|-++--.|+|.+|-..|-++...+
T Consensus 40 ~~A~~ly~~G~l~~A~~~f~~L~~~Dp~~~~--------y~~gLG~~~Q~~g~~~~AI~aY~~A~~L~ 99 (157)
T PRK15363 40 RYAMQLMEVKEFAGAARLFQLLTIYDAWSFD--------YWFRLGECCQAQKHWGEAIYAYGRAAQIK 99 (157)
T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHhCcccHH--------HHHHHHHHHHHHhhHHHHHHHHHHHHhcC
Confidence 4566678899999999999866433322111 11335888888999999999998887654
No 130
>PLN03218 maturation of RBCL 1; Provisional
Probab=31.13 E-value=1.9e+02 Score=27.05 Aligned_cols=29 Identities=3% Similarity=-0.098 Sum_probs=17.3
Q ss_pred CCHHHHHHHHHHHHhcccHHHHHHHHHHc
Q psy11108 19 YVAGQASFLSGVSQKKERFYHVEQALVNC 47 (106)
Q Consensus 19 ~~~~lW~~LA~~AL~~~~l~iAe~Af~rl 47 (106)
|.+..|..|...-.+.++++-|.+.|...
T Consensus 612 p~~~tynsLI~ay~k~G~~deAl~lf~eM 640 (1060)
T PLN03218 612 GTPEVYTIAVNSCSQKGDWDFALSIYDDM 640 (1060)
T ss_pred CChHHHHHHHHHHHhcCCHHHHHHHHHHH
Confidence 44556666666666666666666555433
No 131
>PLN03081 pentatricopeptide (PPR) repeat-containing protein; Provisional
Probab=30.72 E-value=2.7e+02 Score=23.89 Aligned_cols=73 Identities=14% Similarity=0.035 Sum_probs=42.6
Q ss_pred CHHHHHHHHHHHHhcccHHHHHHHHHHcCC-------chhhHHHHHhhCC-------------------CcHHHHHH--H
Q psy11108 20 VAGQASFLSGVSQKKERFYHVEQALVNCQD-------YAGLQLIKSLRAL-------------------PDDIMKHA--Y 71 (106)
Q Consensus 20 ~~~lW~~LA~~AL~~~~l~iAe~Af~rl~d-------~~~v~~I~~l~~~-------------------~~~~~~~A--e 71 (106)
+..-|..|...-.+.++.+-|...|....+ +-+..+|+-.-+. ++.....+ .
T Consensus 289 ~~vt~n~li~~y~~~g~~~eA~~lf~~M~~~g~~pd~~t~~~ll~a~~~~g~~~~a~~i~~~m~~~g~~~d~~~~~~Li~ 368 (697)
T PLN03081 289 TTVAWNSMLAGYALHGYSEEALCLYYEMRDSGVSIDQFTFSIMIRIFSRLALLEHAKQAHAGLIRTGFPLDIVANTALVD 368 (697)
T ss_pred ChhHHHHHHHHHHhCCCHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHhccchHHHHHHHHHHHHhCCCCCeeehHHHHH
Confidence 455688888888888888888888866522 1111222211111 12222333 3
Q ss_pred HHHHcCCHHHHHHHHHHcCCc
Q psy11108 72 IAQHQGRFQTAENYQLQADRR 92 (106)
Q Consensus 72 i~ay~Gkf~EAe~i~l~ag~~ 92 (106)
.+.-.|++++|.++|.+...+
T Consensus 369 ~y~k~G~~~~A~~vf~~m~~~ 389 (697)
T PLN03081 369 LYSKWGRMEDARNVFDRMPRK 389 (697)
T ss_pred HHHHCCCHHHHHHHHHhCCCC
Confidence 444589999999998765444
No 132
>PRK15331 chaperone protein SicA; Provisional
Probab=30.12 E-value=1.9e+02 Score=21.37 Aligned_cols=66 Identities=11% Similarity=-0.058 Sum_probs=51.5
Q ss_pred cCCCCHHHHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHc
Q psy11108 16 LCPYVAGQASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDDIMKHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 16 ~~~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~~~~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
-++.++.-|--||..--...+++-|..+|..+...+ .+-|.+....|.-.+..|+..+|.+-|..+
T Consensus 66 ~d~~n~~Y~~GLaa~~Q~~k~y~~Ai~~Y~~A~~l~--------~~dp~p~f~agqC~l~l~~~~~A~~~f~~a 131 (165)
T PRK15331 66 YDFYNPDYTMGLAAVCQLKKQFQKACDLYAVAFTLL--------KNDYRPVFFTGQCQLLMRKAAKARQCFELV 131 (165)
T ss_pred hCcCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcc--------cCCCCccchHHHHHHHhCCHHHHHHHHHHH
Confidence 345557789999988888999999999998664432 134556788899999999999999988653
No 133
>PHA02608 67 prohead core protein; Provisional
Probab=29.98 E-value=56 Score=21.51 Aligned_cols=31 Identities=10% Similarity=0.105 Sum_probs=26.8
Q ss_pred HHHhcccHHHHHHHHHHcCCchhhHHHHHhh
Q psy11108 30 VSQKKERFYHVEQALVNCQDYAGLQLIKSLR 60 (106)
Q Consensus 30 ~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~ 60 (106)
.|.+..+|-.|++.|..+=..+...+|+..+
T Consensus 6 eAIKS~DLV~akK~F~~~Me~rt~~li~e~k 36 (80)
T PHA02608 6 EAIKSGDLVEAKKEFASIMEARTEALIEEEK 36 (80)
T ss_pred HHHhcCcHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5889999999999999988888888887653
No 134
>KOG1126|consensus
Probab=29.29 E-value=91 Score=27.81 Aligned_cols=74 Identities=8% Similarity=-0.051 Sum_probs=49.3
Q ss_pred CCCCHHHHHHHHHHHHhcccHHHHHHHHHHcCCchh----hHHHHHhhCCC------------------cHHH----HHH
Q psy11108 17 CPYVAGQASFLSGVSQKKERFYHVEQALVNCQDYAG----LQLIKSLRALP------------------DDIM----KHA 70 (106)
Q Consensus 17 ~~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~----v~~I~~l~~~~------------------~~~~----~~A 70 (106)
++..|.-|.++|..=-..+|-++|-+||-|.-.++- -..+.-=+.+. .++- =.|
T Consensus 417 ~~~sPesWca~GNcfSLQkdh~~Aik~f~RAiQldp~faYayTLlGhE~~~~ee~d~a~~~fr~Al~~~~rhYnAwYGlG 496 (638)
T KOG1126|consen 417 DPNSPESWCALGNCFSLQKDHDTAIKCFKRAIQLDPRFAYAYTLLGHESIATEEFDKAMKSFRKALGVDPRHYNAWYGLG 496 (638)
T ss_pred CCCCcHHHHHhcchhhhhhHHHHHHHHHHHhhccCCccchhhhhcCChhhhhHHHHhHHHHHHhhhcCCchhhHHHHhhh
Confidence 445588899999888888999999999998654433 11111111111 0110 136
Q ss_pred HHHHHcCCHHHHHHHHHHcC
Q psy11108 71 YIAQHQGRFQTAENYQLQAD 90 (106)
Q Consensus 71 ei~ay~Gkf~EAe~i~l~ag 90 (106)
.|+.-++|++.||-.|.++=
T Consensus 497 ~vy~Kqek~e~Ae~~fqkA~ 516 (638)
T KOG1126|consen 497 TVYLKQEKLEFAEFHFQKAV 516 (638)
T ss_pred hheeccchhhHHHHHHHhhh
Confidence 89999999999999887763
No 135
>PF04762 IKI3: IKI3 family; InterPro: IPR006849 Members of this family are components of the elongator multi-subunit component of a novel RNA polymerase II holoenzyme for transcriptional elongation [].
Probab=28.29 E-value=48 Score=30.37 Aligned_cols=59 Identities=15% Similarity=0.162 Sum_probs=42.0
Q ss_pred HHhcccHHHHHHHHHHc-CCc-hhhHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHcC
Q psy11108 31 SQKKERFYHVEQALVNC-QDY-AGLQLIKSLRALPDDIMKHAYIAQHQGRFQTAENYQLQAD 90 (106)
Q Consensus 31 AL~~~~l~iAe~Af~rl-~d~-~~v~~I~~l~~~~~~~~~~Aei~ay~Gkf~EAe~i~l~ag 90 (106)
||---||+.|--.=.+. +|+ .||=||++|+.++. ..++=.|=.+.|||+.|-+.+.++|
T Consensus 868 ALG~YDl~Lal~VAq~SQkDPKEYLPfL~~L~~l~~-~~rry~ID~hLkRy~kAL~~L~~~G 928 (928)
T PF04762_consen 868 ALGTYDLELALMVAQQSQKDPKEYLPFLQELQKLPP-LYRRYKIDDHLKRYEKALRHLSACG 928 (928)
T ss_pred HhhhcCHHHHHHHHHHhccChHHHHHHHHHHHhCCh-hheeeeHhhhhCCHHHHHHHHHhhC
Confidence 33333555544333333 343 36789999999975 4488999999999999999998887
No 136
>COG3063 PilF Tfp pilus assembly protein PilF [Cell motility and secretion / Intracellular trafficking and secretion]
Probab=28.08 E-value=3.1e+02 Score=21.75 Aligned_cols=30 Identities=10% Similarity=-0.108 Sum_probs=25.6
Q ss_pred CCCHHHHHHHHHHHHhcccHHHHHHHHHHc
Q psy11108 18 PYVAGQASFLSGVSQKKERFYHVEQALVNC 47 (106)
Q Consensus 18 ~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl 47 (106)
+....-|..+|.....-++.+.|.+.|.+.
T Consensus 66 Ps~~~a~~~~A~~Yq~~Ge~~~A~e~YrkA 95 (250)
T COG3063 66 PSYYLAHLVRAHYYQKLGENDLADESYRKA 95 (250)
T ss_pred cccHHHHHHHHHHHHHcCChhhHHHHHHHH
Confidence 334666999999999999999999999863
No 137
>KOG3081|consensus
Probab=26.27 E-value=67 Score=25.98 Aligned_cols=21 Identities=29% Similarity=0.370 Sum_probs=17.9
Q ss_pred HHHHHHHcCCHHHHHHHHHHc
Q psy11108 69 HAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 69 ~Aei~ay~Gkf~EAe~i~l~a 89 (106)
+|-+.+.+|||+||+.++..+
T Consensus 213 ~Av~~l~~~~~eeAe~lL~ea 233 (299)
T KOG3081|consen 213 QAVCHLQLGRYEEAESLLEEA 233 (299)
T ss_pred HHHHHHHhcCHHHHHHHHHHH
Confidence 467788999999999998764
No 138
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=25.56 E-value=3.9e+02 Score=23.14 Aligned_cols=31 Identities=6% Similarity=-0.030 Sum_probs=20.3
Q ss_pred CCCCHHHHHHHHHHHHhcccHHHHHHHHHHc
Q psy11108 17 CPYVAGQASFLSGVSQKKERFYHVEQALVNC 47 (106)
Q Consensus 17 ~~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl 47 (106)
.+.++-+|...+..-++....+-|...|.+.
T Consensus 336 ~P~N~~~~~~~~~i~~~~nk~~~A~e~~~ka 366 (484)
T COG4783 336 QPDNPYYLELAGDILLEANKAKEAIERLKKA 366 (484)
T ss_pred CCCCHHHHHHHHHHHHHcCChHHHHHHHHHH
Confidence 4455667777777777777766666666544
No 139
>KOG2404|consensus
Probab=25.43 E-value=51 Score=27.82 Aligned_cols=24 Identities=13% Similarity=0.026 Sum_probs=19.5
Q ss_pred hhhcCCCCHHHHHHHHHHHHhccc
Q psy11108 13 SFVLCPYVAGQASFLSGVSQKKER 36 (106)
Q Consensus 13 ~f~~~~~~~~lW~~LA~~AL~~~~ 36 (106)
-||.-++-..+|+.||.+||..+-
T Consensus 258 gfidpndr~~~wKfLAAEalRG~G 281 (477)
T KOG2404|consen 258 GFIDPNDRTALWKFLAAEALRGLG 281 (477)
T ss_pred CccCCCCchhHHHHHHHHHhccCc
Confidence 367777778899999999998753
No 140
>KOG0292|consensus
Probab=24.61 E-value=1.5e+02 Score=27.97 Aligned_cols=63 Identities=16% Similarity=-0.033 Sum_probs=45.4
Q ss_pred hhhhhcCCCCHHHHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHcC
Q psy11108 11 RHSFVLCPYVAGQASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDDIMKHAYIAQHQGRFQTAENYQLQAD 90 (106)
Q Consensus 11 ~~~f~~~~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~~~~~Aei~ay~Gkf~EAe~i~l~ag 90 (106)
|--||++... ==..||+.+|+++|-.+=.+++|.+. --+.++.++.+|+.+=||-.|.+-.
T Consensus 639 AL~FVkD~~t------RF~LaLe~gnle~ale~akkldd~d~-------------w~rLge~Al~qgn~~IaEm~yQ~~k 699 (1202)
T KOG0292|consen 639 ALHFVKDERT------RFELALECGNLEVALEAAKKLDDKDV-------------WERLGEEALRQGNHQIAEMCYQRTK 699 (1202)
T ss_pred eeeeecCcch------heeeehhcCCHHHHHHHHHhcCcHHH-------------HHHHHHHHHHhcchHHHHHHHHHhh
Confidence 4457777422 12458999999999777666665432 2467999999999999999998854
Q ss_pred Cc
Q psy11108 91 RR 92 (106)
Q Consensus 91 ~~ 92 (106)
..
T Consensus 700 nf 701 (1202)
T KOG0292|consen 700 NF 701 (1202)
T ss_pred hh
Confidence 33
No 141
>KOG2066|consensus
Probab=24.40 E-value=2.2e+02 Score=26.24 Aligned_cols=53 Identities=17% Similarity=0.022 Sum_probs=39.5
Q ss_pred HHhhhhhhhcCCCCHHHHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhC
Q psy11108 7 VIASRHSFVLCPYVAGQASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRA 61 (106)
Q Consensus 7 ~~~~~~~f~~~~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~ 61 (106)
-|..|.+|+|....+.||..|-..++++=++-.|--- -..+.+-+..|.+|..
T Consensus 676 die~AIefvKeq~D~eLWe~LI~~~ldkPe~~~~ll~--i~~~~dpl~ii~kip~ 728 (846)
T KOG2066|consen 676 DIEKAIEFVKEQDDSELWEDLINYSLDKPEFIKALLN--IGEHEDPLLIIRKIPD 728 (846)
T ss_pred CHHHHHHHHHhcCCHHHHHHHHHHhhcCcHHHHHHHH--hhhcccHHHHHhcCCC
Confidence 4678999999999999999999999999888766511 1223455566666653
No 142
>smart00299 CLH Clathrin heavy chain repeat homology.
Probab=23.94 E-value=1.3e+02 Score=20.04 Aligned_cols=27 Identities=11% Similarity=0.101 Sum_probs=14.2
Q ss_pred HHHHHcCCHHHHHHHHHHc-CCcccccc
Q psy11108 71 YIAQHQGRFQTAENYQLQA-DRRTWKSK 97 (106)
Q Consensus 71 ei~ay~Gkf~EAe~i~l~a-g~~~~Aie 97 (106)
.++.-.|.+++|-+++++. +.++.|++
T Consensus 90 ~l~~k~~~~~~Al~~~l~~~~d~~~a~~ 117 (140)
T smart00299 90 ELYKKDGNFKDAIVTLIEHLGNYEKAIE 117 (140)
T ss_pred HHHHhhcCHHHHHHHHHHcccCHHHHHH
Confidence 4444455556666665555 44554444
No 143
>COG4700 Uncharacterized protein conserved in bacteria containing a divergent form of TPR repeats [Function unknown]
Probab=23.93 E-value=81 Score=24.67 Aligned_cols=24 Identities=25% Similarity=0.176 Sum_probs=21.7
Q ss_pred HHHHHHHHHHcCCHHHHHHHHHHc
Q psy11108 66 IMKHAYIAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 66 ~~~~Aei~ay~Gkf~EAe~i~l~a 89 (106)
..+.|+-++-.|++.||+..|.++
T Consensus 92 r~rLa~al~elGr~~EA~~hy~qa 115 (251)
T COG4700 92 RYRLANALAELGRYHEAVPHYQQA 115 (251)
T ss_pred HHHHHHHHHHhhhhhhhHHHHHHH
Confidence 467899999999999999999995
No 144
>PLN03218 maturation of RBCL 1; Provisional
Probab=23.88 E-value=3.9e+02 Score=25.12 Aligned_cols=14 Identities=7% Similarity=0.017 Sum_probs=7.5
Q ss_pred HcCCHHHHHHHHHH
Q psy11108 75 HQGRFQTAENYQLQ 88 (106)
Q Consensus 75 y~Gkf~EAe~i~l~ 88 (106)
..|++++|.++|.+
T Consensus 731 k~G~~eeAlelf~e 744 (1060)
T PLN03218 731 EGNQLPKALEVLSE 744 (1060)
T ss_pred HCCCHHHHHHHHHH
Confidence 35555555555553
No 145
>PF08311 Mad3_BUB1_I: Mad3/BUB1 homology region 1; InterPro: IPR013212 Proteins containing this domain are checkpoint proteins involved in cell division. This region has been shown to be essential for the binding of BUB1 and MAD3 to CDC20p [].; PDB: 3ESL_B 4AEZ_I 4A1G_B 2LAH_A 2WVI_A 3SI5_B.
Probab=23.87 E-value=84 Score=21.52 Aligned_cols=21 Identities=19% Similarity=0.313 Sum_probs=17.6
Q ss_pred HHHHHHHHcCCHHHHHHHHHH
Q psy11108 68 KHAYIAQHQGRFQTAENYQLQ 88 (106)
Q Consensus 68 ~~Aei~ay~Gkf~EAe~i~l~ 88 (106)
.-|.++--.|+|++|.++|..
T Consensus 104 ~wA~~le~~~~~~~A~~I~~~ 124 (126)
T PF08311_consen 104 EWAEFLEKRGNFKKADEIYQL 124 (126)
T ss_dssp HHHHHHHHTT-HHHHHHHHHH
T ss_pred HHHHHHHHcCCHHHHHHHHHh
Confidence 468999999999999999975
No 146
>KOG3617|consensus
Probab=22.48 E-value=84 Score=29.69 Aligned_cols=86 Identities=13% Similarity=0.103 Sum_probs=56.1
Q ss_pred hhhhcCCCCHHHHHHHHHHHHhcccHHHHHHHHHHcCCchhhHHHHHhhCCCcHHH-------------HHHHHHHHcCC
Q psy11108 12 HSFVLCPYVAGQASFLSGVSQKKERFYHVEQALVNCQDYAGLQLIKSLRALPDDIM-------------KHAYIAQHQGR 78 (106)
Q Consensus 12 ~~f~~~~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~~~-------------~~Aei~ay~Gk 78 (106)
..||.....++||+--|..-=..++++.|-.-|...+|+=.+--|+=+.-..++.- ..|..+=-.|+
T Consensus 903 e~Yv~~~~d~~L~~WWgqYlES~GemdaAl~~Y~~A~D~fs~VrI~C~qGk~~kAa~iA~esgd~AAcYhlaR~YEn~g~ 982 (1416)
T KOG3617|consen 903 EQYVRRKRDESLYSWWGQYLESVGEMDAALSFYSSAKDYFSMVRIKCIQGKTDKAARIAEESGDKAACYHLARMYENDGD 982 (1416)
T ss_pred HHHHHhccchHHHHHHHHHHhcccchHHHHHHHHHhhhhhhheeeEeeccCchHHHHHHHhcccHHHHHHHHHHhhhhHH
Confidence 35777777899999999988889999999999999888633222222221111111 23455556777
Q ss_pred HHHHHHHHHHcCCcccccc
Q psy11108 79 FQTAENYQLQADRRTWKSK 97 (106)
Q Consensus 79 f~EAe~i~l~ag~~~~Aie 97 (106)
+.+|-..|.++--..-||.
T Consensus 983 v~~Av~FfTrAqafsnAIR 1001 (1416)
T KOG3617|consen 983 VVKAVKFFTRAQAFSNAIR 1001 (1416)
T ss_pred HHHHHHHHHHHHHHHHHHH
Confidence 8888887777655444443
No 147
>PF13812 PPR_3: Pentatricopeptide repeat domain
Probab=21.98 E-value=1.1e+02 Score=14.86 Aligned_cols=24 Identities=13% Similarity=-0.143 Sum_probs=12.2
Q ss_pred HHHHHHHHHhcccHHHHHHHHHHc
Q psy11108 24 ASFLSGVSQKKERFYHVEQALVNC 47 (106)
Q Consensus 24 W~~LA~~AL~~~~l~iAe~Af~rl 47 (106)
|..+-....+..+++-|...|...
T Consensus 4 y~~ll~a~~~~g~~~~a~~~~~~M 27 (34)
T PF13812_consen 4 YNALLRACAKAGDPDAALQLFDEM 27 (34)
T ss_pred HHHHHHHHHHCCCHHHHHHHHHHH
Confidence 445555555555555555544443
No 148
>PLN03077 Protein ECB2; Provisional
Probab=21.88 E-value=4e+02 Score=23.54 Aligned_cols=29 Identities=7% Similarity=-0.016 Sum_probs=18.5
Q ss_pred CHHHHHHHHHHHHhcccHHHHHHHHHHcC
Q psy11108 20 VAGQASFLSGVSQKKERFYHVEQALVNCQ 48 (106)
Q Consensus 20 ~~~lW~~LA~~AL~~~~l~iAe~Af~rl~ 48 (106)
....|..|...-.+.++++-|++.|..+.
T Consensus 322 d~~~~n~Li~~y~k~g~~~~A~~vf~~m~ 350 (857)
T PLN03077 322 DVSVCNSLIQMYLSLGSWGEAEKVFSRME 350 (857)
T ss_pred chHHHHHHHHHHHhcCCHHHHHHHHhhCC
Confidence 34556666666666677777766666554
No 149
>TIGR03276 Phn-HD phosphonate degradation operons associated HDIG domain protein. This small clade of proteins are found adjacent to other genes implicated in the catabolism of phosphonates. They are members of the TIGR00277 domain family and contain a series of five invariant histidines (the domain in general has only four).
Probab=21.73 E-value=42 Score=25.03 Aligned_cols=32 Identities=9% Similarity=0.043 Sum_probs=22.9
Q ss_pred HHHHcCC--HHHHHHHHHHcCCccccccCCCccc
Q psy11108 72 IAQHQGR--FQTAENYQLQADRRTWKSKPSYWTR 103 (106)
Q Consensus 72 i~ay~Gk--f~EAe~i~l~ag~~~~Aiemr~W~r 103 (106)
.+.+||- -.+...-|.+....+.++.||+||.
T Consensus 123 sL~~QGG~~~~~e~~~f~~~p~~~dav~lR~wDd 156 (179)
T TIGR03276 123 SLELQGGPFTAAEADAFERDPHAADAIRLRRWDD 156 (179)
T ss_pred HHHHcCCCCCHHHHHHHHhCccHHHHHHHHHcch
Confidence 3557773 3333445888888899999999985
No 150
>KOG1155|consensus
Probab=21.62 E-value=4.7e+02 Score=22.99 Aligned_cols=72 Identities=10% Similarity=0.008 Sum_probs=54.4
Q ss_pred CCCHHHHHHHHHHHHhcccHHHHHHHHHHcCCchh----------------------hHHHHHhhCCCc--HH--HHHHH
Q psy11108 18 PYVAGQASFLSGVSQKKERFYHVEQALVNCQDYAG----------------------LQLIKSLRALPD--DI--MKHAY 71 (106)
Q Consensus 18 ~~~~~lW~~LA~~AL~~~~l~iAe~Af~rl~d~~~----------------------v~~I~~l~~~~~--~~--~~~Ae 71 (106)
+.....|...|.+=++=.|-..|-.+|++.=|+.- |-|+++-....+ +- +-.|+
T Consensus 361 p~~~~aWTLmGHEyvEmKNt~AAi~sYRrAvdi~p~DyRAWYGLGQaYeim~Mh~YaLyYfqkA~~~kPnDsRlw~aLG~ 440 (559)
T KOG1155|consen 361 PKYLSAWTLMGHEYVEMKNTHAAIESYRRAVDINPRDYRAWYGLGQAYEIMKMHFYALYYFQKALELKPNDSRLWVALGE 440 (559)
T ss_pred cchhHHHHHhhHHHHHhcccHHHHHHHHHHHhcCchhHHHHhhhhHHHHHhcchHHHHHHHHHHHhcCCCchHHHHHHHH
Confidence 34466799999999999999999999998754432 344555444422 22 35689
Q ss_pred HHHHcCCHHHHHHHHHHc
Q psy11108 72 IAQHQGRFQTAENYQLQA 89 (106)
Q Consensus 72 i~ay~Gkf~EAe~i~l~a 89 (106)
.+.-.++.+||.+.|.++
T Consensus 441 CY~kl~~~~eAiKCykra 458 (559)
T KOG1155|consen 441 CYEKLNRLEEAIKCYKRA 458 (559)
T ss_pred HHHHhccHHHHHHHHHHH
Confidence 999999999999999885
No 151
>COG1447 CelC Phosphotransferase system cellobiose-specific component IIA [Carbohydrate transport and metabolism]
Probab=21.54 E-value=1.2e+02 Score=20.90 Aligned_cols=25 Identities=28% Similarity=0.333 Sum_probs=20.8
Q ss_pred HHHHHHHHHHcCCHHHHHHHHHHcC
Q psy11108 66 IMKHAYIAQHQGRFQTAENYQLQAD 90 (106)
Q Consensus 66 ~~~~Aei~ay~Gkf~EAe~i~l~ag 90 (106)
....|--++-.|+|++|+....+++
T Consensus 22 ~~~eAl~~Ak~g~f~~A~~~i~eA~ 46 (105)
T COG1447 22 KAYEALKAAKEGDFEEAEELIQEAN 46 (105)
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHH
Confidence 4567788889999999999988765
No 152
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=21.19 E-value=5.5e+02 Score=22.27 Aligned_cols=31 Identities=3% Similarity=-0.302 Sum_probs=23.4
Q ss_pred cCCCCHHHHHHHHHHHHhcccHHHHHHHHHH
Q psy11108 16 LCPYVAGQASFLSGVSQKKERFYHVEQALVN 46 (106)
Q Consensus 16 ~~~~~~~lW~~LA~~AL~~~~l~iAe~Af~r 46 (106)
+++.-+..|--.|..-+..++++-|++.+..
T Consensus 301 ~~~~~~aa~YG~A~~~~~~~~~d~A~~~l~~ 331 (484)
T COG4783 301 SKRGGLAAQYGRALQTYLAGQYDEALKLLQP 331 (484)
T ss_pred hCccchHHHHHHHHHHHHhcccchHHHHHHH
Confidence 3334466677788888888889988888776
No 153
>KOG0495|consensus
Probab=20.87 E-value=4.8e+02 Score=24.13 Aligned_cols=22 Identities=18% Similarity=0.125 Sum_probs=14.8
Q ss_pred HHHHHHHHHcCCHHHHHHHHHH
Q psy11108 67 MKHAYIAQHQGRFQTAENYQLQ 88 (106)
Q Consensus 67 ~~~Aei~ay~Gkf~EAe~i~l~ 88 (106)
++.+..--|+|+++||-+++-+
T Consensus 655 mKs~~~er~ld~~eeA~rllEe 676 (913)
T KOG0495|consen 655 MKSANLERYLDNVEEALRLLEE 676 (913)
T ss_pred HHHhHHHHHhhhHHHHHHHHHH
Confidence 3456667777888887776643
No 154
>PF14346 DUF4398: Domain of unknown function (DUF4398)
Probab=20.53 E-value=1.4e+02 Score=19.38 Aligned_cols=56 Identities=18% Similarity=0.169 Sum_probs=35.0
Q ss_pred ccHHHHHHHHHHcCCchhhHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHHcC
Q psy11108 35 ERFYHVEQALVNCQDYAGLQLIKSLRALPDDIMKHAYIAQHQGRFQTAENYQLQAD 90 (106)
Q Consensus 35 ~~l~iAe~Af~rl~d~~~v~~I~~l~~~~~~~~~~Aei~ay~Gkf~EAe~i~l~ag 90 (106)
.++.-|+.++.+.+...-.++=-..-..=...+..|+.+.-.|+|.+|..+..++.
T Consensus 17 ~~l~~A~~ai~~A~~~~a~~~Ap~el~~A~~~L~~A~~a~~~~~y~~A~~~A~~A~ 72 (103)
T PF14346_consen 17 EELSDAEAAIQRAEAAGAEQYAPVELKEAREKLQRAKAALDDGDYERARRLAEQAQ 72 (103)
T ss_pred HHHHHHHHHHHHHHHcCCccccHHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHH
Confidence 45677777777665553222110000111246788999999999999998887753
No 155
>cd02680 MIT_calpain7_2 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in the nuclear thiol protease PalBH. The molecular function of the MIT domain is unclear.
Probab=20.44 E-value=65 Score=20.68 Aligned_cols=15 Identities=20% Similarity=0.220 Sum_probs=12.8
Q ss_pred HcCCHHHHHHHHHHc
Q psy11108 75 HQGRFQTAENYQLQA 89 (106)
Q Consensus 75 y~Gkf~EAe~i~l~a 89 (106)
-.|+++||-++|.++
T Consensus 18 ~~gny~eA~~lY~~a 32 (75)
T cd02680 18 EKGNAEEAIELYTEA 32 (75)
T ss_pred HhhhHHHHHHHHHHH
Confidence 568999999999875
No 156
>COG4235 Cytochrome c biogenesis factor [Posttranslational modification, protein turnover, chaperones]
Probab=20.41 E-value=89 Score=25.06 Aligned_cols=26 Identities=19% Similarity=-0.169 Sum_probs=23.7
Q ss_pred HHHHHHHHHcCCHHHHHHHHHHcCCc
Q psy11108 67 MKHAYIAQHQGRFQTAENYQLQADRR 92 (106)
Q Consensus 67 ~~~Aei~ay~Gkf~EAe~i~l~ag~~ 92 (106)
...|++++.+|++.+|..-|-++.|.
T Consensus 160 ~~Lg~~ym~~~~~~~A~~AY~~A~rL 185 (287)
T COG4235 160 DLLGRAYMALGRASDALLAYRNALRL 185 (287)
T ss_pred HHHHHHHHHhcchhHHHHHHHHHHHh
Confidence 57899999999999999999998766
No 157
>PF13525 YfiO: Outer membrane lipoprotein; PDB: 3TGO_A 3Q5M_A 2YHC_A.
Probab=20.40 E-value=3.4e+02 Score=19.53 Aligned_cols=59 Identities=14% Similarity=0.106 Sum_probs=39.3
Q ss_pred HHHHHHHHHhcccHHHHHHHHHHcCC-chhhHHHHHhhCCCcHHHHHHHHHHHcCCHHHHHHHHHH
Q psy11108 24 ASFLSGVSQKKERFYHVEQALVNCQD-YAGLQLIKSLRALPDDIMKHAYIAQHQGRFQTAENYQLQ 88 (106)
Q Consensus 24 W~~LA~~AL~~~~l~iAe~Af~rl~d-~~~v~~I~~l~~~~~~~~~~Aei~ay~Gkf~EAe~i~l~ 88 (106)
.=..|..++..+++.-|...|..+.+ ++.=.+. ++..+..|...-..|++++|...|.+
T Consensus 8 lY~~a~~~~~~g~y~~Ai~~f~~l~~~~P~s~~a------~~A~l~la~a~y~~~~y~~A~~~~~~ 67 (203)
T PF13525_consen 8 LYQKALEALQQGDYEEAIKLFEKLIDRYPNSPYA------PQAQLMLAYAYYKQGDYEEAIAAYER 67 (203)
T ss_dssp HHHHHHHHHHCT-HHHHHHHHHHHHHH-TTSTTH------HHHHHHHHHHHHHTT-HHHHHHHHHH
T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHHCCCChHH------HHHHHHHHHHHHHcCCHHHHHHHHHH
Confidence 34568889999999999988887632 1111111 12346778888899999999988765
No 158
>PF12234 Rav1p_C: RAVE protein 1 C terminal; InterPro: IPR022033 This domain family is found in eukaryotes, and is typically between 621 and 644 amino acids in length. This family is the C-terminal region of the protein RAVE (regulator of the ATPase of vacuolar and endosomal membranes). Rav1p is involved in regulating the glucose dependent assembly and disassembly of vacuolar ATPase V1 and V0 subunits.
Probab=20.07 E-value=1.4e+02 Score=26.62 Aligned_cols=52 Identities=15% Similarity=0.151 Sum_probs=38.0
Q ss_pred HcCCchhhHHHHHhhCCC---cHHHHHHHHHHHcCCHHHHHHHHHHcCCcccccc
Q psy11108 46 NCQDYAGLQLIKSLRALP---DDIMKHAYIAQHQGRFQTAENYQLQADRRTWKSK 97 (106)
Q Consensus 46 rl~d~~~v~~I~~l~~~~---~~~~~~Aei~ay~Gkf~EAe~i~l~ag~~~~Aie 97 (106)
.-++-.+++|+.+==..+ ...+++|.+++-..||..|+..|+=+|...-|++
T Consensus 432 ~~~~~k~~~Fl~ndF~~~rwr~AAlKNAyaLlsk~Ry~~AAaFFLLag~l~dAv~ 486 (631)
T PF12234_consen 432 HKEQQKMAKFLSNDFTEPRWRTAALKNAYALLSKHRYEYAAAFFLLAGSLKDAVN 486 (631)
T ss_pred ccccHHHHHHHhhcCCChHHHHHHHHhHHHHHhcccHHHHHHHHHhcccHHHHHH
Confidence 445556666655321111 1457899999999999999999999999887765
Done!