Query psy11102
Match_columns 87
No_of_seqs 93 out of 95
Neff 6.2
Searched_HMMs 46136
Date Fri Aug 16 18:02:17 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy11102.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/11102hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2041|consensus 99.9 1.3E-27 2.9E-32 194.0 8.8 83 3-85 810-892 (1189)
2 KOG3617|consensus 98.9 2.4E-09 5.2E-14 89.7 7.4 84 3-86 1036-1121(1416)
3 KOG1538|consensus 98.3 1.1E-06 2.4E-11 72.7 5.8 83 3-85 704-787 (1081)
4 KOG1920|consensus 97.9 6.4E-05 1.4E-09 64.9 9.0 81 3-83 953-1038(1265)
5 KOG1538|consensus 97.4 0.00054 1.2E-08 57.3 7.2 83 2-84 645-729 (1081)
6 KOG3616|consensus 97.4 0.00052 1.1E-08 58.3 6.8 84 2-85 661-746 (1636)
7 KOG3616|consensus 97.4 0.00071 1.5E-08 57.5 7.5 83 3-85 720-805 (1636)
8 PF00637 Clathrin: Region in C 97.3 6.2E-05 1.3E-09 49.2 0.1 65 17-83 44-108 (143)
9 PF12895 Apc3: Anaphase-promot 97.3 0.00073 1.6E-08 40.6 4.6 69 2-70 2-83 (84)
10 KOG3617|consensus 96.9 0.0061 1.3E-07 52.3 8.4 80 2-83 980-1060(1416)
11 PF04840 Vps16_C: Vps16, C-ter 96.9 0.0069 1.5E-07 45.9 7.9 64 21-85 214-289 (319)
12 PF09976 TPR_21: Tetratricopep 96.6 0.017 3.8E-07 37.9 7.4 70 2-71 61-144 (145)
13 KOG0276|consensus 96.5 0.018 3.9E-07 47.8 8.3 81 3-84 667-747 (794)
14 KOG2041|consensus 96.5 0.01 2.2E-07 50.3 6.8 83 3-85 748-836 (1189)
15 PF04053 Coatomer_WDAD: Coatom 96.4 0.046 1E-06 43.2 9.8 82 3-85 348-429 (443)
16 TIGR00990 3a0801s09 mitochondr 96.2 0.057 1.2E-06 43.1 9.4 69 2-70 412-492 (615)
17 PF14938 SNAP: Soluble NSF att 96.1 0.0092 2E-07 43.6 4.1 14 4-17 30-43 (282)
18 PF04053 Coatomer_WDAD: Coatom 96.0 0.058 1.3E-06 42.6 8.6 83 2-86 360-443 (443)
19 PF13429 TPR_15: Tetratricopep 95.9 0.013 2.7E-07 42.1 4.2 70 2-71 90-172 (280)
20 TIGR00990 3a0801s09 mitochondr 95.8 0.11 2.5E-06 41.5 9.6 70 2-71 344-425 (615)
21 PLN03088 SGT1, suppressor of 95.6 0.068 1.5E-06 40.6 7.3 70 2-71 15-96 (356)
22 PLN03077 Protein ECB2; Provisi 95.4 0.27 5.9E-06 40.8 10.4 46 19-64 428-474 (857)
23 PF13432 TPR_16: Tetratricopep 95.3 0.043 9.4E-07 30.9 4.2 53 10-71 5-57 (65)
24 TIGR02552 LcrH_SycD type III s 95.3 0.15 3.2E-06 32.2 7.1 69 2-70 30-110 (135)
25 PF13414 TPR_11: TPR repeat; P 95.2 0.041 9E-07 31.2 3.9 59 3-70 4-63 (69)
26 PF13371 TPR_9: Tetratricopept 95.2 0.067 1.5E-06 30.6 4.8 49 23-71 3-55 (73)
27 smart00299 CLH Clathrin heavy 95.2 0.28 6.1E-06 31.7 8.2 66 17-84 43-108 (140)
28 PRK15359 type III secretion sy 95.2 0.16 3.5E-06 33.6 7.1 50 21-70 64-117 (144)
29 TIGR02521 type_IV_pilW type IV 95.1 0.22 4.8E-06 32.6 7.7 69 3-71 113-195 (234)
30 PF14559 TPR_19: Tetratricopep 95.0 0.077 1.7E-06 29.9 4.6 42 2-43 4-53 (68)
31 cd00189 TPR Tetratricopeptide 95.0 0.2 4.4E-06 27.3 6.2 68 3-70 14-93 (100)
32 TIGR02917 PEP_TPR_lipo putativ 94.8 0.42 9.2E-06 37.6 9.7 68 3-70 479-558 (899)
33 TIGR02917 PEP_TPR_lipo putativ 94.8 0.18 3.9E-06 39.6 7.6 70 2-71 716-796 (899)
34 PRK11788 tetratricopeptide rep 94.8 0.27 5.8E-06 36.2 8.1 70 2-71 227-308 (389)
35 TIGR02795 tol_pal_ybgF tol-pal 94.8 0.23 4.9E-06 30.1 6.5 69 2-70 15-101 (119)
36 PRK11447 cellulose synthase su 94.7 0.42 9.1E-06 41.3 10.0 67 3-69 543-627 (1157)
37 smart00299 CLH Clathrin heavy 94.6 0.2 4.3E-06 32.4 6.2 51 3-54 83-134 (140)
38 PF04840 Vps16_C: Vps16, C-ter 94.3 0.21 4.6E-06 37.8 6.8 50 6-55 266-317 (319)
39 TIGR02521 type_IV_pilW type IV 94.3 0.3 6.6E-06 31.9 6.9 66 3-68 79-158 (234)
40 PF09295 ChAPs: ChAPs (Chs5p-A 93.8 0.81 1.7E-05 35.9 9.2 79 2-80 182-290 (395)
41 KOG0553|consensus 93.8 0.27 5.8E-06 37.6 6.3 71 2-72 94-176 (304)
42 cd00189 TPR Tetratricopeptide 93.4 0.53 1.1E-05 25.5 5.8 54 9-71 7-60 (100)
43 PRK11788 tetratricopeptide rep 93.2 0.54 1.2E-05 34.5 7.1 48 20-67 112-163 (389)
44 PF14938 SNAP: Soluble NSF att 93.0 0.14 3.1E-06 37.3 3.7 22 5-26 97-118 (282)
45 KOG1920|consensus 92.7 0.32 7E-06 42.9 5.9 84 3-86 894-993 (1265)
46 PF13429 TPR_15: Tetratricopep 92.7 0.27 5.8E-06 35.2 4.8 69 3-71 160-240 (280)
47 KOG2280|consensus 92.5 0.43 9.4E-06 40.5 6.2 64 22-85 722-797 (829)
48 KOG0548|consensus 92.2 0.57 1.2E-05 38.2 6.4 85 3-87 312-424 (539)
49 PF13424 TPR_12: Tetratricopep 92.1 0.092 2E-06 30.5 1.5 60 9-74 12-75 (78)
50 PLN03081 pentatricopeptide (PP 91.8 2.2 4.7E-05 34.7 9.5 41 3-43 440-490 (697)
51 PLN03081 pentatricopeptide (PP 91.7 1.4 3E-05 35.8 8.2 53 19-71 263-316 (697)
52 PRK10803 tol-pal system protei 91.7 1.3 2.9E-05 32.6 7.5 56 4-68 145-203 (263)
53 PRK02603 photosystem I assembl 91.7 1.5 3.2E-05 29.3 7.2 58 3-60 49-121 (172)
54 PF12688 TPR_5: Tetratrico pep 91.7 1.9 4.1E-05 28.3 7.5 69 2-70 14-100 (120)
55 KOG2003|consensus 91.6 0.85 1.8E-05 37.6 6.8 28 1-28 502-550 (840)
56 TIGR02552 LcrH_SycD type III s 91.6 0.73 1.6E-05 28.9 5.4 52 11-71 26-77 (135)
57 PRK10370 formate-dependent nit 91.6 1.5 3.2E-05 30.6 7.3 69 3-71 87-170 (198)
58 PRK15174 Vi polysaccharide exp 91.4 1.8 4E-05 35.4 8.7 25 42-66 281-305 (656)
59 PLN03077 Protein ECB2; Provisi 91.4 2.4 5.1E-05 35.3 9.4 53 18-70 326-379 (857)
60 PRK15174 Vi polysaccharide exp 91.3 2.4 5.2E-05 34.7 9.2 66 3-68 90-167 (656)
61 PF13176 TPR_7: Tetratricopept 90.9 0.5 1.1E-05 24.2 3.4 23 48-70 2-24 (36)
62 KOG2076|consensus 90.9 0.98 2.1E-05 38.9 6.8 48 37-84 198-267 (895)
63 KOG4626|consensus 90.7 0.54 1.2E-05 39.8 5.0 54 9-71 395-448 (966)
64 PRK10049 pgaA outer membrane p 90.7 1.8 3.9E-05 35.9 8.1 68 2-70 62-141 (765)
65 PF14559 TPR_19: Tetratricopep 90.6 0.73 1.6E-05 25.7 4.2 49 14-71 3-51 (68)
66 PLN03218 maturation of RBCL 1; 90.3 3.9 8.4E-05 35.8 10.0 48 21-68 655-707 (1060)
67 KOG3060|consensus 90.3 0.95 2.1E-05 34.3 5.6 69 2-70 99-179 (289)
68 PLN03218 maturation of RBCL 1; 90.3 2.6 5.6E-05 36.9 8.9 82 3-84 451-568 (1060)
69 KOG1586|consensus 90.2 0.53 1.1E-05 35.4 4.1 10 8-17 60-69 (288)
70 PRK11447 cellulose synthase su 89.9 4 8.7E-05 35.4 9.8 65 2-66 474-550 (1157)
71 PRK11189 lipoprotein NlpI; Pro 89.9 2.6 5.6E-05 30.9 7.6 49 22-70 105-157 (296)
72 PF03704 BTAD: Bacterial trans 89.9 1.5 3.3E-05 28.3 5.8 54 18-71 65-122 (146)
73 TIGR03302 OM_YfiO outer membra 89.8 4.1 8.8E-05 28.0 8.3 20 21-40 76-95 (235)
74 PRK12370 invasion protein regu 89.7 2.1 4.6E-05 34.1 7.5 70 2-71 385-467 (553)
75 PF00637 Clathrin: Region in C 88.8 0.021 4.5E-07 37.1 -3.8 54 3-57 84-137 (143)
76 PLN03088 SGT1, suppressor of 88.7 3.1 6.6E-05 31.6 7.4 52 10-70 10-61 (356)
77 KOG1585|consensus 88.7 2.1 4.5E-05 32.5 6.4 79 5-83 34-135 (308)
78 PRK15359 type III secretion sy 88.4 3 6.6E-05 27.4 6.5 64 19-82 28-116 (144)
79 KOG1586|consensus 88.2 2.6 5.7E-05 31.8 6.6 80 2-82 27-117 (288)
80 PRK09782 bacteriophage N4 rece 87.8 2.8 6.1E-05 36.4 7.5 68 3-70 590-668 (987)
81 PF09145 Ubiq-assoc: Ubiquitin 87.8 0.41 8.9E-06 26.9 1.6 23 4-26 19-41 (46)
82 PRK02603 photosystem I assembl 87.5 2.9 6.4E-05 27.9 6.1 53 10-71 43-98 (172)
83 PF11817 Foie-gras_1: Foie gra 87.4 2.7 5.9E-05 30.3 6.2 70 6-84 162-244 (247)
84 PRK10370 formate-dependent nit 87.1 4.2 9.1E-05 28.3 6.9 31 41-71 69-99 (198)
85 PF00515 TPR_1: Tetratricopept 87.0 1.2 2.6E-05 21.8 3.1 25 47-71 3-27 (34)
86 PRK10049 pgaA outer membrane p 85.5 5.9 0.00013 32.9 8.0 69 2-70 28-108 (765)
87 PF07719 TPR_2: Tetratricopept 85.5 1.6 3.5E-05 21.0 3.1 25 47-71 3-27 (34)
88 PF04097 Nic96: Nup93/Nic96; 85.5 5.5 0.00012 32.7 7.7 81 3-83 341-452 (613)
89 cd05804 StaR_like StaR_like; a 85.2 9.1 0.0002 27.7 8.1 71 2-72 127-213 (355)
90 TIGR02795 tol_pal_ybgF tol-pal 85.2 6 0.00013 23.6 7.2 55 8-71 8-65 (119)
91 PRK10747 putative protoheme IX 84.8 7.1 0.00015 29.8 7.7 68 2-69 200-287 (398)
92 PRK15363 pathogenicity island 84.7 2.6 5.7E-05 29.2 4.8 55 12-81 79-133 (157)
93 PRK12370 invasion protein regu 84.5 4.8 0.0001 32.1 6.8 22 47-68 374-395 (553)
94 KOG1155|consensus 84.3 3.5 7.5E-05 33.8 5.9 73 2-74 343-464 (559)
95 PF12931 Sec16_C: Sec23-bindin 83.8 4.3 9.3E-05 30.0 5.9 63 12-74 69-145 (284)
96 KOG0547|consensus 83.6 2.6 5.7E-05 34.7 4.9 73 2-74 475-566 (606)
97 KOG1125|consensus 83.1 1.6 3.5E-05 35.9 3.7 52 11-71 439-490 (579)
98 KOG0543|consensus 82.6 7.2 0.00016 30.9 6.9 70 3-72 222-318 (397)
99 TIGR00540 hemY_coli hemY prote 82.4 7.9 0.00017 29.5 7.0 69 2-70 131-212 (409)
100 PF12854 PPR_1: PPR repeat 82.3 2.1 4.7E-05 21.7 2.8 23 62-84 11-33 (34)
101 PF13181 TPR_8: Tetratricopept 81.7 3.1 6.8E-05 20.1 3.2 25 47-71 3-27 (34)
102 PF14555 UBA_4: UBA-like domai 80.8 3.8 8.2E-05 22.0 3.6 27 4-30 14-41 (43)
103 PRK10747 putative protoheme IX 80.6 7.5 0.00016 29.7 6.3 67 2-68 131-210 (398)
104 PRK14574 hmsH outer membrane p 80.3 10 0.00023 32.4 7.6 69 3-71 82-162 (822)
105 PRK15363 pathogenicity island 80.2 4.7 0.0001 28.0 4.7 48 25-72 45-96 (157)
106 PF10602 RPN7: 26S proteasome 79.7 8.1 0.00018 26.7 5.8 24 5-28 39-62 (177)
107 PF13812 PPR_3: Pentatricopept 79.0 3.5 7.5E-05 19.8 2.8 24 61-84 4-27 (34)
108 KOG0276|consensus 78.1 12 0.00026 31.7 7.1 76 6-82 618-703 (794)
109 KOG4162|consensus 78.0 21 0.00045 30.7 8.5 64 3-66 651-739 (799)
110 TIGR00756 PPR pentatricopeptid 77.7 2.9 6.2E-05 19.8 2.3 21 64-84 6-26 (35)
111 PF01535 PPR: PPR repeat; Int 77.1 3.5 7.6E-05 19.2 2.5 24 20-43 5-28 (31)
112 KOG2280|consensus 77.1 8.1 0.00018 33.1 5.9 51 6-56 774-824 (829)
113 TIGR03302 OM_YfiO outer membra 76.5 6.4 0.00014 27.0 4.5 52 21-72 172-230 (235)
114 PRK15179 Vi polysaccharide bio 76.1 8 0.00017 32.4 5.7 52 20-71 159-214 (694)
115 PRK11189 lipoprotein NlpI; Pro 75.6 20 0.00043 26.2 7.2 30 42-71 95-124 (296)
116 TIGR00540 hemY_coli hemY prote 75.6 28 0.00061 26.5 8.2 70 2-71 200-289 (409)
117 CHL00033 ycf3 photosystem I as 75.4 20 0.00044 23.5 7.2 54 3-56 49-117 (168)
118 PF12234 Rav1p_C: RAVE protein 75.3 32 0.00069 28.9 8.8 81 3-84 404-491 (631)
119 PF09205 DUF1955: Domain of un 75.2 9.6 0.00021 26.5 5.0 50 21-70 92-145 (161)
120 PF13374 TPR_10: Tetratricopep 74.3 6.4 0.00014 19.4 3.2 25 48-72 5-29 (42)
121 KOG2005|consensus 74.2 18 0.00039 31.1 7.2 75 8-82 184-276 (878)
122 PF02259 FAT: FAT domain; Int 73.9 8 0.00017 27.9 4.6 26 21-46 4-29 (352)
123 PRK15179 Vi polysaccharide bio 73.9 9.9 0.00021 31.9 5.7 70 3-72 100-181 (694)
124 PF13428 TPR_14: Tetratricopep 73.2 6.7 0.00015 20.5 3.2 24 47-70 3-26 (44)
125 PF13414 TPR_11: TPR repeat; P 72.4 4.6 9.9E-05 22.5 2.5 25 46-70 4-28 (69)
126 PF07035 Mic1: Colon cancer-as 72.4 30 0.00065 24.1 7.9 66 18-83 32-114 (167)
127 CHL00033 ycf3 photosystem I as 72.3 23 0.0005 23.3 6.3 50 22-71 42-98 (168)
128 KOG0686|consensus 71.8 6.2 0.00013 31.8 3.8 39 46-84 151-193 (466)
129 PRK14720 transcript cleavage f 71.7 16 0.00035 31.8 6.6 76 3-79 97-190 (906)
130 PF09205 DUF1955: Domain of un 71.6 16 0.00034 25.5 5.3 69 13-81 67-143 (161)
131 PRK09782 bacteriophage N4 rece 70.9 29 0.00064 30.3 8.0 49 23-71 651-703 (987)
132 KOG0985|consensus 70.8 32 0.00069 31.3 8.1 79 3-83 1147-1245(1666)
133 PRK10803 tol-pal system protei 70.3 22 0.00047 26.2 6.3 69 3-71 157-243 (263)
134 KOG3060|consensus 69.7 18 0.00039 27.5 5.7 48 4-60 156-203 (289)
135 PF12569 NARP1: NMDA receptor- 69.6 17 0.00036 29.5 6.0 52 10-70 12-63 (517)
136 KOG1155|consensus 69.6 13 0.00029 30.5 5.3 60 4-73 434-494 (559)
137 KOG0547|consensus 68.7 21 0.00046 29.6 6.3 39 2-40 128-174 (606)
138 PF14561 TPR_20: Tetratricopep 68.7 12 0.00027 23.1 4.0 42 30-71 7-48 (90)
139 PF10602 RPN7: 26S proteasome 68.4 5.4 0.00012 27.6 2.6 38 45-82 36-77 (177)
140 PF13424 TPR_12: Tetratricopep 68.3 9 0.00019 21.8 3.2 52 20-81 10-69 (78)
141 PF04097 Nic96: Nup93/Nic96; 67.8 20 0.00044 29.4 6.2 36 6-41 418-453 (613)
142 cd05804 StaR_like StaR_like; a 67.5 26 0.00055 25.4 6.1 48 23-70 122-173 (355)
143 KOG0553|consensus 67.4 12 0.00025 28.8 4.4 46 13-67 92-137 (304)
144 COG2956 Predicted N-acetylgluc 67.4 18 0.00039 28.5 5.5 56 11-73 78-135 (389)
145 KOG2003|consensus 67.3 21 0.00045 29.7 6.0 61 2-62 639-711 (840)
146 KOG2168|consensus 66.7 9.2 0.0002 33.0 4.0 38 45-82 622-659 (835)
147 PF12569 NARP1: NMDA receptor- 66.7 37 0.0008 27.6 7.3 37 46-82 229-286 (517)
148 KOG2376|consensus 66.3 16 0.00035 30.7 5.2 53 18-71 83-136 (652)
149 KOG0985|consensus 64.9 32 0.00068 31.3 6.9 75 2-78 1061-1137(1666)
150 PF10366 Vps39_1: Vacuolar sor 64.8 34 0.00074 21.9 7.5 59 21-82 5-63 (108)
151 KOG0495|consensus 64.5 13 0.00028 32.0 4.4 67 4-70 633-710 (913)
152 KOG1840|consensus 64.4 22 0.00048 28.9 5.7 63 8-75 205-271 (508)
153 PF13041 PPR_2: PPR repeat fam 64.4 8.9 0.00019 20.4 2.5 21 64-84 9-29 (50)
154 KOG0307|consensus 63.4 21 0.00045 31.7 5.6 74 9-83 536-626 (1049)
155 KOG1585|consensus 63.2 14 0.00031 28.2 4.1 32 45-76 150-181 (308)
156 PF10516 SHNi-TPR: SHNi-TPR; 63.0 14 0.0003 19.6 3.0 29 47-75 3-31 (38)
157 smart00028 TPR Tetratricopepti 63.0 12 0.00026 15.9 2.6 23 48-70 4-26 (34)
158 PF13431 TPR_17: Tetratricopep 62.4 13 0.00029 18.7 2.8 26 41-66 9-34 (34)
159 KOG1174|consensus 62.2 35 0.00075 28.0 6.3 66 6-71 421-497 (564)
160 KOG4121|consensus 61.9 47 0.001 29.7 7.4 38 9-46 779-816 (1128)
161 PLN03098 LPA1 LOW PSII ACCUMUL 61.4 56 0.0012 26.4 7.4 49 23-71 83-138 (453)
162 KOG2114|consensus 61.0 45 0.00097 29.2 7.0 63 19-81 435-499 (933)
163 COG2909 MalT ATP-dependent tra 60.9 22 0.00047 31.0 5.2 44 41-84 342-386 (894)
164 COG3629 DnrI DNA-binding trans 60.7 27 0.00058 26.4 5.2 58 18-84 156-213 (280)
165 PF13174 TPR_6: Tetratricopept 60.5 10 0.00023 17.7 2.1 22 49-70 4-25 (33)
166 PRK10866 outer membrane biogen 60.4 31 0.00067 24.8 5.4 20 49-68 73-92 (243)
167 KOG1840|consensus 60.4 18 0.00038 29.5 4.5 69 3-71 255-351 (508)
168 PF15469 Sec5: Exocyst complex 59.8 32 0.0007 23.4 5.2 40 2-41 99-140 (182)
169 KOG4626|consensus 59.2 7.3 0.00016 33.3 2.2 30 42-71 181-210 (966)
170 COG5010 TadD Flp pilus assembl 57.3 40 0.00086 25.3 5.6 51 21-71 106-160 (257)
171 cd02576 PseudoU_synth_ScPUS7 P 56.5 19 0.00041 27.6 3.9 37 2-38 170-212 (371)
172 KOG4340|consensus 56.1 7.9 0.00017 30.5 1.8 46 24-69 121-168 (459)
173 KOG2076|consensus 55.7 53 0.0011 28.8 6.6 68 3-70 428-508 (895)
174 PF10366 Vps39_1: Vacuolar sor 55.4 28 0.00061 22.3 4.0 47 26-73 21-67 (108)
175 PRK15331 chaperone protein Sic 55.1 31 0.00066 24.2 4.4 17 12-28 81-97 (165)
176 COG2956 Predicted N-acetylgluc 54.1 83 0.0018 24.9 7.0 43 28-70 193-239 (389)
177 PF12921 ATP13: Mitochondrial 54.0 60 0.0013 21.2 6.1 50 19-68 6-75 (126)
178 KOG4162|consensus 52.5 24 0.00053 30.3 4.2 68 3-70 698-779 (799)
179 PF12688 TPR_5: Tetratrico pep 51.9 64 0.0014 21.0 5.7 55 8-70 7-63 (120)
180 PF07575 Nucleopor_Nup85: Nup8 51.9 9.1 0.0002 30.8 1.6 75 3-77 373-457 (566)
181 PF08238 Sel1: Sel1 repeat; I 51.9 14 0.00031 18.1 1.9 14 3-16 22-35 (39)
182 PF14165 YtzH: YtzH-like prote 51.6 7.1 0.00015 24.8 0.7 59 21-86 18-76 (87)
183 PRK10866 outer membrane biogen 51.5 78 0.0017 22.7 6.3 13 68-80 185-197 (243)
184 PF04733 Coatomer_E: Coatomer 50.7 20 0.00043 26.6 3.1 69 2-70 115-192 (290)
185 smart00668 CTLH C-terminal to 50.5 26 0.00055 18.9 2.9 22 65-86 8-29 (58)
186 smart00671 SEL1 Sel1-like repe 50.5 14 0.0003 17.8 1.6 14 3-16 19-32 (36)
187 COG5110 RPN1 26S proteasome re 50.4 63 0.0014 27.6 6.1 74 9-82 187-278 (881)
188 PF15015 NYD-SP12_N: Spermatog 50.0 65 0.0014 26.5 6.0 28 60-87 230-260 (569)
189 KOG1126|consensus 49.9 28 0.0006 29.3 4.1 73 3-75 367-451 (638)
190 KOG0890|consensus 49.6 66 0.0014 31.1 6.6 71 2-73 1396-1477(2382)
191 KOG2114|consensus 49.5 86 0.0019 27.6 6.9 67 11-83 701-768 (933)
192 KOG4121|consensus 48.7 32 0.00069 30.7 4.4 36 51-86 778-813 (1128)
193 cd00280 TRFH Telomeric Repeat 48.4 61 0.0013 23.5 5.2 22 62-83 115-136 (200)
194 PF14718 SLT_L: Soluble lytic 48.4 56 0.0012 19.3 5.2 30 45-74 39-68 (71)
195 PRK14574 hmsH outer membrane p 47.5 99 0.0022 26.6 7.1 56 4-68 414-473 (822)
196 PF04121 Nup84_Nup100: Nuclear 47.0 21 0.00046 29.6 3.0 79 2-81 146-234 (697)
197 KOG2168|consensus 46.7 34 0.00074 29.7 4.2 37 5-41 625-661 (835)
198 KOG4340|consensus 46.5 83 0.0018 25.0 6.0 67 18-84 47-117 (459)
199 cd02577 PSTD1 PSTD1: Pseudouri 46.1 81 0.0018 24.0 5.8 33 2-34 162-201 (319)
200 KOG2376|consensus 45.9 1.4E+02 0.003 25.3 7.4 30 42-71 171-201 (652)
201 KOG0548|consensus 45.6 1.1E+02 0.0024 25.3 6.8 70 2-71 15-96 (539)
202 PRK15331 chaperone protein Sic 45.4 34 0.00074 23.9 3.5 14 66-79 113-126 (165)
203 COG1397 DraG ADP-ribosylglycoh 45.3 63 0.0014 24.9 5.1 55 30-84 207-265 (314)
204 cd02679 MIT_spastin MIT: domai 45.3 16 0.00035 22.5 1.6 15 3-17 22-36 (79)
205 KOG1173|consensus 44.2 52 0.0011 27.6 4.8 51 21-72 284-339 (611)
206 COG5071 RPN5 26S proteasome re 43.7 58 0.0013 25.8 4.7 64 20-83 24-94 (439)
207 cd08780 Death_TRADD Death Doma 42.3 34 0.00073 21.8 2.8 39 35-73 17-60 (90)
208 cd02680 MIT_calpain7_2 MIT: do 42.1 21 0.00045 21.8 1.7 13 15-27 19-31 (75)
209 COG3947 Response regulator con 42.0 78 0.0017 24.8 5.2 23 61-83 282-304 (361)
210 PF07721 TPR_4: Tetratricopept 41.9 37 0.00081 15.8 2.4 15 66-80 9-23 (26)
211 KOG3785|consensus 41.9 1.2E+02 0.0026 24.6 6.3 77 2-78 35-124 (557)
212 COG5200 LUC7 U1 snRNP componen 41.9 14 0.00031 27.4 1.1 46 31-76 116-161 (258)
213 KOG4648|consensus 41.5 19 0.00041 29.0 1.8 24 48-71 100-123 (536)
214 KOG4234|consensus 41.4 53 0.0012 24.6 4.1 21 66-86 142-165 (271)
215 PF00627 UBA: UBA/TS-N domain; 41.4 45 0.00097 16.9 2.8 18 62-79 17-35 (37)
216 PRK04841 transcriptional regul 39.7 2.2E+02 0.0047 23.7 7.8 67 3-69 355-433 (903)
217 KOG2002|consensus 39.6 89 0.0019 27.8 5.6 69 3-71 660-742 (1018)
218 KOG0543|consensus 39.5 1.1E+02 0.0024 24.4 5.8 26 58-83 257-282 (397)
219 TIGR03504 FimV_Cterm FimV C-te 39.1 66 0.0014 17.4 3.6 34 20-53 4-38 (44)
220 PF09976 TPR_21: Tetratricopep 38.5 1.1E+02 0.0023 19.6 6.5 19 51-69 54-72 (145)
221 KOG0889|consensus 38.2 32 0.00068 34.3 3.0 58 2-61 2495-2570(3550)
222 KOG0292|consensus 38.1 1.7E+02 0.0036 26.4 7.0 72 13-84 1002-1110(1202)
223 COG2976 Uncharacterized protei 37.0 1.2E+02 0.0026 22.1 5.2 54 20-73 131-187 (207)
224 cd02678 MIT_VPS4 MIT: domain c 37.0 26 0.00057 20.7 1.6 15 13-27 17-31 (75)
225 KOG1126|consensus 37.0 1.3E+02 0.0028 25.5 6.0 69 3-71 537-617 (638)
226 PF10163 EnY2: Transcription f 36.9 99 0.0022 18.8 4.5 36 6-41 4-43 (86)
227 KOG2005|consensus 36.2 1.3E+02 0.0028 26.1 6.0 55 6-61 243-297 (878)
228 COG1939 Ribonuclease III famil 36.0 85 0.0018 21.3 4.1 33 9-41 94-126 (132)
229 KOG4642|consensus 35.7 1.4E+02 0.003 22.8 5.5 69 3-71 24-104 (284)
230 smart00685 DM14 Repeats in fly 35.6 93 0.002 18.2 4.6 38 3-40 9-46 (59)
231 PF07443 HARP: HepA-related pr 35.6 16 0.00035 21.2 0.5 18 26-43 32-49 (55)
232 PF12816 Vps8: Golgi CORVET co 35.4 1.6E+02 0.0034 20.7 6.7 39 46-84 23-65 (196)
233 smart00745 MIT Microtubule Int 35.3 30 0.00064 20.2 1.7 25 12-41 18-42 (77)
234 PF15448 NTS_2: N-terminal seg 34.8 34 0.00075 19.3 1.7 25 63-87 9-33 (51)
235 COG3071 HemY Uncharacterized e 34.8 1.7E+02 0.0036 23.5 6.1 64 7-70 268-353 (400)
236 PF00244 14-3-3: 14-3-3 protei 34.7 1.7E+02 0.0038 21.0 6.0 44 31-82 149-193 (236)
237 PF14669 Asp_Glu_race_2: Putat 34.6 40 0.00086 24.8 2.5 28 4-31 183-210 (233)
238 PF02064 MAS20: MAS20 protein 34.3 1.4E+02 0.0029 19.8 4.9 35 50-84 68-106 (121)
239 PF04003 Utp12: Dip2/Utp12 Fam 34.1 1.1E+02 0.0025 18.7 4.6 47 17-63 1-51 (110)
240 KOG0495|consensus 33.9 1.1E+02 0.0023 26.7 5.2 46 41-86 813-882 (913)
241 PRK10564 maltose regulon perip 33.2 50 0.0011 25.4 2.9 32 46-83 251-282 (303)
242 PF11768 DUF3312: Protein of u 32.2 1.6E+02 0.0036 24.4 5.9 77 9-85 415-521 (545)
243 PF06301 Lambda_Kil: Bacteriop 31.6 32 0.00069 19.1 1.2 21 50-70 13-33 (43)
244 cd02656 MIT MIT: domain contai 31.5 37 0.0008 19.8 1.6 16 12-27 16-31 (75)
245 KOG2002|consensus 31.4 80 0.0017 28.1 4.1 38 38-75 191-229 (1018)
246 PF13281 DUF4071: Domain of un 31.3 1.6E+02 0.0035 23.1 5.5 77 4-83 241-330 (374)
247 KOG3824|consensus 31.0 1.1E+02 0.0023 24.5 4.4 30 42-71 147-176 (472)
248 PF13311 DUF4080: Protein of u 30.7 1.2E+02 0.0026 21.1 4.3 49 14-62 29-78 (190)
249 PF07743 HSCB_C: HSCB C-termin 30.6 47 0.001 19.4 2.0 41 29-70 24-65 (78)
250 PF04190 DUF410: Protein of un 30.5 2.1E+02 0.0047 20.8 5.9 36 44-79 88-124 (260)
251 COG4190 Predicted transcriptio 30.2 30 0.00066 23.8 1.2 24 30-53 51-75 (144)
252 KOG1128|consensus 29.9 1.3E+02 0.0029 26.0 5.1 66 3-68 412-480 (777)
253 PF04212 MIT: MIT (microtubule 29.4 1E+02 0.0022 17.5 3.3 24 48-71 8-31 (69)
254 smart00386 HAT HAT (Half-A-TPR 29.3 48 0.001 15.0 1.6 11 4-14 2-12 (33)
255 cd07653 F-BAR_CIP4-like The F- 28.9 30 0.00064 24.6 1.1 34 29-62 2-35 (251)
256 PF10607 CLTH: CTLH/CRA C-term 28.4 71 0.0015 20.5 2.7 22 65-86 8-29 (145)
257 PF14760 Rnk_N: Rnk N-terminus 28.2 93 0.002 16.6 2.7 27 29-55 9-36 (42)
258 PF05843 Suf: Suppressor of fo 28.1 2.4E+02 0.0052 20.5 6.0 38 34-71 59-96 (280)
259 cd02681 MIT_calpain7_1 MIT: do 28.1 46 0.00099 20.2 1.6 17 12-28 16-32 (76)
260 COG3063 PilF Tfp pilus assembl 27.7 1.6E+02 0.0035 22.0 4.7 62 11-72 135-200 (250)
261 PF11207 DUF2989: Protein of u 27.6 1.9E+02 0.0041 20.9 5.0 49 9-64 148-197 (203)
262 COG3947 Response regulator con 27.3 67 0.0015 25.1 2.7 31 5-35 282-312 (361)
263 PF14669 Asp_Glu_race_2: Putat 27.2 55 0.0012 24.1 2.2 21 61-81 184-204 (233)
264 PF10475 DUF2450: Protein of u 26.8 85 0.0018 23.1 3.2 37 48-84 117-153 (291)
265 cd02683 MIT_1 MIT: domain cont 26.8 48 0.001 20.0 1.6 46 11-58 15-63 (77)
266 KOG1129|consensus 26.6 1.7E+02 0.0038 23.5 5.0 27 42-68 287-313 (478)
267 PF04121 Nup84_Nup100: Nuclear 26.6 70 0.0015 26.6 3.0 19 65-83 140-158 (697)
268 PRK04841 transcriptional regul 26.1 2E+02 0.0043 23.9 5.5 38 45-82 341-378 (903)
269 PF08822 DUF1804: Protein of u 26.0 1.4E+02 0.0031 20.9 4.0 56 3-58 48-111 (165)
270 PF05131 Pep3_Vps18: Pep3/Vps1 26.0 57 0.0012 22.1 2.0 19 65-83 110-128 (147)
271 PRK14720 transcript cleavage f 25.9 4.1E+02 0.0089 23.5 7.5 71 4-80 164-245 (906)
272 cd02684 MIT_2 MIT: domain cont 25.8 52 0.0011 19.7 1.6 24 13-41 17-40 (75)
273 PF12812 PDZ_1: PDZ-like domai 25.6 67 0.0015 19.4 2.1 16 29-44 63-78 (78)
274 PF14649 Spatacsin_C: Spatacsi 25.4 1.3E+02 0.0027 22.9 3.9 33 25-57 263-295 (296)
275 cd02682 MIT_AAA_Arch MIT: doma 25.4 1.7E+02 0.0036 17.8 4.5 51 5-57 9-62 (75)
276 PF08397 IMD: IRSp53/MIM homol 25.2 99 0.0021 21.8 3.2 57 10-67 14-72 (219)
277 KOG3785|consensus 25.1 2.5E+02 0.0055 22.9 5.6 42 3-44 71-120 (557)
278 PRK13910 DNA glycosylase MutY; 24.9 1.1E+02 0.0024 23.0 3.6 19 9-27 43-61 (289)
279 PF06855 DUF1250: Protein of u 24.8 52 0.0011 17.9 1.4 30 40-69 10-39 (46)
280 KOG1064|consensus 24.7 2.8E+02 0.0061 27.1 6.4 28 56-83 1286-1313(2439)
281 cd04389 RhoGAP_KIAA1688 RhoGAP 24.6 84 0.0018 21.7 2.7 32 28-59 104-136 (187)
282 PF08820 DUF1803: Domain of un 24.6 92 0.002 19.8 2.6 32 44-75 27-58 (93)
283 PF11464 Rbsn: Rabenosyn Rab b 24.4 1.3E+02 0.0029 16.4 3.2 33 11-43 1-33 (42)
284 PHA01748 hypothetical protein 24.4 51 0.0011 19.0 1.3 40 30-69 13-53 (60)
285 COG1729 Uncharacterized protei 24.1 2.4E+02 0.0051 21.2 5.1 49 22-70 185-240 (262)
286 KOG1498|consensus 23.9 1.1E+02 0.0025 24.6 3.5 37 8-44 45-81 (439)
287 PF12234 Rav1p_C: RAVE protein 23.9 4.5E+02 0.0097 22.2 9.5 80 3-82 466-552 (631)
288 COG4008 Predicted metal-bindin 23.8 2.4E+02 0.0053 19.3 4.7 22 61-82 128-149 (153)
289 KOG2066|consensus 23.7 2.1E+02 0.0045 25.1 5.2 44 15-58 634-677 (846)
290 KOG2160|consensus 23.6 2.1E+02 0.0045 22.4 4.8 48 30-86 103-156 (342)
291 PF10668 Phage_terminase: Phag 23.5 90 0.002 18.3 2.3 26 4-29 8-33 (60)
292 COG4700 Uncharacterized protei 23.1 3.2E+02 0.007 20.3 7.5 69 2-70 102-185 (251)
293 PF08769 Spo0A_C: Sporulation 22.9 15 0.00033 23.6 -1.2 39 18-56 61-106 (106)
294 PF02845 CUE: CUE domain; Int 22.8 1.1E+02 0.0024 15.8 2.4 14 56-69 26-39 (42)
295 PTZ00373 60S Acidic ribosomal 22.5 2.3E+02 0.0051 18.5 5.0 59 1-59 1-68 (112)
296 PF08025 Antimicrobial_3: Spid 22.1 57 0.0012 17.2 1.1 17 45-61 7-23 (37)
297 TIGR02875 spore_0_A sporulatio 22.1 1.7E+02 0.0037 20.3 3.9 41 17-57 210-257 (262)
298 PF12816 Vps8: Golgi CORVET co 21.9 2.5E+02 0.0055 19.7 4.7 34 7-40 27-64 (196)
299 PF14649 Spatacsin_C: Spatacsi 21.8 2E+02 0.0044 21.8 4.5 67 8-74 52-118 (296)
300 COG3063 PilF Tfp pilus assembl 21.6 3.4E+02 0.0074 20.4 5.5 52 9-70 76-128 (250)
301 KOG2063|consensus 21.4 4E+02 0.0087 23.5 6.5 41 44-84 490-530 (877)
302 PF04010 DUF357: Protein of un 21.1 1.1E+02 0.0023 18.5 2.3 34 3-36 36-70 (75)
303 PF11390 FdsD: NADH-dependant 21.0 1.1E+02 0.0023 18.0 2.2 19 49-67 7-25 (61)
304 smart00544 MA3 Domain in DAP-5 20.8 2.1E+02 0.0046 17.4 4.9 26 18-43 5-30 (113)
305 PF07911 DUF1677: Protein of u 20.5 1.5E+02 0.0033 18.8 3.0 26 58-83 34-63 (91)
306 COG1194 MutY A/G-specific DNA 20.4 81 0.0018 24.6 2.1 62 9-82 84-145 (342)
307 KOG0307|consensus 20.3 2.2E+02 0.0047 25.6 4.8 61 6-66 605-686 (1049)
308 cd04409 RhoGAP_PARG1 RhoGAP_PA 20.3 59 0.0013 22.9 1.2 32 31-62 130-162 (211)
No 1
>KOG2041|consensus
Probab=99.94 E-value=1.3e-27 Score=193.97 Aligned_cols=83 Identities=31% Similarity=0.447 Sum_probs=82.0
Q ss_pred hhHHHHHHHHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCCCCchHHHHHHHHHhCCChHHHHHHHHhcCCHHHHHHHHh
Q psy11102 3 TLVSQAREYYEKCHCYEKLISVYTELGDFEALESCARKLPDSSPLLKPMGEIFVKYGLCEQAVYVFDKNKHKSSQWLTVV 82 (87)
Q Consensus 3 ~~w~~A~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~~~~lL~~ia~~F~~~G~~~~Av~aylk~gd~k~ai~~cv 82 (87)
+.|++|++||+++||.+..|||||++++|++||.+++.||++|++|+.+|+||+++|||+|||+||+|++++|+|+.+||
T Consensus 810 ~~We~A~~yY~~~~~~e~~~ecly~le~f~~LE~la~~Lpe~s~llp~~a~mf~svGMC~qAV~a~Lr~s~pkaAv~tCv 889 (1189)
T KOG2041|consen 810 MEWEEAAKYYSYCGDTENQIECLYRLELFGELEVLARTLPEDSELLPVMADMFTSVGMCDQAVEAYLRRSLPKAAVHTCV 889 (1189)
T ss_pred HHHHHHHHHHHhccchHhHHHHHHHHHhhhhHHHHHHhcCcccchHHHHHHHHHhhchHHHHHHHHHhccCcHHHHHHHH
Confidence 68999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred hcC
Q psy11102 83 QDK 85 (87)
Q Consensus 83 ~~~ 85 (87)
++|
T Consensus 890 ~Ln 892 (1189)
T KOG2041|consen 890 ELN 892 (1189)
T ss_pred HHH
Confidence 987
No 2
>KOG3617|consensus
Probab=98.95 E-value=2.4e-09 Score=89.70 Aligned_cols=84 Identities=19% Similarity=0.195 Sum_probs=79.3
Q ss_pred hhHHHHHHHHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCCC-C-chHHHHHHHHHhCCChHHHHHHHHhcCCHHHHHHH
Q psy11102 3 TLVSQAREYYEKCHCYEKLISVYTELGDFEALESCARKLPDS-S-PLLKPMGEIFVKYGLCEQAVYVFDKNKHKSSQWLT 80 (87)
Q Consensus 3 ~~w~~A~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~~-~-~lL~~ia~~F~~~G~~~~Av~aylk~gd~k~ai~~ 80 (87)
++..+|+.+|.++|-+.|+.|+.++.++|..|+-|+..|+++ + ++|..|+++|-+.-+++.||+..++...+..|+++
T Consensus 1036 ~~~~~AVmLYHkAGm~~kALelAF~tqQf~aL~lIa~DLd~~sDp~ll~RcadFF~~~~qyekAV~lL~~ar~~~~Alql 1115 (1416)
T KOG3617|consen 1036 GYAHKAVMLYHKAGMIGKALELAFRTQQFSALDLIAKDLDAGSDPKLLRRCADFFENNQQYEKAVNLLCLAREFSGALQL 1115 (1416)
T ss_pred hhhhHHHHHHHhhcchHHHHHHHHhhcccHHHHHHHHhcCCCCCHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHH
Confidence 467899999999999999999999999999999999999985 3 88999999999999999999999999999999999
Q ss_pred HhhcCC
Q psy11102 81 VVQDKP 86 (87)
Q Consensus 81 cv~~~~ 86 (87)
|.+.|-
T Consensus 1116 C~~~nv 1121 (1416)
T KOG3617|consen 1116 CKNRNV 1121 (1416)
T ss_pred HhcCCC
Confidence 998764
No 3
>KOG1538|consensus
Probab=98.32 E-value=1.1e-06 Score=72.72 Aligned_cols=83 Identities=20% Similarity=0.055 Sum_probs=77.9
Q ss_pred hhHHHHHHHHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCCC-CchHHHHHHHHHhCCChHHHHHHHHhcCCHHHHHHHH
Q psy11102 3 TLVSQAREYYEKCHCYEKLISVYTELGDFEALESCARKLPDS-SPLLKPMGEIFVKYGLCEQAVYVFDKNKHKSSQWLTV 81 (87)
Q Consensus 3 ~~w~~A~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~~-~~lL~~ia~~F~~~G~~~~Av~aylk~gd~k~ai~~c 81 (87)
+.+..|+++...+|+..|+|++...+||.|-|-+|.+.|+.. .+-|..++..|.+..-..-|.+.|.|.||.++.+++|
T Consensus 704 kePkaAAEmLiSaGe~~KAi~i~~d~gW~d~lidI~rkld~~ere~l~~~a~ylk~l~~~gLAaeIF~k~gD~ksiVqlH 783 (1081)
T KOG1538|consen 704 KEPKAAAEMLISAGEHVKAIEICGDHGWVDMLIDIARKLDKAEREPLLLCATYLKKLDSPGLAAEIFLKMGDLKSLVQLH 783 (1081)
T ss_pred CCcHHHHHHhhcccchhhhhhhhhcccHHHHHHHHHhhcchhhhhHHHHHHHHHhhccccchHHHHHHHhccHHHHhhhe
Confidence 456789999999999999999999999999999999999985 5779999999999999999999999999999999999
Q ss_pred hhcC
Q psy11102 82 VQDK 85 (87)
Q Consensus 82 v~~~ 85 (87)
|+..
T Consensus 784 ve~~ 787 (1081)
T KOG1538|consen 784 VETQ 787 (1081)
T ss_pred eecc
Confidence 9854
No 4
>KOG1920|consensus
Probab=97.93 E-value=6.4e-05 Score=64.92 Aligned_cols=81 Identities=21% Similarity=0.301 Sum_probs=63.4
Q ss_pred hhHHHHHHHHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCCC-CchH---HHHHHHHH-hCCChHHHHHHHHhcCCHHHH
Q psy11102 3 TLVSQAREYYEKCHCYEKLISVYTELGDFEALESCARKLPDS-SPLL---KPMGEIFV-KYGLCEQAVYVFDKNKHKSSQ 77 (87)
Q Consensus 3 ~~w~~A~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~~-~~lL---~~ia~~F~-~~G~~~~Av~aylk~gd~k~a 77 (87)
+.|++|+-.|..+|+.+||+.||-+.+||.+...+...+..+ +++. .+++..+. +..++|-|.-.-+-++|++.|
T Consensus 953 ~~~~~Aal~Ye~~GklekAl~a~~~~~dWr~~l~~a~ql~~~~de~~~~a~~L~s~L~e~~kh~eAa~il~e~~sd~~~a 1032 (1265)
T KOG1920|consen 953 LMSDEAALMYERCGKLEKALKAYKECGDWREALSLAAQLSEGKDELVILAEELVSRLVEQRKHYEAAKILLEYLSDPEEA 1032 (1265)
T ss_pred ccccHHHHHHHHhccHHHHHHHHHHhccHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHcccchhHHHHHHHHhcCHHHH
Confidence 468999999999999999999999999999999999999886 4544 45555553 445555555566667777777
Q ss_pred HHHHhh
Q psy11102 78 WLTVVQ 83 (87)
Q Consensus 78 i~~cv~ 83 (87)
+.+.++
T Consensus 1033 v~ll~k 1038 (1265)
T KOG1920|consen 1033 VALLCK 1038 (1265)
T ss_pred HHHHhh
Confidence 776654
No 5
>KOG1538|consensus
Probab=97.44 E-value=0.00054 Score=57.27 Aligned_cols=83 Identities=8% Similarity=0.050 Sum_probs=77.1
Q ss_pred chhHHHHHHHHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCC-C-CchHHHHHHHHHhCCChHHHHHHHHhcCCHHHHHH
Q psy11102 2 TTLVSQAREYYEKCHCYEKLISVYTELGDFEALESCARKLPD-S-SPLLKPMGEIFVKYGLCEQAVYVFDKNKHKSSQWL 79 (87)
Q Consensus 2 ~~~w~~A~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~-~-~~lL~~ia~~F~~~G~~~~Av~aylk~gd~k~ai~ 79 (87)
++++.+|+.+|.++|.-.++.|+|.-+-+||...+.+.+-.+ . ..|+.+=|++-.++....-|.+..+-+|+..+||+
T Consensus 645 ~gKF~EAAklFk~~G~enRAlEmyTDlRMFD~aQE~~~~g~~~eKKmL~RKRA~WAr~~kePkaAAEmLiSaGe~~KAi~ 724 (1081)
T KOG1538|consen 645 QGKFHEAAKLFKRSGHENRALEMYTDLRMFDYAQEFLGSGDPKEKKMLIRKRADWARNIKEPKAAAEMLISAGEHVKAIE 724 (1081)
T ss_pred hhhHHHHHHHHHHcCchhhHHHHHHHHHHHHHHHHHhhcCChHHHHHHHHHHHHHhhhcCCcHHHHHHhhcccchhhhhh
Confidence 578999999999999999999999999999999999888654 3 58899999999999999999999999999999999
Q ss_pred HHhhc
Q psy11102 80 TVVQD 84 (87)
Q Consensus 80 ~cv~~ 84 (87)
.|.++
T Consensus 725 i~~d~ 729 (1081)
T KOG1538|consen 725 ICGDH 729 (1081)
T ss_pred hhhcc
Confidence 99875
No 6
>KOG3616|consensus
Probab=97.41 E-value=0.00052 Score=58.29 Aligned_cols=84 Identities=19% Similarity=0.279 Sum_probs=77.2
Q ss_pred chhHHHHHHHHHHhccHHHHHHHHHHhCCHHHHHHHHhh-CCCC-CchHHHHHHHHHhCCChHHHHHHHHhcCCHHHHHH
Q psy11102 2 TTLVSQAREYYEKCHCYEKLISVYTELGDFEALESCARK-LPDS-SPLLKPMGEIFVKYGLCEQAVYVFDKNKHKSSQWL 79 (87)
Q Consensus 2 ~~~w~~A~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~-L~~~-~~lL~~ia~~F~~~G~~~~Av~aylk~gd~k~ai~ 79 (87)
.+-|++|-.+|++-+|+.|+++||-+-.-|.+-.+|+|- .|+. .+|=+.-|..+...|+.+.||.-|+.+++.++||+
T Consensus 661 ~elydkagdlfeki~d~dkale~fkkgdaf~kaielarfafp~evv~lee~wg~hl~~~~q~daainhfiea~~~~kaie 740 (1636)
T KOG3616|consen 661 GELYDKAGDLFEKIHDFDKALECFKKGDAFGKAIELARFAFPEEVVKLEEAWGDHLEQIGQLDAAINHFIEANCLIKAIE 740 (1636)
T ss_pred hHHHHhhhhHHHHhhCHHHHHHHHHcccHHHHHHHHHHhhCcHHHhhHHHHHhHHHHHHHhHHHHHHHHHHhhhHHHHHH
Confidence 356899999999999999999999999999999999997 5775 68889999999999999999999999999999999
Q ss_pred HHhhcC
Q psy11102 80 TVVQDK 85 (87)
Q Consensus 80 ~cv~~~ 85 (87)
..+..+
T Consensus 741 aai~ak 746 (1636)
T KOG3616|consen 741 AAIGAK 746 (1636)
T ss_pred HHhhhh
Confidence 988754
No 7
>KOG3616|consensus
Probab=97.40 E-value=0.00071 Score=57.51 Aligned_cols=83 Identities=13% Similarity=0.107 Sum_probs=77.1
Q ss_pred hhHHHHHHHHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCC-C--CchHHHHHHHHHhCCChHHHHHHHHhcCCHHHHHH
Q psy11102 3 TLVSQAREYYEKCHCYEKLISVYTELGDFEALESCARKLPD-S--SPLLKPMGEIFVKYGLCEQAVYVFDKNKHKSSQWL 79 (87)
Q Consensus 3 ~~w~~A~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~-~--~~lL~~ia~~F~~~G~~~~Av~aylk~gd~k~ai~ 79 (87)
++++.|+..|..++...|+|+.......|.+-..+++++.+ + +.--..|++.+.+.|.++-|-+.|.+.|.++.||+
T Consensus 720 ~q~daainhfiea~~~~kaieaai~akew~kai~ildniqdqk~~s~yy~~iadhyan~~dfe~ae~lf~e~~~~~dai~ 799 (1636)
T KOG3616|consen 720 GQLDAAINHFIEANCLIKAIEAAIGAKEWKKAISILDNIQDQKTASGYYGEIADHYANKGDFEIAEELFTEADLFKDAID 799 (1636)
T ss_pred HhHHHHHHHHHHhhhHHHHHHHHhhhhhhhhhHhHHHHhhhhccccccchHHHHHhccchhHHHHHHHHHhcchhHHHHH
Confidence 67899999999999999999999999999999999999987 3 58889999999999999999999999999999999
Q ss_pred HHhhcC
Q psy11102 80 TVVQDK 85 (87)
Q Consensus 80 ~cv~~~ 85 (87)
++-+.+
T Consensus 800 my~k~~ 805 (1636)
T KOG3616|consen 800 MYGKAG 805 (1636)
T ss_pred HHhccc
Confidence 986543
No 8
>PF00637 Clathrin: Region in Clathrin and VPS; InterPro: IPR000547 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. These vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transport []. Clathrin coats contain both clathrin (acts as a scaffold) and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors [, ]. Clathrin is a trimer composed of three heavy chains and three light chains, each monomer projecting outwards like a leg; this three-legged structure is known as a triskelion [, ]. The heavy chains form the legs, their N-terminal beta-propeller regions extending outwards, while their C-terminal alpha-alpha-superhelical regions form the central hub of the triskelion. Peptide motifs can bind between the beta-propeller blades. The light chains appear to have a regulatory role, and may help orient the assembly and disassembly of clathrin coats as they interact with hsc70 uncoating ATPase []. Clathrin triskelia self-polymerise into a curved lattice by twisting individual legs together. The clathrin lattice forms around a vesicle as it buds from the TGN, plasma membrane or endosomes, acting to stabilise the vesicle and facilitate the budding process []. The multiple blades created when the triskelia polymerise are involved in multiple protein interactions, enabling the recruitment of different cargo adaptors and membrane attachment proteins []. This entry represents the 7-fold alpha-alpha-superhelical ARM-type repeat found at the C-terminal of clathrin heavy chains and in VPS (vacuolar protein sorting-associated) proteins. In clathrin heavy chains, the C-terminal 7-fold ARM-type repeats interact to form the central hub of the triskelion. VPS proteins are required for vacuolar assembly and vacuolar traffick, and contain one clathrin-type repeat []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0006886 intracellular protein transport, 0016192 vesicle-mediated transport; PDB: 3LVH_A 3LVG_C 1B89_A 3QIL_L.
Probab=97.28 E-value=6.2e-05 Score=49.19 Aligned_cols=65 Identities=20% Similarity=0.251 Sum_probs=54.9
Q ss_pred cHHHHHHHHHHhCCHHHHHHHHhhCCCCCchHHHHHHHHHhCCChHHHHHHHHhcCCHHHHHHHHhh
Q psy11102 17 CYEKLISVYTELGDFEALESCARKLPDSSPLLKPMGEIFVKYGLCEQAVYVFDKNKHKSSQWLTVVQ 83 (87)
Q Consensus 17 n~~k~ie~~~~~ed~d~L~~l~~~L~~~~~lL~~ia~~F~~~G~~~~Av~aylk~gd~k~ai~~cv~ 83 (87)
-...++.+|.+.++.+.|++..+.... =-+..++..+.+.|++++|+..|.|.|+++.|++++.+
T Consensus 44 ~~~~L~~ly~~~~~~~~l~~~L~~~~~--yd~~~~~~~c~~~~l~~~a~~Ly~~~~~~~~al~i~~~ 108 (143)
T PF00637_consen 44 LHTLLLELYIKYDPYEKLLEFLKTSNN--YDLDKALRLCEKHGLYEEAVYLYSKLGNHDEALEILHK 108 (143)
T ss_dssp HHHHHHHHHHCTTTCCHHHHTTTSSSS--S-CTHHHHHHHTTTSHHHHHHHHHCCTTHTTCSSTSSS
T ss_pred HHHHHHHHHHhcCCchHHHHHcccccc--cCHHHHHHHHHhcchHHHHHHHHHHcccHHHHHHHHHH
Confidence 357899999999999999999995554 22458888889999999999999999999999997444
No 9
>PF12895 Apc3: Anaphase-promoting complex, cyclosome, subunit 3; PDB: 3KAE_D 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2XPI_A 3ULQ_A.
Probab=97.25 E-value=0.00073 Score=40.61 Aligned_cols=69 Identities=25% Similarity=0.331 Sum_probs=50.8
Q ss_pred chhHHHHHHHHHHhccHH----------HHHHHHHHhCCHHHHHHHHhhCCC--CC-chHHHHHHHHHhCCChHHHHHHH
Q psy11102 2 TTLVSQAREYYEKCHCYE----------KLISVYTELGDFEALESCARKLPD--SS-PLLKPMGEIFVKYGLCEQAVYVF 68 (87)
Q Consensus 2 ~~~w~~A~~yY~~~~n~~----------k~ie~~~~~ed~d~L~~l~~~L~~--~~-~lL~~ia~~F~~~G~~~~Av~ay 68 (87)
.++|++|+.+|.+.-... .++.||++.|+|+.-..+.+.++. .+ ...-.+|.-+...|..++|+++|
T Consensus 2 ~~~y~~Ai~~~~k~~~~~~~~~~~~~~~~la~~~~~~~~y~~A~~~~~~~~~~~~~~~~~~l~a~~~~~l~~y~eAi~~l 81 (84)
T PF12895_consen 2 QGNYENAIKYYEKLLELDPTNPNSAYLYNLAQCYFQQGKYEEAIELLQKLKLDPSNPDIHYLLARCLLKLGKYEEAIKAL 81 (84)
T ss_dssp TT-HHHHHHHHHHHHHHHCGTHHHHHHHHHHHHHHHTTHHHHHHHHHHCHTHHHCHHHHHHHHHHHHHHTT-HHHHHHHH
T ss_pred CccHHHHHHHHHHHHHHCCCChhHHHHHHHHHHHHHCCCHHHHHHHHHHhCCCCCCHHHHHHHHHHHHHhCCHHHHHHHH
Confidence 578999999988753222 348999999999999999977432 22 45556688888888888888888
Q ss_pred Hh
Q psy11102 69 DK 70 (87)
Q Consensus 69 lk 70 (87)
.|
T Consensus 82 ~~ 83 (84)
T PF12895_consen 82 EK 83 (84)
T ss_dssp HH
T ss_pred hc
Confidence 66
No 10
>KOG3617|consensus
Probab=96.91 E-value=0.0061 Score=52.34 Aligned_cols=80 Identities=14% Similarity=0.183 Sum_probs=69.9
Q ss_pred chhHHHHHHHHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCCCCchHHHHHHHHHhCC-ChHHHHHHHHhcCCHHHHHHH
Q psy11102 2 TTLVSQAREYYEKCHCYEKLISVYTELGDFEALESCARKLPDSSPLLKPMGEIFVKYG-LCEQAVYVFDKNKHKSSQWLT 80 (87)
Q Consensus 2 ~~~w~~A~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~~~~lL~~ia~~F~~~G-~~~~Av~aylk~gd~k~ai~~ 80 (87)
.++..+|+.+|.++.-+..+|.++-.++.-|.|..++-.- +-.-+...|.+|...| ....||..|-|+|-+-+|+++
T Consensus 980 ~g~v~~Av~FfTrAqafsnAIRlcKEnd~~d~L~nlal~s--~~~d~v~aArYyEe~g~~~~~AVmLYHkAGm~~kALel 1057 (1416)
T KOG3617|consen 980 DGDVVKAVKFFTRAQAFSNAIRLCKENDMKDRLANLALMS--GGSDLVSAARYYEELGGYAHKAVMLYHKAGMIGKALEL 1057 (1416)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHhcCHHHHHHHHHhhc--CchhHHHHHHHHHHcchhhhHHHHHHHhhcchHHHHHH
Confidence 4788999999999999999999999999999999987543 3344778899998888 999999999999999999998
Q ss_pred Hhh
Q psy11102 81 VVQ 83 (87)
Q Consensus 81 cv~ 83 (87)
.-+
T Consensus 1058 AF~ 1060 (1416)
T KOG3617|consen 1058 AFR 1060 (1416)
T ss_pred HHh
Confidence 754
No 11
>PF04840 Vps16_C: Vps16, C-terminal region; InterPro: IPR006925 This protein forms part of the Class C vacuolar protein sorting (Vps) complex. Vps16 is essential for vacuolar protein sorting, which is essential for viability in plants, but not yeast []. The Class C Vps complex is required for SNARE-mediated membrane fusion at the lysosome-like yeast vacuole. It is thought to play essential roles in membrane docking and fusion at the Golgi-to-endosome and endosome-to-vacuole stages of transport []. The role of VPS16 in this complex is not known.; GO: 0006886 intracellular protein transport, 0005737 cytoplasm
Probab=96.89 E-value=0.0069 Score=45.85 Aligned_cols=64 Identities=20% Similarity=0.240 Sum_probs=45.7
Q ss_pred HHHHHHHhCCHHHHHHHHhh--CCCC-CchHHHH---------HHHHHhCCChHHHHHHHHhcCCHHHHHHHHhhcC
Q psy11102 21 LISVYTELGDFEALESCARK--LPDS-SPLLKPM---------GEIFVKYGLCEQAVYVFDKNKHKSSQWLTVVQDK 85 (87)
Q Consensus 21 ~ie~~~~~ed~d~L~~l~~~--L~~~-~~lL~~i---------a~~F~~~G~~~~Av~aylk~gd~k~ai~~cv~~~ 85 (87)
-|..|...++||+|++++.+ -|-+ .|....| ..+...+.. +.-|+.|+++|+++.|++.|++.+
T Consensus 214 ki~aLa~~~~w~eL~~fa~skKsPIGyepFv~~~~~~~~~~eA~~yI~k~~~-~~rv~~y~~~~~~~~A~~~A~~~k 289 (319)
T PF04840_consen 214 KIKALAENKDWDELEKFAKSKKSPIGYEPFVEACLKYGNKKEASKYIPKIPD-EERVEMYLKCGDYKEAAQEAFKEK 289 (319)
T ss_pred HHHHHHhcCCHHHHHHHHhCCCCCCChHHHHHHHHHCCCHHHHHHHHHhCCh-HHHHHHHHHCCCHHHHHHHHHHcC
Confidence 36788999999999999975 2223 2333222 223333444 777899999999999999999876
No 12
>PF09976 TPR_21: Tetratricopeptide repeat; InterPro: IPR018704 This domain, found in various hypothetical prokaryotic proteins, has no known function.
Probab=96.62 E-value=0.017 Score=37.92 Aligned_cols=70 Identities=17% Similarity=0.321 Sum_probs=56.9
Q ss_pred chhHHHHHHHHHHhccH-----------HHHHHHHHHhCCHHHHHHHHhhCCCC---CchHHHHHHHHHhCCChHHHHHH
Q psy11102 2 TTLVSQAREYYEKCHCY-----------EKLISVYTELGDFEALESCARKLPDS---SPLLKPMGEIFVKYGLCEQAVYV 67 (87)
Q Consensus 2 ~~~w~~A~~yY~~~~n~-----------~k~ie~~~~~ed~d~L~~l~~~L~~~---~~lL~~ia~~F~~~G~~~~Av~a 67 (87)
.+++++|...|.+.-+. -+++.++...|+||.-..+.+..++. .......|+.+...|..++|+++
T Consensus 61 ~g~~~~A~~~l~~~~~~~~d~~l~~~a~l~LA~~~~~~~~~d~Al~~L~~~~~~~~~~~~~~~~Gdi~~~~g~~~~A~~~ 140 (145)
T PF09976_consen 61 QGDYDEAKAALEKALANAPDPELKPLARLRLARILLQQGQYDEALATLQQIPDEAFKALAAELLGDIYLAQGDYDEARAA 140 (145)
T ss_pred CCCHHHHHHHHHHHHhhCCCHHHHHHHHHHHHHHHHHcCCHHHHHHHHHhccCcchHHHHHHHHHHHHHHCCCHHHHHHH
Confidence 47888899888766543 24789999999999999999888763 24577889999999999999999
Q ss_pred HHhc
Q psy11102 68 FDKN 71 (87)
Q Consensus 68 ylk~ 71 (87)
|.++
T Consensus 141 y~~A 144 (145)
T PF09976_consen 141 YQKA 144 (145)
T ss_pred HHHh
Confidence 9763
No 13
>KOG0276|consensus
Probab=96.52 E-value=0.018 Score=47.84 Aligned_cols=81 Identities=9% Similarity=-0.053 Sum_probs=73.0
Q ss_pred hhHHHHHHHHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCCCCchHHHHHHHHHhCCChHHHHHHHHhcCCHHHHHHHHh
Q psy11102 3 TLVSQAREYYEKCHCYEKLISVYTELGDFEALESCARKLPDSSPLLKPMGEIFVKYGLCEQAVYVFDKNKHKSSQWLTVV 82 (87)
Q Consensus 3 ~~w~~A~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~~~~lL~~ia~~F~~~G~~~~Av~aylk~gd~k~ai~~cv 82 (87)
.+|++=.++..+++++.++-||+.+..||.+|.-+-... -+.+.|..+|.-=.+.|...-|--+|...||+...+++-+
T Consensus 667 ~Kw~~Lg~~al~~~~l~lA~EC~~~a~d~~~LlLl~t~~-g~~~~l~~la~~~~~~g~~N~AF~~~~l~g~~~~C~~lLi 745 (794)
T KOG0276|consen 667 VKWRQLGDAALSAGELPLASECFLRARDLGSLLLLYTSS-GNAEGLAVLASLAKKQGKNNLAFLAYFLSGDYEECLELLI 745 (794)
T ss_pred HHHHHHHHHHhhcccchhHHHHHHhhcchhhhhhhhhhc-CChhHHHHHHHHHHhhcccchHHHHHHHcCCHHHHHHHHH
Confidence 589999999999999999999999999999998665443 3468899999999999999999999999999999999887
Q ss_pred hc
Q psy11102 83 QD 84 (87)
Q Consensus 83 ~~ 84 (87)
+-
T Consensus 746 ~t 747 (794)
T KOG0276|consen 746 ST 747 (794)
T ss_pred hc
Confidence 65
No 14
>KOG2041|consensus
Probab=96.50 E-value=0.01 Score=50.32 Aligned_cols=83 Identities=16% Similarity=0.165 Sum_probs=72.8
Q ss_pred hhHHHHHHHHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCC--CCc----hHHHHHHHHHhCCChHHHHHHHHhcCCHHH
Q psy11102 3 TLVSQAREYYEKCHCYEKLISVYTELGDFEALESCARKLPD--SSP----LLKPMGEIFVKYGLCEQAVYVFDKNKHKSS 76 (87)
Q Consensus 3 ~~w~~A~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~--~~~----lL~~ia~~F~~~G~~~~Av~aylk~gd~k~ 76 (87)
|.+++|-.+|--+..-.-+|+++-++|||-..-+|.|.=.. ++. ....||+.|...--.+.|++.|.++|+...
T Consensus 748 g~feeaek~yld~drrDLAielr~klgDwfrV~qL~r~g~~d~dD~~~e~A~r~ig~~fa~~~~We~A~~yY~~~~~~e~ 827 (1189)
T KOG2041|consen 748 GEFEEAEKLYLDADRRDLAIELRKKLGDWFRVYQLIRNGGSDDDDEGKEDAFRNIGETFAEMMEWEEAAKYYSYCGDTEN 827 (1189)
T ss_pred cchhHhhhhhhccchhhhhHHHHHhhhhHHHHHHHHHccCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccchHh
Confidence 57899999999999999999999999999999999998433 233 356799999999999999999999999999
Q ss_pred HHHHHhhcC
Q psy11102 77 QWLTVVQDK 85 (87)
Q Consensus 77 ai~~cv~~~ 85 (87)
.+++..+++
T Consensus 828 ~~ecly~le 836 (1189)
T KOG2041|consen 828 QIECLYRLE 836 (1189)
T ss_pred HHHHHHHHH
Confidence 999877654
No 15
>PF04053 Coatomer_WDAD: Coatomer WD associated region ; InterPro: IPR006692 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the WD-associated region found in coatomer subunits alpha, beta and beta' subunits. The alpha-subunit (RET1P) of the coatomer complex in Saccharomyces cerevisiae (Baker's yeast), participates in membrane transport between the endoplasmic reticulum and Golgi apparatus. The protein contains six WD-40 repeat motifs in its N-terminal region []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 3MKQ_B.
Probab=96.41 E-value=0.046 Score=43.20 Aligned_cols=82 Identities=13% Similarity=-0.043 Sum_probs=67.3
Q ss_pred hhHHHHHHHHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCCCCchHHHHHHHHHhCCChHHHHHHHHhcCCHHHHHHHHh
Q psy11102 3 TLVSQAREYYEKCHCYEKLISVYTELGDFEALESCARKLPDSSPLLKPMGEIFVKYGLCEQAVYVFDKNKHKSSQWLTVV 82 (87)
Q Consensus 3 ~~w~~A~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~~~~lL~~ia~~F~~~G~~~~Av~aylk~gd~k~ai~~cv 82 (87)
..|+.=.+...+.||++-|.+||.+.+||++|.=|--...+ .+-|.+++..=...|...-|-.+++-.||++.-+++-.
T Consensus 348 ~~W~~Lg~~AL~~g~~~lAe~c~~k~~d~~~L~lLy~~~g~-~~~L~kl~~~a~~~~~~n~af~~~~~lgd~~~cv~lL~ 426 (443)
T PF04053_consen 348 EKWKQLGDEALRQGNIELAEECYQKAKDFSGLLLLYSSTGD-REKLSKLAKIAEERGDINIAFQAALLLGDVEECVDLLI 426 (443)
T ss_dssp HHHHHHHHHHHHTTBHHHHHHHHHHCT-HHHHHHHHHHCT--HHHHHHHHHHHHHTT-HHHHHHHHHHHT-HHHHHHHHH
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHhhcCccccHHHHHHhCC-HHHHHHHHHHHHHccCHHHHHHHHHHcCCHHHHHHHHH
Confidence 46888889999999999999999999999999887776654 46788999998999999999999999999999888876
Q ss_pred hcC
Q psy11102 83 QDK 85 (87)
Q Consensus 83 ~~~ 85 (87)
+.+
T Consensus 427 ~~~ 429 (443)
T PF04053_consen 427 ETG 429 (443)
T ss_dssp HTT
T ss_pred HcC
Confidence 543
No 16
>TIGR00990 3a0801s09 mitochondrial precursor proteins import receptor (72 kDa mitochondrial outermembrane protein) (mitochondrial import receptor for the ADP/ATP carrier) (translocase of outermembrane tom70).
Probab=96.20 E-value=0.057 Score=43.14 Aligned_cols=69 Identities=19% Similarity=0.230 Sum_probs=52.3
Q ss_pred chhHHHHHHHHHHhcc--------HHHHHHHHHHhCCHHHHHHHHhhC----CCCCchHHHHHHHHHhCCChHHHHHHHH
Q psy11102 2 TTLVSQAREYYEKCHC--------YEKLISVYTELGDFEALESCARKL----PDSSPLLKPMGEIFVKYGLCEQAVYVFD 69 (87)
Q Consensus 2 ~~~w~~A~~yY~~~~n--------~~k~ie~~~~~ed~d~L~~l~~~L----~~~~~lL~~ia~~F~~~G~~~~Av~ayl 69 (87)
.++|++|..+|.++-. +..+..+++..|+++.-....+.. |.+...+..+|..+...|..++|++.|.
T Consensus 412 ~g~~~~A~~~~~kal~l~P~~~~~~~~la~~~~~~g~~~eA~~~~~~al~~~P~~~~~~~~lg~~~~~~g~~~~A~~~~~ 491 (615)
T TIGR00990 412 KGEFAQAGKDYQKSIDLDPDFIFSHIQLGVTQYKEGSIASSMATFRRCKKNFPEAPDVYNYYGELLLDQNKFDEAIEKFD 491 (615)
T ss_pred cCCHHHHHHHHHHHHHcCccCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCChHHHHHHHHHHHHccCHHHHHHHHH
Confidence 4678888888877732 224567888899998877776553 4455778889999999999999988775
Q ss_pred h
Q psy11102 70 K 70 (87)
Q Consensus 70 k 70 (87)
+
T Consensus 492 ~ 492 (615)
T TIGR00990 492 T 492 (615)
T ss_pred H
Confidence 5
No 17
>PF14938 SNAP: Soluble NSF attachment protein, SNAP; PDB: 1QQE_A 2IFU_A.
Probab=96.08 E-value=0.0092 Score=43.58 Aligned_cols=14 Identities=29% Similarity=0.330 Sum_probs=9.4
Q ss_pred hHHHHHHHHHHhcc
Q psy11102 4 LVSQAREYYEKCHC 17 (87)
Q Consensus 4 ~w~~A~~yY~~~~n 17 (87)
+|+.|+++|.+++|
T Consensus 30 ~~e~Aa~~y~~Aa~ 43 (282)
T PF14938_consen 30 DYEEAADLYEKAAN 43 (282)
T ss_dssp HHHHHHHHHHHHHH
T ss_pred CHHHHHHHHHHHHH
Confidence 67777777766654
No 18
>PF04053 Coatomer_WDAD: Coatomer WD associated region ; InterPro: IPR006692 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the WD-associated region found in coatomer subunits alpha, beta and beta' subunits. The alpha-subunit (RET1P) of the coatomer complex in Saccharomyces cerevisiae (Baker's yeast), participates in membrane transport between the endoplasmic reticulum and Golgi apparatus. The protein contains six WD-40 repeat motifs in its N-terminal region []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 3MKQ_B.
Probab=96.04 E-value=0.058 Score=42.62 Aligned_cols=83 Identities=19% Similarity=0.231 Sum_probs=61.9
Q ss_pred chhHHHHHHHHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCCC-CchHHHHHHHHHhCCChHHHHHHHHhcCCHHHHHHH
Q psy11102 2 TTLVSQAREYYEKCHCYEKLISVYTELGDFEALESCARKLPDS-SPLLKPMGEIFVKYGLCEQAVYVFDKNKHKSSQWLT 80 (87)
Q Consensus 2 ~~~w~~A~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~~-~~lL~~ia~~F~~~G~~~~Av~aylk~gd~k~ai~~ 80 (87)
.++.+-|.+.|.++++++++.=.|.-.|+-++|.+|+....+. .--+.-.+.++ .|..++-|+.+++.|+...|-=+
T Consensus 360 ~g~~~lAe~c~~k~~d~~~L~lLy~~~g~~~~L~kl~~~a~~~~~~n~af~~~~~--lgd~~~cv~lL~~~~~~~~A~~~ 437 (443)
T PF04053_consen 360 QGNIELAEECYQKAKDFSGLLLLYSSTGDREKLSKLAKIAEERGDINIAFQAALL--LGDVEECVDLLIETGRLPEAALF 437 (443)
T ss_dssp TTBHHHHHHHHHHCT-HHHHHHHHHHCT-HHHHHHHHHHHHHTT-HHHHHHHHHH--HT-HHHHHHHHHHTT-HHHHHHH
T ss_pred cCCHHHHHHHHHhhcCccccHHHHHHhCCHHHHHHHHHHHHHccCHHHHHHHHHH--cCCHHHHHHHHHHcCCchHHHHH
Confidence 4789999999999999999999999999999999999887654 33333333333 39999999999999998887665
Q ss_pred HhhcCC
Q psy11102 81 VVQDKP 86 (87)
Q Consensus 81 cv~~~~ 86 (87)
.....|
T Consensus 438 A~ty~~ 443 (443)
T PF04053_consen 438 ARTYGP 443 (443)
T ss_dssp HHHTT-
T ss_pred HHhcCC
Confidence 554443
No 19
>PF13429 TPR_15: Tetratricopeptide repeat; PDB: 2VQ2_A 2PL2_B.
Probab=95.94 E-value=0.013 Score=42.10 Aligned_cols=70 Identities=23% Similarity=0.349 Sum_probs=38.7
Q ss_pred chhHHHHHHHHHHh-------ccHHHHHHHHHHhCCHHHHHHHHhhC---C--CC-CchHHHHHHHHHhCCChHHHHHHH
Q psy11102 2 TTLVSQAREYYEKC-------HCYEKLISVYTELGDFEALESCARKL---P--DS-SPLLKPMGEIFVKYGLCEQAVYVF 68 (87)
Q Consensus 2 ~~~w~~A~~yY~~~-------~n~~k~ie~~~~~ed~d~L~~l~~~L---~--~~-~~lL~~ia~~F~~~G~~~~Av~ay 68 (87)
.+++++|.+++.++ ..+..+++++...++++.+.+++..+ + +. ..++...|..+...|..+.|+++|
T Consensus 90 ~~~~~~A~~~~~~~~~~~~~~~~l~~~l~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~a~~~~~~G~~~~A~~~~ 169 (280)
T PF13429_consen 90 DGDPEEALKLAEKAYERDGDPRYLLSALQLYYRLGDYDEAEELLEKLEELPAAPDSARFWLALAEIYEQLGDPDKALRDY 169 (280)
T ss_dssp ----------------------------H-HHHTT-HHHHHHHHHHHHH-T---T-HHHHHHHHHHHHHCCHHHHHHHHH
T ss_pred cccccccccccccccccccccchhhHHHHHHHHHhHHHHHHHHHHHHHhccCCCCCHHHHHHHHHHHHHcCCHHHHHHHH
Confidence 46677787777665 45667889999999999999998874 2 22 367889999999999999999988
Q ss_pred Hhc
Q psy11102 69 DKN 71 (87)
Q Consensus 69 lk~ 71 (87)
-++
T Consensus 170 ~~a 172 (280)
T PF13429_consen 170 RKA 172 (280)
T ss_dssp HHH
T ss_pred HHH
Confidence 553
No 20
>TIGR00990 3a0801s09 mitochondrial precursor proteins import receptor (72 kDa mitochondrial outermembrane protein) (mitochondrial import receptor for the ADP/ATP carrier) (translocase of outermembrane tom70).
Probab=95.82 E-value=0.11 Score=41.46 Aligned_cols=70 Identities=21% Similarity=0.134 Sum_probs=50.2
Q ss_pred chhHHHHHHHHHHhc--------cHHHHHHHHHHhCCHHHHHHHHhhC----CCCCchHHHHHHHHHhCCChHHHHHHHH
Q psy11102 2 TTLVSQAREYYEKCH--------CYEKLISVYTELGDFEALESCARKL----PDSSPLLKPMGEIFVKYGLCEQAVYVFD 69 (87)
Q Consensus 2 ~~~w~~A~~yY~~~~--------n~~k~ie~~~~~ed~d~L~~l~~~L----~~~~~lL~~ia~~F~~~G~~~~Av~ayl 69 (87)
.++|++|..+|.++= -+-....++...|+++.-....+.. |++...+..+|..+...|..++|+..|.
T Consensus 344 ~g~~~eA~~~~~kal~l~P~~~~~~~~la~~~~~~g~~~eA~~~~~~al~~~p~~~~~~~~lg~~~~~~g~~~~A~~~~~ 423 (615)
T TIGR00990 344 KGKHLEALADLSKSIELDPRVTQSYIKRASMNLELGDPDKAEEDFDKALKLNSEDPDIYYHRAQLHFIKGEFAQAGKDYQ 423 (615)
T ss_pred cCCHHHHHHHHHHHHHcCCCcHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHH
Confidence 367788888887752 2345667778888888776655442 4445778888888888999998888775
Q ss_pred hc
Q psy11102 70 KN 71 (87)
Q Consensus 70 k~ 71 (87)
++
T Consensus 424 ka 425 (615)
T TIGR00990 424 KS 425 (615)
T ss_pred HH
Confidence 54
No 21
>PLN03088 SGT1, suppressor of G2 allele of SKP1; Provisional
Probab=95.64 E-value=0.068 Score=40.56 Aligned_cols=70 Identities=19% Similarity=0.202 Sum_probs=54.3
Q ss_pred chhHHHHHHHHHHhcc--------HHHHHHHHHHhCCHHHHHHHHhhC----CCCCchHHHHHHHHHhCCChHHHHHHHH
Q psy11102 2 TTLVSQAREYYEKCHC--------YEKLISVYTELGDFEALESCARKL----PDSSPLLKPMGEIFVKYGLCEQAVYVFD 69 (87)
Q Consensus 2 ~~~w~~A~~yY~~~~n--------~~k~ie~~~~~ed~d~L~~l~~~L----~~~~~lL~~ia~~F~~~G~~~~Av~ayl 69 (87)
.++|+.|+++|.++=. +-....+|..+|+|+.-....+.+ |........+|..+...|..+.|+.+|.
T Consensus 15 ~~~~~~Ai~~~~~Al~~~P~~~~a~~~~a~~~~~~g~~~eAl~~~~~Al~l~P~~~~a~~~lg~~~~~lg~~~eA~~~~~ 94 (356)
T PLN03088 15 DDDFALAVDLYTQAIDLDPNNAELYADRAQANIKLGNFTEAVADANKAIELDPSLAKAYLRKGTACMKLEEYQTAKAALE 94 (356)
T ss_pred cCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCcCCHHHHHHHHHHHHHhCCHHHHHHHHH
Confidence 4688889988888732 445677889999998877666553 4445778889999999999999999987
Q ss_pred hc
Q psy11102 70 KN 71 (87)
Q Consensus 70 k~ 71 (87)
++
T Consensus 95 ~a 96 (356)
T PLN03088 95 KG 96 (356)
T ss_pred HH
Confidence 64
No 22
>PLN03077 Protein ECB2; Provisional
Probab=95.36 E-value=0.27 Score=40.78 Aligned_cols=46 Identities=13% Similarity=0.153 Sum_probs=28.0
Q ss_pred HHHHHHHHHhCCHHHHHHHHhhCCCC-CchHHHHHHHHHhCCChHHH
Q psy11102 19 EKLISVYTELGDFEALESCARKLPDS-SPLLKPMGEIFVKYGLCEQA 64 (87)
Q Consensus 19 ~k~ie~~~~~ed~d~L~~l~~~L~~~-~~lL~~ia~~F~~~G~~~~A 64 (87)
..+|++|.+.|++++-.++.+.+++. ...-..+..-+...|..++|
T Consensus 428 n~Li~~y~k~g~~~~A~~vf~~m~~~d~vs~~~mi~~~~~~g~~~eA 474 (857)
T PLN03077 428 NALIEMYSKCKCIDKALEVFHNIPEKDVISWTSIIAGLRLNNRCFEA 474 (857)
T ss_pred HHHHHHHHHcCCHHHHHHHHHhCCCCCeeeHHHHHHHHHHCCCHHHH
Confidence 35677777888888888877777663 23334444444455554444
No 23
>PF13432 TPR_16: Tetratricopeptide repeat; PDB: 3CVP_A 3CVL_A 3CVQ_A 3CV0_A 2GW1_B 3CVN_A 3QKY_A 2PL2_B.
Probab=95.34 E-value=0.043 Score=30.93 Aligned_cols=53 Identities=17% Similarity=0.346 Sum_probs=37.8
Q ss_pred HHHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCCCCchHHHHHHHHHhCCChHHHHHHHHhc
Q psy11102 10 EYYEKCHCYEKLISVYTELGDFEALESCARKLPDSSPLLKPMGEIFVKYGLCEQAVYVFDKN 71 (87)
Q Consensus 10 ~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~~~~lL~~ia~~F~~~G~~~~Av~aylk~ 71 (87)
..|.+.|+++++++++ +++++.-|++...+..+|..+...|..++|+..|.++
T Consensus 5 ~~~~~~g~~~~A~~~~---------~~~l~~~P~~~~a~~~lg~~~~~~g~~~~A~~~~~~a 57 (65)
T PF13432_consen 5 RALYQQGDYDEAIAAF---------EQALKQDPDNPEAWYLLGRILYQQGRYDEALAYYERA 57 (65)
T ss_dssp HHHHHCTHHHHHHHHH---------HHHHCCSTTHHHHHHHHHHHHHHTT-HHHHHHHHHHH
T ss_pred HHHHHcCCHHHHHHHH---------HHHHHHCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 3455556666666554 3344445777789999999999999999999888654
No 24
>TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD. ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array.
Probab=95.31 E-value=0.15 Score=32.18 Aligned_cols=69 Identities=16% Similarity=0.157 Sum_probs=48.2
Q ss_pred chhHHHHHHHHHHhcc--------HHHHHHHHHHhCCHHHHHHHHhhC----CCCCchHHHHHHHHHhCCChHHHHHHHH
Q psy11102 2 TTLVSQAREYYEKCHC--------YEKLISVYTELGDFEALESCARKL----PDSSPLLKPMGEIFVKYGLCEQAVYVFD 69 (87)
Q Consensus 2 ~~~w~~A~~yY~~~~n--------~~k~ie~~~~~ed~d~L~~l~~~L----~~~~~lL~~ia~~F~~~G~~~~Av~ayl 69 (87)
.+++++|.++|.+.-. +..+..+++..++|+.-....+.. |++......+|..+...|..+.|+..|.
T Consensus 30 ~~~~~~A~~~~~~~~~~~p~~~~~~~~la~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~la~~~~~~g~~~~A~~~~~ 109 (135)
T TIGR02552 30 QGRYDEALKLFQLLAAYDPYNSRYWLGLAACCQMLKEYEEAIDAYALAAALDPDDPRPYFHAAECLLALGEPESALKALD 109 (135)
T ss_pred cccHHHHHHHHHHHHHhCCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChHHHHHHHHHHHHcCCHHHHHHHHH
Confidence 4567777777766532 335777888888888766544432 4445667778888888899888888775
Q ss_pred h
Q psy11102 70 K 70 (87)
Q Consensus 70 k 70 (87)
+
T Consensus 110 ~ 110 (135)
T TIGR02552 110 L 110 (135)
T ss_pred H
Confidence 4
No 25
>PF13414 TPR_11: TPR repeat; PDB: 2HO1_B 2FI7_B 2DBA_A 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2PL2_B 3IEG_B 2FBN_A ....
Probab=95.22 E-value=0.041 Score=31.24 Aligned_cols=59 Identities=20% Similarity=0.289 Sum_probs=45.0
Q ss_pred hhHHHHHHHHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCCCCchHHHHHHHHHhCC-ChHHHHHHHHh
Q psy11102 3 TLVSQAREYYEKCHCYEKLISVYTELGDFEALESCARKLPDSSPLLKPMGEIFVKYG-LCEQAVYVFDK 70 (87)
Q Consensus 3 ~~w~~A~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~~~~lL~~ia~~F~~~G-~~~~Av~aylk 70 (87)
..|..-...|.+-|++++++.+|-+.=++ -|++...+-.+|..+...| ..++|++.|.+
T Consensus 4 ~~~~~~g~~~~~~~~~~~A~~~~~~ai~~---------~p~~~~~~~~~g~~~~~~~~~~~~A~~~~~~ 63 (69)
T PF13414_consen 4 EAWYNLGQIYFQQGDYEEAIEYFEKAIEL---------DPNNAEAYYNLGLAYMKLGKDYEEAIEDFEK 63 (69)
T ss_dssp HHHHHHHHHHHHTTHHHHHHHHHHHHHHH---------STTHHHHHHHHHHHHHHTTTHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHHHc---------CCCCHHHHHHHHHHHHHhCccHHHHHHHHHH
Confidence 34666677777888888888876544322 3666789999999999999 79999988865
No 26
>PF13371 TPR_9: Tetratricopeptide repeat
Probab=95.20 E-value=0.067 Score=30.60 Aligned_cols=49 Identities=18% Similarity=0.321 Sum_probs=35.5
Q ss_pred HHHHHhCCHHHHHHHHhh----CCCCCchHHHHHHHHHhCCChHHHHHHHHhc
Q psy11102 23 SVYTELGDFEALESCARK----LPDSSPLLKPMGEIFVKYGLCEQAVYVFDKN 71 (87)
Q Consensus 23 e~~~~~ed~d~L~~l~~~----L~~~~~lL~~ia~~F~~~G~~~~Av~aylk~ 71 (87)
++|...++|+...+.++. -|++..+....|..+...|.+++|++.|.++
T Consensus 3 ~~~~~~~~~~~A~~~~~~~l~~~p~~~~~~~~~a~~~~~~g~~~~A~~~l~~~ 55 (73)
T PF13371_consen 3 QIYLQQEDYEEALEVLERALELDPDDPELWLQRARCLFQLGRYEEALEDLERA 55 (73)
T ss_pred HHHHhCCCHHHHHHHHHHHHHhCcccchhhHHHHHHHHHhccHHHHHHHHHHH
Confidence 345555555554444443 3667788899999999999999999988654
No 27
>smart00299 CLH Clathrin heavy chain repeat homology.
Probab=95.19 E-value=0.28 Score=31.71 Aligned_cols=66 Identities=21% Similarity=0.310 Sum_probs=39.4
Q ss_pred cHHHHHHHHHHhCCHHHHHHHHhhCCCCCchHHHHHHHHHhCCChHHHHHHHHhcCCHHHHHHHHhhc
Q psy11102 17 CYEKLISVYTELGDFEALESCARKLPDSSPLLKPMGEIFVKYGLCEQAVYVFDKNKHKSSQWLTVVQD 84 (87)
Q Consensus 17 n~~k~ie~~~~~ed~d~L~~l~~~L~~~~~lL~~ia~~F~~~G~~~~Av~aylk~gd~k~ai~~cv~~ 84 (87)
-+..+|++|.+.+.-+.++-+.... +.=-+..++....+.|+.++|+..|.|.|+.+.|+++.++.
T Consensus 43 ~~~~li~ly~~~~~~~ll~~l~~~~--~~yd~~~~~~~c~~~~l~~~~~~l~~k~~~~~~Al~~~l~~ 108 (140)
T smart00299 43 LQTKLIELYAKYDPQKEIERLDNKS--NHYDIEKVGKLCEKAKLYEEAVELYKKDGNFKDAIVTLIEH 108 (140)
T ss_pred HHHHHHHHHHHHCHHHHHHHHHhcc--ccCCHHHHHHHHHHcCcHHHHHHHHHhhcCHHHHHHHHHHc
Confidence 3456677776665433333333121 12225566677777777777777777777777777777654
No 28
>PRK15359 type III secretion system chaperone protein SscB; Provisional
Probab=95.16 E-value=0.16 Score=33.56 Aligned_cols=50 Identities=12% Similarity=0.084 Sum_probs=23.4
Q ss_pred HHHHHHHhCCHHHHHHHHhhC----CCCCchHHHHHHHHHhCCChHHHHHHHHh
Q psy11102 21 LISVYTELGDFEALESCARKL----PDSSPLLKPMGEIFVKYGLCEQAVYVFDK 70 (87)
Q Consensus 21 ~ie~~~~~ed~d~L~~l~~~L----~~~~~lL~~ia~~F~~~G~~~~Av~aylk 70 (87)
+..++...|+++.-....+.. |.+...+..+|.-+...|..+.|+++|.+
T Consensus 64 lg~~~~~~g~~~~A~~~y~~Al~l~p~~~~a~~~lg~~l~~~g~~~eAi~~~~~ 117 (144)
T PRK15359 64 LAGTWMMLKEYTTAINFYGHALMLDASHPEPVYQTGVCLKMMGEPGLAREAFQT 117 (144)
T ss_pred HHHHHHHHhhHHHHHHHHHHHHhcCCCCcHHHHHHHHHHHHcCCHHHHHHHHHH
Confidence 333444444444443333331 22334455555555555555555555543
No 29
>TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW. Members of this family are designated PilF in ref (PubMed:8973346) and PilW in ref (PubMed:15612916). This outer membrane protein is required both for pilus stability and for pilus function such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.
Probab=95.12 E-value=0.22 Score=32.60 Aligned_cols=69 Identities=19% Similarity=0.232 Sum_probs=41.1
Q ss_pred hhHHHHHHHHHHhcc----------HHHHHHHHHHhCCHHHHHHHHhhC----CCCCchHHHHHHHHHhCCChHHHHHHH
Q psy11102 3 TLVSQAREYYEKCHC----------YEKLISVYTELGDFEALESCARKL----PDSSPLLKPMGEIFVKYGLCEQAVYVF 68 (87)
Q Consensus 3 ~~w~~A~~yY~~~~n----------~~k~ie~~~~~ed~d~L~~l~~~L----~~~~~lL~~ia~~F~~~G~~~~Av~ay 68 (87)
+++++|.++|.++-. ...+..+++..|+++.-....+.. |.+...+..+|..+...|..++|++.|
T Consensus 113 g~~~~A~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~g~~~~A~~~~~~~~~~~~~~~~~~~~la~~~~~~~~~~~A~~~~ 192 (234)
T TIGR02521 113 GKYEQAMQQFEQAIEDPLYPQPARSLENAGLCALKAGDFDKAEKYLTRALQIDPQRPESLLELAELYYLRGQYKDARAYL 192 (234)
T ss_pred ccHHHHHHHHHHHHhccccccchHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCcCChHHHHHHHHHHHHcCCHHHHHHHH
Confidence 456666666666521 223456667777777665554442 333345666777777777777777666
Q ss_pred Hhc
Q psy11102 69 DKN 71 (87)
Q Consensus 69 lk~ 71 (87)
.++
T Consensus 193 ~~~ 195 (234)
T TIGR02521 193 ERY 195 (234)
T ss_pred HHH
Confidence 553
No 30
>PF14559 TPR_19: Tetratricopeptide repeat; PDB: 2R5S_A 3QDN_B 3QOU_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 3FP3_A 3LCA_A ....
Probab=95.00 E-value=0.077 Score=29.91 Aligned_cols=42 Identities=21% Similarity=0.207 Sum_probs=25.8
Q ss_pred chhHHHHHHHHHHhccH--------HHHHHHHHHhCCHHHHHHHHhhCCC
Q psy11102 2 TTLVSQAREYYEKCHCY--------EKLISVYTELGDFEALESCARKLPD 43 (87)
Q Consensus 2 ~~~w~~A~~yY~~~~n~--------~k~ie~~~~~ed~d~L~~l~~~L~~ 43 (87)
.|+|++|+++|.+.-.. -.+++||++.|+|++=.++...+..
T Consensus 4 ~~~~~~A~~~~~~~l~~~p~~~~~~~~la~~~~~~g~~~~A~~~l~~~~~ 53 (68)
T PF14559_consen 4 QGDYDEAIELLEKALQRNPDNPEARLLLAQCYLKQGQYDEAEELLERLLK 53 (68)
T ss_dssp TTHHHHHHHHHHHHHHHTTTSHHHHHHHHHHHHHTT-HHHHHHHHHCCHG
T ss_pred ccCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHH
Confidence 46777777777765332 2366666666666666666666644
No 31
>cd00189 TPR Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and C
Probab=94.97 E-value=0.2 Score=27.28 Aligned_cols=68 Identities=31% Similarity=0.455 Sum_probs=40.1
Q ss_pred hhHHHHHHHHHHhcc--------HHHHHHHHHHhCCHHHHHHHHhhC----CCCCchHHHHHHHHHhCCChHHHHHHHHh
Q psy11102 3 TLVSQAREYYEKCHC--------YEKLISVYTELGDFEALESCARKL----PDSSPLLKPMGEIFVKYGLCEQAVYVFDK 70 (87)
Q Consensus 3 ~~w~~A~~yY~~~~n--------~~k~ie~~~~~ed~d~L~~l~~~L----~~~~~lL~~ia~~F~~~G~~~~Av~aylk 70 (87)
++|++|..++.++-. .-....++...++++.-.+..... |.....+..+|..+...|..+.|...|.+
T Consensus 14 ~~~~~A~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~ 93 (100)
T cd00189 14 GDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALELDPDNAKAYYNLGLAYYKLGKYEEALEAYEK 93 (100)
T ss_pred hcHHHHHHHHHHHHhcCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhHHHHHHHHHHHHHhHHHHHHHHHH
Confidence 566777777766422 334566677777776655544432 33334566666666666666666665544
No 32
>TIGR02917 PEP_TPR_lipo putative PEP-CTERM system TPR-repeat lipoprotein. This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
Probab=94.83 E-value=0.42 Score=37.56 Aligned_cols=68 Identities=21% Similarity=0.252 Sum_probs=41.7
Q ss_pred hhHHHHHHHHHHhcc--------HHHHHHHHHHhCCHHHHHHHHhh----CCCCCchHHHHHHHHHhCCChHHHHHHHHh
Q psy11102 3 TLVSQAREYYEKCHC--------YEKLISVYTELGDFEALESCARK----LPDSSPLLKPMGEIFVKYGLCEQAVYVFDK 70 (87)
Q Consensus 3 ~~w~~A~~yY~~~~n--------~~k~ie~~~~~ed~d~L~~l~~~----L~~~~~lL~~ia~~F~~~G~~~~Av~aylk 70 (87)
+++++|.++|.++-. ...++.++...|++++-.+..+. -|++...+..++..+...|..+.|+..|.+
T Consensus 479 ~~~~~A~~~~~~a~~~~~~~~~~~~~la~~~~~~g~~~~A~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~A~~~~~~ 558 (899)
T TIGR02917 479 GDLAKAREAFEKALSIEPDFFPAAANLARIDIQEGNPDDAIQRFEKVLTIDPKNLRAILALAGLYLRTGNEEEAVAWLEK 558 (899)
T ss_pred CCHHHHHHHHHHHHhhCCCcHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCcCcHHHHHHHHHHHHHcCCHHHHHHHHHH
Confidence 556666666665422 22355566677777654444433 344556677778888788888877665443
No 33
>TIGR02917 PEP_TPR_lipo putative PEP-CTERM system TPR-repeat lipoprotein. This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
Probab=94.82 E-value=0.18 Score=39.63 Aligned_cols=70 Identities=14% Similarity=0.133 Sum_probs=51.2
Q ss_pred chhHHHHHHHHHHhcc-------HHHHHHHHHHhCCHHHHHHHHhh----CCCCCchHHHHHHHHHhCCChHHHHHHHHh
Q psy11102 2 TTLVSQAREYYEKCHC-------YEKLISVYTELGDFEALESCARK----LPDSSPLLKPMGEIFVKYGLCEQAVYVFDK 70 (87)
Q Consensus 2 ~~~w~~A~~yY~~~~n-------~~k~ie~~~~~ed~d~L~~l~~~----L~~~~~lL~~ia~~F~~~G~~~~Av~aylk 70 (87)
.++|++|.++|.++-. ...++.++...|++++-.+..+. -|++...+..+|..+...|..++|++.|.+
T Consensus 716 ~g~~~~A~~~~~~~~~~~~~~~~~~~l~~~~~~~g~~~~A~~~~~~~l~~~~~~~~~~~~la~~~~~~g~~~~A~~~~~~ 795 (899)
T TIGR02917 716 QKDYPAAIQAYRKALKRAPSSQNAIKLHRALLASGNTAEAVKTLEAWLKTHPNDAVLRTALAELYLAQKDYDKAIKHYRT 795 (899)
T ss_pred CCCHHHHHHHHHHHHhhCCCchHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHCcCHHHHHHHHHH
Confidence 4678888888877532 34577888888888777655554 355557788888888888999888888765
Q ss_pred c
Q psy11102 71 N 71 (87)
Q Consensus 71 ~ 71 (87)
.
T Consensus 796 ~ 796 (899)
T TIGR02917 796 V 796 (899)
T ss_pred H
Confidence 3
No 34
>PRK11788 tetratricopeptide repeat protein; Provisional
Probab=94.78 E-value=0.27 Score=36.19 Aligned_cols=70 Identities=20% Similarity=0.174 Sum_probs=46.3
Q ss_pred chhHHHHHHHHHHhcc---------HHHHHHHHHHhCCHHHHHHHHhhCC---CCCchHHHHHHHHHhCCChHHHHHHHH
Q psy11102 2 TTLVSQAREYYEKCHC---------YEKLISVYTELGDFEALESCARKLP---DSSPLLKPMGEIFVKYGLCEQAVYVFD 69 (87)
Q Consensus 2 ~~~w~~A~~yY~~~~n---------~~k~ie~~~~~ed~d~L~~l~~~L~---~~~~lL~~ia~~F~~~G~~~~Av~ayl 69 (87)
.+++++|.++|.+.-. +..++.+|...|+++.-.+..+.+- .+++....+|..+...|..++|...|.
T Consensus 227 ~g~~~~A~~~~~~~~~~~p~~~~~~~~~l~~~~~~~g~~~~A~~~l~~~~~~~p~~~~~~~la~~~~~~g~~~~A~~~l~ 306 (389)
T PRK11788 227 QGDYAAAIEALERVEEQDPEYLSEVLPKLMECYQALGDEAEGLEFLRRALEEYPGADLLLALAQLLEEQEGPEAAQALLR 306 (389)
T ss_pred CCCHHHHHHHHHHHHHHChhhHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCCCchHHHHHHHHHHHhCCHHHHHHHHH
Confidence 4667777777776552 2355677778888877776666532 233445677777777888888877775
Q ss_pred hc
Q psy11102 70 KN 71 (87)
Q Consensus 70 k~ 71 (87)
+.
T Consensus 307 ~~ 308 (389)
T PRK11788 307 EQ 308 (389)
T ss_pred HH
Confidence 43
No 35
>TIGR02795 tol_pal_ybgF tol-pal system protein YbgF. Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.
Probab=94.76 E-value=0.23 Score=30.08 Aligned_cols=69 Identities=16% Similarity=0.127 Sum_probs=51.0
Q ss_pred chhHHHHHHHHHHhcc-----------HHHHHHHHHHhCCHHHHHHHHhhC----CCC---CchHHHHHHHHHhCCChHH
Q psy11102 2 TTLVSQAREYYEKCHC-----------YEKLISVYTELGDFEALESCARKL----PDS---SPLLKPMGEIFVKYGLCEQ 63 (87)
Q Consensus 2 ~~~w~~A~~yY~~~~n-----------~~k~ie~~~~~ed~d~L~~l~~~L----~~~---~~lL~~ia~~F~~~G~~~~ 63 (87)
.++|++|.++|.+.-. .-.+..+++..++|+.-....+.+ |.+ ...+..+|..+...|..++
T Consensus 15 ~~~~~~A~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~ 94 (119)
T TIGR02795 15 AGDYADAIQAFQAFLKKYPKSTYAPNAHYWLGEAYYAQGKYADAAKAFLAVVKKYPKSPKAPDALLKLGMSLQELGDKEK 94 (119)
T ss_pred cCCHHHHHHHHHHHHHHCCCccccHHHHHHHHHHHHhhccHHHHHHHHHHHHHHCCCCCcccHHHHHHHHHHHHhCChHH
Confidence 4788899999877622 223788999999999777766654 332 2457888888888899888
Q ss_pred HHHHHHh
Q psy11102 64 AVYVFDK 70 (87)
Q Consensus 64 Av~aylk 70 (87)
|+..|.+
T Consensus 95 A~~~~~~ 101 (119)
T TIGR02795 95 AKATLQQ 101 (119)
T ss_pred HHHHHHH
Confidence 8888754
No 36
>PRK11447 cellulose synthase subunit BcsC; Provisional
Probab=94.68 E-value=0.42 Score=41.29 Aligned_cols=67 Identities=12% Similarity=0.025 Sum_probs=51.7
Q ss_pred hhHHHHHHHHHHhcc------------------HHHHHHHHHHhCCHHHHHHHHhhCCCCCchHHHHHHHHHhCCChHHH
Q psy11102 3 TLVSQAREYYEKCHC------------------YEKLISVYTELGDFEALESCARKLPDSSPLLKPMGEIFVKYGLCEQA 64 (87)
Q Consensus 3 ~~w~~A~~yY~~~~n------------------~~k~ie~~~~~ed~d~L~~l~~~L~~~~~lL~~ia~~F~~~G~~~~A 64 (87)
++|++|..++.+.-. .-..++.+...|++++-.++.+.-|.+.+.+..+|..+...|..++|
T Consensus 543 ~~~~~Al~~l~~l~~~~~~~~~~~l~~~l~~~~~l~~a~~l~~~G~~~eA~~~l~~~p~~~~~~~~La~~~~~~g~~~~A 622 (1157)
T PRK11447 543 DRDRAALAHLNTLPRAQWNSNIQELAQRLQSDQVLETANRLRDSGKEAEAEALLRQQPPSTRIDLTLADWAQQRGDYAAA 622 (1157)
T ss_pred CCHHHHHHHHHhCCchhcChhHHHHHHHHhhhHHHHHHHHHHHCCCHHHHHHHHHhCCCCchHHHHHHHHHHHcCCHHHH
Confidence 566777777665321 12346778889999999999998777777888899999999999999
Q ss_pred HHHHH
Q psy11102 65 VYVFD 69 (87)
Q Consensus 65 v~ayl 69 (87)
++.|.
T Consensus 623 ~~~y~ 627 (1157)
T PRK11447 623 RAAYQ 627 (1157)
T ss_pred HHHHH
Confidence 87764
No 37
>smart00299 CLH Clathrin heavy chain repeat homology.
Probab=94.56 E-value=0.2 Score=32.45 Aligned_cols=51 Identities=22% Similarity=0.227 Sum_probs=30.3
Q ss_pred hhHHHHHHHHHHhccHHHHHHHHHHh-CCHHHHHHHHhhCCCCCchHHHHHHH
Q psy11102 3 TLVSQAREYYEKCHCYEKLISVYTEL-GDFEALESCARKLPDSSPLLKPMGEI 54 (87)
Q Consensus 3 ~~w~~A~~yY~~~~n~~k~ie~~~~~-ed~d~L~~l~~~L~~~~~lL~~ia~~ 54 (87)
+.|+.|+-+|.+.|+++.+++.+... +|++...+.+++.. +.++...++..
T Consensus 83 ~l~~~~~~l~~k~~~~~~Al~~~l~~~~d~~~a~~~~~~~~-~~~lw~~~~~~ 134 (140)
T smart00299 83 KLYEEAVELYKKDGNFKDAIVTLIEHLGNYEKAIEYFVKQN-NPELWAEVLKA 134 (140)
T ss_pred CcHHHHHHHHHhhcCHHHHHHHHHHcccCHHHHHHHHHhCC-CHHHHHHHHHH
Confidence 45667777777777777777776665 66666666666522 23344444433
No 38
>PF04840 Vps16_C: Vps16, C-terminal region; InterPro: IPR006925 This protein forms part of the Class C vacuolar protein sorting (Vps) complex. Vps16 is essential for vacuolar protein sorting, which is essential for viability in plants, but not yeast []. The Class C Vps complex is required for SNARE-mediated membrane fusion at the lysosome-like yeast vacuole. It is thought to play essential roles in membrane docking and fusion at the Golgi-to-endosome and endosome-to-vacuole stages of transport []. The role of VPS16 in this complex is not known.; GO: 0006886 intracellular protein transport, 0005737 cytoplasm
Probab=94.35 E-value=0.21 Score=37.81 Aligned_cols=50 Identities=18% Similarity=0.370 Sum_probs=42.4
Q ss_pred HHHHHHHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCCCC-chH-HHHHHHH
Q psy11102 6 SQAREYYEKCHCYEKLISVYTELGDFEALESCARKLPDSS-PLL-KPMGEIF 55 (87)
Q Consensus 6 ~~A~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~~~-~lL-~~ia~~F 55 (87)
++-+++|.++|++..|++..++.+|.+.|..|-+..|.++ +.. ..|-.|.
T Consensus 266 ~~rv~~y~~~~~~~~A~~~A~~~kd~~~L~~i~~~~~~~~~~~~~~~i~~~~ 317 (319)
T PF04840_consen 266 EERVEMYLKCGDYKEAAQEAFKEKDIDLLKQILKRCPGNNDQLIADKIEQML 317 (319)
T ss_pred HHHHHHHHHCCCHHHHHHHHHHcCCHHHHHHHHHHCCCCChHHHHHHHHHHH
Confidence 6788999999999999999999999999999999999864 433 4555443
No 39
>TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW. Members of this family are designated PilF in ref (PubMed:8973346) and PilW in ref (PubMed:15612916). This outer membrane protein is required both for pilus stability and for pilus function such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.
Probab=94.34 E-value=0.3 Score=31.94 Aligned_cols=66 Identities=12% Similarity=0.130 Sum_probs=31.3
Q ss_pred hhHHHHHHHHHHhccH--------HHHHHHHHHhCCHHHHHHHHhhCC------CCCchHHHHHHHHHhCCChHHHHHHH
Q psy11102 3 TLVSQAREYYEKCHCY--------EKLISVYTELGDFEALESCARKLP------DSSPLLKPMGEIFVKYGLCEQAVYVF 68 (87)
Q Consensus 3 ~~w~~A~~yY~~~~n~--------~k~ie~~~~~ed~d~L~~l~~~L~------~~~~lL~~ia~~F~~~G~~~~Av~ay 68 (87)
+++++|.++|.++-.. ..+..++...|+++.-.+..+..- .....+..+|..+...|..++|.+.|
T Consensus 79 ~~~~~A~~~~~~al~~~~~~~~~~~~~~~~~~~~g~~~~A~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~g~~~~A~~~~ 158 (234)
T TIGR02521 79 GELEKAEDSFRRALTLNPNNGDVLNNYGTFLCQQGKYEQAMQQFEQAIEDPLYPQPARSLENAGLCALKAGDFDKAEKYL 158 (234)
T ss_pred CCHHHHHHHHHHHHhhCCCCHHHHHHHHHHHHHcccHHHHHHHHHHHHhccccccchHHHHHHHHHHHHcCCHHHHHHHH
Confidence 4555555555544221 123445555555555444444321 11234445555556666666555544
No 40
>PF09295 ChAPs: ChAPs (Chs5p-Arf1p-binding proteins); InterPro: IPR015374 ChAPs (Chs5p-Arf1p-binding proteins) are required for the export of specialised cargo from the Golgi. They physically interact with Chs3, Chs5 and the small GTPase Arf1, and they also form interactions with each other [].
Probab=93.82 E-value=0.81 Score=35.86 Aligned_cols=79 Identities=20% Similarity=0.121 Sum_probs=56.3
Q ss_pred chhHHHHHHHHHH-----hccHHHHHHHHHHhCCHHHHHHHHhh----CCCCCchHHHHHHHHHhCCChHHHH-------
Q psy11102 2 TTLVSQAREYYEK-----CHCYEKLISVYTELGDFEALESCARK----LPDSSPLLKPMGEIFVKYGLCEQAV------- 65 (87)
Q Consensus 2 ~~~w~~A~~yY~~-----~~n~~k~ie~~~~~ed~d~L~~l~~~----L~~~~~lL~~ia~~F~~~G~~~~Av------- 65 (87)
++.|+.|.+++.+ -...-.++.++...+.-.+-.+++.+ .|.+..+|..-|+++.+.|..+.|.
T Consensus 182 t~~~~~ai~lle~L~~~~pev~~~LA~v~l~~~~E~~AI~ll~~aL~~~p~d~~LL~~Qa~fLl~k~~~~lAL~iAk~av 261 (395)
T PF09295_consen 182 TQRYDEAIELLEKLRERDPEVAVLLARVYLLMNEEVEAIRLLNEALKENPQDSELLNLQAEFLLSKKKYELALEIAKKAV 261 (395)
T ss_pred cccHHHHHHHHHHHHhcCCcHHHHHHHHHHhcCcHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHhcCCHHHHHHHHHHHH
Confidence 4567777777776 44445566666665554444444333 3556799999999999999988875
Q ss_pred --------------HHHHhcCCHHHHHHH
Q psy11102 66 --------------YVFDKNKHKSSQWLT 80 (87)
Q Consensus 66 --------------~aylk~gd~k~ai~~ 80 (87)
++|++.||++.|+-+
T Consensus 262 ~lsP~~f~~W~~La~~Yi~~~d~e~ALla 290 (395)
T PF09295_consen 262 ELSPSEFETWYQLAECYIQLGDFENALLA 290 (395)
T ss_pred HhCchhHHHHHHHHHHHHhcCCHHHHHHH
Confidence 579999999999854
No 41
>KOG0553|consensus
Probab=93.79 E-value=0.27 Score=37.55 Aligned_cols=71 Identities=20% Similarity=0.307 Sum_probs=54.8
Q ss_pred chhHHHHHHHHHHhcc--------HHHHHHHHHHhCCHHHHHHHHhh-C--CCC-CchHHHHHHHHHhCCChHHHHHHHH
Q psy11102 2 TTLVSQAREYYEKCHC--------YEKLISVYTELGDFEALESCARK-L--PDS-SPLLKPMGEIFVKYGLCEQAVYVFD 69 (87)
Q Consensus 2 ~~~w~~A~~yY~~~~n--------~~k~ie~~~~~ed~d~L~~l~~~-L--~~~-~~lL~~ia~~F~~~G~~~~Av~ayl 69 (87)
.++|.+|++.|.++=. |..=+..|.++|.|+.-.+=|.+ | +.. ++...++|.-+...|-++.|+++|-
T Consensus 94 ~~~Y~eAv~kY~~AI~l~P~nAVyycNRAAAy~~Lg~~~~AVkDce~Al~iDp~yskay~RLG~A~~~~gk~~~A~~ayk 173 (304)
T KOG0553|consen 94 NKDYQEAVDKYTEAIELDPTNAVYYCNRAAAYSKLGEYEDAVKDCESALSIDPHYSKAYGRLGLAYLALGKYEEAIEAYK 173 (304)
T ss_pred hhhHHHHHHHHHHHHhcCCCcchHHHHHHHHHHHhcchHHHHHHHHHHHhcChHHHHHHHHHHHHHHccCcHHHHHHHHH
Confidence 4678888888887744 34556678888888766655554 3 444 6999999999999999999999987
Q ss_pred hcC
Q psy11102 70 KNK 72 (87)
Q Consensus 70 k~g 72 (87)
|+=
T Consensus 174 KaL 176 (304)
T KOG0553|consen 174 KAL 176 (304)
T ss_pred hhh
Confidence 753
No 42
>cd00189 TPR Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and C
Probab=93.37 E-value=0.53 Score=25.53 Aligned_cols=54 Identities=22% Similarity=0.355 Sum_probs=33.2
Q ss_pred HHHHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCCCCchHHHHHHHHHhCCChHHHHHHHHhc
Q psy11102 9 REYYEKCHCYEKLISVYTELGDFEALESCARKLPDSSPLLKPMGEIFVKYGLCEQAVYVFDKN 71 (87)
Q Consensus 9 ~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~~~~lL~~ia~~F~~~G~~~~Av~aylk~ 71 (87)
+..|.+.|++++++..+ .+.++.-|........+|..+...|..+.|++.|.+.
T Consensus 7 a~~~~~~~~~~~A~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~ 60 (100)
T cd00189 7 GNLYYKLGDYDEALEYY---------EKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKA 60 (100)
T ss_pred HHHHHHHhcHHHHHHHH---------HHHHhcCCccHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34444555666655543 2223333444466778888888888888888877553
No 43
>PRK11788 tetratricopeptide repeat protein; Provisional
Probab=93.21 E-value=0.54 Score=34.53 Aligned_cols=48 Identities=23% Similarity=0.221 Sum_probs=20.7
Q ss_pred HHHHHHHHhCCHHHHHHHHhhCCC----CCchHHHHHHHHHhCCChHHHHHH
Q psy11102 20 KLISVYTELGDFEALESCARKLPD----SSPLLKPMGEIFVKYGLCEQAVYV 67 (87)
Q Consensus 20 k~ie~~~~~ed~d~L~~l~~~L~~----~~~lL~~ia~~F~~~G~~~~Av~a 67 (87)
.++.+|...|+|+.-..+.+...+ +...+..++..+...|..++|++.
T Consensus 112 ~La~~~~~~g~~~~A~~~~~~~l~~~~~~~~~~~~la~~~~~~g~~~~A~~~ 163 (389)
T PRK11788 112 ELGQDYLKAGLLDRAEELFLQLVDEGDFAEGALQQLLEIYQQEKDWQKAIDV 163 (389)
T ss_pred HHHHHHHHCCCHHHHHHHHHHHHcCCcchHHHHHHHHHHHHHhchHHHHHHH
Confidence 344445555555444444333211 123344444444444554444433
No 44
>PF14938 SNAP: Soluble NSF attachment protein, SNAP; PDB: 1QQE_A 2IFU_A.
Probab=92.95 E-value=0.14 Score=37.28 Aligned_cols=22 Identities=14% Similarity=0.226 Sum_probs=10.5
Q ss_pred HHHHHHHHHHhccHHHHHHHHH
Q psy11102 5 VSQAREYYEKCHCYEKLISVYT 26 (87)
Q Consensus 5 w~~A~~yY~~~~n~~k~ie~~~ 26 (87)
+.+|+++|...|++..++.++.
T Consensus 97 ~~~A~~~y~~~G~~~~aA~~~~ 118 (282)
T PF14938_consen 97 YEKAIEIYREAGRFSQAAKCLK 118 (282)
T ss_dssp HHHHHHHHHHCT-HHHHHHHHH
T ss_pred HHHHHHHHHhcCcHHHHHHHHH
Confidence 3445555555555555555444
No 45
>KOG1920|consensus
Probab=92.71 E-value=0.32 Score=42.92 Aligned_cols=84 Identities=8% Similarity=0.098 Sum_probs=47.4
Q ss_pred hhHHHHHHHHHHhc--cHHHHHHHHHHhCCHHHHHHHHhhCCCC-CchH-------------HHHHHHHHhCCChHHHHH
Q psy11102 3 TLVSQAREYYEKCH--CYEKLISVYTELGDFEALESCARKLPDS-SPLL-------------KPMGEIFVKYGLCEQAVY 66 (87)
Q Consensus 3 ~~w~~A~~yY~~~~--n~~k~ie~~~~~ed~d~L~~l~~~L~~~-~~lL-------------~~ia~~F~~~G~~~~Av~ 66 (87)
++|++|......+| .++..++.--+++.|+..-.+.+-=++. ...+ .+-|-+|...|..++|..
T Consensus 894 ~ry~~AL~hLs~~~~~~~~e~~n~I~kh~Ly~~aL~ly~~~~e~~k~i~~~ya~hL~~~~~~~~Aal~Ye~~GklekAl~ 973 (1265)
T KOG1920|consen 894 KRYEDALSHLSECGETYFPECKNYIKKHGLYDEALALYKPDSEKQKVIYEAYADHLREELMSDEAALMYERCGKLEKALK 973 (1265)
T ss_pred HHHHHHHHHHHHcCccccHHHHHHHHhcccchhhhheeccCHHHHHHHHHHHHHHHHHhccccHHHHHHHHhccHHHHHH
Confidence 56777777777777 4555555555555555443332211111 1222 244556666666666666
Q ss_pred HHHhcCCHHHHHHHHhhcCC
Q psy11102 67 VFDKNKHKSSQWLTVVQDKP 86 (87)
Q Consensus 67 aylk~gd~k~ai~~cv~~~~ 86 (87)
||.-+||.+.|+.+..++++
T Consensus 974 a~~~~~dWr~~l~~a~ql~~ 993 (1265)
T KOG1920|consen 974 AYKECGDWREALSLAAQLSE 993 (1265)
T ss_pred HHHHhccHHHHHHHHHhhcC
Confidence 66666666666666666554
No 46
>PF13429 TPR_15: Tetratricopeptide repeat; PDB: 2VQ2_A 2PL2_B.
Probab=92.71 E-value=0.27 Score=35.20 Aligned_cols=69 Identities=20% Similarity=0.374 Sum_probs=42.0
Q ss_pred hhHHHHHHHHHHhcc--------HHHHHHHHHHhCCHHHHHHHHhhC----CCCCchHHHHHHHHHhCCChHHHHHHHHh
Q psy11102 3 TLVSQAREYYEKCHC--------YEKLISVYTELGDFEALESCARKL----PDSSPLLKPMGEIFVKYGLCEQAVYVFDK 70 (87)
Q Consensus 3 ~~w~~A~~yY~~~~n--------~~k~ie~~~~~ed~d~L~~l~~~L----~~~~~lL~~ia~~F~~~G~~~~Av~aylk 70 (87)
|++++|...|.++=. ...++..+...|+++++.+++..+ |.+..+...+|......|..++|+..|.+
T Consensus 160 G~~~~A~~~~~~al~~~P~~~~~~~~l~~~li~~~~~~~~~~~l~~~~~~~~~~~~~~~~la~~~~~lg~~~~Al~~~~~ 239 (280)
T PF13429_consen 160 GDPDKALRDYRKALELDPDDPDARNALAWLLIDMGDYDEAREALKRLLKAAPDDPDLWDALAAAYLQLGRYEEALEYLEK 239 (280)
T ss_dssp CHHHHHHHHHHHHHHH-TT-HHHHHHHHHHHCTTCHHHHHHHHHHHHHHH-HTSCCHCHHHHHHHHHHT-HHHHHHHHHH
T ss_pred CCHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHCCChHHHHHHHHHHHHHCcCHHHHHHHHHHHhccccccccccccccc
Confidence 455556655555422 122344555667887755555443 55556677788888888888888887766
Q ss_pred c
Q psy11102 71 N 71 (87)
Q Consensus 71 ~ 71 (87)
.
T Consensus 240 ~ 240 (280)
T PF13429_consen 240 A 240 (280)
T ss_dssp H
T ss_pred c
Confidence 3
No 47
>KOG2280|consensus
Probab=92.47 E-value=0.43 Score=40.49 Aligned_cols=64 Identities=13% Similarity=0.217 Sum_probs=52.4
Q ss_pred HHHHHHhCCHHHHHHHHhhC--CCC-CchHH---------HHHHHHHhCCChHHHHHHHHhcCCHHHHHHHHhhcC
Q psy11102 22 ISVYTELGDFEALESCARKL--PDS-SPLLK---------PMGEIFVKYGLCEQAVYVFDKNKHKSSQWLTVVQDK 85 (87)
Q Consensus 22 ie~~~~~ed~d~L~~l~~~L--~~~-~~lL~---------~ia~~F~~~G~~~~Av~aylk~gd~k~ai~~cv~~~ 85 (87)
+..+...++|++|++++.+- |-+ .|..+ +...++-.++...+=|.+|+++|+++.|+++..+.|
T Consensus 722 ~~aLa~~~kweeLekfAkskksPIGy~PFVe~c~~~~n~~EA~KYiprv~~l~ekv~ay~~~~~~~eAad~A~~~r 797 (829)
T KOG2280|consen 722 LTALADIKKWEELEKFAKSKKSPIGYLPFVEACLKQGNKDEAKKYIPRVGGLQEKVKAYLRVGDVKEAADLAAEHR 797 (829)
T ss_pred HHHHHhhhhHHHHHHHHhccCCCCCchhHHHHHHhcccHHHHhhhhhccCChHHHHHHHHHhccHHHHHHHHHHhc
Confidence 45677899999999999885 335 46544 456678888888899999999999999999998876
No 48
>KOG0548|consensus
Probab=92.20 E-value=0.57 Score=38.23 Aligned_cols=85 Identities=16% Similarity=0.114 Sum_probs=63.7
Q ss_pred hhHHHHHHHHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCC----CCchHHHHHHHHHhCCChHHHHH------------
Q psy11102 3 TLVSQAREYYEKCHCYEKLISVYTELGDFEALESCARKLPD----SSPLLKPMGEIFVKYGLCEQAVY------------ 66 (87)
Q Consensus 3 ~~w~~A~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~----~~~lL~~ia~~F~~~G~~~~Av~------------ 66 (87)
++++.|++||.++--..+--+++-.+.+.++-.+......- ........|.-|-+.|-++.||.
T Consensus 312 ~~~~~ai~~~~kaLte~Rt~~~ls~lk~~Ek~~k~~e~~a~~~pe~A~e~r~kGne~Fk~gdy~~Av~~YteAIkr~P~D 391 (539)
T KOG0548|consen 312 EDYEGAIKYYQKALTEHRTPDLLSKLKEAEKALKEAERKAYINPEKAEEEREKGNEAFKKGDYPEAVKHYTEAIKRDPED 391 (539)
T ss_pred HhHHHHHHHHHHHhhhhcCHHHHHHHHHHHHHHHHHHHHHhhChhHHHHHHHHHHHHHhccCHHHHHHHHHHHHhcCCch
Confidence 57788899998876666667788888877777766666543 34556677888999999998886
Q ss_pred ---------HHHhcCCHHHHHH---HHhhcCCC
Q psy11102 67 ---------VFDKNKHKSSQWL---TVVQDKPS 87 (87)
Q Consensus 67 ---------aylk~gd~k~ai~---~cv~~~~~ 87 (87)
||+|+|.+..|+. .|++++|+
T Consensus 392 a~lYsNRAac~~kL~~~~~aL~Da~~~ieL~p~ 424 (539)
T KOG0548|consen 392 ARLYSNRAACYLKLGEYPEALKDAKKCIELDPN 424 (539)
T ss_pred hHHHHHHHHHHHHHhhHHHHHHHHHHHHhcCch
Confidence 5677888887775 67777774
No 49
>PF13424 TPR_12: Tetratricopeptide repeat; PDB: 3RO2_A 3Q15_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 4A1S_B 3CEQ_B 3EDT_H ....
Probab=92.14 E-value=0.092 Score=30.52 Aligned_cols=60 Identities=20% Similarity=0.349 Sum_probs=39.3
Q ss_pred HHHHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCCCC----chHHHHHHHHHhCCChHHHHHHHHhcCCH
Q psy11102 9 REYYEKCHCYEKLISVYTELGDFEALESCARKLPDSS----PLLKPMGEIFVKYGLCEQAVYVFDKNKHK 74 (87)
Q Consensus 9 ~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~~~----~lL~~ia~~F~~~G~~~~Av~aylk~gd~ 74 (87)
+..|..-|+++++++++-+ -.++.+.+++++ ..+..+|..+...|..++|++.|.++=++
T Consensus 12 a~~~~~~~~~~~A~~~~~~------al~~~~~~~~~~~~~a~~~~~lg~~~~~~g~~~~A~~~~~~al~i 75 (78)
T PF13424_consen 12 ARVYRELGRYDEALDYYEK------ALDIEEQLGDDHPDTANTLNNLGECYYRLGDYEEALEYYQKALDI 75 (78)
T ss_dssp HHHHHHTT-HHHHHHHHHH------HHHHHHHTTTHHHHHHHHHHHHHHHHHHTTHHHHHHHHHHHHHHH
T ss_pred HHHHHHcCCHHHHHHHHHH------HHHHHHHHCCCCHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHhh
Confidence 3445555666666655433 333455566543 45788999999999999999999876443
No 50
>PLN03081 pentatricopeptide (PPR) repeat-containing protein; Provisional
Probab=91.85 E-value=2.2 Score=34.74 Aligned_cols=41 Identities=20% Similarity=0.274 Sum_probs=23.7
Q ss_pred hhHHHHHHHHHHhc----------cHHHHHHHHHHhCCHHHHHHHHhhCCC
Q psy11102 3 TLVSQAREYYEKCH----------CYEKLISVYTELGDFEALESCARKLPD 43 (87)
Q Consensus 3 ~~w~~A~~yY~~~~----------n~~k~ie~~~~~ed~d~L~~l~~~L~~ 43 (87)
++.++|.++|..-. -+.-+|++|.+.|++++-.++++..|.
T Consensus 440 g~~~~a~~~f~~m~~~~g~~p~~~~y~~li~~l~r~G~~~eA~~~~~~~~~ 490 (697)
T PLN03081 440 GLSEQGWEIFQSMSENHRIKPRAMHYACMIELLGREGLLDEAYAMIRRAPF 490 (697)
T ss_pred CcHHHHHHHHHHHHHhcCCCCCccchHhHHHHHHhcCCHHHHHHHHHHCCC
Confidence 44555555555432 134466666666666666666666553
No 51
>PLN03081 pentatricopeptide (PPR) repeat-containing protein; Provisional
Probab=91.70 E-value=1.4 Score=35.84 Aligned_cols=53 Identities=21% Similarity=0.342 Sum_probs=39.5
Q ss_pred HHHHHHHHHhCCHHHHHHHHhhCCCC-CchHHHHHHHHHhCCChHHHHHHHHhc
Q psy11102 19 EKLISVYTELGDFEALESCARKLPDS-SPLLKPMGEIFVKYGLCEQAVYVFDKN 71 (87)
Q Consensus 19 ~k~ie~~~~~ed~d~L~~l~~~L~~~-~~lL~~ia~~F~~~G~~~~Av~aylk~ 71 (87)
..+|++|.+.|++++-.++.+.+++. ...-..+..-|.+.|..+.|.+.|.+.
T Consensus 263 n~Li~~y~k~g~~~~A~~vf~~m~~~~~vt~n~li~~y~~~g~~~eA~~lf~~M 316 (697)
T PLN03081 263 CALIDMYSKCGDIEDARCVFDGMPEKTTVAWNSMLAGYALHGYSEEALCLYYEM 316 (697)
T ss_pred HHHHHHHHHCCCHHHHHHHHHhCCCCChhHHHHHHHHHHhCCCHHHHHHHHHHH
Confidence 35678888888888888888888774 455667777777888888887777443
No 52
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=91.70 E-value=1.3 Score=32.57 Aligned_cols=56 Identities=20% Similarity=0.224 Sum_probs=30.7
Q ss_pred hHHHHHHHHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCCC---CchHHHHHHHHHhCCChHHHHHHH
Q psy11102 4 LVSQAREYYEKCHCYEKLISVYTELGDFEALESCARKLPDS---SPLLKPMGEIFVKYGLCEQAVYVF 68 (87)
Q Consensus 4 ~w~~A~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~~---~~lL~~ia~~F~~~G~~~~Av~ay 68 (87)
.|+.|..++.+.|++.+++.. +..+++.-|++ ...+-.+|..+...|..+.|+..|
T Consensus 145 ~Y~~A~~l~~~~~~y~~Ai~a---------f~~fl~~yP~s~~a~~A~y~LG~~y~~~g~~~~A~~~f 203 (263)
T PRK10803 145 DYNAAIALVQDKSRQDDAIVA---------FQNFVKKYPDSTYQPNANYWLGQLNYNKGKKDDAAYYF 203 (263)
T ss_pred HHHHHHHHHHhcCCHHHHHHH---------HHHHHHHCcCCcchHHHHHHHHHHHHHcCCHHHHHHHH
Confidence 455666665555666666544 34444444543 144555666655666665555544
No 53
>PRK02603 photosystem I assembly protein Ycf3; Provisional
Probab=91.68 E-value=1.5 Score=29.35 Aligned_cols=58 Identities=19% Similarity=0.248 Sum_probs=35.4
Q ss_pred hhHHHHHHHHHHhc-----------cHHHHHHHHHHhCCHHHHHHHHhh----CCCCCchHHHHHHHHHhCCC
Q psy11102 3 TLVSQAREYYEKCH-----------CYEKLISVYTELGDFEALESCARK----LPDSSPLLKPMGEIFVKYGL 60 (87)
Q Consensus 3 ~~w~~A~~yY~~~~-----------n~~k~ie~~~~~ed~d~L~~l~~~----L~~~~~lL~~ia~~F~~~G~ 60 (87)
+++++|..+|.++= -+..+..++...|+|+.-....+. .|.....+..+|..+...|.
T Consensus 49 g~~~~A~~~~~~al~~~~~~~~~~~~~~~la~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~lg~~~~~~g~ 121 (172)
T PRK02603 49 GEYAEALENYEEALKLEEDPNDRSYILYNMGIIYASNGEHDKALEYYHQALELNPKQPSALNNIAVIYHKRGE 121 (172)
T ss_pred CCHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCcccHHHHHHHHHHHHHcCC
Confidence 56777777777653 244556777777887776554443 23344555666766666554
No 54
>PF12688 TPR_5: Tetratrico peptide repeat
Probab=91.67 E-value=1.9 Score=28.28 Aligned_cols=69 Identities=16% Similarity=0.182 Sum_probs=51.2
Q ss_pred chhHHHHHHHHHHhcc-----------HHHHHHHHHHhCCHHHHHHHHhhC----CC---CCchHHHHHHHHHhCCChHH
Q psy11102 2 TTLVSQAREYYEKCHC-----------YEKLISVYTELGDFEALESCARKL----PD---SSPLLKPMGEIFVKYGLCEQ 63 (87)
Q Consensus 2 ~~~w~~A~~yY~~~~n-----------~~k~ie~~~~~ed~d~L~~l~~~L----~~---~~~lL~~ia~~F~~~G~~~~ 63 (87)
.|++++|+.+|.++=. +-.+...|-.+|++|+-..+.+.. |+ +..+-.-.+.-+.+.|..++
T Consensus 14 ~G~~~~Ai~~Y~~Al~~gL~~~~~~~a~i~lastlr~LG~~deA~~~L~~~~~~~p~~~~~~~l~~f~Al~L~~~gr~~e 93 (120)
T PF12688_consen 14 LGREEEAIPLYRRALAAGLSGADRRRALIQLASTLRNLGRYDEALALLEEALEEFPDDELNAALRVFLALALYNLGRPKE 93 (120)
T ss_pred cCCHHHHHHHHHHHHHcCCCchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCCCccccHHHHHHHHHHHHHCCCHHH
Confidence 5789999999997633 223777888999999877666654 54 33556667778888888888
Q ss_pred HHHHHHh
Q psy11102 64 AVYVFDK 70 (87)
Q Consensus 64 Av~aylk 70 (87)
|++.++.
T Consensus 94 Al~~~l~ 100 (120)
T PF12688_consen 94 ALEWLLE 100 (120)
T ss_pred HHHHHHH
Confidence 8888875
No 55
>KOG2003|consensus
Probab=91.62 E-value=0.85 Score=37.58 Aligned_cols=28 Identities=14% Similarity=0.352 Sum_probs=21.0
Q ss_pred CchhHHHHHHHHHH---------------------hccHHHHHHHHHHh
Q psy11102 1 MTTLVSQAREYYEK---------------------CHCYEKLISVYTEL 28 (87)
Q Consensus 1 ~~~~w~~A~~yY~~---------------------~~n~~k~ie~~~~~ 28 (87)
|+|++++|++.|.. -||++.+++|++++
T Consensus 502 ~ngd~dka~~~ykeal~ndasc~ealfniglt~e~~~~ldeald~f~kl 550 (840)
T KOG2003|consen 502 ANGDLDKAAEFYKEALNNDASCTEALFNIGLTAEALGNLDEALDCFLKL 550 (840)
T ss_pred ecCcHHHHHHHHHHHHcCchHHHHHHHHhcccHHHhcCHHHHHHHHHHH
Confidence 57889999998864 46777777777753
No 56
>TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD. ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array.
Probab=91.61 E-value=0.73 Score=28.91 Aligned_cols=52 Identities=12% Similarity=0.084 Sum_probs=34.0
Q ss_pred HHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCCCCchHHHHHHHHHhCCChHHHHHHHHhc
Q psy11102 11 YYEKCHCYEKLISVYTELGDFEALESCARKLPDSSPLLKPMGEIFVKYGLCEQAVYVFDKN 71 (87)
Q Consensus 11 yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~~~~lL~~ia~~F~~~G~~~~Av~aylk~ 71 (87)
.|.+.|+++++++++ ++++..-|.+......+|..+...|.+++|+..|.++
T Consensus 26 ~~~~~~~~~~A~~~~---------~~~~~~~p~~~~~~~~la~~~~~~~~~~~A~~~~~~~ 77 (135)
T TIGR02552 26 NLYQQGRYDEALKLF---------QLLAAYDPYNSRYWLGLAACCQMLKEYEEAIDAYALA 77 (135)
T ss_pred HHHHcccHHHHHHHH---------HHHHHhCCCcHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344445566655553 2222223556688889999999999999998766544
No 57
>PRK10370 formate-dependent nitrite reductase complex subunit NrfG; Provisional
Probab=91.59 E-value=1.5 Score=30.64 Aligned_cols=69 Identities=14% Similarity=0.070 Sum_probs=39.2
Q ss_pred hhHHHHHHHHHHhccH--------HHHHHHH-HHhCCH--HHHHHHHhhC----CCCCchHHHHHHHHHhCCChHHHHHH
Q psy11102 3 TLVSQAREYYEKCHCY--------EKLISVY-TELGDF--EALESCARKL----PDSSPLLKPMGEIFVKYGLCEQAVYV 67 (87)
Q Consensus 3 ~~w~~A~~yY~~~~n~--------~k~ie~~-~~~ed~--d~L~~l~~~L----~~~~~lL~~ia~~F~~~G~~~~Av~a 67 (87)
+++++|...|.++-.. -.++.++ +..|++ +.-.++.+.. |++...+..+|..+...|..++|+..
T Consensus 87 g~~~~A~~a~~~Al~l~P~~~~~~~~lA~aL~~~~g~~~~~~A~~~l~~al~~dP~~~~al~~LA~~~~~~g~~~~Ai~~ 166 (198)
T PRK10370 87 NDYDNALLAYRQALQLRGENAELYAALATVLYYQAGQHMTPQTREMIDKALALDANEVTALMLLASDAFMQADYAQAIEL 166 (198)
T ss_pred CCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHhcCCCCcHHHHHHHHHHHHhCCCChhHHHHHHHHHHHcCCHHHHHHH
Confidence 4555666666555432 1233443 454552 4433333332 44456777778777888888888888
Q ss_pred HHhc
Q psy11102 68 FDKN 71 (87)
Q Consensus 68 ylk~ 71 (87)
|.+.
T Consensus 167 ~~~a 170 (198)
T PRK10370 167 WQKV 170 (198)
T ss_pred HHHH
Confidence 7764
No 58
>PRK15174 Vi polysaccharide export protein VexE; Provisional
Probab=91.44 E-value=1.8 Score=35.38 Aligned_cols=25 Identities=16% Similarity=0.339 Sum_probs=13.9
Q ss_pred CCCCchHHHHHHHHHhCCChHHHHH
Q psy11102 42 PDSSPLLKPMGEIFVKYGLCEQAVY 66 (87)
Q Consensus 42 ~~~~~lL~~ia~~F~~~G~~~~Av~ 66 (87)
|++...+..+|..+...|..++|+.
T Consensus 281 P~~~~a~~~lg~~l~~~g~~~eA~~ 305 (656)
T PRK15174 281 SDNVRIVTLYADALIRTGQNEKAIP 305 (656)
T ss_pred CCCHHHHHHHHHHHHHCCCHHHHHH
Confidence 3334555566666666666666643
No 59
>PLN03077 Protein ECB2; Provisional
Probab=91.39 E-value=2.4 Score=35.31 Aligned_cols=53 Identities=21% Similarity=0.312 Sum_probs=40.5
Q ss_pred HHHHHHHHHHhCCHHHHHHHHhhCCCC-CchHHHHHHHHHhCCChHHHHHHHHh
Q psy11102 18 YEKLISVYTELGDFEALESCARKLPDS-SPLLKPMGEIFVKYGLCEQAVYVFDK 70 (87)
Q Consensus 18 ~~k~ie~~~~~ed~d~L~~l~~~L~~~-~~lL~~ia~~F~~~G~~~~Av~aylk 70 (87)
+.-+|.+|.+.|+++.-.++.+.+++. ......+..-|.+.|..++|.+.|.+
T Consensus 326 ~n~Li~~y~k~g~~~~A~~vf~~m~~~d~~s~n~li~~~~~~g~~~~A~~lf~~ 379 (857)
T PLN03077 326 CNSLIQMYLSLGSWGEAEKVFSRMETKDAVSWTAMISGYEKNGLPDKALETYAL 379 (857)
T ss_pred HHHHHHHHHhcCCHHHHHHHHhhCCCCCeeeHHHHHHHHHhCCCHHHHHHHHHH
Confidence 446788888889999888888888764 35566777777888888888777764
No 60
>PRK15174 Vi polysaccharide export protein VexE; Provisional
Probab=91.31 E-value=2.4 Score=34.72 Aligned_cols=66 Identities=12% Similarity=0.028 Sum_probs=32.9
Q ss_pred hhHHHHHHHHHHhc--------cHHHHHHHHHHhCCHHHHHHHHhh----CCCCCchHHHHHHHHHhCCChHHHHHHH
Q psy11102 3 TLVSQAREYYEKCH--------CYEKLISVYTELGDFEALESCARK----LPDSSPLLKPMGEIFVKYGLCEQAVYVF 68 (87)
Q Consensus 3 ~~w~~A~~yY~~~~--------n~~k~ie~~~~~ed~d~L~~l~~~----L~~~~~lL~~ia~~F~~~G~~~~Av~ay 68 (87)
++++.|.+.|.+.- -+..+..++...|+++.-....+. -|++......+|..+...|..++|+.+|
T Consensus 90 g~~~~A~~~l~~~l~~~P~~~~a~~~la~~l~~~g~~~~Ai~~l~~Al~l~P~~~~a~~~la~~l~~~g~~~eA~~~~ 167 (656)
T PRK15174 90 SQPDAVLQVVNKLLAVNVCQPEDVLLVASVLLKSKQYATVADLAEQAWLAFSGNSQIFALHLRTLVLMDKELQAISLA 167 (656)
T ss_pred CCHHHHHHHHHHHHHhCCCChHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCCCcHHHHHHHHHHHHHCCChHHHHHHH
Confidence 45555555555541 123344555555555554333332 1333455555666666666666665544
No 61
>PF13176 TPR_7: Tetratricopeptide repeat; PDB: 3SF4_C 3RO3_A 3RO2_A.
Probab=90.93 E-value=0.5 Score=24.17 Aligned_cols=23 Identities=26% Similarity=0.563 Sum_probs=19.4
Q ss_pred HHHHHHHHHhCCChHHHHHHHHh
Q psy11102 48 LKPMGEIFVKYGLCEQAVYVFDK 70 (87)
Q Consensus 48 L~~ia~~F~~~G~~~~Av~aylk 70 (87)
+..+|..+.+.|.+++|++.|.+
T Consensus 2 l~~Lg~~~~~~g~~~~Ai~~y~~ 24 (36)
T PF13176_consen 2 LNNLGRIYRQQGDYEKAIEYYEQ 24 (36)
T ss_dssp HHHHHHHHHHCT-HHHHHHHHHH
T ss_pred HHHHHHHHHHcCCHHHHHHHHHH
Confidence 56789999999999999999987
No 62
>KOG2076|consensus
Probab=90.92 E-value=0.98 Score=38.87 Aligned_cols=48 Identities=21% Similarity=0.218 Sum_probs=38.4
Q ss_pred HHhhCCC-CCchHHHHHHHHHhCCChHHHHHHHHhc---------------------CCHHHHHHHHhhc
Q psy11102 37 CARKLPD-SSPLLKPMGEIFVKYGLCEQAVYVFDKN---------------------KHKSSQWLTVVQD 84 (87)
Q Consensus 37 l~~~L~~-~~~lL~~ia~~F~~~G~~~~Av~aylk~---------------------gd~k~ai~~cv~~ 84 (87)
++-.|.+ +.++=..+++++.+.|+.++|.-+|.|+ |+.+.|+++..++
T Consensus 198 lAAHL~p~d~e~W~~ladls~~~~~i~qA~~cy~rAI~~~p~n~~~~~ers~L~~~~G~~~~Am~~f~~l 267 (895)
T KOG2076|consen 198 LAAHLNPKDYELWKRLADLSEQLGNINQARYCYSRAIQANPSNWELIYERSSLYQKTGDLKRAMETFLQL 267 (895)
T ss_pred HHHhcCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHhcCCcchHHHHHHHHHHHHhChHHHHHHHHHHH
Confidence 3455665 4699999999999999999999988764 8888888777654
No 63
>KOG4626|consensus
Probab=90.71 E-value=0.54 Score=39.82 Aligned_cols=54 Identities=20% Similarity=0.339 Sum_probs=38.2
Q ss_pred HHHHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCCCCchHHHHHHHHHhCCChHHHHHHHHhc
Q psy11102 9 REYYEKCHCYEKLISVYTELGDFEALESCARKLPDSSPLLKPMGEIFVKYGLCEQAVYVFDKN 71 (87)
Q Consensus 9 ~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~~~~lL~~ia~~F~~~G~~~~Av~aylk~ 71 (87)
+..|.|-||+.++|.||-+. +|--|.-..-+..+|.-++..|..+.|+++|.|+
T Consensus 395 a~i~kqqgnl~~Ai~~Ykea---------lrI~P~fAda~~NmGnt~ke~g~v~~A~q~y~rA 448 (966)
T KOG4626|consen 395 ASIYKQQGNLDDAIMCYKEA---------LRIKPTFADALSNMGNTYKEMGDVSAAIQCYTRA 448 (966)
T ss_pred HHHHHhcccHHHHHHHHHHH---------HhcCchHHHHHHhcchHHHHhhhHHHHHHHHHHH
Confidence 45788999999999998653 2323444556777777777777777777777763
No 64
>PRK10049 pgaA outer membrane protein PgaA; Provisional
Probab=90.67 E-value=1.8 Score=35.93 Aligned_cols=68 Identities=15% Similarity=0.258 Sum_probs=46.9
Q ss_pred chhHHHHHHHHHHh-----cc---HHHHHHHHHHhCCHHHHHHHHhhC----CCCCchHHHHHHHHHhCCChHHHHHHHH
Q psy11102 2 TTLVSQAREYYEKC-----HC---YEKLISVYTELGDFEALESCARKL----PDSSPLLKPMGEIFVKYGLCEQAVYVFD 69 (87)
Q Consensus 2 ~~~w~~A~~yY~~~-----~n---~~k~ie~~~~~ed~d~L~~l~~~L----~~~~~lL~~ia~~F~~~G~~~~Av~ayl 69 (87)
.++|++|.++|.++ ++ ...++.++...++++.-....+.+ |++.. +..+|..+...|..++|+.+|.
T Consensus 62 ~g~~~~A~~~~~~al~~~P~~~~a~~~la~~l~~~g~~~eA~~~l~~~l~~~P~~~~-~~~la~~l~~~g~~~~Al~~l~ 140 (765)
T PRK10049 62 LKQWQNSLTLWQKALSLEPQNDDYQRGLILTLADAGQYDEALVKAKQLVSGAPDKAN-LLALAYVYKRAGRHWDELRAMT 140 (765)
T ss_pred cCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCCHH-HHHHHHHHHHCCCHHHHHHHHH
Confidence 46788888888872 22 345567777888888855554443 55545 7778888888888888887774
Q ss_pred h
Q psy11102 70 K 70 (87)
Q Consensus 70 k 70 (87)
+
T Consensus 141 ~ 141 (765)
T PRK10049 141 Q 141 (765)
T ss_pred H
Confidence 3
No 65
>PF14559 TPR_19: Tetratricopeptide repeat; PDB: 2R5S_A 3QDN_B 3QOU_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 3FP3_A 3LCA_A ....
Probab=90.55 E-value=0.73 Score=25.73 Aligned_cols=49 Identities=18% Similarity=0.324 Sum_probs=35.7
Q ss_pred HhccHHHHHHHHHHhCCHHHHHHHHhhCCCCCchHHHHHHHHHhCCChHHHHHHHHhc
Q psy11102 14 KCHCYEKLISVYTELGDFEALESCARKLPDSSPLLKPMGEIFVKYGLCEQAVYVFDKN 71 (87)
Q Consensus 14 ~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~~~~lL~~ia~~F~~~G~~~~Av~aylk~ 71 (87)
+.|+++.|++.+ +++...-|++.+....+|..+...|..++|.+.+.+.
T Consensus 3 ~~~~~~~A~~~~---------~~~l~~~p~~~~~~~~la~~~~~~g~~~~A~~~l~~~ 51 (68)
T PF14559_consen 3 KQGDYDEAIELL---------EKALQRNPDNPEARLLLAQCYLKQGQYDEAEELLERL 51 (68)
T ss_dssp HTTHHHHHHHHH---------HHHHHHTTTSHHHHHHHHHHHHHTT-HHHHHHHHHCC
T ss_pred hccCHHHHHHHH---------HHHHHHCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 456666666554 4455556888888889999999999999999888764
No 66
>PLN03218 maturation of RBCL 1; Provisional
Probab=90.34 E-value=3.9 Score=35.82 Aligned_cols=48 Identities=10% Similarity=0.104 Sum_probs=21.5
Q ss_pred HHHHHHHhCCHHHHHHHHhhCCC-----CCchHHHHHHHHHhCCChHHHHHHH
Q psy11102 21 LISVYTELGDFEALESCARKLPD-----SSPLLKPMGEIFVKYGLCEQAVYVF 68 (87)
Q Consensus 21 ~ie~~~~~ed~d~L~~l~~~L~~-----~~~lL~~ia~~F~~~G~~~~Av~ay 68 (87)
+|+.|.+.|+++.-.++.+.+.+ +.....-+...+.+.|..+.|.+.|
T Consensus 655 LI~a~~k~G~~eeA~~l~~eM~k~G~~pd~~tynsLI~ay~k~G~~eeA~~lf 707 (1060)
T PLN03218 655 LVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWKKALELY 707 (1060)
T ss_pred HHHHHHhCCCHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHhCCCHHHHHHHH
Confidence 34444555555554444443321 1223444445555555555554433
No 67
>KOG3060|consensus
Probab=90.28 E-value=0.95 Score=34.28 Aligned_cols=69 Identities=23% Similarity=0.312 Sum_probs=51.4
Q ss_pred chhHHHHHHHHHHh---------ccHHHHHHHHHHhCCHHHHHHHHhhCCC---CCchHHHHHHHHHhCCChHHHHHHHH
Q psy11102 2 TTLVSQAREYYEKC---------HCYEKLISVYTELGDFEALESCARKLPD---SSPLLKPMGEIFVKYGLCEQAVYVFD 69 (87)
Q Consensus 2 ~~~w~~A~~yY~~~---------~n~~k~ie~~~~~ed~d~L~~l~~~L~~---~~~lL~~ia~~F~~~G~~~~Av~ayl 69 (87)
+++|++|.+||..- =...|.+-.-.+-...+.++.|..=|.. +.+.-.+++++|.+.|++++|+-||.
T Consensus 99 ~~~~~~A~e~y~~lL~ddpt~~v~~KRKlAilka~GK~l~aIk~ln~YL~~F~~D~EAW~eLaeiY~~~~~f~kA~fClE 178 (289)
T KOG3060|consen 99 TGNYKEAIEYYESLLEDDPTDTVIRKRKLAILKAQGKNLEAIKELNEYLDKFMNDQEAWHELAEIYLSEGDFEKAAFCLE 178 (289)
T ss_pred hhchhhHHHHHHHHhccCcchhHHHHHHHHHHHHcCCcHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHhHHHHHHHHHH
Confidence 47889999999752 1235566666666666766666666544 56889999999999999999999986
Q ss_pred h
Q psy11102 70 K 70 (87)
Q Consensus 70 k 70 (87)
-
T Consensus 179 E 179 (289)
T KOG3060|consen 179 E 179 (289)
T ss_pred H
Confidence 4
No 68
>PLN03218 maturation of RBCL 1; Provisional
Probab=90.26 E-value=2.6 Score=36.88 Aligned_cols=82 Identities=11% Similarity=0.101 Sum_probs=53.5
Q ss_pred hhHHHHHHHHHHhcc---------HHHHHHHHHHhCCHHHHHHHHhhCCC-----CCchHHHHHHHHHhCCChHHHHH--
Q psy11102 3 TLVSQAREYYEKCHC---------YEKLISVYTELGDFEALESCARKLPD-----SSPLLKPMGEIFVKYGLCEQAVY-- 66 (87)
Q Consensus 3 ~~w~~A~~yY~~~~n---------~~k~ie~~~~~ed~d~L~~l~~~L~~-----~~~lL~~ia~~F~~~G~~~~Av~-- 66 (87)
++++.|.++|.+... +.-+|.+|.+.|+++...++.+...+ +......+...|.+.|..+.|++
T Consensus 451 g~~e~A~~lf~~M~~~Gl~pD~~tynsLI~~y~k~G~vd~A~~vf~eM~~~Gv~PdvvTynaLI~gy~k~G~~eeAl~lf 530 (1060)
T PLN03218 451 QDIDGALRVLRLVQEAGLKADCKLYTTLISTCAKSGKVDAMFEVFHEMVNAGVEANVHTFGALIDGCARAGQVAKAFGAY 530 (1060)
T ss_pred cCHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHhCcCHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHCcCHHHHHHHH
Confidence 566667766655432 45677888888888887777777653 22345566677777788777754
Q ss_pred --------------------HHHhcCCHHHHHHHHhhc
Q psy11102 67 --------------------VFDKNKHKSSQWLTVVQD 84 (87)
Q Consensus 67 --------------------aylk~gd~k~ai~~cv~~ 84 (87)
+|.+.|+++.|.++.-++
T Consensus 531 ~~M~~~Gv~PD~vTYnsLI~a~~k~G~~deA~~lf~eM 568 (1060)
T PLN03218 531 GIMRSKNVKPDRVVFNALISACGQSGAVDRAFDVLAEM 568 (1060)
T ss_pred HHHHHcCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHH
Confidence 445567777777666554
No 69
>KOG1586|consensus
Probab=90.19 E-value=0.53 Score=35.45 Aligned_cols=10 Identities=20% Similarity=0.066 Sum_probs=4.1
Q ss_pred HHHHHHHhcc
Q psy11102 8 AREYYEKCHC 17 (87)
Q Consensus 8 A~~yY~~~~n 17 (87)
|++++.++|+
T Consensus 60 aA~~h~k~~s 69 (288)
T KOG1586|consen 60 AADLHLKAGS 69 (288)
T ss_pred HHHHHHhcCC
Confidence 3444444443
No 70
>PRK11447 cellulose synthase subunit BcsC; Provisional
Probab=89.95 E-value=4 Score=35.44 Aligned_cols=65 Identities=15% Similarity=0.112 Sum_probs=41.2
Q ss_pred chhHHHHHHHHHHhcc--------HHHHHHHHHHhCCHHHHHHHHhhC----CCCCchHHHHHHHHHhCCChHHHHH
Q psy11102 2 TTLVSQAREYYEKCHC--------YEKLISVYTELGDFEALESCARKL----PDSSPLLKPMGEIFVKYGLCEQAVY 66 (87)
Q Consensus 2 ~~~w~~A~~yY~~~~n--------~~k~ie~~~~~ed~d~L~~l~~~L----~~~~~lL~~ia~~F~~~G~~~~Av~ 66 (87)
.++|++|+++|.++-. .-.++.+|...|++++-....+.+ |++...+-..|..+...|..++|+.
T Consensus 474 ~g~~~eA~~~~~~Al~~~P~~~~~~~~LA~~~~~~G~~~~A~~~l~~al~~~P~~~~~~~a~al~l~~~~~~~~Al~ 550 (1157)
T PRK11447 474 QGKWAQAAELQRQRLALDPGSVWLTYRLAQDLRQAGQRSQADALMRRLAQQKPNDPEQVYAYGLYLSGSDRDRAALA 550 (1157)
T ss_pred CCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHhCCCHHHHHH
Confidence 3678888888887632 235677888889998887777654 3333444444555555555555544
No 71
>PRK11189 lipoprotein NlpI; Provisional
Probab=89.88 E-value=2.6 Score=30.90 Aligned_cols=49 Identities=18% Similarity=0.049 Sum_probs=26.8
Q ss_pred HHHHHHhCCHHHHHHHHhhC----CCCCchHHHHHHHHHhCCChHHHHHHHHh
Q psy11102 22 ISVYTELGDFEALESCARKL----PDSSPLLKPMGEIFVKYGLCEQAVYVFDK 70 (87)
Q Consensus 22 ie~~~~~ed~d~L~~l~~~L----~~~~~lL~~ia~~F~~~G~~~~Av~aylk 70 (87)
...+...|+|+.-....+.. |+....+..+|..+...|..++|++.|.+
T Consensus 105 g~~~~~~g~~~~A~~~~~~Al~l~P~~~~a~~~lg~~l~~~g~~~eA~~~~~~ 157 (296)
T PRK11189 105 GIYLTQAGNFDAAYEAFDSVLELDPTYNYAYLNRGIALYYGGRYELAQDDLLA 157 (296)
T ss_pred HHHHHHCCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHCCCHHHHHHHHHH
Confidence 34455556666544333321 33345566667666677777777665533
No 72
>PF03704 BTAD: Bacterial transcriptional activator domain; InterPro: IPR005158 Found in the DNRI/REDD/AFSR family of regulators, this region of AFSR (P25941 from SWISSPROT) along with the C-terminal region is capable of independently directing actinorhodin production. It is important for the formation of secondary metabolites.; PDB: 2FF4_B 2FEZ_A.
Probab=89.86 E-value=1.5 Score=28.26 Aligned_cols=54 Identities=17% Similarity=0.242 Sum_probs=38.3
Q ss_pred HHHHHHHHHHhCCHHHHHHHHhhCC----CCCchHHHHHHHHHhCCChHHHHHHHHhc
Q psy11102 18 YEKLISVYTELGDFEALESCARKLP----DSSPLLKPMGEIFVKYGLCEQAVYVFDKN 71 (87)
Q Consensus 18 ~~k~ie~~~~~ed~d~L~~l~~~L~----~~~~lL~~ia~~F~~~G~~~~Av~aylk~ 71 (87)
..++++.+...|+++...++++.+= -+-..-..+-..+...|....|++.|-++
T Consensus 65 ~~~l~~~~~~~~~~~~a~~~~~~~l~~dP~~E~~~~~lm~~~~~~g~~~~A~~~Y~~~ 122 (146)
T PF03704_consen 65 LERLAEALLEAGDYEEALRLLQRALALDPYDEEAYRLLMRALAAQGRRAEALRVYERY 122 (146)
T ss_dssp HHHHHHHHHHTT-HHHHHHHHHHHHHHSTT-HHHHHHHHHHHHHTT-HHHHHHHHHHH
T ss_pred HHHHHHHHHhccCHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHCcCHHHHHHHHHHH
Confidence 4578888889999999999998862 23355566667777777777777777665
No 73
>TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795).
Probab=89.84 E-value=4.1 Score=28.03 Aligned_cols=20 Identities=25% Similarity=0.423 Sum_probs=11.3
Q ss_pred HHHHHHHhCCHHHHHHHHhh
Q psy11102 21 LISVYTELGDFEALESCARK 40 (87)
Q Consensus 21 ~ie~~~~~ed~d~L~~l~~~ 40 (87)
+..+++..++|+.-....+.
T Consensus 76 la~~~~~~~~~~~A~~~~~~ 95 (235)
T TIGR03302 76 LAYAYYKSGDYAEAIAAADR 95 (235)
T ss_pred HHHHHHhcCCHHHHHHHHHH
Confidence 44566666666655554444
No 74
>PRK12370 invasion protein regulator; Provisional
Probab=89.66 E-value=2.1 Score=34.09 Aligned_cols=70 Identities=11% Similarity=-0.025 Sum_probs=40.4
Q ss_pred chhHHHHHHHHHHhcc--------HHHHHHHHHHhCCHHHHHHHHhhC----CCCC-chHHHHHHHHHhCCChHHHHHHH
Q psy11102 2 TTLVSQAREYYEKCHC--------YEKLISVYTELGDFEALESCARKL----PDSS-PLLKPMGEIFVKYGLCEQAVYVF 68 (87)
Q Consensus 2 ~~~w~~A~~yY~~~~n--------~~k~ie~~~~~ed~d~L~~l~~~L----~~~~-~lL~~ia~~F~~~G~~~~Av~ay 68 (87)
.|++++|..+|.++-. ..-....++..|++++-....+.+ |.++ ..+..+|..+...|..++|.+.|
T Consensus 385 ~G~~~eAi~~~~~Al~l~P~~~~~~~~~~~~~~~~g~~eeA~~~~~~~l~~~~p~~~~~~~~la~~l~~~G~~~eA~~~~ 464 (553)
T PRK12370 385 AGQLEEALQTINECLKLDPTRAAAGITKLWITYYHTGIDDAIRLGDELRSQHLQDNPILLSMQVMFLSLKGKHELARKLT 464 (553)
T ss_pred CCCHHHHHHHHHHHHhcCCCChhhHHHHHHHHHhccCHHHHHHHHHHHHHhccccCHHHHHHHHHHHHhCCCHHHHHHHH
Confidence 4566777777776511 111233466677777555554432 3333 34566777777777777777777
Q ss_pred Hhc
Q psy11102 69 DKN 71 (87)
Q Consensus 69 lk~ 71 (87)
.+.
T Consensus 465 ~~~ 467 (553)
T PRK12370 465 KEI 467 (553)
T ss_pred HHh
Confidence 653
No 75
>PF00637 Clathrin: Region in Clathrin and VPS; InterPro: IPR000547 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. These vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transport []. Clathrin coats contain both clathrin (acts as a scaffold) and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors [, ]. Clathrin is a trimer composed of three heavy chains and three light chains, each monomer projecting outwards like a leg; this three-legged structure is known as a triskelion [, ]. The heavy chains form the legs, their N-terminal beta-propeller regions extending outwards, while their C-terminal alpha-alpha-superhelical regions form the central hub of the triskelion. Peptide motifs can bind between the beta-propeller blades. The light chains appear to have a regulatory role, and may help orient the assembly and disassembly of clathrin coats as they interact with hsc70 uncoating ATPase []. Clathrin triskelia self-polymerise into a curved lattice by twisting individual legs together. The clathrin lattice forms around a vesicle as it buds from the TGN, plasma membrane or endosomes, acting to stabilise the vesicle and facilitate the budding process []. The multiple blades created when the triskelia polymerise are involved in multiple protein interactions, enabling the recruitment of different cargo adaptors and membrane attachment proteins []. This entry represents the 7-fold alpha-alpha-superhelical ARM-type repeat found at the C-terminal of clathrin heavy chains and in VPS (vacuolar protein sorting-associated) proteins. In clathrin heavy chains, the C-terminal 7-fold ARM-type repeats interact to form the central hub of the triskelion. VPS proteins are required for vacuolar assembly and vacuolar traffick, and contain one clathrin-type repeat []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0006886 intracellular protein transport, 0016192 vesicle-mediated transport; PDB: 3LVH_A 3LVG_C 1B89_A 3QIL_L.
Probab=88.80 E-value=0.021 Score=37.10 Aligned_cols=54 Identities=20% Similarity=0.287 Sum_probs=36.3
Q ss_pred hhHHHHHHHHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCCCCchHHHHHHHHHh
Q psy11102 3 TLVSQAREYYEKCHCYEKLISVYTELGDFEALESCARKLPDSSPLLKPMGEIFVK 57 (87)
Q Consensus 3 ~~w~~A~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~~~~lL~~ia~~F~~ 57 (87)
+-|+.|+-+|.+.|+++++++++..+++|+.-.+.+...++ ..+...++.++.+
T Consensus 84 ~l~~~a~~Ly~~~~~~~~al~i~~~~~~~~~a~e~~~~~~~-~~l~~~l~~~~l~ 137 (143)
T PF00637_consen 84 GLYEEAVYLYSKLGNHDEALEILHKLKDYEEAIEYAKKVDD-PELWEQLLKYCLD 137 (143)
T ss_dssp TSHHHHHHHHHCCTTHTTCSSTSSSTHCSCCCTTTGGGCSS-SHHHHHHHHHHCT
T ss_pred chHHHHHHHHHHcccHHHHHHHHHHHccHHHHHHHHHhcCc-HHHHHHHHHHHHh
Confidence 56777888888888888888876666777766666666544 4555555555543
No 76
>PLN03088 SGT1, suppressor of G2 allele of SKP1; Provisional
Probab=88.67 E-value=3.1 Score=31.60 Aligned_cols=52 Identities=15% Similarity=0.120 Sum_probs=34.8
Q ss_pred HHHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCCCCchHHHHHHHHHhCCChHHHHHHHHh
Q psy11102 10 EYYEKCHCYEKLISVYTELGDFEALESCARKLPDSSPLLKPMGEIFVKYGLCEQAVYVFDK 70 (87)
Q Consensus 10 ~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~~~~lL~~ia~~F~~~G~~~~Av~aylk 70 (87)
.-+...|+++.++++| .+.++.-|.+...+...|..+...|..+.|+..|.+
T Consensus 10 ~~a~~~~~~~~Ai~~~---------~~Al~~~P~~~~a~~~~a~~~~~~g~~~eAl~~~~~ 61 (356)
T PLN03088 10 KEAFVDDDFALAVDLY---------TQAIDLDPNNAELYADRAQANIKLGNFTEAVADANK 61 (356)
T ss_pred HHHHHcCCHHHHHHHH---------HHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHH
Confidence 3333446777777765 233333455667888899999999999888876543
No 77
>KOG1585|consensus
Probab=88.66 E-value=2.1 Score=32.54 Aligned_cols=79 Identities=16% Similarity=0.245 Sum_probs=46.9
Q ss_pred HHHHHHHHHHhccHHHHHHHHHHhC--------------CHHHHHHHHh---hCCCCCchHHHHHHHHHhCCChHHHHHH
Q psy11102 5 VSQAREYYEKCHCYEKLISVYTELG--------------DFEALESCAR---KLPDSSPLLKPMGEIFVKYGLCEQAVYV 67 (87)
Q Consensus 5 w~~A~~yY~~~~n~~k~ie~~~~~e--------------d~d~L~~l~~---~L~~~~~lL~~ia~~F~~~G~~~~Av~a 67 (87)
|.+|+.-|.-+..++|+-.|+-+.- -|++.--|.+ .||+...+..+-.++....|-.+-|.-+
T Consensus 34 yekAAvafRnAk~feKakdcLlkA~~~yEnnrslfhAAKayEqaamLake~~klsEvvdl~eKAs~lY~E~GspdtAAma 113 (308)
T KOG1585|consen 34 YEKAAVAFRNAKKFEKAKDCLLKASKGYENNRSLFHAAKAYEQAAMLAKELSKLSEVVDLYEKASELYVECGSPDTAAMA 113 (308)
T ss_pred HHHHHHHHHhhccHHHHHHHHHHHHHHHHhcccHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHhCCcchHHHH
Confidence 3344444444567777776654332 1233333333 3444456777777777888888887777
Q ss_pred HHhcC------CHHHHHHHHhh
Q psy11102 68 FDKNK------HKSSQWLTVVQ 83 (87)
Q Consensus 68 ylk~g------d~k~ai~~cv~ 83 (87)
..|++ +|+.||+++-+
T Consensus 114 leKAak~lenv~Pd~AlqlYqr 135 (308)
T KOG1585|consen 114 LEKAAKALENVKPDDALQLYQR 135 (308)
T ss_pred HHHHHHHhhcCCHHHHHHHHHH
Confidence 76654 56777776643
No 78
>PRK15359 type III secretion system chaperone protein SscB; Provisional
Probab=88.36 E-value=3 Score=27.39 Aligned_cols=64 Identities=9% Similarity=-0.089 Sum_probs=45.6
Q ss_pred HHHHHHHHHhCCHHHHHHHHhhC----CCCCchHHHHHHHHHhCCChHHHHHHH---------------------HhcCC
Q psy11102 19 EKLISVYTELGDFEALESCARKL----PDSSPLLKPMGEIFVKYGLCEQAVYVF---------------------DKNKH 73 (87)
Q Consensus 19 ~k~ie~~~~~ed~d~L~~l~~~L----~~~~~lL~~ia~~F~~~G~~~~Av~ay---------------------lk~gd 73 (87)
......++..|+|+.-....+.+ |.+.......|..+...|..++|+.+| .+.|+
T Consensus 28 ~~~g~~~~~~g~~~~A~~~~~~al~~~P~~~~a~~~lg~~~~~~g~~~~A~~~y~~Al~l~p~~~~a~~~lg~~l~~~g~ 107 (144)
T PRK15359 28 YASGYASWQEGDYSRAVIDFSWLVMAQPWSWRAHIALAGTWMMLKEYTTAINFYGHALMLDASHPEPVYQTGVCLKMMGE 107 (144)
T ss_pred HHHHHHHHHcCCHHHHHHHHHHHHHcCCCcHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcCCCCcHHHHHHHHHHHHcCC
Confidence 34556667777777655444442 555688999999999999999998765 55677
Q ss_pred HHHHHHHHh
Q psy11102 74 KSSQWLTVV 82 (87)
Q Consensus 74 ~k~ai~~cv 82 (87)
++.|+..+.
T Consensus 108 ~~eAi~~~~ 116 (144)
T PRK15359 108 PGLAREAFQ 116 (144)
T ss_pred HHHHHHHHH
Confidence 777776653
No 79
>KOG1586|consensus
Probab=88.23 E-value=2.6 Score=31.80 Aligned_cols=80 Identities=18% Similarity=0.149 Sum_probs=40.3
Q ss_pred chhHHHHHHHHHHhccHHHHHHHHHHhCC-HHHHHHHHhhCCCCC---chHHHHHHHHHhCCChHHHH-------HHHHh
Q psy11102 2 TTLVSQAREYYEKCHCYEKLISVYTELGD-FEALESCARKLPDSS---PLLKPMGEIFVKYGLCEQAV-------YVFDK 70 (87)
Q Consensus 2 ~~~w~~A~~yY~~~~n~~k~ie~~~~~ed-~d~L~~l~~~L~~~~---~lL~~ia~~F~~~G~~~~Av-------~aylk 70 (87)
++++++|+++|.+++|.=|+..-....|+ |-++-++=-.....| .--.+.+.-|+++ ..+.|| +.|..
T Consensus 27 ~~k~eeAadl~~~Aan~yklaK~w~~AG~aflkaA~~h~k~~skhDaat~YveA~~cykk~-~~~eAv~cL~~aieIyt~ 105 (288)
T KOG1586|consen 27 SNKYEEAAELYERAANMYKLAKNWSAAGDAFLKAADLHLKAGSKHDAATTYVEAANCYKKV-DPEEAVNCLEKAIEIYTD 105 (288)
T ss_pred CcchHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHhcCCchhHHHHHHHHHHHhhcc-ChHHHHHHHHHHHHHHHh
Confidence 35789999999999886555544443333 222333322332211 2233444444444 444433 33445
Q ss_pred cCCHHHHHHHHh
Q psy11102 71 NKHKSSQWLTVV 82 (87)
Q Consensus 71 ~gd~k~ai~~cv 82 (87)
.|++..|-.-++
T Consensus 106 ~Grf~~aAk~~~ 117 (288)
T KOG1586|consen 106 MGRFTMAAKHHI 117 (288)
T ss_pred hhHHHHHHhhhh
Confidence 566655544443
No 80
>PRK09782 bacteriophage N4 receptor, outer membrane subunit; Provisional
Probab=87.84 E-value=2.8 Score=36.37 Aligned_cols=68 Identities=16% Similarity=0.141 Sum_probs=40.6
Q ss_pred hhHHHHHHHHHHhcc-------HHHHHHHHHHhCCHHHHHHHHhh----CCCCCchHHHHHHHHHhCCChHHHHHHHHh
Q psy11102 3 TLVSQAREYYEKCHC-------YEKLISVYTELGDFEALESCARK----LPDSSPLLKPMGEIFVKYGLCEQAVYVFDK 70 (87)
Q Consensus 3 ~~w~~A~~yY~~~~n-------~~k~ie~~~~~ed~d~L~~l~~~----L~~~~~lL~~ia~~F~~~G~~~~Av~aylk 70 (87)
+++++|...|.++=. +..+..++.+.|++++-+...+. -|++...+..+|..+...|..++|++.|.+
T Consensus 590 Gr~~eAl~~~~~AL~l~P~~~a~~~LA~~l~~lG~~deA~~~l~~AL~l~Pd~~~a~~nLG~aL~~~G~~eeAi~~l~~ 668 (987)
T PRK09782 590 GQPELALNDLTRSLNIAPSANAYVARATIYRQRHNVPAAVSDLRAALELEPNNSNYQAALGYALWDSGDIAQSREMLER 668 (987)
T ss_pred CCHHHHHHHHHHHHHhCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHCCCHHHHHHHHHH
Confidence 556666666655442 23445566677777765444433 244456677777777777777777665543
No 81
>PF09145 Ubiq-assoc: Ubiquitin-associated; InterPro: IPR015228 Ubiquitin-associated domains contain approximately 40 residues and bind ubiquitin noncovalently. They adopt a secondary structure consisting of three alpha-helices, and have been identified in various modular proteins involved in protein trafficking, clathrin assembly/disassembly, DNA repair, proteasomal degradation, and cell cycle regulation []. ; PDB: 1PGY_A.
Probab=87.77 E-value=0.41 Score=26.91 Aligned_cols=23 Identities=35% Similarity=0.621 Sum_probs=17.9
Q ss_pred hHHHHHHHHHHhccHHHHHHHHH
Q psy11102 4 LVSQAREYYEKCHCYEKLISVYT 26 (87)
Q Consensus 4 ~w~~A~~yY~~~~n~~k~ie~~~ 26 (87)
+|++|..||.++.-++.+++-.-
T Consensus 19 sid~A~~yYe~Gi~Ye~~~~~~~ 41 (46)
T PF09145_consen 19 SIDKANDYYERGILYEDLIEKLR 41 (46)
T ss_dssp -SHHHHHHHHHH-SSHHHHHHHH
T ss_pred CHHHHHHHHHcCchHHHHHHHHH
Confidence 58999999999999998877543
No 82
>PRK02603 photosystem I assembly protein Ycf3; Provisional
Probab=87.47 E-value=2.9 Score=27.89 Aligned_cols=53 Identities=21% Similarity=0.251 Sum_probs=35.2
Q ss_pred HHHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCC---CCchHHHHHHHHHhCCChHHHHHHHHhc
Q psy11102 10 EYYEKCHCYEKLISVYTELGDFEALESCARKLPD---SSPLLKPMGEIFVKYGLCEQAVYVFDKN 71 (87)
Q Consensus 10 ~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~---~~~lL~~ia~~F~~~G~~~~Av~aylk~ 71 (87)
.+|.+.|+++++++++-+. +..-|+ ....+..+|..+...|.+++|+.+|.++
T Consensus 43 ~~~~~~g~~~~A~~~~~~a---------l~~~~~~~~~~~~~~~la~~~~~~g~~~~A~~~~~~a 98 (172)
T PRK02603 43 MSAQADGEYAEALENYEEA---------LKLEEDPNDRSYILYNMGIIYASNGEHDKALEYYHQA 98 (172)
T ss_pred HHHHHcCCHHHHHHHHHHH---------HHHhhccchHHHHHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 4456666777776655432 111122 1357889999999999999999987653
No 83
>PF11817 Foie-gras_1: Foie gras liver health family 1; InterPro: IPR021773 Mutating the gene foie gras in zebrafish has been shown to affect development; the mutants develop large, lipid-filled hepatocytes in the liver, resembling those in individuals with fatty liver disease []. Foie-gras protein is long and has several well-defined domains though none of them has a known function. We have annotated this one as the first []. THe C terminus of this region contains TPR repeats.
Probab=87.38 E-value=2.7 Score=30.29 Aligned_cols=70 Identities=14% Similarity=0.119 Sum_probs=47.6
Q ss_pred HHHHHHHHHhccHH-------HHHHHHHHhCCHHHHHHHHhhCC-----CC-CchHHHHHHHHHhCCChHHHHHHHHhcC
Q psy11102 6 SQAREYYEKCHCYE-------KLISVYTELGDFEALESCARKLP-----DS-SPLLKPMGEIFVKYGLCEQAVYVFDKNK 72 (87)
Q Consensus 6 ~~A~~yY~~~~n~~-------k~ie~~~~~ed~d~L~~l~~~L~-----~~-~~lL~~ia~~F~~~G~~~~Av~aylk~g 72 (87)
.+|.+-|.+.+... .+++-|+..|+|++-.++.+.+- ++ ..++..+.. ...+|+.+.|
T Consensus 162 ~~A~~~f~~~~~~R~~~~l~~~~A~ey~~~g~~~~A~~~l~~~~~~yr~egW~~l~~~~l~---------~l~~Ca~~~~ 232 (247)
T PF11817_consen 162 EKAYEQFKKYGQNRMASYLSLEMAEEYFRLGDYDKALKLLEPAASSYRREGWWSLLTEVLW---------RLLECAKRLG 232 (247)
T ss_pred HHHHHHHHHhccchHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHhCCcHHHHHHHHH---------HHHHHHHHhC
Confidence 46666676666543 34556677788876666655541 33 455555544 4578999999
Q ss_pred CHHHHHHHHhhc
Q psy11102 73 HKSSQWLTVVQD 84 (87)
Q Consensus 73 d~k~ai~~cv~~ 84 (87)
|++..|.+|+++
T Consensus 233 ~~~~~l~~~leL 244 (247)
T PF11817_consen 233 DVEDYLTTSLEL 244 (247)
T ss_pred CHHHHHHHHHHH
Confidence 999999999986
No 84
>PRK10370 formate-dependent nitrite reductase complex subunit NrfG; Provisional
Probab=87.06 E-value=4.2 Score=28.32 Aligned_cols=31 Identities=16% Similarity=0.288 Sum_probs=24.4
Q ss_pred CCCCCchHHHHHHHHHhCCChHHHHHHHHhc
Q psy11102 41 LPDSSPLLKPMGEIFVKYGLCEQAVYVFDKN 71 (87)
Q Consensus 41 L~~~~~lL~~ia~~F~~~G~~~~Av~aylk~ 71 (87)
=|++......+|..+...|..+.|+++|-++
T Consensus 69 ~P~~~~~w~~Lg~~~~~~g~~~~A~~a~~~A 99 (198)
T PRK10370 69 NPQNSEQWALLGEYYLWRNDYDNALLAYRQA 99 (198)
T ss_pred CCCCHHHHHHHHHHHHHCCCHHHHHHHHHHH
Confidence 3566677888888888888888888888654
No 85
>PF00515 TPR_1: Tetratricopeptide repeat; InterPro: IPR001440 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees; within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B. ; GO: 0005515 protein binding; PDB: 3SF4_C 2LNI_A 1ELW_A 2C0M_A 1FCH_B 3R9A_B 2J9Q_A 2C0L_A 1KT1_A 3FWV_A ....
Probab=87.03 E-value=1.2 Score=21.80 Aligned_cols=25 Identities=16% Similarity=0.377 Sum_probs=19.4
Q ss_pred hHHHHHHHHHhCCChHHHHHHHHhc
Q psy11102 47 LLKPMGEIFVKYGLCEQAVYVFDKN 71 (87)
Q Consensus 47 lL~~ia~~F~~~G~~~~Av~aylk~ 71 (87)
.+-.+|..+...|..++|+++|.++
T Consensus 3 ~~~~~g~~~~~~~~~~~A~~~~~~a 27 (34)
T PF00515_consen 3 AYYNLGNAYFQLGDYEEALEYYQRA 27 (34)
T ss_dssp HHHHHHHHHHHTT-HHHHHHHHHHH
T ss_pred HHHHHHHHHHHhCCchHHHHHHHHH
Confidence 3567888999999999999988663
No 86
>PRK10049 pgaA outer membrane protein PgaA; Provisional
Probab=85.51 E-value=5.9 Score=32.93 Aligned_cols=69 Identities=13% Similarity=0.218 Sum_probs=51.5
Q ss_pred chhHHHHHHHHHHhcc--------HHHHHHHHHHhCCHHHHHHHHhh---C-CCCCchHHHHHHHHHhCCChHHHHHHHH
Q psy11102 2 TTLVSQAREYYEKCHC--------YEKLISVYTELGDFEALESCARK---L-PDSSPLLKPMGEIFVKYGLCEQAVYVFD 69 (87)
Q Consensus 2 ~~~w~~A~~yY~~~~n--------~~k~ie~~~~~ed~d~L~~l~~~---L-~~~~~lL~~ia~~F~~~G~~~~Av~ayl 69 (87)
.++.++|++.|.++.- +.-++.++...++++.-.++.+. + |.+......+|..+...|..++|+..+-
T Consensus 28 ~g~~~~A~~~~~~~~~~~~~~a~~~~~lA~~~~~~g~~~~A~~~~~~al~~~P~~~~a~~~la~~l~~~g~~~eA~~~l~ 107 (765)
T PRK10049 28 AGQDAEVITVYNRYRVHMQLPARGYAAVAVAYRNLKQWQNSLTLWQKALSLEPQNDDYQRGLILTLADAGQYDEALVKAK 107 (765)
T ss_pred cCCHHHHHHHHHHHHhhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHCCCHHHHHHHHH
Confidence 4667778888877664 56677788888888887777777 4 4455667788888888888888876654
Q ss_pred h
Q psy11102 70 K 70 (87)
Q Consensus 70 k 70 (87)
+
T Consensus 108 ~ 108 (765)
T PRK10049 108 Q 108 (765)
T ss_pred H
Confidence 4
No 87
>PF07719 TPR_2: Tetratricopeptide repeat; InterPro: IPR013105 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. This repeat includes outlying Tetratricopeptide-like repeats (TPR) that are not matched by IPR001440 from INTERPRO.; PDB: 1XNF_B 3Q15_A 4ABN_A 1OUV_A 3U4T_A 3MA5_C 2KCV_A 2KCL_A 2XEV_A 3NF1_A ....
Probab=85.50 E-value=1.6 Score=21.00 Aligned_cols=25 Identities=24% Similarity=0.420 Sum_probs=18.7
Q ss_pred hHHHHHHHHHhCCChHHHHHHHHhc
Q psy11102 47 LLKPMGEIFVKYGLCEQAVYVFDKN 71 (87)
Q Consensus 47 lL~~ia~~F~~~G~~~~Av~aylk~ 71 (87)
.+-.+|..+...|..++|++.|.++
T Consensus 3 ~~~~lg~~~~~~~~~~~A~~~~~~a 27 (34)
T PF07719_consen 3 AWYYLGQAYYQLGNYEEAIEYFEKA 27 (34)
T ss_dssp HHHHHHHHHHHTT-HHHHHHHHHHH
T ss_pred HHHHHHHHHHHhCCHHHHHHHHHHH
Confidence 3567888888888888888888653
No 88
>PF04097 Nic96: Nup93/Nic96; InterPro: IPR007231 Nup93/Nic96 is a component of the nuclear pore complex. It is required for the correct assembly of the nuclear pore complex []. In Saccharomyces cerevisiae, Nic96 has been shown to be involved in the distribution and cellular concentration of the GTPase Gsp1 []. The structure of Nic96 has revealed a mostly alpha helical structure [].; GO: 0006810 transport, 0005643 nuclear pore; PDB: 2QX5_B 2RFO_A.
Probab=85.47 E-value=5.5 Score=32.66 Aligned_cols=81 Identities=14% Similarity=0.145 Sum_probs=55.6
Q ss_pred hhHHHHHHHHHHhccHH----------HHHHHHHHhCCHHHHHH-----------HHhh------CCC---C-CchHHHH
Q psy11102 3 TLVSQAREYYEKCHCYE----------KLISVYTELGDFEALES-----------CARK------LPD---S-SPLLKPM 51 (87)
Q Consensus 3 ~~w~~A~~yY~~~~n~~----------k~ie~~~~~ed~d~L~~-----------l~~~------L~~---~-~~lL~~i 51 (87)
.|...|++||..-.... -.-+......+|+.|-- +++. +.. . ..++...
T Consensus 341 td~~~Al~Y~~li~~~~~~~~~~l~~~~l~eLvletref~~LLG~i~~dG~r~~G~i~~~~~Li~~~~~~~~~~~i~~~~ 420 (613)
T PF04097_consen 341 TDPREALQYLYLICLFKDPEQRNLFHECLRELVLETREFDLLLGDINPDGSRTPGLIERRLSLIKFDDDEDFLREIIEQA 420 (613)
T ss_dssp T-HHHHHHHHHGGGGS-SCCHHHHHHHHHHHHHHHH--HHHHHEEE-TTS-EEE-HHHHTGGGGT-SSSSHHHHHHHHHH
T ss_pred cCHHHHHHHHHHHHHcCCchHHHHHHHHHHHHHHccCCHHHHCCCCCCCCccccceeeccccccCCCCcHHHHHHHHHHH
Confidence 57889999999877743 34444556677766652 2333 322 2 4778899
Q ss_pred HHHHHhCCChHHHHHHHHhcCCHHHHHHHHhh
Q psy11102 52 GEIFVKYGLCEQAVYVFDKNKHKSSQWLTVVQ 83 (87)
Q Consensus 52 a~~F~~~G~~~~Av~aylk~gd~k~ai~~cv~ 83 (87)
|..+...|..+.||..|.-+|++..++++.-+
T Consensus 421 A~~~e~~g~~~dAi~Ly~La~~~d~vl~lln~ 452 (613)
T PF04097_consen 421 AREAEERGRFEDAILLYHLAEEYDKVLSLLNR 452 (613)
T ss_dssp HHHHHHCT-HHHHHHHHHHTT-HHHHHHHHHH
T ss_pred HHHHHHCCCHHHHHHHHHHHhhHHHHHHHHHH
Confidence 99999999999999999999999999987543
No 89
>cd05804 StaR_like StaR_like; a well-conserved protein found in bacteria, plants, and animals. A family member from Streptomyces toyocaensis, StaR is part of a gene cluster involved in the biosynthesis of glycopeptide antibiotics (GPAs), specifically A47934. It has been speculated that StaR could be a flavoprotein hydroxylating a tyrosine sidechain. Some family members have been annotated as proteins containing tetratricopeptide (TPR) repeats, which may at least indicate mostly alpha-helical secondary structure.
Probab=85.19 E-value=9.1 Score=27.73 Aligned_cols=71 Identities=15% Similarity=0.177 Sum_probs=51.5
Q ss_pred chhHHHHHHHHHHhc--------cHHHHHHHHHHhCCHHHHHHHHhhCCC-C--Cch-----HHHHHHHHHhCCChHHHH
Q psy11102 2 TTLVSQAREYYEKCH--------CYEKLISVYTELGDFEALESCARKLPD-S--SPL-----LKPMGEIFVKYGLCEQAV 65 (87)
Q Consensus 2 ~~~w~~A~~yY~~~~--------n~~k~ie~~~~~ed~d~L~~l~~~L~~-~--~~l-----L~~ia~~F~~~G~~~~Av 65 (87)
.|++++|.++|.++- -...+..+++..|++++-+...+..-+ . ++. .-..|......|..++|.
T Consensus 127 ~G~~~~A~~~~~~al~~~p~~~~~~~~la~i~~~~g~~~eA~~~l~~~l~~~~~~~~~~~~~~~~la~~~~~~G~~~~A~ 206 (355)
T cd05804 127 AGQYDRAEEAARRALELNPDDAWAVHAVAHVLEMQGRFKEGIAFMESWRDTWDCSSMLRGHNWWHLALFYLERGDYEAAL 206 (355)
T ss_pred cCCHHHHHHHHHHHHhhCCCCcHHHHHHHHHHHHcCCHHHHHHHHHhhhhccCCCcchhHHHHHHHHHHHHHCCCHHHHH
Confidence 466788888887753 345678899999999998887765322 1 121 225788889999999999
Q ss_pred HHHHhcC
Q psy11102 66 YVFDKNK 72 (87)
Q Consensus 66 ~aylk~g 72 (87)
..|-+..
T Consensus 207 ~~~~~~~ 213 (355)
T cd05804 207 AIYDTHI 213 (355)
T ss_pred HHHHHHh
Confidence 9998764
No 90
>TIGR02795 tol_pal_ybgF tol-pal system protein YbgF. Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.
Probab=85.18 E-value=6 Score=23.58 Aligned_cols=55 Identities=22% Similarity=0.225 Sum_probs=32.8
Q ss_pred HHHHHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCCC---CchHHHHHHHHHhCCChHHHHHHHHhc
Q psy11102 8 AREYYEKCHCYEKLISVYTELGDFEALESCARKLPDS---SPLLKPMGEIFVKYGLCEQAVYVFDKN 71 (87)
Q Consensus 8 A~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~~---~~lL~~ia~~F~~~G~~~~Av~aylk~ 71 (87)
.+..+.+.|+++++++.+-.. ++.-|++ ....-.+|..+...|.+++|++.|.++
T Consensus 8 ~~~~~~~~~~~~~A~~~~~~~---------~~~~~~~~~~~~~~~~l~~~~~~~~~~~~A~~~~~~~ 65 (119)
T TIGR02795 8 AALLVLKAGDYADAIQAFQAF---------LKKYPKSTYAPNAHYWLGEAYYAQGKYADAAKAFLAV 65 (119)
T ss_pred HHHHHHHcCCHHHHHHHHHHH---------HHHCCCccccHHHHHHHHHHHHhhccHHHHHHHHHHH
Confidence 334444456666666554322 2223432 235666888888888888888877653
No 91
>PRK10747 putative protoheme IX biogenesis protein; Provisional
Probab=84.84 E-value=7.1 Score=29.79 Aligned_cols=68 Identities=13% Similarity=-0.005 Sum_probs=46.0
Q ss_pred chhHHHHHHHHHHhccH----------------HHHHHHHHHhCCHHHHHHHHhhCCC----CCchHHHHHHHHHhCCCh
Q psy11102 2 TTLVSQAREYYEKCHCY----------------EKLISVYTELGDFEALESCARKLPD----SSPLLKPMGEIFVKYGLC 61 (87)
Q Consensus 2 ~~~w~~A~~yY~~~~n~----------------~k~ie~~~~~ed~d~L~~l~~~L~~----~~~lL~~ia~~F~~~G~~ 61 (87)
.++|+.|.+.+.+-... ..++.-.-...+-+.|.++.+++|+ +.++....|..+...|..
T Consensus 200 ~gdw~~a~~~l~~l~k~~~~~~~~~~~l~~~a~~~l~~~~~~~~~~~~l~~~w~~lp~~~~~~~~~~~~~A~~l~~~g~~ 279 (398)
T PRK10747 200 TGAWSSLLDILPSMAKAHVGDEEHRAMLEQQAWIGLMDQAMADQGSEGLKRWWKNQSRKTRHQVALQVAMAEHLIECDDH 279 (398)
T ss_pred HHhHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHHHHHHHHHhcCHHHHHHHHHhCCHHHhCCHHHHHHHHHHHHHCCCH
Confidence 46777777666555421 1112222345667888888888876 346788889999999999
Q ss_pred HHHHHHHH
Q psy11102 62 EQAVYVFD 69 (87)
Q Consensus 62 ~~Av~ayl 69 (87)
++|.....
T Consensus 280 ~~A~~~L~ 287 (398)
T PRK10747 280 DTAQQIIL 287 (398)
T ss_pred HHHHHHHH
Confidence 99877653
No 92
>PRK15363 pathogenicity island 2 chaperone protein SscA; Provisional
Probab=84.69 E-value=2.6 Score=29.22 Aligned_cols=55 Identities=9% Similarity=0.062 Sum_probs=35.9
Q ss_pred HHHhccHHHHHHHHHHhCCHHHHHHHHhhCCCCCchHHHHHHHHHhCCChHHHHHHHHhcCCHHHHHHHH
Q psy11102 12 YEKCHCYEKLISVYTELGDFEALESCARKLPDSSPLLKPMGEIFVKYGLCEQAVYVFDKNKHKSSQWLTV 81 (87)
Q Consensus 12 Y~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~~~~lL~~ia~~F~~~G~~~~Av~aylk~gd~k~ai~~c 81 (87)
+.+-|++++||+||.+....+ |++..-.--.|.-+...|..+.|.++| +.||..|
T Consensus 79 ~Q~~g~~~~AI~aY~~A~~L~---------~ddp~~~~~ag~c~L~lG~~~~A~~aF------~~Ai~~~ 133 (157)
T PRK15363 79 CQAQKHWGEAIYAYGRAAQIK---------IDAPQAPWAAAECYLACDNVCYAIKAL------KAVVRIC 133 (157)
T ss_pred HHHHhhHHHHHHHHHHHHhcC---------CCCchHHHHHHHHHHHcCCHHHHHHHH------HHHHHHh
Confidence 355677888888887665422 333344566777777778888887776 3456666
No 93
>PRK12370 invasion protein regulator; Provisional
Probab=84.52 E-value=4.8 Score=32.11 Aligned_cols=22 Identities=18% Similarity=0.087 Sum_probs=10.0
Q ss_pred hHHHHHHHHHhCCChHHHHHHH
Q psy11102 47 LLKPMGEIFVKYGLCEQAVYVF 68 (87)
Q Consensus 47 lL~~ia~~F~~~G~~~~Av~ay 68 (87)
....+|..+...|..++|++.|
T Consensus 374 a~~~lg~~l~~~G~~~eAi~~~ 395 (553)
T PRK12370 374 IKYYYGWNLFMAGQLEEALQTI 395 (553)
T ss_pred HHHHHHHHHHHCCCHHHHHHHH
Confidence 3444444444444444444333
No 94
>KOG1155|consensus
Probab=84.30 E-value=3.5 Score=33.76 Aligned_cols=73 Identities=21% Similarity=0.291 Sum_probs=50.0
Q ss_pred chhHHHHHHHHHHh---------------------ccHHHHHHHHHHhCC--------------HHHHHHH---------
Q psy11102 2 TTLVSQAREYYEKC---------------------HCYEKLISVYTELGD--------------FEALESC--------- 37 (87)
Q Consensus 2 ~~~w~~A~~yY~~~---------------------~n~~k~ie~~~~~ed--------------~d~L~~l--------- 37 (87)
.++.++|+.||.++ +|.++||++|-+.=| -=.++.+
T Consensus 343 r~eHEKAv~YFkRALkLNp~~~~aWTLmGHEyvEmKNt~AAi~sYRrAvdi~p~DyRAWYGLGQaYeim~Mh~YaLyYfq 422 (559)
T KOG1155|consen 343 RSEHEKAVMYFKRALKLNPKYLSAWTLMGHEYVEMKNTHAAIESYRRAVDINPRDYRAWYGLGQAYEIMKMHFYALYYFQ 422 (559)
T ss_pred HHhHHHHHHHHHHHHhcCcchhHHHHHhhHHHHHhcccHHHHHHHHHHHhcCchhHHHHhhhhHHHHHhcchHHHHHHHH
Confidence 35667888887765 678889998865422 1111111
Q ss_pred -HhhC-CCCCchHHHHHHHHHhCCChHHHHHHHHh---cCCH
Q psy11102 38 -ARKL-PDSSPLLKPMGEIFVKYGLCEQAVYVFDK---NKHK 74 (87)
Q Consensus 38 -~~~L-~~~~~lL~~ia~~F~~~G~~~~Av~aylk---~gd~ 74 (87)
+-.+ |-++.++.-+|+-..+.++.++|+.||-+ +||.
T Consensus 423 kA~~~kPnDsRlw~aLG~CY~kl~~~~eAiKCykrai~~~dt 464 (559)
T KOG1155|consen 423 KALELKPNDSRLWVALGECYEKLNRLEEAIKCYKRAILLGDT 464 (559)
T ss_pred HHHhcCCCchHHHHHHHHHHHHhccHHHHHHHHHHHHhcccc
Confidence 1122 55689999999999999999999999854 5666
No 95
>PF12931 Sec16_C: Sec23-binding domain of Sec16; PDB: 3MZK_C.
Probab=83.77 E-value=4.3 Score=30.02 Aligned_cols=63 Identities=13% Similarity=-0.015 Sum_probs=42.0
Q ss_pred HHHhccHHHHHHHHHHh-------------CCHHHHHHHHhhCCCC-CchHHHHHHHHHhCCChHHHHHHHHhcCCH
Q psy11102 12 YEKCHCYEKLISVYTEL-------------GDFEALESCARKLPDS-SPLLKPMGEIFVKYGLCEQAVYVFDKNKHK 74 (87)
Q Consensus 12 Y~~~~n~~k~ie~~~~~-------------ed~d~L~~l~~~L~~~-~~lL~~ia~~F~~~G~~~~Av~aylk~gd~ 74 (87)
-.-+||.+.+++-+... +|-+-|--|+...+.+ .+.|..+|+.+.+.|....|--||+=+|.+
T Consensus 69 ~v~~g~~~~~v~~l~~~~~~~~~~~~~~~~~Wre~lA~il~N~~~~~~~~l~~LGd~L~~~g~~~aA~iCYllag~~ 145 (284)
T PF12931_consen 69 QVFSGNSPEAVDELVPNSAAPPLEGEWDLDNWRETLAIILSNRTPEDSQALCALGDRLWQRGRVEAAHICYLLAGNP 145 (284)
T ss_dssp HHTTT-HHHHHHHHHH-----HHHHHHHHHSHHHHHHHHHHTS---SS-TT--HHHHHHHTT-HHHHHHHHHHTT--
T ss_pred HHHcCCcHHHHHHhccccccccccccchhcCHHHHHHHHHhCCCcccHHHHHHHHHHHHhCCCcchhHHHHhHcCCc
Confidence 34467777777776654 5667777777776664 688999999999999999999999988754
No 96
>KOG0547|consensus
Probab=83.57 E-value=2.6 Score=34.67 Aligned_cols=73 Identities=18% Similarity=0.217 Sum_probs=56.4
Q ss_pred chhHHHHHHHHHHhccHHH---------------HHHHHHHhCCHHHHHHHHhhCC---CCC-chHHHHHHHHHhCCChH
Q psy11102 2 TTLVSQAREYYEKCHCYEK---------------LISVYTELGDFEALESCARKLP---DSS-PLLKPMGEIFVKYGLCE 62 (87)
Q Consensus 2 ~~~w~~A~~yY~~~~n~~k---------------~ie~~~~~ed~d~L~~l~~~L~---~~~-~lL~~ia~~F~~~G~~~ 62 (87)
.|++++|+++|-++-.++. ++-.+.--+|++.-++|++.-- ..+ -.-..+|.+-.+.|..+
T Consensus 475 qqqFd~A~k~YD~ai~LE~~~~~~~v~~~plV~Ka~l~~qwk~d~~~a~~Ll~KA~e~Dpkce~A~~tlaq~~lQ~~~i~ 554 (606)
T KOG0547|consen 475 QQQFDKAVKQYDKAIELEPREHLIIVNAAPLVHKALLVLQWKEDINQAENLLRKAIELDPKCEQAYETLAQFELQRGKID 554 (606)
T ss_pred HHhHHHHHHHHHHHHhhccccccccccchhhhhhhHhhhchhhhHHHHHHHHHHHHccCchHHHHHHHHHHHHHHHhhHH
Confidence 5899999999999877654 3444445588888888888743 333 45788999999999999
Q ss_pred HHHHHHHhcCCH
Q psy11102 63 QAVYVFDKNKHK 74 (87)
Q Consensus 63 ~Av~aylk~gd~ 74 (87)
+||+.|.|.-+.
T Consensus 555 eAielFEksa~l 566 (606)
T KOG0547|consen 555 EAIELFEKSAQL 566 (606)
T ss_pred HHHHHHHHHHHH
Confidence 999999986443
No 97
>KOG1125|consensus
Probab=83.15 E-value=1.6 Score=35.94 Aligned_cols=52 Identities=12% Similarity=0.319 Sum_probs=38.6
Q ss_pred HHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCCCCchHHHHHHHHHhCCChHHHHHHHHhc
Q psy11102 11 YYEKCHCYEKLISVYTELGDFEALESCARKLPDSSPLLKPMGEIFVKYGLCEQAVYVFDKN 71 (87)
Q Consensus 11 yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~~~~lL~~ia~~F~~~G~~~~Av~aylk~ 71 (87)
+|.-.|+|+|+|+|+--+ .+.=|.+--+=.++|--+++....++||+||-|+
T Consensus 439 Ly~ls~efdraiDcf~~A---------L~v~Pnd~~lWNRLGAtLAN~~~s~EAIsAY~rA 490 (579)
T KOG1125|consen 439 LYNLSGEFDRAVDCFEAA---------LQVKPNDYLLWNRLGATLANGNRSEEAISAYNRA 490 (579)
T ss_pred HHhcchHHHHHHHHHHHH---------HhcCCchHHHHHHhhHHhcCCcccHHHHHHHHHH
Confidence 566677888888876432 1111444477899999999999999999999775
No 98
>KOG0543|consensus
Probab=82.60 E-value=7.2 Score=30.93 Aligned_cols=70 Identities=21% Similarity=0.334 Sum_probs=54.3
Q ss_pred hhHHHHHHHHHHhccH-----------------------HHHHHHHHHhCCHHHHHHHHhh-C---CCCCchHHHHHHHH
Q psy11102 3 TLVSQAREYYEKCHCY-----------------------EKLISVYTELGDFEALESCARK-L---PDSSPLLKPMGEIF 55 (87)
Q Consensus 3 ~~w~~A~~yY~~~~n~-----------------------~k~ie~~~~~ed~d~L~~l~~~-L---~~~~~lL~~ia~~F 55 (87)
++|..|+.-|.++-.+ --++-|+.+++.|..-.+.|.. | |.|.+-|=+.|.-.
T Consensus 222 gk~~~A~~~Yerav~~l~~~~~~~~ee~~~~~~~k~~~~lNlA~c~lKl~~~~~Ai~~c~kvLe~~~~N~KALyRrG~A~ 301 (397)
T KOG0543|consen 222 GKFKLAKKRYERAVSFLEYRRSFDEEEQKKAEALKLACHLNLAACYLKLKEYKEAIESCNKVLELDPNNVKALYRRGQAL 301 (397)
T ss_pred chHHHHHHHHHHHHHHhhccccCCHHHHHHHHHHHHHHhhHHHHHHHhhhhHHHHHHHHHHHHhcCCCchhHHHHHHHHH
Confidence 5677777777773221 2367789999999987777665 3 33569999999999
Q ss_pred HhCCChHHHHHHHHhcC
Q psy11102 56 VKYGLCEQAVYVFDKNK 72 (87)
Q Consensus 56 ~~~G~~~~Av~aylk~g 72 (87)
...|-++.|...|.|+-
T Consensus 302 l~~~e~~~A~~df~ka~ 318 (397)
T KOG0543|consen 302 LALGEYDLARDDFQKAL 318 (397)
T ss_pred HhhccHHHHHHHHHHHH
Confidence 99999999999998864
No 99
>TIGR00540 hemY_coli hemY protein. This is an uncharacterized protein encoded next to a heme-biosynthetic enzyme in two gamma division proteobacteria (E. coli and H. influenzae). It is known in no other species. The gene symbol hemY is unfortunate in that an unrelated protein, protoporphyrinogen oxidase, is designated as HemG in E. coli but as HemY in Bacillus subtilis.
Probab=82.39 E-value=7.9 Score=29.55 Aligned_cols=69 Identities=13% Similarity=0.172 Sum_probs=47.2
Q ss_pred chhHHHHHHHHHHh----cc----HHH-HHHHHHHhCCHH----HHHHHHhhCCCCCchHHHHHHHHHhCCChHHHHHHH
Q psy11102 2 TTLVSQAREYYEKC----HC----YEK-LISVYTELGDFE----ALESCARKLPDSSPLLKPMGEIFVKYGLCEQAVYVF 68 (87)
Q Consensus 2 ~~~w~~A~~yY~~~----~n----~~k-~ie~~~~~ed~d----~L~~l~~~L~~~~~lL~~ia~~F~~~G~~~~Av~ay 68 (87)
.|+++.|.+||.++ ++ .+. .++.+...++++ .++++.+.-|++...+.-.+......|..+.|.+.+
T Consensus 131 ~g~~~~A~~~l~~a~~~~p~~~l~~~~~~a~l~l~~~~~~~Al~~l~~l~~~~P~~~~~l~ll~~~~~~~~d~~~a~~~l 210 (409)
T TIGR00540 131 RGDEARANQHLEEAAELAGNDNILVEIARTRILLAQNELHAARHGVDKLLEMAPRHKEVLKLAEEAYIRSGAWQALDDII 210 (409)
T ss_pred CCCHHHHHHHHHHHHHhCCcCchHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHhhHHHHHHHH
Confidence 36788888888773 22 122 377777788887 445555555666677888888888888888766655
Q ss_pred Hh
Q psy11102 69 DK 70 (87)
Q Consensus 69 lk 70 (87)
-+
T Consensus 211 ~~ 212 (409)
T TIGR00540 211 DN 212 (409)
T ss_pred HH
Confidence 44
No 100
>PF12854 PPR_1: PPR repeat
Probab=82.28 E-value=2.1 Score=21.69 Aligned_cols=23 Identities=4% Similarity=-0.067 Sum_probs=15.7
Q ss_pred HHHHHHHHhcCCHHHHHHHHhhc
Q psy11102 62 EQAVYVFDKNKHKSSQWLTVVQD 84 (87)
Q Consensus 62 ~~Av~aylk~gd~k~ai~~cv~~ 84 (87)
.-=|.+|.|.|+++.|+++--++
T Consensus 11 ~~lI~~~Ck~G~~~~A~~l~~~M 33 (34)
T PF12854_consen 11 NTLIDGYCKAGRVDEAFELFDEM 33 (34)
T ss_pred HHHHHHHHHCCCHHHHHHHHHhC
Confidence 34467778888888887766544
No 101
>PF13181 TPR_8: Tetratricopeptide repeat; PDB: 3GW4_B 3MA5_C 2KCV_A 2KCL_A 3FP3_A 3LCA_A 3FP4_A 3FP2_A 1W3B_B 1ELW_A ....
Probab=81.72 E-value=3.1 Score=20.10 Aligned_cols=25 Identities=28% Similarity=0.474 Sum_probs=20.9
Q ss_pred hHHHHHHHHHhCCChHHHHHHHHhc
Q psy11102 47 LLKPMGEIFVKYGLCEQAVYVFDKN 71 (87)
Q Consensus 47 lL~~ia~~F~~~G~~~~Av~aylk~ 71 (87)
.+..+|..+...|..++|.+.|.++
T Consensus 3 ~~~~lg~~y~~~~~~~~A~~~~~~a 27 (34)
T PF13181_consen 3 AYYNLGKIYEQLGDYEEALEYFEKA 27 (34)
T ss_dssp HHHHHHHHHHHTTSHHHHHHHHHHH
T ss_pred HHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 4667888899999999999988765
No 102
>PF14555 UBA_4: UBA-like domain; PDB: 2DAL_A 3BQ3_A 2L4E_A 2L4F_A 2DZL_A 2L2D_A 2DAM_A 1V92_A 3E21_A.
Probab=80.82 E-value=3.8 Score=21.95 Aligned_cols=27 Identities=22% Similarity=0.269 Sum_probs=21.4
Q ss_pred hHHHHHHHHHHhc-cHHHHHHHHHHhCC
Q psy11102 4 LVSQAREYYEKCH-CYEKLISVYTELGD 30 (87)
Q Consensus 4 ~w~~A~~yY~~~~-n~~k~ie~~~~~ed 30 (87)
+.+.|..|...++ |++.||+.||..++
T Consensus 14 ~~~~A~~~L~~~~wdle~Av~~y~~~~~ 41 (43)
T PF14555_consen 14 DEDVAIQYLEANNWDLEAAVNAYFDDGE 41 (43)
T ss_dssp SHHHHHHHHHHTTT-HHHHHHHHHHSS-
T ss_pred CHHHHHHHHHHcCCCHHHHHHHHHhCCC
Confidence 4578999999988 99999999998653
No 103
>PRK10747 putative protoheme IX biogenesis protein; Provisional
Probab=80.62 E-value=7.5 Score=29.66 Aligned_cols=67 Identities=12% Similarity=0.186 Sum_probs=41.9
Q ss_pred chhHHHHHHHHHHhccH----H-----HHHHHHHHhCCHHHHHHHHhhC----CCCCchHHHHHHHHHhCCChHHHHHHH
Q psy11102 2 TTLVSQAREYYEKCHCY----E-----KLISVYTELGDFEALESCARKL----PDSSPLLKPMGEIFVKYGLCEQAVYVF 68 (87)
Q Consensus 2 ~~~w~~A~~yY~~~~n~----~-----k~ie~~~~~ed~d~L~~l~~~L----~~~~~lL~~ia~~F~~~G~~~~Av~ay 68 (87)
.|+++.|.+||.++... . .+.+.+...|+++.-....+.+ |++...+.-.+..+...|..++|++.+
T Consensus 131 ~g~~~~A~~~l~~A~~~~~~~~~~~~l~~a~l~l~~g~~~~Al~~l~~~~~~~P~~~~al~ll~~~~~~~gdw~~a~~~l 210 (398)
T PRK10747 131 RGDEARANQHLERAAELADNDQLPVEITRVRIQLARNENHAARHGVDKLLEVAPRHPEVLRLAEQAYIRTGAWSSLLDIL 210 (398)
T ss_pred CCCHHHHHHHHHHHHhcCCcchHHHHHHHHHHHHHCCCHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHHhHHHHHHHH
Confidence 35677777777765432 1 2356777777777666655554 444466777777777777777777433
No 104
>PRK14574 hmsH outer membrane protein; Provisional
Probab=80.35 E-value=10 Score=32.36 Aligned_cols=69 Identities=10% Similarity=0.030 Sum_probs=43.4
Q ss_pred hhHHHHHHHHHHhcc------HHHH--HHHHHHhCCHHHHHHHHhhC----CCCCchHHHHHHHHHhCCChHHHHHHHHh
Q psy11102 3 TLVSQAREYYEKCHC------YEKL--ISVYTELGDFEALESCARKL----PDSSPLLKPMGEIFVKYGLCEQAVYVFDK 70 (87)
Q Consensus 3 ~~w~~A~~yY~~~~n------~~k~--ie~~~~~ed~d~L~~l~~~L----~~~~~lL~~ia~~F~~~G~~~~Av~aylk 70 (87)
+++++|..++.++-+ ...+ +..|...++|++-.++.+.+ |++..++..++..+...|..++|++.+.+
T Consensus 82 G~~~~A~~~~eka~~p~n~~~~~llalA~ly~~~gdyd~Aiely~kaL~~dP~n~~~l~gLa~~y~~~~q~~eAl~~l~~ 161 (822)
T PRK14574 82 GRDQEVIDVYERYQSSMNISSRGLASAARAYRNEKRWDQALALWQSSLKKDPTNPDLISGMIMTQADAGRGGVVLKQATE 161 (822)
T ss_pred CCcHHHHHHHHHhccCCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHhhCCCCHHHHHHHHHHHhhcCCHHHHHHHHHH
Confidence 566677777777643 1223 34666667777766666554 44556666667777777777777776665
Q ss_pred c
Q psy11102 71 N 71 (87)
Q Consensus 71 ~ 71 (87)
.
T Consensus 162 l 162 (822)
T PRK14574 162 L 162 (822)
T ss_pred h
Confidence 4
No 105
>PRK15363 pathogenicity island 2 chaperone protein SscA; Provisional
Probab=80.21 E-value=4.7 Score=27.98 Aligned_cols=48 Identities=17% Similarity=0.182 Sum_probs=37.7
Q ss_pred HHHhCCHHHHHHHHhhC----CCCCchHHHHHHHHHhCCChHHHHHHHHhcC
Q psy11102 25 YTELGDFEALESCARKL----PDSSPLLKPMGEIFVKYGLCEQAVYVFDKNK 72 (87)
Q Consensus 25 ~~~~ed~d~L~~l~~~L----~~~~~lL~~ia~~F~~~G~~~~Av~aylk~g 72 (87)
+|..|++++-+++-+-| |.+..-.--+|-..+..|.+++|+++|-+++
T Consensus 45 ly~~G~l~~A~~~f~~L~~~Dp~~~~y~~gLG~~~Q~~g~~~~AI~aY~~A~ 96 (157)
T PRK15363 45 LMEVKEFAGAARLFQLLTIYDAWSFDYWFRLGECCQAQKHWGEAIYAYGRAA 96 (157)
T ss_pred HHHCCCHHHHHHHHHHHHHhCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHH
Confidence 46778888887777776 4456778889999999999999998886543
No 106
>PF10602 RPN7: 26S proteasome subunit RPN7; InterPro: IPR019585 This entry represents the regulatory subunit RPN7 (known as the non-ATPase regulatory subunit 6 in higher eukaryotes) of the 26S proteasome. This entry also matches the evolutionarily related subunit 1 of the COP9 signalosome complex (CSN) from Arabidopsis []. The 26S proteasome plays a major role in ATP-dependent degradation of ubiquitinated proteins. Substrate specificity is conferred by the regulatory particle (RP), which can dissociate into stable lid and base subcomplexes. The regulatory subunit RPN7 is one of the lid subunits of the 26S proteasome and has been shown in Saccharomyces cerevisiae (Baker's yeast) to be required for structural integrity []. The COP9 signalosome is a conserved protein complex composed of eight subunits, where Individual subunits of the complex have been linked to various signal transduction pathways leading to gene expression and cell cycle control []. The overall organisation and the amino acid sequences of the COP9 signalosome subunits resemble the lid subcomplex of the 19 S regulatory particle for the 26 S proteasome []. COP9 subunit 1 (CSN1 or GPS1) of the COP9 complex is an essential subunit of the complex with regard to both structural integrity and functionality. The N-terminal region of subunit 1 (CSN1-N) can inhibit c-fos expression from either a transfected template or a chromosomal transgene (fos-lacZ), and may contain the activity domain that confers most of the repression functions of CSN1. The C-terminal region of subunit 1 (CSN1-C) allows integration of the protein into the COP9 signalosome.
Probab=79.66 E-value=8.1 Score=26.66 Aligned_cols=24 Identities=17% Similarity=0.265 Sum_probs=13.9
Q ss_pred HHHHHHHHHHhccHHHHHHHHHHh
Q psy11102 5 VSQAREYYEKCHCYEKLISVYTEL 28 (87)
Q Consensus 5 w~~A~~yY~~~~n~~k~ie~~~~~ 28 (87)
|..=++||.+.|+++.|+++|-+.
T Consensus 39 ~~~l~~~~~~~Gd~~~A~k~y~~~ 62 (177)
T PF10602_consen 39 LEDLADHYCKIGDLEEALKAYSRA 62 (177)
T ss_pred HHHHHHHHHHhhhHHHHHHHHHHH
Confidence 334456666666666666666543
No 107
>PF13812 PPR_3: Pentatricopeptide repeat domain
Probab=78.99 E-value=3.5 Score=19.75 Aligned_cols=24 Identities=4% Similarity=-0.090 Sum_probs=18.9
Q ss_pred hHHHHHHHHhcCCHHHHHHHHhhc
Q psy11102 61 CEQAVYVFDKNKHKSSQWLTVVQD 84 (87)
Q Consensus 61 ~~~Av~aylk~gd~k~ai~~cv~~ 84 (87)
+.-.+.+|.|.|+++.|.++.-++
T Consensus 4 y~~ll~a~~~~g~~~~a~~~~~~M 27 (34)
T PF13812_consen 4 YNALLRACAKAGDPDAALQLFDEM 27 (34)
T ss_pred HHHHHHHHHHCCCHHHHHHHHHHH
Confidence 556788999999999998876544
No 108
>KOG0276|consensus
Probab=78.11 E-value=12 Score=31.72 Aligned_cols=76 Identities=14% Similarity=0.159 Sum_probs=57.9
Q ss_pred HHHHHHHHHhccHHHHHH----------HHHHhCCHHHHHHHHhhCCCCCchHHHHHHHHHhCCChHHHHHHHHhcCCHH
Q psy11102 6 SQAREYYEKCHCYEKLIS----------VYTELGDFEALESCARKLPDSSPLLKPMGEIFVKYGLCEQAVYVFDKNKHKS 75 (87)
Q Consensus 6 ~~A~~yY~~~~n~~k~ie----------~~~~~ed~d~L~~l~~~L~~~~~lL~~ia~~F~~~G~~~~Av~aylk~gd~k 75 (87)
.+++.+.++-|-.++|.+ +..++|+++---+|+...+...+ -.++|+.+.+.|....|.+||.++.|.-
T Consensus 618 t~va~Fle~~g~~e~AL~~s~D~d~rFelal~lgrl~iA~~la~e~~s~~K-w~~Lg~~al~~~~l~lA~EC~~~a~d~~ 696 (794)
T KOG0276|consen 618 TKVAHFLESQGMKEQALELSTDPDQRFELALKLGRLDIAFDLAVEANSEVK-WRQLGDAALSAGELPLASECFLRARDLG 696 (794)
T ss_pred hhHHhHhhhccchHhhhhcCCChhhhhhhhhhcCcHHHHHHHHHhhcchHH-HHHHHHHHhhcccchhHHHHHHhhcchh
Confidence 456777888888887775 55677777766666655443222 6789999999999999999999999999
Q ss_pred HHHHHHh
Q psy11102 76 SQWLTVV 82 (87)
Q Consensus 76 ~ai~~cv 82 (87)
+++-++-
T Consensus 697 ~LlLl~t 703 (794)
T KOG0276|consen 697 SLLLLYT 703 (794)
T ss_pred hhhhhhh
Confidence 8876543
No 109
>KOG4162|consensus
Probab=77.95 E-value=21 Score=30.73 Aligned_cols=64 Identities=17% Similarity=0.059 Sum_probs=34.4
Q ss_pred hhHHHHHHHHHHhccHHHHHHHH--------------HHh--------CCHHHHHHHHhhC--CC-CCchHHHHHHHHHh
Q psy11102 3 TLVSQAREYYEKCHCYEKLISVY--------------TEL--------GDFEALESCARKL--PD-SSPLLKPMGEIFVK 57 (87)
Q Consensus 3 ~~w~~A~~yY~~~~n~~k~ie~~--------------~~~--------ed~d~L~~l~~~L--~~-~~~lL~~ia~~F~~ 57 (87)
..|.-|+.+|.+.+|...+--|+ |+. .+++.++.....| +. ..|-+.-+|..+..
T Consensus 651 ~lwllaa~~~~~~~~~~~a~~CL~Ea~~~~~l~~~~~~~~G~~~~~~~~~~EA~~af~~Al~ldP~hv~s~~Ala~~lle 730 (799)
T KOG4162|consen 651 KLWLLAADLFLLSGNDDEARSCLLEASKIDPLSASVYYLRGLLLEVKGQLEEAKEAFLVALALDPDHVPSMTALAELLLE 730 (799)
T ss_pred HHHHHHHHHHHhcCCchHHHHHHHHHHhcchhhHHHHHHhhHHHHHHHhhHHHHHHHHHHHhcCCCCcHHHHHHHHHHHH
Confidence 45777777777777776664332 333 3344444444443 22 23556666666666
Q ss_pred CCChHHHHH
Q psy11102 58 YGLCEQAVY 66 (87)
Q Consensus 58 ~G~~~~Av~ 66 (87)
.|...-|+.
T Consensus 731 ~G~~~la~~ 739 (799)
T KOG4162|consen 731 LGSPRLAEK 739 (799)
T ss_pred hCCcchHHH
Confidence 555444444
No 110
>TIGR00756 PPR pentatricopeptide repeat domain (PPR motif). This family has a similar consensus to the TPR domain (tetratricopeptide), pfam pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.
Probab=77.75 E-value=2.9 Score=19.76 Aligned_cols=21 Identities=5% Similarity=-0.190 Sum_probs=12.6
Q ss_pred HHHHHHhcCCHHHHHHHHhhc
Q psy11102 64 AVYVFDKNKHKSSQWLTVVQD 84 (87)
Q Consensus 64 Av~aylk~gd~k~ai~~cv~~ 84 (87)
-+.+|.+.|++++|+++.-++
T Consensus 6 li~~~~~~~~~~~a~~~~~~M 26 (35)
T TIGR00756 6 LIDGLCKAGRVEEALELFKEM 26 (35)
T ss_pred HHHHHHHCCCHHHHHHHHHHH
Confidence 355666666666666665443
No 111
>PF01535 PPR: PPR repeat; InterPro: IPR002885 This entry represents the PPR repeat. Pentatricopeptide repeat (PPR) proteins are characterised by tandem repeats of a degenerate 35 amino acid motif []. Most of PPR proteins have roles in mitochondria or plastid []. PPR repeats were discovered while screening Arabidopsis proteins for those predicted to be targeted to mitochondria or chloroplast [, ]. Some of these proteins have been shown to play a role in post-transcriptional processes within organelles and they are thought to be sequence-specific RNA-binding proteins [, , ]. Plant genomes have between one hundred to five hundred PPR genes per genome whereas non-plant genomes encode two to six PPR proteins. Although no PPR structures are yet known, the motif is predicted to fold into a helix-turn-helix structure similar to those found in the tetratricopeptide repeat (TPR) family (see PDOC50005 from PROSITEDOC) []. The plant PPR protein family has been divided in two subfamilies on the basis of their motif content and organisation [, ]. Examples of PPR repeat-containing proteins include PET309 P32522 from SWISSPROT, which may be involved in RNA stabilisation [], and crp1, which is involved in RNA processing []. The repeat is associated with a predicted plant protein O49549 from SWISSPROT that has a domain organisation similar to the human BRCA1 protein.
Probab=77.11 E-value=3.5 Score=19.22 Aligned_cols=24 Identities=29% Similarity=0.498 Sum_probs=15.1
Q ss_pred HHHHHHHHhCCHHHHHHHHhhCCC
Q psy11102 20 KLISVYTELGDFEALESCARKLPD 43 (87)
Q Consensus 20 k~ie~~~~~ed~d~L~~l~~~L~~ 43 (87)
-+|++|.+.+++++-.++.+.+.+
T Consensus 5 ~li~~~~~~~~~~~a~~~~~~M~~ 28 (31)
T PF01535_consen 5 SLISGYCKMGQFEEALEVFDEMRE 28 (31)
T ss_pred HHHHHHHccchHHHHHHHHHHHhH
Confidence 456666666666666666665543
No 112
>KOG2280|consensus
Probab=77.06 E-value=8.1 Score=33.14 Aligned_cols=51 Identities=10% Similarity=-0.040 Sum_probs=40.7
Q ss_pred HHHHHHHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCCCCchHHHHHHHHH
Q psy11102 6 SQAREYYEKCHCYEKLISVYTELGDFEALESCARKLPDSSPLLKPMGEIFV 56 (87)
Q Consensus 6 ~~A~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~~~~lL~~ia~~F~ 56 (87)
.+=+..|.+.|++..|+++.++..|-++|.++-+..+...+.....+..+.
T Consensus 774 ~ekv~ay~~~~~~~eAad~A~~~rd~~~L~ev~~~~~~~~~~~~~~~~~~~ 824 (829)
T KOG2280|consen 774 QEKVKAYLRVGDVKEAADLAAEHRDGAELSEVLSKCTGAPDGATALKIQDQ 824 (829)
T ss_pred HHHHHHHHHhccHHHHHHHHHHhcChHHHHHHHHhcCCCCccchhhhhHhh
Confidence 356788999999999999999999999999999998876544444444443
No 113
>TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795).
Probab=76.54 E-value=6.4 Score=27.04 Aligned_cols=52 Identities=23% Similarity=0.319 Sum_probs=36.4
Q ss_pred HHHHHHHhCCHHHH----HHHHhhCCCC---CchHHHHHHHHHhCCChHHHHHHHHhcC
Q psy11102 21 LISVYTELGDFEAL----ESCARKLPDS---SPLLKPMGEIFVKYGLCEQAVYVFDKNK 72 (87)
Q Consensus 21 ~ie~~~~~ed~d~L----~~l~~~L~~~---~~lL~~ia~~F~~~G~~~~Av~aylk~g 72 (87)
..+.|+..|+++.- ++.++.-|.+ .+.+..+|.-+...|..++|+..+...+
T Consensus 172 ~a~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~a~~~l~~~~~~lg~~~~A~~~~~~l~ 230 (235)
T TIGR03302 172 VARFYLKRGAYVAAINRFETVVENYPDTPATEEALARLVEAYLKLGLKDLAQDAAAVLG 230 (235)
T ss_pred HHHHHHHcCChHHHHHHHHHHHHHCCCCcchHHHHHHHHHHHHHcCCHHHHHHHHHHHH
Confidence 45677788888753 3444444442 3577889999999999999998776553
No 114
>PRK15179 Vi polysaccharide biosynthesis protein TviE; Provisional
Probab=76.11 E-value=8 Score=32.43 Aligned_cols=52 Identities=17% Similarity=0.241 Sum_probs=30.7
Q ss_pred HHHHHHHHhCCHHHHHHHHhhC----CCCCchHHHHHHHHHhCCChHHHHHHHHhc
Q psy11102 20 KLISVYTELGDFEALESCARKL----PDSSPLLKPMGEIFVKYGLCEQAVYVFDKN 71 (87)
Q Consensus 20 k~ie~~~~~ed~d~L~~l~~~L----~~~~~lL~~ia~~F~~~G~~~~Av~aylk~ 71 (87)
....++.++|.+++-..+-+.+ |+....+...|.-++..|..+.|+.+|.++
T Consensus 159 ~~a~~l~~~g~~~~A~~~y~~~~~~~p~~~~~~~~~a~~l~~~G~~~~A~~~~~~a 214 (694)
T PRK15179 159 LEAKSWDEIGQSEQADACFERLSRQHPEFENGYVGWAQSLTRRGALWRARDVLQAG 214 (694)
T ss_pred HHHHHHHHhcchHHHHHHHHHHHhcCCCcHHHHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 3445566666666655544443 223455666666666666666666666665
No 115
>PRK11189 lipoprotein NlpI; Provisional
Probab=75.62 E-value=20 Score=26.18 Aligned_cols=30 Identities=23% Similarity=0.320 Sum_probs=25.8
Q ss_pred CCCCchHHHHHHHHHhCCChHHHHHHHHhc
Q psy11102 42 PDSSPLLKPMGEIFVKYGLCEQAVYVFDKN 71 (87)
Q Consensus 42 ~~~~~lL~~ia~~F~~~G~~~~Av~aylk~ 71 (87)
|++......+|..+...|..++|+++|-++
T Consensus 95 P~~~~a~~~lg~~~~~~g~~~~A~~~~~~A 124 (296)
T PRK11189 95 PDMADAYNYLGIYLTQAGNFDAAYEAFDSV 124 (296)
T ss_pred CCCHHHHHHHHHHHHHCCCHHHHHHHHHHH
Confidence 555688999999999999999999887665
No 116
>TIGR00540 hemY_coli hemY protein. This is an uncharacterized protein encoded next to a heme-biosynthetic enzyme in two gamma division proteobacteria (E. coli and H. influenzae). It is known in no other species. The gene symbol hemY is unfortunate in that an unrelated protein, protoporphyrinogen oxidase, is designated as HemG in E. coli but as HemY in Bacillus subtilis.
Probab=75.57 E-value=28 Score=26.53 Aligned_cols=70 Identities=9% Similarity=0.031 Sum_probs=48.2
Q ss_pred chhHHHHHHHHHHhccH-----HHH---H-HHH-------HHhCCHHHHHHHHhhCCC----CCchHHHHHHHHHhCCCh
Q psy11102 2 TTLVSQAREYYEKCHCY-----EKL---I-SVY-------TELGDFEALESCARKLPD----SSPLLKPMGEIFVKYGLC 61 (87)
Q Consensus 2 ~~~w~~A~~yY~~~~n~-----~k~---i-e~~-------~~~ed~d~L~~l~~~L~~----~~~lL~~ia~~F~~~G~~ 61 (87)
.++|+.|.+++.+-... +.. - ..+ -..+..+.|.++...+|+ +..+..-.|..+...|.+
T Consensus 200 ~~d~~~a~~~l~~l~k~~~~~~~~~~~l~~~a~~~~l~~~~~~~~~~~L~~~~~~~p~~~~~~~~l~~~~a~~l~~~g~~ 279 (409)
T TIGR00540 200 SGAWQALDDIIDNMAKAGLFDDEEFADLEQKAEIGLLDEAMADEGIDGLLNWWKNQPRHRRHNIALKIALAEHLIDCDDH 279 (409)
T ss_pred HhhHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHHHHHHHHHhcCHHHHHHHHHHCCHHHhCCHHHHHHHHHHHHHCCCh
Confidence 47888888777665521 111 0 111 123456789999999985 457788889999999999
Q ss_pred HHHHHHHHhc
Q psy11102 62 EQAVYVFDKN 71 (87)
Q Consensus 62 ~~Av~aylk~ 71 (87)
++|.+...+.
T Consensus 280 ~~A~~~l~~~ 289 (409)
T TIGR00540 280 DSAQEIIFDG 289 (409)
T ss_pred HHHHHHHHHH
Confidence 9999887653
No 117
>CHL00033 ycf3 photosystem I assembly protein Ycf3
Probab=75.40 E-value=20 Score=23.55 Aligned_cols=54 Identities=15% Similarity=0.026 Sum_probs=28.0
Q ss_pred hhHHHHHHHHHHhcc-----------HHHHHHHHHHhCCHHHHHHHHhhC----CCCCchHHHHHHHHH
Q psy11102 3 TLVSQAREYYEKCHC-----------YEKLISVYTELGDFEALESCARKL----PDSSPLLKPMGEIFV 56 (87)
Q Consensus 3 ~~w~~A~~yY~~~~n-----------~~k~ie~~~~~ed~d~L~~l~~~L----~~~~~lL~~ia~~F~ 56 (87)
++|++|..+|.++-. +..+..++...|+++.-....+.. |.....+..+|..+.
T Consensus 49 g~~~~A~~~~~~al~l~~~~~~~~~~~~~lg~~~~~~g~~~eA~~~~~~Al~~~~~~~~~~~~la~i~~ 117 (168)
T CHL00033 49 GEYAEALQNYYEAMRLEIDPYDRSYILYNIGLIHTSNGEHTKALEYYFQALERNPFLPQALNNMAVICH 117 (168)
T ss_pred CCHHHHHHHHHHHHhccccchhhHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCcCcHHHHHHHHHHHH
Confidence 556666666666621 223345666666666655555442 222233445555555
No 118
>PF12234 Rav1p_C: RAVE protein 1 C terminal; InterPro: IPR022033 This domain family is found in eukaryotes, and is typically between 621 and 644 amino acids in length. This family is the C-terminal region of the protein RAVE (regulator of the ATPase of vacuolar and endosomal membranes). Rav1p is involved in regulating the glucose dependent assembly and disassembly of vacuolar ATPase V1 and V0 subunits.
Probab=75.29 E-value=32 Score=28.87 Aligned_cols=81 Identities=7% Similarity=-0.035 Sum_probs=54.4
Q ss_pred hhHHHHHHHHHHhccHHHHHHHHHH----hCCHHHHHHHH-hhCCCC--CchHHHHHHHHHhCCChHHHHHHHHhcCCHH
Q psy11102 3 TLVSQAREYYEKCHCYEKLISVYTE----LGDFEALESCA-RKLPDS--SPLLKPMGEIFVKYGLCEQAVYVFDKNKHKS 75 (87)
Q Consensus 3 ~~w~~A~~yY~~~~n~~k~ie~~~~----~ed~d~L~~l~-~~L~~~--~~lL~~ia~~F~~~G~~~~Av~aylk~gd~k 75 (87)
.|+..++-||..-|.. +.+..+++ ...-.++.+.. +...+. ...-.+=|--+.+.-.++.|..-|+=+|+++
T Consensus 404 ~dp~~~al~YlAl~Kk-~vL~~L~k~a~~~~~~~k~~~Fl~ndF~~~rwr~AAlKNAyaLlsk~Ry~~AAaFFLLag~l~ 482 (631)
T PF12234_consen 404 KDPSDCALFYLALGKK-KVLAGLWKMASWHKEQQKMAKFLSNDFTEPRWRTAALKNAYALLSKHRYEYAAAFFLLAGSLK 482 (631)
T ss_pred CCHHHHHHHHHHhccH-HHHHHHHHhhhcccccHHHHHHHhhcCCChHHHHHHHHhHHHHHhcccHHHHHHHHHhcccHH
Confidence 5666788888888875 44455555 33334444433 334332 2333345666678888999999999999999
Q ss_pred HHHHHHhhc
Q psy11102 76 SQWLTVVQD 84 (87)
Q Consensus 76 ~ai~~cv~~ 84 (87)
.|++.|++.
T Consensus 483 dAv~V~~~~ 491 (631)
T PF12234_consen 483 DAVNVCLRQ 491 (631)
T ss_pred HHHHHHHHH
Confidence 999999874
No 119
>PF09205 DUF1955: Domain of unknown function (DUF1955); InterPro: IPR015288 Members of this family are found in hypothetical proteins synthesised by the Archaeal organism Sulfolobus. Their exact function has not, as yet, been determined. ; PDB: 1WY6_A.
Probab=75.24 E-value=9.6 Score=26.55 Aligned_cols=50 Identities=14% Similarity=0.299 Sum_probs=30.6
Q ss_pred HHHHHHHhCCHHHHHHHHhhCC---C-CCchHHHHHHHHHhCCChHHHHHHHHh
Q psy11102 21 LISVYTELGDFEALESCARKLP---D-SSPLLKPMGEIFVKYGLCEQAVYVFDK 70 (87)
Q Consensus 21 ~ie~~~~~ed~d~L~~l~~~L~---~-~~~lL~~ia~~F~~~G~~~~Av~aylk 70 (87)
|++.+...+--|.|+++.+.|- + +.++|-+||.-+.+.|.--.|-+..-+
T Consensus 92 ALd~lv~~~kkDqLdki~~~l~kn~~~~p~~L~kia~Ay~klg~~r~~~ell~~ 145 (161)
T PF09205_consen 92 ALDILVKQGKKDQLDKIYNELKKNEEINPEFLVKIANAYKKLGNTREANELLKE 145 (161)
T ss_dssp HHHHHHHTT-HHHHHHHHHHH-----S-HHHHHHHHHHHHHTT-HHHHHHHHHH
T ss_pred HHHHHHHhccHHHHHHHHHHHhhccCCCHHHHHHHHHHHHHhcchhhHHHHHHH
Confidence 3333445566688888888874 2 247788888877777776666655433
No 120
>PF13374 TPR_10: Tetratricopeptide repeat; PDB: 3CEQ_B 3EDT_H 3NF1_A.
Probab=74.32 E-value=6.4 Score=19.42 Aligned_cols=25 Identities=16% Similarity=0.325 Sum_probs=19.4
Q ss_pred HHHHHHHHHhCCChHHHHHHHHhcC
Q psy11102 48 LKPMGEIFVKYGLCEQAVYVFDKNK 72 (87)
Q Consensus 48 L~~ia~~F~~~G~~~~Av~aylk~g 72 (87)
+..+|..+...|..++|.+.+.++=
T Consensus 5 ~~~la~~~~~~g~~~~A~~~~~~al 29 (42)
T PF13374_consen 5 LNNLANAYRAQGRYEEALELLEEAL 29 (42)
T ss_dssp HHHHHHHHHHCT-HHHHHHHHHHHH
T ss_pred HHHHHHHHHhhhhcchhhHHHHHHH
Confidence 5678888889999999998887653
No 121
>KOG2005|consensus
Probab=74.16 E-value=18 Score=31.06 Aligned_cols=75 Identities=13% Similarity=0.113 Sum_probs=52.2
Q ss_pred HHHHHHHhccHHHHHHHHHHhCCHHHHHHHHhhC----------------CC--CCchHHHHHHHHHhCCChHHHHHHHH
Q psy11102 8 AREYYEKCHCYEKLISVYTELGDFEALESCARKL----------------PD--SSPLLKPMGEIFVKYGLCEQAVYVFD 69 (87)
Q Consensus 8 A~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L----------------~~--~~~lL~~ia~~F~~~G~~~~Av~ayl 69 (87)
=+.|+.+-.--..|++.+...|+.|.|.+-++.- |. +..+|..+=..+.+-+.+.+|+..-+
T Consensus 184 iV~f~mkHNAE~eAiDlL~Eve~id~l~~~Vd~~n~~RvclYl~sc~~~lP~Pdd~~ll~~a~~IYlKf~~~~~al~~ai 263 (878)
T KOG2005|consen 184 IVPFHMKHNAEFEAIDLLMEVEGIDLLLDYVDEHNYQRVCLYLTSCVPLLPGPDDVALLRTALKIYLKFNEYPRALVGAI 263 (878)
T ss_pred HHHHHHhccchhHHHHHHHHhhhHhHHHHHhhhhhHHHHHHHHHHHhhcCCCchhhHHHHHHHHHHHHHHHhHHHHHHHH
Confidence 3556666666677888888888888777665543 43 34555555556678888888888888
Q ss_pred hcCCHHHHHHHHh
Q psy11102 70 KNKHKSSQWLTVV 82 (87)
Q Consensus 70 k~gd~k~ai~~cv 82 (87)
+.+|.+.+.+.+-
T Consensus 264 ~l~~~~~v~~vf~ 276 (878)
T KOG2005|consen 264 RLDDMKEVKEVFT 276 (878)
T ss_pred hcCcHHHHHHHHH
Confidence 8888877776653
No 122
>PF02259 FAT: FAT domain; InterPro: IPR003151 The FAT domain is a domain present in the PIK-related kinases. Members of the family of PIK-related kinases may act as intracellular sensors that govern radial and horizontal pathways [].; GO: 0005515 protein binding
Probab=73.86 E-value=8 Score=27.92 Aligned_cols=26 Identities=27% Similarity=0.396 Sum_probs=22.9
Q ss_pred HHHHHHHhCCHHHHHHHHhhCCCCCc
Q psy11102 21 LISVYTELGDFEALESCARKLPDSSP 46 (87)
Q Consensus 21 ~ie~~~~~ed~d~L~~l~~~L~~~~~ 46 (87)
.++|..++++||.|.+.+...+..++
T Consensus 4 ~~eaaWrl~~Wd~l~~~~~~~~~~~~ 29 (352)
T PF02259_consen 4 AAEAAWRLGDWDLLEEYLSQSNEDSP 29 (352)
T ss_pred HHHHHHhcCChhhHHHHHhhccCCCh
Confidence 68999999999999999999987543
No 123
>PRK15179 Vi polysaccharide biosynthesis protein TviE; Provisional
Probab=73.86 E-value=9.9 Score=31.90 Aligned_cols=70 Identities=13% Similarity=0.045 Sum_probs=52.2
Q ss_pred hhHHHHHHHHHHhccH--------HHHHHHHHHhCCHHHHHHHHhhC----CCCCchHHHHHHHHHhCCChHHHHHHHHh
Q psy11102 3 TLVSQAREYYEKCHCY--------EKLISVYTELGDFEALESCARKL----PDSSPLLKPMGEIFVKYGLCEQAVYVFDK 70 (87)
Q Consensus 3 ~~w~~A~~yY~~~~n~--------~k~ie~~~~~ed~d~L~~l~~~L----~~~~~lL~~ia~~F~~~G~~~~Av~aylk 70 (87)
+.+++|..++..+-++ .-.+.++.+.+.+|+=...++.+ |++...+...|.-+...|.+++|.++|-|
T Consensus 100 g~~~ea~~~l~~~~~~~Pd~~~a~~~~a~~L~~~~~~eeA~~~~~~~l~~~p~~~~~~~~~a~~l~~~g~~~~A~~~y~~ 179 (694)
T PRK15179 100 HRSDEGLAVWRGIHQRFPDSSEAFILMLRGVKRQQGIEAGRAEIELYFSGGSSSAREILLEAKSWDEIGQSEQADACFER 179 (694)
T ss_pred CCcHHHHHHHHHHHhhCCCcHHHHHHHHHHHHHhccHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHhcchHHHHHHHHH
Confidence 3455666666554332 35678889999998876666654 55557799999999999999999999987
Q ss_pred cC
Q psy11102 71 NK 72 (87)
Q Consensus 71 ~g 72 (87)
.-
T Consensus 180 ~~ 181 (694)
T PRK15179 180 LS 181 (694)
T ss_pred HH
Confidence 53
No 124
>PF13428 TPR_14: Tetratricopeptide repeat
Probab=73.15 E-value=6.7 Score=20.50 Aligned_cols=24 Identities=8% Similarity=0.227 Sum_probs=17.8
Q ss_pred hHHHHHHHHHhCCChHHHHHHHHh
Q psy11102 47 LLKPMGEIFVKYGLCEQAVYVFDK 70 (87)
Q Consensus 47 lL~~ia~~F~~~G~~~~Av~aylk 70 (87)
.+..+|..+...|..++|+++|-+
T Consensus 3 ~~~~la~~~~~~G~~~~A~~~~~~ 26 (44)
T PF13428_consen 3 AWLALARAYRRLGQPDEAERLLRR 26 (44)
T ss_pred HHHHHHHHHHHcCCHHHHHHHHHH
Confidence 345677788888888888887765
No 125
>PF13414 TPR_11: TPR repeat; PDB: 2HO1_B 2FI7_B 2DBA_A 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2PL2_B 3IEG_B 2FBN_A ....
Probab=72.39 E-value=4.6 Score=22.48 Aligned_cols=25 Identities=28% Similarity=0.490 Sum_probs=16.9
Q ss_pred chHHHHHHHHHhCCChHHHHHHHHh
Q psy11102 46 PLLKPMGEIFVKYGLCEQAVYVFDK 70 (87)
Q Consensus 46 ~lL~~ia~~F~~~G~~~~Av~aylk 70 (87)
..+...|..+...|..++|+..|.+
T Consensus 4 ~~~~~~g~~~~~~~~~~~A~~~~~~ 28 (69)
T PF13414_consen 4 EAWYNLGQIYFQQGDYEEAIEYFEK 28 (69)
T ss_dssp HHHHHHHHHHHHTTHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHH
Confidence 3456677777777777777766644
No 126
>PF07035 Mic1: Colon cancer-associated protein Mic1-like; InterPro: IPR009755 This entry represents the C terminus (approximately 160 residues) of a number of proteins that resemble colon cancer-associated protein Mic1.
Probab=72.39 E-value=30 Score=24.11 Aligned_cols=66 Identities=23% Similarity=0.304 Sum_probs=52.0
Q ss_pred HHHHHHHHHHhCCHHHHHHHHhh--CCCCC--------------chHHHHHHHHHhCC-ChHHHHHHHHhcCCHHHHHHH
Q psy11102 18 YEKLISVYTELGDFEALESCARK--LPDSS--------------PLLKPMGEIFVKYG-LCEQAVYVFDKNKHKSSQWLT 80 (87)
Q Consensus 18 ~~k~ie~~~~~ed~d~L~~l~~~--L~~~~--------------~lL~~ia~~F~~~G-~~~~Av~aylk~gd~k~ai~~ 80 (87)
++.+|+++...|.|..|..+++. ++++. +...-.-+||+..| .++.=+++++..|++-.|+..
T Consensus 32 ~~lli~lLi~~~~~~~L~qllq~~Vi~DSk~lA~~LLs~~~~~~~~~Ql~lDMLkRL~~~~~~iievLL~~g~vl~ALr~ 111 (167)
T PF07035_consen 32 YELLIDLLIRNGQFSQLHQLLQYHVIPDSKPLACQLLSLGNQYPPAYQLGLDMLKRLGTAYEEIIEVLLSKGQVLEALRY 111 (167)
T ss_pred HHHHHHHHHHcCCHHHHHHHHhhcccCCcHHHHHHHHHhHccChHHHHHHHHHHHHhhhhHHHHHHHHHhCCCHHHHHHH
Confidence 56889999999999999999876 44432 22222347899999 888888999999999999987
Q ss_pred Hhh
Q psy11102 81 VVQ 83 (87)
Q Consensus 81 cv~ 83 (87)
.-+
T Consensus 112 ar~ 114 (167)
T PF07035_consen 112 ARQ 114 (167)
T ss_pred HHH
Confidence 754
No 127
>CHL00033 ycf3 photosystem I assembly protein Ycf3
Probab=72.27 E-value=23 Score=23.28 Aligned_cols=50 Identities=16% Similarity=0.092 Sum_probs=33.7
Q ss_pred HHHHHHhCCHHHHHHHHhhC----CCC---CchHHHHHHHHHhCCChHHHHHHHHhc
Q psy11102 22 ISVYTELGDFEALESCARKL----PDS---SPLLKPMGEIFVKYGLCEQAVYVFDKN 71 (87)
Q Consensus 22 ie~~~~~ed~d~L~~l~~~L----~~~---~~lL~~ia~~F~~~G~~~~Av~aylk~ 71 (87)
..++...++|++-....+.. |+. ...+..+|..+...|.+++|...|.++
T Consensus 42 g~~~~~~g~~~~A~~~~~~al~l~~~~~~~~~~~~~lg~~~~~~g~~~eA~~~~~~A 98 (168)
T CHL00033 42 GMSAQSEGEYAEALQNYYEAMRLEIDPYDRSYILYNIGLIHTSNGEHTKALEYYFQA 98 (168)
T ss_pred HHHHHHcCCHHHHHHHHHHHHhccccchhhHHHHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 44555667777654443332 221 136888999999999999999888653
No 128
>KOG0686|consensus
Probab=71.85 E-value=6.2 Score=31.76 Aligned_cols=39 Identities=10% Similarity=0.237 Sum_probs=32.8
Q ss_pred chHHHHHHHHHhCCChHHHHHHHHhc----CCHHHHHHHHhhc
Q psy11102 46 PLLKPMGEIFVKYGLCEQAVYVFDKN----KHKSSQWLTVVQD 84 (87)
Q Consensus 46 ~lL~~ia~~F~~~G~~~~Av~aylk~----gd~k~ai~~cv~~ 84 (87)
..+.++|.+|...|+.+.|+.+|.|. -.-+--|++|++.
T Consensus 151 ra~~Dl~dhy~~cG~l~~Alr~YsR~RdYCTs~khvInm~ln~ 193 (466)
T KOG0686|consen 151 RALEDLGDHYLDCGQLDNALRCYSRARDYCTSAKHVINMCLNL 193 (466)
T ss_pred HHHHHHHHHHHHhccHHHHHhhhhhhhhhhcchHHHHHHHHHH
Confidence 45788999999999999999999994 4467788888763
No 129
>PRK14720 transcript cleavage factor/unknown domain fusion protein; Provisional
Probab=71.74 E-value=16 Score=31.79 Aligned_cols=76 Identities=17% Similarity=0.172 Sum_probs=52.2
Q ss_pred hhHHHHHHHHHHh--------ccHHHHHHHHHHhCCHHHHHHHHhhC----CCCCchHHHHHHHHHhC------CChHHH
Q psy11102 3 TLVSQAREYYEKC--------HCYEKLISVYTELGDFEALESCARKL----PDSSPLLKPMGEIFVKY------GLCEQA 64 (87)
Q Consensus 3 ~~w~~A~~yY~~~--------~n~~k~ie~~~~~ed~d~L~~l~~~L----~~~~~lL~~ia~~F~~~------G~~~~A 64 (87)
.+| .+++||.++ .=+-.+++||=++|++++...+.+.+ |++...|..+|..+... .|..+|
T Consensus 97 ~~~-~~ve~~~~~i~~~~~~k~Al~~LA~~Ydk~g~~~ka~~~yer~L~~D~~n~~aLNn~AY~~ae~dL~KA~~m~~KA 175 (906)
T PRK14720 97 LKW-AIVEHICDKILLYGENKLALRTLAEAYAKLNENKKLKGVWERLVKADRDNPEIVKKLATSYEEEDKEKAITYLKKA 175 (906)
T ss_pred cch-hHHHHHHHHHHhhhhhhHHHHHHHHHHHHcCChHHHHHHHHHHHhcCcccHHHHHHHHHHHHHhhHHHHHHHHHHH
Confidence 356 667776632 12346789999999999888877774 55668899999998766 344455
Q ss_pred HHHHHhcCCHHHHHH
Q psy11102 65 VYVFDKNKHKSSQWL 79 (87)
Q Consensus 65 v~aylk~gd~k~ai~ 79 (87)
|+-|+..+++..+.+
T Consensus 176 V~~~i~~kq~~~~~e 190 (906)
T PRK14720 176 IYRFIKKKQYVGIEE 190 (906)
T ss_pred HHHHHhhhcchHHHH
Confidence 666666666665544
No 130
>PF09205 DUF1955: Domain of unknown function (DUF1955); InterPro: IPR015288 Members of this family are found in hypothetical proteins synthesised by the Archaeal organism Sulfolobus. Their exact function has not, as yet, been determined. ; PDB: 1WY6_A.
Probab=71.59 E-value=16 Score=25.53 Aligned_cols=69 Identities=13% Similarity=0.140 Sum_probs=44.4
Q ss_pred HHhccHHHHHHHHHHhCCHHHHHHHHhh-CC-CC-CchHHHHHHHHHhCCChHHH-----HHHHHhcCCHHHHHHHH
Q psy11102 13 EKCHCYEKLISVYTELGDFEALESCARK-LP-DS-SPLLKPMGEIFVKYGLCEQA-----VYVFDKNKHKSSQWLTV 81 (87)
Q Consensus 13 ~~~~n~~k~ie~~~~~ed~d~L~~l~~~-L~-~~-~~lL~~ia~~F~~~G~~~~A-----v~aylk~gd~k~ai~~c 81 (87)
..|||..+-|+||.+.+-..+-..++-. |- .+ ...|.+++.-+.+.+....- ..||-|.|+.+.+-++-
T Consensus 67 s~C~NlKrVi~C~~~~n~~se~vD~ALd~lv~~~kkDqLdki~~~l~kn~~~~p~~L~kia~Ay~klg~~r~~~ell 143 (161)
T PF09205_consen 67 SKCGNLKRVIECYAKRNKLSEYVDLALDILVKQGKKDQLDKIYNELKKNEEINPEFLVKIANAYKKLGNTREANELL 143 (161)
T ss_dssp GG-S-THHHHHHHHHTT---HHHHHHHHHHHHTT-HHHHHHHHHHH-----S-HHHHHHHHHHHHHTT-HHHHHHHH
T ss_pred hhhcchHHHHHHHHHhcchHHHHHHHHHHHHHhccHHHHHHHHHHHhhccCCCHHHHHHHHHHHHHhcchhhHHHHH
Confidence 4789999999999999999888888755 33 34 47788888887655554432 47999999999987764
No 131
>PRK09782 bacteriophage N4 receptor, outer membrane subunit; Provisional
Probab=70.91 E-value=29 Score=30.32 Aligned_cols=49 Identities=12% Similarity=0.134 Sum_probs=38.7
Q ss_pred HHHHHhCCHHHHHHHHhh----CCCCCchHHHHHHHHHhCCChHHHHHHHHhc
Q psy11102 23 SVYTELGDFEALESCARK----LPDSSPLLKPMGEIFVKYGLCEQAVYVFDKN 71 (87)
Q Consensus 23 e~~~~~ed~d~L~~l~~~----L~~~~~lL~~ia~~F~~~G~~~~Av~aylk~ 71 (87)
.++...|++++-..+.+. -|++...+..+|..+...|..+.|+.+|.++
T Consensus 651 ~aL~~~G~~eeAi~~l~~AL~l~P~~~~a~~nLA~al~~lGd~~eA~~~l~~A 703 (987)
T PRK09782 651 YALWDSGDIAQSREMLERAHKGLPDDPALIRQLAYVNQRLDDMAATQHYARLV 703 (987)
T ss_pred HHHHHCCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHH
Confidence 466778888876665554 3556688999999999999999999988775
No 132
>KOG0985|consensus
Probab=70.81 E-value=32 Score=31.29 Aligned_cols=79 Identities=11% Similarity=0.092 Sum_probs=49.8
Q ss_pred hhHHHHHHHHHHhccH-------HHHHHHHHHhCCHHHHHHHHhhCCCCCchHHHHHHHHHhCCChHHHHHHHH------
Q psy11102 3 TLVSQAREYYEKCHCY-------EKLISVYTELGDFEALESCARKLPDSSPLLKPMGEIFVKYGLCEQAVYVFD------ 69 (87)
Q Consensus 3 ~~w~~A~~yY~~~~n~-------~k~ie~~~~~ed~d~L~~l~~~L~~~~~lL~~ia~~F~~~G~~~~Av~ayl------ 69 (87)
++|+.=+.|..-++.. ..+|-.|.+.+...+||+.+. ..++.-+..+|+..-..||++.|.=.|-
T Consensus 1147 ~~~edLv~yL~MaRkk~~E~~id~eLi~AyAkt~rl~elE~fi~--gpN~A~i~~vGdrcf~~~~y~aAkl~y~~vSN~a 1224 (1666)
T KOG0985|consen 1147 GKYEDLVKYLLMARKKVREPYIDSELIFAYAKTNRLTELEEFIA--GPNVANIQQVGDRCFEEKMYEAAKLLYSNVSNFA 1224 (1666)
T ss_pred CcHHHHHHHHHHHHHhhcCccchHHHHHHHHHhchHHHHHHHhc--CCCchhHHHHhHHHhhhhhhHHHHHHHHHhhhHH
Confidence 4455555554444332 234555555555555555542 3466789999999999999999987774
Q ss_pred -------hcCCHHHHHHHHhh
Q psy11102 70 -------KNKHKSSQWLTVVQ 83 (87)
Q Consensus 70 -------k~gd~k~ai~~cv~ 83 (87)
+.|++.+|+|+.-.
T Consensus 1225 ~La~TLV~LgeyQ~AVD~aRK 1245 (1666)
T KOG0985|consen 1225 KLASTLVYLGEYQGAVDAARK 1245 (1666)
T ss_pred HHHHHHHHHHHHHHHHHHhhh
Confidence 45666666665443
No 133
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=70.31 E-value=22 Score=26.15 Aligned_cols=69 Identities=14% Similarity=0.121 Sum_probs=48.3
Q ss_pred hhHHHHHHHHHHhc-----------cHHHHHHHHHHhCCHHHHHH----HHhhCCCC---CchHHHHHHHHHhCCChHHH
Q psy11102 3 TLVSQAREYYEKCH-----------CYEKLISVYTELGDFEALES----CARKLPDS---SPLLKPMGEIFVKYGLCEQA 64 (87)
Q Consensus 3 ~~w~~A~~yY~~~~-----------n~~k~ie~~~~~ed~d~L~~----l~~~L~~~---~~lL~~ia~~F~~~G~~~~A 64 (87)
++|++|+..|.+-- =+-.+.++|+..++|++=.. ++..-|++ ...+-.+|..+...|..+.|
T Consensus 157 ~~y~~Ai~af~~fl~~yP~s~~a~~A~y~LG~~y~~~g~~~~A~~~f~~vv~~yP~s~~~~dAl~klg~~~~~~g~~~~A 236 (263)
T PRK10803 157 SRQDDAIVAFQNFVKKYPDSTYQPNANYWLGQLNYNKGKKDDAAYYFASVVKNYPKSPKAADAMFKVGVIMQDKGDTAKA 236 (263)
T ss_pred CCHHHHHHHHHHHHHHCcCCcchHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCCCCcchhHHHHHHHHHHHHcCCHHHH
Confidence 56777776664321 23467788899999876444 44444663 36688899999899999999
Q ss_pred HHHHHhc
Q psy11102 65 VYVFDKN 71 (87)
Q Consensus 65 v~aylk~ 71 (87)
+..|-+.
T Consensus 237 ~~~~~~v 243 (263)
T PRK10803 237 KAVYQQV 243 (263)
T ss_pred HHHHHHH
Confidence 9988643
No 134
>KOG3060|consensus
Probab=69.70 E-value=18 Score=27.54 Aligned_cols=48 Identities=17% Similarity=0.105 Sum_probs=40.5
Q ss_pred hHHHHHHHHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCCCCchHHHHHHHHHhCCC
Q psy11102 4 LVSQAREYYEKCHCYEKLISVYTELGDFEALESCARKLPDSSPLLKPMGEIFVKYGL 60 (87)
Q Consensus 4 ~w~~A~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~~~~lL~~ia~~F~~~G~ 60 (87)
-|-.+++.|..-|.++||+-|| |+++-.-|.++-....+|+..-..|.
T Consensus 156 AW~eLaeiY~~~~~f~kA~fCl---------EE~ll~~P~n~l~f~rlae~~Yt~gg 203 (289)
T KOG3060|consen 156 AWHELAEIYLSEGDFEKAAFCL---------EELLLIQPFNPLYFQRLAEVLYTQGG 203 (289)
T ss_pred HHHHHHHHHHhHhHHHHHHHHH---------HHHHHcCCCcHHHHHHHHHHHHHHhh
Confidence 4899999999999999999996 67777778888888888888766665
No 135
>PF12569 NARP1: NMDA receptor-regulated protein 1 ; InterPro: IPR021183 This group represents N-terminal acetyltransferase A (NatA) auxiliary subunit and represents a non-catalytic component of the NatA N-terminal acetyltransferase, which catalyzes acetylation of proteins beginning with Met-Ser, Met-Gly and Met-Ala. N-terminal acetylation plays a role in normal eukaryotic translation and processing, protect against proteolytic degradation and protein turnover. NAT1 anchors ARD1 and NAT5 to the ribosome and may present the N- terminal of nascent polypeptides for acetylation [], [].
Probab=69.64 E-value=17 Score=29.51 Aligned_cols=52 Identities=21% Similarity=0.355 Sum_probs=33.8
Q ss_pred HHHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCCCCchHHHHHHHHHhCCChHHHHHHHHh
Q psy11102 10 EYYEKCHCYEKLISVYTELGDFEALESCARKLPDSSPLLKPMGEIFVKYGLCEQAVYVFDK 70 (87)
Q Consensus 10 ~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~~~~lL~~ia~~F~~~G~~~~Av~aylk 70 (87)
..+..+|+++++++.+ ++-...+.+....++.-|+.+.+.|..++|..+|..
T Consensus 12 ~il~e~g~~~~AL~~L---------~~~~~~I~Dk~~~~E~rA~ll~kLg~~~eA~~~y~~ 63 (517)
T PF12569_consen 12 SILEEAGDYEEALEHL---------EKNEKQILDKLAVLEKRAELLLKLGRKEEAEKIYRE 63 (517)
T ss_pred HHHHHCCCHHHHHHHH---------HhhhhhCCCHHHHHHHHHHHHHHcCCHHHHHHHHHH
Confidence 3445566666666554 333344555567778888888888888888777754
No 136
>KOG1155|consensus
Probab=69.63 E-value=13 Score=30.48 Aligned_cols=60 Identities=22% Similarity=0.243 Sum_probs=43.0
Q ss_pred hHHHHHHHHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCC-CCchHHHHHHHHHhCCChHHHHHHHHhcCC
Q psy11102 4 LVSQAREYYEKCHCYEKLISVYTELGDFEALESCARKLPD-SSPLLKPMGEIFVKYGLCEQAVYVFDKNKH 73 (87)
Q Consensus 4 ~w~~A~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~-~~~lL~~ia~~F~~~G~~~~Av~aylk~gd 73 (87)
-|..=.+-|++-++.+.||.||-+.=. +.+ +...|-.+|......++.++|+++|.|+=+
T Consensus 434 lw~aLG~CY~kl~~~~eAiKCykrai~----------~~dte~~~l~~LakLye~l~d~~eAa~~yek~v~ 494 (559)
T KOG1155|consen 434 LWVALGECYEKLNRLEEAIKCYKRAIL----------LGDTEGSALVRLAKLYEELKDLNEAAQYYEKYVE 494 (559)
T ss_pred HHHHHHHHHHHhccHHHHHHHHHHHHh----------ccccchHHHHHHHHHHHHHHhHHHHHHHHHHHHH
Confidence 466667888888888888888854211 122 345677888888888888888888887644
No 137
>KOG0547|consensus
Probab=68.73 E-value=21 Score=29.60 Aligned_cols=39 Identities=26% Similarity=0.433 Sum_probs=32.6
Q ss_pred chhHHHHHHHHHHh----cc----HHHHHHHHHHhCCHHHHHHHHhh
Q psy11102 2 TTLVSQAREYYEKC----HC----YEKLISVYTELGDFEALESCARK 40 (87)
Q Consensus 2 ~~~w~~A~~yY~~~----~n----~~k~ie~~~~~ed~d~L~~l~~~ 40 (87)
+++|++|++||.++ .+ |.-..-||-.+|||.+..+-+..
T Consensus 128 ~kkY~eAIkyY~~AI~l~p~epiFYsNraAcY~~lgd~~~Vied~Tk 174 (606)
T KOG0547|consen 128 NKKYDEAIKYYTQAIELCPDEPIFYSNRAACYESLGDWEKVIEDCTK 174 (606)
T ss_pred cccHHHHHHHHHHHHhcCCCCchhhhhHHHHHHHHhhHHHHHHHHHH
Confidence 57899999999875 34 66778899999999999887765
No 138
>PF14561 TPR_20: Tetratricopeptide repeat; PDB: 3QOU_A 2R5S_A 3QDN_B.
Probab=68.71 E-value=12 Score=23.11 Aligned_cols=42 Identities=21% Similarity=0.215 Sum_probs=30.3
Q ss_pred CHHHHHHHHhhCCCCCchHHHHHHHHHhCCChHHHHHHHHhc
Q psy11102 30 DFEALESCARKLPDSSPLLKPMGEIFVKYGLCEQAVYVFDKN 71 (87)
Q Consensus 30 d~d~L~~l~~~L~~~~~lL~~ia~~F~~~G~~~~Av~aylk~ 71 (87)
+.+.|+.-+..-|++....-.+|......|..++|++.++..
T Consensus 7 ~~~al~~~~a~~P~D~~ar~~lA~~~~~~g~~e~Al~~Ll~~ 48 (90)
T PF14561_consen 7 DIAALEAALAANPDDLDARYALADALLAAGDYEEALDQLLEL 48 (90)
T ss_dssp HHHHHHHHHHHSTT-HHHHHHHHHHHHHTT-HHHHHHHHHHH
T ss_pred cHHHHHHHHHcCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHH
Confidence 455666777777777777888888888888888888877753
No 139
>PF10602 RPN7: 26S proteasome subunit RPN7; InterPro: IPR019585 This entry represents the regulatory subunit RPN7 (known as the non-ATPase regulatory subunit 6 in higher eukaryotes) of the 26S proteasome. This entry also matches the evolutionarily related subunit 1 of the COP9 signalosome complex (CSN) from Arabidopsis []. The 26S proteasome plays a major role in ATP-dependent degradation of ubiquitinated proteins. Substrate specificity is conferred by the regulatory particle (RP), which can dissociate into stable lid and base subcomplexes. The regulatory subunit RPN7 is one of the lid subunits of the 26S proteasome and has been shown in Saccharomyces cerevisiae (Baker's yeast) to be required for structural integrity []. The COP9 signalosome is a conserved protein complex composed of eight subunits, where Individual subunits of the complex have been linked to various signal transduction pathways leading to gene expression and cell cycle control []. The overall organisation and the amino acid sequences of the COP9 signalosome subunits resemble the lid subcomplex of the 19 S regulatory particle for the 26 S proteasome []. COP9 subunit 1 (CSN1 or GPS1) of the COP9 complex is an essential subunit of the complex with regard to both structural integrity and functionality. The N-terminal region of subunit 1 (CSN1-N) can inhibit c-fos expression from either a transfected template or a chromosomal transgene (fos-lacZ), and may contain the activity domain that confers most of the repression functions of CSN1. The C-terminal region of subunit 1 (CSN1-C) allows integration of the protein into the COP9 signalosome.
Probab=68.37 E-value=5.4 Score=27.56 Aligned_cols=38 Identities=13% Similarity=0.250 Sum_probs=29.7
Q ss_pred CchHHHHHHHHHhCCChHHHHHHHHhcCC----HHHHHHHHh
Q psy11102 45 SPLLKPMGEIFVKYGLCEQAVYVFDKNKH----KSSQWLTVV 82 (87)
Q Consensus 45 ~~lL~~ia~~F~~~G~~~~Av~aylk~gd----~k~ai~~cv 82 (87)
...+.++|++|.+.|..+.|+++|.+.-+ +..-|++|.
T Consensus 36 r~~~~~l~~~~~~~Gd~~~A~k~y~~~~~~~~~~~~~id~~l 77 (177)
T PF10602_consen 36 RMALEDLADHYCKIGDLEEALKAYSRARDYCTSPGHKIDMCL 77 (177)
T ss_pred HHHHHHHHHHHHHhhhHHHHHHHHHHHhhhcCCHHHHHHHHH
Confidence 36688999999999999999999987544 445555543
No 140
>PF13424 TPR_12: Tetratricopeptide repeat; PDB: 3RO2_A 3Q15_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 4A1S_B 3CEQ_B 3EDT_H ....
Probab=68.28 E-value=9 Score=21.83 Aligned_cols=52 Identities=15% Similarity=0.112 Sum_probs=25.4
Q ss_pred HHHHHHHHhCCHHHHHHHHhhCCCCCchHHHHHHHHHhCCChH--------HHHHHHHhcCCHHHHHHHH
Q psy11102 20 KLISVYTELGDFEALESCARKLPDSSPLLKPMGEIFVKYGLCE--------QAVYVFDKNKHKSSQWLTV 81 (87)
Q Consensus 20 k~ie~~~~~ed~d~L~~l~~~L~~~~~lL~~ia~~F~~~G~~~--------~Av~aylk~gd~k~ai~~c 81 (87)
.+..+|...|+|+.=.. .+.+.-..+.+.|.-. .--.+|.+.|+++.|++..
T Consensus 10 ~la~~~~~~~~~~~A~~----------~~~~al~~~~~~~~~~~~~a~~~~~lg~~~~~~g~~~~A~~~~ 69 (78)
T PF13424_consen 10 NLARVYRELGRYDEALD----------YYEKALDIEEQLGDDHPDTANTLNNLGECYYRLGDYEEALEYY 69 (78)
T ss_dssp HHHHHHHHTT-HHHHHH----------HHHHHHHHHHHTTTHHHHHHHHHHHHHHHHHHTTHHHHHHHHH
T ss_pred HHHHHHHHcCCHHHHHH----------HHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHcCCHHHHHHHH
Confidence 44556666666653332 2334444455555321 1234566666666666554
No 141
>PF04097 Nic96: Nup93/Nic96; InterPro: IPR007231 Nup93/Nic96 is a component of the nuclear pore complex. It is required for the correct assembly of the nuclear pore complex []. In Saccharomyces cerevisiae, Nic96 has been shown to be involved in the distribution and cellular concentration of the GTPase Gsp1 []. The structure of Nic96 has revealed a mostly alpha helical structure [].; GO: 0006810 transport, 0005643 nuclear pore; PDB: 2QX5_B 2RFO_A.
Probab=67.79 E-value=20 Score=29.39 Aligned_cols=36 Identities=22% Similarity=0.272 Sum_probs=22.3
Q ss_pred HHHHHHHHHhccHHHHHHHHHHhCCHHHHHHHHhhC
Q psy11102 6 SQAREYYEKCHCYEKLISVYTELGDFEALESCARKL 41 (87)
Q Consensus 6 ~~A~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L 41 (87)
..|+.-....|+++.||.+|-..|+||...+++..+
T Consensus 418 ~~~A~~~e~~g~~~dAi~Ly~La~~~d~vl~lln~~ 453 (613)
T PF04097_consen 418 EQAAREAEERGRFEDAILLYHLAEEYDKVLSLLNRL 453 (613)
T ss_dssp HHHHHHHHHCT-HHHHHHHHHHTT-HHHHHHHHHHH
T ss_pred HHHHHHHHHCCCHHHHHHHHHHHhhHHHHHHHHHHH
Confidence 455666666777777777777777777766665543
No 142
>cd05804 StaR_like StaR_like; a well-conserved protein found in bacteria, plants, and animals. A family member from Streptomyces toyocaensis, StaR is part of a gene cluster involved in the biosynthesis of glycopeptide antibiotics (GPAs), specifically A47934. It has been speculated that StaR could be a flavoprotein hydroxylating a tyrosine sidechain. Some family members have been annotated as proteins containing tetratricopeptide (TPR) repeats, which may at least indicate mostly alpha-helical secondary structure.
Probab=67.52 E-value=26 Score=25.37 Aligned_cols=48 Identities=17% Similarity=0.335 Sum_probs=29.8
Q ss_pred HHHHHhCCHHHHHHHHhhC----CCCCchHHHHHHHHHhCCChHHHHHHHHh
Q psy11102 23 SVYTELGDFEALESCARKL----PDSSPLLKPMGEIFVKYGLCEQAVYVFDK 70 (87)
Q Consensus 23 e~~~~~ed~d~L~~l~~~L----~~~~~lL~~ia~~F~~~G~~~~Av~aylk 70 (87)
.++...|+++.-++..+.. |++......+|..+...|..++|++.|.+
T Consensus 122 ~~~~~~G~~~~A~~~~~~al~~~p~~~~~~~~la~i~~~~g~~~eA~~~l~~ 173 (355)
T cd05804 122 FGLEEAGQYDRAEEAARRALELNPDDAWAVHAVAHVLEMQGRFKEGIAFMES 173 (355)
T ss_pred HHHHHcCCHHHHHHHHHHHHhhCCCCcHHHHHHHHHHHHcCCHHHHHHHHHh
Confidence 3556666776655555542 33445566777777777777777766544
No 143
>KOG0553|consensus
Probab=67.43 E-value=12 Score=28.79 Aligned_cols=46 Identities=24% Similarity=0.256 Sum_probs=26.4
Q ss_pred HHhccHHHHHHHHHHhCCHHHHHHHHhhCCCCCchHHHHHHHHHhCCChHHHHHH
Q psy11102 13 EKCHCYEKLISVYTELGDFEALESCARKLPDSSPLLKPMGEIFVKYGLCEQAVYV 67 (87)
Q Consensus 13 ~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~~~~lL~~ia~~F~~~G~~~~Av~a 67 (87)
.+.++|+.|+++|-..=.+ .|.+.=....=|.-..+.|+|+.||+.
T Consensus 92 m~~~~Y~eAv~kY~~AI~l---------~P~nAVyycNRAAAy~~Lg~~~~AVkD 137 (304)
T KOG0553|consen 92 MKNKDYQEAVDKYTEAIEL---------DPTNAVYYCNRAAAYSKLGEYEDAVKD 137 (304)
T ss_pred HHhhhHHHHHHHHHHHHhc---------CCCcchHHHHHHHHHHHhcchHHHHHH
Confidence 4556677777776543221 144444455556666677777777653
No 144
>COG2956 Predicted N-acetylglucosaminyl transferase [Carbohydrate transport and metabolism]
Probab=67.40 E-value=18 Score=28.51 Aligned_cols=56 Identities=16% Similarity=0.215 Sum_probs=37.9
Q ss_pred HHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCCC--CchHHHHHHHHHhCCChHHHHHHHHhcCC
Q psy11102 11 YYEKCHCYEKLISVYTELGDFEALESCARKLPDS--SPLLKPMGEIFVKYGLCEQAVYVFDKNKH 73 (87)
Q Consensus 11 yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~~--~~lL~~ia~~F~~~G~~~~Av~aylk~gd 73 (87)
+|-+-|..++||.+...+-+- .+|+.. .-.|.++|.-|.++|+.+.|-+.|.-.-|
T Consensus 78 LfRsRGEvDRAIRiHQ~L~~s-------pdlT~~qr~lAl~qL~~Dym~aGl~DRAE~~f~~L~d 135 (389)
T COG2956 78 LFRSRGEVDRAIRIHQTLLES-------PDLTFEQRLLALQQLGRDYMAAGLLDRAEDIFNQLVD 135 (389)
T ss_pred HHHhcchHHHHHHHHHHHhcC-------CCCchHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHhc
Confidence 456667777777766543220 011211 23488999999999999999998887766
No 145
>KOG2003|consensus
Probab=67.25 E-value=21 Score=29.75 Aligned_cols=61 Identities=25% Similarity=0.350 Sum_probs=43.5
Q ss_pred chhHHHHHHHHHHhc--------cHHHHHHHHHHhCCHHHHH----HHHhhCCCCCchHHHHHHHHHhCCChH
Q psy11102 2 TTLVSQAREYYEKCH--------CYEKLISVYTELGDFEALE----SCARKLPDSSPLLKPMGEIFVKYGLCE 62 (87)
Q Consensus 2 ~~~w~~A~~yY~~~~--------n~~k~ie~~~~~ed~d~L~----~l~~~L~~~~~lL~~ia~~F~~~G~~~ 62 (87)
+|.|++|+.|++++- =.-+.+.|+-+.|.|+.-- .+-|..|++-..|.-+-..-...||-+
T Consensus 639 tqf~ekai~y~ekaaliqp~~~kwqlmiasc~rrsgnyqka~d~yk~~hrkfpedldclkflvri~~dlgl~d 711 (840)
T KOG2003|consen 639 TQFSEKAINYFEKAALIQPNQSKWQLMIASCFRRSGNYQKAFDLYKDIHRKFPEDLDCLKFLVRIAGDLGLKD 711 (840)
T ss_pred hHHHHHHHHHHHHHHhcCccHHHHHHHHHHHHHhcccHHHHHHHHHHHHHhCccchHHHHHHHHHhccccchh
Confidence 688999999998764 2335567777888876654 455667887666777666667778754
No 146
>KOG2168|consensus
Probab=66.72 E-value=9.2 Score=32.99 Aligned_cols=38 Identities=11% Similarity=0.123 Sum_probs=35.4
Q ss_pred CchHHHHHHHHHhCCChHHHHHHHHhcCCHHHHHHHHh
Q psy11102 45 SPLLKPMGEIFVKYGLCEQAVYVFDKNKHKSSQWLTVV 82 (87)
Q Consensus 45 ~~lL~~ia~~F~~~G~~~~Av~aylk~gd~k~ai~~cv 82 (87)
++++.+-|..+...|.++.|+..|..+|++..|+.+..
T Consensus 622 ~~i~~~vA~~a~~~G~~~~sI~LY~lag~yd~al~lin 659 (835)
T KOG2168|consen 622 QKIILEVASEADEDGLFEDAILLYHLAGDYDKALELIN 659 (835)
T ss_pred HHHHHHHHHHHHhcCCHHHHHHHHHHhhhhhHHHHHHH
Confidence 68889999999999999999999999999999998754
No 147
>PF12569 NARP1: NMDA receptor-regulated protein 1 ; InterPro: IPR021183 This group represents N-terminal acetyltransferase A (NatA) auxiliary subunit and represents a non-catalytic component of the NatA N-terminal acetyltransferase, which catalyzes acetylation of proteins beginning with Met-Ser, Met-Gly and Met-Ala. N-terminal acetylation plays a role in normal eukaryotic translation and processing, protect against proteolytic degradation and protein turnover. NAT1 anchors ARD1 and NAT5 to the ribosome and may present the N- terminal of nascent polypeptides for acetylation [], [].
Probab=66.67 E-value=37 Score=27.58 Aligned_cols=37 Identities=14% Similarity=0.005 Sum_probs=21.3
Q ss_pred chHHHHHHHHHhCCChHHHHH---------------------HHHhcCCHHHHHHHHh
Q psy11102 46 PLLKPMGEIFVKYGLCEQAVY---------------------VFDKNKHKSSQWLTVV 82 (87)
Q Consensus 46 ~lL~~ia~~F~~~G~~~~Av~---------------------aylk~gd~k~ai~~cv 82 (87)
++...=|..+...|....|.+ .++|+|+++.|.+++.
T Consensus 229 ely~~KarilKh~G~~~~Aa~~~~~Ar~LD~~DRyiNsK~aKy~LRa~~~e~A~~~~~ 286 (517)
T PF12569_consen 229 ELYMTKARILKHAGDLKEAAEAMDEARELDLADRYINSKCAKYLLRAGRIEEAEKTAS 286 (517)
T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHhCChhhHHHHHHHHHHHHHCCCHHHHHHHHH
Confidence 445555555555555555544 3566666666666653
No 148
>KOG2376|consensus
Probab=66.26 E-value=16 Score=30.66 Aligned_cols=53 Identities=13% Similarity=0.216 Sum_probs=46.4
Q ss_pred HHHHHHHHHHhCCHHHHHHHHhhCCCC-CchHHHHHHHHHhCCChHHHHHHHHhc
Q psy11102 18 YEKLISVYTELGDFEALESCARKLPDS-SPLLKPMGEIFVKYGLCEQAVYVFDKN 71 (87)
Q Consensus 18 ~~k~ie~~~~~ed~d~L~~l~~~L~~~-~~lL~~ia~~F~~~G~~~~Av~aylk~ 71 (87)
+||| .|.|+++.-|.--+....++.. ..+|.-=|..|-..|.+++|+..|...
T Consensus 83 fEKA-Yc~Yrlnk~Dealk~~~~~~~~~~~ll~L~AQvlYrl~~ydealdiY~~L 136 (652)
T KOG2376|consen 83 FEKA-YCEYRLNKLDEALKTLKGLDRLDDKLLELRAQVLYRLERYDEALDIYQHL 136 (652)
T ss_pred HHHH-HHHHHcccHHHHHHHHhcccccchHHHHHHHHHHHHHhhHHHHHHHHHHH
Confidence 3664 6999999999988888888875 568999999999999999999999877
No 149
>KOG0985|consensus
Probab=64.93 E-value=32 Score=31.30 Aligned_cols=75 Identities=15% Similarity=0.179 Sum_probs=57.0
Q ss_pred chhHHHHHHHHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCC-C-CchHHHHHHHHHhCCChHHHHHHHHhcCCHHHHH
Q psy11102 2 TTLVSQAREYYEKCHCYEKLISVYTELGDFEALESCARKLPD-S-SPLLKPMGEIFVKYGLCEQAVYVFDKNKHKSSQW 78 (87)
Q Consensus 2 ~~~w~~A~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~-~-~~lL~~ia~~F~~~G~~~~Av~aylk~gd~k~ai 78 (87)
++-+++|-..|.+-.-...++..+. ++...|.+-..=... + ..+-.++|.-=..-|+...|++.|+|+.||-.=+
T Consensus 1061 ~~LyEEAF~ifkkf~~n~~A~~VLi--e~i~~ldRA~efAe~~n~p~vWsqlakAQL~~~~v~dAieSyikadDps~y~ 1137 (1666)
T KOG0985|consen 1061 NQLYEEAFAIFKKFDMNVSAIQVLI--ENIGSLDRAYEFAERCNEPAVWSQLAKAQLQGGLVKDAIESYIKADDPSNYL 1137 (1666)
T ss_pred hhHHHHHHHHHHHhcccHHHHHHHH--HHhhhHHHHHHHHHhhCChHHHHHHHHHHHhcCchHHHHHHHHhcCCcHHHH
Confidence 4568899999999998899988875 344444444433333 3 4667889999999999999999999999987543
No 150
>PF10366 Vps39_1: Vacuolar sorting protein 39 domain 1; InterPro: IPR019452 This entry represents a domain found in the vacuolar sorting protein Vps39 and transforming growth factor beta receptor-associated protein Trap1. Vps39, a component of the C-Vps complex, is thought to be required for the fusion of endosomes and other types of transport intermediates with the vacuole [, ]. In Saccharomyces cerevisiae (Baker's yeast), Vps39 has been shown to stimulate nucleotide exchange []. Trap1 plays a role in the TGF-beta/activin signaling pathway. It associates with inactive heteromeric TGF-beta and activin receptor complexes, mainly through the type II receptor, and is released upon activation of signaling [, ]. The precise function of this domain has not been characterised.
Probab=64.79 E-value=34 Score=21.86 Aligned_cols=59 Identities=15% Similarity=0.141 Sum_probs=30.9
Q ss_pred HHHHHHHhCCHHHHHHHHhhCCCCCchHHHHHHHHHhCCChHHHHHHHHhcCCHHHHHHHHh
Q psy11102 21 LISVYTELGDFEALESCARKLPDSSPLLKPMGEIFVKYGLCEQAVYVFDKNKHKSSQWLTVV 82 (87)
Q Consensus 21 ~ie~~~~~ed~d~L~~l~~~L~~~~~lL~~ia~~F~~~G~~~~Av~aylk~gd~k~ai~~cv 82 (87)
+..||... +-..|..++|.- |+=.+.++-..+.+.|++..=+..|-..|.+++|+++-.
T Consensus 5 Llk~Yl~~-~~~~l~~llr~~--N~C~~~~~e~~L~~~~~~~eL~~lY~~kg~h~~AL~ll~ 63 (108)
T PF10366_consen 5 LLKCYLET-NPSLLGPLLRLP--NYCDLEEVEEVLKEHGKYQELVDLYQGKGLHRKALELLK 63 (108)
T ss_pred HHHHHHHh-CHHHHHHHHccC--CcCCHHHHHHHHHHcCCHHHHHHHHHccCccHHHHHHHH
Confidence 34555555 555566666532 322344455555555555555555555555555555443
No 151
>KOG0495|consensus
Probab=64.47 E-value=13 Score=32.00 Aligned_cols=67 Identities=22% Similarity=0.257 Sum_probs=46.2
Q ss_pred hHHHHHHHHHHhccHHHHHHHHH-------HhCCHHH----HHHHHhhCCCCCchHHHHHHHHHhCCChHHHHHHHHh
Q psy11102 4 LVSQAREYYEKCHCYEKLISVYT-------ELGDFEA----LESCARKLPDSSPLLKPMGEIFVKYGLCEQAVYVFDK 70 (87)
Q Consensus 4 ~w~~A~~yY~~~~n~~k~ie~~~-------~~ed~d~----L~~l~~~L~~~~~lL~~ia~~F~~~G~~~~Av~aylk 70 (87)
.++.|..++.|++-...-...++ -++.+++ |++..++.|+-+++-...|..|-+.++.+.|.++|+-
T Consensus 633 e~eraR~llakar~~sgTeRv~mKs~~~er~ld~~eeA~rllEe~lk~fp~f~Kl~lmlGQi~e~~~~ie~aR~aY~~ 710 (913)
T KOG0495|consen 633 ELERARDLLAKARSISGTERVWMKSANLERYLDNVEEALRLLEEALKSFPDFHKLWLMLGQIEEQMENIEMAREAYLQ 710 (913)
T ss_pred cHHHHHHHHHHHhccCCcchhhHHHhHHHHHhhhHHHHHHHHHHHHHhCCchHHHHHHHhHHHHHHHHHHHHHHHHHh
Confidence 45566666666554322222222 2333333 5666777788899999999999999999999999974
No 152
>KOG1840|consensus
Probab=64.40 E-value=22 Score=28.93 Aligned_cols=63 Identities=14% Similarity=0.181 Sum_probs=37.0
Q ss_pred HHHHHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCCCC----chHHHHHHHHHhCCChHHHHHHHHhcCCHH
Q psy11102 8 AREYYEKCHCYEKLISVYTELGDFEALESCARKLPDSS----PLLKPMGEIFVKYGLCEQAVYVFDKNKHKS 75 (87)
Q Consensus 8 A~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~~~----~lL~~ia~~F~~~G~~~~Av~aylk~gd~k 75 (87)
.+..|...|.+++++..+-+. |..+++.....+ ..+..+|..+.+.|-+.+||..|.++=++.
T Consensus 205 La~~y~~~g~~e~A~~l~k~A-----l~~l~k~~G~~hl~va~~l~~~a~~y~~~~k~~eAv~ly~~AL~i~ 271 (508)
T KOG1840|consen 205 LAEMYAVQGRLEKAEPLCKQA-----LRILEKTSGLKHLVVASMLNILALVYRSLGKYDEAVNLYEEALTIR 271 (508)
T ss_pred HHHHHHHhccHHHHHHHHHHH-----HHHHHHccCccCHHHHHHHHHHHHHHHHhccHHHHHHHHHHHHHHH
Confidence 344455555555555544332 222222222234 445568889999999999999998764443
No 153
>PF13041 PPR_2: PPR repeat family
Probab=64.35 E-value=8.9 Score=20.39 Aligned_cols=21 Identities=5% Similarity=-0.141 Sum_probs=12.6
Q ss_pred HHHHHHhcCCHHHHHHHHhhc
Q psy11102 64 AVYVFDKNKHKSSQWLTVVQD 84 (87)
Q Consensus 64 Av~aylk~gd~k~ai~~cv~~ 84 (87)
-+.+|.+.|++++|+++.-++
T Consensus 9 li~~~~~~~~~~~a~~l~~~M 29 (50)
T PF13041_consen 9 LISGYCKAGKFEEALKLFKEM 29 (50)
T ss_pred HHHHHHHCcCHHHHHHHHHHH
Confidence 355666666666666665443
No 154
>KOG0307|consensus
Probab=63.42 E-value=21 Score=31.66 Aligned_cols=74 Identities=16% Similarity=0.186 Sum_probs=50.7
Q ss_pred HHHHHHhccH-HHHHHHHHHhCCHHHHHHH-------------HhhCC-C-CCchHHHHHHHHHhCC-ChHHHHHHHHhc
Q psy11102 9 REYYEKCHCY-EKLISVYTELGDFEALESC-------------ARKLP-D-SSPLLKPMGEIFVKYG-LCEQAVYVFDKN 71 (87)
Q Consensus 9 ~~yY~~~~n~-~k~ie~~~~~ed~d~L~~l-------------~~~L~-~-~~~lL~~ia~~F~~~G-~~~~Av~aylk~ 71 (87)
-+||.+.++- .++|-|+.. .|++.+.+. +.-.+ + -.+++-.+|..+...| .-..|+-+|+-.
T Consensus 536 ~~Y~~k~~~k~s~li~a~v~-~d~~~~ve~~~~k~Wke~la~i~t~~~~~~~~elc~~Lg~rl~~~g~~~~~a~lcYi~a 614 (1049)
T KOG0307|consen 536 DKYLAKSNSKLSRLIYAMVN-RDLDDYVETCEVKQWKETLAAICTYAQTDEFSELCDMLGDRLENAGDLTSAAILCYICA 614 (1049)
T ss_pred HHHHHHhCChHHHHHHHHHh-hhHHHHHhhcchhhHHHHHHHHHHhcchhhHHHHHHHHHHHHhhccchhhhhhHHhhhc
Confidence 4577766654 444445444 334443333 33334 2 3799999999999999 888999999999
Q ss_pred CCHHHHHHHHhh
Q psy11102 72 KHKSSQWLTVVQ 83 (87)
Q Consensus 72 gd~k~ai~~cv~ 83 (87)
|.+.+++.+-.+
T Consensus 615 gsv~k~v~~w~~ 626 (1049)
T KOG0307|consen 615 GSVDKLVEIWLK 626 (1049)
T ss_pred cChhhhHHHHHH
Confidence 999998876544
No 155
>KOG1585|consensus
Probab=63.22 E-value=14 Score=28.16 Aligned_cols=32 Identities=16% Similarity=0.147 Sum_probs=25.0
Q ss_pred CchHHHHHHHHHhCCChHHHHHHHHhcCCHHH
Q psy11102 45 SPLLKPMGEIFVKYGLCEQAVYVFDKNKHKSS 76 (87)
Q Consensus 45 ~~lL~~ia~~F~~~G~~~~Av~aylk~gd~k~ 76 (87)
.+++.+++..+..+..+++|..+|+|-|..-.
T Consensus 150 ~el~gk~sr~lVrl~kf~Eaa~a~lKe~~~~~ 181 (308)
T KOG1585|consen 150 FELYGKCSRVLVRLEKFTEAATAFLKEGVAAD 181 (308)
T ss_pred HHHHHHhhhHhhhhHHhhHHHHHHHHhhhHHH
Confidence 47788888888888888888888888776543
No 156
>PF10516 SHNi-TPR: SHNi-TPR; InterPro: IPR019544 The tetratrico peptide repeat region (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B. This entry represents SHNi-TPR (Sim3-Hif1-NASP interrupted TPR), a sequence that is an interrupted form of TPR repeat [].
Probab=63.05 E-value=14 Score=19.63 Aligned_cols=29 Identities=24% Similarity=0.283 Sum_probs=23.0
Q ss_pred hHHHHHHHHHhCCChHHHHHHHHhcCCHH
Q psy11102 47 LLKPMGEIFVKYGLCEQAVYVFDKNKHKS 75 (87)
Q Consensus 47 lL~~ia~~F~~~G~~~~Av~aylk~gd~k 75 (87)
.+..+|+.....+.+++|++-|.++=+++
T Consensus 3 v~~~Lgeisle~e~f~qA~~D~~~aL~i~ 31 (38)
T PF10516_consen 3 VYDLLGEISLENENFEQAIEDYEKALEIQ 31 (38)
T ss_pred HHHHHHHHHHHhccHHHHHHHHHHHHHHH
Confidence 46678999999999999999887654433
No 157
>smart00028 TPR Tetratricopeptide repeats. Repeats present in 4 or more copies in proteins. Contain a minimum of 34 amino acids each and self-associate via a "knobs and holes" mechanism.
Probab=63.01 E-value=12 Score=15.94 Aligned_cols=23 Identities=26% Similarity=0.559 Sum_probs=17.2
Q ss_pred HHHHHHHHHhCCChHHHHHHHHh
Q psy11102 48 LKPMGEIFVKYGLCEQAVYVFDK 70 (87)
Q Consensus 48 L~~ia~~F~~~G~~~~Av~aylk 70 (87)
...+|..+...|..+.|+..|.+
T Consensus 4 ~~~~a~~~~~~~~~~~a~~~~~~ 26 (34)
T smart00028 4 LYNLGNAYLKLGDYDEALEYYEK 26 (34)
T ss_pred HHHHHHHHHHHhhHHHHHHHHHH
Confidence 44677788888888888887743
No 158
>PF13431 TPR_17: Tetratricopeptide repeat
Probab=62.36 E-value=13 Score=18.67 Aligned_cols=26 Identities=15% Similarity=0.395 Sum_probs=21.2
Q ss_pred CCCCCchHHHHHHHHHhCCChHHHHH
Q psy11102 41 LPDSSPLLKPMGEIFVKYGLCEQAVY 66 (87)
Q Consensus 41 L~~~~~lL~~ia~~F~~~G~~~~Av~ 66 (87)
=|++......+|..+...|..++|++
T Consensus 9 ~P~n~~a~~nla~~~~~~g~~~~A~~ 34 (34)
T PF13431_consen 9 NPNNAEAYNNLANLYLNQGDYEEAIA 34 (34)
T ss_pred CCCCHHHHHHHHHHHHHCcCHHhhcC
Confidence 36677889999999999998888863
No 159
>KOG1174|consensus
Probab=62.21 E-value=35 Score=27.99 Aligned_cols=66 Identities=17% Similarity=0.170 Sum_probs=53.0
Q ss_pred HHHHHHHHHhc--------cHHHHHHHHHHhCCHHHHHHHHhh-CCC--CCchHHHHHHHHHhCCChHHHHHHHHhc
Q psy11102 6 SQAREYYEKCH--------CYEKLISVYTELGDFEALESCARK-LPD--SSPLLKPMGEIFVKYGLCEQAVYVFDKN 71 (87)
Q Consensus 6 ~~A~~yY~~~~--------n~~k~ie~~~~~ed~d~L~~l~~~-L~~--~~~lL~~ia~~F~~~G~~~~Av~aylk~ 71 (87)
+||.++|.++= -.-+.+|.+.+.|-+.....|.+. |-. ++.|-..+|+.|+-+.+.+.|.+-|-++
T Consensus 421 EKAKkf~ek~L~~~P~Y~~AV~~~AEL~~~Eg~~~D~i~LLe~~L~~~~D~~LH~~Lgd~~~A~Ne~Q~am~~y~~A 497 (564)
T KOG1174|consen 421 EKAKKFAEKSLKINPIYTPAVNLIAELCQVEGPTKDIIKLLEKHLIIFPDVNLHNHLGDIMRAQNEPQKAMEYYYKA 497 (564)
T ss_pred HHHHHHHHhhhccCCccHHHHHHHHHHHHhhCccchHHHHHHHHHhhccccHHHHHHHHHHHHhhhHHHHHHHHHHH
Confidence 67888888763 345778999999988888877776 443 4577889999999999999999999764
No 160
>KOG4121|consensus
Probab=61.90 E-value=47 Score=29.66 Aligned_cols=38 Identities=21% Similarity=0.287 Sum_probs=34.4
Q ss_pred HHHHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCCCCc
Q psy11102 9 REYYEKCHCYEKLISVYTELGDFEALESCARKLPDSSP 46 (87)
Q Consensus 9 ~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~~~~ 46 (87)
.++..+-|.++.||.+.=+-.||+.|+.+++.|+++++
T Consensus 779 lq~L~~vg~~e~Ai~iAEKY~DfqsLV~lcdqld~kdr 816 (1128)
T KOG4121|consen 779 LQVLCKVGQYEQAIQIAEKYKDFQSLVQLCDQLDQKDR 816 (1128)
T ss_pred HHHHHhcchHHHHHHHHHHhhhHHHHHHHHHhhCchhH
Confidence 56788899999999999999999999999999988764
No 161
>PLN03098 LPA1 LOW PSII ACCUMULATION1; Provisional
Probab=61.38 E-value=56 Score=26.42 Aligned_cols=49 Identities=6% Similarity=-0.123 Sum_probs=34.1
Q ss_pred HHHHHhCCHHHHHHHHhh---CCCC-Cc---hHHHHHHHHHhCCChHHHHHHHHhc
Q psy11102 23 SVYTELGDFEALESCARK---LPDS-SP---LLKPMGEIFVKYGLCEQAVYVFDKN 71 (87)
Q Consensus 23 e~~~~~ed~d~L~~l~~~---L~~~-~~---lL~~ia~~F~~~G~~~~Av~aylk~ 71 (87)
..|+..|+|++-....+. +..+ .+ ....+|.-+...|..++|+++|.++
T Consensus 83 ~AL~~lGryeEAIa~f~rALeL~Pd~aeA~~A~yNLAcaya~LGr~dEAla~LrrA 138 (453)
T PLN03098 83 LSLFSKGRVKDALAQFETALELNPNPDEAQAAYYNKACCHAYREEGKKAADCLRTA 138 (453)
T ss_pred HHHHHcCCHHHHHHHHHHHHhhCCCchHHHHHHHHHHHHHHHcCCHHHHHHHHHHH
Confidence 345555666555555433 2333 32 3899999999999999999999875
No 162
>KOG2114|consensus
Probab=61.04 E-value=45 Score=29.24 Aligned_cols=63 Identities=21% Similarity=0.144 Sum_probs=46.5
Q ss_pred HHHHHHHHHhCCHHHHHHHHhhCCCCCc--hHHHHHHHHHhCCChHHHHHHHHhcCCHHHHHHHH
Q psy11102 19 EKLISVYTELGDFEALESCARKLPDSSP--LLKPMGEIFVKYGLCEQAVYVFDKNKHKSSQWLTV 81 (87)
Q Consensus 19 ~k~ie~~~~~ed~d~L~~l~~~L~~~~~--lL~~ia~~F~~~G~~~~Av~aylk~gd~k~ai~~c 81 (87)
.-++.||.+++|.++|.+.++.-+++-- -+...-+.++..+-.++|...=.|.+...-.++.-
T Consensus 435 tlLLncYiKlkd~~kL~efI~~~~~g~~~fd~e~al~Ilr~snyl~~a~~LA~k~~~he~vl~il 499 (933)
T KOG2114|consen 435 TLLLNCYIKLKDVEKLTEFISKCDKGEWFFDVETALEILRKSNYLDEAELLATKFKKHEWVLDIL 499 (933)
T ss_pred HHHHHHHHHhcchHHHHHHHhcCCCcceeeeHHHHHHHHHHhChHHHHHHHHHHhccCHHHHHHH
Confidence 4467899999999999999999996532 26666677777777777777766666654444443
No 163
>COG2909 MalT ATP-dependent transcriptional regulator [Transcription]
Probab=60.94 E-value=22 Score=31.01 Aligned_cols=44 Identities=18% Similarity=0.050 Sum_probs=37.6
Q ss_pred CCCC-CchHHHHHHHHHhCCChHHHHHHHHhcCCHHHHHHHHhhc
Q psy11102 41 LPDS-SPLLKPMGEIFVKYGLCEQAVYVFDKNKHKSSQWLTVVQD 84 (87)
Q Consensus 41 L~~~-~~lL~~ia~~F~~~G~~~~Av~aylk~gd~k~ai~~cv~~ 84 (87)
+|.. .++...-+.+|...|+..+||.==+++||+..|-++-.++
T Consensus 342 ~~~~~~~lH~~Aa~w~~~~g~~~eAI~hAlaA~d~~~aa~lle~~ 386 (894)
T COG2909 342 LAARLKELHRAAAEWFAEHGLPSEAIDHALAAGDPEMAADLLEQL 386 (894)
T ss_pred cCCchhHHHHHHHHHHHhCCChHHHHHHHHhCCCHHHHHHHHHhh
Confidence 3444 5788889999999999999999999999999998876543
No 164
>COG3629 DnrI DNA-binding transcriptional activator of the SARP family [Signal transduction mechanisms]
Probab=60.67 E-value=27 Score=26.37 Aligned_cols=58 Identities=17% Similarity=0.133 Sum_probs=36.7
Q ss_pred HHHHHHHHHHhCCHHHHHHHHhhCCCCCchHHHHHHHHHhCCChHHHHHHHHhcCCHHHHHHHHhhc
Q psy11102 18 YEKLISVYTELGDFEALESCARKLPDSSPLLKPMGEIFVKYGLCEQAVYVFDKNKHKSSQWLTVVQD 84 (87)
Q Consensus 18 ~~k~ie~~~~~ed~d~L~~l~~~L~~~~~lL~~ia~~F~~~G~~~~Av~aylk~gd~k~ai~~cv~~ 84 (87)
++++++.+..-+++|.+.+....|-.-+|--+. ++.+..++|.+.|+.-.||+.|-++
T Consensus 156 l~~lae~~~~~~~~~~~~~~l~~Li~~dp~~E~---------~~~~lm~~y~~~g~~~~ai~~y~~l 213 (280)
T COG3629 156 LTKLAEALIACGRADAVIEHLERLIELDPYDEP---------AYLRLMEAYLVNGRQSAAIRAYRQL 213 (280)
T ss_pred HHHHHHHHHhcccHHHHHHHHHHHHhcCccchH---------HHHHHHHHHHHcCCchHHHHHHHHH
Confidence 457888888888888888888877443332222 2445566666666666666665443
No 165
>PF13174 TPR_6: Tetratricopeptide repeat; PDB: 3QKY_A 2XEV_A 3URZ_B 2Q7F_A.
Probab=60.51 E-value=10 Score=17.73 Aligned_cols=22 Identities=18% Similarity=0.389 Sum_probs=13.2
Q ss_pred HHHHHHHHhCCChHHHHHHHHh
Q psy11102 49 KPMGEIFVKYGLCEQAVYVFDK 70 (87)
Q Consensus 49 ~~ia~~F~~~G~~~~Av~aylk 70 (87)
-.+|..+...|..++|++.|.+
T Consensus 4 ~~~a~~~~~~g~~~~A~~~~~~ 25 (33)
T PF13174_consen 4 YRLARCYYKLGDYDEAIEYFQR 25 (33)
T ss_dssp HHHHHHHHHHCHHHHHHHHHHH
T ss_pred HHHHHHHHHccCHHHHHHHHHH
Confidence 3456666666666666666543
No 166
>PRK10866 outer membrane biogenesis protein BamD; Provisional
Probab=60.42 E-value=31 Score=24.85 Aligned_cols=20 Identities=10% Similarity=-0.155 Sum_probs=10.4
Q ss_pred HHHHHHHHhCCChHHHHHHH
Q psy11102 49 KPMGEIFVKYGLCEQAVYVF 68 (87)
Q Consensus 49 ~~ia~~F~~~G~~~~Av~ay 68 (87)
-.+|.-+-+.|..++|+..|
T Consensus 73 l~la~ayy~~~~y~~A~~~~ 92 (243)
T PRK10866 73 LDLIYAYYKNADLPLAQAAI 92 (243)
T ss_pred HHHHHHHHhcCCHHHHHHHH
Confidence 34455555555555555544
No 167
>KOG1840|consensus
Probab=60.37 E-value=18 Score=29.48 Aligned_cols=69 Identities=19% Similarity=0.272 Sum_probs=47.3
Q ss_pred hhHHHHHHHHHHhccH----------------HHHHHHHHHhCCHHHHHHHHhhC-------C----CC-CchHHHHHHH
Q psy11102 3 TLVSQAREYYEKCHCY----------------EKLISVYTELGDFEALESCARKL-------P----DS-SPLLKPMGEI 54 (87)
Q Consensus 3 ~~w~~A~~yY~~~~n~----------------~k~ie~~~~~ed~d~L~~l~~~L-------~----~~-~~lL~~ia~~ 54 (87)
+++++|+.+|.++=.. ..+...|+..|+|++-+..+... + +. ...|..++..
T Consensus 255 ~k~~eAv~ly~~AL~i~e~~~G~~h~~va~~l~nLa~ly~~~GKf~EA~~~~e~Al~I~~~~~~~~~~~v~~~l~~~~~~ 334 (508)
T KOG1840|consen 255 GKYDEAVNLYEEALTIREEVFGEDHPAVAATLNNLAVLYYKQGKFAEAEEYCERALEIYEKLLGASHPEVAAQLSELAAI 334 (508)
T ss_pred ccHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhccCChHHHHHHHHHHHHHHHHhhccChHHHHHHHHHHHHH
Confidence 4567777777765332 34566677888887766554432 1 12 3678899999
Q ss_pred HHhCCChHHHHHHHHhc
Q psy11102 55 FVKYGLCEQAVYVFDKN 71 (87)
Q Consensus 55 F~~~G~~~~Av~aylk~ 71 (87)
+.+.+-+++|+..|-+.
T Consensus 335 ~~~~~~~Eea~~l~q~a 351 (508)
T KOG1840|consen 335 LQSMNEYEEAKKLLQKA 351 (508)
T ss_pred HHHhcchhHHHHHHHHH
Confidence 99999999999877644
No 168
>PF15469 Sec5: Exocyst complex component Sec5
Probab=59.77 E-value=32 Score=23.38 Aligned_cols=40 Identities=15% Similarity=0.045 Sum_probs=27.6
Q ss_pred chhHHHHHHHHHHhccHHHHH--HHHHHhCCHHHHHHHHhhC
Q psy11102 2 TTLVSQAREYYEKCHCYEKLI--SVYTELGDFEALESCARKL 41 (87)
Q Consensus 2 ~~~w~~A~~yY~~~~n~~k~i--e~~~~~ed~d~L~~l~~~L 41 (87)
.++|+.|+..|.++...-.-. +.-.-..-|++.+.++.++
T Consensus 99 ~~dy~~~i~dY~kak~l~~~~~~~~~vf~~v~~eve~ii~~~ 140 (182)
T PF15469_consen 99 KGDYDQAINDYKKAKSLFEKYKQQVPVFQKVWSEVEKIIEEF 140 (182)
T ss_pred cCcHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHH
Confidence 478899999998888765554 4445555667777666655
No 169
>KOG4626|consensus
Probab=59.20 E-value=7.3 Score=33.33 Aligned_cols=30 Identities=23% Similarity=0.248 Sum_probs=26.6
Q ss_pred CCCCchHHHHHHHHHhCCChHHHHHHHHhc
Q psy11102 42 PDSSPLLKPMGEIFVKYGLCEQAVYVFDKN 71 (87)
Q Consensus 42 ~~~~~lL~~ia~~F~~~G~~~~Av~aylk~ 71 (87)
|+.-..+..+|..++..|..++|..+|+|+
T Consensus 181 P~l~ca~s~lgnLlka~Grl~ea~~cYlkA 210 (966)
T KOG4626|consen 181 PDLYCARSDLGNLLKAEGRLEEAKACYLKA 210 (966)
T ss_pred cchhhhhcchhHHHHhhcccchhHHHHHHH
Confidence 555677899999999999999999999985
No 170
>COG5010 TadD Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking and secretion]
Probab=57.30 E-value=40 Score=25.31 Aligned_cols=51 Identities=18% Similarity=0.193 Sum_probs=41.7
Q ss_pred HHHHHHHhCCHHHHHHHHhhC----CCCCchHHHHHHHHHhCCChHHHHHHHHhc
Q psy11102 21 LISVYTELGDFEALESCARKL----PDSSPLLKPMGEIFVKYGLCEQAVYVFDKN 71 (87)
Q Consensus 21 ~ie~~~~~ed~d~L~~l~~~L----~~~~~lL~~ia~~F~~~G~~~~Av~aylk~ 71 (87)
.+...++.|+|..-+...+.+ |++.+.....|--+.+.|..++|-.+|.+.
T Consensus 106 ~gk~~~~~g~~~~A~~~~rkA~~l~p~d~~~~~~lgaaldq~Gr~~~Ar~ay~qA 160 (257)
T COG5010 106 QGKNQIRNGNFGEAVSVLRKAARLAPTDWEAWNLLGAALDQLGRFDEARRAYRQA 160 (257)
T ss_pred HHHHHHHhcchHHHHHHHHHHhccCCCChhhhhHHHHHHHHccChhHHHHHHHHH
Confidence 445567889998888877775 445688999999999999999999999765
No 171
>cd02576 PseudoU_synth_ScPUS7 PseudoU_synth_ScPUS7: Pseudouridine synthase, TruD family. This group consists of eukaryotic pseudouridine synthases similar to Saccharomyces cerevisiae Pus7. Pseudouridine synthases catalyze the isomerization of specific uridines in an RNA molecule to pseudouridines (5-ribosyluracil, psi). Saccharomyces cerevisiae Pus7 makes psi35 in U2 small nuclear RNA (U2 snRNA), psi13 in cytoplasmic tRNAs and psi35 in pre-tRNATyr. Psi35 in yeast U2 snRNA and psi13 in tRNAs are highly phylogenetically conserved. Psi34 is the mammalian U2 snRNA counterpart of yeast U2 snRNA psi35.
Probab=56.46 E-value=19 Score=27.60 Aligned_cols=37 Identities=16% Similarity=0.034 Sum_probs=28.7
Q ss_pred chhHHHHHHHHHHhc------cHHHHHHHHHHhCCHHHHHHHH
Q psy11102 2 TTLVSQAREYYEKCH------CYEKLISVYTELGDFEALESCA 38 (87)
Q Consensus 2 ~~~w~~A~~yY~~~~------n~~k~ie~~~~~ed~d~L~~l~ 38 (87)
.++|++|+.+|.... +..++-+.+...+|++.-.++.
T Consensus 170 ~~~~~~Av~~~l~~~~~~~~~~~~~~r~~~~~~~~~~~~l~~~ 212 (371)
T cd02576 170 KENWKEAVDLILKPRSELEEGDLVEAREIWKETGDAKAALKKL 212 (371)
T ss_pred hhhHHHHHHHHhCCCcccccHHHHHHHHHHHhcCCHHHHHHhc
Confidence 378999999999875 7777888888888887755443
No 172
>KOG4340|consensus
Probab=56.10 E-value=7.9 Score=30.51 Aligned_cols=46 Identities=26% Similarity=0.336 Sum_probs=36.5
Q ss_pred HHHHhCCHHHHHHHHhhCC-CC-CchHHHHHHHHHhCCChHHHHHHHH
Q psy11102 24 VYTELGDFEALESCARKLP-DS-SPLLKPMGEIFVKYGLCEQAVYVFD 69 (87)
Q Consensus 24 ~~~~~ed~d~L~~l~~~L~-~~-~~lL~~ia~~F~~~G~~~~Av~ayl 69 (87)
+.|..+|+.+-..|+..+| ++ ...+...|..+-+.|.+|.||+-|-
T Consensus 121 IkYse~Dl~g~rsLveQlp~en~Ad~~in~gCllykegqyEaAvqkFq 168 (459)
T KOG4340|consen 121 IKYSEGDLPGSRSLVEQLPSENEADGQINLGCLLYKEGQYEAAVQKFQ 168 (459)
T ss_pred HhcccccCcchHHHHHhccCCCccchhccchheeeccccHHHHHHHHH
Confidence 3455566666677888888 56 4778899999999999999998774
No 173
>KOG2076|consensus
Probab=55.68 E-value=53 Score=28.77 Aligned_cols=68 Identities=21% Similarity=0.314 Sum_probs=43.5
Q ss_pred hhHHHHHHHHHHhc---------cHHHHHHHHHHhCCHHHH----HHHHhhCCCCCchHHHHHHHHHhCCChHHHHHHHH
Q psy11102 3 TLVSQAREYYEKCH---------CYEKLISVYTELGDFEAL----ESCARKLPDSSPLLKPMGEIFVKYGLCEQAVYVFD 69 (87)
Q Consensus 3 ~~w~~A~~yY~~~~---------n~~k~ie~~~~~ed~d~L----~~l~~~L~~~~~lL~~ia~~F~~~G~~~~Av~ayl 69 (87)
+.+..|..||...- -+-+.++||-.+|-++.- ++++...|++......++......|..|+|+++..
T Consensus 428 ~~~~~Al~~l~~i~~~~~~~~~~vw~~~a~c~~~l~e~e~A~e~y~kvl~~~p~~~D~Ri~Lasl~~~~g~~EkalEtL~ 507 (895)
T KOG2076|consen 428 GKYKEALRLLSPITNREGYQNAFVWYKLARCYMELGEYEEAIEFYEKVLILAPDNLDARITLASLYQQLGNHEKALETLE 507 (895)
T ss_pred ccHHHHHHHHHHHhcCccccchhhhHHHHHHHHHHhhHHHHHHHHHHHHhcCCCchhhhhhHHHHHHhcCCHHHHHHHHh
Confidence 34555555555432 234566666666666543 34455556666777788888888888888888876
Q ss_pred h
Q psy11102 70 K 70 (87)
Q Consensus 70 k 70 (87)
.
T Consensus 508 ~ 508 (895)
T KOG2076|consen 508 Q 508 (895)
T ss_pred c
Confidence 6
No 174
>PF10366 Vps39_1: Vacuolar sorting protein 39 domain 1; InterPro: IPR019452 This entry represents a domain found in the vacuolar sorting protein Vps39 and transforming growth factor beta receptor-associated protein Trap1. Vps39, a component of the C-Vps complex, is thought to be required for the fusion of endosomes and other types of transport intermediates with the vacuole [, ]. In Saccharomyces cerevisiae (Baker's yeast), Vps39 has been shown to stimulate nucleotide exchange []. Trap1 plays a role in the TGF-beta/activin signaling pathway. It associates with inactive heteromeric TGF-beta and activin receptor complexes, mainly through the type II receptor, and is released upon activation of signaling [, ]. The precise function of this domain has not been characterised.
Probab=55.40 E-value=28 Score=22.26 Aligned_cols=47 Identities=17% Similarity=0.270 Sum_probs=29.1
Q ss_pred HHhCCHHHHHHHHhhCCCCCchHHHHHHHHHhCCChHHHHHHHHhcCC
Q psy11102 26 TELGDFEALESCARKLPDSSPLLKPMGEIFVKYGLCEQAVYVFDKNKH 73 (87)
Q Consensus 26 ~~~ed~d~L~~l~~~L~~~~~lL~~ia~~F~~~G~~~~Av~aylk~gd 73 (87)
.+...+=.+++....|-+ +....+++..+.+.|++++|.+...+.++
T Consensus 21 lr~~N~C~~~~~e~~L~~-~~~~~eL~~lY~~kg~h~~AL~ll~~l~~ 67 (108)
T PF10366_consen 21 LRLPNYCDLEEVEEVLKE-HGKYQELVDLYQGKGLHRKALELLKKLAD 67 (108)
T ss_pred HccCCcCCHHHHHHHHHH-cCCHHHHHHHHHccCccHHHHHHHHHHhc
Confidence 333344444555444432 33466777777888888888877777766
No 175
>PRK15331 chaperone protein SicA; Provisional
Probab=55.09 E-value=31 Score=24.18 Aligned_cols=17 Identities=12% Similarity=0.194 Sum_probs=8.8
Q ss_pred HHHhccHHHHHHHHHHh
Q psy11102 12 YEKCHCYEKLISVYTEL 28 (87)
Q Consensus 12 Y~~~~n~~k~ie~~~~~ 28 (87)
|..-+++++||++|...
T Consensus 81 ~Q~~k~y~~Ai~~Y~~A 97 (165)
T PRK15331 81 CQLKKQFQKACDLYAVA 97 (165)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 34445555555555543
No 176
>COG2956 Predicted N-acetylglucosaminyl transferase [Carbohydrate transport and metabolism]
Probab=54.07 E-value=83 Score=24.91 Aligned_cols=43 Identities=16% Similarity=0.179 Sum_probs=30.3
Q ss_pred hCCHHH-HHHHHhhCCC---CCchHHHHHHHHHhCCChHHHHHHHHh
Q psy11102 28 LGDFEA-LESCARKLPD---SSPLLKPMGEIFVKYGLCEQAVYVFDK 70 (87)
Q Consensus 28 ~ed~d~-L~~l~~~L~~---~~~lL~~ia~~F~~~G~~~~Av~aylk 70 (87)
..++|. ++-+-+.|.. ....-..+|+.+...|.+++||+++.+
T Consensus 193 ~~~~d~A~~~l~kAlqa~~~cvRAsi~lG~v~~~~g~y~~AV~~~e~ 239 (389)
T COG2956 193 SSDVDRARELLKKALQADKKCVRASIILGRVELAKGDYQKAVEALER 239 (389)
T ss_pred hhhHHHHHHHHHHHHhhCccceehhhhhhHHHHhccchHHHHHHHHH
Confidence 356666 4444455543 346677889999999999999987654
No 177
>PF12921 ATP13: Mitochondrial ATPase expression; InterPro: IPR024319 ATPase expression protein 2 (also known as ATP13 in some species) is necessary for the expression of subunit 9 of mitochondrial ATPase. The protein has a basic amino terminal signal sequence that is cleaved upon import into mitochondria [].
Probab=53.96 E-value=60 Score=21.24 Aligned_cols=50 Identities=20% Similarity=0.258 Sum_probs=39.0
Q ss_pred HHHHHHHHHhCCHHHHHHHHhhC--------------CC------CCchHHHHHHHHHhCCChHHHHHHH
Q psy11102 19 EKLISVYTELGDFEALESCARKL--------------PD------SSPLLKPMGEIFVKYGLCEQAVYVF 68 (87)
Q Consensus 19 ~k~ie~~~~~ed~d~L~~l~~~L--------------~~------~~~lL~~ia~~F~~~G~~~~Av~ay 68 (87)
.-.+-++.+.|+.+.++.++.+. +. +..+|.-|..-|...|....|+..-
T Consensus 6 ~~ii~al~r~g~~~~i~~~i~~~WgI~~~~~~~~~~~~~~spl~Pt~~lL~AIv~sf~~n~~i~~al~~v 75 (126)
T PF12921_consen 6 CNIIYALGRSGQLDSIKSYIKSVWGIDVNGKKKEGDYPPSSPLYPTSRLLIAIVHSFGYNGDIFSALKLV 75 (126)
T ss_pred HHHHHHHhhcCCHHHHHHHHHHhcCCCCCCccccCccCCCCCCCCCHHHHHHHHHHHHhcccHHHHHHHH
Confidence 45678899999999999998764 11 2367888999888888888887653
No 178
>KOG4162|consensus
Probab=52.46 E-value=24 Score=30.33 Aligned_cols=68 Identities=25% Similarity=0.287 Sum_probs=42.5
Q ss_pred hhHHHHHHHHHHhcc--------HHHHHHHHHHhCCHH------HHHHHHhhCCCCCchHHHHHHHHHhCCChHHHHHHH
Q psy11102 3 TLVSQAREYYEKCHC--------YEKLISVYTELGDFE------ALESCARKLPDSSPLLKPMGEIFVKYGLCEQAVYVF 68 (87)
Q Consensus 3 ~~w~~A~~yY~~~~n--------~~k~ie~~~~~ed~d------~L~~l~~~L~~~~~lL~~ia~~F~~~G~~~~Av~ay 68 (87)
++|.+|-+-|--+-. ..-+++|+-+.|+=- -|...+|-=|.+++.=..+|..|++.|..++|++||
T Consensus 698 ~~~~EA~~af~~Al~ldP~hv~s~~Ala~~lle~G~~~la~~~~~L~dalr~dp~n~eaW~~LG~v~k~~Gd~~~Aaecf 777 (799)
T KOG4162|consen 698 GQLEEAKEAFLVALALDPDHVPSMTALAELLLELGSPRLAEKRSLLSDALRLDPLNHEAWYYLGEVFKKLGDSKQAAECF 777 (799)
T ss_pred HhhHHHHHHHHHHHhcCCCCcHHHHHHHHHHHHhCCcchHHHHHHHHHHHhhCCCCHHHHHHHHHHHHHccchHHHHHHH
Confidence 345555555544432 124455555555321 233333333667888889999999999999999999
Q ss_pred Hh
Q psy11102 69 DK 70 (87)
Q Consensus 69 lk 70 (87)
.-
T Consensus 778 ~a 779 (799)
T KOG4162|consen 778 QA 779 (799)
T ss_pred HH
Confidence 53
No 179
>PF12688 TPR_5: Tetratrico peptide repeat
Probab=51.94 E-value=64 Score=20.96 Aligned_cols=55 Identities=13% Similarity=0.132 Sum_probs=37.8
Q ss_pred HHHHHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCCC--CchHHHHHHHHHhCCChHHHHHHHHh
Q psy11102 8 AREYYEKCHCYEKLISVYTELGDFEALESCARKLPDS--SPLLKPMGEIFVKYGLCEQAVYVFDK 70 (87)
Q Consensus 8 A~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~~--~~lL~~ia~~F~~~G~~~~Av~aylk 70 (87)
.+..|-.-|+.+++|..|-+.-. .-|+.. ...+..+|.-+.+.|..++|+..+.+
T Consensus 7 ~A~a~d~~G~~~~Ai~~Y~~Al~--------~gL~~~~~~~a~i~lastlr~LG~~deA~~~L~~ 63 (120)
T PF12688_consen 7 LAWAHDSLGREEEAIPLYRRALA--------AGLSGADRRRALIQLASTLRNLGRYDEALALLEE 63 (120)
T ss_pred HHHHHHhcCCHHHHHHHHHHHHH--------cCCCchHHHHHHHHHHHHHHHcCCHHHHHHHHHH
Confidence 44556677888999988865422 234432 36677888888888888888877644
No 180
>PF07575 Nucleopor_Nup85: Nup85 Nucleoporin; InterPro: IPR011502 This is a family of nucleoporins conserved from yeast to human. Nup85 Nucleoporin is an essential component of the nuclear pore complex (NPC) that seems to be required for NPC assembly and maintenance. As part of the NPC Nup107-160 subcomplex plays a role in RNA export and in tethering NUP98/Nup98 and NUP153 to the nucleus. The Nup107-160 complex seems to be required for spindle assembly during mitosis. NUP85 is required for membrane clustering of CCL2-activated CCR2. Seems to be involved in CCR2-mediated chemotaxis of monocytes and may link activated CCR2 to the phosphatidyl-inositol-3-kinase-Rac-lammellipodium protrusion cascade [, , ]. ; PDB: 3F3F_D 3F3P_G 3F3G_G 3EWE_B.
Probab=51.91 E-value=9.1 Score=30.84 Aligned_cols=75 Identities=23% Similarity=0.204 Sum_probs=44.7
Q ss_pred hhHHHHHHHHHHhccHHH-HHHHHH------HhCCHHHHHHHHhh--CCCC-CchHHHHHHHHHhCCChHHHHHHHHhcC
Q psy11102 3 TLVSQAREYYEKCHCYEK-LISVYT------ELGDFEALESCARK--LPDS-SPLLKPMGEIFVKYGLCEQAVYVFDKNK 72 (87)
Q Consensus 3 ~~w~~A~~yY~~~~n~~k-~ie~~~------~~ed~d~L~~l~~~--L~~~-~~lL~~ia~~F~~~G~~~~Av~aylk~g 72 (87)
+-|.-|+.|...+++..+ .|+-+- -..+.+++-++|+. |++. ..+-+..|....+.|..-.|+.-|+|+|
T Consensus 373 ~lW~vai~yL~~c~~~g~~~i~~lL~~~p~~t~~~~~k~l~iC~~~~L~~~a~~I~~~~~~~~~~~~~~g~AL~~~~ra~ 452 (566)
T PF07575_consen 373 SLWQVAIGYLSSCPDEGRERIEELLPRVPLDTNDDAEKLLEICAELGLEDVAREICKILGQRLLKEGRYGEALSWFIRAG 452 (566)
T ss_dssp TTHHHHHHHHHS-SSS-HHHHHHHGGG----SHHHHHHHHHHHHHHT-HHHHHHHHHHHHHHHHHHHHHHHHHHHHH---
T ss_pred chHHHHHHHHHHCChhhHHHHHHHHhhCCCCchHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCC
Confidence 459999999999886442 222222 23445666666666 4453 4667778888888899999999999999
Q ss_pred CHHHH
Q psy11102 73 HKSSQ 77 (87)
Q Consensus 73 d~k~a 77 (87)
|....
T Consensus 453 d~~~v 457 (566)
T PF07575_consen 453 DYSLV 457 (566)
T ss_dssp -----
T ss_pred CHHHH
Confidence 98654
No 181
>PF08238 Sel1: Sel1 repeat; InterPro: IPR006597 Sel1-like repeats are tetratricopeptide repeat sequences originally identified in a Caenorhabditis elegans receptor molecule which is a key negative regulator of the Notch pathway []. Mammalian homologues have since been identified although these mainly pancreatic proteins have yet to have a function assigned.; PDB: 2XM6_A 3RJV_A 1OUV_A 1KLX_A.
Probab=51.87 E-value=14 Score=18.11 Aligned_cols=14 Identities=29% Similarity=0.370 Sum_probs=11.2
Q ss_pred hhHHHHHHHHHHhc
Q psy11102 3 TLVSQAREYYEKCH 16 (87)
Q Consensus 3 ~~w~~A~~yY~~~~ 16 (87)
.|+++|..||.++-
T Consensus 22 ~d~~~A~~~~~~Aa 35 (39)
T PF08238_consen 22 KDYEKAFKWYEKAA 35 (39)
T ss_dssp HHHHHHHHHHHHHH
T ss_pred ccccchHHHHHHHH
Confidence 57889999998763
No 182
>PF14165 YtzH: YtzH-like protein
Probab=51.58 E-value=7.1 Score=24.77 Aligned_cols=59 Identities=8% Similarity=0.179 Sum_probs=38.2
Q ss_pred HHHHHHHhCCHHHHHHHHhhCCCCCchHHHHHHHHHhCCChHHHHHHHHhcCCHHHHHHHHhhcCC
Q psy11102 21 LISVYTELGDFEALESCARKLPDSSPLLKPMGEIFVKYGLCEQAVYVFDKNKHKSSQWLTVVQDKP 86 (87)
Q Consensus 21 ~ie~~~~~ed~d~L~~l~~~L~~~~~lL~~ia~~F~~~G~~~~Av~aylk~gd~k~ai~~cv~~~~ 86 (87)
..+||+-.--++.|++|+.+|-.+..+=..+-..+. =|..|.+-|....-++.++..++
T Consensus 18 q~DccgTvsEcEQieRLvksLm~n~~i~~~ik~~L~-------~Iy~ysq~G~~~~~l~~Hi~~~q 76 (87)
T PF14165_consen 18 QLDCCGTVSECEQIERLVKSLMANPNIDADIKQTLE-------EIYSYSQNGKYAQSLDNHIEFHQ 76 (87)
T ss_pred hhhccCcHHHHHHHHHHHHHHHcCCCcCHHHHHHHH-------HHHHHHccCccccchhhhhHhhH
Confidence 467888888999999999998543222222222222 23677777777777777776543
No 183
>PRK10866 outer membrane biogenesis protein BamD; Provisional
Probab=51.51 E-value=78 Score=22.73 Aligned_cols=13 Identities=8% Similarity=-0.064 Sum_probs=7.9
Q ss_pred HHhcCCHHHHHHH
Q psy11102 68 FDKNKHKSSQWLT 80 (87)
Q Consensus 68 ylk~gd~k~ai~~ 80 (87)
|.|.|.+.+|+.-
T Consensus 185 Y~~~~~y~AA~~r 197 (243)
T PRK10866 185 YTKRGAYVAVVNR 197 (243)
T ss_pred HHHcCchHHHHHH
Confidence 6666666666544
No 184
>PF04733 Coatomer_E: Coatomer epsilon subunit; InterPro: IPR006822 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the epsilon subunit of the coatomer complex, which is involved in the regulation of intracellular protein trafficking between the endoplasmic reticulum and the Golgi complex []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006890 retrograde vesicle-mediated transport, Golgi to ER, 0030126 COPI vesicle coat; PDB: 3MV2_B 3MV3_F 3MKR_A.
Probab=50.73 E-value=20 Score=26.64 Aligned_cols=69 Identities=16% Similarity=0.282 Sum_probs=49.5
Q ss_pred chhHHHHHHHHHHhccHHH---HHHHHHHhCCHHHHHHHHhhCCC-C-CchHHHHHHHHH---hCC-ChHHHHHHHHh
Q psy11102 2 TTLVSQAREYYEKCHCYEK---LISVYTELGDFEALESCARKLPD-S-SPLLKPMGEIFV---KYG-LCEQAVYVFDK 70 (87)
Q Consensus 2 ~~~w~~A~~yY~~~~n~~k---~ie~~~~~ed~d~L~~l~~~L~~-~-~~lL~~ia~~F~---~~G-~~~~Av~aylk 70 (87)
.++++.|.++..++++.|- .+++|...+.+|--++..++..+ + +..|..+++-.. +.| -...|...|.-
T Consensus 115 ~~~~~~AL~~l~~~~~lE~~al~Vqi~L~~~R~dlA~k~l~~~~~~~eD~~l~qLa~awv~l~~g~e~~~~A~y~f~E 192 (290)
T PF04733_consen 115 EGDYEEALKLLHKGGSLELLALAVQILLKMNRPDLAEKELKNMQQIDEDSILTQLAEAWVNLATGGEKYQDAFYIFEE 192 (290)
T ss_dssp CCHHHHHHCCCTTTTCHHHHHHHHHHHHHTT-HHHHHHHHHHHHCCSCCHHHHHHHHHHHHHHHTTTCCCHHHHHHHH
T ss_pred cCCHHHHHHHHHccCcccHHHHHHHHHHHcCCHHHHHHHHHHHHhcCCcHHHHHHHHHHHHHHhCchhHHHHHHHHHH
Confidence 4788999999999988874 46899999999998888888865 3 566777776432 233 46666666543
No 185
>smart00668 CTLH C-terminal to LisH motif. Alpha-helical motif of unknown function.
Probab=50.51 E-value=26 Score=18.93 Aligned_cols=22 Identities=9% Similarity=-0.097 Sum_probs=17.7
Q ss_pred HHHHHhcCCHHHHHHHHhhcCC
Q psy11102 65 VYVFDKNKHKSSQWLTVVQDKP 86 (87)
Q Consensus 65 v~aylk~gd~k~ai~~cv~~~~ 86 (87)
+.--+..||+..|++.|.+.+|
T Consensus 8 i~~~i~~g~~~~a~~~~~~~~~ 29 (58)
T smart00668 8 IRELILKGDWDEALEWLSSLKP 29 (58)
T ss_pred HHHHHHcCCHHHHHHHHHHcCH
Confidence 4556778999999999988776
No 186
>smart00671 SEL1 Sel1-like repeats. These represent a subfamily of TPR (tetratricopeptide repeat) sequences.
Probab=50.50 E-value=14 Score=17.79 Aligned_cols=14 Identities=36% Similarity=0.551 Sum_probs=10.4
Q ss_pred hhHHHHHHHHHHhc
Q psy11102 3 TLVSQAREYYEKCH 16 (87)
Q Consensus 3 ~~w~~A~~yY~~~~ 16 (87)
+|+.+|..||.++-
T Consensus 19 ~d~~~A~~~~~~Aa 32 (36)
T smart00671 19 KDLEKALEYYKKAA 32 (36)
T ss_pred cCHHHHHHHHHHHH
Confidence 57788888887764
No 187
>COG5110 RPN1 26S proteasome regulatory complex component [Posttranslational modification, protein turnover, chaperones]
Probab=50.39 E-value=63 Score=27.57 Aligned_cols=74 Identities=23% Similarity=0.278 Sum_probs=51.9
Q ss_pred HHHHHHhccHHHHHHHHHHhCCHHHH----------------HHHHhhCCC--CCchHHHHHHHHHhCCChHHHHHHHHh
Q psy11102 9 REYYEKCHCYEKLISVYTELGDFEAL----------------ESCARKLPD--SSPLLKPMGEIFVKYGLCEQAVYVFDK 70 (87)
Q Consensus 9 ~~yY~~~~n~~k~ie~~~~~ed~d~L----------------~~l~~~L~~--~~~lL~~ia~~F~~~G~~~~Av~aylk 70 (87)
+.||-+-.--..||+.+...|-.+++ +.++.-||. +..+|+.+=..+.+.|...+||..-+|
T Consensus 187 vpfflkHNaE~dAiDlL~Evg~Iekv~~fVd~~n~~RvclYl~~cv~llp~pedVa~l~ta~~IYlk~~~lt~av~~aiR 266 (881)
T COG5110 187 VPFFLKHNAEFDAIDLLVEVGGIEKVLDFVDTHNYNRVCLYLEDCVPLLPPPEDVALLETALKIYLKMGDLTRAVVGAIR 266 (881)
T ss_pred hHHHHhcccchHHHHHHHHhcchhhhhhhhcccchhHHHHHHHHhhccCCChHHHHHHHHHHHHHHhhhHHHHHHHHHHh
Confidence 45555555566677777776665554 334444454 457788877888888888888888888
Q ss_pred cCCHHHHHHHHh
Q psy11102 71 NKHKSSQWLTVV 82 (87)
Q Consensus 71 ~gd~k~ai~~cv 82 (87)
.++-+.+++-|-
T Consensus 267 l~~~~~i~e~~~ 278 (881)
T COG5110 267 LQKSKEIIEYVR 278 (881)
T ss_pred cccHHHHHHHHH
Confidence 888888887664
No 188
>PF15015 NYD-SP12_N: Spermatogenesis-associated, N-terminal
Probab=50.03 E-value=65 Score=26.49 Aligned_cols=28 Identities=11% Similarity=0.114 Sum_probs=24.1
Q ss_pred ChHHHHHHHHhcCCHHHHHHH---HhhcCCC
Q psy11102 60 LCEQAVYVFDKNKHKSSQWLT---VVQDKPS 87 (87)
Q Consensus 60 ~~~~Av~aylk~gd~k~ai~~---cv~~~~~ 87 (87)
.|++=|.||++.+++..|++- ||.+||+
T Consensus 230 Ietklv~CYL~~rkpdlALnh~hrsI~lnP~ 260 (569)
T PF15015_consen 230 IETKLVTCYLRMRKPDLALNHSHRSINLNPS 260 (569)
T ss_pred HHHHHHHhhhhcCCCchHHHHHhhhhhcCcc
Confidence 578899999999999999984 6778875
No 189
>KOG1126|consensus
Probab=49.95 E-value=28 Score=29.31 Aligned_cols=73 Identities=22% Similarity=0.180 Sum_probs=52.2
Q ss_pred hhHHHHHHHHHHhccHHH----HHH----HHHHhCCHHHHHHHHhhCCC---CCch-HHHHHHHHHhCCChHHHHHHHHh
Q psy11102 3 TLVSQAREYYEKCHCYEK----LIS----VYTELGDFEALESCARKLPD---SSPL-LKPMGEIFVKYGLCEQAVYVFDK 70 (87)
Q Consensus 3 ~~w~~A~~yY~~~~n~~k----~ie----~~~~~ed~d~L~~l~~~L~~---~~~l-L~~ia~~F~~~G~~~~Av~aylk 70 (87)
++|++|..+|...+..+. -.| .++++.|--+|.-|++.|=+ ++|. -.-+|+-|--.+.+++|+.+|.|
T Consensus 367 ~~Y~~a~~~F~~~r~~~p~rv~~meiyST~LWHLq~~v~Ls~Laq~Li~~~~~sPesWca~GNcfSLQkdh~~Aik~f~R 446 (638)
T KOG1126|consen 367 IEYDQAERIFSLVRRIEPYRVKGMEIYSTTLWHLQDEVALSYLAQDLIDTDPNSPESWCALGNCFSLQKDHDTAIKCFKR 446 (638)
T ss_pred HHHHHHHHHHHHHHhhccccccchhHHHHHHHHHHhhHHHHHHHHHHHhhCCCCcHHHHHhcchhhhhhHHHHHHHHHHH
Confidence 456666666666655432 233 34678888888888888743 3443 66789999999999999999998
Q ss_pred cCCHH
Q psy11102 71 NKHKS 75 (87)
Q Consensus 71 ~gd~k 75 (87)
+-+++
T Consensus 447 AiQld 451 (638)
T KOG1126|consen 447 AIQLD 451 (638)
T ss_pred hhccC
Confidence 76543
No 190
>KOG0890|consensus
Probab=49.59 E-value=66 Score=31.15 Aligned_cols=71 Identities=17% Similarity=0.158 Sum_probs=54.0
Q ss_pred chhHHHHHHHHHHhc-----------cHHHHHHHHHHhCCHHHHHHHHhhCCCCCchHHHHHHHHHhCCChHHHHHHHHh
Q psy11102 2 TTLVSQAREYYEKCH-----------CYEKLISVYTELGDFEALESCARKLPDSSPLLKPMGEIFVKYGLCEQAVYVFDK 70 (87)
Q Consensus 2 ~~~w~~A~~yY~~~~-----------n~~k~ie~~~~~ed~d~L~~l~~~L~~~~~lL~~ia~~F~~~G~~~~Av~aylk 70 (87)
.+.|..|+.|.++.. .+--+-.+|+...|.|+++.+....- .++-|.+--..+...|....|..||.+
T Consensus 1396 c~~y~RalmylEs~~~~ek~~~~~e~l~fllq~lY~~i~dpDgV~Gv~~~r~-a~~sl~~qil~~e~~g~~~da~~Cye~ 1474 (2382)
T KOG0890|consen 1396 CKAYARALMYLESHRSTEKEKETEEALYFLLQNLYGSIHDPDGVEGVSARRF-ADPSLYQQILEHEASGNWADAAACYER 1474 (2382)
T ss_pred hHHHHHHHHHHHHhccccchhHHHHHHHHHHHHHHHhcCCcchhhhHHHHhh-cCccHHHHHHHHHhhccHHHHHHHHHH
Confidence 467999999999821 12223349999999999999988622 234566777789999999999999988
Q ss_pred cCC
Q psy11102 71 NKH 73 (87)
Q Consensus 71 ~gd 73 (87)
+..
T Consensus 1475 ~~q 1477 (2382)
T KOG0890|consen 1475 LIQ 1477 (2382)
T ss_pred hhc
Confidence 754
No 191
>KOG2114|consensus
Probab=49.52 E-value=86 Score=27.59 Aligned_cols=67 Identities=16% Similarity=0.115 Sum_probs=53.6
Q ss_pred HHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCCC-CchHHHHHHHHHhCCChHHHHHHHHhcCCHHHHHHHHhh
Q psy11102 11 YYEKCHCYEKLISVYTELGDFEALESCARKLPDS-SPLLKPMGEIFVKYGLCEQAVYVFDKNKHKSSQWLTVVQ 83 (87)
Q Consensus 11 yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~~-~~lL~~ia~~F~~~G~~~~Av~aylk~gd~k~ai~~cv~ 83 (87)
.|.+++.....+.++....|.+.+..+++.+.+. ..+....=.+|++.+..+.=.+ .+++.++-+++
T Consensus 701 ~~~k~k~~~dl~~~~~q~~d~E~~it~~~~~g~~~p~l~~~~L~yF~~~~~i~~~~~------~v~~vl~~I~~ 768 (933)
T KOG2114|consen 701 SYEKLKEGQDLMLYFQQISDPETVITLCERLGKEDPSLWLHALKYFVSEESIEDCYE------IVYKVLEAIEM 768 (933)
T ss_pred HHhhccchHHHHHHHHHhhChHHHHHHHHHhCccChHHHHHHHHHHhhhcchhhHHH------HHHHHHHHHHh
Confidence 4888999999999999999999999999999886 5777888889999987765433 45555555544
No 192
>KOG4121|consensus
Probab=48.70 E-value=32 Score=30.65 Aligned_cols=36 Identities=25% Similarity=0.142 Sum_probs=32.2
Q ss_pred HHHHHHhCCChHHHHHHHHhcCCHHHHHHHHhhcCC
Q psy11102 51 MGEIFVKYGLCEQAVYVFDKNKHKSSQWLTVVQDKP 86 (87)
Q Consensus 51 ia~~F~~~G~~~~Av~aylk~gd~k~ai~~cv~~~~ 86 (87)
.-..+.++|+.++|++.=.|..|+...+.+|-++.|
T Consensus 778 wlq~L~~vg~~e~Ai~iAEKY~DfqsLV~lcdqld~ 813 (1128)
T KOG4121|consen 778 WLQVLCKVGQYEQAIQIAEKYKDFQSLVQLCDQLDQ 813 (1128)
T ss_pred HHHHHHhcchHHHHHHHHHHhhhHHHHHHHHHhhCc
Confidence 334578999999999999999999999999998876
No 193
>cd00280 TRFH Telomeric Repeat binding Factor or TTAGGG Repeat binding Factor, central (dimerization) domain Homology; TRFH. Telomeres are protein/DNA complexes that make up the physical ends of eukaryotic linear chromosomes and are essential for chromosome stability, protecting the chromosome ends from degradation and end-to-end fusion. Proteins TRF1, TRF2 and Taz1 bind telomeric DNA and are also involved in recruiting interacting proteins, TIN2, and Rap1, to the telomeres. It has also been demonstrated that PARP1 associates with TRF2 and is capable of poly(ADP-ribosyl)ation of TRF2, which affects binding of TRF2 to telomeric DNA. TRF1, TRF2 and Taz1 proteins contain three functional domains: an N-terminal acidic domain, a central TRF-specific/dimerization domain, and a C-terminal DNA binding domain with a single Myb-like repeat. Homodimerization, a prerequisite to DNA binding, results in the juxtaposition of two Myb DNA binding domains.
Probab=48.45 E-value=61 Score=23.49 Aligned_cols=22 Identities=27% Similarity=0.078 Sum_probs=19.2
Q ss_pred HHHHHHHHhcCCHHHHHHHHhh
Q psy11102 62 EQAVYVFDKNKHKSSQWLTVVQ 83 (87)
Q Consensus 62 ~~Av~aylk~gd~k~ai~~cv~ 83 (87)
++||-.++|.|.+|.|.++--+
T Consensus 115 ~~aV~VCm~~g~Fk~A~eiLkr 136 (200)
T cd00280 115 EQAVAVCMENGEFKKAEEVLKR 136 (200)
T ss_pred HHHHHHHHhcCchHHHHHHHHH
Confidence 7999999999999999887543
No 194
>PF14718 SLT_L: Soluble lytic murein transglycosylase L domain; PDB: 1QTE_A 1SLY_A 1QSA_A.
Probab=48.43 E-value=56 Score=19.27 Aligned_cols=30 Identities=13% Similarity=0.099 Sum_probs=24.8
Q ss_pred CchHHHHHHHHHhCCChHHHHHHHHhcCCH
Q psy11102 45 SPLLKPMGEIFVKYGLCEQAVYVFDKNKHK 74 (87)
Q Consensus 45 ~~lL~~ia~~F~~~G~~~~Av~aylk~gd~ 74 (87)
.+.+...|.+-.+.|-++.||.+.++.|-.
T Consensus 39 ~~~~~aaA~~A~~~gW~d~AI~a~~~a~~W 68 (71)
T PF14718_consen 39 EEQQLAAAELAHQWGWHDRAIQAAIRAKYW 68 (71)
T ss_dssp HHHHHHHHHHHHHTT-HHHHHHHHHHTT-T
T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHhhh
Confidence 466888999999999999999999998754
No 195
>PRK14574 hmsH outer membrane protein; Provisional
Probab=47.49 E-value=99 Score=26.62 Aligned_cols=56 Identities=13% Similarity=0.053 Sum_probs=41.9
Q ss_pred hHHHH----HHHHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCCCCchHHHHHHHHHhCCChHHHHHHH
Q psy11102 4 LVSQA----REYYEKCHCYEKLISVYTELGDFEALESCARKLPDSSPLLKPMGEIFVKYGLCEQAVYVF 68 (87)
Q Consensus 4 ~w~~A----~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~~~~lL~~ia~~F~~~G~~~~Av~ay 68 (87)
+|..+ +.+|.-.|++.++.+. |+++...-|.+..++...|+.-...|.+..|.+.|
T Consensus 414 d~~~~~~l~a~~~~~~gdl~~Ae~~---------le~l~~~aP~n~~l~~~~A~v~~~Rg~p~~A~~~~ 473 (822)
T PRK14574 414 DWIEGQTLLVQSLVALNDLPTAQKK---------LEDLSSTAPANQNLRIALASIYLARDLPRKAEQEL 473 (822)
T ss_pred cHHHHHHHHHHHHHHcCCHHHHHHH---------HHHHHHhCCCCHHHHHHHHHHHHhcCCHHHHHHHH
Confidence 45555 5666666666666543 45556666888899999999999999999998776
No 196
>PF04121 Nup84_Nup100: Nuclear pore protein 84 / 107 ; InterPro: IPR007252 Nup84p forms a complex with five proteins, including Nup120p, Nup85p, Sec13p, and a Sec13p homolog. This Nup84p complex in conjunction with Sec13-type proteins is required for correct nuclear pore biogenesis [].; GO: 0006810 transport, 0005643 nuclear pore; PDB: 3CQC_A 3CQG_A 3I4R_A 3IKO_I 3JRO_C.
Probab=47.03 E-value=21 Score=29.62 Aligned_cols=79 Identities=14% Similarity=-0.002 Sum_probs=44.8
Q ss_pred chhHHHHHHHHHHhccHHHHHHHHHHhCCHHH-HH-HHHh--hCCCC---CchHHHHHHHHHhC---CChHHHHHHHHhc
Q psy11102 2 TTLVSQAREYYEKCHCYEKLISVYTELGDFEA-LE-SCAR--KLPDS---SPLLKPMGEIFVKY---GLCEQAVYVFDKN 71 (87)
Q Consensus 2 ~~~w~~A~~yY~~~~n~~k~ie~~~~~ed~d~-L~-~l~~--~L~~~---~~lL~~ia~~F~~~---G~~~~Av~aylk~ 71 (87)
.|++++|.++=..+||.-+|+-..+.....|. +. .+.+ ..+.+ ..+-+.+...+... +.+|.||.+++-
T Consensus 146 ~G~~~eA~~lc~~~gq~wrAasL~G~~~~~dp~~~~~~~~~~~~~~G~~~r~LWk~~c~~ls~~~~~~~yEraiY~~L~- 224 (697)
T PF04121_consen 146 AGRIEEAQELCRERGQPWRAASLCGWQLYHDPNLDPELSDEDERMEGNRSRALWKRACYKLSQNPNLDPYERAIYGALS- 224 (697)
T ss_dssp TT-HHHHHHHHHHTT-HHHHHHHHTTSB-B-TTTSGGGTTT-SS-BSBTTHHHHHHHHHHHHHTSSS-HHHHHHHHHHH-
T ss_pred CCCHHHHHHHHHHCCCHHHHHHHcCccccCCcccccccccccccccCChhHHHHHHHHHHHHhCCCCCHHHHHHHHHHh-
Confidence 46677777777777777777777762221121 11 0001 11112 25566666666554 689999999998
Q ss_pred CCHHHHHHHH
Q psy11102 72 KHKSSQWLTV 81 (87)
Q Consensus 72 gd~k~ai~~c 81 (87)
||+..++..|
T Consensus 225 G~~~~~l~~~ 234 (697)
T PF04121_consen 225 GDLSSVLPVC 234 (697)
T ss_dssp TS---HHHHT
T ss_pred cccHhhhhhc
Confidence 9999999998
No 197
>KOG2168|consensus
Probab=46.66 E-value=34 Score=29.67 Aligned_cols=37 Identities=19% Similarity=0.277 Sum_probs=25.0
Q ss_pred HHHHHHHHHHhccHHHHHHHHHHhCCHHHHHHHHhhC
Q psy11102 5 VSQAREYYEKCHCYEKLISVYTELGDFEALESCARKL 41 (87)
Q Consensus 5 w~~A~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L 41 (87)
..+|+.--...|.++++|..|...|+||.-.+++.++
T Consensus 625 ~~~vA~~a~~~G~~~~sI~LY~lag~yd~al~link~ 661 (835)
T KOG2168|consen 625 ILEVASEADEDGLFEDAILLYHLAGDYDKALELINKL 661 (835)
T ss_pred HHHHHHHHHhcCCHHHHHHHHHHhhhhhHHHHHHHHH
Confidence 3456666666777777777777777777766666554
No 198
>KOG4340|consensus
Probab=46.48 E-value=83 Score=24.97 Aligned_cols=67 Identities=19% Similarity=0.155 Sum_probs=52.7
Q ss_pred HHHHHHHHHHhCCHHHHHHHHhhCCCCCchHHH----HHHHHHhCCChHHHHHHHHhcCCHHHHHHHHhhc
Q psy11102 18 YEKLISVYTELGDFEALESCARKLPDSSPLLKP----MGEIFVKYGLCEQAVYVFDKNKHKSSQWLTVVQD 84 (87)
Q Consensus 18 ~~k~ie~~~~~ed~d~L~~l~~~L~~~~~lL~~----ia~~F~~~G~~~~Av~aylk~gd~k~ai~~cv~~ 84 (87)
+.-+..|||...+|..--.+-..|....|.+.+ -|.-+-+.|.+..|...---.||...+.+-|+++
T Consensus 47 LSlLgyCYY~~Q~f~~AA~CYeQL~ql~P~~~qYrlY~AQSLY~A~i~ADALrV~~~~~D~~~L~~~~lqL 117 (459)
T KOG4340|consen 47 LSLLGYCYYRLQEFALAAECYEQLGQLHPELEQYRLYQAQSLYKACIYADALRVAFLLLDNPALHSRVLQL 117 (459)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhChHHHHHHHHHHHHHHHhcccHHHHHHHHHhcCCHHHHHHHHHH
Confidence 345678999999998887777777766665554 3566778999999999999999988888888765
No 199
>cd02577 PSTD1 PSTD1: Pseudouridine synthase, a subgroup of the TruD family. This group consists of several hypothetical archeal pseudouridine synthases assigned to the TruD family of psuedouridine synthases. Pseudouridine synthases catalyze the isomerization of specific uridines in an RNA molecule to pseudouridines (5-ribosyluracil, psi). The TruD family is comprised of proteins related to Escherichia coli TruD.
Probab=46.07 E-value=81 Score=24.01 Aligned_cols=33 Identities=15% Similarity=0.242 Sum_probs=23.1
Q ss_pred chhHHHHHHHHHH-------hccHHHHHHHHHHhCCHHHH
Q psy11102 2 TTLVSQAREYYEK-------CHCYEKLISVYTELGDFEAL 34 (87)
Q Consensus 2 ~~~w~~A~~yY~~-------~~n~~k~ie~~~~~ed~d~L 34 (87)
.++|+.|+..|.. .++..++...+..-+||++-
T Consensus 162 ~gd~~~Av~~il~~~~~~~~~~~~~~ar~~~~~~gd~~~~ 201 (319)
T cd02577 162 EGDYEEAAKTYLTARYPLEETGDKKRALRLFPKWGDWERC 201 (319)
T ss_pred ccCHHHHHHHHhcCCcccccchHHHHHHHHHHhcCCHHHH
Confidence 4789999998874 24455666677777787754
No 200
>KOG2376|consensus
Probab=45.86 E-value=1.4e+02 Score=25.34 Aligned_cols=30 Identities=30% Similarity=0.480 Sum_probs=25.9
Q ss_pred CCCC-chHHHHHHHHHhCCChHHHHHHHHhc
Q psy11102 42 PDSS-PLLKPMGEIFVKYGLCEQAVYVFDKN 71 (87)
Q Consensus 42 ~~~~-~lL~~ia~~F~~~G~~~~Av~aylk~ 71 (87)
|+++ .++-..|....++|.+.+|++...++
T Consensus 171 ~e~syel~yN~Ac~~i~~gky~qA~elL~kA 201 (652)
T KOG2376|consen 171 PEDSYELLYNTACILIENGKYNQAIELLEKA 201 (652)
T ss_pred CcchHHHHHHHHHHHHhcccHHHHHHHHHHH
Confidence 4443 88889999999999999999999887
No 201
>KOG0548|consensus
Probab=45.55 E-value=1.1e+02 Score=25.32 Aligned_cols=70 Identities=19% Similarity=0.348 Sum_probs=52.2
Q ss_pred chhHHHHHHHHHHhcc--------HHHHHHHHHHhCCHHH-HHHH--HhhCCCC-CchHHHHHHHHHhCCChHHHHHHHH
Q psy11102 2 TTLVSQAREYYEKCHC--------YEKLISVYTELGDFEA-LESC--ARKLPDS-SPLLKPMGEIFVKYGLCEQAVYVFD 69 (87)
Q Consensus 2 ~~~w~~A~~yY~~~~n--------~~k~ie~~~~~ed~d~-L~~l--~~~L~~~-~~lL~~ia~~F~~~G~~~~Av~ayl 69 (87)
+++++.|+.+|..+=+ +.-=..||..+++|++ |..- ++.|... .+-..+.|.-+.-.|.++.|+.+|-
T Consensus 15 ~~d~~~ai~~~t~ai~l~p~nhvlySnrsaa~a~~~~~~~al~da~k~~~l~p~w~kgy~r~Gaa~~~lg~~~eA~~ay~ 94 (539)
T KOG0548|consen 15 SGDFETAIRLFTEAIMLSPTNHVLYSNRSAAYASLGSYEKALKDATKTRRLNPDWAKGYSRKGAALFGLGDYEEAILAYS 94 (539)
T ss_pred cccHHHHHHHHHHHHccCCCccchhcchHHHHHHHhhHHHHHHHHHHHHhcCCchhhHHHHhHHHHHhcccHHHHHHHHH
Confidence 5789999999988743 2344567888888875 3333 3334444 6778899999999999999999997
Q ss_pred hc
Q psy11102 70 KN 71 (87)
Q Consensus 70 k~ 71 (87)
+.
T Consensus 95 ~G 96 (539)
T KOG0548|consen 95 EG 96 (539)
T ss_pred HH
Confidence 63
No 202
>PRK15331 chaperone protein SicA; Provisional
Probab=45.39 E-value=34 Score=23.93 Aligned_cols=14 Identities=0% Similarity=-0.280 Sum_probs=7.4
Q ss_pred HHHHhcCCHHHHHH
Q psy11102 66 YVFDKNKHKSSQWL 79 (87)
Q Consensus 66 ~aylk~gd~k~ai~ 79 (87)
+||+..|+..+|.+
T Consensus 113 qC~l~l~~~~~A~~ 126 (165)
T PRK15331 113 QCQLLMRKAAKARQ 126 (165)
T ss_pred HHHHHhCCHHHHHH
Confidence 34455555555554
No 203
>COG1397 DraG ADP-ribosylglycohydrolase [Posttranslational modification, protein turnover, chaperones]
Probab=45.33 E-value=63 Score=24.89 Aligned_cols=55 Identities=13% Similarity=0.048 Sum_probs=38.3
Q ss_pred CHHHHHHHHhhCCCCCchHHHHHHHHHhCCCh----HHHHHHHHhcCCHHHHHHHHhhc
Q psy11102 30 DFEALESCARKLPDSSPLLKPMGEIFVKYGLC----EQAVYVFDKNKHKSSQWLTVVQD 84 (87)
Q Consensus 30 d~d~L~~l~~~L~~~~~lL~~ia~~F~~~G~~----~~Av~aylk~gd~k~ai~~cv~~ 84 (87)
.+..-+++.+++......+..+.+.+-+-+-+ +.|.-+|+...|++.||-.||++
T Consensus 207 ~~~~a~~~~~~~~~~~~~~~~~~~~ig~g~~~~etvpaAi~~~l~~~d~~eai~~aVNl 265 (314)
T COG1397 207 RIYIAEKIVYNLDHTEAAIELAYQVIGSGGYVIETVPAAIACFLLTPDFEEAILKAVNL 265 (314)
T ss_pred HHHHHHHHHHHhhhhHHHHHHHHHHhcCCceecchHHHHHHHHHhCCCHHHHHHHHhhc
Confidence 33344555665555555566666666555544 56677899999999999999986
No 204
>cd02679 MIT_spastin MIT: domain contained within Microtubule Interacting and Trafficking molecules. This MIT domain sub-family is found in the AAA protein spastin, a probable ATPase involved in the assembly or function of nuclear protein complexes; spastins might also be involved in microtubule dynamics. The molecular function of the MIT domain is unclear.
Probab=45.29 E-value=16 Score=22.47 Aligned_cols=15 Identities=27% Similarity=-0.011 Sum_probs=11.3
Q ss_pred hhHHHHHHHHHHhcc
Q psy11102 3 TLVSQAREYYEKCHC 17 (87)
Q Consensus 3 ~~w~~A~~yY~~~~n 17 (87)
++|+.|+.+|.+|-+
T Consensus 22 g~~e~Al~~Y~~gi~ 36 (79)
T cd02679 22 GDKEQALAHYRKGLR 36 (79)
T ss_pred CCHHHHHHHHHHHHH
Confidence 677888888887754
No 205
>KOG1173|consensus
Probab=44.25 E-value=52 Score=27.57 Aligned_cols=51 Identities=25% Similarity=0.296 Sum_probs=39.2
Q ss_pred HHHHHHHhCCHHHHHHHHhhC----CCC-CchHHHHHHHHHhCCChHHHHHHHHhcC
Q psy11102 21 LISVYTELGDFEALESCARKL----PDS-SPLLKPMGEIFVKYGLCEQAVYVFDKNK 72 (87)
Q Consensus 21 ~ie~~~~~ed~d~L~~l~~~L----~~~-~~lL~~ia~~F~~~G~~~~Av~aylk~g 72 (87)
-|.|++++++.-+|=.|.-+| |+. .+ ---.|-+|-++|-.++|.+.|-|+-
T Consensus 284 ~ia~l~el~~~n~Lf~lsh~LV~~yP~~a~s-W~aVg~YYl~i~k~seARry~SKat 339 (611)
T KOG1173|consen 284 HIACLYELGKSNKLFLLSHKLVDLYPSKALS-WFAVGCYYLMIGKYSEARRYFSKAT 339 (611)
T ss_pred HHHHHHHhcccchHHHHHHHHHHhCCCCCcc-hhhHHHHHHHhcCcHHHHHHHHHHh
Confidence 356999999999997776665 443 23 3357889999999999999998863
No 206
>COG5071 RPN5 26S proteasome regulatory complex component [Posttranslational modification, protein turnover, chaperones]
Probab=43.70 E-value=58 Score=25.76 Aligned_cols=64 Identities=23% Similarity=0.348 Sum_probs=51.3
Q ss_pred HHHHHHHHhCCHHHHHHHHhhCCC-C--CchHHHHHHHHHhCC----ChHHHHHHHHhcCCHHHHHHHHhh
Q psy11102 20 KLISVYTELGDFEALESCARKLPD-S--SPLLKPMGEIFVKYG----LCEQAVYVFDKNKHKSSQWLTVVQ 83 (87)
Q Consensus 20 k~ie~~~~~ed~d~L~~l~~~L~~-~--~~lL~~ia~~F~~~G----~~~~Av~aylk~gd~k~ai~~cv~ 83 (87)
.-++|-..++..=.+|+-+|.-++ . .+.|..|++.+.|.| +-++-+-.+.|-|+.+.+|.--++
T Consensus 24 ~~~d~~~~le~LL~~EkK~RqasD~~~~~kvl~~i~dLl~S~~~~~~Lneql~~L~kKhGQlk~sI~~MIq 94 (439)
T COG5071 24 NTIDIDANLEKLLIFEKKVRQASDTSTNTKVLIYIADLLFSAGDFQGLNEQLVSLFKKHGQLKQSITSMIQ 94 (439)
T ss_pred hhcchhhHHHHHHHHHHHHhhhccHHHHHHHHHHHHHHHhhcCchhhhhhHHHHHHHHcchHHHHHHHHHH
Confidence 345666677777778888888887 2 388999999999994 556899999999999999986554
No 207
>cd08780 Death_TRADD Death Domain of Tumor Necrosis Factor Receptor 1-Associated Death Domain protein. Death domain (DD) of TRADD (TNF Receptor 1-Associated Death Domain or TNFRSF1A-associated via death domain) protein. TRADD is a central signaling adaptor for TNF-receptor 1 (TNFR1), mediating activation of Nuclear Factor -kappaB (NF-kB) and c-Jun N-terminal kinase (JNK), as well as caspase-dependent apoptosis. It also carries important immunological roles including germinal center formation, DR3-mediated T-cell stimulation, and TNFalpha-mediated inflammatory responses. In general, DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including CARD (Caspase activation and recruitment domain), DED (Death Effector Domain), and PYRIN. They serve as adaptors in signaling pathways and can recruit other proteins into s
Probab=42.35 E-value=34 Score=21.83 Aligned_cols=39 Identities=21% Similarity=0.293 Sum_probs=30.8
Q ss_pred HHHHhhCC-----CCCchHHHHHHHHHhCCChHHHHHHHHhcCC
Q psy11102 35 ESCARKLP-----DSSPLLKPMGEIFVKYGLCEQAVYVFDKNKH 73 (87)
Q Consensus 35 ~~l~~~L~-----~~~~lL~~ia~~F~~~G~~~~Av~aylk~gd 73 (87)
..+.|.|. -..+.|..|...+-+-|+.|++.+.+.+-.+
T Consensus 17 K~laR~Lg~~cral~d~~ID~I~~~y~r~gL~EqvyQ~L~~W~~ 60 (90)
T cd08780 17 KPVGRSLQKNCRALRDPAIDNLAYEYDREGLYEQAYQLLRRFIQ 60 (90)
T ss_pred HHHHHHHcccccccchhHHHHHHhhcccccHHHHHHHHHHHHHH
Confidence 45677777 3468899999999999999999998876443
No 208
>cd02680 MIT_calpain7_2 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in the nuclear thiol protease PalBH. The molecular function of the MIT domain is unclear.
Probab=42.07 E-value=21 Score=21.78 Aligned_cols=13 Identities=38% Similarity=0.394 Sum_probs=9.6
Q ss_pred hccHHHHHHHHHH
Q psy11102 15 CHCYEKLISVYTE 27 (87)
Q Consensus 15 ~~n~~k~ie~~~~ 27 (87)
+||+++|+++|..
T Consensus 19 ~gny~eA~~lY~~ 31 (75)
T cd02680 19 KGNAEEAIELYTE 31 (75)
T ss_pred hhhHHHHHHHHHH
Confidence 5788888888764
No 209
>COG3947 Response regulator containing CheY-like receiver and SARP domains [Signal transduction mechanisms]
Probab=42.03 E-value=78 Score=24.77 Aligned_cols=23 Identities=0% Similarity=-0.101 Sum_probs=19.2
Q ss_pred hHHHHHHHHhcCCHHHHHHHHhh
Q psy11102 61 CEQAVYVFDKNKHKSSQWLTVVQ 83 (87)
Q Consensus 61 ~~~Av~aylk~gd~k~ai~~cv~ 83 (87)
.-.+..+|+++|.++.|+++|-+
T Consensus 282 lgkva~~yle~g~~neAi~l~qr 304 (361)
T COG3947 282 LGKVARAYLEAGKPNEAIQLHQR 304 (361)
T ss_pred HHHHHHHHHHcCChHHHHHHHHH
Confidence 34567889999999999999965
No 210
>PF07721 TPR_4: Tetratricopeptide repeat; InterPro: IPR011717 This entry includes tetratricopeptide-like repeats not detected by the IPR001440 from INTERPRO, IPR013105 from INTERPRO and IPR011716 from INTERPRO models. The tetratricopeptide repeat (TPR) motif is a protein-protein interaction module found in multiple copies in a number of functionally different proteins that facilitates specific interactions with a partner protein(s) [].; GO: 0042802 identical protein binding
Probab=41.95 E-value=37 Score=15.79 Aligned_cols=15 Identities=0% Similarity=-0.263 Sum_probs=7.8
Q ss_pred HHHHhcCCHHHHHHH
Q psy11102 66 YVFDKNKHKSSQWLT 80 (87)
Q Consensus 66 ~aylk~gd~k~ai~~ 80 (87)
.++...||+..|..+
T Consensus 9 ~~~~~~G~~~eA~~~ 23 (26)
T PF07721_consen 9 RALLAQGDPDEAERL 23 (26)
T ss_pred HHHHHcCCHHHHHHH
Confidence 345555555555544
No 211
>KOG3785|consensus
Probab=41.91 E-value=1.2e+02 Score=24.64 Aligned_cols=77 Identities=16% Similarity=0.075 Sum_probs=49.5
Q ss_pred chhHHHHHHHHHHhccHHH---------HHHHHHHhCCHHHHHHHHhhCCC----CCchHHHHHHHHHhCCChHHHHHHH
Q psy11102 2 TTLVSQAREYYEKCHCYEK---------LISVYTELGDFEALESCARKLPD----SSPLLKPMGEIFVKYGLCEQAVYVF 68 (87)
Q Consensus 2 ~~~w~~A~~yY~~~~n~~k---------~ie~~~~~ed~d~L~~l~~~L~~----~~~lL~~ia~~F~~~G~~~~Av~ay 68 (87)
.+||+-|+.+..-..|.++ .+.|+|++|||+.--..-..+-. +.++=.-+|.-.-=.|++.+|..+-
T Consensus 35 ~rDytGAislLefk~~~~~EEE~~~~lWia~C~fhLgdY~~Al~~Y~~~~~~~~~~~el~vnLAcc~FyLg~Y~eA~~~~ 114 (557)
T KOG3785|consen 35 NRDYTGAISLLEFKLNLDREEEDSLQLWIAHCYFHLGDYEEALNVYTFLMNKDDAPAELGVNLACCKFYLGQYIEAKSIA 114 (557)
T ss_pred cccchhHHHHHHHhhccchhhhHHHHHHHHHHHHhhccHHHHHHHHHHHhccCCCCcccchhHHHHHHHHHHHHHHHHHH
Confidence 3567777777766655544 45699999999876555444422 2355444444444579999999988
Q ss_pred HhcCCHHHHH
Q psy11102 69 DKNKHKSSQW 78 (87)
Q Consensus 69 lk~gd~k~ai 78 (87)
.|+-+...-+
T Consensus 115 ~ka~k~pL~~ 124 (557)
T KOG3785|consen 115 EKAPKTPLCI 124 (557)
T ss_pred hhCCCChHHH
Confidence 8776654433
No 212
>COG5200 LUC7 U1 snRNP component, mediates U1 snRNP association with cap-binding complex [RNA processing and modification]
Probab=41.89 E-value=14 Score=27.36 Aligned_cols=46 Identities=20% Similarity=0.213 Sum_probs=33.0
Q ss_pred HHHHHHHHhhCCCCCchHHHHHHHHHhCCChHHHHHHHHhcCCHHH
Q psy11102 31 FEALESCARKLPDSSPLLKPMGEIFVKYGLCEQAVYVFDKNKHKSS 76 (87)
Q Consensus 31 ~d~L~~l~~~L~~~~~lL~~ia~~F~~~G~~~~Av~aylk~gd~k~ 76 (87)
.++++++...+.++-.+|-.=++-+.+..+.+.|+.+|.++.+++.
T Consensus 116 I~e~~e~~~~~d~sIg~lg~e~dalakrk~V~~a~~~f~el~rl~~ 161 (258)
T COG5200 116 IKEKEELFNRVDESIGELGMEGDALAKRKLVERACSAFNELERLRE 161 (258)
T ss_pred HHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3444555555555557777778888888899999999998887654
No 213
>KOG4648|consensus
Probab=41.46 E-value=19 Score=28.96 Aligned_cols=24 Identities=25% Similarity=0.484 Sum_probs=20.9
Q ss_pred HHHHHHHHHhCCChHHHHHHHHhc
Q psy11102 48 LKPMGEIFVKYGLCEQAVYVFDKN 71 (87)
Q Consensus 48 L~~ia~~F~~~G~~~~Av~aylk~ 71 (87)
|++-|+.|-+.|-+++|+.||.+.
T Consensus 100 iKE~GN~yFKQgKy~EAIDCYs~~ 123 (536)
T KOG4648|consen 100 IKERGNTYFKQGKYEEAIDCYSTA 123 (536)
T ss_pred HHHhhhhhhhccchhHHHHHhhhh
Confidence 788888999999999999998875
No 214
>KOG4234|consensus
Probab=41.39 E-value=53 Score=24.55 Aligned_cols=21 Identities=14% Similarity=0.049 Sum_probs=12.5
Q ss_pred HHHHhcCCHHHHHHHH---hhcCC
Q psy11102 66 YVFDKNKHKSSQWLTV---VQDKP 86 (87)
Q Consensus 66 ~aylk~gd~k~ai~~c---v~~~~ 86 (87)
.|++|.+..++||+-| ++++|
T Consensus 142 aa~iKl~k~e~aI~dcsKaiel~p 165 (271)
T KOG4234|consen 142 AALIKLRKWESAIEDCSKAIELNP 165 (271)
T ss_pred HHHHHhhhHHHHHHHHHhhHhcCc
Confidence 3566666666666666 34555
No 215
>PF00627 UBA: UBA/TS-N domain; InterPro: IPR000449 UBA domains are a commonly occurring sequence motif of approximately 45 amino acid residues that are found in diverse proteins involved in the ubiquitin/proteasome pathway, DNA excision-repair, and cell signalling via protein kinases []. The human homologue of yeast Rad23A is one example of a nucleotide excision-repair protein that contains both an internal and a C-terminal UBA domain. The solution structure of human Rad23A UBA(2) showed that the domain forms a compact three-helix bundle []. Comparison of the structures of UBA(1) and UBA(2) reveals that both form very similar folds and have a conserved large hydrophobic surface patch which may be a common protein-interacting surface present in diverse UBA domains. Evidence that ubiquitin binds to UBA domains leads to the prediction that the hydrophobic surface patch of UBA domains interacts with the hydrophobic surface on the five-stranded beta-sheet of ubiquitin []. This domain is similar in sequence to the N-terminal domain of translation elongation factor EF1B (or EF-Ts) from bacteria, mitochondria and chloroplasts. More information about EF1B (EF-Ts) proteins can be found at Protein of the Month: Elongation Factors [].; GO: 0005515 protein binding; PDB: 2DAI_A 2OO9_C 2JUJ_A 1WHC_A 1YLA_A 2O25_B 3K9O_A 3K9P_A 3F92_A 3E46_A ....
Probab=41.35 E-value=45 Score=16.93 Aligned_cols=18 Identities=17% Similarity=-0.051 Sum_probs=13.0
Q ss_pred HHHHHHHHhcC-CHHHHHH
Q psy11102 62 EQAVYVFDKNK-HKSSQWL 79 (87)
Q Consensus 62 ~~Av~aylk~g-d~k~ai~ 79 (87)
+.|++|+.++| ++..|++
T Consensus 17 ~~~~~AL~~~~~nve~A~~ 35 (37)
T PF00627_consen 17 EQAREALRACNGNVERAVD 35 (37)
T ss_dssp HHHHHHHHHTTTSHHHHHH
T ss_pred HHHHHHHHHcCCCHHHHHH
Confidence 46777777776 7887776
No 216
>PRK04841 transcriptional regulator MalT; Provisional
Probab=39.68 E-value=2.2e+02 Score=23.67 Aligned_cols=67 Identities=15% Similarity=0.094 Sum_probs=37.1
Q ss_pred hhHHHHHHHHHHhccHHHHHHHHH-------HhCCHHHHHHHHhhCCC----CC-chHHHHHHHHHhCCChHHHHHHHH
Q psy11102 3 TLVSQAREYYEKCHCYEKLISVYT-------ELGDFEALESCARKLPD----SS-PLLKPMGEIFVKYGLCEQAVYVFD 69 (87)
Q Consensus 3 ~~w~~A~~yY~~~~n~~k~ie~~~-------~~ed~d~L~~l~~~L~~----~~-~lL~~ia~~F~~~G~~~~Av~ayl 69 (87)
+++..|+.++..+++...+.+++. ..++++.+....+.+|. ++ .++...+......|..++|...+.
T Consensus 355 g~~~~Al~~a~~a~d~~~~~~ll~~~a~~l~~~g~~~~l~~~l~~lp~~~~~~~~~l~~~~a~~~~~~g~~~~a~~~l~ 433 (903)
T PRK04841 355 GFPSEAIHHALAAGDAQLLRDILLQHGWSLFNQGELSLLEECLNALPWEVLLENPRLVLLQAWLAQSQHRYSEVNTLLA 433 (903)
T ss_pred CCHHHHHHHHHHCCCHHHHHHHHHHhHHHHHhcCChHHHHHHHHhCCHHHHhcCcchHHHHHHHHHHCCCHHHHHHHHH
Confidence 345556666666666555444332 33566666777777753 22 333344555566677777665554
No 217
>KOG2002|consensus
Probab=39.55 E-value=89 Score=27.81 Aligned_cols=69 Identities=20% Similarity=0.324 Sum_probs=43.8
Q ss_pred hhHHHHHHHHHHhccHHH--------HHHHHHHhCCHHHHHH----HHhhC-CCC-CchHHHHHHHHHhCCChHHHHHHH
Q psy11102 3 TLVSQAREYYEKCHCYEK--------LISVYTELGDFEALES----CARKL-PDS-SPLLKPMGEIFVKYGLCEQAVYVF 68 (87)
Q Consensus 3 ~~w~~A~~yY~~~~n~~k--------~ie~~~~~ed~d~L~~----l~~~L-~~~-~~lL~~ia~~F~~~G~~~~Av~ay 68 (87)
++|..|...|.|-+.--. +++||...++|-.-++ +.+.. +++ +.+|.-+|.-.-.+|....|+++.
T Consensus 660 g~~~~A~dIFsqVrEa~~~~~dv~lNlah~~~e~~qy~~AIqmYe~~lkkf~~~~~~~vl~~Lara~y~~~~~~eak~~l 739 (1018)
T KOG2002|consen 660 GRFSEARDIFSQVREATSDFEDVWLNLAHCYVEQGQYRLAIQMYENCLKKFYKKNRSEVLHYLARAWYEAGKLQEAKEAL 739 (1018)
T ss_pred cCchHHHHHHHHHHHHHhhCCceeeeHHHHHHHHHHHHHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHhhhHHHHHHHH
Confidence 456666666666544333 5666666666654433 33333 444 578888888888888888888777
Q ss_pred Hhc
Q psy11102 69 DKN 71 (87)
Q Consensus 69 lk~ 71 (87)
+++
T Consensus 740 l~a 742 (1018)
T KOG2002|consen 740 LKA 742 (1018)
T ss_pred HHH
Confidence 654
No 218
>KOG0543|consensus
Probab=39.53 E-value=1.1e+02 Score=24.40 Aligned_cols=26 Identities=8% Similarity=-0.107 Sum_probs=19.3
Q ss_pred CCChHHHHHHHHhcCCHHHHHHHHhh
Q psy11102 58 YGLCEQAVYVFDKNKHKSSQWLTVVQ 83 (87)
Q Consensus 58 ~G~~~~Av~aylk~gd~k~ai~~cv~ 83 (87)
+..+-.-.-+|+|.++++.||..|-.
T Consensus 257 ~~~~lNlA~c~lKl~~~~~Ai~~c~k 282 (397)
T KOG0543|consen 257 LACHLNLAACYLKLKEYKEAIESCNK 282 (397)
T ss_pred HHHhhHHHHHHHhhhhHHHHHHHHHH
Confidence 34444556689999999999998853
No 219
>TIGR03504 FimV_Cterm FimV C-terminal domain. This protein is found at the extreme C-terminus of FimV from Pseudomonas aeruginosa, and of TspA of Neisseria meningitidis. Disruption of the former blocks twitching motility from type IV pili; Semmler, et al. suggest a role in peptidoglycan layer remodelling required by type IV fimbrial systems.
Probab=39.13 E-value=66 Score=17.45 Aligned_cols=34 Identities=18% Similarity=0.333 Sum_probs=22.0
Q ss_pred HHHHHHHHhCCHHHHHHHHhhCCC-CCchHHHHHH
Q psy11102 20 KLISVYTELGDFEALESCARKLPD-SSPLLKPMGE 53 (87)
Q Consensus 20 k~ie~~~~~ed~d~L~~l~~~L~~-~~~lL~~ia~ 53 (87)
.++..|...||+++-.++.+.+-. +++-...=|.
T Consensus 4 dLA~ayie~Gd~e~Ar~lL~evl~~~~~~q~~eA~ 38 (44)
T TIGR03504 4 DLARAYIEMGDLEGARELLEEVIEEGDEAQRQEAR 38 (44)
T ss_pred HHHHHHHHcCChHHHHHHHHHHHHcCCHHHHHHHH
Confidence 367788888888887777776433 4544444333
No 220
>PF09976 TPR_21: Tetratricopeptide repeat; InterPro: IPR018704 This domain, found in various hypothetical prokaryotic proteins, has no known function.
Probab=38.51 E-value=1.1e+02 Score=19.64 Aligned_cols=19 Identities=11% Similarity=0.170 Sum_probs=9.3
Q ss_pred HHHHHHhCCChHHHHHHHH
Q psy11102 51 MGEIFVKYGLCEQAVYVFD 69 (87)
Q Consensus 51 ia~~F~~~G~~~~Av~ayl 69 (87)
+|..+...|..+.|++.|.
T Consensus 54 lA~~~~~~g~~~~A~~~l~ 72 (145)
T PF09976_consen 54 LAKAAYEQGDYDEAKAALE 72 (145)
T ss_pred HHHHHHHCCCHHHHHHHHH
Confidence 4444445555555554443
No 221
>KOG0889|consensus
Probab=38.17 E-value=32 Score=34.30 Aligned_cols=58 Identities=17% Similarity=0.236 Sum_probs=42.2
Q ss_pred chhHHHHHHHHHHhccHHH-----------------HHHHHHHhCCHHHHHHHHhhCCC-CCchHHHHHHHHHhCCCh
Q psy11102 2 TTLVSQAREYYEKCHCYEK-----------------LISVYTELGDFEALESCARKLPD-SSPLLKPMGEIFVKYGLC 61 (87)
Q Consensus 2 ~~~w~~A~~yY~~~~n~~k-----------------~ie~~~~~ed~d~L~~l~~~L~~-~~~lL~~ia~~F~~~G~~ 61 (87)
-|+|++|-.+|+++.-..+ =|+|..++.+||-|-+... .+ +..++.+++-..-++..-
T Consensus 2495 ~G~~e~AQ~lyekaq~Ka~~~~~~~~~~Ey~lWed~WI~Ca~eL~QWdvl~e~~k--~~~~~~llle~aWrlsdw~~~ 2570 (3550)
T KOG0889|consen 2495 LGFWEEAQSLYEKAQVKAREGAIPYSESEYKLWEDHWIRCASELQQWDVLTEFGK--HEGNYELLLECAWRLSDWNDQ 2570 (3550)
T ss_pred hhhHHHHhhHHHHHHHHHhcccCCCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHh--ccCCceeeeehhccCCcchhH
Confidence 3789999999999876543 4889999999999999982 23 346666666555444433
No 222
>KOG0292|consensus
Probab=38.08 E-value=1.7e+02 Score=26.39 Aligned_cols=72 Identities=18% Similarity=0.116 Sum_probs=0.0
Q ss_pred HHhccHHHHHHHHH---------------HhCCHHHHHHHH-------------hhCCCCC-chHHHHHHHHHhCCChHH
Q psy11102 13 EKCHCYEKLISVYT---------------ELGDFEALESCA-------------RKLPDSS-PLLKPMGEIFVKYGLCEQ 63 (87)
Q Consensus 13 ~~~~n~~k~ie~~~---------------~~ed~d~L~~l~-------------~~L~~~~-~lL~~ia~~F~~~G~~~~ 63 (87)
...|.+.+|++|+- .....++|.+++ |.||+.. +...++|.+|+...+-..
T Consensus 1002 tt~gKf~eAie~Frsii~~i~l~vvd~~~e~aea~~li~i~~eYi~gL~~E~~Rr~l~~~~~~~~~ElAaYFt~~~Lqp~ 1081 (1202)
T KOG0292|consen 1002 TTEGKFGEAIEKFRSIIYSIPLLVVDSKEEEAEADELIKICREYIVGLSVELERRKLKKPNLEQQLELAAYFTHCKLQPM 1081 (1202)
T ss_pred hccCcHHHHHHHHHHHHhheeEEEecchhhHHHHHHHHHHHHHHHhhheeeeeecccCCchHHHHHHHHHHhhcCCCCcH
Q ss_pred --------HHHHHHhcCCHHHHHHHHhhc
Q psy11102 64 --------AVYVFDKNKHKSSQWLTVVQD 84 (87)
Q Consensus 64 --------Av~aylk~gd~k~ai~~cv~~ 84 (87)
|+..|-|.+++|.|-..+-++
T Consensus 1082 H~ilalrtA~n~ffK~kN~ktAs~fa~rL 1110 (1202)
T KOG0292|consen 1082 HRILALRTAMNVFFKLKNLKTAAEFARRL 1110 (1202)
T ss_pred HHHHHHHHHHHHHHHhccHHHHHHHHHHH
No 223
>COG2976 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=37.04 E-value=1.2e+02 Score=22.13 Aligned_cols=54 Identities=15% Similarity=0.199 Sum_probs=43.2
Q ss_pred HHHHHHHHhCCHHHHHHHHhhCCCC--Cc-hHHHHHHHHHhCCChHHHHHHHHhcCC
Q psy11102 20 KLISVYTELGDFEALESCARKLPDS--SP-LLKPMGEIFVKYGLCEQAVYVFDKNKH 73 (87)
Q Consensus 20 k~ie~~~~~ed~d~L~~l~~~L~~~--~~-lL~~ia~~F~~~G~~~~Av~aylk~gd 73 (87)
|++...+..+.+|.=..+.+...+. .+ .-...|+.|.+.|.-++|..+|.+.-.
T Consensus 131 RLArvq~q~~k~D~AL~~L~t~~~~~w~~~~~elrGDill~kg~k~~Ar~ay~kAl~ 187 (207)
T COG2976 131 RLARVQLQQKKADAALKTLDTIKEESWAAIVAELRGDILLAKGDKQEARAAYEKALE 187 (207)
T ss_pred HHHHHHHHhhhHHHHHHHHhccccccHHHHHHHHhhhHHHHcCchHHHHHHHHHHHH
Confidence 5677788889999888888887663 33 456679999999999999999987643
No 224
>cd02678 MIT_VPS4 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in intracellular protein transport proteins of the AAA-ATPase family. The molecular function of the MIT domain is unclear.
Probab=37.02 E-value=26 Score=20.67 Aligned_cols=15 Identities=20% Similarity=0.552 Sum_probs=10.5
Q ss_pred HHhccHHHHHHHHHH
Q psy11102 13 EKCHCYEKLISVYTE 27 (87)
Q Consensus 13 ~~~~n~~k~ie~~~~ 27 (87)
...|+++.++++|-.
T Consensus 17 D~~g~y~eA~~~Y~~ 31 (75)
T cd02678 17 DNAGNYEEALRLYQH 31 (75)
T ss_pred HHcCCHHHHHHHHHH
Confidence 456777777777754
No 225
>KOG1126|consensus
Probab=36.99 E-value=1.3e+02 Score=25.53 Aligned_cols=69 Identities=23% Similarity=0.378 Sum_probs=50.2
Q ss_pred hhHHHHHHHHHHhccHH--------HHHHHHHHhCCHH----HHHHHHhhCCCCCchHHHHHHHHHhCCChHHHHHHHHh
Q psy11102 3 TLVSQAREYYEKCHCYE--------KLISVYTELGDFE----ALESCARKLPDSSPLLKPMGEIFVKYGLCEQAVYVFDK 70 (87)
Q Consensus 3 ~~w~~A~~yY~~~~n~~--------k~ie~~~~~ed~d----~L~~l~~~L~~~~~lL~~ia~~F~~~G~~~~Av~aylk 70 (87)
+..++|.++|.+|=-.+ +-+..++-+++++ .||+|-.-.|+.+-...-+|..++..|.-+.|+-=|.=
T Consensus 537 k~~d~AL~~~~~A~~ld~kn~l~~~~~~~il~~~~~~~eal~~LEeLk~~vP~es~v~~llgki~k~~~~~~~Al~~f~~ 616 (638)
T KOG1126|consen 537 KRKDKALQLYEKAIHLDPKNPLCKYHRASILFSLGRYVEALQELEELKELVPQESSVFALLGKIYKRLGNTDLALLHFSW 616 (638)
T ss_pred hhhhHHHHHHHHHHhcCCCCchhHHHHHHHHHhhcchHHHHHHHHHHHHhCcchHHHHHHHHHHHHHHccchHHHHhhHH
Confidence 45567777776653211 2245566667665 57888888899888889999999999999999987764
Q ss_pred c
Q psy11102 71 N 71 (87)
Q Consensus 71 ~ 71 (87)
+
T Consensus 617 A 617 (638)
T KOG1126|consen 617 A 617 (638)
T ss_pred H
Confidence 4
No 226
>PF10163 EnY2: Transcription factor e(y)2; InterPro: IPR018783 Enhancer of yellow 2 (EnY2) is a small transcription factor which is combined in a complex with the TAFII40 protein []. This protein is conserved from protozoa to humans.; PDB: 4DHX_C 3FWC_P 3M99_C 3KIK_A 3KJL_C 3FWB_C 3MHS_B 3MHH_B.
Probab=36.94 E-value=99 Score=18.84 Aligned_cols=36 Identities=19% Similarity=0.278 Sum_probs=28.3
Q ss_pred HHHHHHHHHhccHHHHHHHH----HHhCCHHHHHHHHhhC
Q psy11102 6 SQAREYYEKCHCYEKLISVY----TELGDFEALESCARKL 41 (87)
Q Consensus 6 ~~A~~yY~~~~n~~k~ie~~----~~~ed~d~L~~l~~~L 41 (87)
.+..++...+|+++++-+.+ ..-||.|.+..++++.
T Consensus 4 ~~i~~~L~~sGe~~~L~~~L~~rL~e~GW~d~vr~~~re~ 43 (86)
T PF10163_consen 4 AQIQQRLVESGEYERLKELLRQRLIECGWRDEVRQLCREI 43 (86)
T ss_dssp HHHHHHHHHCTHHHHHHHHHHHHHHHTTHHHHHHHHHHHH
T ss_pred HHHHHHHHHcCcHHHHHHHHHHHHHHCChHHHHHHHHHHH
Confidence 45567788889988877654 4679999999999874
No 227
>KOG2005|consensus
Probab=36.20 E-value=1.3e+02 Score=26.11 Aligned_cols=55 Identities=20% Similarity=0.279 Sum_probs=46.5
Q ss_pred HHHHHHHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCCCCchHHHHHHHHHhCCCh
Q psy11102 6 SQAREYYEKCHCYEKLISVYTELGDFEALESCARKLPDSSPLLKPMGEIFVKYGLC 61 (87)
Q Consensus 6 ~~A~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~~~~lL~~ia~~F~~~G~~ 61 (87)
+-|-..|.+=+.+.+++.+..++++.+...++.++..+- -+=++++.+++..|++
T Consensus 243 ~~a~~IYlKf~~~~~al~~ai~l~~~~~v~~vf~s~~D~-~~kKQ~~ymLaR~~i~ 297 (878)
T KOG2005|consen 243 RTALKIYLKFNEYPRALVGAIRLDDMKEVKEVFTSCTDP-LLKKQMAYMLARHGIY 297 (878)
T ss_pred HHHHHHHHHHHHhHHHHHHHHhcCcHHHHHHHHHhccCH-HHHHHHHHHHHhcCCc
Confidence 457889999999999999999999999999999987652 1235788999998876
No 228
>COG1939 Ribonuclease III family protein [Replication, recombination, and repair]
Probab=36.00 E-value=85 Score=21.35 Aligned_cols=33 Identities=27% Similarity=0.236 Sum_probs=29.4
Q ss_pred HHHHHHhccHHHHHHHHHHhCCHHHHHHHHhhC
Q psy11102 9 REYYEKCHCYEKLISVYTELGDFEALESCARKL 41 (87)
Q Consensus 9 ~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L 41 (87)
+.-|..+--+|.+|--+|..++.|.|++|++..
T Consensus 94 v~tYr~sTgfEAliGyLyL~~~~eRL~ell~~~ 126 (132)
T COG1939 94 VETYRMSTGFEALIGYLYLTKQEERLEELLNKV 126 (132)
T ss_pred hHHHHHhhhHHHHHHHHHHcccHHHHHHHHHHH
Confidence 467888889999999999999999999998764
No 229
>KOG4642|consensus
Probab=35.66 E-value=1.4e+02 Score=22.78 Aligned_cols=69 Identities=16% Similarity=0.239 Sum_probs=53.3
Q ss_pred hhHHHHHHHHHHh--------ccHHHHHHHHHHhCCHHHHHHHHhh---CCCC-CchHHHHHHHHHhCCChHHHHHHHHh
Q psy11102 3 TLVSQAREYYEKC--------HCYEKLISVYTELGDFEALESCARK---LPDS-SPLLKPMGEIFVKYGLCEQAVYVFDK 70 (87)
Q Consensus 3 ~~w~~A~~yY~~~--------~n~~k~ie~~~~~ed~d~L~~l~~~---L~~~-~~lL~~ia~~F~~~G~~~~Av~aylk 70 (87)
..+..|+.-|+++ ..+.+=|.|+.+..+|+..+.=++. |..+ ..-.-.+|..|....-++.||....+
T Consensus 24 k~y~~ai~~y~raI~~nP~~~~Y~tnralchlk~~~~~~v~~dcrralql~~N~vk~h~flg~~~l~s~~~~eaI~~Lqr 103 (284)
T KOG4642|consen 24 KRYDDAIDCYSRAICINPTVASYYTNRALCHLKLKHWEPVEEDCRRALQLDPNLVKAHYFLGQWLLQSKGYDEAIKVLQR 103 (284)
T ss_pred hhhchHHHHHHHHHhcCCCcchhhhhHHHHHHHhhhhhhhhhhHHHHHhcChHHHHHHHHHHHHHHhhccccHHHHHHHH
Confidence 3466788888775 6778889999999999998876665 3334 35566788888888889999988877
Q ss_pred c
Q psy11102 71 N 71 (87)
Q Consensus 71 ~ 71 (87)
+
T Consensus 104 a 104 (284)
T KOG4642|consen 104 A 104 (284)
T ss_pred H
Confidence 6
No 230
>smart00685 DM14 Repeats in fly CG4713, worm Y37H9A.3 and human FLJ20241.
Probab=35.64 E-value=93 Score=18.15 Aligned_cols=38 Identities=18% Similarity=0.017 Sum_probs=32.7
Q ss_pred hhHHHHHHHHHHhccHHHHHHHHHHhCCHHHHHHHHhh
Q psy11102 3 TLVSQAREYYEKCHCYEKLISVYTELGDFEALESCARK 40 (87)
Q Consensus 3 ~~w~~A~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~ 40 (87)
..|..|+.-=.+.||.+++-...-....|+.+.+.+++
T Consensus 9 ~~yk~Aa~~AK~~gd~~kAr~~~R~~K~~~~~I~~~~a 46 (59)
T smart00685 9 EQYKQAALQAKRAGDEEKARRHLRIAKQFDDAIKAARA 46 (59)
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHhhHHHHHHHHHC
Confidence 46788888888999999999999999999888887776
No 231
>PF07443 HARP: HepA-related protein (HARP); InterPro: IPR010003 This entry represents a conserved region approximately 60 residues long within eukaryotic HepA-related protein (HARP). This exhibits single-stranded DNA-dependent ATPase activity, and is ubiquitously expressed in human and mouse tissues []. Family members may contain more than one copy of this region.; GO: 0004386 helicase activity, 0005524 ATP binding, 0016818 hydrolase activity, acting on acid anhydrides, in phosphorus-containing anhydrides, 0016568 chromatin modification, 0005634 nucleus
Probab=35.60 E-value=16 Score=21.17 Aligned_cols=18 Identities=33% Similarity=0.542 Sum_probs=15.8
Q ss_pred HHhCCHHHHHHHHhhCCC
Q psy11102 26 TELGDFEALESCARKLPD 43 (87)
Q Consensus 26 ~~~ed~d~L~~l~~~L~~ 43 (87)
+.++||..|++.++.||.
T Consensus 32 F~L~Dy~~L~~~~~~l~~ 49 (55)
T PF07443_consen 32 FSLEDYSTLMKKVRNLPQ 49 (55)
T ss_pred eeHHHHHHHHHHHhcCCc
Confidence 357899999999999976
No 232
>PF12816 Vps8: Golgi CORVET complex core vacuolar protein 8
Probab=35.41 E-value=1.6e+02 Score=20.70 Aligned_cols=39 Identities=10% Similarity=0.032 Sum_probs=19.3
Q ss_pred chHHHHHHHHHhCCChHHHHHHHHhcC----CHHHHHHHHhhc
Q psy11102 46 PLLKPMGEIFVKYGLCEQAVYVFDKNK----HKSSQWLTVVQD 84 (87)
Q Consensus 46 ~lL~~ia~~F~~~G~~~~Av~aylk~g----d~k~ai~~cv~~ 84 (87)
..++.+-++|.+.|..+..-++-++.. |+..++.+|-++
T Consensus 23 ~v~k~lv~~y~~~~~~~~lE~lI~~LD~~~LDidq~i~lC~~~ 65 (196)
T PF12816_consen 23 EVFKALVEHYASKGRLERLEQLILHLDPSSLDIDQVIKLCKKH 65 (196)
T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHhCCHHhcCHHHHHHHHHHC
Confidence 344555555555555555544444443 455555555443
No 233
>smart00745 MIT Microtubule Interacting and Trafficking molecule domain.
Probab=35.26 E-value=30 Score=20.17 Aligned_cols=25 Identities=20% Similarity=0.390 Sum_probs=16.3
Q ss_pred HHHhccHHHHHHHHHHhCCHHHHHHHHhhC
Q psy11102 12 YEKCHCYEKLISVYTELGDFEALESCARKL 41 (87)
Q Consensus 12 Y~~~~n~~k~ie~~~~~ed~d~L~~l~~~L 41 (87)
+...|+++.|+.+|-. +++.++..+
T Consensus 18 ~d~~g~~~eAl~~Y~~-----a~e~l~~~~ 42 (77)
T smart00745 18 ADEAGDYEEALELYKK-----AIEYLLEGI 42 (77)
T ss_pred HHHcCCHHHHHHHHHH-----HHHHHHHHh
Confidence 4557888888888864 444555544
No 234
>PF15448 NTS_2: N-terminal segments of P. falciparum erythrocyte membrane protein
Probab=34.79 E-value=34 Score=19.30 Aligned_cols=25 Identities=16% Similarity=0.232 Sum_probs=21.0
Q ss_pred HHHHHHHhcCCHHHHHHHHhhcCCC
Q psy11102 63 QAVYVFDKNKHKSSQWLTVVQDKPS 87 (87)
Q Consensus 63 ~Av~aylk~gd~k~ai~~cv~~~~~ 87 (87)
.-+++|++...-++-|+-.....||
T Consensus 9 ~kieayl~~ks~ds~idq~lkadps 33 (51)
T PF15448_consen 9 NKIEAYLEAKSNDSKIDQSLKADPS 33 (51)
T ss_pred HHHHHHHHhhcchhhHHHHhcCChh
Confidence 4578999999999999888888886
No 235
>COG3071 HemY Uncharacterized enzyme of heme biosynthesis [Coenzyme metabolism]
Probab=34.78 E-value=1.7e+02 Score=23.45 Aligned_cols=64 Identities=16% Similarity=0.117 Sum_probs=39.6
Q ss_pred HHHHHHHHhccHHHHHHHHH---HhCCHHHHHHHHhhC-------------------CCCCchHHHHHHHHHhCCChHHH
Q psy11102 7 QAREYYEKCHCYEKLISVYT---ELGDFEALESCARKL-------------------PDSSPLLKPMGEIFVKYGLCEQA 64 (87)
Q Consensus 7 ~A~~yY~~~~n~~k~ie~~~---~~ed~d~L~~l~~~L-------------------~~~~~lL~~ia~~F~~~G~~~~A 64 (87)
..+++|.++|..+-|.++.- +..+=+.|..++..| |++..++..+|....+.++..+|
T Consensus 268 ~~a~~li~l~~~~~A~~~i~~~Lk~~~D~~L~~~~~~l~~~d~~~l~k~~e~~l~~h~~~p~L~~tLG~L~~k~~~w~kA 347 (400)
T COG3071 268 AYAERLIRLGDHDEAQEIIEDALKRQWDPRLCRLIPRLRPGDPEPLIKAAEKWLKQHPEDPLLLSTLGRLALKNKLWGKA 347 (400)
T ss_pred HHHHHHHHcCChHHHHHHHHHHHHhccChhHHHHHhhcCCCCchHHHHHHHHHHHhCCCChhHHHHHHHHHHHhhHHHHH
Confidence 45778888888887766543 333334466665555 33334556667777777777777
Q ss_pred HHHHHh
Q psy11102 65 VYVFDK 70 (87)
Q Consensus 65 v~aylk 70 (87)
-++|.+
T Consensus 348 ~~~lea 353 (400)
T COG3071 348 SEALEA 353 (400)
T ss_pred HHHHHH
Confidence 666543
No 236
>PF00244 14-3-3: 14-3-3 protein; InterPro: IPR023410 The 14-3-3 proteins are a large family of approximately 30kDa acidic proteins which exist primarily as homo- and heterodimeric within all eukaryotic cells [, ]. There is a high degree of sequence identity and conservation between all the 14-3-3 isotypes, particularly in the regions which form the dimer interface or line the central ligand binding channel of the dimeric molecule. Each 14-3-3 protein sequence can be roughly divided into three sections: a divergent amino terminus, the conserved core region and a divergent carboxyl terminus. The conserved middle core region of the 14-3-3s encodes an amphipathic groove that forms the main functional domain, a cradle for interacting with client proteins. The monomer consists of nine helices organised in an antiparallel manner, forming an L-shaped structure. The interior of the L-structure is composed of four helices: H3 and H5, which contain many charged and polar amino acids, and H7 and H9, which contain hydrophobic amino acids. These four helices form the concave amphipathic groove that interacts with target peptides. 14-3-3 proteins mainly bind proteins containing phosphothreonine or phosphoserine motifs however exceptions to this rule do exist. Extensive investigation of the 14-3-3 binding site of the mammalian serine/threonine kinase Raf-1 has produced a consensus sequence for 14-3-3-binding, RSxpSxP (in the single-letter amino-acid code, where x denotes any amino acid and p indicates that the next residue is phosphorylated). 14-3-3 proteins appear to effect intracellular signalling in one of three ways - by direct regulation of the catalytic activity of the bound protein, by regulating interactions between the bound protein and other molecules in the cell by sequestration or modification or by controlling the subcellular localisation of the bound ligand. Proteins appear to initially bind to a single dominant site and then subsequently to many, much weaker secondary interaction sites. The 14-3-3 dimer is capable of changing the conformation of its bound ligand whilst itself undergoing minimal structural alteration. This entry represents the structural domain found in 14-3-3 proteins.; PDB: 2O8P_A 3AXY_D 2C74_A 2C63_A 4DX0_A 1YWT_A 3P1O_A 3P1N_A 4DAU_A 3U9X_A ....
Probab=34.74 E-value=1.7e+02 Score=21.00 Aligned_cols=44 Identities=18% Similarity=0.265 Sum_probs=30.8
Q ss_pred HHHHHHHHhh-CCCCCchHHHHHHHHHhCCChHHHHHHHHhcCCHHHHHHHHh
Q psy11102 31 FEALESCARK-LPDSSPLLKPMGEIFVKYGLCEQAVYVFDKNKHKSSQWLTVV 82 (87)
Q Consensus 31 ~d~L~~l~~~-L~~~~~lL~~ia~~F~~~G~~~~Av~aylk~gd~k~ai~~cv 82 (87)
|+.-.+++.. ||..+|+--.++- ..||-.|...|++.+|++++-
T Consensus 149 Y~~A~~~a~~~L~~~~p~rLgl~L--------N~svF~yei~~~~~~A~~ia~ 193 (236)
T PF00244_consen 149 YEEALEIAKKELPPTHPLRLGLAL--------NYSVFYYEILNDPEKAIEIAK 193 (236)
T ss_dssp HHHHHHHHHHHSCTTSHHHHHHHH--------HHHHHHHHTSS-HHHHHHHHH
T ss_pred hhhHHHHHhcccCCCCcHHHHHHH--------HHHHHHHHHcCChHHHHHHHH
Confidence 3344455555 9999998666654 457888888999999888764
No 237
>PF14669 Asp_Glu_race_2: Putative aspartate racemase
Probab=34.56 E-value=40 Score=24.85 Aligned_cols=28 Identities=21% Similarity=0.158 Sum_probs=24.7
Q ss_pred hHHHHHHHHHHhccHHHHHHHHHHhCCH
Q psy11102 4 LVSQAREYYEKCHCYEKLISVYTELGDF 31 (87)
Q Consensus 4 ~w~~A~~yY~~~~n~~k~ie~~~~~ed~ 31 (87)
-...|++.|.++|+++.|+..+-..+|+
T Consensus 183 ivn~AaEiFL~sgsidGA~~vLreseWi 210 (233)
T PF14669_consen 183 IVNIAAEIFLKSGSIDGALWVLRESEWI 210 (233)
T ss_pred hHHHHHHHHHHcCCchHHHHHHhcccee
Confidence 3457999999999999999999988885
No 238
>PF02064 MAS20: MAS20 protein import receptor; InterPro: IPR002056 Virtually all mitochondrial precursors are imported via the same mechanism []: precursors first bind to receptors on the mitochondrial surface, then insert into the translocation channel in the outer membrane. Many outer-membrane proteins participate in the early stages of import, four of which (MAS20, MAS22, MAS37 and MAS70) are components of the receptor. MAS20, which forms a subcomplex with MAS22, seems to interact with most or all mitochondrial precursors, suggesting that the protein binds directly to mitochondrial targeting sequences. The MAS37 and MAS70 components also form a subcomplex, the two subcomplexes possibly binding via their trans- membrane (TM) regions - the TM region of MAS70 promotes oligomerisation of attatched protein domains and shares sequence similarity with the TM region of MAS20 []. MAS20 is also known as TOM20.; GO: 0006605 protein targeting, 0006886 intracellular protein transport, 0005742 mitochondrial outer membrane translocase complex; PDB: 3AX3_A 3AWR_B 2V1S_A 3AX5_C 3AX2_C 1OM2_A 2V1T_B.
Probab=34.35 E-value=1.4e+02 Score=19.78 Aligned_cols=35 Identities=17% Similarity=0.161 Sum_probs=27.6
Q ss_pred HHHHHHHhCCChHHHHHHHHh----cCCHHHHHHHHhhc
Q psy11102 50 PMGEIFVKYGLCEQAVYVFDK----NKHKSSQWLTVVQD 84 (87)
Q Consensus 50 ~ia~~F~~~G~~~~Av~aylk----~gd~k~ai~~cv~~ 84 (87)
++|+.+.+.|..+.|+..|-+ |++|..+|...-+.
T Consensus 68 ~lGE~L~~~G~~~~aa~hf~nAl~V~~qP~~LL~i~q~t 106 (121)
T PF02064_consen 68 QLGEQLLAQGDYEEAAEHFYNALKVCPQPAELLQIYQKT 106 (121)
T ss_dssp HHHHHHHHTT-HHHHHHHHHHHHHTSSSHHHHHHHHHHH
T ss_pred HHHHHHHhCCCHHHHHHHHHHHHHhCCCHHHHHHHHHhh
Confidence 579999999999999986655 79999888876543
No 239
>PF04003 Utp12: Dip2/Utp12 Family; InterPro: IPR007148 A large ribonuclear protein complex is required for the processing of the small-ribosomal-subunit rRNA - the small-subunit (SSU) processome [, ]. This preribosomal complex contains the U3 snoRNA and at least 40 proteins, which have the following properties: They are nucleolar. They are able to coimmunoprecipitate with the U3 snoRNA and Mpp10 (a protein specific to the SSU processome). They are required for 18S rRNA biogenesis. There appears to be a linkage between polymerase I transcription and the formation of the SSU processome; as some, but not all, of the SSU processome components are required for pre-rRNA transcription initiation. These SSU processome components have been termed t-Utps. They form a pre-complex with pre-18S rRNA in the absence of snoRNA U3 and other SSU processome components. It has been proposed that the t-Utp complex proteins are both rDNA and rRNA binding proteins that are involved in the initiation of pre18S rRNA transcription. Initially binding to rDNA then associating with the 5' end of the nascent pre18S rRNA. The t-Utpcomplex forms the nucleus around which the rest of the SSU processome components, including snoRNA U3, assemble []. From electron microscopy the SSU processome may correspond to the terminal knobs visualized at the 5' ends of nascent 18S rRNA. This domain is found at the C terminus of proteins containing WD40 repeats. These proteins are part of the U3 ribonucleoprotein. In yeast, these proteins are called Utp5, Utp1 or Pwp2, Utp12 or DIP2 Q12220 from SWISSPROT. They interact with snoRNA U3 and with MPP10 []. Pwp2 is an essential Saccharomyces cerevisiae (Baker's yeast) protein involved in cell separation.
Probab=34.11 E-value=1.1e+02 Score=18.66 Aligned_cols=47 Identities=28% Similarity=0.162 Sum_probs=36.2
Q ss_pred cHHHHHHHHHHhCCHHHHHHHHhhCCCC--CchHHHHHHHHH--hCCChHH
Q psy11102 17 CYEKLISVYTELGDFEALESCARKLPDS--SPLLKPMGEIFV--KYGLCEQ 63 (87)
Q Consensus 17 n~~k~ie~~~~~ed~d~L~~l~~~L~~~--~~lL~~ia~~F~--~~G~~~~ 63 (87)
|-+..++-.+..=+.+.++..++.||.. .++|..++.++. +....+.
T Consensus 1 Nd~~ll~~vl~~~~~~~I~~tv~~Lp~~~~~~LL~~l~~~l~~~~~~~~e~ 51 (110)
T PF04003_consen 1 NDEELLLEVLERIPPSDIENTVRSLPFSYAERLLQFLSERLQTRKSPHVEF 51 (110)
T ss_pred CHHHHHHHHHhhCCHHHHHHHHHhCCHHHHHHHHHHHHHHhcccCCCchhH
Confidence 3456677777888999999999999985 599999999775 4344443
No 240
>KOG0495|consensus
Probab=33.86 E-value=1.1e+02 Score=26.65 Aligned_cols=46 Identities=13% Similarity=0.165 Sum_probs=34.1
Q ss_pred CCCCCchHHHHHHHHHhCCChHHHHHHHHhc---------------------C---CHHHHHHHHhhcCC
Q psy11102 41 LPDSSPLLKPMGEIFVKYGLCEQAVYVFDKN---------------------K---HKSSQWLTVVQDKP 86 (87)
Q Consensus 41 L~~~~~lL~~ia~~F~~~G~~~~Av~aylk~---------------------g---d~k~ai~~cv~~~~ 86 (87)
...++-+|..+|.+|-+-..++.|.+=|.|+ | |-+..++.|+...|
T Consensus 813 ce~dphVllaia~lfw~e~k~~kar~Wf~Ravk~d~d~GD~wa~fykfel~hG~eed~kev~~~c~~~EP 882 (913)
T KOG0495|consen 813 CEHDPHVLLAIAKLFWSEKKIEKAREWFERAVKKDPDNGDAWAWFYKFELRHGTEEDQKEVLKKCETAEP 882 (913)
T ss_pred ccCCchhHHHHHHHHHHHHHHHHHHHHHHHHHccCCccchHHHHHHHHHHHhCCHHHHHHHHHHHhccCC
Confidence 3456678999999999988888888777653 2 55666777776666
No 241
>PRK10564 maltose regulon periplasmic protein; Provisional
Probab=33.17 E-value=50 Score=25.35 Aligned_cols=32 Identities=19% Similarity=0.194 Sum_probs=24.9
Q ss_pred chHHHHHHHHHhCCChHHHHHHHHhcCCHHHHHHHHhh
Q psy11102 46 PLLKPMGEIFVKYGLCEQAVYVFDKNKHKSSQWLTVVQ 83 (87)
Q Consensus 46 ~lL~~ia~~F~~~G~~~~Av~aylk~gd~k~ai~~cv~ 83 (87)
+.+.+--.+| -+|++.-+|.||+.+|+.+--+
T Consensus 251 ~v~~dTe~Yy------~~aI~~AVk~gDi~KAL~LldE 282 (303)
T PRK10564 251 PMLNDTESYF------NQAIKQAVKKGDVDKALKLLDE 282 (303)
T ss_pred ccCchHHHHH------HHHHHHHHHcCCHHHHHHHHHH
Confidence 5566655555 4789999999999999988644
No 242
>PF11768 DUF3312: Protein of unknown function (DUF3312); InterPro: IPR024511 This is a eukaryotic family of uncharacterised proteins that contain WD40 repeats.
Probab=32.20 E-value=1.6e+02 Score=24.41 Aligned_cols=77 Identities=18% Similarity=0.303 Sum_probs=51.6
Q ss_pred HHHHHHhccHHHHHHHHHHhCC-------HHHHHHHHhhC---C--C-CCchH-HHHHHHH-HhCCChHHHHHH------
Q psy11102 9 REYYEKCHCYEKLISVYTELGD-------FEALESCARKL---P--D-SSPLL-KPMGEIF-VKYGLCEQAVYV------ 67 (87)
Q Consensus 9 ~~yY~~~~n~~k~ie~~~~~ed-------~d~L~~l~~~L---~--~-~~~lL-~~ia~~F-~~~G~~~~Av~a------ 67 (87)
+.-|.+.++.++||..+.-.+| |-.|..|++.| | + ...+| .-+|.++ -..++++.-+.-
T Consensus 415 ~~~yl~~~qi~eAi~lL~smnW~~~g~~C~~~L~~I~n~Ll~~pl~~ere~~le~algsF~ap~rpl~~~~~~ey~d~V~ 494 (545)
T PF11768_consen 415 ISQYLRCDQIEEAINLLLSMNWNTMGEQCFHCLSAIVNHLLRQPLTPEREAQLEAALGSFYAPTRPLSDATVLEYRDPVS 494 (545)
T ss_pred HHHHHhcCCHHHHHHHHHhCCccccHHHHHHHHHHHHHHHhcCCCChHHHHHHHHHHhhccCCCcCccHHHHHHHHHHHH
Confidence 3469999999999999999888 56788888775 2 1 22333 3444444 255555543332
Q ss_pred ---------HHhcCCHHHHHHHHhhcC
Q psy11102 68 ---------FDKNKHKSSQWLTVVQDK 85 (87)
Q Consensus 68 ---------ylk~gd~k~ai~~cv~~~ 85 (87)
.+|.+.+++|.-+.|++.
T Consensus 495 ~~aRRfFhhLLR~~rfekAFlLAvdi~ 521 (545)
T PF11768_consen 495 DLARRFFHHLLRYQRFEKAFLLAVDIG 521 (545)
T ss_pred HHHHHHHHHHHHhhHHHHHHHHHHhcc
Confidence 367888888887777654
No 243
>PF06301 Lambda_Kil: Bacteriophage lambda Kil protein; InterPro: IPR010444 This family consists of several Bacteriophage lambda Kil protein like sequences. A cessation of division, followed by one or two fairly synchronous cell divisions in Escherichia coli is due to two genetically separable events: a temporary block of cell division and, at the same time, a block to the initiation of new rounds of DNA replication. The cell division block is a result of the transient expression of the lambda kil gene []. The lambda kil gene has been shown to be responsible for premature lysis on the addition of chloramphenicol between 15 and 20 min after thermal induction of a lambda prophage []. Induction of a lambda prophage causes the death of the host cell even in the absence of phage replication and lytic functions due to expression of functions from the lambda p(L) operon. The kil gene causes cell death and filamentation [].
Probab=31.58 E-value=32 Score=19.08 Aligned_cols=21 Identities=5% Similarity=0.025 Sum_probs=17.9
Q ss_pred HHHHHHHhCCChHHHHHHHHh
Q psy11102 50 PMGEIFVKYGLCEQAVYVFDK 70 (87)
Q Consensus 50 ~ia~~F~~~G~~~~Av~aylk 70 (87)
.||.++..-+|..+|+++|-.
T Consensus 13 ~IA~flGD~~mw~eA~e~~k~ 33 (43)
T PF06301_consen 13 AIARFLGDEKMWSEANEAMKI 33 (43)
T ss_pred HHHHHHccHHHHHHHHHHHHH
Confidence 588899999999999998744
No 244
>cd02656 MIT MIT: domain contained within Microtubule Interacting and Trafficking molecules. The MIT domain is found in sorting nexins, the nuclear thiol protease PalBH, the AAA protein spastin and archaebacterial proteins with similar domain architecture, vacuolar sorting proteins and others. The molecular function of the MIT domain is unclear.
Probab=31.54 E-value=37 Score=19.77 Aligned_cols=16 Identities=25% Similarity=0.470 Sum_probs=12.4
Q ss_pred HHHhccHHHHHHHHHH
Q psy11102 12 YEKCHCYEKLISVYTE 27 (87)
Q Consensus 12 Y~~~~n~~k~ie~~~~ 27 (87)
....|+++.++++|-.
T Consensus 16 ~D~~g~~~~Al~~Y~~ 31 (75)
T cd02656 16 EDEDGNYEEALELYKE 31 (75)
T ss_pred HHHcCCHHHHHHHHHH
Confidence 4566999999988864
No 245
>KOG2002|consensus
Probab=31.44 E-value=80 Score=28.09 Aligned_cols=38 Identities=24% Similarity=0.410 Sum_probs=26.9
Q ss_pred HhhCCCC-CchHHHHHHHHHhCCChHHHHHHHHhcCCHH
Q psy11102 38 ARKLPDS-SPLLKPMGEIFVKYGLCEQAVYVFDKNKHKS 75 (87)
Q Consensus 38 ~~~L~~~-~~lL~~ia~~F~~~G~~~~Av~aylk~gd~k 75 (87)
.+..|.. ...-.-||.=|.+.|+.+.|.-+|.++-+..
T Consensus 191 l~inp~~~aD~rIgig~Cf~kl~~~~~a~~a~~ralqLd 229 (1018)
T KOG2002|consen 191 LRINPACKADVRIGIGHCFWKLGMSEKALLAFERALQLD 229 (1018)
T ss_pred HhcCcccCCCccchhhhHHHhccchhhHHHHHHHHHhcC
Confidence 3444553 3444568888999999999999988875544
No 246
>PF13281 DUF4071: Domain of unknown function (DUF4071)
Probab=31.30 E-value=1.6e+02 Score=23.14 Aligned_cols=77 Identities=16% Similarity=0.033 Sum_probs=50.9
Q ss_pred hHHHHHHHHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCCCCchHHHHHHHH----HhCCCh---------HHHHHHHHh
Q psy11102 4 LVSQAREYYEKCHCYEKLISVYTELGDFEALESCARKLPDSSPLLKPMGEIF----VKYGLC---------EQAVYVFDK 70 (87)
Q Consensus 4 ~w~~A~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~~~~lL~~ia~~F----~~~G~~---------~~Av~aylk 70 (87)
..++|+.+|.+|=..+ .-+|..-.+--|..+.....+++..+.+++..| ...|-- .--.++.+=
T Consensus 241 ~ldkAi~~Y~kgFe~~---~~~Y~GIN~AtLL~~~g~~~~~~~el~~i~~~l~~llg~kg~~~~~~dYWd~ATl~Ea~vL 317 (374)
T PF13281_consen 241 SLDKAIEWYRKGFEIE---PDYYSGINAATLLMLAGHDFETSEELRKIGVKLSSLLGRKGSLEKMQDYWDVATLLEASVL 317 (374)
T ss_pred HHHHHHHHHHHHHcCC---ccccchHHHHHHHHHcCCcccchHHHHHHHHHHHHHHHhhccccccccHHHHHHHHHHHHH
Confidence 3677888888775554 345555666666666666555677788888444 345532 234677888
Q ss_pred cCCHHHHHHHHhh
Q psy11102 71 NKHKSSQWLTVVQ 83 (87)
Q Consensus 71 ~gd~k~ai~~cv~ 83 (87)
.||.++|++...+
T Consensus 318 ~~d~~ka~~a~e~ 330 (374)
T PF13281_consen 318 AGDYEKAIQAAEK 330 (374)
T ss_pred cCCHHHHHHHHHH
Confidence 8999888877654
No 247
>KOG3824|consensus
Probab=31.03 E-value=1.1e+02 Score=24.48 Aligned_cols=30 Identities=20% Similarity=0.191 Sum_probs=26.3
Q ss_pred CCCCchHHHHHHHHHhCCChHHHHHHHHhc
Q psy11102 42 PDSSPLLKPMGEIFVKYGLCEQAVYVFDKN 71 (87)
Q Consensus 42 ~~~~~lL~~ia~~F~~~G~~~~Av~aylk~ 71 (87)
|.+...|.+.|.+..-..-.-+|=++|.|+
T Consensus 147 P~~p~~L~e~G~f~E~~~~iv~ADq~Y~~A 176 (472)
T KOG3824|consen 147 PTNPQILIEMGQFREMHNEIVEADQCYVKA 176 (472)
T ss_pred CCCHHHHHHHhHHHHhhhhhHhhhhhhhee
Confidence 445688999999999999999999999986
No 248
>PF13311 DUF4080: Protein of unknown function (DUF4080)
Probab=30.68 E-value=1.2e+02 Score=21.11 Aligned_cols=49 Identities=14% Similarity=0.260 Sum_probs=32.1
Q ss_pred HhccHHHHHHHHHHhCCHHHHHHHHhhCCCC-CchHHHHHHHHHhCCChH
Q psy11102 14 KCHCYEKLISVYTELGDFEALESCARKLPDS-SPLLKPMGEIFVKYGLCE 62 (87)
Q Consensus 14 ~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~~-~~lL~~ia~~F~~~G~~~ 62 (87)
+-+.++++.|.||..|.|....+.+-..-++ -.....+|+++.+.|...
T Consensus 29 ~Lk~~e~~le~yyNSgrF~~tl~~l~~~~~spF~ff~~l~~~~~~~g~~~ 78 (190)
T PF13311_consen 29 RLKRFEDMLEKYYNSGRFDYTLEYLLKEFGSPFDFFEALADYWEEQGYFK 78 (190)
T ss_pred HHHHHHHHHHHHhhhhhHHHHHHHHHHhcCCHHHHHHHHHHHHHHCCCcc
Confidence 3456777888888888887665555443333 366777777777776644
No 249
>PF07743 HSCB_C: HSCB C-terminal oligomerisation domain; InterPro: IPR009073 This entry represents the C-terminal oligomerisation domain found in HscB (heat shock cognate protein B), which is also known as HSC20 (20K heat shock cognate protein). HscB acts as a co-chaperone to regulate the ATPase activity and peptide-binding specificity of the molecular chaperone HscA, also known as HSC66 (HSP70 class). HscB proteins contain two domains, an N-terminal J-domain, which is involved in interactions with HscA, connected by a short loop to the C-terminal oligomerisation domain; the two domains make contact through a hydrophobic interface. The core of the oligomerisation domain is thought to bind and target proteins to HscA and consists of an open, three-helical bundle []. HscB, along with HscA, has been shown to play a role in the biogenesis of iron-sulphur proteins.; GO: 0006457 protein folding; PDB: 1FPO_C 3BVO_B 3HHO_A 3UO2_B 3UO3_B.
Probab=30.58 E-value=47 Score=19.45 Aligned_cols=41 Identities=24% Similarity=0.312 Sum_probs=24.1
Q ss_pred CCHHHHHHHHhhCCCC-CchHHHHHHHHHhCCChHHHHHHHHh
Q psy11102 29 GDFEALESCARKLPDS-SPLLKPMGEIFVKYGLCEQAVYVFDK 70 (87)
Q Consensus 29 ed~d~L~~l~~~L~~~-~~lL~~ia~~F~~~G~~~~Av~aylk 70 (87)
++-+.|..+...+... ..+...++..| +.+..+.|...-.|
T Consensus 24 ~~~~~L~~l~~~~~~~~~~~~~~l~~~f-~~~d~~~A~~~~~k 65 (78)
T PF07743_consen 24 DDEAELEELKKEIEERIKELIKELAEAF-DAKDWEEAKEALRK 65 (78)
T ss_dssp TSHHHHHHHHHHHHHHHHHHHHHHHHHH-HTT-HHHHHHHHHH
T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHH-ccCcHHHHHHHHHH
Confidence 4446666776666664 56677777777 45555555554443
No 250
>PF04190 DUF410: Protein of unknown function (DUF410) ; InterPro: IPR007317 This is a family of conserved eukaryotic proteins with undetermined function.; PDB: 3LKU_E 2WPV_G.
Probab=30.52 E-value=2.1e+02 Score=20.77 Aligned_cols=36 Identities=19% Similarity=0.027 Sum_probs=26.6
Q ss_pred CC-chHHHHHHHHHhCCChHHHHHHHHhcCCHHHHHH
Q psy11102 44 SS-PLLKPMGEIFVKYGLCEQAVYVFDKNKHKSSQWL 79 (87)
Q Consensus 44 ~~-~lL~~ia~~F~~~G~~~~Av~aylk~gd~k~ai~ 79 (87)
++ .+=..+|..|.+.|....|..-|+..+|......
T Consensus 88 Gdp~LH~~~a~~~~~e~~~~~A~~Hfl~~~~~~~~~~ 124 (260)
T PF04190_consen 88 GDPELHHLLAEKLWKEGNYYEAERHFLLGTDPSAFAY 124 (260)
T ss_dssp --HHHHHHHHHHHHHTT-HHHHHHHHHTS-HHHHHHH
T ss_pred CCHHHHHHHHHHHHhhccHHHHHHHHHhcCChhHHHH
Confidence 45 4456789999999999999999999998887655
No 251
>COG4190 Predicted transcriptional regulator [Transcription]
Probab=30.21 E-value=30 Score=23.78 Aligned_cols=24 Identities=29% Similarity=0.325 Sum_probs=20.1
Q ss_pred CHHHHHHHHhhCCCC-CchHHHHHH
Q psy11102 30 DFEALESCARKLPDS-SPLLKPMGE 53 (87)
Q Consensus 30 d~d~L~~l~~~L~~~-~~lL~~ia~ 53 (87)
||.+++.+++-|.+. -+||+-|+.
T Consensus 51 ~F~Sye~la~vLsp~nleLl~~Ia~ 75 (144)
T COG4190 51 WFTSYEDLARVLSPRNLELLELIAQ 75 (144)
T ss_pred ccccHHHHHHHhChhHHHHHHHHHh
Confidence 789999999999875 588888765
No 252
>KOG1128|consensus
Probab=29.89 E-value=1.3e+02 Score=25.99 Aligned_cols=66 Identities=20% Similarity=0.276 Sum_probs=51.5
Q ss_pred hhHHHHHHHHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCC--CC-chHHHHHHHHHhCCChHHHHHHH
Q psy11102 3 TLVSQAREYYEKCHCYEKLISVYTELGDFEALESCARKLPD--SS-PLLKPMGEIFVKYGLCEQAVYVF 68 (87)
Q Consensus 3 ~~w~~A~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~--~~-~lL~~ia~~F~~~G~~~~Av~ay 68 (87)
|--.+|...|++=+-....|.||..+|+-++-+.+.+..-+ ++ -+=.-+|+....--.+|+|-+.+
T Consensus 412 GitksAl~I~Erlemw~~vi~CY~~lg~~~kaeei~~q~lek~~d~~lyc~LGDv~~d~s~yEkawEls 480 (777)
T KOG1128|consen 412 GITKSALVIFERLEMWDPVILCYLLLGQHGKAEEINRQELEKDPDPRLYCLLGDVLHDPSLYEKAWELS 480 (777)
T ss_pred chHHHHHHHHHhHHHHHHHHHHHHHhcccchHHHHHHHHhcCCCcchhHHHhhhhccChHHHHHHHHHh
Confidence 34568999999999999999999999999999999998644 23 44556677777666677776654
No 253
>PF04212 MIT: MIT (microtubule interacting and transport) domain; InterPro: IPR007330 The MIT domain is found in vacuolar sorting proteins, spastin (probable ATPase involved in the assembly or function of nuclear protein complexes), and a sorting nexin, which may play a role in intracellular trafficking.; PDB: 2DL1_A 2JQK_A 1WR0_A 2CPT_A 2JQH_A 2V6Y_A 2JQ9_A 2K3W_A 1YXR_A 3EAB_E ....
Probab=29.39 E-value=1e+02 Score=17.49 Aligned_cols=24 Identities=17% Similarity=0.110 Sum_probs=18.8
Q ss_pred HHHHHHHHHhCCChHHHHHHHHhc
Q psy11102 48 LKPMGEIFVKYGLCEQAVYVFDKN 71 (87)
Q Consensus 48 L~~ia~~F~~~G~~~~Av~aylk~ 71 (87)
+..-|..+-+.|..++|++.|.++
T Consensus 8 ~~~~Av~~D~~g~~~~A~~~Y~~a 31 (69)
T PF04212_consen 8 LIKKAVEADEAGNYEEALELYKEA 31 (69)
T ss_dssp HHHHHHHHHHTTSHHHHHHHHHHH
T ss_pred HHHHHHHHHHCCCHHHHHHHHHHH
Confidence 455677788889999999998764
No 254
>smart00386 HAT HAT (Half-A-TPR) repeats. Present in several RNA-binding proteins. Structurally and sequentially thought to be similar to TPRs.
Probab=29.26 E-value=48 Score=14.95 Aligned_cols=11 Identities=36% Similarity=0.682 Sum_probs=5.8
Q ss_pred hHHHHHHHHHH
Q psy11102 4 LVSQAREYYEK 14 (87)
Q Consensus 4 ~w~~A~~yY~~ 14 (87)
+++.|...|.+
T Consensus 2 ~~~~~r~i~e~ 12 (33)
T smart00386 2 DIERARKIYER 12 (33)
T ss_pred cHHHHHHHHHH
Confidence 44555555554
No 255
>cd07653 F-BAR_CIP4-like The F-BAR (FES-CIP4 Homology and Bin/Amphiphysin/Rvs) domain of Cdc42-Interacting Protein 4 and similar proteins. F-BAR domains are dimerization modules that bind and bend membranes and are found in proteins involved in membrane dynamics and actin reorganization. This subfamily is composed of Cdc42-Interacting Protein 4 (CIP4), Formin Binding Protein 17 (FBP17), FormiN Binding Protein 1-Like (FNBP1L), and similar proteins. CIP4 and FNBP1L are Cdc42 effectors that bind Wiskott-Aldrich syndrome protein (WASP) and function in endocytosis. CIP4 and FBP17 bind to the Fas ligand and may be implicated in the inflammatory response. CIP4 may also play a role in phagocytosis. Members of this subfamily typically contain an N-terminal F-BAR domain and a C-terminal SH3 domain. In addition, some members such as FNBP1L contain a central Cdc42-binding HR1 domain. F-BAR domains form banana-shaped dimers with a positively-charged concave surface that binds to negatively-charged l
Probab=28.94 E-value=30 Score=24.62 Aligned_cols=34 Identities=9% Similarity=0.199 Sum_probs=26.9
Q ss_pred CCHHHHHHHHhhCCCCCchHHHHHHHHHhCCChH
Q psy11102 29 GDFEALESCARKLPDSSPLLKPMGEIFVKYGLCE 62 (87)
Q Consensus 29 ed~d~L~~l~~~L~~~~~lL~~ia~~F~~~G~~~ 62 (87)
+.||.+..|.+.+..+..++.++..+|..+.-.|
T Consensus 2 ~L~d~~~~l~~~~~~g~~~~~~l~~f~keRa~iE 35 (251)
T cd07653 2 ELWDQFDNLEKHTQKGIDFLERYGKFVKERAAIE 35 (251)
T ss_pred cchhhhHHHHHHHHHhHHHHHHHHHHHHHHHHHH
Confidence 4578888888888888888888888887765443
No 256
>PF10607 CLTH: CTLH/CRA C-terminal to LisH motif domain; InterPro: IPR019589 This entry represents the CRA (or CT11-RanBPM) domain, which is a protein-protein interaction domain present in crown eukaryotes (plants, animals, fungi) and which is found in Ran-binding proteins such as Ran-binding protein 9 (RanBP9 or RanBPM) and RanBP10. RanBPM is a scaffolding protein important in regulating cellular function in both the immune system and the nervous system, and may act as an adapter protein to couple membrane receptors to intracellular signaling pathways. This domain is at the C terminus of the proteins and is the binding domain for the CRA motif, which is comprised of approximately 100 amino acids at the C-terminal of RanBPM. It was found to be important for the interaction of RanBPM with fragile X mental retardation protein (FMRP), but its functional significance has yet to be determined [].
Probab=28.42 E-value=71 Score=20.50 Aligned_cols=22 Identities=9% Similarity=-0.103 Sum_probs=17.6
Q ss_pred HHHHHhcCCHHHHHHHHhhcCC
Q psy11102 65 VYVFDKNKHKSSQWLTVVQDKP 86 (87)
Q Consensus 65 v~aylk~gd~k~ai~~cv~~~~ 86 (87)
+.-.++.||+..|++.|.+.+|
T Consensus 8 I~~~I~~g~i~~Ai~w~~~~~~ 29 (145)
T PF10607_consen 8 IRQAILNGDIDPAIEWLNENFP 29 (145)
T ss_pred HHHHHHcCCHHHHHHHHHHcCH
Confidence 3345688999999999988876
No 257
>PF14760 Rnk_N: Rnk N-terminus; PDB: 2PN0_D 3BMB_B.
Probab=28.20 E-value=93 Score=16.58 Aligned_cols=27 Identities=30% Similarity=0.437 Sum_probs=19.8
Q ss_pred CCHHHHHHHHhhCCCCC-chHHHHHHHH
Q psy11102 29 GDFEALESCARKLPDSS-PLLKPMGEIF 55 (87)
Q Consensus 29 ed~d~L~~l~~~L~~~~-~lL~~ia~~F 55 (87)
-|++.|+.|+.+++..+ +....+..-+
T Consensus 9 ~D~~RL~~Ll~~~~~~~~~~~~~L~~EL 36 (42)
T PF14760_consen 9 LDLERLEALLEALSDRDPPVAEALEEEL 36 (42)
T ss_dssp HHHHHHHHHHCCCCCTTSCCHHHHHHHH
T ss_pred hhHHHHHHHHhCccccCCHHHHHHHHHH
Confidence 47899999999998854 6666655433
No 258
>PF05843 Suf: Suppressor of forked protein (Suf); InterPro: IPR008847 This domain consists of several eukaryotic suppressor of forked (Suf) like proteins. The Drosophila melanogaster suppressor of forked [Su(f)] protein shares homology with the Saccharomyces cerevisiae RNA14 protein and the 77 kDa subunit of Homo sapiens cleavage stimulation factor, which are proteins involved in mRNA 3' end formation. This suggests a role for Su(f) in mRNA 3' end formation in Drosophila. The su(f) gene produces three transcripts; two of them are polyadenylated at the end of the transcription unit, and one is a truncated transcript, polyadenylated in intron 4. It is thought that su(f) plays a role in the regulation of poly(A) site utilisation and the GU-rich sequence is important for this regulation to occur [].; GO: 0006397 mRNA processing, 0005634 nucleus; PDB: 2L9B_B 2OND_B 2OOE_A 4E85_B 4EBA_C 4E6H_A 2UY1_B.
Probab=28.11 E-value=2.4e+02 Score=20.51 Aligned_cols=38 Identities=16% Similarity=0.295 Sum_probs=25.6
Q ss_pred HHHHHhhCCCCCchHHHHHHHHHhCCChHHHHHHHHhc
Q psy11102 34 LESCARKLPDSSPLLKPMGEIFVKYGLCEQAVYVFDKN 71 (87)
Q Consensus 34 L~~l~~~L~~~~~lL~~ia~~F~~~G~~~~Av~aylk~ 71 (87)
+|..+...|.+..+...=.+++.+.|..+.|...|.++
T Consensus 59 fe~glk~f~~~~~~~~~Y~~~l~~~~d~~~aR~lfer~ 96 (280)
T PF05843_consen 59 FERGLKKFPSDPDFWLEYLDFLIKLNDINNARALFERA 96 (280)
T ss_dssp HHHHHHHHTT-HHHHHHHHHHHHHTT-HHHHHHHHHHH
T ss_pred HHHHHHHCCCCHHHHHHHHHHHHHhCcHHHHHHHHHHH
Confidence 34445566777677777778888888888888777765
No 259
>cd02681 MIT_calpain7_1 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in the nuclear thiol protease PalBH. The molecular function of the MIT domain is unclear.
Probab=28.06 E-value=46 Score=20.19 Aligned_cols=17 Identities=18% Similarity=0.262 Sum_probs=12.2
Q ss_pred HHHhccHHHHHHHHHHh
Q psy11102 12 YEKCHCYEKLISVYTEL 28 (87)
Q Consensus 12 Y~~~~n~~k~ie~~~~~ 28 (87)
.-++|+++.|+.+|...
T Consensus 16 ~D~~g~y~eA~~~Y~~a 32 (76)
T cd02681 16 RDQEGRYSEAVFYYKEA 32 (76)
T ss_pred HHHccCHHHHHHHHHHH
Confidence 45678888888887643
No 260
>COG3063 PilF Tfp pilus assembly protein PilF [Cell motility and secretion / Intracellular trafficking and secretion]
Probab=27.69 E-value=1.6e+02 Score=22.04 Aligned_cols=62 Identities=23% Similarity=0.314 Sum_probs=48.5
Q ss_pred HHHHhccHHHHHHHHHHhCCHHHHHHHHh-hC---CCCCchHHHHHHHHHhCCChHHHHHHHHhcC
Q psy11102 11 YYEKCHCYEKLISVYTELGDFEALESCAR-KL---PDSSPLLKPMGEIFVKYGLCEQAVYVFDKNK 72 (87)
Q Consensus 11 yY~~~~n~~k~ie~~~~~ed~d~L~~l~~-~L---~~~~~lL~~ia~~F~~~G~~~~Av~aylk~g 72 (87)
||..++-++-+-=|..+.|+++.-++..+ +| |...+-+.+++......|..-+|..-|.+..
T Consensus 135 Y~~~s~t~eN~G~Cal~~gq~~~A~~~l~raL~~dp~~~~~~l~~a~~~~~~~~y~~Ar~~~~~~~ 200 (250)
T COG3063 135 YGEPSDTLENLGLCALKAGQFDQAEEYLKRALELDPQFPPALLELARLHYKAGDYAPARLYLERYQ 200 (250)
T ss_pred CCCcchhhhhhHHHHhhcCCchhHHHHHHHHHHhCcCCChHHHHHHHHHHhcccchHHHHHHHHHH
Confidence 45566778999999999999987665543 34 4456778999999999999999987776643
No 261
>PF11207 DUF2989: Protein of unknown function (DUF2989); InterPro: IPR021372 Some members in this bacterial family of proteins are annotated as lipoproteins however this cannot be confirmed.
Probab=27.58 E-value=1.9e+02 Score=20.95 Aligned_cols=49 Identities=24% Similarity=0.270 Sum_probs=33.2
Q ss_pred HHHHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCC-CCchHHHHHHHHHhCCChHHH
Q psy11102 9 REYYEKCHCYEKLISVYTELGDFEALESCARKLPD-SSPLLKPMGEIFVKYGLCEQA 64 (87)
Q Consensus 9 ~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~-~~~lL~~ia~~F~~~G~~~~A 64 (87)
+.||. ..|-+|++..|++. -++...-++ +.+.|..++..+.+.|..++|
T Consensus 148 AtyY~-krD~~Kt~~ll~~~------L~l~~~~~~~n~eil~sLas~~~~~~~~e~A 197 (203)
T PF11207_consen 148 ATYYT-KRDPEKTIQLLLRA------LELSNPDDNFNPEILKSLASIYQKLKNYEQA 197 (203)
T ss_pred HHHHH-ccCHHHHHHHHHHH------HHhcCCCCCCCHHHHHHHHHHHHHhcchhhh
Confidence 34554 67888888888753 111111112 358899999999999999886
No 262
>COG3947 Response regulator containing CheY-like receiver and SARP domains [Signal transduction mechanisms]
Probab=27.31 E-value=67 Score=25.14 Aligned_cols=31 Identities=10% Similarity=0.104 Sum_probs=24.9
Q ss_pred HHHHHHHHHHhccHHHHHHHHHHhCCHHHHH
Q psy11102 5 VSQAREYYEKCHCYEKLISVYTELGDFEALE 35 (87)
Q Consensus 5 w~~A~~yY~~~~n~~k~ie~~~~~ed~d~L~ 35 (87)
.-+++.+|..+|++..||+...+.-.+|.|.
T Consensus 282 lgkva~~yle~g~~neAi~l~qr~ltldpL~ 312 (361)
T COG3947 282 LGKVARAYLEAGKPNEAIQLHQRALTLDPLS 312 (361)
T ss_pred HHHHHHHHHHcCChHHHHHHHHHHhhcChhh
Confidence 3578899999999999999998776655443
No 263
>PF14669 Asp_Glu_race_2: Putative aspartate racemase
Probab=27.23 E-value=55 Score=24.14 Aligned_cols=21 Identities=14% Similarity=0.109 Sum_probs=17.1
Q ss_pred hHHHHHHHHhcCCHHHHHHHH
Q psy11102 61 CEQAVYVFDKNKHKSSQWLTV 81 (87)
Q Consensus 61 ~~~Av~aylk~gd~k~ai~~c 81 (87)
..-|++.|+|.|.++.|+.+-
T Consensus 184 vn~AaEiFL~sgsidGA~~vL 204 (233)
T PF14669_consen 184 VNIAAEIFLKSGSIDGALWVL 204 (233)
T ss_pred HHHHHHHHHHcCCchHHHHHH
Confidence 346899999999999998764
No 264
>PF10475 DUF2450: Protein of unknown function N-terminal domain (DUF2450) ; InterPro: IPR019515 This entry represents Vacuolar protein sorting-associated protein 54, and is thought to be involved in retrograde transport from early and late endosomes to late Golgi found in eukaryotes, but its function is not known.
Probab=26.83 E-value=85 Score=23.13 Aligned_cols=37 Identities=5% Similarity=-0.145 Sum_probs=28.7
Q ss_pred HHHHHHHHHhCCChHHHHHHHHhcCCHHHHHHHHhhc
Q psy11102 48 LKPMGEIFVKYGLCEQAVYVFDKNKHKSSQWLTVVQD 84 (87)
Q Consensus 48 L~~ia~~F~~~G~~~~Av~aylk~gd~k~ai~~cv~~ 84 (87)
|.+.-.....+...+..++..+..||+-+||++|.+.
T Consensus 117 ll~~L~~i~~v~~~~~~l~~ll~~~dy~~Al~li~~~ 153 (291)
T PF10475_consen 117 LLEKLEQIKTVQQTQSRLQELLEEGDYPGALDLIEEC 153 (291)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHH
Confidence 3444445577778888999999999999999998764
No 265
>cd02683 MIT_1 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in proteins with unknown function, co-occuring with an as yet undescribed domain. The molecular function of the MIT domain is unclear.
Probab=26.79 E-value=48 Score=19.96 Aligned_cols=46 Identities=13% Similarity=0.187 Sum_probs=26.7
Q ss_pred HHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCCC---CchHHHHHHHHHhC
Q psy11102 11 YYEKCHCYEKLISVYTELGDFEALESCARKLPDS---SPLLKPMGEIFVKY 58 (87)
Q Consensus 11 yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~~---~~lL~~ia~~F~~~ 58 (87)
-.-+.|+++.|+.||... .+.|..++..-|+. .-+..++.+++..+
T Consensus 15 e~D~~g~y~eAl~~Y~~a--ie~l~~~lk~e~d~~~k~~~r~ki~eY~~RA 63 (77)
T cd02683 15 ELDQEGRFQEALVCYQEG--IDLLMQVLKGTKDEAKKKNLRQKISEYMDRA 63 (77)
T ss_pred HHHHhccHHHHHHHHHHH--HHHHHHHHhhCCCHHHHHHHHHHHHHHHHHH
Confidence 346789999999998754 34455555444432 23345555555443
No 266
>KOG1129|consensus
Probab=26.63 E-value=1.7e+02 Score=23.46 Aligned_cols=27 Identities=19% Similarity=0.167 Sum_probs=15.9
Q ss_pred CCCCchHHHHHHHHHhCCChHHHHHHH
Q psy11102 42 PDSSPLLKPMGEIFVKYGLCEQAVYVF 68 (87)
Q Consensus 42 ~~~~~lL~~ia~~F~~~G~~~~Av~ay 68 (87)
|.+...|..+|..-...|..+.|++.|
T Consensus 287 P~~VT~l~g~ARi~eam~~~~~a~~lY 313 (478)
T KOG1129|consen 287 PFDVTYLLGQARIHEAMEQQEDALQLY 313 (478)
T ss_pred CchhhhhhhhHHHHHHHHhHHHHHHHH
Confidence 444555556666666666666666655
No 267
>PF04121 Nup84_Nup100: Nuclear pore protein 84 / 107 ; InterPro: IPR007252 Nup84p forms a complex with five proteins, including Nup120p, Nup85p, Sec13p, and a Sec13p homolog. This Nup84p complex in conjunction with Sec13-type proteins is required for correct nuclear pore biogenesis [].; GO: 0006810 transport, 0005643 nuclear pore; PDB: 3CQC_A 3CQG_A 3I4R_A 3IKO_I 3JRO_C.
Probab=26.61 E-value=70 Score=26.64 Aligned_cols=19 Identities=0% Similarity=-0.046 Sum_probs=8.8
Q ss_pred HHHHHhcCCHHHHHHHHhh
Q psy11102 65 VYVFDKNKHKSSQWLTVVQ 83 (87)
Q Consensus 65 v~aylk~gd~k~ai~~cv~ 83 (87)
+-.++++|++.+|+++|.+
T Consensus 140 i~~llR~G~~~eA~~lc~~ 158 (697)
T PF04121_consen 140 IFELLRAGRIEEAQELCRE 158 (697)
T ss_dssp HHHHHHTT-HHHHHHHHHH
T ss_pred HHHHHHCCCHHHHHHHHHH
Confidence 3444555555555555543
No 268
>PRK04841 transcriptional regulator MalT; Provisional
Probab=26.10 E-value=2e+02 Score=23.86 Aligned_cols=38 Identities=11% Similarity=0.050 Sum_probs=32.2
Q ss_pred CchHHHHHHHHHhCCChHHHHHHHHhcCCHHHHHHHHh
Q psy11102 45 SPLLKPMGEIFVKYGLCEQAVYVFDKNKHKSSQWLTVV 82 (87)
Q Consensus 45 ~~lL~~ia~~F~~~G~~~~Av~aylk~gd~k~ai~~cv 82 (87)
..+....+.+|...|....|+..+...+|...+.++..
T Consensus 341 ~~lh~raa~~~~~~g~~~~Al~~a~~a~d~~~~~~ll~ 378 (903)
T PRK04841 341 PELHRAAAEAWLAQGFPSEAIHHALAAGDAQLLRDILL 378 (903)
T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHCCCHHHHHHHHH
Confidence 46778889999999999999999999999988776554
No 269
>PF08822 DUF1804: Protein of unknown function (DUF1804); InterPro: IPR014926 This entry is represented by Bacteriophage D3112, Orf24. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.
Probab=26.03 E-value=1.4e+02 Score=20.90 Aligned_cols=56 Identities=16% Similarity=0.142 Sum_probs=29.9
Q ss_pred hhHHHHHHHHHHhc-cHHHHHHHHH---HhCCHHHHHHHH--hhCCC-C-CchHHHHHHHHHhC
Q psy11102 3 TLVSQAREYYEKCH-CYEKLISVYT---ELGDFEALESCA--RKLPD-S-SPLLKPMGEIFVKY 58 (87)
Q Consensus 3 ~~w~~A~~yY~~~~-n~~k~ie~~~---~~ed~d~L~~l~--~~L~~-~-~~lL~~ia~~F~~~ 58 (87)
-||++|..-+.-+| ..+....-+. ...-..-|++|- ..+|. . .++|..+++-|.+.
T Consensus 48 DDWDk~RaA~~laggg~e~v~~~~l~~f~~Q~~~tmeel~~~~~~~~~~k~~~LasLaDsf~K~ 111 (165)
T PF08822_consen 48 DDWDKARAAHTLAGGGIEDVARQMLEDFVVQYQATMEELKENEDMPPQEKVELLASLADSFSKM 111 (165)
T ss_pred CcHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHHHhcccCCCHHHHHHHHHHHHHHHHHH
Confidence 37888886555544 4444443332 222223333332 22443 2 58899999988653
No 270
>PF05131 Pep3_Vps18: Pep3/Vps18/deep orange family; InterPro: IPR007810 This region is found in a number of proteins identified as being involved in Golgi function and vacuolar sorting. The molecular function of this region is unknown. Proteins containing this domain also contain a C-terminal ring finger domain.
Probab=25.98 E-value=57 Score=22.06 Aligned_cols=19 Identities=0% Similarity=-0.148 Sum_probs=14.5
Q ss_pred HHHHHhcCCHHHHHHHHhh
Q psy11102 65 VYVFDKNKHKSSQWLTVVQ 83 (87)
Q Consensus 65 v~aylk~gd~k~ai~~cv~ 83 (87)
=+.|++.|+++.|+..|-.
T Consensus 110 Wk~yl~~~~fd~Al~~~~~ 128 (147)
T PF05131_consen 110 WKIYLDKGDFDEALQYCKT 128 (147)
T ss_pred HHHHHhcCcHHHHHHHccC
Confidence 3568888888888887743
No 271
>PRK14720 transcript cleavage factor/unknown domain fusion protein; Provisional
Probab=25.94 E-value=4.1e+02 Score=23.50 Aligned_cols=71 Identities=13% Similarity=0.089 Sum_probs=41.5
Q ss_pred hHHHHHHHHHHhccHHHHHHHHHHhCCHHHHHH----HHhhCCCC-CchHHHHHHHHHhCC------ChHHHHHHHHhcC
Q psy11102 4 LVSQAREYYEKCHCYEKLISVYTELGDFEALES----CARKLPDS-SPLLKPMGEIFVKYG------LCEQAVYVFDKNK 72 (87)
Q Consensus 4 ~w~~A~~yY~~~~n~~k~ie~~~~~ed~d~L~~----l~~~L~~~-~~lL~~ia~~F~~~G------~~~~Av~aylk~g 72 (87)
+.++|.+||. |+++.++..++|.++.+ ++..-|++ +.++.-+-....+.| ....=.+.|-+.+
T Consensus 164 dL~KA~~m~~------KAV~~~i~~kq~~~~~e~W~k~~~~~~~d~d~f~~i~~ki~~~~~~~~~~~~~~~l~~~y~~~~ 237 (906)
T PRK14720 164 DKEKAITYLK------KAIYRFIKKKQYVGIEEIWSKLVHYNSDDFDFFLRIERKVLGHREFTRLVGLLEDLYEPYKALE 237 (906)
T ss_pred hHHHHHHHHH------HHHHHHHhhhcchHHHHHHHHHHhcCcccchHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhh
Confidence 4566777765 45566777777777766 55555765 555554444555555 3333345566666
Q ss_pred CHHHHHHH
Q psy11102 73 HKSSQWLT 80 (87)
Q Consensus 73 d~k~ai~~ 80 (87)
|+...|..
T Consensus 238 ~~~~~i~i 245 (906)
T PRK14720 238 DWDEVIYI 245 (906)
T ss_pred hhhHHHHH
Confidence 66565543
No 272
>cd02684 MIT_2 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in proteins with an n-terminal serine/threonine kinase domain. The molecular function of the MIT domain is unclear.
Probab=25.79 E-value=52 Score=19.67 Aligned_cols=24 Identities=21% Similarity=0.225 Sum_probs=16.8
Q ss_pred HHhccHHHHHHHHHHhCCHHHHHHHHhhC
Q psy11102 13 EKCHCYEKLISVYTELGDFEALESCARKL 41 (87)
Q Consensus 13 ~~~~n~~k~ie~~~~~ed~d~L~~l~~~L 41 (87)
-.+|+++.|+.+|.. +|+.++..+
T Consensus 17 D~~g~y~eA~~lY~~-----ale~~~~~~ 40 (75)
T cd02684 17 DQRGDAAAALSLYCS-----ALQYFVPAL 40 (75)
T ss_pred HHhccHHHHHHHHHH-----HHHHHHHHH
Confidence 456888888888864 566666655
No 273
>PF12812 PDZ_1: PDZ-like domain
Probab=25.60 E-value=67 Score=19.39 Aligned_cols=16 Identities=25% Similarity=0.603 Sum_probs=13.5
Q ss_pred CCHHHHHHHHhhCCCC
Q psy11102 29 GDFEALESCARKLPDS 44 (87)
Q Consensus 29 ed~d~L~~l~~~L~~~ 44 (87)
.+.|.+++++++||++
T Consensus 63 ~~Ld~f~~vvk~ipd~ 78 (78)
T PF12812_consen 63 PDLDDFIKVVKKIPDN 78 (78)
T ss_pred cCHHHHHHHHHhCCCC
Confidence 4778999999999985
No 274
>PF14649 Spatacsin_C: Spatacsin C-terminus
Probab=25.45 E-value=1.3e+02 Score=22.90 Aligned_cols=33 Identities=18% Similarity=0.255 Sum_probs=29.8
Q ss_pred HHHhCCHHHHHHHHhhCCCCCchHHHHHHHHHh
Q psy11102 25 YTELGDFEALESCARKLPDSSPLLKPMGEIFVK 57 (87)
Q Consensus 25 ~~~~ed~d~L~~l~~~L~~~~~lL~~ia~~F~~ 57 (87)
+...|+++-|++.....|=++.++.+++.+|+.
T Consensus 263 ~v~~g~~~YL~df~~~~~l~~~~~~dlvk~y~~ 295 (296)
T PF14649_consen 263 VVLNGNFKYLEDFLSVFPLTPNLFEDLVKKYKL 295 (296)
T ss_pred HHHcCcHHHHHHHHHhCCCChHHHHHHHHHHhc
Confidence 457899999999999999999999999999864
No 275
>cd02682 MIT_AAA_Arch MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in mostly archaebacterial AAA-ATPases. The molecular function of the MIT domain is unclear.
Probab=25.38 E-value=1.7e+02 Score=17.80 Aligned_cols=51 Identities=22% Similarity=0.178 Sum_probs=35.0
Q ss_pred HHHHHHHHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCCCC---chHHHHHHHHHh
Q psy11102 5 VSQAREYYEKCHCYEKLISVYTELGDFEALESCARKLPDSS---PLLKPMGEIFVK 57 (87)
Q Consensus 5 w~~A~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~~~---~lL~~ia~~F~~ 57 (87)
+-..+--+.+.|+++.+|-||-.. -+-|.++++.-|+++ -...+|-++...
T Consensus 9 ~a~~AVe~D~~gr~~eAi~~Y~~a--Ie~L~q~~~~~pD~~~k~~yr~ki~eY~~R 62 (75)
T cd02682 9 YAINAVKAEKEGNAEDAITNYKKA--IEVLSQIVKNYPDSPTRLIYEQMINEYKRR 62 (75)
T ss_pred HHHHHHHHHhcCCHHHHHHHHHHH--HHHHHHHHHhCCChHHHHHHHHHHHHHHHH
Confidence 344455678889999999988765 568889999988853 224455555444
No 276
>PF08397 IMD: IRSp53/MIM homology domain; InterPro: IPR013606 The IMD (IRSp53 and MIM (missing in metastases) homology) domain is a BAR-like domain of approximately 250 amino acids found at the N-terminal in the insulin receptor tyrosine kinase substrate p53 (IRSp53) and in the evolutionarily related IRSp53/MIM family. In IRSp53, a ubiquitous regulator o the actin cytoskeleton, the IMD domain acts as conserved F-actin bundling domain involved in filopodium formation. Filopodium-inducing IMD activity is regulated by Cdc42 and Rac1 (Rho-family GTPases) and is SH3-independent [, , ]. The IRSp53/MIM family is a novel F-actin bundling protein family that includes invertebrate relatives: Vertebrate MIM (missing in metastasis), an actin-binding scaffold protein that may be involved in cancer metastasis. Vertebrate ABBA-1, a MIM-related protein. Vertebrate brain-specific angiogenesis inhibitor 1-associated protein 2 (BAI1-associated protein 2) or insulin receptor tyrosine kinase substrate p53 (IRSp53), a multifunctional adaptor protein that links Rac1 with a Wiskott-Aldrich syndrome family verprolin-homologous protein 2 (WAVE2) to induce lamellipodia or Cdc42 with Mena to induce filopodia []. Vertebrate brain-specific angiogenesis inhibitor 1-associated protein 2-like proteins 1 and 2 (BAI1-associated protein 2-like proteins 1 and 2). Drosophila melanogaster (Fruit fly) CG32082-PA. Caenorhabditis elegans M04F3.5 protein. The vertebrate IRSp53/MIM family is divided into two major groups: the IRSp53 subfamily and the MIM/ABBA subfamily. The putative invertebrate homologues are positioned between them. The IRSp53 subfamily members contain an SH3 domain, and the MIM/ABBA subfamily proteins contain a WH2 (WASP-homology 2) domain. The vertebrate SH3-containing subfamily is further divided into three groups according to the presence or absence of the WWB and the half-CRIB motif. The IMD domain can bind to and bundle actin filaments, bind to membranes and interact with the small GTPase Rac [, ]. The IMD domain folds as a coiled coil of three extended alpha-helices and a shorter C-terminal helix. Helix 4 packs tightly against the other three helices, and thus represents an integral part of the domain. The fold of the IMD domain closely resembles that of the BAR (Bin-Amphiphysin-RVS) domain, a functional module serving both as a sensor and inducer of membrane curvature []. The WH2 domain performs a scaffolding function [].; GO: 0008093 cytoskeletal adaptor activity, 0017124 SH3 domain binding, 0007165 signal transduction, 0046847 filopodium assembly; PDB: 2D1L_A 3OK8_B 1WDZ_B 1Y2O_A 2YKT_A.
Probab=25.15 E-value=99 Score=21.84 Aligned_cols=57 Identities=14% Similarity=0.215 Sum_probs=39.5
Q ss_pred HHHHHhccHHHHHHHHHHhC--CHHHHHHHHhhCCCCCchHHHHHHHHHhCCChHHHHHH
Q psy11102 10 EYYEKCHCYEKLISVYTELG--DFEALESCARKLPDSSPLLKPMGEIFVKYGLCEQAVYV 67 (87)
Q Consensus 10 ~yY~~~~n~~k~ie~~~~~e--d~d~L~~l~~~L~~~~~lL~~ia~~F~~~G~~~~Av~a 67 (87)
++-..|.+|.++........ -+|.|+++.+.-+..+. -.+||.-+.+....-..++.
T Consensus 14 ~lv~~~~kY~~al~~~~~a~~~f~dal~ki~~~A~~s~~-s~~lG~~L~~~s~~~r~i~~ 72 (219)
T PF08397_consen 14 NLVSLGKKYQKALRAMSQAAAAFFDALQKIGDMASNSRG-SKELGDALMQISEVHRRIEN 72 (219)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHTSSS-HHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCc-cccHHHHHHHHHHHHHHHHH
Confidence 45667778888877766543 58899999988666444 67888887776655555543
No 277
>KOG3785|consensus
Probab=25.12 E-value=2.5e+02 Score=22.86 Aligned_cols=42 Identities=21% Similarity=0.181 Sum_probs=27.1
Q ss_pred hhHHHHHHHHHHhccHH--------HHHHHHHHhCCHHHHHHHHhhCCCC
Q psy11102 3 TLVSQAREYYEKCHCYE--------KLISVYTELGDFEALESCARKLPDS 44 (87)
Q Consensus 3 ~~w~~A~~yY~~~~n~~--------k~ie~~~~~ed~d~L~~l~~~L~~~ 44 (87)
++|+.|..-|.-.-|.+ -++=|++=+|.|.+-+.++...|++
T Consensus 71 gdY~~Al~~Y~~~~~~~~~~~el~vnLAcc~FyLg~Y~eA~~~~~ka~k~ 120 (557)
T KOG3785|consen 71 GDYEEALNVYTFLMNKDDAPAELGVNLACCKFYLGQYIEAKSIAEKAPKT 120 (557)
T ss_pred ccHHHHHHHHHHHhccCCCCcccchhHHHHHHHHHHHHHHHHHHhhCCCC
Confidence 57777777776655422 3566777777777777766666654
No 278
>PRK13910 DNA glycosylase MutY; Provisional
Probab=24.93 E-value=1.1e+02 Score=23.02 Aligned_cols=19 Identities=16% Similarity=0.357 Sum_probs=14.7
Q ss_pred HHHHHHhccHHHHHHHHHH
Q psy11102 9 REYYEKCHCYEKLISVYTE 27 (87)
Q Consensus 9 ~~yY~~~~n~~k~ie~~~~ 27 (87)
+-||.+++|+.++++....
T Consensus 43 lGyy~RAr~L~~~A~~i~~ 61 (289)
T PRK13910 43 LGYYSRAKNLKKSAEICVK 61 (289)
T ss_pred CCcHHHHHHHHHHHHHHHH
Confidence 4578888888888887764
No 279
>PF06855 DUF1250: Protein of unknown function (DUF1250); InterPro: IPR023089 This entry represents the YozE-like domain found in a group of proteins of unknown function.; PDB: 2KVS_A 2FJ6_A 2O6K_B.
Probab=24.81 E-value=52 Score=17.85 Aligned_cols=30 Identities=10% Similarity=0.170 Sum_probs=23.4
Q ss_pred hCCCCCchHHHHHHHHHhCCChHHHHHHHH
Q psy11102 40 KLPDSSPLLKPMGEIFVKYGLCEQAVYVFD 69 (87)
Q Consensus 40 ~L~~~~~lL~~ia~~F~~~G~~~~Av~ayl 69 (87)
+.|+.+.--.+|..+|.+.|.+.+++++|-
T Consensus 10 ~FPK~~~~~~eI~~Yle~~~~~~~~~~~fd 39 (46)
T PF06855_consen 10 SFPKQETDFDEISSYLESNYDYLESMEIFD 39 (46)
T ss_dssp TS-TT-SSHHHHHHHHHCHCCHHCCHHHHH
T ss_pred CCCCCCCCHHHHHHHHHHhcCchhHHHHHH
Confidence 457777778899999999999988888763
No 280
>KOG1064|consensus
Probab=24.75 E-value=2.8e+02 Score=27.08 Aligned_cols=28 Identities=11% Similarity=-0.072 Sum_probs=24.4
Q ss_pred HhCCChHHHHHHHHhcCCHHHHHHHHhh
Q psy11102 56 VKYGLCEQAVYVFDKNKHKSSQWLTVVQ 83 (87)
Q Consensus 56 ~~~G~~~~Av~aylk~gd~k~ai~~cv~ 83 (87)
.+--.+++|+.-|+-.|..+.||++|++
T Consensus 1286 LgKhRfe~AaaFFLLagslkDAI~VC~k 1313 (2439)
T KOG1064|consen 1286 LGKHRFEHAAAFFLLAGSLKDAINVCCK 1313 (2439)
T ss_pred hhhHHHHHHHHHHhhcchHHHHHHHHHH
Confidence 4445789999999999999999999986
No 281
>cd04389 RhoGAP_KIAA1688 RhoGAP_KIAA1688: GTPase-activator protein (GAP) domain for Rho-like GTPases found in KIAA1688-like proteins; KIAA1688 is a protein of unknown function that contains a RhoGAP domain and a myosin tail homology 4 (MyTH4) domain. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude.
Probab=24.56 E-value=84 Score=21.70 Aligned_cols=32 Identities=16% Similarity=0.097 Sum_probs=25.9
Q ss_pred hCCHHHHHHHHhhCCCC-CchHHHHHHHHHhCC
Q psy11102 28 LGDFEALESCARKLPDS-SPLLKPMGEIFVKYG 59 (87)
Q Consensus 28 ~ed~d~L~~l~~~L~~~-~~lL~~ia~~F~~~G 59 (87)
.++.+.+.+++..||+. ..+|.-+-.+|..+.
T Consensus 104 ~~~~~~~~~li~~LP~~n~~~L~~l~~~L~~v~ 136 (187)
T cd04389 104 SEDPDKAVEIVQKLPIINRLVLCYLINFLQVFA 136 (187)
T ss_pred hcCHHHHHHHHHhCCHHHHHHHHHHHHHHHHHH
Confidence 46788999999999995 588888888886554
No 282
>PF08820 DUF1803: Domain of unknown function (DUF1803); InterPro: IPR014924 This small protein is found in one or two copies in bacteria. The function of this is unknown.
Probab=24.55 E-value=92 Score=19.82 Aligned_cols=32 Identities=16% Similarity=-0.018 Sum_probs=28.6
Q ss_pred CCchHHHHHHHHHhCCChHHHHHHHHhcCCHH
Q psy11102 44 SSPLLKPMGEIFVKYGLCEQAVYVFDKNKHKS 75 (87)
Q Consensus 44 ~~~lL~~ia~~F~~~G~~~~Av~aylk~gd~k 75 (87)
.+-+|..+-.-|......+..|++|+++|=+.
T Consensus 27 ~~~lLR~iKk~f~~qk~~D~fie~li~~GYI~ 58 (93)
T PF08820_consen 27 TDFLLRFIKKDFPKQKRLDIFIEALIKLGYIE 58 (93)
T ss_pred CHhhHHHHHHhhccccchhHHHHHHHHcCCeE
Confidence 56889999999999999999999999999654
No 283
>PF11464 Rbsn: Rabenosyn Rab binding domain; InterPro: IPR021565 Rabenosyn-5 (Rbsn) is a multivalent effector with interacts with the Rab family.Rsbn contains distinct Rab4 and Rab5 binding sites within residues 264-500 and 627-784 respectively []. Rab proteins are GTPases involved in the regulation of all stages of membrane trafficking []. ; PDB: 1Z0K_B 1YZM_A 1Z0J_B.
Probab=24.39 E-value=1.3e+02 Score=16.39 Aligned_cols=33 Identities=15% Similarity=0.240 Sum_probs=26.1
Q ss_pred HHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCC
Q psy11102 11 YYEKCHCYEKLISVYTELGDFEALESCARKLPD 43 (87)
Q Consensus 11 yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~ 43 (87)
++.|..+.+..|.=.-..+.||+..-|-++|.+
T Consensus 1 LleQi~~I~~~I~qAk~~~r~dEV~~L~~NL~E 33 (42)
T PF11464_consen 1 LLEQINIIESYIKQAKAARRFDEVATLEENLRE 33 (42)
T ss_dssp HHHHHHHHHHHHHHHHHTT-HHHHHHHHHHHHH
T ss_pred CHHHHHHHHHHHHHHHHhcCcHHHHHHHHHHHH
Confidence 356777888888888899999999988887754
No 284
>PHA01748 hypothetical protein
Probab=24.36 E-value=51 Score=18.95 Aligned_cols=40 Identities=28% Similarity=0.135 Sum_probs=26.7
Q ss_pred CHHHHHHHHhhCCCC-CchHHHHHHHHHhCCChHHHHHHHH
Q psy11102 30 DFEALESCARKLPDS-SPLLKPMGEIFVKYGLCEQAVYVFD 69 (87)
Q Consensus 30 d~d~L~~l~~~L~~~-~~lL~~ia~~F~~~G~~~~Av~ayl 69 (87)
..+.|.+++....-+ ++++.+.-..|...-.-+.|++.|-
T Consensus 13 l~~eld~~a~~~g~~RSE~Ir~Ai~~~~~~~~~~~~~~~~~ 53 (60)
T PHA01748 13 LLELLDRYAIKHGLNRSEAIRKAIEKMVKDELKKETVPVAK 53 (60)
T ss_pred HHHHHHHHHHHhCCCHHHHHHHHHHHHHHHHHHhcccchhh
Confidence 355667777666654 6777777777776666677766663
No 285
>COG1729 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=24.06 E-value=2.4e+02 Score=21.19 Aligned_cols=49 Identities=18% Similarity=0.288 Sum_probs=37.0
Q ss_pred HHHHHHhCCHHHH----HHHHhhCCCC---CchHHHHHHHHHhCCChHHHHHHHHh
Q psy11102 22 ISVYTELGDFEAL----ESCARKLPDS---SPLLKPMGEIFVKYGLCEQAVYVFDK 70 (87)
Q Consensus 22 ie~~~~~ed~d~L----~~l~~~L~~~---~~lL~~ia~~F~~~G~~~~Av~aylk 70 (87)
-|++|..|+|++- ..+++..|++ .+-|-++|.-....|..+.|-..|-.
T Consensus 185 Ge~~y~qg~y~~Aa~~f~~~~k~~P~s~KApdallKlg~~~~~l~~~d~A~atl~q 240 (262)
T COG1729 185 GESLYAQGDYEDAAYIFARVVKDYPKSPKAPDALLKLGVSLGRLGNTDEACATLQQ 240 (262)
T ss_pred HHHHHhcccchHHHHHHHHHHHhCCCCCCChHHHHHHHHHHHHhcCHHHHHHHHHH
Confidence 4678888888764 4566777874 35688899999999998888877654
No 286
>KOG1498|consensus
Probab=23.92 E-value=1.1e+02 Score=24.61 Aligned_cols=37 Identities=16% Similarity=0.273 Sum_probs=24.0
Q ss_pred HHHHHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCCC
Q psy11102 8 AREYYEKCHCYEKLISVYTELGDFEALESCARKLPDS 44 (87)
Q Consensus 8 A~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~~ 44 (87)
|..+=+-.+-+......++..++||.|.+.++.|...
T Consensus 45 ~~D~~s~~kv~~~i~~lc~~~~~w~~Lne~i~~Lskk 81 (439)
T KOG1498|consen 45 ASDMASNTKVLEEIMKLCFSAKDWDLLNEQIRLLSKK 81 (439)
T ss_pred HhhHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHH
Confidence 3444444455566677777888888887777777443
No 287
>PF12234 Rav1p_C: RAVE protein 1 C terminal; InterPro: IPR022033 This domain family is found in eukaryotes, and is typically between 621 and 644 amino acids in length. This family is the C-terminal region of the protein RAVE (regulator of the ATPase of vacuolar and endosomal membranes). Rav1p is involved in regulating the glucose dependent assembly and disassembly of vacuolar ATPase V1 and V0 subunits.
Probab=23.88 E-value=4.5e+02 Score=22.25 Aligned_cols=80 Identities=19% Similarity=0.136 Sum_probs=55.0
Q ss_pred hhHHHHHHHHHHhccHHHHHHHHH-HhCCHHHHHHHHhhCCCC-CchHHHHHHHHH-----hCCChHHHHHHHHhcCCHH
Q psy11102 3 TLVSQAREYYEKCHCYEKLISVYT-ELGDFEALESCARKLPDS-SPLLKPMGEIFV-----KYGLCEQAVYVFDKNKHKS 75 (87)
Q Consensus 3 ~~w~~A~~yY~~~~n~~k~ie~~~-~~ed~d~L~~l~~~L~~~-~~lL~~ia~~F~-----~~G~~~~Av~aylk~gd~k 75 (87)
+.|.-|+-+|--||....+++.+. +++|.+----++|-...+ .|.+.++-+... +.|..=-+-=+|-..|+..
T Consensus 466 ~Ry~~AAaFFLLag~l~dAv~V~~~~l~D~qLAi~i~Rl~e~d~gp~~~~ll~~~vLp~a~~~~d~wl~s~~~W~L~~~~ 545 (631)
T PF12234_consen 466 HRYEYAAAFFLLAGSLKDAVNVCLRQLNDPQLAIAIARLYEGDNGPVLKKLLEEHVLPEAIKEGDRWLASWAFWMLGDYD 545 (631)
T ss_pred ccHHHHHHHHHhcccHHHHHHHHHHHccChhHHHHHHHHHcCCCchHHHHHHHHhhhccccccCCHHHHHHHHHhcCCHH
Confidence 578899999999999999999776 569999888888887654 577766553321 2233333334566666666
Q ss_pred HHHHHHh
Q psy11102 76 SQWLTVV 82 (87)
Q Consensus 76 ~ai~~cv 82 (87)
.|+++-+
T Consensus 546 ~ai~~Li 552 (631)
T PF12234_consen 546 EAIRALI 552 (631)
T ss_pred HHHHHHh
Confidence 6665544
No 288
>COG4008 Predicted metal-binding transcription factor [Transcription]
Probab=23.83 E-value=2.4e+02 Score=19.32 Aligned_cols=22 Identities=9% Similarity=0.003 Sum_probs=19.5
Q ss_pred hHHHHHHHHhcCCHHHHHHHHh
Q psy11102 61 CEQAVYVFDKNKHKSSQWLTVV 82 (87)
Q Consensus 61 ~~~Av~aylk~gd~k~ai~~cv 82 (87)
-+.|.+|.+-.||++.|+.+..
T Consensus 128 ~eeAr~aleeagDl~~A~k~l~ 149 (153)
T COG4008 128 PEEAREALEEAGDLRTAMKILR 149 (153)
T ss_pred HHHHHHHHHHcCCHHHHHHHHH
Confidence 5789999999999999998754
No 289
>KOG2066|consensus
Probab=23.74 E-value=2.1e+02 Score=25.09 Aligned_cols=44 Identities=14% Similarity=0.109 Sum_probs=30.4
Q ss_pred hccHHHHHHHHHHhCCHHHHHHHHhhCCCCCchHHHHHHHHHhC
Q psy11102 15 CHCYEKLISVYTELGDFEALESCARKLPDSSPLLKPMGEIFVKY 58 (87)
Q Consensus 15 ~~n~~k~ie~~~~~ed~d~L~~l~~~L~~~~~lL~~ia~~F~~~ 58 (87)
.=|.++|++++...+-++++.-|........+.|.-|.+.+...
T Consensus 634 ~Y~lekA~eiC~q~~~~~E~VYlLgrmGn~k~AL~lII~el~di 677 (846)
T KOG2066|consen 634 NYNLEKALEICSQKNFYEELVYLLGRMGNAKEALKLIINELRDI 677 (846)
T ss_pred CCCHHHHHHHHHhhCcHHHHHHHHHhhcchHHHHHHHHHHhhCH
Confidence 34678888888888888877777776665556666666655443
No 290
>KOG2160|consensus
Probab=23.59 E-value=2.1e+02 Score=22.40 Aligned_cols=48 Identities=17% Similarity=0.190 Sum_probs=32.1
Q ss_pred CHHHHHHHHhhCCCCCchHHHHHHHHHhCCChHHHHHHHHhcCCHH------HHHHHHhhcCC
Q psy11102 30 DFEALESCARKLPDSSPLLKPMGEIFVKYGLCEQAVYVFDKNKHKS------SQWLTVVQDKP 86 (87)
Q Consensus 30 d~d~L~~l~~~L~~~~~lL~~ia~~F~~~G~~~~Av~aylk~gd~k------~ai~~cv~~~~ 86 (87)
-||.|+.++.+++ -|..|.+.|--....- |++..+.. ..|-+|++.||
T Consensus 103 ald~Le~lve~iD--------nAndl~~~ggl~~ll~-~l~~~~~~lR~~Aa~Vigt~~qNNP 156 (342)
T KOG2160|consen 103 ALDNLEELVEDID--------NANDLISLGGLVPLLG-YLENSDAELRELAARVIGTAVQNNP 156 (342)
T ss_pred HHHHHHHHHHhhh--------hHHhHhhccCHHHHHH-HhcCCcHHHHHHHHHHHHHHHhcCH
Confidence 3455555555543 3677777787777777 88777753 35778888887
No 291
>PF10668 Phage_terminase: Phage terminase small subunit; InterPro: IPR018925 This entry describes the terminase small subunit from Enterococcus phage phiFL1A, related proteins in other bacteriophage, and prophage regions of bacterial genomes. Packaging of double-stranded viral DNA concatemers requires interaction of the prohead with virus DNA. This process is mediated by a phage-encoded DNA recognition and terminase protein. The terminase enzymes described so far, which are hetero-oligomers composed of a small and a large subunit, do not have a significant level of sequence homology. The small terminase subunit is thought to form a nucleoprotein structure that helps to position the terminase large subunit at the packaging initiation site [].
Probab=23.53 E-value=90 Score=18.25 Aligned_cols=26 Identities=27% Similarity=0.284 Sum_probs=13.8
Q ss_pred hHHHHHHHHHHhccHHHHHHHHHHhC
Q psy11102 4 LVSQAREYYEKCHCYEKLISVYTELG 29 (87)
Q Consensus 4 ~w~~A~~yY~~~~n~~k~ie~~~~~e 29 (87)
+.++|-++|...+---++.++.-++|
T Consensus 8 ~rdkA~e~y~~~~g~i~lkdIA~~Lg 33 (60)
T PF10668_consen 8 NRDKAFEIYKESNGKIKLKDIAEKLG 33 (60)
T ss_pred CHHHHHHHHHHhCCCccHHHHHHHHC
Confidence 45667777766554444444443333
No 292
>COG4700 Uncharacterized protein conserved in bacteria containing a divergent form of TPR repeats [Function unknown]
Probab=23.08 E-value=3.2e+02 Score=20.29 Aligned_cols=69 Identities=20% Similarity=0.176 Sum_probs=50.2
Q ss_pred chhHHHHHHHHHHh------cc---HHHHHHHHHHhCCHHH----HHHHHhhCCCC-Cch-HHHHHHHHHhCCChHHHHH
Q psy11102 2 TTLVSQAREYYEKC------HC---YEKLISVYTELGDFEA----LESCARKLPDS-SPL-LKPMGEIFVKYGLCEQAVY 66 (87)
Q Consensus 2 ~~~w~~A~~yY~~~------~n---~~k~ie~~~~~ed~d~----L~~l~~~L~~~-~~l-L~~ia~~F~~~G~~~~Av~ 66 (87)
.|++.+|+..|.++ ++ .-.+++..+..+++.. |+++++.-|.. +|. ..-.|.-+...|-.++|-.
T Consensus 102 lGr~~EA~~hy~qalsG~fA~d~a~lLglA~Aqfa~~~~A~a~~tLe~l~e~~pa~r~pd~~Ll~aR~laa~g~~a~Aes 181 (251)
T COG4700 102 LGRYHEAVPHYQQALSGIFAHDAAMLLGLAQAQFAIQEFAAAQQTLEDLMEYNPAFRSPDGHLLFARTLAAQGKYADAES 181 (251)
T ss_pred hhhhhhhHHHHHHHhccccCCCHHHHHHHHHHHHhhccHHHHHHHHHHHhhcCCccCCCCchHHHHHHHHhcCCchhHHH
Confidence 36788999999985 33 3356778888888865 66666666763 322 4457899999999999988
Q ss_pred HHHh
Q psy11102 67 VFDK 70 (87)
Q Consensus 67 aylk 70 (87)
+|.-
T Consensus 182 afe~ 185 (251)
T COG4700 182 AFEV 185 (251)
T ss_pred HHHH
Confidence 8753
No 293
>PF08769 Spo0A_C: Sporulation initiation factor Spo0A C terminal; InterPro: IPR014879 The response regulator Spo0A is comprised of a phophoacceptor domain and a transcription activation domain. This domain corresponds to the transcription activation domain and forms an alpha helical structure comprising of 6 alpha helices. The structure contains a helix-turn-helix and binds DNA [, ]. ; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0005509 calcium ion binding, 0006355 regulation of transcription, DNA-dependent, 0042173 regulation of sporulation resulting in formation of a cellular spore, 0005737 cytoplasm; PDB: 1FC3_C 1LQ1_D.
Probab=22.89 E-value=15 Score=23.58 Aligned_cols=39 Identities=21% Similarity=0.204 Sum_probs=25.0
Q ss_pred HHHHHHHHHHhCCHHHHHHHHh-------hCCCCCchHHHHHHHHH
Q psy11102 18 YEKLISVYTELGDFEALESCAR-------KLPDSSPLLKPMGEIFV 56 (87)
Q Consensus 18 ~~k~ie~~~~~ed~d~L~~l~~-------~L~~~~~lL~~ia~~F~ 56 (87)
..-+|+.....||.+.|+++.. .=|.+++++..++++++
T Consensus 61 IR~aI~~~w~~g~~~~l~~i~g~~~~~~~~kPTnsEFI~~~~~~lr 106 (106)
T PF08769_consen 61 IRHAIEVAWTRGNPELLEKIFGYTINEEKGKPTNSEFIAMLADYLR 106 (106)
T ss_dssp HHHHHHHHHHCS-CCCCHHCC-HHHHT-SS---HHHHHHHHHHHHH
T ss_pred HHHHHHHHHHcCCHHHHHHHhCCCcccCCCCCCHHHHHHHHHHHhC
Confidence 4456777778888888888764 33556788888887764
No 294
>PF02845 CUE: CUE domain; InterPro: IPR003892 This domain may be involved in binding ubiquitin-conjugating enzymes (UBCs). CUE domains also occur in two proteins of the IL-1 signal transduction pathway, tollip and TAB2.; GO: 0005515 protein binding; PDB: 2EKF_A 1OTR_A 1P3Q_Q 1MN3_A 1WGL_A 2EJS_A 2DAE_A 2DHY_A 2DI0_A.
Probab=22.84 E-value=1.1e+02 Score=15.81 Aligned_cols=14 Identities=21% Similarity=0.152 Sum_probs=7.6
Q ss_pred HhCCChHHHHHHHH
Q psy11102 56 VKYGLCEQAVYVFD 69 (87)
Q Consensus 56 ~~~G~~~~Av~ayl 69 (87)
.+.|-.+.|+++.+
T Consensus 26 ~~~~~ve~ai~~LL 39 (42)
T PF02845_consen 26 ANNGDVEAAIDALL 39 (42)
T ss_dssp HTTTTHHHHHHHHH
T ss_pred HcCCCHHHHHHHHH
Confidence 45555555555544
No 295
>PTZ00373 60S Acidic ribosomal protein P2; Provisional
Probab=22.55 E-value=2.3e+02 Score=18.51 Aligned_cols=59 Identities=12% Similarity=0.039 Sum_probs=43.7
Q ss_pred CchhHHHHHHHHHHhcc-------HHHHHHHHHHhCCHHHHHHHHhhCCC-C-CchHHHHHHHHHhCC
Q psy11102 1 MTTLVSQAREYYEKCHC-------YEKLISVYTELGDFEALESCARKLPD-S-SPLLKPMGEIFVKYG 59 (87)
Q Consensus 1 ~~~~w~~A~~yY~~~~n-------~~k~ie~~~~~ed~d~L~~l~~~L~~-~-~~lL~~ia~~F~~~G 59 (87)
|++.|..|--+-..+|| +.+.++.-.-.=+-+.+..+++.|.. + +++|..=..++.++|
T Consensus 1 ~~MkyvaAYlL~~lgG~~~pTaddI~kIL~AaGveVd~~~~~l~~~~L~GKdI~ELIa~G~~kl~svg 68 (112)
T PTZ00373 1 MAMKYVAAYLMCVLGGNENPTKKEVKNVLSAVNADVEDDVLDNFFKSLEGKTPHELIAAGMKKLQNIG 68 (112)
T ss_pred CchHHHHHHHHHHHcCCCCCCHHHHHHHHHHcCCCccHHHHHHHHHHHcCCCHHHHHHHhHHHHhccc
Confidence 67777778777777766 45666666666677788888999876 4 477777777887775
No 296
>PF08025 Antimicrobial_3: Spider antimicrobial peptide; InterPro: IPR012522 This family includes antimicrobial peptides isolated from the crude venom of the wolf spider Oxyopes kitabensis (Wolf spider). These peptides, known as oxyopinins, are the largest linear cationic amphipathic peptides chemically characterised and exhibit disrupting activities towards biological membranes [].; GO: 0019836 hemolysis by symbiont of host erythrocytes, 0005576 extracellular region
Probab=22.15 E-value=57 Score=17.19 Aligned_cols=17 Identities=24% Similarity=0.477 Sum_probs=13.6
Q ss_pred CchHHHHHHHHHhCCCh
Q psy11102 45 SPLLKPMGEIFVKYGLC 61 (87)
Q Consensus 45 ~~lL~~ia~~F~~~G~~ 61 (87)
.+.|..|+.+|.-+|-.
T Consensus 7 ~kilrsiak~fkgvgk~ 23 (37)
T PF08025_consen 7 SKILRSIAKFFKGVGKV 23 (37)
T ss_pred HHHHHHHHHHHHHHHHH
Confidence 46788999999988843
No 297
>TIGR02875 spore_0_A sporulation transcription factor Spo0A. Spo0A, the stage 0 sporulation protein A, is a transcription factor critical for the initiation of sporulation. It contains a response regulator receiver domain (pfam00072). In Bacillus subtilis, it works together with response regulator Spo0F and the phosphotransferase Spo0B, both of which are missing from at least some sporulating species and thus not part of the endospore forming bacteria minimal gene set. Spo0A, however, is universal among endospore-forming species.
Probab=22.07 E-value=1.7e+02 Score=20.27 Aligned_cols=41 Identities=15% Similarity=0.171 Sum_probs=31.0
Q ss_pred cHHHHHHHHHHhCCHHHHHHHHhh-------CCCCCchHHHHHHHHHh
Q psy11102 17 CYEKLISVYTELGDFEALESCARK-------LPDSSPLLKPMGEIFVK 57 (87)
Q Consensus 17 n~~k~ie~~~~~ed~d~L~~l~~~-------L~~~~~lL~~ia~~F~~ 57 (87)
...-+|+.....|+.+.|.++..- =|.+++++..++++++.
T Consensus 210 ~~r~~i~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 257 (262)
T TIGR02875 210 AIRHAIEVAWSRGNIEPINKLFGYTIEGEKGKPTNSEFIAMVADKLRL 257 (262)
T ss_pred HHHHHHHHHHhcCCHHHHHHHhCcccccCCCCCChHHHHHHHHHHHHH
Confidence 344567777788888988888553 37788999999998864
No 298
>PF12816 Vps8: Golgi CORVET complex core vacuolar protein 8
Probab=21.86 E-value=2.5e+02 Score=19.65 Aligned_cols=34 Identities=21% Similarity=0.236 Sum_probs=20.6
Q ss_pred HHHHHHHHhccHHHHHHHHHHhC----CHHHHHHHHhh
Q psy11102 7 QAREYYEKCHCYEKLISVYTELG----DFEALESCARK 40 (87)
Q Consensus 7 ~A~~yY~~~~n~~k~ie~~~~~e----d~d~L~~l~~~ 40 (87)
.-++||...|+.+++=+|..+++ |.|.+.++|++
T Consensus 27 ~lv~~y~~~~~~~~lE~lI~~LD~~~LDidq~i~lC~~ 64 (196)
T PF12816_consen 27 ALVEHYASKGRLERLEQLILHLDPSSLDIDQVIKLCKK 64 (196)
T ss_pred HHHHHHHHCCCHHHHHHHHHhCCHHhcCHHHHHHHHHH
Confidence 34667777777666666665554 55555555554
No 299
>PF14649 Spatacsin_C: Spatacsin C-terminus
Probab=21.82 E-value=2e+02 Score=21.75 Aligned_cols=67 Identities=10% Similarity=0.024 Sum_probs=46.2
Q ss_pred HHHHHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCCCCchHHHHHHHHHhCCChHHHHHHHHhcCCH
Q psy11102 8 AREYYEKCHCYEKLISVYTELGDFEALESCARKLPDSSPLLKPMGEIFVKYGLCEQAVYVFDKNKHK 74 (87)
Q Consensus 8 A~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~~~~lL~~ia~~F~~~G~~~~Av~aylk~gd~ 74 (87)
-+..|...|++.-++....-.+.|.+|.-+.+.|=+++.+=.-++...-+++-...|+-.|+|--.|
T Consensus 52 ~~~~l~~~~~~~llvRLltGi~ry~em~yifd~L~~n~qfE~LL~k~~d~~~~lk~all~ylk~~~P 118 (296)
T PF14649_consen 52 LVNHLAAEGDWSLLVRLLTGIGRYREMTYIFDILIENDQFELLLRKGIDKVNGLKMALLDYLKRCCP 118 (296)
T ss_pred HHHHHhhcccHHHHHHHHHccCcHHHHHHHHHHHHHcChHHHHHhccccccchHHHHHHHHHHhcCC
Confidence 4577888888888888888888888888888888775422111222334555677777777776544
No 300
>COG3063 PilF Tfp pilus assembly protein PilF [Cell motility and secretion / Intracellular trafficking and secretion]
Probab=21.60 E-value=3.4e+02 Score=20.35 Aligned_cols=52 Identities=23% Similarity=0.321 Sum_probs=32.6
Q ss_pred HHHHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCC-CCchHHHHHHHHHhCCChHHHHHHHHh
Q psy11102 9 REYYEKCHCYEKLISVYTELGDFEALESCARKLPD-SSPLLKPMGEIFVKYGLCEQAVYVFDK 70 (87)
Q Consensus 9 ~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~-~~~lL~~ia~~F~~~G~~~~Av~aylk 70 (87)
+.||++-|..+.+=+.|- .+-+|.. +-.+|..-|-++=+.|-+++|.+-|.+
T Consensus 76 A~~Yq~~Ge~~~A~e~Yr----------kAlsl~p~~GdVLNNYG~FLC~qg~~~eA~q~F~~ 128 (250)
T COG3063 76 AHYYQKLGENDLADESYR----------KALSLAPNNGDVLNNYGAFLCAQGRPEEAMQQFER 128 (250)
T ss_pred HHHHHHcCChhhHHHHHH----------HHHhcCCCccchhhhhhHHHHhCCChHHHHHHHHH
Confidence 445555555554444432 2233444 458899999999888888888775543
No 301
>KOG2063|consensus
Probab=21.37 E-value=4e+02 Score=23.52 Aligned_cols=41 Identities=2% Similarity=-0.006 Sum_probs=32.7
Q ss_pred CCchHHHHHHHHHhCCChHHHHHHHHhcCCHHHHHHHHhhc
Q psy11102 44 SSPLLKPMGEIFVKYGLCEQAVYVFDKNKHKSSQWLTVVQD 84 (87)
Q Consensus 44 ~~~lL~~ia~~F~~~G~~~~Av~aylk~gd~k~ai~~cv~~ 84 (87)
++-.+++|-..+.+.|++..=+..|-+.|+.++|+++-.++
T Consensus 490 ~~c~vee~e~~L~k~~~y~~Li~LY~~kg~h~~AL~ll~~l 530 (877)
T KOG2063|consen 490 NHCDVEEIETVLKKSKKYRELIELYATKGMHEKALQLLRDL 530 (877)
T ss_pred CCcchHHHHHHHHhcccHHHHHHHHHhccchHHHHHHHHHH
Confidence 35677888888888888888888888888888888876553
No 302
>PF04010 DUF357: Protein of unknown function (DUF357); InterPro: IPR023140 This domain is found in a family of proteins, which have no known function.; PDB: 2OO2_A 2PMR_A.
Probab=21.09 E-value=1.1e+02 Score=18.51 Aligned_cols=34 Identities=12% Similarity=0.150 Sum_probs=21.6
Q ss_pred hhHHHHHHHHHHhccHHHHHHHH-HHhCCHHHHHH
Q psy11102 3 TLVSQAREYYEKCHCYEKLISVY-TELGDFEALES 36 (87)
Q Consensus 3 ~~w~~A~~yY~~~~n~~k~ie~~-~~~ed~d~L~~ 36 (87)
..|-+=..||...|++..+..|. |-.||.|.+.+
T Consensus 36 ~~Y~~D~~~fl~~gD~v~Ala~~sYa~GwLDAg~~ 70 (75)
T PF04010_consen 36 ESYLEDGKYFLEKGDYVNALACFSYAHGWLDAGRR 70 (75)
T ss_dssp HHHHHHHHHHHHTT-HHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHH
Confidence 34555566777777777776665 57777776654
No 303
>PF11390 FdsD: NADH-dependant formate dehydrogenase delta subunit FdsD; InterPro: IPR021074 FdsD is the delta subunit of the enzyme formate dehydrogenase. This subunit may play a role in maintaining the quaternary structure by means of electrostatic interactions with the other subunits []. The delta subunit is not involved in the active centre of the enzyme [].
Probab=20.98 E-value=1.1e+02 Score=17.96 Aligned_cols=19 Identities=26% Similarity=0.279 Sum_probs=15.2
Q ss_pred HHHHHHHHhCCChHHHHHH
Q psy11102 49 KPMGEIFVKYGLCEQAVYV 67 (87)
Q Consensus 49 ~~ia~~F~~~G~~~~Av~a 67 (87)
.+||.+|.+.+--++||+.
T Consensus 7 NQIa~ff~~~p~~~~a~~~ 25 (61)
T PF11390_consen 7 NQIAAFFESYPPEEEAVEG 25 (61)
T ss_pred HHHHHHHccCCCHHHHHHH
Confidence 4788999988788888765
No 304
>smart00544 MA3 Domain in DAP-5, eIF4G, MA-3 and other proteins. Highly alpha-helical. May contain repeats and/or regions similar to MIF4G domains Ponting (TIBS) "Novel eIF4G domain homologues" in press
Probab=20.84 E-value=2.1e+02 Score=17.41 Aligned_cols=26 Identities=23% Similarity=0.261 Sum_probs=19.7
Q ss_pred HHHHHHHHHHhCCHHHHHHHHhhCCC
Q psy11102 18 YEKLISVYTELGDFEALESCARKLPD 43 (87)
Q Consensus 18 ~~k~ie~~~~~ed~d~L~~l~~~L~~ 43 (87)
...+++-|...+|+++..+.+..|+-
T Consensus 5 i~~~l~ey~~~~D~~ea~~~l~~L~~ 30 (113)
T smart00544 5 IFLIIEEYLSSGDTDEAVHCLLELKL 30 (113)
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHhCC
Confidence 34567778888888888888888764
No 305
>PF07911 DUF1677: Protein of unknown function (DUF1677); InterPro: IPR012876 The sequences found in this family are all derived from hypothetical plant proteins of unknown function. The region features a number of highly conserved cysteine residues.
Probab=20.45 E-value=1.5e+02 Score=18.85 Aligned_cols=26 Identities=19% Similarity=0.116 Sum_probs=18.9
Q ss_pred CCChHHHHHHHHh----cCCHHHHHHHHhh
Q psy11102 58 YGLCEQAVYVFDK----NKHKSSQWLTVVQ 83 (87)
Q Consensus 58 ~G~~~~Av~aylk----~gd~k~ai~~cv~ 83 (87)
=|+|.+||.-=+. ...+..|++-|+.
T Consensus 34 CGLC~EAVkeE~~r~~~~~~~eeAl~~Hm~ 63 (91)
T PF07911_consen 34 CGLCSEAVKEEVGRSKPGISVEEALDRHMS 63 (91)
T ss_pred hhcCHHHHHHHHhccCCCCCHHHHHHHHHH
Confidence 4899999976553 4567788887764
No 306
>COG1194 MutY A/G-specific DNA glycosylase [DNA replication, recombination, and repair]
Probab=20.38 E-value=81 Score=24.61 Aligned_cols=62 Identities=15% Similarity=0.098 Sum_probs=39.2
Q ss_pred HHHHHHhccHHHHHHHHHHhCCHHHHHHHHhhCCCCCchHHHHHHHHHhCCChHHHHHHHHhcCCHHHHHHHHh
Q psy11102 9 REYYEKCHCYEKLISVYTELGDFEALESCARKLPDSSPLLKPMGEIFVKYGLCEQAVYVFDKNKHKSSQWLTVV 82 (87)
Q Consensus 9 ~~yY~~~~n~~k~ie~~~~~ed~d~L~~l~~~L~~~~~lL~~ia~~F~~~G~~~~Av~aylk~gd~k~ai~~cv 82 (87)
..||.+++|+.++++.....-. ..+|++-..|.. +--+|.+..+.-+-.-.|.+..++|.-|
T Consensus 84 LGYysRArnL~~~A~~v~~~~~--------G~~P~~~~~l~~----LpGiG~yTa~Ail~~a~~~~~~~lDgNV 145 (342)
T COG1194 84 LGYYSRARNLHKAAQEVVERHG--------GEFPDDEEELAA----LPGVGPYTAGAILSFAFNQPEPVLDGNV 145 (342)
T ss_pred cChHHHHHHHHHHHHHHHHHcC--------CCCCCCHHHHHh----CCCCcHHHHHHHHHHHhCCCCceeecch
Confidence 3499999999999887654311 234554332222 5556777666666667777777666655
No 307
>KOG0307|consensus
Probab=20.31 E-value=2.2e+02 Score=25.62 Aligned_cols=61 Identities=20% Similarity=0.231 Sum_probs=44.5
Q ss_pred HHHHHHHHHhccHHHHHHHHHHhCCHH-----------HHHHHHhhCCC---------C-CchHHHHHHHHHhCCChHHH
Q psy11102 6 SQAREYYEKCHCYEKLISVYTELGDFE-----------ALESCARKLPD---------S-SPLLKPMGEIFVKYGLCEQA 64 (87)
Q Consensus 6 ~~A~~yY~~~~n~~k~ie~~~~~ed~d-----------~L~~l~~~L~~---------~-~~lL~~ia~~F~~~G~~~~A 64 (87)
..|...|.-+|+.+|+++|.++.++-+ .|++++..|.- + ..++..-|+.+++-|..+-|
T Consensus 605 ~~a~lcYi~agsv~k~v~~w~~~~~~~~~~~~y~~~~e~l~~~~~~l~~~~~~~~~s~~l~~~~~~yanllasQG~~~~A 684 (1049)
T KOG0307|consen 605 SAAILCYICAGSVDKLVEIWLKALDLELAPTSYQDLAEDLMELTLKLAQFSANKTYSAGLAKKFSEYANLLASQGALAAA 684 (1049)
T ss_pred hhhhHHhhhccChhhhHHHHHHhcccccchHHHHHHHHHHHHHHhhhhhcccCccccHHHHHHHHHHHHHHHhcChHHHH
Confidence 457788999999999999999998874 45555555521 1 24566677778888888777
Q ss_pred HH
Q psy11102 65 VY 66 (87)
Q Consensus 65 v~ 66 (87)
.+
T Consensus 685 ~~ 686 (1049)
T KOG0307|consen 685 MS 686 (1049)
T ss_pred Hh
Confidence 64
No 308
>cd04409 RhoGAP_PARG1 RhoGAP_PARG1: RhoGAP (GTPase-activator protein [GAP] for Rho-like small GTPases) domain of PARG1 (PTPL1-associated RhoGAP1). PARG1 was originally cloned as an interaction partner of PTPL1, an intracellular protein-tyrosine phosphatase. PARG1 interacts with Rap2, also a member of the Ras small GTPase superfamily whose exact function is unknown, and shows strong preference for Rho. Small GTPases cluster into distinct families, and all act as molecular switches, active in their GTP-bound form but inactive when GDP-bound. The Rho family of GTPases activates effectors involved in a wide variety of developmental processes, including regulation of cytoskeleton formation, cell proliferation and the JNK signaling pathway. GTPases generally have a low intrinsic GTPase hydrolytic activity but there are family-specific groups of GAPs that enhance the rate of GTP hydrolysis by several orders of magnitude.
Probab=20.31 E-value=59 Score=22.94 Aligned_cols=32 Identities=13% Similarity=0.030 Sum_probs=26.0
Q ss_pred HHHHHHHHhhCCC-CCchHHHHHHHHHhCCChH
Q psy11102 31 FEALESCARKLPD-SSPLLKPMGEIFVKYGLCE 62 (87)
Q Consensus 31 ~d~L~~l~~~L~~-~~~lL~~ia~~F~~~G~~~ 62 (87)
+..+.++++.||. +..+|..+-.+|..+..++
T Consensus 130 ~~~l~~li~~LP~~n~~~L~~L~~~L~~V~~~s 162 (211)
T cd04409 130 LLKSKDLLRQLPAPNYNTLQFLIVHLHRVSEQA 162 (211)
T ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHHHHHccc
Confidence 4678899999999 4689999999998776553
Done!