Query psy18096
Match_columns 131
No_of_seqs 179 out of 1901
Neff 9.3
Searched_HMMs 46136
Date Fri Aug 16 22:28:14 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy18096.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/18096hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG0543|consensus 100.0 2.7E-36 5.8E-41 220.9 12.9 130 1-130 141-303 (397)
2 KOG0545|consensus 99.8 2E-20 4.3E-25 130.5 4.9 129 2-130 73-276 (329)
3 KOG0553|consensus 99.6 4E-15 8.7E-20 106.3 7.3 63 68-130 97-161 (304)
4 KOG0547|consensus 99.6 5.2E-15 1.1E-19 111.6 6.6 63 68-130 131-195 (606)
5 KOG0544|consensus 99.5 9.5E-14 2.1E-18 83.1 4.5 47 2-49 61-107 (108)
6 KOG0548|consensus 99.5 2.6E-13 5.7E-18 103.1 7.8 63 68-130 374-438 (539)
7 TIGR00990 3a0801s09 mitochondr 99.4 2.2E-12 4.7E-17 102.1 9.6 97 32-130 93-206 (615)
8 KOG4234|consensus 99.4 8.6E-13 1.9E-17 90.2 5.6 63 68-130 111-180 (271)
9 KOG0549|consensus 99.3 1.6E-12 3.5E-17 86.9 4.5 52 1-53 128-179 (188)
10 KOG0551|consensus 99.3 7.5E-12 1.6E-16 90.9 5.8 63 68-130 97-165 (390)
11 KOG4648|consensus 99.2 2.7E-11 5.8E-16 88.9 4.0 63 68-130 113-177 (536)
12 COG0545 FkpA FKBP-type peptidy 99.2 4.5E-11 9.8E-16 81.5 4.8 47 2-49 158-204 (205)
13 KOG0548|consensus 99.1 1.3E-10 2.9E-15 88.5 6.0 64 68-131 18-83 (539)
14 KOG0550|consensus 99.1 6.2E-11 1.3E-15 88.3 3.7 63 68-130 265-333 (486)
15 KOG4642|consensus 99.1 1.5E-10 3.3E-15 80.9 5.1 63 68-130 26-90 (284)
16 PF13414 TPR_11: TPR repeat; P 99.1 6.4E-10 1.4E-14 63.8 7.0 48 68-115 19-69 (69)
17 KOG0376|consensus 99.1 2.9E-11 6.4E-16 91.3 1.1 63 68-130 20-84 (476)
18 PF00254 FKBP_C: FKBP-type pep 99.0 2.2E-10 4.9E-15 69.8 3.9 47 1-47 48-94 (94)
19 PLN03088 SGT1, suppressor of 99.0 1.2E-09 2.5E-14 81.6 8.0 63 68-130 18-82 (356)
20 TIGR03516 ppisom_GldI peptidyl 99.0 2.9E-10 6.3E-15 77.1 3.8 49 1-50 128-176 (177)
21 PRK15363 pathogenicity island 99.0 2.3E-09 4.9E-14 71.0 7.4 63 68-130 51-115 (157)
22 PRK15359 type III secretion sy 99.0 2.4E-09 5.2E-14 70.4 7.3 61 68-128 74-136 (144)
23 PF13414 TPR_11: TPR repeat; P 99.0 1.9E-09 4.1E-14 61.8 5.5 48 83-130 2-49 (69)
24 PRK15359 type III secretion sy 98.9 5.7E-09 1.2E-13 68.6 7.0 63 68-130 40-104 (144)
25 PF00515 TPR_1: Tetratricopept 98.9 3.1E-09 6.8E-14 52.7 4.4 34 84-117 1-34 (34)
26 KOG0552|consensus 98.9 2E-09 4.4E-14 74.9 4.2 47 1-49 179-225 (226)
27 PRK11570 peptidyl-prolyl cis-t 98.8 3.7E-09 8.1E-14 73.3 4.1 48 1-49 158-205 (206)
28 PRK11189 lipoprotein NlpI; Pro 98.8 8.3E-09 1.8E-13 75.2 5.7 90 40-129 40-143 (296)
29 PF14559 TPR_19: Tetratricopep 98.7 6.4E-08 1.4E-12 55.1 6.5 59 68-126 7-67 (68)
30 PF13432 TPR_16: Tetratricopep 98.7 6.8E-08 1.5E-12 54.7 6.3 51 68-118 13-65 (65)
31 PF13371 TPR_9: Tetratricopept 98.7 1E-07 2.2E-12 55.0 7.2 59 68-126 11-71 (73)
32 PF13428 TPR_14: Tetratricopep 98.7 5.4E-08 1.2E-12 51.1 5.3 43 85-127 2-44 (44)
33 TIGR02552 LcrH_SycD type III s 98.7 1.1E-07 2.4E-12 61.0 7.2 63 68-130 33-97 (135)
34 PF07719 TPR_2: Tetratricopept 98.7 7.4E-08 1.6E-12 47.4 4.9 34 84-117 1-34 (34)
35 PF12895 Apc3: Anaphase-promot 98.6 9.1E-08 2E-12 57.0 5.3 62 68-130 5-70 (84)
36 PRK10902 FKBP-type peptidyl-pr 98.6 6.9E-08 1.5E-12 69.5 4.7 51 1-53 202-252 (269)
37 KOG0553|consensus 98.5 2.9E-07 6.2E-12 66.4 5.2 56 68-123 131-188 (304)
38 PLN03098 LPA1 LOW PSII ACCUMUL 98.5 4E-07 8.7E-12 69.2 6.0 59 56-114 71-142 (453)
39 PLN03088 SGT1, suppressor of 98.4 1.5E-06 3.3E-11 65.0 8.0 61 68-128 52-114 (356)
40 KOG1308|consensus 98.4 4E-08 8.7E-13 72.0 -0.7 63 68-130 130-194 (377)
41 TIGR00990 3a0801s09 mitochondr 98.4 1.4E-06 3E-11 69.3 7.1 61 68-128 381-443 (615)
42 PF14853 Fis1_TPR_C: Fis1 C-te 98.3 2.1E-06 4.6E-11 46.9 5.3 38 85-122 2-39 (53)
43 PLN02789 farnesyltranstransfer 98.3 2.7E-06 5.9E-11 62.9 7.5 59 70-128 126-186 (320)
44 PF13432 TPR_16: Tetratricopep 98.3 1.3E-06 2.8E-11 49.3 4.6 43 88-130 1-43 (65)
45 KOG0543|consensus 98.3 1.4E-06 3.1E-11 65.1 6.0 51 68-118 273-325 (397)
46 PRK10370 formate-dependent nit 98.3 7.7E-06 1.7E-10 56.5 8.9 63 68-130 89-156 (198)
47 cd00189 TPR Tetratricopeptide 98.3 5.7E-06 1.2E-10 47.9 7.1 62 68-129 16-79 (100)
48 PRK02603 photosystem I assembl 98.3 7.2E-06 1.6E-10 55.1 8.3 62 68-129 51-117 (172)
49 PRK12370 invasion protein regu 98.3 3.8E-06 8.3E-11 66.2 7.9 62 69-130 321-384 (553)
50 PRK10370 formate-dependent nit 98.3 6E-06 1.3E-10 57.0 8.0 61 68-128 55-117 (198)
51 PF13181 TPR_8: Tetratricopept 98.3 1.9E-06 4.1E-11 42.3 4.0 33 85-117 2-34 (34)
52 KOG0550|consensus 98.3 3E-07 6.6E-12 69.0 1.1 63 68-130 65-129 (486)
53 CHL00033 ycf3 photosystem I as 98.2 9.5E-06 2.1E-10 54.3 8.0 62 68-129 51-117 (168)
54 TIGR02552 LcrH_SycD type III s 98.2 1.3E-05 2.7E-10 51.3 7.5 54 68-121 67-122 (135)
55 PRK11189 lipoprotein NlpI; Pro 98.2 9.4E-06 2E-10 59.2 7.5 52 68-119 114-167 (296)
56 PF13424 TPR_12: Tetratricopep 98.2 9.7E-06 2.1E-10 47.3 5.9 47 68-114 21-76 (78)
57 TIGR03302 OM_YfiO outer membra 98.2 1.9E-05 4.1E-10 55.3 8.4 62 68-129 49-118 (235)
58 PLN02789 farnesyltranstransfer 98.1 1E-05 2.3E-10 59.8 7.1 61 68-128 53-116 (320)
59 KOG4234|consensus 98.1 1.7E-05 3.7E-10 54.9 7.5 59 68-128 150-210 (271)
60 KOG0624|consensus 98.1 2.1E-06 4.5E-11 63.6 3.0 63 68-130 285-353 (504)
61 KOG4626|consensus 98.1 9.1E-06 2E-10 64.2 6.5 61 68-128 370-432 (966)
62 COG4785 NlpI Lipoprotein NlpI, 98.1 9.4E-07 2E-11 61.7 0.8 95 34-128 35-143 (297)
63 PRK15331 chaperone protein Sic 98.1 8.3E-06 1.8E-10 54.5 5.2 63 68-130 53-117 (165)
64 TIGR02521 type_IV_pilW type IV 98.1 2.8E-05 6E-10 52.9 8.1 62 68-129 47-110 (234)
65 KOG4626|consensus 98.1 1.3E-05 2.8E-10 63.4 6.9 60 70-129 304-365 (966)
66 TIGR02795 tol_pal_ybgF tol-pal 98.1 3.1E-05 6.7E-10 48.0 7.5 62 68-129 18-87 (119)
67 KOG1125|consensus 98.1 4.6E-06 9.9E-11 64.7 4.2 62 68-129 446-509 (579)
68 KOG4555|consensus 98.1 2.5E-05 5.5E-10 50.6 6.8 62 68-129 59-126 (175)
69 PRK09782 bacteriophage N4 rece 98.0 2.3E-05 5E-10 65.5 8.1 62 68-129 625-688 (987)
70 PRK12370 invasion protein regu 98.0 2.5E-05 5.5E-10 61.6 7.9 61 68-128 354-416 (553)
71 PF13512 TPR_18: Tetratricopep 98.0 9.5E-05 2.1E-09 48.4 8.8 63 68-130 26-96 (142)
72 PRK15363 pathogenicity island 98.0 4.2E-05 9.2E-10 50.8 6.9 62 68-129 85-150 (157)
73 PF13176 TPR_7: Tetratricopept 98.0 1.9E-05 4.1E-10 39.5 3.9 29 86-114 1-29 (36)
74 PLN03098 LPA1 LOW PSII ACCUMUL 98.0 4.5E-05 9.7E-10 58.3 7.5 53 78-130 69-124 (453)
75 smart00028 TPR Tetratricopepti 97.9 1.9E-05 4E-10 37.0 3.8 33 85-117 2-34 (34)
76 KOG1310|consensus 97.9 2.3E-05 5E-10 60.9 5.9 63 68-130 390-457 (758)
77 PRK10866 outer membrane biogen 97.9 0.00013 2.8E-09 52.0 9.1 61 68-128 48-116 (243)
78 KOG0547|consensus 97.9 1.9E-05 4E-10 60.8 4.9 51 78-128 388-438 (606)
79 PF13371 TPR_9: Tetratricopept 97.9 2.1E-05 4.5E-10 45.2 4.0 40 91-130 2-41 (73)
80 KOG1126|consensus 97.9 9.7E-06 2.1E-10 63.8 3.1 63 68-130 505-569 (638)
81 PRK15179 Vi polysaccharide bio 97.9 6.1E-05 1.3E-09 60.9 7.7 62 68-129 102-165 (694)
82 TIGR02521 type_IV_pilW type IV 97.9 9.8E-05 2.1E-09 50.2 7.7 62 68-129 115-180 (234)
83 PF13525 YfiO: Outer membrane 97.9 0.00013 2.9E-09 50.4 8.3 62 68-129 21-90 (203)
84 PRK09782 bacteriophage N4 rece 97.9 7.7E-05 1.7E-09 62.5 8.2 62 68-130 592-655 (987)
85 PRK15174 Vi polysaccharide exp 97.8 8.4E-05 1.8E-09 59.9 7.9 63 68-130 92-156 (656)
86 PRK15179 Vi polysaccharide bio 97.8 7.1E-05 1.5E-09 60.6 7.4 63 68-130 136-200 (694)
87 PF13431 TPR_17: Tetratricopep 97.8 2.1E-05 4.6E-10 38.9 2.9 27 78-104 7-33 (34)
88 KOG1155|consensus 97.8 7E-05 1.5E-09 57.3 6.4 63 68-130 346-410 (559)
89 PF13424 TPR_12: Tetratricopep 97.8 5.2E-05 1.1E-09 44.1 4.5 49 81-129 2-57 (78)
90 PF14559 TPR_19: Tetratricopep 97.8 5.4E-05 1.2E-09 42.7 4.4 36 95-130 2-37 (68)
91 PF13174 TPR_6: Tetratricopept 97.8 6.6E-05 1.4E-09 36.2 4.0 32 86-117 2-33 (33)
92 cd00189 TPR Tetratricopeptide 97.7 0.00015 3.2E-09 41.7 5.9 44 86-129 2-45 (100)
93 PF13431 TPR_17: Tetratricopep 97.7 4.2E-05 9.1E-10 37.8 2.9 24 107-130 2-25 (34)
94 PRK15174 Vi polysaccharide exp 97.7 0.0002 4.4E-09 57.7 7.9 58 71-128 269-328 (656)
95 PRK10049 pgaA outer membrane p 97.7 0.00023 5E-09 58.3 7.8 63 68-130 375-439 (765)
96 COG5010 TadD Flp pilus assembl 97.7 0.00021 4.5E-09 50.9 6.5 63 68-130 116-180 (257)
97 PF06552 TOM20_plant: Plant sp 97.6 0.00032 6.9E-09 47.6 7.0 63 68-130 7-81 (186)
98 COG4783 Putative Zn-dependent 97.6 0.00012 2.6E-09 56.1 5.4 63 68-130 356-420 (484)
99 KOG2076|consensus 97.6 0.00018 3.8E-09 58.7 6.5 62 68-129 430-494 (895)
100 PRK11447 cellulose synthase su 97.6 0.00028 6E-09 60.2 8.0 61 68-128 367-429 (1157)
101 PF13429 TPR_15: Tetratricopep 97.6 0.00045 9.7E-09 49.7 7.7 62 68-129 126-191 (280)
102 PRK11447 cellulose synthase su 97.5 0.00043 9.3E-09 59.1 8.1 61 68-128 477-539 (1157)
103 COG3063 PilF Tfp pilus assembl 97.5 0.00027 5.9E-09 49.7 5.8 62 68-129 119-184 (250)
104 KOG1128|consensus 97.5 0.00022 4.8E-09 57.1 5.9 63 68-130 501-565 (777)
105 KOG1155|consensus 97.5 0.0004 8.6E-09 53.3 7.0 52 78-129 426-477 (559)
106 PF12895 Apc3: Anaphase-promot 97.5 0.00036 7.8E-09 41.2 5.5 43 68-110 41-84 (84)
107 KOG0624|consensus 97.5 0.00032 6.9E-09 52.3 6.1 63 68-130 54-118 (504)
108 TIGR02795 tol_pal_ybgF tol-pal 97.5 0.00065 1.4E-08 41.9 6.6 53 68-120 55-112 (119)
109 TIGR02917 PEP_TPR_lipo putativ 97.5 0.0005 1.1E-08 55.5 7.4 51 79-129 832-882 (899)
110 PF04733 Coatomer_E: Coatomer 97.5 0.00025 5.4E-09 51.8 5.1 62 68-129 183-246 (290)
111 COG3063 PilF Tfp pilus assembl 97.5 0.00084 1.8E-08 47.3 7.4 59 68-126 51-111 (250)
112 TIGR02917 PEP_TPR_lipo putativ 97.5 0.00061 1.3E-08 55.0 7.7 61 68-128 141-203 (899)
113 PRK10803 tol-pal system protei 97.4 0.0011 2.4E-08 47.8 8.0 62 68-129 159-228 (263)
114 PRK11788 tetratricopeptide rep 97.4 0.0011 2.4E-08 49.5 8.2 62 68-129 157-225 (389)
115 KOG1125|consensus 97.4 0.00017 3.8E-09 56.2 3.6 49 68-116 480-530 (579)
116 KOG1173|consensus 97.4 0.00048 1.1E-08 53.8 6.0 51 68-118 471-523 (611)
117 TIGR03302 OM_YfiO outer membra 97.4 0.0018 4E-08 45.2 8.4 44 87-130 169-215 (235)
118 PRK10049 pgaA outer membrane p 97.4 0.001 2.3E-08 54.5 7.9 59 68-127 99-159 (765)
119 PF13374 TPR_10: Tetratricopep 97.3 0.00061 1.3E-08 34.4 4.4 31 84-114 2-32 (42)
120 KOG1173|consensus 97.3 0.00088 1.9E-08 52.4 6.7 63 68-130 430-501 (611)
121 PF03704 BTAD: Bacterial trans 97.3 0.0015 3.3E-08 42.4 7.0 63 68-130 29-108 (146)
122 PRK11788 tetratricopeptide rep 97.3 0.0017 3.6E-08 48.5 7.6 61 68-128 196-259 (389)
123 PF13429 TPR_15: Tetratricopep 97.2 0.0012 2.7E-08 47.4 6.2 50 81-130 211-260 (280)
124 PF09295 ChAPs: ChAPs (Chs5p-A 97.2 0.002 4.4E-08 49.0 7.4 62 69-130 217-280 (395)
125 PRK02603 photosystem I assembl 97.1 0.002 4.2E-08 43.2 6.3 53 78-130 29-84 (172)
126 KOG1126|consensus 97.1 0.00084 1.8E-08 53.2 4.7 61 68-128 471-533 (638)
127 KOG3364|consensus 97.0 0.0022 4.8E-08 41.7 5.3 55 68-122 51-109 (149)
128 PF06552 TOM20_plant: Plant sp 97.0 0.0039 8.5E-08 42.4 6.6 57 68-126 51-120 (186)
129 KOG0546|consensus 97.0 9.7E-05 2.1E-09 54.7 -1.3 63 68-130 245-321 (372)
130 PRK14720 transcript cleavage f 97.0 0.0037 7.9E-08 52.0 7.5 50 81-130 113-162 (906)
131 KOG4162|consensus 97.0 0.0033 7.2E-08 50.8 6.8 62 68-129 700-765 (799)
132 KOG4648|consensus 96.9 0.0026 5.5E-08 47.7 5.5 42 78-119 159-200 (536)
133 PF12569 NARP1: NMDA receptor- 96.8 0.0082 1.8E-07 47.3 8.0 61 68-128 20-82 (517)
134 PRK15095 FKBP-type peptidyl-pr 96.8 0.00052 1.1E-08 45.8 1.2 28 1-28 48-75 (156)
135 PRK15331 chaperone protein Sic 96.8 0.0044 9.6E-08 41.5 5.6 60 68-129 87-148 (165)
136 PF12968 DUF3856: Domain of Un 96.8 0.01 2.2E-07 37.9 6.7 63 68-130 25-112 (144)
137 CHL00033 ycf3 photosystem I as 96.7 0.0097 2.1E-07 39.6 6.5 49 81-129 32-83 (168)
138 cd05804 StaR_like StaR_like; a 96.7 0.0093 2E-07 44.0 7.0 62 68-129 130-197 (355)
139 PRK11906 transcriptional regul 96.6 0.012 2.5E-07 45.5 7.4 53 78-130 332-384 (458)
140 KOG1840|consensus 96.6 0.0055 1.2E-07 48.1 5.6 47 68-114 257-313 (508)
141 PF12569 NARP1: NMDA receptor- 96.6 0.0078 1.7E-07 47.4 6.5 46 84-129 194-239 (517)
142 PF12688 TPR_5: Tetratrico pep 96.6 0.034 7.5E-07 35.4 8.2 49 68-116 17-70 (120)
143 KOG0376|consensus 96.5 0.0026 5.6E-08 49.0 3.3 54 68-121 54-109 (476)
144 COG4235 Cytochrome c biogenesi 96.5 0.016 3.4E-07 42.3 6.8 63 68-130 172-239 (287)
145 PRK14574 hmsH outer membrane p 96.4 0.016 3.5E-07 48.1 7.4 60 68-127 84-145 (822)
146 KOG0545|consensus 96.3 0.024 5.2E-07 40.7 7.0 57 68-124 246-304 (329)
147 KOG3785|consensus 96.3 0.038 8.2E-07 41.8 8.3 61 68-128 167-229 (557)
148 PRK10153 DNA-binding transcrip 96.3 0.016 3.4E-07 45.7 6.7 51 68-119 436-488 (517)
149 COG4105 ComL DNA uptake lipopr 96.3 0.055 1.2E-06 38.8 8.8 62 68-129 50-119 (254)
150 KOG2002|consensus 96.3 0.0031 6.7E-08 52.2 2.7 111 16-129 639-761 (1018)
151 cd05804 StaR_like StaR_like; a 96.3 0.014 3.1E-07 43.0 6.0 50 80-129 110-159 (355)
152 PF14938 SNAP: Soluble NSF att 96.2 0.035 7.6E-07 40.3 7.8 62 68-129 138-207 (282)
153 PRK11906 transcriptional regul 96.2 0.024 5.3E-07 43.8 7.0 62 68-129 274-349 (458)
154 PF15015 NYD-SP12_N: Spermatog 96.2 0.023 5E-07 43.6 6.7 61 68-128 199-272 (569)
155 PF09976 TPR_21: Tetratricopep 96.2 0.041 8.8E-07 35.8 7.0 61 68-129 64-129 (145)
156 PRK10803 tol-pal system protei 96.1 0.027 5.9E-07 40.7 6.6 53 68-120 196-253 (263)
157 KOG2002|consensus 96.1 0.021 4.6E-07 47.5 6.4 62 68-129 286-353 (1018)
158 KOG0551|consensus 96.1 0.01 2.2E-07 44.1 4.2 54 68-121 135-190 (390)
159 COG4235 Cytochrome c biogenesi 96.0 0.023 5E-07 41.4 5.6 52 78-129 150-201 (287)
160 KOG3060|consensus 95.9 0.055 1.2E-06 39.0 7.2 48 81-128 151-198 (289)
161 KOG2076|consensus 95.9 0.041 8.8E-07 45.5 7.4 49 81-129 204-252 (895)
162 KOG1129|consensus 95.9 0.015 3.2E-07 43.5 4.3 43 78-120 284-326 (478)
163 COG4783 Putative Zn-dependent 95.8 0.063 1.4E-06 41.6 7.5 63 68-130 310-386 (484)
164 PF12862 Apc5: Anaphase-promot 95.8 0.062 1.3E-06 32.5 6.2 47 68-114 14-71 (94)
165 KOG1128|consensus 95.7 0.015 3.2E-07 47.1 4.0 56 68-123 535-592 (777)
166 KOG1174|consensus 95.7 0.034 7.3E-07 42.7 5.7 54 68-121 454-508 (564)
167 KOG1840|consensus 95.5 0.044 9.6E-07 43.1 5.9 47 68-114 341-397 (508)
168 KOG3060|consensus 95.5 0.082 1.8E-06 38.1 6.7 57 68-124 170-231 (289)
169 KOG4814|consensus 95.4 0.089 1.9E-06 42.5 7.3 56 68-123 370-433 (872)
170 PRK10941 hypothetical protein; 95.4 0.087 1.9E-06 38.2 6.8 48 82-129 179-226 (269)
171 KOG2003|consensus 95.3 0.011 2.3E-07 45.9 2.1 56 68-124 253-315 (840)
172 KOG1129|consensus 95.3 0.039 8.5E-07 41.3 4.8 49 80-128 354-405 (478)
173 COG1047 SlpA FKBP-type peptidy 95.3 0.0091 2E-07 40.4 1.4 28 1-28 46-73 (174)
174 PRK10737 FKBP-type peptidyl-pr 95.3 0.0086 1.9E-07 41.4 1.2 28 1-28 45-72 (196)
175 KOG1127|consensus 95.1 0.083 1.8E-06 44.5 6.5 61 68-128 18-81 (1238)
176 PF09295 ChAPs: ChAPs (Chs5p-A 95.1 0.061 1.3E-06 41.1 5.5 42 68-109 250-293 (395)
177 TIGR00540 hemY_coli hemY prote 95.1 0.12 2.5E-06 39.5 7.1 60 69-129 316-381 (409)
178 COG2956 Predicted N-acetylgluc 95.1 0.11 2.4E-06 38.8 6.4 62 68-129 157-225 (389)
179 PRK10747 putative protoheme IX 95.1 0.14 3.1E-06 38.9 7.4 52 68-119 310-363 (398)
180 PRK10747 putative protoheme IX 95.0 0.18 3.9E-06 38.4 7.9 62 68-129 134-198 (398)
181 PF09976 TPR_21: Tetratricopep 95.0 0.24 5.3E-06 32.1 7.6 48 82-129 46-96 (145)
182 KOG3081|consensus 95.0 0.1 2.2E-06 37.8 6.0 58 68-125 189-248 (299)
183 KOG4555|consensus 95.0 0.053 1.2E-06 35.4 4.1 41 90-130 49-89 (175)
184 PRK10153 DNA-binding transcrip 94.8 0.18 3.9E-06 39.9 7.6 50 80-130 416-465 (517)
185 PF10516 SHNi-TPR: SHNi-TPR; 94.8 0.091 2E-06 26.5 4.0 30 85-114 2-31 (38)
186 KOG0549|consensus 94.8 0.015 3.2E-07 39.5 1.3 29 1-29 12-40 (188)
187 COG5010 TadD Flp pilus assembl 94.7 0.12 2.6E-06 37.1 5.7 49 81-129 97-145 (257)
188 KOG1156|consensus 94.5 0.14 3.1E-06 41.2 6.2 62 68-129 91-154 (700)
189 PF07721 TPR_4: Tetratricopept 94.4 0.067 1.5E-06 24.3 2.7 24 85-108 2-25 (26)
190 KOG4151|consensus 94.3 0.046 9.9E-07 44.6 3.3 50 80-129 87-138 (748)
191 TIGR00540 hemY_coli hemY prote 94.3 0.28 6.2E-06 37.4 7.4 62 68-129 134-198 (409)
192 TIGR03504 FimV_Cterm FimV C-te 94.3 0.11 2.4E-06 27.0 3.6 27 87-113 2-28 (44)
193 KOG2796|consensus 94.3 0.085 1.8E-06 38.4 4.1 49 68-116 268-318 (366)
194 KOG1156|consensus 94.2 0.16 3.5E-06 40.9 6.0 47 80-126 71-117 (700)
195 PF12688 TPR_5: Tetratrico pep 94.1 0.21 4.5E-06 31.8 5.4 45 85-129 2-49 (120)
196 KOG2471|consensus 94.1 0.23 4.9E-06 39.2 6.3 48 83-130 334-381 (696)
197 PF10255 Paf67: RNA polymerase 94.0 0.078 1.7E-06 40.6 3.7 45 68-112 138-192 (404)
198 PF07720 TPR_3: Tetratricopept 93.9 0.34 7.4E-06 24.0 4.8 32 86-117 3-36 (36)
199 PRK14574 hmsH outer membrane p 93.8 0.38 8.2E-06 40.3 7.6 62 68-129 432-495 (822)
200 COG3947 Response regulator con 93.5 0.52 1.1E-05 34.8 7.0 47 78-124 273-319 (361)
201 KOG4340|consensus 93.5 0.3 6.4E-06 36.4 5.9 61 68-128 26-88 (459)
202 KOG4162|consensus 93.5 0.39 8.5E-06 39.4 7.0 52 78-129 678-729 (799)
203 KOG2003|consensus 93.4 0.32 6.9E-06 38.1 6.1 61 68-128 506-568 (840)
204 PF13525 YfiO: Outer membrane 93.4 0.46 9.9E-06 32.7 6.5 48 83-130 4-54 (203)
205 PRK10941 hypothetical protein; 93.2 0.8 1.7E-05 33.3 7.7 60 68-127 197-258 (269)
206 PF10300 DUF3808: Protein of u 93.2 0.7 1.5E-05 36.2 7.9 60 68-128 283-348 (468)
207 COG4785 NlpI Lipoprotein NlpI, 93.0 0.072 1.6E-06 37.8 2.0 51 68-118 115-167 (297)
208 KOG3824|consensus 92.8 0.21 4.6E-06 37.2 4.3 60 68-127 132-193 (472)
209 PF14938 SNAP: Soluble NSF att 92.4 0.79 1.7E-05 33.2 6.9 50 68-117 171-229 (282)
210 KOG1586|consensus 92.4 1 2.3E-05 32.3 7.1 62 68-129 130-206 (288)
211 PF03704 BTAD: Bacterial trans 92.1 0.88 1.9E-05 29.3 6.3 44 68-111 78-123 (146)
212 COG1729 Uncharacterized protei 92.0 0.52 1.1E-05 34.1 5.4 62 68-129 157-226 (262)
213 KOG1127|consensus 91.9 0.43 9.3E-06 40.5 5.4 59 68-126 578-638 (1238)
214 COG1729 Uncharacterized protei 91.8 0.6 1.3E-05 33.8 5.5 44 87-130 144-190 (262)
215 TIGR00115 tig trigger factor. 91.7 0.31 6.7E-06 37.3 4.2 45 2-52 188-232 (408)
216 COG0457 NrfG FOG: TPR repeat [ 91.5 1.8 3.9E-05 27.9 7.5 61 68-128 183-246 (291)
217 PF14561 TPR_20: Tetratricopep 91.3 1.2 2.5E-05 26.8 5.7 40 78-117 16-55 (90)
218 PF14863 Alkyl_sulf_dimr: Alky 91.3 1.3 2.8E-05 29.0 6.3 45 85-129 71-115 (141)
219 KOG3785|consensus 91.0 1 2.3E-05 34.3 6.2 42 68-109 73-116 (557)
220 PRK14720 transcript cleavage f 90.6 0.65 1.4E-05 39.2 5.4 52 78-129 25-76 (906)
221 PRK10866 outer membrane biogen 90.2 1.7 3.7E-05 31.0 6.6 48 83-130 31-81 (243)
222 COG4976 Predicted methyltransf 90.0 0.8 1.7E-05 32.8 4.7 51 68-118 11-63 (287)
223 KOG4642|consensus 89.9 0.24 5.1E-06 35.6 2.0 40 91-130 17-56 (284)
224 PF10579 Rapsyn_N: Rapsyn N-te 89.4 3 6.4E-05 24.6 6.4 46 68-113 22-72 (80)
225 KOG4340|consensus 89.4 1 2.2E-05 33.7 4.9 46 68-113 160-207 (459)
226 COG2912 Uncharacterized conser 89.3 1.1 2.3E-05 32.6 5.0 50 80-129 177-226 (269)
227 PF02259 FAT: FAT domain; Int 88.8 1.8 3.8E-05 31.8 6.1 49 81-129 249-303 (352)
228 PF12968 DUF3856: Domain of Un 88.7 3 6.4E-05 26.9 6.1 45 68-112 78-128 (144)
229 cd02682 MIT_AAA_Arch MIT: doma 88.3 3.5 7.6E-05 24.0 6.3 40 83-122 5-51 (75)
230 PF09986 DUF2225: Uncharacteri 87.7 5 0.00011 28.1 7.4 59 68-126 141-208 (214)
231 PF00244 14-3-3: 14-3-3 protei 87.3 4.3 9.3E-05 28.9 7.0 45 68-112 149-197 (236)
232 PRK01490 tig trigger factor; P 87.0 1.1 2.4E-05 34.5 4.2 45 2-52 199-243 (435)
233 KOG2796|consensus 87.0 0.8 1.7E-05 33.5 3.1 48 82-129 250-297 (366)
234 KOG1130|consensus 86.3 1.9 4.1E-05 33.6 4.9 61 68-128 251-325 (639)
235 PF01535 PPR: PPR repeat; Int 86.3 1.5 3.2E-05 19.8 3.0 27 86-112 2-28 (31)
236 KOG3824|consensus 86.1 3.2 6.9E-05 31.2 5.9 44 85-128 117-160 (472)
237 PF13512 TPR_18: Tetratricopep 85.4 6.3 0.00014 25.9 6.5 53 68-120 63-135 (142)
238 KOG1130|consensus 85.1 1.8 3.8E-05 33.8 4.3 46 68-113 211-264 (639)
239 PF04184 ST7: ST7 protein; In 84.7 6.9 0.00015 31.1 7.4 46 84-129 259-306 (539)
240 PF12854 PPR_1: PPR repeat 84.5 2.9 6.2E-05 20.1 3.6 27 83-109 6-32 (34)
241 PLN03081 pentatricopeptide (PP 84.5 3.6 7.9E-05 33.6 6.2 60 68-128 478-538 (697)
242 KOG1174|consensus 84.3 4.2 9.2E-05 31.7 6.0 58 68-125 350-409 (564)
243 COG3629 DnrI DNA-binding trans 84.0 5.9 0.00013 29.1 6.5 52 78-129 147-198 (280)
244 smart00386 HAT HAT (Half-A-TPR 83.9 2.9 6.3E-05 18.9 4.0 29 98-126 1-29 (33)
245 COG0457 NrfG FOG: TPR repeat [ 83.5 9.4 0.0002 24.5 7.4 23 93-115 139-161 (291)
246 PF01239 PPTA: Protein prenylt 83.2 3.4 7.5E-05 19.2 3.9 26 103-128 2-27 (31)
247 PF04733 Coatomer_E: Coatomer 82.4 4.7 0.0001 29.5 5.5 51 68-118 217-270 (290)
248 KOG2376|consensus 81.9 6.9 0.00015 31.7 6.5 41 68-108 62-103 (652)
249 PF04212 MIT: MIT (microtubule 81.8 5.8 0.00013 22.2 4.7 28 85-112 6-33 (69)
250 cd02679 MIT_spastin MIT: domai 81.5 1.5 3.2E-05 25.9 2.1 26 87-112 11-36 (79)
251 KOG2610|consensus 81.4 5.1 0.00011 30.5 5.4 48 81-128 172-219 (491)
252 KOG2376|consensus 80.6 8.5 0.00018 31.2 6.6 51 68-118 191-258 (652)
253 KOG1308|consensus 80.4 1.5 3.2E-05 33.1 2.3 47 68-114 164-212 (377)
254 PF09122 DUF1930: Domain of un 79.9 1.9 4.2E-05 24.1 2.1 19 3-21 39-57 (68)
255 COG4700 Uncharacterized protei 79.1 19 0.00042 25.3 7.2 48 68-115 105-155 (251)
256 COG2956 Predicted N-acetylgluc 79.0 4.5 9.7E-05 30.5 4.4 50 68-117 196-247 (389)
257 KOG2053|consensus 78.0 9.5 0.0002 32.3 6.3 61 68-129 59-121 (932)
258 COG3071 HemY Uncharacterized e 77.0 13 0.00028 28.6 6.4 50 79-129 323-372 (400)
259 KOG0495|consensus 76.9 14 0.0003 30.7 6.8 44 85-128 652-695 (913)
260 COG4700 Uncharacterized protei 76.7 16 0.00035 25.6 6.3 60 68-130 76-136 (251)
261 TIGR00756 PPR pentatricopeptid 76.7 5.9 0.00013 18.0 3.8 28 86-113 2-29 (35)
262 KOG0546|consensus 76.3 6.1 0.00013 30.0 4.5 42 78-119 303-344 (372)
263 COG2912 Uncharacterized conser 75.3 12 0.00027 27.2 5.7 60 68-127 197-258 (269)
264 PF11817 Foie-gras_1: Foie gra 75.3 14 0.0003 26.3 6.0 45 68-112 161-206 (247)
265 cd02683 MIT_1 MIT: domain cont 74.9 11 0.00024 21.9 4.5 29 84-112 6-34 (77)
266 PF13812 PPR_3: Pentatricopept 74.9 6.9 0.00015 17.9 4.1 28 85-112 2-29 (34)
267 KOG4507|consensus 74.5 6 0.00013 32.3 4.2 35 86-120 678-712 (886)
268 cd02681 MIT_calpain7_1 MIT: do 73.9 12 0.00027 21.7 4.6 28 85-112 7-34 (76)
269 COG4105 ComL DNA uptake lipopr 73.1 22 0.00048 25.8 6.5 49 81-129 31-82 (254)
270 PF09613 HrpB1_HrpK: Bacterial 73.0 13 0.00028 25.0 5.0 38 80-117 40-77 (160)
271 KOG1941|consensus 72.6 20 0.00043 27.7 6.4 49 81-129 80-133 (518)
272 PLN03077 Protein ECB2; Provisi 72.5 20 0.00043 30.2 7.1 43 86-128 659-701 (857)
273 PF04781 DUF627: Protein of un 70.4 16 0.00034 23.0 4.6 47 68-114 60-108 (111)
274 smart00101 14_3_3 14-3-3 homol 70.0 30 0.00064 24.9 6.6 45 68-112 151-199 (244)
275 cd02680 MIT_calpain7_2 MIT: do 69.5 14 0.0003 21.5 4.1 28 86-113 8-35 (75)
276 KOG1310|consensus 68.8 10 0.00022 30.6 4.4 40 78-117 439-478 (758)
277 KOG1550|consensus 68.7 35 0.00075 27.4 7.5 56 69-126 345-404 (552)
278 cd02679 MIT_spastin MIT: domai 68.0 14 0.0003 21.7 3.9 32 98-129 3-38 (79)
279 COG5191 Uncharacterized conser 67.4 7.9 0.00017 29.2 3.3 46 78-123 135-181 (435)
280 smart00745 MIT Microtubule Int 66.9 20 0.00044 20.3 4.5 26 87-112 11-36 (77)
281 PF02184 HAT: HAT (Half-A-TPR) 66.4 14 0.0003 17.8 3.4 28 98-126 1-28 (32)
282 PF10602 RPN7: 26S proteasome 65.0 29 0.00063 23.4 5.6 47 68-114 52-103 (177)
283 PRK04841 transcriptional regul 64.9 27 0.00057 29.4 6.4 49 68-116 707-763 (903)
284 cd02656 MIT MIT: domain contai 64.2 24 0.00053 20.0 4.5 26 87-112 9-34 (75)
285 COG4649 Uncharacterized protei 62.8 49 0.0011 23.0 6.6 51 78-129 161-211 (221)
286 KOG1550|consensus 61.9 44 0.00096 26.9 6.9 44 83-128 324-370 (552)
287 KOG2709|consensus 61.9 9.3 0.0002 29.8 2.9 27 87-113 25-51 (560)
288 PLN03081 pentatricopeptide (PP 59.6 60 0.0013 26.7 7.4 44 68-112 275-318 (697)
289 PRK04841 transcriptional regul 59.2 37 0.0008 28.5 6.3 46 68-113 507-560 (903)
290 PF14561 TPR_20: Tetratricopep 58.8 20 0.00043 21.4 3.5 28 103-130 7-34 (90)
291 cd02678 MIT_VPS4 MIT: domain c 58.4 33 0.00071 19.6 4.5 26 87-112 9-34 (75)
292 KOG1915|consensus 58.3 11 0.00025 30.0 2.9 57 68-124 89-147 (677)
293 PF13041 PPR_2: PPR repeat fam 57.9 25 0.00053 17.9 6.2 32 83-114 2-33 (50)
294 PF11817 Foie-gras_1: Foie gra 57.3 53 0.0012 23.3 6.1 45 68-112 194-246 (247)
295 KOG0495|consensus 56.8 46 0.00099 27.9 6.0 54 68-121 667-722 (913)
296 PRK13184 pknD serine/threonine 55.4 30 0.00065 29.9 5.1 51 68-118 535-586 (932)
297 COG4455 ImpE Protein of avirul 54.1 31 0.00068 24.8 4.3 52 68-119 17-70 (273)
298 PF04781 DUF627: Protein of un 53.8 26 0.00057 22.0 3.5 31 90-120 2-32 (111)
299 PF10373 EST1_DNA_bind: Est1 D 53.7 76 0.0017 22.3 6.6 45 71-115 1-47 (278)
300 cd02684 MIT_2 MIT: domain cont 53.6 42 0.0009 19.3 4.3 26 87-112 9-34 (75)
301 KOG2396|consensus 53.5 81 0.0018 25.4 6.8 40 83-122 104-143 (568)
302 PF05131 Pep3_Vps18: Pep3/Vps1 53.3 7.3 0.00016 25.7 1.0 18 92-109 111-128 (147)
303 KOG1941|consensus 52.7 44 0.00094 26.0 5.1 33 82-114 204-236 (518)
304 COG0790 FOG: TPR repeat, SEL1 52.1 85 0.0018 22.4 7.1 62 68-129 93-159 (292)
305 PLN03218 maturation of RBCL 1; 51.7 1E+02 0.0022 27.1 7.8 29 83-111 683-711 (1060)
306 KOG1070|consensus 50.9 83 0.0018 28.7 7.0 56 68-129 1520-1575(1710)
307 COG2976 Uncharacterized protei 50.6 74 0.0016 22.3 5.6 41 68-108 105-150 (207)
308 PF05843 Suf: Suppressor of fo 50.0 96 0.0021 22.4 7.0 51 68-118 52-104 (280)
309 PLN03077 Protein ECB2; Provisi 49.6 54 0.0012 27.6 5.8 31 82-112 552-582 (857)
310 KOG1586|consensus 49.1 1E+02 0.0022 22.5 6.3 50 68-117 170-228 (288)
311 PF04184 ST7: ST7 protein; In 49.0 1.3E+02 0.0027 24.4 7.2 42 68-109 275-320 (539)
312 PF07980 SusD: SusD family; I 48.1 47 0.001 23.1 4.6 31 83-113 132-162 (266)
313 PF11846 DUF3366: Domain of un 47.7 49 0.0011 22.3 4.5 35 82-116 142-176 (193)
314 COG3118 Thioredoxin domain-con 47.4 40 0.00086 25.1 4.1 42 68-109 150-193 (304)
315 COG3947 Response regulator con 47.4 1.1E+02 0.0024 23.1 6.3 42 68-109 295-338 (361)
316 PF11207 DUF2989: Protein of u 47.4 36 0.00077 23.8 3.7 23 81-103 175-197 (203)
317 PF08631 SPO22: Meiosis protei 47.2 1.1E+02 0.0023 22.1 6.5 36 78-113 29-65 (278)
318 PF14689 SPOB_a: Sensor_kinase 46.8 49 0.0011 18.2 3.9 26 87-112 26-51 (62)
319 KOG0889|consensus 46.2 69 0.0015 31.8 6.1 61 68-128 2791-2856(3550)
320 PF13281 DUF4071: Domain of un 46.0 1.4E+02 0.003 23.1 7.5 46 83-128 178-227 (374)
321 KOG0529|consensus 45.9 53 0.0011 25.6 4.7 47 78-124 103-151 (421)
322 PF08424 NRDE-2: NRDE-2, neces 45.5 1.1E+02 0.0023 22.8 6.2 47 68-114 125-184 (321)
323 KOG1915|consensus 45.5 1.3E+02 0.0028 24.4 6.8 50 68-117 453-504 (677)
324 COG3629 DnrI DNA-binding trans 45.5 78 0.0017 23.3 5.4 45 68-112 169-215 (280)
325 KOG2053|consensus 45.0 1.1E+02 0.0024 26.4 6.6 55 68-122 206-264 (932)
326 cd02677 MIT_SNX15 MIT: domain 44.0 63 0.0014 18.6 4.2 24 89-112 11-34 (75)
327 PF08631 SPO22: Meiosis protei 43.0 48 0.001 23.9 4.0 29 83-111 245-273 (278)
328 PHA02537 M terminase endonucle 42.7 52 0.0011 23.5 4.0 43 80-122 165-217 (230)
329 PF08311 Mad3_BUB1_I: Mad3/BUB 42.6 87 0.0019 19.8 5.5 34 78-111 93-126 (126)
330 cd00280 TRFH Telomeric Repeat 42.5 1.2E+02 0.0025 21.2 7.7 52 68-120 85-146 (200)
331 PF09986 DUF2225: Uncharacteri 42.2 74 0.0016 22.2 4.7 47 68-114 100-155 (214)
332 TIGR02561 HrpB1_HrpK type III 41.1 85 0.0018 21.0 4.6 34 81-114 41-74 (153)
333 PLN03218 maturation of RBCL 1; 41.0 2.2E+02 0.0048 25.2 8.1 28 86-113 581-608 (1060)
334 COG0425 SirA Predicted redox p 40.7 25 0.00054 20.5 1.9 19 3-21 24-42 (78)
335 cd01736 LSm14_N LSm14 (also kn 40.7 43 0.00094 19.4 2.8 33 11-43 28-62 (74)
336 COG5191 Uncharacterized conser 40.6 30 0.00065 26.2 2.6 41 81-121 104-144 (435)
337 KOG1585|consensus 39.8 72 0.0016 23.4 4.3 41 87-128 34-74 (308)
338 PF09311 Rab5-bind: Rabaptin-l 39.5 84 0.0018 21.3 4.6 33 80-112 136-168 (181)
339 PF15297 CKAP2_C: Cytoskeleton 38.8 1.8E+02 0.0038 22.3 6.9 54 63-116 101-172 (353)
340 PF12753 Nro1: Nuclear pore co 38.1 49 0.0011 25.6 3.5 32 68-99 334-365 (404)
341 cd03420 SirA_RHOD_Pry_redox Si 37.6 31 0.00067 19.3 1.9 19 3-21 18-36 (69)
342 cd03422 YedF YedF is a bacteri 37.3 31 0.00068 19.3 1.9 19 3-21 18-36 (69)
343 KOG2471|consensus 36.6 1.3E+02 0.0027 24.5 5.5 51 78-129 503-553 (696)
344 smart00668 CTLH C-terminal to 36.2 40 0.00086 17.6 2.2 20 92-111 9-28 (58)
345 PF01157 Ribosomal_L21e: Ribos 35.3 38 0.00082 20.8 2.1 25 2-26 27-51 (99)
346 KOG1811|consensus 35.2 88 0.0019 26.1 4.6 46 68-115 572-618 (1141)
347 COG4941 Predicted RNA polymera 35.2 1.4E+02 0.003 23.0 5.3 44 83-126 364-407 (415)
348 PF10602 RPN7: 26S proteasome 35.1 1.4E+02 0.0031 20.1 8.2 47 68-114 19-66 (177)
349 PF04910 Tcf25: Transcriptiona 34.9 1.6E+02 0.0035 22.4 5.8 49 64-112 13-68 (360)
350 KOG2581|consensus 34.6 1.7E+02 0.0036 23.2 5.8 50 68-117 225-280 (493)
351 COG2139 RPL21A Ribosomal prote 34.2 44 0.00094 20.5 2.2 25 2-26 27-51 (98)
352 cd09243 BRO1_Brox_like Protein 34.1 1.8E+02 0.0039 22.2 6.0 45 68-112 217-276 (353)
353 PF06957 COPI_C: Coatomer (COP 34.1 1.3E+02 0.0028 23.6 5.3 43 78-120 293-336 (422)
354 KOG4151|consensus 33.7 56 0.0012 27.5 3.4 44 78-121 121-164 (748)
355 COG3071 HemY Uncharacterized e 31.9 2.5E+02 0.0054 21.9 7.3 47 83-129 152-198 (400)
356 PF12309 KBP_C: KIF-1 binding 31.9 93 0.002 23.8 4.2 40 80-119 296-344 (371)
357 PF05688 DUF824: Salmonella re 31.8 57 0.0012 17.1 2.2 21 5-25 6-26 (47)
358 COG2976 Uncharacterized protei 31.5 1.9E+02 0.004 20.4 5.3 50 68-117 142-192 (207)
359 KOG2396|consensus 31.3 2E+02 0.0042 23.4 5.8 45 78-122 133-178 (568)
360 COG0790 FOG: TPR repeat, SEL1 30.8 2E+02 0.0043 20.5 7.6 56 69-126 172-231 (292)
361 KOG0985|consensus 30.1 1.4E+02 0.003 26.8 5.1 29 82-110 1102-1130(1666)
362 KOG3677|consensus 29.8 47 0.001 26.1 2.3 26 88-113 276-301 (525)
363 PF05053 Menin: Menin; InterP 29.7 1.7E+02 0.0037 24.1 5.3 43 68-110 302-344 (618)
364 PRK15326 type III secretion sy 29.6 58 0.0013 19.2 2.2 25 98-122 21-45 (80)
365 KOG4507|consensus 29.1 1.3E+02 0.0028 25.1 4.6 63 68-130 623-688 (886)
366 KOG0890|consensus 28.3 86 0.0019 30.0 3.9 35 90-124 1455-1489(2382)
367 PF15469 Sec5: Exocyst complex 28.0 68 0.0015 21.5 2.6 21 95-115 97-117 (182)
368 KOG2460|consensus 28.0 2.2E+02 0.0047 23.2 5.6 25 89-113 427-451 (593)
369 KOG0686|consensus 27.9 3.1E+02 0.0067 21.7 6.5 46 68-113 166-216 (466)
370 KOG4563|consensus 27.7 1.3E+02 0.0029 23.2 4.3 40 91-130 48-95 (400)
371 cd03423 SirA SirA (also known 27.6 58 0.0012 18.1 1.9 19 3-21 18-36 (69)
372 PF10300 DUF3808: Protein of u 27.2 1.9E+02 0.0042 22.8 5.3 44 71-114 252-297 (468)
373 KOG0292|consensus 26.8 4.5E+02 0.0097 23.3 7.4 30 92-121 1092-1121(1202)
374 PF04910 Tcf25: Transcriptiona 26.7 1.3E+02 0.0029 22.8 4.2 34 86-119 105-139 (360)
375 PF04505 Dispanin: Interferon- 26.3 88 0.0019 18.3 2.6 30 84-113 37-66 (82)
376 smart00671 SEL1 Sel1-like repe 26.0 79 0.0017 14.3 3.5 28 86-113 3-34 (36)
377 COG3898 Uncharacterized membra 25.9 3.4E+02 0.0074 21.6 7.2 50 68-118 204-263 (531)
378 cd09240 BRO1_Alix Protein-inte 25.3 2.8E+02 0.0061 20.9 5.7 29 84-112 255-283 (346)
379 KOG2300|consensus 24.9 3.9E+02 0.0083 21.9 6.5 33 81-113 320-352 (629)
380 PF10952 DUF2753: Protein of u 24.6 2.1E+02 0.0045 18.7 6.0 50 68-117 24-87 (140)
381 KOG2047|consensus 24.5 1.2E+02 0.0027 25.4 3.8 32 82-113 246-277 (835)
382 PRK11018 hypothetical protein; 24.4 68 0.0015 18.5 1.9 19 3-21 27-45 (78)
383 COG3914 Spy Predicted O-linked 24.3 1.9E+02 0.0042 23.8 4.8 47 78-124 95-142 (620)
384 PF10938 YfdX: YfdX protein; 24.1 2.2E+02 0.0048 18.8 5.6 45 68-112 91-145 (155)
385 KOG4481|consensus 23.9 2.5E+02 0.0054 19.3 5.5 20 17-38 97-116 (194)
386 KOG2124|consensus 23.6 4.1E+02 0.0089 23.1 6.6 35 80-114 383-417 (883)
387 PF01206 TusA: Sulfurtransfera 23.0 68 0.0015 17.6 1.7 20 3-22 19-38 (70)
388 cd08977 SusD starch binding ou 22.8 1.7E+02 0.0037 21.7 4.2 32 82-113 174-210 (359)
389 KOG2041|consensus 22.7 2.9E+02 0.0064 23.7 5.6 44 80-124 792-835 (1189)
390 cd09241 BRO1_ScRim20-like Prot 22.5 3.5E+02 0.0075 20.5 5.8 45 68-112 207-265 (355)
391 COG0544 Tig FKBP-type peptidyl 22.1 4E+02 0.0087 21.0 6.7 42 2-51 199-242 (441)
392 KOG0396|consensus 22.1 3.8E+02 0.0083 20.8 6.8 46 84-129 191-236 (389)
393 PF07079 DUF1347: Protein of u 21.8 2.7E+02 0.0058 22.5 5.0 35 94-129 472-506 (549)
394 PRK04306 50S ribosomal protein 21.6 72 0.0016 19.6 1.6 23 3-25 30-52 (98)
395 PF02064 MAS20: MAS20 protein 21.3 2.1E+02 0.0046 18.2 3.8 28 89-116 68-95 (121)
396 cd04750 Commd2 COMM_Domain con 21.2 2.3E+02 0.0049 19.0 4.1 37 87-123 2-38 (166)
397 PF08324 PUL: PUL domain; Int 21.0 3.1E+02 0.0067 19.4 5.1 48 81-128 175-226 (268)
398 PRK00299 sulfur transfer prote 20.8 90 0.0019 18.1 1.9 18 3-20 28-45 (81)
399 PF06812 ImpA-rel_N: ImpA-rela 20.8 1.6E+02 0.0035 16.0 4.0 30 99-128 1-30 (62)
400 KOG3783|consensus 20.7 3.7E+02 0.008 21.9 5.6 47 78-125 297-343 (546)
401 COG1516 FliS Flagellin-specifi 20.6 2.6E+02 0.0056 18.2 4.8 24 89-112 36-59 (132)
402 KOG3807|consensus 20.3 3.7E+02 0.0081 20.9 5.4 29 87-115 278-306 (556)
403 TIGR02863 spore_sspJ small, ac 20.3 1E+02 0.0022 15.8 1.7 18 99-116 19-36 (47)
404 PF14858 DUF4486: Domain of un 20.2 2.2E+02 0.0047 23.2 4.3 35 78-112 191-225 (542)
405 cd00291 SirA_YedF_YeeD SirA, Y 20.0 1E+02 0.0022 16.8 1.9 19 3-21 18-36 (69)
No 1
>KOG0543|consensus
Probab=100.00 E-value=2.7e-36 Score=220.91 Aligned_cols=130 Identities=31% Similarity=0.462 Sum_probs=117.5
Q ss_pred CcchhcccCCCcEEEEEEcCCCCCCCCCCCCCCCCCCceEEEEEeecccc-CCccCcCcchhhhHHHH------------
Q psy18096 1 MDYVLPLMETGEECQIEITARFGYGDKGEPSKSIPPGAKLYYSLTLHSVL-PDFDLAELPVEKRLDFG------------ 67 (131)
Q Consensus 1 ld~~l~~M~~gE~~~v~i~~~~ayG~~G~~~~~ip~~~~~~~~v~l~~~~-~~~~~~~~~~~e~~~~~------------ 67 (131)
||.||++|++||++.|+|+|+|+||+.|..++.|||++++.|+|+|++|. +....|.+...+++..+
T Consensus 141 le~al~~M~~GE~a~v~i~~~YayG~~~~~~p~IPPnA~l~yEVeL~~f~~~~~~s~~~~~~e~l~~A~~~ke~Gn~~fK 220 (397)
T KOG0543|consen 141 LEIALRMMKVGEVALVTIDPKYAYGEEGGEPPLIPPNATLLYEVELLDFELKEDESWKMFAEERLEAADRKKERGNVLFK 220 (397)
T ss_pred HHHHHHhcCccceEEEEeCcccccCCCCCCCCCCCCCceEEEEEEEEeeecCcccccccchHHHHHHHHHHHHhhhHHHh
Confidence 58899999999999999999999997777789999999999999999999 77789999888777655
Q ss_pred ----------HHHHHHHHHH----------HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHh
Q psy18096 68 ----------GILASQELLD----------SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKC 127 (131)
Q Consensus 68 ----------y~~Ai~~~~~----------~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a 127 (131)
|++|+++++. ........+++|+|+||+|+++|.+|+..|++||++||+|+|||||||+|
T Consensus 221 ~gk~~~A~~~Yerav~~l~~~~~~~~ee~~~~~~~k~~~~lNlA~c~lKl~~~~~Ai~~c~kvLe~~~~N~KALyRrG~A 300 (397)
T KOG0543|consen 221 EGKFKLAKKRYERAVSFLEYRRSFDEEEQKKAEALKLACHLNLAACYLKLKEYKEAIESCNKVLELDPNNVKALYRRGQA 300 (397)
T ss_pred hchHHHHHHHHHHHHHHhhccccCCHHHHHHHHHHHHHHhhHHHHHHHhhhhHHHHHHHHHHHHhcCCCchhHHHHHHHH
Confidence 6666666664 45677889999999999999999999999999999999999999999999
Q ss_pred hhc
Q psy18096 128 LIL 130 (131)
Q Consensus 128 ~~~ 130 (131)
++.
T Consensus 301 ~l~ 303 (397)
T KOG0543|consen 301 LLA 303 (397)
T ss_pred HHh
Confidence 975
No 2
>KOG0545|consensus
Probab=99.81 E-value=2e-20 Score=130.51 Aligned_cols=129 Identities=20% Similarity=0.159 Sum_probs=101.2
Q ss_pred cchhcccCCCcEEEEEEcCC----CC------------------------------CCCCCCCC--CCCCCCceEEEEEe
Q psy18096 2 DYVLPLMETGEECQIEITAR----FG------------------------------YGDKGEPS--KSIPPGAKLYYSLT 45 (131)
Q Consensus 2 d~~l~~M~~gE~~~v~i~~~----~a------------------------------yG~~G~~~--~~ip~~~~~~~~v~ 45 (131)
|.+|.+|.++|.++|+++.. |- |...|..+ ......+++.|.++
T Consensus 73 E~il~tM~v~EvaqF~~d~~~~vqYPfvsksLRdia~GK~p~e~~~H~Cg~a~m~~~~glGyedLDeL~knPqpL~Fvie 152 (329)
T KOG0545|consen 73 EIILTTMRVHEVAQFWCDTIHTVQYPFVSKSLRDIAQGKDPTEWHRHCCGLANMFAYHGLGYEDLDELQKNPQPLVFVIE 152 (329)
T ss_pred HHHHHHHhhhhHHHhhhhhhheeechhHHHHHHHHhcCCCcchhhhhhhhhHHHHHhcCCChhhHHHHhhCCCceEeehh
Confidence 57899999999999999853 21 22222111 11245578999999
Q ss_pred eccccCCc----cCcCcchhhhHHHH---------------HHHHHHHH-------HH-------------HhhhhhHHH
Q psy18096 46 LHSVLPDF----DLAELPVEKRLDFG---------------GILASQEL-------LD-------------SLIKERINC 86 (131)
Q Consensus 46 l~~~~~~~----~~~~~~~~e~~~~~---------------y~~Ai~~~-------~~-------------~~~~~~~~~ 86 (131)
|..+.... +.|.|+.++|+... |++|+..| +. +++....++
T Consensus 153 llqVe~P~qYq~e~WqlsddeKmkav~~l~q~GN~lfk~~~ykEA~~~YreAi~~l~~L~lkEkP~e~eW~eLdk~~tpL 232 (329)
T KOG0545|consen 153 LLQVEAPSQYQRETWQLSDDEKMKAVPVLHQEGNRLFKLGRYKEASSKYREAIICLRNLQLKEKPGEPEWLELDKMITPL 232 (329)
T ss_pred hhhccCchhhccccccCCchHhhhhhHHHHHhhhhhhhhccHHHHHHHHHHHHHHHHHHHhccCCCChHHHHHHHhhhHH
Confidence 99987543 68999999886432 44444444 44 788899999
Q ss_pred HHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 87 YNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 87 ~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
+.|.|+|+++.++|.++++.|+.+|..+|.|+|||||||+|+++
T Consensus 233 llNy~QC~L~~~e~yevleh~seiL~~~~~nvKA~frRakAhaa 276 (329)
T KOG0545|consen 233 LLNYCQCLLKKEEYYEVLEHCSEILRHHPGNVKAYFRRAKAHAA 276 (329)
T ss_pred HHhHHHHHhhHHHHHHHHHHHHHHHhcCCchHHHHHHHHHHHHh
Confidence 99999999999999999999999999999999999999999874
No 3
>KOG0553|consensus
Probab=99.59 E-value=4e-15 Score=106.32 Aligned_cols=63 Identities=29% Similarity=0.268 Sum_probs=61.5
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
|.+||..|+. +++|.++.+|+|||++|++||+|+.|++||..+|.+||.+.|+|-|.|.|+..
T Consensus 97 Y~eAv~kY~~AI~l~P~nAVyycNRAAAy~~Lg~~~~AVkDce~Al~iDp~yskay~RLG~A~~~ 161 (304)
T KOG0553|consen 97 YQEAVDKYTEAIELDPTNAVYYCNRAAAYSKLGEYEDAVKDCESALSIDPHYSKAYGRLGLAYLA 161 (304)
T ss_pred HHHHHHHHHHHHhcCCCcchHHHHHHHHHHHhcchHHHHHHHHHHHhcChHHHHHHHHHHHHHHc
Confidence 9999999999 99999999999999999999999999999999999999999999999999864
No 4
>KOG0547|consensus
Probab=99.57 E-value=5.2e-15 Score=111.56 Aligned_cols=63 Identities=25% Similarity=0.242 Sum_probs=60.2
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
|.+||++|+. ++.|..+..|.|||+||..+|+|++.+++|+++|+++|+++|||+||+.|+-.
T Consensus 131 Y~eAIkyY~~AI~l~p~epiFYsNraAcY~~lgd~~~Vied~TkALEl~P~Y~KAl~RRA~A~E~ 195 (606)
T KOG0547|consen 131 YDEAIKYYTQAIELCPDEPIFYSNRAACYESLGDWEKVIEDCTKALELNPDYVKALLRRASAHEQ 195 (606)
T ss_pred HHHHHHHHHHHHhcCCCCchhhhhHHHHHHHHhhHHHHHHHHHHHhhcCcHHHHHHHHHHHHHHh
Confidence 9999999999 88888899999999999999999999999999999999999999999999743
No 5
>KOG0544|consensus
Probab=99.46 E-value=9.5e-14 Score=83.14 Aligned_cols=47 Identities=32% Similarity=0.642 Sum_probs=44.4
Q ss_pred cchhcccCCCcEEEEEEcCCCCCCCCCCCCCCCCCCceEEEEEeeccc
Q psy18096 2 DYVLPLMETGEECQIEITARFGYGDKGEPSKSIPPGAKLYYSLTLHSV 49 (131)
Q Consensus 2 d~~l~~M~~gE~~~v~i~~~~ayG~~G~~~~~ip~~~~~~~~v~l~~~ 49 (131)
|.++.+|++||++.++|.|.||||..| ++..|||++++.|++||.++
T Consensus 61 degv~qmsvGekakLti~pd~aYG~~G-~p~~IppNatL~FdVEll~v 107 (108)
T KOG0544|consen 61 DEGVAQMSVGEKAKLTISPDYAYGPRG-HPGGIPPNATLVFDVELLKV 107 (108)
T ss_pred hhcchhccccccceeeeccccccCCCC-CCCccCCCcEEEEEEEEEec
Confidence 688999999999999999999999999 56999999999999999876
No 6
>KOG0548|consensus
Probab=99.45 E-value=2.6e-13 Score=103.06 Aligned_cols=63 Identities=32% Similarity=0.325 Sum_probs=60.6
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
|..||+.|++ ..+|....+|+|||+||++++++..|++||+.+|++||++.|+|+|+|.|+..
T Consensus 374 y~~Av~~YteAIkr~P~Da~lYsNRAac~~kL~~~~~aL~Da~~~ieL~p~~~kgy~RKg~al~~ 438 (539)
T KOG0548|consen 374 YPEAVKHYTEAIKRDPEDARLYSNRAACYLKLGEYPEALKDAKKCIELDPNFIKAYLRKGAALRA 438 (539)
T ss_pred HHHHHHHHHHHHhcCCchhHHHHHHHHHHHHHhhHHHHHHHHHHHHhcCchHHHHHHHHHHHHHH
Confidence 9999999999 77899999999999999999999999999999999999999999999999864
No 7
>TIGR00990 3a0801s09 mitochondrial precursor proteins import receptor (72 kDa mitochondrial outermembrane protein) (mitochondrial import receptor for the ADP/ATP carrier) (translocase of outermembrane tom70).
Probab=99.39 E-value=2.2e-12 Score=102.07 Aligned_cols=97 Identities=25% Similarity=0.233 Sum_probs=78.7
Q ss_pred CCCCCCceEEEEEeeccccCCccCcCcchhhhHHH-------H--------HHHHHHHHHH--HhhhhhHHHHHHHHHHH
Q psy18096 32 KSIPPGAKLYYSLTLHSVLPDFDLAELPVEKRLDF-------G--------GILASQELLD--SLIKERINCYNNLAQAQ 94 (131)
Q Consensus 32 ~~ip~~~~~~~~v~l~~~~~~~~~~~~~~~e~~~~-------~--------y~~Ai~~~~~--~~~~~~~~~~~N~a~~~ 94 (131)
..+|++..+....++..++. ...|.++.+++... | |.+||..|++ ++.|. ..+|.|+|.||
T Consensus 93 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~a~~~k~~G~~~~~~~~~~~Ai~~y~~al~~~p~-~~~~~n~a~~~ 170 (615)
T TIGR00990 93 STAPKNAPVEPADELPEIDE-SSVANLSEEERKKYAAKLKEKGNKAYRNKDFNKAIKLYSKAIECKPD-PVYYSNRAACH 170 (615)
T ss_pred CCCCCCCCCCccccccccch-hhcccCCHHHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHhcCCc-hHHHHHHHHHH
Confidence 34566666666666665543 44678887765543 2 9999999999 66664 67899999999
Q ss_pred HHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 95 IKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 95 ~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
+++|+|++|+.+|+++|+++|++.++|+|+|.++..
T Consensus 171 ~~l~~~~~Ai~~~~~al~l~p~~~~a~~~~a~a~~~ 206 (615)
T TIGR00990 171 NALGDWEKVVEDTTAALELDPDYSKALNRRANAYDG 206 (615)
T ss_pred HHhCCHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHH
Confidence 999999999999999999999999999999999864
No 8
>KOG4234|consensus
Probab=99.38 E-value=8.6e-13 Score=90.22 Aligned_cols=63 Identities=30% Similarity=0.322 Sum_probs=54.7
Q ss_pred HHHHHHHHHH-------HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 68 GILASQELLD-------SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 68 y~~Ai~~~~~-------~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
|.+|..-|.. -.......+|.|||+|.+|++.|+.||.+|.++|+++|.+.||+-||+.+|-.
T Consensus 111 yeeA~skY~~Ale~cp~~~~e~rsIly~Nraaa~iKl~k~e~aI~dcsKaiel~pty~kAl~RRAeayek 180 (271)
T KOG4234|consen 111 YEEANSKYQEALESCPSTSTEERSILYSNRAAALIKLRKWESAIEDCSKAIELNPTYEKALERRAEAYEK 180 (271)
T ss_pred HHHHHHHHHHHHHhCccccHHHHHHHHhhhHHHHHHhhhHHHHHHHHHhhHhcCchhHHHHHHHHHHHHh
Confidence 6666665555 33467889999999999999999999999999999999999999999999853
No 9
>KOG0549|consensus
Probab=99.33 E-value=1.6e-12 Score=86.90 Aligned_cols=52 Identities=33% Similarity=0.510 Sum_probs=48.0
Q ss_pred CcchhcccCCCcEEEEEEcCCCCCCCCCCCCCCCCCCceEEEEEeeccccCCc
Q psy18096 1 MDYVLPLMETGEECQIEITARFGYGDKGEPSKSIPPGAKLYYSLTLHSVLPDF 53 (131)
Q Consensus 1 ld~~l~~M~~gE~~~v~i~~~~ayG~~G~~~~~ip~~~~~~~~v~l~~~~~~~ 53 (131)
+|++|..|++||+..++|+|+++||++|.|+. ||+++.+.|+++|..+.+.+
T Consensus 128 ~Dqgl~gMCvGEkRkl~IPp~LgYG~~G~~~~-IP~~A~LiFdiELv~i~~~~ 179 (188)
T KOG0549|consen 128 WDQGLLGMCVGEKRKLIIPPHLGYGERGAPPK-IPGDAVLIFDIELVKIERGP 179 (188)
T ss_pred HhHHhhhhCcccceEEecCccccCccCCCCCC-CCCCeeEEEEEEEEEeecCC
Confidence 58999999999999999999999999998644 99999999999999998754
No 10
>KOG0551|consensus
Probab=99.27 E-value=7.5e-12 Score=90.91 Aligned_cols=63 Identities=33% Similarity=0.388 Sum_probs=58.3
Q ss_pred HHHHHHHHHHHh------hhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 68 GILASQELLDSL------IKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 68 y~~Ai~~~~~~~------~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
|+.|+..|++.+ ...++.+|+|||+|++-+|+|-.||.||.+|+.++|.+.||+||-++|++.
T Consensus 97 yk~A~~~Yt~Glk~kc~D~dlnavLY~NRAAa~~~l~NyRs~l~Dcs~al~~~P~h~Ka~~R~Akc~~e 165 (390)
T KOG0551|consen 97 YKDAVESYTEGLKKKCADPDLNAVLYTNRAAAQLYLGNYRSALNDCSAALKLKPTHLKAYIRGAKCLLE 165 (390)
T ss_pred HHHHHHHHHHHHhhcCCCccHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhcCcchhhhhhhhhHHHHH
Confidence 999999999922 357889999999999999999999999999999999999999999999874
No 11
>KOG4648|consensus
Probab=99.16 E-value=2.7e-11 Score=88.89 Aligned_cols=63 Identities=21% Similarity=0.035 Sum_probs=60.4
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
|.+||++|++ .+.|.++.++.|||.+|+|++.|..|..||+.|+.+|..++|||-|||.|...
T Consensus 113 y~EAIDCYs~~ia~~P~NpV~~~NRA~AYlk~K~FA~AE~DC~~AiaLd~~Y~KAYSRR~~AR~~ 177 (536)
T KOG4648|consen 113 YEEAIDCYSTAIAVYPHNPVYHINRALAYLKQKSFAQAEEDCEAAIALDKLYVKAYSRRMQARES 177 (536)
T ss_pred hhHHHHHhhhhhccCCCCccchhhHHHHHHHHHHHHHHHHhHHHHHHhhHHHHHHHHHHHHHHHH
Confidence 9999999999 78899999999999999999999999999999999999999999999998753
No 12
>COG0545 FkpA FKBP-type peptidyl-prolyl cis-trans isomerases 1 [Posttranslational modification, protein turnover, chaperones]
Probab=99.16 E-value=4.5e-11 Score=81.48 Aligned_cols=47 Identities=32% Similarity=0.615 Sum_probs=43.4
Q ss_pred cchhcccCCCcEEEEEEcCCCCCCCCCCCCCCCCCCceEEEEEeeccc
Q psy18096 2 DYVLPLMETGEECQIEITARFGYGDKGEPSKSIPPGAKLYYSLTLHSV 49 (131)
Q Consensus 2 d~~l~~M~~gE~~~v~i~~~~ayG~~G~~~~~ip~~~~~~~~v~l~~~ 49 (131)
+.+|+.|++|++++++|+|.+|||.+|.| +.|||++++.|+|+|.++
T Consensus 158 ~egl~~M~vG~k~~l~IP~~laYG~~g~~-g~Ippns~LvFeVeLl~v 204 (205)
T COG0545 158 DEGLQGMKVGGKRKLTIPPELAYGERGVP-GVIPPNSTLVFEVELLDV 204 (205)
T ss_pred HHHHhhCCCCceEEEEeCchhccCcCCCC-CCCCCCCeEEEEEEEEec
Confidence 57899999999999999999999999975 459999999999999876
No 13
>KOG0548|consensus
Probab=99.11 E-value=1.3e-10 Score=88.54 Aligned_cols=64 Identities=27% Similarity=0.217 Sum_probs=61.6
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhcC
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLILT 131 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~~ 131 (131)
|..||.+|+. .++|.+-.+|.||++||.++++|.+|+.|..+.++++|+++|+|.|+|-|++++
T Consensus 18 ~~~ai~~~t~ai~l~p~nhvlySnrsaa~a~~~~~~~al~da~k~~~l~p~w~kgy~r~Gaa~~~l 83 (539)
T KOG0548|consen 18 FETAIRLFTEAIMLSPTNHVLYSNRSAAYASLGSYEKALKDATKTRRLNPDWAKGYSRKGAALFGL 83 (539)
T ss_pred HHHHHHHHHHHHccCCCccchhcchHHHHHHHhhHHHHHHHHHHHHhcCCchhhHHHHhHHHHHhc
Confidence 9999999999 889999999999999999999999999999999999999999999999998763
No 14
>KOG0550|consensus
Probab=99.10 E-value=6.2e-11 Score=88.27 Aligned_cols=63 Identities=30% Similarity=0.417 Sum_probs=57.8
Q ss_pred HHHHHHHHHH--Hh----hhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 68 GILASQELLD--SL----IKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 68 y~~Ai~~~~~--~~----~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
|+.|-++|+. .+ ...++.+|.|||.+..++|+..+||.||+.|+.+||..+|||.|||+|++.
T Consensus 265 y~~A~E~Yteal~idP~n~~~naklY~nra~v~~rLgrl~eaisdc~~Al~iD~syikall~ra~c~l~ 333 (486)
T KOG0550|consen 265 YRKAYECYTEALNIDPSNKKTNAKLYGNRALVNIRLGRLREAISDCNEALKIDSSYIKALLRRANCHLA 333 (486)
T ss_pred hhHHHHHHHHhhcCCccccchhHHHHHHhHhhhcccCCchhhhhhhhhhhhcCHHHHHHHHHHHHHHHH
Confidence 8999999998 33 356788999999999999999999999999999999999999999999874
No 15
>KOG4642|consensus
Probab=99.09 E-value=1.5e-10 Score=80.94 Aligned_cols=63 Identities=25% Similarity=0.191 Sum_probs=59.4
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
|..||..|.+ -+.|..+++|.|+|.||+|+++|+.+..||.+||+++|+.+|++|-.|++++.
T Consensus 26 y~~ai~~y~raI~~nP~~~~Y~tnralchlk~~~~~~v~~dcrralql~~N~vk~h~flg~~~l~ 90 (284)
T KOG4642|consen 26 YDDAIDCYSRAICINPTVASYYTNRALCHLKLKHWEPVEEDCRRALQLDPNLVKAHYFLGQWLLQ 90 (284)
T ss_pred hchHHHHHHHHHhcCCCcchhhhhHHHHHHHhhhhhhhhhhHHHHHhcChHHHHHHHHHHHHHHh
Confidence 8999999998 67899999999999999999999999999999999999999999999998763
No 16
>PF13414 TPR_11: TPR repeat; PDB: 2HO1_B 2FI7_B 2DBA_A 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2PL2_B 3IEG_B 2FBN_A ....
Probab=99.09 E-value=6.4e-10 Score=63.77 Aligned_cols=48 Identities=33% Similarity=0.393 Sum_probs=42.0
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhC-ChHHHHHHHHHHHhhCC
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLG-SLEPALMSLENVLNLDP 115 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~-~~~~a~~~~~~al~~dp 115 (131)
|.+|+..|++ +++|..+.++.|+|.||.++| +|.+|+.+++++|++||
T Consensus 19 ~~~A~~~~~~ai~~~p~~~~~~~~~g~~~~~~~~~~~~A~~~~~~al~l~P 69 (69)
T PF13414_consen 19 YEEAIEYFEKAIELDPNNAEAYYNLGLAYMKLGKDYEEAIEDFEKALKLDP 69 (69)
T ss_dssp HHHHHHHHHHHHHHSTTHHHHHHHHHHHHHHTTTHHHHHHHHHHHHHHHST
T ss_pred HHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHhCccHHHHHHHHHHHHHcCc
Confidence 8888888888 778888888889999999888 68899999999988887
No 17
>KOG0376|consensus
Probab=99.08 E-value=2.9e-11 Score=91.26 Aligned_cols=63 Identities=25% Similarity=0.245 Sum_probs=60.6
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
|..||..|++ +++|..+.++.|||++++|.++|..|+.|+.+||++||.+.|+|+|||.|.+.
T Consensus 20 fd~avdlysKaI~ldpnca~~~anRa~a~lK~e~~~~Al~Da~kaie~dP~~~K~Y~rrg~a~m~ 84 (476)
T KOG0376|consen 20 FDVAVDLYSKAIELDPNCAIYFANRALAHLKVESFGGALHDALKAIELDPTYIKAYVRRGTAVMA 84 (476)
T ss_pred HHHHHHHHHHHHhcCCcceeeechhhhhheeechhhhHHHHHHhhhhcCchhhheeeeccHHHHh
Confidence 9999999999 89999999999999999999999999999999999999999999999998764
No 18
>PF00254 FKBP_C: FKBP-type peptidyl-prolyl cis-trans isomerase; InterPro: IPR001179 Synonym(s): Peptidylprolyl cis-trans isomerase FKBP-type peptidylprolyl isomerases (5.2.1.8 from EC) in vertebrates, are receptors for the two immunosuppressants, FK506 and rapamycin. The drugs inhibit T cell proliferation by arresting two distinct cytoplasmic signal transmission pathways. Peptidylprolyl isomerases accelerate protein folding by catalysing the cis-trans isomerisation of proline imidic peptide bonds in oligopeptides. These proteins are found in a variety of organisms.; GO: 0006457 protein folding; PDB: 1IX5_A 3JXV_A 3JYM_A 1T11_A 1PBK_A 1FD9_A 2VCD_A 3B7X_A 1Q6H_B 1Q6I_B ....
Probab=99.05 E-value=2.2e-10 Score=69.85 Aligned_cols=47 Identities=28% Similarity=0.556 Sum_probs=43.2
Q ss_pred CcchhcccCCCcEEEEEEcCCCCCCCCCCCCCCCCCCceEEEEEeec
Q psy18096 1 MDYVLPLMETGEECQIEITARFGYGDKGEPSKSIPPGAKLYYSLTLH 47 (131)
Q Consensus 1 ld~~l~~M~~gE~~~v~i~~~~ayG~~G~~~~~ip~~~~~~~~v~l~ 47 (131)
+|.+|..|++||++.|.++|..+||+.|.....||+++++.|+|+|.
T Consensus 48 ~e~al~~m~~Ge~~~~~vp~~~ayg~~~~~~~~ip~~~~l~f~Iell 94 (94)
T PF00254_consen 48 LEEALIGMKVGEKREFYVPPELAYGEKGLEPPKIPPNSTLVFEIELL 94 (94)
T ss_dssp HHHHHTTSBTTEEEEEEEEGGGTTTTTTBCTTTBTTTSEEEEEEEEE
T ss_pred hhhhcccccCCCEeeeEeCChhhcCccccCCCCcCCCCeEEEEEEEC
Confidence 47899999999999999999999999997556799999999999984
No 19
>PLN03088 SGT1, suppressor of G2 allele of SKP1; Provisional
Probab=99.03 E-value=1.2e-09 Score=81.65 Aligned_cols=63 Identities=30% Similarity=0.353 Sum_probs=59.7
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
|.+|+.+|++ ++.|....+|.|+|.||+++|++++|+.+++++|.++|++..+|+++|.++..
T Consensus 18 ~~~Ai~~~~~Al~~~P~~~~a~~~~a~~~~~~g~~~eAl~~~~~Al~l~P~~~~a~~~lg~~~~~ 82 (356)
T PLN03088 18 FALAVDLYTQAIDLDPNNAELYADRAQANIKLGNFTEAVADANKAIELDPSLAKAYLRKGTACMK 82 (356)
T ss_pred HHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCcCCHHHHHHHHHHHHH
Confidence 9999999999 78899999999999999999999999999999999999999999999998864
No 20
>TIGR03516 ppisom_GldI peptidyl-prolyl isomerase, gliding motility-associated. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldI is a FKBP-type peptidyl-prolyl cis-trans isomerase (pfam00254) linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae and Cytophaga hutchinsonii. Knockout of this gene abolishes the gliding phenotype. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. This family is only found in Bacteroidetes containing the suite of genes proposed to confer the gliding motility phenotype.
Probab=99.01 E-value=2.9e-10 Score=77.12 Aligned_cols=49 Identities=24% Similarity=0.386 Sum_probs=45.2
Q ss_pred CcchhcccCCCcEEEEEEcCCCCCCCCCCCCCCCCCCceEEEEEeecccc
Q psy18096 1 MDYVLPLMETGEECQIEITARFGYGDKGEPSKSIPPGAKLYYSLTLHSVL 50 (131)
Q Consensus 1 ld~~l~~M~~gE~~~v~i~~~~ayG~~G~~~~~ip~~~~~~~~v~l~~~~ 50 (131)
++.+|..|++||+++|+++|..|||..|. ...|||++++.|+++|.++.
T Consensus 128 l~e~L~~Mk~Ge~~~~~iP~~~AYG~~g~-~~~Ippns~L~f~IeL~~i~ 176 (177)
T TIGR03516 128 LRDGLKLMKEGETATFLFPSHKAYGYYGD-QNKIGPNLPIISTVTLLNIK 176 (177)
T ss_pred HHHHHcCCCCCCEEEEEECHHHcCCCCCC-CCCcCcCCcEEEEEEEEEec
Confidence 47899999999999999999999999995 47899999999999999874
No 21
>PRK15363 pathogenicity island 2 chaperone protein SscA; Provisional
Probab=99.00 E-value=2.3e-09 Score=71.02 Aligned_cols=63 Identities=10% Similarity=-0.005 Sum_probs=59.7
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
|.+|.+.|.- .++|.+...|.|+|.|+..+|+|.+||..+.+++.++|+|+.++|+.|.+++.
T Consensus 51 l~~A~~~f~~L~~~Dp~~~~y~~gLG~~~Q~~g~~~~AI~aY~~A~~L~~ddp~~~~~ag~c~L~ 115 (157)
T PRK15363 51 FAGAARLFQLLTIYDAWSFDYWFRLGECCQAQKHWGEAIYAYGRAAQIKIDAPQAPWAAAECYLA 115 (157)
T ss_pred HHHHHHHHHHHHHhCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcCCCCchHHHHHHHHHHH
Confidence 7888888877 88999999999999999999999999999999999999999999999999875
No 22
>PRK15359 type III secretion system chaperone protein SscB; Provisional
Probab=98.99 E-value=2.4e-09 Score=70.35 Aligned_cols=61 Identities=7% Similarity=-0.097 Sum_probs=31.4
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhh
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCL 128 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~ 128 (131)
|.+|+..|.. .++|.....+.|+|.|+..+|++++|+..++++|.++|+++.++..+|.+.
T Consensus 74 ~~~A~~~y~~Al~l~p~~~~a~~~lg~~l~~~g~~~eAi~~~~~Al~~~p~~~~~~~~~~~~~ 136 (144)
T PRK15359 74 YTTAINFYGHALMLDASHPEPVYQTGVCLKMMGEPGLAREAFQTAIKMSYADASWSEIRQNAQ 136 (144)
T ss_pred HHHHHHHHHHHHhcCCCCcHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCCCChHHHHHHHHHH
Confidence 4444444444 444555555555555555555555555555555555555555555555443
No 23
>PF13414 TPR_11: TPR repeat; PDB: 2HO1_B 2FI7_B 2DBA_A 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2PL2_B 3IEG_B 2FBN_A ....
Probab=98.97 E-value=1.9e-09 Score=61.77 Aligned_cols=48 Identities=27% Similarity=0.398 Sum_probs=45.3
Q ss_pred hHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 83 RINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 83 ~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
++..|.++|.++..+|+|++|+..++++|++||+++.+|+++|.++..
T Consensus 2 ~a~~~~~~g~~~~~~~~~~~A~~~~~~ai~~~p~~~~~~~~~g~~~~~ 49 (69)
T PF13414_consen 2 NAEAWYNLGQIYFQQGDYEEAIEYFEKAIELDPNNAEAYYNLGLAYMK 49 (69)
T ss_dssp SHHHHHHHHHHHHHTTHHHHHHHHHHHHHHHSTTHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHH
Confidence 467899999999999999999999999999999999999999999864
No 24
>PRK15359 type III secretion system chaperone protein SscB; Provisional
Probab=98.92 E-value=5.7e-09 Score=68.56 Aligned_cols=63 Identities=16% Similarity=0.054 Sum_probs=60.7
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
|.+|+..|.. .++|....++.++|.++.++|+|++|+..+++++.++|+++.+++++|.++..
T Consensus 40 ~~~A~~~~~~al~~~P~~~~a~~~lg~~~~~~g~~~~A~~~y~~Al~l~p~~~~a~~~lg~~l~~ 104 (144)
T PRK15359 40 YSRAVIDFSWLVMAQPWSWRAHIALAGTWMMLKEYTTAINFYGHALMLDASHPEPVYQTGVCLKM 104 (144)
T ss_pred HHHHHHHHHHHHHcCCCcHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcCCCCcHHHHHHHHHHHH
Confidence 9999999999 88999999999999999999999999999999999999999999999999864
No 25
>PF00515 TPR_1: Tetratricopeptide repeat; InterPro: IPR001440 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees; within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B. ; GO: 0005515 protein binding; PDB: 3SF4_C 2LNI_A 1ELW_A 2C0M_A 1FCH_B 3R9A_B 2J9Q_A 2C0L_A 1KT1_A 3FWV_A ....
Probab=98.91 E-value=3.1e-09 Score=52.69 Aligned_cols=34 Identities=41% Similarity=0.640 Sum_probs=30.9
Q ss_pred HHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCC
Q psy18096 84 INCYNNLAQAQIKLGSLEPALMSLENVLNLDPNN 117 (131)
Q Consensus 84 ~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~ 117 (131)
+.+|+|+|.+|..+|+|++|+.+|++||++||+|
T Consensus 1 a~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~ 34 (34)
T PF00515_consen 1 AEAYYNLGNAYFQLGDYEEALEYYQRALELDPDN 34 (34)
T ss_dssp HHHHHHHHHHHHHTT-HHHHHHHHHHHHHHSTTH
T ss_pred CHHHHHHHHHHHHhCCchHHHHHHHHHHHHCcCC
Confidence 3679999999999999999999999999999985
No 26
>KOG0552|consensus
Probab=98.89 E-value=2e-09 Score=74.94 Aligned_cols=47 Identities=34% Similarity=0.588 Sum_probs=44.1
Q ss_pred CcchhcccCCCcEEEEEEcCCCCCCCCCCCCCCCCCCceEEEEEeeccc
Q psy18096 1 MDYVLPLMETGEECQIEITARFGYGDKGEPSKSIPPGAKLYYSLTLHSV 49 (131)
Q Consensus 1 ld~~l~~M~~gE~~~v~i~~~~ayG~~G~~~~~ip~~~~~~~~v~l~~~ 49 (131)
+|.+|.+|++|-+.+|+|+|.+|||..|. +.|||++++.|+|+|..+
T Consensus 179 ~d~gv~GMkvGGkRrviIPp~lgYg~~g~--~~IppnstL~fdVEL~~v 225 (226)
T KOG0552|consen 179 WDVGVEGMKVGGKRRVIIPPELGYGKKGV--PEIPPNSTLVFDVELLSV 225 (226)
T ss_pred HHHhhhhhccCCeeEEEeCccccccccCc--CcCCCCCcEEEEEEEEec
Confidence 57899999999999999999999999997 599999999999999875
No 27
>PRK11570 peptidyl-prolyl cis-trans isomerase; Provisional
Probab=98.83 E-value=3.7e-09 Score=73.33 Aligned_cols=48 Identities=29% Similarity=0.551 Sum_probs=44.2
Q ss_pred CcchhcccCCCcEEEEEEcCCCCCCCCCCCCCCCCCCceEEEEEeeccc
Q psy18096 1 MDYVLPLMETGEECQIEITARFGYGDKGEPSKSIPPGAKLYYSLTLHSV 49 (131)
Q Consensus 1 ld~~l~~M~~gE~~~v~i~~~~ayG~~G~~~~~ip~~~~~~~~v~l~~~ 49 (131)
++.+|..|++|+++.|+|+|+.|||..|.+ +.|||++++.|+|+|.++
T Consensus 158 ~~eaL~~M~~G~k~~~~IP~~lAYG~~g~~-~~Ipp~s~Lif~veLl~i 205 (206)
T PRK11570 158 WIEALTLMPVGSKWELTIPHELAYGERGAG-ASIPPFSTLVFEVELLEI 205 (206)
T ss_pred HHHHHcCCCCCCEEEEEECHHHcCCCCCCC-CCcCCCCeEEEEEEEEEE
Confidence 367899999999999999999999999963 789999999999999876
No 28
>PRK11189 lipoprotein NlpI; Provisional
Probab=98.82 E-value=8.3e-09 Score=75.22 Aligned_cols=90 Identities=22% Similarity=0.301 Sum_probs=71.3
Q ss_pred EEEEEeeccccCCccCcCcchhh----hHHHH--------HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHH
Q psy18096 40 LYYSLTLHSVLPDFDLAELPVEK----RLDFG--------GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALM 105 (131)
Q Consensus 40 ~~~~v~l~~~~~~~~~~~~~~~e----~~~~~--------y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~ 105 (131)
...++.+.++........++.+. ...+| +.+|+..|++ +++|..+.+|+|+|.++..+|+|++|+.
T Consensus 40 ~~~e~~i~~~~~~l~~~~~~~~~~a~~~~~~g~~~~~~g~~~~A~~~~~~Al~l~P~~~~a~~~lg~~~~~~g~~~~A~~ 119 (296)
T PRK11189 40 LQQEVILARLNQILASRDLTDEERAQLHYERGVLYDSLGLRALARNDFSQALALRPDMADAYNYLGIYLTQAGNFDAAYE 119 (296)
T ss_pred hHHHHHHHHHHHHHccccCCcHhhHHHHHHHHHHHHHCCCHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHCCCHHHHHH
Confidence 34455566665554433443333 33444 7889998888 7889999999999999999999999999
Q ss_pred HHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 106 SLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 106 ~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
.++++++++|++..+|+++|.++.
T Consensus 120 ~~~~Al~l~P~~~~a~~~lg~~l~ 143 (296)
T PRK11189 120 AFDSVLELDPTYNYAYLNRGIALY 143 (296)
T ss_pred HHHHHHHhCCCCHHHHHHHHHHHH
Confidence 999999999999999999999875
No 29
>PF14559 TPR_19: Tetratricopeptide repeat; PDB: 2R5S_A 3QDN_B 3QOU_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 3FP3_A 3LCA_A ....
Probab=98.73 E-value=6.4e-08 Score=55.11 Aligned_cols=59 Identities=27% Similarity=0.139 Sum_probs=52.2
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHH
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAK 126 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~ 126 (131)
|.+|+..|++ ...|.+..++.++|.||++.|++++|...+.+++..+|++...+.=+++
T Consensus 7 ~~~A~~~~~~~l~~~p~~~~~~~~la~~~~~~g~~~~A~~~l~~~~~~~~~~~~~~~l~a~ 67 (68)
T PF14559_consen 7 YDEAIELLEKALQRNPDNPEARLLLAQCYLKQGQYDEAEELLERLLKQDPDNPEYQQLLAQ 67 (68)
T ss_dssp HHHHHHHHHHHHHHTTTSHHHHHHHHHHHHHTT-HHHHHHHHHCCHGGGTTHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCcCHHHHHHHHhc
Confidence 8899999999 8889999999999999999999999999999999999998766654443
No 30
>PF13432 TPR_16: Tetratricopeptide repeat; PDB: 3CVP_A 3CVL_A 3CVQ_A 3CV0_A 2GW1_B 3CVN_A 3QKY_A 2PL2_B.
Probab=98.72 E-value=6.8e-08 Score=54.65 Aligned_cols=51 Identities=22% Similarity=0.193 Sum_probs=46.3
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCH
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNI 118 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ 118 (131)
|.+|+..|+. +..|....++..+|.|+..+|++++|+..++++++++|+|+
T Consensus 13 ~~~A~~~~~~~l~~~P~~~~a~~~lg~~~~~~g~~~~A~~~~~~a~~~~P~~p 65 (65)
T PF13432_consen 13 YDEAIAAFEQALKQDPDNPEAWYLLGRILYQQGRYDEALAYYERALELDPDNP 65 (65)
T ss_dssp HHHHHHHHHHHHCCSTTHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHHSTT-H
T ss_pred HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCcCCC
Confidence 8999999999 67799999999999999999999999999999999999985
No 31
>PF13371 TPR_9: Tetratricopeptide repeat
Probab=98.72 E-value=1e-07 Score=55.05 Aligned_cols=59 Identities=27% Similarity=0.242 Sum_probs=54.3
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHH
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAK 126 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~ 126 (131)
|.+|+++++. .+.|..+.++.++|.|+.++|+|.+|+.+++++++.+|++..+..-+++
T Consensus 11 ~~~A~~~~~~~l~~~p~~~~~~~~~a~~~~~~g~~~~A~~~l~~~l~~~p~~~~~~~~~a~ 71 (73)
T PF13371_consen 11 YEEALEVLERALELDPDDPELWLQRARCLFQLGRYEEALEDLERALELSPDDPDARALRAM 71 (73)
T ss_pred HHHHHHHHHHHHHhCcccchhhHHHHHHHHHhccHHHHHHHHHHHHHHCCCcHHHHHHHHh
Confidence 8899999999 8899999999999999999999999999999999999999888765543
No 32
>PF13428 TPR_14: Tetratricopeptide repeat
Probab=98.71 E-value=5.4e-08 Score=51.14 Aligned_cols=43 Identities=30% Similarity=0.366 Sum_probs=40.1
Q ss_pred HHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHh
Q psy18096 85 NCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKC 127 (131)
Q Consensus 85 ~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a 127 (131)
.++..+|.+|..+|++++|++.++++++.+|+|..+|+..|++
T Consensus 2 ~~~~~la~~~~~~G~~~~A~~~~~~~l~~~P~~~~a~~~La~l 44 (44)
T PF13428_consen 2 AAWLALARAYRRLGQPDEAERLLRRALALDPDDPEAWRALAQL 44 (44)
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHHHHCcCCHHHHHHhhhC
Confidence 4688999999999999999999999999999999999998863
No 33
>TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD. ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array.
Probab=98.68 E-value=1.1e-07 Score=60.99 Aligned_cols=63 Identities=19% Similarity=0.130 Sum_probs=53.0
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
|.+|+..++. .+.|....++.++|.++.++|++++|+..+++++..+|++...++.+|.++..
T Consensus 33 ~~~A~~~~~~~~~~~p~~~~~~~~la~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~la~~~~~ 97 (135)
T TIGR02552 33 YDEALKLFQLLAAYDPYNSRYWLGLAACCQMLKEYEEAIDAYALAAALDPDDPRPYFHAAECLLA 97 (135)
T ss_pred HHHHHHHHHHHHHhCCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChHHHHHHHHHHHH
Confidence 8888888877 66788888888888888888888888888888888888888888888887753
No 34
>PF07719 TPR_2: Tetratricopeptide repeat; InterPro: IPR013105 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. This repeat includes outlying Tetratricopeptide-like repeats (TPR) that are not matched by IPR001440 from INTERPRO.; PDB: 1XNF_B 3Q15_A 4ABN_A 1OUV_A 3U4T_A 3MA5_C 2KCV_A 2KCL_A 2XEV_A 3NF1_A ....
Probab=98.68 E-value=7.4e-08 Score=47.44 Aligned_cols=34 Identities=41% Similarity=0.606 Sum_probs=30.9
Q ss_pred HHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCC
Q psy18096 84 INCYNNLAQAQIKLGSLEPALMSLENVLNLDPNN 117 (131)
Q Consensus 84 ~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~ 117 (131)
+.++.++|.++.++|+|++|+.+++++++++|+|
T Consensus 1 a~~~~~lg~~~~~~~~~~~A~~~~~~al~l~p~~ 34 (34)
T PF07719_consen 1 AEAWYYLGQAYYQLGNYEEAIEYFEKALELDPNN 34 (34)
T ss_dssp HHHHHHHHHHHHHTT-HHHHHHHHHHHHHHSTTS
T ss_pred CHHHHHHHHHHHHhCCHHHHHHHHHHHHHHCcCC
Confidence 3578999999999999999999999999999986
No 35
>PF12895 Apc3: Anaphase-promoting complex, cyclosome, subunit 3; PDB: 3KAE_D 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2XPI_A 3ULQ_A.
Probab=98.63 E-value=9.1e-08 Score=56.99 Aligned_cols=62 Identities=29% Similarity=0.269 Sum_probs=54.6
Q ss_pred HHHHHHHHHH--Hhhhh--hHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 68 GILASQELLD--SLIKE--RINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~--~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
|++|+.++++ +..|. ...++.++|.||.++|+|++|+..+++ +..+|.+....+-.|++++.
T Consensus 5 y~~Ai~~~~k~~~~~~~~~~~~~~~~la~~~~~~~~y~~A~~~~~~-~~~~~~~~~~~~l~a~~~~~ 70 (84)
T PF12895_consen 5 YENAIKYYEKLLELDPTNPNSAYLYNLAQCYFQQGKYEEAIELLQK-LKLDPSNPDIHYLLARCLLK 70 (84)
T ss_dssp HHHHHHHHHHHHHHHCGTHHHHHHHHHHHHHHHTTHHHHHHHHHHC-HTHHHCHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHCCCChhHHHHHHHHHHHHHCCCHHHHHHHHHH-hCCCCCCHHHHHHHHHHHHH
Confidence 8899999999 44442 667888999999999999999999999 99999999999999999864
No 36
>PRK10902 FKBP-type peptidyl-prolyl cis-trans isomerase; Provisional
Probab=98.59 E-value=6.9e-08 Score=69.48 Aligned_cols=51 Identities=24% Similarity=0.412 Sum_probs=46.0
Q ss_pred CcchhcccCCCcEEEEEEcCCCCCCCCCCCCCCCCCCceEEEEEeeccccCCc
Q psy18096 1 MDYVLPLMETGEECQIEITARFGYGDKGEPSKSIPPGAKLYYSLTLHSVLPDF 53 (131)
Q Consensus 1 ld~~l~~M~~gE~~~v~i~~~~ayG~~G~~~~~ip~~~~~~~~v~l~~~~~~~ 53 (131)
++.+|..|++|+++.|+|+|.++||..|. +.|||++++.|+|+|.++....
T Consensus 202 ~~EaL~~Mk~Gek~~l~IP~~laYG~~g~--~gIppns~LvfeVeLl~V~~~~ 252 (269)
T PRK10902 202 WTEGLKNIKKGGKIKLVIPPELAYGKAGV--PGIPANSTLVFDVELLDVKPAP 252 (269)
T ss_pred HHHHHhcCCCCcEEEEEECchhhCCCCCC--CCCCCCCcEEEEEEEEEeccCc
Confidence 36789999999999999999999999995 4799999999999999987544
No 37
>KOG0553|consensus
Probab=98.47 E-value=2.9e-07 Score=66.44 Aligned_cols=56 Identities=25% Similarity=0.271 Sum_probs=50.4
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHH
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQR 123 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~r 123 (131)
|..||+.+.. .++|.+.++|..++.+|+-+|+|.+|++.|.+||++||+|.-.+-.
T Consensus 131 ~~~AVkDce~Al~iDp~yskay~RLG~A~~~~gk~~~A~~aykKaLeldP~Ne~~K~n 188 (304)
T KOG0553|consen 131 YEDAVKDCESALSIDPHYSKAYGRLGLAYLALGKYEEAIEAYKKALELDPDNESYKSN 188 (304)
T ss_pred hHHHHHHHHHHHhcChHHHHHHHHHHHHHHccCcHHHHHHHHHhhhccCCCcHHHHHH
Confidence 7788888877 8899999999999999999999999999999999999999744433
No 38
>PLN03098 LPA1 LOW PSII ACCUMULATION1; Provisional
Probab=98.46 E-value=4e-07 Score=69.23 Aligned_cols=59 Identities=17% Similarity=0.020 Sum_probs=47.7
Q ss_pred cCcchhhhHHHH--------HHHHHHHHHH--HhhhhhHH---HHHHHHHHHHHhCChHHHHHHHHHHHhhC
Q psy18096 56 AELPVEKRLDFG--------GILASQELLD--SLIKERIN---CYNNLAQAQIKLGSLEPALMSLENVLNLD 114 (131)
Q Consensus 56 ~~~~~~e~~~~~--------y~~Ai~~~~~--~~~~~~~~---~~~N~a~~~~kl~~~~~a~~~~~~al~~d 114 (131)
.+.......+.| |.+|+..|++ +++|.... +|+|+|.||.++|++++|+.++.+||++.
T Consensus 71 dP~~a~a~~NLG~AL~~lGryeEAIa~f~rALeL~Pd~aeA~~A~yNLAcaya~LGr~dEAla~LrrALels 142 (453)
T PLN03098 71 DVKTAEDAVNLGLSLFSKGRVKDALAQFETALELNPNPDEAQAAYYNKACCHAYREEGKKAADCLRTALRDY 142 (453)
T ss_pred CCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHhhCCCchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhc
Confidence 344445555555 9999999998 78888774 49999999999999999999999999973
No 39
>PLN03088 SGT1, suppressor of G2 allele of SKP1; Provisional
Probab=98.41 E-value=1.5e-06 Score=65.00 Aligned_cols=61 Identities=20% Similarity=0.147 Sum_probs=57.5
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhh
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCL 128 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~ 128 (131)
|.+|+..+++ +++|....+|+++|.++..+|+|++|+..++++++++|++..+....+.+.
T Consensus 52 ~~eAl~~~~~Al~l~P~~~~a~~~lg~~~~~lg~~~eA~~~~~~al~l~P~~~~~~~~l~~~~ 114 (356)
T PLN03088 52 FTEAVADANKAIELDPSLAKAYLRKGTACMKLEEYQTAKAALEKGASLAPGDSRFTKLIKECD 114 (356)
T ss_pred HHHHHHHHHHHHHhCcCCHHHHHHHHHHHHHhCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHH
Confidence 9999999999 888999999999999999999999999999999999999999998887763
No 40
>KOG1308|consensus
Probab=98.40 E-value=4e-08 Score=71.98 Aligned_cols=63 Identities=16% Similarity=0.112 Sum_probs=59.3
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
+..||..++. ++.|....+|.+|+.+++|++.+..||+||+.||+++|+..+.|-.||.|+.+
T Consensus 130 ~~~ai~~~t~ai~lnp~~a~l~~kr~sv~lkl~kp~~airD~d~A~ein~Dsa~~ykfrg~A~rl 194 (377)
T KOG1308|consen 130 FDTAIELFTSAIELNPPLAILYAKRASVFLKLKKPNAAIRDCDFAIEINPDSAKGYKFRGYAERL 194 (377)
T ss_pred hhhhhcccccccccCCchhhhcccccceeeeccCCchhhhhhhhhhccCcccccccchhhHHHHH
Confidence 8888888888 88999999999999999999999999999999999999999999999998764
No 41
>TIGR00990 3a0801s09 mitochondrial precursor proteins import receptor (72 kDa mitochondrial outermembrane protein) (mitochondrial import receptor for the ADP/ATP carrier) (translocase of outermembrane tom70).
Probab=98.36 E-value=1.4e-06 Score=69.29 Aligned_cols=61 Identities=20% Similarity=0.167 Sum_probs=33.6
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhh
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCL 128 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~ 128 (131)
|.+|+..|++ +++|....++.++|.++..+|+|++|+.++++++.++|++..+++.+|.++
T Consensus 381 ~~eA~~~~~~al~~~p~~~~~~~~lg~~~~~~g~~~~A~~~~~kal~l~P~~~~~~~~la~~~ 443 (615)
T TIGR00990 381 PDKAEEDFDKALKLNSEDPDIYYHRAQLHFIKGEFAQAGKDYQKSIDLDPDFIFSHIQLGVTQ 443 (615)
T ss_pred HHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCccCHHHHHHHHHHH
Confidence 4445555544 444555555555555555555555555555555555555555555555544
No 42
>PF14853 Fis1_TPR_C: Fis1 C-terminal tetratricopeptide repeat; PDB: 1IYG_A 1PC2_A 1NZN_A 3UUX_C 1Y8M_A 2PQR_A 2PQN_A 3O48_A.
Probab=98.33 E-value=2.1e-06 Score=46.87 Aligned_cols=38 Identities=32% Similarity=0.499 Sum_probs=32.6
Q ss_pred HHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHH
Q psy18096 85 NCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQ 122 (131)
Q Consensus 85 ~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~ 122 (131)
.+++.+|.++.|+|+|.+|.+.|+.+|+.+|+|..|.-
T Consensus 2 d~lY~lAig~ykl~~Y~~A~~~~~~lL~~eP~N~Qa~~ 39 (53)
T PF14853_consen 2 DCLYYLAIGHYKLGEYEKARRYCDALLEIEPDNRQAQS 39 (53)
T ss_dssp HHHHHHHHHHHHTT-HHHHHHHHHHHHHHTTS-HHHHH
T ss_pred hhHHHHHHHHHHhhhHHHHHHHHHHHHhhCCCcHHHHH
Confidence 36778999999999999999999999999999987754
No 43
>PLN02789 farnesyltranstransferase
Probab=98.33 E-value=2.7e-06 Score=62.85 Aligned_cols=59 Identities=17% Similarity=0.101 Sum_probs=46.5
Q ss_pred HHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhh
Q psy18096 70 LASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCL 128 (131)
Q Consensus 70 ~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~ 128 (131)
+++.++++ +++|.+..+|++|+.+..++|+|++++.+|+++|+.||.|..||++||.++
T Consensus 126 ~el~~~~kal~~dpkNy~AW~~R~w~l~~l~~~~eeL~~~~~~I~~d~~N~sAW~~R~~vl 186 (320)
T PLN02789 126 KELEFTRKILSLDAKNYHAWSHRQWVLRTLGGWEDELEYCHQLLEEDVRNNSAWNQRYFVI 186 (320)
T ss_pred HHHHHHHHHHHhCcccHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHCCCchhHHHHHHHHH
Confidence 34444434 566778888888888888888888888888888888888888888888765
No 44
>PF13432 TPR_16: Tetratricopeptide repeat; PDB: 3CVP_A 3CVL_A 3CVQ_A 3CV0_A 2GW1_B 3CVN_A 3QKY_A 2PL2_B.
Probab=98.33 E-value=1.3e-06 Score=49.27 Aligned_cols=43 Identities=28% Similarity=0.366 Sum_probs=40.2
Q ss_pred HHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 88 NNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 88 ~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
+++|..+++.|+|++|+..++++++.+|++..+|+..|.++..
T Consensus 1 ~~~a~~~~~~g~~~~A~~~~~~~l~~~P~~~~a~~~lg~~~~~ 43 (65)
T PF13432_consen 1 YALARALYQQGDYDEAIAAFEQALKQDPDNPEAWYLLGRILYQ 43 (65)
T ss_dssp HHHHHHHHHCTHHHHHHHHHHHHHCCSTTHHHHHHHHHHHHHH
T ss_pred ChHHHHHHHcCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH
Confidence 3689999999999999999999999999999999999999864
No 45
>KOG0543|consensus
Probab=98.33 E-value=1.4e-06 Score=65.12 Aligned_cols=51 Identities=22% Similarity=0.219 Sum_probs=49.6
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCH
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNI 118 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ 118 (131)
|..||..+++ +++|.++++++.++.|++-+++|+.|+.++.+|++++|+|-
T Consensus 273 ~~~Ai~~c~kvLe~~~~N~KALyRrG~A~l~~~e~~~A~~df~ka~k~~P~Nk 325 (397)
T KOG0543|consen 273 YKEAIESCNKVLELDPNNVKALYRRGQALLALGEYDLARDDFQKALKLEPSNK 325 (397)
T ss_pred HHHHHHHHHHHHhcCCCchhHHHHHHHHHHhhccHHHHHHHHHHHHHhCCCcH
Confidence 9999999999 89999999999999999999999999999999999999994
No 46
>PRK10370 formate-dependent nitrite reductase complex subunit NrfG; Provisional
Probab=98.30 E-value=7.7e-06 Score=56.48 Aligned_cols=63 Identities=22% Similarity=0.168 Sum_probs=53.5
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHH-HHhCC--hHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQ-IKLGS--LEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~-~kl~~--~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
|.+|+..|++ .+.|.+..++.++|.++ ...|+ +++|+..+++++++||++..+++..|.+++.
T Consensus 89 ~~~A~~a~~~Al~l~P~~~~~~~~lA~aL~~~~g~~~~~~A~~~l~~al~~dP~~~~al~~LA~~~~~ 156 (198)
T PRK10370 89 YDNALLAYRQALQLRGENAELYAALATVLYYQAGQHMTPQTREMIDKALALDANEVTALMLLASDAFM 156 (198)
T ss_pred HHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHhcCCCCcHHHHHHHHHHHHhCCCChhHHHHHHHHHHH
Confidence 8888888888 78888899999999885 67777 4899999999999999999999988887754
No 47
>cd00189 TPR Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and C
Probab=98.29 E-value=5.7e-06 Score=47.93 Aligned_cols=62 Identities=34% Similarity=0.327 Sum_probs=55.1
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
|.+|+..+.. ...|....++.++|.++...+++++|+..+++++...|.+..+++..|.++.
T Consensus 16 ~~~A~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (100)
T cd00189 16 YDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALELDPDNAKAYYNLGLAYY 79 (100)
T ss_pred HHHHHHHHHHHHhcCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhHHHHHHHHHH
Confidence 8888988887 5566667889999999999999999999999999999999999998888765
No 48
>PRK02603 photosystem I assembly protein Ycf3; Provisional
Probab=98.29 E-value=7.2e-06 Score=55.13 Aligned_cols=62 Identities=21% Similarity=0.161 Sum_probs=54.2
Q ss_pred HHHHHHHHHH--Hhhhh---hHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 68 GILASQELLD--SLIKE---RINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~---~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
|.+|+.+|.+ .+.+. ...++.|+|.++.++|+|++|+..+.+++.++|++..+++..|.++.
T Consensus 51 ~~~A~~~~~~al~~~~~~~~~~~~~~~la~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~lg~~~~ 117 (172)
T PRK02603 51 YAEALENYEEALKLEEDPNDRSYILYNMGIIYASNGEHDKALEYYHQALELNPKQPSALNNIAVIYH 117 (172)
T ss_pred HHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCcccHHHHHHHHHHHH
Confidence 8999999988 33322 35789999999999999999999999999999999999999988874
No 49
>PRK12370 invasion protein regulator; Provisional
Probab=98.28 E-value=3.8e-06 Score=66.17 Aligned_cols=62 Identities=11% Similarity=0.016 Sum_probs=52.0
Q ss_pred HHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 69 ILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 69 ~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
.+|+..+++ +++|....++.++|.++..+|++++|+..++++++++|++..+|+..|.++..
T Consensus 321 ~~A~~~~~~Al~ldP~~~~a~~~lg~~~~~~g~~~~A~~~~~~Al~l~P~~~~a~~~lg~~l~~ 384 (553)
T PRK12370 321 IKAKEHAIKATELDHNNPQALGLLGLINTIHSEYIVGSLLFKQANLLSPISADIKYYYGWNLFM 384 (553)
T ss_pred HHHHHHHHHHHhcCCCCHHHHHHHHHHHHHccCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHH
Confidence 456666666 77888888999999999999999999999999999999999999988887753
No 50
>PRK10370 formate-dependent nitrite reductase complex subunit NrfG; Provisional
Probab=98.28 E-value=6e-06 Score=57.01 Aligned_cols=61 Identities=18% Similarity=0.137 Sum_probs=55.9
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhh
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCL 128 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~ 128 (131)
..+++..+.. ..+|.+...|.++|.+|+.+|++++|+..+++++.++|++...++..|.++
T Consensus 55 ~~~~i~~l~~~L~~~P~~~~~w~~Lg~~~~~~g~~~~A~~a~~~Al~l~P~~~~~~~~lA~aL 117 (198)
T PRK10370 55 PEAQLQALQDKIRANPQNSEQWALLGEYYLWRNDYDNALLAYRQALQLRGENAELYAALATVL 117 (198)
T ss_pred HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHH
Confidence 4667777776 778999999999999999999999999999999999999999999999875
No 51
>PF13181 TPR_8: Tetratricopeptide repeat; PDB: 3GW4_B 3MA5_C 2KCV_A 2KCL_A 3FP3_A 3LCA_A 3FP4_A 3FP2_A 1W3B_B 1ELW_A ....
Probab=98.28 E-value=1.9e-06 Score=42.33 Aligned_cols=33 Identities=39% Similarity=0.625 Sum_probs=30.4
Q ss_pred HHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCC
Q psy18096 85 NCYNNLAQAQIKLGSLEPALMSLENVLNLDPNN 117 (131)
Q Consensus 85 ~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~ 117 (131)
.+|.++|.+|.++|++++|+..+.++++++|+|
T Consensus 2 ~~~~~lg~~y~~~~~~~~A~~~~~~a~~~~~~n 34 (34)
T PF13181_consen 2 EAYYNLGKIYEQLGDYEEALEYFEKALELNPDN 34 (34)
T ss_dssp HHHHHHHHHHHHTTSHHHHHHHHHHHHHHHTT-
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHHhhCCCC
Confidence 578999999999999999999999999999965
No 52
>KOG0550|consensus
Probab=98.25 E-value=3e-07 Score=68.96 Aligned_cols=63 Identities=21% Similarity=0.219 Sum_probs=60.0
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
|..|+..|+. ++.|.++.+|.|||++++.+++|++|+-++.+.++++|...|+..|-++++..
T Consensus 65 Y~nal~~yt~Ai~~~pd~a~yy~nRAa~~m~~~~~~~a~~dar~~~r~kd~~~k~~~r~~~c~~a 129 (486)
T KOG0550|consen 65 YGNALKNYTFAIDMCPDNASYYSNRAATLMMLGRFEEALGDARQSVRLKDGFSKGQLREGQCHLA 129 (486)
T ss_pred HHHHHHHHHHHHHhCccchhhhchhHHHHHHHHhHhhcccchhhheecCCCccccccchhhhhhh
Confidence 9999999999 78899999999999999999999999999999999999999999999998864
No 53
>CHL00033 ycf3 photosystem I assembly protein Ycf3
Probab=98.24 E-value=9.5e-06 Score=54.25 Aligned_cols=62 Identities=21% Similarity=0.072 Sum_probs=53.7
Q ss_pred HHHHHHHHHH--Hhhhh---hHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 68 GILASQELLD--SLIKE---RINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~---~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
|.+|+..|.+ .+.+. .+.++.|+|.++.++|++++|+..+.+++.++|.+..+++..|.++.
T Consensus 51 ~~~A~~~~~~al~l~~~~~~~~~~~~~lg~~~~~~g~~~eA~~~~~~Al~~~~~~~~~~~~la~i~~ 117 (168)
T CHL00033 51 YAEALQNYYEAMRLEIDPYDRSYILYNIGLIHTSNGEHTKALEYYFQALERNPFLPQALNNMAVICH 117 (168)
T ss_pred HHHHHHHHHHHHhccccchhhHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCcCcHHHHHHHHHHHH
Confidence 8899998888 33322 45689999999999999999999999999999999999988887765
No 54
>TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD. ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array.
Probab=98.18 E-value=1.3e-05 Score=51.34 Aligned_cols=54 Identities=20% Similarity=0.111 Sum_probs=50.1
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHH
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKAL 121 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~ 121 (131)
|.+|+.+++. ...|.....+.++|.|+..+|++++|+..++++++++|++....
T Consensus 67 ~~~A~~~~~~~~~~~p~~~~~~~~la~~~~~~g~~~~A~~~~~~al~~~p~~~~~~ 122 (135)
T TIGR02552 67 YEEAIDAYALAAALDPDDPRPYFHAAECLLALGEPESALKALDLAIEICGENPEYS 122 (135)
T ss_pred HHHHHHHHHHHHhcCCCChHHHHHHHHHHHHcCCHHHHHHHHHHHHHhccccchHH
Confidence 8899999988 77889999999999999999999999999999999999998743
No 55
>PRK11189 lipoprotein NlpI; Provisional
Probab=98.18 E-value=9.4e-06 Score=59.24 Aligned_cols=52 Identities=23% Similarity=0.006 Sum_probs=49.6
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHH
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIK 119 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~k 119 (131)
|.+|+..|++ +++|....++.|+|.++..+|++++|+.+++++++++|++..
T Consensus 114 ~~~A~~~~~~Al~l~P~~~~a~~~lg~~l~~~g~~~eA~~~~~~al~~~P~~~~ 167 (296)
T PRK11189 114 FDAAYEAFDSVLELDPTYNYAYLNRGIALYYGGRYELAQDDLLAFYQDDPNDPY 167 (296)
T ss_pred HHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCCHH
Confidence 9999999999 889999999999999999999999999999999999999973
No 56
>PF13424 TPR_12: Tetratricopeptide repeat; PDB: 3RO2_A 3Q15_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 4A1S_B 3CEQ_B 3EDT_H ....
Probab=98.15 E-value=9.7e-06 Score=47.27 Aligned_cols=47 Identities=23% Similarity=0.163 Sum_probs=38.1
Q ss_pred HHHHHHHHHH---------HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhC
Q psy18096 68 GILASQELLD---------SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLD 114 (131)
Q Consensus 68 y~~Ai~~~~~---------~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~d 114 (131)
|.+|+.+|++ +-.+..+.++.|+|.||..+|++++|+..+++++++.
T Consensus 21 ~~~A~~~~~~al~~~~~~~~~~~~~a~~~~~lg~~~~~~g~~~~A~~~~~~al~i~ 76 (78)
T PF13424_consen 21 YDEALDYYEKALDIEEQLGDDHPDTANTLNNLGECYYRLGDYEEALEYYQKALDIF 76 (78)
T ss_dssp HHHHHHHHHHHHHHHHHTTTHHHHHHHHHHHHHHHHHHTTHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHhhh
Confidence 5566665555 2335668999999999999999999999999999863
No 57
>TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795).
Probab=98.15 E-value=1.9e-05 Score=55.28 Aligned_cols=62 Identities=19% Similarity=0.083 Sum_probs=53.9
Q ss_pred HHHHHHHHHH--Hhhhh---hHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHH---HHHHHHHhhh
Q psy18096 68 GILASQELLD--SLIKE---RINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIK---ALQRKAKCLI 129 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~---~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~k---a~~rrg~a~~ 129 (131)
|.+|+..++. ...|. ...+++++|.++.++|+|++|+..++++++.+|++.. ++|.+|.++.
T Consensus 49 ~~~A~~~~~~~~~~~p~~~~~~~a~~~la~~~~~~~~~~~A~~~~~~~l~~~p~~~~~~~a~~~~g~~~~ 118 (235)
T TIGR03302 49 YTEAIKYFEALESRYPFSPYAEQAQLDLAYAYYKSGDYAEAIAAADRFIRLHPNHPDADYAYYLRGLSNY 118 (235)
T ss_pred HHHHHHHHHHHHHhCCCchhHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHCcCCCchHHHHHHHHHHHH
Confidence 9999999988 44443 3468899999999999999999999999999998887 7999998875
No 58
>PLN02789 farnesyltranstransferase
Probab=98.14 E-value=1e-05 Score=59.78 Aligned_cols=61 Identities=18% Similarity=0.082 Sum_probs=42.2
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhC-ChHHHHHHHHHHHhhCCCCHHHHHHHHHhh
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLG-SLEPALMSLENVLNLDPNNIKALQRKAKCL 128 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~-~~~~a~~~~~~al~~dp~~~ka~~rrg~a~ 128 (131)
+.+|+..+++ ++.|.+.++|++|+.++.+++ .+++++..++++++.+|+|..+|+.|+.++
T Consensus 53 serAL~lt~~aI~lnP~~ytaW~~R~~iL~~L~~~l~eeL~~~~~~i~~npknyqaW~~R~~~l 116 (320)
T PLN02789 53 SPRALDLTADVIRLNPGNYTVWHFRRLCLEALDADLEEELDFAEDVAEDNPKNYQIWHHRRWLA 116 (320)
T ss_pred CHHHHHHHHHHHHHCchhHHHHHHHHHHHHHcchhHHHHHHHHHHHHHHCCcchHHhHHHHHHH
Confidence 5566666666 666777777777777777776 467777777777777777777777776554
No 59
>KOG4234|consensus
Probab=98.13 E-value=1.7e-05 Score=54.86 Aligned_cols=59 Identities=22% Similarity=0.169 Sum_probs=53.2
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhh
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCL 128 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~ 128 (131)
++.||+.+++ ++.|.+..++..||.+|-++..|++|+.|+.+++++||.+.. .|+|.+.
T Consensus 150 ~e~aI~dcsKaiel~pty~kAl~RRAeayek~ek~eealeDyKki~E~dPs~~e--ar~~i~r 210 (271)
T KOG4234|consen 150 WESAIEDCSKAIELNPTYEKALERRAEAYEKMEKYEEALEDYKKILESDPSRRE--AREAIAR 210 (271)
T ss_pred HHHHHHHHHhhHhcCchhHHHHHHHHHHHHhhhhHHHHHHHHHHHHHhCcchHH--HHHHHHh
Confidence 8889998888 899999999999999999999999999999999999999874 3666554
No 60
>KOG0624|consensus
Probab=98.11 E-value=2.1e-06 Score=63.58 Aligned_cols=63 Identities=16% Similarity=0.167 Sum_probs=51.1
Q ss_pred HHHHHHHHHH------HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 68 GILASQELLD------SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 68 y~~Ai~~~~~------~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
|.+++..+++ +..+.....+--++.||.+-+++.+||..|.++|..||+|+.+|..|+.||++
T Consensus 285 ~t~cle~ge~vlk~ep~~~~ir~~~~r~~c~C~~~d~~~~eAiqqC~evL~~d~~dv~~l~dRAeA~l~ 353 (504)
T KOG0624|consen 285 WTECLEAGEKVLKNEPEETMIRYNGFRVLCTCYREDEQFGEAIQQCKEVLDIDPDDVQVLCDRAEAYLG 353 (504)
T ss_pred HHHHHHHHHHHHhcCCcccceeeeeeheeeecccccCCHHHHHHHHHHHHhcCchHHHHHHHHHHHHhh
Confidence 7777777776 22344455666788999999999999999999999999999999999999875
No 61
>KOG4626|consensus
Probab=98.11 E-value=9.1e-06 Score=64.21 Aligned_cols=61 Identities=25% Similarity=0.278 Sum_probs=49.6
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhh
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCL 128 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~ 128 (131)
+..|+++|.. ++.|..+.+++|+|..|-.+|++++|+.++.+||.++|.-+.||-++|..|
T Consensus 370 ~e~A~~ly~~al~v~p~~aaa~nNLa~i~kqqgnl~~Ai~~YkealrI~P~fAda~~NmGnt~ 432 (966)
T KOG4626|consen 370 IEEATRLYLKALEVFPEFAAAHNNLASIYKQQGNLDDAIMCYKEALRIKPTFADALSNMGNTY 432 (966)
T ss_pred chHHHHHHHHHHhhChhhhhhhhhHHHHHHhcccHHHHHHHHHHHHhcCchHHHHHHhcchHH
Confidence 5566666666 667888888888888888888888888888888888888888888888765
No 62
>COG4785 NlpI Lipoprotein NlpI, contains TPR repeats [General function prediction only]
Probab=98.09 E-value=9.4e-07 Score=61.68 Aligned_cols=95 Identities=18% Similarity=0.206 Sum_probs=72.6
Q ss_pred CCCCceEEEEEeeccccCCccCcCcchhhhHHHHHHHHH------------HHHHH--HhhhhhHHHHHHHHHHHHHhCC
Q psy18096 34 IPPGAKLYYSLTLHSVLPDFDLAELPVEKRLDFGGILAS------------QELLD--SLIKERINCYNNLAQAQIKLGS 99 (131)
Q Consensus 34 ip~~~~~~~~v~l~~~~~~~~~~~~~~~e~~~~~y~~Ai------------~~~~~--~~~~~~~~~~~N~a~~~~kl~~ 99 (131)
+|-.+.+.-+|.+..++.......++.+++...-|++.+ -.+++ .+.|..+.+|+-++.-++.-|+
T Consensus 35 ~~~qp~lqqEV~iarlsqlL~~~~l~~eeRA~l~fERGvlYDSlGL~~LAR~DftQaLai~P~m~~vfNyLG~Yl~~a~~ 114 (297)
T COG4785 35 VPLQPTLQQEVILARMSQILASRALTDEERAQLLFERGVLYDSLGLRALARNDFSQALAIRPDMPEVFNYLGIYLTQAGN 114 (297)
T ss_pred ccCCccHHHHHHHHHHHHHHHhccCChHHHHHHHHHhcchhhhhhHHHHHhhhhhhhhhcCCCcHHHHHHHHHHHHhccc
Confidence 444445666666777766666667777776554333333 22222 5679999999999999999999
Q ss_pred hHHHHHHHHHHHhhCCCCHHHHHHHHHhh
Q psy18096 100 LEPALMSLENVLNLDPNNIKALQRKAKCL 128 (131)
Q Consensus 100 ~~~a~~~~~~al~~dp~~~ka~~rrg~a~ 128 (131)
|+.|.+.++.++++||.+.-|..+||.++
T Consensus 115 fdaa~eaFds~~ELDp~y~Ya~lNRgi~~ 143 (297)
T COG4785 115 FDAAYEAFDSVLELDPTYNYAHLNRGIAL 143 (297)
T ss_pred chHHHHHhhhHhccCCcchHHHhccceee
Confidence 99999999999999999999999999875
No 63
>PRK15331 chaperone protein SicA; Provisional
Probab=98.09 E-value=8.3e-06 Score=54.49 Aligned_cols=63 Identities=13% Similarity=-0.072 Sum_probs=56.8
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
|.+|..+|.- -+++.+..++..+|.|+..+++|.+|+..+..+..++++|+...|+-|++++.
T Consensus 53 ~~eA~~~F~~L~~~d~~n~~Y~~GLaa~~Q~~k~y~~Ai~~Y~~A~~l~~~dp~p~f~agqC~l~ 117 (165)
T PRK15331 53 LDEAETFFRFLCIYDFYNPDYTMGLAAVCQLKKQFQKACDLYAVAFTLLKNDYRPVFFTGQCQLL 117 (165)
T ss_pred HHHHHHHHHHHHHhCcCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccCCCCccchHHHHHHH
Confidence 6666666666 66788899999999999999999999999999999999999999999999875
No 64
>TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW. Members of this family are designated PilF in ref (PubMed:8973346) and PilW in ref (PubMed:15612916). This outer membrane protein is required both for pilus stability and for pilus function such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.
Probab=98.09 E-value=2.8e-05 Score=52.94 Aligned_cols=62 Identities=29% Similarity=0.246 Sum_probs=52.2
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
|.+|+..+++ +..|....++.++|.++..+|++++|+..+.++++++|++..+++..|.++.
T Consensus 47 ~~~A~~~~~~~l~~~p~~~~~~~~la~~~~~~~~~~~A~~~~~~al~~~~~~~~~~~~~~~~~~ 110 (234)
T TIGR02521 47 LEVAKENLDKALEHDPDDYLAYLALALYYQQLGELEKAEDSFRRALTLNPNNGDVLNNYGTFLC 110 (234)
T ss_pred HHHHHHHHHHHHHhCcccHHHHHHHHHHHHHcCCHHHHHHHHHHHHhhCCCCHHHHHHHHHHHH
Confidence 8888888887 5667778888889999999999999999999999988888888888877664
No 65
>KOG4626|consensus
Probab=98.08 E-value=1.3e-05 Score=63.37 Aligned_cols=60 Identities=28% Similarity=0.304 Sum_probs=46.2
Q ss_pred HHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 70 LASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 70 ~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
-||+.|.+ +++|....+|+|+|.+.-..|+-.+|.++++++|.+.|+.+.++++.|.++.
T Consensus 304 lAI~~Ykral~~~P~F~~Ay~NlanALkd~G~V~ea~~cYnkaL~l~p~hadam~NLgni~~ 365 (966)
T KOG4626|consen 304 LAIDTYKRALELQPNFPDAYNNLANALKDKGSVTEAVDCYNKALRLCPNHADAMNNLGNIYR 365 (966)
T ss_pred HHHHHHHHHHhcCCCchHHHhHHHHHHHhccchHHHHHHHHHHHHhCCccHHHHHHHHHHHH
Confidence 34444444 6678888888888888888888888888888888888888888888887764
No 66
>TIGR02795 tol_pal_ybgF tol-pal system protein YbgF. Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.
Probab=98.08 E-value=3.1e-05 Score=48.01 Aligned_cols=62 Identities=19% Similarity=0.064 Sum_probs=38.0
Q ss_pred HHHHHHHHHH--Hhhhh---hHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCC---HHHHHHHHHhhh
Q psy18096 68 GILASQELLD--SLIKE---RINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNN---IKALQRKAKCLI 129 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~---~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~---~ka~~rrg~a~~ 129 (131)
|.+|++.|.. ...|. ...++.++|.++.+.|+|++|+..+.+++..+|++ ..+++..|.++.
T Consensus 18 ~~~A~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~~ 87 (119)
T TIGR02795 18 YADAIQAFQAFLKKYPKSTYAPNAHYWLGEAYYAQGKYADAAKAFLAVVKKYPKSPKAPDALLKLGMSLQ 87 (119)
T ss_pred HHHHHHHHHHHHHHCCCccccHHHHHHHHHHHHhhccHHHHHHHHHHHHHHCCCCCcccHHHHHHHHHHH
Confidence 6666666665 22222 24566667777777777777777777777766664 456666666553
No 67
>KOG1125|consensus
Probab=98.07 E-value=4.6e-06 Score=64.73 Aligned_cols=62 Identities=13% Similarity=0.067 Sum_probs=58.7
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
|.+||++|+. ..+|....+|+.++++...-++..+||..|.+||+|.|+.+++.|+.|.+++
T Consensus 446 fdraiDcf~~AL~v~Pnd~~lWNRLGAtLAN~~~s~EAIsAY~rALqLqP~yVR~RyNlgIS~m 509 (579)
T KOG1125|consen 446 FDRAVDCFEAALQVKPNDYLLWNRLGATLANGNRSEEAISAYNRALQLQPGYVRVRYNLGISCM 509 (579)
T ss_pred HHHHHHHHHHHHhcCCchHHHHHHhhHHhcCCcccHHHHHHHHHHHhcCCCeeeeehhhhhhhh
Confidence 9999999999 7789999999999999999999999999999999999999999999998875
No 68
>KOG4555|consensus
Probab=98.06 E-value=2.5e-05 Score=50.61 Aligned_cols=62 Identities=29% Similarity=0.203 Sum_probs=53.2
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCC----HHHHHHHHHhhh
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNN----IKALQRKAKCLI 129 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~----~ka~~rrg~a~~ 129 (131)
...||+.|.+ .+-|..+++|+|||+++.-+|+.++|++|.++|+++...- ..++..||..|-
T Consensus 59 Ld~AlE~F~qal~l~P~raSayNNRAQa~RLq~~~e~ALdDLn~AleLag~~trtacqa~vQRg~lyR 126 (175)
T KOG4555|consen 59 LDGALELFGQALCLAPERASAYNNRAQALRLQGDDEEALDDLNKALELAGDQTRTACQAFVQRGLLYR 126 (175)
T ss_pred hHHHHHHHHHHHHhcccchHhhccHHHHHHHcCChHHHHHHHHHHHHhcCccchHHHHHHHHHHHHHH
Confidence 7888888888 7789999999999999999999999999999999997544 456777776553
No 69
>PRK09782 bacteriophage N4 receptor, outer membrane subunit; Provisional
Probab=98.04 E-value=2.3e-05 Score=65.53 Aligned_cols=62 Identities=18% Similarity=0.060 Sum_probs=43.5
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
+.+|+..|.+ +++|....+++|+|.++..+|++++|+..+.++++++|+++.+++.+|.++.
T Consensus 625 ~deA~~~l~~AL~l~Pd~~~a~~nLG~aL~~~G~~eeAi~~l~~AL~l~P~~~~a~~nLA~al~ 688 (987)
T PRK09782 625 VPAAVSDLRAALELEPNNSNYQAALGYALWDSGDIAQSREMLERAHKGLPDDPALIRQLAYVNQ 688 (987)
T ss_pred HHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHH
Confidence 5666666666 6667777777777777777777777777777777777777777777776664
No 70
>PRK12370 invasion protein regulator; Provisional
Probab=98.03 E-value=2.5e-05 Score=61.60 Aligned_cols=61 Identities=23% Similarity=0.140 Sum_probs=55.0
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhh
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCL 128 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~ 128 (131)
|.+|+..|++ +++|..+.+++++|.++..+|++++|+..++++++++|.+..+++.++.++
T Consensus 354 ~~~A~~~~~~Al~l~P~~~~a~~~lg~~l~~~G~~~eAi~~~~~Al~l~P~~~~~~~~~~~~~ 416 (553)
T PRK12370 354 YIVGSLLFKQANLLSPISADIKYYYGWNLFMAGQLEEALQTINECLKLDPTRAAAGITKLWIT 416 (553)
T ss_pred HHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHhcCCCChhhHHHHHHHH
Confidence 8899999999 889999999999999999999999999999999999999987766655443
No 71
>PF13512 TPR_18: Tetratricopeptide repeat
Probab=98.00 E-value=9.5e-05 Score=48.37 Aligned_cols=63 Identities=19% Similarity=0.113 Sum_probs=55.3
Q ss_pred HHHHHHHHHH-----HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCH---HHHHHHHHhhhc
Q psy18096 68 GILASQELLD-----SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNI---KALQRKAKCLIL 130 (131)
Q Consensus 68 y~~Ai~~~~~-----~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~---ka~~rrg~a~~~ 130 (131)
|.+|++.++. -..+....+.++++-+|.+.++|.+|+..+++-|+++|++. -|+|++|.++..
T Consensus 26 Y~~A~~~le~L~~ryP~g~ya~qAqL~l~yayy~~~~y~~A~a~~~rFirLhP~hp~vdYa~Y~~gL~~~~ 96 (142)
T PF13512_consen 26 YEEAIKQLEALDTRYPFGEYAEQAQLDLAYAYYKQGDYEEAIAAYDRFIRLHPTHPNVDYAYYMRGLSYYE 96 (142)
T ss_pred HHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHccCHHHHHHHHHHHHHhCCCCCCccHHHHHHHHHHHH
Confidence 9999999987 34466678899999999999999999999999999999885 689999988753
No 72
>PRK15363 pathogenicity island 2 chaperone protein SscA; Provisional
Probab=97.98 E-value=4.2e-05 Score=50.83 Aligned_cols=62 Identities=11% Similarity=0.009 Sum_probs=51.8
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCC--HHHHHHHHHhhh
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNN--IKALQRKAKCLI 129 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~--~ka~~rrg~a~~ 129 (131)
|.+||..|.. .+++..+..+.|.|.|++++|+.+.|+..+..||..-..+ -..+-.|+++++
T Consensus 85 ~~~AI~aY~~A~~L~~ddp~~~~~ag~c~L~lG~~~~A~~aF~~Ai~~~~~~~~~~~l~~~A~~~L 150 (157)
T PRK15363 85 WGEAIYAYGRAAQIKIDAPQAPWAAAECYLACDNVCYAIKALKAVVRICGEVSEHQILRQRAEKML 150 (157)
T ss_pred HHHHHHHHHHHHhcCCCCchHHHHHHHHHHHcCCHHHHHHHHHHHHHHhccChhHHHHHHHHHHHH
Confidence 9999999999 8889999999999999999999999999999999876222 334455565554
No 73
>PF13176 TPR_7: Tetratricopeptide repeat; PDB: 3SF4_C 3RO3_A 3RO2_A.
Probab=97.95 E-value=1.9e-05 Score=39.52 Aligned_cols=29 Identities=31% Similarity=0.372 Sum_probs=24.8
Q ss_pred HHHHHHHHHHHhCChHHHHHHHHHHHhhC
Q psy18096 86 CYNNLAQAQIKLGSLEPALMSLENVLNLD 114 (131)
Q Consensus 86 ~~~N~a~~~~kl~~~~~a~~~~~~al~~d 114 (131)
++.|+|.+|.++|+|++|+..+.++|.+.
T Consensus 1 al~~Lg~~~~~~g~~~~Ai~~y~~aL~l~ 29 (36)
T PF13176_consen 1 ALNNLGRIYRQQGDYEKAIEYYEQALALA 29 (36)
T ss_dssp HHHHHHHHHHHCT-HHHHHHHHHHHHHHH
T ss_pred CHHHHHHHHHHcCCHHHHHHHHHHHHHhc
Confidence 57899999999999999999999966543
No 74
>PLN03098 LPA1 LOW PSII ACCUMULATION1; Provisional
Probab=97.95 E-value=4.5e-05 Score=58.31 Aligned_cols=53 Identities=26% Similarity=0.282 Sum_probs=49.3
Q ss_pred HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHH---HHHHHHhhhc
Q psy18096 78 SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKA---LQRKAKCLIL 130 (131)
Q Consensus 78 ~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka---~~rrg~a~~~ 130 (131)
+.+|.....++|+|.+|.++|+|++|+..|++||+++|++..+ ||++|.+|..
T Consensus 69 ~~dP~~a~a~~NLG~AL~~lGryeEAIa~f~rALeL~Pd~aeA~~A~yNLAcaya~ 124 (453)
T PLN03098 69 EADVKTAEDAVNLGLSLFSKGRVKDALAQFETALELNPNPDEAQAAYYNKACCHAY 124 (453)
T ss_pred cCCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHhhCCCchHHHHHHHHHHHHHHH
Confidence 4578999999999999999999999999999999999999965 9999999864
No 75
>smart00028 TPR Tetratricopeptide repeats. Repeats present in 4 or more copies in proteins. Contain a minimum of 34 amino acids each and self-associate via a "knobs and holes" mechanism.
Probab=97.95 E-value=1.9e-05 Score=36.97 Aligned_cols=33 Identities=45% Similarity=0.655 Sum_probs=30.5
Q ss_pred HHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCC
Q psy18096 85 NCYNNLAQAQIKLGSLEPALMSLENVLNLDPNN 117 (131)
Q Consensus 85 ~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~ 117 (131)
.++.++|.++..++++++|+..+.++++++|++
T Consensus 2 ~~~~~~a~~~~~~~~~~~a~~~~~~~~~~~~~~ 34 (34)
T smart00028 2 EALYNLGNAYLKLGDYDEALEYYEKALELDPNN 34 (34)
T ss_pred hHHHHHHHHHHHHhhHHHHHHHHHHHHccCCCC
Confidence 468899999999999999999999999999864
No 76
>KOG1310|consensus
Probab=97.95 E-value=2.3e-05 Score=60.88 Aligned_cols=63 Identities=27% Similarity=0.151 Sum_probs=55.0
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHh---CChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKL---GSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl---~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
...||..|.+ ...+...-+|.|||.+++|. |+--.|++||..||++||...|||||.+.|+..
T Consensus 390 ~~~~i~~~s~a~q~~~~~~~~l~nraa~lmkRkW~~d~~~AlrDch~Alrln~s~~kah~~la~aL~e 457 (758)
T KOG1310|consen 390 VSGAISHYSRAIQYVPDAIYLLENRAAALMKRKWRGDSYLALRDCHVALRLNPSIQKAHFRLARALNE 457 (758)
T ss_pred HHHHHHHHHHHhhhccchhHHHHhHHHHHHhhhccccHHHHHHhHHhhccCChHHHHHHHHHHHHHHH
Confidence 5677777777 44578889999999999996 578889999999999999999999999999864
No 77
>PRK10866 outer membrane biogenesis protein BamD; Provisional
Probab=97.92 E-value=0.00013 Score=51.98 Aligned_cols=61 Identities=16% Similarity=0.048 Sum_probs=52.0
Q ss_pred HHHHHHHHHH--Hhh---hhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCC---HHHHHHHHHhh
Q psy18096 68 GILASQELLD--SLI---KERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNN---IKALQRKAKCL 128 (131)
Q Consensus 68 y~~Ai~~~~~--~~~---~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~---~ka~~rrg~a~ 128 (131)
|.+|+..|+. ... +....+..++|.+|.++++|++|+..+++.++++|++ ..++|.+|.++
T Consensus 48 y~~Ai~~f~~l~~~yP~s~~a~~a~l~la~ayy~~~~y~~A~~~~e~fi~~~P~~~~~~~a~Y~~g~~~ 116 (243)
T PRK10866 48 WKQAITQLEALDNRYPFGPYSQQVQLDLIYAYYKNADLPLAQAAIDRFIRLNPTHPNIDYVLYMRGLTN 116 (243)
T ss_pred HHHHHHHHHHHHHhCCCChHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHhCcCCCchHHHHHHHHHhh
Confidence 9999999998 332 4455567899999999999999999999999999876 57899999874
No 78
>KOG0547|consensus
Probab=97.91 E-value=1.9e-05 Score=60.77 Aligned_cols=51 Identities=25% Similarity=0.339 Sum_probs=44.4
Q ss_pred HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhh
Q psy18096 78 SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCL 128 (131)
Q Consensus 78 ~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~ 128 (131)
.++|.++.+|.-||+.++-+++|++|+.||++++++||+|+-++-..+.++
T Consensus 388 ~ldp~n~dvYyHRgQm~flL~q~e~A~aDF~Kai~L~pe~~~~~iQl~~a~ 438 (606)
T KOG0547|consen 388 DLDPENPDVYYHRGQMRFLLQQYEEAIADFQKAISLDPENAYAYIQLCCAL 438 (606)
T ss_pred hcCCCCCchhHhHHHHHHHHHHHHHHHHHHHHHhhcChhhhHHHHHHHHHH
Confidence 678889999999999999999999999999999999999988877776664
No 79
>PF13371 TPR_9: Tetratricopeptide repeat
Probab=97.90 E-value=2.1e-05 Score=45.17 Aligned_cols=40 Identities=33% Similarity=0.514 Sum_probs=37.2
Q ss_pred HHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 91 AQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 91 a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
...|++.++|++|+..+++++.++|+++.+|+.+|.++..
T Consensus 2 ~~~~~~~~~~~~A~~~~~~~l~~~p~~~~~~~~~a~~~~~ 41 (73)
T PF13371_consen 2 KQIYLQQEDYEEALEVLERALELDPDDPELWLQRARCLFQ 41 (73)
T ss_pred HHHHHhCCCHHHHHHHHHHHHHhCcccchhhHHHHHHHHH
Confidence 5678999999999999999999999999999999998864
No 80
>KOG1126|consensus
Probab=97.89 E-value=9.7e-06 Score=63.76 Aligned_cols=63 Identities=19% Similarity=0.169 Sum_probs=54.5
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
|+.|.-.|.+ ++.|.+..+..-.+..+.++|+.++|+..+++|+.+||.|+-.-|.||..+..
T Consensus 505 ~e~Ae~~fqkA~~INP~nsvi~~~~g~~~~~~k~~d~AL~~~~~A~~ld~kn~l~~~~~~~il~~ 569 (638)
T KOG1126|consen 505 LEFAEFHFQKAVEINPSNSVILCHIGRIQHQLKRKDKALQLYEKAIHLDPKNPLCKYHRASILFS 569 (638)
T ss_pred hhHHHHHHHhhhcCCccchhHHhhhhHHHHHhhhhhHHHHHHHHHHhcCCCCchhHHHHHHHHHh
Confidence 5555555555 77899999999999999999999999999999999999999999999988764
No 81
>PRK15179 Vi polysaccharide biosynthesis protein TviE; Provisional
Probab=97.89 E-value=6.1e-05 Score=60.94 Aligned_cols=62 Identities=6% Similarity=-0.005 Sum_probs=38.7
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
|.+|...+.. ++.|.+..++.|+|.+..+++++++|+..++++|..+|+|..+++.+|.++.
T Consensus 102 ~~ea~~~l~~~~~~~Pd~~~a~~~~a~~L~~~~~~eeA~~~~~~~l~~~p~~~~~~~~~a~~l~ 165 (694)
T PRK15179 102 SDEGLAVWRGIHQRFPDSSEAFILMLRGVKRQQGIEAGRAEIELYFSGGSSSAREILLEAKSWD 165 (694)
T ss_pred cHHHHHHHHHHHhhCCCcHHHHHHHHHHHHHhccHHHHHHHHHHHhhcCCCCHHHHHHHHHHHH
Confidence 6777776666 5556666666666666666666666666666666666666666666665553
No 82
>TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW. Members of this family are designated PilF in ref (PubMed:8973346) and PilW in ref (PubMed:15612916). This outer membrane protein is required both for pilus stability and for pilus function such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.
Probab=97.88 E-value=9.8e-05 Score=50.22 Aligned_cols=62 Identities=19% Similarity=0.228 Sum_probs=44.6
Q ss_pred HHHHHHHHHHHh----hhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 68 GILASQELLDSL----IKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 68 y~~Ai~~~~~~~----~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
|.+|+..+.+-+ .+.....+.++|.++..+|++++|+..+++++..+|++..+++..|.++.
T Consensus 115 ~~~A~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~g~~~~A~~~~~~~~~~~~~~~~~~~~la~~~~ 180 (234)
T TIGR02521 115 YEQAMQQFEQAIEDPLYPQPARSLENAGLCALKAGDFDKAEKYLTRALQIDPQRPESLLELAELYY 180 (234)
T ss_pred HHHHHHHHHHHHhccccccchHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCcCChHHHHHHHHHHH
Confidence 455555555411 13345577788888888888888888888888888888888888877764
No 83
>PF13525 YfiO: Outer membrane lipoprotein; PDB: 3TGO_A 3Q5M_A 2YHC_A.
Probab=97.87 E-value=0.00013 Score=50.43 Aligned_cols=62 Identities=23% Similarity=0.057 Sum_probs=52.3
Q ss_pred HHHHHHHHHH-----HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCH---HHHHHHHHhhh
Q psy18096 68 GILASQELLD-----SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNI---KALQRKAKCLI 129 (131)
Q Consensus 68 y~~Ai~~~~~-----~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~---ka~~rrg~a~~ 129 (131)
|.+|++.|+. .-.+....+...+|.++.+.|+|.+|+..+++.+...|++. .|+|.+|.++.
T Consensus 21 y~~Ai~~f~~l~~~~P~s~~a~~A~l~la~a~y~~~~y~~A~~~~~~fi~~yP~~~~~~~A~Y~~g~~~~ 90 (203)
T PF13525_consen 21 YEEAIKLFEKLIDRYPNSPYAPQAQLMLAYAYYKQGDYEEAIAAYERFIKLYPNSPKADYALYMLGLSYY 90 (203)
T ss_dssp HHHHHHHHHHHHHH-TTSTTHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHH-TT-TTHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHCCCChHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCCCCcchhhHHHHHHHHHH
Confidence 9999999998 23467778999999999999999999999999999998775 69999999875
No 84
>PRK09782 bacteriophage N4 receptor, outer membrane subunit; Provisional
Probab=97.87 E-value=7.7e-05 Score=62.49 Aligned_cols=62 Identities=23% Similarity=0.198 Sum_probs=56.5
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
|.+|+..|.+ ++.|. ..++.|+|.++.++|++++|+..+.+++.++|++..+++..|.++..
T Consensus 592 ~~eAl~~~~~AL~l~P~-~~a~~~LA~~l~~lG~~deA~~~l~~AL~l~Pd~~~a~~nLG~aL~~ 655 (987)
T PRK09782 592 PELALNDLTRSLNIAPS-ANAYVARATIYRQRHNVPAAVSDLRAALELEPNNSNYQAALGYALWD 655 (987)
T ss_pred HHHHHHHHHHHHHhCCC-HHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHH
Confidence 7888888888 67785 88999999999999999999999999999999999999999988753
No 85
>PRK15174 Vi polysaccharide export protein VexE; Provisional
Probab=97.85 E-value=8.4e-05 Score=59.85 Aligned_cols=63 Identities=11% Similarity=-0.072 Sum_probs=57.5
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
+.+|+..+++ +++|....++.++|.++..+|++++|+..++++++++|++..+++.+|.++..
T Consensus 92 ~~~A~~~l~~~l~~~P~~~~a~~~la~~l~~~g~~~~Ai~~l~~Al~l~P~~~~a~~~la~~l~~ 156 (656)
T PRK15174 92 PDAVLQVVNKLLAVNVCQPEDVLLVASVLLKSKQYATVADLAEQAWLAFSGNSQIFALHLRTLVL 156 (656)
T ss_pred HHHHHHHHHHHHHhCCCChHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCCCcHHHHHHHHHHHHH
Confidence 8888888888 78899999999999999999999999999999999999999999999888653
No 86
>PRK15179 Vi polysaccharide biosynthesis protein TviE; Provisional
Probab=97.84 E-value=7.1e-05 Score=60.57 Aligned_cols=63 Identities=16% Similarity=-0.061 Sum_probs=59.2
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
+.+|+..+++ ...|.+...++++|.|..++|+|++|+..|++++.-+|++.++|..+|.++..
T Consensus 136 ~eeA~~~~~~~l~~~p~~~~~~~~~a~~l~~~g~~~~A~~~y~~~~~~~p~~~~~~~~~a~~l~~ 200 (694)
T PRK15179 136 IEAGRAEIELYFSGGSSSAREILLEAKSWDEIGQSEQADACFERLSRQHPEFENGYVGWAQSLTR 200 (694)
T ss_pred HHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHhcchHHHHHHHHHHHhcCCCcHHHHHHHHHHHHH
Confidence 8888888888 77899999999999999999999999999999999999999999999999863
No 87
>PF13431 TPR_17: Tetratricopeptide repeat
Probab=97.84 E-value=2.1e-05 Score=38.88 Aligned_cols=27 Identities=33% Similarity=0.490 Sum_probs=20.2
Q ss_pred HhhhhhHHHHHHHHHHHHHhCChHHHH
Q psy18096 78 SLIKERINCYNNLAQAQIKLGSLEPAL 104 (131)
Q Consensus 78 ~~~~~~~~~~~N~a~~~~kl~~~~~a~ 104 (131)
+++|.++.+|+|+|.+|...|++++|+
T Consensus 7 e~~P~n~~a~~nla~~~~~~g~~~~A~ 33 (34)
T PF13431_consen 7 ELNPNNAEAYNNLANLYLNQGDYEEAI 33 (34)
T ss_pred HHCCCCHHHHHHHHHHHHHCcCHHhhc
Confidence 456777777777777777777777775
No 88
>KOG1155|consensus
Probab=97.81 E-value=7e-05 Score=57.32 Aligned_cols=63 Identities=10% Similarity=0.176 Sum_probs=60.0
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
.++||.+|.+ .++|....+|.=++.=|+.|++-..||..+..|+.++|.+-+|||-.|++|..
T Consensus 346 HEKAv~YFkRALkLNp~~~~aWTLmGHEyvEmKNt~AAi~sYRrAvdi~p~DyRAWYGLGQaYei 410 (559)
T KOG1155|consen 346 HEKAVMYFKRALKLNPKYLSAWTLMGHEYVEMKNTHAAIESYRRAVDINPRDYRAWYGLGQAYEI 410 (559)
T ss_pred HHHHHHHHHHHHhcCcchhHHHHHhhHHHHHhcccHHHHHHHHHHHhcCchhHHHHhhhhHHHHH
Confidence 7899999999 88999999999999999999999999999999999999999999999999853
No 89
>PF13424 TPR_12: Tetratricopeptide repeat; PDB: 3RO2_A 3Q15_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 4A1S_B 3CEQ_B 3EDT_H ....
Probab=97.80 E-value=5.2e-05 Score=44.11 Aligned_cols=49 Identities=31% Similarity=0.562 Sum_probs=39.8
Q ss_pred hhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhC-------CCCHHHHHHHHHhhh
Q psy18096 81 KERINCYNNLAQAQIKLGSLEPALMSLENVLNLD-------PNNIKALQRKAKCLI 129 (131)
Q Consensus 81 ~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~d-------p~~~ka~~rrg~a~~ 129 (131)
|....+++|+|.+|..+|+|++|+..+++++.+- |.-+.++++.|.++.
T Consensus 2 ~~~a~~~~~la~~~~~~~~~~~A~~~~~~al~~~~~~~~~~~~~a~~~~~lg~~~~ 57 (78)
T PF13424_consen 2 PDTANAYNNLARVYRELGRYDEALDYYEKALDIEEQLGDDHPDTANTLNNLGECYY 57 (78)
T ss_dssp HHHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHHHHHTTTHHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHH
Confidence 5678899999999999999999999999999762 223677888887764
No 90
>PF14559 TPR_19: Tetratricopeptide repeat; PDB: 2R5S_A 3QDN_B 3QOU_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 3FP3_A 3LCA_A ....
Probab=97.79 E-value=5.4e-05 Score=42.75 Aligned_cols=36 Identities=31% Similarity=0.469 Sum_probs=34.0
Q ss_pred HHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 95 IKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 95 ~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
++.|+|++|+..+++++..+|+|..+++..|.+++.
T Consensus 2 l~~~~~~~A~~~~~~~l~~~p~~~~~~~~la~~~~~ 37 (68)
T PF14559_consen 2 LKQGDYDEAIELLEKALQRNPDNPEARLLLAQCYLK 37 (68)
T ss_dssp HHTTHHHHHHHHHHHHHHHTTTSHHHHHHHHHHHHH
T ss_pred hhccCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH
Confidence 678999999999999999999999999999999875
No 91
>PF13174 TPR_6: Tetratricopeptide repeat; PDB: 3QKY_A 2XEV_A 3URZ_B 2Q7F_A.
Probab=97.77 E-value=6.6e-05 Score=36.20 Aligned_cols=32 Identities=22% Similarity=0.400 Sum_probs=30.0
Q ss_pred HHHHHHHHHHHhCChHHHHHHHHHHHhhCCCC
Q psy18096 86 CYNNLAQAQIKLGSLEPALMSLENVLNLDPNN 117 (131)
Q Consensus 86 ~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~ 117 (131)
++.++|.|+.++|++++|+..+++++..-|++
T Consensus 2 a~~~~a~~~~~~g~~~~A~~~~~~~~~~~P~s 33 (33)
T PF13174_consen 2 ALYRLARCYYKLGDYDEAIEYFQRLIKRYPDS 33 (33)
T ss_dssp HHHHHHHHHHHHCHHHHHHHHHHHHHHHSTTS
T ss_pred HHHHHHHHHHHccCHHHHHHHHHHHHHHCcCC
Confidence 57899999999999999999999999999975
No 92
>cd00189 TPR Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and C
Probab=97.74 E-value=0.00015 Score=41.74 Aligned_cols=44 Identities=34% Similarity=0.410 Sum_probs=41.2
Q ss_pred HHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 86 CYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 86 ~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
++.++|.++...|++++|+..+.++++..|++..+++..|.++.
T Consensus 2 ~~~~~a~~~~~~~~~~~A~~~~~~~~~~~~~~~~~~~~~~~~~~ 45 (100)
T cd00189 2 ALLNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYY 45 (100)
T ss_pred HHHHHHHHHHHHhcHHHHHHHHHHHHhcCCccHHHHHHHHHHHH
Confidence 57899999999999999999999999999999999999998875
No 93
>PF13431 TPR_17: Tetratricopeptide repeat
Probab=97.73 E-value=4.2e-05 Score=37.82 Aligned_cols=24 Identities=25% Similarity=0.423 Sum_probs=22.1
Q ss_pred HHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 107 LENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 107 ~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
|++||+++|+|+.+|++.|.+|..
T Consensus 2 y~kAie~~P~n~~a~~nla~~~~~ 25 (34)
T PF13431_consen 2 YKKAIELNPNNAEAYNNLANLYLN 25 (34)
T ss_pred hHHHHHHCCCCHHHHHHHHHHHHH
Confidence 789999999999999999998863
No 94
>PRK15174 Vi polysaccharide export protein VexE; Provisional
Probab=97.70 E-value=0.0002 Score=57.68 Aligned_cols=58 Identities=21% Similarity=0.204 Sum_probs=29.5
Q ss_pred HHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhh
Q psy18096 71 ASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCL 128 (131)
Q Consensus 71 Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~ 128 (131)
|+..|++ ++.|....++.++|.++.++|++++|+..++++++++|++..+++..|.++
T Consensus 269 A~~~~~~Al~l~P~~~~a~~~lg~~l~~~g~~~eA~~~l~~al~l~P~~~~a~~~La~~l 328 (656)
T PRK15174 269 AAEHWRHALQFNSDNVRIVTLYADALIRTGQNEKAIPLLQQSLATHPDLPYVRAMYARAL 328 (656)
T ss_pred HHHHHHHHHhhCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHH
Confidence 4444444 444445555555555555555555555555555555555555555544444
No 95
>PRK10049 pgaA outer membrane protein PgaA; Provisional
Probab=97.66 E-value=0.00023 Score=58.26 Aligned_cols=63 Identities=16% Similarity=-0.034 Sum_probs=58.3
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
+.+|+..+++ ...|.+..++.++|.++..+|++++|+..+++++.++|++...++-+|.+++.
T Consensus 375 ~~eA~~~l~~al~~~P~n~~l~~~lA~l~~~~g~~~~A~~~l~~al~l~Pd~~~l~~~~a~~al~ 439 (765)
T PRK10049 375 LPQAEMRARELAYNAPGNQGLRIDYASVLQARGWPRAAENELKKAEVLEPRNINLEVEQAWTALD 439 (765)
T ss_pred HHHHHHHHHHHHHhCCCCHHHHHHHHHHHHhcCCHHHHHHHHHHHHhhCCCChHHHHHHHHHHHH
Confidence 8999999998 77899999999999999999999999999999999999999999988887653
No 96
>COG5010 TadD Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking and secretion]
Probab=97.65 E-value=0.00021 Score=50.91 Aligned_cols=63 Identities=24% Similarity=0.211 Sum_probs=59.2
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
|.+|+..+.+ .+.|....+|+-++.+|.++|++++|...+.+++++.|++..++-+.|..+..
T Consensus 116 ~~~A~~~~rkA~~l~p~d~~~~~~lgaaldq~Gr~~~Ar~ay~qAl~L~~~~p~~~nNlgms~~L 180 (257)
T COG5010 116 FGEAVSVLRKAARLAPTDWEAWNLLGAALDQLGRFDEARRAYRQALELAPNEPSIANNLGMSLLL 180 (257)
T ss_pred hHHHHHHHHHHhccCCCChhhhhHHHHHHHHccChhHHHHHHHHHHHhccCCchhhhhHHHHHHH
Confidence 9999999999 88999999999999999999999999999999999999999999888876653
No 97
>PF06552 TOM20_plant: Plant specific mitochondrial import receptor subunit TOM20; InterPro: IPR010547 This family consists of several plant specific mitochondrial import receptor subunit TOM20 (translocase of outer membrane 20 kDa subunit) proteins. Most mitochondrial proteins are encoded by the nuclear genome, and are synthesised in the cytosol. TOM20 is a general import receptor that binds to mitochondrial pre-sequences in the early step of protein import into the mitochondria [].; GO: 0045040 protein import into mitochondrial outer membrane, 0005742 mitochondrial outer membrane translocase complex; PDB: 1ZU2_A.
Probab=97.65 E-value=0.00032 Score=47.61 Aligned_cols=63 Identities=19% Similarity=0.155 Sum_probs=50.2
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhC----------ChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLG----------SLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~----------~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
|+.|.+.+.. ..+|..+..++|-+.+++.+- .+++|+.-+++||.++|+...|++..|.||..
T Consensus 7 FE~ark~aea~y~~nP~DadnL~~WG~ALLELAqfk~g~es~~miedAisK~eeAL~I~P~~hdAlw~lGnA~ts 81 (186)
T PF06552_consen 7 FEHARKKAEAAYAKNPLDADNLTNWGGALLELAQFKQGPESKKMIEDAISKFEEALKINPNKHDALWCLGNAYTS 81 (186)
T ss_dssp HHHHHHHHHHHHHH-TT-HHHHHHHHHHHHHHHHHS-HHHHHHHHHHHHHHHHHHHHH-TT-HHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHhCcHhHHHHHHHHHHHHHHHhccCcchHHHHHHHHHHHHHHHHhcCCchHHHHHHHHHHHHH
Confidence 5667766666 678999999999999999873 46788999999999999999999999999864
No 98
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=97.63 E-value=0.00012 Score=56.14 Aligned_cols=63 Identities=25% Similarity=0.138 Sum_probs=59.0
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
+++|++.+.+ .++|....+..|+|.+++++|++.+|+...+..+.-+|+|+-.|...++||..
T Consensus 356 ~~~A~e~~~kal~l~P~~~~l~~~~a~all~~g~~~eai~~L~~~~~~~p~dp~~w~~LAqay~~ 420 (484)
T COG4783 356 AKEAIERLKKALALDPNSPLLQLNLAQALLKGGKPQEAIRILNRYLFNDPEDPNGWDLLAQAYAE 420 (484)
T ss_pred hHHHHHHHHHHHhcCCCccHHHHHHHHHHHhcCChHHHHHHHHHHhhcCCCCchHHHHHHHHHHH
Confidence 8888888888 78899999999999999999999999999999999999999999999999854
No 99
>KOG2076|consensus
Probab=97.63 E-value=0.00018 Score=58.66 Aligned_cols=62 Identities=26% Similarity=0.178 Sum_probs=53.4
Q ss_pred HHHHHHHHHH---HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 68 GILASQELLD---SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 68 y~~Ai~~~~~---~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
|+.|++++.. .-.-.+..+|.++|.||+.+|.+++|+..+.+||.++|+|..+..+.+..+.
T Consensus 430 ~~~Al~~l~~i~~~~~~~~~~vw~~~a~c~~~l~e~e~A~e~y~kvl~~~p~~~D~Ri~Lasl~~ 494 (895)
T KOG2076|consen 430 YKEALRLLSPITNREGYQNAFVWYKLARCYMELGEYEEAIEFYEKVLILAPDNLDARITLASLYQ 494 (895)
T ss_pred HHHHHHHHHHHhcCccccchhhhHHHHHHHHHHhhHHHHHHHHHHHHhcCCCchhhhhhHHHHHH
Confidence 9999999988 1112246799999999999999999999999999999999999888877654
No 100
>PRK11447 cellulose synthase subunit BcsC; Provisional
Probab=97.62 E-value=0.00028 Score=60.18 Aligned_cols=61 Identities=15% Similarity=0.098 Sum_probs=55.9
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhh
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCL 128 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~ 128 (131)
|.+|+..|.+ .++|....++.++|.++..+|++++|+..++++++++|++..++...+..+
T Consensus 367 ~~eA~~~~~~Al~~~P~~~~a~~~Lg~~~~~~g~~~eA~~~y~~aL~~~p~~~~a~~~L~~l~ 429 (1157)
T PRK11447 367 LAQAERLYQQARQVDNTDSYAVLGLGDVAMARKDYAAAERYYQQALRMDPGNTNAVRGLANLY 429 (1157)
T ss_pred HHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHH
Confidence 8889999988 778889999999999999999999999999999999999999987776654
No 101
>PF13429 TPR_15: Tetratricopeptide repeat; PDB: 2VQ2_A 2PL2_B.
Probab=97.59 E-value=0.00045 Score=49.72 Aligned_cols=62 Identities=27% Similarity=0.241 Sum_probs=49.1
Q ss_pred HHHHHHHHHH--Hhh--hhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 68 GILASQELLD--SLI--KERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 68 y~~Ai~~~~~--~~~--~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
|.++...++. ... +....+|.-+|.++.+.|++++|+.++.++|+++|+|..++...+..++
T Consensus 126 ~~~~~~~l~~~~~~~~~~~~~~~~~~~a~~~~~~G~~~~A~~~~~~al~~~P~~~~~~~~l~~~li 191 (280)
T PF13429_consen 126 YDEAEELLEKLEELPAAPDSARFWLALAEIYEQLGDPDKALRDYRKALELDPDDPDARNALAWLLI 191 (280)
T ss_dssp HHHHHHHHHHHHH-T---T-HHHHHHHHHHHHHCCHHHHHHHHHHHHHHH-TT-HHHHHHHHHHHC
T ss_pred HHHHHHHHHHHHhccCCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCCCHHHHHHHHHHHH
Confidence 6777777666 222 4677889999999999999999999999999999999999888887765
No 102
>PRK11447 cellulose synthase subunit BcsC; Provisional
Probab=97.54 E-value=0.00043 Score=59.06 Aligned_cols=61 Identities=16% Similarity=0.047 Sum_probs=56.8
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhh
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCL 128 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~ 128 (131)
|.+|+..|++ ++.|....++.++|.+|.++|++++|+..++++++++|++..++|..|..+
T Consensus 477 ~~eA~~~~~~Al~~~P~~~~~~~~LA~~~~~~G~~~~A~~~l~~al~~~P~~~~~~~a~al~l 539 (1157)
T PRK11447 477 WAQAAELQRQRLALDPGSVWLTYRLAQDLRQAGQRSQADALMRRLAQQKPNDPEQVYAYGLYL 539 (1157)
T ss_pred HHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCCCHHHHHHHHHHH
Confidence 8999999999 788999999999999999999999999999999999999999998877654
No 103
>COG3063 PilF Tfp pilus assembly protein PilF [Cell motility and secretion / Intracellular trafficking and secretion]
Probab=97.54 E-value=0.00027 Score=49.73 Aligned_cols=62 Identities=26% Similarity=0.229 Sum_probs=50.3
Q ss_pred HHHHHHHHHH----HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 68 GILASQELLD----SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 68 y~~Ai~~~~~----~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
|.+|...|+. -.-+.....|-|++.|-+++|+++.|..++.++|++||++.-+....++-+.
T Consensus 119 ~~eA~q~F~~Al~~P~Y~~~s~t~eN~G~Cal~~gq~~~A~~~l~raL~~dp~~~~~~l~~a~~~~ 184 (250)
T COG3063 119 PEEAMQQFERALADPAYGEPSDTLENLGLCALKAGQFDQAEEYLKRALELDPQFPPALLELARLHY 184 (250)
T ss_pred hHHHHHHHHHHHhCCCCCCcchhhhhhHHHHhhcCCchhHHHHHHHHHHhCcCCChHHHHHHHHHH
Confidence 6666666666 2235567789999999999999999999999999999999887777666554
No 104
>KOG1128|consensus
Probab=97.54 E-value=0.00022 Score=57.08 Aligned_cols=63 Identities=17% Similarity=0.230 Sum_probs=59.0
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
|++|.++++. ++.|.....|++++.|.++++++..|..+|...+.++|+|..+|-+-+.+|+.
T Consensus 501 fs~~~~hle~sl~~nplq~~~wf~~G~~ALqlek~q~av~aF~rcvtL~Pd~~eaWnNls~ayi~ 565 (777)
T KOG1128|consen 501 FSEADKHLERSLEINPLQLGTWFGLGCAALQLEKEQAAVKAFHRCVTLEPDNAEAWNNLSTAYIR 565 (777)
T ss_pred HHHHHHHHHHHhhcCccchhHHHhccHHHHHHhhhHHHHHHHHHHhhcCCCchhhhhhhhHHHHH
Confidence 8888888888 88999999999999999999999999999999999999999999999988863
No 105
>KOG1155|consensus
Probab=97.53 E-value=0.0004 Score=53.34 Aligned_cols=52 Identities=25% Similarity=0.208 Sum_probs=48.6
Q ss_pred HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 78 SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 78 ~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
++.|....+|.-+|.||.|+++.++||.++.+|+.....+.-||+|.|++|-
T Consensus 426 ~~kPnDsRlw~aLG~CY~kl~~~~eAiKCykrai~~~dte~~~l~~LakLye 477 (559)
T KOG1155|consen 426 ELKPNDSRLWVALGECYEKLNRLEEAIKCYKRAILLGDTEGSALVRLAKLYE 477 (559)
T ss_pred hcCCCchHHHHHHHHHHHHhccHHHHHHHHHHHHhccccchHHHHHHHHHHH
Confidence 5679999999999999999999999999999999999999999999999874
No 106
>PF12895 Apc3: Anaphase-promoting complex, cyclosome, subunit 3; PDB: 3KAE_D 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2XPI_A 3ULQ_A.
Probab=97.53 E-value=0.00036 Score=41.23 Aligned_cols=43 Identities=28% Similarity=0.196 Sum_probs=36.4
Q ss_pred HHHHHHHHHH-HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHH
Q psy18096 68 GILASQELLD-SLIKERINCYNNLAQAQIKLGSLEPALMSLENV 110 (131)
Q Consensus 68 y~~Ai~~~~~-~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~a 110 (131)
|.+|+.++++ +..+....+..-+|.|++++|+|++|+..+.++
T Consensus 41 y~~A~~~~~~~~~~~~~~~~~~l~a~~~~~l~~y~eAi~~l~~~ 84 (84)
T PF12895_consen 41 YEEAIELLQKLKLDPSNPDIHYLLARCLLKLGKYEEAIKALEKA 84 (84)
T ss_dssp HHHHHHHHHCHTHHHCHHHHHHHHHHHHHHTT-HHHHHHHHHHH
T ss_pred HHHHHHHHHHhCCCCCCHHHHHHHHHHHHHhCCHHHHHHHHhcC
Confidence 9999999987 566667777778899999999999999998875
No 107
>KOG0624|consensus
Probab=97.51 E-value=0.00032 Score=52.30 Aligned_cols=63 Identities=19% Similarity=0.139 Sum_probs=47.5
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
|..|+..|.. +.+|.+-.+++.||.+|+-||+-.-|+.|.++||++.|+-.-|-..||..++.
T Consensus 54 ~sDALt~yHaAve~dp~~Y~aifrRaT~yLAmGksk~al~Dl~rVlelKpDF~~ARiQRg~vllK 118 (504)
T KOG0624|consen 54 LSDALTHYHAAVEGDPNNYQAIFRRATVYLAMGKSKAALQDLSRVLELKPDFMAARIQRGVVLLK 118 (504)
T ss_pred HHHHHHHHHHHHcCCchhHHHHHHHHHHHhhhcCCccchhhHHHHHhcCccHHHHHHHhchhhhh
Confidence 7777777766 67777777777777777777777777777777777777777777777776653
No 108
>TIGR02795 tol_pal_ybgF tol-pal system protein YbgF. Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.
Probab=97.49 E-value=0.00065 Score=41.93 Aligned_cols=53 Identities=21% Similarity=0.191 Sum_probs=46.2
Q ss_pred HHHHHHHHHH--Hhhhh---hHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHH
Q psy18096 68 GILASQELLD--SLIKE---RINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKA 120 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~---~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka 120 (131)
|.+|+..|.. ...|. ...++.++|.++.+++++++|+..+++++...|++..+
T Consensus 55 ~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~A~~~~~~~~~~~p~~~~~ 112 (119)
T TIGR02795 55 YADAAKAFLAVVKKYPKSPKAPDALLKLGMSLQELGDKEKAKATLQQVIKRYPGSSAA 112 (119)
T ss_pred HHHHHHHHHHHHHHCCCCCcccHHHHHHHHHHHHhCChHHHHHHHHHHHHHCcCChhH
Confidence 8999999998 44443 46789999999999999999999999999999998764
No 109
>TIGR02917 PEP_TPR_lipo putative PEP-CTERM system TPR-repeat lipoprotein. This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
Probab=97.48 E-value=0.0005 Score=55.52 Aligned_cols=51 Identities=20% Similarity=0.212 Sum_probs=37.6
Q ss_pred hhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 79 LIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 79 ~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
+.|....++.++|.++..+|++++|+..++++++++|.+..+++..+.++.
T Consensus 832 ~~~~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~~~~~~~~~~l~~~~~ 882 (899)
T TIGR02917 832 LAPNIPAILDTLGWLLVEKGEADRALPLLRKAVNIAPEAAAIRYHLALALL 882 (899)
T ss_pred hCCCCcHHHHHHHHHHHHcCCHHHHHHHHHHHHhhCCCChHHHHHHHHHHH
Confidence 345555666677777777788888888888888888777777777777765
No 110
>PF04733 Coatomer_E: Coatomer epsilon subunit; InterPro: IPR006822 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the epsilon subunit of the coatomer complex, which is involved in the regulation of intracellular protein trafficking between the endoplasmic reticulum and the Golgi complex []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006890 retrograde vesicle-mediated transport, Golgi to ER, 0030126 COPI vesicle coat; PDB: 3MV2_B 3MV3_F 3MKR_A.
Probab=97.47 E-value=0.00025 Score=51.83 Aligned_cols=62 Identities=26% Similarity=0.250 Sum_probs=49.7
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
|++|.-+|+. +..+..+..++.+|.|++.+|+|++|.....++++.||+++.++.+...+..
T Consensus 183 ~~~A~y~f~El~~~~~~t~~~lng~A~~~l~~~~~~eAe~~L~~al~~~~~~~d~LaNliv~~~ 246 (290)
T PF04733_consen 183 YQDAFYIFEELSDKFGSTPKLLNGLAVCHLQLGHYEEAEELLEEALEKDPNDPDTLANLIVCSL 246 (290)
T ss_dssp CCHHHHHHHHHHCCS--SHHHHHHHHHHHHHCT-HHHHHHHHHHHCCC-CCHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHhccCCCHHHHHHHHHHHHHhCCHHHHHHHHHHHHHhccCCHHHHHHHHHHHH
Confidence 6777777776 3335677888999999999999999999999999999999999999876643
No 111
>COG3063 PilF Tfp pilus assembly protein PilF [Cell motility and secretion / Intracellular trafficking and secretion]
Probab=97.47 E-value=0.00084 Score=47.32 Aligned_cols=59 Identities=24% Similarity=0.068 Sum_probs=53.4
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHH
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAK 126 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~ 126 (131)
+..|-.-+++ +.+|.+..+|.-||..|.++|+.+.|-+.+.+||+++|+|...+.+-|.
T Consensus 51 ~~~A~~nlekAL~~DPs~~~a~~~~A~~Yq~~Ge~~~A~e~YrkAlsl~p~~GdVLNNYG~ 111 (250)
T COG3063 51 YAQAKKNLEKALEHDPSYYLAHLVRAHYYQKLGENDLADESYRKALSLAPNNGDVLNNYGA 111 (250)
T ss_pred HHHHHHHHHHHHHhCcccHHHHHHHHHHHHHcCChhhHHHHHHHHHhcCCCccchhhhhhH
Confidence 7888888888 8899999999999999999999999999999999999999887776664
No 112
>TIGR02917 PEP_TPR_lipo putative PEP-CTERM system TPR-repeat lipoprotein. This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
Probab=97.46 E-value=0.00061 Score=55.03 Aligned_cols=61 Identities=25% Similarity=0.263 Sum_probs=30.6
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhh
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCL 128 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~ 128 (131)
|.+|+..|++ +..|....++..+|.++...|+|++|+..++++++.+|.+..+++.+|.++
T Consensus 141 ~~~A~~~~~~a~~~~~~~~~~~~~la~~~~~~~~~~~A~~~~~~~~~~~~~~~~~~~~~~~~~ 203 (899)
T TIGR02917 141 LELAQKSYEQALAIDPRSLYAKLGLAQLALAENRFDEARALIDEVLTADPGNVDALLLKGDLL 203 (899)
T ss_pred HHHHHHHHHHHHhcCCCChhhHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCChHHHHHHHHHH
Confidence 5555555554 334444444555555555555555555555555555555555555544443
No 113
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=97.43 E-value=0.0011 Score=47.79 Aligned_cols=62 Identities=15% Similarity=0.061 Sum_probs=51.8
Q ss_pred HHHHHHHHHH--Hhhhhh---HHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCC---CHHHHHHHHHhhh
Q psy18096 68 GILASQELLD--SLIKER---INCYNNLAQAQIKLGSLEPALMSLENVLNLDPN---NIKALQRKAKCLI 129 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~---~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~---~~ka~~rrg~a~~ 129 (131)
|.+|+..|.. ...|.. ..+++-+|.+|...|+|++|+..+.+++...|+ ...+||+.|.++.
T Consensus 159 y~~Ai~af~~fl~~yP~s~~a~~A~y~LG~~y~~~g~~~~A~~~f~~vv~~yP~s~~~~dAl~klg~~~~ 228 (263)
T PRK10803 159 QDDAIVAFQNFVKKYPDSTYQPNANYWLGQLNYNKGKKDDAAYYFASVVKNYPKSPKAADAMFKVGVIMQ 228 (263)
T ss_pred HHHHHHHHHHHHHHCcCCcchHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCCCCcchhHHHHHHHHHHH
Confidence 8888888887 444443 578899999999999999999999999988876 5788998888774
No 114
>PRK11788 tetratricopeptide repeat protein; Provisional
Probab=97.42 E-value=0.0011 Score=49.51 Aligned_cols=62 Identities=23% Similarity=0.184 Sum_probs=43.4
Q ss_pred HHHHHHHHHH--Hhhhh-----hHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 68 GILASQELLD--SLIKE-----RINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~-----~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
|.+|+..+.. ...+. ...++.++|.++.+.|++++|+..++++++.+|++..+++..|.++.
T Consensus 157 ~~~A~~~~~~~~~~~~~~~~~~~~~~~~~la~~~~~~~~~~~A~~~~~~al~~~p~~~~~~~~la~~~~ 225 (389)
T PRK11788 157 WQKAIDVAERLEKLGGDSLRVEIAHFYCELAQQALARGDLDAARALLKKALAADPQCVRASILLGDLAL 225 (389)
T ss_pred HHHHHHHHHHHHHhcCCcchHHHHHHHHHHHHHHHhCCCHHHHHHHHHHHHhHCcCCHHHHHHHHHHHH
Confidence 6666666665 22221 23456778888888888888888888888888888888877777664
No 115
>KOG1125|consensus
Probab=97.39 E-value=0.00017 Score=56.23 Aligned_cols=49 Identities=18% Similarity=0.177 Sum_probs=47.1
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCC
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPN 116 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~ 116 (131)
+.+||..|++ ++.|.++.+.+|+|.+++.+|.|++|+.++-.||.+.++
T Consensus 480 s~EAIsAY~rALqLqP~yVR~RyNlgIS~mNlG~ykEA~~hlL~AL~mq~k 530 (579)
T KOG1125|consen 480 SEEAISAYNRALQLQPGYVRVRYNLGISCMNLGAYKEAVKHLLEALSMQRK 530 (579)
T ss_pred cHHHHHHHHHHHhcCCCeeeeehhhhhhhhhhhhHHHHHHHHHHHHHhhhc
Confidence 8999999999 999999999999999999999999999999999998865
No 116
>KOG1173|consensus
Probab=97.39 E-value=0.00048 Score=53.84 Aligned_cols=51 Identities=22% Similarity=0.199 Sum_probs=47.9
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCH
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNI 118 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ 118 (131)
|.+||..|++ .+.|..+..|.-.|.+|..+|+++.|++.+.++|.++|+|.
T Consensus 471 ~~eAI~~~q~aL~l~~k~~~~~asig~iy~llgnld~Aid~fhKaL~l~p~n~ 523 (611)
T KOG1173|consen 471 YEEAIDYYQKALLLSPKDASTHASIGYIYHLLGNLDKAIDHFHKALALKPDNI 523 (611)
T ss_pred HHHHHHHHHHHHHcCCCchhHHHHHHHHHHHhcChHHHHHHHHHHHhcCCccH
Confidence 9999999999 77899999999999999999999999999999999999994
No 117
>TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795).
Probab=97.37 E-value=0.0018 Score=45.17 Aligned_cols=44 Identities=23% Similarity=0.258 Sum_probs=38.9
Q ss_pred HHHHHHHHHHhCChHHHHHHHHHHHhhCCC---CHHHHHHHHHhhhc
Q psy18096 87 YNNLAQAQIKLGSLEPALMSLENVLNLDPN---NIKALQRKAKCLIL 130 (131)
Q Consensus 87 ~~N~a~~~~kl~~~~~a~~~~~~al~~dp~---~~ka~~rrg~a~~~ 130 (131)
..++|.+|++.|++.+|+..+.+++...|+ ..++++++|.++..
T Consensus 169 ~~~~a~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~a~~~l~~~~~~ 215 (235)
T TIGR03302 169 ELYVARFYLKRGAYVAAINRFETVVENYPDTPATEEALARLVEAYLK 215 (235)
T ss_pred HHHHHHHHHHcCChHHHHHHHHHHHHHCCCCcchHHHHHHHHHHHHH
Confidence 358899999999999999999999999765 46999999999864
No 118
>PRK10049 pgaA outer membrane protein PgaA; Provisional
Probab=97.35 E-value=0.001 Score=54.49 Aligned_cols=59 Identities=15% Similarity=0.016 Sum_probs=26.6
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHh
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKC 127 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a 127 (131)
+.+|+..+.+ +..|.... +.++|.++..+|++.+|+..++++++++|++..+++..|.+
T Consensus 99 ~~eA~~~l~~~l~~~P~~~~-~~~la~~l~~~g~~~~Al~~l~~al~~~P~~~~~~~~la~~ 159 (765)
T PRK10049 99 YDEALVKAKQLVSGAPDKAN-LLALAYVYKRAGRHWDELRAMTQALPRAPQTQQYPTEYVQA 159 (765)
T ss_pred HHHHHHHHHHHHHhCCCCHH-HHHHHHHHHHCCCHHHHHHHHHHHHHhCCCCHHHHHHHHHH
Confidence 3444444443 33344444 44444444444444444444444444444444444444443
No 119
>PF13374 TPR_10: Tetratricopeptide repeat; PDB: 3CEQ_B 3EDT_H 3NF1_A.
Probab=97.34 E-value=0.00061 Score=34.41 Aligned_cols=31 Identities=39% Similarity=0.425 Sum_probs=27.0
Q ss_pred HHHHHHHHHHHHHhCChHHHHHHHHHHHhhC
Q psy18096 84 INCYNNLAQAQIKLGSLEPALMSLENVLNLD 114 (131)
Q Consensus 84 ~~~~~N~a~~~~kl~~~~~a~~~~~~al~~d 114 (131)
+.+++|+|.+|..+|+|++|+..+.+++.+-
T Consensus 2 a~~~~~la~~~~~~g~~~~A~~~~~~al~~~ 32 (42)
T PF13374_consen 2 ASALNNLANAYRAQGRYEEALELLEEALEIR 32 (42)
T ss_dssp HHHHHHHHHHHHHCT-HHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHhhhhcchhhHHHHHHHHHH
Confidence 4678999999999999999999999999753
No 120
>KOG1173|consensus
Probab=97.33 E-value=0.00088 Score=52.45 Aligned_cols=63 Identities=22% Similarity=0.123 Sum_probs=52.4
Q ss_pred HHHHHHHHHHHh------h---hhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 68 GILASQELLDSL------I---KERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 68 y~~Ai~~~~~~~------~---~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
|.+|+.++...+ . +.-.+.++|+|.++.|++.|.+||..++++|.+.|+++..+---|-+|..
T Consensus 430 y~~A~~~f~~~l~~ik~~~~e~~~w~p~~~NLGH~~Rkl~~~~eAI~~~q~aL~l~~k~~~~~asig~iy~l 501 (611)
T KOG1173|consen 430 YPEALKYFQKALEVIKSVLNEKIFWEPTLNNLGHAYRKLNKYEEAIDYYQKALLLSPKDASTHASIGYIYHL 501 (611)
T ss_pred hHHHHHHHHHHHHHhhhccccccchhHHHHhHHHHHHHHhhHHHHHHHHHHHHHcCCCchhHHHHHHHHHHH
Confidence 888888887721 1 12346799999999999999999999999999999999998887777653
No 121
>PF03704 BTAD: Bacterial transcriptional activator domain; InterPro: IPR005158 Found in the DNRI/REDD/AFSR family of regulators, this region of AFSR (P25941 from SWISSPROT) along with the C-terminal region is capable of independently directing actinorhodin production. It is important for the formation of secondary metabolites.; PDB: 2FF4_B 2FEZ_A.
Probab=97.31 E-value=0.0015 Score=42.44 Aligned_cols=63 Identities=25% Similarity=0.235 Sum_probs=50.0
Q ss_pred HHHHHHHHHH-----------------HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 68 GILASQELLD-----------------SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 68 y~~Ai~~~~~-----------------~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
+.+|+.+|.. .+......+...++..+...|++++|+..|.+++.+||-+..+|...-++|..
T Consensus 29 ~~~al~ly~G~~l~~~~~~~W~~~~r~~l~~~~~~~~~~l~~~~~~~~~~~~a~~~~~~~l~~dP~~E~~~~~lm~~~~~ 108 (146)
T PF03704_consen 29 LEEALALYRGDFLPDLDDEEWVEPERERLRELYLDALERLAEALLEAGDYEEALRLLQRALALDPYDEEAYRLLMRALAA 108 (146)
T ss_dssp HHHHHTT--SSTTGGGTTSTTHHHHHHHHHHHHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHHSTT-HHHHHHHHHHHHH
T ss_pred HHHHHHHhCCCCCCCCCccHHHHHHHHHHHHHHHHHHHHHHHHHHhccCHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHH
Confidence 6777777765 34567778888999999999999999999999999999999999888777753
No 122
>PRK11788 tetratricopeptide repeat protein; Provisional
Probab=97.26 E-value=0.0017 Score=48.50 Aligned_cols=61 Identities=20% Similarity=0.220 Sum_probs=34.3
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCH-HHHHHHHHhh
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNI-KALQRKAKCL 128 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~-ka~~rrg~a~ 128 (131)
+.+|+..+++ +..|....++.++|.++.+.|++++|+..++++++.+|.+. .++...+.++
T Consensus 196 ~~~A~~~~~~al~~~p~~~~~~~~la~~~~~~g~~~~A~~~~~~~~~~~p~~~~~~~~~l~~~~ 259 (389)
T PRK11788 196 LDAARALLKKALAADPQCVRASILLGDLALAQGDYAAAIEALERVEEQDPEYLSEVLPKLMECY 259 (389)
T ss_pred HHHHHHHHHHHHhHCcCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHChhhHHHHHHHHHHHH
Confidence 5556666555 34455555666666666666666666666666666665542 3333444433
No 123
>PF13429 TPR_15: Tetratricopeptide repeat; PDB: 2VQ2_A 2PL2_B.
Probab=97.20 E-value=0.0012 Score=47.42 Aligned_cols=50 Identities=32% Similarity=0.395 Sum_probs=42.9
Q ss_pred hhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 81 KERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 81 ~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
+..+.++..+|.++..+|++++|+..+.+++..+|+|+..+...|.++..
T Consensus 211 ~~~~~~~~~la~~~~~lg~~~~Al~~~~~~~~~~p~d~~~~~~~a~~l~~ 260 (280)
T PF13429_consen 211 PDDPDLWDALAAAYLQLGRYEEALEYLEKALKLNPDDPLWLLAYADALEQ 260 (280)
T ss_dssp HTSCCHCHHHHHHHHHHT-HHHHHHHHHHHHHHSTT-HHHHHHHHHHHT-
T ss_pred cCHHHHHHHHHHHhcccccccccccccccccccccccccccccccccccc
Confidence 55667888999999999999999999999999999999999999988753
No 124
>PF09295 ChAPs: ChAPs (Chs5p-Arf1p-binding proteins); InterPro: IPR015374 ChAPs (Chs5p-Arf1p-binding proteins) are required for the export of specialised cargo from the Golgi. They physically interact with Chs3, Chs5 and the small GTPase Arf1, and they also form interactions with each other [].
Probab=97.19 E-value=0.002 Score=49.00 Aligned_cols=62 Identities=19% Similarity=0.185 Sum_probs=51.7
Q ss_pred HHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 69 ILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 69 ~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
.+||+++.+ ...|....++.-.|..+++.++++.|+..+.++.++.|++.++|+..+++|+.
T Consensus 217 ~~AI~ll~~aL~~~p~d~~LL~~Qa~fLl~k~~~~lAL~iAk~av~lsP~~f~~W~~La~~Yi~ 280 (395)
T PF09295_consen 217 VEAIRLLNEALKENPQDSELLNLQAEFLLSKKKYELALEIAKKAVELSPSEFETWYQLAECYIQ 280 (395)
T ss_pred HHHHHHHHHHHHhCCCCHHHHHHHHHHHHhcCCHHHHHHHHHHHHHhCchhHHHHHHHHHHHHh
Confidence 444444444 33466778888899999999999999999999999999999999999999975
No 125
>PRK02603 photosystem I assembly protein Ycf3; Provisional
Probab=97.14 E-value=0.002 Score=43.23 Aligned_cols=53 Identities=15% Similarity=0.066 Sum_probs=44.7
Q ss_pred HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCC---HHHHHHHHHhhhc
Q psy18096 78 SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNN---IKALQRKAKCLIL 130 (131)
Q Consensus 78 ~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~---~ka~~rrg~a~~~ 130 (131)
...+....++.|+|..+...|++++|+..+.+++.++|+. ..+++..|.++..
T Consensus 29 ~~~~~~a~~~~~lg~~~~~~g~~~~A~~~~~~al~~~~~~~~~~~~~~~la~~~~~ 84 (172)
T PRK02603 29 NKKAKEAFVYYRDGMSAQADGEYAEALENYEEALKLEEDPNDRSYILYNMGIIYAS 84 (172)
T ss_pred ccHhhhHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHH
Confidence 3456777889999999999999999999999999987653 4688888888753
No 126
>KOG1126|consensus
Probab=97.11 E-value=0.00084 Score=53.19 Aligned_cols=61 Identities=16% Similarity=0.164 Sum_probs=47.0
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhh
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCL 128 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~ 128 (131)
|.+|..+|+. ..+|..-.+|+-++.+|+|+++++.|.-.+.+|++++|.|.-.+..-|..+
T Consensus 471 ~d~a~~~fr~Al~~~~rhYnAwYGlG~vy~Kqek~e~Ae~~fqkA~~INP~nsvi~~~~g~~~ 533 (638)
T KOG1126|consen 471 FDKAMKSFRKALGVDPRHYNAWYGLGTVYLKQEKLEFAEFHFQKAVEINPSNSVILCHIGRIQ 533 (638)
T ss_pred HHhHHHHHHhhhcCCchhhHHHHhhhhheeccchhhHHHHHHHhhhcCCccchhHHhhhhHHH
Confidence 4455555554 456777888999999999999999999999999999998877666666554
No 127
>KOG3364|consensus
Probab=97.03 E-value=0.0022 Score=41.68 Aligned_cols=55 Identities=25% Similarity=0.305 Sum_probs=44.2
Q ss_pred HHHHHHHHHH----HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHH
Q psy18096 68 GILASQELLD----SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQ 122 (131)
Q Consensus 68 y~~Ai~~~~~----~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~ 122 (131)
=++.|.+++. +-......+.+-+|..+.++++|+.+++.++..|+.+|+|..|.-
T Consensus 51 v~~GI~iLe~l~~~~~~~~rRe~lyYLAvg~yRlkeY~~s~~yvd~ll~~e~~n~Qa~~ 109 (149)
T KOG3364|consen 51 VQEGIVILEDLLKSAHPERRRECLYYLAVGHYRLKEYSKSLRYVDALLETEPNNRQALE 109 (149)
T ss_pred HHHhHHHHHHHhhhcCcccchhhhhhhHHHHHHHhhHHHHHHHHHHHHhhCCCcHHHHH
Confidence 4566777766 222345567788999999999999999999999999999987753
No 128
>PF06552 TOM20_plant: Plant specific mitochondrial import receptor subunit TOM20; InterPro: IPR010547 This family consists of several plant specific mitochondrial import receptor subunit TOM20 (translocase of outer membrane 20 kDa subunit) proteins. Most mitochondrial proteins are encoded by the nuclear genome, and are synthesised in the cytosol. TOM20 is a general import receptor that binds to mitochondrial pre-sequences in the early step of protein import into the mitochondria [].; GO: 0045040 protein import into mitochondrial outer membrane, 0005742 mitochondrial outer membrane translocase complex; PDB: 1ZU2_A.
Probab=97.00 E-value=0.0039 Score=42.40 Aligned_cols=57 Identities=23% Similarity=0.191 Sum_probs=44.8
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCC-----------hHHHHHHHHHHHhhCCCCHHHHHHHHH
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGS-----------LEPALMSLENVLNLDPNNIKALQRKAK 126 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~-----------~~~a~~~~~~al~~dp~~~ka~~rrg~ 126 (131)
|++||.-|+. .++|....++.|++.+|+.++. |++|..++.+|...||+|. +||++.
T Consensus 51 iedAisK~eeAL~I~P~~hdAlw~lGnA~ts~A~l~~d~~~A~~~F~kA~~~FqkAv~~~P~ne--~Y~ksL 120 (186)
T PF06552_consen 51 IEDAISKFEEALKINPNKHDALWCLGNAYTSLAFLTPDTAEAEEYFEKATEYFQKAVDEDPNNE--LYRKSL 120 (186)
T ss_dssp HHHHHHHHHHHHHH-TT-HHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHH-TT-H--HHHHHH
T ss_pred HHHHHHHHHHHHhcCCchHHHHHHHHHHHHHHHhhcCChHHHHHHHHHHHHHHHHHHhcCCCcH--HHHHHH
Confidence 7788888877 8899999999999999998753 9999999999999999985 455543
No 129
>KOG0546|consensus
Probab=96.99 E-value=9.7e-05 Score=54.73 Aligned_cols=63 Identities=22% Similarity=0.165 Sum_probs=54.3
Q ss_pred HHHHHHHHHH----------H----hhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 68 GILASQELLD----------S----LIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 68 y~~Ai~~~~~----------~----~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
|.++.++++. + +.......+.|.+.|-+|++.|..|+..|..++..|++..||+|||++++.+
T Consensus 245 ~~k~~r~~~~~s~~~~~e~~~~~~~~~~~r~~~~~n~~~~~lk~~~~~~a~~~~~~~~~~~~s~tka~~Rr~~~~~~ 321 (372)
T KOG0546|consen 245 YRKALRYLSEQSRDREKEQENRIPPLRELRFSIRRNLAAVGLKVKGRGGARFRTNEALRDERSKTKAHYRRGQAYKL 321 (372)
T ss_pred HHHHhhhhcccccccccccccccccccccccccccchHHhcccccCCCcceeccccccccChhhCcHHHHHHhHHHh
Confidence 7777777762 1 3455677889999999999999999999999999999999999999999865
No 130
>PRK14720 transcript cleavage factor/unknown domain fusion protein; Provisional
Probab=96.98 E-value=0.0037 Score=52.03 Aligned_cols=50 Identities=20% Similarity=0.133 Sum_probs=44.3
Q ss_pred hhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 81 KERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 81 ~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
+.+.-++..+|.||-++|++++|+..++++|++||+|+-++-+.|-.|..
T Consensus 113 ~~~k~Al~~LA~~Ydk~g~~~ka~~~yer~L~~D~~n~~aLNn~AY~~ae 162 (906)
T PRK14720 113 GENKLALRTLAEAYAKLNENKKLKGVWERLVKADRDNPEIVKKLATSYEE 162 (906)
T ss_pred hhhhHHHHHHHHHHHHcCChHHHHHHHHHHHhcCcccHHHHHHHHHHHHH
Confidence 33445888999999999999999999999999999999999998877653
No 131
>KOG4162|consensus
Probab=96.96 E-value=0.0033 Score=50.83 Aligned_cols=62 Identities=26% Similarity=0.229 Sum_probs=54.4
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHH--HHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALM--SLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~--~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
+.+|...|.. .++|..+....-+|.|+++.|+..-|.+ ....++++||.|.+|||..|..+.
T Consensus 700 ~~EA~~af~~Al~ldP~hv~s~~Ala~~lle~G~~~la~~~~~L~dalr~dp~n~eaW~~LG~v~k 765 (799)
T KOG4162|consen 700 LEEAKEAFLVALALDPDHVPSMTALAELLLELGSPRLAEKRSLLSDALRLDPLNHEAWYYLGEVFK 765 (799)
T ss_pred hHHHHHHHHHHHhcCCCCcHHHHHHHHHHHHhCCcchHHHHHHHHHHHhhCCCCHHHHHHHHHHHH
Confidence 6666666666 7789999999999999999998888877 899999999999999999998764
No 132
>KOG4648|consensus
Probab=96.91 E-value=0.0026 Score=47.67 Aligned_cols=42 Identities=36% Similarity=0.426 Sum_probs=39.2
Q ss_pred HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHH
Q psy18096 78 SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIK 119 (131)
Q Consensus 78 ~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~k 119 (131)
.++..++++|..|+.+...+|...+|..||+.||.+.|++..
T Consensus 159 aLd~~Y~KAYSRR~~AR~~Lg~~~EAKkD~E~vL~LEP~~~E 200 (536)
T KOG4648|consen 159 ALDKLYVKAYSRRMQARESLGNNMEAKKDCETVLALEPKNIE 200 (536)
T ss_pred HhhHHHHHHHHHHHHHHHHHhhHHHHHHhHHHHHhhCcccHH
Confidence 567889999999999999999999999999999999998753
No 133
>PF12569 NARP1: NMDA receptor-regulated protein 1 ; InterPro: IPR021183 This group represents N-terminal acetyltransferase A (NatA) auxiliary subunit and represents a non-catalytic component of the NatA N-terminal acetyltransferase, which catalyzes acetylation of proteins beginning with Met-Ser, Met-Gly and Met-Ala. N-terminal acetylation plays a role in normal eukaryotic translation and processing, protect against proteolytic degradation and protein turnover. NAT1 anchors ARD1 and NAT5 to the ribosome and may present the N- terminal of nascent polypeptides for acetylation [], [].
Probab=96.82 E-value=0.0082 Score=47.30 Aligned_cols=61 Identities=20% Similarity=0.225 Sum_probs=52.8
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhh
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCL 128 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~ 128 (131)
|++|++++.. ..-.+...++--+|.+++++|++++|...+...|..+|+|..-|.....++
T Consensus 20 ~~~AL~~L~~~~~~I~Dk~~~~E~rA~ll~kLg~~~eA~~~y~~Li~rNPdn~~Yy~~L~~~~ 82 (517)
T PF12569_consen 20 YEEALEHLEKNEKQILDKLAVLEKRAELLLKLGRKEEAEKIYRELIDRNPDNYDYYRGLEEAL 82 (517)
T ss_pred HHHHHHHHHhhhhhCCCHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCCCcHHHHHHHHHHH
Confidence 9999999988 445667888899999999999999999999999999999987666555554
No 134
>PRK15095 FKBP-type peptidyl-prolyl cis-trans isomerase; Provisional
Probab=96.82 E-value=0.00052 Score=45.76 Aligned_cols=28 Identities=11% Similarity=0.272 Sum_probs=25.9
Q ss_pred CcchhcccCCCcEEEEEEcCCCCCCCCC
Q psy18096 1 MDYVLPLMETGEECQIEITARFGYGDKG 28 (131)
Q Consensus 1 ld~~l~~M~~gE~~~v~i~~~~ayG~~G 28 (131)
+|.+|..|++||+..|+|+|+.|||...
T Consensus 48 le~aL~gm~~Ge~~~v~ipp~~ayG~~d 75 (156)
T PRK15095 48 LEQQLLGLKVGDKKTFSLEPEAAFGVPS 75 (156)
T ss_pred HHHHHcCCCCCCEEEEEEChHHhcCCCC
Confidence 4789999999999999999999999766
No 135
>PRK15331 chaperone protein SicA; Provisional
Probab=96.81 E-value=0.0044 Score=41.55 Aligned_cols=60 Identities=18% Similarity=0.047 Sum_probs=49.3
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
|++||..|.. -+++..+.-+...|.||+.+|+...|+..+..|+. .|. -..+-+|+++++
T Consensus 87 y~~Ai~~Y~~A~~l~~~dp~p~f~agqC~l~l~~~~~A~~~f~~a~~-~~~-~~~l~~~A~~~L 148 (165)
T PRK15331 87 FQKACDLYAVAFTLLKNDYRPVFFTGQCQLLMRKAAKARQCFELVNE-RTE-DESLRAKALVYL 148 (165)
T ss_pred HHHHHHHHHHHHHcccCCCCccchHHHHHHHhCCHHHHHHHHHHHHh-Ccc-hHHHHHHHHHHH
Confidence 9999999988 55566666688899999999999999999999998 455 445566777765
No 136
>PF12968 DUF3856: Domain of Unknown Function (DUF3856); InterPro: IPR024552 This domain of unknown function is found in a small group of tetratricopeptide-like proteins, which includes the uncharacterised protein Q8KAL8 from SWISSPROT. The structure of Q8KAL8 is known and belongs to the SCOP all alpha class, TPR-like superfamily, CT2138-like family.; PDB: 2HR2_D.
Probab=96.78 E-value=0.01 Score=37.91 Aligned_cols=63 Identities=22% Similarity=0.118 Sum_probs=48.3
Q ss_pred HHHHHHHHHH--------------HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHh-------hCCC----CHHHHH
Q psy18096 68 GILASQELLD--------------SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLN-------LDPN----NIKALQ 122 (131)
Q Consensus 68 y~~Ai~~~~~--------------~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~-------~dp~----~~ka~~ 122 (131)
|.+|...+.+ +.+....-+|.-++.++.++|+|++++...+.+|. ++.+ ++.+-|
T Consensus 25 ~~eAa~s~r~AM~~srtiP~eEaFDh~GFDA~chA~Ls~A~~~Lgry~e~L~sA~~aL~YFNRRGEL~qdeGklWIaaVf 104 (144)
T PF12968_consen 25 YEEAAASCRKAMEVSRTIPAEEAFDHDGFDAFCHAGLSGALAGLGRYDECLQSADRALRYFNRRGELHQDEGKLWIAAVF 104 (144)
T ss_dssp HHHHHHHHHHHHHHHTTS-TTS---HHHHHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHHHHHH--TTSTHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHhccCChHhhcccccHHHHHHHHHHHHHHhhccHHHHHHHHHHHHHHHhhccccccccchhHHHHHH
Confidence 6666666655 44677888999999999999999999999999996 3333 467778
Q ss_pred HHHHhhhc
Q psy18096 123 RKAKCLIL 130 (131)
Q Consensus 123 rrg~a~~~ 130 (131)
+||.|+-.
T Consensus 105 sra~Al~~ 112 (144)
T PF12968_consen 105 SRAVALEG 112 (144)
T ss_dssp HHHHHHHH
T ss_pred HHHHHHHh
Confidence 89888754
No 137
>CHL00033 ycf3 photosystem I assembly protein Ycf3
Probab=96.67 E-value=0.0097 Score=39.62 Aligned_cols=49 Identities=12% Similarity=0.049 Sum_probs=41.8
Q ss_pred hhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCC---HHHHHHHHHhhh
Q psy18096 81 KERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNN---IKALQRKAKCLI 129 (131)
Q Consensus 81 ~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~---~ka~~rrg~a~~ 129 (131)
.....+++++|.++..+++|++|+..+.+++.+.|+. .-+|+..|.++.
T Consensus 32 ~~~a~~~~~~g~~~~~~g~~~~A~~~~~~al~l~~~~~~~~~~~~~lg~~~~ 83 (168)
T CHL00033 32 EKEAFTYYRDGMSAQSEGEYAEALQNYYEAMRLEIDPYDRSYILYNIGLIHT 83 (168)
T ss_pred hHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHhccccchhhHHHHHHHHHHHH
Confidence 4467889999999999999999999999999987653 458888888775
No 138
>cd05804 StaR_like StaR_like; a well-conserved protein found in bacteria, plants, and animals. A family member from Streptomyces toyocaensis, StaR is part of a gene cluster involved in the biosynthesis of glycopeptide antibiotics (GPAs), specifically A47934. It has been speculated that StaR could be a flavoprotein hydroxylating a tyrosine sidechain. Some family members have been annotated as proteins containing tetratricopeptide (TPR) repeats, which may at least indicate mostly alpha-helical secondary structure.
Probab=96.66 E-value=0.0093 Score=43.99 Aligned_cols=62 Identities=10% Similarity=-0.055 Sum_probs=48.6
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCH----HHHHHHHHhhh
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNI----KALQRKAKCLI 129 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~----ka~~rrg~a~~ 129 (131)
|.+|+..+++ ++.|....++.++|.++...|++++|+..+++++..+|.+. ..|+..|..++
T Consensus 130 ~~~A~~~~~~al~~~p~~~~~~~~la~i~~~~g~~~eA~~~l~~~l~~~~~~~~~~~~~~~~la~~~~ 197 (355)
T cd05804 130 YDRAEEAARRALELNPDDAWAVHAVAHVLEMQGRFKEGIAFMESWRDTWDCSSMLRGHNWWHLALFYL 197 (355)
T ss_pred HHHHHHHHHHHHhhCCCCcHHHHHHHHHHHHcCCHHHHHHHHHhhhhccCCCcchhHHHHHHHHHHHH
Confidence 8888888888 77788888888999999999999999999999998876432 34445555543
No 139
>PRK11906 transcriptional regulator; Provisional
Probab=96.63 E-value=0.012 Score=45.49 Aligned_cols=53 Identities=6% Similarity=-0.098 Sum_probs=49.6
Q ss_pred HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 78 SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 78 ~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
++++..+.++..+|.+..-.++++.|+..+++|+.++|+.+-+||..|..++.
T Consensus 332 eld~~Da~a~~~~g~~~~~~~~~~~a~~~f~rA~~L~Pn~A~~~~~~~~~~~~ 384 (458)
T PRK11906 332 DITTVDGKILAIMGLITGLSGQAKVSHILFEQAKIHSTDIASLYYYRALVHFH 384 (458)
T ss_pred hcCCCCHHHHHHHHHHHHhhcchhhHHHHHHHHhhcCCccHHHHHHHHHHHHH
Confidence 78899999999999999999999999999999999999999999999987653
No 140
>KOG1840|consensus
Probab=96.61 E-value=0.0055 Score=48.11 Aligned_cols=47 Identities=21% Similarity=0.004 Sum_probs=40.0
Q ss_pred HHHHHHHHHH----------HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhC
Q psy18096 68 GILASQELLD----------SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLD 114 (131)
Q Consensus 68 y~~Ai~~~~~----------~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~d 114 (131)
|.+|+.+|.+ +-.|..+..+.|+|..|.+.|+|.+|..+|++|+++-
T Consensus 257 ~~eAv~ly~~AL~i~e~~~G~~h~~va~~l~nLa~ly~~~GKf~EA~~~~e~Al~I~ 313 (508)
T KOG1840|consen 257 YDEAVNLYEEALTIREEVFGEDHPAVAATLNNLAVLYYKQGKFAEAEEYCERALEIY 313 (508)
T ss_pred HHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhccCChHHHHHHHHHHHHHH
Confidence 6666666666 4568888999999999999999999999999999864
No 141
>PF12569 NARP1: NMDA receptor-regulated protein 1 ; InterPro: IPR021183 This group represents N-terminal acetyltransferase A (NatA) auxiliary subunit and represents a non-catalytic component of the NatA N-terminal acetyltransferase, which catalyzes acetylation of proteins beginning with Met-Ser, Met-Gly and Met-Ala. N-terminal acetylation plays a role in normal eukaryotic translation and processing, protect against proteolytic degradation and protein turnover. NAT1 anchors ARD1 and NAT5 to the ribosome and may present the N- terminal of nascent polypeptides for acetylation [], [].
Probab=96.60 E-value=0.0078 Score=47.42 Aligned_cols=46 Identities=26% Similarity=0.261 Sum_probs=41.6
Q ss_pred HHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 84 INCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 84 ~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
..+++.+|++|..+|++++|+...++||+.+|..+..|+.+|+.|-
T Consensus 194 lw~~~~lAqhyd~~g~~~~Al~~Id~aI~htPt~~ely~~KarilK 239 (517)
T PF12569_consen 194 LWTLYFLAQHYDYLGDYEKALEYIDKAIEHTPTLVELYMTKARILK 239 (517)
T ss_pred HHHHHHHHHHHHHhCCHHHHHHHHHHHHhcCCCcHHHHHHHHHHHH
Confidence 3466789999999999999999999999999999999999998863
No 142
>PF12688 TPR_5: Tetratrico peptide repeat
Probab=96.56 E-value=0.034 Score=35.40 Aligned_cols=49 Identities=27% Similarity=0.194 Sum_probs=35.9
Q ss_pred HHHHHHHHHH----Hh-hhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCC
Q psy18096 68 GILASQELLD----SL-IKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPN 116 (131)
Q Consensus 68 y~~Ai~~~~~----~~-~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~ 116 (131)
..+||.+|.+ .+ .+....++.++|.++..+|++++|+...++++.-.|+
T Consensus 17 ~~~Ai~~Y~~Al~~gL~~~~~~~a~i~lastlr~LG~~deA~~~L~~~~~~~p~ 70 (120)
T PF12688_consen 17 EEEAIPLYRRALAAGLSGADRRRALIQLASTLRNLGRYDEALALLEEALEEFPD 70 (120)
T ss_pred HHHHHHHHHHHHHcCCCchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCCC
Confidence 5677777776 11 2345567788888888888888888888888877666
No 143
>KOG0376|consensus
Probab=96.53 E-value=0.0026 Score=48.98 Aligned_cols=54 Identities=31% Similarity=0.333 Sum_probs=50.9
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHH
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKAL 121 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~ 121 (131)
|..|+..+.+ +++|.+..+|..+|.+.++++++.+|+.+++.+..+.|+.+++.
T Consensus 54 ~~~Al~Da~kaie~dP~~~K~Y~rrg~a~m~l~~~~~A~~~l~~~~~l~Pnd~~~~ 109 (476)
T KOG0376|consen 54 FGGALHDALKAIELDPTYIKAYVRRGTAVMALGEFKKALLDLEKVKKLAPNDPDAT 109 (476)
T ss_pred hhhHHHHHHhhhhcCchhhheeeeccHHHHhHHHHHHHHHHHHHhhhcCcCcHHHH
Confidence 8899998888 88999999999999999999999999999999999999988764
No 144
>COG4235 Cytochrome c biogenesis factor [Posttranslational modification, protein turnover, chaperones]
Probab=96.45 E-value=0.016 Score=42.25 Aligned_cols=63 Identities=25% Similarity=0.141 Sum_probs=48.8
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhC---ChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLG---SLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~---~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
+..|+..|.+ .+.+.++.++.-.|.++..+. .-.++...++++|.+||+|+++++-.|.+++.
T Consensus 172 ~~~A~~AY~~A~rL~g~n~~~~~g~aeaL~~~a~~~~ta~a~~ll~~al~~D~~~iral~lLA~~afe 239 (287)
T COG4235 172 ASDALLAYRNALRLAGDNPEILLGLAEALYYQAGQQMTAKARALLRQALALDPANIRALSLLAFAAFE 239 (287)
T ss_pred hhHHHHHHHHHHHhCCCCHHHHHHHHHHHHHhcCCcccHHHHHHHHHHHhcCCccHHHHHHHHHHHHH
Confidence 7777777777 777888888888888877764 35677788888888888888888888777654
No 145
>PRK14574 hmsH outer membrane protein; Provisional
Probab=96.41 E-value=0.016 Score=48.11 Aligned_cols=60 Identities=15% Similarity=0.022 Sum_probs=37.2
Q ss_pred HHHHHHHHHHHhhhhhH--HHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHh
Q psy18096 68 GILASQELLDSLIKERI--NCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKC 127 (131)
Q Consensus 68 y~~Ai~~~~~~~~~~~~--~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a 127 (131)
+.+|+.++++-+++... ..+..+|..+..+|+|++|+..++++++.+|+|+.+++-.+.+
T Consensus 84 ~~~A~~~~eka~~p~n~~~~~llalA~ly~~~gdyd~Aiely~kaL~~dP~n~~~l~gLa~~ 145 (822)
T PRK14574 84 DQEVIDVYERYQSSMNISSRGLASAARAYRNEKRWDQALALWQSSLKKDPTNPDLISGMIMT 145 (822)
T ss_pred cHHHHHHHHHhccCCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHhhCCCCHHHHHHHHHH
Confidence 56666666653333333 3333446677777777777777777777777777777644443
No 146
>KOG0545|consensus
Probab=96.34 E-value=0.024 Score=40.72 Aligned_cols=57 Identities=19% Similarity=0.187 Sum_probs=48.5
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHH
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRK 124 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rr 124 (131)
|-++++..+. ...|.++++|..||.++...-+..+|.+|+.++|++||.-..+--|.
T Consensus 246 ~yevleh~seiL~~~~~nvKA~frRakAhaa~Wn~~eA~~D~~~vL~ldpslasvVsrE 304 (329)
T KOG0545|consen 246 YYEVLEHCSEILRHHPGNVKAYFRRAKAHAAVWNEAEAKADLQKVLELDPSLASVVSRE 304 (329)
T ss_pred HHHHHHHHHHHHhcCCchHHHHHHHHHHHHhhcCHHHHHHHHHHHHhcChhhHHHHHHH
Confidence 6666766666 66799999999999999999999999999999999999766555443
No 147
>KOG3785|consensus
Probab=96.33 E-value=0.038 Score=41.79 Aligned_cols=61 Identities=21% Similarity=-0.015 Sum_probs=50.2
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhh
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCL 128 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~ 128 (131)
|++||+.|++ .-.+....+-.++|+||.|++-|+-+-.....-|+--|++.=|.--++..+
T Consensus 167 YQeAIdvYkrvL~dn~ey~alNVy~ALCyyKlDYydvsqevl~vYL~q~pdStiA~NLkacn~ 229 (557)
T KOG3785|consen 167 YQEAIDVYKRVLQDNPEYIALNVYMALCYYKLDYYDVSQEVLKVYLRQFPDSTIAKNLKACNL 229 (557)
T ss_pred HHHHHHHHHHHHhcChhhhhhHHHHHHHHHhcchhhhHHHHHHHHHHhCCCcHHHHHHHHHHH
Confidence 9999999999 445777778889999999999999998888888888898876665554433
No 148
>PRK10153 DNA-binding transcriptional activator CadC; Provisional
Probab=96.33 E-value=0.016 Score=45.75 Aligned_cols=51 Identities=20% Similarity=0.051 Sum_probs=36.6
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHH
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIK 119 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~k 119 (131)
+.+|...+++ ++.+ ...+|..+|.++...|++++|+..+.+|+.++|.+..
T Consensus 436 ~~~A~~~l~rAl~L~p-s~~a~~~lG~~~~~~G~~~eA~~~~~~A~~L~P~~pt 488 (517)
T PRK10153 436 TDEAYQAINKAIDLEM-SWLNYVLLGKVYELKGDNRLAADAYSTAFNLRPGENT 488 (517)
T ss_pred HHHHHHHHHHHHHcCC-CHHHHHHHHHHHHHcCCHHHHHHHHHHHHhcCCCCch
Confidence 6777777776 5566 4567777777777777777777777777777777653
No 149
>COG4105 ComL DNA uptake lipoprotein [General function prediction only]
Probab=96.32 E-value=0.055 Score=38.78 Aligned_cols=62 Identities=19% Similarity=0.070 Sum_probs=53.4
Q ss_pred HHHHHHHHHH-----HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCC---HHHHHHHHHhhh
Q psy18096 68 GILASQELLD-----SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNN---IKALQRKAKCLI 129 (131)
Q Consensus 68 y~~Ai~~~~~-----~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~---~ka~~rrg~a~~ 129 (131)
|.+|++.|+. -..|....+...++.++.|-++|++|+...++=+.+.|++ .-++|=+|.+++
T Consensus 50 ~~~A~~~fe~l~~~~p~s~~~~qa~l~l~yA~Yk~~~y~~A~~~~drFi~lyP~~~n~dY~~YlkgLs~~ 119 (254)
T COG4105 50 YEEAIKYFEALDSRHPFSPYSEQAQLDLAYAYYKNGEYDLALAYIDRFIRLYPTHPNADYAYYLKGLSYF 119 (254)
T ss_pred HHHHHHHHHHHHHcCCCCcccHHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCCChhHHHHHHHHHHh
Confidence 9999999988 3456677889999999999999999999999999999866 467777887754
No 150
>KOG2002|consensus
Probab=96.29 E-value=0.0031 Score=52.17 Aligned_cols=111 Identities=21% Similarity=0.085 Sum_probs=72.3
Q ss_pred EEEcCCCCCCCCCCCCCCCCCCceEEEEEeeccccCCcc-------CcCcchhhhHHHH-HHHHHHHHHH----Hhhhhh
Q psy18096 16 IEITARFGYGDKGEPSKSIPPGAKLYYSLTLHSVLPDFD-------LAELPVEKRLDFG-GILASQELLD----SLIKER 83 (131)
Q Consensus 16 v~i~~~~ayG~~G~~~~~ip~~~~~~~~v~l~~~~~~~~-------~~~~~~~e~~~~~-y~~Ai~~~~~----~~~~~~ 83 (131)
+..+|+..|.-.|.. |--...-.|..++.-|....+ .|-+.+.--++.+ |..||+.|+. -.....
T Consensus 639 L~~dpkN~yAANGIg---iVLA~kg~~~~A~dIFsqVrEa~~~~~dv~lNlah~~~e~~qy~~AIqmYe~~lkkf~~~~~ 715 (1018)
T KOG2002|consen 639 LRNDPKNMYAANGIG---IVLAEKGRFSEARDIFSQVREATSDFEDVWLNLAHCYVEQGQYRLAIQMYENCLKKFYKKNR 715 (1018)
T ss_pred HhcCcchhhhccchh---hhhhhccCchHHHHHHHHHHHHHhhCCceeeeHHHHHHHHHHHHHHHHHHHHHHHHhcccCC
Confidence 456777777777742 111111122223322332222 3333344444555 8899998888 234556
Q ss_pred HHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 84 INCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 84 ~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
..+..-+|.++...|.|.+|.+....|+.+.|.|.-..|+.|.++.
T Consensus 716 ~~vl~~Lara~y~~~~~~eak~~ll~a~~~~p~~~~v~FN~a~v~k 761 (1018)
T KOG2002|consen 716 SEVLHYLARAWYEAGKLQEAKEALLKARHLAPSNTSVKFNLALVLK 761 (1018)
T ss_pred HHHHHHHHHHHHHhhhHHHHHHHHHHHHHhCCccchHHhHHHHHHH
Confidence 7788889999999999999999999999999999888888887764
No 151
>cd05804 StaR_like StaR_like; a well-conserved protein found in bacteria, plants, and animals. A family member from Streptomyces toyocaensis, StaR is part of a gene cluster involved in the biosynthesis of glycopeptide antibiotics (GPAs), specifically A47934. It has been speculated that StaR could be a flavoprotein hydroxylating a tyrosine sidechain. Some family members have been annotated as proteins containing tetratricopeptide (TPR) repeats, which may at least indicate mostly alpha-helical secondary structure.
Probab=96.27 E-value=0.014 Score=42.99 Aligned_cols=50 Identities=20% Similarity=0.163 Sum_probs=44.4
Q ss_pred hhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 80 IKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 80 ~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
.+....++.++|.++..+|++++|+..++++++++|++..++...|.++.
T Consensus 110 ~~~~~~~~~~~a~~~~~~G~~~~A~~~~~~al~~~p~~~~~~~~la~i~~ 159 (355)
T cd05804 110 NPDYWYLLGMLAFGLEEAGQYDRAEEAARRALELNPDDAWAVHAVAHVLE 159 (355)
T ss_pred CCCcHHHHHHHHHHHHHcCCHHHHHHHHHHHHhhCCCCcHHHHHHHHHHH
Confidence 45556677899999999999999999999999999999999888888775
No 152
>PF14938 SNAP: Soluble NSF attachment protein, SNAP; PDB: 1QQE_A 2IFU_A.
Probab=96.25 E-value=0.035 Score=40.25 Aligned_cols=62 Identities=18% Similarity=0.141 Sum_probs=43.8
Q ss_pred HHHHHHHHHH-HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCC------CCHHHH-HHHHHhhh
Q psy18096 68 GILASQELLD-SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDP------NNIKAL-QRKAKCLI 129 (131)
Q Consensus 68 y~~Ai~~~~~-~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp------~~~ka~-~rrg~a~~ 129 (131)
|++|+++|.. +.......++.+.|.++.++++|++|+..++++....- -+++-+ ++.+.+++
T Consensus 138 Y~~A~~~y~~e~~~~~a~~~~~~~A~l~~~l~~y~~A~~~~e~~~~~~l~~~l~~~~~~~~~l~a~l~~L 207 (282)
T PF14938_consen 138 YQKAAELYEQEGSPHSAAECLLKAADLYARLGRYEEAIEIYEEVAKKCLENNLLKYSAKEYFLKAILCHL 207 (282)
T ss_dssp HHHHHHHHHHTT-HHHHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHTCCCHCTTGHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHCCChhhHHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHhhcccccchhHHHHHHHHHHHHH
Confidence 6777777766 43445667889999999999999999999999987431 134544 55565554
No 153
>PRK11906 transcriptional regulator; Provisional
Probab=96.21 E-value=0.024 Score=43.79 Aligned_cols=62 Identities=19% Similarity=0.116 Sum_probs=52.0
Q ss_pred HHHHHHHHHH-----HhhhhhHHHHHHHHHHHHHh---------CChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 68 GILASQELLD-----SLIKERINCYNNLAQAQIKL---------GSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 68 y~~Ai~~~~~-----~~~~~~~~~~~N~a~~~~kl---------~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
-.+|+.++.+ +++|.++.+|.-+|.||+.+ ..-.+|.+..++|+++||.|+.|++..|.++.
T Consensus 274 ~~~Al~lf~ra~~~~~ldp~~a~a~~~lA~~h~~~~~~g~~~~~~~~~~a~~~A~rAveld~~Da~a~~~~g~~~~ 349 (458)
T PRK11906 274 IYRAMTIFDRLQNKSDIQTLKTECYCLLAECHMSLALHGKSELELAAQKALELLDYVSDITTVDGKILAIMGLITG 349 (458)
T ss_pred HHHHHHHHHHHhhcccCCcccHHHHHHHHHHHHHHHHhcCCCchHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHH
Confidence 4566666666 67899999999999999975 23567889999999999999999999999764
No 154
>PF15015 NYD-SP12_N: Spermatogenesis-associated, N-terminal
Probab=96.21 E-value=0.023 Score=43.63 Aligned_cols=61 Identities=15% Similarity=0.105 Sum_probs=50.8
Q ss_pred HHHHHHHHHH-------------HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhh
Q psy18096 68 GILASQELLD-------------SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCL 128 (131)
Q Consensus 68 y~~Ai~~~~~-------------~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~ 128 (131)
|..|++++++ ++.....-+-.-+..||+++++.+-|+...-+.|.++|.+..-++|+|-+.
T Consensus 199 F~taLelcskg~a~~k~~~~~~~di~~vaSfIetklv~CYL~~rkpdlALnh~hrsI~lnP~~frnHLrqAavf 272 (569)
T PF15015_consen 199 FRTALELCSKGAALSKPFKASAEDISSVASFIETKLVTCYLRMRKPDLALNHSHRSINLNPSYFRNHLRQAAVF 272 (569)
T ss_pred HHHHHHHHhhhhhccCCCCCChhhHHHHHHHHHHHHHHhhhhcCCCchHHHHHhhhhhcCcchhhHHHHHHHHH
Confidence 7777777777 334444555668999999999999999999999999999999999988765
No 155
>PF09976 TPR_21: Tetratricopeptide repeat; InterPro: IPR018704 This domain, found in various hypothetical prokaryotic proteins, has no known function.
Probab=96.15 E-value=0.041 Score=35.78 Aligned_cols=61 Identities=15% Similarity=0.047 Sum_probs=45.0
Q ss_pred HHHHHHHHHH--Hh---hhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 68 GILASQELLD--SL---IKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 68 y~~Ai~~~~~--~~---~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
|.+|+..|+. +. ......+.+++|.+++.+|+|++|+...+. +.-.+-...++..+|.++.
T Consensus 64 ~~~A~~~l~~~~~~~~d~~l~~~a~l~LA~~~~~~~~~d~Al~~L~~-~~~~~~~~~~~~~~Gdi~~ 129 (145)
T PF09976_consen 64 YDEAKAALEKALANAPDPELKPLARLRLARILLQQGQYDEALATLQQ-IPDEAFKALAAELLGDIYL 129 (145)
T ss_pred HHHHHHHHHHHHhhCCCHHHHHHHHHHHHHHHHHcCCHHHHHHHHHh-ccCcchHHHHHHHHHHHHH
Confidence 8899999988 11 234566888999999999999999999866 3333444566677777765
No 156
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=96.12 E-value=0.027 Score=40.66 Aligned_cols=53 Identities=11% Similarity=-0.017 Sum_probs=46.0
Q ss_pred HHHHHHHHHH--H---hhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHH
Q psy18096 68 GILASQELLD--S---LIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKA 120 (131)
Q Consensus 68 y~~Ai~~~~~--~---~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka 120 (131)
|.+|+..|.. . -.+....++.++|.++..+|++++|+..++++++..|++.-+
T Consensus 196 ~~~A~~~f~~vv~~yP~s~~~~dAl~klg~~~~~~g~~~~A~~~~~~vi~~yP~s~~a 253 (263)
T PRK10803 196 KDDAAYYFASVVKNYPKSPKAADAMFKVGVIMQDKGDTAKAKAVYQQVIKKYPGTDGA 253 (263)
T ss_pred HHHHHHHHHHHHHHCCCCcchhHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCcCCHHH
Confidence 8899998888 2 235578899999999999999999999999999999988644
No 157
>KOG2002|consensus
Probab=96.08 E-value=0.021 Score=47.48 Aligned_cols=62 Identities=15% Similarity=0.077 Sum_probs=49.7
Q ss_pred HHHHHHHHHH-----HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCC-HHHHHHHHHhhh
Q psy18096 68 GILASQELLD-----SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNN-IKALQRKAKCLI 129 (131)
Q Consensus 68 y~~Ai~~~~~-----~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~-~ka~~rrg~a~~ 129 (131)
|..+..++.. ...+..+..|+++|.+|-.+|+|++|..++-++++.+|++ +-+++..|+.++
T Consensus 286 y~~v~~la~~ai~~t~~~~~~aes~Y~~gRs~Ha~Gd~ekA~~yY~~s~k~~~d~~~l~~~GlgQm~i 353 (1018)
T KOG2002|consen 286 YERVWHLAEHAIKNTENKSIKAESFYQLGRSYHAQGDFEKAFKYYMESLKADNDNFVLPLVGLGQMYI 353 (1018)
T ss_pred HHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHhhccHHHHHHHHHHHHccCCCCccccccchhHHHH
Confidence 4455444443 3356777789999999999999999999999999999998 888888888775
No 158
>KOG0551|consensus
Probab=96.06 E-value=0.01 Score=44.08 Aligned_cols=54 Identities=20% Similarity=0.164 Sum_probs=48.5
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHH
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKAL 121 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~ 121 (131)
|..||...+. .++|...+++..-|.|++.++++++|+..|+..+++|..+-++.
T Consensus 135 yRs~l~Dcs~al~~~P~h~Ka~~R~Akc~~eLe~~~~a~nw~ee~~~~d~e~K~~~ 190 (390)
T KOG0551|consen 135 YRSALNDCSAALKLKPTHLKAYIRGAKCLLELERFAEAVNWCEEGLQIDDEAKKAI 190 (390)
T ss_pred HHHHHHHHHHHHhcCcchhhhhhhhhHHHHHHHHHHHHHHHHhhhhhhhHHHHHHH
Confidence 8888888888 88999999999999999999999999999999999886655543
No 159
>COG4235 Cytochrome c biogenesis factor [Posttranslational modification, protein turnover, chaperones]
Probab=95.96 E-value=0.023 Score=41.42 Aligned_cols=52 Identities=25% Similarity=0.287 Sum_probs=47.6
Q ss_pred HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 78 SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 78 ~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
..+|.++.-|.=+|.+|+.+|++..|...+.+|++++|+|+..+.-.|.++.
T Consensus 150 ~~nP~d~egW~~Lg~~ym~~~~~~~A~~AY~~A~rL~g~n~~~~~g~aeaL~ 201 (287)
T COG4235 150 QQNPGDAEGWDLLGRAYMALGRASDALLAYRNALRLAGDNPEILLGLAEALY 201 (287)
T ss_pred HhCCCCchhHHHHHHHHHHhcchhHHHHHHHHHHHhCCCCHHHHHHHHHHHH
Confidence 4578999999999999999999999999999999999999998888877764
No 160
>KOG3060|consensus
Probab=95.94 E-value=0.055 Score=38.95 Aligned_cols=48 Identities=29% Similarity=0.396 Sum_probs=34.3
Q ss_pred hhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhh
Q psy18096 81 KERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCL 128 (131)
Q Consensus 81 ~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~ 128 (131)
+....+|--+|-.|+.+++|++|.-++++++-+.|-|.--+-|.|..+
T Consensus 151 ~~D~EAW~eLaeiY~~~~~f~kA~fClEE~ll~~P~n~l~f~rlae~~ 198 (289)
T KOG3060|consen 151 MNDQEAWHELAEIYLSEGDFEKAAFCLEELLLIQPFNPLYFQRLAEVL 198 (289)
T ss_pred cCcHHHHHHHHHHHHhHhHHHHHHHHHHHHHHcCCCcHHHHHHHHHHH
Confidence 445567777777777788888888888888877777776665555543
No 161
>KOG2076|consensus
Probab=95.94 E-value=0.041 Score=45.46 Aligned_cols=49 Identities=14% Similarity=0.287 Sum_probs=37.5
Q ss_pred hhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 81 KERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 81 ~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
|.....|.-++---.++|+|.+|+-++++||+++|.|.+-.++|+..+.
T Consensus 204 p~d~e~W~~ladls~~~~~i~qA~~cy~rAI~~~p~n~~~~~ers~L~~ 252 (895)
T KOG2076|consen 204 PKDYELWKRLADLSEQLGNINQARYCYSRAIQANPSNWELIYERSSLYQ 252 (895)
T ss_pred CCChHHHHHHHHHHHhcccHHHHHHHHHHHHhcCCcchHHHHHHHHHHH
Confidence 3344445555555555678999999999999999999999999988775
No 162
>KOG1129|consensus
Probab=95.88 E-value=0.015 Score=43.49 Aligned_cols=43 Identities=23% Similarity=0.216 Sum_probs=39.7
Q ss_pred HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHH
Q psy18096 78 SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKA 120 (131)
Q Consensus 78 ~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka 120 (131)
+.-|..++.++-.|..|-.|+++++|++.+..|++++|.|+.|
T Consensus 284 d~fP~~VT~l~g~ARi~eam~~~~~a~~lYk~vlk~~~~nvEa 326 (478)
T KOG1129|consen 284 DSFPFDVTYLLGQARIHEAMEQQEDALQLYKLVLKLHPINVEA 326 (478)
T ss_pred hcCCchhhhhhhhHHHHHHHHhHHHHHHHHHHHHhcCCcccee
Confidence 4568899999999999999999999999999999999999876
No 163
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=95.80 E-value=0.063 Score=41.64 Aligned_cols=63 Identities=19% Similarity=0.120 Sum_probs=54.9
Q ss_pred HHHHHHHHHH--------------HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 68 GILASQELLD--------------SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 68 y~~Ai~~~~~--------------~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
|-.|+..|.. ...|.++.+..=++..+++.|+.++|++.+.+++.++|+..-.++..|++|+.
T Consensus 310 YG~A~~~~~~~~~d~A~~~l~~L~~~~P~N~~~~~~~~~i~~~~nk~~~A~e~~~kal~l~P~~~~l~~~~a~all~ 386 (484)
T COG4783 310 YGRALQTYLAGQYDEALKLLQPLIAAQPDNPYYLELAGDILLEANKAKEAIERLKKALALDPNSPLLQLNLAQALLK 386 (484)
T ss_pred HHHHHHHHHhcccchHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCChHHHHHHHHHHHhcCCCccHHHHHHHHHHHh
Confidence 7777777655 34578888888999999999999999999999999999999889999999875
No 164
>PF12862 Apc5: Anaphase-promoting complex subunit 5
Probab=95.76 E-value=0.062 Score=32.49 Aligned_cols=47 Identities=28% Similarity=0.206 Sum_probs=37.2
Q ss_pred HHHHHHHHHH-------Hhh----hhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhC
Q psy18096 68 GILASQELLD-------SLI----KERINCYNNLAQAQIKLGSLEPALMSLENVLNLD 114 (131)
Q Consensus 68 y~~Ai~~~~~-------~~~----~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~d 114 (131)
|..|++.+.+ +.. .....+..|+|..+..+|++++|+..+++++.+-
T Consensus 14 y~~A~d~L~~~fD~~~~~~~~~~~~~~~~all~lA~~~~~~G~~~~A~~~l~eAi~~A 71 (94)
T PF12862_consen 14 YSEALDALHRYFDYAKQSNNSSSNSGLAYALLNLAELHRRFGHYEEALQALEEAIRLA 71 (94)
T ss_pred HHHHHHHHHHHHHHHhhcccchhhHHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHH
Confidence 7777776666 111 1356678899999999999999999999999874
No 165
>KOG1128|consensus
Probab=95.73 E-value=0.015 Score=47.10 Aligned_cols=56 Identities=20% Similarity=0.261 Sum_probs=50.8
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHH
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQR 123 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~r 123 (131)
++.|++.|.. .++|.+...|+|++.+|+++++-.+|.....+|++-+-++.+.|-+
T Consensus 535 ~q~av~aF~rcvtL~Pd~~eaWnNls~ayi~~~~k~ra~~~l~EAlKcn~~~w~iWEN 592 (777)
T KOG1128|consen 535 EQAAVKAFHRCVTLEPDNAEAWNNLSTAYIRLKKKKRAFRKLKEALKCNYQHWQIWEN 592 (777)
T ss_pred hHHHHHHHHHHhhcCCCchhhhhhhhHHHHHHhhhHHHHHHHHHHhhcCCCCCeeeec
Confidence 8999999999 8899999999999999999999999999999999998777665543
No 166
>KOG1174|consensus
Probab=95.72 E-value=0.034 Score=42.73 Aligned_cols=54 Identities=19% Similarity=0.122 Sum_probs=44.6
Q ss_pred HHHHHHHHHH-HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHH
Q psy18096 68 GILASQELLD-SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKAL 121 (131)
Q Consensus 68 y~~Ai~~~~~-~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~ 121 (131)
|+.+|.++++ -.+-....++.-+|...--.+++.+|++.+..||.+||+|.+++
T Consensus 454 ~~D~i~LLe~~L~~~~D~~LH~~Lgd~~~A~Ne~Q~am~~y~~ALr~dP~~~~sl 508 (564)
T KOG1174|consen 454 TKDIIKLLEKHLIIFPDVNLHNHLGDIMRAQNEPQKAMEYYYKALRQDPKSKRTL 508 (564)
T ss_pred cchHHHHHHHHHhhccccHHHHHHHHHHHHhhhHHHHHHHHHHHHhcCccchHHH
Confidence 6777777766 22334567888899999999999999999999999999998875
No 167
>KOG1840|consensus
Probab=95.49 E-value=0.044 Score=43.14 Aligned_cols=47 Identities=19% Similarity=0.119 Sum_probs=38.0
Q ss_pred HHHHHHHHHH----------HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhC
Q psy18096 68 GILASQELLD----------SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLD 114 (131)
Q Consensus 68 y~~Ai~~~~~----------~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~d 114 (131)
|++|+.+|.. +-.+..+..+.|+|.+|.++|+|++|...+.+||+..
T Consensus 341 ~Eea~~l~q~al~i~~~~~g~~~~~~a~~~~nl~~l~~~~gk~~ea~~~~k~ai~~~ 397 (508)
T KOG1840|consen 341 YEEAKKLLQKALKIYLDAPGEDNVNLAKIYANLAELYLKMGKYKEAEELYKKAIQIL 397 (508)
T ss_pred hhHHHHHHHHHHHHHHhhccccchHHHHHHHHHHHHHHHhcchhHHHHHHHHHHHHH
Confidence 5555555544 3345778899999999999999999999999999864
No 168
>KOG3060|consensus
Probab=95.48 E-value=0.082 Score=38.08 Aligned_cols=57 Identities=26% Similarity=0.200 Sum_probs=50.6
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhC---ChHHHHHHHHHHHhhCCCCHHHHHHH
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLG---SLEPALMSLENVLNLDPNNIKALQRK 124 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~---~~~~a~~~~~~al~~dp~~~ka~~rr 124 (131)
|.+|+=+|++ -+.|.+.-+|..+|-++.-+| ++.-|..++.++|+++|.|..+||--
T Consensus 170 f~kA~fClEE~ll~~P~n~l~f~rlae~~Yt~gg~eN~~~arkyy~~alkl~~~~~ral~GI 231 (289)
T KOG3060|consen 170 FEKAAFCLEELLLIQPFNPLYFQRLAEVLYTQGGAENLELARKYYERALKLNPKNLRALFGI 231 (289)
T ss_pred HHHHHHHHHHHHHcCCCcHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHhChHhHHHHHHH
Confidence 8999988888 668999999999999998775 68889999999999999999998854
No 169
>KOG4814|consensus
Probab=95.41 E-value=0.089 Score=42.47 Aligned_cols=56 Identities=11% Similarity=0.012 Sum_probs=46.1
Q ss_pred HHHHHHHHHH--------HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHH
Q psy18096 68 GILASQELLD--------SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQR 123 (131)
Q Consensus 68 y~~Ai~~~~~--------~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~r 123 (131)
|..+++.|.. +.+-..++..-|++.||+++.+.+.|++...+|=+.||.++-.-+.
T Consensus 370 Y~~s~~~y~~Sl~~i~~D~~~~~FaK~qR~l~~CYL~L~QLD~A~E~~~EAE~~d~~~~l~q~~ 433 (872)
T KOG4814|consen 370 YVVSIRFYKLSLKDIISDNYSDRFAKIQRALQVCYLKLEQLDNAVEVYQEAEEVDRQSPLCQLL 433 (872)
T ss_pred HHHHHHHHHHHHHhccchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHhhccccHHHHHH
Confidence 7777777766 3445568999999999999999999999999999999988644433
No 170
>PRK10941 hypothetical protein; Provisional
Probab=95.41 E-value=0.087 Score=38.24 Aligned_cols=48 Identities=19% Similarity=0.231 Sum_probs=30.5
Q ss_pred hhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 82 ERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 82 ~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
.......|+=.+|++.++|+.|+++++.+|.++|+++.-+-.||.+|.
T Consensus 179 il~Rml~nLK~~~~~~~~~~~AL~~~e~ll~l~P~dp~e~RDRGll~~ 226 (269)
T PRK10941 179 VIRKLLDTLKAALMEEKQMELALRASEALLQFDPEDPYEIRDRGLIYA 226 (269)
T ss_pred HHHHHHHHHHHHHHHcCcHHHHHHHHHHHHHhCCCCHHHHHHHHHHHH
Confidence 344455666666666666666666666666666666666655665553
No 171
>KOG2003|consensus
Probab=95.34 E-value=0.011 Score=45.93 Aligned_cols=56 Identities=25% Similarity=0.217 Sum_probs=46.9
Q ss_pred HHHHHHHHHHHh-------hhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHH
Q psy18096 68 GILASQELLDSL-------IKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRK 124 (131)
Q Consensus 68 y~~Ai~~~~~~~-------~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rr 124 (131)
|.+||++|...+ ....+++++|.+..+.++|+|+.||..++...+-.|+-. +-++.
T Consensus 253 fskaikfyrmaldqvpsink~~rikil~nigvtfiq~gqy~dainsfdh~m~~~pn~~-a~~nl 315 (840)
T KOG2003|consen 253 FSKAIKFYRMALDQVPSINKDMRIKILNNIGVTFIQAGQYDDAINSFDHCMEEAPNFI-AALNL 315 (840)
T ss_pred HHHHHHHHHHHHhhccccchhhHHHHHhhcCeeEEecccchhhHhhHHHHHHhCccHH-hhhhh
Confidence 999999998822 356789999999999999999999999999999999644 43443
No 172
>KOG1129|consensus
Probab=95.32 E-value=0.039 Score=41.27 Aligned_cols=49 Identities=8% Similarity=0.040 Sum_probs=39.0
Q ss_pred hhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCC---CCHHHHHHHHHhh
Q psy18096 80 IKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDP---NNIKALQRKAKCL 128 (131)
Q Consensus 80 ~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp---~~~ka~~rrg~a~ 128 (131)
.-.++.+|+|+++|.+--++++-++..+.+|+..-. .-...||+.|.+.
T Consensus 354 G~~speLf~NigLCC~yaqQ~D~~L~sf~RAlstat~~~~aaDvWYNlg~va 405 (478)
T KOG1129|consen 354 GAQSPELFCNIGLCCLYAQQIDLVLPSFQRALSTATQPGQAADVWYNLGFVA 405 (478)
T ss_pred cCCChHHHhhHHHHHHhhcchhhhHHHHHHHHhhccCcchhhhhhhccceeE
Confidence 345778999999999999999999999999998643 3467788877653
No 173
>COG1047 SlpA FKBP-type peptidyl-prolyl cis-trans isomerases 2 [Posttranslational modification, protein turnover, chaperones]
Probab=95.32 E-value=0.0091 Score=40.37 Aligned_cols=28 Identities=29% Similarity=0.421 Sum_probs=26.2
Q ss_pred CcchhcccCCCcEEEEEEcCCCCCCCCC
Q psy18096 1 MDYVLPLMETGEECQIEITARFGYGDKG 28 (131)
Q Consensus 1 ld~~l~~M~~gE~~~v~i~~~~ayG~~G 28 (131)
||.||..|.+||+-.|.|+|+-|||...
T Consensus 46 lE~al~g~~~Ge~~~V~IpPE~AfGe~~ 73 (174)
T COG1047 46 LEEALLGKEVGEEFTVEIPPEDAFGEYD 73 (174)
T ss_pred HHHHHhCCCCCceeEEEeCchHhcCCCC
Confidence 5889999999999999999999999876
No 174
>PRK10737 FKBP-type peptidyl-prolyl cis-trans isomerase; Provisional
Probab=95.27 E-value=0.0086 Score=41.37 Aligned_cols=28 Identities=21% Similarity=0.392 Sum_probs=26.0
Q ss_pred CcchhcccCCCcEEEEEEcCCCCCCCCC
Q psy18096 1 MDYVLPLMETGEECQIEITARFGYGDKG 28 (131)
Q Consensus 1 ld~~l~~M~~gE~~~v~i~~~~ayG~~G 28 (131)
||.+|..|.+|++..|+|+|+-|||...
T Consensus 45 lE~aL~G~~~Gd~~~v~l~peeAyGe~d 72 (196)
T PRK10737 45 LETALEGHEVGDKFDVAVGANDAYGQYD 72 (196)
T ss_pred HHHHHcCCCCCCEEEEEEChHHhcCCCC
Confidence 5889999999999999999999999766
No 175
>KOG1127|consensus
Probab=95.13 E-value=0.083 Score=44.51 Aligned_cols=61 Identities=23% Similarity=0.109 Sum_probs=55.3
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCC-hHHHHHHHHHHHhhCCCCHHHHHHHHHhh
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGS-LEPALMSLENVLNLDPNNIKALQRKAKCL 128 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~-~~~a~~~~~~al~~dp~~~ka~~rrg~a~ 128 (131)
|++||+...+ +.+|++-.++.-++.++..+++ .++|-..+-.|.++||+|.-||--.|..|
T Consensus 18 YeealEqskkvLk~dpdNYnA~vFLGvAl~sl~q~le~A~ehYv~AaKldpdnlLAWkGL~nLy 81 (1238)
T KOG1127|consen 18 YEEALEQSKKVLKEDPDNYNAQVFLGVALWSLGQDLEKAAEHYVLAAKLDPDNLLAWKGLGNLY 81 (1238)
T ss_pred HHHHHHHHHHHHhcCCCcchhhhHHHHHHHhccCCHHHHHHHHHHHHhcChhhhHHHHHHHHHH
Confidence 9999999988 8889999999999999999998 99999999999999999998886665544
No 176
>PF09295 ChAPs: ChAPs (Chs5p-Arf1p-binding proteins); InterPro: IPR015374 ChAPs (Chs5p-Arf1p-binding proteins) are required for the export of specialised cargo from the Golgi. They physically interact with Chs3, Chs5 and the small GTPase Arf1, and they also form interactions with each other [].
Probab=95.11 E-value=0.061 Score=41.12 Aligned_cols=42 Identities=29% Similarity=0.361 Sum_probs=38.8
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHH
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLEN 109 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~ 109 (131)
|+.|+.+..+ ++.|.....|..+|.||.++|+|+.|+...+.
T Consensus 250 ~~lAL~iAk~av~lsP~~f~~W~~La~~Yi~~~d~e~ALlaLNs 293 (395)
T PF09295_consen 250 YELALEIAKKAVELSPSEFETWYQLAECYIQLGDFENALLALNS 293 (395)
T ss_pred HHHHHHHHHHHHHhCchhHHHHHHHHHHHHhcCCHHHHHHHHhc
Confidence 8999999988 88999999999999999999999999987663
No 177
>TIGR00540 hemY_coli hemY protein. This is an uncharacterized protein encoded next to a heme-biosynthetic enzyme in two gamma division proteobacteria (E. coli and H. influenzae). It is known in no other species. The gene symbol hemY is unfortunate in that an unrelated protein, protoporphyrinogen oxidase, is designated as HemG in E. coli but as HemY in Bacillus subtilis.
Probab=95.10 E-value=0.12 Score=39.50 Aligned_cols=60 Identities=12% Similarity=0.040 Sum_probs=45.4
Q ss_pred HHHHHHHHH--HhhhhhH--HHHHHHHHHHHHhCChHHHHHHHH--HHHhhCCCCHHHHHHHHHhhh
Q psy18096 69 ILASQELLD--SLIKERI--NCYNNLAQAQIKLGSLEPALMSLE--NVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 69 ~~Ai~~~~~--~~~~~~~--~~~~N~a~~~~kl~~~~~a~~~~~--~al~~dp~~~ka~~rrg~a~~ 129 (131)
..+++.+++ +..|.++ .++..+|..++++|+|++|.+.++ .+++.+|++.- +...|.++.
T Consensus 316 ~~~~~~~e~~lk~~p~~~~~~ll~sLg~l~~~~~~~~~A~~~le~a~a~~~~p~~~~-~~~La~ll~ 381 (409)
T TIGR00540 316 EKLEKLIEKQAKNVDDKPKCCINRALGQLLMKHGEFIEAADAFKNVAACKEQLDAND-LAMAADAFD 381 (409)
T ss_pred HHHHHHHHHHHHhCCCChhHHHHHHHHHHHHHcccHHHHHHHHHHhHHhhcCCCHHH-HHHHHHHHH
Confidence 344444544 5567888 888899999999999999999999 68888887765 336665543
No 178
>COG2956 Predicted N-acetylglucosaminyl transferase [Carbohydrate transport and metabolism]
Probab=95.08 E-value=0.11 Score=38.78 Aligned_cols=62 Identities=18% Similarity=0.090 Sum_probs=50.5
Q ss_pred HHHHHHHHHH-------HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 68 GILASQELLD-------SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 68 y~~Ai~~~~~-------~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
|++||+.-.+ +..-..+..|+-+|..++--++.+.|+....+|++.||++++|=.-.|...+
T Consensus 157 W~KAId~A~~L~k~~~q~~~~eIAqfyCELAq~~~~~~~~d~A~~~l~kAlqa~~~cvRAsi~lG~v~~ 225 (389)
T COG2956 157 WEKAIDVAERLVKLGGQTYRVEIAQFYCELAQQALASSDVDRARELLKKALQADKKCVRASIILGRVEL 225 (389)
T ss_pred HHHHHHHHHHHHHcCCccchhHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHhhCccceehhhhhhHHHH
Confidence 7888887775 3334566788889999998999999999999999999999988777776654
No 179
>PRK10747 putative protoheme IX biogenesis protein; Provisional
Probab=95.07 E-value=0.14 Score=38.91 Aligned_cols=52 Identities=15% Similarity=0.166 Sum_probs=44.6
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHH
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIK 119 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~k 119 (131)
+.+++..++. +..|.++.++.-+|..+++.++|++|.+.++++++.+|++.-
T Consensus 310 ~~~al~~~e~~lk~~P~~~~l~l~lgrl~~~~~~~~~A~~~le~al~~~P~~~~ 363 (398)
T PRK10747 310 PEQLEKVLRQQIKQHGDTPLLWSTLGQLLMKHGEWQEASLAFRAALKQRPDAYD 363 (398)
T ss_pred hHHHHHHHHHHHhhCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHhcCCCHHH
Confidence 5566666655 667889999999999999999999999999999999998754
No 180
>PRK10747 putative protoheme IX biogenesis protein; Provisional
Probab=95.04 E-value=0.18 Score=38.36 Aligned_cols=62 Identities=16% Similarity=0.161 Sum_probs=38.2
Q ss_pred HHHHHHHHHH--HhhhhhH-HHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 68 GILASQELLD--SLIKERI-NCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~-~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
+.+|..++.+ +..|... ....-.+..++..|++++|+..++++++.+|+|..++.-.+.++.
T Consensus 134 ~~~A~~~l~~A~~~~~~~~~~~~l~~a~l~l~~g~~~~Al~~l~~~~~~~P~~~~al~ll~~~~~ 198 (398)
T PRK10747 134 EARANQHLERAAELADNDQLPVEITRVRIQLARNENHAARHGVDKLLEVAPRHPEVLRLAEQAYI 198 (398)
T ss_pred HHHHHHHHHHHHhcCCcchHHHHHHHHHHHHHCCCHHHHHHHHHHHHhcCCCCHHHHHHHHHHHH
Confidence 4444444444 3333332 222334667777788888888888888888887777776666654
No 181
>PF09976 TPR_21: Tetratricopeptide repeat; InterPro: IPR018704 This domain, found in various hypothetical prokaryotic proteins, has no known function.
Probab=95.03 E-value=0.24 Score=32.07 Aligned_cols=48 Identities=27% Similarity=0.231 Sum_probs=40.2
Q ss_pred hhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCC---HHHHHHHHHhhh
Q psy18096 82 ERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNN---IKALQRKAKCLI 129 (131)
Q Consensus 82 ~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~---~ka~~rrg~a~~ 129 (131)
....+.+.+|.++...|++++|+..+++++...|+. ..+.+|.+.+++
T Consensus 46 ya~~A~l~lA~~~~~~g~~~~A~~~l~~~~~~~~d~~l~~~a~l~LA~~~~ 96 (145)
T PF09976_consen 46 YAALAALQLAKAAYEQGDYDEAKAALEKALANAPDPELKPLARLRLARILL 96 (145)
T ss_pred HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHhhCCCHHHHHHHHHHHHHHHH
Confidence 446678889999999999999999999999987655 457888888775
No 182
>KOG3081|consensus
Probab=94.98 E-value=0.1 Score=37.82 Aligned_cols=58 Identities=21% Similarity=0.094 Sum_probs=47.1
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHH
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKA 125 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg 125 (131)
++.|.-+|+. +--+......+-.|.|++.+++|++|......+|..|++++..+-+.-
T Consensus 189 ~qdAfyifeE~s~k~~~T~~llnG~Av~~l~~~~~eeAe~lL~eaL~kd~~dpetL~Nli 248 (299)
T KOG3081|consen 189 IQDAFYIFEELSEKTPPTPLLLNGQAVCHLQLGRYEEAESLLEEALDKDAKDPETLANLI 248 (299)
T ss_pred hhhHHHHHHHHhcccCCChHHHccHHHHHHHhcCHHHHHHHHHHHHhccCCCHHHHHHHH
Confidence 5555555554 334567778889999999999999999999999999999999887654
No 183
>KOG4555|consensus
Probab=94.97 E-value=0.053 Score=35.45 Aligned_cols=41 Identities=32% Similarity=0.367 Sum_probs=36.1
Q ss_pred HHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 90 LAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 90 ~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
.+.+...-|+.+.|++-|.++|.+-|.++-+|.+|++++-+
T Consensus 49 ~~valaE~g~Ld~AlE~F~qal~l~P~raSayNNRAQa~RL 89 (175)
T KOG4555|consen 49 KAIALAEAGDLDGALELFGQALCLAPERASAYNNRAQALRL 89 (175)
T ss_pred HHHHHHhccchHHHHHHHHHHHHhcccchHhhccHHHHHHH
Confidence 35556677999999999999999999999999999999853
No 184
>PRK10153 DNA-binding transcriptional activator CadC; Provisional
Probab=94.84 E-value=0.18 Score=39.93 Aligned_cols=50 Identities=16% Similarity=0.171 Sum_probs=42.7
Q ss_pred hhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 80 IKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 80 ~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
++..+.+|.-+|..++..|++++|...+++|+.++| +..+|...|+++..
T Consensus 416 ~~~~~~~~~ala~~~~~~g~~~~A~~~l~rAl~L~p-s~~a~~~lG~~~~~ 465 (517)
T PRK10153 416 LNVLPRIYEILAVQALVKGKTDEAYQAINKAIDLEM-SWLNYVLLGKVYEL 465 (517)
T ss_pred CcCChHHHHHHHHHHHhcCCHHHHHHHHHHHHHcCC-CHHHHHHHHHHHHH
Confidence 344556777888899999999999999999999999 57899999988753
No 185
>PF10516 SHNi-TPR: SHNi-TPR; InterPro: IPR019544 The tetratrico peptide repeat region (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B. This entry represents SHNi-TPR (Sim3-Hif1-NASP interrupted TPR), a sequence that is an interrupted form of TPR repeat [].
Probab=94.84 E-value=0.091 Score=26.50 Aligned_cols=30 Identities=20% Similarity=0.424 Sum_probs=25.9
Q ss_pred HHHHHHHHHHHHhCChHHHHHHHHHHHhhC
Q psy18096 85 NCYNNLAQAQIKLGSLEPALMSLENVLNLD 114 (131)
Q Consensus 85 ~~~~N~a~~~~kl~~~~~a~~~~~~al~~d 114 (131)
.+|.-+|-+-+..++|++|+.|+.++|++.
T Consensus 2 dv~~~Lgeisle~e~f~qA~~D~~~aL~i~ 31 (38)
T PF10516_consen 2 DVYDLLGEISLENENFEQAIEDYEKALEIQ 31 (38)
T ss_pred cHHHHHHHHHHHhccHHHHHHHHHHHHHHH
Confidence 357778888888899999999999999874
No 186
>KOG0549|consensus
Probab=94.78 E-value=0.015 Score=39.54 Aligned_cols=29 Identities=34% Similarity=0.622 Sum_probs=25.9
Q ss_pred CcchhcccCCCcEEEEEEcCCCCCCCCCC
Q psy18096 1 MDYVLPLMETGEECQIEITARFGYGDKGE 29 (131)
Q Consensus 1 ld~~l~~M~~gE~~~v~i~~~~ayG~~G~ 29 (131)
+|.++..|+.||+..++++|+.+||..|.
T Consensus 12 m~~~~~g~c~ge~rkvv~pp~l~fg~~~~ 40 (188)
T KOG0549|consen 12 MDQALEGMCNGEKRKVVIPPHLGFGEGGR 40 (188)
T ss_pred HHHHhhhhhccccceeccCCccccccccc
Confidence 47899999999999999999999995554
No 187
>COG5010 TadD Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking and secretion]
Probab=94.71 E-value=0.12 Score=37.07 Aligned_cols=49 Identities=18% Similarity=0.205 Sum_probs=42.6
Q ss_pred hhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 81 KERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 81 ~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
+....++.-.+..+...|+|.+|+..+.++..++|+|.++|--+|-+|.
T Consensus 97 ~~d~~ll~~~gk~~~~~g~~~~A~~~~rkA~~l~p~d~~~~~~lgaald 145 (257)
T COG5010 97 PKDRELLAAQGKNQIRNGNFGEAVSVLRKAARLAPTDWEAWNLLGAALD 145 (257)
T ss_pred cccHHHHHHHHHHHHHhcchHHHHHHHHHHhccCCCChhhhhHHHHHHH
Confidence 4444455558899999999999999999999999999999999998875
No 188
>KOG1156|consensus
Probab=94.50 E-value=0.14 Score=41.17 Aligned_cols=62 Identities=18% Similarity=0.128 Sum_probs=54.1
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
|.+||++|.. .+.+.+..++.-+|.-+..|++|+.....=.+.|+++|.+.-.|+--+.++.
T Consensus 91 Y~eaiKcy~nAl~~~~dN~qilrDlslLQ~QmRd~~~~~~tr~~LLql~~~~ra~w~~~Avs~~ 154 (700)
T KOG1156|consen 91 YDEAIKCYRNALKIEKDNLQILRDLSLLQIQMRDYEGYLETRNQLLQLRPSQRASWIGFAVAQH 154 (700)
T ss_pred HHHHHHHHHHHHhcCCCcHHHHHHHHHHHHHHHhhhhHHHHHHHHHHhhhhhHHHHHHHHHHHH
Confidence 9999999988 7788899999999999999999999999889999999988887777766654
No 189
>PF07721 TPR_4: Tetratricopeptide repeat; InterPro: IPR011717 This entry includes tetratricopeptide-like repeats not detected by the IPR001440 from INTERPRO, IPR013105 from INTERPRO and IPR011716 from INTERPRO models. The tetratricopeptide repeat (TPR) motif is a protein-protein interaction module found in multiple copies in a number of functionally different proteins that facilitates specific interactions with a partner protein(s) [].; GO: 0042802 identical protein binding
Probab=94.40 E-value=0.067 Score=24.34 Aligned_cols=24 Identities=25% Similarity=0.174 Sum_probs=21.1
Q ss_pred HHHHHHHHHHHHhCChHHHHHHHH
Q psy18096 85 NCYNNLAQAQIKLGSLEPALMSLE 108 (131)
Q Consensus 85 ~~~~N~a~~~~kl~~~~~a~~~~~ 108 (131)
.+..++|.++..+|++++|...++
T Consensus 2 ~a~~~la~~~~~~G~~~eA~~~l~ 25 (26)
T PF07721_consen 2 RARLALARALLAQGDPDEAERLLR 25 (26)
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHh
Confidence 467899999999999999998765
No 190
>KOG4151|consensus
Probab=94.35 E-value=0.046 Score=44.57 Aligned_cols=50 Identities=24% Similarity=0.210 Sum_probs=45.1
Q ss_pred hhhhHHHHHHHHHHHHHh--CChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 80 IKERINCYNNLAQAQIKL--GSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 80 ~~~~~~~~~N~a~~~~kl--~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
....+.++.|++.|++.+ ++|..++.+|+-|+...|...+++++|+.+|-
T Consensus 87 ~~~~a~~~~~~~s~~m~~~l~~~~~~~~E~~la~~~~p~i~~~Ll~r~~~y~ 138 (748)
T KOG4151|consen 87 HHVVATLRSNQASCYMQLGLGEYPKAIPECELALESQPRISKALLKRARKYE 138 (748)
T ss_pred chhhhhHHHHHHHHHhhcCccchhhhcCchhhhhhccchHHHHHhhhhhHHH
Confidence 456788999999999865 79999999999999999999999999988874
No 191
>TIGR00540 hemY_coli hemY protein. This is an uncharacterized protein encoded next to a heme-biosynthetic enzyme in two gamma division proteobacteria (E. coli and H. influenzae). It is known in no other species. The gene symbol hemY is unfortunate in that an unrelated protein, protoporphyrinogen oxidase, is designated as HemG in E. coli but as HemY in Bacillus subtilis.
Probab=94.28 E-value=0.28 Score=37.39 Aligned_cols=62 Identities=18% Similarity=0.211 Sum_probs=43.1
Q ss_pred HHHHHHHHHH--Hhhhhh-HHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 68 GILASQELLD--SLIKER-INCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~-~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
+.+|..++.+ +..|.. ..+..-++..++..|+++.|+..++..++..|++..++.-.+.+++
T Consensus 134 ~~~A~~~l~~a~~~~p~~~l~~~~~~a~l~l~~~~~~~Al~~l~~l~~~~P~~~~~l~ll~~~~~ 198 (409)
T TIGR00540 134 EARANQHLEEAAELAGNDNILVEIARTRILLAQNELHAARHGVDKLLEMAPRHKEVLKLAEEAYI 198 (409)
T ss_pred HHHHHHHHHHHHHhCCcCchHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHH
Confidence 4444444444 222333 2344556788888899999999999999999999888888777765
No 192
>TIGR03504 FimV_Cterm FimV C-terminal domain. This protein is found at the extreme C-terminus of FimV from Pseudomonas aeruginosa, and of TspA of Neisseria meningitidis. Disruption of the former blocks twitching motility from type IV pili; Semmler, et al. suggest a role in peptidoglycan layer remodelling required by type IV fimbrial systems.
Probab=94.26 E-value=0.11 Score=27.04 Aligned_cols=27 Identities=41% Similarity=0.442 Sum_probs=24.5
Q ss_pred HHHHHHHHHHhCChHHHHHHHHHHHhh
Q psy18096 87 YNNLAQAQIKLGSLEPALMSLENVLNL 113 (131)
Q Consensus 87 ~~N~a~~~~kl~~~~~a~~~~~~al~~ 113 (131)
.+++|.+|+.+|+++.|..-.++++.-
T Consensus 2 kLdLA~ayie~Gd~e~Ar~lL~evl~~ 28 (44)
T TIGR03504 2 KLDLARAYIEMGDLEGARELLEEVIEE 28 (44)
T ss_pred chHHHHHHHHcCChHHHHHHHHHHHHc
Confidence 368999999999999999999999953
No 193
>KOG2796|consensus
Probab=94.25 E-value=0.085 Score=38.44 Aligned_cols=49 Identities=27% Similarity=0.261 Sum_probs=41.6
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCC
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPN 116 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~ 116 (131)
|-.|.+.|++ +.++..+.+-+|.|+|++-+|+..+|++..+.+++.+|.
T Consensus 268 ~a~a~r~~~~i~~~D~~~~~a~NnKALcllYlg~l~DAiK~~e~~~~~~P~ 318 (366)
T KOG2796|consen 268 FAEAHRFFTEILRMDPRNAVANNNKALCLLYLGKLKDALKQLEAMVQQDPR 318 (366)
T ss_pred hHHHHHHHhhccccCCCchhhhchHHHHHHHHHHHHHHHHHHHHHhccCCc
Confidence 7788887877 667788888889999999999999999999999888884
No 194
>KOG1156|consensus
Probab=94.22 E-value=0.16 Score=40.87 Aligned_cols=47 Identities=17% Similarity=0.244 Sum_probs=40.8
Q ss_pred hhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHH
Q psy18096 80 IKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAK 126 (131)
Q Consensus 80 ~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~ 126 (131)
++-...||--+|+.+-.-++|++||.++..||.++|+|...|...+.
T Consensus 71 d~~S~vCwHv~gl~~R~dK~Y~eaiKcy~nAl~~~~dN~qilrDlsl 117 (700)
T KOG1156|consen 71 DLKSHVCWHVLGLLQRSDKKYDEAIKCYRNALKIEKDNLQILRDLSL 117 (700)
T ss_pred CcccchhHHHHHHHHhhhhhHHHHHHHHHHHHhcCCCcHHHHHHHHH
Confidence 45566788889999999999999999999999999999998877654
No 195
>PF12688 TPR_5: Tetratrico peptide repeat
Probab=94.09 E-value=0.21 Score=31.79 Aligned_cols=45 Identities=27% Similarity=0.191 Sum_probs=37.3
Q ss_pred HHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCC---HHHHHHHHHhhh
Q psy18096 85 NCYNNLAQAQIKLGSLEPALMSLENVLNLDPNN---IKALQRKAKCLI 129 (131)
Q Consensus 85 ~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~---~ka~~rrg~a~~ 129 (131)
.+.+++|-++-.+|+.++|+..+.+++....+. ..++...|.++.
T Consensus 2 ~~~~~~A~a~d~~G~~~~Ai~~Y~~Al~~gL~~~~~~~a~i~lastlr 49 (120)
T PF12688_consen 2 RALYELAWAHDSLGREEEAIPLYRRALAAGLSGADRRRALIQLASTLR 49 (120)
T ss_pred chHHHHHHHHHhcCCHHHHHHHHHHHHHcCCCchHHHHHHHHHHHHHH
Confidence 467899999999999999999999999975443 567777777664
No 196
>KOG2471|consensus
Probab=94.06 E-value=0.23 Score=39.17 Aligned_cols=48 Identities=17% Similarity=0.212 Sum_probs=44.2
Q ss_pred hHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 83 RINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 83 ~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
.-.+.+|.+..|+..|++..|..++.++.+.-..|+..|.|.+.|.+.
T Consensus 334 s~eilYNcG~~~Lh~grPl~AfqCf~~av~vfh~nPrlWLRlAEcCim 381 (696)
T KOG2471|consen 334 SMEILYNCGLLYLHSGRPLLAFQCFQKAVHVFHRNPRLWLRLAECCIM 381 (696)
T ss_pred chhhHHhhhHHHHhcCCcHHHHHHHHHHHHHHhcCcHHHHHHHHHHHH
Confidence 446788999999999999999999999999999999999999998764
No 197
>PF10255 Paf67: RNA polymerase I-associated factor PAF67; InterPro: IPR019382 RNA polymerase I is a multi-subunit enzyme and its transcription competence is dependent on the presence of PAF67 [].
Probab=93.95 E-value=0.078 Score=40.60 Aligned_cols=45 Identities=18% Similarity=0.141 Sum_probs=40.0
Q ss_pred HHHHHHHHHH----------HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHh
Q psy18096 68 GILASQELLD----------SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLN 112 (131)
Q Consensus 68 y~~Ai~~~~~----------~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~ 112 (131)
|..|++.++. ...+-.++.++..|.||+.|++|.+|++.+..+|.
T Consensus 138 Y~~Alk~l~~idl~~~~l~~~V~~~~is~~YyvGFaylMlrRY~DAir~f~~iL~ 192 (404)
T PF10255_consen 138 YYQALKVLENIDLNKKGLYTKVPACHISTYYYVGFAYLMLRRYADAIRTFSQILL 192 (404)
T ss_pred HHHHHHHhhccCcccchhhccCcchheehHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 9999999887 33567888999999999999999999999999885
No 198
>PF07720 TPR_3: Tetratricopeptide repeat; InterPro: IPR011716 This entry includes tetratricopeptide-like repeats found in the LcrH/SycD-like chaperones [].; PDB: 3KS2_O 3GZ2_A 3GZ1_A 3GYZ_A 4AM9_A 2VGX_A 2VGY_A.
Probab=93.87 E-value=0.34 Score=24.02 Aligned_cols=32 Identities=22% Similarity=0.211 Sum_probs=25.2
Q ss_pred HHHHHHHHHHHhCChHHHHHHHH--HHHhhCCCC
Q psy18096 86 CYNNLAQAQIKLGSLEPALMSLE--NVLNLDPNN 117 (131)
Q Consensus 86 ~~~N~a~~~~kl~~~~~a~~~~~--~al~~dp~~ 117 (131)
.+.-+|.++..+|+|++|++.++ -+..+||.|
T Consensus 3 ~~y~~a~~~y~~~ky~~A~~~~~y~~l~~ld~~n 36 (36)
T PF07720_consen 3 YLYGLAYNFYQKGKYDEAIHFFQYAFLCALDKYN 36 (36)
T ss_dssp HHHHHHHHHHHTT-HHHHHHHHHHHHHHHHTTT-
T ss_pred HHHHHHHHHHHHhhHHHHHHHHHHHHHHHhcccC
Confidence 45567899999999999999944 888888875
No 199
>PRK14574 hmsH outer membrane protein; Provisional
Probab=93.79 E-value=0.38 Score=40.28 Aligned_cols=62 Identities=21% Similarity=0.105 Sum_probs=52.2
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
+.+|.+.++. ...|.+..+.+++|.++...|.+.+|....+.++.++|++..+.+-+|.+++
T Consensus 432 l~~Ae~~le~l~~~aP~n~~l~~~~A~v~~~Rg~p~~A~~~~k~a~~l~P~~~~~~~~~~~~al 495 (822)
T PRK14574 432 LPTAQKKLEDLSSTAPANQNLRIALASIYLARDLPRKAEQELKAVESLAPRSLILERAQAETAM 495 (822)
T ss_pred HHHHHHHHHHHHHhCCCCHHHHHHHHHHHHhcCCHHHHHHHHHHHhhhCCccHHHHHHHHHHHH
Confidence 7888888887 6678888888888888888899999988888888888888888887777664
No 200
>COG3947 Response regulator containing CheY-like receiver and SARP domains [Signal transduction mechanisms]
Probab=93.51 E-value=0.52 Score=34.84 Aligned_cols=47 Identities=19% Similarity=0.214 Sum_probs=39.3
Q ss_pred HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHH
Q psy18096 78 SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRK 124 (131)
Q Consensus 78 ~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rr 124 (131)
-+...+.+++.-.|..|++.|.+.+|+..|.+++.+||-+...++-.
T Consensus 273 rle~ly~kllgkva~~yle~g~~neAi~l~qr~ltldpL~e~~nk~l 319 (361)
T COG3947 273 RLEQLYMKLLGKVARAYLEAGKPNEAIQLHQRALTLDPLSEQDNKGL 319 (361)
T ss_pred HHHHHHHHHHHHHHHHHHHcCChHHHHHHHHHHhhcChhhhHHHHHH
Confidence 34567788888889999999999999999999999999776555433
No 201
>KOG4340|consensus
Probab=93.51 E-value=0.3 Score=36.37 Aligned_cols=61 Identities=18% Similarity=-0.001 Sum_probs=51.8
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhh
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCL 128 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~ 128 (131)
|..||+++.. +-.|.....++-++-||....+|..|-.++++.-.+-|...+--+.-++.+
T Consensus 26 y~DaI~~l~s~~Er~p~~rAgLSlLgyCYY~~Q~f~~AA~CYeQL~ql~P~~~qYrlY~AQSL 88 (459)
T KOG4340|consen 26 YADAIQLLGSELERSPRSRAGLSLLGYCYYRLQEFALAAECYEQLGQLHPELEQYRLYQAQSL 88 (459)
T ss_pred HHHHHHHHHHHHhcCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhhChHHHHHHHHHHHHH
Confidence 9999999998 666777788889999999999999999999999999998876555555543
No 202
>KOG4162|consensus
Probab=93.49 E-value=0.39 Score=39.42 Aligned_cols=52 Identities=21% Similarity=0.212 Sum_probs=47.7
Q ss_pred HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 78 SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 78 ~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
.+.+..+..|+=+|.++...|++.+|...+..|+.+||+++.+.--.|+++.
T Consensus 678 ~~~~l~~~~~~~~G~~~~~~~~~~EA~~af~~Al~ldP~hv~s~~Ala~~ll 729 (799)
T KOG4162|consen 678 KIDPLSASVYYLRGLLLEVKGQLEEAKEAFLVALALDPDHVPSMTALAELLL 729 (799)
T ss_pred hcchhhHHHHHHhhHHHHHHHhhHHHHHHHHHHHhcCCCCcHHHHHHHHHHH
Confidence 5678889999999999999999999999999999999999988888887775
No 203
>KOG2003|consensus
Probab=93.38 E-value=0.32 Score=38.14 Aligned_cols=61 Identities=18% Similarity=0.074 Sum_probs=39.9
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhh
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCL 128 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~ 128 (131)
|.+|...|.. .-+..-..+++|.++.+-++|+.++|++++-+.-.+--+|+..++.-+..|
T Consensus 506 ~dka~~~ykeal~ndasc~ealfniglt~e~~~~ldeald~f~klh~il~nn~evl~qianiy 568 (840)
T KOG2003|consen 506 LDKAAEFYKEALNNDASCTEALFNIGLTAEALGNLDEALDCFLKLHAILLNNAEVLVQIANIY 568 (840)
T ss_pred HHHHHHHHHHHHcCchHHHHHHHHhcccHHHhcCHHHHHHHHHHHHHHHHhhHHHHHHHHHHH
Confidence 7777777766 223445566777777777777777777777665555556666666655544
No 204
>PF13525 YfiO: Outer membrane lipoprotein; PDB: 3TGO_A 3Q5M_A 2YHC_A.
Probab=93.36 E-value=0.46 Score=32.74 Aligned_cols=48 Identities=19% Similarity=0.180 Sum_probs=39.4
Q ss_pred hHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCC---HHHHHHHHHhhhc
Q psy18096 83 RINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNN---IKALQRKAKCLIL 130 (131)
Q Consensus 83 ~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~---~ka~~rrg~a~~~ 130 (131)
....+...|...+..|+|.+|+..+++++..-|.+ .+|++..|.++..
T Consensus 4 ~~~~lY~~a~~~~~~g~y~~Ai~~f~~l~~~~P~s~~a~~A~l~la~a~y~ 54 (203)
T PF13525_consen 4 TAEALYQKALEALQQGDYEEAIKLFEKLIDRYPNSPYAPQAQLMLAYAYYK 54 (203)
T ss_dssp -HHHHHHHHHHHHHCT-HHHHHHHHHHHHHH-TTSTTHHHHHHHHHHHHHH
T ss_pred CHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCChHHHHHHHHHHHHHHH
Confidence 45678889999999999999999999999988765 6889999988764
No 205
>PRK10941 hypothetical protein; Provisional
Probab=93.15 E-value=0.8 Score=33.29 Aligned_cols=60 Identities=15% Similarity=-0.009 Sum_probs=53.6
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHh
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKC 127 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a 127 (131)
|.+|+++.+. .+.|..+.-+--||..|.++|.+..|+.|.+.-++.-|+++.+-.=+.+.
T Consensus 197 ~~~AL~~~e~ll~l~P~dp~e~RDRGll~~qL~c~~~A~~DL~~fl~~~P~dp~a~~ik~ql 258 (269)
T PRK10941 197 MELALRASEALLQFDPEDPYEIRDRGLIYAQLDCEHVALSDLSYFVEQCPEDPISEMIRAQI 258 (269)
T ss_pred HHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCcHHHHHHHHHHHHhCCCchhHHHHHHHH
Confidence 9999999988 88899998888999999999999999999999999999998776655443
No 206
>PF10300 DUF3808: Protein of unknown function (DUF3808); InterPro: IPR019412 This entry represents a family of proteins conserved from fungi to humans. In humans this protein is expressed in primary breast carcinomas but not in normal breast tissue, and has a putative eukaryotic RNP-1 RNA binding region and a candidate anchoring transmembrane domain. The human protein is coordinately regulated with oestrogen receptor, but is not necessarily oestradiol-responsive []. Members of this family carry a tetratricopeptide repeat (IPR013105 from INTERPRO) at their C terminus.
Probab=93.15 E-value=0.7 Score=36.16 Aligned_cols=60 Identities=17% Similarity=0.014 Sum_probs=49.7
Q ss_pred HHHHHHHHHH------HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhh
Q psy18096 68 GILASQELLD------SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCL 128 (131)
Q Consensus 68 y~~Ai~~~~~------~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~ 128 (131)
-.+||+.|+. +..+...-++.-++.|++-+.+|++|..++.+.++.+ ++.||+|.-..|-
T Consensus 283 ~~~Ai~~~~~a~~~q~~~~Ql~~l~~~El~w~~~~~~~w~~A~~~f~~L~~~s-~WSka~Y~Y~~a~ 348 (468)
T PF10300_consen 283 LEEAIESFERAIESQSEWKQLHHLCYFELAWCHMFQHDWEEAAEYFLRLLKES-KWSKAFYAYLAAA 348 (468)
T ss_pred HHHHHHHHHHhccchhhHHhHHHHHHHHHHHHHHHHchHHHHHHHHHHHHhcc-ccHHHHHHHHHHH
Confidence 7888888886 4556677889999999999999999999999988754 6789998875543
No 207
>COG4785 NlpI Lipoprotein NlpI, contains TPR repeats [General function prediction only]
Probab=93.01 E-value=0.072 Score=37.77 Aligned_cols=51 Identities=20% Similarity=-0.061 Sum_probs=48.8
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCH
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNI 118 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ 118 (131)
|..|.+.++. |++|.+--++.||+++..--|+|.-|.+++.+-.+-||+++
T Consensus 115 fdaa~eaFds~~ELDp~y~Ya~lNRgi~~YY~gR~~LAq~d~~~fYQ~D~~DP 167 (297)
T COG4785 115 FDAAYEAFDSVLELDPTYNYAHLNRGIALYYGGRYKLAQDDLLAFYQDDPNDP 167 (297)
T ss_pred chHHHHHhhhHhccCCcchHHHhccceeeeecCchHhhHHHHHHHHhcCCCCh
Confidence 8999999999 99999999999999999999999999999999999999886
No 208
>KOG3824|consensus
Probab=92.81 E-value=0.21 Score=37.24 Aligned_cols=60 Identities=17% Similarity=0.090 Sum_probs=49.7
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHh
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKC 127 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a 127 (131)
-++|.+++.. .+.|.++.++...+...-.-++.-+|-.++-+||.++|.|.+|+-.|+..
T Consensus 132 ~ekA~~lfeHAlalaP~~p~~L~e~G~f~E~~~~iv~ADq~Y~~ALtisP~nseALvnR~RT 193 (472)
T KOG3824|consen 132 LEKAMTLFEHALALAPTNPQILIEMGQFREMHNEIVEADQCYVKALTISPGNSEALVNRART 193 (472)
T ss_pred hHHHHHHHHHHHhcCCCCHHHHHHHhHHHHhhhhhHhhhhhhheeeeeCCCchHHHhhhhcc
Confidence 5666666666 67888888888888888877888889999999999999999999888754
No 209
>PF14938 SNAP: Soluble NSF attachment protein, SNAP; PDB: 1QQE_A 2IFU_A.
Probab=92.37 E-value=0.79 Score=33.21 Aligned_cols=50 Identities=18% Similarity=0.161 Sum_probs=40.1
Q ss_pred HHHHHHHHHH--------HhhhhhH-HHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCC
Q psy18096 68 GILASQELLD--------SLIKERI-NCYNNLAQAQIKLGSLEPALMSLENVLNLDPNN 117 (131)
Q Consensus 68 y~~Ai~~~~~--------~~~~~~~-~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~ 117 (131)
|.+|+.+|++ .+....+ ..+++..+|++.+|++..|...+++..+.||.-
T Consensus 171 y~~A~~~~e~~~~~~l~~~l~~~~~~~~~l~a~l~~L~~~D~v~A~~~~~~~~~~~~~F 229 (282)
T PF14938_consen 171 YEEAIEIYEEVAKKCLENNLLKYSAKEYFLKAILCHLAMGDYVAARKALERYCSQDPSF 229 (282)
T ss_dssp HHHHHHHHHHHHHTCCCHCTTGHHHHHHHHHHHHHHHHTT-HHHHHHHHHHHGTTSTTS
T ss_pred HHHHHHHHHHHHHHhhcccccchhHHHHHHHHHHHHHHcCCHHHHHHHHHHHHhhCCCC
Confidence 9999999998 1122233 467899999999999999999999999999843
No 210
>KOG1586|consensus
Probab=92.36 E-value=1 Score=32.31 Aligned_cols=62 Identities=19% Similarity=0.141 Sum_probs=43.3
Q ss_pred HHHHHHHHHH--------HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCC------HHHHHHH-HHhhh
Q psy18096 68 GILASQELLD--------SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNN------IKALQRK-AKCLI 129 (131)
Q Consensus 68 y~~Ai~~~~~--------~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~------~ka~~rr-g~a~~ 129 (131)
+.+||.+|+. +....-..+++-.|.---.+++|.+||+.++++....-+| +|.||-+ |.+++
T Consensus 130 ~ekaI~~YE~Aae~yk~ees~ssANKC~lKvA~yaa~leqY~~Ai~iyeqva~~s~~n~LLKys~KdyflkAgLChl 206 (288)
T KOG1586|consen 130 FEKAIAHYEQAAEYYKGEESVSSANKCLLKVAQYAAQLEQYSKAIDIYEQVARSSLDNNLLKYSAKDYFLKAGLCHL 206 (288)
T ss_pred HHHHHHHHHHHHHHHcchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccchHHHhHHHHHHHHHHHHhH
Confidence 5666666665 4455556677777777777899999999999998766554 5666544 55554
No 211
>PF03704 BTAD: Bacterial transcriptional activator domain; InterPro: IPR005158 Found in the DNRI/REDD/AFSR family of regulators, this region of AFSR (P25941 from SWISSPROT) along with the C-terminal region is capable of independently directing actinorhodin production. It is important for the formation of secondary metabolites.; PDB: 2FF4_B 2FEZ_A.
Probab=92.10 E-value=0.88 Score=29.25 Aligned_cols=44 Identities=23% Similarity=0.032 Sum_probs=38.1
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHH
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVL 111 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al 111 (131)
|.+|+..+.. .++|..-.+|..+-.+|..+|++.+|++.+.+..
T Consensus 78 ~~~a~~~~~~~l~~dP~~E~~~~~lm~~~~~~g~~~~A~~~Y~~~~ 123 (146)
T PF03704_consen 78 YEEALRLLQRALALDPYDEEAYRLLMRALAAQGRRAEALRVYERYR 123 (146)
T ss_dssp HHHHHHHHHHHHHHSTT-HHHHHHHHHHHHHTT-HHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHCcCHHHHHHHHHHHH
Confidence 8888888888 8899999999999999999999999999888763
No 212
>COG1729 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=92.04 E-value=0.52 Score=34.08 Aligned_cols=62 Identities=21% Similarity=0.109 Sum_probs=43.2
Q ss_pred HHHHHHHHHH-----HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCH---HHHHHHHHhhh
Q psy18096 68 GILASQELLD-----SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNI---KALQRKAKCLI 129 (131)
Q Consensus 68 y~~Ai~~~~~-----~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~---ka~~rrg~a~~ 129 (131)
|..|...|.. .-.+....+++=++.|+..+|+|++|...+..+.+--|++. .++|..|.++.
T Consensus 157 y~~A~~~F~~fi~~YP~s~~~~nA~yWLGe~~y~qg~y~~Aa~~f~~~~k~~P~s~KApdallKlg~~~~ 226 (262)
T COG1729 157 YAEAEQAFQAFIKKYPNSTYTPNAYYWLGESLYAQGDYEDAAYIFARVVKDYPKSPKAPDALLKLGVSLG 226 (262)
T ss_pred HHHHHHHHHHHHHcCCCCcccchhHHHHHHHHHhcccchHHHHHHHHHHHhCCCCCCChHHHHHHHHHHH
Confidence 4444444444 11344556777788999999999999999999888777664 55887777653
No 213
>KOG1127|consensus
Probab=91.93 E-value=0.43 Score=40.51 Aligned_cols=59 Identities=19% Similarity=0.039 Sum_probs=47.7
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHH
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAK 126 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~ 126 (131)
+.+||..+.. ..+|....+|.-++.+|...|+|..|++.+++|..++|.+.-+-|..+.
T Consensus 578 ~h~aV~~fQsALR~dPkD~n~W~gLGeAY~~sGry~~AlKvF~kAs~LrP~s~y~~fk~A~ 638 (1238)
T KOG1127|consen 578 LHGAVCEFQSALRTDPKDYNLWLGLGEAYPESGRYSHALKVFTKASLLRPLSKYGRFKEAV 638 (1238)
T ss_pred hhhHHHHHHHHhcCCchhHHHHHHHHHHHHhcCceehHHHhhhhhHhcCcHhHHHHHHHHH
Confidence 6777777777 6678888888888888888888888888888888888888777666553
No 214
>COG1729 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=91.79 E-value=0.6 Score=33.79 Aligned_cols=44 Identities=20% Similarity=0.157 Sum_probs=40.1
Q ss_pred HHHHHHHHHHhCChHHHHHHHHHHHhhCCCC---HHHHHHHHHhhhc
Q psy18096 87 YNNLAQAQIKLGSLEPALMSLENVLNLDPNN---IKALQRKAKCLIL 130 (131)
Q Consensus 87 ~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~---~ka~~rrg~a~~~ 130 (131)
.+|.|.-+++.|+|.+|..-|..=++.=|++ ..|+||.|+++..
T Consensus 144 ~Y~~A~~~~ksgdy~~A~~~F~~fi~~YP~s~~~~nA~yWLGe~~y~ 190 (262)
T COG1729 144 LYNAALDLYKSGDYAEAEQAFQAFIKKYPNSTYTPNAYYWLGESLYA 190 (262)
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHHcCCCCcccchhHHHHHHHHHh
Confidence 7899999999999999999999999998877 5799999998863
No 215
>TIGR00115 tig trigger factor. Trigger factor is a ribosome-associated molecular chaperone and is the first chaperone to interact with nascent polypeptide. Trigger factor can bind at the same time as the signal recognition particle (SRP), but is excluded by the SRP receptor (FtsY). The central domain of trigger factor has peptidyl-prolyl cis/trans isomerase activity. This protein is found in a single copy in virtually every bacterial genome.
Probab=91.66 E-value=0.31 Score=37.26 Aligned_cols=45 Identities=24% Similarity=0.302 Sum_probs=36.2
Q ss_pred cchhcccCCCcEEEEEEcCCCCCCCCCCCCCCCCCCceEEEEEeeccccCC
Q psy18096 2 DYVLPLMETGEECQIEITARFGYGDKGEPSKSIPPGAKLYYSLTLHSVLPD 52 (131)
Q Consensus 2 d~~l~~M~~gE~~~v~i~~~~ayG~~G~~~~~ip~~~~~~~~v~l~~~~~~ 52 (131)
+.+|.+|++|+...|.++....|+..+. .|.++.|++++.++.+.
T Consensus 188 ee~L~G~k~Gd~~~~~v~~p~~~~~~~~------~gk~~~f~v~i~~I~~~ 232 (408)
T TIGR00115 188 EEQLVGMKAGEEKEIKVTFPEDYHAEEL------AGKEATFKVTVKEVKEK 232 (408)
T ss_pred HHHhCCCCCCCeeEEEecCccccCcccC------CCCeEEEEEEEEEeccC
Confidence 5689999999999999986666665543 57788999999998653
No 216
>COG0457 NrfG FOG: TPR repeat [General function prediction only]
Probab=91.55 E-value=1.8 Score=27.93 Aligned_cols=61 Identities=34% Similarity=0.307 Sum_probs=40.1
Q ss_pred HHHHHHHHHH--Hhhhh-hHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhh
Q psy18096 68 GILASQELLD--SLIKE-RINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCL 128 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~-~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~ 128 (131)
+..|+..+.. ...+. ....+.+++.++...+.+.+|+..+..++...|.....++..+..+
T Consensus 183 ~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~ 246 (291)
T COG0457 183 YEEALELLEKALKLNPDDDAEALLNLGLLYLKLGKYEEALEYYEKALELDPDNAEALYNLALLL 246 (291)
T ss_pred HHHHHHHHHHHHhhCcccchHHHHHhhHHHHHcccHHHHHHHHHHHHhhCcccHHHHhhHHHHH
Confidence 5666666665 33344 4666777777777777777777777777777776555555555443
No 217
>PF14561 TPR_20: Tetratricopeptide repeat; PDB: 3QOU_A 2R5S_A 3QDN_B.
Probab=91.34 E-value=1.2 Score=26.81 Aligned_cols=40 Identities=23% Similarity=0.272 Sum_probs=34.4
Q ss_pred HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCC
Q psy18096 78 SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNN 117 (131)
Q Consensus 78 ~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~ 117 (131)
+.+|....+...+|.+++..|++++|++.+-.++..|+++
T Consensus 16 a~~P~D~~ar~~lA~~~~~~g~~e~Al~~Ll~~v~~dr~~ 55 (90)
T PF14561_consen 16 AANPDDLDARYALADALLAAGDYEEALDQLLELVRRDRDY 55 (90)
T ss_dssp HHSTT-HHHHHHHHHHHHHTT-HHHHHHHHHHHHCC-TTC
T ss_pred HcCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCccc
Confidence 5678899999999999999999999999999999999877
No 218
>PF14863 Alkyl_sulf_dimr: Alkyl sulfatase dimerisation; PDB: 2YHE_C 2CG2_A 2CG3_A 2CFU_A 2CFZ_A.
Probab=91.31 E-value=1.3 Score=29.03 Aligned_cols=45 Identities=31% Similarity=0.268 Sum_probs=36.3
Q ss_pred HHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 85 NCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 85 ~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
.....+|...+..|+|..|.+-++.++..||+|..|-.-++.+|.
T Consensus 71 d~vl~~A~~~~~~gd~~wA~~L~d~l~~adp~n~~ar~l~A~al~ 115 (141)
T PF14863_consen 71 DKVLERAQAALAAGDYQWAAELLDHLVFADPDNEEARQLKADALE 115 (141)
T ss_dssp HHHHHHHHHHHHCT-HHHHHHHHHHHHHH-TT-HHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHHcCCCcHHHHHHHHHHHH
Confidence 345567888889999999999999999999999999888888764
No 219
>KOG3785|consensus
Probab=91.01 E-value=1 Score=34.34 Aligned_cols=42 Identities=19% Similarity=-0.097 Sum_probs=24.1
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHH
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLEN 109 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~ 109 (131)
|++|+..|+- +.+.....+..|+|-|+.-+|.|.+|.....+
T Consensus 73 Y~~Al~~Y~~~~~~~~~~~el~vnLAcc~FyLg~Y~eA~~~~~k 116 (557)
T KOG3785|consen 73 YEEALNVYTFLMNKDDAPAELGVNLACCKFYLGQYIEAKSIAEK 116 (557)
T ss_pred HHHHHHHHHHHhccCCCCcccchhHHHHHHHHHHHHHHHHHHhh
Confidence 6666666655 33333455566666666666666666554444
No 220
>PRK14720 transcript cleavage factor/unknown domain fusion protein; Provisional
Probab=90.65 E-value=0.65 Score=39.22 Aligned_cols=52 Identities=15% Similarity=0.043 Sum_probs=47.5
Q ss_pred HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 78 SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 78 ~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
..+|.+..++.-+...|...+++++|+..|+.+++..|+...+||-.|..+.
T Consensus 25 ~~~p~n~~a~~~Li~~~~~~~~~deai~i~~~~l~~~P~~i~~yy~~G~l~~ 76 (906)
T PRK14720 25 NYSLSKFKELDDLIDAYKSENLTDEAKDICEEHLKEHKKSISALYISGILSL 76 (906)
T ss_pred cCCcchHHHHHHHHHHHHhcCCHHHHHHHHHHHHHhCCcceehHHHHHHHHH
Confidence 4568899999999999999999999999999999999999999999998543
No 221
>PRK10866 outer membrane biogenesis protein BamD; Provisional
Probab=90.18 E-value=1.7 Score=30.95 Aligned_cols=48 Identities=17% Similarity=0.033 Sum_probs=39.1
Q ss_pred hHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHH---HHHHHHhhhc
Q psy18096 83 RINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKA---LQRKAKCLIL 130 (131)
Q Consensus 83 ~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka---~~rrg~a~~~ 130 (131)
....+...|.-.++.|+|++|+..+++++...|....+ .++.|.++..
T Consensus 31 ~~~~~Y~~A~~~~~~g~y~~Ai~~f~~l~~~yP~s~~a~~a~l~la~ayy~ 81 (243)
T PRK10866 31 PPSEIYATAQQKLQDGNWKQAITQLEALDNRYPFGPYSQQVQLDLIYAYYK 81 (243)
T ss_pred CHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCChHHHHHHHHHHHHHHh
Confidence 45556678888888999999999999999999988544 4888887753
No 222
>COG4976 Predicted methyltransferase (contains TPR repeat) [General function prediction only]
Probab=90.05 E-value=0.8 Score=32.80 Aligned_cols=51 Identities=25% Similarity=0.306 Sum_probs=43.7
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCH
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNI 118 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ 118 (131)
+..|.+.|++ ++.|.-..-|..++.-..|.|+++.|.+.+.++|++||.+.
T Consensus 11 ~~aaaely~qal~lap~w~~gwfR~g~~~ekag~~daAa~a~~~~L~ldp~D~ 63 (287)
T COG4976 11 AEAAAELYNQALELAPEWAAGWFRLGEYTEKAGEFDAAAAAYEEVLELDPEDH 63 (287)
T ss_pred hHHHHHHHHHHhhcCchhhhhhhhcchhhhhcccHHHHHHHHHHHHcCCcccc
Confidence 4456667776 67788888899999999999999999999999999999774
No 223
>KOG4642|consensus
Probab=89.86 E-value=0.24 Score=35.56 Aligned_cols=40 Identities=10% Similarity=0.179 Sum_probs=34.6
Q ss_pred HHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 91 AQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 91 a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
+.++..-+.|..||.++.+||.++|..+.-|-+|+.+|+.
T Consensus 17 gnk~f~~k~y~~ai~~y~raI~~nP~~~~Y~tnralchlk 56 (284)
T KOG4642|consen 17 GNKCFIPKRYDDAIDCYSRAICINPTVASYYTNRALCHLK 56 (284)
T ss_pred cccccchhhhchHHHHHHHHHhcCCCcchhhhhHHHHHHH
Confidence 4455667889999999999999999999999999998864
No 224
>PF10579 Rapsyn_N: Rapsyn N-terminal myristoylation and linker region; InterPro: IPR019568 Neuromuscular junction formation relies upon the clustering of acetylcholine receptors and other proteins in the muscle membrane. Rapsyn is a peripheral membrane protein that is selectively concentrated at the neuromuscular junction and is essential for the formation of synaptic acetylcholine receptor aggregates. Acetylcholine receptors fail to aggregate beneath nerve terminals in mice where rapsyn has been knocked out. The N-terminal six amino acids of rapsyn are its myristoylation site, and myristoylation is necessary for the targeting of the protein to the membrane []. ; GO: 0008270 zinc ion binding, 0033130 acetylcholine receptor binding, 0007268 synaptic transmission, 0005856 cytoskeleton, 0030054 cell junction, 0045211 postsynaptic membrane
Probab=89.44 E-value=3 Score=24.63 Aligned_cols=46 Identities=20% Similarity=0.169 Sum_probs=36.0
Q ss_pred HHHHHHHHHH-----HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhh
Q psy18096 68 GILASQELLD-----SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNL 113 (131)
Q Consensus 68 y~~Ai~~~~~-----~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~ 113 (131)
..+||...++ .-.+....++.-++.+|..+|+|.++++....-+.+
T Consensus 22 ~~~Al~~W~~aL~k~~~~~~rf~~lG~l~qA~~e~Gkyr~~L~fA~~Q~~~ 72 (80)
T PF10579_consen 22 TQQALQKWRKALEKITDREDRFRVLGYLIQAHMEWGKYREMLAFALQQLEI 72 (80)
T ss_pred HHHHHHHHHHHHhhcCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5677777766 113567788899999999999999999988776654
No 225
>KOG4340|consensus
Probab=89.42 E-value=1 Score=33.67 Aligned_cols=46 Identities=17% Similarity=0.054 Sum_probs=41.3
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhh
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNL 113 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~ 113 (131)
|+.|++-++. ++....+-+-+|+|+|+.+.++|..|++...+.++.
T Consensus 160 yEaAvqkFqaAlqvsGyqpllAYniALaHy~~~qyasALk~iSEIieR 207 (459)
T KOG4340|consen 160 YEAAVQKFQAALQVSGYQPLLAYNLALAHYSSRQYASALKHISEIIER 207 (459)
T ss_pred HHHHHHHHHHHHhhcCCCchhHHHHHHHHHhhhhHHHHHHHHHHHHHh
Confidence 9999999988 677778888899999999999999999999998864
No 226
>COG2912 Uncharacterized conserved protein [Function unknown]
Probab=89.34 E-value=1.1 Score=32.60 Aligned_cols=50 Identities=20% Similarity=0.189 Sum_probs=40.9
Q ss_pred hhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 80 IKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 80 ~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
.........|+=..++.-++|+.|..+.++.|.++|.++.-+-.||.+|.
T Consensus 177 ~~il~rll~~lk~~~~~e~~~~~al~~~~r~l~l~P~dp~eirDrGliY~ 226 (269)
T COG2912 177 REILSRLLRNLKAALLRELQWELALRVAERLLDLNPEDPYEIRDRGLIYA 226 (269)
T ss_pred HHHHHHHHHHHHHHHHHhhchHHHHHHHHHHHhhCCCChhhccCcHHHHH
Confidence 34556677788888888899999999999999999998888888887775
No 227
>PF02259 FAT: FAT domain; InterPro: IPR003151 The FAT domain is a domain present in the PIK-related kinases. Members of the family of PIK-related kinases may act as intracellular sensors that govern radial and horizontal pathways [].; GO: 0005515 protein binding
Probab=88.85 E-value=1.8 Score=31.79 Aligned_cols=49 Identities=22% Similarity=0.097 Sum_probs=38.8
Q ss_pred hhhHHHHHHHHHHHHHh------CChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 81 KERINCYNNLAQAQIKL------GSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 81 ~~~~~~~~N~a~~~~kl------~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
.....++.-+|.....+ +..++++..+.++++++|++.|+|++-|..+.
T Consensus 249 ~~~a~~~l~~a~w~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~k~~~~~a~~~~ 303 (352)
T PF02259_consen 249 ELKAKAFLLLAKWLDELYSKLSSESSDEILKYYKEATKLDPSWEKAWHSWALFND 303 (352)
T ss_pred HHHHHHHHHHHHHHHhhccccccccHHHHHHHHHHHHHhChhHHHHHHHHHHHHH
Confidence 34455666666666666 78888999999999999999999999887653
No 228
>PF12968 DUF3856: Domain of Unknown Function (DUF3856); InterPro: IPR024552 This domain of unknown function is found in a small group of tetratricopeptide-like proteins, which includes the uncharacterised protein Q8KAL8 from SWISSPROT. The structure of Q8KAL8 is known and belongs to the SCOP all alpha class, TPR-like superfamily, CT2138-like family.; PDB: 2HR2_D.
Probab=88.72 E-value=3 Score=26.90 Aligned_cols=45 Identities=22% Similarity=0.013 Sum_probs=33.6
Q ss_pred HHHHHHHHHH--Hhhh----hhHHHHHHHHHHHHHhCChHHHHHHHHHHHh
Q psy18096 68 GILASQELLD--SLIK----ERINCYNNLAQAQIKLGSLEPALMSLENVLN 112 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~----~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~ 112 (131)
-.+|+.+|++ ++.. .-+.+-.|||.+.--+|+.++|+..+..+-+
T Consensus 78 A~~aL~YFNRRGEL~qdeGklWIaaVfsra~Al~~~Gr~~eA~~~fr~agE 128 (144)
T PF12968_consen 78 ADRALRYFNRRGELHQDEGKLWIAAVFSRAVALEGLGRKEEALKEFRMAGE 128 (144)
T ss_dssp HHHHHHHHHHH--TTSTHHHHHHHHHHHHHHHHHHTT-HHHHHHHHHHHHH
T ss_pred HHHHHHHHhhccccccccchhHHHHHHHHHHHHHhcCChHHHHHHHHHHHH
Confidence 4566777777 5543 4455667999999999999999999988765
No 229
>cd02682 MIT_AAA_Arch MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in mostly archaebacterial AAA-ATPases. The molecular function of the MIT domain is unclear.
Probab=88.33 E-value=3.5 Score=24.03 Aligned_cols=40 Identities=20% Similarity=0.161 Sum_probs=28.2
Q ss_pred hHHHHHHHHHHHHHhCChHHHHHHHHHHHh-------hCCCCHHHHH
Q psy18096 83 RINCYNNLAQAQIKLGSLEPALMSLENVLN-------LDPNNIKALQ 122 (131)
Q Consensus 83 ~~~~~~N~a~~~~kl~~~~~a~~~~~~al~-------~dp~~~ka~~ 122 (131)
.+.-+..+|.-+-+.|++.+|+.++.++++ .-|++.+...
T Consensus 5 ~A~~~a~~AVe~D~~gr~~eAi~~Y~~aIe~L~q~~~~~pD~~~k~~ 51 (75)
T cd02682 5 MARKYAINAVKAEKEGNAEDAITNYKKAIEVLSQIVKNYPDSPTRLI 51 (75)
T ss_pred HHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHhCCChHHHHH
Confidence 455677888888888888888877777665 4576654433
No 230
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=87.69 E-value=5 Score=28.11 Aligned_cols=59 Identities=12% Similarity=0.022 Sum_probs=42.1
Q ss_pred HHHHHHHHHH----Hh----hhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCH-HHHHHHHH
Q psy18096 68 GILASQELLD----SL----IKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNI-KALQRKAK 126 (131)
Q Consensus 68 y~~Ai~~~~~----~~----~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~-ka~~rrg~ 126 (131)
+++|++.|.+ +- ......+.+=+|..+.++|++++|+..+.+++..-..+. ..+.++|+
T Consensus 141 l~~Al~~y~~a~~~e~~~~~~~~~~~l~YLigeL~rrlg~~~eA~~~fs~vi~~~~~s~~~~l~~~AR 208 (214)
T PF09986_consen 141 LRKALEFYEEAYENEDFPIEGMDEATLLYLIGELNRRLGNYDEAKRWFSRVIGSKKASKEPKLKDMAR 208 (214)
T ss_pred HHHHHHHHHHHHHhCcCCCCCchHHHHHHHHHHHHHHhCCHHHHHHHHHHHHcCCCCCCcHHHHHHHH
Confidence 7888888888 11 122345666789999999999999999999998653332 25555554
No 231
>PF00244 14-3-3: 14-3-3 protein; InterPro: IPR023410 The 14-3-3 proteins are a large family of approximately 30kDa acidic proteins which exist primarily as homo- and heterodimeric within all eukaryotic cells [, ]. There is a high degree of sequence identity and conservation between all the 14-3-3 isotypes, particularly in the regions which form the dimer interface or line the central ligand binding channel of the dimeric molecule. Each 14-3-3 protein sequence can be roughly divided into three sections: a divergent amino terminus, the conserved core region and a divergent carboxyl terminus. The conserved middle core region of the 14-3-3s encodes an amphipathic groove that forms the main functional domain, a cradle for interacting with client proteins. The monomer consists of nine helices organised in an antiparallel manner, forming an L-shaped structure. The interior of the L-structure is composed of four helices: H3 and H5, which contain many charged and polar amino acids, and H7 and H9, which contain hydrophobic amino acids. These four helices form the concave amphipathic groove that interacts with target peptides. 14-3-3 proteins mainly bind proteins containing phosphothreonine or phosphoserine motifs however exceptions to this rule do exist. Extensive investigation of the 14-3-3 binding site of the mammalian serine/threonine kinase Raf-1 has produced a consensus sequence for 14-3-3-binding, RSxpSxP (in the single-letter amino-acid code, where x denotes any amino acid and p indicates that the next residue is phosphorylated). 14-3-3 proteins appear to effect intracellular signalling in one of three ways - by direct regulation of the catalytic activity of the bound protein, by regulating interactions between the bound protein and other molecules in the cell by sequestration or modification or by controlling the subcellular localisation of the bound ligand. Proteins appear to initially bind to a single dominant site and then subsequently to many, much weaker secondary interaction sites. The 14-3-3 dimer is capable of changing the conformation of its bound ligand whilst itself undergoing minimal structural alteration. This entry represents the structural domain found in 14-3-3 proteins.; PDB: 2O8P_A 3AXY_D 2C74_A 2C63_A 4DX0_A 1YWT_A 3P1O_A 3P1N_A 4DAU_A 3U9X_A ....
Probab=87.30 E-value=4.3 Score=28.85 Aligned_cols=45 Identities=13% Similarity=-0.045 Sum_probs=37.0
Q ss_pred HHHHHHHHHH---HhhhhhHHHHHHHHHHHHH-hCChHHHHHHHHHHHh
Q psy18096 68 GILASQELLD---SLIKERINCYNNLAQAQIK-LGSLEPALMSLENVLN 112 (131)
Q Consensus 68 y~~Ai~~~~~---~~~~~~~~~~~N~a~~~~k-l~~~~~a~~~~~~al~ 112 (131)
|++|+..... ..+|....+.+|.+..|.. +|+.++|+..+.+|+.
T Consensus 149 Y~~A~~~a~~~L~~~~p~rLgl~LN~svF~yei~~~~~~A~~ia~~afd 197 (236)
T PF00244_consen 149 YEEALEIAKKELPPTHPLRLGLALNYSVFYYEILNDPEKAIEIAKQAFD 197 (236)
T ss_dssp HHHHHHHHHHHSCTTSHHHHHHHHHHHHHHHHTSS-HHHHHHHHHHHHH
T ss_pred hhhHHHHHhcccCCCCcHHHHHHHHHHHHHHHHcCChHHHHHHHHHHHH
Confidence 9999988887 4567888889999988865 8999999998888875
No 232
>PRK01490 tig trigger factor; Provisional
Probab=87.03 E-value=1.1 Score=34.55 Aligned_cols=45 Identities=24% Similarity=0.307 Sum_probs=34.4
Q ss_pred cchhcccCCCcEEEEEEcCCCCCCCCCCCCCCCCCCceEEEEEeeccccCC
Q psy18096 2 DYVLPLMETGEECQIEITARFGYGDKGEPSKSIPPGAKLYYSLTLHSVLPD 52 (131)
Q Consensus 2 d~~l~~M~~gE~~~v~i~~~~ayG~~G~~~~~ip~~~~~~~~v~l~~~~~~ 52 (131)
+.+|.+|++||...|.+...-.|+.... .|.++.|.++++++...
T Consensus 199 ee~L~G~k~Ge~~~~~~~~p~~~~~~~l------agk~~~f~v~v~~V~~~ 243 (435)
T PRK01490 199 EEQLVGMKAGEEKTIDVTFPEDYHAEDL------AGKEATFKVTVKEVKEK 243 (435)
T ss_pred HHHhCCCCCCCeeEEEecCccccccccC------CCCeEEEEEEEEEeccC
Confidence 5689999999999999876545544332 46788999999998653
No 233
>KOG2796|consensus
Probab=86.96 E-value=0.8 Score=33.55 Aligned_cols=48 Identities=27% Similarity=0.356 Sum_probs=41.1
Q ss_pred hhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 82 ERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 82 ~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
....++-|.+..|+-.++|.+|...+++.+..||.++-+-..+|.+++
T Consensus 250 ~~~~V~~n~a~i~lg~nn~a~a~r~~~~i~~~D~~~~~a~NnKALcll 297 (366)
T KOG2796|consen 250 GKIMVLMNSAFLHLGQNNFAEAHRFFTEILRMDPRNAVANNNKALCLL 297 (366)
T ss_pred hhHHHHhhhhhheecccchHHHHHHHhhccccCCCchhhhchHHHHHH
Confidence 355678899999999999999999999999999998877777776654
No 234
>KOG1130|consensus
Probab=86.30 E-value=1.9 Score=33.60 Aligned_cols=61 Identities=13% Similarity=0.059 Sum_probs=37.1
Q ss_pred HHHHHHHHHH------Hh--hhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhC------CCCHHHHHHHHHhh
Q psy18096 68 GILASQELLD------SL--IKERINCYNNLAQAQIKLGSLEPALMSLENVLNLD------PNNIKALQRKAKCL 128 (131)
Q Consensus 68 y~~Ai~~~~~------~~--~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~d------p~~~ka~~rrg~a~ 128 (131)
|..|+++|.. ++ ....+...+.++.+|.-+.++.+||.+..+-|.+- -.-.+|+|..|.|+
T Consensus 251 fe~A~ehYK~tl~LAielg~r~vEAQscYSLgNtytll~e~~kAI~Yh~rHLaIAqeL~DriGe~RacwSLgna~ 325 (639)
T KOG1130|consen 251 FELAIEHYKLTLNLAIELGNRTVEAQSCYSLGNTYTLLKEVQKAITYHQRHLAIAQELEDRIGELRACWSLGNAF 325 (639)
T ss_pred cHhHHHHHHHHHHHHHHhcchhHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHH
Confidence 6667776666 22 12333334457777777888888887777666543 23356666666655
No 235
>PF01535 PPR: PPR repeat; InterPro: IPR002885 This entry represents the PPR repeat. Pentatricopeptide repeat (PPR) proteins are characterised by tandem repeats of a degenerate 35 amino acid motif []. Most of PPR proteins have roles in mitochondria or plastid []. PPR repeats were discovered while screening Arabidopsis proteins for those predicted to be targeted to mitochondria or chloroplast [, ]. Some of these proteins have been shown to play a role in post-transcriptional processes within organelles and they are thought to be sequence-specific RNA-binding proteins [, , ]. Plant genomes have between one hundred to five hundred PPR genes per genome whereas non-plant genomes encode two to six PPR proteins. Although no PPR structures are yet known, the motif is predicted to fold into a helix-turn-helix structure similar to those found in the tetratricopeptide repeat (TPR) family (see PDOC50005 from PROSITEDOC) []. The plant PPR protein family has been divided in two subfamilies on the basis of their motif content and organisation [, ]. Examples of PPR repeat-containing proteins include PET309 P32522 from SWISSPROT, which may be involved in RNA stabilisation [], and crp1, which is involved in RNA processing []. The repeat is associated with a predicted plant protein O49549 from SWISSPROT that has a domain organisation similar to the human BRCA1 protein.
Probab=86.30 E-value=1.5 Score=19.82 Aligned_cols=27 Identities=30% Similarity=0.498 Sum_probs=23.4
Q ss_pred HHHHHHHHHHHhCChHHHHHHHHHHHh
Q psy18096 86 CYNNLAQAQIKLGSLEPALMSLENVLN 112 (131)
Q Consensus 86 ~~~N~a~~~~kl~~~~~a~~~~~~al~ 112 (131)
.|+.+-.+|.+.|++++|.+.+++..+
T Consensus 2 ~y~~li~~~~~~~~~~~a~~~~~~M~~ 28 (31)
T PF01535_consen 2 TYNSLISGYCKMGQFEEALEVFDEMRE 28 (31)
T ss_pred cHHHHHHHHHccchHHHHHHHHHHHhH
Confidence 477888999999999999999988654
No 236
>KOG3824|consensus
Probab=86.09 E-value=3.2 Score=31.20 Aligned_cols=44 Identities=27% Similarity=0.218 Sum_probs=38.5
Q ss_pred HHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhh
Q psy18096 85 NCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCL 128 (131)
Q Consensus 85 ~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~ 128 (131)
.+-.|.|.-..+.|+-++|..-+.-||.++|.++.++.+.|+-.
T Consensus 117 ~~Al~~A~~~~~~Gk~ekA~~lfeHAlalaP~~p~~L~e~G~f~ 160 (472)
T KOG3824|consen 117 ILALKAAGRSRKDGKLEKAMTLFEHALALAPTNPQILIEMGQFR 160 (472)
T ss_pred HHHHHHHHHHHhccchHHHHHHHHHHHhcCCCCHHHHHHHhHHH
Confidence 34557777778899999999999999999999999999998753
No 237
>PF13512 TPR_18: Tetratricopeptide repeat
Probab=85.37 E-value=6.3 Score=25.91 Aligned_cols=53 Identities=15% Similarity=-0.012 Sum_probs=44.5
Q ss_pred HHHHHHHHHH--Hhhhh---hHHHHHHHHHHHHHhCC---------------hHHHHHHHHHHHhhCCCCHHH
Q psy18096 68 GILASQELLD--SLIKE---RINCYNNLAQAQIKLGS---------------LEPALMSLENVLNLDPNNIKA 120 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~---~~~~~~N~a~~~~kl~~---------------~~~a~~~~~~al~~dp~~~ka 120 (131)
|.+|+..+++ ++.|. ..-+++-+|++++++.. ...|+.++.+++..-|++.-|
T Consensus 63 y~~A~a~~~rFirLhP~hp~vdYa~Y~~gL~~~~~~~~~~~~~~~~drD~~~~~~A~~~f~~lv~~yP~S~ya 135 (142)
T PF13512_consen 63 YEEAIAAYDRFIRLHPTHPNVDYAYYMRGLSYYEQDEGSLQSFFRSDRDPTPARQAFRDFEQLVRRYPNSEYA 135 (142)
T ss_pred HHHHHHHHHHHHHhCCCCCCccHHHHHHHHHHHHHhhhHHhhhcccccCcHHHHHHHHHHHHHHHHCcCChhH
Confidence 8999999998 55543 44578899999999987 899999999999999988644
No 238
>KOG1130|consensus
Probab=85.09 E-value=1.8 Score=33.76 Aligned_cols=46 Identities=24% Similarity=0.119 Sum_probs=35.2
Q ss_pred HHHHHHHHHH--Hh------hhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhh
Q psy18096 68 GILASQELLD--SL------IKERINCYNNLAQAQIKLGSLEPALMSLENVLNL 113 (131)
Q Consensus 68 y~~Ai~~~~~--~~------~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~ 113 (131)
|.+||..=.. ++ ..-...++.|++.||+-+|+++.|++++..++.+
T Consensus 211 f~~ai~~H~~RL~ia~efGDrAaeRRA~sNlgN~hiflg~fe~A~ehYK~tl~L 264 (639)
T KOG1130|consen 211 FDQAIHFHKLRLEIAQEFGDRAAERRAHSNLGNCHIFLGNFELAIEHYKLTLNL 264 (639)
T ss_pred HHHHHHHHHHHHHHHHHhhhHHHHHHhhcccchhhhhhcccHhHHHHHHHHHHH
Confidence 7777764444 22 2334568999999999999999999999987765
No 239
>PF04184 ST7: ST7 protein; InterPro: IPR007311 The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain.
Probab=84.69 E-value=6.9 Score=31.10 Aligned_cols=46 Identities=26% Similarity=0.372 Sum_probs=35.1
Q ss_pred HHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCC--CHHHHHHHHHhhh
Q psy18096 84 INCYNNLAQAQIKLGSLEPALMSLENVLNLDPN--NIKALQRKAKCLI 129 (131)
Q Consensus 84 ~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~--~~ka~~rrg~a~~ 129 (131)
.-+...+|.|.-|+|+.+|||+.+.+.++..|. +.-..+..-.+|+
T Consensus 259 ~y~KrRLAmCarklGr~~EAIk~~rdLlke~p~~~~l~IrenLie~LL 306 (539)
T PF04184_consen 259 VYAKRRLAMCARKLGRLREAIKMFRDLLKEFPNLDNLNIRENLIEALL 306 (539)
T ss_pred hhhHHHHHHHHHHhCChHHHHHHHHHHHhhCCccchhhHHHHHHHHHH
Confidence 445568999999999999999999999988775 3445555555544
No 240
>PF12854 PPR_1: PPR repeat
Probab=84.50 E-value=2.9 Score=20.08 Aligned_cols=27 Identities=22% Similarity=0.400 Sum_probs=23.4
Q ss_pred hHHHHHHHHHHHHHhCChHHHHHHHHH
Q psy18096 83 RINCYNNLAQAQIKLGSLEPALMSLEN 109 (131)
Q Consensus 83 ~~~~~~N~a~~~~kl~~~~~a~~~~~~ 109 (131)
+...|+-+-.+|.+.|+.++|++.+++
T Consensus 6 d~~ty~~lI~~~Ck~G~~~~A~~l~~~ 32 (34)
T PF12854_consen 6 DVVTYNTLIDGYCKAGRVDEAFELFDE 32 (34)
T ss_pred cHhHHHHHHHHHHHCCCHHHHHHHHHh
Confidence 556788899999999999999998765
No 241
>PLN03081 pentatricopeptide (PPR) repeat-containing protein; Provisional
Probab=84.46 E-value=3.6 Score=33.61 Aligned_cols=60 Identities=13% Similarity=0.078 Sum_probs=43.9
Q ss_pred HHHHHHHHHH-HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhh
Q psy18096 68 GILASQELLD-SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCL 128 (131)
Q Consensus 68 y~~Ai~~~~~-~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~ 128 (131)
+++|.+++++ ...| ....|+.+..++.+.|+++.|...+++.++++|++...|.-.+.+|
T Consensus 478 ~~eA~~~~~~~~~~p-~~~~~~~Ll~a~~~~g~~~~a~~~~~~l~~~~p~~~~~y~~L~~~y 538 (697)
T PLN03081 478 LDEAYAMIRRAPFKP-TVNMWAALLTACRIHKNLELGRLAAEKLYGMGPEKLNNYVVLLNLY 538 (697)
T ss_pred HHHHHHHHHHCCCCC-CHHHHHHHHHHHHHcCCcHHHHHHHHHHhCCCCCCCcchHHHHHHH
Confidence 5666666655 2223 4556888888888999999999999999999998876665555444
No 242
>KOG1174|consensus
Probab=84.29 E-value=4.2 Score=31.72 Aligned_cols=58 Identities=16% Similarity=0.096 Sum_probs=46.1
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHH
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKA 125 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg 125 (131)
.++|+-.|.. .+.|....+|.-+-.||+-.|.+.||.-..+.++..=|++++++---|
T Consensus 350 ~~~A~IaFR~Aq~Lap~rL~~Y~GL~hsYLA~~~~kEA~~~An~~~~~~~~sA~~LtL~g 409 (564)
T KOG1174|consen 350 HTQAVIAFRTAQMLAPYRLEIYRGLFHSYLAQKRFKEANALANWTIRLFQNSARSLTLFG 409 (564)
T ss_pred hHHHHHHHHHHHhcchhhHHHHHHHHHHHHhhchHHHHHHHHHHHHHHhhcchhhhhhhc
Confidence 4555555555 566788888889999999999999999999999988888888876654
No 243
>COG3629 DnrI DNA-binding transcriptional activator of the SARP family [Signal transduction mechanisms]
Probab=84.03 E-value=5.9 Score=29.06 Aligned_cols=52 Identities=27% Similarity=0.376 Sum_probs=46.3
Q ss_pred HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 78 SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 78 ~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
-+......++.-++..+.-.++++.++...++.+..||-+.++|.+.-.+|.
T Consensus 147 ~l~e~~~~~l~~lae~~~~~~~~~~~~~~l~~Li~~dp~~E~~~~~lm~~y~ 198 (280)
T COG3629 147 ALEELFIKALTKLAEALIACGRADAVIEHLERLIELDPYDEPAYLRLMEAYL 198 (280)
T ss_pred HHHHHHHHHHHHHHHHHHhcccHHHHHHHHHHHHhcCccchHHHHHHHHHHH
Confidence 4567888999999999999999999999999999999999999988766654
No 244
>smart00386 HAT HAT (Half-A-TPR) repeats. Present in several RNA-binding proteins. Structurally and sequentially thought to be similar to TPRs.
Probab=83.90 E-value=2.9 Score=18.89 Aligned_cols=29 Identities=24% Similarity=0.303 Sum_probs=24.5
Q ss_pred CChHHHHHHHHHHHhhCCCCHHHHHHHHH
Q psy18096 98 GSLEPALMSLENVLNLDPNNIKALQRKAK 126 (131)
Q Consensus 98 ~~~~~a~~~~~~al~~dp~~~ka~~rrg~ 126 (131)
|+++.|...+++++...|.+...|.....
T Consensus 1 ~~~~~~r~i~e~~l~~~~~~~~~W~~y~~ 29 (33)
T smart00386 1 GDIERARKIYERALEKFPKSVELWLKYAE 29 (33)
T ss_pred CcHHHHHHHHHHHHHHCCCChHHHHHHHH
Confidence 46788999999999999999988877654
No 245
>COG0457 NrfG FOG: TPR repeat [General function prediction only]
Probab=83.50 E-value=9.4 Score=24.45 Aligned_cols=23 Identities=48% Similarity=0.631 Sum_probs=13.4
Q ss_pred HHHHhCChHHHHHHHHHHHhhCC
Q psy18096 93 AQIKLGSLEPALMSLENVLNLDP 115 (131)
Q Consensus 93 ~~~kl~~~~~a~~~~~~al~~dp 115 (131)
++..+++++.|+..+.+++..+|
T Consensus 139 ~~~~~~~~~~a~~~~~~~~~~~~ 161 (291)
T COG0457 139 ALYELGDYEEALELYEKALELDP 161 (291)
T ss_pred HHHHcCCHHHHHHHHHHHHhcCC
Confidence 55556666666666666555544
No 246
>PF01239 PPTA: Protein prenyltransferase alpha subunit repeat; InterPro: IPR002088 Protein prenylation is the posttranslational attachment of either a farnesyl group or a geranylgeranyl group via a thioether linkage (-C-S-C-) to a cysteine at or near the carboxyl terminus of the protein. Farnesyl and geranylgeranyl groups are polyisoprenes, unsaturated hydrocarbons with a multiple of five carbons; the chain is 15 carbons long in the farnesyl moiety and 20 carbons long in the geranylgeranyl moiety. There are three different protein prenyltransferases in humans: farnesyltransferase (FT) and geranylgeranyltransferase 1 (GGT1) share the same motif (the CaaX box) around the cysteine in their substrates, and are thus called CaaX prenyltransferases, whereas geranylgeranyltransferase 2 (GGT2, also called Rab geranylgeranyltransferase) recognises a different motif and is thus called a non-CaaX prenyltransferase. Protein prenyltransferases are currently known only in eukaryotes, but they are widespread, being found in vertebrates, insects, nematodes, plants, fungi and protozoa, including several parasites. Each protein consists of two subunits, alpha and beta; the alpha subunit of FT and GGT1 is encoded by the same gene, FNTA. The alpha subunit is thought to participate in a stable complex with the isoprenyl substrate; the beta subunit binds the peptide substrate. In the alpha subunits of both types of protein prenyltransferases, seven tetratricopeptide repeats are formed by pairs of helices that are stabilised by conserved intercalating residues. The alpha subunits of GGT2 in mammals and plants also have an immunoglobulin-like domain between the fifth and sixth tetratricopeptide repeat, as well as leucine-rich repeats at the carboxyl terminus. The functions of these additional domains in GGT2 are as yet undefined, but they are apparently not directly involved in the interaction with substrates and Rab escort proteins. The tetratricopeptide repeats of the alpha subunit form a right-handed superhelix, which embraces the (alpha-alpha)6 barrel of the beta subunit []. ; GO: 0008318 protein prenyltransferase activity, 0018342 protein prenylation; PDB: 1S63_A 1LD7_A 1LD8_A 2H6G_A 1SA4_A 1MZC_A 1TN6_A 2F0Y_A 2H6H_A 2H6F_A ....
Probab=83.21 E-value=3.4 Score=19.23 Aligned_cols=26 Identities=27% Similarity=0.268 Sum_probs=21.5
Q ss_pred HHHHHHHHHhhCCCCHHHHHHHHHhh
Q psy18096 103 ALMSLENVLNLDPNNIKALQRKAKCL 128 (131)
Q Consensus 103 a~~~~~~al~~dp~~~ka~~rrg~a~ 128 (131)
.+..+.++|..||+|--+|..|-..+
T Consensus 2 El~~~~~~l~~~pknys~W~yR~~ll 27 (31)
T PF01239_consen 2 ELEFTKKALEKDPKNYSAWNYRRWLL 27 (31)
T ss_dssp HHHHHHHHHHHSTTCHHHHHHHHHHH
T ss_pred HHHHHHHHHHHCcccccHHHHHHHHH
Confidence 46788999999999999998875543
No 247
>PF04733 Coatomer_E: Coatomer epsilon subunit; InterPro: IPR006822 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the epsilon subunit of the coatomer complex, which is involved in the regulation of intracellular protein trafficking between the endoplasmic reticulum and the Golgi complex []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006890 retrograde vesicle-mediated transport, Golgi to ER, 0030126 COPI vesicle coat; PDB: 3MV2_B 3MV3_F 3MKR_A.
Probab=82.41 E-value=4.7 Score=29.55 Aligned_cols=51 Identities=20% Similarity=0.119 Sum_probs=40.5
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHH-HHHHHHHHHhhCCCCH
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEP-ALMSLENVLNLDPNNI 118 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~-a~~~~~~al~~dp~~~ 118 (131)
|.+|.+.+.+ +.+|..+.++.|++.|...+|+..+ +-+...+.-..+|+++
T Consensus 217 ~~eAe~~L~~al~~~~~~~d~LaNliv~~~~~gk~~~~~~~~l~qL~~~~p~h~ 270 (290)
T PF04733_consen 217 YEEAEELLEEALEKDPNDPDTLANLIVCSLHLGKPTEAAERYLSQLKQSNPNHP 270 (290)
T ss_dssp HHHHHHHHHHHCCC-CCHHHHHHHHHHHHHHTT-TCHHHHHHHHHCHHHTTTSH
T ss_pred HHHHHHHHHHHHHhccCCHHHHHHHHHHHHHhCCChhHHHHHHHHHHHhCCCCh
Confidence 9999999998 6778889999999999999999955 5556666666888764
No 248
>KOG2376|consensus
Probab=81.89 E-value=6.9 Score=31.69 Aligned_cols=41 Identities=17% Similarity=0.088 Sum_probs=21.5
Q ss_pred HHHHHHHHHH-HhhhhhHHHHHHHHHHHHHhCChHHHHHHHH
Q psy18096 68 GILASQELLD-SLIKERINCYNNLAQAQIKLGSLEPALMSLE 108 (131)
Q Consensus 68 y~~Ai~~~~~-~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~ 108 (131)
|++|+++... .........++-.|-|..+++..++|+.+.+
T Consensus 62 y~~ALk~ikk~~~~~~~~~~~fEKAYc~Yrlnk~Dealk~~~ 103 (652)
T KOG2376|consen 62 YEDALKLIKKNGALLVINSFFFEKAYCEYRLNKLDEALKTLK 103 (652)
T ss_pred HHHHHHHHHhcchhhhcchhhHHHHHHHHHcccHHHHHHHHh
Confidence 7777755554 1112222223455666666666666666555
No 249
>PF04212 MIT: MIT (microtubule interacting and transport) domain; InterPro: IPR007330 The MIT domain is found in vacuolar sorting proteins, spastin (probable ATPase involved in the assembly or function of nuclear protein complexes), and a sorting nexin, which may play a role in intracellular trafficking.; PDB: 2DL1_A 2JQK_A 1WR0_A 2CPT_A 2JQH_A 2V6Y_A 2JQ9_A 2K3W_A 1YXR_A 3EAB_E ....
Probab=81.84 E-value=5.8 Score=22.24 Aligned_cols=28 Identities=18% Similarity=0.071 Sum_probs=21.5
Q ss_pred HHHHHHHHHHHHhCChHHHHHHHHHHHh
Q psy18096 85 NCYNNLAQAQIKLGSLEPALMSLENVLN 112 (131)
Q Consensus 85 ~~~~N~a~~~~kl~~~~~a~~~~~~al~ 112 (131)
.-+.+.|.-.-+.|+|.+|+..+.++++
T Consensus 6 ~~~~~~Av~~D~~g~~~~A~~~Y~~ai~ 33 (69)
T PF04212_consen 6 IELIKKAVEADEAGNYEEALELYKEAIE 33 (69)
T ss_dssp HHHHHHHHHHHHTTSHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHH
Confidence 3456777888888888888888887775
No 250
>cd02679 MIT_spastin MIT: domain contained within Microtubule Interacting and Trafficking molecules. This MIT domain sub-family is found in the AAA protein spastin, a probable ATPase involved in the assembly or function of nuclear protein complexes; spastins might also be involved in microtubule dynamics. The molecular function of the MIT domain is unclear.
Probab=81.52 E-value=1.5 Score=25.88 Aligned_cols=26 Identities=23% Similarity=0.081 Sum_probs=18.6
Q ss_pred HHHHHHHHHHhCChHHHHHHHHHHHh
Q psy18096 87 YNNLAQAQIKLGSLEPALMSLENVLN 112 (131)
Q Consensus 87 ~~N~a~~~~kl~~~~~a~~~~~~al~ 112 (131)
+.|.|+++-..|++++|+..|.+++.
T Consensus 11 ~I~kaL~~dE~g~~e~Al~~Y~~gi~ 36 (79)
T cd02679 11 EISKALRADEWGDKEQALAHYRKGLR 36 (79)
T ss_pred HHHHHhhhhhcCCHHHHHHHHHHHHH
Confidence 45677777777777777777777665
No 251
>KOG2610|consensus
Probab=81.40 E-value=5.1 Score=30.50 Aligned_cols=48 Identities=17% Similarity=0.034 Sum_probs=37.9
Q ss_pred hhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhh
Q psy18096 81 KERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCL 128 (131)
Q Consensus 81 ~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~ 128 (131)
|.++-+..=.|.|....|-|++|.+..+++++++|.+.-|.--++.++
T Consensus 172 p~~sYv~GmyaFgL~E~g~y~dAEk~A~ralqiN~~D~Wa~Ha~aHVl 219 (491)
T KOG2610|consen 172 PCYSYVHGMYAFGLEECGIYDDAEKQADRALQINRFDCWASHAKAHVL 219 (491)
T ss_pred cHHHHHHHHHHhhHHHhccchhHHHHHHhhccCCCcchHHHHHHHHHH
Confidence 444555567899999999999999999999999998876655444443
No 252
>KOG2376|consensus
Probab=80.60 E-value=8.5 Score=31.21 Aligned_cols=51 Identities=16% Similarity=0.010 Sum_probs=43.6
Q ss_pred HHHHHHHHHH-----------------HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCH
Q psy18096 68 GILASQELLD-----------------SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNI 118 (131)
Q Consensus 68 y~~Ai~~~~~-----------------~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ 118 (131)
|.+|++++.+ ++......+..-+|-++..+|+-.+|...+...|..+|.+.
T Consensus 191 y~qA~elL~kA~~~~~e~l~~~d~~eEeie~el~~IrvQlayVlQ~~Gqt~ea~~iy~~~i~~~~~D~ 258 (652)
T KOG2376|consen 191 YNQAIELLEKALRICREKLEDEDTNEEEIEEELNPIRVQLAYVLQLQGQTAEASSIYVDIIKRNPADE 258 (652)
T ss_pred HHHHHHHHHHHHHHHHHhhcccccchhhHHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHHhcCCCc
Confidence 8888888876 34566677888999999999999999999999999998664
No 253
>KOG1308|consensus
Probab=80.37 E-value=1.5 Score=33.09 Aligned_cols=47 Identities=21% Similarity=0.104 Sum_probs=42.9
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhC
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLD 114 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~d 114 (131)
...||+.|+. ++.++...-|--|+.++.-+|+|.+|-+++..|+++|
T Consensus 164 p~~airD~d~A~ein~Dsa~~ykfrg~A~rllg~~e~aa~dl~~a~kld 212 (377)
T KOG1308|consen 164 PNAAIRDCDFAIEINPDSAKGYKFRGYAERLLGNWEEAAHDLALACKLD 212 (377)
T ss_pred CchhhhhhhhhhccCcccccccchhhHHHHHhhchHHHHHHHHHHHhcc
Confidence 6788888888 7888888889999999999999999999999999987
No 254
>PF09122 DUF1930: Domain of unknown function (DUF1930); InterPro: IPR015206 This entry represents a domain found in 3-mercaptopyruvate sulphurtransferase which has no known function. This domain adopts a structure consisting of a four-stranded antiparallel beta-sheet and an alpha-helix, arranged in a beta(2)-alpha-beta(2) fashion, and bearing a remarkable structural similarity to the FK506-binding protein class of peptidylprolyl cis/trans-isomerase []. ; PDB: 1OKG_A.
Probab=79.93 E-value=1.9 Score=24.11 Aligned_cols=19 Identities=21% Similarity=0.328 Sum_probs=14.5
Q ss_pred chhcccCCCcEEEEEEcCC
Q psy18096 3 YVLPLMETGEECQIEITAR 21 (131)
Q Consensus 3 ~~l~~M~~gE~~~v~i~~~ 21 (131)
.|+..|+.||++.|...++
T Consensus 39 sA~~HlH~GEkA~V~FkS~ 57 (68)
T PF09122_consen 39 SALVHLHIGEKAQVFFKSQ 57 (68)
T ss_dssp HHHTT-BTT-EEEEEETTS
T ss_pred HHHHHhhcCceeEEEEecC
Confidence 4788999999999998874
No 255
>COG4700 Uncharacterized protein conserved in bacteria containing a divergent form of TPR repeats [Function unknown]
Probab=79.09 E-value=19 Score=25.26 Aligned_cols=48 Identities=21% Similarity=0.257 Sum_probs=35.1
Q ss_pred HHHHHHHHHH---HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCC
Q psy18096 68 GILASQELLD---SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDP 115 (131)
Q Consensus 68 y~~Ai~~~~~---~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp 115 (131)
|.+|...|.+ .+-......++.+|.+++.++++..|....++..+.+|
T Consensus 105 ~~EA~~hy~qalsG~fA~d~a~lLglA~Aqfa~~~~A~a~~tLe~l~e~~p 155 (251)
T COG4700 105 YHEAVPHYQQALSGIFAHDAAMLLGLAQAQFAIQEFAAAQQTLEDLMEYNP 155 (251)
T ss_pred hhhhHHHHHHHhccccCCCHHHHHHHHHHHHhhccHHHHHHHHHHHhhcCC
Confidence 7777777776 33344566777788888888888888888888777775
No 256
>COG2956 Predicted N-acetylglucosaminyl transferase [Carbohydrate transport and metabolism]
Probab=79.01 E-value=4.5 Score=30.52 Aligned_cols=50 Identities=22% Similarity=0.212 Sum_probs=29.7
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCC
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNN 117 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~ 117 (131)
..+|+..+.+ +.+|..+.+-.-++..++..|+|..|++....+++-||..
T Consensus 196 ~d~A~~~l~kAlqa~~~cvRAsi~lG~v~~~~g~y~~AV~~~e~v~eQn~~y 247 (389)
T COG2956 196 VDRARELLKKALQADKKCVRASIILGRVELAKGDYQKAVEALERVLEQNPEY 247 (389)
T ss_pred HHHHHHHHHHHHhhCccceehhhhhhHHHHhccchHHHHHHHHHHHHhChHH
Confidence 4455555555 4445555555556666666666666666666666666655
No 257
>KOG2053|consensus
Probab=78.02 E-value=9.5 Score=32.34 Aligned_cols=61 Identities=20% Similarity=0.073 Sum_probs=45.2
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
+.+|..+++. ...+..-..+.-+-.||..++++++|...++++++.+|+ .+-++..-.||.
T Consensus 59 ~~ea~~~Le~~~~~~~~D~~tLq~l~~~y~d~~~~d~~~~~Ye~~~~~~P~-eell~~lFmayv 121 (932)
T KOG2053|consen 59 GDEALKLLEALYGLKGTDDLTLQFLQNVYRDLGKLDEAVHLYERANQKYPS-EELLYHLFMAYV 121 (932)
T ss_pred chhHHHHHhhhccCCCCchHHHHHHHHHHHHHhhhhHHHHHHHHHHhhCCc-HHHHHHHHHHHH
Confidence 4444444443 233445556677889999999999999999999999998 777777666654
No 258
>COG3071 HemY Uncharacterized enzyme of heme biosynthesis [Coenzyme metabolism]
Probab=77.05 E-value=13 Score=28.63 Aligned_cols=50 Identities=22% Similarity=0.135 Sum_probs=40.7
Q ss_pred hhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 79 LIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 79 ~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
..|..+-++.-++.-++|.+.|.+|-..++.+|...|+ ...|-..|.++.
T Consensus 323 ~h~~~p~L~~tLG~L~~k~~~w~kA~~~leaAl~~~~s-~~~~~~la~~~~ 372 (400)
T COG3071 323 QHPEDPLLLSTLGRLALKNKLWGKASEALEAALKLRPS-ASDYAELADALD 372 (400)
T ss_pred hCCCChhHHHHHHHHHHHhhHHHHHHHHHHHHHhcCCC-hhhHHHHHHHHH
Confidence 35777788999999999999999999999999999875 555555665553
No 259
>KOG0495|consensus
Probab=76.91 E-value=14 Score=30.74 Aligned_cols=44 Identities=23% Similarity=0.137 Sum_probs=28.3
Q ss_pred HHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhh
Q psy18096 85 NCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCL 128 (131)
Q Consensus 85 ~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~ 128 (131)
.+|.--+...--+++.++|++-|+++|+.-|...|-|.-.|+.+
T Consensus 652 Rv~mKs~~~er~ld~~eeA~rllEe~lk~fp~f~Kl~lmlGQi~ 695 (913)
T KOG0495|consen 652 RVWMKSANLERYLDNVEEALRLLEEALKSFPDFHKLWLMLGQIE 695 (913)
T ss_pred hhhHHHhHHHHHhhhHHHHHHHHHHHHHhCCchHHHHHHHhHHH
Confidence 34444444444567777777777777777777777777777654
No 260
>COG4700 Uncharacterized protein conserved in bacteria containing a divergent form of TPR repeats [Function unknown]
Probab=76.67 E-value=16 Score=25.64 Aligned_cols=60 Identities=20% Similarity=0.145 Sum_probs=43.1
Q ss_pred HHHHHHHHHHHhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHh-hCCCCHHHHHHHHHhhhc
Q psy18096 68 GILASQELLDSLIKERINCYNNLAQAQIKLGSLEPALMSLENVLN-LDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 68 y~~Ai~~~~~~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~-~dp~~~ka~~rrg~a~~~ 130 (131)
|.+|-+.+.. -| .+.-...+|.+...+|++.||...+.+++. +-..+..-+.-+++|.+.
T Consensus 76 ~Rea~~~~~~--Ap-Tvqnr~rLa~al~elGr~~EA~~hy~qalsG~fA~d~a~lLglA~Aqfa 136 (251)
T COG4700 76 LREATEELAI--AP-TVQNRYRLANALAELGRYHEAVPHYQQALSGIFAHDAAMLLGLAQAQFA 136 (251)
T ss_pred HHHHHHHHhh--ch-hHHHHHHHHHHHHHhhhhhhhHHHHHHHhccccCCCHHHHHHHHHHHHh
Confidence 5555554432 12 333466789999999999999999999987 446677778888887764
No 261
>TIGR00756 PPR pentatricopeptide repeat domain (PPR motif). This family has a similar consensus to the TPR domain (tetratricopeptide), pfam pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.
Probab=76.65 E-value=5.9 Score=18.00 Aligned_cols=28 Identities=32% Similarity=0.447 Sum_probs=23.5
Q ss_pred HHHHHHHHHHHhCChHHHHHHHHHHHhh
Q psy18096 86 CYNNLAQAQIKLGSLEPALMSLENVLNL 113 (131)
Q Consensus 86 ~~~N~a~~~~kl~~~~~a~~~~~~al~~ 113 (131)
.|+.+-.+|.+.|++++|++.+.+..+.
T Consensus 2 ~~n~li~~~~~~~~~~~a~~~~~~M~~~ 29 (35)
T TIGR00756 2 TYNTLIDGLCKAGRVEEALELFKEMLER 29 (35)
T ss_pred cHHHHHHHHHHCCCHHHHHHHHHHHHHc
Confidence 3667778899999999999999987653
No 262
>KOG0546|consensus
Probab=76.34 E-value=6.1 Score=29.97 Aligned_cols=42 Identities=19% Similarity=0.144 Sum_probs=38.0
Q ss_pred HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHH
Q psy18096 78 SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIK 119 (131)
Q Consensus 78 ~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~k 119 (131)
+.++...++|..++..++++.++++|+.+...+.+.+|++..
T Consensus 303 ~~~~s~tka~~Rr~~~~~~~~~~~~a~~~~~~a~~~~p~d~~ 344 (372)
T KOG0546|consen 303 RDERSKTKAHYRRGQAYKLLKNYDEALEDLKKAKQKAPNDKA 344 (372)
T ss_pred ccChhhCcHHHHHHhHHHhhhchhhhHHHHHHhhccCcchHH
Confidence 456778899999999999999999999999999999998754
No 263
>COG2912 Uncharacterized conserved protein [Function unknown]
Probab=75.35 E-value=12 Score=27.22 Aligned_cols=60 Identities=20% Similarity=0.059 Sum_probs=50.1
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHh
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKC 127 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a 127 (131)
+..|.+.... .+.|..+.-..-||.+|..+|.+.-|+.|.+..++.-|+..-+-.=|++.
T Consensus 197 ~~~al~~~~r~l~l~P~dp~eirDrGliY~ql~c~~vAl~dl~~~~~~~P~~~~a~~ir~~l 258 (269)
T COG2912 197 WELALRVAERLLDLNPEDPYEIRDRGLIYAQLGCYHVALEDLSYFVEHCPDDPIAEMIRAQL 258 (269)
T ss_pred hHHHHHHHHHHHhhCCCChhhccCcHHHHHhcCCchhhHHHHHHHHHhCCCchHHHHHHHHH
Confidence 6666666666 67788887788899999999999999999999999999988877666553
No 264
>PF11817 Foie-gras_1: Foie gras liver health family 1; InterPro: IPR021773 Mutating the gene foie gras in zebrafish has been shown to affect development; the mutants develop large, lipid-filled hepatocytes in the liver, resembling those in individuals with fatty liver disease []. Foie-gras protein is long and has several well-defined domains though none of them has a known function. We have annotated this one as the first []. THe C terminus of this region contains TPR repeats.
Probab=75.28 E-value=14 Score=26.30 Aligned_cols=45 Identities=18% Similarity=0.106 Sum_probs=34.0
Q ss_pred HHHHHHHHHH-HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHh
Q psy18096 68 GILASQELLD-SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLN 112 (131)
Q Consensus 68 y~~Ai~~~~~-~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~ 112 (131)
.++|+..|.. ........+...+|.-|+++|+|++|+..++.+..
T Consensus 161 L~~A~~~f~~~~~~R~~~~l~~~~A~ey~~~g~~~~A~~~l~~~~~ 206 (247)
T PF11817_consen 161 LEKAYEQFKKYGQNRMASYLSLEMAEEYFRLGDYDKALKLLEPAAS 206 (247)
T ss_pred HHHHHHHHHHhccchHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH
Confidence 6777777776 33444555666889999999999999999988854
No 265
>cd02683 MIT_1 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in proteins with unknown function, co-occuring with an as yet undescribed domain. The molecular function of the MIT domain is unclear.
Probab=74.92 E-value=11 Score=21.93 Aligned_cols=29 Identities=14% Similarity=0.091 Sum_probs=18.4
Q ss_pred HHHHHHHHHHHHHhCChHHHHHHHHHHHh
Q psy18096 84 INCYNNLAQAQIKLGSLEPALMSLENVLN 112 (131)
Q Consensus 84 ~~~~~N~a~~~~kl~~~~~a~~~~~~al~ 112 (131)
+.-+...|.-.-+.|+|.+|+..+.++|+
T Consensus 6 a~~l~~~Ave~D~~g~y~eAl~~Y~~aie 34 (77)
T cd02683 6 AKEVLKRAVELDQEGRFQEALVCYQEGID 34 (77)
T ss_pred HHHHHHHHHHHHHhccHHHHHHHHHHHHH
Confidence 33455566666677777777766666554
No 266
>PF13812 PPR_3: Pentatricopeptide repeat domain
Probab=74.90 E-value=6.9 Score=17.92 Aligned_cols=28 Identities=29% Similarity=0.413 Sum_probs=23.7
Q ss_pred HHHHHHHHHHHHhCChHHHHHHHHHHHh
Q psy18096 85 NCYNNLAQAQIKLGSLEPALMSLENVLN 112 (131)
Q Consensus 85 ~~~~N~a~~~~kl~~~~~a~~~~~~al~ 112 (131)
..|+.+-.++.+.|+++.|...++.-.+
T Consensus 2 ~ty~~ll~a~~~~g~~~~a~~~~~~M~~ 29 (34)
T PF13812_consen 2 HTYNALLRACAKAGDPDAALQLFDEMKE 29 (34)
T ss_pred cHHHHHHHHHHHCCCHHHHHHHHHHHHH
Confidence 3578889999999999999998887654
No 267
>KOG4507|consensus
Probab=74.53 E-value=6 Score=32.29 Aligned_cols=35 Identities=20% Similarity=0.303 Sum_probs=29.7
Q ss_pred HHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHH
Q psy18096 86 CYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKA 120 (131)
Q Consensus 86 ~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka 120 (131)
.++-++.+|+-+.+.+.|+..+.+|+.++|+|...
T Consensus 678 ~~~~~g~~~l~l~~i~~a~~~~~~a~~~~~~~~~~ 712 (886)
T KOG4507|consen 678 TFLSLGNAYLALKNISGALEAFRQALKLTTKCPEC 712 (886)
T ss_pred HHHhcchhHHHHhhhHHHHHHHHHHHhcCCCChhh
Confidence 45667788888999999999999999999998643
No 268
>cd02681 MIT_calpain7_1 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in the nuclear thiol protease PalBH. The molecular function of the MIT domain is unclear.
Probab=73.90 E-value=12 Score=21.71 Aligned_cols=28 Identities=14% Similarity=0.160 Sum_probs=18.8
Q ss_pred HHHHHHHHHHHHhCChHHHHHHHHHHHh
Q psy18096 85 NCYNNLAQAQIKLGSLEPALMSLENVLN 112 (131)
Q Consensus 85 ~~~~N~a~~~~kl~~~~~a~~~~~~al~ 112 (131)
.-+.-+|.-.-+.|+|.+|+..+..+|+
T Consensus 7 i~~a~~Ave~D~~g~y~eA~~~Y~~aie 34 (76)
T cd02681 7 VQFARLAVQRDQEGRYSEAVFYYKEAAQ 34 (76)
T ss_pred HHHHHHHHHHHHccCHHHHHHHHHHHHH
Confidence 3345566666677787777777777665
No 269
>COG4105 ComL DNA uptake lipoprotein [General function prediction only]
Probab=73.14 E-value=22 Score=25.75 Aligned_cols=49 Identities=16% Similarity=0.091 Sum_probs=38.8
Q ss_pred hhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCC---HHHHHHHHHhhh
Q psy18096 81 KERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNN---IKALQRKAKCLI 129 (131)
Q Consensus 81 ~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~---~ka~~rrg~a~~ 129 (131)
...+.-++|-|+..+.-|+|++|+..++.+....|.+ .++..-.+.|+.
T Consensus 31 ~~p~~~LY~~g~~~L~~gn~~~A~~~fe~l~~~~p~s~~~~qa~l~l~yA~Y 82 (254)
T COG4105 31 NLPASELYNEGLTELQKGNYEEAIKYFEALDSRHPFSPYSEQAQLDLAYAYY 82 (254)
T ss_pred CCCHHHHHHHHHHHHhcCCHHHHHHHHHHHHHcCCCCcccHHHHHHHHHHHH
Confidence 3456778899999999999999999999999988765 455555555543
No 270
>PF09613 HrpB1_HrpK: Bacterial type III secretion protein (HrpB1_HrpK); InterPro: IPR013394 This family of proteins is encoded by genes found within type III secretion operons in a limited range of species including Xanthomonas, Ralstonia and Burkholderia.
Probab=72.98 E-value=13 Score=24.97 Aligned_cols=38 Identities=18% Similarity=-0.017 Sum_probs=21.5
Q ss_pred hhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCC
Q psy18096 80 IKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNN 117 (131)
Q Consensus 80 ~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~ 117 (131)
.|....+-.--+.-++..|+|.+|++.+..+.+-.|..
T Consensus 40 RP~~~e~~~~~~~l~i~r~~w~dA~rlLr~l~~~~~~~ 77 (160)
T PF09613_consen 40 RPEFPELDLFDGWLHIVRGDWDDALRLLRELEERAPGF 77 (160)
T ss_pred CCCchHHHHHHHHHHHHhCCHHHHHHHHHHHhccCCCC
Confidence 45555555555555666666666666666655555533
No 271
>KOG1941|consensus
Probab=72.57 E-value=20 Score=27.74 Aligned_cols=49 Identities=16% Similarity=0.184 Sum_probs=35.2
Q ss_pred hhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCH-----HHHHHHHHhhh
Q psy18096 81 KERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNI-----KALQRKAKCLI 129 (131)
Q Consensus 81 ~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~-----ka~~rrg~a~~ 129 (131)
-....+|.|++..+-++.+|.+++.+|...+.+-.... .++.-.|.|++
T Consensus 80 ~~~~ea~lnlar~~e~l~~f~kt~~y~k~~l~lpgt~~~~~~gq~~l~~~~Ahl 133 (518)
T KOG1941|consen 80 DFLLEAYLNLARSNEKLCEFHKTISYCKTCLGLPGTRAGQLGGQVSLSMGNAHL 133 (518)
T ss_pred HHHHHHHHHHHHHHHHHHHhhhHHHHHHHHhcCCCCCcccccchhhhhHHHHhh
Confidence 44567889999999999999999999998888753332 34444455444
No 272
>PLN03077 Protein ECB2; Provisional
Probab=72.50 E-value=20 Score=30.18 Aligned_cols=43 Identities=16% Similarity=0.210 Sum_probs=23.0
Q ss_pred HHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhh
Q psy18096 86 CYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCL 128 (131)
Q Consensus 86 ~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~ 128 (131)
.|.-+-.++..-++.+.+....+++++++|++.-.|.-.+..|
T Consensus 659 ~~~aLl~ac~~~~~~e~~e~~a~~l~~l~p~~~~~y~ll~n~y 701 (857)
T PLN03077 659 VWGALLNACRIHRHVELGELAAQHIFELDPNSVGYYILLCNLY 701 (857)
T ss_pred HHHHHHHHHHHcCChHHHHHHHHHHHhhCCCCcchHHHHHHHH
Confidence 3333333444445555555555566667776666555555444
No 273
>PF04781 DUF627: Protein of unknown function (DUF627); InterPro: IPR006866 This domain represents the N-terminal region of several plant proteins of unknown function.
Probab=70.43 E-value=16 Score=23.05 Aligned_cols=47 Identities=11% Similarity=0.012 Sum_probs=38.0
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhC
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLD 114 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~d 114 (131)
..-|++.+.+ .+.|..+..+.++|.=.---..|++++.-|.++|.+.
T Consensus 60 Ll~sve~~s~a~~Lsp~~A~~L~~la~~l~s~~~Ykk~v~kak~~Lsv~ 108 (111)
T PF04781_consen 60 LLGSVECFSRAVELSPDSAHSLFELASQLGSVKYYKKAVKKAKRGLSVT 108 (111)
T ss_pred HHHhHHHHHHHhccChhHHHHHHHHHHHhhhHHHHHHHHHHHHHHhccc
Confidence 4567777777 7778888888888877666778999999999999865
No 274
>smart00101 14_3_3 14-3-3 homologues. 14-3-3 homologues mediates signal transduction by binding to phosphoserine-containing proteins. They are involved in growth factor signalling and also interact with MEK kinases.
Probab=69.98 E-value=30 Score=24.90 Aligned_cols=45 Identities=16% Similarity=-0.001 Sum_probs=34.9
Q ss_pred HHHHHHHHHH---HhhhhhHHHHHHHHHHHHH-hCChHHHHHHHHHHHh
Q psy18096 68 GILASQELLD---SLIKERINCYNNLAQAQIK-LGSLEPALMSLENVLN 112 (131)
Q Consensus 68 y~~Ai~~~~~---~~~~~~~~~~~N~a~~~~k-l~~~~~a~~~~~~al~ 112 (131)
|++|...... ..+|....+.+|.+..|.. +++.++|+....+|+.
T Consensus 151 Y~~A~e~a~~~L~pt~PirLgLaLN~SVF~yEI~~~~~~A~~lAk~afd 199 (244)
T smart00101 151 YKSAQDIALAELPPTHPIRLGLALNFSVFYYEILNSPDRACNLAKQAFD 199 (244)
T ss_pred HHHHHHHHHccCCCCCHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHH
Confidence 8888888765 4567777888899988886 5999999976666553
No 275
>cd02680 MIT_calpain7_2 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in the nuclear thiol protease PalBH. The molecular function of the MIT domain is unclear.
Probab=69.53 E-value=14 Score=21.49 Aligned_cols=28 Identities=18% Similarity=0.127 Sum_probs=18.3
Q ss_pred HHHHHHHHHHHhCChHHHHHHHHHHHhh
Q psy18096 86 CYNNLAQAQIKLGSLEPALMSLENVLNL 113 (131)
Q Consensus 86 ~~~N~a~~~~kl~~~~~a~~~~~~al~~ 113 (131)
.+...|.-.-..|+|++|+..+..++++
T Consensus 8 ~Lv~~A~~eD~~gny~eA~~lY~~ale~ 35 (75)
T cd02680 8 FLVTQAFDEDEKGNAEEAIELYTEAVEL 35 (75)
T ss_pred HHHHHHHHhhHhhhHHHHHHHHHHHHHH
Confidence 3444555556667777777777777763
No 276
>KOG1310|consensus
Probab=68.81 E-value=10 Score=30.60 Aligned_cols=40 Identities=20% Similarity=-0.002 Sum_probs=34.0
Q ss_pred HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCC
Q psy18096 78 SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNN 117 (131)
Q Consensus 78 ~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~ 117 (131)
++++...+++..++.|...++++.+|+.+...+....|.+
T Consensus 439 rln~s~~kah~~la~aL~el~r~~eal~~~~alq~~~Ptd 478 (758)
T KOG1310|consen 439 RLNPSIQKAHFRLARALNELTRYLEALSCHWALQMSFPTD 478 (758)
T ss_pred cCChHHHHHHHHHHHHHHHHhhHHHhhhhHHHHhhcCchh
Confidence 6788999999999999999999999999776666666643
No 277
>KOG1550|consensus
Probab=68.71 E-value=35 Score=27.44 Aligned_cols=56 Identities=16% Similarity=-0.091 Sum_probs=27.8
Q ss_pred HHHHHHHHHHhhhhhHHHHHHHHHHHHH----hCChHHHHHHHHHHHhhCCCCHHHHHHHHH
Q psy18096 69 ILASQELLDSLIKERINCYNNLAQAQIK----LGSLEPALMSLENVLNLDPNNIKALQRKAK 126 (131)
Q Consensus 69 ~~Ai~~~~~~~~~~~~~~~~N~a~~~~k----l~~~~~a~~~~~~al~~dp~~~ka~~rrg~ 126 (131)
.+|.++|......-.+.+..++|.||.. .-+...|...+.+|-+.. ++.|.++++.
T Consensus 345 ~~A~~yy~~Aa~~G~~~A~~~la~~y~~G~gv~r~~~~A~~~~k~aA~~g--~~~A~~~~~~ 404 (552)
T KOG1550|consen 345 RRAFEYYSLAAKAGHILAIYRLALCYELGLGVERNLELAFAYYKKAAEKG--NPSAAYLLGA 404 (552)
T ss_pred HHHHHHHHHHHHcCChHHHHHHHHHHHhCCCcCCCHHHHHHHHHHHHHcc--ChhhHHHHHH
Confidence 3444444443334445555555555553 234555555555555554 4444454443
No 278
>cd02679 MIT_spastin MIT: domain contained within Microtubule Interacting and Trafficking molecules. This MIT domain sub-family is found in the AAA protein spastin, a probable ATPase involved in the assembly or function of nuclear protein complexes; spastins might also be involved in microtubule dynamics. The molecular function of the MIT domain is unclear.
Probab=68.00 E-value=14 Score=21.73 Aligned_cols=32 Identities=22% Similarity=0.221 Sum_probs=22.5
Q ss_pred CChHHHHHHHHHHHhhC----CCCHHHHHHHHHhhh
Q psy18096 98 GSLEPALMSLENVLNLD----PNNIKALQRKAKCLI 129 (131)
Q Consensus 98 ~~~~~a~~~~~~al~~d----p~~~ka~~rrg~a~~ 129 (131)
+.|+.|...-+++|..| |+++-++||+|.-.+
T Consensus 3 ~~~~~A~~~I~kaL~~dE~g~~e~Al~~Y~~gi~~l 38 (79)
T cd02679 3 GYYKQAFEEISKALRADEWGDKEQALAHYRKGLREL 38 (79)
T ss_pred hHHHHHHHHHHHHhhhhhcCCHHHHHHHHHHHHHHH
Confidence 34677777788888776 456777788776554
No 279
>COG5191 Uncharacterized conserved protein, contains HAT (Half-A-TPR) repeat [General function prediction only]
Probab=67.42 E-value=7.9 Score=29.20 Aligned_cols=46 Identities=7% Similarity=0.069 Sum_probs=37.2
Q ss_pred HhhhhhHHHHHH-HHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHH
Q psy18096 78 SLIKERINCYNN-LAQAQIKLGSLEPALMSLENVLNLDPNNIKALQR 123 (131)
Q Consensus 78 ~~~~~~~~~~~N-~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~r 123 (131)
...|.++.+|.- .+.=+...++++.|...+.++|.++|.+++.|+.
T Consensus 135 ~khP~nvdlWI~~c~~e~~~~ani~s~Ra~f~~glR~N~~~p~iw~e 181 (435)
T COG5191 135 TKHPLNVDLWIYCCAFELFEIANIESSRAMFLKGLRMNSRSPRIWIE 181 (435)
T ss_pred hcCCCCceeeeeeccchhhhhccHHHHHHHHHhhhccCCCCchHHHH
Confidence 557888888875 4444556789999999999999999999988764
No 280
>smart00745 MIT Microtubule Interacting and Trafficking molecule domain.
Probab=66.87 E-value=20 Score=20.33 Aligned_cols=26 Identities=19% Similarity=0.139 Sum_probs=16.3
Q ss_pred HHHHHHHHHHhCChHHHHHHHHHHHh
Q psy18096 87 YNNLAQAQIKLGSLEPALMSLENVLN 112 (131)
Q Consensus 87 ~~N~a~~~~kl~~~~~a~~~~~~al~ 112 (131)
+...|..+-+.|++++|+..+..+++
T Consensus 11 li~~Av~~d~~g~~~eAl~~Y~~a~e 36 (77)
T smart00745 11 LISKALKADEAGDYEEALELYKKAIE 36 (77)
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHH
Confidence 44566666667777777666665554
No 281
>PF02184 HAT: HAT (Half-A-TPR) repeat; InterPro: IPR003107 The HAT (Half A TPR) repeat has a repetitive pattern characterised by three aromatic residues with a conserved spacing. They are structurally and sequentially similar to TPRs (tetratricopeptide repeats), though they lack the highly conserved alanine and glycine residues found in TPRs. The number of HAT repeats found in different proteins varies between 9 and 12. HAT-repeat-containing proteins appear to be components of macromolecular complexes that are required for RNA processing []. The repeats may be involved in protein-protein interactions. The HAT motif has striking structural similarities to HEAT repeats (IPR000357 from INTERPRO), being of a similar length and consisting of two short helices connected by a loop domain, as in HEAT repeats.; GO: 0006396 RNA processing, 0005622 intracellular
Probab=66.36 E-value=14 Score=17.84 Aligned_cols=28 Identities=21% Similarity=0.275 Sum_probs=22.8
Q ss_pred CChHHHHHHHHHHHhhCCCCHHHHHHHHH
Q psy18096 98 GSLEPALMSLENVLNLDPNNIKALQRKAK 126 (131)
Q Consensus 98 ~~~~~a~~~~~~al~~dp~~~ka~~rrg~ 126 (131)
++++.|..-+++.+...| ++|.|.+-|+
T Consensus 1 kE~dRAR~IyeR~v~~hp-~~k~WikyAk 28 (32)
T PF02184_consen 1 KEFDRARSIYERFVLVHP-EVKNWIKYAK 28 (32)
T ss_pred ChHHHHHHHHHHHHHhCC-CchHHHHHHH
Confidence 367889999999999886 4999888764
No 282
>PF10602 RPN7: 26S proteasome subunit RPN7; InterPro: IPR019585 This entry represents the regulatory subunit RPN7 (known as the non-ATPase regulatory subunit 6 in higher eukaryotes) of the 26S proteasome. This entry also matches the evolutionarily related subunit 1 of the COP9 signalosome complex (CSN) from Arabidopsis []. The 26S proteasome plays a major role in ATP-dependent degradation of ubiquitinated proteins. Substrate specificity is conferred by the regulatory particle (RP), which can dissociate into stable lid and base subcomplexes. The regulatory subunit RPN7 is one of the lid subunits of the 26S proteasome and has been shown in Saccharomyces cerevisiae (Baker's yeast) to be required for structural integrity []. The COP9 signalosome is a conserved protein complex composed of eight subunits, where Individual subunits of the complex have been linked to various signal transduction pathways leading to gene expression and cell cycle control []. The overall organisation and the amino acid sequences of the COP9 signalosome subunits resemble the lid subcomplex of the 19 S regulatory particle for the 26 S proteasome []. COP9 subunit 1 (CSN1 or GPS1) of the COP9 complex is an essential subunit of the complex with regard to both structural integrity and functionality. The N-terminal region of subunit 1 (CSN1-N) can inhibit c-fos expression from either a transfected template or a chromosomal transgene (fos-lacZ), and may contain the activity domain that confers most of the repression functions of CSN1. The C-terminal region of subunit 1 (CSN1-C) allows integration of the protein into the COP9 signalosome.
Probab=65.04 E-value=29 Score=23.43 Aligned_cols=47 Identities=15% Similarity=0.032 Sum_probs=35.5
Q ss_pred HHHHHHHHHH-----HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhC
Q psy18096 68 GILASQELLD-----SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLD 114 (131)
Q Consensus 68 y~~Ai~~~~~-----~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~d 114 (131)
+..|++.|.+ -........+.|+..+.+-+++|..+....++|-.+-
T Consensus 52 ~~~A~k~y~~~~~~~~~~~~~id~~l~~irv~i~~~d~~~v~~~i~ka~~~~ 103 (177)
T PF10602_consen 52 LEEALKAYSRARDYCTSPGHKIDMCLNVIRVAIFFGDWSHVEKYIEKAESLI 103 (177)
T ss_pred HHHHHHHHHHHhhhcCCHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHH
Confidence 7788888877 2235566777888888888888888888888877654
No 283
>PRK04841 transcriptional regulator MalT; Provisional
Probab=64.94 E-value=27 Score=29.37 Aligned_cols=49 Identities=20% Similarity=0.007 Sum_probs=38.9
Q ss_pred HHHHHHHHHH--------HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCC
Q psy18096 68 GILASQELLD--------SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPN 116 (131)
Q Consensus 68 y~~Ai~~~~~--------~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~ 116 (131)
|.+|+..+.. ......+.++..+|.++.++|++.+|.....+|+.+-..
T Consensus 707 ~~~A~~~l~~al~~~~~~g~~~~~a~~~~~la~a~~~~G~~~~A~~~L~~Al~la~~ 763 (903)
T PRK04841 707 FDEAEIILEELNENARSLRLMSDLNRNLILLNQLYWQQGRKSEAQRVLLEALKLANR 763 (903)
T ss_pred HHHHHHHHHHHHHHHHHhCchHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHhCc
Confidence 7777777776 112345567889999999999999999999999997643
No 284
>cd02656 MIT MIT: domain contained within Microtubule Interacting and Trafficking molecules. The MIT domain is found in sorting nexins, the nuclear thiol protease PalBH, the AAA protein spastin and archaebacterial proteins with similar domain architecture, vacuolar sorting proteins and others. The molecular function of the MIT domain is unclear.
Probab=64.21 E-value=24 Score=19.97 Aligned_cols=26 Identities=23% Similarity=0.173 Sum_probs=16.8
Q ss_pred HHHHHHHHHHhCChHHHHHHHHHHHh
Q psy18096 87 YNNLAQAQIKLGSLEPALMSLENVLN 112 (131)
Q Consensus 87 ~~N~a~~~~kl~~~~~a~~~~~~al~ 112 (131)
+.+.|.-.-+.|+|++|+..+..+++
T Consensus 9 l~~~Av~~D~~g~~~~Al~~Y~~a~e 34 (75)
T cd02656 9 LIKQAVKEDEDGNYEEALELYKEALD 34 (75)
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHH
Confidence 44556666666777777776666654
No 285
>COG4649 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=62.83 E-value=49 Score=23.00 Aligned_cols=51 Identities=18% Similarity=0.214 Sum_probs=43.6
Q ss_pred HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 78 SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 78 ~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
+-.|....+.--++++-.|-|++.+|...|.++-. |.+.+...-.|+++.+
T Consensus 161 d~n~mR~sArEALglAa~kagd~a~A~~~F~qia~-Da~aprnirqRAq~ml 211 (221)
T COG4649 161 DGNPMRHSAREALGLAAYKAGDFAKAKSWFVQIAN-DAQAPRNIRQRAQIML 211 (221)
T ss_pred CCChhHHHHHHHHhHHHHhccchHHHHHHHHHHHc-cccCcHHHHHHHHHHH
Confidence 44678888888999999999999999999999777 7777888888888754
No 286
>KOG1550|consensus
Probab=61.94 E-value=44 Score=26.86 Aligned_cols=44 Identities=16% Similarity=0.043 Sum_probs=19.9
Q ss_pred hHHHHHHHHHHHHHhC---ChHHHHHHHHHHHhhCCCCHHHHHHHHHhh
Q psy18096 83 RINCYNNLAQAQIKLG---SLEPALMSLENVLNLDPNNIKALQRKAKCL 128 (131)
Q Consensus 83 ~~~~~~N~a~~~~kl~---~~~~a~~~~~~al~~dp~~~ka~~rrg~a~ 128 (131)
.+.+..+++.||..-. ++..|...+..|-. -.++.|.||.|.++
T Consensus 324 ~~~a~~~lg~~~~~g~~~~d~~~A~~yy~~Aa~--~G~~~A~~~la~~y 370 (552)
T KOG1550|consen 324 NPDAQYLLGVLYETGTKERDYRRAFEYYSLAAK--AGHILAIYRLALCY 370 (552)
T ss_pred CchHHHHHHHHHHcCCccccHHHHHHHHHHHHH--cCChHHHHHHHHHH
Confidence 3344444444544433 34445555544442 23455555555444
No 287
>KOG2709|consensus
Probab=61.88 E-value=9.3 Score=29.83 Aligned_cols=27 Identities=26% Similarity=0.153 Sum_probs=24.5
Q ss_pred HHHHHHHHHHhCChHHHHHHHHHHHhh
Q psy18096 87 YNNLAQAQIKLGSLEPALMSLENVLNL 113 (131)
Q Consensus 87 ~~N~a~~~~kl~~~~~a~~~~~~al~~ 113 (131)
..|.++||-.+++|++|+..+++.|.+
T Consensus 25 ~V~~gl~~dE~~~~e~a~~~Ye~gl~~ 51 (560)
T KOG2709|consen 25 SVEQGLCYDEVNDWENALAMYEKGLNL 51 (560)
T ss_pred HHHhhcchhhhcCHHHHHHHHHHHHHH
Confidence 558999999999999999999999874
No 288
>PLN03081 pentatricopeptide (PPR) repeat-containing protein; Provisional
Probab=59.59 E-value=60 Score=26.66 Aligned_cols=44 Identities=14% Similarity=-0.013 Sum_probs=30.7
Q ss_pred HHHHHHHHHHHhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHh
Q psy18096 68 GILASQELLDSLIKERINCYNNLAQAQIKLGSLEPALMSLENVLN 112 (131)
Q Consensus 68 y~~Ai~~~~~~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~ 112 (131)
+.+|.++|+. ..+.++..|+.+..+|.+.|++++|+..+++..+
T Consensus 275 ~~~A~~vf~~-m~~~~~vt~n~li~~y~~~g~~~eA~~lf~~M~~ 318 (697)
T PLN03081 275 IEDARCVFDG-MPEKTTVAWNSMLAGYALHGYSEEALCLYYEMRD 318 (697)
T ss_pred HHHHHHHHHh-CCCCChhHHHHHHHHHHhCCCHHHHHHHHHHHHH
Confidence 5566666653 2334566788888888888888888888877755
No 289
>PRK04841 transcriptional regulator MalT; Provisional
Probab=59.23 E-value=37 Score=28.55 Aligned_cols=46 Identities=13% Similarity=0.030 Sum_probs=33.9
Q ss_pred HHHHHHHHHH------Hh--hhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhh
Q psy18096 68 GILASQELLD------SL--IKERINCYNNLAQAQIKLGSLEPALMSLENVLNL 113 (131)
Q Consensus 68 y~~Ai~~~~~------~~--~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~ 113 (131)
|.+|...+.. +. ......++.|+|.++...|++++|...+.+++.+
T Consensus 507 ~~~A~~~~~~al~~~~~~g~~~~~~~~~~~la~~~~~~G~~~~A~~~~~~al~~ 560 (903)
T PRK04841 507 LARALAMMQQTEQMARQHDVYHYALWSLLQQSEILFAQGFLQAAYETQEKAFQL 560 (903)
T ss_pred HHHHHHHHHHHHHHHhhhcchHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH
Confidence 5666666655 11 1223456789999999999999999999999886
No 290
>PF14561 TPR_20: Tetratricopeptide repeat; PDB: 3QOU_A 2R5S_A 3QDN_B.
Probab=58.84 E-value=20 Score=21.39 Aligned_cols=28 Identities=25% Similarity=0.419 Sum_probs=23.2
Q ss_pred HHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 103 ALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 103 a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
.+....+.+..+|+|..+.|..|.+++.
T Consensus 7 ~~~al~~~~a~~P~D~~ar~~lA~~~~~ 34 (90)
T PF14561_consen 7 DIAALEAALAANPDDLDARYALADALLA 34 (90)
T ss_dssp HHHHHHHHHHHSTT-HHHHHHHHHHHHH
T ss_pred cHHHHHHHHHcCCCCHHHHHHHHHHHHH
Confidence 4667889999999999999999988763
No 291
>cd02678 MIT_VPS4 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in intracellular protein transport proteins of the AAA-ATPase family. The molecular function of the MIT domain is unclear.
Probab=58.41 E-value=33 Score=19.58 Aligned_cols=26 Identities=23% Similarity=0.202 Sum_probs=15.5
Q ss_pred HHHHHHHHHHhCChHHHHHHHHHHHh
Q psy18096 87 YNNLAQAQIKLGSLEPALMSLENVLN 112 (131)
Q Consensus 87 ~~N~a~~~~kl~~~~~a~~~~~~al~ 112 (131)
+...|.-.-..|+|++|+..+..+++
T Consensus 9 l~~~Av~~D~~g~y~eA~~~Y~~aie 34 (75)
T cd02678 9 LVKKAIEEDNAGNYEEALRLYQHALE 34 (75)
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHH
Confidence 44455555566777766666665554
No 292
>KOG1915|consensus
Probab=58.28 E-value=11 Score=30.02 Aligned_cols=57 Identities=11% Similarity=0.006 Sum_probs=47.4
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHH
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRK 124 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rr 124 (131)
+.+|=.++.. +.+..+.++++.-|-+-+|.++.+.|....++|+.+=|.=-+-||.-
T Consensus 89 ~~RARSv~ERALdvd~r~itLWlkYae~Emknk~vNhARNv~dRAvt~lPRVdqlWyKY 147 (677)
T KOG1915|consen 89 IQRARSVFERALDVDYRNITLWLKYAEFEMKNKQVNHARNVWDRAVTILPRVDQLWYKY 147 (677)
T ss_pred HHHHHHHHHHHHhcccccchHHHHHHHHHHhhhhHhHHHHHHHHHHHhcchHHHHHHHH
Confidence 6666666666 56677889999999999999999999999999999999877777653
No 293
>PF13041 PPR_2: PPR repeat family
Probab=57.88 E-value=25 Score=17.95 Aligned_cols=32 Identities=25% Similarity=0.338 Sum_probs=27.4
Q ss_pred hHHHHHHHHHHHHHhCChHHHHHHHHHHHhhC
Q psy18096 83 RINCYNNLAQAQIKLGSLEPALMSLENVLNLD 114 (131)
Q Consensus 83 ~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~d 114 (131)
++..|+-+-.++.+.|++++|++.+++..+..
T Consensus 2 ~~~~yn~li~~~~~~~~~~~a~~l~~~M~~~g 33 (50)
T PF13041_consen 2 DVVTYNTLISGYCKAGKFEEALKLFKEMKKRG 33 (50)
T ss_pred chHHHHHHHHHHHHCcCHHHHHHHHHHHHHcC
Confidence 34568888899999999999999999988754
No 294
>PF11817 Foie-gras_1: Foie gras liver health family 1; InterPro: IPR021773 Mutating the gene foie gras in zebrafish has been shown to affect development; the mutants develop large, lipid-filled hepatocytes in the liver, resembling those in individuals with fatty liver disease []. Foie-gras protein is long and has several well-defined domains though none of them has a known function. We have annotated this one as the first []. THe C terminus of this region contains TPR repeats.
Probab=57.28 E-value=53 Score=23.32 Aligned_cols=45 Identities=18% Similarity=0.048 Sum_probs=37.1
Q ss_pred HHHHHHHHHH--------HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHh
Q psy18096 68 GILASQELLD--------SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLN 112 (131)
Q Consensus 68 y~~Ai~~~~~--------~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~ 112 (131)
|.+|+++|+. .=......+...+..|+.++|+.+..+..|-+.+.
T Consensus 194 ~~~A~~~l~~~~~~yr~egW~~l~~~~l~~l~~Ca~~~~~~~~~l~~~leLls 246 (247)
T PF11817_consen 194 YDKALKLLEPAASSYRREGWWSLLTEVLWRLLECAKRLGDVEDYLTTSLELLS 246 (247)
T ss_pred HHHHHHHHHHHHHHHHhCCcHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHHhc
Confidence 8999999888 11356777888999999999999999998877653
No 295
>KOG0495|consensus
Probab=56.80 E-value=46 Score=27.87 Aligned_cols=54 Identities=17% Similarity=0.025 Sum_probs=33.3
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHH
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKAL 121 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~ 121 (131)
..+|+++++. +.-|...++|.-+++.+-.+++.+.|...|..-+..-|+.+--|
T Consensus 667 ~eeA~rllEe~lk~fp~f~Kl~lmlGQi~e~~~~ie~aR~aY~~G~k~cP~~ipLW 722 (913)
T KOG0495|consen 667 VEEALRLLEEALKSFPDFHKLWLMLGQIEEQMENIEMAREAYLQGTKKCPNSIPLW 722 (913)
T ss_pred HHHHHHHHHHHHHhCCchHHHHHHHhHHHHHHHHHHHHHHHHHhccccCCCCchHH
Confidence 5667776665 44456666666666666666666666666666666655554433
No 296
>PRK13184 pknD serine/threonine-protein kinase; Reviewed
Probab=55.39 E-value=30 Score=29.85 Aligned_cols=51 Identities=18% Similarity=-0.071 Sum_probs=39.0
Q ss_pred HHHHHHHHHH-HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCH
Q psy18096 68 GILASQELLD-SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNI 118 (131)
Q Consensus 68 y~~Ai~~~~~-~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ 118 (131)
|.+|+..+++ .-.+--+-=|+-.|.+|.++++|+|=++++.-|++.-|+.+
T Consensus 535 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 586 (932)
T PRK13184 535 FTQALSEFSYLHGGVGAPLEYLGKALVYQRLGEYNEEIKSLLLALKRYSQHP 586 (932)
T ss_pred HHHHHHHHHHhcCCCCCchHHHhHHHHHHHhhhHHHHHHHHHHHHHhcCCCC
Confidence 6666666665 11233334488899999999999999999999999888775
No 297
>COG4455 ImpE Protein of avirulence locus involved in temperature-dependent protein secretion [General function prediction only]
Probab=54.14 E-value=31 Score=24.79 Aligned_cols=52 Identities=21% Similarity=0.031 Sum_probs=39.1
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHH
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIK 119 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~k 119 (131)
..+||..... ..+|.+....--+=.-+.-.|+|.+|...|+-+-+++|++.+
T Consensus 17 L~dai~~a~~qVkakPtda~~RhflfqLlcvaGdw~kAl~Ql~l~a~l~p~~t~ 70 (273)
T COG4455 17 LQDAIGLARDQVKAKPTDAGGRHFLFQLLCVAGDWEKALAQLNLAATLSPQDTV 70 (273)
T ss_pred HHHHHHHHHHHHhcCCccccchhHHHHHHhhcchHHHHHHHHHHHhhcCcccch
Confidence 5677777766 445555555555556666789999999999999999998864
No 298
>PF04781 DUF627: Protein of unknown function (DUF627); InterPro: IPR006866 This domain represents the N-terminal region of several plant proteins of unknown function.
Probab=53.82 E-value=26 Score=22.02 Aligned_cols=31 Identities=16% Similarity=0.168 Sum_probs=24.4
Q ss_pred HHHHHHHhCChHHHHHHHHHHHhhCCCCHHH
Q psy18096 90 LAQAQIKLGSLEPALMSLENVLNLDPNNIKA 120 (131)
Q Consensus 90 ~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka 120 (131)
+|..++..|++-+|++..++.+...+++..+
T Consensus 2 ~A~~~~~rGnhiKAL~iied~i~~h~~~~~~ 32 (111)
T PF04781_consen 2 KAKDYFARGNHIKALEIIEDLISRHGEDESS 32 (111)
T ss_pred hHHHHHHccCHHHHHHHHHHHHHHccCCCch
Confidence 5667788888888888888888888777654
No 299
>PF10373 EST1_DNA_bind: Est1 DNA/RNA binding domain; InterPro: IPR018834 Est1 is a protein which recruits or activates telomerase at the site of polymerisation [, ]. This is the DNA/RNA binding domain of EST1 []. ; PDB: 1YA0_B.
Probab=53.65 E-value=76 Score=22.34 Aligned_cols=45 Identities=22% Similarity=0.103 Sum_probs=35.4
Q ss_pred HHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCC
Q psy18096 71 ASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDP 115 (131)
Q Consensus 71 Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp 115 (131)
|.++|.. .+.|..-..|+.+|..+.-.|+.-+|+=.|-++|....
T Consensus 1 A~~~Y~~A~~l~P~~G~p~nQLAvl~~~~~~~l~avy~y~Rsl~~~~ 47 (278)
T PF10373_consen 1 AERYYRKAIRLLPSNGNPYNQLAVLASYQGDDLDAVYYYIRSLAVRI 47 (278)
T ss_dssp HHHHHHHHHHH-TTBSHHHHHHHHHHHHTT-HHHHHHHHHHHHSSSB
T ss_pred CHHHHHHHHHhCCCCCCcccchhhhhccccchHHHHHHHHHHHhcCC
Confidence 4555655 56688889999999999999999999999999987653
No 300
>cd02684 MIT_2 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in proteins with an n-terminal serine/threonine kinase domain. The molecular function of the MIT domain is unclear.
Probab=53.65 E-value=42 Score=19.32 Aligned_cols=26 Identities=19% Similarity=0.045 Sum_probs=16.1
Q ss_pred HHHHHHHHHHhCChHHHHHHHHHHHh
Q psy18096 87 YNNLAQAQIKLGSLEPALMSLENVLN 112 (131)
Q Consensus 87 ~~N~a~~~~kl~~~~~a~~~~~~al~ 112 (131)
+...|.-.-..|+|++|+..+..+|+
T Consensus 9 lv~~Av~~D~~g~y~eA~~lY~~ale 34 (75)
T cd02684 9 LVVQAVKKDQRGDAAAALSLYCSALQ 34 (75)
T ss_pred HHHHHHHHHHhccHHHHHHHHHHHHH
Confidence 33455556666777777766666654
No 301
>KOG2396|consensus
Probab=53.51 E-value=81 Score=25.43 Aligned_cols=40 Identities=13% Similarity=0.052 Sum_probs=21.1
Q ss_pred hHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHH
Q psy18096 83 RINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQ 122 (131)
Q Consensus 83 ~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~ 122 (131)
.+.+|.+-..---|-+.+.+.-..|.++|...|+|+.-|-
T Consensus 104 D~~lW~~yi~f~kk~~~~~~v~ki~~~~l~~Hp~~~dLWI 143 (568)
T KOG2396|consen 104 DVKLWLSYIAFCKKKKTYGEVKKIFAAMLAKHPNNPDLWI 143 (568)
T ss_pred CHHHHHHHHHHHHHhcchhHHHHHHHHHHHhCCCCchhHH
Confidence 4445555444444444455555566666666665554443
No 302
>PF05131 Pep3_Vps18: Pep3/Vps18/deep orange family; InterPro: IPR007810 This region is found in a number of proteins identified as being involved in Golgi function and vacuolar sorting. The molecular function of this region is unknown. Proteins containing this domain also contain a C-terminal ring finger domain.
Probab=53.34 E-value=7.3 Score=25.67 Aligned_cols=18 Identities=17% Similarity=0.283 Sum_probs=15.5
Q ss_pred HHHHHhCChHHHHHHHHH
Q psy18096 92 QAQIKLGSLEPALMSLEN 109 (131)
Q Consensus 92 ~~~~kl~~~~~a~~~~~~ 109 (131)
..|+++|+|++|++.|..
T Consensus 111 k~yl~~~~fd~Al~~~~~ 128 (147)
T PF05131_consen 111 KIYLDKGDFDEALQYCKT 128 (147)
T ss_pred HHHHhcCcHHHHHHHccC
Confidence 357888999999999987
No 303
>KOG1941|consensus
Probab=52.74 E-value=44 Score=25.99 Aligned_cols=33 Identities=27% Similarity=0.259 Sum_probs=27.7
Q ss_pred hhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhC
Q psy18096 82 ERINCYNNLAQAQIKLGSLEPALMSLENVLNLD 114 (131)
Q Consensus 82 ~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~d 114 (131)
.....++.+|.++-++|+...|.++|+++.++.
T Consensus 204 yr~~~lyhmaValR~~G~LgdA~e~C~Ea~kla 236 (518)
T KOG1941|consen 204 YRAMSLYHMAVALRLLGRLGDAMECCEEAMKLA 236 (518)
T ss_pred HHHHHHHHHHHHHHHhcccccHHHHHHHHHHHH
Confidence 344567789999999999999999999998764
No 304
>COG0790 FOG: TPR repeat, SEL1 subfamily [General function prediction only]
Probab=52.14 E-value=85 Score=22.43 Aligned_cols=62 Identities=16% Similarity=-0.005 Sum_probs=44.3
Q ss_pred HHHHHHHHHHHhhhhhHHHHHHHHHHHHH----hCChHHHHHHHHHHHhhCCCC-HHHHHHHHHhhh
Q psy18096 68 GILASQELLDSLIKERINCYNNLAQAQIK----LGSLEPALMSLENVLNLDPNN-IKALQRKAKCLI 129 (131)
Q Consensus 68 y~~Ai~~~~~~~~~~~~~~~~N~a~~~~k----l~~~~~a~~~~~~al~~dp~~-~ka~~rrg~a~~ 129 (131)
+.+|+.+|........+....|++..|.. -.++.+|...+.+|.+..-.. ..+.++.|..+.
T Consensus 93 ~~~A~~~~~~~a~~g~~~a~~~lg~~~~~G~gv~~d~~~A~~~~~~Aa~~g~~~a~~~~~~l~~~~~ 159 (292)
T COG0790 93 KTKAADWYRCAAADGLAEALFNLGLMYANGRGVPLDLVKALKYYEKAAKLGNVEAALAMYRLGLAYL 159 (292)
T ss_pred HHHHHHHHHHHhhcccHHHHHhHHHHHhcCCCcccCHHHHHHHHHHHHHcCChhHHHHHHHHHHHHH
Confidence 67788877776666677788888888887 458889999888888764322 233666666553
No 305
>PLN03218 maturation of RBCL 1; Provisional
Probab=51.73 E-value=1e+02 Score=27.14 Aligned_cols=29 Identities=21% Similarity=0.172 Sum_probs=15.6
Q ss_pred hHHHHHHHHHHHHHhCChHHHHHHHHHHH
Q psy18096 83 RINCYNNLAQAQIKLGSLEPALMSLENVL 111 (131)
Q Consensus 83 ~~~~~~N~a~~~~kl~~~~~a~~~~~~al 111 (131)
....|+.+..+|.+.|++++|+..+++..
T Consensus 683 d~~tynsLI~ay~k~G~~eeA~~lf~eM~ 711 (1060)
T PLN03218 683 GTVSYSSLMGACSNAKNWKKALELYEDIK 711 (1060)
T ss_pred CHHHHHHHHHHHHhCCCHHHHHHHHHHHH
Confidence 33445555555555555555555555543
No 306
>KOG1070|consensus
Probab=50.93 E-value=83 Score=28.73 Aligned_cols=56 Identities=18% Similarity=-0.138 Sum_probs=37.2
Q ss_pred HHHHHHHHHHHhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 68 GILASQELLDSLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 68 y~~Ai~~~~~~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
|++|.++++ -.++|.-++-.|.+-+.+++|.+.++..++.-.+-.+.|.+.|..++
T Consensus 1520 FeRAcqycd------~~~V~~~L~~iy~k~ek~~~A~ell~~m~KKF~q~~~vW~~y~~fLl 1575 (1710)
T KOG1070|consen 1520 FERACQYCD------AYTVHLKLLGIYEKSEKNDEADELLRLMLKKFGQTRKVWIMYADFLL 1575 (1710)
T ss_pred HHHHHHhcc------hHHHHHHHHHHHHHhhcchhHHHHHHHHHHHhcchhhHHHHHHHHHh
Confidence 666665554 34556667777777777777777777777766566666666666654
No 307
>COG2976 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=50.55 E-value=74 Score=22.32 Aligned_cols=41 Identities=22% Similarity=0.156 Sum_probs=18.8
Q ss_pred HHHHHHHHHHHh-----hhhhHHHHHHHHHHHHHhCChHHHHHHHH
Q psy18096 68 GILASQELLDSL-----IKERINCYNNLAQAQIKLGSLEPALMSLE 108 (131)
Q Consensus 68 y~~Ai~~~~~~~-----~~~~~~~~~N~a~~~~kl~~~~~a~~~~~ 108 (131)
+.+|+..++.-+ ......+-.++|.+++.+|.+++|+...+
T Consensus 105 ~d~A~aqL~~~l~~t~De~lk~l~~lRLArvq~q~~k~D~AL~~L~ 150 (207)
T COG2976 105 LDKAEAQLKQALAQTKDENLKALAALRLARVQLQQKKADAALKTLD 150 (207)
T ss_pred HHHHHHHHHHHHccchhHHHHHHHHHHHHHHHHHhhhHHHHHHHHh
Confidence 555555555411 12223333455555555555555554433
No 308
>PF05843 Suf: Suppressor of forked protein (Suf); InterPro: IPR008847 This domain consists of several eukaryotic suppressor of forked (Suf) like proteins. The Drosophila melanogaster suppressor of forked [Su(f)] protein shares homology with the Saccharomyces cerevisiae RNA14 protein and the 77 kDa subunit of Homo sapiens cleavage stimulation factor, which are proteins involved in mRNA 3' end formation. This suggests a role for Su(f) in mRNA 3' end formation in Drosophila. The su(f) gene produces three transcripts; two of them are polyadenylated at the end of the transcription unit, and one is a truncated transcript, polyadenylated in intron 4. It is thought that su(f) plays a role in the regulation of poly(A) site utilisation and the GU-rich sequence is important for this regulation to occur [].; GO: 0006397 mRNA processing, 0005634 nucleus; PDB: 2L9B_B 2OND_B 2OOE_A 4E85_B 4EBA_C 4E6H_A 2UY1_B.
Probab=50.01 E-value=96 Score=22.44 Aligned_cols=51 Identities=14% Similarity=0.081 Sum_probs=34.3
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCH
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNI 118 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ 118 (131)
.+.|.++|+. ..-+....+++.-....+++|+.+.|...+++++..-|...
T Consensus 52 ~~~A~~Ife~glk~f~~~~~~~~~Y~~~l~~~~d~~~aR~lfer~i~~l~~~~ 104 (280)
T PF05843_consen 52 PKRARKIFERGLKKFPSDPDFWLEYLDFLIKLNDINNARALFERAISSLPKEK 104 (280)
T ss_dssp HHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHTT-HHHHHHHHHHHCCTSSCHH
T ss_pred HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHhCcHHHHHHHHHHHHHhcCchh
Confidence 3345555655 33456667777777777888999999999999988655443
No 309
>PLN03077 Protein ECB2; Provisional
Probab=49.60 E-value=54 Score=27.63 Aligned_cols=31 Identities=13% Similarity=0.186 Sum_probs=18.8
Q ss_pred hhHHHHHHHHHHHHHhCChHHHHHHHHHHHh
Q psy18096 82 ERINCYNNLAQAQIKLGSLEPALMSLENVLN 112 (131)
Q Consensus 82 ~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~ 112 (131)
.+...|+.+..+|.+.|+.++|+..+++..+
T Consensus 552 ~d~~s~n~lI~~~~~~G~~~~A~~lf~~M~~ 582 (857)
T PLN03077 552 KDVVSWNILLTGYVAHGKGSMAVELFNRMVE 582 (857)
T ss_pred CChhhHHHHHHHHHHcCCHHHHHHHHHHHHH
Confidence 3455566666666666666666666665554
No 310
>KOG1586|consensus
Probab=49.12 E-value=1e+02 Score=22.50 Aligned_cols=50 Identities=18% Similarity=0.090 Sum_probs=35.4
Q ss_pred HHHHHHHHHH--------HhhhhhH-HHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCC
Q psy18096 68 GILASQELLD--------SLIKERI-NCYNNLAQAQIKLGSLEPALMSLENVLNLDPNN 117 (131)
Q Consensus 68 y~~Ai~~~~~--------~~~~~~~-~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~ 117 (131)
|.+||++|++ .+..-.+ ..++--++||+--.+.--+-...++-.++||.-
T Consensus 170 Y~~Ai~iyeqva~~s~~n~LLKys~KdyflkAgLChl~~~D~v~a~~ALeky~~~dP~F 228 (288)
T KOG1586|consen 170 YSKAIDIYEQVARSSLDNNLLKYSAKDYFLKAGLCHLCKADEVNAQRALEKYQELDPAF 228 (288)
T ss_pred HHHHHHHHHHHHHHhccchHHHhHHHHHHHHHHHHhHhcccHHHHHHHHHHHHhcCCcc
Confidence 9999999998 2222233 355678899998777666666677777788853
No 311
>PF04184 ST7: ST7 protein; InterPro: IPR007311 The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain.
Probab=48.96 E-value=1.3e+02 Score=24.36 Aligned_cols=42 Identities=19% Similarity=0.164 Sum_probs=33.0
Q ss_pred HHHHHHHHHH---Hhhh-hhHHHHHHHHHHHHHhCChHHHHHHHHH
Q psy18096 68 GILASQELLD---SLIK-ERINCYNNLAQAQIKLGSLEPALMSLEN 109 (131)
Q Consensus 68 y~~Ai~~~~~---~~~~-~~~~~~~N~a~~~~kl~~~~~a~~~~~~ 109 (131)
.++||+.+.. +... ....++.|+..|++.++.|.++-....+
T Consensus 275 ~~EAIk~~rdLlke~p~~~~l~IrenLie~LLelq~Yad~q~lL~k 320 (539)
T PF04184_consen 275 LREAIKMFRDLLKEFPNLDNLNIRENLIEALLELQAYADVQALLAK 320 (539)
T ss_pred hHHHHHHHHHHHhhCCccchhhHHHHHHHHHHhcCCHHHHHHHHHH
Confidence 7889998887 2222 3667899999999999999988776555
No 312
>PF07980 SusD: SusD family; InterPro: IPR012944 This domain occurs in several hypothetical proteins. It also occurs in RagB, Q9ZA59 from SWISSPROT, a protein involved in signalling [] and SusD, Q8A1G2 from SWISSPROT, an outer membrane protein involved in nutrient binding [].; PDB: 3IHV_A 3LEW_A 3JQ1_A 3JQ0_A 3NQP_B 3SNX_A 3L22_A 3OTN_A 3IV0_A 3QNK_C ....
Probab=48.08 E-value=47 Score=23.10 Aligned_cols=31 Identities=29% Similarity=0.263 Sum_probs=26.8
Q ss_pred hHHHHHHHHHHHHHhCChHHHHHHHHHHHhh
Q psy18096 83 RINCYNNLAQAQIKLGSLEPALMSLENVLNL 113 (131)
Q Consensus 83 ~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~ 113 (131)
.+.+|+.+|-|..++|+..+|+.+.+++.+.
T Consensus 132 ~aEvyL~~AEA~~~~g~~~~A~~~lN~vR~R 162 (266)
T PF07980_consen 132 LAEVYLIYAEALARLGNTAEALEYLNQVRKR 162 (266)
T ss_dssp HHHHHHHHHHHHHHTTSHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHhCCHHHHHHHHHHHHHH
Confidence 5567899999999999999999999987753
No 313
>PF11846 DUF3366: Domain of unknown function (DUF3366); InterPro: IPR021797 This domain is functionally uncharacterised. This domain is found in bacteria. This presumed domain is about 200 amino acids in length.
Probab=47.72 E-value=49 Score=22.32 Aligned_cols=35 Identities=31% Similarity=0.256 Sum_probs=32.4
Q ss_pred hhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCC
Q psy18096 82 ERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPN 116 (131)
Q Consensus 82 ~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~ 116 (131)
..+.++.|++.++..+|+.++|.+...++..+=|.
T Consensus 142 P~~~~~~~~a~~l~~~G~~~eA~~~~~~~~~lyP~ 176 (193)
T PF11846_consen 142 PDPNVYQRYALALALLGDPEEARQWLARARRLYPA 176 (193)
T ss_pred CCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCCc
Confidence 47788999999999999999999999999999993
No 314
>COG3118 Thioredoxin domain-containing protein [Posttranslational modification, protein turnover, chaperones]
Probab=47.40 E-value=40 Score=25.11 Aligned_cols=42 Identities=21% Similarity=0.056 Sum_probs=30.1
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHH
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLEN 109 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~ 109 (131)
+..|...+.. ...+.+..+..-+|.||+..|+.+.|..-.+.
T Consensus 150 ~~~a~~~~~~al~~~~~~~~~~~~la~~~l~~g~~e~A~~iL~~ 193 (304)
T COG3118 150 FGEAAPLLKQALQAAPENSEAKLLLAECLLAAGDVEAAQAILAA 193 (304)
T ss_pred hhhHHHHHHHHHHhCcccchHHHHHHHHHHHcCChHHHHHHHHh
Confidence 6666666665 55677777888888888888888777664443
No 315
>COG3947 Response regulator containing CheY-like receiver and SARP domains [Signal transduction mechanisms]
Probab=47.36 E-value=1.1e+02 Score=23.06 Aligned_cols=42 Identities=19% Similarity=-0.016 Sum_probs=35.2
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHH
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLEN 109 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~ 109 (131)
|.+|+.+..+ .++|.....+.-+-+.+..+|+--.|+..+++
T Consensus 295 ~neAi~l~qr~ltldpL~e~~nk~lm~~la~~gD~is~~khyer 338 (361)
T COG3947 295 PNEAIQLHQRALTLDPLSEQDNKGLMASLATLGDEISAIKHYER 338 (361)
T ss_pred hHHHHHHHHHHhhcChhhhHHHHHHHHHHHHhccchhhhhHHHH
Confidence 8889988887 77888888888888999999997777777664
No 316
>PF11207 DUF2989: Protein of unknown function (DUF2989); InterPro: IPR021372 Some members in this bacterial family of proteins are annotated as lipoproteins however this cannot be confirmed.
Probab=47.36 E-value=36 Score=23.83 Aligned_cols=23 Identities=26% Similarity=0.163 Sum_probs=13.4
Q ss_pred hhhHHHHHHHHHHHHHhCChHHH
Q psy18096 81 KERINCYNNLAQAQIKLGSLEPA 103 (131)
Q Consensus 81 ~~~~~~~~N~a~~~~kl~~~~~a 103 (131)
..++.++.-+|..|.++++++.|
T Consensus 175 ~~n~eil~sLas~~~~~~~~e~A 197 (203)
T PF11207_consen 175 NFNPEILKSLASIYQKLKNYEQA 197 (203)
T ss_pred CCCHHHHHHHHHHHHHhcchhhh
Confidence 44555566666666666666555
No 317
>PF08631 SPO22: Meiosis protein SPO22/ZIP4 like; InterPro: IPR013940 SPO22 is a meiosis-specific protein with similarity to phospholipase A2, involved in completion of nuclear divisions during meiosis; induced early in meiosis []. It is also involved in sporulation [].
Probab=47.21 E-value=1.1e+02 Score=22.14 Aligned_cols=36 Identities=14% Similarity=0.227 Sum_probs=30.8
Q ss_pred HhhhhhHHHHHHHHHHHHHhC-ChHHHHHHHHHHHhh
Q psy18096 78 SLIKERINCYNNLAQAQIKLG-SLEPALMSLENVLNL 113 (131)
Q Consensus 78 ~~~~~~~~~~~N~a~~~~kl~-~~~~a~~~~~~al~~ 113 (131)
+........++|.|...++.+ ++++|+...++++.+
T Consensus 29 ~~~~~La~~~yn~G~~l~~~~~~~~~a~~wL~~a~~~ 65 (278)
T PF08631_consen 29 DMAEELARVCYNIGKSLLSKKDKYEEAVKWLQRAYDI 65 (278)
T ss_pred HHHHHHHHHHHHHHHHHHHcCCChHHHHHHHHHHHHH
Confidence 334556778899999999999 999999999999887
No 318
>PF14689 SPOB_a: Sensor_kinase_SpoOB-type, alpha-helical domain; PDB: 1F51_C 2FTK_B 1IXM_B.
Probab=46.75 E-value=49 Score=18.17 Aligned_cols=26 Identities=15% Similarity=0.176 Sum_probs=19.4
Q ss_pred HHHHHHHHHHhCChHHHHHHHHHHHh
Q psy18096 87 YNNLAQAQIKLGSLEPALMSLENVLN 112 (131)
Q Consensus 87 ~~N~a~~~~kl~~~~~a~~~~~~al~ 112 (131)
+...-..++.+|++++|.++..+...
T Consensus 26 hLqvI~gllqlg~~~~a~eYi~~~~~ 51 (62)
T PF14689_consen 26 HLQVIYGLLQLGKYEEAKEYIKELSK 51 (62)
T ss_dssp HHHHHHHHHHTT-HHHHHHHHHHHHH
T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHH
Confidence 44556678899999999998887654
No 319
>KOG0889|consensus
Probab=46.21 E-value=69 Score=31.79 Aligned_cols=61 Identities=16% Similarity=0.074 Sum_probs=48.9
Q ss_pred HHHHHHHHHH-----HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhh
Q psy18096 68 GILASQELLD-----SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCL 128 (131)
Q Consensus 68 y~~Ai~~~~~-----~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~ 128 (131)
++.+++..+. =.+...+..+.=+|.-+.|+|+.++|=..+..|++++-.-.|||+.-|.-+
T Consensus 2791 ~~~gLevi~sTNl~yF~~~q~aeff~lkG~f~~kL~~~eeAn~~fs~AvQi~~~l~KaW~~Wg~y~ 2856 (3550)
T KOG0889|consen 2791 LKTGLEVIESTNLMYFSDRQKAEFFTLKGMFLEKLGKFEEANKAFSAAVQIDDGLGKAWAEWGKYL 2856 (3550)
T ss_pred HHHHHHHHhcccHHHHhhHHHHHHHHhhhHHHHHhcCcchhHHHHHHHHHHHhhhHHHHHHHHHHH
Confidence 4455555544 124567778888999999999999999999999999999999999888643
No 320
>PF13281 DUF4071: Domain of unknown function (DUF4071)
Probab=45.99 E-value=1.4e+02 Score=23.06 Aligned_cols=46 Identities=24% Similarity=0.105 Sum_probs=35.4
Q ss_pred hHHHHHHHHHHHHH---hCChHHHHHHHHHHHh-hCCCCHHHHHHHHHhh
Q psy18096 83 RINCYNNLAQAQIK---LGSLEPALMSLENVLN-LDPNNIKALQRKAKCL 128 (131)
Q Consensus 83 ~~~~~~N~a~~~~k---l~~~~~a~~~~~~al~-~dp~~~ka~~rrg~a~ 128 (131)
...+-.-.|.+..+ .|+.++|+..+..++. .++.++..+.-.|.+|
T Consensus 178 ~~~i~~~yafALnRrn~~gdre~Al~il~~~l~~~~~~~~d~~gL~GRIy 227 (374)
T PF13281_consen 178 QHNIKFQYAFALNRRNKPGDREKALQILLPVLESDENPDPDTLGLLGRIY 227 (374)
T ss_pred chHHHHHHHHHHhhcccCCCHHHHHHHHHHHHhccCCCChHHHHHHHHHH
Confidence 44455567888888 8999999999999554 5577788887777765
No 321
>KOG0529|consensus
Probab=45.91 E-value=53 Score=25.60 Aligned_cols=47 Identities=17% Similarity=0.185 Sum_probs=38.0
Q ss_pred HhhhhhHHHHHHHHHHHHHhC--ChHHHHHHHHHHHhhCCCCHHHHHHH
Q psy18096 78 SLIKERINCYNNLAQAQIKLG--SLEPALMSLENVLNLDPNNIKALQRK 124 (131)
Q Consensus 78 ~~~~~~~~~~~N~a~~~~kl~--~~~~a~~~~~~al~~dp~~~ka~~rr 124 (131)
...|..-.+|+-|+-+..+.. +|..=++-|+++|+.||.|-.+|-.|
T Consensus 103 ~~npksY~aW~hR~w~L~~~p~~~~~~EL~lcek~L~~D~RNfh~W~YR 151 (421)
T KOG0529|consen 103 KVNPKSYGAWHHRKWVLQKNPHSDWNTELQLCEKALKQDPRNFHAWHYR 151 (421)
T ss_pred HhCchhHHHHHHHHHHHHhCCCchHHHHHHHHHHHHhcCcccccchHHH
Confidence 345666778888999988765 47888999999999999999888544
No 322
>PF08424 NRDE-2: NRDE-2, necessary for RNA interference; InterPro: IPR013633 This is domain is found in eukaryotic proteins of unknown function.
Probab=45.54 E-value=1.1e+02 Score=22.79 Aligned_cols=47 Identities=15% Similarity=0.092 Sum_probs=39.4
Q ss_pred HHHHHHHHHH-------------HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhC
Q psy18096 68 GILASQELLD-------------SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLD 114 (131)
Q Consensus 68 y~~Ai~~~~~-------------~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~d 114 (131)
|.++++.+.. ++....+.++.+++.-....|..+.|+...+..|+++
T Consensus 125 y~~~l~~L~~~~~~~~~~~~~~~~~e~~~l~v~~r~~~fl~~aG~~E~Ava~~Qa~lE~n 184 (321)
T PF08424_consen 125 YEKCLRALSRRRSGRMTSHPDLPELEEFMLYVFLRLCRFLRQAGYTERAVALWQALLEFN 184 (321)
T ss_pred HHHHHHHHHHhhccccccccchhhHHHHHHHHHHHHHHHHHHCCchHHHHHHHHHHHHHH
Confidence 7777777766 3356778888899999999999999999999999987
No 323
>KOG1915|consensus
Probab=45.49 E-value=1.3e+02 Score=24.37 Aligned_cols=50 Identities=12% Similarity=0.069 Sum_probs=41.1
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCC
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNN 117 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~ 117 (131)
+.++=++|.+ +..|.+...+.+-|..-..+|+++.|...++-|++-..-+
T Consensus 453 fDRcRkLYEkfle~~Pe~c~~W~kyaElE~~LgdtdRaRaifelAi~qp~ld 504 (677)
T KOG1915|consen 453 FDRCRKLYEKFLEFSPENCYAWSKYAELETSLGDTDRARAIFELAISQPALD 504 (677)
T ss_pred HHHHHHHHHHHHhcChHhhHHHHHHHHHHHHhhhHHHHHHHHHHHhcCcccc
Confidence 6666666666 7889999999999999999999999999999888754333
No 324
>COG3629 DnrI DNA-binding transcriptional activator of the SARP family [Signal transduction mechanisms]
Probab=45.46 E-value=78 Score=23.32 Aligned_cols=45 Identities=16% Similarity=0.083 Sum_probs=38.0
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHh
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLN 112 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~ 112 (131)
+..++..++. +.+|..-..|.-+-..|++.|+...||..|.+.-.
T Consensus 169 ~~~~~~~l~~Li~~dp~~E~~~~~lm~~y~~~g~~~~ai~~y~~l~~ 215 (280)
T COG3629 169 ADAVIEHLERLIELDPYDEPAYLRLMEAYLVNGRQSAAIRAYRQLKK 215 (280)
T ss_pred HHHHHHHHHHHHhcCccchHHHHHHHHHHHHcCCchHHHHHHHHHHH
Confidence 5555666666 77899999999999999999999999999987655
No 325
>KOG2053|consensus
Probab=44.98 E-value=1.1e+02 Score=26.41 Aligned_cols=55 Identities=11% Similarity=-0.047 Sum_probs=42.5
Q ss_pred HHHHHHHHHH---HhhhhhHHHHHHHHHHHH-HhCChHHHHHHHHHHHhhCCCCHHHHH
Q psy18096 68 GILASQELLD---SLIKERINCYNNLAQAQI-KLGSLEPALMSLENVLNLDPNNIKALQ 122 (131)
Q Consensus 68 y~~Ai~~~~~---~~~~~~~~~~~N~a~~~~-kl~~~~~a~~~~~~al~~dp~~~ka~~ 122 (131)
|++|...+.. +..+......-|++.-++ ++++|.+..+-+.+++..++++-|+++
T Consensus 206 ~~eal~~l~~~la~~l~~~~~~l~~~~~dllk~l~~w~~l~~l~~~Ll~k~~Ddy~~~~ 264 (932)
T KOG2053|consen 206 YQEALEFLAITLAEKLTSANLYLENKKLDLLKLLNRWQELFELSSRLLEKGNDDYKIYT 264 (932)
T ss_pred HHHHHHHHHHHHHHhccccchHHHHHHHHHHHHhcChHHHHHHHHHHHHhCCcchHHHH
Confidence 9999999976 444445555667777666 569999999999999999999855543
No 326
>cd02677 MIT_SNX15 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This MIT domain sub-family is found in sorting nexin 15 and related proteins. The molecular function of the MIT domain is unclear.
Probab=44.02 E-value=63 Score=18.61 Aligned_cols=24 Identities=21% Similarity=0.192 Sum_probs=14.1
Q ss_pred HHHHHHHHhCChHHHHHHHHHHHh
Q psy18096 89 NLAQAQIKLGSLEPALMSLENVLN 112 (131)
Q Consensus 89 N~a~~~~kl~~~~~a~~~~~~al~ 112 (131)
.+|.-.-+.++|.+|+..+..+|.
T Consensus 11 ~~Ave~d~~~~y~eA~~~Y~~~i~ 34 (75)
T cd02677 11 RLALEKEEEGDYEAAFEFYRAGVD 34 (75)
T ss_pred HHHHHHHHHhhHHHHHHHHHHHHH
Confidence 344444455677777666666654
No 327
>PF08631 SPO22: Meiosis protein SPO22/ZIP4 like; InterPro: IPR013940 SPO22 is a meiosis-specific protein with similarity to phospholipase A2, involved in completion of nuclear divisions during meiosis; induced early in meiosis []. It is also involved in sporulation [].
Probab=42.99 E-value=48 Score=23.94 Aligned_cols=29 Identities=17% Similarity=0.039 Sum_probs=25.0
Q ss_pred hHHHHHHHHHHHHHhCChHHHHHHHHHHH
Q psy18096 83 RINCYNNLAQAQIKLGSLEPALMSLENVL 111 (131)
Q Consensus 83 ~~~~~~N~a~~~~kl~~~~~a~~~~~~al 111 (131)
...++=|.|.-..+.++|.+|+..|+-++
T Consensus 245 ~~~LLW~~~~~~~~~k~y~~A~~w~~~al 273 (278)
T PF08631_consen 245 IHTLLWNKGKKHYKAKNYDEAIEWYELAL 273 (278)
T ss_pred HHHHHHHHHHHHHhhcCHHHHHHHHHHHH
Confidence 34556689999999999999999999877
No 328
>PHA02537 M terminase endonuclease subunit; Provisional
Probab=42.75 E-value=52 Score=23.49 Aligned_cols=43 Identities=19% Similarity=0.212 Sum_probs=30.7
Q ss_pred hhhhHHHHHHHHHHHHH---------hCChHHHHHHHHHHHhhCCCC-HHHHH
Q psy18096 80 IKERINCYNNLAQAQIK---------LGSLEPALMSLENVLNLDPNN-IKALQ 122 (131)
Q Consensus 80 ~~~~~~~~~N~a~~~~k---------l~~~~~a~~~~~~al~~dp~~-~ka~~ 122 (131)
+...+++|--.|..+++ .++...|+..+.+|+++||+. +|...
T Consensus 165 d~vrAKl~K~~G~~llr~~~g~~~~d~~~l~~Al~~L~rA~~l~~k~GVK~~i 217 (230)
T PHA02537 165 DEVRAKLYKAAGYLLLRNEKGEPIGDAETLQLALALLQRAFQLNDKCGVKKDI 217 (230)
T ss_pred hHHHHHHHHHHHHHHhhcccCCCccCcccHHHHHHHHHHHHHhCCCCChHHHH
Confidence 35566666666666643 457889999999999999864 44433
No 329
>PF08311 Mad3_BUB1_I: Mad3/BUB1 homology region 1; InterPro: IPR013212 Proteins containing this domain are checkpoint proteins involved in cell division. This region has been shown to be essential for the binding of BUB1 and MAD3 to CDC20p [].; PDB: 3ESL_B 4AEZ_I 4A1G_B 2LAH_A 2WVI_A 3SI5_B.
Probab=42.65 E-value=87 Score=19.83 Aligned_cols=34 Identities=15% Similarity=0.037 Sum_probs=26.9
Q ss_pred HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHH
Q psy18096 78 SLIKERINCYNNLAQAQIKLGSLEPALMSLENVL 111 (131)
Q Consensus 78 ~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al 111 (131)
.+-...+..|.-.|..+.+.|++++|...+..+|
T Consensus 93 ~IG~~~A~fY~~wA~~le~~~~~~~A~~I~~~Gi 126 (126)
T PF08311_consen 93 GIGTKLALFYEEWAEFLEKRGNFKKADEIYQLGI 126 (126)
T ss_dssp TTSTTBHHHHHHHHHHHHHTT-HHHHHHHHHHHH
T ss_pred CccHHHHHHHHHHHHHHHHcCCHHHHHHHHHhhC
Confidence 4456677788889999999999999999887764
No 330
>cd00280 TRFH Telomeric Repeat binding Factor or TTAGGG Repeat binding Factor, central (dimerization) domain Homology; TRFH. Telomeres are protein/DNA complexes that make up the physical ends of eukaryotic linear chromosomes and are essential for chromosome stability, protecting the chromosome ends from degradation and end-to-end fusion. Proteins TRF1, TRF2 and Taz1 bind telomeric DNA and are also involved in recruiting interacting proteins, TIN2, and Rap1, to the telomeres. It has also been demonstrated that PARP1 associates with TRF2 and is capable of poly(ADP-ribosyl)ation of TRF2, which affects binding of TRF2 to telomeric DNA. TRF1, TRF2 and Taz1 proteins contain three functional domains: an N-terminal acidic domain, a central TRF-specific/dimerization domain, and a C-terminal DNA binding domain with a single Myb-like repeat. Homodimerization, a prerequisite to DNA binding, results in the juxtaposition of two Myb DNA binding domains.
Probab=42.51 E-value=1.2e+02 Score=21.21 Aligned_cols=52 Identities=17% Similarity=0.089 Sum_probs=35.7
Q ss_pred HHHHHHHHHH---Hhh------hhh-HHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHH
Q psy18096 68 GILASQELLD---SLI------KER-INCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKA 120 (131)
Q Consensus 68 y~~Ai~~~~~---~~~------~~~-~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka 120 (131)
.+.|+..++. +.+ ... .-+----..++++-|.|++|.+..++..+ ||++.+-
T Consensus 85 LESAl~v~~~I~~E~~~~~~lhe~i~~lik~~aV~VCm~~g~Fk~A~eiLkr~~~-d~~~~~~ 146 (200)
T cd00280 85 LESALMVLESIEKEFSLPETLHEEIRKLIKEQAVAVCMENGEFKKAEEVLKRLFS-DPESQKL 146 (200)
T ss_pred HHHHHHHHHHHHHhcCCcHHHHHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHhc-CCCchhH
Confidence 5677777776 211 111 11222345678899999999999999999 8887765
No 331
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=42.18 E-value=74 Score=22.24 Aligned_cols=47 Identities=21% Similarity=0.017 Sum_probs=31.8
Q ss_pred HHHHHHHHHH--HhhhhhHHHHHHHHHHHHHhCC-------hHHHHHHHHHHHhhC
Q psy18096 68 GILASQELLD--SLIKERINCYNNLAQAQIKLGS-------LEPALMSLENVLNLD 114 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~-------~~~a~~~~~~al~~d 114 (131)
|+-|+..+.. +-....+.+++.+|=.|--+++ +..|+..+.++++.+
T Consensus 100 YkLAll~~~~~~~~~s~~A~l~LrlAWlyR~~~~~~~E~~fl~~Al~~y~~a~~~e 155 (214)
T PF09986_consen 100 YKLALLCAQIKKEKPSKKAGLCLRLAWLYRDLGDEENEKRFLRKALEFYEEAYENE 155 (214)
T ss_pred HHHHHHHHHHhCCCHHHHHHHHHHHHHHhhccCCHHHHHHHHHHHHHHHHHHHHhC
Confidence 6666655544 2234567778888888888887 566777777777655
No 332
>TIGR02561 HrpB1_HrpK type III secretion protein HrpB1/HrpK. This gene is found within type III secretion operons in a limited range of species including Xanthomonas, Ralstonia and Burkholderia.
Probab=41.14 E-value=85 Score=20.96 Aligned_cols=34 Identities=15% Similarity=-0.030 Sum_probs=15.2
Q ss_pred hhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhC
Q psy18096 81 KERINCYNNLAQAQIKLGSLEPALMSLENVLNLD 114 (131)
Q Consensus 81 ~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~d 114 (131)
|....+-.--+..++..|+|.+|++.+..+.+-.
T Consensus 41 P~~~e~d~~dg~l~i~rg~w~eA~rvlr~l~~~~ 74 (153)
T TIGR02561 41 PNLKELDMFDGWLLIARGNYDEAARILRELLSSA 74 (153)
T ss_pred CCccccchhHHHHHHHcCCHHHHHHHHHhhhccC
Confidence 3333333334444444455555555554444433
No 333
>PLN03218 maturation of RBCL 1; Provisional
Probab=41.05 E-value=2.2e+02 Score=25.20 Aligned_cols=28 Identities=14% Similarity=0.138 Sum_probs=12.5
Q ss_pred HHHHHHHHHHHhCChHHHHHHHHHHHhh
Q psy18096 86 CYNNLAQAQIKLGSLEPALMSLENVLNL 113 (131)
Q Consensus 86 ~~~N~a~~~~kl~~~~~a~~~~~~al~~ 113 (131)
.|+.+-.+|.+.|++++|++.+++..+.
T Consensus 581 TynaLI~ay~k~G~ldeA~elf~~M~e~ 608 (1060)
T PLN03218 581 TVGALMKACANAGQVDRAKEVYQMIHEY 608 (1060)
T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHc
Confidence 3444444444444444444444444443
No 334
>COG0425 SirA Predicted redox protein, regulator of disulfide bond formation [Posttranslational modification, protein turnover, chaperones]
Probab=40.67 E-value=25 Score=20.47 Aligned_cols=19 Identities=16% Similarity=0.184 Sum_probs=16.5
Q ss_pred chhcccCCCcEEEEEEcCC
Q psy18096 3 YVLPLMETGEECQIEITAR 21 (131)
Q Consensus 3 ~~l~~M~~gE~~~v~i~~~ 21 (131)
.+|..|+.||.-.|.++..
T Consensus 24 k~l~~m~~Ge~LeV~~ddp 42 (78)
T COG0425 24 KALAKLKPGEILEVIADDP 42 (78)
T ss_pred HHHHcCCCCCEEEEEecCc
Confidence 4789999999999999853
No 335
>cd01736 LSm14_N LSm14 (also known as RAP55) belongs to a family of Sm-like proteins that associate with RNA to form the core domain of the ribonucleoprotein particles involved in a variety of RNA processing events including pre-mRNA splicing, telomere replication, and mRNA degradation. Members of this family share a highly conserved Sm fold, containing an N-terminal helix followed by a strongly bent five-stranded antiparallel beta-sheet, that associates with other Sm proteins to form hexameric and heptameric ring structures. In addition to the N-terminal Sm-like domain, LSm14 has an uncharacterized C-terminal domain containing a conserved DFDF box. In Xenopus laevis, LSm14 is an oocyte-specific constituent of ribonucleoprotein particles.
Probab=40.65 E-value=43 Score=19.45 Aligned_cols=33 Identities=27% Similarity=0.618 Sum_probs=23.2
Q ss_pred CcEEEEEEcCCCCCCCCCCCC--CCCCCCceEEEE
Q psy18096 11 GEECQIEITARFGYGDKGEPS--KSIPPGAKLYYS 43 (131)
Q Consensus 11 gE~~~v~i~~~~ayG~~G~~~--~~ip~~~~~~~~ 43 (131)
.|.+.+.+..-..||..|++. +.|||..++.-.
T Consensus 28 ~~~sTi~L~nVr~fGTEgR~~~~~~ipp~~~vyd~ 62 (74)
T cd01736 28 TEDSTIALKNVRSFGTEGRPTDGPEIPPSDEVYDY 62 (74)
T ss_pred cccCEEEeeeeEeecccCCCCCCCccCCCCcceeE
Confidence 356777888888999998854 237887766433
No 336
>COG5191 Uncharacterized conserved protein, contains HAT (Half-A-TPR) repeat [General function prediction only]
Probab=40.58 E-value=30 Score=26.23 Aligned_cols=41 Identities=15% Similarity=0.075 Sum_probs=34.0
Q ss_pred hhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHH
Q psy18096 81 KERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKAL 121 (131)
Q Consensus 81 ~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~ 121 (131)
+..++.|.--+.--.+.+.|.+.-..|.++|+..|.|+..|
T Consensus 104 f~D~k~w~~y~~Y~~k~k~y~~~~nI~~~~l~khP~nvdlW 144 (435)
T COG5191 104 FNDPKIWSQYAAYVIKKKMYGEMKNIFAECLTKHPLNVDLW 144 (435)
T ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCceee
Confidence 45677777777777788899999999999999999997655
No 337
>KOG1585|consensus
Probab=39.80 E-value=72 Score=23.43 Aligned_cols=41 Identities=24% Similarity=0.110 Sum_probs=19.9
Q ss_pred HHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhh
Q psy18096 87 YNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCL 128 (131)
Q Consensus 87 ~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~ 128 (131)
|-.-|.||-..++|++|..+..+|.+--. |-++||..++++
T Consensus 34 yekAAvafRnAk~feKakdcLlkA~~~yE-nnrslfhAAKay 74 (308)
T KOG1585|consen 34 YEKAAVAFRNAKKFEKAKDCLLKASKGYE-NNRSLFHAAKAY 74 (308)
T ss_pred HHHHHHHHHhhccHHHHHHHHHHHHHHHH-hcccHHHHHHHH
Confidence 33344455555555555555555554432 234455544443
No 338
>PF09311 Rab5-bind: Rabaptin-like protein; InterPro: IPR015390 This domain is predominantly found in Rabaptin and allows for binding to the GTPase Rab5. This interaction is necessary and sufficient for Rab5-dependent recruitment of Rabaptin5 to early endosomal membranes []. ; PDB: 3NF1_A 3CEQ_B 3EDT_H 1X79_C 1TU3_F.
Probab=39.54 E-value=84 Score=21.32 Aligned_cols=33 Identities=24% Similarity=0.255 Sum_probs=27.5
Q ss_pred hhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHh
Q psy18096 80 IKERINCYNNLAQAQIKLGSLEPALMSLENVLN 112 (131)
Q Consensus 80 ~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~ 112 (131)
.|.....+.|+-.-|.+.|+|.-|+.-|..+|+
T Consensus 136 ~~~rl~tL~nlv~q~~~q~r~evav~~~KqalE 168 (181)
T PF09311_consen 136 IPARLRTLHNLVIQYESQGRYEVAVPLCKQALE 168 (181)
T ss_dssp S-HHHHHHHHHHHHHHHTT-HHHHHHHHHHHHH
T ss_pred chHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHH
Confidence 355667788999999999999999999999986
No 339
>PF15297 CKAP2_C: Cytoskeleton-associated protein 2 C-terminus
Probab=38.82 E-value=1.8e+02 Score=22.29 Aligned_cols=54 Identities=11% Similarity=0.127 Sum_probs=42.5
Q ss_pred hHHHHHHHHHHHHHH-----Hh-------------hhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCC
Q psy18096 63 RLDFGGILASQELLD-----SL-------------IKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPN 116 (131)
Q Consensus 63 ~~~~~y~~Ai~~~~~-----~~-------------~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~ 116 (131)
+++..+.+++++++. ++ ..-.+++|..+|...-..|.+++.|..|++|+.....
T Consensus 101 kvn~tlsECl~Li~eGcp~eei~~~L~~li~~IP~A~K~aKYWIC~Arl~~~~~~~e~vi~iyEeAi~agAq 172 (353)
T PF15297_consen 101 KVNKTLSECLNLIEEGCPKEEILATLSDLIKNIPDAKKLAKYWICLARLEPRTGPIEDVIAIYEEAILAGAQ 172 (353)
T ss_pred HHHHHHHHHHHHHHcCCCHHHHHHHHHHHHhcCchHHHHHHHHHHHHHHHhhcCCHHHHHHHHHHHHHcCCC
Confidence 444458888888887 11 1346789999999999999999999999999987643
No 340
>PF12753 Nro1: Nuclear pore complex subunit Nro1; InterPro: IPR024318 In fission yeast, Nro1 is a positive regulator of the stability of Sre1N, the sterol regulatory element-binding protein, which is an ER membrane-bound transcription factor that controls adaptation to low oxygen-growth []. In addition, the fission yeast Nro1 is a direct inhibitor of a protein that inhibits SreN1 degradation, Ofd1 (an oxoglutamate deoxygenase). The outcome of this reactivity is that Ofd1 acts as an oxygen sensor that regulates the binding of Nro1 to Ofd1 to control the stability of Sre1N []. This entry also represents ETT1, an Nro1 ortholog []. ETT1 is required for correct translation termination and probably involved in regulation of hypoxic gene expression in association TPA1 []. It inhibits replication of Brome mosaic virus [].; GO: 0005515 protein binding, 0005634 nucleus; PDB: 3QTM_B 3MSV_B 3QTN_B.
Probab=38.07 E-value=49 Score=25.63 Aligned_cols=32 Identities=22% Similarity=0.184 Sum_probs=25.4
Q ss_pred HHHHHHHHHHHhhhhhHHHHHHHHHHHHHhCC
Q psy18096 68 GILASQELLDSLIKERINCYNNLAQAQIKLGS 99 (131)
Q Consensus 68 y~~Ai~~~~~~~~~~~~~~~~N~a~~~~kl~~ 99 (131)
.++|++++.+..+...+..|.++|-+++-+|+
T Consensus 334 ~~~Al~yL~kA~d~ddPetWv~vAEa~I~LGN 365 (404)
T PF12753_consen 334 IKKALEYLKKAQDEDDPETWVDVAEAMIDLGN 365 (404)
T ss_dssp HHHHHHHHHHHHHS--TTHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHhhccCChhHHHHHHHHHhhhhc
Confidence 78889988885566778889999999999876
No 341
>cd03420 SirA_RHOD_Pry_redox SirA_RHOD_Pry_redox. SirA-like domain located within a multidomain protein of unknown function. Other domains include RHOD (rhodanese homology domain), and Pry_redox (pyridine nucleotide-disulphide oxidoreductase) as well as a C-terminal domain that corresponds to COG2210. This fold is referred to as a two-layered alpha/beta sandwich, structurally similar to that of translation initiation factor 3.
Probab=37.64 E-value=31 Score=19.29 Aligned_cols=19 Identities=11% Similarity=0.289 Sum_probs=16.1
Q ss_pred chhcccCCCcEEEEEEcCC
Q psy18096 3 YVLPLMETGEECQIEITAR 21 (131)
Q Consensus 3 ~~l~~M~~gE~~~v~i~~~ 21 (131)
.+|.+|+.||...|.++..
T Consensus 18 kal~~l~~G~~l~V~~d~~ 36 (69)
T cd03420 18 KEIDKLQDGEQLEVKASDP 36 (69)
T ss_pred HHHHcCCCCCEEEEEECCc
Confidence 5789999999999998753
No 342
>cd03422 YedF YedF is a bacterial SirA-like protein of unknown function. SirA (also known as UvrY, and YhhP) belongs to a family of a two-component response regulators that controls secondary metabolism and virulence. The other member of this two-component system is a sensor kinase called BarA which phosphorylates SirA. A variety of microorganisms have similar proteins, all of which contain a common CPxP sequence motif in the N-terminal region. YhhP is suggested to be important for normal cell division and growth in rich nutrient medium. Moreover, despite a low primary sequence similarity, the YccP structure closely resembles the non-homologous C-terminal RNA-binding domain of E. coli translation initiation factor IF3. The signature CPxP motif serves to stabilize the N-terminal helix as part of the N-capping box and might be important in mRNA-binding.
Probab=37.29 E-value=31 Score=19.30 Aligned_cols=19 Identities=21% Similarity=0.298 Sum_probs=16.2
Q ss_pred chhcccCCCcEEEEEEcCC
Q psy18096 3 YVLPLMETGEECQIEITAR 21 (131)
Q Consensus 3 ~~l~~M~~gE~~~v~i~~~ 21 (131)
.+|.+|..||...|.++..
T Consensus 18 kal~~l~~G~~l~V~~d~~ 36 (69)
T cd03422 18 EALPSLKPGEILEVISDCP 36 (69)
T ss_pred HHHHcCCCCCEEEEEecCc
Confidence 5789999999999988753
No 343
>KOG2471|consensus
Probab=36.61 E-value=1.3e+02 Score=24.55 Aligned_cols=51 Identities=22% Similarity=0.240 Sum_probs=41.2
Q ss_pred HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 78 SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 78 ~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
++..+...+|-|.|-+-+.+|+.-.|+...++.|+.. +-.++|--.|..|+
T Consensus 503 ~leNm~~ai~A~~ayV~L~Lgd~i~AL~~a~kLLq~~-~lS~~~kfLGHiYA 553 (696)
T KOG2471|consen 503 DLENMRQAIFANMAYVELELGDPIKALSAATKLLQLA-DLSKIYKFLGHIYA 553 (696)
T ss_pred HHHHHHHHHHHHHHHHHHHhcChhhHHHHHHHHHhhh-hhhhHHHHHHHHHH
Confidence 5566788899999999999999999999999999874 45666655555544
No 344
>smart00668 CTLH C-terminal to LisH motif. Alpha-helical motif of unknown function.
Probab=36.22 E-value=40 Score=17.62 Aligned_cols=20 Identities=25% Similarity=0.238 Sum_probs=15.5
Q ss_pred HHHHHhCChHHHHHHHHHHH
Q psy18096 92 QAQIKLGSLEPALMSLENVL 111 (131)
Q Consensus 92 ~~~~kl~~~~~a~~~~~~al 111 (131)
...+..|+|++|+..|+..-
T Consensus 9 ~~~i~~g~~~~a~~~~~~~~ 28 (58)
T smart00668 9 RELILKGDWDEALEWLSSLK 28 (58)
T ss_pred HHHHHcCCHHHHHHHHHHcC
Confidence 34456899999999998653
No 345
>PF01157 Ribosomal_L21e: Ribosomal protein L21e; InterPro: IPR001147 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. L21E family contains proteins from a number of eukaryotic and archaebacterial organisms which include; mammalian L2, Entamoeba histolytica L21, Caenorhabditis elegans L21 (C14B9.7), Saccharomyces cerevisiae (Baker's yeast) L21E (URP1) and Haloarcula marismortui HL31.; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 3IZR_U 1S1I_Q 3O58_T 3IZS_U 3O5H_T 1Q82_R 1KQS_P 3CCJ_Q 3CCQ_Q 1VQ5_Q ....
Probab=35.28 E-value=38 Score=20.84 Aligned_cols=25 Identities=20% Similarity=0.274 Sum_probs=12.9
Q ss_pred cchhcccCCCcEEEEEEcCCCCCCC
Q psy18096 2 DYVLPLMETGEECQIEITARFGYGD 26 (131)
Q Consensus 2 d~~l~~M~~gE~~~v~i~~~~ayG~ 26 (131)
...|+..++|+.+.+.++|...-|.
T Consensus 27 s~~l~~yk~GD~V~I~id~sv~kGm 51 (99)
T PF01157_consen 27 STYLQEYKVGDKVDIKIDPSVHKGM 51 (99)
T ss_dssp HHHH----TT-EEEE---TTSSSSS
T ss_pred HHHHHHccCCCEEEEEecCccccCC
Confidence 4578899999999999999875543
No 346
>KOG1811|consensus
Probab=35.17 E-value=88 Score=26.12 Aligned_cols=46 Identities=15% Similarity=0.130 Sum_probs=37.9
Q ss_pred HHHHHHHHHH-HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCC
Q psy18096 68 GILASQELLD-SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDP 115 (131)
Q Consensus 68 y~~Ai~~~~~-~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp 115 (131)
|+-|+...++ .++ ...+|--.+.+.+|++++..|..-+.+++++..
T Consensus 572 YqlaV~mckKc~iD--~f~aW~AWGlA~Lk~e~~aaAR~KFkqafklkg 618 (1141)
T KOG1811|consen 572 YQLAVEMCKKCGID--TFGAWHAWGLACLKAENLAAAREKFKQAFKLKG 618 (1141)
T ss_pred HHHHHHHHhhcCCC--cccHHHHHHHHHHHhhhHHHHHHHHHHHhCCCC
Confidence 8888877665 333 445788899999999999999999999999873
No 347
>COG4941 Predicted RNA polymerase sigma factor containing a TPR repeat domain [Transcription]
Probab=35.15 E-value=1.4e+02 Score=23.02 Aligned_cols=44 Identities=16% Similarity=0.129 Sum_probs=37.0
Q ss_pred hHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHH
Q psy18096 83 RINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAK 126 (131)
Q Consensus 83 ~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~ 126 (131)
+-.++.-+|--..++|+-.+|-..|++++.+-++-....|-+.+
T Consensus 364 y~~~h~~RadlL~rLgr~~eAr~aydrAi~La~~~aer~~l~~r 407 (415)
T COG4941 364 YHLYHAARADLLARLGRVEEARAAYDRAIALARNAAERAFLRQR 407 (415)
T ss_pred ccccHHHHHHHHHHhCChHHHHHHHHHHHHhcCChHHHHHHHHH
Confidence 45577789999999999999999999999999988776665544
No 348
>PF10602 RPN7: 26S proteasome subunit RPN7; InterPro: IPR019585 This entry represents the regulatory subunit RPN7 (known as the non-ATPase regulatory subunit 6 in higher eukaryotes) of the 26S proteasome. This entry also matches the evolutionarily related subunit 1 of the COP9 signalosome complex (CSN) from Arabidopsis []. The 26S proteasome plays a major role in ATP-dependent degradation of ubiquitinated proteins. Substrate specificity is conferred by the regulatory particle (RP), which can dissociate into stable lid and base subcomplexes. The regulatory subunit RPN7 is one of the lid subunits of the 26S proteasome and has been shown in Saccharomyces cerevisiae (Baker's yeast) to be required for structural integrity []. The COP9 signalosome is a conserved protein complex composed of eight subunits, where Individual subunits of the complex have been linked to various signal transduction pathways leading to gene expression and cell cycle control []. The overall organisation and the amino acid sequences of the COP9 signalosome subunits resemble the lid subcomplex of the 19 S regulatory particle for the 26 S proteasome []. COP9 subunit 1 (CSN1 or GPS1) of the COP9 complex is an essential subunit of the complex with regard to both structural integrity and functionality. The N-terminal region of subunit 1 (CSN1-N) can inhibit c-fos expression from either a transfected template or a chromosomal transgene (fos-lacZ), and may contain the activity domain that confers most of the repression functions of CSN1. The C-terminal region of subunit 1 (CSN1-C) allows integration of the protein into the COP9 signalosome.
Probab=35.15 E-value=1.4e+02 Score=20.07 Aligned_cols=47 Identities=17% Similarity=0.058 Sum_probs=36.4
Q ss_pred HHHHHHHHHH-HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhC
Q psy18096 68 GILASQELLD-SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLD 114 (131)
Q Consensus 68 y~~Ai~~~~~-~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~d 114 (131)
.+.-|+.+.. ..+.....++..+|.-|.+.|++++|++.+.++....
T Consensus 19 Le~elk~~~~n~~kesir~~~~~l~~~~~~~Gd~~~A~k~y~~~~~~~ 66 (177)
T PF10602_consen 19 LEAELKDAKSNLGKESIRMALEDLADHYCKIGDLEEALKAYSRARDYC 66 (177)
T ss_pred HHHHHHHHHhccchHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHhhhc
Confidence 4455665655 4455566789999999999999999999999988754
No 349
>PF04910 Tcf25: Transcriptional repressor TCF25; InterPro: IPR006994 This entry appears to represent a novel family of basic helix-loop-helix (bHLH) proteins that control differentiation and development of a variety of organs [, ]. Human Nulp1 (Q2MK75 from SWISSPROT) is a basic helix-loop-helix protein expressed broadly during early embryonic organogenesis. Over expression of human Nulp1 in COS-7 cells inhibits the transcriptional activity of serum response factor (SRF), suggesting that Nulp1 may act as a novel bHLH transcriptional repressor in the SRF signalling pathway to mediate cellular functions [].
Probab=34.91 E-value=1.6e+02 Score=22.40 Aligned_cols=49 Identities=18% Similarity=-0.050 Sum_probs=38.9
Q ss_pred HHHHHHHHHHHHHH-------HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHh
Q psy18096 64 LDFGGILASQELLD-------SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLN 112 (131)
Q Consensus 64 ~~~~y~~Ai~~~~~-------~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~ 112 (131)
.+..|..|+...+- ...|..+..++.++.++..+|++..|-+..++||-
T Consensus 13 ~q~~F~~~v~~~Dp~~l~~ll~~~PyHidtLlqls~v~~~~gd~~~A~~lleRALf 68 (360)
T PF04910_consen 13 AQEQFYAAVQSHDPNALINLLQKNPYHIDTLLQLSEVYRQQGDHAQANDLLERALF 68 (360)
T ss_pred HHHHHHHHHHccCHHHHHHHHHHCCCcHHHHHHHHHHHHHcCCHHHHHHHHHHHHH
Confidence 33336667766644 44688899999999999999999999999998874
No 350
>KOG2581|consensus
Probab=34.61 E-value=1.7e+02 Score=23.18 Aligned_cols=50 Identities=14% Similarity=0.005 Sum_probs=38.0
Q ss_pred HHHHHHHHHH------HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCC
Q psy18096 68 GILASQELLD------SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNN 117 (131)
Q Consensus 68 y~~Ai~~~~~------~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~ 117 (131)
|.+|-++..+ ......+.+++-++....-+.+|..|.+++-+|+...|+.
T Consensus 225 ydqa~~lvsK~~~pe~~snne~ARY~yY~GrIkaiqldYssA~~~~~qa~rkapq~ 280 (493)
T KOG2581|consen 225 YDQADKLVSKSVYPEAASNNEWARYLYYLGRIKAIQLDYSSALEYFLQALRKAPQH 280 (493)
T ss_pred HHHHHHHhhcccCccccccHHHHHHHHHHhhHHHhhcchhHHHHHHHHHHHhCcch
Confidence 5666555555 2234556666778888888999999999999999999973
No 351
>COG2139 RPL21A Ribosomal protein L21E [Translation, ribosomal structure and biogenesis]
Probab=34.21 E-value=44 Score=20.48 Aligned_cols=25 Identities=20% Similarity=0.299 Sum_probs=20.6
Q ss_pred cchhcccCCCcEEEEEEcCCCCCCC
Q psy18096 2 DYVLPLMETGEECQIEITARFGYGD 26 (131)
Q Consensus 2 d~~l~~M~~gE~~~v~i~~~~ayG~ 26 (131)
-..|+..++|+.+-|.|+|...-|.
T Consensus 27 sr~l~ey~~Gd~V~I~IdpSv~kGm 51 (98)
T COG2139 27 SRYLQEYKVGDKVHIDIDPSVHKGM 51 (98)
T ss_pred hhHHhhccCCCEEEEEeCcccccCC
Confidence 3568889999999999999876553
No 352
>cd09243 BRO1_Brox_like Protein-interacting Bro1-like domain of human Brox1 and related proteins. This family contains the Bro1-like domain of a single-domain protein, human Brox, and related domains. It belongs to the BRO1_Alix_like superfamily which also includes the Bro1-like domains of mammalian Alix (apoptosis-linked gene-2 interacting protein X), His-Domain type N23 protein tyrosine phosphatase (HD-PTP, also known as PTPN23), RhoA-binding proteins Rhophilin-1 and -2, Bro1 and Rim20 (also known as PalA) from Saccharomyces cerevisiae, Ustilago maydis Rim23 (also known as PalC), and related domains. Alix, HD-PTP, Brox, Bro1, Rim20, and Rim23, interact with the ESCRT (Endosomal Sorting Complexes Required for Transport) system. Bro1-like domains are boomerang-shaped, and part of the domain is a tetratricopeptide repeat (TPR)-like structure. Bro1-like domains bind components of the ESCRT-III complex: CHMP4 in the case of Brox. Human Brox can bind to human immunodeficiency virus type 1 (
Probab=34.14 E-value=1.8e+02 Score=22.16 Aligned_cols=45 Identities=13% Similarity=-0.003 Sum_probs=30.5
Q ss_pred HHHHHHHHHH---------------HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHh
Q psy18096 68 GILASQELLD---------------SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLN 112 (131)
Q Consensus 68 y~~Ai~~~~~---------------~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~ 112 (131)
|++|.+.++. ...-..+.+|+..|......+++.+||..+..|..
T Consensus 217 Y~~A~~~l~~~~~~i~~~W~~~v~~K~~~f~A~A~y~~a~~l~e~~k~GeaIa~L~~A~~ 276 (353)
T cd09243 217 FQKADDSLSSLDPEYSGKWRKYLQLKSVFYLAYAYCYHGETLLAKDKCGEAIRSLQESEK 276 (353)
T ss_pred HHHHHHHHHcCCccccHHHHHHHHHHHHHHHHHHHHHHHHHhHhcchHHHHHHHHHHHHH
Confidence 8888777653 11223455566666666667889999998887775
No 353
>PF06957 COPI_C: Coatomer (COPI) alpha subunit C-terminus; InterPro: IPR010714 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the C terminus (approximately 500 residues) of the eukaryotic coatomer alpha subunit [, ]. This domain is found along with the IPR006692 from INTERPRO domain. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0005515 protein binding, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030126 COPI vesicle coat; PDB: 3MKR_B 3MV2_E 3MKQ_B 3MV3_A.
Probab=34.09 E-value=1.3e+02 Score=23.59 Aligned_cols=43 Identities=23% Similarity=0.169 Sum_probs=27.7
Q ss_pred HhhhhhHHHHHHHHHHH-HHhCChHHHHHHHHHHHhhCCCCHHH
Q psy18096 78 SLIKERINCYNNLAQAQ-IKLGSLEPALMSLENVLNLDPNNIKA 120 (131)
Q Consensus 78 ~~~~~~~~~~~N~a~~~-~kl~~~~~a~~~~~~al~~dp~~~ka 120 (131)
.++|....+=++.|... .|.++|.-|-..+.+.|++.|+...+
T Consensus 293 ~LQp~H~~LaLr~AM~~~~K~KNf~tAa~FArRLLel~p~~~~a 336 (422)
T PF06957_consen 293 KLQPSHLILALRSAMSQAFKLKNFITAASFARRLLELNPSPEVA 336 (422)
T ss_dssp ---HHHHHHHHHHHHHHCCCTTBHHHHHHHHHHHHCT--SCHHH
T ss_pred CCcHHHHHHHHHHHHHHHHHhccHHHHHHHHHHHHHcCCCHHHH
Confidence 34455555445555444 37899999999999999999977543
No 354
>KOG4151|consensus
Probab=33.72 E-value=56 Score=27.46 Aligned_cols=44 Identities=23% Similarity=0.196 Sum_probs=38.3
Q ss_pred HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHH
Q psy18096 78 SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKAL 121 (131)
Q Consensus 78 ~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~ 121 (131)
...|....+++-||.||.-++.++-|+++..-+...+|++..+.
T Consensus 121 ~~~p~i~~~Ll~r~~~y~al~k~d~a~rdl~i~~~~~p~~~~~~ 164 (748)
T KOG4151|consen 121 ESQPRISKALLKRARKYEALNKLDLAVRDLRIVEKMDPSNVSAS 164 (748)
T ss_pred hccchHHHHHhhhhhHHHHHHHHHHHHHHHHHHhcCCCCcchHH
Confidence 45688888999999999999999999999888899999995443
No 355
>COG3071 HemY Uncharacterized enzyme of heme biosynthesis [Coenzyme metabolism]
Probab=31.90 E-value=2.5e+02 Score=21.91 Aligned_cols=47 Identities=13% Similarity=0.195 Sum_probs=36.1
Q ss_pred hHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 83 RINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 83 ~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
...++.-++.-.+..++++.|....+++++..|.+...+-=.-++|+
T Consensus 152 ~l~v~ltrarlll~~~d~~aA~~~v~~ll~~~pr~~~vlrLa~r~y~ 198 (400)
T COG3071 152 TLAVELTRARLLLNRRDYPAARENVDQLLEMTPRHPEVLRLALRAYI 198 (400)
T ss_pred hHHHHHHHHHHHHhCCCchhHHHHHHHHHHhCcCChHHHHHHHHHHH
Confidence 44556667777778899999999999999999999877655555543
No 356
>PF12309 KBP_C: KIF-1 binding protein C terminal; InterPro: IPR022083 This family of proteins is found in bacteria and eukaryotes. Proteins in this family are typically between 365 and 621 amino acids in length. There is a conserved LLP sequence motif. KBP is a binding partner for KIF1Balpha that is a regulator of its transport function and thus represents a type of kinesin interacting protein.
Probab=31.87 E-value=93 Score=23.84 Aligned_cols=40 Identities=20% Similarity=0.255 Sum_probs=29.5
Q ss_pred hhhhHHHHHHHHHHHHHh---------CChHHHHHHHHHHHhhCCCCHH
Q psy18096 80 IKERINCYNNLAQAQIKL---------GSLEPALMSLENVLNLDPNNIK 119 (131)
Q Consensus 80 ~~~~~~~~~N~a~~~~kl---------~~~~~a~~~~~~al~~dp~~~k 119 (131)
......++.++|..|.|+ +.+..++..+..+..+-..|..
T Consensus 296 ~~~~l~a~f~~arl~~K~~~~~~~~~~~~l~~sl~~y~~vv~y~~~~~~ 344 (371)
T PF12309_consen 296 LRPYLYAYFHIARLYSKLITSDPKEQLENLEKSLEYYKWVVDYCEKHPE 344 (371)
T ss_pred HHHHHHHHHHHHHHHccccCCChHHHHHHHHHHHHHHHHHHHHHHhChh
Confidence 345677888999999987 5677888888888776544444
No 357
>PF05688 DUF824: Salmonella repeat of unknown function (DUF824); InterPro: IPR008542 This family consists of a series of repeated sequences (of around 180 residues) which are found in Salmonella typhimurium, Salmonella typhi and Escherichia coli. These repeats are almost always found with this entry. The repeats are associated with RatA and RatB, the coding sequences of which are found in the pathogeneicity island of Salmonella. The sequences may be determinants of pathogenicity [, ].
Probab=31.79 E-value=57 Score=17.14 Aligned_cols=21 Identities=19% Similarity=0.297 Sum_probs=16.7
Q ss_pred hcccCCCcEEEEEEcCCCCCC
Q psy18096 5 LPLMETGEECQIEITARFGYG 25 (131)
Q Consensus 5 l~~M~~gE~~~v~i~~~~ayG 25 (131)
....++||...+++.-.-+-|
T Consensus 6 aakaK~Ge~I~ltVt~kda~G 26 (47)
T PF05688_consen 6 AAKAKVGETIPLTVTVKDANG 26 (47)
T ss_pred hhheecCCeEEEEEEEECCCC
Confidence 346789999999999887633
No 358
>COG2976 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=31.54 E-value=1.9e+02 Score=20.38 Aligned_cols=50 Identities=14% Similarity=-0.029 Sum_probs=38.2
Q ss_pred HHHHHHHHHHHhhh-hhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCC
Q psy18096 68 GILASQELLDSLIK-ERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNN 117 (131)
Q Consensus 68 y~~Ai~~~~~~~~~-~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~ 117 (131)
|.+|+..++....+ -.....-=++-.++..|+-.+|+..+.++++.++++
T Consensus 142 ~D~AL~~L~t~~~~~w~~~~~elrGDill~kg~k~~Ar~ay~kAl~~~~s~ 192 (207)
T COG2976 142 ADAALKTLDTIKEESWAAIVAELRGDILLAKGDKQEARAAYEKALESDASP 192 (207)
T ss_pred HHHHHHHHhccccccHHHHHHHHhhhHHHHcCchHHHHHHHHHHHHccCCh
Confidence 88888888763322 223334468999999999999999999999987554
No 359
>KOG2396|consensus
Probab=31.26 E-value=2e+02 Score=23.40 Aligned_cols=45 Identities=13% Similarity=0.145 Sum_probs=36.6
Q ss_pred HhhhhhHHHHHHHHHHHHHhCC-hHHHHHHHHHHHhhCCCCHHHHH
Q psy18096 78 SLIKERINCYNNLAQAQIKLGS-LEPALMSLENVLNLDPNNIKALQ 122 (131)
Q Consensus 78 ~~~~~~~~~~~N~a~~~~kl~~-~~~a~~~~~~al~~dp~~~ka~~ 122 (131)
...|.++.+|---|.-....+. .+.|...+.+.|+.+|++++.|.
T Consensus 133 ~~Hp~~~dLWI~aA~wefe~n~ni~saRalflrgLR~npdsp~Lw~ 178 (568)
T KOG2396|consen 133 AKHPNNPDLWIYAAKWEFEINLNIESARALFLRGLRFNPDSPKLWK 178 (568)
T ss_pred HhCCCCchhHHhhhhhHHhhccchHHHHHHHHHHhhcCCCChHHHH
Confidence 4567888888877777777665 89999999999999999987654
No 360
>COG0790 FOG: TPR repeat, SEL1 subfamily [General function prediction only]
Probab=30.83 E-value=2e+02 Score=20.49 Aligned_cols=56 Identities=11% Similarity=-0.159 Sum_probs=43.5
Q ss_pred HHHHHHHHHHhhhhhHHHHHHHHHHHHH----hCChHHHHHHHHHHHhhCCCCHHHHHHHHH
Q psy18096 69 ILASQELLDSLIKERINCYNNLAQAQIK----LGSLEPALMSLENVLNLDPNNIKALQRKAK 126 (131)
Q Consensus 69 ~~Ai~~~~~~~~~~~~~~~~N~a~~~~k----l~~~~~a~~~~~~al~~dp~~~ka~~rrg~ 126 (131)
.+|+..|.+-.......+..+++.+|.+ -.++.+|+..+.++-+... ..+.|+.+.
T Consensus 172 ~~A~~~~~~aa~~~~~~a~~~lg~~y~~G~Gv~~d~~~A~~wy~~Aa~~g~--~~a~~~~~~ 231 (292)
T COG0790 172 KKALYLYRKAAELGNPDAQLLLGRMYEKGLGVPRDLKKAFRWYKKAAEQGD--GAACYNLGL 231 (292)
T ss_pred HhHHHHHHHHHHhcCHHHHHHHHHHHHcCCCCCcCHHHHHHHHHHHHHCCC--HHHHHHHHH
Confidence 4677777764444477888899988876 3589999999999998876 888888873
No 361
>KOG0985|consensus
Probab=30.10 E-value=1.4e+02 Score=26.75 Aligned_cols=29 Identities=24% Similarity=0.316 Sum_probs=25.6
Q ss_pred hhHHHHHHHHHHHHHhCChHHHHHHHHHH
Q psy18096 82 ERINCYNNLAQAQIKLGSLEPALMSLENV 110 (131)
Q Consensus 82 ~~~~~~~N~a~~~~kl~~~~~a~~~~~~a 110 (131)
..+.+|..+|.+++..+...+||+.+-+|
T Consensus 1102 n~p~vWsqlakAQL~~~~v~dAieSyika 1130 (1666)
T KOG0985|consen 1102 NEPAVWSQLAKAQLQGGLVKDAIESYIKA 1130 (1666)
T ss_pred CChHHHHHHHHHHHhcCchHHHHHHHHhc
Confidence 45678999999999999999999998775
No 362
>KOG3677|consensus
Probab=29.82 E-value=47 Score=26.13 Aligned_cols=26 Identities=15% Similarity=0.151 Sum_probs=22.8
Q ss_pred HHHHHHHHHhCChHHHHHHHHHHHhh
Q psy18096 88 NNLAQAQIKLGSLEPALMSLENVLNL 113 (131)
Q Consensus 88 ~N~a~~~~kl~~~~~a~~~~~~al~~ 113 (131)
+-.+.+|+.|++|.+|++.+..+|..
T Consensus 276 Y~VGFayLmmrryadai~~F~niLly 301 (525)
T KOG3677|consen 276 YQVGFAYLMMRRYADAIRVFLNILLY 301 (525)
T ss_pred eehhHHHHHHHHHHHHHHHHHHHHHH
Confidence 45789999999999999999988864
No 363
>PF05053 Menin: Menin; InterPro: IPR007747 MEN1, the gene responsible for multiple endocrine neoplasia type 1, is a tumour suppressor gene that encodes a protein called Menin which may be an atypical GTPase stimulated by nm23 [].; GO: 0005634 nucleus; PDB: 3RE2_A 3U84_B 3U86_A 3U88_B 3U85_A.
Probab=29.72 E-value=1.7e+02 Score=24.05 Aligned_cols=43 Identities=14% Similarity=-0.037 Sum_probs=28.4
Q ss_pred HHHHHHHHHHHhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHH
Q psy18096 68 GILASQELLDSLIKERINCYNNLAQAQIKLGSLEPALMSLENV 110 (131)
Q Consensus 68 y~~Ai~~~~~~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~a 110 (131)
|.+||..-..-.....+-=|.-+|-+|.+.++|.+|++...+|
T Consensus 302 ~~~AI~sa~~~Y~n~HvYPYty~gg~~yR~~~~~eA~~~Wa~a 344 (618)
T PF05053_consen 302 FNEAISSARTYYNNHHVYPYTYLGGYYYRHKRYREALRSWAEA 344 (618)
T ss_dssp HHHHHHHHHHHCTT--SHHHHHHHHHHHHTT-HHHHHHHHHHH
T ss_pred HHHHHHHHHHHhcCCccccceehhhHHHHHHHHHHHHHHHHHH
Confidence 7777776655222223333777899999999999999866554
No 364
>PRK15326 type III secretion system needle complex protein PrgI; Provisional
Probab=29.59 E-value=58 Score=19.22 Aligned_cols=25 Identities=16% Similarity=0.239 Sum_probs=18.8
Q ss_pred CChHHHHHHHHHHHhhCCCCHHHHH
Q psy18096 98 GSLEPALMSLENVLNLDPNNIKALQ 122 (131)
Q Consensus 98 ~~~~~a~~~~~~al~~dp~~~ka~~ 122 (131)
++..+++...-+.++.+|+|+-.+-
T Consensus 21 ~~~~~~l~~Al~~l~~~pdnP~~LA 45 (80)
T PRK15326 21 DNLQTQVTEALDKLAAKPSDPALLA 45 (80)
T ss_pred HHHHHHHHHHHHHhhcCCCCHHHHH
Confidence 5566777777788888999887654
No 365
>KOG4507|consensus
Probab=29.11 E-value=1.3e+02 Score=25.05 Aligned_cols=63 Identities=17% Similarity=0.065 Sum_probs=46.8
Q ss_pred HHHHHHHHHH--Hhhhhh-HHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhhc
Q psy18096 68 GILASQELLD--SLIKER-INCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLIL 130 (131)
Q Consensus 68 y~~Ai~~~~~--~~~~~~-~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~~ 130 (131)
-..|+.++.. ...|.. -.-..|+|...++-+...+|-.-..++|.+.-.-+-.+|-.|.+++.
T Consensus 623 ~~~a~~cl~~a~~~~p~~~~v~~v~la~~~~~~~~~~da~~~l~q~l~~~~sepl~~~~~g~~~l~ 688 (886)
T KOG4507|consen 623 STFAIACLQRALNLAPLQQDVPLVNLANLLIHYGLHLDATKLLLQALAINSSEPLTFLSLGNAYLA 688 (886)
T ss_pred cHHHHHHHHHHhccChhhhcccHHHHHHHHHHhhhhccHHHHHHHHHhhcccCchHHHhcchhHHH
Confidence 4677777776 222322 23456999999999999999999999999997667777777777654
No 366
>KOG0890|consensus
Probab=28.32 E-value=86 Score=29.98 Aligned_cols=35 Identities=17% Similarity=0.158 Sum_probs=27.8
Q ss_pred HHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHH
Q psy18096 90 LAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRK 124 (131)
Q Consensus 90 ~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rr 124 (131)
.-.-+-..|+|..|..+|+++++.+|++.+.+-+.
T Consensus 1455 qil~~e~~g~~~da~~Cye~~~q~~p~~~~~~~g~ 1489 (2382)
T KOG0890|consen 1455 QILEHEASGNWADAAACYERLIQKDPDKEKHHSGV 1489 (2382)
T ss_pred HHHHHHhhccHHHHHHHHHHhhcCCCccccchhhH
Confidence 33445567999999999999999999988766543
No 367
>PF15469 Sec5: Exocyst complex component Sec5
Probab=28.01 E-value=68 Score=21.52 Aligned_cols=21 Identities=24% Similarity=0.289 Sum_probs=17.5
Q ss_pred HHhCChHHHHHHHHHHHhhCC
Q psy18096 95 IKLGSLEPALMSLENVLNLDP 115 (131)
Q Consensus 95 ~kl~~~~~a~~~~~~al~~dp 115 (131)
++.|+|+.++.++.+|..+-.
T Consensus 97 i~~~dy~~~i~dY~kak~l~~ 117 (182)
T PF15469_consen 97 IKKGDYDQAINDYKKAKSLFE 117 (182)
T ss_pred HHcCcHHHHHHHHHHHHHHHH
Confidence 567999999999999887643
No 368
>KOG2460|consensus
Probab=27.98 E-value=2.2e+02 Score=23.24 Aligned_cols=25 Identities=12% Similarity=-0.011 Sum_probs=21.9
Q ss_pred HHHHHHHHhCChHHHHHHHHHHHhh
Q psy18096 89 NLAQAQIKLGSLEPALMSLENVLNL 113 (131)
Q Consensus 89 N~a~~~~kl~~~~~a~~~~~~al~~ 113 (131)
-+|..|.-.+.|.+|+..|.+|...
T Consensus 427 ~iA~sY~a~~K~~EAlALy~Ra~sy 451 (593)
T KOG2460|consen 427 YIAVSYQAKKKYSEALALYVRAYSY 451 (593)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5788889999999999999998863
No 369
>KOG0686|consensus
Probab=27.85 E-value=3.1e+02 Score=21.72 Aligned_cols=46 Identities=20% Similarity=0.043 Sum_probs=36.9
Q ss_pred HHHHHHHHHH-----HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhh
Q psy18096 68 GILASQELLD-----SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNL 113 (131)
Q Consensus 68 y~~Ai~~~~~-----~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~ 113 (131)
...|++.|.+ ......+..+.|.-.+-+-+++|.+....-++|.+.
T Consensus 166 l~~Alr~YsR~RdYCTs~khvInm~ln~i~VSI~~~nw~hv~sy~~~A~st 216 (466)
T KOG0686|consen 166 LDNALRCYSRARDYCTSAKHVINMCLNLILVSIYMGNWGHVLSYISKAEST 216 (466)
T ss_pred HHHHHhhhhhhhhhhcchHHHHHHHHHHHHHHHhhcchhhhhhHHHHHHhC
Confidence 5677888777 334556778889999999999999999999988876
No 370
>KOG4563|consensus
Probab=27.75 E-value=1.3e+02 Score=23.18 Aligned_cols=40 Identities=18% Similarity=0.204 Sum_probs=29.2
Q ss_pred HHHHHHhCChHHHHHHHHHHHhhC--------CCCHHHHHHHHHhhhc
Q psy18096 91 AQAQIKLGSLEPALMSLENVLNLD--------PNNIKALQRKAKCLIL 130 (131)
Q Consensus 91 a~~~~kl~~~~~a~~~~~~al~~d--------p~~~ka~~rrg~a~~~ 130 (131)
+.=.+.+++++.|...+..|..+- -.+..++|-.|++++.
T Consensus 48 G~~~~~~~d~~~Avda~s~A~~l~~ei~Ge~~~e~~eal~~YGkslLe 95 (400)
T KOG4563|consen 48 GRRALCNNDIDKAVDALSEATELSDEIYGEKHLETFEALFLYGKSLLE 95 (400)
T ss_pred hhHHHhcccHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHH
Confidence 333445788888888888887754 2557888999988864
No 371
>cd03423 SirA SirA (also known as UvrY, and YhhP) belongs to a family of two-component response regulators that controls secondary metabolism and virulence. The other member of this two-component system is a sensor kinase called BarA which phosphorylates SirA. A variety of microorganisms have similar proteins, all of which contain a common CPxP sequence motif in the N-terminal region. YhhP is thought to be important for normal cell division and growth in rich nutrient medium. Moreover, despite a low primary sequence similarity, the YccP structure closely resembles the non-homologous C-terminal RNA-binding domain of E. coli translation initiation factor IF3. The signature CPxP motif serves to stabilize the N-terminal helix as part of the N-capping box and might be important in mRNA-binding.
Probab=27.64 E-value=58 Score=18.14 Aligned_cols=19 Identities=16% Similarity=0.134 Sum_probs=15.8
Q ss_pred chhcccCCCcEEEEEEcCC
Q psy18096 3 YVLPLMETGEECQIEITAR 21 (131)
Q Consensus 3 ~~l~~M~~gE~~~v~i~~~ 21 (131)
.++..|..||...|.++-.
T Consensus 18 ~~l~~l~~G~~l~V~~dd~ 36 (69)
T cd03423 18 KKVRKMKPGDTLLVLATDP 36 (69)
T ss_pred HHHHcCCCCCEEEEEeCCC
Confidence 5788999999999988743
No 372
>PF10300 DUF3808: Protein of unknown function (DUF3808); InterPro: IPR019412 This entry represents a family of proteins conserved from fungi to humans. In humans this protein is expressed in primary breast carcinomas but not in normal breast tissue, and has a putative eukaryotic RNP-1 RNA binding region and a candidate anchoring transmembrane domain. The human protein is coordinately regulated with oestrogen receptor, but is not necessarily oestradiol-responsive []. Members of this family carry a tetratricopeptide repeat (IPR013105 from INTERPRO) at their C terminus.
Probab=27.23 E-value=1.9e+02 Score=22.77 Aligned_cols=44 Identities=18% Similarity=0.105 Sum_probs=34.4
Q ss_pred HHHHHHH--HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhC
Q psy18096 71 ASQELLD--SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLD 114 (131)
Q Consensus 71 Ai~~~~~--~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~d 114 (131)
|.+++.. ..-|...-.+...|..+...|+.++|++.+++++...
T Consensus 252 a~~lL~~~~~~yP~s~lfl~~~gR~~~~~g~~~~Ai~~~~~a~~~q 297 (468)
T PF10300_consen 252 AEELLEEMLKRYPNSALFLFFEGRLERLKGNLEEAIESFERAIESQ 297 (468)
T ss_pred HHHHHHHHHHhCCCcHHHHHHHHHHHHHhcCHHHHHHHHHHhccch
Confidence 4444444 3447777788899999999999999999999999543
No 373
>KOG0292|consensus
Probab=26.76 E-value=4.5e+02 Score=23.27 Aligned_cols=30 Identities=27% Similarity=0.232 Sum_probs=25.2
Q ss_pred HHHHHhCChHHHHHHHHHHHhhCCCCHHHH
Q psy18096 92 QAQIKLGSLEPALMSLENVLNLDPNNIKAL 121 (131)
Q Consensus 92 ~~~~kl~~~~~a~~~~~~al~~dp~~~ka~ 121 (131)
.++.|++++..|-.-+.+.|++-|.-..|-
T Consensus 1092 n~ffK~kN~ktAs~fa~rLlel~~~~~~A~ 1121 (1202)
T KOG0292|consen 1092 NVFFKLKNLKTAAEFARRLLELAPSPPVAE 1121 (1202)
T ss_pred HHHHHhccHHHHHHHHHHHHhhCCCChHHH
Confidence 456799999999999999999998765553
No 374
>PF04910 Tcf25: Transcriptional repressor TCF25; InterPro: IPR006994 This entry appears to represent a novel family of basic helix-loop-helix (bHLH) proteins that control differentiation and development of a variety of organs [, ]. Human Nulp1 (Q2MK75 from SWISSPROT) is a basic helix-loop-helix protein expressed broadly during early embryonic organogenesis. Over expression of human Nulp1 in COS-7 cells inhibits the transcriptional activity of serum response factor (SRF), suggesting that Nulp1 may act as a novel bHLH transcriptional repressor in the SRF signalling pathway to mediate cellular functions [].
Probab=26.70 E-value=1.3e+02 Score=22.83 Aligned_cols=34 Identities=24% Similarity=0.218 Sum_probs=27.7
Q ss_pred HHHHHHHHHHHhCChHHHHHHHHHHHhhCCC-CHH
Q psy18096 86 CYNNLAQAQIKLGSLEPALMSLENVLNLDPN-NIK 119 (131)
Q Consensus 86 ~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~-~~k 119 (131)
+.........+.|-|.-|++.|.-.+.+||. ++-
T Consensus 105 al~r~i~~L~~RG~~rTAlE~~KlLlsLdp~~DP~ 139 (360)
T PF04910_consen 105 ALFRYIQSLGRRGCWRTALEWCKLLLSLDPDEDPL 139 (360)
T ss_pred HHHHHHHHHHhcCcHHHHHHHHHHHHhcCCCCCcc
Confidence 3345667777889999999999999999998 543
No 375
>PF04505 Dispanin: Interferon-induced transmembrane protein; InterPro: IPR007593 This family includes the human leukocyte antigen CD225, which is an interferon inducible transmembrane protein, and is associated with interferon induced cell growth suppression [].; GO: 0009607 response to biotic stimulus, 0016021 integral to membrane
Probab=26.30 E-value=88 Score=18.31 Aligned_cols=30 Identities=13% Similarity=0.132 Sum_probs=22.1
Q ss_pred HHHHHHHHHHHHHhCChHHHHHHHHHHHhh
Q psy18096 84 INCYNNLAQAQIKLGSLEPALMSLENVLNL 113 (131)
Q Consensus 84 ~~~~~N~a~~~~kl~~~~~a~~~~~~al~~ 113 (131)
+.++.+.+.-....|++++|.+...+|..+
T Consensus 37 Ai~~s~kv~~~~~~Gd~~~A~~aS~~Ak~~ 66 (82)
T PF04505_consen 37 AIVYSSKVRSRYAAGDYEGARRASRKAKKW 66 (82)
T ss_pred HheechhhHHHHHCCCHHHHHHHHHHhHHH
Confidence 344556777777889999999888877654
No 376
>smart00671 SEL1 Sel1-like repeats. These represent a subfamily of TPR (tetratricopeptide repeat) sequences.
Probab=25.99 E-value=79 Score=14.28 Aligned_cols=28 Identities=29% Similarity=0.139 Sum_probs=19.0
Q ss_pred HHHHHHHHHHHh----CChHHHHHHHHHHHhh
Q psy18096 86 CYNNLAQAQIKL----GSLEPALMSLENVLNL 113 (131)
Q Consensus 86 ~~~N~a~~~~kl----~~~~~a~~~~~~al~~ 113 (131)
+..++|.+|..= .++.+|+..+.++-+.
T Consensus 3 a~~~lg~~~~~G~g~~~d~~~A~~~~~~Aa~~ 34 (36)
T smart00671 3 AQYNLGQMYEYGLGVKKDLEKALEYYKKAAEL 34 (36)
T ss_pred HHHHHHHHHHcCCCCCcCHHHHHHHHHHHHHc
Confidence 456777777642 3777888877777653
No 377
>COG3898 Uncharacterized membrane-bound protein [Function unknown]
Probab=25.89 E-value=3.4e+02 Score=21.56 Aligned_cols=50 Identities=12% Similarity=-0.064 Sum_probs=34.3
Q ss_pred HHHHHHHHHH----------HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCH
Q psy18096 68 GILASQELLD----------SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNI 118 (131)
Q Consensus 68 y~~Ai~~~~~----------~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ 118 (131)
|..||++.+. ..+.....++.-.|...+. -+...|..+..++++++|+-+
T Consensus 204 Wd~AlkLvd~~~~~~vie~~~aeR~rAvLLtAkA~s~ld-adp~~Ar~~A~~a~KL~pdlv 263 (531)
T COG3898 204 WDGALKLVDAQRAAKVIEKDVAERSRAVLLTAKAMSLLD-ADPASARDDALEANKLAPDLV 263 (531)
T ss_pred hHHHHHHHHHHHHHHhhchhhHHHHHHHHHHHHHHHHhc-CChHHHHHHHHHHhhcCCccc
Confidence 8899998887 2223344444444444333 358899999999999999874
No 378
>cd09240 BRO1_Alix Protein-interacting, N-terminal, Bro1-like domain of mammalian Alix and related domains. This family contains the N-terminal, Bro1-like domain of mammalian Alix (apoptosis-linked gene-2 interacting protein X), also called apoptosis-linked gene-2 interacting protein 1 (AIP1). It belongs to the BRO1_Alix_like superfamily which also includes the Bro1-like domains of His-Domain type N23 protein tyrosine phosphatase (HD-PTP, also known as PTPN23), RhoA-binding proteins Rhophilin-1 and -2, Brox, Bro1 and Rim20 (also known as PalA) from Saccharomyces cerevisiae, Ustilago maydis Rim23 (also known as PalC), and related domains. Alix, HD-PTP, Brox, Bro1, Rim20, and Rim23, interact with the ESCRT (Endosomal Sorting Complexes Required for Transport) system. Alix participates in membrane remodeling processes during the budding of enveloped viruses, vesicle budding inside late endosomal multivesicular bodies (MVBs), and the abscission reactions of mammalian cell division. It also f
Probab=25.35 E-value=2.8e+02 Score=20.86 Aligned_cols=29 Identities=7% Similarity=0.014 Sum_probs=20.8
Q ss_pred HHHHHHHHHHHHHhCChHHHHHHHHHHHh
Q psy18096 84 INCYNNLAQAQIKLGSLEPALMSLENVLN 112 (131)
Q Consensus 84 ~~~~~N~a~~~~kl~~~~~a~~~~~~al~ 112 (131)
..+++..|......++|.++|..+..|..
T Consensus 255 a~A~y~~a~~~~e~~k~GeaIa~L~~A~~ 283 (346)
T cd09240 255 ALAEYHQSLVAKAQKKFGEEIARLQHALE 283 (346)
T ss_pred HHHHHHHHHHhhhhchHHHHHHHHHHHHH
Confidence 33455566666666889999998888776
No 379
>KOG2300|consensus
Probab=24.94 E-value=3.9e+02 Score=21.86 Aligned_cols=33 Identities=12% Similarity=0.159 Sum_probs=27.0
Q ss_pred hhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhh
Q psy18096 81 KERINCYNNLAQAQIKLGSLEPALMSLENVLNL 113 (131)
Q Consensus 81 ~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~ 113 (131)
......+-|++.|.+-.|++.+|++.-..+.+.
T Consensus 320 m~km~~LE~iv~c~lv~~~~~~al~~i~dm~~w 352 (629)
T KOG2300|consen 320 MFKMILLEHIVMCRLVRGDYVEALEEIVDMKNW 352 (629)
T ss_pred HHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHHH
Confidence 455667779999999999999999877776654
No 380
>PF10952 DUF2753: Protein of unknown function (DUF2753); InterPro: IPR020206 This entry represents a group of uncharacterised proteins.
Probab=24.58 E-value=2.1e+02 Score=18.66 Aligned_cols=50 Identities=24% Similarity=0.150 Sum_probs=37.1
Q ss_pred HHHHHHHHHH----------HhhhhhHHHHHHHHHHHHHhCChHHHHHHHH----HHHhhCCCC
Q psy18096 68 GILASQELLD----------SLIKERINCYNNLAQAQIKLGSLEPALMSLE----NVLNLDPNN 117 (131)
Q Consensus 68 y~~Ai~~~~~----------~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~----~al~~dp~~ 117 (131)
|++|+..-.. ++-...+..--|+|.-+..+|+-+-.+++.. +++.+-|..
T Consensus 24 YQqAls~se~~~~~~~~el~dll~i~VisCHNLA~FWR~~gd~~yELkYLqlASE~VltLiPQC 87 (140)
T PF10952_consen 24 YQQALSLSEEIDESNEIELEDLLTISVISCHNLADFWRSQGDSDYELKYLQLASEKVLTLIPQC 87 (140)
T ss_pred HHHHHHHHHHhcccccccHHHHHHHHHHHHhhHHHHHHHcCChHHHHHHHHHHHHHHHHhccCC
Confidence 8888877665 3334555666799999999999888888654 677777754
No 381
>KOG2047|consensus
Probab=24.49 E-value=1.2e+02 Score=25.41 Aligned_cols=32 Identities=22% Similarity=0.149 Sum_probs=28.3
Q ss_pred hhHHHHHHHHHHHHHhCChHHHHHHHHHHHhh
Q psy18096 82 ERINCYNNLAQAQIKLGSLEPALMSLENVLNL 113 (131)
Q Consensus 82 ~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~ 113 (131)
.-..+++.+|.-|++.|.|++|.+.+.++|+-
T Consensus 246 q~g~Lw~SLAdYYIr~g~~ekarDvyeeai~~ 277 (835)
T KOG2047|consen 246 QLGFLWCSLADYYIRSGLFEKARDVYEEAIQT 277 (835)
T ss_pred HHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHh
Confidence 34568999999999999999999999999864
No 382
>PRK11018 hypothetical protein; Provisional
Probab=24.42 E-value=68 Score=18.47 Aligned_cols=19 Identities=21% Similarity=0.321 Sum_probs=16.1
Q ss_pred chhcccCCCcEEEEEEcCC
Q psy18096 3 YVLPLMETGEECQIEITAR 21 (131)
Q Consensus 3 ~~l~~M~~gE~~~v~i~~~ 21 (131)
.+|..|+.||...|.++..
T Consensus 27 k~l~~l~~G~~L~V~~d~~ 45 (78)
T PRK11018 27 EALPQLKKGEILEVVSDCP 45 (78)
T ss_pred HHHHhCCCCCEEEEEeCCc
Confidence 5789999999999988853
No 383
>COG3914 Spy Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones]
Probab=24.35 E-value=1.9e+02 Score=23.82 Aligned_cols=47 Identities=30% Similarity=0.127 Sum_probs=36.8
Q ss_pred HhhhhhHHHHHHHHHHHHHhCChHHHHHH-HHHHHhhCCCCHHHHHHH
Q psy18096 78 SLIKERINCYNNLAQAQIKLGSLEPALMS-LENVLNLDPNNIKALQRK 124 (131)
Q Consensus 78 ~~~~~~~~~~~N~a~~~~kl~~~~~a~~~-~~~al~~dp~~~ka~~rr 124 (131)
..++....++.|++.+....+....++.+ +..+....|+|...+..+
T Consensus 95 ~~~~~~~~~~~~L~~ale~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~ 142 (620)
T COG3914 95 SVNPENCPAVQNLAAALELDGLQFLALADISEIAEWLSPDNAEFLGHL 142 (620)
T ss_pred hcCcccchHHHHHHHHHHHhhhHHHHHHHHHHHHHhcCcchHHHHhhH
Confidence 45688889999999999888766655554 445888999998888777
No 384
>PF10938 YfdX: YfdX protein; InterPro: IPR021236 YfdX is a protein found in Proteobacteria of unknown function. The protein coding for this gene is regulated by EvgA in Escherichia coli []. ; PDB: 3DZA_C.
Probab=24.10 E-value=2.2e+02 Score=18.80 Aligned_cols=45 Identities=18% Similarity=0.109 Sum_probs=33.3
Q ss_pred HHHHHHHHHH---Hh------hh-hhHHHHHHHHHHHHHhCChHHHHHHHHHHHh
Q psy18096 68 GILASQELLD---SL------IK-ERINCYNNLAQAQIKLGSLEPALMSLENVLN 112 (131)
Q Consensus 68 y~~Ai~~~~~---~~------~~-~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~ 112 (131)
.+.|.+.+.. ++ -| .....-.++|..++..|+|.+|-.....|+.
T Consensus 91 ~~~A~~~L~~~~~ei~~~~~~lPL~~~~~av~~A~~ll~~~k~~eA~~aL~~A~~ 145 (155)
T PF10938_consen 91 KQAAREILKLAGSEIDITTALLPLAQTPAAVKQAAALLDEGKYYEANAALKQALD 145 (155)
T ss_dssp HHHHHHHHHHTT-EEEEEEEEEEHHHHHHHHHHHHHHHHTT-HHHHHHHHHHHHH
T ss_pred HHHHHHHHHHhcccceeeeeeCCHHhhHHHHHHHHHHHHCCCHHHHHHHHHHHhc
Confidence 8888888877 11 12 2345567899999999999999998888774
No 385
>KOG4481|consensus
Probab=23.90 E-value=2.5e+02 Score=19.29 Aligned_cols=20 Identities=20% Similarity=0.516 Sum_probs=10.3
Q ss_pred EEcCCCCCCCCCCCCCCCCCCc
Q psy18096 17 EITARFGYGDKGEPSKSIPPGA 38 (131)
Q Consensus 17 ~i~~~~ayG~~G~~~~~ip~~~ 38 (131)
..+.+|+|+... ...||+|.
T Consensus 97 r~p~~~~f~~~~--i~rIpkgk 116 (194)
T KOG4481|consen 97 RLPKDYHFDEIN--IKRIPKGK 116 (194)
T ss_pred CCcccccCCCcC--cccCCCCc
Confidence 344456666544 25566654
No 386
>KOG2124|consensus
Probab=23.58 E-value=4.1e+02 Score=23.10 Aligned_cols=35 Identities=17% Similarity=0.211 Sum_probs=28.4
Q ss_pred hhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhC
Q psy18096 80 IKERINCYNNLAQAQIKLGSLEPALMSLENVLNLD 114 (131)
Q Consensus 80 ~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~d 114 (131)
.......|.+..--+.+.++|++|++.|.+..++-
T Consensus 383 ~~~~~~~y~~~i~~li~~~~~s~ai~~~~e~~k~a 417 (883)
T KOG2124|consen 383 SMTQIEYYLSQIDSLIKKENYSEAIELCKELMKLA 417 (883)
T ss_pred chHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHH
Confidence 34455677788888899999999999999988764
No 387
>PF01206 TusA: Sulfurtransferase TusA; InterPro: IPR001455 SirA functions as a response regulator as part of a two-component system, where BarA is the sensor kinase. This system increases the expression of virulence genes and decreases the expression of motility genes []. BarA phosphorylates SirA, thereby activating the protein. Phosphorylated SirA directly activates virulence expression by interacting with hilA and hilC promoters, while repressing the flagellar regulon indirectly by binding to the csrB promoter, which in turn affects flagellar gene expression. Orthologues of SirA from Salmonella spp. can be found throughout proteobacteria, such as GacA in Psuedomonas spp., VarA in Vibrio cholerae, ExpA in Erwinia carotovora, LetA in Legionella pneumophila, and UvrY in Escherichia coli []. A sensor kinase for SirA is present in each of these organisms as well; the sensor kinase is known as BarA in E. coli and Salmonella spp., but has different names in other genera. In different species, SirA/BarA orthologues are required for virulence gene expression, exoenzyme and antibiotic production, motility, and biofilm formation. The structure of SirA consists of an alpha/beta sandwich with a beta-alpha-beta-alpha-beta(2) fold, comprising a mixed four-stranded beta-sheet stacked against two alpha-helices, both of which are nearly parallel to the strands of the beta-sheet []. Several uncharacterised bacterial proteins (73 to 81 amino-acid residues in length) that contain a well-conserved region in their N-terminal region show structural similarity to the SirA protein, including the E. coli protein YedF (P0AA31 from SWISSPROT), and other members of the UPF0033 family.; GO: 0016783 sulfurtransferase activity, 0008033 tRNA processing, 0005737 cytoplasm; PDB: 3LVJ_D 3LVK_B 1DCJ_A 3HZ7_A 1JDQ_A 1JE3_A 1PAV_A.
Probab=23.02 E-value=68 Score=17.64 Aligned_cols=20 Identities=15% Similarity=0.159 Sum_probs=15.5
Q ss_pred chhcccCCCcEEEEEEcCCC
Q psy18096 3 YVLPLMETGEECQIEITARF 22 (131)
Q Consensus 3 ~~l~~M~~gE~~~v~i~~~~ 22 (131)
.++..|..|+...|.++...
T Consensus 19 ~~l~~l~~G~~l~v~~d~~~ 38 (70)
T PF01206_consen 19 KALKELPPGEVLEVLVDDPA 38 (70)
T ss_dssp HHHHTSGTT-EEEEEESSTT
T ss_pred HHHHhcCCCCEEEEEECCcc
Confidence 46889999999999998653
No 388
>cd08977 SusD starch binding outer membrane protein SusD. SusD-like proteins from Bacteroidetes, members of the human distal gut microbiota, are part of the starch utilization system (Sus). Sus is one of the large clusters of glycosyl hydrolases, called polysaccharide utilization loci (PULs), which play an important role in polysaccharide recognition and uptake, and it is needed for growth on amylose, amylopectin, pullulan, and maltooligosaccharides. SusD, together with SusC, a predicted beta-barrel porin, forms the minimum outer-membrane starch-binding complex. The adult human distal gut microbiota is essential for digestion of a large variety of dietary polysaccharides, for which humans lack the necessary glycosyl hydrolases.
Probab=22.75 E-value=1.7e+02 Score=21.73 Aligned_cols=32 Identities=19% Similarity=0.111 Sum_probs=25.7
Q ss_pred hhHHHHHHHHHHHHHhC-----ChHHHHHHHHHHHhh
Q psy18096 82 ERINCYNNLAQAQIKLG-----SLEPALMSLENVLNL 113 (131)
Q Consensus 82 ~~~~~~~N~a~~~~kl~-----~~~~a~~~~~~al~~ 113 (131)
....++.=+|.+++.++ +|++|+..|++++..
T Consensus 174 ~k~aA~al~ar~~L~~~~~~~~~~~~A~~~~~~vi~~ 210 (359)
T cd08977 174 WKKAARALLARVYLYLANYTAADYAEALTAAEKSFKG 210 (359)
T ss_pred hHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHHHhc
Confidence 34456666788888888 899999999999964
No 389
>KOG2041|consensus
Probab=22.71 E-value=2.9e+02 Score=23.73 Aligned_cols=44 Identities=16% Similarity=0.190 Sum_probs=32.4
Q ss_pred hhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHH
Q psy18096 80 IKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRK 124 (131)
Q Consensus 80 ~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rr 124 (131)
++..-.++.|++-.+..+-.|++|.+++.+.-... +...+|||.
T Consensus 792 D~~~e~A~r~ig~~fa~~~~We~A~~yY~~~~~~e-~~~ecly~l 835 (1189)
T KOG2041|consen 792 DEGKEDAFRNIGETFAEMMEWEEAAKYYSYCGDTE-NQIECLYRL 835 (1189)
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccchH-hHHHHHHHH
Confidence 56677889999999999999999999887654332 445666553
No 390
>cd09241 BRO1_ScRim20-like Protein-interacting, N-terminal, Bro1-like domain of Saccharomyces cerevisiae Rim20 and related proteins. This family contains the N-terminal, Bro1-like domain of Saccharomyces cerevisiae Rim20 (also known as PalA) and related proteins. It belongs to the BRO1_Alix_like superfamily which also includes the Bro1-like domains of mammalian Alix (apoptosis-linked gene-2 interacting protein X), His-Domain type N23 protein tyrosine phosphatase (HD-PTP, also known as PTPN23), RhoA-binding proteins Rhophilin-1 and -2, Brox, Saccharomyces cerevisiae Bro1, Ustilago maydis Rim23 (also known as PalC), and related domains. Alix, HD-PTP, Brox, Bro1, Rim20, and Rim23, interact with the ESCRT (Endosomal Sorting Complexes Required for Transport) system. Rim20 and Rim23 participate in the response to the external pH via the Rim101 pathway. Bro1-like domains are boomerang-shaped, and part of the domain is a tetratricopeptide repeat (TPR)-like structure. Bro1-like domains bind comp
Probab=22.51 E-value=3.5e+02 Score=20.46 Aligned_cols=45 Identities=9% Similarity=-0.075 Sum_probs=29.1
Q ss_pred HHHHHHHHHH--------------HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHh
Q psy18096 68 GILASQELLD--------------SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLN 112 (131)
Q Consensus 68 y~~Ai~~~~~--------------~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~ 112 (131)
|++|.+.+.. ...-..+.+++..|......++|.++|..+..++.
T Consensus 207 Y~~a~~~l~~~~~i~~~W~~~v~~K~~~f~A~A~y~~a~~~~e~~k~Ge~Ia~L~~A~~ 265 (355)
T cd09241 207 YQEALKYANKSDLIRSDWINHLKVKKHHFKAAAHYRMALVALEKSKYGEEVARLRVALA 265 (355)
T ss_pred HHHHHHHHhcCCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHH
Confidence 7888777763 11123344555666666666788888888887776
No 391
>COG0544 Tig FKBP-type peptidyl-prolyl cis-trans isomerase (trigger factor) [Posttranslational modification, protein turnover, chaperones]
Probab=22.15 E-value=4e+02 Score=21.04 Aligned_cols=42 Identities=19% Similarity=0.308 Sum_probs=29.3
Q ss_pred cchhcccCCCcEEEEEE--cCCCCCCCCCCCCCCCCCCceEEEEEeeccccC
Q psy18096 2 DYVLPLMETGEECQIEI--TARFGYGDKGEPSKSIPPGAKLYYSLTLHSVLP 51 (131)
Q Consensus 2 d~~l~~M~~gE~~~v~i--~~~~ayG~~G~~~~~ip~~~~~~~~v~l~~~~~ 51 (131)
|.+|-.|+.||...|.+ +..|+-++. .|....|.|+++.+..
T Consensus 199 e~~LvG~k~Ge~k~i~vtFP~dy~a~~L--------aGK~a~F~V~vkeVk~ 242 (441)
T COG0544 199 EDQLVGMKAGEEKDIKVTFPEDYHAEEL--------AGKEATFKVKVKEVKK 242 (441)
T ss_pred HhhhccCcCCCeeEEEEEcccccchhHh--------CCCceEEEEEEEEEee
Confidence 67889999999987544 555643321 2345789999988854
No 392
>KOG0396|consensus
Probab=22.10 E-value=3.8e+02 Score=20.78 Aligned_cols=46 Identities=13% Similarity=-0.003 Sum_probs=35.1
Q ss_pred HHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 84 INCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 84 ~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
..+..-...-.++.++|.+||..+.+=+.--++..+...+.+.+.+
T Consensus 191 f~lRlQefIELi~~~~~~~Ai~~akk~f~~~~~~~~~~Lk~a~g~l 236 (389)
T KOG0396|consen 191 FQLRLQEFIELIKVDNYDKAIAFAKKHFAPWAKSHKSDLKLAMGLL 236 (389)
T ss_pred hHHHHHHHHHHHHhccHHHHHHHHHHHHhhhhhhhHHHHHHHHHhh
Confidence 3455566677788899999999999988877777777777766554
No 393
>PF07079 DUF1347: Protein of unknown function (DUF1347); InterPro: IPR010764 This family consists of several hypothetical bacterial proteins of around 610 residues in length. Members of this family are highly conserved and seem to be specific to Chlamydia species. The function of this family is unknown.
Probab=21.77 E-value=2.7e+02 Score=22.47 Aligned_cols=35 Identities=14% Similarity=0.248 Sum_probs=20.5
Q ss_pred HHHhCChHHHHHHHHHHHhhCCCCHHHHHHHHHhhh
Q psy18096 94 QIKLGSLEPALMSLENVLNLDPNNIKALQRKAKCLI 129 (131)
Q Consensus 94 ~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg~a~~ 129 (131)
..-.|+|.+|.-.+.-..+++| ++.++--.|.+++
T Consensus 472 Lysqgey~kc~~ys~WL~~iaP-S~~~~RLlGl~l~ 506 (549)
T PF07079_consen 472 LYSQGEYHKCYLYSSWLTKIAP-SPQAYRLLGLCLM 506 (549)
T ss_pred HHhcccHHHHHHHHHHHHHhCC-cHHHHHHHHHHHH
Confidence 3345666666666666666666 5666544455443
No 394
>PRK04306 50S ribosomal protein L21e; Reviewed
Probab=21.60 E-value=72 Score=19.61 Aligned_cols=23 Identities=26% Similarity=0.371 Sum_probs=19.4
Q ss_pred chhcccCCCcEEEEEEcCCCCCC
Q psy18096 3 YVLPLMETGEECQIEITARFGYG 25 (131)
Q Consensus 3 ~~l~~M~~gE~~~v~i~~~~ayG 25 (131)
..|+..++|+.+.+.++|...-|
T Consensus 30 ~~l~~y~~Gd~V~I~~d~sv~kG 52 (98)
T PRK04306 30 RALQEFEEGDKVHIVIDPSVHKG 52 (98)
T ss_pred HHHHhccCCCEEEEEecCceecC
Confidence 56788999999999999986544
No 395
>PF02064 MAS20: MAS20 protein import receptor; InterPro: IPR002056 Virtually all mitochondrial precursors are imported via the same mechanism []: precursors first bind to receptors on the mitochondrial surface, then insert into the translocation channel in the outer membrane. Many outer-membrane proteins participate in the early stages of import, four of which (MAS20, MAS22, MAS37 and MAS70) are components of the receptor. MAS20, which forms a subcomplex with MAS22, seems to interact with most or all mitochondrial precursors, suggesting that the protein binds directly to mitochondrial targeting sequences. The MAS37 and MAS70 components also form a subcomplex, the two subcomplexes possibly binding via their trans- membrane (TM) regions - the TM region of MAS70 promotes oligomerisation of attatched protein domains and shares sequence similarity with the TM region of MAS20 []. MAS20 is also known as TOM20.; GO: 0006605 protein targeting, 0006886 intracellular protein transport, 0005742 mitochondrial outer membrane translocase complex; PDB: 3AX3_A 3AWR_B 2V1S_A 3AX5_C 3AX2_C 1OM2_A 2V1T_B.
Probab=21.29 E-value=2.1e+02 Score=18.23 Aligned_cols=28 Identities=25% Similarity=0.358 Sum_probs=21.7
Q ss_pred HHHHHHHHhCChHHHHHHHHHHHhhCCC
Q psy18096 89 NLAQAQIKLGSLEPALMSLENVLNLDPN 116 (131)
Q Consensus 89 N~a~~~~kl~~~~~a~~~~~~al~~dp~ 116 (131)
.++-.++..|++++|..++-+||..-|+
T Consensus 68 ~lGE~L~~~G~~~~aa~hf~nAl~V~~q 95 (121)
T PF02064_consen 68 QLGEQLLAQGDYEEAAEHFYNALKVCPQ 95 (121)
T ss_dssp HHHHHHHHTT-HHHHHHHHHHHHHTSSS
T ss_pred HHHHHHHhCCCHHHHHHHHHHHHHhCCC
Confidence 3566667789999999999999998764
No 396
>cd04750 Commd2 COMM_Domain containing protein 2. The COMM Domain is found at the C-terminus of a variety of proteins; presumably all COMM_Domain containing proteins are located in the nucleus and the COMM domain plays a role in protein-protein interactions. Several family members have been shown to bind and inhibit NF-kappaB.
Probab=21.18 E-value=2.3e+02 Score=18.96 Aligned_cols=37 Identities=11% Similarity=0.083 Sum_probs=23.3
Q ss_pred HHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHH
Q psy18096 87 YNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQR 123 (131)
Q Consensus 87 ~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~r 123 (131)
+++.|.-|+.-|.-.+-+..+-+.++++++.+|..+.
T Consensus 2 ~~~~a~~~l~~g~n~~~~~~~A~~l~i~~~~vk~~v~ 38 (166)
T cd04750 2 FCKLAIEFLFKGINQKKYEGAARKLEVEVETVQHGVE 38 (166)
T ss_pred HHHHHHHHHHcCCChHHHHHHHHHhCCCHHHHHHHHH
Confidence 3566666666666666666666666666666665543
No 397
>PF08324 PUL: PUL domain; InterPro: IPR013535 The PUL (after PLAP, UFD3 and lub1) domain is a predicted predominantly alpha helical globular domain found in eukaryotes. It is found in association with either WD repeats (see PDOC00574 from PROSITEDOC) and the PFU domain (see PDOC51394 from PROSITEDOC) or PPPDE and thioredoxin (see PDOC00172 from PROSITEDOC) domains. The PUL domain is a protein-protein interaction domain [, ]. Some proteins known to contain a PUL domain are listed below: Saccharomyces cerevisiae DOA1 (UFD3, ZZZ4), involved in ubiquitin conjugation pathway. DOA1 participates in the regulation of the ubiquitin conjugation pathway involving CDC48 by hindering multiubiquitination of substrates at the CDC48 chaperone. Schizosaccharomyces pombe ubiquitin homeostasis protein lub1, acts as a negative regulator of vacuole-dependent ubiquitin degradation. Mammalian phospholipase A-2-activating protein (PLA2P, PLAA), the homologue of DOA1. PLA2P plays an important role in the regulation of specific inflammatory disease processes. ; PDB: 3EBB_A 3L3F_X 3GAE_B 3PST_A 3PSP_A.
Probab=20.96 E-value=3.1e+02 Score=19.42 Aligned_cols=48 Identities=19% Similarity=0.036 Sum_probs=32.1
Q ss_pred hhhHHHHHHHHHHHHHhC-ChHHHHHHHHHHHh---hCCCCHHHHHHHHHhh
Q psy18096 81 KERINCYNNLAQAQIKLG-SLEPALMSLENVLN---LDPNNIKALQRKAKCL 128 (131)
Q Consensus 81 ~~~~~~~~N~a~~~~kl~-~~~~a~~~~~~al~---~dp~~~ka~~rrg~a~ 128 (131)
--.+++++|+|....+.. .-+........+++ ....+.++.||--.|+
T Consensus 175 ~A~ATl~~Nlsv~~~~~~~~~~~~~~ll~~i~~~~~~~~~d~Ea~~R~LvAl 226 (268)
T PF08324_consen 175 IALATLLLNLSVLLHKNRSDEEWQSELLSSIIEVLSREESDEEALYRLLVAL 226 (268)
T ss_dssp HHHHHHHHHHHHHHHHCTS-CCHHHHHHHHHHHHCHCCHTSHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHhcCCChHHHHHHHHHHHHHhccccCCHHHHHHHHHHH
Confidence 345678899999999887 44444444444444 3457999999876664
No 398
>PRK00299 sulfur transfer protein SirA; Reviewed
Probab=20.84 E-value=90 Score=18.09 Aligned_cols=18 Identities=22% Similarity=0.171 Sum_probs=15.5
Q ss_pred chhcccCCCcEEEEEEcC
Q psy18096 3 YVLPLMETGEECQIEITA 20 (131)
Q Consensus 3 ~~l~~M~~gE~~~v~i~~ 20 (131)
.++.+|..|+...|.++-
T Consensus 28 k~l~~l~~G~~l~V~~dd 45 (81)
T PRK00299 28 KTVRNMQPGETLLIIADD 45 (81)
T ss_pred HHHHcCCCCCEEEEEeCC
Confidence 578999999999998874
No 399
>PF06812 ImpA-rel_N: ImpA-related N-terminal; InterPro: IPR010657 This entry represents a conserved region located towards the N-terminal end of ImpA and related proteins. ImpA is an inner membrane protein, which has been suggested to be involved with proteins that are exported and associated with colony variations in Actinobacillus actinomycetemcomitans []. Note that many members are hypothetical proteins.
Probab=20.76 E-value=1.6e+02 Score=15.96 Aligned_cols=30 Identities=7% Similarity=0.066 Sum_probs=20.2
Q ss_pred ChHHHHHHHHHHHhhCCCCHHHHHHHHHhh
Q psy18096 99 SLEPALMSLENVLNLDPNNIKALQRKAKCL 128 (131)
Q Consensus 99 ~~~~a~~~~~~al~~dp~~~ka~~rrg~a~ 128 (131)
+|....+.|.+.|+-..++.+..-....|.
T Consensus 1 DW~~v~~~~~~lL~~~sKDlrv~~~l~~a~ 30 (62)
T PF06812_consen 1 DWDKVEELALELLSEQSKDLRVAVWLTEAL 30 (62)
T ss_pred CHHHHHHHHHHHHHHcCchHHHHHHHHHHH
Confidence 467777788888877777766555544443
No 400
>KOG3783|consensus
Probab=20.68 E-value=3.7e+02 Score=21.94 Aligned_cols=47 Identities=9% Similarity=-0.097 Sum_probs=37.2
Q ss_pred HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHhhCCCCHHHHHHHH
Q psy18096 78 SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLNLDPNNIKALQRKA 125 (131)
Q Consensus 78 ~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~~dp~~~ka~~rrg 125 (131)
.......-++.-+|.++.-+-+|.+|-.+++...... ++.+|+|.--
T Consensus 297 ~~kQ~~~l~~fE~aw~~v~~~~~~~aad~~~~L~des-dWS~a~Y~Yf 343 (546)
T KOG3783|consen 297 RMKQVKSLMVFERAWLSVGQHQYSRAADSFDLLRDES-DWSHAFYTYF 343 (546)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhh-hhhHHHHHHH
Confidence 3456667788889999999999999999998866655 7888888753
No 401
>COG1516 FliS Flagellin-specific chaperone FliS [Cell motility and secretion / Intracellular trafficking and secretion / Posttranslational modification, protein turnover, chaperones]
Probab=20.59 E-value=2.6e+02 Score=18.22 Aligned_cols=24 Identities=17% Similarity=0.275 Sum_probs=17.5
Q ss_pred HHHHHHHHhCChHHHHHHHHHHHh
Q psy18096 89 NLAQAQIKLGSLEPALMSLENVLN 112 (131)
Q Consensus 89 N~a~~~~kl~~~~~a~~~~~~al~ 112 (131)
-+|.+.+..+++.++....++|+.
T Consensus 36 ~~A~~aie~~~i~~k~~~i~ka~~ 59 (132)
T COG1516 36 KRAKEAIEQEDIEEKNESIDKAID 59 (132)
T ss_pred HHHHHHHHhccHHHHHHHHHHHHH
Confidence 367777777888777777777764
No 402
>KOG3807|consensus
Probab=20.32 E-value=3.7e+02 Score=20.87 Aligned_cols=29 Identities=28% Similarity=0.362 Sum_probs=23.2
Q ss_pred HHHHHHHHHHhCChHHHHHHHHHHHhhCC
Q psy18096 87 YNNLAQAQIKLGSLEPALMSLENVLNLDP 115 (131)
Q Consensus 87 ~~N~a~~~~kl~~~~~a~~~~~~al~~dp 115 (131)
--.+|.|-.|+|+..+|++.+....+--|
T Consensus 278 KRRLAMCARklGrlrEA~K~~RDL~ke~p 306 (556)
T KOG3807|consen 278 KRRLAMCARKLGRLREAVKIMRDLMKEFP 306 (556)
T ss_pred HHHHHHHHHHhhhHHHHHHHHHHHhhhcc
Confidence 34789999999999999998887655433
No 403
>TIGR02863 spore_sspJ small, acid-soluble spore protein, SspJ family. RL J Bacteriol. 1998 Dec;180(24):6704-12.
Probab=20.29 E-value=1e+02 Score=15.85 Aligned_cols=18 Identities=28% Similarity=0.331 Sum_probs=14.4
Q ss_pred ChHHHHHHHHHHHhhCCC
Q psy18096 99 SLEPALMSLENVLNLDPN 116 (131)
Q Consensus 99 ~~~~a~~~~~~al~~dp~ 116 (131)
....|+++..++|.-||-
T Consensus 19 ~v~gaLEdAg~aLk~DPL 36 (47)
T TIGR02863 19 AIQGALEDAGQALKDDPL 36 (47)
T ss_pred HHHHHHHHHHHHhcCChH
Confidence 356788999999988873
No 404
>PF14858 DUF4486: Domain of unknown function (DUF4486)
Probab=20.19 E-value=2.2e+02 Score=23.22 Aligned_cols=35 Identities=17% Similarity=0.099 Sum_probs=30.8
Q ss_pred HhhhhhHHHHHHHHHHHHHhCChHHHHHHHHHHHh
Q psy18096 78 SLIKERINCYNNLAQAQIKLGSLEPALMSLENVLN 112 (131)
Q Consensus 78 ~~~~~~~~~~~N~a~~~~kl~~~~~a~~~~~~al~ 112 (131)
..-|-.+.+|.-.+.||...+.+.+|...+.++|.
T Consensus 191 rYL~WR~~Ly~avc~cY~d~~~~~~A~~farraL~ 225 (542)
T PF14858_consen 191 RYLPWRVTLYTAVCQCYEDCQAGEHAEAFARRALA 225 (542)
T ss_pred chhhHHHHHHHHHHHHHHHhcchhHHHHHHHHHHH
Confidence 33577889999999999999999999999998885
No 405
>cd00291 SirA_YedF_YeeD SirA, YedF, and YeeD. Two-layered alpha/beta sandwich domain. SirA (also known as UvrY, and YhhP) belongs to a family of bacterial two-component response regulators that controls secondary metabolism and virulence. The other member of this two-component system is a sensor kinase called BarA which phosphorylates SirA. A variety of microorganisms have similar proteins, all of which contain a common CPxP sequence motif in the N-terminal region. YhhP is suggested to be important for normal cell division and growth in rich nutrient medium. Moreover, despite a low primary sequence similarity, the YccP structure closely resembles the non-homologous C-terminal RNA-binding domain of E. coli translation initiation factor IF3. The signature CPxP motif serves to stabilize the N-terminal helix as part of the N-capping box and might be important in mRNA-binding.
Probab=20.01 E-value=1e+02 Score=16.76 Aligned_cols=19 Identities=16% Similarity=0.270 Sum_probs=16.0
Q ss_pred chhcccCCCcEEEEEEcCC
Q psy18096 3 YVLPLMETGEECQIEITAR 21 (131)
Q Consensus 3 ~~l~~M~~gE~~~v~i~~~ 21 (131)
.++..|..|+...|..+..
T Consensus 18 ~~l~~l~~g~~l~v~~d~~ 36 (69)
T cd00291 18 KALEKLKSGEVLEVLLDDP 36 (69)
T ss_pred HHHhcCCCCCEEEEEecCC
Confidence 4788999999999998853
Done!