Query psy16144
Match_columns 122
No_of_seqs 102 out of 119
Neff 5.5
Searched_HMMs 46136
Date Fri Aug 16 21:57:29 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy16144.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/16144hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PRK15359 type III secretion sy 99.5 2.9E-13 6.2E-18 98.3 12.0 94 22-117 33-136 (144)
2 TIGR02552 LcrH_SycD type III s 99.3 5E-11 1.1E-15 82.9 12.5 95 22-118 26-130 (135)
3 PRK15363 pathogenicity island 99.3 2.8E-11 6.1E-16 91.1 11.1 106 10-120 30-150 (157)
4 PRK11189 lipoprotein NlpI; Pro 99.3 6.5E-11 1.4E-15 94.6 11.3 85 22-107 73-167 (296)
5 PLN03088 SGT1, suppressor of 99.2 1.1E-10 2.3E-15 96.3 11.4 97 22-120 11-117 (356)
6 PRK10370 formate-dependent nit 99.2 1.7E-10 3.8E-15 88.1 11.2 89 17-106 75-178 (198)
7 TIGR02795 tol_pal_ybgF tol-pal 99.2 9.3E-10 2E-14 73.7 11.4 87 22-109 11-113 (119)
8 TIGR00990 3a0801s09 mitochondr 99.1 8.6E-10 1.9E-14 95.4 12.0 89 22-111 340-438 (615)
9 PRK15179 Vi polysaccharide bio 99.1 7.4E-10 1.6E-14 99.0 11.0 88 22-110 95-192 (694)
10 cd00189 TPR Tetratricopeptide 99.1 1.8E-09 3.9E-14 65.7 9.0 82 22-104 9-100 (100)
11 KOG0553|consensus 99.1 6.1E-10 1.3E-14 91.2 8.7 98 22-121 90-197 (304)
12 PRK10370 formate-dependent nit 99.1 1.4E-09 3E-14 83.1 9.8 86 25-111 51-149 (198)
13 PRK12370 invasion protein regu 99.1 2.2E-09 4.7E-14 92.5 12.1 84 25-109 316-409 (553)
14 PRK09782 bacteriophage N4 rece 99.0 1.5E-09 3.3E-14 100.2 11.2 90 22-112 618-717 (987)
15 TIGR00990 3a0801s09 mitochondr 99.0 2.5E-09 5.3E-14 92.5 11.8 89 22-111 374-472 (615)
16 PF14559 TPR_19: Tetratricopep 99.0 7.9E-10 1.7E-14 69.2 6.0 57 53-110 7-63 (68)
17 PRK12370 invasion protein regu 99.0 2.6E-09 5.6E-14 92.0 11.0 87 22-109 347-444 (553)
18 PRK15359 type III secretion sy 99.0 2.7E-09 5.9E-14 77.4 8.7 82 28-110 8-96 (144)
19 PF13432 TPR_16: Tetratricopep 99.0 2E-09 4.3E-14 67.2 6.5 53 53-106 13-65 (65)
20 PF13429 TPR_15: Tetratricopep 99.0 3.8E-09 8.3E-14 82.5 9.3 94 18-112 149-254 (280)
21 TIGR02521 type_IV_pilW type IV 98.9 3E-08 6.5E-13 71.2 11.4 87 22-108 74-171 (234)
22 TIGR02521 type_IV_pilW type IV 98.9 2.7E-08 6E-13 71.4 11.0 79 22-101 40-128 (234)
23 TIGR03302 OM_YfiO outer membra 98.9 2.9E-08 6.4E-13 75.1 11.6 94 15-109 33-152 (235)
24 PF13414 TPR_11: TPR repeat; P 98.9 9.9E-09 2.1E-13 64.4 7.2 50 53-103 19-69 (69)
25 PRK15174 Vi polysaccharide exp 98.9 2.1E-08 4.6E-13 88.4 11.6 86 22-108 255-354 (656)
26 PRK15331 chaperone protein Sic 98.9 2.6E-08 5.6E-13 75.7 10.3 85 24-110 48-142 (165)
27 KOG4626|consensus 98.9 1.1E-08 2.4E-13 91.3 9.4 89 22-111 397-495 (966)
28 PRK09782 bacteriophage N4 rece 98.9 1.8E-08 3.9E-13 93.1 11.2 84 25-110 588-681 (987)
29 PRK15174 Vi polysaccharide exp 98.9 2.9E-08 6.2E-13 87.6 12.0 93 18-111 79-183 (656)
30 TIGR02917 PEP_TPR_lipo putativ 98.8 4.8E-08 1.1E-12 82.7 11.6 91 22-113 779-878 (899)
31 PRK11447 cellulose synthase su 98.8 4.2E-08 9E-13 91.0 11.5 92 18-110 306-423 (1157)
32 TIGR02552 LcrH_SycD type III s 98.8 4.9E-08 1.1E-12 67.7 9.0 78 33-111 3-90 (135)
33 PRK10049 pgaA outer membrane p 98.8 5.6E-08 1.2E-12 86.8 11.5 93 22-117 368-470 (765)
34 TIGR02917 PEP_TPR_lipo putativ 98.8 1.4E-07 3.1E-12 79.9 12.5 88 22-110 576-673 (899)
35 PRK10803 tol-pal system protei 98.8 1.5E-07 3.4E-12 75.3 12.0 87 22-109 152-254 (263)
36 PRK11447 cellulose synthase su 98.8 8.2E-08 1.8E-12 89.0 11.4 87 22-109 394-532 (1157)
37 COG3063 PilF Tfp pilus assembl 98.7 6.2E-08 1.3E-12 77.5 8.2 84 22-107 44-137 (250)
38 PRK10049 pgaA outer membrane p 98.7 2.4E-07 5.3E-12 82.8 13.0 92 18-111 52-155 (765)
39 KOG1125|consensus 98.7 2.4E-08 5.1E-13 87.5 6.3 82 22-104 439-530 (579)
40 PF13371 TPR_9: Tetratricopept 98.7 1.3E-07 2.8E-12 59.8 7.5 57 53-110 11-67 (73)
41 PRK11788 tetratricopeptide rep 98.7 2.7E-07 5.9E-12 73.8 11.1 86 22-109 223-319 (389)
42 TIGR03302 OM_YfiO outer membra 98.7 4.4E-07 9.6E-12 68.6 11.3 84 22-106 79-200 (235)
43 PRK15179 Vi polysaccharide bio 98.7 3E-07 6.6E-12 82.4 11.7 89 22-111 129-227 (694)
44 PF12895 Apc3: Anaphase-promot 98.6 7E-08 1.5E-12 63.3 5.6 72 25-98 1-84 (84)
45 KOG4626|consensus 98.6 1.1E-07 2.3E-12 85.2 7.9 89 22-111 363-461 (966)
46 PRK02603 photosystem I assembl 98.6 5.1E-07 1.1E-11 66.4 10.4 83 22-105 44-153 (172)
47 COG4783 Putative Zn-dependent 98.6 3.3E-07 7.1E-12 79.2 10.2 96 22-119 315-420 (484)
48 KOG1155|consensus 98.6 2.2E-07 4.8E-12 80.5 9.1 94 25-120 342-445 (559)
49 CHL00033 ycf3 photosystem I as 98.6 1.2E-06 2.5E-11 64.0 11.6 88 17-105 37-153 (168)
50 PRK11788 tetratricopeptide rep 98.6 8.2E-07 1.8E-11 71.1 11.7 85 22-107 189-284 (389)
51 PRK11189 lipoprotein NlpI; Pro 98.6 7.1E-07 1.5E-11 71.4 10.4 85 26-111 39-137 (296)
52 PF13429 TPR_15: Tetratricopep 98.5 1.3E-07 2.9E-12 73.8 5.3 78 22-100 189-276 (280)
53 KOG1126|consensus 98.5 1.3E-07 2.9E-12 83.7 5.7 87 22-109 430-526 (638)
54 COG4235 Cytochrome c biogenesi 98.5 1.2E-06 2.6E-11 71.6 10.1 90 15-105 156-260 (287)
55 PF13432 TPR_16: Tetratricopep 98.5 2.9E-07 6.3E-12 57.3 4.8 50 22-72 6-65 (65)
56 PF13428 TPR_14: Tetratricopep 98.5 6.4E-07 1.4E-11 52.9 5.6 40 72-111 1-40 (44)
57 KOG1173|consensus 98.4 1.8E-06 3.8E-11 76.1 10.0 93 22-117 423-533 (611)
58 COG5010 TadD Flp pilus assembl 98.4 4.1E-06 8.9E-11 67.6 10.8 89 22-111 109-207 (257)
59 PRK14574 hmsH outer membrane p 98.4 2E-06 4.4E-11 78.5 9.3 86 22-110 43-140 (822)
60 KOG1125|consensus 98.3 2.6E-06 5.6E-11 74.9 8.8 58 53-111 446-503 (579)
61 PLN02789 farnesyltranstransfer 98.3 1.3E-05 2.9E-10 65.8 12.0 88 22-110 80-180 (320)
62 CHL00033 ycf3 photosystem I as 98.3 5.2E-06 1.1E-10 60.6 8.6 81 29-110 15-110 (168)
63 cd05804 StaR_like StaR_like; a 98.3 1.1E-05 2.3E-10 64.0 10.7 51 53-104 130-180 (355)
64 PRK15363 pathogenicity island 98.3 3.6E-06 7.8E-11 63.5 7.4 56 53-109 51-106 (157)
65 PLN02789 farnesyltranstransfer 98.3 1.6E-05 3.5E-10 65.3 11.5 88 22-110 46-146 (320)
66 PF07719 TPR_2: Tetratricopept 98.3 3E-06 6.4E-11 46.3 5.0 34 72-105 1-34 (34)
67 cd05804 StaR_like StaR_like; a 98.2 6.5E-06 1.4E-10 65.3 8.7 82 22-104 123-218 (355)
68 KOG1126|consensus 98.2 1.7E-06 3.7E-11 76.8 5.7 92 15-107 454-558 (638)
69 PRK14720 transcript cleavage f 98.2 7E-06 1.5E-10 75.7 9.7 93 17-112 33-156 (906)
70 PF13414 TPR_11: TPR repeat; P 98.2 2.5E-06 5.5E-11 53.3 4.8 52 17-69 5-69 (69)
71 PLN03088 SGT1, suppressor of 98.2 6.9E-06 1.5E-10 67.8 8.6 58 53-111 18-75 (356)
72 KOG1155|consensus 98.2 1.2E-05 2.6E-10 70.0 10.0 87 17-104 366-464 (559)
73 PF00515 TPR_1: Tetratricopept 98.2 3.8E-06 8.3E-11 46.3 4.3 34 72-105 1-34 (34)
74 PLN03098 LPA1 LOW PSII ACCUMUL 98.2 5.7E-06 1.2E-10 71.3 7.1 66 42-108 71-149 (453)
75 cd00189 TPR Tetratricopeptide 98.2 1.8E-05 3.9E-10 47.7 7.5 57 53-110 16-72 (100)
76 PRK14574 hmsH outer membrane p 98.2 2E-05 4.3E-10 72.1 10.8 86 22-109 111-206 (822)
77 PF12895 Apc3: Anaphase-promot 98.1 7E-06 1.5E-10 53.7 5.7 56 53-110 5-62 (84)
78 PF13371 TPR_9: Tetratricopept 98.1 9.9E-06 2.1E-10 51.0 6.2 57 22-79 4-70 (73)
79 TIGR00540 hemY_coli hemY prote 98.1 3E-05 6.6E-10 64.4 10.8 86 22-108 272-373 (409)
80 KOG1840|consensus 98.1 2.8E-05 6.1E-10 67.9 10.8 94 22-115 208-331 (508)
81 TIGR02795 tol_pal_ybgF tol-pal 98.1 2.3E-05 5E-10 52.1 8.1 54 53-107 18-74 (119)
82 KOG0553|consensus 98.1 7.3E-06 1.6E-10 67.5 6.6 58 53-111 97-154 (304)
83 PRK02603 photosystem I assembl 98.1 3.1E-05 6.6E-10 56.9 8.9 57 53-110 51-110 (172)
84 PRK10747 putative protoheme IX 98.1 7E-05 1.5E-09 62.2 11.7 88 22-110 127-225 (398)
85 PF13525 YfiO: Outer membrane 98.0 5.5E-05 1.2E-09 57.5 9.9 87 22-109 14-127 (203)
86 COG3063 PilF Tfp pilus assembl 98.0 4E-05 8.6E-10 61.6 9.1 100 16-118 70-182 (250)
87 PRK10747 putative protoheme IX 98.0 0.00012 2.5E-09 60.9 12.2 85 22-107 272-363 (398)
88 PF13431 TPR_17: Tetratricopep 98.0 5.7E-06 1.2E-10 47.0 3.1 34 59-93 1-34 (34)
89 PRK10153 DNA-binding transcrip 98.0 3.9E-05 8.5E-10 66.8 9.6 53 53-107 436-488 (517)
90 COG2956 Predicted N-acetylgluc 98.0 5.9E-05 1.3E-09 63.4 9.9 101 19-120 146-262 (389)
91 COG5010 TadD Flp pilus assembl 98.0 3.9E-05 8.5E-10 62.0 8.4 90 22-112 75-174 (257)
92 PRK10866 outer membrane biogen 98.0 0.00011 2.3E-09 58.0 10.6 93 15-108 32-160 (243)
93 PF13512 TPR_18: Tetratricopep 98.0 2.3E-05 5.1E-10 58.3 6.3 42 68-109 43-84 (142)
94 PRK10866 outer membrane biogen 98.0 2.8E-05 6.1E-10 61.3 7.1 57 53-110 48-107 (243)
95 COG1729 Uncharacterized protei 97.9 6.8E-05 1.5E-09 60.8 9.1 88 22-109 150-252 (262)
96 PF09976 TPR_21: Tetratricopep 97.9 8.4E-05 1.8E-09 53.3 8.7 72 25-97 23-110 (145)
97 PF14559 TPR_19: Tetratricopep 97.9 1.1E-05 2.3E-10 50.1 3.5 56 24-80 2-67 (68)
98 PF06552 TOM20_plant: Plant sp 97.9 0.00014 3.1E-09 56.3 10.2 81 29-110 7-118 (186)
99 KOG0548|consensus 97.9 7.6E-05 1.7E-09 65.4 9.0 87 22-109 367-463 (539)
100 TIGR00540 hemY_coli hemY prote 97.9 0.00017 3.7E-09 59.9 10.8 87 22-109 93-190 (409)
101 KOG0543|consensus 97.9 0.00014 2.9E-09 62.0 10.3 91 22-113 217-332 (397)
102 KOG2076|consensus 97.9 0.0002 4.2E-09 65.9 11.7 80 22-103 149-272 (895)
103 PRK10803 tol-pal system protei 97.8 0.00012 2.6E-09 58.7 8.9 59 53-112 159-220 (263)
104 PF13174 TPR_6: Tetratricopept 97.8 5E-05 1.1E-09 40.9 4.6 32 74-105 2-33 (33)
105 PF04733 Coatomer_E: Coatomer 97.8 5.1E-05 1.1E-09 61.5 6.4 89 22-111 140-240 (290)
106 KOG4162|consensus 97.8 0.00018 3.8E-09 65.4 9.6 85 22-107 693-789 (799)
107 KOG2076|consensus 97.8 0.00047 1E-08 63.4 12.4 62 53-115 155-216 (895)
108 KOG0624|consensus 97.7 0.00043 9.4E-09 59.1 10.4 95 22-118 232-386 (504)
109 PF13424 TPR_12: Tetratricopep 97.7 8.9E-05 1.9E-09 47.3 4.9 61 22-101 14-75 (78)
110 KOG0548|consensus 97.7 0.00033 7.1E-09 61.5 9.3 88 22-110 11-108 (539)
111 PRK11906 transcriptional regul 97.6 0.00036 7.8E-09 60.4 9.3 77 28-105 319-405 (458)
112 COG4235 Cytochrome c biogenesi 97.6 0.00037 8.1E-09 57.1 9.0 81 29-110 138-231 (287)
113 PF12569 NARP1: NMDA receptor- 97.6 0.00044 9.6E-09 60.5 8.7 84 18-102 197-292 (517)
114 KOG0543|consensus 97.6 0.0013 2.7E-08 56.2 11.1 100 4-104 244-358 (397)
115 KOG4234|consensus 97.6 0.00089 1.9E-08 53.7 9.6 100 13-113 90-209 (271)
116 KOG0550|consensus 97.5 0.00016 3.5E-09 62.3 5.7 83 25-108 215-323 (486)
117 PF13428 TPR_14: Tetratricopep 97.5 0.0002 4.3E-09 42.1 3.9 35 17-51 3-40 (44)
118 KOG0547|consensus 97.5 0.00019 4.2E-09 63.0 5.5 55 53-108 131-185 (606)
119 PF14561 TPR_20: Tetratricopep 97.5 0.00089 1.9E-08 45.7 7.5 58 56-114 7-66 (90)
120 smart00028 TPR Tetratricopepti 97.5 0.0003 6.5E-09 35.2 4.1 34 72-105 1-34 (34)
121 PF09295 ChAPs: ChAPs (Chs5p-A 97.5 0.00071 1.5E-08 57.5 8.5 73 22-95 209-291 (395)
122 PRK11906 transcriptional regul 97.5 0.0007 1.5E-08 58.7 8.5 56 53-109 320-375 (458)
123 PF13525 YfiO: Outer membrane 97.4 0.0015 3.2E-08 49.7 8.8 58 53-111 21-81 (203)
124 PF09976 TPR_21: Tetratricopep 97.4 0.0042 9.2E-08 44.4 10.5 76 22-99 57-145 (145)
125 KOG0547|consensus 97.4 0.0019 4.1E-08 56.9 10.1 86 22-108 369-464 (606)
126 PF13181 TPR_8: Tetratricopept 97.3 0.00056 1.2E-08 37.3 4.1 33 73-105 2-34 (34)
127 KOG1128|consensus 97.3 0.002 4.4E-08 58.5 9.8 101 9-111 418-558 (777)
128 PF09295 ChAPs: ChAPs (Chs5p-A 97.3 0.0053 1.2E-07 52.2 11.9 96 22-119 178-280 (395)
129 KOG0495|consensus 97.3 0.0024 5.2E-08 58.1 10.2 91 18-110 587-689 (913)
130 PF04184 ST7: ST7 protein; In 97.3 0.0021 4.5E-08 56.5 9.4 90 25-114 180-303 (539)
131 PF12688 TPR_5: Tetratrico pep 97.3 0.0021 4.5E-08 46.1 7.8 88 22-112 10-113 (120)
132 KOG2003|consensus 97.2 0.0032 7E-08 55.7 10.0 89 23-112 500-598 (840)
133 KOG1840|consensus 97.2 0.003 6.6E-08 55.3 9.6 78 22-99 376-477 (508)
134 PLN03098 LPA1 LOW PSII ACCUMUL 97.2 0.0013 2.8E-08 56.9 7.2 42 67-108 70-111 (453)
135 KOG1173|consensus 97.2 0.003 6.6E-08 56.1 9.5 91 22-113 321-421 (611)
136 PRK15331 chaperone protein Sic 97.1 0.0018 3.9E-08 49.3 6.8 55 53-108 53-107 (165)
137 KOG2002|consensus 97.1 0.0023 5.1E-08 59.6 8.7 88 24-113 247-382 (1018)
138 PF13176 TPR_7: Tetratricopept 97.1 0.00061 1.3E-08 38.5 2.8 29 75-103 2-30 (36)
139 PF14938 SNAP: Soluble NSF att 97.1 0.003 6.5E-08 50.2 7.7 75 22-103 103-186 (282)
140 PF12688 TPR_5: Tetratrico pep 97.1 0.016 3.4E-07 41.6 10.7 56 53-109 17-78 (120)
141 KOG1128|consensus 97.0 0.0029 6.3E-08 57.5 8.1 90 22-112 494-593 (777)
142 COG4105 ComL DNA uptake lipopr 97.0 0.004 8.8E-08 50.4 8.1 56 53-109 50-108 (254)
143 KOG0624|consensus 97.0 0.0017 3.6E-08 55.6 5.9 83 25-108 50-142 (504)
144 KOG1174|consensus 97.0 0.0043 9.3E-08 54.0 8.2 79 29-109 420-508 (564)
145 PF14853 Fis1_TPR_C: Fis1 C-te 97.0 0.0025 5.4E-08 39.9 5.1 36 74-109 3-38 (53)
146 KOG4340|consensus 96.9 0.0049 1.1E-07 52.1 8.1 88 22-114 17-116 (459)
147 KOG0376|consensus 96.9 0.0017 3.6E-08 56.5 5.4 92 22-114 13-114 (476)
148 PF04733 Coatomer_E: Coatomer 96.8 0.01 2.2E-07 48.2 9.1 99 11-110 165-274 (290)
149 KOG3060|consensus 96.8 0.018 4E-07 47.2 10.5 86 26-112 133-231 (289)
150 PRK14720 transcript cleavage f 96.8 0.0099 2.1E-07 55.4 10.0 93 22-115 125-266 (906)
151 KOG1127|consensus 96.8 0.0061 1.3E-07 57.4 8.5 83 23-106 12-108 (1238)
152 KOG1129|consensus 96.8 0.011 2.4E-07 50.4 9.0 84 22-107 232-325 (478)
153 KOG3060|consensus 96.8 0.016 3.6E-07 47.4 9.7 90 22-112 95-194 (289)
154 COG4783 Putative Zn-dependent 96.8 0.032 6.9E-07 48.8 11.9 81 22-103 349-439 (484)
155 KOG1156|consensus 96.7 0.0061 1.3E-07 55.0 7.4 86 22-108 50-145 (700)
156 PF07719 TPR_2: Tetratricopept 96.7 0.0024 5.2E-08 34.4 3.1 24 22-45 10-34 (34)
157 KOG2002|consensus 96.7 0.0092 2E-07 55.7 8.5 94 22-115 655-759 (1018)
158 PF00515 TPR_1: Tetratricopept 96.7 0.0023 4.9E-08 34.9 2.9 28 18-45 4-34 (34)
159 COG0457 NrfG FOG: TPR repeat [ 96.6 0.044 9.6E-07 36.4 9.7 82 22-104 139-234 (291)
160 KOG1174|consensus 96.6 0.014 3E-07 51.0 8.6 53 53-106 420-472 (564)
161 KOG4162|consensus 96.6 0.037 8E-07 50.8 11.4 89 22-111 659-759 (799)
162 KOG3081|consensus 96.5 0.031 6.8E-07 46.0 9.5 61 53-114 189-249 (299)
163 PF13424 TPR_12: Tetratricopep 96.4 0.0091 2E-07 37.9 5.1 13 53-65 62-74 (78)
164 PRK10153 DNA-binding transcrip 96.4 0.013 2.8E-07 51.2 7.1 57 22-82 429-495 (517)
165 KOG0495|consensus 96.3 0.019 4.1E-07 52.5 8.2 84 26-110 664-757 (913)
166 COG3118 Thioredoxin domain-con 96.3 0.059 1.3E-06 44.7 10.3 55 59-114 224-280 (304)
167 PLN03081 pentatricopeptide (PP 96.2 0.021 4.5E-07 50.5 7.6 82 17-99 292-387 (697)
168 KOG4648|consensus 96.1 0.0055 1.2E-07 52.6 3.5 55 53-108 113-167 (536)
169 COG4785 NlpI Lipoprotein NlpI, 96.1 0.012 2.7E-07 47.7 5.4 84 22-106 74-167 (297)
170 PLN03081 pentatricopeptide (PP 96.1 0.029 6.2E-07 49.6 7.9 78 22-101 471-557 (697)
171 COG4976 Predicted methyltransf 96.0 0.011 2.4E-07 48.1 4.7 52 53-105 11-62 (287)
172 PF14938 SNAP: Soluble NSF att 96.0 0.068 1.5E-06 42.4 9.2 88 22-109 123-233 (282)
173 COG2956 Predicted N-acetylgluc 96.0 0.053 1.1E-06 46.0 8.7 83 22-105 189-282 (389)
174 PLN03077 Protein ECB2; Provisi 96.0 0.047 1E-06 49.3 9.0 82 18-99 357-451 (857)
175 KOG4642|consensus 96.0 0.014 3E-07 47.6 4.9 72 27-99 24-105 (284)
176 PLN03218 maturation of RBCL 1; 95.9 0.08 1.7E-06 50.1 10.6 78 22-100 623-712 (1060)
177 COG3071 HemY Uncharacterized e 95.9 0.056 1.2E-06 46.3 8.7 82 22-104 272-360 (400)
178 PF12569 NARP1: NMDA receptor- 95.9 0.11 2.4E-06 45.6 10.7 92 22-114 13-119 (517)
179 PLN03077 Protein ECB2; Provisi 95.9 0.11 2.5E-06 46.9 11.0 97 17-116 556-667 (857)
180 COG0457 NrfG FOG: TPR repeat [ 95.9 0.17 3.6E-06 33.6 9.2 82 22-104 176-268 (291)
181 PF13181 TPR_8: Tetratricopept 95.8 0.0099 2.1E-07 32.1 2.6 27 18-44 4-33 (34)
182 PF06552 TOM20_plant: Plant sp 95.8 0.046 9.9E-07 42.5 7.1 56 53-109 7-72 (186)
183 KOG2471|consensus 95.8 0.038 8.3E-07 49.2 7.4 111 9-119 233-382 (696)
184 PF09986 DUF2225: Uncharacteri 95.8 0.28 6.1E-06 38.2 11.5 65 53-119 141-210 (214)
185 COG3071 HemY Uncharacterized e 95.7 0.082 1.8E-06 45.3 8.8 75 24-100 305-389 (400)
186 PF03704 BTAD: Bacterial trans 95.7 0.43 9.2E-06 33.6 11.3 45 53-98 78-122 (146)
187 PF13374 TPR_10: Tetratricopep 95.6 0.034 7.4E-07 30.7 4.4 29 73-101 3-31 (42)
188 KOG2003|consensus 95.6 0.11 2.4E-06 46.3 9.5 86 22-108 601-696 (840)
189 KOG1127|consensus 95.6 0.039 8.4E-07 52.2 7.0 91 17-108 564-666 (1238)
190 KOG2396|consensus 95.5 0.13 2.8E-06 45.6 9.5 80 33-113 91-181 (568)
191 PLN03218 maturation of RBCL 1; 95.5 0.14 3.1E-06 48.5 10.4 79 22-101 551-643 (1060)
192 PF04781 DUF627: Protein of un 95.5 0.065 1.4E-06 38.5 6.3 78 23-101 6-107 (111)
193 KOG4234|consensus 95.4 0.058 1.3E-06 43.4 6.6 56 53-108 111-170 (271)
194 PF13512 TPR_18: Tetratricopep 95.4 0.14 3E-06 38.2 8.2 87 22-109 19-136 (142)
195 KOG4555|consensus 95.4 0.84 1.8E-05 34.7 12.9 100 2-102 31-145 (175)
196 PHA02537 M terminase endonucle 95.3 0.18 4E-06 40.2 9.1 96 23-121 93-225 (230)
197 KOG1129|consensus 95.2 0.061 1.3E-06 46.0 6.3 89 22-112 299-397 (478)
198 KOG1156|consensus 95.2 0.17 3.8E-06 45.9 9.4 85 17-102 77-173 (700)
199 PF08424 NRDE-2: NRDE-2, neces 95.1 0.38 8.2E-06 39.3 10.7 83 36-119 8-112 (321)
200 KOG4642|consensus 95.1 0.062 1.3E-06 43.9 5.9 56 53-109 26-81 (284)
201 KOG4507|consensus 95.0 0.18 3.8E-06 46.0 9.0 99 22-121 615-725 (886)
202 KOG0545|consensus 94.9 0.1 2.2E-06 43.1 6.7 56 53-108 194-266 (329)
203 KOG4648|consensus 94.8 0.13 2.9E-06 44.3 7.4 106 4-110 84-203 (536)
204 COG1729 Uncharacterized protei 94.4 0.19 4E-06 41.0 7.1 60 53-113 157-219 (262)
205 PF05843 Suf: Suppressor of fo 94.3 0.34 7.5E-06 38.7 8.3 90 16-106 3-104 (280)
206 PF07720 TPR_3: Tetratricopept 94.3 0.18 3.9E-06 29.0 4.9 33 72-104 1-35 (36)
207 KOG2053|consensus 94.2 0.17 3.6E-06 47.3 7.1 80 24-104 20-109 (932)
208 PF07721 TPR_4: Tetratricopept 93.9 0.092 2E-06 27.6 2.9 26 72-97 1-26 (26)
209 KOG2376|consensus 93.6 0.34 7.4E-06 43.7 7.6 49 53-105 95-143 (652)
210 PRK04841 transcriptional regul 93.6 0.74 1.6E-05 41.3 9.9 80 22-101 461-560 (903)
211 KOG2376|consensus 93.5 0.5 1.1E-05 42.7 8.5 53 53-105 191-257 (652)
212 PF10255 Paf67: RNA polymerase 93.3 0.13 2.8E-06 44.1 4.5 61 22-99 131-191 (404)
213 PF13176 TPR_7: Tetratricopept 93.2 0.078 1.7E-06 29.7 2.1 23 22-44 8-31 (36)
214 KOG1585|consensus 93.1 1.1 2.4E-05 37.0 9.3 88 22-109 40-147 (308)
215 COG4700 Uncharacterized protei 93.0 0.89 1.9E-05 36.4 8.4 92 23-116 134-241 (251)
216 TIGR03504 FimV_Cterm FimV C-te 92.9 0.19 4.1E-06 30.3 3.6 29 75-104 2-30 (44)
217 KOG0550|consensus 92.8 0.4 8.7E-06 41.8 6.7 87 22-110 258-358 (486)
218 KOG3824|consensus 92.7 0.21 4.5E-06 42.6 4.8 55 53-108 132-186 (472)
219 PF12862 Apc5: Anaphase-promot 92.6 0.67 1.4E-05 31.1 6.4 31 73-103 42-72 (94)
220 PF03704 BTAD: Bacterial trans 92.5 0.58 1.3E-05 32.9 6.3 85 23-114 16-104 (146)
221 smart00028 TPR Tetratricopepti 92.4 0.12 2.7E-06 25.0 2.0 23 22-44 10-33 (34)
222 KOG0551|consensus 92.4 0.67 1.4E-05 39.5 7.3 55 53-108 97-155 (390)
223 PF13281 DUF4071: Domain of un 92.3 0.91 2E-05 38.7 8.2 76 22-99 188-286 (374)
224 PF10300 DUF3808: Protein of u 92.3 0.7 1.5E-05 39.8 7.6 78 22-99 276-374 (468)
225 PF09613 HrpB1_HrpK: Bacterial 92.3 0.88 1.9E-05 34.5 7.3 66 22-88 19-94 (160)
226 KOG3364|consensus 91.7 0.94 2E-05 34.1 6.7 64 53-116 51-117 (149)
227 PF14853 Fis1_TPR_C: Fis1 C-te 91.4 0.35 7.5E-06 30.1 3.6 31 22-52 10-41 (53)
228 PF13174 TPR_6: Tetratricopept 91.4 0.15 3.2E-06 26.8 1.7 24 22-45 9-33 (33)
229 PF13374 TPR_10: Tetratricopep 91.1 0.34 7.5E-06 26.5 3.1 26 16-41 3-31 (42)
230 PRK04841 transcriptional regul 91.1 1.2 2.7E-05 39.9 8.2 82 22-103 500-604 (903)
231 KOG4340|consensus 91.0 0.65 1.4E-05 39.6 5.9 56 53-109 26-81 (459)
232 KOG0551|consensus 90.8 0.84 1.8E-05 38.9 6.3 79 22-101 90-182 (390)
233 KOG1586|consensus 90.7 1.8 3.8E-05 35.6 7.9 82 22-109 102-191 (288)
234 COG4785 NlpI Lipoprotein NlpI, 90.4 0.61 1.3E-05 38.1 5.0 55 53-108 81-135 (297)
235 KOG1941|consensus 90.2 0.53 1.2E-05 40.9 4.7 80 22-101 131-235 (518)
236 PRK10941 hypothetical protein; 89.5 2 4.4E-05 34.7 7.5 55 53-108 197-251 (269)
237 smart00386 HAT HAT (Half-A-TPR 89.4 1.2 2.5E-05 22.8 4.2 29 86-114 1-29 (33)
238 COG4700 Uncharacterized protei 89.4 1.4 3E-05 35.3 6.2 50 53-103 140-191 (251)
239 COG4105 ComL DNA uptake lipopr 88.8 1.5 3.2E-05 35.7 6.1 51 71-121 33-83 (254)
240 KOG3785|consensus 88.7 5.7 0.00012 34.7 9.8 24 17-40 61-85 (557)
241 PF10300 DUF3808: Protein of u 88.0 2.6 5.7E-05 36.3 7.5 80 25-105 245-338 (468)
242 PF11207 DUF2989: Protein of u 87.7 4 8.7E-05 32.1 7.7 39 53-91 156-197 (203)
243 COG3118 Thioredoxin domain-con 87.6 2.9 6.2E-05 34.9 7.2 50 53-107 150-199 (304)
244 KOG2796|consensus 87.4 2 4.2E-05 36.1 6.1 79 22-104 221-318 (366)
245 TIGR02561 HrpB1_HrpK type III 87.1 2.6 5.6E-05 31.9 6.1 41 68-108 40-80 (153)
246 KOG2053|consensus 87.1 3.8 8.3E-05 38.6 8.4 83 22-106 52-144 (932)
247 PF10373 EST1_DNA_bind: Est1 D 86.6 3.7 8.1E-05 31.4 7.0 56 56-112 1-56 (278)
248 COG3914 Spy Predicted O-linked 86.5 5 0.00011 36.4 8.5 92 22-114 76-184 (620)
249 KOG3785|consensus 86.5 3.6 7.8E-05 35.9 7.4 57 53-115 375-431 (557)
250 PF04910 Tcf25: Transcriptiona 86.3 5.3 0.00012 33.4 8.3 92 22-116 112-234 (360)
251 KOG0376|consensus 86.2 0.7 1.5E-05 40.6 3.1 55 53-108 20-74 (476)
252 KOG3807|consensus 86.2 9.7 0.00021 33.1 9.8 89 27-115 198-320 (556)
253 COG5191 Uncharacterized conser 86.1 1.1 2.4E-05 38.2 4.1 77 35-112 96-182 (435)
254 KOG1308|consensus 86.0 0.16 3.4E-06 43.2 -1.0 93 22-116 123-228 (377)
255 PF15015 NYD-SP12_N: Spermatog 85.4 2 4.3E-05 38.0 5.4 35 74-108 230-264 (569)
256 PF12862 Apc5: Anaphase-promot 85.2 2.6 5.7E-05 28.1 5.0 62 22-83 7-87 (94)
257 KOG1308|consensus 85.2 0.78 1.7E-05 39.1 2.8 61 46-107 119-183 (377)
258 KOG3824|consensus 84.8 2.4 5.2E-05 36.3 5.5 66 10-77 112-189 (472)
259 KOG1070|consensus 84.5 11 0.00024 37.6 10.3 95 15-111 1459-1569(1710)
260 PF06957 COPI_C: Coatomer (COP 83.8 6.4 0.00014 34.1 7.8 97 22-118 213-348 (422)
261 cd02682 MIT_AAA_Arch MIT: doma 83.7 8.6 0.00019 25.6 6.8 17 91-107 32-48 (75)
262 PF04184 ST7: ST7 protein; In 83.6 5.6 0.00012 35.5 7.4 82 25-108 271-382 (539)
263 KOG2422|consensus 83.5 4 8.6E-05 37.1 6.5 82 17-103 345-450 (665)
264 KOG4555|consensus 83.5 4.6 0.0001 30.7 6.0 55 53-108 59-113 (175)
265 KOG1070|consensus 82.7 12 0.00026 37.4 9.7 92 22-114 1539-1642(1710)
266 PRK13184 pknD serine/threonine 82.5 9.3 0.0002 36.2 8.8 96 24-119 486-602 (932)
267 PRK10941 hypothetical protein; 82.2 8.5 0.00018 31.2 7.5 68 6-76 174-253 (269)
268 cd02682 MIT_AAA_Arch MIT: doma 81.8 9.3 0.0002 25.5 6.4 63 30-98 4-69 (75)
269 COG4649 Uncharacterized protei 81.3 25 0.00055 27.9 9.6 91 16-107 96-201 (221)
270 COG3898 Uncharacterized membra 81.1 11 0.00024 33.2 8.2 87 22-110 197-300 (531)
271 KOG1585|consensus 81.0 21 0.00045 29.7 9.3 67 22-95 100-173 (308)
272 KOG2471|consensus 79.2 2.8 6.1E-05 37.7 4.1 68 17-85 285-382 (696)
273 COG3947 Response regulator con 79.0 3.5 7.7E-05 34.8 4.4 36 75-110 282-317 (361)
274 COG3629 DnrI DNA-binding trans 79.0 14 0.00031 30.3 7.9 71 27-100 135-215 (280)
275 KOG2796|consensus 78.9 9 0.0002 32.2 6.7 87 22-108 186-288 (366)
276 KOG1130|consensus 78.6 8.6 0.00019 34.2 6.8 79 23-101 205-304 (639)
277 PF04910 Tcf25: Transcriptiona 78.5 12 0.00026 31.4 7.5 101 7-108 25-140 (360)
278 PF01535 PPR: PPR repeat; Int 77.9 3.1 6.8E-05 21.1 2.6 19 22-40 9-28 (31)
279 smart00745 MIT Microtubule Int 77.5 16 0.00034 23.3 7.1 15 29-43 5-20 (77)
280 COG4976 Predicted methyltransf 77.4 5.8 0.00013 32.6 5.1 58 23-80 5-72 (287)
281 PF08424 NRDE-2: NRDE-2, neces 76.5 22 0.00047 29.0 8.3 59 58-117 6-76 (321)
282 PF14561 TPR_20: Tetratricopep 75.2 7 0.00015 26.4 4.4 47 22-74 31-78 (90)
283 PF14863 Alkyl_sulf_dimr: Alky 73.5 18 0.00038 26.7 6.5 40 71-110 69-108 (141)
284 PF05843 Suf: Suppressor of fo 73.3 21 0.00046 28.4 7.4 60 53-113 17-77 (280)
285 TIGR00756 PPR pentatricopeptid 73.3 5.4 0.00012 20.4 2.8 19 22-40 9-28 (35)
286 smart00299 CLH Clathrin heavy 72.9 28 0.00061 24.1 9.3 91 23-115 17-134 (140)
287 PF11817 Foie-gras_1: Foie gra 72.2 17 0.00037 28.4 6.5 65 53-119 161-227 (247)
288 PF08631 SPO22: Meiosis protei 71.9 48 0.001 26.2 9.8 56 53-109 52-124 (278)
289 KOG1586|consensus 71.1 45 0.00097 27.6 8.7 85 22-110 23-111 (288)
290 KOG1130|consensus 70.9 8.5 0.00018 34.2 4.9 80 22-102 26-125 (639)
291 PF11207 DUF2989: Protein of u 70.9 15 0.00033 28.9 5.9 60 53-113 122-181 (203)
292 COG2976 Uncharacterized protei 70.6 15 0.00033 29.1 5.8 88 22-114 98-199 (207)
293 cd00280 TRFH Telomeric Repeat 70.5 7.3 0.00016 30.7 4.0 58 53-113 85-151 (200)
294 KOG0529|consensus 69.8 15 0.00033 31.9 6.1 54 28-105 90-144 (421)
295 PF13041 PPR_2: PPR repeat fam 69.1 7.8 0.00017 22.5 3.1 26 17-42 5-33 (50)
296 PF09613 HrpB1_HrpK: Bacterial 69.0 34 0.00073 26.0 7.2 57 53-110 26-82 (160)
297 COG3629 DnrI DNA-binding trans 68.5 26 0.00056 28.8 7.0 45 22-66 162-216 (280)
298 PF11846 DUF3366: Domain of un 68.3 31 0.00068 25.5 7.0 48 54-103 128-175 (193)
299 KOG0545|consensus 67.0 47 0.001 27.8 8.1 54 53-107 246-299 (329)
300 KOG3677|consensus 66.3 4.4 9.4E-05 35.7 2.2 27 73-99 273-299 (525)
301 PF00244 14-3-3: 14-3-3 protei 66.0 59 0.0013 25.5 8.4 44 22-66 10-59 (236)
302 PF12968 DUF3856: Domain of Un 65.6 22 0.00048 26.5 5.4 48 53-100 25-83 (144)
303 KOG3364|consensus 65.2 7.7 0.00017 29.2 3.0 30 22-51 80-110 (149)
304 KOG2610|consensus 64.8 45 0.00098 29.1 7.9 40 53-93 191-230 (491)
305 KOG4014|consensus 64.0 18 0.0004 29.0 5.1 78 26-117 48-139 (248)
306 PF12854 PPR_1: PPR repeat 63.8 18 0.00039 19.7 3.8 27 71-97 6-32 (34)
307 KOG2581|consensus 62.9 24 0.00051 31.2 5.9 59 22-81 218-290 (493)
308 KOG2300|consensus 62.8 1.1E+02 0.0025 27.7 10.2 79 22-104 332-433 (629)
309 cd02656 MIT MIT: domain contai 62.7 37 0.0008 21.5 7.4 25 91-116 32-56 (75)
310 KOG2997|consensus 62.0 11 0.00025 32.0 3.8 36 53-104 16-51 (366)
311 PF10602 RPN7: 26S proteasome 61.6 66 0.0014 24.1 8.5 82 22-103 45-144 (177)
312 PF13812 PPR_3: Pentatricopept 60.6 15 0.00033 18.9 3.0 20 22-41 10-30 (34)
313 KOG4507|consensus 59.8 16 0.00034 33.8 4.5 75 30-104 196-281 (886)
314 cd02678 MIT_VPS4 MIT: domain c 59.2 45 0.00098 21.4 7.7 12 30-41 4-16 (75)
315 KOG2396|consensus 58.4 41 0.00089 30.3 6.7 57 53-111 87-144 (568)
316 PF10952 DUF2753: Protein of u 58.4 75 0.0016 23.7 7.3 59 22-112 59-122 (140)
317 cd02680 MIT_calpain7_2 MIT: do 58.0 18 0.00038 24.1 3.5 32 85-116 19-52 (75)
318 COG5159 RPN6 26S proteasome re 58.0 1.1E+02 0.0023 26.3 8.9 79 22-100 54-153 (421)
319 PF05030 SSXT: SSXT protein (N 57.8 14 0.00031 24.1 2.9 28 10-42 20-47 (65)
320 PF14689 SPOB_a: Sensor_kinase 57.5 19 0.00041 22.6 3.5 19 22-40 32-51 (62)
321 PF02184 HAT: HAT (Half-A-TPR) 56.8 17 0.00038 20.5 2.8 23 86-109 1-23 (32)
322 PF13281 DUF4071: Domain of un 56.6 43 0.00093 28.6 6.4 55 53-107 157-218 (374)
323 PF09205 DUF1955: Domain of un 55.7 23 0.00051 26.9 4.2 40 60-100 109-148 (161)
324 PF10516 SHNi-TPR: SHNi-TPR; 53.4 37 0.00081 19.6 4.0 29 73-101 2-30 (38)
325 KOG2460|consensus 53.1 80 0.0017 28.7 7.7 79 22-100 349-450 (593)
326 PF07163 Pex26: Pex26 protein; 52.8 15 0.00033 30.7 3.0 26 12-37 156-182 (309)
327 smart00101 14_3_3 14-3-3 homol 52.3 75 0.0016 25.4 6.9 79 22-101 10-102 (244)
328 PF12583 TPPII_N: Tripeptidyl 51.5 35 0.00075 25.5 4.5 44 53-114 75-118 (139)
329 PF00244 14-3-3: 14-3-3 protei 50.5 45 0.00098 26.2 5.3 43 76-118 5-47 (236)
330 KOG4814|consensus 50.4 2.3E+02 0.0049 26.8 11.6 84 29-115 346-437 (872)
331 KOG0546|consensus 49.4 15 0.00032 31.6 2.5 43 69-111 306-348 (372)
332 PF15015 NYD-SP12_N: Spermatog 47.6 50 0.0011 29.5 5.5 54 22-75 237-303 (569)
333 PF02259 FAT: FAT domain; Int 47.6 87 0.0019 24.4 6.6 51 53-104 274-341 (352)
334 KOG1310|consensus 47.5 63 0.0014 29.7 6.2 74 28-103 389-475 (758)
335 PF02064 MAS20: MAS20 protein 47.2 76 0.0016 22.9 5.6 39 76-115 67-105 (121)
336 COG2912 Uncharacterized conser 46.2 81 0.0018 25.9 6.3 65 53-118 197-264 (269)
337 smart00777 Mad3_BUB1_I Mad3/BU 46.2 1.1E+02 0.0024 22.0 7.3 76 22-97 35-124 (125)
338 PF10345 Cohesin_load: Cohesin 46.0 1.8E+02 0.004 25.7 8.9 47 53-99 377-431 (608)
339 PF09670 Cas_Cas02710: CRISPR- 45.8 1.8E+02 0.0039 24.4 9.9 48 17-65 134-197 (379)
340 KOG2034|consensus 45.5 68 0.0015 30.6 6.3 43 14-63 357-415 (911)
341 cd00280 TRFH Telomeric Repeat 45.4 52 0.0011 26.0 4.8 26 22-48 120-146 (200)
342 COG3898 Uncharacterized membra 44.8 1.9E+02 0.004 25.8 8.5 81 22-103 272-360 (531)
343 PF07079 DUF1347: Protein of u 44.5 2.4E+02 0.0052 25.4 10.3 87 3-101 454-545 (549)
344 PF09797 NatB_MDM20: N-acetylt 44.0 75 0.0016 26.1 5.9 43 53-96 199-241 (365)
345 PF04212 MIT: MIT (microtubule 43.7 79 0.0017 19.6 7.5 26 90-116 30-55 (69)
346 PF08311 Mad3_BUB1_I: Mad3/BUB 43.2 1.2E+02 0.0025 21.5 8.3 78 22-99 35-126 (126)
347 KOG2610|consensus 43.0 55 0.0012 28.6 5.0 75 23-97 185-272 (491)
348 PF02255 PTS_IIA: PTS system, 42.6 57 0.0012 22.4 4.2 40 8-50 4-49 (96)
349 PF10373 EST1_DNA_bind: Est1 D 42.5 86 0.0019 23.8 5.7 52 32-84 1-62 (278)
350 cd02684 MIT_2 MIT: domain cont 42.2 95 0.0021 20.2 7.4 25 91-116 32-56 (75)
351 KOG2300|consensus 41.7 2.3E+02 0.005 25.9 8.7 86 16-102 367-475 (629)
352 PF14852 Fis1_TPR_N: Fis1 N-te 41.3 45 0.00098 18.9 3.0 30 74-103 3-35 (35)
353 COG5191 Uncharacterized conser 41.2 30 0.00064 29.8 3.1 67 16-83 108-190 (435)
354 smart00101 14_3_3 14-3-3 homol 40.5 1.5E+02 0.0032 23.8 6.9 48 53-101 144-200 (244)
355 cd02681 MIT_calpain7_1 MIT: do 40.5 1.1E+02 0.0023 20.2 6.9 64 30-98 4-70 (76)
356 PF02259 FAT: FAT domain; Int 40.4 1.8E+02 0.0038 22.7 10.6 89 22-110 155-296 (352)
357 PF06957 COPI_C: Coatomer (COP 39.1 41 0.00088 29.3 3.7 82 22-103 127-235 (422)
358 cd02683 MIT_1 MIT: domain cont 38.7 1.1E+02 0.0024 20.0 7.1 14 30-43 4-18 (77)
359 KOG3081|consensus 38.0 2.4E+02 0.0052 23.6 10.4 86 22-109 182-279 (299)
360 PF05131 Pep3_Vps18: Pep3/Vps1 37.8 34 0.00073 25.3 2.6 29 80-115 111-139 (147)
361 PRK10454 PTS system N,N'-diace 37.7 60 0.0013 23.2 3.8 41 7-50 20-66 (115)
362 KOG1914|consensus 37.4 1.1E+02 0.0023 28.2 6.0 62 38-100 12-81 (656)
363 cd00215 PTS_IIA_lac PTS_IIA, P 37.1 70 0.0015 22.1 4.0 41 7-50 4-50 (97)
364 KOG2047|consensus 37.1 3.3E+02 0.0072 25.7 9.2 96 22-118 434-559 (835)
365 PRK09591 celC cellobiose phosp 36.8 52 0.0011 23.0 3.4 44 4-50 6-55 (104)
366 KOG3617|consensus 36.8 4.3E+02 0.0092 26.1 10.0 32 69-100 964-995 (1416)
367 PF05944 Phage_term_smal: Phag 36.6 84 0.0018 22.9 4.5 48 22-69 57-110 (132)
368 COG2976 Uncharacterized protei 36.5 2.2E+02 0.0047 22.6 8.2 47 53-99 105-153 (207)
369 PF04781 DUF627: Protein of un 35.9 73 0.0016 22.8 4.0 32 77-108 1-32 (111)
370 TIGR00823 EIIA-LAC phosphotran 35.9 75 0.0016 22.0 4.0 41 7-50 6-52 (99)
371 KOG1941|consensus 35.6 1.4E+02 0.0029 26.5 6.2 78 28-105 98-195 (518)
372 COG2912 Uncharacterized conser 35.3 2.2E+02 0.0047 23.4 7.1 54 22-76 190-253 (269)
373 PF11817 Foie-gras_1: Foie gra 34.5 1.7E+02 0.0037 22.7 6.3 47 53-99 194-245 (247)
374 cd02679 MIT_spastin MIT: domai 33.9 79 0.0017 21.1 3.7 33 27-65 3-36 (79)
375 KOG1914|consensus 33.9 1.9E+02 0.004 26.7 7.0 36 15-50 20-57 (656)
376 KOG1915|consensus 33.9 3.3E+02 0.0071 25.0 8.5 79 22-101 413-500 (677)
377 KOG1463|consensus 33.8 3.2E+02 0.007 23.8 8.5 25 76-100 132-156 (411)
378 KOG0546|consensus 32.6 84 0.0018 27.0 4.5 68 22-90 284-361 (372)
379 TIGR02561 HrpB1_HrpK type III 32.6 2.2E+02 0.0048 21.5 6.4 80 22-103 19-108 (153)
380 PF04053 Coatomer_WDAD: Coatom 32.3 3.3E+02 0.0072 23.5 9.0 27 71-97 346-372 (443)
381 PF10602 RPN7: 26S proteasome 31.7 1.5E+02 0.0032 22.1 5.4 24 76-99 40-63 (177)
382 PF07035 Mic1: Colon cancer-as 30.7 2E+02 0.0044 21.7 5.9 93 22-118 38-132 (167)
383 PF04840 Vps16_C: Vps16, C-ter 30.5 1.6E+02 0.0035 24.2 5.8 84 18-115 211-315 (319)
384 KOG3783|consensus 30.5 3.7E+02 0.0081 24.4 8.3 95 22-118 276-395 (546)
385 KOG0739|consensus 30.1 2.1E+02 0.0046 24.7 6.4 31 30-66 8-39 (439)
386 KOG0889|consensus 29.6 84 0.0018 34.3 4.6 54 16-69 2812-2886(3550)
387 KOG1550|consensus 29.2 4E+02 0.0086 23.4 9.3 44 53-99 309-355 (552)
388 KOG1920|consensus 28.9 1.3E+02 0.0027 29.9 5.4 38 76-117 1003-1040(1265)
389 PF11417 Inhibitor_G39P: Loade 28.9 1.2E+02 0.0026 19.8 3.9 28 14-43 32-60 (71)
390 PF06667 PspB: Phage shock pro 28.8 1.5E+02 0.0033 19.7 4.4 49 57-107 23-71 (75)
391 PF10345 Cohesin_load: Cohesin 28.7 4.1E+02 0.009 23.5 11.7 87 22-109 69-178 (608)
392 PF13226 DUF4034: Domain of un 28.4 1.6E+02 0.0034 24.2 5.3 20 31-50 61-81 (277)
393 TIGR03549 conserved hypothetic 28.2 3.2E+02 0.0069 25.6 7.7 32 14-45 606-638 (718)
394 PF13311 DUF4080: Protein of u 28.0 2.7E+02 0.0058 21.0 8.2 43 2-49 20-64 (190)
395 KOG3303|consensus 27.5 3.1E+02 0.0066 21.6 6.6 86 10-98 32-130 (192)
396 PF10955 DUF2757: Protein of u 26.9 41 0.00088 22.6 1.4 16 93-108 56-71 (76)
397 PF12925 APP_E2: E2 domain of 26.6 1.8E+02 0.004 22.7 5.2 62 53-115 113-177 (193)
398 smart00671 SEL1 Sel1-like repe 26.4 1E+02 0.0022 15.8 4.1 28 73-100 2-33 (36)
399 PF10151 DUF2359: Uncharacteri 26.4 54 0.0012 28.9 2.4 23 30-52 167-190 (469)
400 KOG0985|consensus 25.7 4.8E+02 0.01 26.3 8.5 87 22-117 1057-1144(1666)
401 PF09796 QCR10: Ubiquinol-cyto 25.6 42 0.0009 21.8 1.2 15 101-115 31-45 (64)
402 COG5107 RNA14 Pre-mRNA 3'-end 24.4 1.4E+02 0.003 27.1 4.6 55 59-114 30-84 (660)
403 KOG0530|consensus 24.1 4.4E+02 0.0095 22.2 9.4 83 22-104 52-145 (318)
404 PF04121 Nup84_Nup100: Nuclear 24.1 1E+02 0.0022 27.9 3.8 26 73-98 134-159 (697)
405 KOG3540|consensus 24.0 2.1E+02 0.0046 25.9 5.6 93 10-114 265-391 (615)
406 PF10776 DUF2600: Protein of u 23.9 1.1E+02 0.0023 26.0 3.6 29 89-117 117-145 (330)
407 TIGR00985 3a0801s04tom mitocho 23.7 2.7E+02 0.0058 20.8 5.4 39 76-115 94-133 (148)
408 COG5159 RPN6 26S proteasome re 23.4 1E+02 0.0022 26.5 3.3 26 76-101 7-32 (421)
409 KOG0985|consensus 23.1 5.8E+02 0.013 25.8 8.5 90 14-116 1103-1204(1666)
410 PF09311 Rab5-bind: Rabaptin-l 23.0 66 0.0014 24.2 2.1 30 11-40 136-168 (181)
411 PF04348 LppC: LppC putative l 22.2 29 0.00064 30.6 0.0 79 17-95 25-121 (536)
412 KOG1478|consensus 21.8 1.2E+02 0.0025 25.6 3.4 37 69-105 10-59 (341)
413 KOG1310|consensus 21.7 1.5E+02 0.0031 27.5 4.2 58 30-91 428-495 (758)
414 KOG0292|consensus 21.5 72 0.0016 30.9 2.3 82 22-103 915-1022(1202)
415 COG3947 Response regulator con 21.3 2.2E+02 0.0047 24.4 4.9 44 53-97 295-338 (361)
416 PF01239 PPTA: Protein prenylt 20.8 1.5E+02 0.0031 15.5 3.1 19 91-109 2-20 (31)
417 KOG2168|consensus 20.7 6.1E+02 0.013 24.3 8.1 36 6-41 619-664 (835)
418 PF09311 Rab5-bind: Rabaptin-l 20.4 1.4E+02 0.003 22.5 3.4 32 69-100 137-168 (181)
No 1
>PRK15359 type III secretion system chaperone protein SscB; Provisional
Probab=99.52 E-value=2.9e-13 Score=98.28 Aligned_cols=94 Identities=13% Similarity=-0.066 Sum_probs=86.1
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
.+.+.|++++|.+.++ ++..+|.+..+|..+ |++|...|++|+++ +|+++.+.+.||.+|...|++.+|
T Consensus 33 ~~~~~g~~~~A~~~~~~al~~~P~~~~a~~~lg~~~~~~g~~~~A~~~y~~Al~l-~p~~~~a~~~lg~~l~~~g~~~eA 111 (144)
T PRK15359 33 ASWQEGDYSRAVIDFSWLVMAQPWSWRAHIALAGTWMMLKEYTTAINFYGHALML-DASHPEPVYQTGVCLKMMGEPGLA 111 (144)
T ss_pred HHHHcCCHHHHHHHHHHHHHcCCCcHHHHHHHHHHHHHHhhHHHHHHHHHHHHhc-CCCCcHHHHHHHHHHHHcCCHHHH
Confidence 7889999999999999 999999999999844 99999999999999 799999999999999999999999
Q ss_pred HHHHHHHHHhCCCCHHHHHHHHHHHH
Q psy16144 92 IDVSQTIMKAHPSYPKVHKQIFEKAM 117 (122)
Q Consensus 92 I~vc~kVL~~~P~ypkir~~il~ka~ 117 (122)
+..|+++++..|++|... ++...++
T Consensus 112 i~~~~~Al~~~p~~~~~~-~~~~~~~ 136 (144)
T PRK15359 112 REAFQTAIKMSYADASWS-EIRQNAQ 136 (144)
T ss_pred HHHHHHHHHhCCCChHHH-HHHHHHH
Confidence 999999999999998876 3444443
No 2
>TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD. ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array.
Probab=99.33 E-value=5e-11 Score=82.86 Aligned_cols=95 Identities=14% Similarity=0.115 Sum_probs=85.8
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
.+++.|++++|.+.++ ++..+|..+.+|..+ |++|...|+++.+. +|+++.+.+.+|..|...|++.+|
T Consensus 26 ~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~la~~~~~~~~~~~A~~~~~~~~~~-~p~~~~~~~~la~~~~~~g~~~~A 104 (135)
T TIGR02552 26 NLYQQGRYDEALKLFQLLAAYDPYNSRYWLGLAACCQMLKEYEEAIDAYALAAAL-DPDDPRPYFHAAECLLALGEPESA 104 (135)
T ss_pred HHHHcccHHHHHHHHHHHHHhCCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-CCCChHHHHHHHHHHHHcCCHHHH
Confidence 8889999999999999 999999999998744 99999999999999 799999999999999999999999
Q ss_pred HHHHHHHHHhCCCCHHHHHHHHHHHHh
Q psy16144 92 IDVSQTIMKAHPSYPKVHKQIFEKAMV 118 (122)
Q Consensus 92 I~vc~kVL~~~P~ypkir~~il~ka~~ 118 (122)
+..++++++.+|+.+..+. +..++..
T Consensus 105 ~~~~~~al~~~p~~~~~~~-~~~~~~~ 130 (135)
T TIGR02552 105 LKALDLAIEICGENPEYSE-LKERAEA 130 (135)
T ss_pred HHHHHHHHHhccccchHHH-HHHHHHH
Confidence 9999999999999988652 4444443
No 3
>PRK15363 pathogenicity island 2 chaperone protein SscA; Provisional
Probab=99.31 E-value=2.8e-11 Score=91.12 Aligned_cols=106 Identities=11% Similarity=0.057 Sum_probs=90.6
Q ss_pred HHHHHHHHHHHH--HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHH
Q psy16144 10 QIIQTIAEWHFL--LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQK 77 (122)
Q Consensus 10 ~~~~~~~~~~~L--Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgyk 77 (122)
+|-+++..+|-+ ..-++|++++|+.+++ ...++|.+..+|.-+ |.+|+..|..|..+ +|+||..-+.
T Consensus 30 ~~~~~l~~lY~~A~~ly~~G~l~~A~~~f~~L~~~Dp~~~~y~~gLG~~~Q~~g~~~~AI~aY~~A~~L-~~ddp~~~~~ 108 (157)
T PRK15363 30 DVTQPLNTLYRYAMQLMEVKEFAGAARLFQLLTIYDAWSFDYWFRLGECCQAQKHWGEAIYAYGRAAQI-KIDAPQAPWA 108 (157)
T ss_pred ChHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhc-CCCCchHHHH
Confidence 345666777777 6667899999999999 999999999999844 99999999999999 6999999999
Q ss_pred HHHHHHhcccHHHHHHHHHHHHHhC---CCCHHHHHHHHHHHHhcc
Q psy16144 78 LASLLLRMKRYAEAIDVSQTIMKAH---PSYPKVHKQIFEKAMVHL 120 (122)
Q Consensus 78 LA~~ylk~kr~~~AI~vc~kVL~~~---P~ypkir~~il~ka~~~l 120 (122)
+|.+|++.|+...|...++.|+..- |.+..+ -++|...|
T Consensus 109 ag~c~L~lG~~~~A~~aF~~Ai~~~~~~~~~~~l----~~~A~~~L 150 (157)
T PRK15363 109 AAECYLACDNVCYAIKALKAVVRICGEVSEHQIL----RQRAEKML 150 (157)
T ss_pred HHHHHHHcCCHHHHHHHHHHHHHHhccChhHHHH----HHHHHHHH
Confidence 9999999999999999999999876 445444 45555544
No 4
>PRK11189 lipoprotein NlpI; Provisional
Probab=99.26 E-value=6.5e-11 Score=94.62 Aligned_cols=85 Identities=15% Similarity=0.065 Sum_probs=81.5
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
+|.+.|++++|...++ +++.+|+++.+|..+ |++|...|++|+++ +|+++.+.+++|.+|...|++.+|
T Consensus 73 ~~~~~g~~~~A~~~~~~Al~l~P~~~~a~~~lg~~~~~~g~~~~A~~~~~~Al~l-~P~~~~a~~~lg~~l~~~g~~~eA 151 (296)
T PRK11189 73 LYDSLGLRALARNDFSQALALRPDMADAYNYLGIYLTQAGNFDAAYEAFDSVLEL-DPTYNYAYLNRGIALYYGGRYELA 151 (296)
T ss_pred HHHHCCCHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHh-CCCCHHHHHHHHHHHHHCCCHHHH
Confidence 8999999999999999 999999999999844 99999999999999 799999999999999999999999
Q ss_pred HHHHHHHHHhCCCCHH
Q psy16144 92 IDVSQTIMKAHPSYPK 107 (122)
Q Consensus 92 I~vc~kVL~~~P~ypk 107 (122)
+..|+++++.+|++|.
T Consensus 152 ~~~~~~al~~~P~~~~ 167 (296)
T PRK11189 152 QDDLLAFYQDDPNDPY 167 (296)
T ss_pred HHHHHHHHHhCCCCHH
Confidence 9999999999999983
No 5
>PLN03088 SGT1, suppressor of G2 allele of SKP1; Provisional
Probab=99.23 E-value=1.1e-10 Score=96.26 Aligned_cols=97 Identities=9% Similarity=0.040 Sum_probs=87.1
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
-+...|+|++|.++++ +++.+|+++.+|..+ |++|...+++|+++ +|+++.+.+.||.+|...|+|.+|
T Consensus 11 ~a~~~~~~~~Ai~~~~~Al~~~P~~~~a~~~~a~~~~~~g~~~eAl~~~~~Al~l-~P~~~~a~~~lg~~~~~lg~~~eA 89 (356)
T PLN03088 11 EAFVDDDFALAVDLYTQAIDLDPNNAELYADRAQANIKLGNFTEAVADANKAIEL-DPSLAKAYLRKGTACMKLEEYQTA 89 (356)
T ss_pred HHHHcCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHh-CcCCHHHHHHHHHHHHHhCCHHHH
Confidence 4556799999999999 999999999988743 99999999999999 799999999999999999999999
Q ss_pred HHHHHHHHHhCCCCHHHHHHHHHHHHhcc
Q psy16144 92 IDVSQTIMKAHPSYPKVHKQIFEKAMVHL 120 (122)
Q Consensus 92 I~vc~kVL~~~P~ypkir~~il~ka~~~l 120 (122)
+..++++++.+|+++.++.- +.+|...|
T Consensus 90 ~~~~~~al~l~P~~~~~~~~-l~~~~~kl 117 (356)
T PLN03088 90 KAALEKGASLAPGDSRFTKL-IKECDEKI 117 (356)
T ss_pred HHHHHHHHHhCCCCHHHHHH-HHHHHHHH
Confidence 99999999999999999764 45555443
No 6
>PRK10370 formate-dependent nitrite reductase complex subunit NrfG; Provisional
Probab=99.22 E-value=1.7e-10 Score=88.07 Aligned_cols=89 Identities=18% Similarity=0.275 Sum_probs=81.6
Q ss_pred HHHHH--HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC------------HHHHHHHHHHHHHhhCCCChHHHHHHHHH
Q psy16144 17 EWHFL--LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF------------PDDAVVQYTQACRIIGTSDLALHQKLASL 81 (122)
Q Consensus 17 ~~~~L--Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l------------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ 81 (122)
-|..| +|...|++++|.+.++ +++++|+++.++..+ +++|...+++|.+. +|+++.+.+.||+.
T Consensus 75 ~w~~Lg~~~~~~g~~~~A~~a~~~Al~l~P~~~~~~~~lA~aL~~~~g~~~~~~A~~~l~~al~~-dP~~~~al~~LA~~ 153 (198)
T PRK10370 75 QWALLGEYYLWRNDYDNALLAYRQALQLRGENAELYAALATVLYYQAGQHMTPQTREMIDKALAL-DANEVTALMLLASD 153 (198)
T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHhcCCCCcHHHHHHHHHHHHh-CCCChhHHHHHHHH
Confidence 47777 9999999999999999 999999988888732 48999999999999 89999999999999
Q ss_pred HHhcccHHHHHHHHHHHHHhCCCCH
Q psy16144 82 LLRMKRYAEAIDVSQTIMKAHPSYP 106 (122)
Q Consensus 82 ylk~kr~~~AI~vc~kVL~~~P~yp 106 (122)
+.+.|+|.+|+..++++|+.+|...
T Consensus 154 ~~~~g~~~~Ai~~~~~aL~l~~~~~ 178 (198)
T PRK10370 154 AFMQADYAQAIELWQKVLDLNSPRV 178 (198)
T ss_pred HHHcCCHHHHHHHHHHHHhhCCCCc
Confidence 9999999999999999999999554
No 7
>TIGR02795 tol_pal_ybgF tol-pal system protein YbgF. Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.
Probab=99.16 E-value=9.3e-10 Score=73.69 Aligned_cols=87 Identities=15% Similarity=0.077 Sum_probs=76.3
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchH---HHHhhC---------HHHHHHHHHHHHHhhCCCC---hHHHHHHHHHHHhc
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSAT---NIYNLF---------PDDAVVQYTQACRIIGTSD---LALHQKLASLLLRM 85 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~---kA~e~l---------y~dAa~~ye~Aw~l~~~~~---~~vgykLA~~ylk~ 85 (122)
.+.+.|++++|.+.|+ +++.+|++. .++..+ |++|...|+.+... .|++ +.+.+.+|.+|.+.
T Consensus 11 ~~~~~~~~~~A~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~A~~~~~~~~~~-~p~~~~~~~~~~~~~~~~~~~ 89 (119)
T TIGR02795 11 LVLKAGDYADAIQAFQAFLKKYPKSTYAPNAHYWLGEAYYAQGKYADAAKAFLAVVKK-YPKSPKAPDALLKLGMSLQEL 89 (119)
T ss_pred HHHHcCCHHHHHHHHHHHHHHCCCccccHHHHHHHHHHHHhhccHHHHHHHHHHHHHH-CCCCCcccHHHHHHHHHHHHh
Confidence 7788999999999999 999999863 344322 99999999999998 5765 67899999999999
Q ss_pred ccHHHHHHHHHHHHHhCCCCHHHH
Q psy16144 86 KRYAEAIDVSQTIMKAHPSYPKVH 109 (122)
Q Consensus 86 kr~~~AI~vc~kVL~~~P~ypkir 109 (122)
|++.+|+..++++++.+|+++.+.
T Consensus 90 ~~~~~A~~~~~~~~~~~p~~~~~~ 113 (119)
T TIGR02795 90 GDKEKAKATLQQVIKRYPGSSAAK 113 (119)
T ss_pred CChHHHHHHHHHHHHHCcCChhHH
Confidence 999999999999999999988764
No 8
>TIGR00990 3a0801s09 mitochondrial precursor proteins import receptor (72 kDa mitochondrial outermembrane protein) (mitochondrial import receptor for the ADP/ATP carrier) (translocase of outermembrane tom70).
Probab=99.12 E-value=8.6e-10 Score=95.36 Aligned_cols=89 Identities=20% Similarity=0.169 Sum_probs=76.3
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
+|.+.|++++|.+.++ +++.+|++..+|..+ |++|...|++|++. +|+++.+.+.+|..|...|++.+|
T Consensus 340 ~~~~~g~~~eA~~~~~kal~l~P~~~~~~~~la~~~~~~g~~~eA~~~~~~al~~-~p~~~~~~~~lg~~~~~~g~~~~A 418 (615)
T TIGR00990 340 FKCLKGKHLEALADLSKSIELDPRVTQSYIKRASMNLELGDPDKAEEDFDKALKL-NSEDPDIYYHRAQLHFIKGEFAQA 418 (615)
T ss_pred HHHHcCCHHHHHHHHHHHHHcCCCcHHHHHHHHHHHHHCCCHHHHHHHHHHHHHh-CCCCHHHHHHHHHHHHHcCCHHHH
Confidence 7788899999999999 999999888777633 88899999999888 788899999999999999999999
Q ss_pred HHHHHHHHHhCCCCHHHHHH
Q psy16144 92 IDVSQTIMKAHPSYPKVHKQ 111 (122)
Q Consensus 92 I~vc~kVL~~~P~ypkir~~ 111 (122)
+..|+++++.+|+++.++..
T Consensus 419 ~~~~~kal~l~P~~~~~~~~ 438 (615)
T TIGR00990 419 GKDYQKSIDLDPDFIFSHIQ 438 (615)
T ss_pred HHHHHHHHHcCccCHHHHHH
Confidence 99999999999988777543
No 9
>PRK15179 Vi polysaccharide biosynthesis protein TviE; Provisional
Probab=99.10 E-value=7.4e-10 Score=99.01 Aligned_cols=88 Identities=13% Similarity=0.024 Sum_probs=83.9
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
+..+.|++|+|+.+++ +++..|+...|+-.+ +++|...+++++.. +|+++.+.+.+|-++.+.|+|++|
T Consensus 95 i~~~~g~~~ea~~~l~~~~~~~Pd~~~a~~~~a~~L~~~~~~eeA~~~~~~~l~~-~p~~~~~~~~~a~~l~~~g~~~~A 173 (694)
T PRK15179 95 ALEAAHRSDEGLAVWRGIHQRFPDSSEAFILMLRGVKRQQGIEAGRAEIELYFSG-GSSSAREILLEAKSWDEIGQSEQA 173 (694)
T ss_pred HHHHcCCcHHHHHHHHHHHhhCCCcHHHHHHHHHHHHHhccHHHHHHHHHHHhhc-CCCCHHHHHHHHHHHHHhcchHHH
Confidence 8899999999999999 999999999999855 99999999999999 799999999999999999999999
Q ss_pred HHHHHHHHHhCCCCHHHHH
Q psy16144 92 IDVSQTIMKAHPSYPKVHK 110 (122)
Q Consensus 92 I~vc~kVL~~~P~ypkir~ 110 (122)
+++|+++++.+|+++.++.
T Consensus 174 ~~~y~~~~~~~p~~~~~~~ 192 (694)
T PRK15179 174 DACFERLSRQHPEFENGYV 192 (694)
T ss_pred HHHHHHHHhcCCCcHHHHH
Confidence 9999999999999998864
No 10
>cd00189 TPR Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and C
Probab=99.08 E-value=1.8e-09 Score=65.72 Aligned_cols=82 Identities=13% Similarity=0.246 Sum_probs=75.1
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
.+.+.|++++|.+.++ +++.+|+...++..+ +++|...|+.+... .|.++.+.+.+|..+...|++.+|
T Consensus 9 ~~~~~~~~~~A~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~a 87 (100)
T cd00189 9 LYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALEL-DPDNAKAYYNLGLAYYKLGKYEEA 87 (100)
T ss_pred HHHHHhcHHHHHHHHHHHHhcCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC-CCcchhHHHHHHHHHHHHHhHHHH
Confidence 6778899999999999 999999987776633 99999999999999 699999999999999999999999
Q ss_pred HHHHHHHHHhCCC
Q psy16144 92 IDVSQTIMKAHPS 104 (122)
Q Consensus 92 I~vc~kVL~~~P~ 104 (122)
...+.++++.+|+
T Consensus 88 ~~~~~~~~~~~~~ 100 (100)
T cd00189 88 LEAYEKALELDPN 100 (100)
T ss_pred HHHHHHHHccCCC
Confidence 9999999999884
No 11
>KOG0553|consensus
Probab=99.07 E-value=6.1e-10 Score=91.22 Aligned_cols=98 Identities=16% Similarity=0.138 Sum_probs=89.8
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHh-----hC----HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYN-----LF----PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e-----~l----y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
=..+.++|.+|++... +++++|.++-.|+ |+ |++|++--+.|+.+ +|+...+.-+||.+|+-.|+|++|
T Consensus 90 ~~m~~~~Y~eAv~kY~~AI~l~P~nAVyycNRAAAy~~Lg~~~~AVkDce~Al~i-Dp~yskay~RLG~A~~~~gk~~~A 168 (304)
T KOG0553|consen 90 KLMKNKDYQEAVDKYTEAIELDPTNAVYYCNRAAAYSKLGEYEDAVKDCESALSI-DPHYSKAYGRLGLAYLALGKYEEA 168 (304)
T ss_pred HHHHhhhHHHHHHHHHHHHhcCCCcchHHHHHHHHHHHhcchHHHHHHHHHHHhc-ChHHHHHHHHHHHHHHccCcHHHH
Confidence 3567799999999999 9999999888887 33 99999999999999 899999999999999999999999
Q ss_pred HHHHHHHHHhCCCCHHHHHHHHHHHHhccC
Q psy16144 92 IDVSQTIMKAHPSYPKVHKQIFEKAMVHLR 121 (122)
Q Consensus 92 I~vc~kVL~~~P~ypkir~~il~ka~~~lr 121 (122)
|+-|+|+|.++|+++ +.++=|+.|+..+|
T Consensus 169 ~~aykKaLeldP~Ne-~~K~nL~~Ae~~l~ 197 (304)
T KOG0553|consen 169 IEAYKKALELDPDNE-SYKSNLKIAEQKLN 197 (304)
T ss_pred HHHHHhhhccCCCcH-HHHHHHHHHHHHhc
Confidence 999999999999998 66678899988776
No 12
>PRK10370 formate-dependent nitrite reductase complex subunit NrfG; Provisional
Probab=99.06 E-value=1.4e-09 Score=83.14 Aligned_cols=86 Identities=14% Similarity=0.137 Sum_probs=77.9
Q ss_pred CCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHH-HHhccc--HHHH
Q psy16144 25 LPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASL-LLRMKR--YAEA 91 (122)
Q Consensus 25 ~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~-ylk~kr--~~~A 91 (122)
..++.+++...++ +++.||+++.+|..+ |++|...|++|.++ +|+++.+.+.+|.+ |...|+ +.+|
T Consensus 51 ~~~~~~~~i~~l~~~L~~~P~~~~~w~~Lg~~~~~~g~~~~A~~a~~~Al~l-~P~~~~~~~~lA~aL~~~~g~~~~~~A 129 (198)
T PRK10370 51 SQQTPEAQLQALQDKIRANPQNSEQWALLGEYYLWRNDYDNALLAYRQALQL-RGENAELYAALATVLYYQAGQHMTPQT 129 (198)
T ss_pred CchhHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHh-CCCCHHHHHHHHHHHHHhcCCCCcHHH
Confidence 4566789999999 999999999999955 99999999999999 79999999999997 478888 5999
Q ss_pred HHHHHHHHHhCCCCHHHHHH
Q psy16144 92 IDVSQTIMKAHPSYPKVHKQ 111 (122)
Q Consensus 92 I~vc~kVL~~~P~ypkir~~ 111 (122)
+.+++++++.+|+++.++..
T Consensus 130 ~~~l~~al~~dP~~~~al~~ 149 (198)
T PRK10370 130 REMIDKALALDANEVTALML 149 (198)
T ss_pred HHHHHHHHHhCCCChhHHHH
Confidence 99999999999999988654
No 13
>PRK12370 invasion protein regulator; Provisional
Probab=99.06 E-value=2.2e-09 Score=92.54 Aligned_cols=84 Identities=10% Similarity=0.005 Sum_probs=68.6
Q ss_pred CCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHH
Q psy16144 25 LPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDV 94 (122)
Q Consensus 25 ~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~v 94 (122)
+.+++++|.+.++ +++.+|+++.+|..+ +++|...|++|+++ +|+++.+.+.||..|...|++++|+..
T Consensus 316 ~~~~~~~A~~~~~~Al~ldP~~~~a~~~lg~~~~~~g~~~~A~~~~~~Al~l-~P~~~~a~~~lg~~l~~~G~~~eAi~~ 394 (553)
T PRK12370 316 KQNAMIKAKEHAIKATELDHNNPQALGLLGLINTIHSEYIVGSLLFKQANLL-SPISADIKYYYGWNLFMAGQLEEALQT 394 (553)
T ss_pred cchHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHccCHHHHHHHHHHHHHh-CCCCHHHHHHHHHHHHHCCCHHHHHHH
Confidence 4456888888888 888888888887633 88888888888888 688888888888888888888888888
Q ss_pred HHHHHHhCCCCHHHH
Q psy16144 95 SQTIMKAHPSYPKVH 109 (122)
Q Consensus 95 c~kVL~~~P~ypkir 109 (122)
++++++.+|++|...
T Consensus 395 ~~~Al~l~P~~~~~~ 409 (553)
T PRK12370 395 INECLKLDPTRAAAG 409 (553)
T ss_pred HHHHHhcCCCChhhH
Confidence 888888888877653
No 14
>PRK09782 bacteriophage N4 receptor, outer membrane subunit; Provisional
Probab=99.05 E-value=1.5e-09 Score=100.15 Aligned_cols=90 Identities=13% Similarity=0.088 Sum_probs=77.2
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
++.+.|++++|++.++ +++.+|+++.++..+ +++|...|++|+++ +|+++.+.+.||..|...|++++|
T Consensus 618 ~l~~lG~~deA~~~l~~AL~l~Pd~~~a~~nLG~aL~~~G~~eeAi~~l~~AL~l-~P~~~~a~~nLA~al~~lGd~~eA 696 (987)
T PRK09782 618 IYRQRHNVPAAVSDLRAALELEPNNSNYQAALGYALWDSGDIAQSREMLERAHKG-LPDDPALIRQLAYVNQRLDDMAAT 696 (987)
T ss_pred HHHHCCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHh-CCCCHHHHHHHHHHHHHCCCHHHH
Confidence 7888888888888888 888888888877733 88899999999998 789999999999999999999999
Q ss_pred HHHHHHHHHhCCCCHHHHHHH
Q psy16144 92 IDVSQTIMKAHPSYPKVHKQI 112 (122)
Q Consensus 92 I~vc~kVL~~~P~ypkir~~i 112 (122)
++.|+++++..|+.+.|...+
T Consensus 697 ~~~l~~Al~l~P~~a~i~~~~ 717 (987)
T PRK09782 697 QHYARLVIDDIDNQALITPLT 717 (987)
T ss_pred HHHHHHHHhcCCCCchhhhhh
Confidence 999999999999888887443
No 15
>TIGR00990 3a0801s09 mitochondrial precursor proteins import receptor (72 kDa mitochondrial outermembrane protein) (mitochondrial import receptor for the ADP/ATP carrier) (translocase of outermembrane tom70).
Probab=99.04 E-value=2.5e-09 Score=92.53 Aligned_cols=89 Identities=15% Similarity=0.095 Sum_probs=79.9
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
++...|++++|.+.++ +++.+|+++.+|..+ |++|...|++|.++ +|++..+.+.||..|.+.|++.+|
T Consensus 374 ~~~~~g~~~eA~~~~~~al~~~p~~~~~~~~lg~~~~~~g~~~~A~~~~~kal~l-~P~~~~~~~~la~~~~~~g~~~eA 452 (615)
T TIGR00990 374 MNLELGDPDKAEEDFDKALKLNSEDPDIYYHRAQLHFIKGEFAQAGKDYQKSIDL-DPDFIFSHIQLGVTQYKEGSIASS 452 (615)
T ss_pred HHHHCCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHc-CccCHHHHHHHHHHHHHCCCHHHH
Confidence 7888899999999999 999999998888744 99999999999999 799999999999999999999999
Q ss_pred HHHHHHHHHhCCCCHHHHHH
Q psy16144 92 IDVSQTIMKAHPSYPKVHKQ 111 (122)
Q Consensus 92 I~vc~kVL~~~P~ypkir~~ 111 (122)
+..++++++.+|++|.+...
T Consensus 453 ~~~~~~al~~~P~~~~~~~~ 472 (615)
T TIGR00990 453 MATFRRCKKNFPEAPDVYNY 472 (615)
T ss_pred HHHHHHHHHhCCCChHHHHH
Confidence 99999999999998877543
No 16
>PF14559 TPR_19: Tetratricopeptide repeat; PDB: 2R5S_A 3QDN_B 3QOU_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 3FP3_A 3LCA_A ....
Probab=99.03 E-value=7.9e-10 Score=69.15 Aligned_cols=57 Identities=21% Similarity=0.331 Sum_probs=53.5
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVHK 110 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir~ 110 (122)
|++|...|++++.. +|+|+.+.+.||.+|++.|++.+|.+++++++..+|++|.++.
T Consensus 7 ~~~A~~~~~~~l~~-~p~~~~~~~~la~~~~~~g~~~~A~~~l~~~~~~~~~~~~~~~ 63 (68)
T PF14559_consen 7 YDEAIELLEKALQR-NPDNPEARLLLAQCYLKQGQYDEAEELLERLLKQDPDNPEYQQ 63 (68)
T ss_dssp HHHHHHHHHHHHHH-TTTSHHHHHHHHHHHHHTT-HHHHHHHHHCCHGGGTTHHHHHH
T ss_pred HHHHHHHHHHHHHH-CCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCcCHHHHHH
Confidence 78999999999999 7999999999999999999999999999999999999998864
No 17
>PRK12370 invasion protein regulator; Provisional
Probab=99.02 E-value=2.6e-09 Score=92.04 Aligned_cols=87 Identities=9% Similarity=0.092 Sum_probs=78.3
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
++...|++++|.+.++ +++.||+++.+|..+ +++|...|++|.++ +|.++.+.+.++..+...|++++|
T Consensus 347 ~~~~~g~~~~A~~~~~~Al~l~P~~~~a~~~lg~~l~~~G~~~eAi~~~~~Al~l-~P~~~~~~~~~~~~~~~~g~~eeA 425 (553)
T PRK12370 347 INTIHSEYIVGSLLFKQANLLSPISADIKYYYGWNLFMAGQLEEALQTINECLKL-DPTRAAAGITKLWITYYHTGIDDA 425 (553)
T ss_pred HHHHccCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHhc-CCCChhhHHHHHHHHHhccCHHHH
Confidence 7788899999999999 999999999998854 99999999999999 799998888888888889999999
Q ss_pred HHHHHHHHHhC-CCCHHHH
Q psy16144 92 IDVSQTIMKAH-PSYPKVH 109 (122)
Q Consensus 92 I~vc~kVL~~~-P~ypkir 109 (122)
++.++++++.+ |++|...
T Consensus 426 ~~~~~~~l~~~~p~~~~~~ 444 (553)
T PRK12370 426 IRLGDELRSQHLQDNPILL 444 (553)
T ss_pred HHHHHHHHHhccccCHHHH
Confidence 99999999886 7788754
No 18
>PRK15359 type III secretion system chaperone protein SscB; Provisional
Probab=99.00 E-value=2.7e-09 Score=77.36 Aligned_cols=82 Identities=15% Similarity=0.089 Sum_probs=71.4
Q ss_pred ChHHHHHHHH-HHhhCcchHHHHh--hC----HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHH
Q psy16144 28 RWEPLSAGLR-KKQYISSATNIYN--LF----PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMK 100 (122)
Q Consensus 28 k~d~A~elL~-~l~~nps~~kA~e--~l----y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~ 100 (122)
|..--+++|+ +++.+|+.....- +. |++|...|++|+.+ +|+++.+.+.||.++.+.|++.+|++.|+++++
T Consensus 8 ~~~~~~~~~~~al~~~p~~~~~~g~~~~~~g~~~~A~~~~~~al~~-~P~~~~a~~~lg~~~~~~g~~~~A~~~y~~Al~ 86 (144)
T PRK15359 8 KNKIPEDILKQLLSVDPETVYASGYASWQEGDYSRAVIDFSWLVMA-QPWSWRAHIALAGTWMMLKEYTTAINFYGHALM 86 (144)
T ss_pred ccCCHHHHHHHHHHcCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHc-CCCcHHHHHHHHHHHHHHhhHHHHHHHHHHHHh
Confidence 4445578999 9999998643221 22 99999999999999 799999999999999999999999999999999
Q ss_pred hCCCCHHHHH
Q psy16144 101 AHPSYPKVHK 110 (122)
Q Consensus 101 ~~P~ypkir~ 110 (122)
.+|++|..+.
T Consensus 87 l~p~~~~a~~ 96 (144)
T PRK15359 87 LDASHPEPVY 96 (144)
T ss_pred cCCCCcHHHH
Confidence 9999998865
No 19
>PF13432 TPR_16: Tetratricopeptide repeat; PDB: 3CVP_A 3CVL_A 3CVQ_A 3CV0_A 2GW1_B 3CVN_A 3QKY_A 2PL2_B.
Probab=98.98 E-value=2e-09 Score=67.17 Aligned_cols=53 Identities=23% Similarity=0.380 Sum_probs=49.0
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCH
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYP 106 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~yp 106 (122)
|++|...|+++.+. +|+++.+.+.||.++...|++.+|+..++++++.+|++|
T Consensus 13 ~~~A~~~~~~~l~~-~P~~~~a~~~lg~~~~~~g~~~~A~~~~~~a~~~~P~~p 65 (65)
T PF13432_consen 13 YDEAIAAFEQALKQ-DPDNPEAWYLLGRILYQQGRYDEALAYYERALELDPDNP 65 (65)
T ss_dssp HHHHHHHHHHHHCC-STTHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHHSTT-H
T ss_pred HHHHHHHHHHHHHH-CCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCcCCC
Confidence 88999999999999 699999999999999999999999999999999999987
No 20
>PF13429 TPR_15: Tetratricopeptide repeat; PDB: 2VQ2_A 2PL2_B.
Probab=98.97 E-value=3.8e-09 Score=82.49 Aligned_cols=94 Identities=20% Similarity=0.239 Sum_probs=76.7
Q ss_pred HHHH--HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhc
Q psy16144 18 WHFL--LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRM 85 (122)
Q Consensus 18 ~~~L--Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~ 85 (122)
|..+ ++.+.|+.++|.+.++ +++.+|.++.++..+ +++|...++...+. .|+||.+...||..|+..
T Consensus 149 ~~~~a~~~~~~G~~~~A~~~~~~al~~~P~~~~~~~~l~~~li~~~~~~~~~~~l~~~~~~-~~~~~~~~~~la~~~~~l 227 (280)
T PF13429_consen 149 WLALAEIYEQLGDPDKALRDYRKALELDPDDPDARNALAWLLIDMGDYDEAREALKRLLKA-APDDPDLWDALAAAYLQL 227 (280)
T ss_dssp HHHHHHHHHHCCHHHHHHHHHHHHHHH-TT-HHHHHHHHHHHCTTCHHHHHHHHHHHHHHH--HTSCCHCHHHHHHHHHH
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHCCChHHHHHHHHHHHHH-CcCHHHHHHHHHHHhccc
Confidence 4444 8999999999999999 999999988877633 77766666665555 599999999999999999
Q ss_pred ccHHHHHHHHHHHHHhCCCCHHHHHHH
Q psy16144 86 KRYAEAIDVSQTIMKAHPSYPKVHKQI 112 (122)
Q Consensus 86 kr~~~AI~vc~kVL~~~P~ypkir~~i 112 (122)
|++.+|++.++++++.+|++|.+....
T Consensus 228 g~~~~Al~~~~~~~~~~p~d~~~~~~~ 254 (280)
T PF13429_consen 228 GRYEEALEYLEKALKLNPDDPLWLLAY 254 (280)
T ss_dssp T-HHHHHHHHHHHHHHSTT-HHHHHHH
T ss_pred ccccccccccccccccccccccccccc
Confidence 999999999999999999999987643
No 21
>TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW. Members of this family are designated PilF in ref (PubMed:8973346) and PilW in ref (PubMed:15612916). This outer membrane protein is required both for pilus stability and for pilus function such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.
Probab=98.91 E-value=3e-08 Score=71.21 Aligned_cols=87 Identities=14% Similarity=0.126 Sum_probs=43.6
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhh-CCCChHHHHHHHHHHHhcccHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRII-GTSDLALHQKLASLLLRMKRYAE 90 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~-~~~~~~vgykLA~~ylk~kr~~~ 90 (122)
+|...|++++|.++++ +++.+|....++..+ |++|...|+++.... .+..+...+.+|..|.+.|++.+
T Consensus 74 ~~~~~~~~~~A~~~~~~al~~~~~~~~~~~~~~~~~~~~g~~~~A~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~g~~~~ 153 (234)
T TIGR02521 74 YYQQLGELEKAEDSFRRALTLNPNNGDVLNNYGTFLCQQGKYEQAMQQFEQAIEDPLYPQPARSLENAGLCALKAGDFDK 153 (234)
T ss_pred HHHHcCCHHHHHHHHHHHHhhCCCCHHHHHHHHHHHHHcccHHHHHHHHHHHHhccccccchHHHHHHHHHHHHcCCHHH
Confidence 4555555555555555 555555544433311 555555555554431 12334445555555555555555
Q ss_pred HHHHHHHHHHhCCCCHHH
Q psy16144 91 AIDVSQTIMKAHPSYPKV 108 (122)
Q Consensus 91 AI~vc~kVL~~~P~ypki 108 (122)
|+..+.++++.+|+++.+
T Consensus 154 A~~~~~~~~~~~~~~~~~ 171 (234)
T TIGR02521 154 AEKYLTRALQIDPQRPES 171 (234)
T ss_pred HHHHHHHHHHhCcCChHH
Confidence 555555555555555443
No 22
>TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW. Members of this family are designated PilF in ref (PubMed:8973346) and PilW in ref (PubMed:15612916). This outer membrane protein is required both for pilus stability and for pilus function such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.
Probab=98.91 E-value=2.7e-08 Score=71.39 Aligned_cols=79 Identities=13% Similarity=0.132 Sum_probs=74.0
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
.|...|++++|.+.++ +++.+|+...++..+ +++|...|+++.+. .|.++.+.+.+|..|...|++.+|
T Consensus 40 ~~~~~~~~~~A~~~~~~~l~~~p~~~~~~~~la~~~~~~~~~~~A~~~~~~al~~-~~~~~~~~~~~~~~~~~~g~~~~A 118 (234)
T TIGR02521 40 GYLEQGDLEVAKENLDKALEHDPDDYLAYLALALYYQQLGELEKAEDSFRRALTL-NPNNGDVLNNYGTFLCQQGKYEQA 118 (234)
T ss_pred HHHHCCCHHHHHHHHHHHHHhCcccHHHHHHHHHHHHHcCCHHHHHHHHHHHHhh-CCCCHHHHHHHHHHHHHcccHHHH
Confidence 8889999999999999 999999988887743 99999999999999 699999999999999999999999
Q ss_pred HHHHHHHHHh
Q psy16144 92 IDVSQTIMKA 101 (122)
Q Consensus 92 I~vc~kVL~~ 101 (122)
+..++++++.
T Consensus 119 ~~~~~~~~~~ 128 (234)
T TIGR02521 119 MQQFEQAIED 128 (234)
T ss_pred HHHHHHHHhc
Confidence 9999999975
No 23
>TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795).
Probab=98.90 E-value=2.9e-08 Score=75.08 Aligned_cols=94 Identities=9% Similarity=-0.087 Sum_probs=80.4
Q ss_pred HHHHHHH--HhcCCCChHHHHHHHH-HHhhCcchH---HHHhhC---------HHHHHHHHHHHHHhhCCCChH---HHH
Q psy16144 15 IAEWHFL--LYLLPWRWEPLSAGLR-KKQYISSAT---NIYNLF---------PDDAVVQYTQACRIIGTSDLA---LHQ 76 (122)
Q Consensus 15 ~~~~~~L--Lyi~~gk~d~A~elL~-~l~~nps~~---kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~---vgy 76 (122)
+..++.+ .+.+.|+|++|.+.++ +++.+|+++ .++..+ +++|...|+++.+. .|+++. +.|
T Consensus 33 ~~~~~~~g~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~a~~~la~~~~~~~~~~~A~~~~~~~l~~-~p~~~~~~~a~~ 111 (235)
T TIGR03302 33 AEELYEEAKEALDSGDYTEAIKYFEALESRYPFSPYAEQAQLDLAYAYYKSGDYAEAIAAADRFIRL-HPNHPDADYAYY 111 (235)
T ss_pred HHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCCCchhHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHH-CcCCCchHHHHH
Confidence 3444555 7788999999999999 999999866 454422 99999999999999 688887 689
Q ss_pred HHHHHHHhc--------ccHHHHHHHHHHHHHhCCCCHHHH
Q psy16144 77 KLASLLLRM--------KRYAEAIDVSQTIMKAHPSYPKVH 109 (122)
Q Consensus 77 kLA~~ylk~--------kr~~~AI~vc~kVL~~~P~ypkir 109 (122)
.+|.+|.+. |++.+|+..++++++.+|+++...
T Consensus 112 ~~g~~~~~~~~~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~ 152 (235)
T TIGR03302 112 LRGLSNYNQIDRVDRDQTAAREAFEAFQELIRRYPNSEYAP 152 (235)
T ss_pred HHHHHHHHhcccccCCHHHHHHHHHHHHHHHHHCCCChhHH
Confidence 999999987 899999999999999999998764
No 24
>PF13414 TPR_11: TPR repeat; PDB: 2HO1_B 2FI7_B 2DBA_A 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2PL2_B 3IEG_B 2FBN_A ....
Probab=98.89 E-value=9.9e-09 Score=64.43 Aligned_cols=50 Identities=18% Similarity=0.333 Sum_probs=48.5
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcc-cHHHHHHHHHHHHHhCC
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMK-RYAEAIDVSQTIMKAHP 103 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~k-r~~~AI~vc~kVL~~~P 103 (122)
|++|+.+|++|.++ +|+++.+.+.+|.+|++.| ++.+|+..++++++.+|
T Consensus 19 ~~~A~~~~~~ai~~-~p~~~~~~~~~g~~~~~~~~~~~~A~~~~~~al~l~P 69 (69)
T PF13414_consen 19 YEEAIEYFEKAIEL-DPNNAEAYYNLGLAYMKLGKDYEEAIEDFEKALKLDP 69 (69)
T ss_dssp HHHHHHHHHHHHHH-STTHHHHHHHHHHHHHHTTTHHHHHHHHHHHHHHHST
T ss_pred HHHHHHHHHHHHHc-CCCCHHHHHHHHHHHHHhCccHHHHHHHHHHHHHcCc
Confidence 99999999999999 7999999999999999999 89999999999999998
No 25
>PRK15174 Vi polysaccharide export protein VexE; Provisional
Probab=98.88 E-value=2.1e-08 Score=88.37 Aligned_cols=86 Identities=17% Similarity=0.140 Sum_probs=54.1
Q ss_pred HhcCCCChHH----HHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhccc
Q psy16144 22 LYLLPWRWEP----LSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKR 87 (122)
Q Consensus 22 Lyi~~gk~d~----A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr 87 (122)
+|.+.|++++ |.+.++ +++.+|++..++..+ +++|...|++|+++ +|+++.+.+.||..|.+.|+
T Consensus 255 ~l~~~G~~~eA~~~A~~~~~~Al~l~P~~~~a~~~lg~~l~~~g~~~eA~~~l~~al~l-~P~~~~a~~~La~~l~~~G~ 333 (656)
T PRK15174 255 AYYQSGRSREAKLQAAEHWRHALQFNSDNVRIVTLYADALIRTGQNEKAIPLLQQSLAT-HPDLPYVRAMYARALRQVGQ 333 (656)
T ss_pred HHHHcCCchhhHHHHHHHHHHHHhhCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHh-CCCCHHHHHHHHHHHHHCCC
Confidence 5556666664 566666 666666666655533 66666666666666 46666666666666666666
Q ss_pred HHHHHHHHHHHHHhCCCCHHH
Q psy16144 88 YAEAIDVSQTIMKAHPSYPKV 108 (122)
Q Consensus 88 ~~~AI~vc~kVL~~~P~ypki 108 (122)
+++|++.++++++.+|+++..
T Consensus 334 ~~eA~~~l~~al~~~P~~~~~ 354 (656)
T PRK15174 334 YTAASDEFVQLAREKGVTSKW 354 (656)
T ss_pred HHHHHHHHHHHHHhCccchHH
Confidence 666666666666666665543
No 26
>PRK15331 chaperone protein SicA; Provisional
Probab=98.87 E-value=2.6e-08 Score=75.68 Aligned_cols=85 Identities=13% Similarity=-0.021 Sum_probs=77.3
Q ss_pred cCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHH
Q psy16144 24 LLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAID 93 (122)
Q Consensus 24 i~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~ 93 (122)
-++||+++|+.+|+ ..-++|.+++.|.-+ |+.|+..|..|..+ +++||..-|-.|.+|+..|+...|..
T Consensus 48 y~~Gk~~eA~~~F~~L~~~d~~n~~Y~~GLaa~~Q~~k~y~~Ai~~Y~~A~~l-~~~dp~p~f~agqC~l~l~~~~~A~~ 126 (165)
T PRK15331 48 YNQGRLDEAETFFRFLCIYDFYNPDYTMGLAAVCQLKKQFQKACDLYAVAFTL-LKNDYRPVFFTGQCQLLMRKAAKARQ 126 (165)
T ss_pred HHCCCHHHHHHHHHHHHHhCcCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHc-ccCCCCccchHHHHHHHhCCHHHHHH
Confidence 35699999999999 888999999988744 99999999999999 59999999999999999999999999
Q ss_pred HHHHHHHhCCCCHHHHH
Q psy16144 94 VSQTIMKAHPSYPKVHK 110 (122)
Q Consensus 94 vc~kVL~~~P~ypkir~ 110 (122)
+++.|.. +|.++.++.
T Consensus 127 ~f~~a~~-~~~~~~l~~ 142 (165)
T PRK15331 127 CFELVNE-RTEDESLRA 142 (165)
T ss_pred HHHHHHh-CcchHHHHH
Confidence 9999996 888887764
No 27
>KOG4626|consensus
Probab=98.87 E-value=1.1e-08 Score=91.32 Aligned_cols=89 Identities=19% Similarity=0.224 Sum_probs=80.8
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
+|-|+|++++|....+ +++..|....|+.-| -+.|+.+|++|... +|++..+.-+||..|-.+|...+|
T Consensus 397 i~kqqgnl~~Ai~~YkealrI~P~fAda~~NmGnt~ke~g~v~~A~q~y~rAI~~-nPt~AeAhsNLasi~kDsGni~~A 475 (966)
T KOG4626|consen 397 IYKQQGNLDDAIMCYKEALRIKPTFADALSNMGNTYKEMGDVSAAIQCYTRAIQI-NPTFAEAHSNLASIYKDSGNIPEA 475 (966)
T ss_pred HHHhcccHHHHHHHHHHHHhcCchHHHHHHhcchHHHHhhhHHHHHHHHHHHHhc-CcHHHHHHhhHHHHhhccCCcHHH
Confidence 8888899999999888 899999888888855 77899999999999 899999999999999999999999
Q ss_pred HHHHHHHHHhCCCCHHHHHH
Q psy16144 92 IDVSQTIMKAHPSYPKVHKQ 111 (122)
Q Consensus 92 I~vc~kVL~~~P~ypkir~~ 111 (122)
|+-|+.+|++.||+|..--.
T Consensus 476 I~sY~~aLklkPDfpdA~cN 495 (966)
T KOG4626|consen 476 IQSYRTALKLKPDFPDAYCN 495 (966)
T ss_pred HHHHHHHHccCCCCchhhhH
Confidence 99999999999999987543
No 28
>PRK09782 bacteriophage N4 receptor, outer membrane subunit; Provisional
Probab=98.87 E-value=1.8e-08 Score=93.09 Aligned_cols=84 Identities=18% Similarity=0.154 Sum_probs=76.0
Q ss_pred CCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHH
Q psy16144 25 LPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDV 94 (122)
Q Consensus 25 ~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~v 94 (122)
..|++++|...++ +++.+|+ ..+|.-+ +++|...|++|+++ +|+++.+.+.||..+...|++++|+++
T Consensus 588 ~~Gr~~eAl~~~~~AL~l~P~-~~a~~~LA~~l~~lG~~deA~~~l~~AL~l-~Pd~~~a~~nLG~aL~~~G~~eeAi~~ 665 (987)
T PRK09782 588 IPGQPELALNDLTRSLNIAPS-ANAYVARATIYRQRHNVPAAVSDLRAALEL-EPNNSNYQAALGYALWDSGDIAQSREM 665 (987)
T ss_pred hCCCHHHHHHHHHHHHHhCCC-HHHHHHHHHHHHHCCCHHHHHHHHHHHHHh-CCCCHHHHHHHHHHHHHCCCHHHHHHH
Confidence 3499999999999 9999997 7777633 99999999999999 799999999999999999999999999
Q ss_pred HHHHHHhCCCCHHHHH
Q psy16144 95 SQTIMKAHPSYPKVHK 110 (122)
Q Consensus 95 c~kVL~~~P~ypkir~ 110 (122)
++++++.+|++|.++.
T Consensus 666 l~~AL~l~P~~~~a~~ 681 (987)
T PRK09782 666 LERAHKGLPDDPALIR 681 (987)
T ss_pred HHHHHHhCCCCHHHHH
Confidence 9999999999998864
No 29
>PRK15174 Vi polysaccharide export protein VexE; Provisional
Probab=98.87 E-value=2.9e-08 Score=87.56 Aligned_cols=93 Identities=11% Similarity=-0.056 Sum_probs=80.8
Q ss_pred HHHH--HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhc
Q psy16144 18 WHFL--LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRM 85 (122)
Q Consensus 18 ~~~L--Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~ 85 (122)
++++ .....|++++|.+.++ +++.+|+++.+|..+ +++|...|++|.++ +|+++.+...||..|...
T Consensus 79 l~~l~~~~l~~g~~~~A~~~l~~~l~~~P~~~~a~~~la~~l~~~g~~~~Ai~~l~~Al~l-~P~~~~a~~~la~~l~~~ 157 (656)
T PRK15174 79 LRRWVISPLASSQPDAVLQVVNKLLAVNVCQPEDVLLVASVLLKSKQYATVADLAEQAWLA-FSGNSQIFALHLRTLVLM 157 (656)
T ss_pred HHHHhhhHhhcCCHHHHHHHHHHHHHhCCCChHHHHHHHHHHHHcCCHHHHHHHHHHHHHh-CCCcHHHHHHHHHHHHHC
Confidence 4555 6677899999999999 999999998888744 89999999999999 799999999999999999
Q ss_pred ccHHHHHHHHHHHHHhCCCCHHHHHH
Q psy16144 86 KRYAEAIDVSQTIMKAHPSYPKVHKQ 111 (122)
Q Consensus 86 kr~~~AI~vc~kVL~~~P~ypkir~~ 111 (122)
|++.+|+..+++++..+|+.+.....
T Consensus 158 g~~~eA~~~~~~~~~~~P~~~~a~~~ 183 (656)
T PRK15174 158 DKELQAISLARTQAQEVPPRGDMIAT 183 (656)
T ss_pred CChHHHHHHHHHHHHhCCCCHHHHHH
Confidence 99999999999999999988877644
No 30
>TIGR02917 PEP_TPR_lipo putative PEP-CTERM system TPR-repeat lipoprotein. This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
Probab=98.83 E-value=4.8e-08 Score=82.74 Aligned_cols=91 Identities=14% Similarity=0.159 Sum_probs=78.3
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC--------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF--------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAI 92 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l--------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI 92 (122)
+|...|++++|.+.++ +++.+|+++.++..+ -.+|...|++++++ .|+++.+...+|..|...|++.+|+
T Consensus 779 ~~~~~g~~~~A~~~~~~~~~~~p~~~~~~~~l~~~~~~~~~~~A~~~~~~~~~~-~~~~~~~~~~~~~~~~~~g~~~~A~ 857 (899)
T TIGR02917 779 LYLAQKDYDKAIKHYRTVVKKAPDNAVVLNNLAWLYLELKDPRALEYAEKALKL-APNIPAILDTLGWLLVEKGEADRAL 857 (899)
T ss_pred HHHHCcCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHhcCcHHHHHHHHHHHhh-CCCCcHHHHHHHHHHHHcCCHHHHH
Confidence 7888899999999999 999999888877744 35689999999998 6899999999999999999999999
Q ss_pred HHHHHHHHhCCCCHHHHHHHH
Q psy16144 93 DVSQTIMKAHPSYPKVHKQIF 113 (122)
Q Consensus 93 ~vc~kVL~~~P~ypkir~~il 113 (122)
..++++++.+|+.|.++..+.
T Consensus 858 ~~~~~a~~~~~~~~~~~~~l~ 878 (899)
T TIGR02917 858 PLLRKAVNIAPEAAAIRYHLA 878 (899)
T ss_pred HHHHHHHhhCCCChHHHHHHH
Confidence 999999999999888876543
No 31
>PRK11447 cellulose synthase subunit BcsC; Provisional
Probab=98.81 E-value=4.2e-08 Score=90.95 Aligned_cols=92 Identities=17% Similarity=0.128 Sum_probs=80.3
Q ss_pred HHHH--HhcCCCChHHHHHHHH-HHhhCcchHHH------------Hh-------hC----HHHHHHHHHHHHHhhCCCC
Q psy16144 18 WHFL--LYLLPWRWEPLSAGLR-KKQYISSATNI------------YN-------LF----PDDAVVQYTQACRIIGTSD 71 (122)
Q Consensus 18 ~~~L--Lyi~~gk~d~A~elL~-~l~~nps~~kA------------~e-------~l----y~dAa~~ye~Aw~l~~~~~ 71 (122)
++.| +|.+.|++++|.++|+ +++.+|+.... |. ++ +++|...|++|.+. +|++
T Consensus 306 ~~~Lg~~~~~~g~~~eA~~~l~~Al~~~p~~~~~~~~~~ll~~~~~~~~~~~g~~~~~~g~~~eA~~~~~~Al~~-~P~~ 384 (1157)
T PRK11447 306 LGALGQAYSQQGDRARAVAQFEKALALDPHSSNRDKWESLLKVNRYWLLIQQGDAALKANNLAQAERLYQQARQV-DNTD 384 (1157)
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHHhCCCccchhHHHHHHHhhhHHHHHHHHHHHHHCCCHHHHHHHHHHHHHh-CCCC
Confidence 3444 8899999999999999 99999986531 11 11 99999999999999 7999
Q ss_pred hHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHHH
Q psy16144 72 LALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVHK 110 (122)
Q Consensus 72 ~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir~ 110 (122)
+.+.+.||.+|...|++.+|++.|+++++.+|+++.+..
T Consensus 385 ~~a~~~Lg~~~~~~g~~~eA~~~y~~aL~~~p~~~~a~~ 423 (1157)
T PRK11447 385 SYAVLGLGDVAMARKDYAAAERYYQQALRMDPGNTNAVR 423 (1157)
T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCCHHHHH
Confidence 999999999999999999999999999999999988754
No 32
>TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD. ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array.
Probab=98.80 E-value=4.9e-08 Score=67.73 Aligned_cols=78 Identities=19% Similarity=0.141 Sum_probs=69.7
Q ss_pred HHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhC
Q psy16144 33 SAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAH 102 (122)
Q Consensus 33 ~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~ 102 (122)
.++|+ ++..+|+...+...+ |++|...|+.+... +|+++.+.+.+|..|.+.|++.+|+.+++++++.+
T Consensus 3 ~~~~~~~l~~~p~~~~~~~~~a~~~~~~~~~~~A~~~~~~~~~~-~p~~~~~~~~la~~~~~~~~~~~A~~~~~~~~~~~ 81 (135)
T TIGR02552 3 GATLKDLLGLDSEQLEQIYALAYNLYQQGRYDEALKLFQLLAAY-DPYNSRYWLGLAACCQMLKEYEEAIDAYALAAALD 81 (135)
T ss_pred chhHHHHHcCChhhHHHHHHHHHHHHHcccHHHHHHHHHHHHHh-CCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC
Confidence 36788 999999877665522 99999999999999 79999999999999999999999999999999999
Q ss_pred CCCHHHHHH
Q psy16144 103 PSYPKVHKQ 111 (122)
Q Consensus 103 P~ypkir~~ 111 (122)
|++|..+..
T Consensus 82 p~~~~~~~~ 90 (135)
T TIGR02552 82 PDDPRPYFH 90 (135)
T ss_pred CCChHHHHH
Confidence 999988743
No 33
>PRK10049 pgaA outer membrane protein PgaA; Provisional
Probab=98.80 E-value=5.6e-08 Score=86.84 Aligned_cols=93 Identities=12% Similarity=0.137 Sum_probs=85.7
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
++..+|++++|+++|+ ++...|+++.++..+ +++|...|++|+++ +|+++.+.+.+|.++++.|++.+|
T Consensus 368 ~l~~~g~~~eA~~~l~~al~~~P~n~~l~~~lA~l~~~~g~~~~A~~~l~~al~l-~Pd~~~l~~~~a~~al~~~~~~~A 446 (765)
T PRK10049 368 VAKYSNDLPQAEMRARELAYNAPGNQGLRIDYASVLQARGWPRAAENELKKAEVL-EPRNINLEVEQAWTALDLQEWRQM 446 (765)
T ss_pred HHHHcCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHhcCCHHHHHHHHHHHHhh-CCCChHHHHHHHHHHHHhCCHHHH
Confidence 8889999999999999 999999999888844 99999999999999 799999999999999999999999
Q ss_pred HHHHHHHHHhCCCCHHHHHHHHHHHH
Q psy16144 92 IDVSQTIMKAHPSYPKVHKQIFEKAM 117 (122)
Q Consensus 92 I~vc~kVL~~~P~ypkir~~il~ka~ 117 (122)
+.+.+++++.+|++|-+.. |++++
T Consensus 447 ~~~~~~ll~~~Pd~~~~~~--~~~~~ 470 (765)
T PRK10049 447 DVLTDDVVAREPQDPGVQR--LARAR 470 (765)
T ss_pred HHHHHHHHHhCCCCHHHHH--HHHHH
Confidence 9999999999999998853 44443
No 34
>TIGR02917 PEP_TPR_lipo putative PEP-CTERM system TPR-repeat lipoprotein. This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system, analogous to the LPXTG/sortase system common in Gram-positive bacteria. This protein occurs in a species if and only if a transmembrane histidine kinase (TIGR02916) and a DNA-binding response regulator (TIGR02915) also occur. The present of tetratricopeptide repeats (TPR) suggests protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.
Probab=98.77 E-value=1.4e-07 Score=79.92 Aligned_cols=88 Identities=17% Similarity=0.180 Sum_probs=66.5
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
+|...|++++|.++++ +++.+|..+.+|..+ +++|...|+++.+. +|+++.+.+.+|..|.+.|++.+|
T Consensus 576 ~~~~~~~~~~A~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~A~~~~~~~~~~-~~~~~~~~~~l~~~~~~~~~~~~A 654 (899)
T TIGR02917 576 YYLGKGQLKKALAILNEAADAAPDSPEAWLMLGRAQLAAGDLNKAVSSFKKLLAL-QPDSALALLLLADAYAVMKNYAKA 654 (899)
T ss_pred HHHHCCCHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHh-CCCChHHHHHHHHHHHHcCCHHHH
Confidence 6667777777777777 777777777766643 77888888888777 577777788888888888888888
Q ss_pred HHHHHHHHHhCCCCHHHHH
Q psy16144 92 IDVSQTIMKAHPSYPKVHK 110 (122)
Q Consensus 92 I~vc~kVL~~~P~ypkir~ 110 (122)
+.+++++++.+|+++..+.
T Consensus 655 ~~~~~~~~~~~~~~~~~~~ 673 (899)
T TIGR02917 655 ITSLKRALELKPDNTEAQI 673 (899)
T ss_pred HHHHHHHHhcCCCCHHHHH
Confidence 8888888888887766543
No 35
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=98.76 E-value=1.5e-07 Score=75.34 Aligned_cols=87 Identities=16% Similarity=0.096 Sum_probs=76.1
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchH---HHHhhC---------HHHHHHHHHHHHHhhCCC---ChHHHHHHHHHHHhc
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSAT---NIYNLF---------PDDAVVQYTQACRIIGTS---DLALHQKLASLLLRM 85 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~---kA~e~l---------y~dAa~~ye~Aw~l~~~~---~~~vgykLA~~ylk~ 85 (122)
+....|+|++|.+.|+ .++..|++. .+++.+ |++|+..|+.+.+. .|+ .+.+.+++|..|...
T Consensus 152 l~~~~~~y~~Ai~af~~fl~~yP~s~~a~~A~y~LG~~y~~~g~~~~A~~~f~~vv~~-yP~s~~~~dAl~klg~~~~~~ 230 (263)
T PRK10803 152 LVQDKSRQDDAIVAFQNFVKKYPDSTYQPNANYWLGQLNYNKGKKDDAAYYFASVVKN-YPKSPKAADAMFKVGVIMQDK 230 (263)
T ss_pred HHHhcCCHHHHHHHHHHHHHHCcCCcchHHHHHHHHHHHHHcCCHHHHHHHHHHHHHH-CCCCcchhHHHHHHHHHHHHc
Confidence 5567799999999999 999999973 566533 99999999999976 455 688899999999999
Q ss_pred ccHHHHHHHHHHHHHhCCCCHHHH
Q psy16144 86 KRYAEAIDVSQTIMKAHPSYPKVH 109 (122)
Q Consensus 86 kr~~~AI~vc~kVL~~~P~ypkir 109 (122)
|++.+|+.+|++|++.+|+.+-.+
T Consensus 231 g~~~~A~~~~~~vi~~yP~s~~a~ 254 (263)
T PRK10803 231 GDTAKAKAVYQQVIKKYPGTDGAK 254 (263)
T ss_pred CCHHHHHHHHHHHHHHCcCCHHHH
Confidence 999999999999999999887553
No 36
>PRK11447 cellulose synthase subunit BcsC; Provisional
Probab=98.75 E-value=8.2e-08 Score=89.03 Aligned_cols=87 Identities=13% Similarity=0.045 Sum_probs=76.4
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhh-------------------------------------------------
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNL------------------------------------------------- 51 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~------------------------------------------------- 51 (122)
+|.+.|++++|++.++ +++.+|++..++..
T Consensus 394 ~~~~~g~~~eA~~~y~~aL~~~p~~~~a~~~L~~l~~~~~~~~A~~~l~~l~~~~~~~~~~~~~~l~~~~~~~~a~~~~~ 473 (1157)
T PRK11447 394 VAMARKDYAAAERYYQQALRMDPGNTNAVRGLANLYRQQSPEKALAFIASLSASQRRSIDDIERSLQNDRLAQQAEALEN 473 (1157)
T ss_pred HHHHCCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHhcCHHHHHHHHHhCCHHHHHHHHHHHHHhhhhHHHHHHHHHHH
Confidence 8889999999999999 99999997766420
Q ss_pred C--HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHH
Q psy16144 52 F--PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVH 109 (122)
Q Consensus 52 l--y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir 109 (122)
. +++|+..|++|.+. +|+++.+.+.||..|.+.|++.+|+..++++++.+|++|..+
T Consensus 474 ~g~~~eA~~~~~~Al~~-~P~~~~~~~~LA~~~~~~G~~~~A~~~l~~al~~~P~~~~~~ 532 (1157)
T PRK11447 474 QGKWAQAAELQRQRLAL-DPGSVWLTYRLAQDLRQAGQRSQADALMRRLAQQKPNDPEQV 532 (1157)
T ss_pred CCCHHHHHHHHHHHHHh-CCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHcCCCCHHHH
Confidence 1 78899999999999 799999999999999999999999999999999999998764
No 37
>COG3063 PilF Tfp pilus assembly protein PilF [Cell motility and secretion / Intracellular trafficking and secretion]
Probab=98.72 E-value=6.2e-08 Score=77.53 Aligned_cols=84 Identities=18% Similarity=0.133 Sum_probs=70.2
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
=|+++|.+..|.+-|+ +|+++||...+|.-+ .+-|-..|++|+.+ +|++..|..+-|..+...|+|++|
T Consensus 44 ~YL~~gd~~~A~~nlekAL~~DPs~~~a~~~~A~~Yq~~Ge~~~A~e~YrkAlsl-~p~~GdVLNNYG~FLC~qg~~~eA 122 (250)
T COG3063 44 GYLQQGDYAQAKKNLEKALEHDPSYYLAHLVRAHYYQKLGENDLADESYRKALSL-APNNGDVLNNYGAFLCAQGRPEEA 122 (250)
T ss_pred HHHHCCCHHHHHHHHHHHHHhCcccHHHHHHHHHHHHHcCChhhHHHHHHHHHhc-CCCccchhhhhhHHHHhCCChHHH
Confidence 7899999999999999 999999999999833 88899999999999 799999977777777777777777
Q ss_pred HHHHHHHHHhCCCCHH
Q psy16144 92 IDVSQTIMKAHPSYPK 107 (122)
Q Consensus 92 I~vc~kVL~~~P~ypk 107 (122)
-.-+++++ .+|+||.
T Consensus 123 ~q~F~~Al-~~P~Y~~ 137 (250)
T COG3063 123 MQQFERAL-ADPAYGE 137 (250)
T ss_pred HHHHHHHH-hCCCCCC
Confidence 77777666 6666653
No 38
>PRK10049 pgaA outer membrane protein PgaA; Provisional
Probab=98.72 E-value=2.4e-07 Score=82.77 Aligned_cols=92 Identities=12% Similarity=0.081 Sum_probs=84.0
Q ss_pred HHHH--HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhc
Q psy16144 18 WHFL--LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRM 85 (122)
Q Consensus 18 ~~~L--Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~ 85 (122)
+..+ ++.+.|++++|.++++ +++.+|.++.++..+ +++|...++++.+. .|+++. .+.||..|...
T Consensus 52 ~~~lA~~~~~~g~~~~A~~~~~~al~~~P~~~~a~~~la~~l~~~g~~~eA~~~l~~~l~~-~P~~~~-~~~la~~l~~~ 129 (765)
T PRK10049 52 YAAVAVAYRNLKQWQNSLTLWQKALSLEPQNDDYQRGLILTLADAGQYDEALVKAKQLVSG-APDKAN-LLALAYVYKRA 129 (765)
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHh-CCCCHH-HHHHHHHHHHC
Confidence 4445 8899999999999999 999999998887633 99999999999999 799999 99999999999
Q ss_pred ccHHHHHHHHHHHHHhCCCCHHHHHH
Q psy16144 86 KRYAEAIDVSQTIMKAHPSYPKVHKQ 111 (122)
Q Consensus 86 kr~~~AI~vc~kVL~~~P~ypkir~~ 111 (122)
|++.+|+..++++++.+|+++.+...
T Consensus 130 g~~~~Al~~l~~al~~~P~~~~~~~~ 155 (765)
T PRK10049 130 GRHWDELRAMTQALPRAPQTQQYPTE 155 (765)
T ss_pred CCHHHHHHHHHHHHHhCCCCHHHHHH
Confidence 99999999999999999999988653
No 39
>KOG1125|consensus
Probab=98.71 E-value=2.4e-08 Score=87.51 Aligned_cols=82 Identities=15% Similarity=0.235 Sum_probs=79.0
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
||--.|+||.|++.|+ +|..+|++..-|..+ ..||+..|.+|+++ .|.+..+.|+||-++|-.|-|.||
T Consensus 439 Ly~ls~efdraiDcf~~AL~v~Pnd~~lWNRLGAtLAN~~~s~EAIsAY~rALqL-qP~yVR~RyNlgIS~mNlG~ykEA 517 (579)
T KOG1125|consen 439 LYNLSGEFDRAVDCFEAALQVKPNDYLLWNRLGATLANGNRSEEAISAYNRALQL-QPGYVRVRYNLGISCMNLGAYKEA 517 (579)
T ss_pred HHhcchHHHHHHHHHHHHHhcCCchHHHHHHhhHHhcCCcccHHHHHHHHHHHhc-CCCeeeeehhhhhhhhhhhhHHHH
Confidence 7777899999999999 999999999999988 99999999999999 799999999999999999999999
Q ss_pred HHHHHHHHHhCCC
Q psy16144 92 IDVSQTIMKAHPS 104 (122)
Q Consensus 92 I~vc~kVL~~~P~ 104 (122)
+..+-.||..++.
T Consensus 518 ~~hlL~AL~mq~k 530 (579)
T KOG1125|consen 518 VKHLLEALSMQRK 530 (579)
T ss_pred HHHHHHHHHhhhc
Confidence 9999999999885
No 40
>PF13371 TPR_9: Tetratricopeptide repeat
Probab=98.69 E-value=1.3e-07 Score=59.76 Aligned_cols=57 Identities=21% Similarity=0.343 Sum_probs=52.7
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVHK 110 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir~ 110 (122)
|++|...++.+.++ +|+++...+.+|..|.+.|++.+|+..++++++..|+.|.+..
T Consensus 11 ~~~A~~~~~~~l~~-~p~~~~~~~~~a~~~~~~g~~~~A~~~l~~~l~~~p~~~~~~~ 67 (73)
T PF13371_consen 11 YEEALEVLERALEL-DPDDPELWLQRARCLFQLGRYEEALEDLERALELSPDDPDARA 67 (73)
T ss_pred HHHHHHHHHHHHHh-CcccchhhHHHHHHHHHhccHHHHHHHHHHHHHHCCCcHHHHH
Confidence 78888888889998 7999999999999999999999999999999999999988753
No 41
>PRK11788 tetratricopeptide repeat protein; Provisional
Probab=98.68 E-value=2.7e-07 Score=73.85 Aligned_cols=86 Identities=16% Similarity=0.111 Sum_probs=44.6
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchH-HHHh-------hC--HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSAT-NIYN-------LF--PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAE 90 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~-kA~e-------~l--y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~ 90 (122)
+|.+.|++++|.++++ +++.+|++. .++. .. +++|...++++.+. +|+++.+ ..+|..|.+.|++.+
T Consensus 223 ~~~~~g~~~~A~~~~~~~~~~~p~~~~~~~~~l~~~~~~~g~~~~A~~~l~~~~~~-~p~~~~~-~~la~~~~~~g~~~~ 300 (389)
T PRK11788 223 LALAQGDYAAAIEALERVEEQDPEYLSEVLPKLMECYQALGDEAEGLEFLRRALEE-YPGADLL-LALAQLLEEQEGPEA 300 (389)
T ss_pred HHHHCCCHHHHHHHHHHHHHHChhhHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHh-CCCchHH-HHHHHHHHHhCCHHH
Confidence 4555555555555555 555555431 1111 11 55555555555555 3544333 555666666666666
Q ss_pred HHHHHHHHHHhCCCCHHHH
Q psy16144 91 AIDVSQTIMKAHPSYPKVH 109 (122)
Q Consensus 91 AI~vc~kVL~~~P~ypkir 109 (122)
|+.+++++++.+|+.+.++
T Consensus 301 A~~~l~~~l~~~P~~~~~~ 319 (389)
T PRK11788 301 AQALLREQLRRHPSLRGFH 319 (389)
T ss_pred HHHHHHHHHHhCcCHHHHH
Confidence 6666666666666555443
No 42
>TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795).
Probab=98.67 E-value=4.4e-07 Score=68.61 Aligned_cols=84 Identities=17% Similarity=0.178 Sum_probs=73.1
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHH---HHhhC-----------------HHHHHHHHHHHHHhhCCCChHHH-----
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATN---IYNLF-----------------PDDAVVQYTQACRIIGTSDLALH----- 75 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~k---A~e~l-----------------y~dAa~~ye~Aw~l~~~~~~~vg----- 75 (122)
+|.+.|++++|.+.++ +++.+|+.+. ++..+ +++|...|+++++. +|+++.+.
T Consensus 79 ~~~~~~~~~~A~~~~~~~l~~~p~~~~~~~a~~~~g~~~~~~~~~~~~~~~~~~~A~~~~~~~~~~-~p~~~~~~~a~~~ 157 (235)
T TIGR03302 79 AYYKSGDYAEAIAAADRFIRLHPNHPDADYAYYLRGLSNYNQIDRVDRDQTAAREAFEAFQELIRR-YPNSEYAPDAKKR 157 (235)
T ss_pred HHHhcCCHHHHHHHHHHHHHHCcCCCchHHHHHHHHHHHHHhcccccCCHHHHHHHHHHHHHHHHH-CCCChhHHHHHHH
Confidence 8899999999999999 9999998776 44321 67899999999999 79987553
Q ss_pred ------------HHHHHHHHhcccHHHHHHHHHHHHHhCCCCH
Q psy16144 76 ------------QKLASLLLRMKRYAEAIDVSQTIMKAHPSYP 106 (122)
Q Consensus 76 ------------ykLA~~ylk~kr~~~AI~vc~kVL~~~P~yp 106 (122)
+.+|..|++.|++.+|++.++++++.+|+.|
T Consensus 158 ~~~~~~~~~~~~~~~a~~~~~~g~~~~A~~~~~~al~~~p~~~ 200 (235)
T TIGR03302 158 MDYLRNRLAGKELYVARFYLKRGAYVAAINRFETVVENYPDTP 200 (235)
T ss_pred HHHHHHHHHHHHHHHHHHHHHcCChHHHHHHHHHHHHHCCCCc
Confidence 5789999999999999999999999988765
No 43
>PRK15179 Vi polysaccharide biosynthesis protein TviE; Provisional
Probab=98.66 E-value=3e-07 Score=82.43 Aligned_cols=89 Identities=17% Similarity=-0.105 Sum_probs=81.2
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
+..+.+++|+|.+.++ ++..+|+++.++..+ |++|...|+++.. .+|+++.+...+|..+++.|+.++|
T Consensus 129 ~L~~~~~~eeA~~~~~~~l~~~p~~~~~~~~~a~~l~~~g~~~~A~~~y~~~~~-~~p~~~~~~~~~a~~l~~~G~~~~A 207 (694)
T PRK15179 129 GVKRQQGIEAGRAEIELYFSGGSSSAREILLEAKSWDEIGQSEQADACFERLSR-QHPEFENGYVGWAQSLTRRGALWRA 207 (694)
T ss_pred HHHHhccHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHhcchHHHHHHHHHHHh-cCCCcHHHHHHHHHHHHHcCCHHHH
Confidence 8889999999999999 999999999988744 9999999999999 4899999999999999999999999
Q ss_pred HHHHHHHHHhCCCCHHHHHH
Q psy16144 92 IDVSQTIMKAHPSYPKVHKQ 111 (122)
Q Consensus 92 I~vc~kVL~~~P~ypkir~~ 111 (122)
.+.|+++++..-+-.|--.+
T Consensus 208 ~~~~~~a~~~~~~~~~~~~~ 227 (694)
T PRK15179 208 RDVLQAGLDAIGDGARKLTR 227 (694)
T ss_pred HHHHHHHHHhhCcchHHHHH
Confidence 99999999998876654333
No 44
>PF12895 Apc3: Anaphase-promoting complex, cyclosome, subunit 3; PDB: 3KAE_D 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2XPI_A 3ULQ_A.
Probab=98.65 E-value=7e-08 Score=63.30 Aligned_cols=72 Identities=17% Similarity=0.183 Sum_probs=58.1
Q ss_pred CCCChHHHHHHHH-HHhhCcc--hHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHH
Q psy16144 25 LPWRWEPLSAGLR-KKQYISS--ATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAI 92 (122)
Q Consensus 25 ~~gk~d~A~elL~-~l~~nps--~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI 92 (122)
++|.|+.|+.+++ +++.+|+ ....+..+ |++|...+++ .+. +++++.+.+-+|.+|.+.|+|++||
T Consensus 1 ~~~~y~~Ai~~~~k~~~~~~~~~~~~~~~~la~~~~~~~~y~~A~~~~~~-~~~-~~~~~~~~~l~a~~~~~l~~y~eAi 78 (84)
T PF12895_consen 1 DQGNYENAIKYYEKLLELDPTNPNSAYLYNLAQCYFQQGKYEEAIELLQK-LKL-DPSNPDIHYLLARCLLKLGKYEEAI 78 (84)
T ss_dssp HTT-HHHHHHHHHHHHHHHCGTHHHHHHHHHHHHHHHTTHHHHHHHHHHC-HTH-HHCHHHHHHHHHHHHHHTT-HHHHH
T ss_pred CCccHHHHHHHHHHHHHHCCCChhHHHHHHHHHHHHHCCCHHHHHHHHHH-hCC-CCCCHHHHHHHHHHHHHhCCHHHHH
Confidence 3689999999999 9999995 34444422 9999999999 666 5888999999999999999999999
Q ss_pred HHHHHH
Q psy16144 93 DVSQTI 98 (122)
Q Consensus 93 ~vc~kV 98 (122)
.+++++
T Consensus 79 ~~l~~~ 84 (84)
T PF12895_consen 79 KALEKA 84 (84)
T ss_dssp HHHHHH
T ss_pred HHHhcC
Confidence 988764
No 45
>KOG4626|consensus
Probab=98.63 E-value=1.1e-07 Score=85.20 Aligned_cols=89 Identities=17% Similarity=0.207 Sum_probs=68.8
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
+|-.+|++++|..+++ +++.+|.|+.|+.-+ ++||+.+|+.|+.+ +|++..+.-++|..|-.+|+...|
T Consensus 363 i~~E~~~~e~A~~ly~~al~v~p~~aaa~nNLa~i~kqqgnl~~Ai~~YkealrI-~P~fAda~~NmGnt~ke~g~v~~A 441 (966)
T KOG4626|consen 363 IYREQGKIEEATRLYLKALEVFPEFAAAHNNLASIYKQQGNLDDAIMCYKEALRI-KPTFADALSNMGNTYKEMGDVSAA 441 (966)
T ss_pred HHHHhccchHHHHHHHHHHhhChhhhhhhhhHHHHHHhcccHHHHHHHHHHHHhc-CchHHHHHHhcchHHHHhhhHHHH
Confidence 7777788888888877 888888877777733 77788888888887 588888888888888888888888
Q ss_pred HHHHHHHHHhCCCCHHHHHH
Q psy16144 92 IDVSQTIMKAHPSYPKVHKQ 111 (122)
Q Consensus 92 I~vc~kVL~~~P~ypkir~~ 111 (122)
|.+|.++...+|.+......
T Consensus 442 ~q~y~rAI~~nPt~AeAhsN 461 (966)
T KOG4626|consen 442 IQCYTRAIQINPTFAEAHSN 461 (966)
T ss_pred HHHHHHHHhcCcHHHHHHhh
Confidence 88888887788877776543
No 46
>PRK02603 photosystem I assembly protein Ycf3; Provisional
Probab=98.63 E-value=5.1e-07 Score=66.42 Aligned_cols=83 Identities=11% Similarity=0.112 Sum_probs=73.2
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchH---HHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhccc-
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSAT---NIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKR- 87 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~---kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr- 87 (122)
+|.+.|++++|.++++ +++.+|+.. .++..+ |++|...|++|.+. .|+++...+.+|..|...|+
T Consensus 44 ~~~~~g~~~~A~~~~~~al~~~~~~~~~~~~~~~la~~~~~~g~~~~A~~~~~~al~~-~p~~~~~~~~lg~~~~~~g~~ 122 (172)
T PRK02603 44 SAQADGEYAEALENYEEALKLEEDPNDRSYILYNMGIIYASNGEHDKALEYYHQALEL-NPKQPSALNNIAVIYHKRGEK 122 (172)
T ss_pred HHHHcCCHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHh-CcccHHHHHHHHHHHHHcCCh
Confidence 7889999999999999 999887643 344322 99999999999999 79999999999999999988
Q ss_pred -------------HHHHHHHHHHHHHhCCCC
Q psy16144 88 -------------YAEAIDVSQTIMKAHPSY 105 (122)
Q Consensus 88 -------------~~~AI~vc~kVL~~~P~y 105 (122)
+.+|++++++++..+|++
T Consensus 123 ~~a~~~~~~A~~~~~~A~~~~~~a~~~~p~~ 153 (172)
T PRK02603 123 AEEAGDQDEAEALFDKAAEYWKQAIRLAPNN 153 (172)
T ss_pred HhHhhCHHHHHHHHHHHHHHHHHHHhhCchh
Confidence 788999999999999976
No 47
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=98.61 E-value=3.3e-07 Score=79.16 Aligned_cols=96 Identities=17% Similarity=0.177 Sum_probs=87.9
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
-+.+.|++|.|+..|+ .++..|+++-.+++. +++|.+.+++|..+ .|+.+...+.+|..|++.|++.+|
T Consensus 315 ~~~~~~~~d~A~~~l~~L~~~~P~N~~~~~~~~~i~~~~nk~~~A~e~~~kal~l-~P~~~~l~~~~a~all~~g~~~ea 393 (484)
T COG4783 315 QTYLAGQYDEALKLLQPLIAAQPDNPYYLELAGDILLEANKAKEAIERLKKALAL-DPNSPLLQLNLAQALLKGGKPQEA 393 (484)
T ss_pred HHHHhcccchHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCChHHHHHHHHHHHhc-CCCccHHHHHHHHHHHhcCChHHH
Confidence 5667899999999999 999999999999955 99999999999999 799999999999999999999999
Q ss_pred HHHHHHHHHhCCCCHHHHHHHHHHHHhc
Q psy16144 92 IDVSQTIMKAHPSYPKVHKQIFEKAMVH 119 (122)
Q Consensus 92 I~vc~kVL~~~P~ypkir~~il~ka~~~ 119 (122)
|...+..+..+|++|..+. -|-.|+..
T Consensus 394 i~~L~~~~~~~p~dp~~w~-~LAqay~~ 420 (484)
T COG4783 394 IRILNRYLFNDPEDPNGWD-LLAQAYAE 420 (484)
T ss_pred HHHHHHHhhcCCCCchHHH-HHHHHHHH
Confidence 9999999999999999984 66655543
No 48
>KOG1155|consensus
Probab=98.61 E-value=2.2e-07 Score=80.50 Aligned_cols=94 Identities=16% Similarity=0.130 Sum_probs=80.2
Q ss_pred CCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHH
Q psy16144 25 LPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDV 94 (122)
Q Consensus 25 ~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~v 94 (122)
-.+..+.|+.+|+ +|++||+...||-+| -..|+.+|+.|.+. +|.|-.+.|.||..|.-++-+.=|+=-
T Consensus 342 lr~eHEKAv~YFkRALkLNp~~~~aWTLmGHEyvEmKNt~AAi~sYRrAvdi-~p~DyRAWYGLGQaYeim~Mh~YaLyY 420 (559)
T KOG1155|consen 342 LRSEHEKAVMYFKRALKLNPKYLSAWTLMGHEYVEMKNTHAAIESYRRAVDI-NPRDYRAWYGLGQAYEIMKMHFYALYY 420 (559)
T ss_pred HHHhHHHHHHHHHHHHhcCcchhHHHHHhhHHHHHhcccHHHHHHHHHHHhc-CchhHHHHhhhhHHHHHhcchHHHHHH
Confidence 3467899999999 999999999988766 67789999999999 799999999999999999999999999
Q ss_pred HHHHHHhCCCCHHHHHHHHHHHHhcc
Q psy16144 95 SQTIMKAHPSYPKVHKQIFEKAMVHL 120 (122)
Q Consensus 95 c~kVL~~~P~ypkir~~il~ka~~~l 120 (122)
+++++..-|+.++++. .|-.|+..|
T Consensus 421 fqkA~~~kPnDsRlw~-aLG~CY~kl 445 (559)
T KOG1155|consen 421 FQKALELKPNDSRLWV-ALGECYEKL 445 (559)
T ss_pred HHHHHhcCCCchHHHH-HHHHHHHHh
Confidence 9999999999999885 566665543
No 49
>CHL00033 ycf3 photosystem I assembly protein Ycf3
Probab=98.60 E-value=1.2e-06 Score=64.05 Aligned_cols=88 Identities=9% Similarity=-0.060 Sum_probs=72.6
Q ss_pred HHHHH--HhcCCCChHHHHHHHH-HHhhCcch---HHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHH
Q psy16144 17 EWHFL--LYLLPWRWEPLSAGLR-KKQYISSA---TNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASL 81 (122)
Q Consensus 17 ~~~~L--Lyi~~gk~d~A~elL~-~l~~nps~---~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ 81 (122)
.|+.+ ++...|++++|...++ +++.+|+. +.+|..+ +++|...|++|..+ .|.++.....+|..
T Consensus 37 ~~~~~g~~~~~~g~~~~A~~~~~~al~l~~~~~~~~~~~~~lg~~~~~~g~~~eA~~~~~~Al~~-~~~~~~~~~~la~i 115 (168)
T CHL00033 37 TYYRDGMSAQSEGEYAEALQNYYEAMRLEIDPYDRSYILYNIGLIHTSNGEHTKALEYYFQALER-NPFLPQALNNMAVI 115 (168)
T ss_pred HHHHHHHHHHHcCCHHHHHHHHHHHHhccccchhhHHHHHHHHHHHHHcCCHHHHHHHHHHHHHh-CcCcHHHHHHHHHH
Confidence 34444 7888999999999999 99887763 3355422 99999999999999 79999999999999
Q ss_pred HH-------hcccHH-------HHHHHHHHHHHhCCCC
Q psy16144 82 LL-------RMKRYA-------EAIDVSQTIMKAHPSY 105 (122)
Q Consensus 82 yl-------k~kr~~-------~AI~vc~kVL~~~P~y 105 (122)
|. +.|++. +|+.++++++..+|++
T Consensus 116 ~~~~~~~~~~~g~~~~A~~~~~~a~~~~~~a~~~~p~~ 153 (168)
T CHL00033 116 CHYRGEQAIEQGDSEIAEAWFDQAAEYWKQAIALAPGN 153 (168)
T ss_pred HHHhhHHHHHcccHHHHHHHHHHHHHHHHHHHHhCccc
Confidence 99 777755 7888888999999954
No 50
>PRK11788 tetratricopeptide repeat protein; Provisional
Probab=98.60 E-value=8.2e-07 Score=71.08 Aligned_cols=85 Identities=12% Similarity=0.008 Sum_probs=58.4
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCC-hHHHHHHHHHHHhcccHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSD-LALHQKLASLLLRMKRYAE 90 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~-~~vgykLA~~ylk~kr~~~ 90 (122)
++.+.|++++|.++++ +++.+|++..++..+ +++|...|+++++. +|++ +.+...|+..|.+.|++++
T Consensus 189 ~~~~~~~~~~A~~~~~~al~~~p~~~~~~~~la~~~~~~g~~~~A~~~~~~~~~~-~p~~~~~~~~~l~~~~~~~g~~~~ 267 (389)
T PRK11788 189 QALARGDLDAARALLKKALAADPQCVRASILLGDLALAQGDYAAAIEALERVEEQ-DPEYLSEVLPKLMECYQALGDEAE 267 (389)
T ss_pred HHHhCCCHHHHHHHHHHHHhHCcCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH-ChhhHHHHHHHHHHHHHHcCCHHH
Confidence 5666777777777777 777777766665533 77777777777776 3544 3455667777777777777
Q ss_pred HHHHHHHHHHhCCCCHH
Q psy16144 91 AIDVSQTIMKAHPSYPK 107 (122)
Q Consensus 91 AI~vc~kVL~~~P~ypk 107 (122)
|+..++++++.+|+.+.
T Consensus 268 A~~~l~~~~~~~p~~~~ 284 (389)
T PRK11788 268 GLEFLRRALEEYPGADL 284 (389)
T ss_pred HHHHHHHHHHhCCCchH
Confidence 77777777777776544
No 51
>PRK11189 lipoprotein NlpI; Provisional
Probab=98.57 E-value=7.1e-07 Score=71.40 Aligned_cols=85 Identities=13% Similarity=0.080 Sum_probs=72.6
Q ss_pred CCChHHHHHHHH-HHhhCc---c-hHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 26 PWRWEPLSAGLR-KKQYIS---S-ATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 26 ~gk~d~A~elL~-~l~~np---s-~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
++..+.+...+. ++..+| + .+.+|..+ +++|...|++|.++ +|+++.+.+.||..|...|++++|
T Consensus 39 ~~~~e~~i~~~~~~l~~~~~~~~~~a~~~~~~g~~~~~~g~~~~A~~~~~~Al~l-~P~~~~a~~~lg~~~~~~g~~~~A 117 (296)
T PRK11189 39 TLQQEVILARLNQILASRDLTDEERAQLHYERGVLYDSLGLRALARNDFSQALAL-RPDMADAYNYLGIYLTQAGNFDAA 117 (296)
T ss_pred chHHHHHHHHHHHHHccccCCcHhhHHHHHHHHHHHHHCCCHHHHHHHHHHHHHc-CCCCHHHHHHHHHHHHHCCCHHHH
Confidence 467788999998 886444 3 24455522 89999999999999 899999999999999999999999
Q ss_pred HHHHHHHHHhCCCCHHHHHH
Q psy16144 92 IDVSQTIMKAHPSYPKVHKQ 111 (122)
Q Consensus 92 I~vc~kVL~~~P~ypkir~~ 111 (122)
++.++++++.+|+++.++..
T Consensus 118 ~~~~~~Al~l~P~~~~a~~~ 137 (296)
T PRK11189 118 YEAFDSVLELDPTYNYAYLN 137 (296)
T ss_pred HHHHHHHHHhCCCCHHHHHH
Confidence 99999999999999887543
No 52
>PF13429 TPR_15: Tetratricopeptide repeat; PDB: 2VQ2_A 2PL2_B.
Probab=98.53 E-value=1.3e-07 Score=73.78 Aligned_cols=78 Identities=13% Similarity=0.193 Sum_probs=53.6
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
+++..|+++++.++++ ..+.+|+++..|..+ +++|...|+++++. +|+||.+...+|.++...|++++|
T Consensus 189 ~li~~~~~~~~~~~l~~~~~~~~~~~~~~~~la~~~~~lg~~~~Al~~~~~~~~~-~p~d~~~~~~~a~~l~~~g~~~~A 267 (280)
T PF13429_consen 189 LLIDMGDYDEAREALKRLLKAAPDDPDLWDALAAAYLQLGRYEEALEYLEKALKL-NPDDPLWLLAYADALEQAGRKDEA 267 (280)
T ss_dssp HHCTTCHHHHHHHHHHHHHHH-HTSCCHCHHHHHHHHHHT-HHHHHHHHHHHHHH-STT-HHHHHHHHHHHT--------
T ss_pred HHHHCCChHHHHHHHHHHHHHCcCHHHHHHHHHHHhccccccccccccccccccc-cccccccccccccccccccccccc
Confidence 8899999999999999 888887777666633 99999999999998 899999999999999999999999
Q ss_pred HHHHHHHHH
Q psy16144 92 IDVSQTIMK 100 (122)
Q Consensus 92 I~vc~kVL~ 100 (122)
.++.+++++
T Consensus 268 ~~~~~~~~~ 276 (280)
T PF13429_consen 268 LRLRRQALR 276 (280)
T ss_dssp ---------
T ss_pred ccccccccc
Confidence 999999985
No 53
>KOG1126|consensus
Probab=98.53 E-value=1.3e-07 Score=83.74 Aligned_cols=87 Identities=15% Similarity=0.139 Sum_probs=77.6
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
.|.-++++|.|++.|+ ++++||....||=++ |+.|.++|++|+.. +|++=.+.|-||-+|+|.++++.|
T Consensus 430 cfSLQkdh~~Aik~f~RAiQldp~faYayTLlGhE~~~~ee~d~a~~~fr~Al~~-~~rhYnAwYGlG~vy~Kqek~e~A 508 (638)
T KOG1126|consen 430 CFSLQKDHDTAIKCFKRAIQLDPRFAYAYTLLGHESIATEEFDKAMKSFRKALGV-DPRHYNAWYGLGTVYLKQEKLEFA 508 (638)
T ss_pred hhhhhhHHHHHHHHHHHhhccCCccchhhhhcCChhhhhHHHHhHHHHHHhhhcC-CchhhHHHHhhhhheeccchhhHH
Confidence 4555789999999999 999999988888655 99999999999999 799999999999999999999999
Q ss_pred HHHHHHHHHhCCCCHHHH
Q psy16144 92 IDVSQTIMKAHPSYPKVH 109 (122)
Q Consensus 92 I~vc~kVL~~~P~ypkir 109 (122)
.--+++++.+||.+--|.
T Consensus 509 e~~fqkA~~INP~nsvi~ 526 (638)
T KOG1126|consen 509 EFHFQKAVEINPSNSVIL 526 (638)
T ss_pred HHHHHhhhcCCccchhHH
Confidence 999999999999876654
No 54
>COG4235 Cytochrome c biogenesis factor [Posttranslational modification, protein turnover, chaperones]
Probab=98.50 E-value=1.2e-06 Score=71.63 Aligned_cols=90 Identities=19% Similarity=0.188 Sum_probs=83.4
Q ss_pred HHHHHHH--HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC------------HHHHHHHHHHHHHhhCCCChHHHHHHH
Q psy16144 15 IAEWHFL--LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF------------PDDAVVQYTQACRIIGTSDLALHQKLA 79 (122)
Q Consensus 15 ~~~~~~L--Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l------------y~dAa~~ye~Aw~l~~~~~~~vgykLA 79 (122)
+..|.+| +|.+.|+++.|...++ +.++.|+++..+--+ -.+|-..+.+|+++ +|+|+++-+-||
T Consensus 156 ~egW~~Lg~~ym~~~~~~~A~~AY~~A~rL~g~n~~~~~g~aeaL~~~a~~~~ta~a~~ll~~al~~-D~~~iral~lLA 234 (287)
T COG4235 156 AEGWDLLGRAYMALGRASDALLAYRNALRLAGDNPEILLGLAEALYYQAGQQMTAKARALLRQALAL-DPANIRALSLLA 234 (287)
T ss_pred chhHHHHHHHHHHhcchhHHHHHHHHHHHhCCCCHHHHHHHHHHHHHhcCCcccHHHHHHHHHHHhc-CCccHHHHHHHH
Confidence 4569999 9999999999999999 999999999988732 77899999999999 899999999999
Q ss_pred HHHHhcccHHHHHHHHHHHHHhCCCC
Q psy16144 80 SLLLRMKRYAEAIDVSQTIMKAHPSY 105 (122)
Q Consensus 80 ~~ylk~kr~~~AI~vc~kVL~~~P~y 105 (122)
+++...|+|.+|+...+..|+..|..
T Consensus 235 ~~afe~g~~~~A~~~Wq~lL~~lp~~ 260 (287)
T COG4235 235 FAAFEQGDYAEAAAAWQMLLDLLPAD 260 (287)
T ss_pred HHHHHcccHHHHHHHHHHHHhcCCCC
Confidence 99999999999999999999999944
No 55
>PF13432 TPR_16: Tetratricopeptide repeat; PDB: 3CVP_A 3CVL_A 3CVQ_A 3CV0_A 2GW1_B 3CVN_A 3QKY_A 2PL2_B.
Probab=98.48 E-value=2.9e-07 Score=57.26 Aligned_cols=50 Identities=12% Similarity=0.169 Sum_probs=46.1
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCCh
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDL 72 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~ 72 (122)
.+.++|+|++|.+.++ +++.+|+++.+|..+ |++|+..|+++.+. +|+||
T Consensus 6 ~~~~~g~~~~A~~~~~~~l~~~P~~~~a~~~lg~~~~~~g~~~~A~~~~~~a~~~-~P~~p 65 (65)
T PF13432_consen 6 ALYQQGDYDEAIAAFEQALKQDPDNPEAWYLLGRILYQQGRYDEALAYYERALEL-DPDNP 65 (65)
T ss_dssp HHHHCTHHHHHHHHHHHHHCCSTTHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHH-STT-H
T ss_pred HHHHcCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHH-CcCCC
Confidence 7889999999999999 999999999999955 99999999999999 69886
No 56
>PF13428 TPR_14: Tetratricopeptide repeat
Probab=98.45 E-value=6.4e-07 Score=52.90 Aligned_cols=40 Identities=20% Similarity=0.203 Sum_probs=37.4
Q ss_pred hHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHHHH
Q psy16144 72 LALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVHKQ 111 (122)
Q Consensus 72 ~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir~~ 111 (122)
|.+.+.||..|...|++++|+++++++++.+|++|.++..
T Consensus 1 p~~~~~la~~~~~~G~~~~A~~~~~~~l~~~P~~~~a~~~ 40 (44)
T PF13428_consen 1 PAAWLALARAYRRLGQPDEAERLLRRALALDPDDPEAWRA 40 (44)
T ss_pred CHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCcCCHHHHHH
Confidence 5788999999999999999999999999999999999853
No 57
>KOG1173|consensus
Probab=98.43 E-value=1.8e-06 Score=76.08 Aligned_cols=93 Identities=13% Similarity=0.162 Sum_probs=73.8
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcc----hHHHHhhC-------------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISS----ATNIYNLF-------------PDDAVVQYTQACRIIGTSDLALHQKLASLLL 83 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps----~~kA~e~l-------------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~yl 83 (122)
++-+.+.|.+|..+|+ ++..-|+ .+ -|+-+ |.+|+.+|++|+.+ .|.|+++.-.+||.|.
T Consensus 423 vay~~~~y~~A~~~f~~~l~~ik~~~~e~~-~w~p~~~NLGH~~Rkl~~~~eAI~~~q~aL~l-~~k~~~~~asig~iy~ 500 (611)
T KOG1173|consen 423 VAYTYEEYPEALKYFQKALEVIKSVLNEKI-FWEPTLNNLGHAYRKLNKYEEAIDYYQKALLL-SPKDASTHASIGYIYH 500 (611)
T ss_pred eeehHhhhHHHHHHHHHHHHHhhhcccccc-chhHHHHhHHHHHHHHhhHHHHHHHHHHHHHc-CCCchhHHHHHHHHHH
Confidence 4445567777777777 6633333 22 34411 99999999999999 5999999999999999
Q ss_pred hcccHHHHHHHHHHHHHhCCCCHHHHHHHHHHHH
Q psy16144 84 RMKRYAEAIDVSQTIMKAHPSYPKVHKQIFEKAM 117 (122)
Q Consensus 84 k~kr~~~AI~vc~kVL~~~P~ypkir~~il~ka~ 117 (122)
..|++..||+-+||+|.+.|++--+. ++|-+|.
T Consensus 501 llgnld~Aid~fhKaL~l~p~n~~~~-~lL~~ai 533 (611)
T KOG1173|consen 501 LLGNLDKAIDHFHKALALKPDNIFIS-ELLKLAI 533 (611)
T ss_pred HhcChHHHHHHHHHHHhcCCccHHHH-HHHHHHH
Confidence 99999999999999999999987665 4666654
No 58
>COG5010 TadD Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking and secretion]
Probab=98.40 E-value=4.1e-06 Score=67.59 Aligned_cols=89 Identities=16% Similarity=0.158 Sum_probs=81.3
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
..++.|.|..|+..|+ +...+|.++++|..+ +++|-.-|.+|.++ .|++|.+--+||..|+..|++.+|
T Consensus 109 ~~~~~g~~~~A~~~~rkA~~l~p~d~~~~~~lgaaldq~Gr~~~Ar~ay~qAl~L-~~~~p~~~nNlgms~~L~gd~~~A 187 (257)
T COG5010 109 NQIRNGNFGEAVSVLRKAARLAPTDWEAWNLLGAALDQLGRFDEARRAYRQALEL-APNEPSIANNLGMSLLLRGDLEDA 187 (257)
T ss_pred HHHHhcchHHHHHHHHHHhccCCCChhhhhHHHHHHHHccChhHHHHHHHHHHHh-ccCCchhhhhHHHHHHHcCCHHHH
Confidence 7889999999999999 999999999999855 99999999999999 799999999999999999999999
Q ss_pred HHHHHHHHHhCCCCHHHHHH
Q psy16144 92 IDVSQTIMKAHPSYPKVHKQ 111 (122)
Q Consensus 92 I~vc~kVL~~~P~ypkir~~ 111 (122)
.......-..-|..++|+.+
T Consensus 188 ~~lll~a~l~~~ad~~v~~N 207 (257)
T COG5010 188 ETLLLPAYLSPAADSRVRQN 207 (257)
T ss_pred HHHHHHHHhCCCCchHHHHH
Confidence 99988888666678888764
No 59
>PRK14574 hmsH outer membrane protein; Provisional
Probab=98.36 E-value=2e-06 Score=78.46 Aligned_cols=86 Identities=13% Similarity=0.121 Sum_probs=70.0
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHH-Hh----hC----HHHHHHHHHHHHHhhCCCChHHHHHH--HHHHHhcccHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNI-YN----LF----PDDAVVQYTQACRIIGTSDLALHQKL--ASLLLRMKRYA 89 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA-~e----~l----y~dAa~~ye~Aw~l~~~~~~~vgykL--A~~ylk~kr~~ 89 (122)
+..++|+++.|.+.|+ +++.+|..+.+ +. +. +++|..+++++. .|.+....-.+ |..|...|+|.
T Consensus 43 i~~r~Gd~~~Al~~L~qaL~~~P~~~~av~dll~l~~~~G~~~~A~~~~eka~---~p~n~~~~~llalA~ly~~~gdyd 119 (822)
T PRK14574 43 IRARAGDTAPVLDYLQEESKAGPLQSGQVDDWLQIAGWAGRDQEVIDVYERYQ---SSMNISSRGLASAARAYRNEKRWD 119 (822)
T ss_pred HHHhCCCHHHHHHHHHHHHhhCccchhhHHHHHHHHHHcCCcHHHHHHHHHhc---cCCCCCHHHHHHHHHHHHHcCCHH
Confidence 8889999999999999 99999998533 12 11 899999999998 34444444444 77999999999
Q ss_pred HHHHHHHHHHHhCCCCHHHHH
Q psy16144 90 EAIDVSQTIMKAHPSYPKVHK 110 (122)
Q Consensus 90 ~AI~vc~kVL~~~P~ypkir~ 110 (122)
+|+++|+++++.+|++|.+..
T Consensus 120 ~Aiely~kaL~~dP~n~~~l~ 140 (822)
T PRK14574 120 QALALWQSSLKKDPTNPDLIS 140 (822)
T ss_pred HHHHHHHHHHhhCCCCHHHHH
Confidence 999999999999999998875
No 60
>KOG1125|consensus
Probab=98.34 E-value=2.6e-06 Score=74.95 Aligned_cols=58 Identities=26% Similarity=0.351 Sum_probs=56.1
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVHKQ 111 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir~~ 111 (122)
|+.|+.||+.|+.. +|+|-..+-|||..+--..|.+|||.-|+++|.+-|.|.|+|-.
T Consensus 446 fdraiDcf~~AL~v-~Pnd~~lWNRLGAtLAN~~~s~EAIsAY~rALqLqP~yVR~RyN 503 (579)
T KOG1125|consen 446 FDRAVDCFEAALQV-KPNDYLLWNRLGATLANGNRSEEAISAYNRALQLQPGYVRVRYN 503 (579)
T ss_pred HHHHHHHHHHHHhc-CCchHHHHHHhhHHhcCCcccHHHHHHHHHHHhcCCCeeeeehh
Confidence 99999999999999 89999999999999999999999999999999999999999864
No 61
>PLN02789 farnesyltranstransferase
Probab=98.31 E-value=1.3e-05 Score=65.81 Aligned_cols=88 Identities=3% Similarity=-0.035 Sum_probs=67.0
Q ss_pred HhcCCC-ChHHHHHHHH-HHhhCcchHHHHhhC-----------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccH
Q psy16144 22 LYLLPW-RWEPLSAGLR-KKQYISSATNIYNLF-----------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRY 88 (122)
Q Consensus 22 Lyi~~g-k~d~A~elL~-~l~~nps~~kA~e~l-----------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~ 88 (122)
++...| ++++|.+.+. +++.||.+..+|... ++++...++++++. +|+|-.+....++.+...|++
T Consensus 80 iL~~L~~~l~eeL~~~~~~i~~npknyqaW~~R~~~l~~l~~~~~~~el~~~~kal~~-dpkNy~AW~~R~w~l~~l~~~ 158 (320)
T PLN02789 80 CLEALDADLEEELDFAEDVAEDNPKNYQIWHHRRWLAEKLGPDAANKELEFTRKILSL-DAKNYHAWSHRQWVLRTLGGW 158 (320)
T ss_pred HHHHcchhHHHHHHHHHHHHHHCCcchHHhHHHHHHHHHcCchhhHHHHHHHHHHHHh-CcccHHHHHHHHHHHHHhhhH
Confidence 334445 4677888887 888888877777622 35677778888888 788888888888888888888
Q ss_pred HHHHHHHHHHHHhCCCCHHHHH
Q psy16144 89 AEAIDVSQTIMKAHPSYPKVHK 110 (122)
Q Consensus 89 ~~AI~vc~kVL~~~P~ypkir~ 110 (122)
.+|++.|.++++.+|.+..++.
T Consensus 159 ~eeL~~~~~~I~~d~~N~sAW~ 180 (320)
T PLN02789 159 EDELEYCHQLLEEDVRNNSAWN 180 (320)
T ss_pred HHHHHHHHHHHHHCCCchhHHH
Confidence 8888888888888888877653
No 62
>CHL00033 ycf3 photosystem I assembly protein Ycf3
Probab=98.30 E-value=5.2e-06 Score=60.59 Aligned_cols=81 Identities=11% Similarity=0.149 Sum_probs=64.5
Q ss_pred hHHHHHHHH-HHhhCcchHHHHhhC-----------HHHHHHHHHHHHHhhCCC---ChHHHHHHHHHHHhcccHHHHHH
Q psy16144 29 WEPLSAGLR-KKQYISSATNIYNLF-----------PDDAVVQYTQACRIIGTS---DLALHQKLASLLLRMKRYAEAID 93 (122)
Q Consensus 29 ~d~A~elL~-~l~~nps~~kA~e~l-----------y~dAa~~ye~Aw~l~~~~---~~~vgykLA~~ylk~kr~~~AI~ 93 (122)
|..+.+.|. .++.+++...+..++ |++|...|++|..+. ++ .+.+.++||..|.+.|++.+|+.
T Consensus 15 ~~~~~~~l~~~~~~~~~~~~a~~~~~~g~~~~~~g~~~~A~~~~~~al~l~-~~~~~~~~~~~~lg~~~~~~g~~~eA~~ 93 (168)
T CHL00033 15 FTIVADILLRILPTTSGEKEAFTYYRDGMSAQSEGEYAEALQNYYEAMRLE-IDPYDRSYILYNIGLIHTSNGEHTKALE 93 (168)
T ss_pred cccchhhhhHhccCCchhHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHhcc-ccchhhHHHHHHHHHHHHHcCCHHHHHH
Confidence 555666666 656666655544322 999999999999983 54 34589999999999999999999
Q ss_pred HHHHHHHhCCCCHHHHH
Q psy16144 94 VSQTIMKAHPSYPKVHK 110 (122)
Q Consensus 94 vc~kVL~~~P~ypkir~ 110 (122)
.|++++..+|+++....
T Consensus 94 ~~~~Al~~~~~~~~~~~ 110 (168)
T CHL00033 94 YYFQALERNPFLPQALN 110 (168)
T ss_pred HHHHHHHhCcCcHHHHH
Confidence 99999999999987654
No 63
>cd05804 StaR_like StaR_like; a well-conserved protein found in bacteria, plants, and animals. A family member from Streptomyces toyocaensis, StaR is part of a gene cluster involved in the biosynthesis of glycopeptide antibiotics (GPAs), specifically A47934. It has been speculated that StaR could be a flavoprotein hydroxylating a tyrosine sidechain. Some family members have been annotated as proteins containing tetratricopeptide (TPR) repeats, which may at least indicate mostly alpha-helical secondary structure.
Probab=98.29 E-value=1.1e-05 Score=64.01 Aligned_cols=51 Identities=18% Similarity=0.132 Sum_probs=29.2
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCC
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPS 104 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ 104 (122)
+++|...+++++++ +|+++.+...||..|...|++++|++.++++++..|.
T Consensus 130 ~~~A~~~~~~al~~-~p~~~~~~~~la~i~~~~g~~~eA~~~l~~~l~~~~~ 180 (355)
T cd05804 130 YDRAEEAARRALEL-NPDDAWAVHAVAHVLEMQGRFKEGIAFMESWRDTWDC 180 (355)
T ss_pred HHHHHHHHHHHHhh-CCCCcHHHHHHHHHHHHcCCHHHHHHHHHhhhhccCC
Confidence 55555555555555 4555555555555555555555555555555555553
No 64
>PRK15363 pathogenicity island 2 chaperone protein SscA; Provisional
Probab=98.28 E-value=3.6e-06 Score=63.53 Aligned_cols=56 Identities=13% Similarity=0.045 Sum_probs=54.6
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVH 109 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir 109 (122)
+++|...|+-+..+ +|.|+...|.||-++...|+|.+||+.|.+++..+||+|+.-
T Consensus 51 l~~A~~~f~~L~~~-Dp~~~~y~~gLG~~~Q~~g~~~~AI~aY~~A~~L~~ddp~~~ 106 (157)
T PRK15363 51 FAGAARLFQLLTIY-DAWSFDYWFRLGECCQAQKHWGEAIYAYGRAAQIKIDAPQAP 106 (157)
T ss_pred HHHHHHHHHHHHHh-CcccHHHHHHHHHHHHHHhhHHHHHHHHHHHHhcCCCCchHH
Confidence 99999999999999 899999999999999999999999999999999999999874
No 65
>PLN02789 farnesyltranstransferase
Probab=98.26 E-value=1.6e-05 Score=65.31 Aligned_cols=88 Identities=5% Similarity=0.004 Sum_probs=80.4
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC----------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccH--
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF----------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRY-- 88 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l----------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~-- 88 (122)
++...++++.|.++.. +++.||++..+|.+. +++|...++++.+. +|++..+.+..+.++.+.|+.
T Consensus 46 ~l~~~e~serAL~lt~~aI~lnP~~ytaW~~R~~iL~~L~~~l~eeL~~~~~~i~~-npknyqaW~~R~~~l~~l~~~~~ 124 (320)
T PLN02789 46 VYASDERSPRALDLTADVIRLNPGNYTVWHFRRLCLEALDADLEEELDFAEDVAED-NPKNYQIWHHRRWLAEKLGPDAA 124 (320)
T ss_pred HHHcCCCCHHHHHHHHHHHHHCchhHHHHHHHHHHHHHcchhHHHHHHHHHHHHHH-CCcchHHhHHHHHHHHHcCchhh
Confidence 7778899999999999 999999999999844 78999999999999 899999999999999999985
Q ss_pred HHHHHHHHHHHHhCCCCHHHHH
Q psy16144 89 AEAIDVSQTIMKAHPSYPKVHK 110 (122)
Q Consensus 89 ~~AI~vc~kVL~~~P~ypkir~ 110 (122)
.++++.|.++++.+|.+-.++.
T Consensus 125 ~~el~~~~kal~~dpkNy~AW~ 146 (320)
T PLN02789 125 NKELEFTRKILSLDAKNYHAWS 146 (320)
T ss_pred HHHHHHHHHHHHhCcccHHHHH
Confidence 6889999999999999877754
No 66
>PF07719 TPR_2: Tetratricopeptide repeat; InterPro: IPR013105 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. This repeat includes outlying Tetratricopeptide-like repeats (TPR) that are not matched by IPR001440 from INTERPRO.; PDB: 1XNF_B 3Q15_A 4ABN_A 1OUV_A 3U4T_A 3MA5_C 2KCV_A 2KCL_A 2XEV_A 3NF1_A ....
Probab=98.26 E-value=3e-06 Score=46.31 Aligned_cols=34 Identities=18% Similarity=0.275 Sum_probs=31.0
Q ss_pred hHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCC
Q psy16144 72 LALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSY 105 (122)
Q Consensus 72 ~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~y 105 (122)
|.+.+.||.+|.+.|++.+|++.|+++++.+|++
T Consensus 1 a~~~~~lg~~~~~~~~~~~A~~~~~~al~l~p~~ 34 (34)
T PF07719_consen 1 AEAWYYLGQAYYQLGNYEEAIEYFEKALELDPNN 34 (34)
T ss_dssp HHHHHHHHHHHHHTT-HHHHHHHHHHHHHHSTTS
T ss_pred CHHHHHHHHHHHHhCCHHHHHHHHHHHHHHCcCC
Confidence 4678999999999999999999999999999985
No 67
>cd05804 StaR_like StaR_like; a well-conserved protein found in bacteria, plants, and animals. A family member from Streptomyces toyocaensis, StaR is part of a gene cluster involved in the biosynthesis of glycopeptide antibiotics (GPAs), specifically A47934. It has been speculated that StaR could be a flavoprotein hydroxylating a tyrosine sidechain. Some family members have been annotated as proteins containing tetratricopeptide (TPR) repeats, which may at least indicate mostly alpha-helical secondary structure.
Probab=98.25 E-value=6.5e-06 Score=65.27 Aligned_cols=82 Identities=10% Similarity=0.014 Sum_probs=70.7
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHH----HHHHHHHHHhccc
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLAL----HQKLASLLLRMKR 87 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~v----gykLA~~ylk~kr 87 (122)
++..+|++++|.+.++ +++.+|+++.++..+ +++|...++++.+. .|.++.. .+.+|..|+..|+
T Consensus 123 ~~~~~G~~~~A~~~~~~al~~~p~~~~~~~~la~i~~~~g~~~eA~~~l~~~l~~-~~~~~~~~~~~~~~la~~~~~~G~ 201 (355)
T cd05804 123 GLEEAGQYDRAEEAARRALELNPDDAWAVHAVAHVLEMQGRFKEGIAFMESWRDT-WDCSSMLRGHNWWHLALFYLERGD 201 (355)
T ss_pred HHHHcCCHHHHHHHHHHHHhhCCCCcHHHHHHHHHHHHcCCHHHHHHHHHhhhhc-cCCCcchhHHHHHHHHHHHHHCCC
Confidence 7889999999999999 999999988776543 99999999999998 4544433 4479999999999
Q ss_pred HHHHHHHHHHHHHhCCC
Q psy16144 88 YAEAIDVSQTIMKAHPS 104 (122)
Q Consensus 88 ~~~AI~vc~kVL~~~P~ 104 (122)
+++|++++++++...|.
T Consensus 202 ~~~A~~~~~~~~~~~~~ 218 (355)
T cd05804 202 YEAALAIYDTHIAPSAE 218 (355)
T ss_pred HHHHHHHHHHHhccccC
Confidence 99999999999877773
No 68
>KOG1126|consensus
Probab=98.24 E-value=1.7e-06 Score=76.82 Aligned_cols=92 Identities=14% Similarity=0.146 Sum_probs=68.6
Q ss_pred HHHHHHH---HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHH
Q psy16144 15 IAEWHFL---LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASL 81 (122)
Q Consensus 15 ~~~~~~L---Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ 81 (122)
.+-.|.| =++..-.||.|...+| +|..+|....||.-+ ++.|.-+|++|.+. ||.|..+-.-+|..
T Consensus 454 faYayTLlGhE~~~~ee~d~a~~~fr~Al~~~~rhYnAwYGlG~vy~Kqek~e~Ae~~fqkA~~I-NP~nsvi~~~~g~~ 532 (638)
T KOG1126|consen 454 FAYAYTLLGHESIATEEFDKAMKSFRKALGVDPRHYNAWYGLGTVYLKQEKLEFAEFHFQKAVEI-NPSNSVILCHIGRI 532 (638)
T ss_pred cchhhhhcCChhhhhHHHHhHHHHHHhhhcCCchhhHHHHhhhhheeccchhhHHHHHHHhhhcC-CccchhHHhhhhHH
Confidence 3445555 4556667788888888 888888888888733 77788888888887 78888888888888
Q ss_pred HHhcccHHHHHHHHHHHHHhCCCCHH
Q psy16144 82 LLRMKRYAEAIDVSQTIMKAHPSYPK 107 (122)
Q Consensus 82 ylk~kr~~~AI~vc~kVL~~~P~ypk 107 (122)
|.+.|+.++|++++++++-.+|.+|-
T Consensus 533 ~~~~k~~d~AL~~~~~A~~ld~kn~l 558 (638)
T KOG1126|consen 533 QHQLKRKDKALQLYEKAIHLDPKNPL 558 (638)
T ss_pred HHHhhhhhHHHHHHHHHHhcCCCCch
Confidence 88888888888888888877777653
No 69
>PRK14720 transcript cleavage factor/unknown domain fusion protein; Provisional
Probab=98.24 E-value=7e-06 Score=75.73 Aligned_cols=93 Identities=13% Similarity=0.095 Sum_probs=77.7
Q ss_pred HHHHH--HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCCh------------
Q psy16144 17 EWHFL--LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDL------------ 72 (122)
Q Consensus 17 ~~~~L--Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~------------ 72 (122)
.|..| +|...|++|+|.++++ .++++|+...+|.++ +.+|+.. ++.... +.+.
T Consensus 33 a~~~Li~~~~~~~~~deai~i~~~~l~~~P~~i~~yy~~G~l~~q~~~~~~~~lv--~~l~~~-~~~~~~~~ve~~~~~i 109 (906)
T PRK14720 33 ELDDLIDAYKSENLTDEAKDICEEHLKEHKKSISALYISGILSLSRRPLNDSNLL--NLIDSF-SQNLKWAIVEHICDKI 109 (906)
T ss_pred HHHHHHHHHHhcCCHHHHHHHHHHHHHhCCcceehHHHHHHHHHhhcchhhhhhh--hhhhhc-ccccchhHHHHHHHHH
Confidence 44445 8889999999999999 999999999998855 6666555 555553 4444
Q ss_pred -------HHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHHHHH
Q psy16144 73 -------ALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVHKQI 112 (122)
Q Consensus 73 -------~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir~~i 112 (122)
.+.+.||.+|-+.|++.+|+++++++|+.+|++|-+-..+
T Consensus 110 ~~~~~~k~Al~~LA~~Ydk~g~~~ka~~~yer~L~~D~~n~~aLNn~ 156 (906)
T PRK14720 110 LLYGENKLALRTLAEAYAKLNENKKLKGVWERLVKADRDNPEIVKKL 156 (906)
T ss_pred HhhhhhhHHHHHHHHHHHHcCChHHHHHHHHHHHhcCcccHHHHHHH
Confidence 8999999999999999999999999999999999886554
No 70
>PF13414 TPR_11: TPR repeat; PDB: 2HO1_B 2FI7_B 2DBA_A 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2PL2_B 3IEG_B 2FBN_A ....
Probab=98.23 E-value=2.5e-06 Score=53.29 Aligned_cols=52 Identities=13% Similarity=0.286 Sum_probs=46.3
Q ss_pred HHHHH--HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC----------HHHHHHHHHHHHHhhCC
Q psy16144 17 EWHFL--LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF----------PDDAVVQYTQACRIIGT 69 (122)
Q Consensus 17 ~~~~L--Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l----------y~dAa~~ye~Aw~l~~~ 69 (122)
.|..+ .+.+.|+|++|++.++ +++++|+++.+|..+ |++|...|++|+++ +|
T Consensus 5 ~~~~~g~~~~~~~~~~~A~~~~~~ai~~~p~~~~~~~~~g~~~~~~~~~~~~A~~~~~~al~l-~P 69 (69)
T PF13414_consen 5 AWYNLGQIYFQQGDYEEAIEYFEKAIELDPNNAEAYYNLGLAYMKLGKDYEEAIEDFEKALKL-DP 69 (69)
T ss_dssp HHHHHHHHHHHTTHHHHHHHHHHHHHHHSTTHHHHHHHHHHHHHHTTTHHHHHHHHHHHHHHH-ST
T ss_pred HHHHHHHHHHHcCCHHHHHHHHHHHHHcCCCCHHHHHHHHHHHHHhCccHHHHHHHHHHHHHc-Cc
Confidence 45556 8899999999999999 999999999999844 89999999999998 55
No 71
>PLN03088 SGT1, suppressor of G2 allele of SKP1; Provisional
Probab=98.22 E-value=6.9e-06 Score=67.79 Aligned_cols=58 Identities=22% Similarity=0.255 Sum_probs=55.7
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVHKQ 111 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir~~ 111 (122)
|++|+..|++|+++ +|+++.+.+.+|.+|++.|++.+|+..++++++.+|+++.+...
T Consensus 18 ~~~Ai~~~~~Al~~-~P~~~~a~~~~a~~~~~~g~~~eAl~~~~~Al~l~P~~~~a~~~ 75 (356)
T PLN03088 18 FALAVDLYTQAIDL-DPNNAELYADRAQANIKLGNFTEAVADANKAIELDPSLAKAYLR 75 (356)
T ss_pred HHHHHHHHHHHHHh-CCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCcCCHHHHHH
Confidence 99999999999999 79999999999999999999999999999999999999988653
No 72
>KOG1155|consensus
Probab=98.20 E-value=1.2e-05 Score=69.97 Aligned_cols=87 Identities=14% Similarity=0.138 Sum_probs=78.4
Q ss_pred HHHHH--HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHh
Q psy16144 17 EWHFL--LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLR 84 (122)
Q Consensus 17 ~~~~L--Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk 84 (122)
-|-++ =|+..+....|.+-.| +++.||-+.+||.-+ +-=|.-+|++|.++ +|+|+.+..-||.+|+|
T Consensus 366 aWTLmGHEyvEmKNt~AAi~sYRrAvdi~p~DyRAWYGLGQaYeim~Mh~YaLyYfqkA~~~-kPnDsRlw~aLG~CY~k 444 (559)
T KOG1155|consen 366 AWTLMGHEYVEMKNTHAAIESYRRAVDINPRDYRAWYGLGQAYEIMKMHFYALYYFQKALEL-KPNDSRLWVALGECYEK 444 (559)
T ss_pred HHHHhhHHHHHhcccHHHHHHHHHHHhcCchhHHHHhhhhHHHHHhcchHHHHHHHHHHHhc-CCCchHHHHHHHHHHHH
Confidence 34444 7888899999999999 999999999999844 67789999999999 79999999999999999
Q ss_pred cccHHHHHHHHHHHHHhCCC
Q psy16144 85 MKRYAEAIDVSQTIMKAHPS 104 (122)
Q Consensus 85 ~kr~~~AI~vc~kVL~~~P~ 104 (122)
.++.+|||-+|.+++...-.
T Consensus 445 l~~~~eAiKCykrai~~~dt 464 (559)
T KOG1155|consen 445 LNRLEEAIKCYKRAILLGDT 464 (559)
T ss_pred hccHHHHHHHHHHHHhcccc
Confidence 99999999999999977665
No 73
>PF00515 TPR_1: Tetratricopeptide repeat; InterPro: IPR001440 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees; within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B. ; GO: 0005515 protein binding; PDB: 3SF4_C 2LNI_A 1ELW_A 2C0M_A 1FCH_B 3R9A_B 2J9Q_A 2C0L_A 1KT1_A 3FWV_A ....
Probab=98.18 E-value=3.8e-06 Score=46.34 Aligned_cols=34 Identities=18% Similarity=0.310 Sum_probs=30.7
Q ss_pred hHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCC
Q psy16144 72 LALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSY 105 (122)
Q Consensus 72 ~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~y 105 (122)
+.+.+.+|..|...|++.+|++.|+++|+++|++
T Consensus 1 a~~~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~ 34 (34)
T PF00515_consen 1 AEAYYNLGNAYFQLGDYEEALEYYQRALELDPDN 34 (34)
T ss_dssp HHHHHHHHHHHHHTT-HHHHHHHHHHHHHHSTTH
T ss_pred CHHHHHHHHHHHHhCCchHHHHHHHHHHHHCcCC
Confidence 4678999999999999999999999999999974
No 74
>PLN03098 LPA1 LOW PSII ACCUMULATION1; Provisional
Probab=98.16 E-value=5.7e-06 Score=71.29 Aligned_cols=66 Identities=12% Similarity=0.058 Sum_probs=54.2
Q ss_pred CcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHH---HHHHHHHHHhcccHHHHHHHHHHHHHh-CCCCHHH
Q psy16144 42 ISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLAL---HQKLASLLLRMKRYAEAIDVSQTIMKA-HPSYPKV 108 (122)
Q Consensus 42 nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~v---gykLA~~ylk~kr~~~AI~vc~kVL~~-~P~ypki 108 (122)
+|+.+.+|.-+ |++|+.+|++|+++ +|+++.+ .|+||.+|.+.|++++|++.++++++. +|.|..+
T Consensus 71 dP~~a~a~~NLG~AL~~lGryeEAIa~f~rALeL-~Pd~aeA~~A~yNLAcaya~LGr~dEAla~LrrALelsn~~f~~i 149 (453)
T PLN03098 71 DVKTAEDAVNLGLSLFSKGRVKDALAQFETALEL-NPNPDEAQAAYYNKACCHAYREEGKKAADCLRTALRDYNLKFSTI 149 (453)
T ss_pred CCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHhh-CCCchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhcchhHHHH
Confidence 45555555522 99999999999999 7999855 999999999999999999999999997 5544333
No 75
>cd00189 TPR Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and C
Probab=98.16 E-value=1.8e-05 Score=47.72 Aligned_cols=57 Identities=19% Similarity=0.314 Sum_probs=53.2
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVHK 110 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir~ 110 (122)
+++|...++++.+. .|+++.+.+.+|..|...|++.+|++.++++++.+|..+.+..
T Consensus 16 ~~~A~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~ 72 (100)
T cd00189 16 YDEALEYYEKALEL-DPDNADAYYNLAAAYYKLGKYEEALEDYEKALELDPDNAKAYY 72 (100)
T ss_pred HHHHHHHHHHHHhc-CCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcchhHHH
Confidence 89999999999999 6999999999999999999999999999999999999886543
No 76
>PRK14574 hmsH outer membrane protein; Provisional
Probab=98.15 E-value=2e-05 Score=72.05 Aligned_cols=86 Identities=15% Similarity=0.120 Sum_probs=47.2
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
+|.+.|+|+.|.++++ +++.+|+++.++..+ +++|...++++... +|++... --+++.+...+++.+|
T Consensus 111 ly~~~gdyd~Aiely~kaL~~dP~n~~~l~gLa~~y~~~~q~~eAl~~l~~l~~~-dp~~~~~-l~layL~~~~~~~~~A 188 (822)
T PRK14574 111 AYRNEKRWDQALALWQSSLKKDPTNPDLISGMIMTQADAGRGGVVLKQATELAER-DPTVQNY-MTLSYLNRATDRNYDA 188 (822)
T ss_pred HHHHcCCHHHHHHHHHHHHhhCCCCHHHHHHHHHHHhhcCCHHHHHHHHHHhccc-CcchHHH-HHHHHHHHhcchHHHH
Confidence 4555566666666666 666666655555422 55555555555555 3444443 3334444445555556
Q ss_pred HHHHHHHHHhCCCCHHHH
Q psy16144 92 IDVSQTIMKAHPSYPKVH 109 (122)
Q Consensus 92 I~vc~kVL~~~P~ypkir 109 (122)
+.+++++++.+|+++++.
T Consensus 189 L~~~ekll~~~P~n~e~~ 206 (822)
T PRK14574 189 LQASSEAVRLAPTSEEVL 206 (822)
T ss_pred HHHHHHHHHhCCCCHHHH
Confidence 666666666666665553
No 77
>PF12895 Apc3: Anaphase-promoting complex, cyclosome, subunit 3; PDB: 3KAE_D 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2XPI_A 3ULQ_A.
Probab=98.14 E-value=7e-06 Score=53.70 Aligned_cols=56 Identities=27% Similarity=0.430 Sum_probs=50.9
Q ss_pred HHHHHHHHHHHHHhhCCC--ChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHHH
Q psy16144 53 PDDAVVQYTQACRIIGTS--DLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVHK 110 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~--~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir~ 110 (122)
|++|+..|+++++.. |+ ++.+.+.||.+|.+.|+|.+|++++++ ++.+|.++.++-
T Consensus 5 y~~Ai~~~~k~~~~~-~~~~~~~~~~~la~~~~~~~~y~~A~~~~~~-~~~~~~~~~~~~ 62 (84)
T PF12895_consen 5 YENAIKYYEKLLELD-PTNPNSAYLYNLAQCYFQQGKYEEAIELLQK-LKLDPSNPDIHY 62 (84)
T ss_dssp HHHHHHHHHHHHHHH-CGTHHHHHHHHHHHHHHHTTHHHHHHHHHHC-HTHHHCHHHHHH
T ss_pred HHHHHHHHHHHHHHC-CCChhHHHHHHHHHHHHHCCCHHHHHHHHHH-hCCCCCCHHHHH
Confidence 789999999999996 63 678899999999999999999999999 889998888876
No 78
>PF13371 TPR_9: Tetratricopeptide repeat
Probab=98.14 E-value=9.9e-06 Score=50.99 Aligned_cols=57 Identities=14% Similarity=0.123 Sum_probs=50.8
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLA 79 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA 79 (122)
+|+++++|++|.+.++ +++.+|+++.+|... |.+|...++++++. +|+++.+..-.+
T Consensus 4 ~~~~~~~~~~A~~~~~~~l~~~p~~~~~~~~~a~~~~~~g~~~~A~~~l~~~l~~-~p~~~~~~~~~a 70 (73)
T PF13371_consen 4 IYLQQEDYEEALEVLERALELDPDDPELWLQRARCLFQLGRYEEALEDLERALEL-SPDDPDARALRA 70 (73)
T ss_pred HHHhCCCHHHHHHHHHHHHHhCcccchhhHHHHHHHHHhccHHHHHHHHHHHHHH-CCCcHHHHHHHH
Confidence 7899999999999999 999999999999844 99999999999999 699888765443
No 79
>TIGR00540 hemY_coli hemY protein. This is an uncharacterized protein encoded next to a heme-biosynthetic enzyme in two gamma division proteobacteria (E. coli and H. influenzae). It is known in no other species. The gene symbol hemY is unfortunate in that an unrelated protein, protoporphyrinogen oxidase, is designated as HemG in E. coli but as HemY in Bacillus subtilis.
Probab=98.14 E-value=3e-05 Score=64.39 Aligned_cols=86 Identities=8% Similarity=0.039 Sum_probs=74.9
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHH----HHhhC-------HHHHHHHHHHHHHhhCCCCh--HHHHHHHHHHHhccc
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATN----IYNLF-------PDDAVVQYTQACRIIGTSDL--ALHQKLASLLLRMKR 87 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~k----A~e~l-------y~dAa~~ye~Aw~l~~~~~~--~vgykLA~~ylk~kr 87 (122)
++++.|++++|.+.++ .++.+|++.. .+... .+++....+++.+. +|+|| .+...||..+.+.|+
T Consensus 272 ~l~~~g~~~~A~~~l~~~l~~~pd~~~~~~~~l~~~~~l~~~~~~~~~~~~e~~lk~-~p~~~~~~ll~sLg~l~~~~~~ 350 (409)
T TIGR00540 272 HLIDCDDHDSAQEIIFDGLKKLGDDRAISLPLCLPIPRLKPEDNEKLEKLIEKQAKN-VDDKPKCCINRALGQLLMKHGE 350 (409)
T ss_pred HHHHCCChHHHHHHHHHHHhhCCCcccchhHHHHHhhhcCCCChHHHHHHHHHHHHh-CCCChhHHHHHHHHHHHHHccc
Confidence 8999999999999999 9999999874 33311 77889999999988 79999 889999999999999
Q ss_pred HHHHHHHHH--HHHHhCCCCHHH
Q psy16144 88 YAEAIDVSQ--TIMKAHPSYPKV 108 (122)
Q Consensus 88 ~~~AI~vc~--kVL~~~P~ypki 108 (122)
+.+|.+.++ +.++.+|+....
T Consensus 351 ~~~A~~~le~a~a~~~~p~~~~~ 373 (409)
T TIGR00540 351 FIEAADAFKNVAACKEQLDANDL 373 (409)
T ss_pred HHHHHHHHHHhHHhhcCCCHHHH
Confidence 999999999 688899977553
No 80
>KOG1840|consensus
Probab=98.13 E-value=2.8e-05 Score=67.90 Aligned_cols=94 Identities=23% Similarity=0.221 Sum_probs=72.8
Q ss_pred HhcCCCChHHHHHHHH-HHhh--------Ccc---hHH--HHhhC----HHHHHHHHHHHHHhhC----CCChHHH---H
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQY--------ISS---ATN--IYNLF----PDDAVVQYTQACRIIG----TSDLALH---Q 76 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~--------nps---~~k--A~e~l----y~dAa~~ye~Aw~l~~----~~~~~vg---y 76 (122)
.|..+|+||.|+.+++ +++. -|- +.+ |..|+ |++|+..|+.|+.... +++|.++ .
T Consensus 208 ~y~~~g~~e~A~~l~k~Al~~l~k~~G~~hl~va~~l~~~a~~y~~~~k~~eAv~ly~~AL~i~e~~~G~~h~~va~~l~ 287 (508)
T KOG1840|consen 208 MYAVQGRLEKAEPLCKQALRILEKTSGLKHLVVASMLNILALVYRSLGKYDEAVNLYEEALTIREEVFGEDHPAVAATLN 287 (508)
T ss_pred HHHHhccHHHHHHHHHHHHHHHHHccCccCHHHHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHH
Confidence 8999999999999999 9987 121 111 11133 9999999999987654 7777765 6
Q ss_pred HHHHHHHhcccHHHHHHHHHHHHHhC-----CCCHHHHHHHHHH
Q psy16144 77 KLASLLLRMKRYAEAIDVSQTIMKAH-----PSYPKVHKQIFEK 115 (122)
Q Consensus 77 kLA~~ylk~kr~~~AI~vc~kVL~~~-----P~ypkir~~il~k 115 (122)
+||..|.+.|||.+|...|..++++. +++|++.....+-
T Consensus 288 nLa~ly~~~GKf~EA~~~~e~Al~I~~~~~~~~~~~v~~~l~~~ 331 (508)
T KOG1840|consen 288 NLAVLYYKQGKFAEAEEYCERALEIYEKLLGASHPEVAAQLSEL 331 (508)
T ss_pred HHHHHHhccCChHHHHHHHHHHHHHHHHhhccChHHHHHHHHHH
Confidence 89999999999999999999998764 4567776655543
No 81
>TIGR02795 tol_pal_ybgF tol-pal system protein YbgF. Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.
Probab=98.13 E-value=2.3e-05 Score=52.15 Aligned_cols=54 Identities=20% Similarity=0.231 Sum_probs=49.5
Q ss_pred HHHHHHHHHHHHHhhCCCC---hHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHH
Q psy16144 53 PDDAVVQYTQACRIIGTSD---LALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPK 107 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~---~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypk 107 (122)
|++|...|+++... +|++ +.+.+.+|..|.+.|++.+|+..+++++..+|++|.
T Consensus 18 ~~~A~~~~~~~~~~-~~~~~~~~~~~~~l~~~~~~~~~~~~A~~~~~~~~~~~p~~~~ 74 (119)
T TIGR02795 18 YADAIQAFQAFLKK-YPKSTYAPNAHYWLGEAYYAQGKYADAAKAFLAVVKKYPKSPK 74 (119)
T ss_pred HHHHHHHHHHHHHH-CCCccccHHHHHHHHHHHHhhccHHHHHHHHHHHHHHCCCCCc
Confidence 99999999999998 5766 578999999999999999999999999999998753
No 82
>KOG0553|consensus
Probab=98.12 E-value=7.3e-06 Score=67.48 Aligned_cols=58 Identities=19% Similarity=0.340 Sum_probs=55.7
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVHKQ 111 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir~~ 111 (122)
|.+|+..|.+|+++ +|+|+...++=|.+|.+.|.|++||.=|+.++.++|.|-|..-.
T Consensus 97 Y~eAv~kY~~AI~l-~P~nAVyycNRAAAy~~Lg~~~~AVkDce~Al~iDp~yskay~R 154 (304)
T KOG0553|consen 97 YQEAVDKYTEAIEL-DPTNAVYYCNRAAAYSKLGEYEDAVKDCESALSIDPHYSKAYGR 154 (304)
T ss_pred HHHHHHHHHHHHhc-CCCcchHHHHHHHHHHHhcchHHHHHHHHHHHhcChHHHHHHHH
Confidence 99999999999999 79999999999999999999999999999999999999998653
No 83
>PRK02603 photosystem I assembly protein Ycf3; Provisional
Probab=98.09 E-value=3.1e-05 Score=56.93 Aligned_cols=57 Identities=11% Similarity=0.197 Sum_probs=51.2
Q ss_pred HHHHHHHHHHHHHhhCCCC---hHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSD---LALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVHK 110 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~---~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir~ 110 (122)
|++|..+|++|++. .|+. +.+.+.||..|.+.|++.+|+..++++++.+|+++....
T Consensus 51 ~~~A~~~~~~al~~-~~~~~~~~~~~~~la~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~ 110 (172)
T PRK02603 51 YAEALENYEEALKL-EEDPNDRSYILYNMGIIYASNGEHDKALEYYHQALELNPKQPSALN 110 (172)
T ss_pred HHHHHHHHHHHHHH-hhccchHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCcccHHHHH
Confidence 99999999999998 3543 478999999999999999999999999999999987653
No 84
>PRK10747 putative protoheme IX biogenesis protein; Provisional
Probab=98.07 E-value=7e-05 Score=62.19 Aligned_cols=88 Identities=8% Similarity=0.010 Sum_probs=74.0
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHh------hC----HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYN------LF----PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAE 90 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e------~l----y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~ 90 (122)
...+.|+++.|.++|. +.+.+|+..-+.. ++ |++|...++++.+. +|++|.+..-++..|++.|++.+
T Consensus 127 aA~~~g~~~~A~~~l~~A~~~~~~~~~~~~l~~a~l~l~~g~~~~Al~~l~~~~~~-~P~~~~al~ll~~~~~~~gdw~~ 205 (398)
T PRK10747 127 AAQQRGDEARANQHLERAAELADNDQLPVEITRVRIQLARNENHAARHGVDKLLEV-APRHPEVLRLAEQAYIRTGAWSS 205 (398)
T ss_pred HHHHCCCHHHHHHHHHHHHhcCCcchHHHHHHHHHHHHHCCCHHHHHHHHHHHHhc-CCCCHHHHHHHHHHHHHHHhHHH
Confidence 4478899999999999 9999988754432 22 99999999999999 69999999999999999999999
Q ss_pred HHHHHHHHHHhCCCCHHHHH
Q psy16144 91 AIDVSQTIMKAHPSYPKVHK 110 (122)
Q Consensus 91 AI~vc~kVL~~~P~ypkir~ 110 (122)
|++++.+..+..+..|....
T Consensus 206 a~~~l~~l~k~~~~~~~~~~ 225 (398)
T PRK10747 206 LLDILPSMAKAHVGDEEHRA 225 (398)
T ss_pred HHHHHHHHHHcCCCCHHHHH
Confidence 99999999988887666544
No 85
>PF13525 YfiO: Outer membrane lipoprotein; PDB: 3TGO_A 3Q5M_A 2YHC_A.
Probab=98.05 E-value=5.5e-05 Score=57.51 Aligned_cols=87 Identities=10% Similarity=0.044 Sum_probs=68.3
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHh---hC---------HHHHHHHHHHHHHhhCCCChH---HHHHHHHHHHhc
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYN---LF---------PDDAVVQYTQACRIIGTSDLA---LHQKLASLLLRM 85 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e---~l---------y~dAa~~ye~Aw~l~~~~~~~---vgykLA~~ylk~ 85 (122)
-+.+.|+|++|.+.|+ ++...|.+..+=. .+ |.+|+..|++-.+. .|+++. +.|.+|.++.+.
T Consensus 14 ~~~~~g~y~~Ai~~f~~l~~~~P~s~~a~~A~l~la~a~y~~~~y~~A~~~~~~fi~~-yP~~~~~~~A~Y~~g~~~~~~ 92 (203)
T PF13525_consen 14 EALQQGDYEEAIKLFEKLIDRYPNSPYAPQAQLMLAYAYYKQGDYEEAIAAYERFIKL-YPNSPKADYALYMLGLSYYKQ 92 (203)
T ss_dssp HHHHCT-HHHHHHHHHHHHHH-TTSTTHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHH--TT-TTHHHHHHHHHHHHHHH
T ss_pred HHHHCCCHHHHHHHHHHHHHHCCCChHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHH-CCCCcchhhHHHHHHHHHHHh
Confidence 6778899999999999 9999998554433 11 99999999999998 688775 788999998776
Q ss_pred ccHH-----------HHHHHHHHHHHhCCCCHHHH
Q psy16144 86 KRYA-----------EAIDVSQTIMKAHPSYPKVH 109 (122)
Q Consensus 86 kr~~-----------~AI~vc~kVL~~~P~ypkir 109 (122)
.+-. +|+..++.+++.+|+.+.+.
T Consensus 93 ~~~~~~~~~D~~~~~~A~~~~~~li~~yP~S~y~~ 127 (203)
T PF13525_consen 93 IPGILRSDRDQTSTRKAIEEFEELIKRYPNSEYAE 127 (203)
T ss_dssp HHHHH-TT---HHHHHHHHHHHHHHHH-TTSTTHH
T ss_pred CccchhcccChHHHHHHHHHHHHHHHHCcCchHHH
Confidence 5544 99999999999999987664
No 86
>COG3063 PilF Tfp pilus assembly protein PilF [Cell motility and secretion / Intracellular trafficking and secretion]
Probab=98.03 E-value=4e-05 Score=61.58 Aligned_cols=100 Identities=14% Similarity=0.187 Sum_probs=84.1
Q ss_pred HHHHHH--HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhC-CCChHHHHHHHHHH
Q psy16144 16 AEWHFL--LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIG-TSDLALHQKLASLL 82 (122)
Q Consensus 16 ~~~~~L--Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~-~~~~~vgykLA~~y 82 (122)
--|..+ +|-+.|..|.|.+.++ +++.+|.+...++-. |++|-..|++|..... +..+...-++|.+-
T Consensus 70 ~a~~~~A~~Yq~~Ge~~~A~e~YrkAlsl~p~~GdVLNNYG~FLC~qg~~~eA~q~F~~Al~~P~Y~~~s~t~eN~G~Ca 149 (250)
T COG3063 70 LAHLVRAHYYQKLGENDLADESYRKALSLAPNNGDVLNNYGAFLCAQGRPEEAMQQFERALADPAYGEPSDTLENLGLCA 149 (250)
T ss_pred HHHHHHHHHHHHcCChhhHHHHHHHHHhcCCCccchhhhhhHHHHhCCChHHHHHHHHHHHhCCCCCCcchhhhhhHHHH
Confidence 346666 8999999999999999 999999999887722 9999999999998622 23456678999999
Q ss_pred HhcccHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHh
Q psy16144 83 LRMKRYAEAIDVSQTIMKAHPSYPKVHKQIFEKAMV 118 (122)
Q Consensus 83 lk~kr~~~AI~vc~kVL~~~P~ypkir~~il~ka~~ 118 (122)
+|+|++..|.+.+++.|+.+|++|-.. +++|+.
T Consensus 150 l~~gq~~~A~~~l~raL~~dp~~~~~~---l~~a~~ 182 (250)
T COG3063 150 LKAGQFDQAEEYLKRALELDPQFPPAL---LELARL 182 (250)
T ss_pred hhcCCchhHHHHHHHHHHhCcCCChHH---HHHHHH
Confidence 999999999999999999999999874 444443
No 87
>PRK10747 putative protoheme IX biogenesis protein; Provisional
Probab=98.03 E-value=0.00012 Score=60.89 Aligned_cols=85 Identities=12% Similarity=0.049 Sum_probs=73.3
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHH--hhC----HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIY--NLF----PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDV 94 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~--e~l----y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~v 94 (122)
.+++.|+.++|.+.++ .++.+|+..-+. -.+ +++|....++..+. +|+||...+-+|..+++.|++.+|.+.
T Consensus 272 ~l~~~g~~~~A~~~L~~~l~~~~~~~l~~l~~~l~~~~~~~al~~~e~~lk~-~P~~~~l~l~lgrl~~~~~~~~~A~~~ 350 (398)
T PRK10747 272 HLIECDDHDTAQQIILDGLKRQYDERLVLLIPRLKTNNPEQLEKVLRQQIKQ-HGDTPLLWSTLGQLLMKHGEWQEASLA 350 (398)
T ss_pred HHHHCCCHHHHHHHHHHHHhcCCCHHHHHHHhhccCCChHHHHHHHHHHHhh-CCCCHHHHHHHHHHHHHCCCHHHHHHH
Confidence 8889999999999999 999766543222 222 88898888887777 899999999999999999999999999
Q ss_pred HHHHHHhCCCCHH
Q psy16144 95 SQTIMKAHPSYPK 107 (122)
Q Consensus 95 c~kVL~~~P~ypk 107 (122)
++++++.+|+++.
T Consensus 351 le~al~~~P~~~~ 363 (398)
T PRK10747 351 FRAALKQRPDAYD 363 (398)
T ss_pred HHHHHhcCCCHHH
Confidence 9999999998876
No 88
>PF13431 TPR_17: Tetratricopeptide repeat
Probab=98.02 E-value=5.7e-06 Score=46.98 Aligned_cols=34 Identities=29% Similarity=0.420 Sum_probs=31.8
Q ss_pred HHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHH
Q psy16144 59 QYTQACRIIGTSDLALHQKLASLLLRMKRYAEAID 93 (122)
Q Consensus 59 ~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~ 93 (122)
+|++|+++ +|+|+.+.++||..|...|++++|+.
T Consensus 1 ~y~kAie~-~P~n~~a~~nla~~~~~~g~~~~A~~ 34 (34)
T PF13431_consen 1 CYKKAIEL-NPNNAEAYNNLANLYLNQGDYEEAIA 34 (34)
T ss_pred ChHHHHHH-CCCCHHHHHHHHHHHHHCcCHHhhcC
Confidence 48999999 89999999999999999999999973
No 89
>PRK10153 DNA-binding transcriptional activator CadC; Provisional
Probab=98.02 E-value=3.9e-05 Score=66.78 Aligned_cols=53 Identities=15% Similarity=0.047 Sum_probs=48.2
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHH
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPK 107 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypk 107 (122)
+++|...|++|.++ +| +..+.+-||..|...|++.+|++.|++++..+|.+|.
T Consensus 436 ~~~A~~~l~rAl~L-~p-s~~a~~~lG~~~~~~G~~~eA~~~~~~A~~L~P~~pt 488 (517)
T PRK10153 436 TDEAYQAINKAIDL-EM-SWLNYVLLGKVYELKGDNRLAADAYSTAFNLRPGENT 488 (517)
T ss_pred HHHHHHHHHHHHHc-CC-CHHHHHHHHHHHHHcCCHHHHHHHHHHHHhcCCCCch
Confidence 99999999999999 68 4778888899999999999999999999999998874
No 90
>COG2956 Predicted N-acetylglucosaminyl transferase [Carbohydrate transport and metabolism]
Probab=98.00 E-value=5.9e-05 Score=63.37 Aligned_cols=101 Identities=14% Similarity=0.186 Sum_probs=83.8
Q ss_pred HHH-HhcCCCChHHHHHHHH-HHhhCcc----------hHHHHhhC----HHHHHHHHHHHHHhhCCCChHHHHHHHHHH
Q psy16144 19 HFL-LYLLPWRWEPLSAGLR-KKQYISS----------ATNIYNLF----PDDAVVQYTQACRIIGTSDLALHQKLASLL 82 (122)
Q Consensus 19 ~~L-Lyi~~gk~d~A~elL~-~l~~nps----------~~kA~e~l----y~dAa~~ye~Aw~l~~~~~~~vgykLA~~y 82 (122)
+++ ||-+...+++|++.=+ ..++.|. |--|-.++ .+.|...+++|..- +|+...++.-||..+
T Consensus 146 qLl~IYQ~treW~KAId~A~~L~k~~~q~~~~eIAqfyCELAq~~~~~~~~d~A~~~l~kAlqa-~~~cvRAsi~lG~v~ 224 (389)
T COG2956 146 QLLNIYQATREWEKAIDVAERLVKLGGQTYRVEIAQFYCELAQQALASSDVDRARELLKKALQA-DKKCVRASIILGRVE 224 (389)
T ss_pred HHHHHHHHhhHHHHHHHHHHHHHHcCCccchhHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHhh-CccceehhhhhhHHH
Confidence 444 8889999999999888 7777776 22222333 88899999999999 899999999999999
Q ss_pred HhcccHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHhcc
Q psy16144 83 LRMKRYAEAIDVSQTIMKAHPSYPKVHKQIFEKAMVHL 120 (122)
Q Consensus 83 lk~kr~~~AI~vc~kVL~~~P~ypkir~~il~ka~~~l 120 (122)
+..|+|..|++.++.|+.+||+|.--=-+-|-.|+..+
T Consensus 225 ~~~g~y~~AV~~~e~v~eQn~~yl~evl~~L~~~Y~~l 262 (389)
T COG2956 225 LAKGDYQKAVEALERVLEQNPEYLSEVLEMLYECYAQL 262 (389)
T ss_pred HhccchHHHHHHHHHHHHhChHHHHHHHHHHHHHHHHh
Confidence 99999999999999999999999665556677777654
No 91
>COG5010 TadD Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking and secretion]
Probab=97.99 E-value=3.9e-05 Score=61.96 Aligned_cols=90 Identities=17% Similarity=0.187 Sum_probs=79.5
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
-+...|+-|.+.-++. .+..+|++.....-. |.+|...+++|..+ +|+|..+..-||..|.+.||+++|
T Consensus 75 a~~~~G~a~~~l~~~~~~~~~~~~d~~ll~~~gk~~~~~g~~~~A~~~~rkA~~l-~p~d~~~~~~lgaaldq~Gr~~~A 153 (257)
T COG5010 75 ALYLRGDADSSLAVLQKSAIAYPKDRELLAAQGKNQIRNGNFGEAVSVLRKAARL-APTDWEAWNLLGAALDQLGRFDEA 153 (257)
T ss_pred HHHhcccccchHHHHhhhhccCcccHHHHHHHHHHHHHhcchHHHHHHHHHHhcc-CCCChhhhhHHHHHHHHccChhHH
Confidence 4555689999999999 888888876555211 99999999999999 699999999999999999999999
Q ss_pred HHHHHHHHHhCCCCHHHHHHH
Q psy16144 92 IDVSQTIMKAHPSYPKVHKQI 112 (122)
Q Consensus 92 I~vc~kVL~~~P~ypkir~~i 112 (122)
-.-|++.++..|+.|.+...+
T Consensus 154 r~ay~qAl~L~~~~p~~~nNl 174 (257)
T COG5010 154 RRAYRQALELAPNEPSIANNL 174 (257)
T ss_pred HHHHHHHHHhccCCchhhhhH
Confidence 999999999999999998776
No 92
>PRK10866 outer membrane biogenesis protein BamD; Provisional
Probab=97.98 E-value=0.00011 Score=58.01 Aligned_cols=93 Identities=9% Similarity=0.001 Sum_probs=70.6
Q ss_pred HHHHHHH--HhcCCCChHHHHHHHH-HHhhCcchHHHHh--------hC----HHHHHHHHHHHHHhhCCCChHH---HH
Q psy16144 15 IAEWHFL--LYLLPWRWEPLSAGLR-KKQYISSATNIYN--------LF----PDDAVVQYTQACRIIGTSDLAL---HQ 76 (122)
Q Consensus 15 ~~~~~~L--Lyi~~gk~d~A~elL~-~l~~nps~~kA~e--------~l----y~dAa~~ye~Aw~l~~~~~~~v---gy 76 (122)
..+++-. -+.+.|+|++|.+.|+ ++...|..+.+-. +. |++|+..|++..+. .|++|.+ .|
T Consensus 32 ~~~~Y~~A~~~~~~g~y~~Ai~~f~~l~~~yP~s~~a~~a~l~la~ayy~~~~y~~A~~~~e~fi~~-~P~~~~~~~a~Y 110 (243)
T PRK10866 32 PSEIYATAQQKLQDGNWKQAITQLEALDNRYPFGPYSQQVQLDLIYAYYKNADLPLAQAAIDRFIRL-NPTHPNIDYVLY 110 (243)
T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCCCChHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHh-CcCCCchHHHHH
Confidence 3344444 6677899999999999 9999998766642 11 99999999999999 7888765 57
Q ss_pred HHHHHHHhcc------------------cHHHHHHHHHHHHHhCCCCHHH
Q psy16144 77 KLASLLLRMK------------------RYAEAIDVSQTIMKAHPSYPKV 108 (122)
Q Consensus 77 kLA~~ylk~k------------------r~~~AI~vc~kVL~~~P~ypki 108 (122)
.+|.++...+ .-.+|+..++++++.+|+.+-+
T Consensus 111 ~~g~~~~~~~~~~~~~~~~~~~~~rD~~~~~~A~~~~~~li~~yP~S~ya 160 (243)
T PRK10866 111 MRGLTNMALDDSALQGFFGVDRSDRDPQHARAAFRDFSKLVRGYPNSQYT 160 (243)
T ss_pred HHHHhhhhcchhhhhhccCCCccccCHHHHHHHHHHHHHHHHHCcCChhH
Confidence 7787654443 1357999999999999987544
No 93
>PF13512 TPR_18: Tetratricopeptide repeat
Probab=97.97 E-value=2.3e-05 Score=58.26 Aligned_cols=42 Identities=26% Similarity=0.400 Sum_probs=38.2
Q ss_pred CCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHH
Q psy16144 68 GTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVH 109 (122)
Q Consensus 68 ~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir 109 (122)
++-.+.+-+.||++|.+.|+|++|++.+++.+++||++|+|-
T Consensus 43 g~ya~qAqL~l~yayy~~~~y~~A~a~~~rFirLhP~hp~vd 84 (142)
T PF13512_consen 43 GEYAEQAQLDLAYAYYKQGDYEEAIAAYDRFIRLHPTHPNVD 84 (142)
T ss_pred CcccHHHHHHHHHHHHHccCHHHHHHHHHHHHHhCCCCCCcc
Confidence 456678899999999999999999999999999999999874
No 94
>PRK10866 outer membrane biogenesis protein BamD; Provisional
Probab=97.97 E-value=2.8e-05 Score=61.32 Aligned_cols=57 Identities=11% Similarity=0.027 Sum_probs=53.8
Q ss_pred HHHHHHHHHHHHHhhCCCChHHH---HHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALH---QKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVHK 110 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vg---ykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir~ 110 (122)
|++|+..|++.... .|..+.+- +.||.+|.+.|+|.+|+..++++++.+|++|++--
T Consensus 48 y~~Ai~~f~~l~~~-yP~s~~a~~a~l~la~ayy~~~~y~~A~~~~e~fi~~~P~~~~~~~ 107 (243)
T PRK10866 48 WKQAITQLEALDNR-YPFGPYSQQVQLDLIYAYYKNADLPLAQAAIDRFIRLNPTHPNIDY 107 (243)
T ss_pred HHHHHHHHHHHHHh-CCCChHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHhCcCCCchHH
Confidence 99999999999998 79988887 99999999999999999999999999999999853
No 95
>COG1729 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=97.95 E-value=6.8e-05 Score=60.76 Aligned_cols=88 Identities=16% Similarity=0.128 Sum_probs=71.7
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchH---HHHhhC---------HHHHHHHHHHHHHhh--CCCChHHHHHHHHHHHhcc
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSAT---NIYNLF---------PDDAVVQYTQACRII--GTSDLALHQKLASLLLRMK 86 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~---kA~e~l---------y~dAa~~ye~Aw~l~--~~~~~~vgykLA~~ylk~k 86 (122)
=++.+|+|+.|++.|+ =++-=|+++ .|+..+ |++|+..|..+.+-- .|.-|...+|||-+....|
T Consensus 150 ~~~ksgdy~~A~~~F~~fi~~YP~s~~~~nA~yWLGe~~y~qg~y~~Aa~~f~~~~k~~P~s~KApdallKlg~~~~~l~ 229 (262)
T COG1729 150 DLYKSGDYAEAEQAFQAFIKKYPNSTYTPNAYYWLGESLYAQGDYEDAAYIFARVVKDYPKSPKAPDALLKLGVSLGRLG 229 (262)
T ss_pred HHHHcCCHHHHHHHHHHHHHcCCCCcccchhHHHHHHHHHhcccchHHHHHHHHHHHhCCCCCCChHHHHHHHHHHHHhc
Confidence 4556799999999999 777777743 333322 999999999987751 2445678999999999999
Q ss_pred cHHHHHHHHHHHHHhCCCCHHHH
Q psy16144 87 RYAEAIDVSQTIMKAHPSYPKVH 109 (122)
Q Consensus 87 r~~~AI~vc~kVL~~~P~ypkir 109 (122)
+.++|-.++++|.+.+|+.+..+
T Consensus 230 ~~d~A~atl~qv~k~YP~t~aA~ 252 (262)
T COG1729 230 NTDEACATLQQVIKRYPGTDAAK 252 (262)
T ss_pred CHHHHHHHHHHHHHHCCCCHHHH
Confidence 99999999999999999887664
No 96
>PF09976 TPR_21: Tetratricopeptide repeat; InterPro: IPR018704 This domain, found in various hypothetical prokaryotic proteins, has no known function.
Probab=97.94 E-value=8.4e-05 Score=53.25 Aligned_cols=72 Identities=18% Similarity=0.079 Sum_probs=45.4
Q ss_pred CCCChHHHHHHHH-HHhhCcchHHHHh---hC---------HHHHHHHHHHHHHhhCCCC---hHHHHHHHHHHHhcccH
Q psy16144 25 LPWRWEPLSAGLR-KKQYISSATNIYN---LF---------PDDAVVQYTQACRIIGTSD---LALHQKLASLLLRMKRY 88 (122)
Q Consensus 25 ~~gk~d~A~elL~-~l~~nps~~kA~e---~l---------y~dAa~~ye~Aw~l~~~~~---~~vgykLA~~ylk~kr~ 88 (122)
+.|+.+.+.+.+. .++-+|+.+.+.. .+ |++|...|+.+.+-. |++ +.+.++||.+++..|+|
T Consensus 23 ~~~~~~~~~~~~~~l~~~~~~s~ya~~A~l~lA~~~~~~g~~~~A~~~l~~~~~~~-~d~~l~~~a~l~LA~~~~~~~~~ 101 (145)
T PF09976_consen 23 QAGDPAKAEAAAEQLAKDYPSSPYAALAALQLAKAAYEQGDYDEAKAALEKALANA-PDPELKPLARLRLARILLQQGQY 101 (145)
T ss_pred HCCCHHHHHHHHHHHHHHCCCChHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHhhC-CCHHHHHHHHHHHHHHHHHcCCH
Confidence 5667777777777 6666776633322 11 777777777777663 433 34566777777777777
Q ss_pred HHHHHHHHH
Q psy16144 89 AEAIDVSQT 97 (122)
Q Consensus 89 ~~AI~vc~k 97 (122)
++|+++++.
T Consensus 102 d~Al~~L~~ 110 (145)
T PF09976_consen 102 DEALATLQQ 110 (145)
T ss_pred HHHHHHHHh
Confidence 777776655
No 97
>PF14559 TPR_19: Tetratricopeptide repeat; PDB: 2R5S_A 3QDN_B 3QOU_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 3FP3_A 3LCA_A ....
Probab=97.94 E-value=1.1e-05 Score=50.14 Aligned_cols=56 Identities=16% Similarity=0.051 Sum_probs=49.6
Q ss_pred cCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHH
Q psy16144 24 LLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLAS 80 (122)
Q Consensus 24 i~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~ 80 (122)
++.|+|++|.++|+ +++.+|++..++..+ +++|...++++... +|+++.+..-+|.
T Consensus 2 l~~~~~~~A~~~~~~~l~~~p~~~~~~~~la~~~~~~g~~~~A~~~l~~~~~~-~~~~~~~~~l~a~ 67 (68)
T PF14559_consen 2 LKQGDYDEAIELLEKALQRNPDNPEARLLLAQCYLKQGQYDEAEELLERLLKQ-DPDNPEYQQLLAQ 67 (68)
T ss_dssp HHTTHHHHHHHHHHHHHHHTTTSHHHHHHHHHHHHHTT-HHHHHHHHHCCHGG-GTTHHHHHHHHHH
T ss_pred hhccCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHH-CcCHHHHHHHHhc
Confidence 56899999999999 999999999998854 99999999999999 6988887766653
No 98
>PF06552 TOM20_plant: Plant specific mitochondrial import receptor subunit TOM20; InterPro: IPR010547 This family consists of several plant specific mitochondrial import receptor subunit TOM20 (translocase of outer membrane 20 kDa subunit) proteins. Most mitochondrial proteins are encoded by the nuclear genome, and are synthesised in the cytosol. TOM20 is a general import receptor that binds to mitochondrial pre-sequences in the early step of protein import into the mitochondria [].; GO: 0045040 protein import into mitochondrial outer membrane, 0005742 mitochondrial outer membrane translocase complex; PDB: 1ZU2_A.
Probab=97.92 E-value=0.00014 Score=56.26 Aligned_cols=81 Identities=14% Similarity=0.150 Sum_probs=66.3
Q ss_pred hHHHHHHHH-HHhhCcchHHHHh-----hC--------------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhccc-
Q psy16144 29 WEPLSAGLR-KKQYISSATNIYN-----LF--------------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKR- 87 (122)
Q Consensus 29 ~d~A~elL~-~l~~nps~~kA~e-----~l--------------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr- 87 (122)
|+.|.+... ....||.++.++. ++ ++||+.-|+.|+.+ +|+...+.+.||.+|.-.++
T Consensus 7 FE~ark~aea~y~~nP~DadnL~~WG~ALLELAqfk~g~es~~miedAisK~eeAL~I-~P~~hdAlw~lGnA~ts~A~l 85 (186)
T PF06552_consen 7 FEHARKKAEAAYAKNPLDADNLTNWGGALLELAQFKQGPESKKMIEDAISKFEEALKI-NPNKHDALWCLGNAYTSLAFL 85 (186)
T ss_dssp HHHHHHHHHHHHHH-TT-HHHHHHHHHHHHHHHHHS-HHHHHHHHHHHHHHHHHHHHH--TT-HHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHhCcHhHHHHHHHHHHHHHHHhccCcchHHHHHHHHHHHHHHHHhc-CCchHHHHHHHHHHHHHHHhh
Confidence 678888888 8899999888776 11 99999999999999 89999999999999988776
Q ss_pred ----------HHHHHHHHHHHHHhCCCCHHHHH
Q psy16144 88 ----------YAEAIDVSQTIMKAHPSYPKVHK 110 (122)
Q Consensus 88 ----------~~~AI~vc~kVL~~~P~ypkir~ 110 (122)
|.+|.+.++++.+.+|++.--|+
T Consensus 86 ~~d~~~A~~~F~kA~~~FqkAv~~~P~ne~Y~k 118 (186)
T PF06552_consen 86 TPDTAEAEEYFEKATEYFQKAVDEDPNNELYRK 118 (186)
T ss_dssp ---HHHHHHHHHHHHHHHHHHHHH-TT-HHHHH
T ss_pred cCChHHHHHHHHHHHHHHHHHHhcCCCcHHHHH
Confidence 88899999999999999987776
No 99
>KOG0548|consensus
Probab=97.89 E-value=7.6e-05 Score=65.39 Aligned_cols=87 Identities=11% Similarity=0.128 Sum_probs=71.2
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHh-----hC----HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYN-----LF----PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e-----~l----y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
-+.+.|+|.+|+..+. +++.||+++++|- |+ |..|...-+.+.++ +|+++...++-|-++--+++|.+|
T Consensus 367 e~Fk~gdy~~Av~~YteAIkr~P~Da~lYsNRAac~~kL~~~~~aL~Da~~~ieL-~p~~~kgy~RKg~al~~mk~ydkA 445 (539)
T KOG0548|consen 367 EAFKKGDYPEAVKHYTEAIKRDPEDARLYSNRAACYLKLGEYPEALKDAKKCIEL-DPNFIKAYLRKGAALRAMKEYDKA 445 (539)
T ss_pred HHHhccCHHHHHHHHHHHHhcCCchhHHHHHHHHHHHHHhhHHHHHHHHHHHHhc-CchHHHHHHHHHHHHHHHHHHHHH
Confidence 4567788888888888 8888888888776 22 88888888888888 788888888888888888888888
Q ss_pred HHHHHHHHHhCCCCHHHH
Q psy16144 92 IDVSQTIMKAHPSYPKVH 109 (122)
Q Consensus 92 I~vc~kVL~~~P~ypkir 109 (122)
.++|++.|..+|+...+-
T Consensus 446 leay~eale~dp~~~e~~ 463 (539)
T KOG0548|consen 446 LEAYQEALELDPSNAEAI 463 (539)
T ss_pred HHHHHHHHhcCchhHHHH
Confidence 888888888888766553
No 100
>TIGR00540 hemY_coli hemY protein. This is an uncharacterized protein encoded next to a heme-biosynthetic enzyme in two gamma division proteobacteria (E. coli and H. influenzae). It is known in no other species. The gene symbol hemY is unfortunate in that an unrelated protein, protoporphyrinogen oxidase, is designated as HemG in E. coli but as HemY in Bacillus subtilis.
Probab=97.89 E-value=0.00017 Score=59.92 Aligned_cols=87 Identities=11% Similarity=0.041 Sum_probs=47.8
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCCh-HHHHHHHHHHHhcccHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDL-ALHQKLASLLLRMKRYAE 90 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~-~vgykLA~~ylk~kr~~~ 90 (122)
+....|+++.|++.+. ..++.|+..-++... ++.|...|+++.+.. |++. .+...+|..++..|++++
T Consensus 93 la~~~g~~~~A~~~l~~~~~~~~~~~~~~llaA~aa~~~g~~~~A~~~l~~a~~~~-p~~~l~~~~~~a~l~l~~~~~~~ 171 (409)
T TIGR00540 93 LKLAEGDYAKAEKLIAKNADHAAEPVLNLIKAAEAAQQRGDEARANQHLEEAAELA-GNDNILVEIARTRILLAQNELHA 171 (409)
T ss_pred HHHhCCCHHHHHHHHHHHhhcCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhC-CcCchHHHHHHHHHHHHCCCHHH
Confidence 4555566666666666 555555533333211 566666666665552 4432 455555666666666666
Q ss_pred HHHHHHHHHHhCCCCHHHH
Q psy16144 91 AIDVSQTIMKAHPSYPKVH 109 (122)
Q Consensus 91 AI~vc~kVL~~~P~ypkir 109 (122)
|.+..++.++.+|++|.+.
T Consensus 172 Al~~l~~l~~~~P~~~~~l 190 (409)
T TIGR00540 172 ARHGVDKLLEMAPRHKEVL 190 (409)
T ss_pred HHHHHHHHHHhCCCCHHHH
Confidence 6666666666666666554
No 101
>KOG0543|consensus
Probab=97.89 E-value=0.00014 Score=61.98 Aligned_cols=91 Identities=11% Similarity=0.163 Sum_probs=76.0
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcch-------HHHHh-------------hC----HHHHHHHHHHHHHhhCCCChHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSA-------TNIYN-------------LF----PDDAVVQYTQACRIIGTSDLALHQ 76 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~-------~kA~e-------------~l----y~dAa~~ye~Aw~l~~~~~~~vgy 76 (122)
.|.+.|+|..|...++ +++.-..+ .+..+ ++ |++|+.+=.+++++ +|+|+-+.|
T Consensus 217 ~~fK~gk~~~A~~~Yerav~~l~~~~~~~~ee~~~~~~~k~~~~lNlA~c~lKl~~~~~Ai~~c~kvLe~-~~~N~KALy 295 (397)
T KOG0543|consen 217 VLFKEGKFKLAKKRYERAVSFLEYRRSFDEEEQKKAEALKLACHLNLAACYLKLKEYKEAIESCNKVLEL-DPNNVKALY 295 (397)
T ss_pred HHHhhchHHHHHHHHHHHHHHhhccccCCHHHHHHHHHHHHHHhhHHHHHHHhhhhHHHHHHHHHHHHhc-CCCchhHHH
Confidence 7888899999988877 65543321 11111 11 99999999999999 799999999
Q ss_pred HHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHHHHHH
Q psy16144 77 KLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVHKQIF 113 (122)
Q Consensus 77 kLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir~~il 113 (122)
+=|.+|+-.+.|.+|+..+++|++..|++-.|+.+|.
T Consensus 296 RrG~A~l~~~e~~~A~~df~ka~k~~P~Nka~~~el~ 332 (397)
T KOG0543|consen 296 RRGQALLALGEYDLARDDFQKALKLEPSNKAARAELI 332 (397)
T ss_pred HHHHHHHhhccHHHHHHHHHHHHHhCCCcHHHHHHHH
Confidence 9999999999999999999999999999998988875
No 102
>KOG2076|consensus
Probab=97.87 E-value=0.0002 Score=65.89 Aligned_cols=80 Identities=16% Similarity=0.141 Sum_probs=61.1
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC-------------------------------------------HHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF-------------------------------------------PDDAV 57 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l-------------------------------------------y~dAa 57 (122)
++++ |++++|+++|. +++++|-.+.||+.+ ++.|.
T Consensus 149 lfar-g~~eeA~~i~~EvIkqdp~~~~ay~tL~~IyEqrGd~eK~l~~~llAAHL~p~d~e~W~~ladls~~~~~i~qA~ 227 (895)
T KOG2076|consen 149 LFAR-GDLEEAEEILMEVIKQDPRNPIAYYTLGEIYEQRGDIEKALNFWLLAAHLNPKDYELWKRLADLSEQLGNINQAR 227 (895)
T ss_pred HHHh-CCHHHHHHHHHHHHHhCccchhhHHHHHHHHHHcccHHHHHHHHHHHHhcCCCChHHHHHHHHHHHhcccHHHHH
Confidence 3444 77777777777 777777777777633 77888
Q ss_pred HHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCC
Q psy16144 58 VQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHP 103 (122)
Q Consensus 58 ~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P 103 (122)
-+|-+|+.+ +|.+-..-++-+..|.++|.+..|.+.+.+++...|
T Consensus 228 ~cy~rAI~~-~p~n~~~~~ers~L~~~~G~~~~Am~~f~~l~~~~p 272 (895)
T KOG2076|consen 228 YCYSRAIQA-NPSNWELIYERSSLYQKTGDLKRAMETFLQLLQLDP 272 (895)
T ss_pred HHHHHHHhc-CCcchHHHHHHHHHHHHhChHHHHHHHHHHHHhhCC
Confidence 888888888 688888888888888888888888888888888888
No 103
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=97.85 E-value=0.00012 Score=58.72 Aligned_cols=59 Identities=19% Similarity=0.217 Sum_probs=54.2
Q ss_pred HHHHHHHHHHHHHhhCCCC---hHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHHHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSD---LALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVHKQI 112 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~---~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir~~i 112 (122)
|++|+..|+...+. .|++ |.+.|.||..|...|+|.+|+..+++|++.+|++|+.-.-.
T Consensus 159 y~~Ai~af~~fl~~-yP~s~~a~~A~y~LG~~y~~~g~~~~A~~~f~~vv~~yP~s~~~~dAl 220 (263)
T PRK10803 159 QDDAIVAFQNFVKK-YPDSTYQPNANYWLGQLNYNKGKKDDAAYYFASVVKNYPKSPKAADAM 220 (263)
T ss_pred HHHHHHHHHHHHHH-CcCCcchHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCCCCcchhHHH
Confidence 99999999999999 6988 58999999999999999999999999999999988775433
No 104
>PF13174 TPR_6: Tetratricopeptide repeat; PDB: 3QKY_A 2XEV_A 3URZ_B 2Q7F_A.
Probab=97.83 E-value=5e-05 Score=40.93 Aligned_cols=32 Identities=28% Similarity=0.391 Sum_probs=30.4
Q ss_pred HHHHHHHHHHhcccHHHHHHHHHHHHHhCCCC
Q psy16144 74 LHQKLASLLLRMKRYAEAIDVSQTIMKAHPSY 105 (122)
Q Consensus 74 vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~y 105 (122)
+.|++|.+|.+.|++.+|++.++++++.+|++
T Consensus 2 a~~~~a~~~~~~g~~~~A~~~~~~~~~~~P~s 33 (33)
T PF13174_consen 2 ALYRLARCYYKLGDYDEAIEYFQRLIKRYPDS 33 (33)
T ss_dssp HHHHHHHHHHHHCHHHHHHHHHHHHHHHSTTS
T ss_pred HHHHHHHHHHHccCHHHHHHHHHHHHHHCcCC
Confidence 67999999999999999999999999999975
No 105
>PF04733 Coatomer_E: Coatomer epsilon subunit; InterPro: IPR006822 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the epsilon subunit of the coatomer complex, which is involved in the regulation of intracellular protein trafficking between the endoplasmic reticulum and the Golgi complex []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006890 retrograde vesicle-mediated transport, Golgi to ER, 0030126 COPI vesicle coat; PDB: 3MV2_B 3MV3_F 3MKR_A.
Probab=97.82 E-value=5.1e-05 Score=61.46 Aligned_cols=89 Identities=18% Similarity=0.153 Sum_probs=69.1
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHH-----HHhhC------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATN-----IYNLF------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYA 89 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~k-----A~e~l------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~ 89 (122)
+|+..+|+|.|.+.++ .-+.+.+..- ||-.+ |.+|.-.|+..-+- -+.+|.+...+|.+.+..|+|.
T Consensus 140 i~L~~~R~dlA~k~l~~~~~~~eD~~l~qLa~awv~l~~g~e~~~~A~y~f~El~~~-~~~t~~~lng~A~~~l~~~~~~ 218 (290)
T PF04733_consen 140 ILLKMNRPDLAEKELKNMQQIDEDSILTQLAEAWVNLATGGEKYQDAFYIFEELSDK-FGSTPKLLNGLAVCHLQLGHYE 218 (290)
T ss_dssp HHHHTT-HHHHHHHHHHHHCCSCCHHHHHHHHHHHHHHHTTTCCCHHHHHHHHHHCC-S--SHHHHHHHHHHHHHCT-HH
T ss_pred HHHHcCCHHHHHHHHHHHHhcCCcHHHHHHHHHHHHHHhCchhHHHHHHHHHHHHhc-cCCCHHHHHHHHHHHHHhCCHH
Confidence 8889999999999999 8777776443 33222 89999999997665 3678899999999999999999
Q ss_pred HHHHHHHHHHHhCCCCHHHHHH
Q psy16144 90 EAIDVSQTIMKAHPSYPKVHKQ 111 (122)
Q Consensus 90 ~AI~vc~kVL~~~P~ypkir~~ 111 (122)
+|+++..++|..+|++|..-..
T Consensus 219 eAe~~L~~al~~~~~~~d~LaN 240 (290)
T PF04733_consen 219 EAEELLEEALEKDPNDPDTLAN 240 (290)
T ss_dssp HHHHHHHHHCCC-CCHHHHHHH
T ss_pred HHHHHHHHHHHhccCCHHHHHH
Confidence 9999999999999999886443
No 106
>KOG4162|consensus
Probab=97.77 E-value=0.00018 Score=65.40 Aligned_cols=85 Identities=12% Similarity=0.084 Sum_probs=73.3
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHh-----hC------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYN-----LF------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYA 89 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e-----~l------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~ 89 (122)
++...|+.++|.+.|. ++..||+.+...- ++ -.++......|.++ +|+||.++|.||-++.+.|+..
T Consensus 693 ~~~~~~~~~EA~~af~~Al~ldP~hv~s~~Ala~~lle~G~~~la~~~~~L~dalr~-dp~n~eaW~~LG~v~k~~Gd~~ 771 (799)
T KOG4162|consen 693 LLEVKGQLEEAKEAFLVALALDPDHVPSMTALAELLLELGSPRLAEKRSLLSDALRL-DPLNHEAWYYLGEVFKKLGDSK 771 (799)
T ss_pred HHHHHHhhHHHHHHHHHHHhcCCCCcHHHHHHHHHHHHhCCcchHHHHHHHHHHHhh-CCCCHHHHHHHHHHHHHccchH
Confidence 7888999999999999 9999999665443 12 44444599999999 7999999999999999999999
Q ss_pred HHHHHHHHHHHhCCCCHH
Q psy16144 90 EAIDVSQTIMKAHPSYPK 107 (122)
Q Consensus 90 ~AI~vc~kVL~~~P~ypk 107 (122)
+|.++|..++...+..|=
T Consensus 772 ~Aaecf~aa~qLe~S~PV 789 (799)
T KOG4162|consen 772 QAAECFQAALQLEESNPV 789 (799)
T ss_pred HHHHHHHHHHhhccCCCc
Confidence 999999999999998874
No 107
>KOG2076|consensus
Probab=97.77 E-value=0.00047 Score=63.43 Aligned_cols=62 Identities=8% Similarity=0.049 Sum_probs=52.4
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHHHHHHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVHKQIFEK 115 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir~~il~k 115 (122)
+++|......+.+. +|.++.+.+.||-.|...|+.++|....-.+--.+|..+..|+.+=|-
T Consensus 155 ~eeA~~i~~EvIkq-dp~~~~ay~tL~~IyEqrGd~eK~l~~~llAAHL~p~d~e~W~~ladl 216 (895)
T KOG2076|consen 155 LEEAEEILMEVIKQ-DPRNPIAYYTLGEIYEQRGDIEKALNFWLLAAHLNPKDYELWKRLADL 216 (895)
T ss_pred HHHHHHHHHHHHHh-CccchhhHHHHHHHHHHcccHHHHHHHHHHHHhcCCCChHHHHHHHHH
Confidence 88999999999999 799999999999999999998888887777777888877777765443
No 108
>KOG0624|consensus
Probab=97.70 E-value=0.00043 Score=59.11 Aligned_cols=95 Identities=17% Similarity=0.132 Sum_probs=64.4
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcc---hHHHHhhC---------------------------------------------
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISS---ATNIYNLF--------------------------------------------- 52 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps---~~kA~e~l--------------------------------------------- 52 (122)
|+-+.|+.+.+..-+| ||+.||+ |.-.|--+
T Consensus 232 L~Y~vgd~~~sL~~iRECLKldpdHK~Cf~~YKklkKv~K~les~e~~ie~~~~t~cle~ge~vlk~ep~~~~ir~~~~r 311 (504)
T KOG0624|consen 232 LLYTVGDAENSLKEIRECLKLDPDHKLCFPFYKKLKKVVKSLESAEQAIEEKHWTECLEAGEKVLKNEPEETMIRYNGFR 311 (504)
T ss_pred HHHhhhhHHHHHHHHHHHHccCcchhhHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhcCCcccceeeeeeh
Confidence 6777899999999999 9999998 33333210
Q ss_pred -----------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHh
Q psy16144 53 -----------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVHKQIFEKAMV 118 (122)
Q Consensus 53 -----------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir~~il~ka~~ 118 (122)
+.+|+..-..++.. .|+|..+.+.=|-.|+-.-.|++||.-|+++++.+|++.++| +-+++|.+
T Consensus 312 ~~c~C~~~d~~~~eAiqqC~evL~~-d~~dv~~l~dRAeA~l~dE~YD~AI~dye~A~e~n~sn~~~r-eGle~Akr 386 (504)
T KOG0624|consen 312 VLCTCYREDEQFGEAIQQCKEVLDI-DPDDVQVLCDRAEAYLGDEMYDDAIHDYEKALELNESNTRAR-EGLERAKR 386 (504)
T ss_pred eeeecccccCCHHHHHHHHHHHHhc-CchHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhcCcccHHHH-HHHHHHHH
Confidence 55566666666666 466666666666666666666666666666666666666666 35565543
No 109
>PF13424 TPR_12: Tetratricopeptide repeat; PDB: 3RO2_A 3Q15_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 4A1S_B 3CEQ_B 3EDT_H ....
Probab=97.69 E-value=8.9e-05 Score=47.34 Aligned_cols=61 Identities=13% Similarity=0.110 Sum_probs=42.1
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhCHHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLFPDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMK 100 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~ly~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~ 100 (122)
+|...|+|++|.++++ +++.....+. .+|.-..+.++||.+|...|++.+|+..++++++
T Consensus 14 ~~~~~~~~~~A~~~~~~al~~~~~~~~-------------------~~~~~a~~~~~lg~~~~~~g~~~~A~~~~~~al~ 74 (78)
T PF13424_consen 14 VYRELGRYDEALDYYEKALDIEEQLGD-------------------DHPDTANTLNNLGECYYRLGDYEEALEYYQKALD 74 (78)
T ss_dssp HHHHTT-HHHHHHHHHHHHHHHHHTTT-------------------HHHHHHHHHHHHHHHHHHTTHHHHHHHHHHHHHH
T ss_pred HHHHcCCHHHHHHHHHHHHHHHHHHCC-------------------CCHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHh
Confidence 6667777777777777 6654221110 0112245678999999999999999999999997
Q ss_pred h
Q psy16144 101 A 101 (122)
Q Consensus 101 ~ 101 (122)
+
T Consensus 75 i 75 (78)
T PF13424_consen 75 I 75 (78)
T ss_dssp H
T ss_pred h
Confidence 5
No 110
>KOG0548|consensus
Probab=97.66 E-value=0.00033 Score=61.49 Aligned_cols=88 Identities=14% Similarity=0.119 Sum_probs=78.1
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHh-----h--C--HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYN-----L--F--PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e-----~--l--y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
...++|+|+.|+.++. ++.+.|.+..-|. | + |.+|..-=.+..++ +|+=|-...++|..+.-.|+|++|
T Consensus 11 aa~s~~d~~~ai~~~t~ai~l~p~nhvlySnrsaa~a~~~~~~~al~da~k~~~l-~p~w~kgy~r~Gaa~~~lg~~~eA 89 (539)
T KOG0548|consen 11 AAFSSGDFETAIRLFTEAIMLSPTNHVLYSNRSAAYASLGSYEKALKDATKTRRL-NPDWAKGYSRKGAALFGLGDYEEA 89 (539)
T ss_pred hhcccccHHHHHHHHHHHHccCCCccchhcchHHHHHHHhhHHHHHHHHHHHHhc-CCchhhHHHHhHHHHHhcccHHHH
Confidence 5678999999999999 9999998554443 1 1 99999999999999 899999999999999999999999
Q ss_pred HHHHHHHHHhCCCCHHHHH
Q psy16144 92 IDVSQTIMKAHPSYPKVHK 110 (122)
Q Consensus 92 I~vc~kVL~~~P~ypkir~ 110 (122)
+.-|.+-|+.+|+++-++.
T Consensus 90 ~~ay~~GL~~d~~n~~L~~ 108 (539)
T KOG0548|consen 90 ILAYSEGLEKDPSNKQLKT 108 (539)
T ss_pred HHHHHHHhhcCCchHHHHH
Confidence 9999999999999887764
No 111
>PRK11906 transcriptional regulator; Provisional
Probab=97.65 E-value=0.00036 Score=60.42 Aligned_cols=77 Identities=13% Similarity=0.026 Sum_probs=54.7
Q ss_pred ChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHH
Q psy16144 28 RWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQT 97 (122)
Q Consensus 28 k~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~k 97 (122)
.-.+|.++-+ +++++|+++.|+-.+ ++.|...+++|..+ +|+++.+.|-+|.+..-.|+.++|++..++
T Consensus 319 ~~~~a~~~A~rAveld~~Da~a~~~~g~~~~~~~~~~~a~~~f~rA~~L-~Pn~A~~~~~~~~~~~~~G~~~~a~~~i~~ 397 (458)
T PRK11906 319 AAQKALELLDYVSDITTVDGKILAIMGLITGLSGQAKVSHILFEQAKIH-STDIASLYYYRALVHFHNEKIEEARICIDK 397 (458)
T ss_pred HHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHhhcchhhHHHHHHHHhhc-CCccHHHHHHHHHHHHHcCCHHHHHHHHHH
Confidence 3456666666 777777777766633 77777777777777 577777777777777777777777777777
Q ss_pred HHHhCCCC
Q psy16144 98 IMKAHPSY 105 (122)
Q Consensus 98 VL~~~P~y 105 (122)
.++.+|.-
T Consensus 398 alrLsP~~ 405 (458)
T PRK11906 398 SLQLEPRR 405 (458)
T ss_pred HhccCchh
Confidence 77777743
No 112
>COG4235 Cytochrome c biogenesis factor [Posttranslational modification, protein turnover, chaperones]
Probab=97.65 E-value=0.00037 Score=57.15 Aligned_cols=81 Identities=21% Similarity=0.236 Sum_probs=69.6
Q ss_pred hHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcc---cHHHHHHHH
Q psy16144 29 WEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMK---RYAEAIDVS 95 (122)
Q Consensus 29 ~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~k---r~~~AI~vc 95 (122)
.+.-..-++ -|+.||++++.|-.+ +.+|...|.+|.++ .|+||.+.--+|-.+.-+. .-.++.++.
T Consensus 138 ~~~l~a~Le~~L~~nP~d~egW~~Lg~~ym~~~~~~~A~~AY~~A~rL-~g~n~~~~~g~aeaL~~~a~~~~ta~a~~ll 216 (287)
T COG4235 138 MEALIARLETHLQQNPGDAEGWDLLGRAYMALGRASDALLAYRNALRL-AGDNPEILLGLAEALYYQAGQQMTAKARALL 216 (287)
T ss_pred HHHHHHHHHHHHHhCCCCchhHHHHHHHHHHhcchhHHHHHHHHHHHh-CCCCHHHHHHHHHHHHHhcCCcccHHHHHHH
Confidence 444566788 999999999999855 99999999999999 5999999999998887654 356899999
Q ss_pred HHHHHhCCCCHHHHH
Q psy16144 96 QTIMKAHPSYPKVHK 110 (122)
Q Consensus 96 ~kVL~~~P~ypkir~ 110 (122)
+++|+.+|++++.+.
T Consensus 217 ~~al~~D~~~iral~ 231 (287)
T COG4235 217 RQALALDPANIRALS 231 (287)
T ss_pred HHHHhcCCccHHHHH
Confidence 999999999998864
No 113
>PF12569 NARP1: NMDA receptor-regulated protein 1 ; InterPro: IPR021183 This group represents N-terminal acetyltransferase A (NatA) auxiliary subunit and represents a non-catalytic component of the NatA N-terminal acetyltransferase, which catalyzes acetylation of proteins beginning with Met-Ser, Met-Gly and Met-Ala. N-terminal acetylation plays a role in normal eukaryotic translation and processing, protect against proteolytic degradation and protein turnover. NAT1 anchors ARD1 and NAT5 to the ribosome and may present the N- terminal of nascent polypeptides for acetylation [], [].
Probab=97.55 E-value=0.00044 Score=60.45 Aligned_cols=84 Identities=19% Similarity=0.178 Sum_probs=75.6
Q ss_pred HHHH--HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhc
Q psy16144 18 WHFL--LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRM 85 (122)
Q Consensus 18 ~~~L--Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~ 85 (122)
++++ .|...|.+++|.++++ +++++|.++..|..- +++|+...+.|-.+ +..|--+-.+.+--+++.
T Consensus 197 ~~~lAqhyd~~g~~~~Al~~Id~aI~htPt~~ely~~KarilKh~G~~~~Aa~~~~~Ar~L-D~~DRyiNsK~aKy~LRa 275 (517)
T PF12569_consen 197 LYFLAQHYDYLGDYEKALEYIDKAIEHTPTLVELYMTKARILKHAGDLKEAAEAMDEAREL-DLADRYINSKCAKYLLRA 275 (517)
T ss_pred HHHHHHHHHHhCCHHHHHHHHHHHHhcCCCcHHHHHHHHHHHHHCCCHHHHHHHHHHHHhC-ChhhHHHHHHHHHHHHHC
Confidence 3566 7889999999999999 999999999877622 99999999999999 899999999999999999
Q ss_pred ccHHHHHHHHHHHHHhC
Q psy16144 86 KRYAEAIDVSQTIMKAH 102 (122)
Q Consensus 86 kr~~~AI~vc~kVL~~~ 102 (122)
|+.++|+.++-..-+.+
T Consensus 276 ~~~e~A~~~~~~Ftr~~ 292 (517)
T PF12569_consen 276 GRIEEAEKTASLFTRED 292 (517)
T ss_pred CCHHHHHHHHHhhcCCC
Confidence 99999999998776544
No 114
>KOG0543|consensus
Probab=97.55 E-value=0.0013 Score=56.16 Aligned_cols=100 Identities=11% Similarity=0.122 Sum_probs=83.3
Q ss_pred cchhHHHHHHHHH---HHHHH-HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCC
Q psy16144 4 LSVDTMQIIQTIA---EWHFL-LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGT 69 (122)
Q Consensus 4 ~~~~~~~~~~~~~---~~~~L-Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~ 69 (122)
.+.+..+..+.+- -+++- -|+..++|+.|.+.+. +|...|.+.||..+- |++|..-|++|.++ .|
T Consensus 244 ~~~ee~~~~~~~k~~~~lNlA~c~lKl~~~~~Ai~~c~kvLe~~~~N~KALyRrG~A~l~~~e~~~A~~df~ka~k~-~P 322 (397)
T KOG0543|consen 244 FDEEEQKKAEALKLACHLNLAACYLKLKEYKEAIESCNKVLELDPNNVKALYRRGQALLALGEYDLARDDFQKALKL-EP 322 (397)
T ss_pred CCHHHHHHHHHHHHHHhhHHHHHHHhhhhHHHHHHHHHHHHhcCCCchhHHHHHHHHHHhhccHHHHHHHHHHHHHh-CC
Confidence 4445555554432 22332 8999999999999999 999999999999833 99999999999999 79
Q ss_pred CChHHHHHHHHHHHhcccHHHH-HHHHHHHHHhCCC
Q psy16144 70 SDLALHQKLASLLLRMKRYAEA-IDVSQTIMKAHPS 104 (122)
Q Consensus 70 ~~~~vgykLA~~ylk~kr~~~A-I~vc~kVL~~~P~ 104 (122)
+|-+|--.|..+-.|.+++.+. -++|++.++.-+.
T Consensus 323 ~Nka~~~el~~l~~k~~~~~~kekk~y~~mF~k~~~ 358 (397)
T KOG0543|consen 323 SNKAARAELIKLKQKIREYEEKEKKMYANMFAKLAE 358 (397)
T ss_pred CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc
Confidence 9999999999999999999987 7889998876653
No 115
>KOG4234|consensus
Probab=97.55 E-value=0.00089 Score=53.65 Aligned_cols=100 Identities=15% Similarity=0.171 Sum_probs=82.2
Q ss_pred HHHHHHHHH-----HhcCCCChHHHHHHHH-HHhhCcchHHHHh---hC-----------HHHHHHHHHHHHHhhCCCCh
Q psy16144 13 QTIAEWHFL-----LYLLPWRWEPLSAGLR-KKQYISSATNIYN---LF-----------PDDAVVQYTQACRIIGTSDL 72 (122)
Q Consensus 13 ~~~~~~~~L-----Lyi~~gk~d~A~elL~-~l~~nps~~kA~e---~l-----------y~dAa~~ye~Aw~l~~~~~~ 72 (122)
+.+++..-| =....|.|++|..-.. +|..-|+|+--.. |. ++.|+.---+|+++ +|+..
T Consensus 90 k~~~kad~lK~EGN~~F~ngdyeeA~skY~~Ale~cp~~~~e~rsIly~Nraaa~iKl~k~e~aI~dcsKaiel-~pty~ 168 (271)
T KOG4234|consen 90 KAIEKADSLKKEGNELFKNGDYEEANSKYQEALESCPSTSTEERSILYSNRAAALIKLRKWESAIEDCSKAIEL-NPTYE 168 (271)
T ss_pred HHHHHHHHHHHHHHHhhhcccHHHHHHHHHHHHHhCccccHHHHHHHHhhhHHHHHHhhhHHHHHHHHHhhHhc-CchhH
Confidence 345555445 2344599999999999 9999998765211 22 88999999999999 89999
Q ss_pred HHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHHHHHH
Q psy16144 73 ALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVHKQIF 113 (122)
Q Consensus 73 ~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir~~il 113 (122)
.+.-+=|+.|.+..+|++|++=|.++|..+|..-.+|+-|+
T Consensus 169 kAl~RRAeayek~ek~eealeDyKki~E~dPs~~ear~~i~ 209 (271)
T KOG4234|consen 169 KALERRAEAYEKMEKYEEALEDYKKILESDPSRREAREAIA 209 (271)
T ss_pred HHHHHHHHHHHhhhhHHHHHHHHHHHHHhCcchHHHHHHHH
Confidence 99999999999999999999999999999998877776664
No 116
>KOG0550|consensus
Probab=97.55 E-value=0.00016 Score=62.30 Aligned_cols=83 Identities=19% Similarity=0.101 Sum_probs=71.9
Q ss_pred CCCChHHHHHHHH-HHhhCcchHHHHhhC---------------------HHHHHHHHHHHHHhhCCCChHH----HHHH
Q psy16144 25 LPWRWEPLSAGLR-KKQYISSATNIYNLF---------------------PDDAVVQYTQACRIIGTSDLAL----HQKL 78 (122)
Q Consensus 25 ~~gk~d~A~elL~-~l~~nps~~kA~e~l---------------------y~dAa~~ye~Aw~l~~~~~~~v----gykL 78 (122)
-.+..|+|...++ .|...|++.++--.. |.+|..+|..|+.+ +|+|... .++.
T Consensus 215 y~~~~~ka~~hf~qal~ldpdh~~sk~~~~~~k~le~~k~~gN~~fk~G~y~~A~E~Yteal~i-dP~n~~~naklY~nr 293 (486)
T KOG0550|consen 215 YNDNADKAINHFQQALRLDPDHQKSKSASMMPKKLEVKKERGNDAFKNGNYRKAYECYTEALNI-DPSNKKTNAKLYGNR 293 (486)
T ss_pred cccchHHHHHHHhhhhccChhhhhHHhHhhhHHHHHHHHhhhhhHhhccchhHHHHHHHHhhcC-CccccchhHHHHHHh
Confidence 4477899999999 999999987765511 99999999999999 6887655 6778
Q ss_pred HHHHHhcccHHHHHHHHHHHHHhCCCCHHH
Q psy16144 79 ASLLLRMKRYAEAIDVSQTIMKAHPSYPKV 108 (122)
Q Consensus 79 A~~ylk~kr~~~AI~vc~kVL~~~P~ypki 108 (122)
|-+..+.||..|||..|..+|+++|.|-+.
T Consensus 294 a~v~~rLgrl~eaisdc~~Al~iD~syika 323 (486)
T KOG0550|consen 294 ALVNIRLGRLREAISDCNEALKIDSSYIKA 323 (486)
T ss_pred HhhhcccCCchhhhhhhhhhhhcCHHHHHH
Confidence 999999999999999999999999999776
No 117
>PF13428 TPR_14: Tetratricopeptide repeat
Probab=97.48 E-value=0.0002 Score=42.07 Aligned_cols=35 Identities=14% Similarity=0.086 Sum_probs=30.2
Q ss_pred HHHHH--HhcCCCChHHHHHHHH-HHhhCcchHHHHhh
Q psy16144 17 EWHFL--LYLLPWRWEPLSAGLR-KKQYISSATNIYNL 51 (122)
Q Consensus 17 ~~~~L--Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~ 51 (122)
-|..+ .|.+.|++|+|++.++ +++.+|+++.+|..
T Consensus 3 ~~~~la~~~~~~G~~~~A~~~~~~~l~~~P~~~~a~~~ 40 (44)
T PF13428_consen 3 AWLALARAYRRLGQPDEAERLLRRALALDPDDPEAWRA 40 (44)
T ss_pred HHHHHHHHHHHcCCHHHHHHHHHHHHHHCcCCHHHHHH
Confidence 35566 8999999999999999 99999999998864
No 118
>KOG0547|consensus
Probab=97.48 E-value=0.00019 Score=62.98 Aligned_cols=55 Identities=20% Similarity=0.317 Sum_probs=53.1
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKV 108 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypki 108 (122)
|++|+.+|.+|++++ |+-|...-+.|.+|.-.|++.+-|+-|-++|+++|+|.|+
T Consensus 131 Y~eAIkyY~~AI~l~-p~epiFYsNraAcY~~lgd~~~Vied~TkALEl~P~Y~KA 185 (606)
T KOG0547|consen 131 YDEAIKYYTQAIELC-PDEPIFYSNRAACYESLGDWEKVIEDCTKALELNPDYVKA 185 (606)
T ss_pred HHHHHHHHHHHHhcC-CCCchhhhhHHHHHHHHhhHHHHHHHHHHHhhcCcHHHHH
Confidence 999999999999995 9889999999999999999999999999999999999987
No 119
>PF14561 TPR_20: Tetratricopeptide repeat; PDB: 3QOU_A 2R5S_A 3QDN_B.
Probab=97.46 E-value=0.00089 Score=45.72 Aligned_cols=58 Identities=22% Similarity=0.356 Sum_probs=48.4
Q ss_pred HHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCC--HHHHHHHHH
Q psy16144 56 AVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSY--PKVHKQIFE 114 (122)
Q Consensus 56 Aa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~y--pkir~~il~ 114 (122)
....++++.+- +|+|+.+-|.||-.++..|++++|++.+-.+++.+|++ -..|+..++
T Consensus 7 ~~~al~~~~a~-~P~D~~ar~~lA~~~~~~g~~e~Al~~Ll~~v~~dr~~~~~~ar~~ll~ 66 (90)
T PF14561_consen 7 DIAALEAALAA-NPDDLDARYALADALLAAGDYEEALDQLLELVRRDRDYEDDAARKRLLD 66 (90)
T ss_dssp HHHHHHHHHHH-STT-HHHHHHHHHHHHHTT-HHHHHHHHHHHHCC-TTCCCCHHHHHHHH
T ss_pred cHHHHHHHHHc-CCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHhCccccccHHHHHHHH
Confidence 35678889998 89999999999999999999999999999999999988 666666554
No 120
>smart00028 TPR Tetratricopeptide repeats. Repeats present in 4 or more copies in proteins. Contain a minimum of 34 amino acids each and self-associate via a "knobs and holes" mechanism.
Probab=97.46 E-value=0.0003 Score=35.16 Aligned_cols=34 Identities=18% Similarity=0.339 Sum_probs=30.8
Q ss_pred hHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCC
Q psy16144 72 LALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSY 105 (122)
Q Consensus 72 ~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~y 105 (122)
+.+.+.+|..|...|++.+|+..++++++.+|++
T Consensus 1 ~~~~~~~a~~~~~~~~~~~a~~~~~~~~~~~~~~ 34 (34)
T smart00028 1 AEALYNLGNAYLKLGDYDEALEYYEKALELDPNN 34 (34)
T ss_pred ChHHHHHHHHHHHHhhHHHHHHHHHHHHccCCCC
Confidence 3577899999999999999999999999999864
No 121
>PF09295 ChAPs: ChAPs (Chs5p-Arf1p-binding proteins); InterPro: IPR015374 ChAPs (Chs5p-Arf1p-binding proteins) are required for the export of specialised cargo from the Golgi. They physically interact with Chs3, Chs5 and the small GTPase Arf1, and they also form interactions with each other [].
Probab=97.46 E-value=0.00071 Score=57.46 Aligned_cols=73 Identities=16% Similarity=0.150 Sum_probs=57.8
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHh-----hC----HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYN-----LF----PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e-----~l----y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
+++..++-.+|..+++ +++.+|.++..+. ++ |+.|....++|.++ .|++....+.||.+|.+.|+|+.|
T Consensus 209 v~l~~~~E~~AI~ll~~aL~~~p~d~~LL~~Qa~fLl~k~~~~lAL~iAk~av~l-sP~~f~~W~~La~~Yi~~~d~e~A 287 (395)
T PF09295_consen 209 VYLLMNEEVEAIRLLNEALKENPQDSELLNLQAEFLLSKKKYELALEIAKKAVEL-SPSEFETWYQLAECYIQLGDFENA 287 (395)
T ss_pred HHHhcCcHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHhcCCHHHHHHHHHHHHHh-CchhHHHHHHHHHHHHhcCCHHHH
Confidence 6777778888888888 8888887655443 22 88888888888888 488888888888888888888888
Q ss_pred HHHH
Q psy16144 92 IDVS 95 (122)
Q Consensus 92 I~vc 95 (122)
+.+.
T Consensus 288 LlaL 291 (395)
T PF09295_consen 288 LLAL 291 (395)
T ss_pred HHHH
Confidence 8743
No 122
>PRK11906 transcriptional regulator; Provisional
Probab=97.45 E-value=0.0007 Score=58.67 Aligned_cols=56 Identities=7% Similarity=-0.089 Sum_probs=41.8
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVH 109 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir 109 (122)
-.+|...-++|.++ +|+||.+.+.||+.+.-.+++..|+..+++++..+||++.++
T Consensus 320 ~~~a~~~A~rAvel-d~~Da~a~~~~g~~~~~~~~~~~a~~~f~rA~~L~Pn~A~~~ 375 (458)
T PRK11906 320 AQKALELLDYVSDI-TTVDGKILAIMGLITGLSGQAKVSHILFEQAKIHSTDIASLY 375 (458)
T ss_pred HHHHHHHHHHHHhc-CCCCHHHHHHHHHHHHhhcchhhHHHHHHHHhhcCCccHHHH
Confidence 56677777777777 677777777777777777777777777777777777777654
No 123
>PF13525 YfiO: Outer membrane lipoprotein; PDB: 3TGO_A 3Q5M_A 2YHC_A.
Probab=97.39 E-value=0.0015 Score=49.69 Aligned_cols=58 Identities=19% Similarity=0.214 Sum_probs=48.5
Q ss_pred HHHHHHHHHHHHHhhCCC---ChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHHHH
Q psy16144 53 PDDAVVQYTQACRIIGTS---DLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVHKQ 111 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~---~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir~~ 111 (122)
|.+|+..|+..... .|. -+.+.+.||.+|.+.|+|.+|+..+++.++.+|++|.+-.-
T Consensus 21 y~~Ai~~f~~l~~~-~P~s~~a~~A~l~la~a~y~~~~y~~A~~~~~~fi~~yP~~~~~~~A 81 (203)
T PF13525_consen 21 YEEAIKLFEKLIDR-YPNSPYAPQAQLMLAYAYYKQGDYEEAIAAYERFIKLYPNSPKADYA 81 (203)
T ss_dssp HHHHHHHHHHHHHH--TTSTTHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHH-TT-TTHHHH
T ss_pred HHHHHHHHHHHHHH-CCCChHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCCCCcchhhH
Confidence 99999999999987 354 45778999999999999999999999999999999987543
No 124
>PF09976 TPR_21: Tetratricopeptide repeat; InterPro: IPR018704 This domain, found in various hypothetical prokaryotic proteins, has no known function.
Probab=97.37 E-value=0.0042 Score=44.37 Aligned_cols=76 Identities=14% Similarity=0.048 Sum_probs=62.2
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchH---HHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSAT---NIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRY 88 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~---kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~ 88 (122)
.+...|++++|.+.|+ ++...|+.. .+...+ |++|...++.. . .++-.+.+...+|..|++.|++
T Consensus 57 ~~~~~g~~~~A~~~l~~~~~~~~d~~l~~~a~l~LA~~~~~~~~~d~Al~~L~~~-~-~~~~~~~~~~~~Gdi~~~~g~~ 134 (145)
T PF09976_consen 57 AAYEQGDYDEAKAALEKALANAPDPELKPLARLRLARILLQQGQYDEALATLQQI-P-DEAFKALAAELLGDIYLAQGDY 134 (145)
T ss_pred HHHHCCCHHHHHHHHHHHHhhCCCHHHHHHHHHHHHHHHHHcCCHHHHHHHHHhc-c-CcchHHHHHHHHHHHHHHCCCH
Confidence 8889999999999999 999876642 233322 99999999773 2 2466788899999999999999
Q ss_pred HHHHHHHHHHH
Q psy16144 89 AEAIDVSQTIM 99 (122)
Q Consensus 89 ~~AI~vc~kVL 99 (122)
.+|+..|+++|
T Consensus 135 ~~A~~~y~~Al 145 (145)
T PF09976_consen 135 DEARAAYQKAL 145 (145)
T ss_pred HHHHHHHHHhC
Confidence 99999999874
No 125
>KOG0547|consensus
Probab=97.37 E-value=0.0019 Score=56.93 Aligned_cols=86 Identities=17% Similarity=0.285 Sum_probs=79.9
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhh-----C----HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNL-----F----PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~-----l----y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
+|++....++-...|. +..+||.++..|.- + |++|..-|++|..+ +|.|.-...-|++..-|.+++.++
T Consensus 369 ~y~d~~~~~~~~~~F~~A~~ldp~n~dvYyHRgQm~flL~q~e~A~aDF~Kai~L-~pe~~~~~iQl~~a~Yr~~k~~~~ 447 (606)
T KOG0547|consen 369 AYADENQSEKMWKDFNKAEDLDPENPDVYYHRGQMRFLLQQYEEAIADFQKAISL-DPENAYAYIQLCCALYRQHKIAES 447 (606)
T ss_pred HHhhhhccHHHHHHHHHHHhcCCCCCchhHhHHHHHHHHHHHHHHHHHHHHHhhc-ChhhhHHHHHHHHHHHHHHHHHHH
Confidence 8999999999999999 99999999988871 1 99999999999999 899999999999999999999999
Q ss_pred HHHHHHHHHhCCCCHHH
Q psy16144 92 IDVSQTIMKAHPSYPKV 108 (122)
Q Consensus 92 I~vc~kVL~~~P~ypki 108 (122)
...++.+-+..|+-|.+
T Consensus 448 m~~Fee~kkkFP~~~Ev 464 (606)
T KOG0547|consen 448 MKTFEEAKKKFPNCPEV 464 (606)
T ss_pred HHHHHHHHHhCCCCchH
Confidence 99999999999998876
No 126
>PF13181 TPR_8: Tetratricopeptide repeat; PDB: 3GW4_B 3MA5_C 2KCV_A 2KCL_A 3FP3_A 3LCA_A 3FP4_A 3FP2_A 1W3B_B 1ELW_A ....
Probab=97.30 E-value=0.00056 Score=37.27 Aligned_cols=33 Identities=15% Similarity=0.308 Sum_probs=30.2
Q ss_pred HHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCC
Q psy16144 73 ALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSY 105 (122)
Q Consensus 73 ~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~y 105 (122)
.+.+.+|..|.+.|++.+|+..++++++.+|+|
T Consensus 2 ~~~~~lg~~y~~~~~~~~A~~~~~~a~~~~~~n 34 (34)
T PF13181_consen 2 EAYYNLGKIYEQLGDYEEALEYFEKALELNPDN 34 (34)
T ss_dssp HHHHHHHHHHHHTTSHHHHHHHHHHHHHHHTT-
T ss_pred HHHHHHHHHHHHcCCHHHHHHHHHHHHhhCCCC
Confidence 467899999999999999999999999999975
No 127
>KOG1128|consensus
Probab=97.30 E-value=0.002 Score=58.54 Aligned_cols=101 Identities=14% Similarity=0.189 Sum_probs=78.0
Q ss_pred HHHHHHHHHHHHH--HhcCCCChHHHHHHHH-HHhhCcchH-----------------HHHhhC----------------
Q psy16144 9 MQIIQTIAEWHFL--LYLLPWRWEPLSAGLR-KKQYISSAT-----------------NIYNLF---------------- 52 (122)
Q Consensus 9 ~~~~~~~~~~~~L--Lyi~~gk~d~A~elL~-~l~~nps~~-----------------kA~e~l---------------- 52 (122)
.+|...+.-|--. -|...|+.+.|++.++ -++ +|+.+ +|||+.
T Consensus 418 l~I~Erlemw~~vi~CY~~lg~~~kaeei~~q~le-k~~d~~lyc~LGDv~~d~s~yEkawElsn~~sarA~r~~~~~~~ 496 (777)
T KOG1128|consen 418 LVIFERLEMWDPVILCYLLLGQHGKAEEINRQELE-KDPDPRLYCLLGDVLHDPSLYEKAWELSNYISARAQRSLALLIL 496 (777)
T ss_pred HHHHHhHHHHHHHHHHHHHhcccchHHHHHHHHhc-CCCcchhHHHhhhhccChHHHHHHHHHhhhhhHHHHHhhccccc
Confidence 3455555555544 6788899999999999 888 44433 344411
Q ss_pred ----HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHHHH
Q psy16144 53 ----PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVHKQ 111 (122)
Q Consensus 53 ----y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir~~ 111 (122)
|.+|-.+++...++ +|-.+...|.+|.+.++.+++..|...|+..+...||+...+..
T Consensus 497 ~~~~fs~~~~hle~sl~~-nplq~~~wf~~G~~ALqlek~q~av~aF~rcvtL~Pd~~eaWnN 558 (777)
T KOG1128|consen 497 SNKDFSEADKHLERSLEI-NPLQLGTWFGLGCAALQLEKEQAAVKAFHRCVTLEPDNAEAWNN 558 (777)
T ss_pred cchhHHHHHHHHHHHhhc-CccchhHHHhccHHHHHHhhhHHHHHHHHHHhhcCCCchhhhhh
Confidence 78888888888888 78888889999999999999999999999999999988877764
No 128
>PF09295 ChAPs: ChAPs (Chs5p-Arf1p-binding proteins); InterPro: IPR015374 ChAPs (Chs5p-Arf1p-binding proteins) are required for the export of specialised cargo from the Golgi. They physically interact with Chs3, Chs5 and the small GTPase Arf1, and they also form interactions with each other [].
Probab=97.30 E-value=0.0053 Score=52.21 Aligned_cols=96 Identities=17% Similarity=0.219 Sum_probs=80.9
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHh--hC----HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYN--LF----PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDV 94 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e--~l----y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~v 94 (122)
+.-..++++.|.++|+ ..+.+|+.....- ++ -.+|+....++++. +|.+.......|..+++.|+++.|+.+
T Consensus 178 ~l~~t~~~~~ai~lle~L~~~~pev~~~LA~v~l~~~~E~~AI~ll~~aL~~-~p~d~~LL~~Qa~fLl~k~~~~lAL~i 256 (395)
T PF09295_consen 178 YLSLTQRYDEAIELLEKLRERDPEVAVLLARVYLLMNEEVEAIRLLNEALKE-NPQDSELLNLQAEFLLSKKKYELALEI 256 (395)
T ss_pred HHhhcccHHHHHHHHHHHHhcCCcHHHHHHHHHHhcCcHHHHHHHHHHHHHh-CCCCHHHHHHHHHHHHhcCCHHHHHHH
Confidence 4445689999999999 9999998433322 11 88999999999988 799999999999999999999999999
Q ss_pred HHHHHHhCCCCHHHHHHHHHHHHhc
Q psy16144 95 SQTIMKAHPSYPKVHKQIFEKAMVH 119 (122)
Q Consensus 95 c~kVL~~~P~ypkir~~il~ka~~~ 119 (122)
.+++....|+..+.|.. |-+++..
T Consensus 257 Ak~av~lsP~~f~~W~~-La~~Yi~ 280 (395)
T PF09295_consen 257 AKKAVELSPSEFETWYQ-LAECYIQ 280 (395)
T ss_pred HHHHHHhCchhHHHHHH-HHHHHHh
Confidence 99999999999999864 6665543
No 129
>KOG0495|consensus
Probab=97.29 E-value=0.0024 Score=58.08 Aligned_cols=91 Identities=15% Similarity=0.100 Sum_probs=81.1
Q ss_pred HHHH--HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhc
Q psy16144 18 WHFL--LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRM 85 (122)
Q Consensus 18 ~~~L--Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~ 85 (122)
|.|. .+-..|+...|..+|. +.+.||.+-+.|..- |+.|-..+.+|-..+ |+ +.+.||-+..---.
T Consensus 587 wlM~ake~w~agdv~~ar~il~~af~~~pnseeiwlaavKle~en~e~eraR~llakar~~s-gT-eRv~mKs~~~er~l 664 (913)
T KOG0495|consen 587 WLMYAKEKWKAGDVPAARVILDQAFEANPNSEEIWLAAVKLEFENDELERARDLLAKARSIS-GT-ERVWMKSANLERYL 664 (913)
T ss_pred HHHHHHHHHhcCCcHHHHHHHHHHHHhCCCcHHHHHHHHHHhhccccHHHHHHHHHHHhccC-Cc-chhhHHHhHHHHHh
Confidence 5555 6777899999999999 999999999999832 999999999999984 54 89999999998899
Q ss_pred ccHHHHHHHHHHHHHhCCCCHHHHH
Q psy16144 86 KRYAEAIDVSQTIMKAHPSYPKVHK 110 (122)
Q Consensus 86 kr~~~AI~vc~kVL~~~P~ypkir~ 110 (122)
+..++|+.+|+..|+.+|+++|++.
T Consensus 665 d~~eeA~rllEe~lk~fp~f~Kl~l 689 (913)
T KOG0495|consen 665 DNVEEALRLLEEALKSFPDFHKLWL 689 (913)
T ss_pred hhHHHHHHHHHHHHHhCCchHHHHH
Confidence 9999999999999999999999864
No 130
>PF04184 ST7: ST7 protein; InterPro: IPR007311 The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain.
Probab=97.28 E-value=0.0021 Score=56.52 Aligned_cols=90 Identities=17% Similarity=0.189 Sum_probs=69.6
Q ss_pred CCCChHHHHHHHH-HHhhCcchHHHHhhC-------HHHHHHHHHHHHHhhC--------------------CCC----h
Q psy16144 25 LPWRWEPLSAGLR-KKQYISSATNIYNLF-------PDDAVVQYTQACRIIG--------------------TSD----L 72 (122)
Q Consensus 25 ~~gk~d~A~elL~-~l~~nps~~kA~e~l-------y~dAa~~ye~Aw~l~~--------------------~~~----~ 72 (122)
+......-.++=+ +|+.||+|+.||-.+ ..||...|++|++.+. .++ +
T Consensus 180 RERnp~aRIkaA~eALei~pdCAdAYILLAEEeA~Ti~Eae~l~rqAvkAgE~~lg~s~~~~~~g~~~e~~~~Rdt~~~~ 259 (539)
T PF04184_consen 180 RERNPQARIKAAKEALEINPDCADAYILLAEEEASTIVEAEELLRQAVKAGEASLGKSQFLQHHGHFWEAWHRRDTNVLV 259 (539)
T ss_pred hcCCHHHHHHHHHHHHHhhhhhhHHHhhcccccccCHHHHHHHHHHHHHHHHHhhchhhhhhcccchhhhhhccccchhh
Confidence 4556666677777 999999999999877 7788888888876521 011 3
Q ss_pred HHHHHHHHHHHhcccHHHHHHHHHHHHHhCCC--CHHHHHHHHH
Q psy16144 73 ALHQKLASLLLRMKRYAEAIDVSQTIMKAHPS--YPKVHKQIFE 114 (122)
Q Consensus 73 ~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~--ypkir~~il~ 114 (122)
-+-.+||-+.-|.||..|||+.++..++.+|. .--||...++
T Consensus 260 y~KrRLAmCarklGr~~EAIk~~rdLlke~p~~~~l~IrenLie 303 (539)
T PF04184_consen 260 YAKRRLAMCARKLGRLREAIKMFRDLLKEFPNLDNLNIRENLIE 303 (539)
T ss_pred hhHHHHHHHHHHhCChHHHHHHHHHHHhhCCccchhhHHHHHHH
Confidence 35678999999999999999999999999984 5667765444
No 131
>PF12688 TPR_5: Tetratrico peptide repeat
Probab=97.26 E-value=0.0021 Score=46.15 Aligned_cols=88 Identities=20% Similarity=0.059 Sum_probs=61.5
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHh--------hC----HHHHHHHHHHHHHhhCCC---ChHHHHHHHHHHHhc
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYN--------LF----PDDAVVQYTQACRIIGTS---DLALHQKLASLLLRM 85 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e--------~l----y~dAa~~ye~Aw~l~~~~---~~~vgykLA~~ylk~ 85 (122)
.+-..|+.++|+.+++ ++..-.+.....+ +. +++|...++++..- .|+ +..+-.-+|-.+...
T Consensus 10 a~d~~G~~~~Ai~~Y~~Al~~gL~~~~~~~a~i~lastlr~LG~~deA~~~L~~~~~~-~p~~~~~~~l~~f~Al~L~~~ 88 (120)
T PF12688_consen 10 AHDSLGREEEAIPLYRRALAAGLSGADRRRALIQLASTLRNLGRYDEALALLEEALEE-FPDDELNAALRVFLALALYNL 88 (120)
T ss_pred HHHhcCCHHHHHHHHHHHHHcCCCchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHH-CCCccccHHHHHHHHHHHHHC
Confidence 5667788888888888 8876555432222 11 88888888888776 577 777777788888888
Q ss_pred ccHHHHHHHHHHHHHhCCCCHHHHHHH
Q psy16144 86 KRYAEAIDVSQTIMKAHPSYPKVHKQI 112 (122)
Q Consensus 86 kr~~~AI~vc~kVL~~~P~ypkir~~i 112 (122)
||+.+|++.+-.+|..+ -|+-++-|
T Consensus 89 gr~~eAl~~~l~~la~~--~~~y~ra~ 113 (120)
T PF12688_consen 89 GRPKEALEWLLEALAET--LPRYRRAI 113 (120)
T ss_pred CCHHHHHHHHHHHHHHH--HHHHHHHH
Confidence 88888888887777633 33444444
No 132
>KOG2003|consensus
Probab=97.22 E-value=0.0032 Score=55.66 Aligned_cols=89 Identities=12% Similarity=0.109 Sum_probs=79.3
Q ss_pred hcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHH
Q psy16144 23 YLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAI 92 (122)
Q Consensus 23 yi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI 92 (122)
..-.|++|.|.+..+ +|.-+.||..|..-. .++|..+|-+.-.+. -++..|.+.+|..|........||
T Consensus 500 ~f~ngd~dka~~~ykeal~ndasc~ealfniglt~e~~~~ldeald~f~klh~il-~nn~evl~qianiye~led~aqai 578 (840)
T KOG2003|consen 500 AFANGDLDKAAEFYKEALNNDASCTEALFNIGLTAEALGNLDEALDCFLKLHAIL-LNNAEVLVQIANIYELLEDPAQAI 578 (840)
T ss_pred eeecCcHHHHHHHHHHHHcCchHHHHHHHHhcccHHHhcCHHHHHHHHHHHHHHH-HhhHHHHHHHHHHHHHhhCHHHHH
Confidence 334599999999999 999999999998833 899999999888875 789999999999999999999999
Q ss_pred HHHHHHHHhCCCCHHHHHHH
Q psy16144 93 DVSQTIMKAHPSYPKVHKQI 112 (122)
Q Consensus 93 ~vc~kVL~~~P~ypkir~~i 112 (122)
+.+-++-..-|+.|.|-...
T Consensus 579 e~~~q~~slip~dp~ilskl 598 (840)
T KOG2003|consen 579 ELLMQANSLIPNDPAILSKL 598 (840)
T ss_pred HHHHHhcccCCCCHHHHHHH
Confidence 99999999999999885443
No 133
>KOG1840|consensus
Probab=97.19 E-value=0.003 Score=55.33 Aligned_cols=78 Identities=17% Similarity=0.234 Sum_probs=58.1
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHH---------------HHhhC--HHHHHHHHHHHHH---hhCCCChH---HHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATN---------------IYNLF--PDDAVVQYTQACR---IIGTSDLA---LHQK 77 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~k---------------A~e~l--y~dAa~~ye~Aw~---l~~~~~~~---vgyk 77 (122)
+|..+|||++|+++++ +++.+.+... +++.+ |.+|...|+.+.. .++|+.|. ..-+
T Consensus 376 l~~~~gk~~ea~~~~k~ai~~~~~~~~~~~~~~~~~l~~la~~~~~~k~~~~a~~l~~~~~~i~~~~g~~~~~~~~~~~n 455 (508)
T KOG1840|consen 376 LYLKMGKYKEAEELYKKAIQILRELLGKKDYGVGKPLNQLAEAYEELKKYEEAEQLFEEAKDIMKLCGPDHPDVTYTYLN 455 (508)
T ss_pred HHHHhcchhHHHHHHHHHHHHHHhcccCcChhhhHHHHHHHHHHHHhcccchHHHHHHHHHHHHHHhCCCCCchHHHHHH
Confidence 8999999999999999 9988754221 12222 6666666665543 23555554 4568
Q ss_pred HHHHHHhcccHHHHHHHHHHHH
Q psy16144 78 LASLLLRMKRYAEAIDVSQTIM 99 (122)
Q Consensus 78 LA~~ylk~kr~~~AI~vc~kVL 99 (122)
||..|-.+|||++|+++-++|+
T Consensus 456 L~~~Y~~~g~~e~a~~~~~~~~ 477 (508)
T KOG1840|consen 456 LAALYRAQGNYEAAEELEEKVL 477 (508)
T ss_pred HHHHHHHcccHHHHHHHHHHHH
Confidence 9999999999999999999998
No 134
>PLN03098 LPA1 LOW PSII ACCUMULATION1; Provisional
Probab=97.19 E-value=0.0013 Score=56.94 Aligned_cols=42 Identities=14% Similarity=0.131 Sum_probs=39.9
Q ss_pred hCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHH
Q psy16144 67 IGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKV 108 (122)
Q Consensus 67 ~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypki 108 (122)
.+|+++...++||..|.+.|+|++|++.|+++|+++|+++..
T Consensus 70 ~dP~~a~a~~NLG~AL~~lGryeEAIa~f~rALeL~Pd~aeA 111 (453)
T PLN03098 70 ADVKTAEDAVNLGLSLFSKGRVKDALAQFETALELNPNPDEA 111 (453)
T ss_pred CCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHhhCCCchHH
Confidence 489999999999999999999999999999999999999854
No 135
>KOG1173|consensus
Probab=97.18 E-value=0.003 Score=56.11 Aligned_cols=91 Identities=16% Similarity=0.159 Sum_probs=76.9
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
-|.-.|++++|..+|- +...||+.+.||-.. .++|..+|-.|-++- |...-..+=||-=|++.+.+.-|
T Consensus 321 YYl~i~k~seARry~SKat~lD~~fgpaWl~fghsfa~e~EhdQAmaaY~tAarl~-~G~hlP~LYlgmey~~t~n~kLA 399 (611)
T KOG1173|consen 321 YYLMIGKYSEARRYFSKATTLDPTFGPAWLAFGHSFAGEGEHDQAMAAYFTAARLM-PGCHLPSLYLGMEYMRTNNLKLA 399 (611)
T ss_pred HHHHhcCcHHHHHHHHHHhhcCccccHHHHHHhHHhhhcchHHHHHHHHHHHHHhc-cCCcchHHHHHHHHHHhccHHHH
Confidence 4556699999999999 999999999999955 889999999998885 55555556678888999999999
Q ss_pred HHHHHHHHHhCCCCHHHHHHHH
Q psy16144 92 IDVSQTIMKAHPSYPKVHKQIF 113 (122)
Q Consensus 92 I~vc~kVL~~~P~ypkir~~il 113 (122)
..-+.+++.+.|+.|-+..|+=
T Consensus 400 e~Ff~~A~ai~P~Dplv~~Elg 421 (611)
T KOG1173|consen 400 EKFFKQALAIAPSDPLVLHELG 421 (611)
T ss_pred HHHHHHHHhcCCCcchhhhhhh
Confidence 9999999999999998887763
No 136
>PRK15331 chaperone protein SicA; Provisional
Probab=97.15 E-value=0.0018 Score=49.33 Aligned_cols=55 Identities=15% Similarity=0.111 Sum_probs=52.6
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKV 108 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypki 108 (122)
|++|...|+-.+-+ +|-|+...+-||.++...|+|.+||+.|--.....++.|+.
T Consensus 53 ~~eA~~~F~~L~~~-d~~n~~Y~~GLaa~~Q~~k~y~~Ai~~Y~~A~~l~~~dp~p 107 (165)
T PRK15331 53 LDEAETFFRFLCIY-DFYNPDYTMGLAAVCQLKKQFQKACDLYAVAFTLLKNDYRP 107 (165)
T ss_pred HHHHHHHHHHHHHh-CcCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccCCCCc
Confidence 99999999999999 89999999999999999999999999999999999988874
No 137
>KOG2002|consensus
Probab=97.14 E-value=0.0023 Score=59.56 Aligned_cols=88 Identities=11% Similarity=0.065 Sum_probs=71.7
Q ss_pred cCCCChHHHHHHHH-HHhhCcchHHHHhhC----------------------------------------------HHHH
Q psy16144 24 LLPWRWEPLSAGLR-KKQYISSATNIYNLF----------------------------------------------PDDA 56 (122)
Q Consensus 24 i~~gk~d~A~elL~-~l~~nps~~kA~e~l----------------------------------------------y~dA 56 (122)
-....++.|..++. +-..|+.||.+...+ |+.|
T Consensus 247 ~d~~s~~~~~~ll~~ay~~n~~nP~~l~~LAn~fyfK~dy~~v~~la~~ai~~t~~~~~~aes~Y~~gRs~Ha~Gd~ekA 326 (1018)
T KOG2002|consen 247 NDSDSYKKGVQLLQRAYKENNENPVALNHLANHFYFKKDYERVWHLAEHAIKNTENKSIKAESFYQLGRSYHAQGDFEKA 326 (1018)
T ss_pred cchHHHHHHHHHHHHHHhhcCCCcHHHHHHHHHHhhcccHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHhhccHHHH
Confidence 34456777777777 777777777665522 9999
Q ss_pred HHHHHHHHHhhCCCC-hHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHHHHHH
Q psy16144 57 VVQYTQACRIIGTSD-LALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVHKQIF 113 (122)
Q Consensus 57 a~~ye~Aw~l~~~~~-~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir~~il 113 (122)
-.+|.+|... ++++ .=..+.||..|++.|.+..|+-++++|++++|+.+...+ ||
T Consensus 327 ~~yY~~s~k~-~~d~~~l~~~GlgQm~i~~~dle~s~~~fEkv~k~~p~~~etm~-iL 382 (1018)
T KOG2002|consen 327 FKYYMESLKA-DNDNFVLPLVGLGQMYIKRGDLEESKFCFEKVLKQLPNNYETMK-IL 382 (1018)
T ss_pred HHHHHHHHcc-CCCCccccccchhHHHHHhchHHHHHHHHHHHHHhCcchHHHHH-HH
Confidence 9999999999 5666 777889999999999999999999999999999988764 44
No 138
>PF13176 TPR_7: Tetratricopeptide repeat; PDB: 3SF4_C 3RO3_A 3RO2_A.
Probab=97.07 E-value=0.00061 Score=38.55 Aligned_cols=29 Identities=14% Similarity=0.221 Sum_probs=23.7
Q ss_pred HHHHHHHHHhcccHHHHHHHHHHHHHhCC
Q psy16144 75 HQKLASLLLRMKRYAEAIDVSQTIMKAHP 103 (122)
Q Consensus 75 gykLA~~ylk~kr~~~AI~vc~kVL~~~P 103 (122)
...||..|.+.|+|.+||++|+++|...+
T Consensus 2 l~~Lg~~~~~~g~~~~Ai~~y~~aL~l~~ 30 (36)
T PF13176_consen 2 LNNLGRIYRQQGDYEKAIEYYEQALALAR 30 (36)
T ss_dssp HHHHHHHHHHCT-HHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHHhcc
Confidence 46899999999999999999999664443
No 139
>PF14938 SNAP: Soluble NSF attachment protein, SNAP; PDB: 1QQE_A 2IFU_A.
Probab=97.06 E-value=0.003 Score=50.18 Aligned_cols=75 Identities=24% Similarity=0.334 Sum_probs=53.5
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---HHHHHHHHHHHHHhhCCCC-----hHHHHHHHHHHHhcccHHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---PDDAVVQYTQACRIIGTSD-----LALHQKLASLLLRMKRYAEAI 92 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---y~dAa~~ye~Aw~l~~~~~-----~~vgykLA~~ylk~kr~~~AI 92 (122)
+|...|+++.|-+.+. + ++.++-. +++|..+|++|.++-...+ ..+..++|..+.+.|+|.+|+
T Consensus 103 ~y~~~G~~~~aA~~~~~l-------A~~ye~~~~d~e~Ai~~Y~~A~~~y~~e~~~~~a~~~~~~~A~l~~~l~~y~~A~ 175 (282)
T PF14938_consen 103 IYREAGRFSQAAKCLKEL-------AEIYEEQLGDYEKAIEYYQKAAELYEQEGSPHSAAECLLKAADLYARLGRYEEAI 175 (282)
T ss_dssp HHHHCT-HHHHHHHHHHH-------HHHHCCTT--HHHHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHHHHHTT-HHHHH
T ss_pred HHHhcCcHHHHHHHHHHH-------HHHHHHHcCCHHHHHHHHHHHHHHHHHCCChhhHHHHHHHHHHHHHHhCCHHHHH
Confidence 6666666666666655 3 2333322 8999999999998743333 234578999999999999999
Q ss_pred HHHHHHHHhCC
Q psy16144 93 DVSQTIMKAHP 103 (122)
Q Consensus 93 ~vc~kVL~~~P 103 (122)
++|+++....-
T Consensus 176 ~~~e~~~~~~l 186 (282)
T PF14938_consen 176 EIYEEVAKKCL 186 (282)
T ss_dssp HHHHHHHHTCC
T ss_pred HHHHHHHHHhh
Confidence 99999997644
No 140
>PF12688 TPR_5: Tetratrico peptide repeat
Probab=97.06 E-value=0.016 Score=41.60 Aligned_cols=56 Identities=25% Similarity=0.371 Sum_probs=49.6
Q ss_pred HHHHHHHHHHHHHhhCCCC---hHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCC---CHHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSD---LALHQKLASLLLRMKRYAEAIDVSQTIMKAHPS---YPKVH 109 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~---~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~---ypkir 109 (122)
.++|+..|++|.+. +.+. ..+...||..|...|++++|+.+.++.+..+|+ ++.++
T Consensus 17 ~~~Ai~~Y~~Al~~-gL~~~~~~~a~i~lastlr~LG~~deA~~~L~~~~~~~p~~~~~~~l~ 78 (120)
T PF12688_consen 17 EEEAIPLYRRALAA-GLSGADRRRALIQLASTLRNLGRYDEALALLEEALEEFPDDELNAALR 78 (120)
T ss_pred HHHHHHHHHHHHHc-CCCchHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHCCCccccHHHH
Confidence 99999999999997 3444 567889999999999999999999999999998 66665
No 141
>KOG1128|consensus
Probab=97.03 E-value=0.0029 Score=57.52 Aligned_cols=90 Identities=11% Similarity=0.124 Sum_probs=78.6
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
+..++++|.+|.+.|+ .++.||-....|... ++.|+..|..+..+ .|++....-+|+-.|++.|+-.+|
T Consensus 494 ~~~~~~~fs~~~~hle~sl~~nplq~~~wf~~G~~ALqlek~q~av~aF~rcvtL-~Pd~~eaWnNls~ayi~~~~k~ra 572 (777)
T KOG1128|consen 494 LILSNKDFSEADKHLERSLEINPLQLGTWFGLGCAALQLEKEQAAVKAFHRCVTL-EPDNAEAWNNLSTAYIRLKKKKRA 572 (777)
T ss_pred ccccchhHHHHHHHHHHHhhcCccchhHHHhccHHHHHHhhhHHHHHHHHHHhhc-CCCchhhhhhhhHHHHHHhhhHHH
Confidence 4455799999999999 999999988888844 99999999999999 699999999999999999999999
Q ss_pred HHHHHHHHHhCCCCHHHHHHH
Q psy16144 92 IDVSQTIMKAHPSYPKVHKQI 112 (122)
Q Consensus 92 I~vc~kVL~~~P~ypkir~~i 112 (122)
-.+..++|+-+-++++|+.+=
T Consensus 573 ~~~l~EAlKcn~~~w~iWENy 593 (777)
T KOG1128|consen 573 FRKLKEALKCNYQHWQIWENY 593 (777)
T ss_pred HHHHHHHhhcCCCCCeeeech
Confidence 999999999888888887653
No 142
>COG4105 ComL DNA uptake lipoprotein [General function prediction only]
Probab=97.02 E-value=0.004 Score=50.35 Aligned_cols=56 Identities=13% Similarity=0.142 Sum_probs=46.1
Q ss_pred HHHHHHHHHHHHHhhCCC---ChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHH
Q psy16144 53 PDDAVVQYTQACRIIGTS---DLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVH 109 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~---~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir 109 (122)
|++|+..|+..-.- .|. .+.+-..||+++.|.++|++|+..-.+-+++||++|.+-
T Consensus 50 ~~~A~~~fe~l~~~-~p~s~~~~qa~l~l~yA~Yk~~~y~~A~~~~drFi~lyP~~~n~d 108 (254)
T COG4105 50 YEEAIKYFEALDSR-HPFSPYSEQAQLDLAYAYYKNGEYDLALAYIDRFIRLYPTHPNAD 108 (254)
T ss_pred HHHHHHHHHHHHHc-CCCCcccHHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCCChh
Confidence 66777777666543 344 357889999999999999999999999999999999873
No 143
>KOG0624|consensus
Probab=96.99 E-value=0.0017 Score=55.60 Aligned_cols=83 Identities=10% Similarity=0.032 Sum_probs=73.3
Q ss_pred CCCChHHHHHHHH-HHhhCcchHHHHh-----hC----HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHH
Q psy16144 25 LPWRWEPLSAGLR-KKQYISSATNIYN-----LF----PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDV 94 (122)
Q Consensus 25 ~~gk~d~A~elL~-~l~~nps~~kA~e-----~l----y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~v 94 (122)
-.|.+..|...+. +++.||.+..|+. |+ -+.|..-+.+.+++ +|++-.+-.-=|.++||+|+++.|++=
T Consensus 50 a~~Q~sDALt~yHaAve~dp~~Y~aifrRaT~yLAmGksk~al~Dl~rVlel-KpDF~~ARiQRg~vllK~Gele~A~~D 128 (504)
T KOG0624|consen 50 ARGQLSDALTHYHAAVEGDPNNYQAIFRRATVYLAMGKSKAALQDLSRVLEL-KPDFMAARIQRGVVLLKQGELEQAEAD 128 (504)
T ss_pred HhhhHHHHHHHHHHHHcCCchhHHHHHHHHHHHhhhcCCccchhhHHHHHhc-CccHHHHHHHhchhhhhcccHHHHHHH
Confidence 3488999999999 9999999988887 33 67888899999999 799999999999999999999999999
Q ss_pred HHHHHHhCCCCHHH
Q psy16144 95 SQTIMKAHPSYPKV 108 (122)
Q Consensus 95 c~kVL~~~P~ypki 108 (122)
+++||..+|++--.
T Consensus 129 F~~vl~~~~s~~~~ 142 (504)
T KOG0624|consen 129 FDQVLQHEPSNGLV 142 (504)
T ss_pred HHHHHhcCCCcchh
Confidence 99999999966443
No 144
>KOG1174|consensus
Probab=96.97 E-value=0.0043 Score=54.04 Aligned_cols=79 Identities=15% Similarity=0.151 Sum_probs=59.2
Q ss_pred hHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHH
Q psy16144 29 WEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTI 98 (122)
Q Consensus 29 ~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kV 98 (122)
-++|.+++. .|++||.-..|-... ++|++..+|+++.. ..|....-+||..+--.+.+.+|.+-|.++
T Consensus 420 rEKAKkf~ek~L~~~P~Y~~AV~~~AEL~~~Eg~~~D~i~LLe~~L~~--~~D~~LH~~Lgd~~~A~Ne~Q~am~~y~~A 497 (564)
T KOG1174|consen 420 REKAKKFAEKSLKINPIYTPAVNLIAELCQVEGPTKDIIKLLEKHLII--FPDVNLHNHLGDIMRAQNEPQKAMEYYYKA 497 (564)
T ss_pred HHHHHHHHHhhhccCCccHHHHHHHHHHHHhhCccchHHHHHHHHHhh--ccccHHHHHHHHHHHHhhhHHHHHHHHHHH
Confidence 466777777 777777766665522 88888888888776 456778888888888888888888888888
Q ss_pred HHhCCCCHHHH
Q psy16144 99 MKAHPSYPKVH 109 (122)
Q Consensus 99 L~~~P~ypkir 109 (122)
|+++|++-+..
T Consensus 498 Lr~dP~~~~sl 508 (564)
T KOG1174|consen 498 LRQDPKSKRTL 508 (564)
T ss_pred HhcCccchHHH
Confidence 88888776654
No 145
>PF14853 Fis1_TPR_C: Fis1 C-terminal tetratricopeptide repeat; PDB: 1IYG_A 1PC2_A 1NZN_A 3UUX_C 1Y8M_A 2PQR_A 2PQN_A 3O48_A.
Probab=96.96 E-value=0.0025 Score=39.88 Aligned_cols=36 Identities=14% Similarity=0.188 Sum_probs=31.4
Q ss_pred HHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHH
Q psy16144 74 LHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVH 109 (122)
Q Consensus 74 vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir 109 (122)
-.|-||..+.|.|+|.+|...|+.+|+..|+++.++
T Consensus 3 ~lY~lAig~ykl~~Y~~A~~~~~~lL~~eP~N~Qa~ 38 (53)
T PF14853_consen 3 CLYYLAIGHYKLGEYEKARRYCDALLEIEPDNRQAQ 38 (53)
T ss_dssp HHHHHHHHHHHTT-HHHHHHHHHHHHHHTTS-HHHH
T ss_pred hHHHHHHHHHHhhhHHHHHHHHHHHHhhCCCcHHHH
Confidence 468899999999999999999999999999998874
No 146
>KOG4340|consensus
Probab=96.93 E-value=0.0049 Score=52.08 Aligned_cols=88 Identities=17% Similarity=0.158 Sum_probs=70.3
Q ss_pred Hh--cCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHH
Q psy16144 22 LY--LLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYA 89 (122)
Q Consensus 22 Ly--i~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~ 89 (122)
+| |+..||++|.+++. -.+.||.+..+...+ |..|+.||++...+ .|.-..--+--|..+-++++|.
T Consensus 17 iy~lI~d~ry~DaI~~l~s~~Er~p~~rAgLSlLgyCYY~~Q~f~~AA~CYeQL~ql-~P~~~qYrlY~AQSLY~A~i~A 95 (459)
T KOG4340|consen 17 VYRLIRDARYADAIQLLGSELERSPRSRAGLSLLGYCYYRLQEFALAAECYEQLGQL-HPELEQYRLYQAQSLYKACIYA 95 (459)
T ss_pred HHHHHHHhhHHHHHHHHHHHHhcCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhh-ChHHHHHHHHHHHHHHHhcccH
Confidence 55 78899999999999 999999656655533 99999999999999 5766665566689999999999
Q ss_pred HHHHHHHHHHHhCCCCHHHHHHHHH
Q psy16144 90 EAIDVSQTIMKAHPSYPKVHKQIFE 114 (122)
Q Consensus 90 ~AI~vc~kVL~~~P~ypkir~~il~ 114 (122)
||+.|-..+ -|+|..+.+.+.
T Consensus 96 DALrV~~~~----~D~~~L~~~~lq 116 (459)
T KOG4340|consen 96 DALRVAFLL----LDNPALHSRVLQ 116 (459)
T ss_pred HHHHHHHHh----cCCHHHHHHHHH
Confidence 999955444 466778777764
No 147
>KOG0376|consensus
Probab=96.92 E-value=0.0017 Score=56.51 Aligned_cols=92 Identities=10% Similarity=0.078 Sum_probs=82.5
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHh-----hC----HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYN-----LF----PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e-----~l----y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
-....++||.|+.+.. +++.+|+|+..|. ++ |..|..=..+|.++ .|+..-..++=|...|+.++|-+|
T Consensus 13 ~~l~~~~fd~avdlysKaI~ldpnca~~~anRa~a~lK~e~~~~Al~Da~kaie~-dP~~~K~Y~rrg~a~m~l~~~~~A 91 (476)
T KOG0376|consen 13 EALKDKVFDVAVDLYSKAIELDPNCAIYFANRALAHLKVESFGGALHDALKAIEL-DPTYIKAYVRRGTAVMALGEFKKA 91 (476)
T ss_pred hhcccchHHHHHHHHHHHHhcCCcceeeechhhhhheeechhhhHHHHHHhhhhc-CchhhheeeeccHHHHhHHHHHHH
Confidence 3456689999999999 9999999987665 22 99999999999999 599999999999999999999999
Q ss_pred HHHHHHHHHhCCCCHHHHHHHHH
Q psy16144 92 IDVSQTIMKAHPSYPKVHKQIFE 114 (122)
Q Consensus 92 I~vc~kVL~~~P~ypkir~~il~ 114 (122)
...++++.+..|+-|++++-|-|
T Consensus 92 ~~~l~~~~~l~Pnd~~~~r~~~E 114 (476)
T KOG0376|consen 92 LLDLEKVKKLAPNDPDATRKIDE 114 (476)
T ss_pred HHHHHHhhhcCcCcHHHHHHHHH
Confidence 99999999999999999987654
No 148
>PF04733 Coatomer_E: Coatomer epsilon subunit; InterPro: IPR006822 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the epsilon subunit of the coatomer complex, which is involved in the regulation of intracellular protein trafficking between the endoplasmic reticulum and the Golgi complex []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006890 retrograde vesicle-mediated transport, Golgi to ER, 0030126 COPI vesicle coat; PDB: 3MV2_B 3MV3_F 3MKR_A.
Probab=96.85 E-value=0.01 Score=48.16 Aligned_cols=99 Identities=16% Similarity=0.108 Sum_probs=75.2
Q ss_pred HHHHHHHHHHHHhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHH
Q psy16144 11 IIQTIAEWHFLLYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLAS 80 (122)
Q Consensus 11 ~~~~~~~~~~LLyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~ 80 (122)
++-.+++...-++....++.+|--.++ ....-++.+.....+ |++|...+++|.+. +|++|.+..+|+.
T Consensus 165 ~l~qLa~awv~l~~g~e~~~~A~y~f~El~~~~~~t~~~lng~A~~~l~~~~~~eAe~~L~~al~~-~~~~~d~LaNliv 243 (290)
T PF04733_consen 165 ILTQLAEAWVNLATGGEKYQDAFYIFEELSDKFGSTPKLLNGLAVCHLQLGHYEEAEELLEEALEK-DPNDPDTLANLIV 243 (290)
T ss_dssp HHHHHHHHHHHHHHTTTCCCHHHHHHHHHHCCS--SHHHHHHHHHHHHHCT-HHHHHHHHHHHCCC--CCHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHhCchhHHHHHHHHHHHHhccCCCHHHHHHHHHHHHHhCCHHHHHHHHHHHHHh-ccCCHHHHHHHHH
Confidence 444456655667777789999999999 777656656655532 99999999999998 8999999999999
Q ss_pred HHHhcccHHH-HHHHHHHHHHhCCCCHHHHH
Q psy16144 81 LLLRMKRYAE-AIDVSQTIMKAHPSYPKVHK 110 (122)
Q Consensus 81 ~ylk~kr~~~-AI~vc~kVL~~~P~ypkir~ 110 (122)
+..-.|+..+ +.....+.-..+|++|-+..
T Consensus 244 ~~~~~gk~~~~~~~~l~qL~~~~p~h~~~~~ 274 (290)
T PF04733_consen 244 CSLHLGKPTEAAERYLSQLKQSNPNHPLVKD 274 (290)
T ss_dssp HHHHTT-TCHHHHHHHHHCHHHTTTSHHHHH
T ss_pred HHHHhCCChhHHHHHHHHHHHhCCCChHHHH
Confidence 9999999844 44556666668999998864
No 149
>KOG3060|consensus
Probab=96.84 E-value=0.018 Score=47.17 Aligned_cols=86 Identities=13% Similarity=0.034 Sum_probs=41.7
Q ss_pred CCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcc---cHHHHH
Q psy16144 26 PWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMK---RYAEAI 92 (122)
Q Consensus 26 ~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~k---r~~~AI 92 (122)
+||.-+|.+.|. -++..+++..||-++ |+.|+-|||...-. +|.+|...-+||-.+--.| ++.-|-
T Consensus 133 ~GK~l~aIk~ln~YL~~F~~D~EAW~eLaeiY~~~~~f~kA~fClEE~ll~-~P~n~l~f~rlae~~Yt~gg~eN~~~ar 211 (289)
T KOG3060|consen 133 QGKNLEAIKELNEYLDKFMNDQEAWHELAEIYLSEGDFEKAAFCLEELLLI-QPFNPLYFQRLAEVLYTQGGAENLELAR 211 (289)
T ss_pred cCCcHHHHHHHHHHHHHhcCcHHHHHHHHHHHHhHhHHHHHHHHHHHHHHc-CCCcHHHHHHHHHHHHHHhhHHHHHHHH
Confidence 344445555555 555555555555422 55555555555544 3555555555554443322 233344
Q ss_pred HHHHHHHHhCCCCHHHHHHH
Q psy16144 93 DVSQTIMKAHPSYPKVHKQI 112 (122)
Q Consensus 93 ~vc~kVL~~~P~ypkir~~i 112 (122)
..|.++|+.+|.+-+..-.|
T Consensus 212 kyy~~alkl~~~~~ral~GI 231 (289)
T KOG3060|consen 212 KYYERALKLNPKNLRALFGI 231 (289)
T ss_pred HHHHHHHHhChHhHHHHHHH
Confidence 45555555555444444433
No 150
>PRK14720 transcript cleavage factor/unknown domain fusion protein; Provisional
Probab=96.83 E-value=0.0099 Score=55.38 Aligned_cols=93 Identities=10% Similarity=0.083 Sum_probs=75.1
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC----------------------------HHHHHHHHHHHHHhhCCCCh
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF----------------------------PDDAVVQYTQACRIIGTSDL 72 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l----------------------------y~dAa~~ye~Aw~l~~~~~~ 72 (122)
.|-+.|++++|....+ +|+++|+++.+...+ |.++...+++-+++ +|.+.
T Consensus 125 ~Ydk~g~~~ka~~~yer~L~~D~~n~~aLNn~AY~~ae~dL~KA~~m~~KAV~~~i~~kq~~~~~e~W~k~~~~-~~~d~ 203 (906)
T PRK14720 125 AYAKLNENKKLKGVWERLVKADRDNPEIVKKLATSYEEEDKEKAITYLKKAIYRFIKKKQYVGIEEIWSKLVHY-NSDDF 203 (906)
T ss_pred HHHHcCChHHHHHHHHHHHhcCcccHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHhhhcchHHHHHHHHHHhc-Ccccc
Confidence 7778899999999999 999999998887733 55566666666666 67766
Q ss_pred HHH--------------------HHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHHHHHHHH
Q psy16144 73 ALH--------------------QKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVHKQIFEK 115 (122)
Q Consensus 73 ~vg--------------------ykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir~~il~k 115 (122)
... +-|=..|-+.+++.++|.+.+.+|+.+|++-+.+.+|.+-
T Consensus 204 d~f~~i~~ki~~~~~~~~~~~~~~~l~~~y~~~~~~~~~i~iLK~iL~~~~~n~~a~~~l~~~ 266 (906)
T PRK14720 204 DFFLRIERKVLGHREFTRLVGLLEDLYEPYKALEDWDEVIYILKKILEHDNKNNKAREELIRF 266 (906)
T ss_pred hHHHHHHHHHHhhhccchhHHHHHHHHHHHhhhhhhhHHHHHHHHHHhcCCcchhhHHHHHHH
Confidence 662 2233778889999999999999999999999999998763
No 151
>KOG1127|consensus
Probab=96.83 E-value=0.0061 Score=57.38 Aligned_cols=83 Identities=14% Similarity=0.094 Sum_probs=75.0
Q ss_pred hcCCCChHHHHHHHH-HHhhCcchHHHHhhC----------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHh---cccH
Q psy16144 23 YLLPWRWEPLSAGLR-KKQYISSATNIYNLF----------PDDAVVQYTQACRIIGTSDLALHQKLASLLLR---MKRY 88 (122)
Q Consensus 23 yi~~gk~d~A~elL~-~l~~nps~~kA~e~l----------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk---~kr~ 88 (122)
.++...|++|.|.++ +|+.||++..||-.+ .++|..+|..|-++ .|++.-+.--|+..|.+ .-.+
T Consensus 12 al~nk~YeealEqskkvLk~dpdNYnA~vFLGvAl~sl~q~le~A~ehYv~AaKl-dpdnlLAWkGL~nLye~~~dIl~l 90 (1238)
T KOG1127|consen 12 ALRNKEYEEALEQSKKVLKEDPDNYNAQVFLGVALWSLGQDLEKAAEHYVLAAKL-DPDNLLAWKGLGNLYERYNDILDL 90 (1238)
T ss_pred HHhhccHHHHHHHHHHHHhcCCCcchhhhHHHHHHHhccCCHHHHHHHHHHHHhc-ChhhhHHHHHHHHHHHccchhhhh
Confidence 356688999999999 999999999998865 89999999999999 79999999999999998 6678
Q ss_pred HHHHHHHHHHHHhCCCCH
Q psy16144 89 AEAIDVSQTIMKAHPSYP 106 (122)
Q Consensus 89 ~~AI~vc~kVL~~~P~yp 106 (122)
.++-.+|.+.+...|+.-
T Consensus 91 d~~~~~yq~~~l~le~q~ 108 (1238)
T KOG1127|consen 91 DRAAKCYQRAVLILENQS 108 (1238)
T ss_pred hHhHHHHHHHHHhhhhhh
Confidence 899999999999999765
No 152
>KOG1129|consensus
Probab=96.77 E-value=0.011 Score=50.41 Aligned_cols=84 Identities=15% Similarity=0.164 Sum_probs=74.3
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
-|.+.|-+-+|++-|+ .|+..|- ++-|.++ +.-|...|..-++- -|-+.+.....|..+.-++++++|
T Consensus 232 CylrLgm~r~AekqlqssL~q~~~-~dTfllLskvY~ridQP~~AL~~~~~gld~-fP~~VT~l~g~ARi~eam~~~~~a 309 (478)
T KOG1129|consen 232 CYLRLGMPRRAEKQLQSSLTQFPH-PDTFLLLSKVYQRIDQPERALLVIGEGLDS-FPFDVTYLLGQARIHEAMEQQEDA 309 (478)
T ss_pred HHHHhcChhhhHHHHHHHhhcCCc-hhHHHHHHHHHHHhccHHHHHHHHhhhhhc-CCchhhhhhhhHHHHHHHHhHHHH
Confidence 7888999999999999 9998885 5555544 88899999999998 699999999999999999999999
Q ss_pred HHHHHHHHHhCCCCHH
Q psy16144 92 IDVSQTIMKAHPSYPK 107 (122)
Q Consensus 92 I~vc~kVL~~~P~ypk 107 (122)
...|+.||++||.+..
T Consensus 310 ~~lYk~vlk~~~~nvE 325 (478)
T KOG1129|consen 310 LQLYKLVLKLHPINVE 325 (478)
T ss_pred HHHHHHHHhcCCccce
Confidence 9999999999998753
No 153
>KOG3060|consensus
Probab=96.77 E-value=0.016 Score=47.44 Aligned_cols=90 Identities=12% Similarity=0.050 Sum_probs=71.1
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
++=-.|.+++|.+++. .|+-||.+.-.+-.- -.+|++....-++. -++|+.++-.||-.|+-.|.|..|
T Consensus 95 ~lEa~~~~~~A~e~y~~lL~ddpt~~v~~KRKlAilka~GK~l~aIk~ln~YL~~-F~~D~EAW~eLaeiY~~~~~f~kA 173 (289)
T KOG3060|consen 95 LLEATGNYKEAIEYYESLLEDDPTDTVIRKRKLAILKAQGKNLEAIKELNEYLDK-FMNDQEAWHELAEIYLSEGDFEKA 173 (289)
T ss_pred HHHHhhchhhHHHHHHHHhccCcchhHHHHHHHHHHHHcCCcHHHHHHHHHHHHH-hcCcHHHHHHHHHHHHhHhHHHHH
Confidence 3334577888888888 888888776666522 66777777777777 488889999999999999999999
Q ss_pred HHHHHHHHHhCCCCHHHHHHH
Q psy16144 92 IDVSQTIMKAHPSYPKVHKQI 112 (122)
Q Consensus 92 I~vc~kVL~~~P~ypkir~~i 112 (122)
+=|+++++=..|.+|......
T Consensus 174 ~fClEE~ll~~P~n~l~f~rl 194 (289)
T KOG3060|consen 174 AFCLEELLLIQPFNPLYFQRL 194 (289)
T ss_pred HHHHHHHHHcCCCcHHHHHHH
Confidence 999999999999998876543
No 154
>COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only]
Probab=96.75 E-value=0.032 Score=48.80 Aligned_cols=81 Identities=14% Similarity=0.021 Sum_probs=68.3
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHh-----hC----HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYN-----LF----PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e-----~l----y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
|++..+|.++|.+.++ ++...|+.+--+- ++ +++|+........- +|+||....-||..|-+.|+-.+|
T Consensus 349 i~~~~nk~~~A~e~~~kal~l~P~~~~l~~~~a~all~~g~~~eai~~L~~~~~~-~p~dp~~w~~LAqay~~~g~~~~a 427 (484)
T COG4783 349 ILLEANKAKEAIERLKKALALDPNSPLLQLNLAQALLKGGKPQEAIRILNRYLFN-DPEDPNGWDLLAQAYAELGNRAEA 427 (484)
T ss_pred HHHHcCChHHHHHHHHHHHhcCCCccHHHHHHHHHHHhcCChHHHHHHHHHHhhc-CCCCchHHHHHHHHHHHhCchHHH
Confidence 9999999999999999 9999999644333 22 99999999998888 899999999999999999888777
Q ss_pred HHHHHHHHHhCC
Q psy16144 92 IDVSQTIMKAHP 103 (122)
Q Consensus 92 I~vc~kVL~~~P 103 (122)
..-+-+.+...-
T Consensus 428 ~~A~AE~~~~~G 439 (484)
T COG4783 428 LLARAEGYALAG 439 (484)
T ss_pred HHHHHHHHHhCC
Confidence 776666665544
No 155
>KOG1156|consensus
Probab=96.71 E-value=0.0061 Score=54.96 Aligned_cols=86 Identities=17% Similarity=0.230 Sum_probs=72.8
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHh---hC------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYN---LF------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e---~l------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
..-..|+-++|-++.+ .++-++.+.--|- .+ |++|+++|++|+.+ .|+|.++...|+..-..+++|.-.
T Consensus 50 ~L~~lg~~~ea~~~vr~glr~d~~S~vCwHv~gl~~R~dK~Y~eaiKcy~nAl~~-~~dN~qilrDlslLQ~QmRd~~~~ 128 (700)
T KOG1156|consen 50 TLNCLGKKEEAYELVRLGLRNDLKSHVCWHVLGLLQRSDKKYDEAIKCYRNALKI-EKDNLQILRDLSLLQIQMRDYEGY 128 (700)
T ss_pred hhhcccchHHHHHHHHHHhccCcccchhHHHHHHHHhhhhhHHHHHHHHHHHHhc-CCCcHHHHHHHHHHHHHHHhhhhH
Confidence 4446699999999999 9987777555444 22 99999999999999 699999999999999999999999
Q ss_pred HHHHHHHHHhCCCCHHH
Q psy16144 92 IDVSQTIMKAHPSYPKV 108 (122)
Q Consensus 92 I~vc~kVL~~~P~ypki 108 (122)
.++=.+.|...|.+-.-
T Consensus 129 ~~tr~~LLql~~~~ra~ 145 (700)
T KOG1156|consen 129 LETRNQLLQLRPSQRAS 145 (700)
T ss_pred HHHHHHHHHhhhhhHHH
Confidence 99999999998876443
No 156
>PF07719 TPR_2: Tetratricopeptide repeat; InterPro: IPR013105 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. This repeat includes outlying Tetratricopeptide-like repeats (TPR) that are not matched by IPR001440 from INTERPRO.; PDB: 1XNF_B 3Q15_A 4ABN_A 1OUV_A 3U4T_A 3MA5_C 2KCV_A 2KCL_A 2XEV_A 3NF1_A ....
Probab=96.69 E-value=0.0024 Score=34.40 Aligned_cols=24 Identities=8% Similarity=-0.025 Sum_probs=21.0
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcch
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSA 45 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~ 45 (122)
+|.+.|++++|.+.++ +++++|++
T Consensus 10 ~~~~~~~~~~A~~~~~~al~l~p~~ 34 (34)
T PF07719_consen 10 AYYQLGNYEEAIEYFEKALELDPNN 34 (34)
T ss_dssp HHHHTT-HHHHHHHHHHHHHHSTTS
T ss_pred HHHHhCCHHHHHHHHHHHHHHCcCC
Confidence 8889999999999999 99999974
No 157
>KOG2002|consensus
Probab=96.67 E-value=0.0092 Score=55.74 Aligned_cols=94 Identities=12% Similarity=0.046 Sum_probs=73.0
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHh-----hC----HHHHHHHHHHHHHhhC-CCChHHHHHHHHHHHhcccHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYN-----LF----PDDAVVQYTQACRIIG-TSDLALHQKLASLLLRMKRYAE 90 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e-----~l----y~dAa~~ye~Aw~l~~-~~~~~vgykLA~~ylk~kr~~~ 90 (122)
+....|+|++|.+.|. +.+--..|+..|- |+ |..|+..|+.+.+... .+++.|.--||.+|...|++.+
T Consensus 655 VLA~kg~~~~A~dIFsqVrEa~~~~~dv~lNlah~~~e~~qy~~AIqmYe~~lkkf~~~~~~~vl~~Lara~y~~~~~~e 734 (1018)
T KOG2002|consen 655 VLAEKGRFSEARDIFSQVREATSDFEDVWLNLAHCYVEQGQYRLAIQMYENCLKKFYKKNRSEVLHYLARAWYEAGKLQE 734 (1018)
T ss_pred hhhhccCchHHHHHHHHHHHHHhhCCceeeeHHHHHHHHHHHHHHHHHHHHHHHHhcccCCHHHHHHHHHHHHHhhhHHH
Confidence 5567788888888888 6665545555555 33 8899999999887644 4578888888999999999999
Q ss_pred HHHHHHHHHHhCCCCHHHHHHHHHH
Q psy16144 91 AIDVSQTIMKAHPSYPKVHKQIFEK 115 (122)
Q Consensus 91 AI~vc~kVL~~~P~ypkir~~il~k 115 (122)
|.+...+++...|.+|-++.+|...
T Consensus 735 ak~~ll~a~~~~p~~~~v~FN~a~v 759 (1018)
T KOG2002|consen 735 AKEALLKARHLAPSNTSVKFNLALV 759 (1018)
T ss_pred HHHHHHHHHHhCCccchHHhHHHHH
Confidence 9998999999999888887776543
No 158
>PF00515 TPR_1: Tetratricopeptide repeat; InterPro: IPR001440 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees; within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B. ; GO: 0005515 protein binding; PDB: 3SF4_C 2LNI_A 1ELW_A 2C0M_A 1FCH_B 3R9A_B 2J9Q_A 2C0L_A 1KT1_A 3FWV_A ....
Probab=96.65 E-value=0.0023 Score=34.92 Aligned_cols=28 Identities=11% Similarity=0.010 Sum_probs=23.1
Q ss_pred HHHH--HhcCCCChHHHHHHHH-HHhhCcch
Q psy16144 18 WHFL--LYLLPWRWEPLSAGLR-KKQYISSA 45 (122)
Q Consensus 18 ~~~L--Lyi~~gk~d~A~elL~-~l~~nps~ 45 (122)
|..+ +|...|++++|++.++ +++++|++
T Consensus 4 ~~~~g~~~~~~~~~~~A~~~~~~al~~~p~~ 34 (34)
T PF00515_consen 4 YYNLGNAYFQLGDYEEALEYYQRALELDPDN 34 (34)
T ss_dssp HHHHHHHHHHTT-HHHHHHHHHHHHHHSTTH
T ss_pred HHHHHHHHHHhCCchHHHHHHHHHHHHCcCC
Confidence 4445 8889999999999999 99999974
No 159
>COG0457 NrfG FOG: TPR repeat [General function prediction only]
Probab=96.65 E-value=0.044 Score=36.41 Aligned_cols=82 Identities=13% Similarity=0.165 Sum_probs=54.1
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcc---hHHHHhhC---------HHHHHHHHHHHHHhhCCC-ChHHHHHHHHHHHhccc
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISS---ATNIYNLF---------PDDAVVQYTQACRIIGTS-DLALHQKLASLLLRMKR 87 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps---~~kA~e~l---------y~dAa~~ye~Aw~l~~~~-~~~vgykLA~~ylk~kr 87 (122)
++...|+++.|.+.+. ++..+|. ....+... +.+|...+.++.+. .+. +..+...++..|...++
T Consensus 139 ~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~ 217 (291)
T COG0457 139 ALYELGDYEEALELYEKALELDPELNELAEALLALGALLEALGRYEEALELLEKALKL-NPDDDAEALLNLGLLYLKLGK 217 (291)
T ss_pred HHHHcCCHHHHHHHHHHHHhcCCCccchHHHHHHhhhHHHHhcCHHHHHHHHHHHHhh-CcccchHHHHHhhHHHHHccc
Confidence 4566777777777777 6665552 12211110 77777777777777 355 57777777777777777
Q ss_pred HHHHHHHHHHHHHhCCC
Q psy16144 88 YAEAIDVSQTIMKAHPS 104 (122)
Q Consensus 88 ~~~AI~vc~kVL~~~P~ 104 (122)
+.+|+..+.+.+...|+
T Consensus 218 ~~~a~~~~~~~~~~~~~ 234 (291)
T COG0457 218 YEEALEYYEKALELDPD 234 (291)
T ss_pred HHHHHHHHHHHHhhCcc
Confidence 77777777777777775
No 160
>KOG1174|consensus
Probab=96.60 E-value=0.014 Score=50.97 Aligned_cols=53 Identities=11% Similarity=0.028 Sum_probs=49.3
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCH
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYP 106 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~yp 106 (122)
-+.|-+.|++++++ +|.-..+-.++|-..+.-|++.|+|.+.++.|+..||-.
T Consensus 420 rEKAKkf~ek~L~~-~P~Y~~AV~~~AEL~~~Eg~~~D~i~LLe~~L~~~~D~~ 472 (564)
T KOG1174|consen 420 REKAKKFAEKSLKI-NPIYTPAVNLIAELCQVEGPTKDIIKLLEKHLIIFPDVN 472 (564)
T ss_pred HHHHHHHHHhhhcc-CCccHHHHHHHHHHHHhhCccchHHHHHHHHHhhccccH
Confidence 67899999999999 799999999999999999999999999999999999753
No 161
>KOG4162|consensus
Probab=96.56 E-value=0.037 Score=50.79 Aligned_cols=89 Identities=17% Similarity=0.161 Sum_probs=81.8
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
++...+.-|+|.-.|. +-+..|-|+..|... ..||-..|..|..+ +|+++.+--.||-.|++.|+..-|
T Consensus 659 ~~~~~~~~~~a~~CL~Ea~~~~~l~~~~~~~~G~~~~~~~~~~EA~~af~~Al~l-dP~hv~s~~Ala~~lle~G~~~la 737 (799)
T KOG4162|consen 659 LFLLSGNDDEARSCLLEASKIDPLSASVYYLRGLLLEVKGQLEEAKEAFLVALAL-DPDHVPSMTALAELLLELGSPRLA 737 (799)
T ss_pred HHHhcCCchHHHHHHHHHHhcchhhHHHHHHhhHHHHHHHhhHHHHHHHHHHHhc-CCCCcHHHHHHHHHHHHhCCcchH
Confidence 7888888999998888 999999998888733 99999999999999 899999999999999999999888
Q ss_pred HH--HHHHHHHhCCCCHHHHHH
Q psy16144 92 ID--VSQTIMKAHPSYPKVHKQ 111 (122)
Q Consensus 92 I~--vc~kVL~~~P~ypkir~~ 111 (122)
.+ +.+.+++.+|++|++|.-
T Consensus 738 ~~~~~L~dalr~dp~n~eaW~~ 759 (799)
T KOG4162|consen 738 EKRSLLSDALRLDPLNHEAWYY 759 (799)
T ss_pred HHHHHHHHHHhhCCCCHHHHHH
Confidence 88 999999999999999864
No 162
>KOG3081|consensus
Probab=96.47 E-value=0.031 Score=46.05 Aligned_cols=61 Identities=16% Similarity=0.193 Sum_probs=54.1
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHHHHHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVHKQIFE 114 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir~~il~ 114 (122)
+.||.-.|+.--+.+ +..|....-+|.+.+..+||++|..+.+.+|..+|++|..-...+-
T Consensus 189 ~qdAfyifeE~s~k~-~~T~~llnG~Av~~l~~~~~eeAe~lL~eaL~kd~~dpetL~Nliv 249 (299)
T KOG3081|consen 189 IQDAFYIFEELSEKT-PPTPLLLNGQAVCHLQLGRYEEAESLLEEALDKDAKDPETLANLIV 249 (299)
T ss_pred hhhHHHHHHHHhccc-CCChHHHccHHHHHHHhcCHHHHHHHHHHHHhccCCCHHHHHHHHH
Confidence 999999999999875 4468999999999999999999999999999999999987655443
No 163
>PF13424 TPR_12: Tetratricopeptide repeat; PDB: 3RO2_A 3Q15_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 4A1S_B 3CEQ_B 3EDT_H ....
Probab=96.43 E-value=0.0091 Score=37.86 Aligned_cols=13 Identities=23% Similarity=0.329 Sum_probs=5.5
Q ss_pred HHHHHHHHHHHHH
Q psy16144 53 PDDAVVQYTQACR 65 (122)
Q Consensus 53 y~dAa~~ye~Aw~ 65 (122)
|++|..+|++|.+
T Consensus 62 ~~~A~~~~~~al~ 74 (78)
T PF13424_consen 62 YEEALEYYQKALD 74 (78)
T ss_dssp HHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHh
Confidence 4444444444443
No 164
>PRK10153 DNA-binding transcriptional activator CadC; Provisional
Probab=96.36 E-value=0.013 Score=51.23 Aligned_cols=57 Identities=14% Similarity=0.014 Sum_probs=47.5
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLL 82 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~y 82 (122)
.+...|++++|...++ ++..||+ ..+|.++ +++|+..|++|..+ +|.+|. |-|.-|+
T Consensus 429 ~~~~~g~~~~A~~~l~rAl~L~ps-~~a~~~lG~~~~~~G~~~eA~~~~~~A~~L-~P~~pt--~~~~~~~ 495 (517)
T PRK10153 429 QALVKGKTDEAYQAINKAIDLEMS-WLNYVLLGKVYELKGDNRLAADAYSTAFNL-RPGENT--LYWIENL 495 (517)
T ss_pred HHHhcCCHHHHHHHHHHHHHcCCC-HHHHHHHHHHHHHcCCHHHHHHHHHHHHhc-CCCCch--HHHHHhc
Confidence 6667799999999999 9999997 6677754 99999999999999 799985 5444443
No 165
>KOG0495|consensus
Probab=96.34 E-value=0.019 Score=52.50 Aligned_cols=84 Identities=13% Similarity=0.045 Sum_probs=74.6
Q ss_pred CCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHH
Q psy16144 26 PWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVS 95 (122)
Q Consensus 26 ~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc 95 (122)
++..++|..+|. ||+.-|.|.|.|..+ .+.|...|.+-.+.+ |+.+-....|+..=.|.|..+.|-.+.
T Consensus 664 ld~~eeA~rllEe~lk~fp~f~Kl~lmlGQi~e~~~~ie~aR~aY~~G~k~c-P~~ipLWllLakleEk~~~~~rAR~il 742 (913)
T KOG0495|consen 664 LDNVEEALRLLEEALKSFPDFHKLWLMLGQIEEQMENIEMAREAYLQGTKKC-PNSIPLWLLLAKLEEKDGQLVRARSIL 742 (913)
T ss_pred hhhHHHHHHHHHHHHHhCCchHHHHHHHhHHHHHHHHHHHHHHHHHhccccC-CCCchHHHHHHHHHHHhcchhhHHHHH
Confidence 488999999999 999999999998755 778889999999995 999999999999999999999999999
Q ss_pred HHHHHhCCCCHHHHH
Q psy16144 96 QTIMKAHPSYPKVHK 110 (122)
Q Consensus 96 ~kVL~~~P~ypkir~ 110 (122)
.+..-.||+++-.+-
T Consensus 743 drarlkNPk~~~lwl 757 (913)
T KOG0495|consen 743 DRARLKNPKNALLWL 757 (913)
T ss_pred HHHHhcCCCcchhHH
Confidence 999999999887654
No 166
>COG3118 Thioredoxin domain-containing protein [Posttranslational modification, protein turnover, chaperones]
Probab=96.29 E-value=0.059 Score=44.69 Aligned_cols=55 Identities=15% Similarity=0.235 Sum_probs=47.5
Q ss_pred HHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCC--HHHHHHHHH
Q psy16144 59 QYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSY--PKVHKQIFE 114 (122)
Q Consensus 59 ~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~y--pkir~~il~ 114 (122)
-+++.|.- +|+|+.+.+.||..|.-.||+++|.+..-.+|+.+-++ ..+|+.+|+
T Consensus 224 ~l~~~~aa-dPdd~~aa~~lA~~~~~~g~~e~Ale~Ll~~l~~d~~~~d~~~Rk~lle 280 (304)
T COG3118 224 DLQRRLAA-DPDDVEAALALADQLHLVGRNEAALEHLLALLRRDRGFEDGEARKTLLE 280 (304)
T ss_pred HHHHHHHh-CCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhcccccCcHHHHHHHH
Confidence 34555666 89999999999999999999999999999999998744 778888776
No 167
>PLN03081 pentatricopeptide (PPR) repeat-containing protein; Provisional
Probab=96.19 E-value=0.021 Score=50.46 Aligned_cols=82 Identities=15% Similarity=0.091 Sum_probs=58.8
Q ss_pred HHHHH--HhcCCCChHHHHHHHH-HHhh--CcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHH
Q psy16144 17 EWHFL--LYLLPWRWEPLSAGLR-KKQY--ISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLL 82 (122)
Q Consensus 17 ~~~~L--Lyi~~gk~d~A~elL~-~l~~--nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~y 82 (122)
-|+.+ .|.+.|++++|.++|+ ..+. .|+ ...|.-+ +++|...++...+..-+.|..++..|...|
T Consensus 292 t~n~li~~y~~~g~~~eA~~lf~~M~~~g~~pd-~~t~~~ll~a~~~~g~~~~a~~i~~~m~~~g~~~d~~~~~~Li~~y 370 (697)
T PLN03081 292 AWNSMLAGYALHGYSEEALCLYYEMRDSGVSID-QFTFSIMIRIFSRLALLEHAKQAHAGLIRTGFPLDIVANTALVDLY 370 (697)
T ss_pred HHHHHHHHHHhCCCHHHHHHHHHHHHHcCCCCC-HHHHHHHHHHHHhccchHHHHHHHHHHHHhCCCCCeeehHHHHHHH
Confidence 46666 8888899999999888 6543 233 2233311 778888888888776577777788888888
Q ss_pred HhcccHHHHHHHHHHHH
Q psy16144 83 LRMKRYAEAIDVSQTIM 99 (122)
Q Consensus 83 lk~kr~~~AI~vc~kVL 99 (122)
.|.|+.++|..++++..
T Consensus 371 ~k~G~~~~A~~vf~~m~ 387 (697)
T PLN03081 371 SKWGRMEDARNVFDRMP 387 (697)
T ss_pred HHCCCHHHHHHHHHhCC
Confidence 88888888888777664
No 168
>KOG4648|consensus
Probab=96.13 E-value=0.0055 Score=52.61 Aligned_cols=55 Identities=20% Similarity=0.245 Sum_probs=42.2
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKV 108 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypki 108 (122)
|++|+.||-++..+ .|.||....+=|..|+|.++|..|..=|..++.++--|.|+
T Consensus 113 y~EAIDCYs~~ia~-~P~NpV~~~NRA~AYlk~K~FA~AE~DC~~AiaLd~~Y~KA 167 (536)
T KOG4648|consen 113 YEEAIDCYSTAIAV-YPHNPVYHINRALAYLKQKSFAQAEEDCEAAIALDKLYVKA 167 (536)
T ss_pred hhHHHHHhhhhhcc-CCCCccchhhHHHHHHHHHHHHHHHHhHHHHHHhhHHHHHH
Confidence 78888888888888 47777777777888888888887777777777776666555
No 169
>COG4785 NlpI Lipoprotein NlpI, contains TPR repeats [General function prediction only]
Probab=96.12 E-value=0.012 Score=47.72 Aligned_cols=84 Identities=15% Similarity=0.100 Sum_probs=78.0
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
+|-+.|-.+.|.--|. +|..+|.-+.+++|+ |+.|-..|...+++ +|+..-+..+=|-++--.|||.-|
T Consensus 74 lYDSlGL~~LAR~DftQaLai~P~m~~vfNyLG~Yl~~a~~fdaa~eaFds~~EL-Dp~y~Ya~lNRgi~~YY~gR~~LA 152 (297)
T COG4785 74 LYDSLGLRALARNDFSQALAIRPDMPEVFNYLGIYLTQAGNFDAAYEAFDSVLEL-DPTYNYAHLNRGIALYYGGRYKLA 152 (297)
T ss_pred hhhhhhHHHHHhhhhhhhhhcCCCcHHHHHHHHHHHHhcccchHHHHHhhhHhcc-CCcchHHHhccceeeeecCchHhh
Confidence 9999999999999999 999999999999977 99999999999999 799988888889999999999999
Q ss_pred HHHHHHHHHhCCCCH
Q psy16144 92 IDVSQTIMKAHPSYP 106 (122)
Q Consensus 92 I~vc~kVL~~~P~yp 106 (122)
.+=..+--..+|+.|
T Consensus 153 q~d~~~fYQ~D~~DP 167 (297)
T COG4785 153 QDDLLAFYQDDPNDP 167 (297)
T ss_pred HHHHHHHHhcCCCCh
Confidence 998888888888777
No 170
>PLN03081 pentatricopeptide (PPR) repeat-containing protein; Provisional
Probab=96.06 E-value=0.029 Score=49.58 Aligned_cols=78 Identities=18% Similarity=0.152 Sum_probs=61.1
Q ss_pred HhcCCCChHHHHHHHHHHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHH
Q psy16144 22 LYLLPWRWEPLSAGLRKKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAI 92 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI 92 (122)
+|.+.|++++|.++++-....|+ ...|.-+ ++.|...+++..++ .|+++....-|...|.+.|+.++|.
T Consensus 471 ~l~r~G~~~eA~~~~~~~~~~p~-~~~~~~Ll~a~~~~g~~~~a~~~~~~l~~~-~p~~~~~y~~L~~~y~~~G~~~~A~ 548 (697)
T PLN03081 471 LLGREGLLDEAYAMIRRAPFKPT-VNMWAALLTACRIHKNLELGRLAAEKLYGM-GPEKLNNYVVLLNLYNSSGRQAEAA 548 (697)
T ss_pred HHHhcCCHHHHHHHHHHCCCCCC-HHHHHHHHHHHHHcCCcHHHHHHHHHHhCC-CCCCCcchHHHHHHHHhCCCHHHHH
Confidence 88888889988888882234454 4455533 78888888998888 6888888889999999999999999
Q ss_pred HHHHHHHHh
Q psy16144 93 DVSQTIMKA 101 (122)
Q Consensus 93 ~vc~kVL~~ 101 (122)
.+.+..-+.
T Consensus 549 ~v~~~m~~~ 557 (697)
T PLN03081 549 KVVETLKRK 557 (697)
T ss_pred HHHHHHHHc
Confidence 988876644
No 171
>COG4976 Predicted methyltransferase (contains TPR repeat) [General function prediction only]
Probab=96.05 E-value=0.011 Score=48.12 Aligned_cols=52 Identities=15% Similarity=0.179 Sum_probs=49.4
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCC
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSY 105 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~y 105 (122)
++.|+..|.+|+++ .|....-.|+||+.-.|+|++..|..-|++||+++|+.
T Consensus 11 ~~aaaely~qal~l-ap~w~~gwfR~g~~~ekag~~daAa~a~~~~L~ldp~D 62 (287)
T COG4976 11 AEAAAELYNQALEL-APEWAAGWFRLGEYTEKAGEFDAAAAAYEEVLELDPED 62 (287)
T ss_pred hHHHHHHHHHHhhc-CchhhhhhhhcchhhhhcccHHHHHHHHHHHHcCCccc
Confidence 67899999999999 69999999999999999999999999999999999965
No 172
>PF14938 SNAP: Soluble NSF attachment protein, SNAP; PDB: 1QQE_A 2IFU_A.
Probab=96.03 E-value=0.068 Score=42.40 Aligned_cols=88 Identities=13% Similarity=0.124 Sum_probs=63.2
Q ss_pred HhcCC-CChHHHHHHHH-HHhhCcchHHHHh-------------hC--HHHHHHHHHHHHHhhCCCC---hHH---HHHH
Q psy16144 22 LYLLP-WRWEPLSAGLR-KKQYISSATNIYN-------------LF--PDDAVVQYTQACRIIGTSD---LAL---HQKL 78 (122)
Q Consensus 22 Lyi~~-gk~d~A~elL~-~l~~nps~~kA~e-------------~l--y~dAa~~ye~Aw~l~~~~~---~~v---gykL 78 (122)
+|-+. |++++|.++++ ++.+-........ .+ |++|+..|++.....-.++ .++ .++.
T Consensus 123 ~ye~~~~d~e~Ai~~Y~~A~~~y~~e~~~~~a~~~~~~~A~l~~~l~~y~~A~~~~e~~~~~~l~~~l~~~~~~~~~l~a 202 (282)
T PF14938_consen 123 IYEEQLGDYEKAIEYYQKAAELYEQEGSPHSAAECLLKAADLYARLGRYEEAIEIYEEVAKKCLENNLLKYSAKEYFLKA 202 (282)
T ss_dssp HHCCTT--HHHHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHTCCCHCTTGHHHHHHHHHH
T ss_pred HHHHHcCCHHHHHHHHHHHHHHHHHCCChhhHHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHhhcccccchhHHHHHHHH
Confidence 88888 99999999999 8876443222221 11 9999999999998642222 223 3567
Q ss_pred HHHHHhcccHHHHHHHHHHHHHhCCCCHHHH
Q psy16144 79 ASLLLRMKRYAEAIDVSQTIMKAHPSYPKVH 109 (122)
Q Consensus 79 A~~ylk~kr~~~AI~vc~kVL~~~P~ypkir 109 (122)
.-+++..|+++.|-..+.+....+|.+..-|
T Consensus 203 ~l~~L~~~D~v~A~~~~~~~~~~~~~F~~s~ 233 (282)
T PF14938_consen 203 ILCHLAMGDYVAARKALERYCSQDPSFASSR 233 (282)
T ss_dssp HHHHHHTT-HHHHHHHHHHHGTTSTTSTTSH
T ss_pred HHHHHHcCCHHHHHHHHHHHHhhCCCCCCcH
Confidence 7799999999999999999999999875543
No 173
>COG2956 Predicted N-acetylglucosaminyl transferase [Carbohydrate transport and metabolism]
Probab=96.01 E-value=0.053 Score=45.98 Aligned_cols=83 Identities=11% Similarity=0.021 Sum_probs=73.2
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCC-hHHHHHHHHHHHhcccHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSD-LALHQKLASLLLRMKRYAE 90 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~-~~vgykLA~~ylk~kr~~~ 90 (122)
.+....++|.|.++++ +++-||.|.+|=-.+ |..|+..|+...+- ||+. +.|+-.|.-+|...|+.++
T Consensus 189 ~~~~~~~~d~A~~~l~kAlqa~~~cvRAsi~lG~v~~~~g~y~~AV~~~e~v~eQ-n~~yl~evl~~L~~~Y~~lg~~~~ 267 (389)
T COG2956 189 QALASSDVDRARELLKKALQADKKCVRASIILGRVELAKGDYQKAVEALERVLEQ-NPEYLSEVLEMLYECYAQLGKPAE 267 (389)
T ss_pred HHhhhhhHHHHHHHHHHHHhhCccceehhhhhhHHHHhccchHHHHHHHHHHHHh-ChHHHHHHHHHHHHHHHHhCCHHH
Confidence 5667789999999999 999999999987644 99999999999998 6665 6789999999999999999
Q ss_pred HHHHHHHHHHhCCCC
Q psy16144 91 AIDVSQTIMKAHPSY 105 (122)
Q Consensus 91 AI~vc~kVL~~~P~y 105 (122)
.+.-.++..+.+|.-
T Consensus 268 ~~~fL~~~~~~~~g~ 282 (389)
T COG2956 268 GLNFLRRAMETNTGA 282 (389)
T ss_pred HHHHHHHHHHccCCc
Confidence 999999988888743
No 174
>PLN03077 Protein ECB2; Provisional
Probab=95.98 E-value=0.047 Score=49.27 Aligned_cols=82 Identities=17% Similarity=0.052 Sum_probs=55.5
Q ss_pred HHHH--HhcCCCChHHHHHHHH-HHhhC--cchHHHHhh------C--HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHh
Q psy16144 18 WHFL--LYLLPWRWEPLSAGLR-KKQYI--SSATNIYNL------F--PDDAVVQYTQACRIIGTSDLALHQKLASLLLR 84 (122)
Q Consensus 18 ~~~L--Lyi~~gk~d~A~elL~-~l~~n--ps~~kA~e~------l--y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk 84 (122)
|..+ .|.+.|++++|.++|+ ..+.+ |+..-.-.. . +++|...++.+.+.+-..+..++..|-..|.|
T Consensus 357 ~n~li~~~~~~g~~~~A~~lf~~M~~~g~~Pd~~t~~~ll~a~~~~g~~~~a~~l~~~~~~~g~~~~~~~~n~Li~~y~k 436 (857)
T PLN03077 357 WTAMISGYEKNGLPDKALETYALMEQDNVSPDEITIASVLSACACLGDLDVGVKLHELAERKGLISYVVVANALIEMYSK 436 (857)
T ss_pred HHHHHHHHHhCCCHHHHHHHHHHHHHhCCCCCceeHHHHHHHHhccchHHHHHHHHHHHHHhCCCcchHHHHHHHHHHHH
Confidence 5555 6777777777777777 54433 332111111 1 67777777777777666677788888888999
Q ss_pred cccHHHHHHHHHHHH
Q psy16144 85 MKRYAEAIDVSQTIM 99 (122)
Q Consensus 85 ~kr~~~AI~vc~kVL 99 (122)
.|++++|.+++++..
T Consensus 437 ~g~~~~A~~vf~~m~ 451 (857)
T PLN03077 437 CKCIDKALEVFHNIP 451 (857)
T ss_pred cCCHHHHHHHHHhCC
Confidence 999888888888764
No 175
>KOG4642|consensus
Probab=95.96 E-value=0.014 Score=47.64 Aligned_cols=72 Identities=19% Similarity=0.179 Sum_probs=56.9
Q ss_pred CChHHHHHHHH-HHhhCcchHHHHh-----hC----HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHH
Q psy16144 27 WRWEPLSAGLR-KKQYISSATNIYN-----LF----PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQ 96 (122)
Q Consensus 27 gk~d~A~elL~-~l~~nps~~kA~e-----~l----y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~ 96 (122)
.+|+.|...+- ++..||..+.+|- ++ ++....-=.+|.++ .|+.....|-||...+..+.|.+||.+..
T Consensus 24 k~y~~ai~~y~raI~~nP~~~~Y~tnralchlk~~~~~~v~~dcrralql-~~N~vk~h~flg~~~l~s~~~~eaI~~Lq 102 (284)
T KOG4642|consen 24 KRYDDAIDCYSRAICINPTVASYYTNRALCHLKLKHWEPVEEDCRRALQL-DPNLVKAHYFLGQWLLQSKGYDEAIKVLQ 102 (284)
T ss_pred hhhchHHHHHHHHHhcCCCcchhhhhHHHHHHHhhhhhhhhhhHHHHHhc-ChHHHHHHHHHHHHHHhhccccHHHHHHH
Confidence 67888888888 8888888655443 33 77777777888888 58888888888888888888888888888
Q ss_pred HHH
Q psy16144 97 TIM 99 (122)
Q Consensus 97 kVL 99 (122)
+++
T Consensus 103 ra~ 105 (284)
T KOG4642|consen 103 RAY 105 (284)
T ss_pred HHH
Confidence 873
No 176
>PLN03218 maturation of RBCL 1; Provisional
Probab=95.95 E-value=0.08 Score=50.10 Aligned_cols=78 Identities=12% Similarity=0.032 Sum_probs=51.8
Q ss_pred HhcCCCChHHHHHHHH-HHhh--CcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQY--ISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYA 89 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~--nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~ 89 (122)
.|.+.|++++|.++++ ..+. .|+ ...|.-+ +++|...++...+..-+-+..+...|...|.+.|+++
T Consensus 623 ay~k~G~~deAl~lf~eM~~~Gv~PD-~~TynsLI~a~~k~G~~eeA~~l~~eM~k~G~~pd~~tynsLI~ay~k~G~~e 701 (1060)
T PLN03218 623 SCSQKGDWDFALSIYDDMKKKGVKPD-EVFFSALVDVAGHAGDLDKAFEILQDARKQGIKLGTVSYSSLMGACSNAKNWK 701 (1060)
T ss_pred HHHhcCCHHHHHHHHHHHHHcCCCCC-HHHHHHHHHHHHhCCCHHHHHHHHHHHHHcCCCCCHHHHHHHHHHHHhCCCHH
Confidence 6667777777777777 5544 333 2233311 7777777777777655566777777777777777777
Q ss_pred HHHHHHHHHHH
Q psy16144 90 EAIDVSQTIMK 100 (122)
Q Consensus 90 ~AI~vc~kVL~ 100 (122)
+|+.++++..+
T Consensus 702 eA~~lf~eM~~ 712 (1060)
T PLN03218 702 KALELYEDIKS 712 (1060)
T ss_pred HHHHHHHHHHH
Confidence 77777777654
No 177
>COG3071 HemY Uncharacterized enzyme of heme biosynthesis [Coenzyme metabolism]
Probab=95.94 E-value=0.056 Score=46.30 Aligned_cols=82 Identities=13% Similarity=0.117 Sum_probs=48.0
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchH--HHHhhC----HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSAT--NIYNLF----PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDV 94 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~--kA~e~l----y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~v 94 (122)
=+++-|..|+|.+.++ .++++-+.. ..+..+ +.+=.+.-|+..+. .|++|...+.||..|+|.|.+.+|-+.
T Consensus 272 ~li~l~~~~~A~~~i~~~Lk~~~D~~L~~~~~~l~~~d~~~l~k~~e~~l~~-h~~~p~L~~tLG~L~~k~~~w~kA~~~ 350 (400)
T COG3071 272 RLIRLGDHDEAQEIIEDALKRQWDPRLCRLIPRLRPGDPEPLIKAAEKWLKQ-HPEDPLLLSTLGRLALKNKLWGKASEA 350 (400)
T ss_pred HHHHcCChHHHHHHHHHHHHhccChhHHHHHhhcCCCCchHHHHHHHHHHHh-CCCChhHHHHHHHHHHHhhHHHHHHHH
Confidence 3566777788887777 777655422 111111 44444445555555 566666666666666666666666666
Q ss_pred HHHHHHhCCC
Q psy16144 95 SQTIMKAHPS 104 (122)
Q Consensus 95 c~kVL~~~P~ 104 (122)
.+.+++.-|+
T Consensus 351 leaAl~~~~s 360 (400)
T COG3071 351 LEAALKLRPS 360 (400)
T ss_pred HHHHHhcCCC
Confidence 6666666553
No 178
>PF12569 NARP1: NMDA receptor-regulated protein 1 ; InterPro: IPR021183 This group represents N-terminal acetyltransferase A (NatA) auxiliary subunit and represents a non-catalytic component of the NatA N-terminal acetyltransferase, which catalyzes acetylation of proteins beginning with Met-Ser, Met-Gly and Met-Ala. N-terminal acetylation plays a role in normal eukaryotic translation and processing, protect against proteolytic degradation and protein turnover. NAT1 anchors ARD1 and NAT5 to the ribosome and may present the N- terminal of nascent polypeptides for acetylation [], [].
Probab=95.91 E-value=0.11 Score=45.63 Aligned_cols=92 Identities=9% Similarity=0.065 Sum_probs=72.1
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcc-----
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMK----- 86 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~k----- 86 (122)
|+..+|++++|.+.|. .-+...+-....|.. +++|...|+..++. ||+|..-...|..+..-..
T Consensus 13 il~e~g~~~~AL~~L~~~~~~I~Dk~~~~E~rA~ll~kLg~~~eA~~~y~~Li~r-NPdn~~Yy~~L~~~~g~~~~~~~~ 91 (517)
T PF12569_consen 13 ILEEAGDYEEALEHLEKNEKQILDKLAVLEKRAELLLKLGRKEEAEKIYRELIDR-NPDNYDYYRGLEEALGLQLQLSDE 91 (517)
T ss_pred HHHHCCCHHHHHHHHHhhhhhCCCHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHH-CCCcHHHHHHHHHHHhhhcccccc
Confidence 8889999999999999 888788766666622 89999999999998 8998888888888874333
Q ss_pred cHHHHHHHHHHHHHhCCCCHHHHHHHHH
Q psy16144 87 RYAEAIDVSQTIMKAHPSYPKVHKQIFE 114 (122)
Q Consensus 87 r~~~AI~vc~kVL~~~P~ypkir~~il~ 114 (122)
....-.++|.++-..+|...-+|.-.|+
T Consensus 92 ~~~~~~~~y~~l~~~yp~s~~~~rl~L~ 119 (517)
T PF12569_consen 92 DVEKLLELYDELAEKYPRSDAPRRLPLD 119 (517)
T ss_pred cHHHHHHHHHHHHHhCccccchhHhhcc
Confidence 4566688899998888876666655554
No 179
>PLN03077 Protein ECB2; Provisional
Probab=95.89 E-value=0.11 Score=46.86 Aligned_cols=97 Identities=13% Similarity=0.157 Sum_probs=69.4
Q ss_pred HHHHH--HhcCCCChHHHHHHHH-HHh--hCcchHHHHhh-C--------HHHHHHHHHHHH-HhhCCCChHHHHHHHHH
Q psy16144 17 EWHFL--LYLLPWRWEPLSAGLR-KKQ--YISSATNIYNL-F--------PDDAVVQYTQAC-RIIGTSDLALHQKLASL 81 (122)
Q Consensus 17 ~~~~L--Lyi~~gk~d~A~elL~-~l~--~nps~~kA~e~-l--------y~dAa~~ye~Aw-~l~~~~~~~vgykLA~~ 81 (122)
-|..+ -|.+.|+.++|.++|+ ..+ ..|+. ..|.. + .++|...|+... +.+-.-+.....-|...
T Consensus 556 s~n~lI~~~~~~G~~~~A~~lf~~M~~~g~~Pd~-~T~~~ll~a~~~~g~v~ea~~~f~~M~~~~gi~P~~~~y~~lv~~ 634 (857)
T PLN03077 556 SWNILLTGYVAHGKGSMAVELFNRMVESGVNPDE-VTFISLLCACSRSGMVTQGLEYFHSMEEKYSITPNLKHYACVVDL 634 (857)
T ss_pred hHHHHHHHHHHcCCHHHHHHHHHHHHHcCCCCCc-ccHHHHHHHHhhcChHHHHHHHHHHHHHHhCCCCchHHHHHHHHH
Confidence 46666 8889999999999999 766 34542 22221 1 889999999887 44434455677889999
Q ss_pred HHhcccHHHHHHHHHHHHHhCCCCHHHHHHHHHHH
Q psy16144 82 LLRMKRYAEAIDVSQTIMKAHPSYPKVHKQIFEKA 116 (122)
Q Consensus 82 ylk~kr~~~AI~vc~kVL~~~P~ypkir~~il~ka 116 (122)
|.+.|++++|.++.++. ...|| +.++.-+|.-+
T Consensus 635 l~r~G~~~eA~~~~~~m-~~~pd-~~~~~aLl~ac 667 (857)
T PLN03077 635 LGRAGKLTEAYNFINKM-PITPD-PAVWGALLNAC 667 (857)
T ss_pred HHhCCCHHHHHHHHHHC-CCCCC-HHHHHHHHHHH
Confidence 99999999999988775 35664 66666666544
No 180
>COG0457 NrfG FOG: TPR repeat [General function prediction only]
Probab=95.86 E-value=0.17 Score=33.60 Aligned_cols=82 Identities=21% Similarity=0.255 Sum_probs=71.3
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcc-hHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISS-ATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAE 90 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps-~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~ 90 (122)
.+...|.++.|...+. .++.++. ...++..+ +.+|...+..+... .|........++..+...+.+.+
T Consensus 176 ~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 254 (291)
T COG0457 176 LLEALGRYEEALELLEKALKLNPDDDAEALLNLGLLYLKLGKYEEALEYYEKALEL-DPDNAEALYNLALLLLELGRYEE 254 (291)
T ss_pred HHHHhcCHHHHHHHHHHHHhhCcccchHHHHHhhHHHHHcccHHHHHHHHHHHHhh-CcccHHHHhhHHHHHHHcCCHHH
Confidence 4567789999999999 9999999 56666533 79999999999999 57778889999999998888999
Q ss_pred HHHHHHHHHHhCCC
Q psy16144 91 AIDVSQTIMKAHPS 104 (122)
Q Consensus 91 AI~vc~kVL~~~P~ 104 (122)
++...++++..+|+
T Consensus 255 ~~~~~~~~~~~~~~ 268 (291)
T COG0457 255 ALEALEKALELDPD 268 (291)
T ss_pred HHHHHHHHHHhCcc
Confidence 99999999999996
No 181
>PF13181 TPR_8: Tetratricopeptide repeat; PDB: 3GW4_B 3MA5_C 2KCV_A 2KCL_A 3FP3_A 3LCA_A 3FP4_A 3FP2_A 1W3B_B 1ELW_A ....
Probab=95.83 E-value=0.0099 Score=32.13 Aligned_cols=27 Identities=11% Similarity=0.027 Sum_probs=23.5
Q ss_pred HHHH--HhcCCCChHHHHHHHH-HHhhCcc
Q psy16144 18 WHFL--LYLLPWRWEPLSAGLR-KKQYISS 44 (122)
Q Consensus 18 ~~~L--Lyi~~gk~d~A~elL~-~l~~nps 44 (122)
|+.+ +|.+.|++++|.+.++ +++.+|+
T Consensus 4 ~~~lg~~y~~~~~~~~A~~~~~~a~~~~~~ 33 (34)
T PF13181_consen 4 YYNLGKIYEQLGDYEEALEYFEKALELNPD 33 (34)
T ss_dssp HHHHHHHHHHTTSHHHHHHHHHHHHHHHTT
T ss_pred HHHHHHHHHHcCCHHHHHHHHHHHHhhCCC
Confidence 4445 9999999999999999 9999984
No 182
>PF06552 TOM20_plant: Plant specific mitochondrial import receptor subunit TOM20; InterPro: IPR010547 This family consists of several plant specific mitochondrial import receptor subunit TOM20 (translocase of outer membrane 20 kDa subunit) proteins. Most mitochondrial proteins are encoded by the nuclear genome, and are synthesised in the cytosol. TOM20 is a general import receptor that binds to mitochondrial pre-sequences in the early step of protein import into the mitochondria [].; GO: 0045040 protein import into mitochondrial outer membrane, 0005742 mitochondrial outer membrane translocase complex; PDB: 1ZU2_A.
Probab=95.83 E-value=0.046 Score=42.51 Aligned_cols=56 Identities=18% Similarity=0.127 Sum_probs=46.1
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhc----------ccHHHHHHHHHHHHHhCCCCHHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRM----------KRYAEAIDVSQTIMKAHPSYPKVH 109 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~----------kr~~~AI~vc~kVL~~~P~ypkir 109 (122)
|+.|.+.|+.+... +|.|+...++=|..++.. +-+.|||.-++++|.++|++.++.
T Consensus 7 FE~ark~aea~y~~-nP~DadnL~~WG~ALLELAqfk~g~es~~miedAisK~eeAL~I~P~~hdAl 72 (186)
T PF06552_consen 7 FEHARKKAEAAYAK-NPLDADNLTNWGGALLELAQFKQGPESKKMIEDAISKFEEALKINPNKHDAL 72 (186)
T ss_dssp HHHHHHHHHHHHHH--TT-HHHHHHHHHHHHHHHHHS-HHHHHHHHHHHHHHHHHHHHH-TT-HHHH
T ss_pred HHHHHHHHHHHHHh-CcHhHHHHHHHHHHHHHHHhccCcchHHHHHHHHHHHHHHHHhcCCchHHHH
Confidence 88999999999999 899999999999998876 446789999999999999998764
No 183
>KOG2471|consensus
Probab=95.81 E-value=0.038 Score=49.19 Aligned_cols=111 Identities=15% Similarity=0.083 Sum_probs=84.2
Q ss_pred HHHHHHHHHHHHH---HhcCCCChHHHHHHHH--HHhhCcc--------hHHHHhhC---------HHHHHHHHHHHHH-
Q psy16144 9 MQIIQTIAEWHFL---LYLLPWRWEPLSAGLR--KKQYISS--------ATNIYNLF---------PDDAVVQYTQACR- 65 (122)
Q Consensus 9 ~~~~~~~~~~~~L---Lyi~~gk~d~A~elL~--~l~~nps--------~~kA~e~l---------y~dAa~~ye~Aw~- 65 (122)
|-|-|.-++.-+| ++--.|.|.+|.++|- -++-.|. .+.+|.-+ |.-++..|.+|+.
T Consensus 233 mn~a~~s~~~l~LKsq~eY~~gn~~kA~KlL~~sni~~~~g~~~T~q~~~cif~NNlGcIh~~~~~y~~~~~~F~kAL~N 312 (696)
T KOG2471|consen 233 MNIAQDSSMALLLKSQLEYAHGNHPKAMKLLLVSNIHKEAGGTITPQLSSCIFNNNLGCIHYQLGCYQASSVLFLKALRN 312 (696)
T ss_pred hhhcCCCcHHHHHHHHHHHHhcchHHHHHHHHhcccccccCccccchhhhheeecCcceEeeehhhHHHHHHHHHHHHHH
Confidence 4444545556666 5555699999999987 4444444 44555533 9999999999996
Q ss_pred ----------------hhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHhc
Q psy16144 66 ----------------IIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVHKQIFEKAMVH 119 (122)
Q Consensus 66 ----------------l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir~~il~ka~~~ 119 (122)
++.-+.-.+.|++|-.|+-.||.-.|-.++.+...++-.+|++|-+.-|-|...
T Consensus 313 ~c~qL~~g~~~~~~~tls~nks~eilYNcG~~~Lh~grPl~AfqCf~~av~vfh~nPrlWLRlAEcCima 382 (696)
T KOG2471|consen 313 SCSQLRNGLKPAKTFTLSQNKSMEILYNCGLLYLHSGRPLLAFQCFQKAVHVFHRNPRLWLRLAECCIMA 382 (696)
T ss_pred HHHHHhccCCCCcceehhcccchhhHHhhhHHHHhcCCcHHHHHHHHHHHHHHhcCcHHHHHHHHHHHHH
Confidence 122245678999999999999999999999999999999999999887776554
No 184
>PF09986 DUF2225: Uncharacterized protein conserved in bacteria (DUF2225); InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=95.77 E-value=0.28 Score=38.25 Aligned_cols=65 Identities=17% Similarity=0.131 Sum_probs=54.1
Q ss_pred HHHHHHHHHHHHHhhCC-----CChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHhc
Q psy16144 53 PDDAVVQYTQACRIIGT-----SDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVHKQIFEKAMVH 119 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~-----~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir~~il~ka~~~ 119 (122)
+..|...|++|.+.... +...+.|-+|-.+.+.|++++|+.-+.+|+ .+|+..+ ..-|+++||..
T Consensus 141 l~~Al~~y~~a~~~e~~~~~~~~~~~l~YLigeL~rrlg~~~eA~~~fs~vi-~~~~~s~-~~~l~~~AR~~ 210 (214)
T PF09986_consen 141 LRKALEFYEEAYENEDFPIEGMDEATLLYLIGELNRRLGNYDEAKRWFSRVI-GSKKASK-EPKLKDMARDQ 210 (214)
T ss_pred HHHHHHHHHHHHHhCcCCCCCchHHHHHHHHHHHHHHhCCHHHHHHHHHHHH-cCCCCCC-cHHHHHHHHHH
Confidence 89999999999987532 346889999999999999999999999999 6775555 45688888753
No 185
>COG3071 HemY Uncharacterized enzyme of heme biosynthesis [Coenzyme metabolism]
Probab=95.70 E-value=0.082 Score=45.29 Aligned_cols=75 Identities=17% Similarity=0.082 Sum_probs=66.4
Q ss_pred cCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHH
Q psy16144 24 LLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAID 93 (122)
Q Consensus 24 i~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~ 93 (122)
++.++...=++.++ .++.-|..+..+..+ |.+|..+++.|+++ ..+.+-+--||.+|.+.|+.++|.+
T Consensus 305 l~~~d~~~l~k~~e~~l~~h~~~p~L~~tLG~L~~k~~~w~kA~~~leaAl~~--~~s~~~~~~la~~~~~~g~~~~A~~ 382 (400)
T COG3071 305 LRPGDPEPLIKAAEKWLKQHPEDPLLLSTLGRLALKNKLWGKASEALEAALKL--RPSASDYAELADALDQLGEPEEAEQ 382 (400)
T ss_pred cCCCCchHHHHHHHHHHHhCCCChhHHHHHHHHHHHhhHHHHHHHHHHHHHhc--CCChhhHHHHHHHHHHcCChHHHHH
Confidence 47788888888999 999999999877744 99999999999999 3457888899999999999999999
Q ss_pred HHHHHHH
Q psy16144 94 VSQTIMK 100 (122)
Q Consensus 94 vc~kVL~ 100 (122)
+.+++|.
T Consensus 383 ~r~e~L~ 389 (400)
T COG3071 383 VRREALL 389 (400)
T ss_pred HHHHHHH
Confidence 9999984
No 186
>PF03704 BTAD: Bacterial transcriptional activator domain; InterPro: IPR005158 Found in the DNRI/REDD/AFSR family of regulators, this region of AFSR (P25941 from SWISSPROT) along with the C-terminal region is capable of independently directing actinorhodin production. It is important for the formation of secondary metabolites.; PDB: 2FF4_B 2FEZ_A.
Probab=95.69 E-value=0.43 Score=33.57 Aligned_cols=45 Identities=22% Similarity=0.163 Sum_probs=36.3
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTI 98 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kV 98 (122)
|++|...++++... +|-|..+...|-..|...|++.+|+.+|++.
T Consensus 78 ~~~a~~~~~~~l~~-dP~~E~~~~~lm~~~~~~g~~~~A~~~Y~~~ 122 (146)
T PF03704_consen 78 YEEALRLLQRALAL-DPYDEEAYRLLMRALAAQGRRAEALRVYERY 122 (146)
T ss_dssp HHHHHHHHHHHHHH-STT-HHHHHHHHHHHHHTT-HHHHHHHHHHH
T ss_pred HHHHHHHHHHHHhc-CCCCHHHHHHHHHHHHHCcCHHHHHHHHHHH
Confidence 88888888888888 7888888888888888888888888888765
No 187
>PF13374 TPR_10: Tetratricopeptide repeat; PDB: 3CEQ_B 3EDT_H 3NF1_A.
Probab=95.64 E-value=0.034 Score=30.72 Aligned_cols=29 Identities=21% Similarity=0.328 Sum_probs=25.1
Q ss_pred HHHHHHHHHHHhcccHHHHHHHHHHHHHh
Q psy16144 73 ALHQKLASLLLRMKRYAEAIDVSQTIMKA 101 (122)
Q Consensus 73 ~vgykLA~~ylk~kr~~~AI~vc~kVL~~ 101 (122)
.+..+||..|...|++.+|+.+++++++.
T Consensus 3 ~~~~~la~~~~~~g~~~~A~~~~~~al~~ 31 (42)
T PF13374_consen 3 SALNNLANAYRAQGRYEEALELLEEALEI 31 (42)
T ss_dssp HHHHHHHHHHHHCT-HHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHhhhhcchhhHHHHHHHHH
Confidence 45678999999999999999999999965
No 188
>KOG2003|consensus
Probab=95.63 E-value=0.11 Score=46.28 Aligned_cols=86 Identities=16% Similarity=0.175 Sum_probs=74.8
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
||-|.|+-.+|-++-- .-+|.|++....|.+ .+.|+.+||+|--+ .|+-.---.-.|.++-+.|.|..|
T Consensus 601 lydqegdksqafq~~ydsyryfp~nie~iewl~ayyidtqf~ekai~y~ekaali-qp~~~kwqlmiasc~rrsgnyqka 679 (840)
T KOG2003|consen 601 LYDQEGDKSQAFQCHYDSYRYFPCNIETIEWLAAYYIDTQFSEKAINYFEKAALI-QPNQSKWQLMIASCFRRSGNYQKA 679 (840)
T ss_pred HhhcccchhhhhhhhhhcccccCcchHHHHHHHHHHHhhHHHHHHHHHHHHHHhc-CccHHHHHHHHHHHHHhcccHHHH
Confidence 8899999999988777 888999988777633 88999999999888 588777778889999999999999
Q ss_pred HHHHHHHHHhCCCCHHH
Q psy16144 92 IDVSQTIMKAHPSYPKV 108 (122)
Q Consensus 92 I~vc~kVL~~~P~ypki 108 (122)
.++|+.+.+..|....-
T Consensus 680 ~d~yk~~hrkfpedldc 696 (840)
T KOG2003|consen 680 FDLYKDIHRKFPEDLDC 696 (840)
T ss_pred HHHHHHHHHhCccchHH
Confidence 99999999999987543
No 189
>KOG1127|consensus
Probab=95.60 E-value=0.039 Score=52.21 Aligned_cols=91 Identities=16% Similarity=0.094 Sum_probs=58.2
Q ss_pred HHHHH--HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHh
Q psy16144 17 EWHFL--LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLR 84 (122)
Q Consensus 17 ~~~~L--Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk 84 (122)
.|+.. .|...+.+-.|+.-|+ +++.+|.+...|+-+ |.-|.+.|.+|-.+ +|.+--+-|+-|-.-.-
T Consensus 564 nW~~rG~yyLea~n~h~aV~~fQsALR~dPkD~n~W~gLGeAY~~sGry~~AlKvF~kAs~L-rP~s~y~~fk~A~~ecd 642 (1238)
T KOG1127|consen 564 NWVQRGPYYLEAHNLHGAVCEFQSALRTDPKDYNLWLGLGEAYPESGRYSHALKVFTKASLL-RPLSKYGRFKEAVMECD 642 (1238)
T ss_pred hhhhccccccCccchhhHHHHHHHHhcCCchhHHHHHHHHHHHHhcCceehHHHhhhhhHhc-CcHhHHHHHHHHHHHHH
Confidence 35555 5666666666666666 666666666666633 66666666666666 56666666666666666
Q ss_pred cccHHHHHHHHHHHHHhCCCCHHH
Q psy16144 85 MKRYAEAIDVSQTIMKAHPSYPKV 108 (122)
Q Consensus 85 ~kr~~~AI~vc~kVL~~~P~ypki 108 (122)
.|+|.+|.+.+..++....++.-.
T Consensus 643 ~GkYkeald~l~~ii~~~s~e~~~ 666 (1238)
T KOG1127|consen 643 NGKYKEALDALGLIIYAFSLERTG 666 (1238)
T ss_pred hhhHHHHHHHHHHHHHHHHHHHHh
Confidence 666666666666666555544433
No 190
>KOG2396|consensus
Probab=95.51 E-value=0.13 Score=45.62 Aligned_cols=80 Identities=11% Similarity=0.100 Sum_probs=66.6
Q ss_pred HHHHH-HHhhCcchHHHHh-hC--------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhccc-HHHHHHHHHHHHHh
Q psy16144 33 SAGLR-KKQYISSATNIYN-LF--------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKR-YAEAIDVSQTIMKA 101 (122)
Q Consensus 33 ~elL~-~l~~nps~~kA~e-~l--------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr-~~~AI~vc~kVL~~ 101 (122)
..+.+ ++..-+.+.+-|. |+ |.+=...|.+++.+ +|++|.++.-=|+-..-.+. ..-|-++..+.|+.
T Consensus 91 v~lyr~at~rf~~D~~lW~~yi~f~kk~~~~~~v~ki~~~~l~~-Hp~~~dLWI~aA~wefe~n~ni~saRalflrgLR~ 169 (568)
T KOG2396|consen 91 VFLYRRATNRFNGDVKLWLSYIAFCKKKKTYGEVKKIFAAMLAK-HPNNPDLWIYAAKWEFEINLNIESARALFLRGLRF 169 (568)
T ss_pred HHHHHHHHHhcCCCHHHHHHHHHHHHHhcchhHHHHHHHHHHHh-CCCCchhHHhhhhhHHhhccchHHHHHHHHHHhhc
Confidence 34666 7777777788887 22 78888999999999 89999999988877777776 77788999999999
Q ss_pred CCCCHHHHHHHH
Q psy16144 102 HPSYPKVHKQIF 113 (122)
Q Consensus 102 ~P~ypkir~~il 113 (122)
||+.|+++++-+
T Consensus 170 npdsp~Lw~eyf 181 (568)
T KOG2396|consen 170 NPDSPKLWKEYF 181 (568)
T ss_pred CCCChHHHHHHH
Confidence 999999998754
No 191
>PLN03218 maturation of RBCL 1; Provisional
Probab=95.51 E-value=0.14 Score=48.47 Aligned_cols=79 Identities=9% Similarity=-0.104 Sum_probs=44.4
Q ss_pred HhcCCCChHHHHHHHH-HHh----hCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhccc
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQ----YISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKR 87 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~----~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr 87 (122)
.|.+.|++++|.+++. ..+ ..|+ ...|..+ +++|...|+...+..-+.++.+...|...|.+.|+
T Consensus 551 a~~k~G~~deA~~lf~eM~~~~~gi~PD-~vTynaLI~ay~k~G~ldeA~elf~~M~e~gi~p~~~tynsLI~ay~k~G~ 629 (1060)
T PLN03218 551 ACGQSGAVDRAFDVLAEMKAETHPIDPD-HITVGALMKACANAGQVDRAKEVYQMIHEYNIKGTPEVYTIAVNSCSQKGD 629 (1060)
T ss_pred HHHHCCCHHHHHHHHHHHHHhcCCCCCc-HHHHHHHHHHHHHCCCHHHHHHHHHHHHHcCCCCChHHHHHHHHHHHhcCC
Confidence 5555666666666666 433 1333 2222211 66666666666666444555666666666666666
Q ss_pred HHHHHHHHHHHHHh
Q psy16144 88 YAEAIDVSQTIMKA 101 (122)
Q Consensus 88 ~~~AI~vc~kVL~~ 101 (122)
+++|+.++++..+.
T Consensus 630 ~deAl~lf~eM~~~ 643 (1060)
T PLN03218 630 WDFALSIYDDMKKK 643 (1060)
T ss_pred HHHHHHHHHHHHHc
Confidence 66666666666554
No 192
>PF04781 DUF627: Protein of unknown function (DUF627); InterPro: IPR006866 This domain represents the N-terminal region of several plant proteins of unknown function.
Probab=95.46 E-value=0.065 Score=38.48 Aligned_cols=78 Identities=14% Similarity=0.064 Sum_probs=65.5
Q ss_pred hcCCCChHHHHHHHH-HHhhCcchHHHHhhC-----------------------HHHHHHHHHHHHHhhCCCChHHHHHH
Q psy16144 23 YLLPWRWEPLSAGLR-KKQYISSATNIYNLF-----------------------PDDAVVQYTQACRIIGTSDLALHQKL 78 (122)
Q Consensus 23 yi~~gk~d~A~elL~-~l~~nps~~kA~e~l-----------------------y~dAa~~ye~Aw~l~~~~~~~vgykL 78 (122)
+.+.|.+-+|.++++ .+..+++...+|... ...|+.+|.++..++ |..+...|.|
T Consensus 6 ~~~rGnhiKAL~iied~i~~h~~~~~~~~lh~~QG~if~~lA~~ten~d~k~~yLl~sve~~s~a~~Ls-p~~A~~L~~l 84 (111)
T PF04781_consen 6 YFARGNHIKALEIIEDLISRHGEDESSWLLHRLQGTIFYKLAKKTENPDVKFRYLLGSVECFSRAVELS-PDSAHSLFEL 84 (111)
T ss_pred HHHccCHHHHHHHHHHHHHHccCCCchHHHHHHHhHHHHHHHHhccCchHHHHHHHHhHHHHHHHhccC-hhHHHHHHHH
Confidence 456799999999999 777777766555411 678899999999995 8889999999
Q ss_pred HHHHHhcccHHHHHHHHHHHHHh
Q psy16144 79 ASLLLRMKRYAEAIDVSQTIMKA 101 (122)
Q Consensus 79 A~~ylk~kr~~~AI~vc~kVL~~ 101 (122)
|..+--..-|.+|..-|++.|..
T Consensus 85 a~~l~s~~~Ykk~v~kak~~Lsv 107 (111)
T PF04781_consen 85 ASQLGSVKYYKKAVKKAKRGLSV 107 (111)
T ss_pred HHHhhhHHHHHHHHHHHHHHhcc
Confidence 99998899999999999999864
No 193
>KOG4234|consensus
Probab=95.44 E-value=0.058 Score=43.43 Aligned_cols=56 Identities=18% Similarity=0.386 Sum_probs=47.2
Q ss_pred HHHHHHHHHHHHHhhCCCChH---HH-HHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSDLA---LH-QKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKV 108 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~---vg-ykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypki 108 (122)
|.+|...|..|++.+-+.+.. |. -+=|.+++|.++.+.||+-|-+++.++|.|-|+
T Consensus 111 yeeA~skY~~Ale~cp~~~~e~rsIly~Nraaa~iKl~k~e~aI~dcsKaiel~pty~kA 170 (271)
T KOG4234|consen 111 YEEANSKYQEALESCPSTSTEERSILYSNRAAALIKLRKWESAIEDCSKAIELNPTYEKA 170 (271)
T ss_pred HHHHHHHHHHHHHhCccccHHHHHHHHhhhHHHHHHhhhHHHHHHHHHhhHhcCchhHHH
Confidence 999999999999997443332 22 245889999999999999999999999999886
No 194
>PF13512 TPR_18: Tetratricopeptide repeat
Probab=95.42 E-value=0.14 Score=38.15 Aligned_cols=87 Identities=9% Similarity=0.043 Sum_probs=72.8
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHh--------hC----HHHHHHHHHHHHHhhCCCChHH---HHHHHHHHHhc
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYN--------LF----PDDAVVQYTQACRIIGTSDLAL---HQKLASLLLRM 85 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e--------~l----y~dAa~~ye~Aw~l~~~~~~~v---gykLA~~ylk~ 85 (122)
-.++.|+|++|++.|+ +...-|....+-. |. |.+|+..|++-.++ +|++|.| .|.-|..++..
T Consensus 19 ~~l~~~~Y~~A~~~le~L~~ryP~g~ya~qAqL~l~yayy~~~~y~~A~a~~~rFirL-hP~hp~vdYa~Y~~gL~~~~~ 97 (142)
T PF13512_consen 19 EALQKGNYEEAIKQLEALDTRYPFGEYAEQAQLDLAYAYYKQGDYEEAIAAYDRFIRL-HPTHPNVDYAYYMRGLSYYEQ 97 (142)
T ss_pred HHHHhCCHHHHHHHHHHHHhcCCCCcccHHHHHHHHHHHHHccCHHHHHHHHHHHHHh-CCCCCCccHHHHHHHHHHHHH
Confidence 4568899999999999 8888887554433 11 99999999999999 8999876 56778888888
Q ss_pred cc---------------HHHHHHHHHHHHHhCCCCHHHH
Q psy16144 86 KR---------------YAEAIDVSQTIMKAHPSYPKVH 109 (122)
Q Consensus 86 kr---------------~~~AI~vc~kVL~~~P~ypkir 109 (122)
.. ..+|..-++++++.+|+++-+.
T Consensus 98 ~~~~~~~~~~~drD~~~~~~A~~~f~~lv~~yP~S~ya~ 136 (142)
T PF13512_consen 98 DEGSLQSFFRSDRDPTPARQAFRDFEQLVRRYPNSEYAA 136 (142)
T ss_pred hhhHHhhhcccccCcHHHHHHHHHHHHHHHHCcCChhHH
Confidence 77 8899999999999999987553
No 195
>KOG4555|consensus
Probab=95.42 E-value=0.84 Score=34.69 Aligned_cols=100 Identities=11% Similarity=0.006 Sum_probs=74.4
Q ss_pred cccchhHHHHHHHHHHHHHH--HhcCCCChHHHHHHHH-HHhhCcchHHHHh------hC---HHHHHHHHHHHHHhhCC
Q psy16144 2 KSLSVDTMQIIQTIAEWHFL--LYLLPWRWEPLSAGLR-KKQYISSATNIYN------LF---PDDAVVQYTQACRIIGT 69 (122)
Q Consensus 2 ~~~~~~~~~~~~~~~~~~~L--Lyi~~gk~d~A~elL~-~l~~nps~~kA~e------~l---y~dAa~~ye~Aw~l~~~ 69 (122)
++=+.||-+|-+..+ +-.- .....|+.|.|.|.|. ++..-|..+-+|. .+ ..+|..-+++|++++++
T Consensus 31 ede~~~~~~~e~S~~-LEl~~valaE~g~Ld~AlE~F~qal~l~P~raSayNNRAQa~RLq~~~e~ALdDLn~AleLag~ 109 (175)
T KOG4555|consen 31 KDEEPDTQAIKASRE-LELKAIALAEAGDLDGALELFGQALCLAPERASAYNNRAQALRLQGDDEEALDDLNKALELAGD 109 (175)
T ss_pred cccCCchHHHHHHHH-HHHHHHHHHhccchHHHHHHHHHHHHhcccchHhhccHHHHHHHcCChHHHHHHHHHHHHhcCc
Confidence 344567776665543 2222 5678899999999999 9999999888887 22 89999999999999887
Q ss_pred CChH---HHHHHHHHHHhcccHHHHHHHHHHHHHhC
Q psy16144 70 SDLA---LHQKLASLLLRMKRYAEAIDVSQTIMKAH 102 (122)
Q Consensus 70 ~~~~---vgykLA~~ylk~kr~~~AI~vc~kVL~~~ 102 (122)
.--+ +..-=|..|-+.|+-++|-.=++.+-..-
T Consensus 110 ~trtacqa~vQRg~lyRl~g~dd~AR~DFe~AA~LG 145 (175)
T KOG4555|consen 110 QTRTACQAFVQRGLLYRLLGNDDAARADFEAAAQLG 145 (175)
T ss_pred cchHHHHHHHHHHHHHHHhCchHHHHHhHHHHHHhC
Confidence 7443 34455788999999888877776544433
No 196
>PHA02537 M terminase endonuclease subunit; Provisional
Probab=95.33 E-value=0.18 Score=40.20 Aligned_cols=96 Identities=14% Similarity=0.000 Sum_probs=63.4
Q ss_pred hcCCCChHHHHHHHH-HHhhCcchHHHHhhC--------------------------HHHHHHHHHHHHHhhCCCChHHH
Q psy16144 23 YLLPWRWEPLSAGLR-KKQYISSATNIYNLF--------------------------PDDAVVQYTQACRIIGTSDLALH 75 (122)
Q Consensus 23 yi~~gk~d~A~elL~-~l~~nps~~kA~e~l--------------------------y~dAa~~ye~Aw~l~~~~~~~vg 75 (122)
.+..|+|+.|.+..+ ++++|-..|.-|..- +-........-|.+ +..-.+=.
T Consensus 93 ~~D~Gd~~~AL~ia~yAI~~~l~~Pd~f~R~~~t~vaeev~~~A~~~~~ag~~~e~~~~~~~~~l~~~~dm-pd~vrAKl 171 (230)
T PHA02537 93 RFDIGDFDGALEIAEYALEHGLTMPDQFRRTLANFVAEEVANAALKAASAGESVEPYFLRVFLDLTTEWDM-PDEVRAKL 171 (230)
T ss_pred eeeccCHHHHHHHHHHHHHcCCCCCccccCCchHHHHHHHHHHHHHHHHcCCCCChHHHHHHHHHHhcCCC-ChHHHHHH
Confidence 457899999999999 999998888776622 22222222233444 12112225
Q ss_pred HHHHHHHH----------hcccHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHhccC
Q psy16144 76 QKLASLLL----------RMKRYAEAIDVSQTIMKAHPSYPKVHKQIFEKAMVHLR 121 (122)
Q Consensus 76 ykLA~~yl----------k~kr~~~AI~vc~kVL~~~P~ypkir~~il~ka~~~lr 121 (122)
||.+-..+ ..++...|++.+++++..|| ..-++++| ++....||
T Consensus 172 ~K~~G~~llr~~~g~~~~d~~~l~~Al~~L~rA~~l~~-k~GVK~~i-~~l~~~lr 225 (230)
T PHA02537 172 YKAAGYLLLRNEKGEPIGDAETLQLALALLQRAFQLND-KCGVKKDI-ERLERRLK 225 (230)
T ss_pred HHHHHHHHhhcccCCCccCcccHHHHHHHHHHHHHhCC-CCChHHHH-HHHHHHHh
Confidence 56555444 45688899999999999998 66677766 55555554
No 197
>KOG1129|consensus
Probab=95.19 E-value=0.061 Score=46.02 Aligned_cols=89 Identities=11% Similarity=0.080 Sum_probs=65.5
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHh-----hC----HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYN-----LF----PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e-----~l----y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
+|-..+.+++|.++.+ +++..|.+..|.- |. ++=|..+|++.+.+ +-.+|....++|-+-+-.++++-+
T Consensus 299 i~eam~~~~~a~~lYk~vlk~~~~nvEaiAcia~~yfY~~~PE~AlryYRRiLqm-G~~speLf~NigLCC~yaqQ~D~~ 377 (478)
T KOG1129|consen 299 IHEAMEQQEDALQLYKLVLKLHPINVEAIACIAVGYFYDNNPEMALRYYRRILQM-GAQSPELFCNIGLCCLYAQQIDLV 377 (478)
T ss_pred HHHHHHhHHHHHHHHHHHHhcCCccceeeeeeeeccccCCChHHHHHHHHHHHHh-cCCChHHHhhHHHHHHhhcchhhh
Confidence 5666788999999999 9999998776654 11 77788888888888 467788888888888888888887
Q ss_pred HHHHHHHHHhCCCCHHHHHHH
Q psy16144 92 IDVSQTIMKAHPSYPKVHKQI 112 (122)
Q Consensus 92 I~vc~kVL~~~P~ypkir~~i 112 (122)
+..++++|..-- +|....||
T Consensus 378 L~sf~RAlstat-~~~~aaDv 397 (478)
T KOG1129|consen 378 LPSFQRALSTAT-QPGQAADV 397 (478)
T ss_pred HHHHHHHHhhcc-Ccchhhhh
Confidence 777777775443 44444444
No 198
>KOG1156|consensus
Probab=95.18 E-value=0.17 Score=45.86 Aligned_cols=85 Identities=15% Similarity=0.145 Sum_probs=75.9
Q ss_pred HHHHH--HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHh
Q psy16144 17 EWHFL--LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLR 84 (122)
Q Consensus 17 ~~~~L--Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk 84 (122)
-||.+ ++-...+|++|.+..+ +++..|++...|.-+ |+-.+..-.+.|++ .|+..+-.+.+|.++-.
T Consensus 77 CwHv~gl~~R~dK~Y~eaiKcy~nAl~~~~dN~qilrDlslLQ~QmRd~~~~~~tr~~LLql-~~~~ra~w~~~Avs~~L 155 (700)
T KOG1156|consen 77 CWHVLGLLQRSDKKYDEAIKCYRNALKIEKDNLQILRDLSLLQIQMRDYEGYLETRNQLLQL-RPSQRASWIGFAVAQHL 155 (700)
T ss_pred hHHHHHHHHhhhhhHHHHHHHHHHHHhcCCCcHHHHHHHHHHHHHHHhhhhHHHHHHHHHHh-hhhhHHHHHHHHHHHHH
Confidence 49999 6777788999999999 999999999988833 89999999999999 79999999999999999
Q ss_pred cccHHHHHHHHHHHHHhC
Q psy16144 85 MKRYAEAIDVSQTIMKAH 102 (122)
Q Consensus 85 ~kr~~~AI~vc~kVL~~~ 102 (122)
.|.|..|+.+-...-+..
T Consensus 156 ~g~y~~A~~il~ef~~t~ 173 (700)
T KOG1156|consen 156 LGEYKMALEILEEFEKTQ 173 (700)
T ss_pred HHHHHHHHHHHHHHHHhh
Confidence 999999999887766655
No 199
>PF08424 NRDE-2: NRDE-2, necessary for RNA interference; InterPro: IPR013633 This is domain is found in eukaryotic proteins of unknown function.
Probab=95.15 E-value=0.38 Score=39.29 Aligned_cols=83 Identities=6% Similarity=0.039 Sum_probs=68.5
Q ss_pred HH-HHhhCcchHHHHhhC---------------------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHH
Q psy16144 36 LR-KKQYISSATNIYNLF---------------------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAID 93 (122)
Q Consensus 36 L~-~l~~nps~~kA~e~l---------------------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~ 93 (122)
|. .++-||++..+|..+ .+--...|++|++. +|++......+=....+.-.-.+...
T Consensus 8 l~~~v~~~P~di~~Wl~li~~Qd~~~~~~~~~~~~~~a~~E~klsilerAL~~-np~~~~L~l~~l~~~~~~~~~~~l~~ 86 (321)
T PF08424_consen 8 LNRRVRENPHDIEAWLELIEFQDELFRLQSSSKAERRALAERKLSILERALKH-NPDSERLLLGYLEEGEKVWDSEKLAK 86 (321)
T ss_pred HHHHHHhCcccHHHHHHHHHHHHHhccccccchhhHHHHHHHHHHHHHHHHHh-CCCCHHHHHHHHHHHHHhCCHHHHHH
Confidence 45 788899999999844 23455689999999 89999888887777778888888888
Q ss_pred HHHHHHHhCCCCHHHHHHHHHHHHhc
Q psy16144 94 VSQTIMKAHPSYPKVHKQIFEKAMVH 119 (122)
Q Consensus 94 vc~kVL~~~P~ypkir~~il~ka~~~ 119 (122)
-.+++|..||+++.++..-|+-.+..
T Consensus 87 ~we~~l~~~~~~~~LW~~yL~~~q~~ 112 (321)
T PF08424_consen 87 KWEELLFKNPGSPELWREYLDFRQSN 112 (321)
T ss_pred HHHHHHHHCCCChHHHHHHHHHHHHH
Confidence 99999999999999999988876553
No 200
>KOG4642|consensus
Probab=95.13 E-value=0.062 Score=43.93 Aligned_cols=56 Identities=21% Similarity=0.281 Sum_probs=51.4
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVH 109 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir 109 (122)
|.+|+.+|-+|+.. +|+.++-..+=|-+|+|.++++.+..=|+++|...||-.+..
T Consensus 26 y~~ai~~y~raI~~-nP~~~~Y~tnralchlk~~~~~~v~~dcrralql~~N~vk~h 81 (284)
T KOG4642|consen 26 YDDAIDCYSRAICI-NPTVASYYTNRALCHLKLKHWEPVEEDCRRALQLDPNLVKAH 81 (284)
T ss_pred hchHHHHHHHHHhc-CCCcchhhhhHHHHHHHhhhhhhhhhhHHHHHhcChHHHHHH
Confidence 89999999999999 799999999999999999999999999999999999877653
No 201
>KOG4507|consensus
Probab=95.04 E-value=0.18 Score=46.00 Aligned_cols=99 Identities=16% Similarity=0.099 Sum_probs=81.8
Q ss_pred Hhc-CCCChHHHHHHHH-HHhhCcchHHH------Hh----hCHHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHH
Q psy16144 22 LYL-LPWRWEPLSAGLR-KKQYISSATNI------YN----LFPDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYA 89 (122)
Q Consensus 22 Lyi-~~gk~d~A~elL~-~l~~nps~~kA------~e----~ly~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~ 89 (122)
||- -+|..-.|...|. ++-..|..... -- -|.-||-..+.+++.+ +-+-|-..|.+|..|+..++..
T Consensus 615 lywr~~gn~~~a~~cl~~a~~~~p~~~~v~~v~la~~~~~~~~~~da~~~l~q~l~~-~~sepl~~~~~g~~~l~l~~i~ 693 (886)
T KOG4507|consen 615 LYWRAVGNSTFAIACLQRALNLAPLQQDVPLVNLANLLIHYGLHLDATKLLLQALAI-NSSEPLTFLSLGNAYLALKNIS 693 (886)
T ss_pred ceeeecCCcHHHHHHHHHHhccChhhhcccHHHHHHHHHHhhhhccHHHHHHHHHhh-cccCchHHHhcchhHHHHhhhH
Confidence 444 3588889999998 88877763221 11 1288999999999999 6888999999999999999999
Q ss_pred HHHHHHHHHHHhCCCCHHHHHHHHHHHHhccC
Q psy16144 90 EAIDVSQTIMKAHPSYPKVHKQIFEKAMVHLR 121 (122)
Q Consensus 90 ~AI~vc~kVL~~~P~ypkir~~il~ka~~~lr 121 (122)
.||+.++++++.+|+.|-+..+.+--++.+..
T Consensus 694 ~a~~~~~~a~~~~~~~~~~~~~l~~i~c~~~~ 725 (886)
T KOG4507|consen 694 GALEAFRQALKLTTKCPECENSLKLIRCMQFY 725 (886)
T ss_pred HHHHHHHHHHhcCCCChhhHHHHHHHHHhhhh
Confidence 99999999999999999999998877765543
No 202
>KOG0545|consensus
Probab=94.90 E-value=0.1 Score=43.07 Aligned_cols=56 Identities=21% Similarity=0.313 Sum_probs=44.1
Q ss_pred HHHHHHHHHHHHH-------hhCCCChH----------HHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHH
Q psy16144 53 PDDAVVQYTQACR-------IIGTSDLA----------LHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKV 108 (122)
Q Consensus 53 y~dAa~~ye~Aw~-------l~~~~~~~----------vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypki 108 (122)
|++|+..|+.|+. -.+|.+|. ...+.+.++++.|.|-++++.|-.||..+|++.|+
T Consensus 194 ykEA~~~YreAi~~l~~L~lkEkP~e~eW~eLdk~~tpLllNy~QC~L~~~e~yevleh~seiL~~~~~nvKA 266 (329)
T KOG0545|consen 194 YKEASSKYREAIICLRNLQLKEKPGEPEWLELDKMITPLLLNYCQCLLKKEEYYEVLEHCSEILRHHPGNVKA 266 (329)
T ss_pred HHHHHHHHHHHHHHHHHHHhccCCCChHHHHHHHhhhHHHHhHHHHHhhHHHHHHHHHHHHHHHhcCCchHHH
Confidence 7777777776643 23444443 46778899999999999999999999999999876
No 203
>KOG4648|consensus
Probab=94.83 E-value=0.13 Score=44.31 Aligned_cols=106 Identities=13% Similarity=0.095 Sum_probs=68.7
Q ss_pred cchhHH--HHHHHHHHHHHH--HhcCCCChHHHHHHHH-HHhhCcchHH-----HHhhC----HHHHHHHHHHHHHhhCC
Q psy16144 4 LSVDTM--QIIQTIAEWHFL--LYLLPWRWEPLSAGLR-KKQYISSATN-----IYNLF----PDDAVVQYTQACRIIGT 69 (122)
Q Consensus 4 ~~~~~~--~~~~~~~~~~~L--Lyi~~gk~d~A~elL~-~l~~nps~~k-----A~e~l----y~dAa~~ye~Aw~l~~~ 69 (122)
|+||.. +++..-.|+--- -|..+|||++|.+... .+...|-++- |.-|+ |.-|..--+.|+.+ +.
T Consensus 84 L~vd~I~~~LL~~~SEiKE~GN~yFKQgKy~EAIDCYs~~ia~~P~NpV~~~NRA~AYlk~K~FA~AE~DC~~AiaL-d~ 162 (536)
T KOG4648|consen 84 LPVDPIAQQLLKKASEIKERGNTYFKQGKYEEAIDCYSTAIAVYPHNPVYHINRALAYLKQKSFAQAEEDCEAAIAL-DK 162 (536)
T ss_pred CCccHHHHHHHHhhHHHHHhhhhhhhccchhHHHHHhhhhhccCCCCccchhhHHHHHHHHHHHHHHHHhHHHHHHh-hH
Confidence 555542 333333442222 7899999999999999 8888774332 22233 66666655666665 34
Q ss_pred CChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHHH
Q psy16144 70 SDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVHK 110 (122)
Q Consensus 70 ~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir~ 110 (122)
.-.-+.-+=|..-.-.|+..+|-.=|+.||++.|+.-.+|+
T Consensus 163 ~Y~KAYSRR~~AR~~Lg~~~EAKkD~E~vL~LEP~~~ELkK 203 (536)
T KOG4648|consen 163 LYVKAYSRRMQARESLGNNMEAKKDCETVLALEPKNIELKK 203 (536)
T ss_pred HHHHHHHHHHHHHHHHhhHHHHHHhHHHHHhhCcccHHHHH
Confidence 33333334444445567888999999999999999887775
No 204
>COG1729 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=94.45 E-value=0.19 Score=40.96 Aligned_cols=60 Identities=15% Similarity=0.143 Sum_probs=52.2
Q ss_pred HHHHHHHHHHHHHhhCCCC---hHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHHHHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSD---LALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVHKQIF 113 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~---~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir~~il 113 (122)
|.+|...|..=.+. -|++ |.+.|=||-++...|+|.+|..++..+.+.+|++||.=.-.|
T Consensus 157 y~~A~~~F~~fi~~-YP~s~~~~nA~yWLGe~~y~qg~y~~Aa~~f~~~~k~~P~s~KApdall 219 (262)
T COG1729 157 YAEAEQAFQAFIKK-YPNSTYTPNAYYWLGESLYAQGDYEDAAYIFARVVKDYPKSPKAPDALL 219 (262)
T ss_pred HHHHHHHHHHHHHc-CCCCcccchhHHHHHHHHHhcccchHHHHHHHHHHHhCCCCCCChHHHH
Confidence 88999988888887 4665 678999999999999999999999999999999999843333
No 205
>PF05843 Suf: Suppressor of forked protein (Suf); InterPro: IPR008847 This domain consists of several eukaryotic suppressor of forked (Suf) like proteins. The Drosophila melanogaster suppressor of forked [Su(f)] protein shares homology with the Saccharomyces cerevisiae RNA14 protein and the 77 kDa subunit of Homo sapiens cleavage stimulation factor, which are proteins involved in mRNA 3' end formation. This suggests a role for Su(f) in mRNA 3' end formation in Drosophila. The su(f) gene produces three transcripts; two of them are polyadenylated at the end of the transcription unit, and one is a truncated transcript, polyadenylated in intron 4. It is thought that su(f) plays a role in the regulation of poly(A) site utilisation and the GU-rich sequence is important for this regulation to occur [].; GO: 0006397 mRNA processing, 0005634 nucleus; PDB: 2L9B_B 2OND_B 2OOE_A 4E85_B 4EBA_C 4E6H_A 2UY1_B.
Probab=94.27 E-value=0.34 Score=38.66 Aligned_cols=90 Identities=10% Similarity=0.168 Sum_probs=63.9
Q ss_pred HHHHHH-HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC----------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHH
Q psy16144 16 AEWHFL-LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF----------PDDAVVQYTQACRIIGTSDLALHQKLASLLL 83 (122)
Q Consensus 16 ~~~~~L-Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l----------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~yl 83 (122)
...+++ ..-+.+..+.|..+|. |++-++.+...|... .+-|...||.+++.- |+++.+...-...+.
T Consensus 3 v~i~~m~~~~r~~g~~~aR~vF~~a~~~~~~~~~vy~~~A~~E~~~~~d~~~A~~Ife~glk~f-~~~~~~~~~Y~~~l~ 81 (280)
T PF05843_consen 3 VWIQYMRFMRRTEGIEAARKVFKRARKDKRCTYHVYVAYALMEYYCNKDPKRARKIFERGLKKF-PSDPDFWLEYLDFLI 81 (280)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCS-THHHHHHHHHHHHTCS-HHHHHHHHHHHHHHH-TT-HHHHHHHHHHHH
T ss_pred HHHHHHHHHHHhCChHHHHHHHHHHHcCCCCCHHHHHHHHHHHHHhCCCHHHHHHHHHHHHHHC-CCCHHHHHHHHHHHH
Confidence 333444 4445555888889998 887666656555522 455889999999884 888888888888888
Q ss_pred hcccHHHHHHHHHHHHHhCCCCH
Q psy16144 84 RMKRYAEAIDVSQTIMKAHPSYP 106 (122)
Q Consensus 84 k~kr~~~AI~vc~kVL~~~P~yp 106 (122)
+.++...|-.++++++..-|...
T Consensus 82 ~~~d~~~aR~lfer~i~~l~~~~ 104 (280)
T PF05843_consen 82 KLNDINNARALFERAISSLPKEK 104 (280)
T ss_dssp HTT-HHHHHHHHHHHCCTSSCHH
T ss_pred HhCcHHHHHHHHHHHHHhcCchh
Confidence 88998888888888887766444
No 206
>PF07720 TPR_3: Tetratricopeptide repeat; InterPro: IPR011716 This entry includes tetratricopeptide-like repeats found in the LcrH/SycD-like chaperones [].; PDB: 3KS2_O 3GZ2_A 3GZ1_A 3GYZ_A 4AM9_A 2VGX_A 2VGY_A.
Probab=94.27 E-value=0.18 Score=29.03 Aligned_cols=33 Identities=18% Similarity=0.104 Sum_probs=25.6
Q ss_pred hHHHHHHHHHHHhcccHHHHHHH--HHHHHHhCCC
Q psy16144 72 LALHQKLASLLLRMKRYAEAIDV--SQTIMKAHPS 104 (122)
Q Consensus 72 ~~vgykLA~~ylk~kr~~~AI~v--c~kVL~~~P~ 104 (122)
+...|.||+++-..|+|.+|+++ ++-....+|.
T Consensus 1 ~e~~y~~a~~~y~~~ky~~A~~~~~y~~l~~ld~~ 35 (36)
T PF07720_consen 1 PEYLYGLAYNFYQKGKYDEAIHFFQYAFLCALDKY 35 (36)
T ss_dssp HHHHHHHHHHHHHTT-HHHHHHHHHHHHHHHHTTT
T ss_pred CcHHHHHHHHHHHHhhHHHHHHHHHHHHHHHhccc
Confidence 35678999999999999999999 4466666664
No 207
>KOG2053|consensus
Probab=94.21 E-value=0.17 Score=47.31 Aligned_cols=80 Identities=15% Similarity=0.120 Sum_probs=65.9
Q ss_pred cCCCChHHHHHHHH-HHhhCcc--hHHHHhhC-------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHH
Q psy16144 24 LLPWRWEPLSAGLR-KKQYISS--ATNIYNLF-------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAID 93 (122)
Q Consensus 24 i~~gk~d~A~elL~-~l~~nps--~~kA~e~l-------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~ 93 (122)
+.++.|..|...+. .++.-|+ |++++..+ ++||-.+.|-.-.. +++|....--|-++|-..+++++|..
T Consensus 20 ld~~qfkkal~~~~kllkk~Pn~~~a~vLkaLsl~r~gk~~ea~~~Le~~~~~-~~~D~~tLq~l~~~y~d~~~~d~~~~ 98 (932)
T KOG2053|consen 20 LDSSQFKKALAKLGKLLKKHPNALYAKVLKALSLFRLGKGDEALKLLEALYGL-KGTDDLTLQFLQNVYRDLGKLDEAVH 98 (932)
T ss_pred hhhHHHHHHHHHHHHHHHHCCCcHHHHHHHHHHHHHhcCchhHHHHHhhhccC-CCCchHHHHHHHHHHHHHhhhhHHHH
Confidence 46788999999999 8888777 56666633 89998766655555 78888888899999999999999999
Q ss_pred HHHHHHHhCCC
Q psy16144 94 VSQTIMKAHPS 104 (122)
Q Consensus 94 vc~kVL~~~P~ 104 (122)
+|+++...+|+
T Consensus 99 ~Ye~~~~~~P~ 109 (932)
T KOG2053|consen 99 LYERANQKYPS 109 (932)
T ss_pred HHHHHHhhCCc
Confidence 99999999997
No 208
>PF07721 TPR_4: Tetratricopeptide repeat; InterPro: IPR011717 This entry includes tetratricopeptide-like repeats not detected by the IPR001440 from INTERPRO, IPR013105 from INTERPRO and IPR011716 from INTERPRO models. The tetratricopeptide repeat (TPR) motif is a protein-protein interaction module found in multiple copies in a number of functionally different proteins that facilitates specific interactions with a partner protein(s) [].; GO: 0042802 identical protein binding
Probab=93.86 E-value=0.092 Score=27.60 Aligned_cols=26 Identities=23% Similarity=0.068 Sum_probs=23.0
Q ss_pred hHHHHHHHHHHHhcccHHHHHHHHHH
Q psy16144 72 LALHQKLASLLLRMKRYAEAIDVSQT 97 (122)
Q Consensus 72 ~~vgykLA~~ylk~kr~~~AI~vc~k 97 (122)
|.+.+.||..+...|++++|..++++
T Consensus 1 ~~a~~~la~~~~~~G~~~eA~~~l~~ 26 (26)
T PF07721_consen 1 PRARLALARALLAQGDPDEAERLLRR 26 (26)
T ss_pred CHHHHHHHHHHHHcCCHHHHHHHHhC
Confidence 56788999999999999999998763
No 209
>KOG2376|consensus
Probab=93.60 E-value=0.34 Score=43.71 Aligned_cols=49 Identities=29% Similarity=0.394 Sum_probs=29.2
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCC
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSY 105 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~y 105 (122)
.++|.+.++ .+ ++.++-..--=|..+-+.++|.+|.+||++..+-+-+.
T Consensus 95 ~Dealk~~~---~~-~~~~~~ll~L~AQvlYrl~~ydealdiY~~L~kn~~dd 143 (652)
T KOG2376|consen 95 LDEALKTLK---GL-DRLDDKLLELRAQVLYRLERYDEALDIYQHLAKNNSDD 143 (652)
T ss_pred HHHHHHHHh---cc-cccchHHHHHHHHHHHHHhhHHHHHHHHHHHHhcCCch
Confidence 555555555 11 33333333333677777788888888888887666543
No 210
>PRK04841 transcriptional regulator MalT; Provisional
Probab=93.58 E-value=0.74 Score=41.31 Aligned_cols=80 Identities=9% Similarity=-0.066 Sum_probs=61.4
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHH-----HHh-----h--C--HHHHHHHHHHHHHhhCCCC-----hHHHHHHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATN-----IYN-----L--F--PDDAVVQYTQACRIIGTSD-----LALHQKLASL 81 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~k-----A~e-----~--l--y~dAa~~ye~Aw~l~~~~~-----~~vgykLA~~ 81 (122)
++...|++++|..+++ ++...+.... +.. + . +++|...+++|........ ..+...||..
T Consensus 461 ~~~~~g~~~~A~~~~~~al~~~~~~~~~~~~~a~~~lg~~~~~~G~~~~A~~~~~~al~~~~~~g~~~~~~~~~~~la~~ 540 (903)
T PRK04841 461 VAINDGDPEEAERLAELALAELPLTWYYSRIVATSVLGEVHHCKGELARALAMMQQTEQMARQHDVYHYALWSLLQQSEI 540 (903)
T ss_pred HHHhCCCHHHHHHHHHHHHhcCCCccHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHhhhcchHHHHHHHHHHHHH
Confidence 7788999999999999 9886554221 111 1 1 9999999999998743221 2355788999
Q ss_pred HHhcccHHHHHHHHHHHHHh
Q psy16144 82 LLRMKRYAEAIDVSQTIMKA 101 (122)
Q Consensus 82 ylk~kr~~~AI~vc~kVL~~ 101 (122)
+...|++.+|.+.++++++.
T Consensus 541 ~~~~G~~~~A~~~~~~al~~ 560 (903)
T PRK04841 541 LFAQGFLQAAYETQEKAFQL 560 (903)
T ss_pred HHHCCCHHHHHHHHHHHHHH
Confidence 99999999999999999875
No 211
>KOG2376|consensus
Probab=93.47 E-value=0.5 Score=42.68 Aligned_cols=53 Identities=25% Similarity=0.254 Sum_probs=39.5
Q ss_pred HHHHHHHHHHHHHhh-----CCCC---------hHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCC
Q psy16144 53 PDDAVVQYTQACRII-----GTSD---------LALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSY 105 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~-----~~~~---------~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~y 105 (122)
|.+|...+++|.+++ +.+. ..+..-|||+|...|+-.+|-.+|..+++.+|..
T Consensus 191 y~qA~elL~kA~~~~~e~l~~~d~~eEeie~el~~IrvQlayVlQ~~Gqt~ea~~iy~~~i~~~~~D 257 (652)
T KOG2376|consen 191 YNQAIELLEKALRICREKLEDEDTNEEEIEEELNPIRVQLAYVLQLQGQTAEASSIYVDIIKRNPAD 257 (652)
T ss_pred HHHHHHHHHHHHHHHHHhhcccccchhhHHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHHhcCCC
Confidence 888888888884332 2222 2356778999999999999999999988888843
No 212
>PF10255 Paf67: RNA polymerase I-associated factor PAF67; InterPro: IPR019382 RNA polymerase I is a multi-subunit enzyme and its transcription competence is dependent on the presence of PAF67 [].
Probab=93.29 E-value=0.13 Score=44.13 Aligned_cols=61 Identities=16% Similarity=0.183 Sum_probs=48.1
Q ss_pred HhcCCCChHHHHHHHHHHhhCcchHHHHhhCHHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHH
Q psy16144 22 LYLLPWRWEPLSAGLRKKQYISSATNIYNLFPDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIM 99 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~~l~~nps~~kA~e~ly~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL 99 (122)
+++-.|+|..|.+.+.-+..|+.- .| |+- -+...++.|-+||+||-.+||.|||.++..+|
T Consensus 131 vh~LLGDY~~Alk~l~~idl~~~~-------------l~---~~V-~~~~is~~YyvGFaylMlrRY~DAir~f~~iL 191 (404)
T PF10255_consen 131 VHCLLGDYYQALKVLENIDLNKKG-------------LY---TKV-PACHISTYYYVGFAYLMLRRYADAIRTFSQIL 191 (404)
T ss_pred HHHhccCHHHHHHHhhccCcccch-------------hh---ccC-cchheehHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 788889999999988844545431 11 333 36678899999999999999999999999887
No 213
>PF13176 TPR_7: Tetratricopeptide repeat; PDB: 3SF4_C 3RO3_A 3RO2_A.
Probab=93.20 E-value=0.078 Score=29.68 Aligned_cols=23 Identities=9% Similarity=-0.003 Sum_probs=17.5
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcc
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISS 44 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps 44 (122)
+|.+.|+|++|+++++ +|...++
T Consensus 8 ~~~~~g~~~~Ai~~y~~aL~l~~~ 31 (36)
T PF13176_consen 8 IYRQQGDYEKAIEYYEQALALARD 31 (36)
T ss_dssp HHHHCT-HHHHHHHHHHHHHHHHH
T ss_pred HHHHcCCHHHHHHHHHHHHHhccc
Confidence 8888899999998888 7765544
No 214
>KOG1585|consensus
Probab=93.07 E-value=1.1 Score=37.02 Aligned_cols=88 Identities=15% Similarity=0.135 Sum_probs=67.4
Q ss_pred HhcCCCChHHHHHHHH-HH---hhCcc---hHHHHh---hC------HHHHHHHHHHHHH----hhCCCChHHHHHHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KK---QYISS---ATNIYN---LF------PDDAVVQYTQACR----IIGTSDLALHQKLASL 81 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l---~~nps---~~kA~e---~l------y~dAa~~ye~Aw~----l~~~~~~~vgykLA~~ 81 (122)
-|-.+++|++|...|. +. +.|+| -.|++| .| +.|++..|++|.. .+.|+-.+....-|.-
T Consensus 40 afRnAk~feKakdcLlkA~~~yEnnrslfhAAKayEqaamLake~~klsEvvdl~eKAs~lY~E~GspdtAAmaleKAak 119 (308)
T KOG1585|consen 40 AFRNAKKFEKAKDCLLKASKGYENNRSLFHAAKAYEQAAMLAKELSKLSEVVDLYEKASELYVECGSPDTAAMALEKAAK 119 (308)
T ss_pred HHHhhccHHHHHHHHHHHHHHHHhcccHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHhCCcchHHHHHHHHHH
Confidence 4557788999999888 76 45666 456677 22 8999999999965 4556666677777888
Q ss_pred HHhcccHHHHHHHHHHHHHhCCCCHHHH
Q psy16144 82 LLRMKRYAEAIDVSQTIMKAHPSYPKVH 109 (122)
Q Consensus 82 ylk~kr~~~AI~vc~kVL~~~P~ypkir 109 (122)
.++..+.++|+.+|++.+.+-...-+.+
T Consensus 120 ~lenv~Pd~AlqlYqralavve~~dr~~ 147 (308)
T KOG1585|consen 120 ALENVKPDDALQLYQRALAVVEEDDRDQ 147 (308)
T ss_pred HhhcCCHHHHHHHHHHHHHHHhccchHH
Confidence 8999999999999999998766554444
No 215
>COG4700 Uncharacterized protein conserved in bacteria containing a divergent form of TPR repeats [Function unknown]
Probab=92.98 E-value=0.89 Score=36.38 Aligned_cols=92 Identities=21% Similarity=0.204 Sum_probs=68.7
Q ss_pred hcCCCChHHHHHHHH-HHhhCcc--hHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHH
Q psy16144 23 YLLPWRWEPLSAGLR-KKQYISS--ATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAE 90 (122)
Q Consensus 23 yi~~gk~d~A~elL~-~l~~nps--~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~ 90 (122)
.+..+++.+|...|. ..++||. .+.....+ |.||...||.|..+ -| .|..-...+--+.++||..+
T Consensus 134 qfa~~~~A~a~~tLe~l~e~~pa~r~pd~~Ll~aR~laa~g~~a~Aesafe~a~~~-yp-g~~ar~~Y~e~La~qgr~~e 211 (251)
T COG4700 134 QFAIQEFAAAQQTLEDLMEYNPAFRSPDGHLLFARTLAAQGKYADAESAFEVAISY-YP-GPQARIYYAEMLAKQGRLRE 211 (251)
T ss_pred HHhhccHHHHHHHHHHHhhcCCccCCCCchHHHHHHHHhcCCchhHHHHHHHHHHh-CC-CHHHHHHHHHHHHHhcchhH
Confidence 445688889999999 9998886 23333222 89999999999988 35 46777777888999998776
Q ss_pred HH----HHHHHHHHhCCCCHHHHHHHHHHH
Q psy16144 91 AI----DVSQTIMKAHPSYPKVHKQIFEKA 116 (122)
Q Consensus 91 AI----~vc~kVL~~~P~ypkir~~il~ka 116 (122)
|- ++...+.+.+|-|.|=.++-++-|
T Consensus 212 a~aq~~~v~d~~~r~~~H~rkh~reW~~~A 241 (251)
T COG4700 212 ANAQYVAVVDTAKRSRPHYRKHHREWIKTA 241 (251)
T ss_pred HHHHHHHHHHHHHhcchhHHHHHHHHHHHH
Confidence 54 556667778888888777777776
No 216
>TIGR03504 FimV_Cterm FimV C-terminal domain. This protein is found at the extreme C-terminus of FimV from Pseudomonas aeruginosa, and of TspA of Neisseria meningitidis. Disruption of the former blocks twitching motility from type IV pili; Semmler, et al. suggest a role in peptidoglycan layer remodelling required by type IV fimbrial systems.
Probab=92.95 E-value=0.19 Score=30.29 Aligned_cols=29 Identities=14% Similarity=0.231 Sum_probs=25.7
Q ss_pred HHHHHHHHHhcccHHHHHHHHHHHHHhCCC
Q psy16144 75 HQKLASLLLRMKRYAEAIDVSQTIMKAHPS 104 (122)
Q Consensus 75 gykLA~~ylk~kr~~~AI~vc~kVL~~~P~ 104 (122)
-+.||..|..+|+++.|.++.++|+. .++
T Consensus 2 kLdLA~ayie~Gd~e~Ar~lL~evl~-~~~ 30 (44)
T TIGR03504 2 KLDLARAYIEMGDLEGARELLEEVIE-EGD 30 (44)
T ss_pred chHHHHHHHHcCChHHHHHHHHHHHH-cCC
Confidence 37899999999999999999999994 554
No 217
>KOG0550|consensus
Probab=92.82 E-value=0.4 Score=41.84 Aligned_cols=87 Identities=11% Similarity=0.089 Sum_probs=65.3
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHh--hC-----------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhccc
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYN--LF-----------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKR 87 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e--~l-----------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr 87 (122)
-..+.|+|..|.|... +|..+|++.+-.- |+ .+||+.--+.|.++ ++.-..+...=|.+++-.++
T Consensus 258 ~~fk~G~y~~A~E~Yteal~idP~n~~~naklY~nra~v~~rLgrl~eaisdc~~Al~i-D~syikall~ra~c~l~le~ 336 (486)
T KOG0550|consen 258 DAFKNGNYRKAYECYTEALNIDPSNKKTNAKLYGNRALVNIRLGRLREAISDCNEALKI-DSSYIKALLRRANCHLALEK 336 (486)
T ss_pred hHhhccchhHHHHHHHHhhcCCccccchhHHHHHHhHhhhcccCCchhhhhhhhhhhhc-CHHHHHHHHHHHHHHHHHHH
Confidence 3456688888888888 8888888554332 22 78888888888888 67777777777888888888
Q ss_pred HHHHHHHHHHHHHhCCCCHHHHH
Q psy16144 88 YAEAIDVSQTIMKAHPSYPKVHK 110 (122)
Q Consensus 88 ~~~AI~vc~kVL~~~P~ypkir~ 110 (122)
+++|.+=+++++...-+ +.+|.
T Consensus 337 ~e~AV~d~~~a~q~~~s-~e~r~ 358 (486)
T KOG0550|consen 337 WEEAVEDYEKAMQLEKD-CEIRR 358 (486)
T ss_pred HHHHHHHHHHHHhhccc-cchHH
Confidence 88888888888877666 66664
No 218
>KOG3824|consensus
Probab=92.72 E-value=0.21 Score=42.60 Aligned_cols=55 Identities=9% Similarity=0.135 Sum_probs=45.8
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKV 108 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypki 108 (122)
-+.|.+.|+.|+.+ .|++|.+..+.|....-.+..++|=.+|-++|...|.+-++
T Consensus 132 ~ekA~~lfeHAlal-aP~~p~~L~e~G~f~E~~~~iv~ADq~Y~~ALtisP~nseA 186 (472)
T KOG3824|consen 132 LEKAMTLFEHALAL-APTNPQILIEMGQFREMHNEIVEADQCYVKALTISPGNSEA 186 (472)
T ss_pred hHHHHHHHHHHHhc-CCCCHHHHHHHhHHHHhhhhhHhhhhhhheeeeeCCCchHH
Confidence 67788888888888 68888888888888888888888888888888888877654
No 219
>PF12862 Apc5: Anaphase-promoting complex subunit 5
Probab=92.58 E-value=0.67 Score=31.11 Aligned_cols=31 Identities=23% Similarity=0.268 Sum_probs=27.0
Q ss_pred HHHHHHHHHHHhcccHHHHHHHHHHHHHhCC
Q psy16144 73 ALHQKLASLLLRMKRYAEAIDVSQTIMKAHP 103 (122)
Q Consensus 73 ~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P 103 (122)
.+...+|..+...|.+.+|+..+++++++-.
T Consensus 42 ~all~lA~~~~~~G~~~~A~~~l~eAi~~Ar 72 (94)
T PF12862_consen 42 YALLNLAELHRRFGHYEEALQALEEAIRLAR 72 (94)
T ss_pred HHHHHHHHHHHHhCCHHHHHHHHHHHHHHHH
Confidence 4567899999999999999999999987654
No 220
>PF03704 BTAD: Bacterial transcriptional activator domain; InterPro: IPR005158 Found in the DNRI/REDD/AFSR family of regulators, this region of AFSR (P25941 from SWISSPROT) along with the C-terminal region is capable of independently directing actinorhodin production. It is important for the formation of secondary metabolites.; PDB: 2FF4_B 2FEZ_A.
Probab=92.49 E-value=0.58 Score=32.86 Aligned_cols=85 Identities=14% Similarity=0.008 Sum_probs=57.0
Q ss_pred hcCCCChHHHHHHHH-HHhhCcchHHHHh-h-C-HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHH
Q psy16144 23 YLLPWRWEPLSAGLR-KKQYISSATNIYN-L-F-PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTI 98 (122)
Q Consensus 23 yi~~gk~d~A~elL~-~l~~nps~~kA~e-~-l-y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kV 98 (122)
..+.|..+.+.+.++ ++..-.+..-.-. . - .......++..+ ..+.-+|+..+...|++.+|+..|++.
T Consensus 16 ~~~~~~~~~~~~~~~~al~ly~G~~l~~~~~~~W~~~~r~~l~~~~-------~~~~~~l~~~~~~~~~~~~a~~~~~~~ 88 (146)
T PF03704_consen 16 AARAGDPEEAIELLEEALALYRGDFLPDLDDEEWVEPERERLRELY-------LDALERLAEALLEAGDYEEALRLLQRA 88 (146)
T ss_dssp HHHTT-HHHHHHHHHHHHTT--SSTTGGGTTSTTHHHHHHHHHHHH-------HHHHHHHHHHHHHTT-HHHHHHHHHHH
T ss_pred HHHCCCHHHHHHHHHHHHHHhCCCCCCCCCccHHHHHHHHHHHHHH-------HHHHHHHHHHHHhccCHHHHHHHHHHH
Confidence 345678889999999 8887766432221 1 1 333333344332 467888999999999999999999999
Q ss_pred HHhCCCCHHHHHHHHH
Q psy16144 99 MKAHPSYPKVHKQIFE 114 (122)
Q Consensus 99 L~~~P~ypkir~~il~ 114 (122)
+..+|.+-.+....|.
T Consensus 89 l~~dP~~E~~~~~lm~ 104 (146)
T PF03704_consen 89 LALDPYDEEAYRLLMR 104 (146)
T ss_dssp HHHSTT-HHHHHHHHH
T ss_pred HhcCCCCHHHHHHHHH
Confidence 9999999888765543
No 221
>smart00028 TPR Tetratricopeptide repeats. Repeats present in 4 or more copies in proteins. Contain a minimum of 34 amino acids each and self-associate via a "knobs and holes" mechanism.
Probab=92.39 E-value=0.12 Score=25.02 Aligned_cols=23 Identities=9% Similarity=-0.026 Sum_probs=19.7
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcc
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISS 44 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps 44 (122)
+|...|++++|...++ +++.+|+
T Consensus 10 ~~~~~~~~~~a~~~~~~~~~~~~~ 33 (34)
T smart00028 10 AYLKLGDYDEALEYYEKALELDPN 33 (34)
T ss_pred HHHHHhhHHHHHHHHHHHHccCCC
Confidence 7777899999999999 8888876
No 222
>KOG0551|consensus
Probab=92.37 E-value=0.67 Score=39.53 Aligned_cols=55 Identities=25% Similarity=0.325 Sum_probs=31.5
Q ss_pred HHHHHHHHHHHHHhhCCCChHHH----HHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALH----QKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKV 108 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vg----ykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypki 108 (122)
|++|..+|-+.++- +-.||.+- .+=|.+-+-.|+|.-||.=|.++|..+|+|.|.
T Consensus 97 yk~A~~~Yt~Glk~-kc~D~dlnavLY~NRAAa~~~l~NyRs~l~Dcs~al~~~P~h~Ka 155 (390)
T KOG0551|consen 97 YKDAVESYTEGLKK-KCADPDLNAVLYTNRAAAQLYLGNYRSALNDCSAALKLKPTHLKA 155 (390)
T ss_pred HHHHHHHHHHHHhh-cCCCccHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhcCcchhhh
Confidence 66666666666655 34444432 233445555666666666666666666666554
No 223
>PF13281 DUF4071: Domain of unknown function (DUF4071)
Probab=92.30 E-value=0.91 Score=38.66 Aligned_cols=76 Identities=18% Similarity=0.078 Sum_probs=54.7
Q ss_pred HhcC---CCChHHHHHHHH-HHhh-CcchHHHHhhC------------------HHHHHHHHHHHHHhhCCCChHHHHHH
Q psy16144 22 LYLL---PWRWEPLSAGLR-KKQY-ISSATNIYNLF------------------PDDAVVQYTQACRIIGTSDLALHQKL 78 (122)
Q Consensus 22 Lyi~---~gk~d~A~elL~-~l~~-nps~~kA~e~l------------------y~dAa~~ye~Aw~l~~~~~~~vgykL 78 (122)
-..+ .|+.+.|.+.+. ++.- .+..+..+-.. .+.|+..|.++|+. + .+--.|-++
T Consensus 188 ALnRrn~~gdre~Al~il~~~l~~~~~~~~d~~gL~GRIyKD~~~~s~~~d~~~ldkAi~~Y~kgFe~-~-~~~Y~GIN~ 265 (374)
T PF13281_consen 188 ALNRRNKPGDREKALQILLPVLESDENPDPDTLGLLGRIYKDLFLESNFTDRESLDKAIEWYRKGFEI-E-PDYYSGINA 265 (374)
T ss_pred HHhhcccCCCHHHHHHHHHHHHhccCCCChHHHHHHHHHHHHHHHHcCccchHHHHHHHHHHHHHHcC-C-ccccchHHH
Confidence 4456 899999999998 5443 33344443311 88899999999999 4 578999999
Q ss_pred HHHHHhcccHHHHHHHHHHHH
Q psy16144 79 ASLLLRMKRYAEAIDVSQTIM 99 (122)
Q Consensus 79 A~~ylk~kr~~~AI~vc~kVL 99 (122)
|..+...|.-.+...-.++|-
T Consensus 266 AtLL~~~g~~~~~~~el~~i~ 286 (374)
T PF13281_consen 266 ATLLMLAGHDFETSEELRKIG 286 (374)
T ss_pred HHHHHHcCCcccchHHHHHHH
Confidence 999999997555443344443
No 224
>PF10300 DUF3808: Protein of unknown function (DUF3808); InterPro: IPR019412 This entry represents a family of proteins conserved from fungi to humans. In humans this protein is expressed in primary breast carcinomas but not in normal breast tissue, and has a putative eukaryotic RNP-1 RNA binding region and a candidate anchoring transmembrane domain. The human protein is coordinately regulated with oestrogen receptor, but is not necessarily oestradiol-responsive []. Members of this family carry a tetratricopeptide repeat (IPR013105 from INTERPRO) at their C terminus.
Probab=92.28 E-value=0.7 Score=39.83 Aligned_cols=78 Identities=17% Similarity=0.044 Sum_probs=49.8
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcc-----hHHHHhhC--------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhccc
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISS-----ATNIYNLF--------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKR 87 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps-----~~kA~e~l--------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr 87 (122)
++...|+.++|.+.++ ++.-..+ ..-.||+. |++|+.+|....+.++=+..-..|-.|-+|...++
T Consensus 276 ~~~~~g~~~~Ai~~~~~a~~~q~~~~Ql~~l~~~El~w~~~~~~~w~~A~~~f~~L~~~s~WSka~Y~Y~~a~c~~~l~~ 355 (468)
T PF10300_consen 276 LERLKGNLEEAIESFERAIESQSEWKQLHHLCYFELAWCHMFQHDWEEAAEYFLRLLKESKWSKAFYAYLAAACLLMLGR 355 (468)
T ss_pred HHHHhcCHHHHHHHHHHhccchhhHHhHHHHHHHHHHHHHHHHchHHHHHHHHHHHHhccccHHHHHHHHHHHHHHhhcc
Confidence 6667777888888777 6632111 11122211 77787777777777655556667777777777777
Q ss_pred H-------HHHHHHHHHHH
Q psy16144 88 Y-------AEAIDVSQTIM 99 (122)
Q Consensus 88 ~-------~~AI~vc~kVL 99 (122)
. .+|.+..++|=
T Consensus 356 ~~~~~~~~~~a~~l~~~vp 374 (468)
T PF10300_consen 356 EEEAKEHKKEAEELFRKVP 374 (468)
T ss_pred chhhhhhHHHHHHHHHHHH
Confidence 7 66666666554
No 225
>PF09613 HrpB1_HrpK: Bacterial type III secretion protein (HrpB1_HrpK); InterPro: IPR013394 This family of proteins is encoded by genes found within type III secretion operons in a limited range of species including Xanthomonas, Ralstonia and Burkholderia.
Probab=92.25 E-value=0.88 Score=34.54 Aligned_cols=66 Identities=14% Similarity=-0.071 Sum_probs=37.5
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHH-----HhhC----HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNI-----YNLF----PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRY 88 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA-----~e~l----y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~ 88 (122)
+.++.++.++++.+|. .-...|+.+.. |..+ |.||...++..-+- .|.+|.+.=-||+++.-.|+.
T Consensus 19 ~al~~~~~~D~e~lL~ALrvLRP~~~e~~~~~~~l~i~r~~w~dA~rlLr~l~~~-~~~~p~~kALlA~CL~~~~D~ 94 (160)
T PF09613_consen 19 VALRLGDPDDAEALLDALRVLRPEFPELDLFDGWLHIVRGDWDDALRLLRELEER-APGFPYAKALLALCLYALGDP 94 (160)
T ss_pred HHHccCChHHHHHHHHHHHHhCCCchHHHHHHHHHHHHhCCHHHHHHHHHHHhcc-CCCChHHHHHHHHHHHHcCCh
Confidence 5556667777777777 55555553322 2222 66666666665544 466666666666666555553
No 226
>KOG3364|consensus
Probab=91.65 E-value=0.94 Score=34.09 Aligned_cols=64 Identities=14% Similarity=0.205 Sum_probs=51.4
Q ss_pred HHHHHHHHHHHHH-hhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHH--HHHHHHH
Q psy16144 53 PDDAVVQYTQACR-IIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVH--KQIFEKA 116 (122)
Q Consensus 53 y~dAa~~ye~Aw~-l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir--~~il~ka 116 (122)
-++-+..+|...+ -...+.-.--|=||+.+-|.|+|..++..|+..|+..||++.++ ++-++++
T Consensus 51 v~~GI~iLe~l~~~~~~~~rRe~lyYLAvg~yRlkeY~~s~~yvd~ll~~e~~n~Qa~~Lk~~ied~ 117 (149)
T KOG3364|consen 51 VQEGIVILEDLLKSAHPERRRECLYYLAVGHYRLKEYSKSLRYVDALLETEPNNRQALELKETIEDK 117 (149)
T ss_pred HHHhHHHHHHHhhhcCcccchhhhhhhHHHHHHHhhHHHHHHHHHHHHhhCCCcHHHHHHHHHHHHH
Confidence 7788888888886 33344556788999999999999999999999999999998774 3444444
No 227
>PF14853 Fis1_TPR_C: Fis1 C-terminal tetratricopeptide repeat; PDB: 1IYG_A 1PC2_A 1NZN_A 3UUX_C 1Y8M_A 2PQR_A 2PQN_A 3O48_A.
Probab=91.43 E-value=0.35 Score=30.12 Aligned_cols=31 Identities=6% Similarity=-0.223 Sum_probs=25.1
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF 52 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l 52 (122)
=+.+.|+|++|..+++ +|+..|++..|-++.
T Consensus 10 g~ykl~~Y~~A~~~~~~lL~~eP~N~Qa~~L~ 41 (53)
T PF14853_consen 10 GHYKLGEYEKARRYCDALLEIEPDNRQAQSLK 41 (53)
T ss_dssp HHHHTT-HHHHHHHHHHHHHHTTS-HHHHHHH
T ss_pred HHHHhhhHHHHHHHHHHHHhhCCCcHHHHHHH
Confidence 5668899999999999 999999998877643
No 228
>PF13174 TPR_6: Tetratricopeptide repeat; PDB: 3QKY_A 2XEV_A 3URZ_B 2Q7F_A.
Probab=91.36 E-value=0.15 Score=26.83 Aligned_cols=24 Identities=4% Similarity=-0.036 Sum_probs=21.4
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcch
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSA 45 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~ 45 (122)
+|.+.|++++|.+.++ +++..|++
T Consensus 9 ~~~~~g~~~~A~~~~~~~~~~~P~s 33 (33)
T PF13174_consen 9 CYYKLGDYDEAIEYFQRLIKRYPDS 33 (33)
T ss_dssp HHHHHCHHHHHHHHHHHHHHHSTTS
T ss_pred HHHHccCHHHHHHHHHHHHHHCcCC
Confidence 7778899999999999 99988874
No 229
>PF13374 TPR_10: Tetratricopeptide repeat; PDB: 3CEQ_B 3EDT_H 3NF1_A.
Probab=91.14 E-value=0.34 Score=26.48 Aligned_cols=26 Identities=19% Similarity=0.112 Sum_probs=19.1
Q ss_pred HHHHHH--HhcCCCChHHHHHHHH-HHhh
Q psy16144 16 AEWHFL--LYLLPWRWEPLSAGLR-KKQY 41 (122)
Q Consensus 16 ~~~~~L--Lyi~~gk~d~A~elL~-~l~~ 41 (122)
.-++.| .|...|++++|.++++ ++..
T Consensus 3 ~~~~~la~~~~~~g~~~~A~~~~~~al~~ 31 (42)
T PF13374_consen 3 SALNNLANAYRAQGRYEEALELLEEALEI 31 (42)
T ss_dssp HHHHHHHHHHHHCT-HHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHhhhhcchhhHHHHHHHHH
Confidence 344555 7888899999999999 7764
No 230
>PRK04841 transcriptional regulator MalT; Provisional
Probab=91.13 E-value=1.2 Score=39.88 Aligned_cols=82 Identities=17% Similarity=0.106 Sum_probs=60.0
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHH----Hh-------hC----HHHHHHHHHHHHHhhCCCC----h---HHHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNI----YN-------LF----PDDAVVQYTQACRIIGTSD----L---ALHQKL 78 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA----~e-------~l----y~dAa~~ye~Aw~l~~~~~----~---~vgykL 78 (122)
.+...|++++|...++ ++....+.... +- ++ +++|...++++..+..... + .+...+
T Consensus 500 ~~~~~G~~~~A~~~~~~al~~~~~~g~~~~~~~~~~~la~~~~~~G~~~~A~~~~~~al~~~~~~~~~~~~~~~~~~~~l 579 (903)
T PRK04841 500 VHHCKGELARALAMMQQTEQMARQHDVYHYALWSLLQQSEILFAQGFLQAAYETQEKAFQLIEEQHLEQLPMHEFLLRIR 579 (903)
T ss_pred HHHHcCCHHHHHHHHHHHHHHHhhhcchHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHhccccccHHHHHHHHH
Confidence 6778899999999999 88764432111 11 11 9999999999998754321 2 234577
Q ss_pred HHHHHhcccHHHHHHHHHHHHHhCC
Q psy16144 79 ASLLLRMKRYAEAIDVSQTIMKAHP 103 (122)
Q Consensus 79 A~~ylk~kr~~~AI~vc~kVL~~~P 103 (122)
|..+...|++.+|.+.+++++...+
T Consensus 580 a~~~~~~G~~~~A~~~~~~al~~~~ 604 (903)
T PRK04841 580 AQLLWEWARLDEAEQCARKGLEVLS 604 (903)
T ss_pred HHHHHHhcCHHHHHHHHHHhHHhhh
Confidence 8888999999999999999987644
No 231
>KOG4340|consensus
Probab=91.01 E-value=0.65 Score=39.57 Aligned_cols=56 Identities=16% Similarity=0.153 Sum_probs=50.8
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVH 109 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir 109 (122)
|.||+.+...-.+- +|++.+..-.||++|-..++|..|.++|.+.=..+|.+.+-|
T Consensus 26 y~DaI~~l~s~~Er-~p~~rAgLSlLgyCYY~~Q~f~~AA~CYeQL~ql~P~~~qYr 81 (459)
T KOG4340|consen 26 YADAIQLLGSELER-SPRSRAGLSLLGYCYYRLQEFALAAECYEQLGQLHPELEQYR 81 (459)
T ss_pred HHHHHHHHHHHHhc-CccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhhChHHHHHH
Confidence 89999999998888 688888888999999999999999999999999999776654
No 232
>KOG0551|consensus
Probab=90.81 E-value=0.84 Score=38.95 Aligned_cols=79 Identities=15% Similarity=0.188 Sum_probs=66.0
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHh--hC-----------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhccc
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYN--LF-----------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKR 87 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e--~l-----------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr 87 (122)
=|...+||..|++.+. -|+.+..+++.-. |. |+.|..=-.+|..+ +|++.-+-++=|-+++-.++
T Consensus 90 ~~fK~Kryk~A~~~Yt~Glk~kc~D~dlnavLY~NRAAa~~~l~NyRs~l~Dcs~al~~-~P~h~Ka~~R~Akc~~eLe~ 168 (390)
T KOG0551|consen 90 EYFKEKRYKDAVESYTEGLKKKCADPDLNAVLYTNRAAAQLYLGNYRSALNDCSAALKL-KPTHLKAYIRGAKCLLELER 168 (390)
T ss_pred HHHHhhhHHHHHHHHHHHHhhcCCCccHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHhc-CcchhhhhhhhhHHHHHHHH
Confidence 5778899999999999 9988777655443 22 78888878889999 89999999999999999999
Q ss_pred HHHHHHHHHHHHHh
Q psy16144 88 YAEAIDVSQTIMKA 101 (122)
Q Consensus 88 ~~~AI~vc~kVL~~ 101 (122)
+.+|...|+.-|.+
T Consensus 169 ~~~a~nw~ee~~~~ 182 (390)
T KOG0551|consen 169 FAEAVNWCEEGLQI 182 (390)
T ss_pred HHHHHHHHhhhhhh
Confidence 99999999865543
No 233
>KOG1586|consensus
Probab=90.72 E-value=1.8 Score=35.58 Aligned_cols=82 Identities=17% Similarity=0.247 Sum_probs=53.5
Q ss_pred HhcCCCChHHHHHHHHHHhhCcchHHHHh-hC--HHHHHHHHHHHHHhhCCCCh-----HHHHHHHHHHHhcccHHHHHH
Q psy16144 22 LYLLPWRWEPLSAGLRKKQYISSATNIYN-LF--PDDAVVQYTQACRIIGTSDL-----ALHQKLASLLLRMKRYAEAID 93 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~~l~~nps~~kA~e-~l--y~dAa~~ye~Aw~l~~~~~~-----~vgykLA~~ylk~kr~~~AI~ 93 (122)
||...|+|..|-..-. +-...|| -+ ++.|+.+||+|=++-...-. --..|-|.---..++|.+||+
T Consensus 102 Iyt~~Grf~~aAk~~~------~iaEiyEsdl~d~ekaI~~YE~Aae~yk~ees~ssANKC~lKvA~yaa~leqY~~Ai~ 175 (288)
T KOG1586|consen 102 IYTDMGRFTMAAKHHI------EIAEIYESDLQDFEKAIAHYEQAAEYYKGEESVSSANKCLLKVAQYAAQLEQYSKAID 175 (288)
T ss_pred HHHhhhHHHHHHhhhh------hHHHHHhhhHHHHHHHHHHHHHHHHHHcchhhhhhHHHHHHHHHHHHHHHHHHHHHHH
Confidence 6666666666544322 2234444 22 88999999999776322111 123455666667899999999
Q ss_pred HHHHHHHhCCCCHHHH
Q psy16144 94 VSQTIMKAHPSYPKVH 109 (122)
Q Consensus 94 vc~kVL~~~P~ypkir 109 (122)
+|++|-+..-++|-.+
T Consensus 176 iyeqva~~s~~n~LLK 191 (288)
T KOG1586|consen 176 IYEQVARSSLDNNLLK 191 (288)
T ss_pred HHHHHHHHhccchHHH
Confidence 9999998887877654
No 234
>COG4785 NlpI Lipoprotein NlpI, contains TPR repeats [General function prediction only]
Probab=90.40 E-value=0.61 Score=38.08 Aligned_cols=55 Identities=16% Similarity=0.224 Sum_probs=49.6
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKV 108 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypki 108 (122)
..=|..-|.+++.+ +|+-|+|..-||--|...|+|+.|.+.+.-|++.+|.|--.
T Consensus 81 ~~LAR~DftQaLai-~P~m~~vfNyLG~Yl~~a~~fdaa~eaFds~~ELDp~y~Ya 135 (297)
T COG4785 81 RALARNDFSQALAI-RPDMPEVFNYLGIYLTQAGNFDAAYEAFDSVLELDPTYNYA 135 (297)
T ss_pred HHHHhhhhhhhhhc-CCCcHHHHHHHHHHHHhcccchHHHHHhhhHhccCCcchHH
Confidence 55577778899999 79999999999999999999999999999999999987544
No 235
>KOG1941|consensus
Probab=90.18 E-value=0.53 Score=40.86 Aligned_cols=80 Identities=13% Similarity=0.107 Sum_probs=61.7
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------------HHHHHHHHHHHHHhhCCC---------ChHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------------PDDAVVQYTQACRIIGTS---------DLALHQ 76 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------------y~dAa~~ye~Aw~l~~~~---------~~~vgy 76 (122)
-++..|-|+.+.+.|+ ++++.-...++...+ |+.|..+-.+|.++.+-= ...+.|
T Consensus 131 Ahlgls~fq~~Lesfe~A~~~A~~~~D~~LElqvcv~Lgslf~~l~D~~Kal~f~~kA~~lv~s~~l~d~~~kyr~~~ly 210 (518)
T KOG1941|consen 131 AHLGLSVFQKALESFEKALRYAHNNDDAMLELQVCVSLGSLFAQLKDYEKALFFPCKAAELVNSYGLKDWSLKYRAMSLY 210 (518)
T ss_pred HhhhHHHHHHHHHHHHHHHHHhhccCCceeeeehhhhHHHHHHHHHhhhHHhhhhHhHHHHHHhcCcCchhHHHHHHHHH
Confidence 3455677899999999 888776655554422 899999889998885321 135679
Q ss_pred HHHHHHHhcccHHHHHHHHHHHHHh
Q psy16144 77 KLASLLLRMKRYAEAIDVSQTIMKA 101 (122)
Q Consensus 77 kLA~~ylk~kr~~~AI~vc~kVL~~ 101 (122)
.+|..|-+.|+..||-+.|+++.++
T Consensus 211 hmaValR~~G~LgdA~e~C~Ea~kl 235 (518)
T KOG1941|consen 211 HMAVALRLLGRLGDAMECCEEAMKL 235 (518)
T ss_pred HHHHHHHHhcccccHHHHHHHHHHH
Confidence 9999999999999999999988764
No 236
>PRK10941 hypothetical protein; Provisional
Probab=89.53 E-value=2 Score=34.74 Aligned_cols=55 Identities=9% Similarity=-0.002 Sum_probs=37.2
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKV 108 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypki 108 (122)
+..|..+-+..+.+ .|++|.--=.-|+.|...|.+..|.+=++.-+++.|+.|.+
T Consensus 197 ~~~AL~~~e~ll~l-~P~dp~e~RDRGll~~qL~c~~~A~~DL~~fl~~~P~dp~a 251 (269)
T PRK10941 197 MELALRASEALLQF-DPEDPYEIRDRGLIYAQLDCEHVALSDLSYFVEQCPEDPIS 251 (269)
T ss_pred HHHHHHHHHHHHHh-CCCCHHHHHHHHHHHHHcCCcHHHHHHHHHHHHhCCCchhH
Confidence 55555555555555 46666555555777777777777777777777777777765
No 237
>smart00386 HAT HAT (Half-A-TPR) repeats. Present in several RNA-binding proteins. Structurally and sequentially thought to be similar to TPRs.
Probab=89.44 E-value=1.2 Score=22.82 Aligned_cols=29 Identities=10% Similarity=0.122 Sum_probs=23.5
Q ss_pred ccHHHHHHHHHHHHHhCCCCHHHHHHHHH
Q psy16144 86 KRYAEAIDVSQTIMKAHPSYPKVHKQIFE 114 (122)
Q Consensus 86 kr~~~AI~vc~kVL~~~P~ypkir~~il~ 114 (122)
|....|-++|++.++..|.++.++..=++
T Consensus 1 ~~~~~~r~i~e~~l~~~~~~~~~W~~y~~ 29 (33)
T smart00386 1 GDIERARKIYERALEKFPKSVELWLKYAE 29 (33)
T ss_pred CcHHHHHHHHHHHHHHCCCChHHHHHHHH
Confidence 45678889999999999999999876543
No 238
>COG4700 Uncharacterized protein conserved in bacteria containing a divergent form of TPR repeats [Function unknown]
Probab=89.37 E-value=1.4 Score=35.34 Aligned_cols=50 Identities=16% Similarity=0.060 Sum_probs=23.3
Q ss_pred HHHHHHHHHHHHHhhCCCC--hHHHHHHHHHHHhcccHHHHHHHHHHHHHhCC
Q psy16144 53 PDDAVVQYTQACRIIGTSD--LALHQKLASLLLRMKRYAEAIDVSQTIMKAHP 103 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~--~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P 103 (122)
+.+|...+|...+. +|.. |.-..-+|..|--.|++++|..-++.+++-.|
T Consensus 140 ~A~a~~tLe~l~e~-~pa~r~pd~~Ll~aR~laa~g~~a~Aesafe~a~~~yp 191 (251)
T COG4700 140 FAAAQQTLEDLMEY-NPAFRSPDGHLLFARTLAAQGKYADAESAFEVAISYYP 191 (251)
T ss_pred HHHHHHHHHHHhhc-CCccCCCCchHHHHHHHHhcCCchhHHHHHHHHHHhCC
Confidence 44444444444444 3322 23334444455555555555555555555444
No 239
>COG4105 ComL DNA uptake lipoprotein [General function prediction only]
Probab=88.78 E-value=1.5 Score=35.66 Aligned_cols=51 Identities=18% Similarity=0.152 Sum_probs=45.7
Q ss_pred ChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHhccC
Q psy16144 71 DLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVHKQIFEKAMVHLR 121 (122)
Q Consensus 71 ~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir~~il~ka~~~lr 121 (122)
.+...|+=|.--+..|++.+|++-++++..+||..|-.++-.|+-+.+.-|
T Consensus 33 p~~~LY~~g~~~L~~gn~~~A~~~fe~l~~~~p~s~~~~qa~l~l~yA~Yk 83 (254)
T COG4105 33 PASELYNEGLTELQKGNYEEAIKYFEALDSRHPFSPYSEQAQLDLAYAYYK 83 (254)
T ss_pred CHHHHHHHHHHHHhcCCHHHHHHHHHHHHHcCCCCcccHHHHHHHHHHHHh
Confidence 577899999999999999999999999999999999998888888876543
No 240
>KOG3785|consensus
Probab=88.68 E-value=5.7 Score=34.70 Aligned_cols=24 Identities=13% Similarity=0.032 Sum_probs=15.7
Q ss_pred HHHHHHhcCCCChHHHHHHHH-HHh
Q psy16144 17 EWHFLLYLLPWRWEPLSAGLR-KKQ 40 (122)
Q Consensus 17 ~~~~LLyi~~gk~d~A~elL~-~l~ 40 (122)
-|.-.-|...|+|++|.+..+ ..+
T Consensus 61 lWia~C~fhLgdY~~Al~~Y~~~~~ 85 (557)
T KOG3785|consen 61 LWIAHCYFHLGDYEEALNVYTFLMN 85 (557)
T ss_pred HHHHHHHHhhccHHHHHHHHHHHhc
Confidence 344445666788888888777 543
No 241
>PF10300 DUF3808: Protein of unknown function (DUF3808); InterPro: IPR019412 This entry represents a family of proteins conserved from fungi to humans. In humans this protein is expressed in primary breast carcinomas but not in normal breast tissue, and has a putative eukaryotic RNP-1 RNA binding region and a candidate anchoring transmembrane domain. The human protein is coordinately regulated with oestrogen receptor, but is not necessarily oestradiol-responsive []. Members of this family carry a tetratricopeptide repeat (IPR013105 from INTERPRO) at their C terminus.
Probab=87.96 E-value=2.6 Score=36.30 Aligned_cols=80 Identities=11% Similarity=-0.026 Sum_probs=62.2
Q ss_pred CCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCCh----HHHHHHHHHHHhcccHHH
Q psy16144 25 LPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDL----ALHQKLASLLLRMKRYAE 90 (122)
Q Consensus 25 ~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~----~vgykLA~~ylk~kr~~~ 90 (122)
.....+.|+++|. ..+.-|+++-....- .++|+..|++|++-. ..-+ -..|.||.+++-..++.+
T Consensus 245 ~~~~~~~a~~lL~~~~~~yP~s~lfl~~~gR~~~~~g~~~~Ai~~~~~a~~~q-~~~~Ql~~l~~~El~w~~~~~~~w~~ 323 (468)
T PF10300_consen 245 EDVPLEEAEELLEEMLKRYPNSALFLFFEGRLERLKGNLEEAIESFERAIESQ-SEWKQLHHLCYFELAWCHMFQHDWEE 323 (468)
T ss_pred cCCCHHHHHHHHHHHHHhCCCcHHHHHHHHHHHHHhcCHHHHHHHHHHhccch-hhHHhHHHHHHHHHHHHHHHHchHHH
Confidence 4566889999999 999999855433311 999999999998632 3333 347999999999999999
Q ss_pred HHHHHHHHHHhCCCC
Q psy16144 91 AIDVSQTIMKAHPSY 105 (122)
Q Consensus 91 AI~vc~kVL~~~P~y 105 (122)
|.+.+.+.++.+.-.
T Consensus 324 A~~~f~~L~~~s~WS 338 (468)
T PF10300_consen 324 AAEYFLRLLKESKWS 338 (468)
T ss_pred HHHHHHHHHhccccH
Confidence 999988888776543
No 242
>PF11207 DUF2989: Protein of unknown function (DUF2989); InterPro: IPR021372 Some members in this bacterial family of proteins are annotated as lipoproteins however this cannot be confirmed.
Probab=87.65 E-value=4 Score=32.14 Aligned_cols=39 Identities=21% Similarity=0.411 Sum_probs=36.5
Q ss_pred HHHHHHHHHHHHHhhCCC---ChHHHHHHHHHHHhcccHHHH
Q psy16144 53 PDDAVVQYTQACRIIGTS---DLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~---~~~vgykLA~~ylk~kr~~~A 91 (122)
...|...|.+++++++++ ||.+...||..|.+.|+++.|
T Consensus 156 ~~Kt~~ll~~~L~l~~~~~~~n~eil~sLas~~~~~~~~e~A 197 (203)
T PF11207_consen 156 PEKTIQLLLRALELSNPDDNFNPEILKSLASIYQKLKNYEQA 197 (203)
T ss_pred HHHHHHHHHHHHHhcCCCCCCCHHHHHHHHHHHHHhcchhhh
Confidence 789999999999999888 899999999999999999886
No 243
>COG3118 Thioredoxin domain-containing protein [Posttranslational modification, protein turnover, chaperones]
Probab=87.56 E-value=2.9 Score=34.87 Aligned_cols=50 Identities=22% Similarity=0.143 Sum_probs=42.6
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHH
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPK 107 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypk 107 (122)
+.+|...+..||.. .|.+..+..-||-+|+..|+.++|.+ +|+.-|...+
T Consensus 150 ~~~a~~~~~~al~~-~~~~~~~~~~la~~~l~~g~~e~A~~----iL~~lP~~~~ 199 (304)
T COG3118 150 FGEAAPLLKQALQA-APENSEAKLLLAECLLAAGDVEAAQA----ILAALPLQAQ 199 (304)
T ss_pred hhhHHHHHHHHHHh-CcccchHHHHHHHHHHHcCChHHHHH----HHHhCcccch
Confidence 88999999999999 58889999999999999999999766 6777775443
No 244
>KOG2796|consensus
Probab=87.42 E-value=2 Score=36.09 Aligned_cols=79 Identities=10% Similarity=0.142 Sum_probs=57.8
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHh--------------hC----HHHHHHHHHHHHHhhCCCChHHHHHHHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYN--------------LF----PDDAVVQYTQACRIIGTSDLALHQKLASLL 82 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e--------------~l----y~dAa~~ye~Aw~l~~~~~~~vgykLA~~y 82 (122)
+..|.|+-+-|..+++ +=+ +..+--+ ++ |.+|...|.+..+- ++.++.+-.+-|-++
T Consensus 221 ~~MQ~GD~k~a~~yf~~vek---~~~kL~~~q~~~~V~~n~a~i~lg~nn~a~a~r~~~~i~~~-D~~~~~a~NnKALcl 296 (366)
T KOG2796|consen 221 ISMQIGDIKTAEKYFQDVEK---VTQKLDGLQGKIMVLMNSAFLHLGQNNFAEAHRFFTEILRM-DPRNAVANNNKALCL 296 (366)
T ss_pred HHHhcccHHHHHHHHHHHHH---HHhhhhccchhHHHHhhhhhheecccchHHHHHHHhhcccc-CCCchhhhchHHHHH
Confidence 6677777777777777 532 2222111 22 88888888888877 688888888888888
Q ss_pred HhcccHHHHHHHHHHHHHhCCC
Q psy16144 83 LRMKRYAEAIDVSQTIMKAHPS 104 (122)
Q Consensus 83 lk~kr~~~AI~vc~kVL~~~P~ 104 (122)
|-.|+..||+.+-..++.+.|.
T Consensus 297 lYlg~l~DAiK~~e~~~~~~P~ 318 (366)
T KOG2796|consen 297 LYLGKLKDALKQLEAMVQQDPR 318 (366)
T ss_pred HHHHHHHHHHHHHHHHhccCCc
Confidence 8888888888888888888884
No 245
>TIGR02561 HrpB1_HrpK type III secretion protein HrpB1/HrpK. This gene is found within type III secretion operons in a limited range of species including Xanthomonas, Ralstonia and Burkholderia.
Probab=87.09 E-value=2.6 Score=31.89 Aligned_cols=41 Identities=17% Similarity=0.118 Sum_probs=24.1
Q ss_pred CCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHH
Q psy16144 68 GTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKV 108 (122)
Q Consensus 68 ~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypki 108 (122)
.|+.+.+..=-|..++..|++.||+.+.+.+-..-|..|..
T Consensus 40 rP~~~e~d~~dg~l~i~rg~w~eA~rvlr~l~~~~~~~p~~ 80 (153)
T TIGR02561 40 RPNLKELDMFDGWLLIARGNYDEAARILRELLSSAGAPPYG 80 (153)
T ss_pred CCCccccchhHHHHHHHcCCHHHHHHHHHhhhccCCCchHH
Confidence 45556666656666666666666666666666555555533
No 246
>KOG2053|consensus
Probab=87.07 E-value=3.8 Score=38.63 Aligned_cols=83 Identities=17% Similarity=0.078 Sum_probs=61.1
Q ss_pred HhcCCCChHHHHHHHHHHhh-CcchHHHHhh-------C--HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 22 LYLLPWRWEPLSAGLRKKQY-ISSATNIYNL-------F--PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~~l~~-nps~~kA~e~-------l--y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
..++.||+|+|.++|+++.. -+.+-.-... + +++|...||+|..- .|+ ..-.+-|.-+|.+-+.|.+=
T Consensus 52 sl~r~gk~~ea~~~Le~~~~~~~~D~~tLq~l~~~y~d~~~~d~~~~~Ye~~~~~-~P~-eell~~lFmayvR~~~yk~q 129 (932)
T KOG2053|consen 52 SLFRLGKGDEALKLLEALYGLKGTDDLTLQFLQNVYRDLGKLDEAVHLYERANQK-YPS-EELLYHLFMAYVREKSYKKQ 129 (932)
T ss_pred HHHHhcCchhHHHHHhhhccCCCCchHHHHHHHHHHHHHhhhhHHHHHHHHHHhh-CCc-HHHHHHHHHHHHHHHHHHHH
Confidence 78899999999999983332 2222332221 1 99999999999999 798 99999999999999999875
Q ss_pred HHHHHHHHHhCCCCH
Q psy16144 92 IDVSQTIMKAHPSYP 106 (122)
Q Consensus 92 I~vc~kVL~~~P~yp 106 (122)
-.+--+.-+..|..|
T Consensus 130 Qkaa~~LyK~~pk~~ 144 (932)
T KOG2053|consen 130 QKAALQLYKNFPKRA 144 (932)
T ss_pred HHHHHHHHHhCCccc
Confidence 544444455556554
No 247
>PF10373 EST1_DNA_bind: Est1 DNA/RNA binding domain; InterPro: IPR018834 Est1 is a protein which recruits or activates telomerase at the site of polymerisation [, ]. This is the DNA/RNA binding domain of EST1 []. ; PDB: 1YA0_B.
Probab=86.59 E-value=3.7 Score=31.40 Aligned_cols=56 Identities=18% Similarity=0.141 Sum_probs=38.2
Q ss_pred HHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHHHHH
Q psy16144 56 AVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVHKQI 112 (122)
Q Consensus 56 Aa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir~~i 112 (122)
|..+|.+|+.+ .|++-....-||-.+.-.|+..+|+=-|-+.|...--+|.++..+
T Consensus 1 A~~~Y~~A~~l-~P~~G~p~nQLAvl~~~~~~~l~avy~y~Rsl~~~~Pf~~A~~NL 56 (278)
T PF10373_consen 1 AERYYRKAIRL-LPSNGNPYNQLAVLASYQGDDLDAVYYYIRSLAVRIPFPSARENL 56 (278)
T ss_dssp HHHHHHHHHHH--TTBSHHHHHHHHHHHHTT-HHHHHHHHHHHHSSSB--HHHHHHH
T ss_pred CHHHHHHHHHh-CCCCCCcccchhhhhccccchHHHHHHHHHHHhcCCCcHHHHHHH
Confidence 66788888888 588888888888888888888888766666665444457665543
No 248
>COG3914 Spy Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones]
Probab=86.47 E-value=5 Score=36.36 Aligned_cols=92 Identities=14% Similarity=0.000 Sum_probs=70.3
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC----------HHHHHHHHHHHHHhhCCCChHHHHHH------HHHHHh
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF----------PDDAVVQYTQACRIIGTSDLALHQKL------ASLLLR 84 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l----------y~dAa~~ye~Aw~l~~~~~~~vgykL------A~~ylk 84 (122)
++...+..+.|.-.++ ++..||..+.++--+ ..-++..-+.|... .|+|..+-.-+ +...-.
T Consensus 76 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~L~~ale~~~~~~~~~~~~~~~a~~~-~~~~~~~~~~~~~~~~~~~~~~~ 154 (620)
T COG3914 76 LLAPLADSTLAFLAKRIPLSVNPENCPAVQNLAAALELDGLQFLALADISEIAEWL-SPDNAEFLGHLIRFYQLGRYLKL 154 (620)
T ss_pred hccccccchhHHHHHhhhHhcCcccchHHHHHHHHHHHhhhHHHHHHHHHHHHHhc-CcchHHHHhhHHHHHHHHHHHHH
Confidence 5667788888888888 999999988887743 44444444557777 48887765555 777778
Q ss_pred cccHHHHHHHHHHHHHhCCCCHHHHHHHHH
Q psy16144 85 MKRYAEAIDVSQTIMKAHPSYPKVHKQIFE 114 (122)
Q Consensus 85 ~kr~~~AI~vc~kVL~~~P~ypkir~~il~ 114 (122)
.|+.+++..--+++.+..|.||++-..-+-
T Consensus 155 l~~~~~~~~~l~~~~d~~p~~~~~~~~~~~ 184 (620)
T COG3914 155 LGRTAEAELALERAVDLLPKYPRVLGALMT 184 (620)
T ss_pred hccHHHHHHHHHHHHHhhhhhhhhHhHHHH
Confidence 888899999999999999999988655443
No 249
>KOG3785|consensus
Probab=86.46 E-value=3.6 Score=35.91 Aligned_cols=57 Identities=25% Similarity=0.229 Sum_probs=45.6
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHHHHHHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVHKQIFEK 115 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir~~il~k 115 (122)
|+|-..+...--.+ --+|....|++|..++-.|.|++|++.+-+|- .|+||.+|+.+
T Consensus 375 FddVl~YlnSi~sY-F~NdD~Fn~N~AQAk~atgny~eaEelf~~is-----~~~ikn~~~Y~ 431 (557)
T KOG3785|consen 375 FDDVLTYLNSIESY-FTNDDDFNLNLAQAKLATGNYVEAEELFIRIS-----GPEIKNKILYK 431 (557)
T ss_pred HHHHHHHHHHHHHH-hcCcchhhhHHHHHHHHhcChHHHHHHHhhhc-----ChhhhhhHHHH
Confidence 67766655555555 47888899999999999999999999877664 67888888766
No 250
>PF04910 Tcf25: Transcriptional repressor TCF25; InterPro: IPR006994 This entry appears to represent a novel family of basic helix-loop-helix (bHLH) proteins that control differentiation and development of a variety of organs [, ]. Human Nulp1 (Q2MK75 from SWISSPROT) is a basic helix-loop-helix protein expressed broadly during early embryonic organogenesis. Over expression of human Nulp1 in COS-7 cells inhibits the transcriptional activity of serum response factor (SRF), suggesting that Nulp1 may act as a novel bHLH transcriptional repressor in the SRF signalling pathway to mediate cellular functions [].
Probab=86.26 E-value=5.3 Score=33.43 Aligned_cols=92 Identities=11% Similarity=-0.069 Sum_probs=63.1
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcc-hHHHHhhC----------HHHHHHHHHHHHHhhCC----CChHHHHHHHHHHHhc
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISS-ATNIYNLF----------PDDAVVQYTQACRIIGT----SDLALHQKLASLLLRM 85 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps-~~kA~e~l----------y~dAa~~ye~Aw~l~~~----~~~~vgykLA~~ylk~ 85 (122)
...+.|-+..|.|.+| .+..+|+ ++-+-.++ |+-=+..++........ .-|...|..|-++...
T Consensus 112 ~L~~RG~~rTAlE~~KlLlsLdp~~DP~g~ll~ID~~ALrs~~y~~Li~~~~~~~~~~~~~~~~~lPn~a~S~aLA~~~l 191 (360)
T PF04910_consen 112 SLGRRGCWRTALEWCKLLLSLDPDEDPLGVLLFIDYYALRSRQYQWLIDFSESPLAKCYRNWLSLLPNFAFSIALAYFRL 191 (360)
T ss_pred HHHhcCcHHHHHHHHHHHHhcCCCCCcchhHHHHHHHHHhcCCHHHHHHHHHhHhhhhhhhhhhhCccHHHHHHHHHHHh
Confidence 7778899999999999 9999999 66555433 33333333322220001 1467888888888888
Q ss_pred ccH---------------HHHHHHHHHHHHhCCCCHHHHHHHHHHH
Q psy16144 86 KRY---------------AEAIDVSQTIMKAHPSYPKVHKQIFEKA 116 (122)
Q Consensus 86 kr~---------------~~AI~vc~kVL~~~P~ypkir~~il~ka 116 (122)
++- .+|-....+++..+| .+-..++++.
T Consensus 192 ~~~~~~~~~~~~~~~~~~~~A~~~L~~Ai~~fP---~vl~~Ll~~l 234 (360)
T PF04910_consen 192 EKEESSQSSAQSGRSENSESADEALQKAILRFP---WVLVPLLDKL 234 (360)
T ss_pred cCccccccccccccccchhHHHHHHHHHHHHhH---HHHHHHHHHh
Confidence 887 888888899998876 4444566654
No 251
>KOG0376|consensus
Probab=86.25 E-value=0.7 Score=40.57 Aligned_cols=55 Identities=20% Similarity=0.257 Sum_probs=49.8
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKV 108 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypki 108 (122)
|+.|+..|-+|+++ +|+.....-+=|++++|.+.|-.|+.=+.++++.+|.|.|.
T Consensus 20 fd~avdlysKaI~l-dpnca~~~anRa~a~lK~e~~~~Al~Da~kaie~dP~~~K~ 74 (476)
T KOG0376|consen 20 FDVAVDLYSKAIEL-DPNCAIYFANRALAHLKVESFGGALHDALKAIELDPTYIKA 74 (476)
T ss_pred HHHHHHHHHHHHhc-CCcceeeechhhhhheeechhhhHHHHHHhhhhcCchhhhe
Confidence 99999999999999 78776666666899999999999999999999999998775
No 252
>KOG3807|consensus
Probab=86.19 E-value=9.7 Score=33.13 Aligned_cols=89 Identities=19% Similarity=0.247 Sum_probs=63.8
Q ss_pred CChHHHHHHHH-HHhhCcchHHHHhhC-------HHHHHHHHHHHHHhhC--------------------CCChH----H
Q psy16144 27 WRWEPLSAGLR-KKQYISSATNIYNLF-------PDDAVVQYTQACRIIG--------------------TSDLA----L 74 (122)
Q Consensus 27 gk~d~A~elL~-~l~~nps~~kA~e~l-------y~dAa~~ye~Aw~l~~--------------------~~~~~----v 74 (122)
....+-.+.-+ +|+.||.|.-||-++ ..+|.+.+.+|++... .+|.. +
T Consensus 198 Rnp~~RI~~A~~ALeIN~eCA~AyvLLAEEEa~Ti~~AE~l~k~ALka~e~~yr~sqq~qh~~~~~da~~rRDtnvl~YI 277 (556)
T KOG3807|consen 198 RNPPARIKAAYQALEINNECATAYVLLAEEEATTIVDAERLFKQALKAGETIYRQSQQCQHQSPQHEAQLRRDTNVLVYI 277 (556)
T ss_pred cCcHHHHHHHHHHHhcCchhhhHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHhhHHHHhhhccchhhhhhcccchhhHH
Confidence 33444445556 999999999999766 7778888888876531 11111 2
Q ss_pred HHHHHHHHHhcccHHHHHHHHHHHHHhCC--CCHHHHHHHHHH
Q psy16144 75 HQKLASLLLRMKRYAEAIDVSQTIMKAHP--SYPKVHKQIFEK 115 (122)
Q Consensus 75 gykLA~~ylk~kr~~~AI~vc~kVL~~~P--~ypkir~~il~k 115 (122)
--+||-+--|.||..||+.+.|...+..| .--.|+...+|-
T Consensus 278 KRRLAMCARklGrlrEA~K~~RDL~ke~pl~t~lniheNLiEa 320 (556)
T KOG3807|consen 278 KRRLAMCARKLGRLREAVKIMRDLMKEFPLLTMLNIHENLLEA 320 (556)
T ss_pred HHHHHHHHHHhhhHHHHHHHHHHHhhhccHHHHHHHHHHHHHH
Confidence 45799999999999999999999999888 224455555553
No 253
>COG5191 Uncharacterized conserved protein, contains HAT (Half-A-TPR) repeat [General function prediction only]
Probab=86.13 E-value=1.1 Score=38.22 Aligned_cols=77 Identities=8% Similarity=0.083 Sum_probs=61.4
Q ss_pred HHHHHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHH-HHHHHHhcccHHHHHHHHHHHHHhCCC
Q psy16144 35 GLRKKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQK-LASLLLRMKRYAEAIDVSQTIMKAHPS 104 (122)
Q Consensus 35 lL~~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgyk-LA~~ylk~kr~~~AI~vc~kVL~~~P~ 104 (122)
++|....-|++++.|... |.+--..|-.++.. +|.|...+.- -+|-|.-.+++.-|-+++.+.|+.||+
T Consensus 96 ~~R~tnkff~D~k~w~~y~~Y~~k~k~y~~~~nI~~~~l~k-hP~nvdlWI~~c~~e~~~~ani~s~Ra~f~~glR~N~~ 174 (435)
T COG5191 96 LYRSTNKFFNDPKIWSQYAAYVIKKKMYGEMKNIFAECLTK-HPLNVDLWIYCCAFELFEIANIESSRAMFLKGLRMNSR 174 (435)
T ss_pred eehhhhcCCCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-CCCCceeeeeeccchhhhhccHHHHHHHHHhhhccCCC
Confidence 344455577788888722 66666777888888 8999888765 678888899999999999999999999
Q ss_pred CHHHHHHH
Q psy16144 105 YPKVHKQI 112 (122)
Q Consensus 105 ypkir~~i 112 (122)
.|+||-+-
T Consensus 175 ~p~iw~ey 182 (435)
T COG5191 175 SPRIWIEY 182 (435)
T ss_pred CchHHHHH
Confidence 99998764
No 254
>KOG1308|consensus
Probab=85.97 E-value=0.16 Score=43.21 Aligned_cols=93 Identities=16% Similarity=0.123 Sum_probs=69.0
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHh-----hC----HHHHHHHHHHHHHhhCCCChHHHHHH-HHHHHhcccHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYN-----LF----PDDAVVQYTQACRIIGTSDLALHQKL-ASLLLRMKRYAE 90 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e-----~l----y~dAa~~ye~Aw~l~~~~~~~vgykL-A~~ylk~kr~~~ 90 (122)
=..+.|.++.|++++. ++..||.....|- ++ ...|+..|..|+++ +| |.+.||++ ++.--..|..++
T Consensus 123 eAln~G~~~~ai~~~t~ai~lnp~~a~l~~kr~sv~lkl~kp~~airD~d~A~ei-n~-Dsa~~ykfrg~A~rllg~~e~ 200 (377)
T KOG1308|consen 123 EALNDGEFDTAIELFTSAIELNPPLAILYAKRASVFLKLKKPNAAIRDCDFAIEI-NP-DSAKGYKFRGYAERLLGNWEE 200 (377)
T ss_pred HHhcCcchhhhhcccccccccCCchhhhcccccceeeeccCCchhhhhhhhhhcc-Cc-ccccccchhhHHHHHhhchHH
Confidence 4567899999999999 9999998766554 11 88999999999999 56 46777775 566677788888
Q ss_pred HHHHHHHHHHhC--CCCHHHHHHHHHHH
Q psy16144 91 AIDVSQTIMKAH--PSYPKVHKQIFEKA 116 (122)
Q Consensus 91 AI~vc~kVL~~~--P~ypkir~~il~ka 116 (122)
|-.-.+.+++.+ |..-...++|-..|
T Consensus 201 aa~dl~~a~kld~dE~~~a~lKeV~p~a 228 (377)
T KOG1308|consen 201 AAHDLALACKLDYDEANSATLKEVFPNA 228 (377)
T ss_pred HHHHHHHHHhccccHHHHHHHHHhccch
Confidence 888888887664 34445555555444
No 255
>PF15015 NYD-SP12_N: Spermatogenesis-associated, N-terminal
Probab=85.38 E-value=2 Score=37.96 Aligned_cols=35 Identities=29% Similarity=0.449 Sum_probs=31.9
Q ss_pred HHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHH
Q psy16144 74 LHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKV 108 (122)
Q Consensus 74 vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypki 108 (122)
+--+|.-+|+++++..-|+..-|+-.-.+|.|++=
T Consensus 230 Ietklv~CYL~~rkpdlALnh~hrsI~lnP~~frn 264 (569)
T PF15015_consen 230 IETKLVTCYLRMRKPDLALNHSHRSINLNPSYFRN 264 (569)
T ss_pred HHHHHHHhhhhcCCCchHHHHHhhhhhcCcchhhH
Confidence 45689999999999999999999999999999764
No 256
>PF12862 Apc5: Anaphase-promoting complex subunit 5
Probab=85.23 E-value=2.6 Score=28.15 Aligned_cols=62 Identities=18% Similarity=0.134 Sum_probs=44.4
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHH-------HHhhC-----------HHHHHHHHHHHHHhhCCCChHHHHHHHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATN-------IYNLF-----------PDDAVVQYTQACRIIGTSDLALHQKLASLL 82 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~k-------A~e~l-----------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~y 82 (122)
-.++.|+|.+|.+.|. ...+-+.... .+-.+ +++|...++.|.++..+.+...+...+-.+
T Consensus 7 ~~~~~~dy~~A~d~L~~~fD~~~~~~~~~~~~~~~~all~lA~~~~~~G~~~~A~~~l~eAi~~Are~~D~~~l~~al~~ 86 (94)
T PF12862_consen 7 NALRSGDYSEALDALHRYFDYAKQSNNSSSNSGLAYALLNLAELHRRFGHYEEALQALEEAIRLARENGDRRCLAYALSW 86 (94)
T ss_pred HHHHcCCHHHHHHHHHHHHHHHhhcccchhhHHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHH
Confidence 6678999999988887 6655444221 11111 999999999999998888877777666554
Q ss_pred H
Q psy16144 83 L 83 (122)
Q Consensus 83 l 83 (122)
+
T Consensus 87 ~ 87 (94)
T PF12862_consen 87 L 87 (94)
T ss_pred H
Confidence 3
No 257
>KOG1308|consensus
Probab=85.21 E-value=0.78 Score=39.09 Aligned_cols=61 Identities=23% Similarity=0.263 Sum_probs=50.1
Q ss_pred HHHHhhC----HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHH
Q psy16144 46 TNIYNLF----PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPK 107 (122)
Q Consensus 46 ~kA~e~l----y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypk 107 (122)
.+|-|.+ ++.|+..|-.|.++ +|.+..+.-+=|.+++|.++...||.=|..++.++||..+
T Consensus 119 ~~A~eAln~G~~~~ai~~~t~ai~l-np~~a~l~~kr~sv~lkl~kp~~airD~d~A~ein~Dsa~ 183 (377)
T KOG1308|consen 119 VQASEALNDGEFDTAIELFTSAIEL-NPPLAILYAKRASVFLKLKKPNAAIRDCDFAIEINPDSAK 183 (377)
T ss_pred HHHHHHhcCcchhhhhccccccccc-CCchhhhcccccceeeeccCCchhhhhhhhhhccCccccc
Confidence 4455544 88888999999998 7888888888889999999999999889999988887654
No 258
>KOG3824|consensus
Probab=84.75 E-value=2.4 Score=36.33 Aligned_cols=66 Identities=11% Similarity=-0.023 Sum_probs=53.3
Q ss_pred HHHHHHHHHHHH--HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHH
Q psy16144 10 QIIQTIAEWHFL--LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQK 77 (122)
Q Consensus 10 ~~~~~~~~~~~L--Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgyk 77 (122)
.+...|.-+.+- ++. .||-|.|..+|. +++..|.++.+.-.+ ..+|-.+|-+|+.++ |.|..+.-+
T Consensus 112 ~~kEA~~Al~~A~~~~~-~Gk~ekA~~lfeHAlalaP~~p~~L~e~G~f~E~~~~iv~ADq~Y~~ALtis-P~nseALvn 189 (472)
T KOG3824|consen 112 KVKEAILALKAAGRSRK-DGKLEKAMTLFEHALALAPTNPQILIEMGQFREMHNEIVEADQCYVKALTIS-PGNSEALVN 189 (472)
T ss_pred hhHHHHHHHHHHHHHHh-ccchHHHHHHHHHHHhcCCCCHHHHHHHhHHHHhhhhhHhhhhhhheeeeeC-CCchHHHhh
Confidence 345556555555 554 599999999999 999999999998765 778999999999995 988887654
No 259
>KOG1070|consensus
Probab=84.46 E-value=11 Score=37.59 Aligned_cols=95 Identities=16% Similarity=0.090 Sum_probs=69.8
Q ss_pred HHHHHHH-HhcCCCChHHHHHHHH-HHh-hCcch-------HHHHh-hC--HHH---HHHHHHHHHHhhCCCChHHHHHH
Q psy16144 15 IAEWHFL-LYLLPWRWEPLSAGLR-KKQ-YISSA-------TNIYN-LF--PDD---AVVQYTQACRIIGTSDLALHQKL 78 (122)
Q Consensus 15 ~~~~~~L-Lyi~~gk~d~A~elL~-~l~-~nps~-------~kA~e-~l--y~d---Aa~~ye~Aw~l~~~~~~~vgykL 78 (122)
|-++..+ -.++.+..+.|.+..+ +|. .|+.- +-||. +- |.. =-+.||+|+.+++| -.|..+|
T Consensus 1459 i~WI~YMaf~LelsEiekAR~iaerAL~tIN~REeeEKLNiWiA~lNlEn~yG~eesl~kVFeRAcqycd~--~~V~~~L 1536 (1710)
T KOG1070|consen 1459 ILWIRYMAFHLELSEIEKARKIAERALKTINFREEEEKLNIWIAYLNLENAYGTEESLKKVFERACQYCDA--YTVHLKL 1536 (1710)
T ss_pred hHHHHHHHHHhhhhhhHHHHHHHHHHhhhCCcchhHHHHHHHHHHHhHHHhhCcHHHHHHHHHHHHHhcch--HHHHHHH
Confidence 3344445 6788899999999998 664 44431 11222 11 552 23579999999744 8999999
Q ss_pred HHHHHhcccHHHHHHHHHHHHHhCCCCHHHHHH
Q psy16144 79 ASLLLRMKRYAEAIDVSQTIMKAHPSYPKVHKQ 111 (122)
Q Consensus 79 A~~ylk~kr~~~AI~vc~kVL~~~P~ypkir~~ 111 (122)
.-.|.|..++.+|.++++.-++..-+.+++|..
T Consensus 1537 ~~iy~k~ek~~~A~ell~~m~KKF~q~~~vW~~ 1569 (1710)
T KOG1070|consen 1537 LGIYEKSEKNDEADELLRLMLKKFGQTRKVWIM 1569 (1710)
T ss_pred HHHHHHhhcchhHHHHHHHHHHHhcchhhHHHH
Confidence 999999999999999999999988877777743
No 260
>PF06957 COPI_C: Coatomer (COPI) alpha subunit C-terminus; InterPro: IPR010714 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the C terminus (approximately 500 residues) of the eukaryotic coatomer alpha subunit [, ]. This domain is found along with the IPR006692 from INTERPRO domain. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0005515 protein binding, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030126 COPI vesicle coat; PDB: 3MKR_B 3MV2_E 3MKQ_B 3MV3_A.
Probab=83.76 E-value=6.4 Score=34.11 Aligned_cols=97 Identities=13% Similarity=0.122 Sum_probs=59.7
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcch--------HHHHh-------hC----------------HHHHHHHHHHHHHhhCC
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSA--------TNIYN-------LF----------------PDDAVVQYTQACRIIGT 69 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~--------~kA~e-------~l----------------y~dAa~~ye~Aw~l~~~ 69 (122)
=+.+.|||.+|.+.|+ .|..-|=. .++.+ |+ .++.-++.|.|-=++..
T Consensus 213 k~~t~gKF~eA~~~Fr~iL~~i~l~vv~~~~E~~e~~eli~icrEYilgl~iEl~Rr~l~~~~~~~~kR~lELAAYFThc 292 (422)
T PF06957_consen 213 KLFTAGKFEEAIEIFRSILHSIPLLVVESREEEDEAKELIEICREYILGLSIELERRELPKDPVEDQKRNLELAAYFTHC 292 (422)
T ss_dssp HHHHTT-HHHHHHHHHHHHHHHHC--BSSCHHHHHHHHHHHHHHHHHHHHHHHHHHCTS-TTTHHHHHHHHHHHHHHCCS
T ss_pred HHHhcCCHHHHHHHHHHHHHHhheeeecCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccchhhHHHHHHHHHHHhcC
Confidence 4578899999999999 88764421 12222 22 34555677777666655
Q ss_pred CChHHHHHHHH-----HHHhcccHHHHHHHHHHHHHhCCCCHHH--HHHHHHHHHh
Q psy16144 70 SDLALHQKLAS-----LLLRMKRYAEAIDVSQTIMKAHPSYPKV--HKQIFEKAMV 118 (122)
Q Consensus 70 ~~~~vgykLA~-----~ylk~kr~~~AI~vc~kVL~~~P~ypki--r~~il~ka~~ 118 (122)
+--.++-.|+- ...|.|+|.-|-..+++.|+++|.-+.+ -+.|+.+|.+
T Consensus 293 ~LQp~H~~LaLr~AM~~~~K~KNf~tAa~FArRLLel~p~~~~a~qArKil~~~e~ 348 (422)
T PF06957_consen 293 KLQPSHLILALRSAMSQAFKLKNFITAASFARRLLELNPSPEVAEQARKILQACER 348 (422)
T ss_dssp ---HHHHHHHHHHHHHHCCCTTBHHHHHHHHHHHHCT--SCHHHHHHHHHHHHHCC
T ss_pred CCcHHHHHHHHHHHHHHHHHhccHHHHHHHHHHHHHcCCCHHHHHHHHHHHHHHhc
Confidence 54445544443 2359999999999999999999966532 2246666544
No 261
>cd02682 MIT_AAA_Arch MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in mostly archaebacterial AAA-ATPases. The molecular function of the MIT domain is unclear.
Probab=83.67 E-value=8.6 Score=25.63 Aligned_cols=17 Identities=41% Similarity=0.604 Sum_probs=8.4
Q ss_pred HHHHHHHHHHhCCCCHH
Q psy16144 91 AIDVSQTIMKAHPSYPK 107 (122)
Q Consensus 91 AI~vc~kVL~~~P~ypk 107 (122)
||++..++++..||.|+
T Consensus 32 aIe~L~q~~~~~pD~~~ 48 (75)
T cd02682 32 AIEVLSQIVKNYPDSPT 48 (75)
T ss_pred HHHHHHHHHHhCCChHH
Confidence 33444455555555554
No 262
>PF04184 ST7: ST7 protein; InterPro: IPR007311 The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain.
Probab=83.60 E-value=5.6 Score=35.49 Aligned_cols=82 Identities=15% Similarity=0.093 Sum_probs=54.8
Q ss_pred CCCChHHHHHHHH-HHhhCcc--hHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccH----
Q psy16144 25 LPWRWEPLSAGLR-KKQYISS--ATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRY---- 88 (122)
Q Consensus 25 ~~gk~d~A~elL~-~l~~nps--~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~---- 88 (122)
+.|+.++|++.++ .++.+|. ....+|-+ |.|+.....+==+.+-|++.++.|.= +++|.+.-
T Consensus 271 klGr~~EAIk~~rdLlke~p~~~~l~IrenLie~LLelq~Yad~q~lL~kYdDi~lpkSAti~YTa--ALLkaRav~d~f 348 (539)
T PF04184_consen 271 KLGRLREAIKMFRDLLKEFPNLDNLNIRENLIEALLELQAYADVQALLAKYDDISLPKSATICYTA--ALLKARAVGDKF 348 (539)
T ss_pred HhCChHHHHHHHHHHHhhCCccchhhHHHHHHHHHHhcCCHHHHHHHHHHhccccCCchHHHHHHH--HHHHHHhhcccc
Confidence 6799999999999 9988775 33344411 77777766664445456777777764 34443322
Q ss_pred --------------HHHHHHHHHHHHhCCCCHHH
Q psy16144 89 --------------AEAIDVSQTIMKAHPSYPKV 108 (122)
Q Consensus 89 --------------~~AI~vc~kVL~~~P~ypki 108 (122)
..|.+..|++.+.||--|+-
T Consensus 349 s~e~a~rRGls~ae~~aveAi~RAvefNPHVp~Y 382 (539)
T PF04184_consen 349 SPEAASRRGLSPAEMNAVEAIHRAVEFNPHVPKY 382 (539)
T ss_pred CchhhhhcCCChhHHHHHHHHHHHHHhCCCCchh
Confidence 23567788999999976754
No 263
>KOG2422|consensus
Probab=83.49 E-value=4 Score=37.09 Aligned_cols=82 Identities=12% Similarity=0.119 Sum_probs=55.0
Q ss_pred HHHHH-HhcCCCChHHHHHHHH-HHhhCcc-hHHHHhhC----------------HHHHHHHHHHHHHhhCCCChHHHHH
Q psy16144 17 EWHFL-LYLLPWRWEPLSAGLR-KKQYISS-ATNIYNLF----------------PDDAVVQYTQACRIIGTSDLALHQK 77 (122)
Q Consensus 17 ~~~~L-Lyi~~gk~d~A~elL~-~l~~nps-~~kA~e~l----------------y~dAa~~ye~Aw~l~~~~~~~vgyk 77 (122)
-|+.+ -..+.|=+.-|.+.++ .++.+|+ ++-+..++ +-++.+...+.|.+ |..||.
T Consensus 345 l~r~m~~l~~RGC~rTA~E~cKlllsLdp~eDPl~~l~~ID~~ALrareYqwiI~~~~~~e~~n~l~~~-----PN~~yS 419 (665)
T KOG2422|consen 345 LFRYMQSLAQRGCWRTALEWCKLLLSLDPSEDPLGILYLIDIYALRAREYQWIIELSNEPENMNKLSQL-----PNFGYS 419 (665)
T ss_pred HHHHHHHHHhcCChHHHHHHHHHHhhcCCcCCchhHHHHHHHHHHHHHhHHHHHHHHHHHHhhccHhhc-----CCchHH
Confidence 34444 4556799999999999 9999999 55544433 33444444555555 556666
Q ss_pred HH--HHHHhccc---HHHHHHHHHHHHHhCC
Q psy16144 78 LA--SLLLRMKR---YAEAIDVSQTIMKAHP 103 (122)
Q Consensus 78 LA--~~ylk~kr---~~~AI~vc~kVL~~~P 103 (122)
+| ..|+.... -.-|.....++++.||
T Consensus 420 ~AlA~f~l~~~~~~~rqsa~~~l~qAl~~~P 450 (665)
T KOG2422|consen 420 LALARFFLRKNEEDDRQSALNALLQALKHHP 450 (665)
T ss_pred HHHHHHHHhcCChhhHHHHHHHHHHHHHhCc
Confidence 55 55555554 4568888889999988
No 264
>KOG4555|consensus
Probab=83.48 E-value=4.6 Score=30.74 Aligned_cols=55 Identities=16% Similarity=0.079 Sum_probs=50.5
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKV 108 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypki 108 (122)
.+.|..-|.+|+.+. |.++++..+=|..|-..|+-++|++=..++|...-+..|.
T Consensus 59 Ld~AlE~F~qal~l~-P~raSayNNRAQa~RLq~~~e~ALdDLn~AleLag~~trt 113 (175)
T KOG4555|consen 59 LDGALELFGQALCLA-PERASAYNNRAQALRLQGDDEEALDDLNKALELAGDQTRT 113 (175)
T ss_pred hHHHHHHHHHHHHhc-ccchHhhccHHHHHHHcCChHHHHHHHHHHHHhcCccchH
Confidence 899999999999995 9999999999999999999999999999999987766543
No 265
>KOG1070|consensus
Probab=82.67 E-value=12 Score=37.41 Aligned_cols=92 Identities=11% Similarity=0.179 Sum_probs=73.1
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCC--ChHHHHHHHHHHHhcccHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTS--DLALHQKLASLLLRMKRYA 89 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~--~~~vgykLA~~ylk~kr~~ 89 (122)
+|...+++++|.++|+ .++--....+.|-.. -++|-...++|++.- |. ...+-=+-|..=-|.|..+
T Consensus 1539 iy~k~ek~~~A~ell~~m~KKF~q~~~vW~~y~~fLl~~ne~~aa~~lL~rAL~~l-Pk~eHv~~IskfAqLEFk~GDae 1617 (1710)
T KOG1070|consen 1539 IYEKSEKNDEADELLRLMLKKFGQTRKVWIMYADFLLRQNEAEAARELLKRALKSL-PKQEHVEFISKFAQLEFKYGDAE 1617 (1710)
T ss_pred HHHHhhcchhHHHHHHHHHHHhcchhhHHHHHHHHHhcccHHHHHHHHHHHHHhhc-chhhhHHHHHHHHHHHhhcCCch
Confidence 9999999999999999 888888777777632 456667788999884 77 5566667777888888888
Q ss_pred HHHHHHHHHHHhCCCCHHHHHHHHH
Q psy16144 90 EAIDVSQTIMKAHPSYPKVHKQIFE 114 (122)
Q Consensus 90 ~AI~vc~kVL~~~P~ypkir~~il~ 114 (122)
.+-.+++..|..+|---++|.--+|
T Consensus 1618 RGRtlfEgll~ayPKRtDlW~VYid 1642 (1710)
T KOG1070|consen 1618 RGRTLFEGLLSAYPKRTDLWSVYID 1642 (1710)
T ss_pred hhHHHHHHHHhhCccchhHHHHHHH
Confidence 8888999999999987777754444
No 266
>PRK13184 pknD serine/threonine-protein kinase; Reviewed
Probab=82.45 E-value=9.3 Score=36.25 Aligned_cols=96 Identities=11% Similarity=0.040 Sum_probs=67.9
Q ss_pred cCCCChHHHHHHHH-HHhhCcchHHHHhhCHHHHHHHHHHHHHhhC-----------------CCChHHHHHHHHHHHhc
Q psy16144 24 LLPWRWEPLSAGLR-KKQYISSATNIYNLFPDDAVVQYTQACRIIG-----------------TSDLALHQKLASLLLRM 85 (122)
Q Consensus 24 i~~gk~d~A~elL~-~l~~nps~~kA~e~ly~dAa~~ye~Aw~l~~-----------------~~~~~vgykLA~~ylk~ 85 (122)
+..+.||.|....| ...--|+-...+|.+|..-++..|+|-+.++ +.-|==..--|-+|.+.
T Consensus 486 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 565 (932)
T PRK13184 486 LAEKLYDQALIFYRRIRESFPGRKEGYEAQFRLGITLLEKASEQGDPRDFTQALSEFSYLHGGVGAPLEYLGKALVYQRL 565 (932)
T ss_pred HhhHHHHHHHHHHHHHhhcCCCcccchHHHHHhhHHHHHHHHhcCChHHHHHHHHHHHHhcCCCCCchHHHhHHHHHHHh
Confidence 34467899999999 8887888888888665545555555544433 22233334458899999
Q ss_pred ccHHHHHHHHHHHHHhCCCCH---HHHHHHHHHHHhc
Q psy16144 86 KRYAEAIDVSQTIMKAHPSYP---KVHKQIFEKAMVH 119 (122)
Q Consensus 86 kr~~~AI~vc~kVL~~~P~yp---kir~~il~ka~~~ 119 (122)
|.|+|=|.++.-+|+.+|++| +++..|...-...
T Consensus 566 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 602 (932)
T PRK13184 566 GEYNEEIKSLLLALKRYSQHPEISRLRDHLVYRLHES 602 (932)
T ss_pred hhHHHHHHHHHHHHHhcCCCCccHHHHHHHHHHHHHH
Confidence 999999999999999988886 4566666554433
No 267
>PRK10941 hypothetical protein; Provisional
Probab=82.17 E-value=8.5 Score=31.16 Aligned_cols=68 Identities=10% Similarity=0.015 Sum_probs=51.7
Q ss_pred hhHHHHHHHHHHHHHH--HhcCCCChHHHHHHHH-HHhhCcchHHHHh-----hC----HHHHHHHHHHHHHhhCCCChH
Q psy16144 6 VDTMQIIQTIAEWHFL--LYLLPWRWEPLSAGLR-KKQYISSATNIYN-----LF----PDDAVVQYTQACRIIGTSDLA 73 (122)
Q Consensus 6 ~~~~~~~~~~~~~~~L--Lyi~~gk~d~A~elL~-~l~~nps~~kA~e-----~l----y~dAa~~ye~Aw~l~~~~~~~ 73 (122)
++..+|++.+ ++.| .|++.++++.|..... ++..+|+++.-+. |. +..|..=++.-++.+ |++|.
T Consensus 174 a~~~~il~Rm--l~nLK~~~~~~~~~~~AL~~~e~ll~l~P~dp~e~RDRGll~~qL~c~~~A~~DL~~fl~~~-P~dp~ 250 (269)
T PRK10941 174 ADNIEVIRKL--LDTLKAALMEEKQMELALRASEALLQFDPEDPYEIRDRGLIYAQLDCEHVALSDLSYFVEQC-PEDPI 250 (269)
T ss_pred CCHHHHHHHH--HHHHHHHHHHcCcHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHHcCCcHHHHHHHHHHHHhC-CCchh
Confidence 3455666443 3344 8999999999999999 9999999876554 11 888989899999995 88887
Q ss_pred HHH
Q psy16144 74 LHQ 76 (122)
Q Consensus 74 vgy 76 (122)
+..
T Consensus 251 a~~ 253 (269)
T PRK10941 251 SEM 253 (269)
T ss_pred HHH
Confidence 654
No 268
>cd02682 MIT_AAA_Arch MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in mostly archaebacterial AAA-ATPases. The molecular function of the MIT domain is unclear.
Probab=81.83 E-value=9.3 Score=25.49 Aligned_cols=63 Identities=11% Similarity=-0.012 Sum_probs=42.5
Q ss_pred HHHHHHHH-HHhhCcchHHHHhhC--HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHH
Q psy16144 30 EPLSAGLR-KKQYISSATNIYNLF--PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTI 98 (122)
Q Consensus 30 d~A~elL~-~l~~nps~~kA~e~l--y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kV 98 (122)
++|.+++. +++.+.+ .+.-|-+ |++|+..+.++.... |+++.- -+--.|.+.|-+=++..++.
T Consensus 4 ~~A~~~a~~AVe~D~~-gr~~eAi~~Y~~aIe~L~q~~~~~-pD~~~k----~~yr~ki~eY~~Rae~Lk~~ 69 (75)
T cd02682 4 EMARKYAINAVKAEKE-GNAEDAITNYKKAIEVLSQIVKNY-PDSPTR----LIYEQMINEYKRRIEVLEKQ 69 (75)
T ss_pred HHHHHHHHHHHHHHhc-CCHHHHHHHHHHHHHHHHHHHHhC-CChHHH----HHHHHHHHHHHHHHHHHHHH
Confidence 57888888 8888886 6666555 889999999999984 777762 22334555555545544443
No 269
>COG4649 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=81.27 E-value=25 Score=27.88 Aligned_cols=91 Identities=14% Similarity=0.016 Sum_probs=63.7
Q ss_pred HHHHHH-HhcCCCChHHHHHHHH-HHhhCcchHH---------HHhhC----HHHHHHHHHHHHHhhCCCChHHHHHHHH
Q psy16144 16 AEWHFL-LYLLPWRWEPLSAGLR-KKQYISSATN---------IYNLF----PDDAVVQYTQACRIIGTSDLALHQKLAS 80 (122)
Q Consensus 16 ~~~~~L-Lyi~~gk~d~A~elL~-~l~~nps~~k---------A~e~l----y~dAa~~ye~Aw~l~~~~~~~vgykLA~ 80 (122)
+..++- |..+.|+...|+..|. +-+-+|+.-. ||.++ |+|=+...|..-.-++|=-.++-=-||+
T Consensus 96 A~mr~at~~a~kgdta~AV~aFdeia~dt~~P~~~rd~ARlraa~lLvD~gsy~dV~srvepLa~d~n~mR~sArEALgl 175 (221)
T COG4649 96 ARMRAATLLAQKGDTAAAVAAFDEIAADTSIPQIGRDLARLRAAYLLVDNGSYDDVSSRVEPLAGDGNPMRHSAREALGL 175 (221)
T ss_pred HHHHHHHHHhhcccHHHHHHHHHHHhccCCCcchhhHHHHHHHHHHHhccccHHHHHHHhhhccCCCChhHHHHHHHHhH
Confidence 334444 8889999999999999 7776666222 22233 8887776665433333333455567999
Q ss_pred HHHhcccHHHHHHHHHHHHHhCCCCHH
Q psy16144 81 LLLRMKRYAEAIDVSQTIMKAHPSYPK 107 (122)
Q Consensus 81 ~ylk~kr~~~AI~vc~kVL~~~P~ypk 107 (122)
.-.|.|+|.+|...+.++.. +.+-|+
T Consensus 176 Aa~kagd~a~A~~~F~qia~-Da~apr 201 (221)
T COG4649 176 AAYKAGDFAKAKSWFVQIAN-DAQAPR 201 (221)
T ss_pred HHHhccchHHHHHHHHHHHc-cccCcH
Confidence 99999999999999988875 554554
No 270
>COG3898 Uncharacterized membrane-bound protein [Function unknown]
Probab=81.15 E-value=11 Score=33.17 Aligned_cols=87 Identities=15% Similarity=0.162 Sum_probs=67.9
Q ss_pred HhcCCCChHHHHHHHH-HHh---hCcch----------HHHHhhC---HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHh
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQ---YISSA----------TNIYNLF---PDDAVVQYTQACRIIGTSDLALHQKLASLLLR 84 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~---~nps~----------~kA~e~l---y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk 84 (122)
--++.|++|.|.++++ ..+ +.|+- .+|-+.+ ...|-..-.+|.++ .|++.-...-=|..|.+
T Consensus 197 ~r~~~gdWd~AlkLvd~~~~~~vie~~~aeR~rAvLLtAkA~s~ldadp~~Ar~~A~~a~KL-~pdlvPaav~AAralf~ 275 (531)
T COG3898 197 ARCAAGDWDGALKLVDAQRAAKVIEKDVAERSRAVLLTAKAMSLLDADPASARDDALEANKL-APDLVPAAVVAARALFR 275 (531)
T ss_pred HHHhcCChHHHHHHHHHHHHHHhhchhhHHHHHHHHHHHHHHHHhcCChHHHHHHHHHHhhc-CCccchHHHHHHHHHHh
Confidence 4567899999999988 432 23321 1222222 77888888899999 69988888888999999
Q ss_pred cccHHHHHHHHHHHHHhCCCCHHHHH
Q psy16144 85 MKRYAEAIDVSQTIMKAHPSYPKVHK 110 (122)
Q Consensus 85 ~kr~~~AI~vc~kVL~~~P~ypkir~ 110 (122)
.|+..++-.+.+.+-+..| +|.|..
T Consensus 276 d~~~rKg~~ilE~aWK~eP-HP~ia~ 300 (531)
T COG3898 276 DGNLRKGSKILETAWKAEP-HPDIAL 300 (531)
T ss_pred ccchhhhhhHHHHHHhcCC-ChHHHH
Confidence 9999999999999999999 998864
No 271
>KOG1585|consensus
Probab=80.98 E-value=21 Score=29.71 Aligned_cols=67 Identities=24% Similarity=0.351 Sum_probs=47.9
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC-HHHHHHHHHHHHHhhCCCCh-----HHHHHHHHHHHhcccHHHHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF-PDDAVVQYTQACRIIGTSDL-----ALHQKLASLLLRMKRYAEAIDV 94 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l-y~dAa~~ye~Aw~l~~~~~~-----~vgykLA~~ylk~kr~~~AI~v 94 (122)
+|.+.|..|.|...|. + .++.|-. +++|...|++++..--.++. ...=+++..|.+.++|.||-.-
T Consensus 100 lY~E~GspdtAAmaleKA-------ak~lenv~Pd~AlqlYqralavve~~dr~~ma~el~gk~sr~lVrl~kf~Eaa~a 172 (308)
T KOG1585|consen 100 LYVECGSPDTAAMALEKA-------AKALENVKPDDALQLYQRALAVVEEDDRDQMAFELYGKCSRVLVRLEKFTEAATA 172 (308)
T ss_pred HHHHhCCcchHHHHHHHH-------HHHhhcCCHHHHHHHHHHHHHHHhccchHHHHHHHHHHhhhHhhhhHHhhHHHHH
Confidence 8888888888888777 6 4555555 89999999999987533221 1223567788888999887654
Q ss_pred H
Q psy16144 95 S 95 (122)
Q Consensus 95 c 95 (122)
.
T Consensus 173 ~ 173 (308)
T KOG1585|consen 173 F 173 (308)
T ss_pred H
Confidence 3
No 272
>KOG2471|consensus
Probab=79.25 E-value=2.8 Score=37.74 Aligned_cols=68 Identities=16% Similarity=0.047 Sum_probs=54.1
Q ss_pred HHHHH--HhcCCCChHHHHHHHH-HHh-------------------hCcchHHHHh----hC----HHHHHHHHHHHHHh
Q psy16144 17 EWHFL--LYLLPWRWEPLSAGLR-KKQ-------------------YISSATNIYN----LF----PDDAVVQYTQACRI 66 (122)
Q Consensus 17 ~~~~L--Lyi~~gk~d~A~elL~-~l~-------------------~nps~~kA~e----~l----y~dAa~~ye~Aw~l 66 (122)
.|+.| |+.+.|+|..+..+|. +|+ .|+||-.-|. |+ +..|..||.+|..-
T Consensus 285 f~NNlGcIh~~~~~y~~~~~~F~kAL~N~c~qL~~g~~~~~~~tls~nks~eilYNcG~~~Lh~grPl~AfqCf~~av~v 364 (696)
T KOG2471|consen 285 FNNNLGCIHYQLGCYQASSVLFLKALRNSCSQLRNGLKPAKTFTLSQNKSMEILYNCGLLYLHSGRPLLAFQCFQKAVHV 364 (696)
T ss_pred eecCcceEeeehhhHHHHHHHHHHHHHHHHHHHhccCCCCcceehhcccchhhHHhhhHHHHhcCCcHHHHHHHHHHHHH
Confidence 36777 9999999999999999 885 3444444443 22 99999999999999
Q ss_pred hCCCChHHHHHHHHHHHhc
Q psy16144 67 IGTSDLALHQKLASLLLRM 85 (122)
Q Consensus 67 ~~~~~~~vgykLA~~ylk~ 85 (122)
-.+||..+.+||-+-+-.
T Consensus 365 -fh~nPrlWLRlAEcCima 382 (696)
T KOG2471|consen 365 -FHRNPRLWLRLAECCIMA 382 (696)
T ss_pred -HhcCcHHHHHHHHHHHHH
Confidence 489999999999776543
No 273
>COG3947 Response regulator containing CheY-like receiver and SARP domains [Signal transduction mechanisms]
Probab=78.97 E-value=3.5 Score=34.81 Aligned_cols=36 Identities=25% Similarity=0.255 Sum_probs=29.1
Q ss_pred HHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHHH
Q psy16144 75 HQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVHK 110 (122)
Q Consensus 75 gykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir~ 110 (122)
..+-|..|++.|.+-|||.+|+++|..||=.-...+
T Consensus 282 lgkva~~yle~g~~neAi~l~qr~ltldpL~e~~nk 317 (361)
T COG3947 282 LGKVARAYLEAGKPNEAIQLHQRALTLDPLSEQDNK 317 (361)
T ss_pred HHHHHHHHHHcCChHHHHHHHHHHhhcChhhhHHHH
Confidence 456688999999999999999999999995444433
No 274
>COG3629 DnrI DNA-binding transcriptional activator of the SARP family [Signal transduction mechanisms]
Probab=78.95 E-value=14 Score=30.30 Aligned_cols=71 Identities=14% Similarity=0.116 Sum_probs=52.4
Q ss_pred CChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHH
Q psy16144 27 WRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQ 96 (122)
Q Consensus 27 gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~ 96 (122)
.+|++=+...+ +++... .++++-+ ++.+....+..... +|-+..+...|=-.|++.|+...||..|+
T Consensus 135 ~~f~~WV~~~R~~l~e~~--~~~l~~lae~~~~~~~~~~~~~~l~~Li~~-dp~~E~~~~~lm~~y~~~g~~~~ai~~y~ 211 (280)
T COG3629 135 DRFDEWVLEQRRALEELF--IKALTKLAEALIACGRADAVIEHLERLIEL-DPYDEPAYLRLMEAYLVNGRQSAAIRAYR 211 (280)
T ss_pred chHHHHHHHHHHHHHHHH--HHHHHHHHHHHHhcccHHHHHHHHHHHHhc-CccchHHHHHHHHHHHHcCCchHHHHHHH
Confidence 44666666666 555332 2222211 78888888888888 79999999999999999999999999999
Q ss_pred HHHH
Q psy16144 97 TIMK 100 (122)
Q Consensus 97 kVL~ 100 (122)
+.=+
T Consensus 212 ~l~~ 215 (280)
T COG3629 212 QLKK 215 (280)
T ss_pred HHHH
Confidence 7755
No 275
>KOG2796|consensus
Probab=78.94 E-value=9 Score=32.23 Aligned_cols=87 Identities=13% Similarity=0.080 Sum_probs=65.6
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcch-HHHHh---hC------HHHHHHHHH----HHHHhh-CCCChHHHHHHHHHHHhc
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSA-TNIYN---LF------PDDAVVQYT----QACRII-GTSDLALHQKLASLLLRM 85 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~-~kA~e---~l------y~dAa~~ye----~Aw~l~-~~~~~~vgykLA~~ylk~ 85 (122)
.+...|.|--..++++ ++++||.. +.-.. .+ -+.|..+++ .+=+++ .+.+-.||.+.|+.|+.+
T Consensus 186 ~llG~kEy~iS~d~~~~vi~~~~e~~p~L~s~Lgr~~MQ~GD~k~a~~yf~~vek~~~kL~~~q~~~~V~~n~a~i~lg~ 265 (366)
T KOG2796|consen 186 CLLGMKEYVLSVDAYHSVIKYYPEQEPQLLSGLGRISMQIGDIKTAEKYFQDVEKVTQKLDGLQGKIMVLMNSAFLHLGQ 265 (366)
T ss_pred HHhcchhhhhhHHHHHHHHHhCCcccHHHHHHHHHHHHhcccHHHHHHHHHHHHHHHhhhhccchhHHHHhhhhhheecc
Confidence 6677899999999999 99999553 32222 11 555656666 232332 356778999999999999
Q ss_pred ccHHHHHHHHHHHHHhCCCCHHH
Q psy16144 86 KRYAEAIDVSQTIMKAHPSYPKV 108 (122)
Q Consensus 86 kr~~~AI~vc~kVL~~~P~ypki 108 (122)
++|+.|-.-+.+++..+|..|-.
T Consensus 266 nn~a~a~r~~~~i~~~D~~~~~a 288 (366)
T KOG2796|consen 266 NNFAEAHRFFTEILRMDPRNAVA 288 (366)
T ss_pred cchHHHHHHHhhccccCCCchhh
Confidence 99999999999999999977754
No 276
>KOG1130|consensus
Probab=78.56 E-value=8.6 Score=34.22 Aligned_cols=79 Identities=11% Similarity=0.081 Sum_probs=54.6
Q ss_pred hcCCCChHHHHHHHH-HHhhCcchH------HHHh-------hC--HHHHHHHHHHHHHh----hCCC-ChHHHHHHHHH
Q psy16144 23 YLLPWRWEPLSAGLR-KKQYISSAT------NIYN-------LF--PDDAVVQYTQACRI----IGTS-DLALHQKLASL 81 (122)
Q Consensus 23 yi~~gk~d~A~elL~-~l~~nps~~------kA~e-------~l--y~dAa~~ye~Aw~l----~~~~-~~~vgykLA~~ 81 (122)
|--.|.|+.|+..=+ -|.+.+... +|+. .+ +..|+.+|..++.+ ++.. -....|.||..
T Consensus 205 yYlLGdf~~ai~~H~~RL~ia~efGDrAaeRRA~sNlgN~hiflg~fe~A~ehYK~tl~LAielg~r~vEAQscYSLgNt 284 (639)
T KOG1130|consen 205 YYLLGDFDQAIHFHKLRLEIAQEFGDRAAERRAHSNLGNCHIFLGNFELAIEHYKLTLNLAIELGNRTVEAQSCYSLGNT 284 (639)
T ss_pred eeeeccHHHHHHHHHHHHHHHHHhhhHHHHHHhhcccchhhhhhcccHhHHHHHHHHHHHHHHhcchhHHHHHHHHhhhH
Confidence 344588999988766 555544422 2222 22 99999999987654 2211 23458999999
Q ss_pred HHhcccHHHHHHHHHHHHHh
Q psy16144 82 LLRMKRYAEAIDVSQTIMKA 101 (122)
Q Consensus 82 ylk~kr~~~AI~vc~kVL~~ 101 (122)
|-..+.+..||+-..+-|++
T Consensus 285 ytll~e~~kAI~Yh~rHLaI 304 (639)
T KOG1130|consen 285 YTLLKEVQKAITYHQRHLAI 304 (639)
T ss_pred HHHHHHHHHHHHHHHHHHHH
Confidence 99999999999977776654
No 277
>PF04910 Tcf25: Transcriptional repressor TCF25; InterPro: IPR006994 This entry appears to represent a novel family of basic helix-loop-helix (bHLH) proteins that control differentiation and development of a variety of organs [, ]. Human Nulp1 (Q2MK75 from SWISSPROT) is a basic helix-loop-helix protein expressed broadly during early embryonic organogenesis. Over expression of human Nulp1 in COS-7 cells inhibits the transcriptional activity of serum response factor (SRF), suggesting that Nulp1 may act as a novel bHLH transcriptional repressor in the SRF signalling pathway to mediate cellular functions [].
Probab=78.50 E-value=12 Score=31.37 Aligned_cols=101 Identities=11% Similarity=0.015 Sum_probs=52.5
Q ss_pred hHHHHHHHHHH-------HHHH--HhcCCCChHHHHHHHH-HHhhCcchH-HHHhhCHHHHHHHHHHHHHh---hCCCCh
Q psy16144 7 DTMQIIQTIAE-------WHFL--LYLLPWRWEPLSAGLR-KKQYISSAT-NIYNLFPDDAVVQYTQACRI---IGTSDL 72 (122)
Q Consensus 7 ~~~~~~~~~~~-------~~~L--Lyi~~gk~d~A~elL~-~l~~nps~~-kA~e~ly~dAa~~ye~Aw~l---~~~~~~ 72 (122)
|.++|++-+.. +..+ ++.++|+++.|.++++ ||=.--++. ..+.- +.+....=..=+.+ .|..+-
T Consensus 25 Dp~~l~~ll~~~PyHidtLlqls~v~~~~gd~~~A~~lleRALf~~e~~~~~~F~~-~~~~~~~g~~rL~~~~~eNR~ff 103 (360)
T PF04910_consen 25 DPNALINLLQKNPYHIDTLLQLSEVYRQQGDHAQANDLLERALFAFERAFHPSFSP-FRSNLTSGNCRLDYRRPENRQFF 103 (360)
T ss_pred CHHHHHHHHHHCCCcHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHHHhhh-hhcccccCccccCCccccchHHH
Confidence 66667765543 3334 8899999999999999 754221111 11100 00000000000000 123333
Q ss_pred HHHHHHHHHHHhcccHHHHHHHHHHHHHhCCC-CHHH
Q psy16144 73 ALHQKLASLLLRMKRYAEAIDVSQTIMKAHPS-YPKV 108 (122)
Q Consensus 73 ~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~-ypki 108 (122)
-++++...++.+-|-+.-|.++|+-.|..+|+ .|-.
T Consensus 104 lal~r~i~~L~~RG~~rTAlE~~KlLlsLdp~~DP~g 140 (360)
T PF04910_consen 104 LALFRYIQSLGRRGCWRTALEWCKLLLSLDPDEDPLG 140 (360)
T ss_pred HHHHHHHHHHHhcCcHHHHHHHHHHHHhcCCCCCcch
Confidence 45566666666667777777777777777776 5543
No 278
>PF01535 PPR: PPR repeat; InterPro: IPR002885 This entry represents the PPR repeat. Pentatricopeptide repeat (PPR) proteins are characterised by tandem repeats of a degenerate 35 amino acid motif []. Most of PPR proteins have roles in mitochondria or plastid []. PPR repeats were discovered while screening Arabidopsis proteins for those predicted to be targeted to mitochondria or chloroplast [, ]. Some of these proteins have been shown to play a role in post-transcriptional processes within organelles and they are thought to be sequence-specific RNA-binding proteins [, , ]. Plant genomes have between one hundred to five hundred PPR genes per genome whereas non-plant genomes encode two to six PPR proteins. Although no PPR structures are yet known, the motif is predicted to fold into a helix-turn-helix structure similar to those found in the tetratricopeptide repeat (TPR) family (see PDOC50005 from PROSITEDOC) []. The plant PPR protein family has been divided in two subfamilies on the basis of their motif content and organisation [, ]. Examples of PPR repeat-containing proteins include PET309 P32522 from SWISSPROT, which may be involved in RNA stabilisation [], and crp1, which is involved in RNA processing []. The repeat is associated with a predicted plant protein O49549 from SWISSPROT that has a domain organisation similar to the human BRCA1 protein.
Probab=77.87 E-value=3.1 Score=21.11 Aligned_cols=19 Identities=11% Similarity=-0.003 Sum_probs=17.0
Q ss_pred HhcCCCChHHHHHHHH-HHh
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQ 40 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~ 40 (122)
.|.+.|++++|.++++ ..+
T Consensus 9 ~~~~~~~~~~a~~~~~~M~~ 28 (31)
T PF01535_consen 9 GYCKMGQFEEALEVFDEMRE 28 (31)
T ss_pred HHHccchHHHHHHHHHHHhH
Confidence 8999999999999998 654
No 279
>smart00745 MIT Microtubule Interacting and Trafficking molecule domain.
Probab=77.46 E-value=16 Score=23.28 Aligned_cols=15 Identities=0% Similarity=-0.237 Sum_probs=7.7
Q ss_pred hHHHHHHHH-HHhhCc
Q psy16144 29 WEPLSAGLR-KKQYIS 43 (122)
Q Consensus 29 ~d~A~elL~-~l~~np 43 (122)
++.|.+++. +++.+.
T Consensus 5 ~~~A~~li~~Av~~d~ 20 (77)
T smart00745 5 LSKAKELISKALKADE 20 (77)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 445555555 554444
No 280
>COG4976 Predicted methyltransferase (contains TPR repeat) [General function prediction only]
Probab=77.36 E-value=5.8 Score=32.56 Aligned_cols=58 Identities=12% Similarity=-0.050 Sum_probs=48.6
Q ss_pred hcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHH
Q psy16144 23 YLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLAS 80 (122)
Q Consensus 23 yi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~ 80 (122)
..++|+.+.|.++.. ++...|.-...|..+ .+.|+..|++.+++--++...+..+|+.
T Consensus 5 ~~~~~D~~aaaely~qal~lap~w~~gwfR~g~~~ekag~~daAa~a~~~~L~ldp~D~~gaa~kLa~ 72 (287)
T COG4976 5 LAESGDAEAAAELYNQALELAPEWAAGWFRLGEYTEKAGEFDAAAAAYEEVLELDPEDHGGAALKLAV 72 (287)
T ss_pred hcccCChHHHHHHHHHHhhcCchhhhhhhhcchhhhhcccHHHHHHHHHHHHcCCcccccchhhhHHh
Confidence 357899999999999 999999999999866 8899999999999954455556677754
No 281
>PF08424 NRDE-2: NRDE-2, necessary for RNA interference; InterPro: IPR013633 This is domain is found in eukaryotic proteins of unknown function.
Probab=76.52 E-value=22 Score=28.98 Aligned_cols=59 Identities=12% Similarity=0.194 Sum_probs=47.3
Q ss_pred HHHHHHHHhhCCCChHHHHHHHHHHHhccc------------HHHHHHHHHHHHHhCCCCHHHHHHHHHHHH
Q psy16144 58 VQYTQACRIIGTSDLALHQKLASLLLRMKR------------YAEAIDVSQTIMKAHPSYPKVHKQIFEKAM 117 (122)
Q Consensus 58 ~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr------------~~~AI~vc~kVL~~~P~ypkir~~il~ka~ 117 (122)
..|++..+- +|+|......|...-.+.-. .+.-+.+++++|+.||++++++..-|+-+.
T Consensus 6 ~el~~~v~~-~P~di~~Wl~li~~Qd~~~~~~~~~~~~~~a~~E~klsilerAL~~np~~~~L~l~~l~~~~ 76 (321)
T PF08424_consen 6 AELNRRVRE-NPHDIEAWLELIEFQDELFRLQSSSKAERRALAERKLSILERALKHNPDSERLLLGYLEEGE 76 (321)
T ss_pred HHHHHHHHh-CcccHHHHHHHHHHHHHhccccccchhhHHHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHHH
Confidence 457777777 79999999999876655543 456789999999999999999887776554
No 282
>PF14561 TPR_20: Tetratricopeptide repeat; PDB: 3QOU_A 2R5S_A 3QDN_B.
Probab=75.21 E-value=7 Score=26.37 Aligned_cols=47 Identities=15% Similarity=0.094 Sum_probs=27.3
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhCHHHHHHHHHHHHHhhCCCChHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLFPDDAVVQYTQACRIIGTSDLAL 74 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~ly~dAa~~ye~Aw~l~~~~~~~v 74 (122)
.++..|++++|.+.|- +++.+|+... +.|-+.+-..++..++++|-+
T Consensus 31 ~~~~~g~~e~Al~~Ll~~v~~dr~~~~------~~ar~~ll~~f~~lg~~~plv 78 (90)
T PF14561_consen 31 ALLAAGDYEEALDQLLELVRRDRDYED------DAARKRLLDIFELLGPGDPLV 78 (90)
T ss_dssp HHHHTT-HHHHHHHHHHHHCC-TTCCC------CHHHHHHHHHHHHH-TT-HHH
T ss_pred HHHHCCCHHHHHHHHHHHHHhCccccc------cHHHHHHHHHHHHcCCCChHH
Confidence 6667777888777777 7777776322 334455666666666666644
No 283
>PF14863 Alkyl_sulf_dimr: Alkyl sulfatase dimerisation; PDB: 2YHE_C 2CG2_A 2CG3_A 2CFU_A 2CFZ_A.
Probab=73.46 E-value=18 Score=26.68 Aligned_cols=40 Identities=15% Similarity=0.126 Sum_probs=33.4
Q ss_pred ChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHHH
Q psy16144 71 DLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVHK 110 (122)
Q Consensus 71 ~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir~ 110 (122)
....-.+-|...+..|+|.-|.+++..++..+|++..+|.
T Consensus 69 G~d~vl~~A~~~~~~gd~~wA~~L~d~l~~adp~n~~ar~ 108 (141)
T PF14863_consen 69 GADKVLERAQAALAAGDYQWAAELLDHLVFADPDNEEARQ 108 (141)
T ss_dssp CHHHHHHHHHHHHHCT-HHHHHHHHHHHHHH-TT-HHHHH
T ss_pred CHHHHHHHHHHHHHCCCHHHHHHHHHHHHHcCCCcHHHHH
Confidence 3677788899999999999999999999999999998875
No 284
>PF05843 Suf: Suppressor of forked protein (Suf); InterPro: IPR008847 This domain consists of several eukaryotic suppressor of forked (Suf) like proteins. The Drosophila melanogaster suppressor of forked [Su(f)] protein shares homology with the Saccharomyces cerevisiae RNA14 protein and the 77 kDa subunit of Homo sapiens cleavage stimulation factor, which are proteins involved in mRNA 3' end formation. This suggests a role for Su(f) in mRNA 3' end formation in Drosophila. The su(f) gene produces three transcripts; two of them are polyadenylated at the end of the transcription unit, and one is a truncated transcript, polyadenylated in intron 4. It is thought that su(f) plays a role in the regulation of poly(A) site utilisation and the GU-rich sequence is important for this regulation to occur [].; GO: 0006397 mRNA processing, 0005634 nucleus; PDB: 2L9B_B 2OND_B 2OOE_A 4E85_B 4EBA_C 4E6H_A 2UY1_B.
Probab=73.27 E-value=21 Score=28.36 Aligned_cols=60 Identities=15% Similarity=0.076 Sum_probs=46.9
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHh-cccHHHHHHHHHHHHHhCCCCHHHHHHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLR-MKRYAEAIDVSQTIMKAHPSYPKVHKQIF 113 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk-~kr~~~AI~vc~kVL~~~P~ypkir~~il 113 (122)
.+.|-..|++|.+- ++....+....|..-.. .++..-|..|++..|+..|..+.++..=+
T Consensus 17 ~~~aR~vF~~a~~~-~~~~~~vy~~~A~~E~~~~~d~~~A~~Ife~glk~f~~~~~~~~~Y~ 77 (280)
T PF05843_consen 17 IEAARKVFKRARKD-KRCTYHVYVAYALMEYYCNKDPKRARKIFERGLKKFPSDPDFWLEYL 77 (280)
T ss_dssp HHHHHHHHHHHHCC-CCS-THHHHHHHHHHHHTCS-HHHHHHHHHHHHHHHTT-HHHHHHHH
T ss_pred hHHHHHHHHHHHcC-CCCCHHHHHHHHHHHHHhCCCHHHHHHHHHHHHHHCCCCHHHHHHHH
Confidence 67888999999855 56678899988888566 56666699999999999999999876544
No 285
>TIGR00756 PPR pentatricopeptide repeat domain (PPR motif). This family has a similar consensus to the TPR domain (tetratricopeptide), pfam pfam00515, a 33-residue repeat. It is predicted to form a pair of antiparallel helices similar to that of TPR.
Probab=73.26 E-value=5.4 Score=20.36 Aligned_cols=19 Identities=11% Similarity=-0.195 Sum_probs=16.8
Q ss_pred HhcCCCChHHHHHHHH-HHh
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQ 40 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~ 40 (122)
-|.+.|++++|.++|+ ..+
T Consensus 9 ~~~~~~~~~~a~~~~~~M~~ 28 (35)
T TIGR00756 9 GLCKAGRVEEALELFKEMLE 28 (35)
T ss_pred HHHHCCCHHHHHHHHHHHHH
Confidence 7899999999999999 654
No 286
>smart00299 CLH Clathrin heavy chain repeat homology.
Probab=72.87 E-value=28 Score=24.05 Aligned_cols=91 Identities=11% Similarity=0.151 Sum_probs=54.6
Q ss_pred hcCCCChHHHHHHHH-HHhhCcchHHHHhhC--------HHHHHHHHH---------HHHHhhCCCChHHHHHHHHHHHh
Q psy16144 23 YLLPWRWEPLSAGLR-KKQYISSATNIYNLF--------PDDAVVQYT---------QACRIIGTSDLALHQKLASLLLR 84 (122)
Q Consensus 23 yi~~gk~d~A~elL~-~l~~nps~~kA~e~l--------y~dAa~~ye---------~Aw~l~~~~~~~vgykLA~~ylk 84 (122)
+.+.|.......+|+ +++.|+..+..+..+ ..+....++ .|++++...+. .-...+.|.|
T Consensus 17 ~~~~~~~~~l~~yLe~~~~~~~~~~~~~~~li~ly~~~~~~~ll~~l~~~~~~yd~~~~~~~c~~~~l--~~~~~~l~~k 94 (140)
T smart00299 17 FEKRNLLEELIPYLESALKLNSENPALQTKLIELYAKYDPQKEIERLDNKSNHYDIEKVGKLCEKAKL--YEEAVELYKK 94 (140)
T ss_pred HHhCCcHHHHHHHHHHHHccCccchhHHHHHHHHHHHHCHHHHHHHHHhccccCCHHHHHHHHHHcCc--HHHHHHHHHh
Confidence 345678888888999 888887666555533 233333333 35666544333 2356778888
Q ss_pred cccHHHHHHHHHHHH---------HhCCCCHHHHHHHHHH
Q psy16144 85 MKRYAEAIDVSQTIM---------KAHPSYPKVHKQIFEK 115 (122)
Q Consensus 85 ~kr~~~AI~vc~kVL---------~~~P~ypkir~~il~k 115 (122)
.|.+.+|++++-+-+ ...++.|+.+..++.-
T Consensus 95 ~~~~~~Al~~~l~~~~d~~~a~~~~~~~~~~~lw~~~~~~ 134 (140)
T smart00299 95 DGNFKDAIVTLIEHLGNYEKAIEYFVKQNNPELWAEVLKA 134 (140)
T ss_pred hcCHHHHHHHHHHcccCHHHHHHHHHhCCCHHHHHHHHHH
Confidence 888888888764321 1123456666666543
No 287
>PF11817 Foie-gras_1: Foie gras liver health family 1; InterPro: IPR021773 Mutating the gene foie gras in zebrafish has been shown to affect development; the mutants develop large, lipid-filled hepatocytes in the liver, resembling those in individuals with fatty liver disease []. Foie-gras protein is long and has several well-defined domains though none of them has a known function. We have annotated this one as the first []. THe C terminus of this region contains TPR repeats.
Probab=72.15 E-value=17 Score=28.39 Aligned_cols=65 Identities=11% Similarity=0.145 Sum_probs=45.1
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCC--CCHHHHHHHHHHHHhc
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHP--SYPKVHKQIFEKAMVH 119 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P--~ypkir~~il~ka~~~ 119 (122)
...|..+|.+--.. .--..++..+|.-|.+.|+|.+|+..++.++..+. .=..+-.+|+-+.+.+
T Consensus 161 L~~A~~~f~~~~~~--R~~~~l~~~~A~ey~~~g~~~~A~~~l~~~~~~yr~egW~~l~~~~l~~l~~C 227 (247)
T PF11817_consen 161 LEKAYEQFKKYGQN--RMASYLSLEMAEEYFRLGDYDKALKLLEPAASSYRREGWWSLLTEVLWRLLEC 227 (247)
T ss_pred HHHHHHHHHHhccc--hHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHhCCcHHHHHHHHHHHHHH
Confidence 55566666554442 33467788999999999999999999999976644 2255666666655543
No 288
>PF08631 SPO22: Meiosis protein SPO22/ZIP4 like; InterPro: IPR013940 SPO22 is a meiosis-specific protein with similarity to phospholipase A2, involved in completion of nuclear divisions during meiosis; induced early in meiosis []. It is also involved in sporulation [].
Probab=71.93 E-value=48 Score=26.23 Aligned_cols=56 Identities=20% Similarity=0.221 Sum_probs=29.4
Q ss_pred HHHHHHHHHHHHHhhC------CCC-------hHHHHHHHHHHHhcccHHHHHHHHHHH----HHhCCCCHHHH
Q psy16144 53 PDDAVVQYTQACRIIG------TSD-------LALHQKLASLLLRMKRYAEAIDVSQTI----MKAHPSYPKVH 109 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~------~~~-------~~vgykLA~~ylk~kr~~~AI~vc~kV----L~~~P~ypkir 109 (122)
|++|+.-+++|.++.+ +.. ..+.--||.+|+..+.++.... |..+ -..+|+.|-+.
T Consensus 52 ~~~a~~wL~~a~~~l~~~~~~~~~~~~~~elr~~iL~~La~~~l~~~~~~~~~k-a~~~l~~l~~e~~~~~~~~ 124 (278)
T PF08631_consen 52 YEEAVKWLQRAYDILEKPGKMDKLSPDGSELRLSILRLLANAYLEWDTYESVEK-ALNALRLLESEYGNKPEVF 124 (278)
T ss_pred hHHHHHHHHHHHHHHHhhhhccccCCcHHHHHHHHHHHHHHHHHcCCChHHHHH-HHHHHHHHHHhCCCCcHHH
Confidence 6666666777766621 111 1345566777777666543222 3333 34456655543
No 289
>KOG1586|consensus
Probab=71.10 E-value=45 Score=27.57 Aligned_cols=85 Identities=11% Similarity=-0.029 Sum_probs=52.0
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQT 97 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~k 97 (122)
|.-.++||++|.+++. +-..-+ -.|-|..- |-+|+.+.-++ .+.++..--|-=|++--|..+.++|+.+.++
T Consensus 23 lfgg~~k~eeAadl~~~Aan~yk-laK~w~~AG~aflkaA~~h~k~---~skhDaat~YveA~~cykk~~~~eAv~cL~~ 98 (288)
T KOG1586|consen 23 LFGGSNKYEEAAELYERAANMYK-LAKNWSAAGDAFLKAADLHLKA---GSKHDAATTYVEAANCYKKVDPEEAVNCLEK 98 (288)
T ss_pred ccCCCcchHHHHHHHHHHHHHHH-HHHhHHHHHHHHHHHHHHHHhc---CCchhHHHHHHHHHHHhhccChHHHHHHHHH
Confidence 5566789999999887 532211 12222211 44444444333 3455566667777777777788888888888
Q ss_pred HHHhCCCCHHHHH
Q psy16144 98 IMKAHPSYPKVHK 110 (122)
Q Consensus 98 VL~~~P~ypkir~ 110 (122)
+..+.-+--+.+.
T Consensus 99 aieIyt~~Grf~~ 111 (288)
T KOG1586|consen 99 AIEIYTDMGRFTM 111 (288)
T ss_pred HHHHHHhhhHHHH
Confidence 8777765555443
No 290
>KOG1130|consensus
Probab=70.94 E-value=8.5 Score=34.24 Aligned_cols=80 Identities=15% Similarity=0.065 Sum_probs=52.8
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhCH---HHH---HHHHHHHHHhhCCCChHH-------------HHHHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLFP---DDA---VVQYTQACRIIGTSDLAL-------------HQKLASL 81 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~ly---~dA---a~~ye~Aw~l~~~~~~~v-------------gykLA~~ 81 (122)
=..+.|++...+.+|+ +++.--+..+-...+| ..| ..-|++|+++ +..|..+ .=+||..
T Consensus 26 RLck~gdcraGv~ff~aA~qvGTeDl~tLSAIYsQLGNAyfyL~DY~kAl~y-H~hDltlar~lgdklGEAKssgNLGNt 104 (639)
T KOG1130|consen 26 RLCKMGDCRAGVDFFKAALQVGTEDLSTLSAIYSQLGNAYFYLKDYEKALKY-HTHDLTLARLLGDKLGEAKSSGNLGNT 104 (639)
T ss_pred HHHhccchhhhHHHHHHHHHhcchHHHHHHHHHHHhcchhhhHhhHHHHHhh-hhhhHHHHHHhcchhccccccccccch
Confidence 4678899999999999 9987766554433110 000 1234555555 3434333 2356778
Q ss_pred HHhcccHHHHHHHHHHHHHhC
Q psy16144 82 LLRMKRYAEAIDVSQTIMKAH 102 (122)
Q Consensus 82 ylk~kr~~~AI~vc~kVL~~~ 102 (122)
+--.|+|.||+.+|.+-|+..
T Consensus 105 lKv~G~fdeA~~cc~rhLd~a 125 (639)
T KOG1130|consen 105 LKVKGAFDEALTCCFRHLDFA 125 (639)
T ss_pred hhhhcccchHHHHHHHHhHHH
Confidence 888999999999999988643
No 291
>PF11207 DUF2989: Protein of unknown function (DUF2989); InterPro: IPR021372 Some members in this bacterial family of proteins are annotated as lipoproteins however this cannot be confirmed.
Probab=70.90 E-value=15 Score=28.91 Aligned_cols=60 Identities=12% Similarity=0.140 Sum_probs=49.3
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHHHHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVHKQIF 113 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir~~il 113 (122)
=++|...|.++-.-..=++|+.-|.||.-|.| .+..+|+.++.++|+..+...++--+|+
T Consensus 122 d~~A~~~fL~~E~~~~l~t~elq~aLAtyY~k-rD~~Kt~~ll~~~L~l~~~~~~~n~eil 181 (203)
T PF11207_consen 122 DQEALRRFLQLEGTPELETAELQYALATYYTK-RDPEKTIQLLLRALELSNPDDNFNPEIL 181 (203)
T ss_pred cHHHHHHHHHHcCCCCCCCHHHHHHHHHHHHc-cCHHHHHHHHHHHHHhcCCCCCCCHHHH
Confidence 56788888888777667889999999999995 5678999999999999987756665665
No 292
>COG2976 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=70.60 E-value=15 Score=29.10 Aligned_cols=88 Identities=15% Similarity=0.110 Sum_probs=55.6
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcc-hHHHHh--hC---------HHHHHHHHHHHHHhhCCCChH-HHHHHHHHHHhccc
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISS-ATNIYN--LF---------PDDAVVQYTQACRIIGTSDLA-LHQKLASLLLRMKR 87 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps-~~kA~e--~l---------y~dAa~~ye~Aw~l~~~~~~~-vgykLA~~ylk~kr 87 (122)
-++..|++|+|+.-|+ ++.+-++ ..++.- ++ +++|...+..--+ ++..+ +.-.=|-.++-.|+
T Consensus 98 ~~ve~~~~d~A~aqL~~~l~~t~De~lk~l~~lRLArvq~q~~k~D~AL~~L~t~~~---~~w~~~~~elrGDill~kg~ 174 (207)
T COG2976 98 AEVEANNLDKAEAQLKQALAQTKDENLKALAALRLARVQLQQKKADAALKTLDTIKE---ESWAAIVAELRGDILLAKGD 174 (207)
T ss_pred HHHhhccHHHHHHHHHHHHccchhHHHHHHHHHHHHHHHHHhhhHHHHHHHHhcccc---ccHHHHHHHHhhhHHHHcCc
Confidence 6788899999999999 8875444 222222 11 6666554443211 11122 23334889999999
Q ss_pred HHHHHHHHHHHHHhCCCCHHHHHHHHH
Q psy16144 88 YAEAIDVSQTIMKAHPSYPKVHKQIFE 114 (122)
Q Consensus 88 ~~~AI~vc~kVL~~~P~ypkir~~il~ 114 (122)
-++|..-|.+.|...++ |-.+ +||.
T Consensus 175 k~~Ar~ay~kAl~~~~s-~~~~-~~lq 199 (207)
T COG2976 175 KQEARAAYEKALESDAS-PAAR-EILQ 199 (207)
T ss_pred hHHHHHHHHHHHHccCC-hHHH-HHHH
Confidence 99999999999988854 3333 4554
No 293
>cd00280 TRFH Telomeric Repeat binding Factor or TTAGGG Repeat binding Factor, central (dimerization) domain Homology; TRFH. Telomeres are protein/DNA complexes that make up the physical ends of eukaryotic linear chromosomes and are essential for chromosome stability, protecting the chromosome ends from degradation and end-to-end fusion. Proteins TRF1, TRF2 and Taz1 bind telomeric DNA and are also involved in recruiting interacting proteins, TIN2, and Rap1, to the telomeres. It has also been demonstrated that PARP1 associates with TRF2 and is capable of poly(ADP-ribosyl)ation of TRF2, which affects binding of TRF2 to telomeric DNA. TRF1, TRF2 and Taz1 proteins contain three functional domains: an N-terminal acidic domain, a central TRF-specific/dimerization domain, and a C-terminal DNA binding domain with a single Myb-like repeat. Homodimerization, a prerequisite to DNA binding, results in the juxtaposition of two Myb DNA binding domains.
Probab=70.46 E-value=7.3 Score=30.69 Aligned_cols=58 Identities=9% Similarity=0.201 Sum_probs=41.1
Q ss_pred HHHHHHHHHHHHHhhCCCCh---------HHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHHHHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSDL---------ALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVHKQIF 113 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~---------~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir~~il 113 (122)
.+.|...++.--+-.. .| -+---.-.++++.|+|.+|++|..+... +|+.++-|.+.+
T Consensus 85 LESAl~v~~~I~~E~~--~~~~lhe~i~~lik~~aV~VCm~~g~Fk~A~eiLkr~~~-d~~~~~~r~kL~ 151 (200)
T cd00280 85 LESALMVLESIEKEFS--LPETLHEEIRKLIKEQAVAVCMENGEFKKAEEVLKRLFS-DPESQKLRMKLL 151 (200)
T ss_pred HHHHHHHHHHHHHhcC--CcHHHHHHHHHHHHHHHHHHHHhcCchHHHHHHHHHHhc-CCCchhHHHHHH
Confidence 5667666666554421 22 1223345678999999999999999997 999988877654
No 294
>KOG0529|consensus
Probab=69.81 E-value=15 Score=31.90 Aligned_cols=54 Identities=4% Similarity=0.036 Sum_probs=39.0
Q ss_pred ChHHHHHHHH-HHhhCcchHHHHhhCHHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCC
Q psy16144 28 RWEPLSAGLR-KKQYISSATNIYNLFPDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSY 105 (122)
Q Consensus 28 k~d~A~elL~-~l~~nps~~kA~e~ly~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~y 105 (122)
-.|+-.++.. ||+.||.+.-+| +.+.|-+++...+..+.. ..+|.++|+++|-+
T Consensus 90 ~ld~eL~~~~~~L~~npksY~aW----------~hR~w~L~~~p~~~~~~E--------------L~lcek~L~~D~RN 144 (421)
T KOG0529|consen 90 LLDEELKYVESALKVNPKSYGAW----------HHRKWVLQKNPHSDWNTE--------------LQLCEKALKQDPRN 144 (421)
T ss_pred hhHHHHHHHHHHHHhCchhHHHH----------HHHHHHHHhCCCchHHHH--------------HHHHHHHHhcCccc
Confidence 4566677788 888888877777 677788864444456666 56789999998833
No 295
>PF13041 PPR_2: PPR repeat family
Probab=69.07 E-value=7.8 Score=22.46 Aligned_cols=26 Identities=15% Similarity=0.080 Sum_probs=20.6
Q ss_pred HHHHH--HhcCCCChHHHHHHHH-HHhhC
Q psy16144 17 EWHFL--LYLLPWRWEPLSAGLR-KKQYI 42 (122)
Q Consensus 17 ~~~~L--Lyi~~gk~d~A~elL~-~l~~n 42 (122)
.|+.+ -|.+.|++++|.++++ ..+.+
T Consensus 5 ~yn~li~~~~~~~~~~~a~~l~~~M~~~g 33 (50)
T PF13041_consen 5 TYNTLISGYCKAGKFEEALKLFKEMKKRG 33 (50)
T ss_pred HHHHHHHHHHHCcCHHHHHHHHHHHHHcC
Confidence 35555 8899999999999999 66543
No 296
>PF09613 HrpB1_HrpK: Bacterial type III secretion protein (HrpB1_HrpK); InterPro: IPR013394 This family of proteins is encoded by genes found within type III secretion operons in a limited range of species including Xanthomonas, Ralstonia and Burkholderia.
Probab=69.05 E-value=34 Score=25.97 Aligned_cols=57 Identities=16% Similarity=0.179 Sum_probs=46.2
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVHK 110 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir~ 110 (122)
.+|+...+.-.--+ .|+++++..--|..++..|++.||+.+.+.+....|..|-.+.
T Consensus 26 ~~D~e~lL~ALrvL-RP~~~e~~~~~~~l~i~r~~w~dA~rlLr~l~~~~~~~p~~kA 82 (160)
T PF09613_consen 26 PDDAEALLDALRVL-RPEFPELDLFDGWLHIVRGDWDDALRLLRELEERAPGFPYAKA 82 (160)
T ss_pred hHHHHHHHHHHHHh-CCCchHHHHHHHHHHHHhCCHHHHHHHHHHHhccCCCChHHHH
Confidence 56776666555555 7999999999999999999999999999999988888886653
No 297
>COG3629 DnrI DNA-binding transcriptional activator of the SARP family [Signal transduction mechanisms]
Probab=68.50 E-value=26 Score=28.81 Aligned_cols=45 Identities=13% Similarity=0.053 Sum_probs=41.6
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHh
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRI 66 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l 66 (122)
.++..|++|.+++.++ .+...|=.-.+|+++ ..+|+..|++.|.+
T Consensus 162 ~~~~~~~~~~~~~~l~~Li~~dp~~E~~~~~lm~~y~~~g~~~~ai~~y~~l~~~ 216 (280)
T COG3629 162 ALIACGRADAVIEHLERLIELDPYDEPAYLRLMEAYLVNGRQSAAIRAYRQLKKT 216 (280)
T ss_pred HHHhcccHHHHHHHHHHHHhcCccchHHHHHHHHHHHHcCCchHHHHHHHHHHHH
Confidence 7888999999999999 999999999999855 89999999999995
No 298
>PF11846 DUF3366: Domain of unknown function (DUF3366); InterPro: IPR021797 This domain is functionally uncharacterised. This domain is found in bacteria. This presumed domain is about 200 amino acids in length.
Probab=68.32 E-value=31 Score=25.53 Aligned_cols=48 Identities=17% Similarity=0.041 Sum_probs=40.3
Q ss_pred HHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCC
Q psy16144 54 DDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHP 103 (122)
Q Consensus 54 ~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P 103 (122)
.+.....++..+. .| +|.+..+++..+...|+.++|.++-++....+|
T Consensus 128 ~~~~~~a~~~l~~-~P-~~~~~~~~a~~l~~~G~~~eA~~~~~~~~~lyP 175 (193)
T PF11846_consen 128 EAYIEWAERLLRR-RP-DPNVYQRYALALALLGDPEEARQWLARARRLYP 175 (193)
T ss_pred HHHHHHHHHHHHh-CC-CHHHHHHHHHHHHHcCCHHHHHHHHHHHHHhCC
Confidence 3444555666666 46 699999999999999999999999999999999
No 299
>KOG0545|consensus
Probab=66.98 E-value=47 Score=27.78 Aligned_cols=54 Identities=15% Similarity=0.075 Sum_probs=47.2
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHH
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPK 107 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypk 107 (122)
|-++..+-...+.. .|+|.-+.|+=|.+..-.=+..+|.+=+.+||+.+|.-..
T Consensus 246 ~yevleh~seiL~~-~~~nvKA~frRakAhaa~Wn~~eA~~D~~~vL~ldpslas 299 (329)
T KOG0545|consen 246 YYEVLEHCSEILRH-HPGNVKAYFRRAKAHAAVWNEAEAKADLQKVLELDPSLAS 299 (329)
T ss_pred HHHHHHHHHHHHhc-CCchHHHHHHHHHHHHhhcCHHHHHHHHHHHHhcChhhHH
Confidence 77777777888888 7999999999999999999999999999999999996543
No 300
>KOG3677|consensus
Probab=66.34 E-value=4.4 Score=35.72 Aligned_cols=27 Identities=33% Similarity=0.474 Sum_probs=24.4
Q ss_pred HHHHHHHHHHHhcccHHHHHHHHHHHH
Q psy16144 73 ALHQKLASLLLRMKRYAEAIDVSQTIM 99 (122)
Q Consensus 73 ~vgykLA~~ylk~kr~~~AI~vc~kVL 99 (122)
.|-|-.||+||-++||.|||+++-.+|
T Consensus 273 ~VTY~VGFayLmmrryadai~~F~niL 299 (525)
T KOG3677|consen 273 RVTYQVGFAYLMMRRYADAIRVFLNIL 299 (525)
T ss_pred eEeeehhHHHHHHHHHHHHHHHHHHHH
Confidence 345889999999999999999999888
No 301
>PF00244 14-3-3: 14-3-3 protein; InterPro: IPR023410 The 14-3-3 proteins are a large family of approximately 30kDa acidic proteins which exist primarily as homo- and heterodimeric within all eukaryotic cells [, ]. There is a high degree of sequence identity and conservation between all the 14-3-3 isotypes, particularly in the regions which form the dimer interface or line the central ligand binding channel of the dimeric molecule. Each 14-3-3 protein sequence can be roughly divided into three sections: a divergent amino terminus, the conserved core region and a divergent carboxyl terminus. The conserved middle core region of the 14-3-3s encodes an amphipathic groove that forms the main functional domain, a cradle for interacting with client proteins. The monomer consists of nine helices organised in an antiparallel manner, forming an L-shaped structure. The interior of the L-structure is composed of four helices: H3 and H5, which contain many charged and polar amino acids, and H7 and H9, which contain hydrophobic amino acids. These four helices form the concave amphipathic groove that interacts with target peptides. 14-3-3 proteins mainly bind proteins containing phosphothreonine or phosphoserine motifs however exceptions to this rule do exist. Extensive investigation of the 14-3-3 binding site of the mammalian serine/threonine kinase Raf-1 has produced a consensus sequence for 14-3-3-binding, RSxpSxP (in the single-letter amino-acid code, where x denotes any amino acid and p indicates that the next residue is phosphorylated). 14-3-3 proteins appear to effect intracellular signalling in one of three ways - by direct regulation of the catalytic activity of the bound protein, by regulating interactions between the bound protein and other molecules in the cell by sequestration or modification or by controlling the subcellular localisation of the bound ligand. Proteins appear to initially bind to a single dominant site and then subsequently to many, much weaker secondary interaction sites. The 14-3-3 dimer is capable of changing the conformation of its bound ligand whilst itself undergoing minimal structural alteration. This entry represents the structural domain found in 14-3-3 proteins.; PDB: 2O8P_A 3AXY_D 2C74_A 2C63_A 4DX0_A 1YWT_A 3P1O_A 3P1N_A 4DAU_A 3U9X_A ....
Probab=65.96 E-value=59 Score=25.52 Aligned_cols=44 Identities=14% Similarity=0.118 Sum_probs=34.4
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHh--hC---HHHHHHHHHHHHHh
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYN--LF---PDDAVVQYTQACRI 66 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e--~l---y~dAa~~ye~Aw~l 66 (122)
|+-+.|+|++..++++ ++..++.-.. -| .+ |+..+..++.+|+.
T Consensus 10 laeq~eRy~dmv~~mk~~~~~~~eLt~-eERnLlsvayKn~i~~~R~s~R~ 59 (236)
T PF00244_consen 10 LAEQAERYDDMVEYMKQLIEMNPELTE-EERNLLSVAYKNVIGSRRASWRI 59 (236)
T ss_dssp HHHHTTHHHHHHHHHHHHHHTSS---H-HHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHhcCHHHHHHHHHHHHccCCCCCH-HHHHHHHHHHHhccccchHHHHh
Confidence 8889999999999999 9998886322 22 22 99999999999954
No 302
>PF12968 DUF3856: Domain of Unknown Function (DUF3856); InterPro: IPR024552 This domain of unknown function is found in a small group of tetratricopeptide-like proteins, which includes the uncharacterised protein Q8KAL8 from SWISSPROT. The structure of Q8KAL8 is known and belongs to the SCOP all alpha class, TPR-like superfamily, CT2138-like family.; PDB: 2HR2_D.
Probab=65.60 E-value=22 Score=26.52 Aligned_cols=48 Identities=17% Similarity=0.212 Sum_probs=35.9
Q ss_pred HHHHHHHHHHHHHhhCCCCh-----HHHH------HHHHHHHhcccHHHHHHHHHHHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSDL-----ALHQ------KLASLLLRMKRYAEAIDVSQTIMK 100 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~-----~vgy------kLA~~ylk~kr~~~AI~vc~kVL~ 100 (122)
|.+|+..|.+|.+.+..-.+ -.|| -|+..+-+.|+|.+++.--.+.|.
T Consensus 25 ~~eAa~s~r~AM~~srtiP~eEaFDh~GFDA~chA~Ls~A~~~Lgry~e~L~sA~~aL~ 83 (144)
T PF12968_consen 25 YEEAAASCRKAMEVSRTIPAEEAFDHDGFDAFCHAGLSGALAGLGRYDECLQSADRALR 83 (144)
T ss_dssp HHHHHHHHHHHHHHHTTS-TTS---HHHHHHHHHHHHHHHHHHTT-HHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHHhccCChHhhcccccHHHHHHHHHHHHHHhhccHHHHHHHHHHHHH
Confidence 99999999999998743222 2344 478888999999999988777773
No 303
>KOG3364|consensus
Probab=65.18 E-value=7.7 Score=29.25 Aligned_cols=30 Identities=3% Similarity=-0.211 Sum_probs=22.3
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhh
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNL 51 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~ 51 (122)
=+.+.|.|++|..+++ +|+..|++..|.++
T Consensus 80 g~yRlkeY~~s~~yvd~ll~~e~~n~Qa~~L 110 (149)
T KOG3364|consen 80 GHYRLKEYSKSLRYVDALLETEPNNRQALEL 110 (149)
T ss_pred HHHHHhhHHHHHHHHHHHHhhCCCcHHHHHH
Confidence 4457778888888888 88888887776653
No 304
>KOG2610|consensus
Probab=64.85 E-value=45 Score=29.08 Aligned_cols=40 Identities=28% Similarity=0.125 Sum_probs=19.9
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAID 93 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~ 93 (122)
|+||.+.-.+|+++ |+.|..+.-.+|.++.--+|+.|+.+
T Consensus 191 y~dAEk~A~ralqi-N~~D~Wa~Ha~aHVlem~~r~Keg~e 230 (491)
T KOG2610|consen 191 YDDAEKQADRALQI-NRFDCWASHAKAHVLEMNGRHKEGKE 230 (491)
T ss_pred chhHHHHHHhhccC-CCcchHHHHHHHHHHHhcchhhhHHH
Confidence 55555555555555 45555555555555444444444443
No 305
>KOG4014|consensus
Probab=63.97 E-value=18 Score=28.95 Aligned_cols=78 Identities=15% Similarity=0.121 Sum_probs=53.8
Q ss_pred CCChHHHHHHHH--HH--hhCcchHHHHhhC----------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 26 PWRWEPLSAGLR--KK--QYISSATNIYNLF----------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 26 ~gk~d~A~elL~--~l--~~nps~~kA~e~l----------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
++.|+.|...|+ |= .|-+||.+.=.|+ ...|++.|+.|++. ++|.+.-.++...--.....
T Consensus 48 ~knF~~A~kv~K~nCden~y~kSCyKyG~y~~~GKgG~~~~l~~a~r~~~~aC~~---n~~~aC~~~gLl~~~g~~~r-- 122 (248)
T KOG4014|consen 48 QKNFQAAVKVFKKNCDENSYPKSCYKYGMYMLAGKGGDDASLSKAIRPMKIACDA---NIPQACRYLGLLHWNGEKDR-- 122 (248)
T ss_pred HHHHHHHHHHHHhcccccCCcHHHHHhhhhhhcccCCCccCHHHHHHHHHHHhcc---CCHHHHhhhhhhhccCcCCc--
Confidence 378999999998 43 3556787765555 89999999999985 45666666665544333322
Q ss_pred HHHHHHHHHhCCCCHHHHHHHHHHHH
Q psy16144 92 IDVSQTIMKAHPSYPKVHKQIFEKAM 117 (122)
Q Consensus 92 I~vc~kVL~~~P~ypkir~~il~ka~ 117 (122)
+.+||.||.+ +-|-+|+
T Consensus 123 --------~~dpd~~Ka~-~y~traC 139 (248)
T KOG4014|consen 123 --------KADPDSEKAE-RYMTRAC 139 (248)
T ss_pred --------cCCCCcHHHH-HHHHHhc
Confidence 3678888876 3555553
No 306
>PF12854 PPR_1: PPR repeat
Probab=63.79 E-value=18 Score=19.75 Aligned_cols=27 Identities=19% Similarity=0.192 Sum_probs=22.4
Q ss_pred ChHHHHHHHHHHHhcccHHHHHHHHHH
Q psy16144 71 DLALHQKLASLLLRMKRYAEAIDVSQT 97 (122)
Q Consensus 71 ~~~vgykLA~~ylk~kr~~~AI~vc~k 97 (122)
|....-.|=..|.|.|+.++|+++.++
T Consensus 6 d~~ty~~lI~~~Ck~G~~~~A~~l~~~ 32 (34)
T PF12854_consen 6 DVVTYNTLIDGYCKAGRVDEAFELFDE 32 (34)
T ss_pred cHhHHHHHHHHHHHCCCHHHHHHHHHh
Confidence 456667788899999999999997764
No 307
>KOG2581|consensus
Probab=62.90 E-value=24 Score=31.16 Aligned_cols=59 Identities=14% Similarity=0.010 Sum_probs=44.8
Q ss_pred HhcCCCChHHHHHHHH-HH----hhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KK----QYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASL 81 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l----~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ 81 (122)
-|+.-+.||.|.++.. .. .-|-.-++...|+ |.+|..++-+|...+ |.+.++||+-..+
T Consensus 218 ~yL~n~lydqa~~lvsK~~~pe~~snne~ARY~yY~GrIkaiqldYssA~~~~~qa~rka-pq~~alGf~q~v~ 290 (493)
T KOG2581|consen 218 NYLHNKLYDQADKLVSKSVYPEAASNNEWARYLYYLGRIKAIQLDYSSALEYFLQALRKA-PQHAALGFRQQVN 290 (493)
T ss_pred HHhhhHHHHHHHHHhhcccCccccccHHHHHHHHHHhhHHHhhcchhHHHHHHHHHHHhC-cchhhhhHHHHHH
Confidence 7888899999999888 43 1122334444454 999999999999995 8899999986554
No 308
>KOG2300|consensus
Probab=62.77 E-value=1.1e+02 Score=27.71 Aligned_cols=79 Identities=14% Similarity=0.136 Sum_probs=56.6
Q ss_pred HhcCCCChHHHHHHHH-HHh---hCcc--hHHHHhh----C----------HHHHHHHHHHHHHhhCCCChHHHH--HHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQ---YISS--ATNIYNL----F----------PDDAVVQYTQACRIIGTSDLALHQ--KLA 79 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~---~nps--~~kA~e~----l----------y~dAa~~ye~Aw~l~~~~~~~vgy--kLA 79 (122)
.-+-.|++.+|++-+. +.. .-|. -..+++- + |+.|+.+|..|.++++..+..+.. +||
T Consensus 332 c~lv~~~~~~al~~i~dm~~w~~r~p~~~Llr~~~~~ih~LlGlys~sv~~~enAe~hf~~a~k~t~~~dl~a~~nlnlA 411 (629)
T KOG2300|consen 332 CRLVRGDYVEALEEIVDMKNWCTRFPTPLLLRAHEAQIHMLLGLYSHSVNCYENAEFHFIEATKLTESIDLQAFCNLNLA 411 (629)
T ss_pred HHHHhCCHHHHHHHHHHHHHHHHhCCchHHHHHhHHHHHHHHhhHhhhcchHHHHHHHHHHHHHhhhHHHHHHHHHHhHH
Confidence 5556799999998877 443 3343 4445551 1 999999999999999888877754 689
Q ss_pred HHHHhcccHHHHHHHHHHHHH-hCCC
Q psy16144 80 SLLLRMKRYAEAIDVSQTIMK-AHPS 104 (122)
Q Consensus 80 ~~ylk~kr~~~AI~vc~kVL~-~~P~ 104 (122)
-.|++.|+-++ +-++|+ +.|.
T Consensus 412 i~YL~~~~~ed----~y~~ld~i~p~ 433 (629)
T KOG2300|consen 412 ISYLRIGDAED----LYKALDLIGPL 433 (629)
T ss_pred HHHHHhccHHH----HHHHHHhcCCC
Confidence 99999888665 344443 4554
No 309
>cd02656 MIT MIT: domain contained within Microtubule Interacting and Trafficking molecules. The MIT domain is found in sorting nexins, the nuclear thiol protease PalBH, the AAA protein spastin and archaebacterial proteins with similar domain architecture, vacuolar sorting proteins and others. The molecular function of the MIT domain is unclear.
Probab=62.68 E-value=37 Score=21.54 Aligned_cols=25 Identities=32% Similarity=0.422 Sum_probs=10.9
Q ss_pred HHHHHHHHHHhCCCCHHHHHHHHHHH
Q psy16144 91 AIDVSQTIMKAHPSYPKVHKQIFEKA 116 (122)
Q Consensus 91 AI~vc~kVL~~~P~ypkir~~il~ka 116 (122)
|++.|.++++..|+ |..+..+-.|+
T Consensus 32 a~e~l~~~~~~~~~-~~~k~~l~~k~ 56 (75)
T cd02656 32 ALDYLLQALKAEKE-PKLRKLLRKKV 56 (75)
T ss_pred HHHHHHHHhccCCC-HHHHHHHHHHH
Confidence 33334444544442 44444444443
No 310
>KOG2997|consensus
Probab=62.01 E-value=11 Score=31.99 Aligned_cols=36 Identities=17% Similarity=0.230 Sum_probs=28.7
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCC
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPS 104 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ 104 (122)
-++|...|++|.+.. +-|..-|||..|+.+++++||
T Consensus 16 ~kkA~~l~~~av~~E----------------q~G~l~dai~fYR~AlqI~~d 51 (366)
T KOG2997|consen 16 AKKAIALYEKAVLKE----------------QDGSLYDAINFYRDALQIVPD 51 (366)
T ss_pred HHHHHHHHHHHHHHh----------------hcCcHHHHHHHHHhhhcCCch
Confidence 467888888887773 446677999999999999995
No 311
>PF10602 RPN7: 26S proteasome subunit RPN7; InterPro: IPR019585 This entry represents the regulatory subunit RPN7 (known as the non-ATPase regulatory subunit 6 in higher eukaryotes) of the 26S proteasome. This entry also matches the evolutionarily related subunit 1 of the COP9 signalosome complex (CSN) from Arabidopsis []. The 26S proteasome plays a major role in ATP-dependent degradation of ubiquitinated proteins. Substrate specificity is conferred by the regulatory particle (RP), which can dissociate into stable lid and base subcomplexes. The regulatory subunit RPN7 is one of the lid subunits of the 26S proteasome and has been shown in Saccharomyces cerevisiae (Baker's yeast) to be required for structural integrity []. The COP9 signalosome is a conserved protein complex composed of eight subunits, where Individual subunits of the complex have been linked to various signal transduction pathways leading to gene expression and cell cycle control []. The overall organisation and the amino acid sequences of the COP9 signalosome subunits resemble the lid subcomplex of the 19 S regulatory particle for the 26 S proteasome []. COP9 subunit 1 (CSN1 or GPS1) of the COP9 complex is an essential subunit of the complex with regard to both structural integrity and functionality. The N-terminal region of subunit 1 (CSN1-N) can inhibit c-fos expression from either a transfected template or a chromosomal transgene (fos-lacZ), and may contain the activity domain that confers most of the repression functions of CSN1. The C-terminal region of subunit 1 (CSN1-C) allows integration of the protein into the COP9 signalosome.
Probab=61.60 E-value=66 Score=24.06 Aligned_cols=82 Identities=13% Similarity=0.113 Sum_probs=58.2
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC------------HHHHHHHHHHHHHhhCC-CChHHHHHH----HHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF------------PDDAVVQYTQACRIIGT-SDLALHQKL----ASLLL 83 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l------------y~dAa~~ye~Aw~l~~~-~~~~vgykL----A~~yl 83 (122)
.|.+.|+.++|.+.+. +..+.-+...-.... +.....+.++|-.+... .+.....+| |-.++
T Consensus 45 ~~~~~Gd~~~A~k~y~~~~~~~~~~~~~id~~l~~irv~i~~~d~~~v~~~i~ka~~~~~~~~d~~~~nrlk~~~gL~~l 124 (177)
T PF10602_consen 45 HYCKIGDLEEALKAYSRARDYCTSPGHKIDMCLNVIRVAIFFGDWSHVEKYIEKAESLIEKGGDWERRNRLKVYEGLANL 124 (177)
T ss_pred HHHHhhhHHHHHHHHHHHhhhcCCHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHhccchHHHHHHHHHHHHHHHH
Confidence 8999999999999999 988876644444411 77777777777666433 223333332 67788
Q ss_pred hcccHHHHHHHHHHHHHhCC
Q psy16144 84 RMKRYAEAIDVSQTIMKAHP 103 (122)
Q Consensus 84 k~kr~~~AI~vc~kVL~~~P 103 (122)
..++|.+|...+-.++..+.
T Consensus 125 ~~r~f~~AA~~fl~~~~t~~ 144 (177)
T PF10602_consen 125 AQRDFKEAAELFLDSLSTFT 144 (177)
T ss_pred HhchHHHHHHHHHccCcCCC
Confidence 89999999888777765443
No 312
>PF13812 PPR_3: Pentatricopeptide repeat domain
Probab=60.63 E-value=15 Score=18.87 Aligned_cols=20 Identities=5% Similarity=-0.253 Sum_probs=17.3
Q ss_pred HhcCCCChHHHHHHHH-HHhh
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQY 41 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~ 41 (122)
.+.+.|+++.|.++++ ..+.
T Consensus 10 a~~~~g~~~~a~~~~~~M~~~ 30 (34)
T PF13812_consen 10 ACAKAGDPDAALQLFDEMKEQ 30 (34)
T ss_pred HHHHCCCHHHHHHHHHHHHHh
Confidence 7889999999999999 6553
No 313
>KOG4507|consensus
Probab=59.84 E-value=16 Score=33.83 Aligned_cols=75 Identities=17% Similarity=0.080 Sum_probs=59.4
Q ss_pred HHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCCh-HHHHHHHHHHHhcccHHHHHHHHHHH
Q psy16144 30 EPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDL-ALHQKLASLLLRMKRYAEAIDVSQTI 98 (122)
Q Consensus 30 d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~-~vgykLA~~ylk~kr~~~AI~vc~kV 98 (122)
++-..++- .+++||++.+.+..- ..||..||..|..++++.+. .+...||-.+.++|+-+||--+.+.+
T Consensus 196 ~~~~~~~~~glq~~~~sw~lH~~as~YWR~~G~~~~A~~Ca~~a~hf~~~h~kdi~lLSlaTiL~RaG~sadA~iILhAA 275 (886)
T KOG4507|consen 196 DDIGHLIHEGLQKNTSSWVLHNMASFYWRIKGEPYQAVECAMRALHFSSRHNKDIALLSLATVLHRAGFSADAAVILHAA 275 (886)
T ss_pred HHHHHHHHHhhhcCchhHHHHHHHHHHHHHcCChhhhhHHHHHHhhhCCcccccchhhhHHHHHHHcccccchhheeehh
Confidence 33344555 889999987766522 88999999999999877665 45789999999999999999999888
Q ss_pred HHhCCC
Q psy16144 99 MKAHPS 104 (122)
Q Consensus 99 L~~~P~ 104 (122)
+..-|+
T Consensus 276 ~~dA~~ 281 (886)
T KOG4507|consen 276 LDDADF 281 (886)
T ss_pred ccCCcc
Confidence 866553
No 314
>cd02678 MIT_VPS4 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in intracellular protein transport proteins of the AAA-ATPase family. The molecular function of the MIT domain is unclear.
Probab=59.17 E-value=45 Score=21.40 Aligned_cols=12 Identities=0% Similarity=-0.144 Sum_probs=5.8
Q ss_pred HHHHHHHH-HHhh
Q psy16144 30 EPLSAGLR-KKQY 41 (122)
Q Consensus 30 d~A~elL~-~l~~ 41 (122)
+.|.++++ +++.
T Consensus 4 ~~A~~l~~~Av~~ 16 (75)
T cd02678 4 QKAIELVKKAIEE 16 (75)
T ss_pred HHHHHHHHHHHHH
Confidence 44555555 5443
No 315
>KOG2396|consensus
Probab=58.38 E-value=41 Score=30.33 Aligned_cols=57 Identities=21% Similarity=0.404 Sum_probs=42.9
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhccc-HHHHHHHHHHHHHhCCCCHHHHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKR-YAEAIDVSQTIMKAHPSYPKVHKQ 111 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr-~~~AI~vc~kVL~~~P~ypkir~~ 111 (122)
..-=...|+.|..-- +.|+.....-. +|.|.++ +.+---||.+.|..||++|.+|-.
T Consensus 87 ~~rIv~lyr~at~rf-~~D~~lW~~yi-~f~kk~~~~~~v~ki~~~~l~~Hp~~~dLWI~ 144 (568)
T KOG2396|consen 87 PNRIVFLYRRATNRF-NGDVKLWLSYI-AFCKKKKTYGEVKKIFAAMLAKHPNNPDLWIY 144 (568)
T ss_pred HHHHHHHHHHHHHhc-CCCHHHHHHHH-HHHHHhcchhHHHHHHHHHHHhCCCCchhHHh
Confidence 334455688887773 55788877754 4555555 999999999999999999999853
No 316
>PF10952 DUF2753: Protein of unknown function (DUF2753); InterPro: IPR020206 This entry represents a group of uncharacterised proteins.
Probab=58.36 E-value=75 Score=23.70 Aligned_cols=59 Identities=12% Similarity=0.195 Sum_probs=34.7
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhCHHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHH--
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLFPDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTI-- 98 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~ly~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kV-- 98 (122)
.+-.+|+.+-..+||+ +- |+...+. |.=|..++. .|.+.+-+|+++
T Consensus 59 FWR~~gd~~yELkYLqlAS---------------------E~VltLi-PQCp~~~C~---------afi~sLGCCk~ALl 107 (140)
T PF10952_consen 59 FWRSQGDSDYELKYLQLAS---------------------EKVLTLI-PQCPNTECE---------AFIDSLGCCKKALL 107 (140)
T ss_pred HHHHcCChHHHHHHHHHHH---------------------HHHHHhc-cCCCCcchH---------HHHHhhhccHHHHH
Confidence 6666777777777766 41 2333342 433443333 344556666654
Q ss_pred --HHhCCCCHHHHHHH
Q psy16144 99 --MKAHPSYPKVHKQI 112 (122)
Q Consensus 99 --L~~~P~ypkir~~i 112 (122)
++.|| +|+|-+.|
T Consensus 108 ~F~KRHP-NP~iA~~v 122 (140)
T PF10952_consen 108 DFMKRHP-NPEIARLV 122 (140)
T ss_pred HHHHhCC-CHHHHHHH
Confidence 58899 89987665
No 317
>cd02680 MIT_calpain7_2 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in the nuclear thiol protease PalBH. The molecular function of the MIT domain is unclear.
Probab=58.03 E-value=18 Score=24.07 Aligned_cols=32 Identities=16% Similarity=0.143 Sum_probs=16.0
Q ss_pred cccHHHHHHHHHHHHHhC--CCCHHHHHHHHHHH
Q psy16144 85 MKRYAEAIDVSQTIMKAH--PSYPKVHKQIFEKA 116 (122)
Q Consensus 85 ~kr~~~AI~vc~kVL~~~--P~ypkir~~il~ka 116 (122)
.|+|++|+.+|..+++.. -.+|+++..|..|.
T Consensus 19 ~gny~eA~~lY~~ale~~~~ekn~~~k~~i~~K~ 52 (75)
T cd02680 19 KGNAEEAIELYTEAVELCINTSNETMDQALQTKL 52 (75)
T ss_pred hhhHHHHHHHHHHHHHHHHHhcChhhHHHHHHHH
Confidence 344444444444443321 24666666666665
No 318
>COG5159 RPN6 26S proteasome regulatory complex component [Posttranslational modification, protein turnover, chaperones]
Probab=57.98 E-value=1.1e+02 Score=26.28 Aligned_cols=79 Identities=18% Similarity=0.250 Sum_probs=47.5
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHh------------hC---HHHHHHHHHHHHHhhCCCC-----hHHHHHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYN------------LF---PDDAVVQYTQACRIIGTSD-----LALHQKLAS 80 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e------------~l---y~dAa~~ye~Aw~l~~~~~-----~~vgykLA~ 80 (122)
||++.|+++.-.+.+. ...+.-+.+++.- +. +++-+...+...+.....+ .+.-+||++
T Consensus 54 lyv~~g~~~~l~~~i~~sre~m~~ftk~k~~KiirtLiekf~~~~dsl~dqi~v~~~~iewA~rEkr~fLr~~Le~Kli~ 133 (421)
T COG5159 54 LYVSKGDYCSLGDTITSSREAMEDFTKPKITKIIRTLIEKFPYSSDSLEDQIKVLTALIEWADREKRKFLRLELECKLIY 133 (421)
T ss_pred HHHhcCCcchHHHHHHhhHHHHHHhcchhHHHHHHHHHHhcCCCCccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 9999999987766554 3322222221111 11 4444444444443322222 234679999
Q ss_pred HHHhcccHHHHHHHHHHHHH
Q psy16144 81 LLLRMKRYAEAIDVSQTIMK 100 (122)
Q Consensus 81 ~ylk~kr~~~AI~vc~kVL~ 100 (122)
.|.+.|+|.||+..-.-+|.
T Consensus 134 l~y~~~~YsdalalIn~ll~ 153 (421)
T COG5159 134 LLYKTGKYSDALALINPLLH 153 (421)
T ss_pred HHHhcccHHHHHHHHHHHHH
Confidence 99999999999998776654
No 319
>PF05030 SSXT: SSXT protein (N-terminal region); InterPro: IPR007726 SSXT (also known as SYT or SS18) appears to function synergistically with RBM14 as a transcriptional coactivator []. The SSXT protein is involved in synovial sarcoma in humans. A SYT-SSX fusion gene resulting from the chromosomal translocation t(X;18) (p11;q11) is characteristic of synovial sarcomas. This translocation fuses the SSXT (SYT) gene from chromosome 18 to either of two homologous genes at Xp11, SSX1 or SSX2 []. This entry also includes SS18-like protein 1, a transcriptional activator which is required for calcium-dependent dendritic growth and branching in cortical neurons [],[].
Probab=57.83 E-value=14 Score=24.12 Aligned_cols=28 Identities=25% Similarity=0.069 Sum_probs=20.5
Q ss_pred HHHHHHHHHHHHHhcCCCChHHHHHHHHHHhhC
Q psy16144 10 QIIQTIAEWHFLLYLLPWRWEPLSAGLRKKQYI 42 (122)
Q Consensus 10 ~~~~~~~~~~~LLyi~~gk~d~A~elL~~l~~n 42 (122)
++|++|.| |-+.|+..++..+.+.|+.|
T Consensus 20 ~LI~~I~e-----~qn~Gr~~Ec~qyq~~LhrN 47 (65)
T PF05030_consen 20 QLIQCIQE-----YQNKGRAQECVQYQQILHRN 47 (65)
T ss_pred HHHHHHHH-----HHHcCCHHHHHHHHHHHHHH
Confidence 56677766 44689999999877766655
No 320
>PF14689 SPOB_a: Sensor_kinase_SpoOB-type, alpha-helical domain; PDB: 1F51_C 2FTK_B 1IXM_B.
Probab=57.47 E-value=19 Score=22.60 Aligned_cols=19 Identities=11% Similarity=0.029 Sum_probs=15.7
Q ss_pred HhcCCCChHHHHHHHH-HHh
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQ 40 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~ 40 (122)
-|++.|++|+|.++++ ..+
T Consensus 32 gllqlg~~~~a~eYi~~~~~ 51 (62)
T PF14689_consen 32 GLLQLGKYEEAKEYIKELSK 51 (62)
T ss_dssp HHHHTT-HHHHHHHHHHHHH
T ss_pred HHHHCCCHHHHHHHHHHHHH
Confidence 6789999999999999 654
No 321
>PF02184 HAT: HAT (Half-A-TPR) repeat; InterPro: IPR003107 The HAT (Half A TPR) repeat has a repetitive pattern characterised by three aromatic residues with a conserved spacing. They are structurally and sequentially similar to TPRs (tetratricopeptide repeats), though they lack the highly conserved alanine and glycine residues found in TPRs. The number of HAT repeats found in different proteins varies between 9 and 12. HAT-repeat-containing proteins appear to be components of macromolecular complexes that are required for RNA processing []. The repeats may be involved in protein-protein interactions. The HAT motif has striking structural similarities to HEAT repeats (IPR000357 from INTERPRO), being of a similar length and consisting of two short helices connected by a loop domain, as in HEAT repeats.; GO: 0006396 RNA processing, 0005622 intracellular
Probab=56.80 E-value=17 Score=20.54 Aligned_cols=23 Identities=22% Similarity=0.319 Sum_probs=18.5
Q ss_pred ccHHHHHHHHHHHHHhCCCCHHHH
Q psy16144 86 KRYAEAIDVSQTIMKAHPSYPKVH 109 (122)
Q Consensus 86 kr~~~AI~vc~kVL~~~P~ypkir 109 (122)
+.++.|-.||++.+..|| .|+.+
T Consensus 1 kE~dRAR~IyeR~v~~hp-~~k~W 23 (32)
T PF02184_consen 1 KEFDRARSIYERFVLVHP-EVKNW 23 (32)
T ss_pred ChHHHHHHHHHHHHHhCC-CchHH
Confidence 356788999999999998 46665
No 322
>PF13281 DUF4071: Domain of unknown function (DUF4071)
Probab=56.62 E-value=43 Score=28.64 Aligned_cols=55 Identities=11% Similarity=0.025 Sum_probs=39.8
Q ss_pred HHHHHHHHHHHHHh---hCCCChHHHHHHHHHHHh---cccHHHHHHHHHH-HHHhCCCCHH
Q psy16144 53 PDDAVVQYTQACRI---IGTSDLALHQKLASLLLR---MKRYAEAIDVSQT-IMKAHPSYPK 107 (122)
Q Consensus 53 y~dAa~~ye~Aw~l---~~~~~~~vgykLA~~ylk---~kr~~~AI~vc~k-VL~~~P~ypk 107 (122)
|+.=+...+..=.+ .-++.+.|-+-+||++.+ .|+.++|++++.. +....+..|.
T Consensus 157 ydamI~Lve~l~~~p~~~~~~~~~i~~~yafALnRrn~~gdre~Al~il~~~l~~~~~~~~d 218 (374)
T PF13281_consen 157 YDAMIKLVETLEALPTCDVANQHNIKFQYAFALNRRNKPGDREKALQILLPVLESDENPDPD 218 (374)
T ss_pred HHHHHHHHHHhhccCccchhcchHHHHHHHHHHhhcccCCCHHHHHHHHHHHHhccCCCChH
Confidence 55444444444333 135678899999999999 9999999999999 5555556665
No 323
>PF09205 DUF1955: Domain of unknown function (DUF1955); InterPro: IPR015288 Members of this family are found in hypothetical proteins synthesised by the Archaeal organism Sulfolobus. Their exact function has not, as yet, been determined. ; PDB: 1WY6_A.
Probab=55.72 E-value=23 Score=26.85 Aligned_cols=40 Identities=13% Similarity=0.139 Sum_probs=27.5
Q ss_pred HHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHH
Q psy16144 60 YTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMK 100 (122)
Q Consensus 60 ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~ 100 (122)
+....+ .++.+|++.++||.+|-|.|.-.+|-++.+++=+
T Consensus 109 ~~~l~k-n~~~~p~~L~kia~Ay~klg~~r~~~ell~~ACe 148 (161)
T PF09205_consen 109 YNELKK-NEEINPEFLVKIANAYKKLGNTREANELLKEACE 148 (161)
T ss_dssp HHHH------S-HHHHHHHHHHHHHTT-HHHHHHHHHHHHH
T ss_pred HHHHhh-ccCCCHHHHHHHHHHHHHhcchhhHHHHHHHHHH
Confidence 333343 3678999999999999999999999998777643
No 324
>PF10516 SHNi-TPR: SHNi-TPR; InterPro: IPR019544 The tetratrico peptide repeat region (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B. This entry represents SHNi-TPR (Sim3-Hif1-NASP interrupted TPR), a sequence that is an interrupted form of TPR repeat [].
Probab=53.41 E-value=37 Score=19.55 Aligned_cols=29 Identities=14% Similarity=0.260 Sum_probs=24.8
Q ss_pred HHHHHHHHHHHhcccHHHHHHHHHHHHHh
Q psy16144 73 ALHQKLASLLLRMKRYAEAIDVSQTIMKA 101 (122)
Q Consensus 73 ~vgykLA~~ylk~kr~~~AI~vc~kVL~~ 101 (122)
.+..+||-.-+-..+|.+|++=+++.|++
T Consensus 2 dv~~~Lgeisle~e~f~qA~~D~~~aL~i 30 (38)
T PF10516_consen 2 DVYDLLGEISLENENFEQAIEDYEKALEI 30 (38)
T ss_pred cHHHHHHHHHHHhccHHHHHHHHHHHHHH
Confidence 46678999999999999999999998853
No 325
>KOG2460|consensus
Probab=53.08 E-value=80 Score=28.66 Aligned_cols=79 Identities=15% Similarity=0.048 Sum_probs=54.0
Q ss_pred HhcCCCChHHHHHHHH--HHhhCcchHHH-----HhhCHHHHHHHHHHHHHhhCCCChH----------------HHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR--KKQYISSATNI-----YNLFPDDAVVQYTQACRIIGTSDLA----------------LHQKL 78 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~--~l~~nps~~kA-----~e~ly~dAa~~ye~Aw~l~~~~~~~----------------vgykL 78 (122)
+.....+.+.|.+-+. .++++||.++= .+.+|+-=..++..+-++-+-.+-. -.|-+
T Consensus 349 l~~tisR~~~~~~n~~~~~ls~~~~~~~k~rpqdl~RLYd~iiknl~e~~elPG~~~D~~l~sqle~~~~~fkafRC~~i 428 (593)
T KOG2460|consen 349 LLTTISRNEDAFTNLWNQWLSQQTSDPKKLRPQDLERLYDSIIKNLSEIMELPGLESDKELQSQLELKKLYFKAFRCFYI 428 (593)
T ss_pred HHHHHhHHHHHHhhhHHHHHHhCcccccccCHHHHHHHHHHHHHHHHHHHhCCCcccchHHHHHHHHHHHHHHHHHHHHH
Confidence 5566678867766666 99999876654 2222766667776666663221111 14567
Q ss_pred HHHHHhcccHHHHHHHHHHHHH
Q psy16144 79 ASLLLRMKRYAEAIDVSQTIMK 100 (122)
Q Consensus 79 A~~ylk~kr~~~AI~vc~kVL~ 100 (122)
|..|.-.++|.+|.++|.++..
T Consensus 429 A~sY~a~~K~~EAlALy~Ra~s 450 (593)
T KOG2460|consen 429 AVSYQAKKKYSEALALYVRAYS 450 (593)
T ss_pred HHHHHHHHHHHHHHHHHHHHHH
Confidence 9999999999999999998763
No 326
>PF07163 Pex26: Pex26 protein; InterPro: IPR010797 This family consists of Pex26 and related mammalian proteins. Pex26 is a type II peroxisomal membrane protein that recruits Pex6-Pex1 complexes to peroxisomes []. Mutations in Pex26 can lead to human disorders [].; GO: 0032403 protein complex binding, 0045046 protein import into peroxisome membrane, 0005779 integral to peroxisomal membrane
Probab=52.82 E-value=15 Score=30.71 Aligned_cols=26 Identities=19% Similarity=0.111 Sum_probs=23.7
Q ss_pred HHHHHHHHHH-HhcCCCChHHHHHHHH
Q psy16144 12 IQTIAEWHFL-LYLLPWRWEPLSAGLR 37 (122)
Q Consensus 12 ~~~~~~~~~L-Lyi~~gk~d~A~elL~ 37 (122)
.-|++|+|++ +++=.|.+++|++++.
T Consensus 156 y~~vaELyLl~VLlPLG~~~eAeelv~ 182 (309)
T PF07163_consen 156 YGTVAELYLLHVLLPLGHFSEAEELVV 182 (309)
T ss_pred hHHHHHHHHHHHHhccccHHHHHHHHh
Confidence 4589999999 9999999999999886
No 327
>smart00101 14_3_3 14-3-3 homologues. 14-3-3 homologues mediates signal transduction by binding to phosphoserine-containing proteins. They are involved in growth factor signalling and also interact with MEK kinases.
Probab=52.29 E-value=75 Score=25.45 Aligned_cols=79 Identities=16% Similarity=0.074 Sum_probs=49.5
Q ss_pred HhcCCCChHHHHHHHH-HHhh-CcchHHHHh--hC---HHHHHHHHHHHHHhhCC---C----ChHHHHHHHHHHHhccc
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQY-ISSATNIYN--LF---PDDAVVQYTQACRIIGT---S----DLALHQKLASLLLRMKR 87 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~-nps~~kA~e--~l---y~dAa~~ye~Aw~l~~~---~----~~~vgykLA~~ylk~kr 87 (122)
|.-|.|+||+..+..+ +++. +|.....=| ++ |+..+...+.+|+.-.. + ...-..++.-.|. .+=
T Consensus 10 laeq~eRyddm~~~mk~~~~~~~~~eLt~EERnLLSvayKn~i~~~R~s~R~i~sie~ke~~~~~~~~~~~~~~yr-~ki 88 (244)
T smart00101 10 LAEQAERYEEMVEFMEKVAKTVDSEELTVEERNLLSVAYKNVIGARRASWRIISSIEQKEESRGNEDHVASIKEYR-GKI 88 (244)
T ss_pred HHHHhcCHHHHHHHHHHHHhhcCCccCCHHHHHHHHHHHhhhhcccHHHHHHHhHHHHhhhccCchHHHHHHHHHH-HHH
Confidence 7889999999999999 8886 543222222 33 99999999999987311 0 0111223333333 222
Q ss_pred HHHHHHHHHHHHHh
Q psy16144 88 YAEAIDVSQTIMKA 101 (122)
Q Consensus 88 ~~~AI~vc~kVL~~ 101 (122)
-.+-..+|..|+.+
T Consensus 89 e~EL~~iC~eil~l 102 (244)
T smart00101 89 ETELSKICDGILKL 102 (244)
T ss_pred HHHHHHHHHHHHHH
Confidence 44567788888754
No 328
>PF12583 TPPII_N: Tripeptidyl peptidase II N terminal; InterPro: IPR022232 This entry represents a region of approximately 190 amino acids in length and is found in association with PF00082 from PFAM. The members are serine peptidases belonging to MEROPS peptidase family S8A, tripeptidyl peptidase II (TPPII), clan SB. They are a crucial component of the proteolytic cascade acting downstream of the 26S proteasome in the ubiquitin-proteasome pathway. It is an amino peptidase belonging to the subtilase family removing tripeptides from the free N terminus of oligopeptides. ; PDB: 3LXU_X.
Probab=51.50 E-value=35 Score=25.50 Aligned_cols=44 Identities=16% Similarity=0.218 Sum_probs=32.0
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHHHHHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVHKQIFE 114 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir~~il~ 114 (122)
|.|+.+-|+-.|--. - ..+.|..||..+++.||++--++...|+
T Consensus 75 Y~EaLRDfq~~~iaK-l-----------------e~e~Ae~vY~el~~~~P~HLpaHla~i~ 118 (139)
T PF12583_consen 75 YSEALRDFQCSWIAK-L-----------------EPENAEQVYEELLEAHPDHLPAHLAMIQ 118 (139)
T ss_dssp HHHHHHHHHHHHHTT-S------------------HHHHHHHHHHHHHH-TT-THHHHHHHH
T ss_pred HHHHHHHHHHHHHHh-h-----------------CHHHHHHHHHHHHHHCcchHHHHHHHHH
Confidence 888888888888652 2 2357789999999999999888776654
No 329
>PF00244 14-3-3: 14-3-3 protein; InterPro: IPR023410 The 14-3-3 proteins are a large family of approximately 30kDa acidic proteins which exist primarily as homo- and heterodimeric within all eukaryotic cells [, ]. There is a high degree of sequence identity and conservation between all the 14-3-3 isotypes, particularly in the regions which form the dimer interface or line the central ligand binding channel of the dimeric molecule. Each 14-3-3 protein sequence can be roughly divided into three sections: a divergent amino terminus, the conserved core region and a divergent carboxyl terminus. The conserved middle core region of the 14-3-3s encodes an amphipathic groove that forms the main functional domain, a cradle for interacting with client proteins. The monomer consists of nine helices organised in an antiparallel manner, forming an L-shaped structure. The interior of the L-structure is composed of four helices: H3 and H5, which contain many charged and polar amino acids, and H7 and H9, which contain hydrophobic amino acids. These four helices form the concave amphipathic groove that interacts with target peptides. 14-3-3 proteins mainly bind proteins containing phosphothreonine or phosphoserine motifs however exceptions to this rule do exist. Extensive investigation of the 14-3-3 binding site of the mammalian serine/threonine kinase Raf-1 has produced a consensus sequence for 14-3-3-binding, RSxpSxP (in the single-letter amino-acid code, where x denotes any amino acid and p indicates that the next residue is phosphorylated). 14-3-3 proteins appear to effect intracellular signalling in one of three ways - by direct regulation of the catalytic activity of the bound protein, by regulating interactions between the bound protein and other molecules in the cell by sequestration or modification or by controlling the subcellular localisation of the bound ligand. Proteins appear to initially bind to a single dominant site and then subsequently to many, much weaker secondary interaction sites. The 14-3-3 dimer is capable of changing the conformation of its bound ligand whilst itself undergoing minimal structural alteration. This entry represents the structural domain found in 14-3-3 proteins.; PDB: 2O8P_A 3AXY_D 2C74_A 2C63_A 4DX0_A 1YWT_A 3P1O_A 3P1N_A 4DAU_A 3U9X_A ....
Probab=50.51 E-value=45 Score=26.19 Aligned_cols=43 Identities=16% Similarity=0.353 Sum_probs=32.9
Q ss_pred HHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHHHHHHHHHHh
Q psy16144 76 QKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVHKQIFEKAMV 118 (122)
Q Consensus 76 ykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir~~il~ka~~ 118 (122)
.-+|......+||.|.++.-+++...+|+...=....|..|+.
T Consensus 5 i~~Aklaeq~eRy~dmv~~mk~~~~~~~eLt~eERnLlsvayK 47 (236)
T PF00244_consen 5 IYLAKLAEQAERYDDMVEYMKQLIEMNPELTEEERNLLSVAYK 47 (236)
T ss_dssp HHHHHHHHHTTHHHHHHHHHHHHHHTSS---HHHHHHHHHHHH
T ss_pred HHHHHHHHHhcCHHHHHHHHHHHHccCCCCCHHHHHHHHHHHH
Confidence 4578899999999999999999999999877666666666654
No 330
>KOG4814|consensus
Probab=50.42 E-value=2.3e+02 Score=26.81 Aligned_cols=84 Identities=13% Similarity=0.171 Sum_probs=61.0
Q ss_pred hHHHHHHHH-HHhhCcchHHHHhhC-HHHHHHHHHHHHHhhCCCCh------HHHHHHHHHHHhcccHHHHHHHHHHHHH
Q psy16144 29 WEPLSAGLR-KKQYISSATNIYNLF-PDDAVVQYTQACRIIGTSDL------ALHQKLASLLLRMKRYAEAIDVSQTIMK 100 (122)
Q Consensus 29 ~d~A~elL~-~l~~nps~~kA~e~l-y~dAa~~ye~Aw~l~~~~~~------~vgykLA~~ylk~kr~~~AI~vc~kVL~ 100 (122)
..++...+. .|= .+-.+-++.- |.+++..|+.....- |+|. -.-=.|+-+|++..+.+.|.++++++-+
T Consensus 346 TkE~~~~iH~iLW--n~A~~~F~~~~Y~~s~~~y~~Sl~~i-~~D~~~~~FaK~qR~l~~CYL~L~QLD~A~E~~~EAE~ 422 (872)
T KOG4814|consen 346 TKEAISCIHTLLW--NTAKKLFKMEKYVVSIRFYKLSLKDI-ISDNYSDRFAKIQRALQVCYLKLEQLDNAVEVYQEAEE 422 (872)
T ss_pred HHHHHHHHHHHHH--HhhHHHHHHHHHHHHHHHHHHHHHhc-cchhhhhHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHh
Confidence 566666566 321 2223333223 999999999999885 4432 2334688999999999999999999999
Q ss_pred hCCCCHHHHHHHHHH
Q psy16144 101 AHPSYPKVHKQIFEK 115 (122)
Q Consensus 101 ~~P~ypkir~~il~k 115 (122)
.+|..|--+..++.-
T Consensus 423 ~d~~~~l~q~~~~~~ 437 (872)
T KOG4814|consen 423 VDRQSPLCQLLMLQS 437 (872)
T ss_pred hccccHHHHHHHHHH
Confidence 999998877777643
No 331
>KOG0546|consensus
Probab=49.40 E-value=15 Score=31.55 Aligned_cols=43 Identities=16% Similarity=0.240 Sum_probs=22.5
Q ss_pred CCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHHHH
Q psy16144 69 TSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVHKQ 111 (122)
Q Consensus 69 ~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir~~ 111 (122)
++..-+.|+.+..|++.+++.+|+.-........|+++.|.++
T Consensus 306 ~s~tka~~Rr~~~~~~~~~~~~a~~~~~~a~~~~p~d~~i~~~ 348 (372)
T KOG0546|consen 306 RSKTKAHYRRGQAYKLLKNYDEALEDLKKAKQKAPNDKAIEEE 348 (372)
T ss_pred hhhCcHHHHHHhHHHhhhchhhhHHHHHHhhccCcchHHHHHH
Confidence 4444455555555555555555555555555555555555443
No 332
>PF15015 NYD-SP12_N: Spermatogenesis-associated, N-terminal
Probab=47.58 E-value=50 Score=29.47 Aligned_cols=54 Identities=13% Similarity=-0.087 Sum_probs=40.8
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHH---HHhhCCCChHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQA---CRIIGTSDLALH 75 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~A---w~l~~~~~~~vg 75 (122)
-|++.++.|.|.+... ++.+||+...-+... |.+|++..--| .-+++.+...+.
T Consensus 237 CYL~~rkpdlALnh~hrsI~lnP~~frnHLrqAavfR~LeRy~eAarSamia~ymywl~g~~~q~~S 303 (569)
T PF15015_consen 237 CYLRMRKPDLALNHSHRSINLNPSYFRNHLRQAAVFRRLERYSEAARSAMIADYMYWLSGGSEQRIS 303 (569)
T ss_pred hhhhcCCCchHHHHHhhhhhcCcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCchHhHH
Confidence 8999999999999999 999999987777633 88888776544 344444444443
No 333
>PF02259 FAT: FAT domain; InterPro: IPR003151 The FAT domain is a domain present in the PIK-related kinases. Members of the family of PIK-related kinases may act as intracellular sensors that govern radial and horizontal pathways [].; GO: 0005515 protein binding
Probab=47.55 E-value=87 Score=24.42 Aligned_cols=51 Identities=14% Similarity=0.121 Sum_probs=37.6
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhccc-----------------HHHHHHHHHHHHHhCCC
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKR-----------------YAEAIDVSQTIMKAHPS 104 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr-----------------~~~AI~vc~kVL~~~P~ 104 (122)
++++...|..|.++ .|+...+.+.+|..+.+.=. ...||..|-+.+...|.
T Consensus 274 ~~~~~~~~~~a~~~-~~~~~k~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~ai~~y~~al~~~~~ 341 (352)
T PF02259_consen 274 SDEILKYYKEATKL-DPSWEKAWHSWALFNDKLLESDPREKEESSQEDRSEYLEQAIEGYLKALSLGSK 341 (352)
T ss_pred HHHHHHHHHHHHHh-ChhHHHHHHHHHHHHHHHHHhhhhcccccchhHHHHHHHHHHHHHHHHHhhCCC
Confidence 46678888888888 58788888888776655422 13488888888888887
No 334
>KOG1310|consensus
Probab=47.53 E-value=63 Score=29.73 Aligned_cols=74 Identities=22% Similarity=0.195 Sum_probs=52.5
Q ss_pred ChHHHHHHHH-HHhhCcchHHHHh-----hC-------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHH
Q psy16144 28 RWEPLSAGLR-KKQYISSATNIYN-----LF-------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDV 94 (122)
Q Consensus 28 k~d~A~elL~-~l~~nps~~kA~e-----~l-------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~v 94 (122)
....|+..+. ++++-|....-++ +| .-.|..--..|+++ +|....+.|.|+.++...+|+.+|++
T Consensus 389 ~~~~~i~~~s~a~q~~~~~~~~l~nraa~lmkRkW~~d~~~AlrDch~Alrl-n~s~~kah~~la~aL~el~r~~eal~- 466 (758)
T KOG1310|consen 389 IVSGAISHYSRAIQYVPDAIYLLENRAAALMKRKWRGDSYLALRDCHVALRL-NPSIQKAHFRLARALNELTRYLEALS- 466 (758)
T ss_pred HHHHHHHHHHHHhhhccchhHHHHhHHHHHHhhhccccHHHHHHhHHhhccC-ChHHHHHHHHHHHHHHHHhhHHHhhh-
Confidence 4455666666 7776666333333 22 33455556678888 89899999999999999999999999
Q ss_pred HHHHHHhCC
Q psy16144 95 SQTIMKAHP 103 (122)
Q Consensus 95 c~kVL~~~P 103 (122)
|+.+|..-|
T Consensus 467 ~~~alq~~~ 475 (758)
T KOG1310|consen 467 CHWALQMSF 475 (758)
T ss_pred hHHHHhhcC
Confidence 777776543
No 335
>PF02064 MAS20: MAS20 protein import receptor; InterPro: IPR002056 Virtually all mitochondrial precursors are imported via the same mechanism []: precursors first bind to receptors on the mitochondrial surface, then insert into the translocation channel in the outer membrane. Many outer-membrane proteins participate in the early stages of import, four of which (MAS20, MAS22, MAS37 and MAS70) are components of the receptor. MAS20, which forms a subcomplex with MAS22, seems to interact with most or all mitochondrial precursors, suggesting that the protein binds directly to mitochondrial targeting sequences. The MAS37 and MAS70 components also form a subcomplex, the two subcomplexes possibly binding via their trans- membrane (TM) regions - the TM region of MAS70 promotes oligomerisation of attatched protein domains and shares sequence similarity with the TM region of MAS20 []. MAS20 is also known as TOM20.; GO: 0006605 protein targeting, 0006886 intracellular protein transport, 0005742 mitochondrial outer membrane translocase complex; PDB: 3AX3_A 3AWR_B 2V1S_A 3AX5_C 3AX2_C 1OM2_A 2V1T_B.
Probab=47.21 E-value=76 Score=22.89 Aligned_cols=39 Identities=31% Similarity=0.392 Sum_probs=29.2
Q ss_pred HHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHHHHHHHH
Q psy16144 76 QKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVHKQIFEK 115 (122)
Q Consensus 76 ykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir~~il~k 115 (122)
..||-.++..|++++|+..+-+++...| +|.---.|+++
T Consensus 67 V~lGE~L~~~G~~~~aa~hf~nAl~V~~-qP~~LL~i~q~ 105 (121)
T PF02064_consen 67 VQLGEQLLAQGDYEEAAEHFYNALKVCP-QPAELLQIYQK 105 (121)
T ss_dssp HHHHHHHHHTT-HHHHHHHHHHHHHTSS-SHHHHHHHHHH
T ss_pred HHHHHHHHhCCCHHHHHHHHHHHHHhCC-CHHHHHHHHHh
Confidence 3567777778888888888888888888 77766667665
No 336
>COG2912 Uncharacterized conserved protein [Function unknown]
Probab=46.24 E-value=81 Score=25.89 Aligned_cols=65 Identities=12% Similarity=0.097 Sum_probs=43.8
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCH---HHHHHHHHHHHh
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYP---KVHKQIFEKAMV 118 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~yp---kir~~il~ka~~ 118 (122)
++-|..+-++...+ +|.+|.---.=|..|...|-+.-|+.-....+++-|+-| .||..+.+-...
T Consensus 197 ~~~al~~~~r~l~l-~P~dp~eirDrGliY~ql~c~~vAl~dl~~~~~~~P~~~~a~~ir~~l~~l~~~ 264 (269)
T COG2912 197 WELALRVAERLLDL-NPEDPYEIRDRGLIYAQLGCYHVALEDLSYFVEHCPDDPIAEMIRAQLLELRQE 264 (269)
T ss_pred hHHHHHHHHHHHhh-CCCChhhccCcHHHHHhcCCchhhHHHHHHHHHhCCCchHHHHHHHHHHHHHHH
Confidence 56666666666666 566665444457888888888888888888777777664 456666665533
No 337
>smart00777 Mad3_BUB1_I Mad3/BUB1 hoMad3/BUB1 homology region 1. Proteins containing this domain are checkpoint proteins involved in cell division. This region has been shown to be essential for the binding of the binding of BUB1 and MAD3 to CDC20p.
Probab=46.23 E-value=1.1e+02 Score=22.04 Aligned_cols=76 Identities=22% Similarity=0.113 Sum_probs=45.4
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHh---hC---------HHHHHHHHHHHHHhhCCCChHHHH-HHHHHHHhccc
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYN---LF---------PDDAVVQYTQACRIIGTSDLALHQ-KLASLLLRMKR 87 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e---~l---------y~dAa~~ye~Aw~l~~~~~~~vgy-kLA~~ylk~kr 87 (122)
-|-+.|+-..=..+|. |++.-.++....+ |+ ..++...|...-..+=....+..| .-|..+...|+
T Consensus 35 ~~p~g~~~s~L~~lLerc~~~f~~~~~YknD~RyLkiWi~ya~~~~dp~~if~~L~~~~IG~~~AlfYe~~A~~lE~~g~ 114 (125)
T smart00777 35 NYPQGGKESGLLTLLERCIRYFEDDERYKNDPRYLKIWLKYADNCDEPRELFQFLYSKGIGTKLALFYEEWAQLLEAAGR 114 (125)
T ss_pred hCCCCCchhhHHHHHHHHHHHhhhhhhhcCCHHHHHHHHHHHHhcCCHHHHHHHHHHCCcchhhHHHHHHHHHHHHHcCC
Confidence 3444345444566666 7766554444333 11 456667777766654333333333 45777789999
Q ss_pred HHHHHHHHHH
Q psy16144 88 YAEAIDVSQT 97 (122)
Q Consensus 88 ~~~AI~vc~k 97 (122)
+.+|.+|++.
T Consensus 115 ~~~A~~iy~~ 124 (125)
T smart00777 115 YKKADEVYQL 124 (125)
T ss_pred HHHHHHHHHc
Confidence 9999998863
No 338
>PF10345 Cohesin_load: Cohesin loading factor; InterPro: IPR019440 Cohesin loading factor is a conserved protein that has been characterised in fungi. It is associated with the cohesin complex and is required in G1 for cohesin binding to chromosomes, but is dispensable in G2 when cohesion has been established. It is often referred to as Ssl3 in Schizosaccharomyces pombe (Fission yeast), and Scc4 in Saccharomyces cerevisiae (Baker's yeast). It complexes with Mis4 [].
Probab=45.97 E-value=1.8e+02 Score=25.67 Aligned_cols=47 Identities=13% Similarity=-0.076 Sum_probs=38.4
Q ss_pred HHHHHHHHHHHHHhhCC--------CChHHHHHHHHHHHhcccHHHHHHHHHHHH
Q psy16144 53 PDDAVVQYTQACRIIGT--------SDLALHQKLASLLLRMKRYAEAIDVSQTIM 99 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~--------~~~~vgykLA~~ylk~kr~~~AI~vc~kVL 99 (122)
+..|....+.+-....+ -.|-+.|-+|-.++..|+...|...|.++.
T Consensus 377 ~~~a~~~l~~~~~~~~~~~~~~~~~~~~~~~yL~gl~~q~~g~l~~A~~~y~~~~ 431 (608)
T PF10345_consen 377 WSKATQELEFMRQLCQRSPSKLYESLYPLLHYLLGLYYQSTGDLEAALYQYQKPR 431 (608)
T ss_pred HHHHHHHHHHHHHHHhcCccchhhhhhHHHHHHHHHHHHHcCCHHHHHHHHhhhH
Confidence 88888888887776432 247889999999999999999999998444
No 339
>PF09670 Cas_Cas02710: CRISPR-associated protein (Cas_Cas02710)
Probab=45.83 E-value=1.8e+02 Score=24.41 Aligned_cols=48 Identities=13% Similarity=0.228 Sum_probs=31.0
Q ss_pred HHHHH-HhcCCCChHHHHHHHH-HHhhCcchHHHHh-------------hC-HHHHHHHHHHHHH
Q psy16144 17 EWHFL-LYLLPWRWEPLSAGLR-KKQYISSATNIYN-------------LF-PDDAVVQYTQACR 65 (122)
Q Consensus 17 ~~~~L-Lyi~~gk~d~A~elL~-~l~~nps~~kA~e-------------~l-y~dAa~~ye~Aw~ 65 (122)
+|+.- -..+.+.|..|.++|. +...-|+... ++ .+ |++|...++....
T Consensus 134 ~~~~a~~l~n~~~y~aA~~~l~~l~~rl~~~~~-~~~~~~l~~~y~~WD~fd~~~A~~~l~~~~~ 197 (379)
T PF09670_consen 134 EWRRAKELFNRYDYGAAARILEELLRRLPGREE-YQRYKDLCEGYDAWDRFDHKEALEYLEKLLK 197 (379)
T ss_pred HHHHHHHHHhcCCHHHHHHHHHHHHHhCCchhh-HHHHHHHHHHHHHHHccCHHHHHHHHHHHHH
Confidence 34444 4448899999999999 7764233232 22 22 8888888886554
No 340
>KOG2034|consensus
Probab=45.54 E-value=68 Score=30.63 Aligned_cols=43 Identities=19% Similarity=0.131 Sum_probs=30.1
Q ss_pred HHHHHHHH--HhcCCCChHHHHHHHH-HHhhCcchHHHHh---------hC----HHHHHHHHHHH
Q psy16144 14 TIAEWHFL--LYLLPWRWEPLSAGLR-KKQYISSATNIYN---------LF----PDDAVVQYTQA 63 (122)
Q Consensus 14 ~~~~~~~L--Lyi~~gk~d~A~elL~-~l~~nps~~kA~e---------~l----y~dAa~~ye~A 63 (122)
+..|.+.. +|+..|+||.|.+..+ - +++.+ ++ |..||..|-+-
T Consensus 357 vndE~R~vWk~yLd~g~y~kAL~~ar~~-------p~~le~Vl~~qAdf~f~~k~y~~AA~~yA~t 415 (911)
T KOG2034|consen 357 VNDEARDVWKTYLDKGEFDKALEIARTR-------PDALETVLLKQADFLFQDKEYLRAAEIYAET 415 (911)
T ss_pred eccchHHHHHHHHhcchHHHHHHhccCC-------HHHHHHHHHHHHHHHHhhhHHHHHHHHHHHh
Confidence 34566666 9999999999999777 4 33444 22 77777777665
No 341
>cd00280 TRFH Telomeric Repeat binding Factor or TTAGGG Repeat binding Factor, central (dimerization) domain Homology; TRFH. Telomeres are protein/DNA complexes that make up the physical ends of eukaryotic linear chromosomes and are essential for chromosome stability, protecting the chromosome ends from degradation and end-to-end fusion. Proteins TRF1, TRF2 and Taz1 bind telomeric DNA and are also involved in recruiting interacting proteins, TIN2, and Rap1, to the telomeres. It has also been demonstrated that PARP1 associates with TRF2 and is capable of poly(ADP-ribosyl)ation of TRF2, which affects binding of TRF2 to telomeric DNA. TRF1, TRF2 and Taz1 proteins contain three functional domains: an N-terminal acidic domain, a central TRF-specific/dimerization domain, and a C-terminal DNA binding domain with a single Myb-like repeat. Homodimerization, a prerequisite to DNA binding, results in the juxtaposition of two Myb DNA binding domains.
Probab=45.37 E-value=52 Score=25.96 Aligned_cols=26 Identities=4% Similarity=-0.001 Sum_probs=22.9
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNI 48 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA 48 (122)
+++..|+|++|.+.|+ ... +|++.+-
T Consensus 120 VCm~~g~Fk~A~eiLkr~~~-d~~~~~~ 146 (200)
T cd00280 120 VCMENGEFKKAEEVLKRLFS-DPESQKL 146 (200)
T ss_pred HHHhcCchHHHHHHHHHHhc-CCCchhH
Confidence 8999999999999999 777 8887664
No 342
>COG3898 Uncharacterized membrane-bound protein [Function unknown]
Probab=44.78 E-value=1.9e+02 Score=25.83 Aligned_cols=81 Identities=9% Similarity=-0.024 Sum_probs=58.7
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC-------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF-------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAID 93 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l-------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~ 93 (122)
.+...|+.-++..+|. +-+..|-.-.+..|. ..|=.+--++.-.+ +|+|....+..+..-+-.|.|..|-+
T Consensus 272 alf~d~~~rKg~~ilE~aWK~ePHP~ia~lY~~ar~gdta~dRlkRa~~L~sl-k~nnaes~~~va~aAlda~e~~~ARa 350 (531)
T COG3898 272 ALFRDGNLRKGSKILETAWKAEPHPDIALLYVRARSGDTALDRLKRAKKLESL-KPNNAESSLAVAEAALDAGEFSAARA 350 (531)
T ss_pred HHHhccchhhhhhHHHHHHhcCCChHHHHHHHHhcCCCcHHHHHHHHHHHHhc-CccchHHHHHHHHHHHhccchHHHHH
Confidence 6677788888888888 888888644444444 33334444555556 68888888888888888888888888
Q ss_pred HHHHHHHhCC
Q psy16144 94 VSQTIMKAHP 103 (122)
Q Consensus 94 vc~kVL~~~P 103 (122)
--+.+.+..|
T Consensus 351 ~Aeaa~r~~p 360 (531)
T COG3898 351 KAEAAAREAP 360 (531)
T ss_pred HHHHHhhhCc
Confidence 8888887777
No 343
>PF07079 DUF1347: Protein of unknown function (DUF1347); InterPro: IPR010764 This family consists of several hypothetical bacterial proteins of around 610 residues in length. Members of this family are highly conserved and seem to be specific to Chlamydia species. The function of this family is unknown.
Probab=44.45 E-value=2.4e+02 Score=25.44 Aligned_cols=87 Identities=14% Similarity=0.241 Sum_probs=53.0
Q ss_pred ccchhHHHHHHHHHHHHHHHhcCCCChHHHHHHHH-HHhhCcchHHHHhhC----HHHHHHHHHHHHHhhCCCChHHHHH
Q psy16144 3 SLSVDTMQIIQTIAEWHFLLYLLPWRWEPLSAGLR-KKQYISSATNIYNLF----PDDAVVQYTQACRIIGTSDLALHQK 77 (122)
Q Consensus 3 ~~~~~~~~~~~~~~~~~~LLyi~~gk~d~A~elL~-~l~~nps~~kA~e~l----y~dAa~~ye~Aw~l~~~~~~~vgyk 77 (122)
+.+|..-+|-+-.|+.-+ ||.+ |.|.++.-+-- ..+..|| +.++.++ +.. +.|+.||++-. +-|. +-+
T Consensus 454 ~i~i~e~eian~LaDAEy-Lysq-gey~kc~~ys~WL~~iaPS-~~~~RLlGl~l~e~--k~Y~eA~~~l~-~LP~-n~~ 526 (549)
T PF07079_consen 454 PITISEEEIANFLADAEY-LYSQ-GEYHKCYLYSSWLTKIAPS-PQAYRLLGLCLMEN--KRYQEAWEYLQ-KLPP-NER 526 (549)
T ss_pred cccccHHHHHHHHHHHHH-HHhc-ccHHHHHHHHHHHHHhCCc-HHHHHHHHHHHHHH--hhHHHHHHHHH-hCCC-chh
Confidence 344455555555554443 5555 99999999999 9999995 8888755 433 35777776642 2222 222
Q ss_pred HHHHHHhcccHHHHHHHHHHHHHh
Q psy16144 78 LASLLLRMKRYAEAIDVSQTIMKA 101 (122)
Q Consensus 78 LA~~ylk~kr~~~AI~vc~kVL~~ 101 (122)
+..-+--+|..+|+|-+..
T Consensus 527 -----~~dskvqKAl~lCqKh~~k 545 (549)
T PF07079_consen 527 -----MRDSKVQKALALCQKHLPK 545 (549)
T ss_pred -----hHHHHHHHHHHHHHHhhhh
Confidence 2223445778888887643
No 344
>PF09797 NatB_MDM20: N-acetyltransferase B complex (NatB) non catalytic subunit; InterPro: IPR019183 This is the non-catalytic subunit of the N-terminal acetyltransferase B complex (NatB). The NatB complex catalyses the acetylation of the amino-terminal methionine residue of all proteins beginning with Met-Asp or Met-Glu and of some proteins beginning with Met-Asn or Met-Met. In Saccharomyces cerevisiae (Baker's yeast) this subunit is called MDM20 and in Schizosaccharomyces pombe (Fission yeast) it is called Arm1. NatB acetylates the Tpm1 protein and regulates and tropomyocin-actin interactions. This subunit is required by the NatB complex for the N-terminal acetylation of Tpm1 [].
Probab=44.05 E-value=75 Score=26.05 Aligned_cols=43 Identities=14% Similarity=0.073 Sum_probs=39.8
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQ 96 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~ 96 (122)
.-+|+...|.++.. .|.|+..-.-|...|...|-...|.++|+
T Consensus 199 l~~Ai~lLE~~l~~-s~~n~~~~LlLvrlY~~LG~~~~A~~~~~ 241 (365)
T PF09797_consen 199 LLQAIALLEHALKK-SPHNYQLKLLLVRLYSLLGAGSLALEHYE 241 (365)
T ss_pred HHHHHHHHHHHHHc-CCCcHHHHHHHHHHHHHcCCHHHHHHHHH
Confidence 66888999999999 59999999999999999999999999886
No 345
>PF04212 MIT: MIT (microtubule interacting and transport) domain; InterPro: IPR007330 The MIT domain is found in vacuolar sorting proteins, spastin (probable ATPase involved in the assembly or function of nuclear protein complexes), and a sorting nexin, which may play a role in intracellular trafficking.; PDB: 2DL1_A 2JQK_A 1WR0_A 2CPT_A 2JQH_A 2V6Y_A 2JQ9_A 2K3W_A 1YXR_A 3EAB_E ....
Probab=43.69 E-value=79 Score=19.62 Aligned_cols=26 Identities=23% Similarity=0.344 Sum_probs=12.4
Q ss_pred HHHHHHHHHHHhCCCCHHHHHHHHHHH
Q psy16144 90 EAIDVSQTIMKAHPSYPKVHKQIFEKA 116 (122)
Q Consensus 90 ~AI~vc~kVL~~~P~ypkir~~il~ka 116 (122)
+|++.+.++++..| .|..+.-+.+|+
T Consensus 30 ~ai~~l~~~~~~~~-~~~~~~~l~~k~ 55 (69)
T PF04212_consen 30 EAIEYLMQALKSES-NPERRQALRQKM 55 (69)
T ss_dssp HHHHHHHHHHHHST-THHHHHHHHHHH
T ss_pred HHHHHHHHHhccCC-CHHHHHHHHHHH
Confidence 33334445555554 455544454444
No 346
>PF08311 Mad3_BUB1_I: Mad3/BUB1 homology region 1; InterPro: IPR013212 Proteins containing this domain are checkpoint proteins involved in cell division. This region has been shown to be essential for the binding of BUB1 and MAD3 to CDC20p [].; PDB: 3ESL_B 4AEZ_I 4A1G_B 2LAH_A 2WVI_A 3SI5_B.
Probab=43.24 E-value=1.2e+02 Score=21.47 Aligned_cols=78 Identities=14% Similarity=0.033 Sum_probs=53.3
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHh---hC---------HHHHHHHHHHHHHhh-CCCChHHHHHHHHHHHhccc
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYN---LF---------PDDAVVQYTQACRII-GTSDLALHQKLASLLLRMKR 87 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e---~l---------y~dAa~~ye~Aw~l~-~~~~~~vgykLA~~ylk~kr 87 (122)
-+-+.|.-..-.++|. |++.-+++.+... |+ ..++...|...-... +...+...-.-|..+.+.|+
T Consensus 35 ~~p~~~~~~~L~~lLer~~~~f~~~~~Y~nD~RylkiWi~ya~~~~~~~~if~~l~~~~IG~~~A~fY~~wA~~le~~~~ 114 (126)
T PF08311_consen 35 NYPSGGKQSGLLELLERCIRKFKDDERYKNDERYLKIWIKYADLSSDPREIFKFLYSKGIGTKLALFYEEWAEFLEKRGN 114 (126)
T ss_dssp HCTTCCCCHHHHHHHHHHHHHHTTSGGGTT-HHHHHHHHHHHTTBSHHHHHHHHHHHHTTSTTBHHHHHHHHHHHHHTT-
T ss_pred HCCCCCchhHHHHHHHHHHHHHhhhHhhcCCHHHHHHHHHHHHHccCHHHHHHHHHHcCccHHHHHHHHHHHHHHHHcCC
Confidence 4444566666677888 8877766555443 11 567778888777643 45555566678899999999
Q ss_pred HHHHHHHHHHHH
Q psy16144 88 YAEAIDVSQTIM 99 (122)
Q Consensus 88 ~~~AI~vc~kVL 99 (122)
+.+|.+|+++.+
T Consensus 115 ~~~A~~I~~~Gi 126 (126)
T PF08311_consen 115 FKKADEIYQLGI 126 (126)
T ss_dssp HHHHHHHHHHHH
T ss_pred HHHHHHHHHhhC
Confidence 999999998753
No 347
>KOG2610|consensus
Probab=43.02 E-value=55 Score=28.59 Aligned_cols=75 Identities=7% Similarity=-0.128 Sum_probs=49.8
Q ss_pred hcCCCChHHHHHHHH-HHhhCcchHHHHh------hC---HHHHHHHHHHH---HHhhCCCChHHHHHHHHHHHhcccHH
Q psy16144 23 YLLPWRWEPLSAGLR-KKQYISSATNIYN------LF---PDDAVVQYTQA---CRIIGTSDLALHQKLASLLLRMKRYA 89 (122)
Q Consensus 23 yi~~gk~d~A~elL~-~l~~nps~~kA~e------~l---y~dAa~~ye~A---w~l~~~~~~~vgykLA~~ylk~kr~~ 89 (122)
....|-|++|++.-+ +++.||.++=|.- .| ++|+.+.-++- |+-+.-...--..--|..|...+.|+
T Consensus 185 L~E~g~y~dAEk~A~ralqiN~~D~Wa~Ha~aHVlem~~r~Keg~eFM~~ted~Wr~s~mlasHNyWH~Al~~iE~aeye 264 (491)
T KOG2610|consen 185 LEECGIYDDAEKQADRALQINRFDCWASHAKAHVLEMNGRHKEGKEFMYKTEDDWRQSWMLASHNYWHTALFHIEGAEYE 264 (491)
T ss_pred HHHhccchhHHHHHHhhccCCCcchHHHHHHHHHHHhcchhhhHHHHHHhcccchhhhhHHHhhhhHHHHHhhhcccchh
Confidence 457799999999999 9999997544333 22 78887765543 33221111111233466778889999
Q ss_pred HHHHHHHH
Q psy16144 90 EAIDVSQT 97 (122)
Q Consensus 90 ~AI~vc~k 97 (122)
.|++||.+
T Consensus 265 ~aleIyD~ 272 (491)
T KOG2610|consen 265 KALEIYDR 272 (491)
T ss_pred HHHHHHHH
Confidence 99999964
No 348
>PF02255 PTS_IIA: PTS system, Lactose/Cellobiose specific IIA subunit; InterPro: IPR003188 The phosphoenolpyruvate-dependent sugar phosphotransferase system (PTS) [, ] is a major carbohydrate transport system in bacteria. The PTS catalyses the phosphorylation of incoming sugar substrates and coupled with translocation across the cell membrane, makes the PTS a link between the uptake and metabolism of sugars. The general mechanism of the PTS is the following: a phosphoryl group from phosphoenolpyruvate (PEP) is transferred via a signal transduction pathway, to enzyme I (EI) which in turn transfers it to a phosphoryl carrier, the histidine protein (HPr). Phospho-HPr then transfers the phosphoryl group to a sugar-specific permease, a membrane-bound complex known as enzyme 2 (EII), which transports the sugar to the cell. EII consists of at least three structurally distinct domains IIA, IIB and IIC []. These can either be fused together in a single polypeptide chain or exist as two or three interactive chains, formerly called enzymes II (EII) and III (EIII). The first domain (IIA or EIIA) carries the first permease-specific phosphorylation site, a histidine which is phosphorylated by phospho-HPr. The second domain (IIB or EIIB) is phosphorylated by phospho-IIA on a cysteinyl or histidyl residue, depending on the sugar transported. Finally, the phosphoryl group is transferred from the IIB domain to the sugar substrate concomitantly with the sugar uptake processed by the IIC domain. This third domain (IIC or EIIC) forms the translocation channel and the specific substrate-binding site. An additional transmembrane domain IID, homologous to IIC, can be found in some PTSs, e.g. for mannose [, , , ]. The lactose/cellobiose-specific family are one of four structurally and functionally distinct group IIA PTS system enzymes. This family of proteins normally function as a homotrimer, stabilised by a centrally located metal ion []. Separation into subunits is thought to occur after phosphorylation.; GO: 0005351 sugar:hydrogen symporter activity, 0006810 transport, 0009401 phosphoenolpyruvate-dependent sugar phosphotransferase system, 0016020 membrane; PDB: 3L8R_D 2E2A_B 1E2A_C 3K1S_C 2LRK_C 2LRL_A 2WY2_A 1WCR_A 2WWV_C.
Probab=42.62 E-value=57 Score=22.38 Aligned_cols=40 Identities=18% Similarity=0.035 Sum_probs=28.7
Q ss_pred HHHHHHHHHHHHHH-----HhcCCCChHHHHHHHH-HHhhCcchHHHHh
Q psy16144 8 TMQIIQTIAEWHFL-----LYLLPWRWEPLSAGLR-KKQYISSATNIYN 50 (122)
Q Consensus 8 ~~~~~~~~~~~~~L-----Lyi~~gk~d~A~elL~-~l~~nps~~kA~e 50 (122)
.||||-.-.+.+-+ =+.+.|+|++|.++++ + +.+-.+|+.
T Consensus 4 ~m~iI~~aG~Ars~~~eAl~~a~~~~fe~A~~~l~~a---~~~l~~AH~ 49 (96)
T PF02255_consen 4 AMQIISHAGDARSLAMEALKAAREGDFEEAEELLKEA---DEELLKAHK 49 (96)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHTT-HHHHHHHHHHH---HHHHHHHHH
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHH---HHHHHHHHH
Confidence 47888777666655 5668899999999999 7 335556664
No 349
>PF10373 EST1_DNA_bind: Est1 DNA/RNA binding domain; InterPro: IPR018834 Est1 is a protein which recruits or activates telomerase at the site of polymerisation [, ]. This is the DNA/RNA binding domain of EST1 []. ; PDB: 1YA0_B.
Probab=42.52 E-value=86 Score=23.80 Aligned_cols=52 Identities=21% Similarity=0.219 Sum_probs=42.0
Q ss_pred HHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHh
Q psy16144 32 LSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLR 84 (122)
Q Consensus 32 A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk 84 (122)
|..+.+ |....|+.+..|.=+ .=+|+=+|-+++-- ..-++.+.-+|...+.+
T Consensus 1 A~~~Y~~A~~l~P~~G~p~nQLAvl~~~~~~~l~avy~y~Rsl~~-~~Pf~~A~~NL~~lf~~ 62 (278)
T PF10373_consen 1 AERYYRKAIRLLPSNGNPYNQLAVLASYQGDDLDAVYYYIRSLAV-RIPFPSARENLQKLFEK 62 (278)
T ss_dssp HHHHHHHHHHH-TTBSHHHHHHHHHHHHTT-HHHHHHHHHHHHSS-SB--HHHHHHHHHHHHH
T ss_pred CHHHHHHHHHhCCCCCCcccchhhhhccccchHHHHHHHHHHHhc-CCCcHHHHHHHHHHHHH
Confidence 678888 999999999999944 67899999999966 45579999999999988
No 350
>cd02684 MIT_2 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in proteins with an n-terminal serine/threonine kinase domain. The molecular function of the MIT domain is unclear.
Probab=42.23 E-value=95 Score=20.16 Aligned_cols=25 Identities=8% Similarity=0.068 Sum_probs=11.4
Q ss_pred HHHHHHHHHHhCCCCHHHHHHHHHHH
Q psy16144 91 AIDVSQTIMKAHPSYPKVHKQIFEKA 116 (122)
Q Consensus 91 AI~vc~kVL~~~P~ypkir~~il~ka 116 (122)
|++.|..+++..| +|..+.-|-.|+
T Consensus 32 ale~~~~~~k~e~-~~~~k~~lr~k~ 56 (75)
T cd02684 32 ALQYFVPALHYET-DAQRKEALRQKV 56 (75)
T ss_pred HHHHHHHHHhhCC-CHHHHHHHHHHH
Confidence 3334444554443 555554444443
No 351
>KOG2300|consensus
Probab=41.70 E-value=2.3e+02 Score=25.86 Aligned_cols=86 Identities=19% Similarity=0.195 Sum_probs=61.3
Q ss_pred HHHHHH--Hhc-CCCChHHHHHHHH-HHhhCcc-hHHHHh-------hC-HHHHHHHHHHHHHhhCCCC----------h
Q psy16144 16 AEWHFL--LYL-LPWRWEPLSAGLR-KKQYISS-ATNIYN-------LF-PDDAVVQYTQACRIIGTSD----------L 72 (122)
Q Consensus 16 ~~~~~L--Lyi-~~gk~d~A~elL~-~l~~nps-~~kA~e-------~l-y~dAa~~ye~Aw~l~~~~~----------~ 72 (122)
+.+|++ +|. ..|-||.|+.-|. |++.--+ +..|+- |+ ..+++..|+ +.+..+|.| .
T Consensus 367 ~~ih~LlGlys~sv~~~enAe~hf~~a~k~t~~~dl~a~~nlnlAi~YL~~~~~ed~y~-~ld~i~p~nt~s~ssq~l~a 445 (629)
T KOG2300|consen 367 AQIHMLLGLYSHSVNCYENAEFHFIEATKLTESIDLQAFCNLNLAISYLRIGDAEDLYK-ALDLIGPLNTNSLSSQRLEA 445 (629)
T ss_pred HHHHHHHhhHhhhcchHHHHHHHHHHHHHhhhHHHHHHHHHHhHHHHHHHhccHHHHHH-HHHhcCCCCCCcchHHHHHH
Confidence 446666 776 6678999999999 7765433 444443 33 677766665 455545653 3
Q ss_pred HHHHHHHHHHHhcccHHHHHHHHHHHHHhC
Q psy16144 73 ALHQKLASLLLRMKRYAEAIDVSQTIMKAH 102 (122)
Q Consensus 73 ~vgykLA~~ylk~kr~~~AI~vc~kVL~~~ 102 (122)
++.|--|+...+.+++-||-...|++|+..
T Consensus 446 ~~~~v~glfaf~qn~lnEaK~~l~e~Lkma 475 (629)
T KOG2300|consen 446 SILYVYGLFAFKQNDLNEAKRFLRETLKMA 475 (629)
T ss_pred HHHHHHHHHHHHhccHHHHHHHHHHHHhhc
Confidence 456777888899999999999999999765
No 352
>PF14852 Fis1_TPR_N: Fis1 N-terminal tetratricopeptide repeat; PDB: 1IYG_A 3UUX_C 1Y8M_A 2PQR_A 2PQN_A 3O48_A 1PC2_A 1NZN_A.
Probab=41.32 E-value=45 Score=18.90 Aligned_cols=30 Identities=17% Similarity=0.236 Sum_probs=17.9
Q ss_pred HHHHHHHHHHhcccH---HHHHHHHHHHHHhCC
Q psy16144 74 LHQKLASLLLRMKRY---AEAIDVSQTIMKAHP 103 (122)
Q Consensus 74 vgykLA~~ylk~kr~---~~AI~vc~kVL~~~P 103 (122)
.-|++|..+.+++.. .+.|.+++.+++.+|
T Consensus 3 t~FnyAw~Lv~S~~~~d~~~Gi~lLe~l~~~~p 35 (35)
T PF14852_consen 3 TQFNYAWGLVKSNNREDQQEGIALLEELYRDEP 35 (35)
T ss_dssp HHHHHHHHHHHSSSHHHHHHHHHHHHHHCCCS-
T ss_pred chhHHHHHHhcCCCHHHHHHHHHHHHHHHhccC
Confidence 445666666666544 456777777766554
No 353
>COG5191 Uncharacterized conserved protein, contains HAT (Half-A-TPR) repeat [General function prediction only]
Probab=41.17 E-value=30 Score=29.83 Aligned_cols=67 Identities=6% Similarity=0.161 Sum_probs=49.9
Q ss_pred HHHHHH-Hhc-CCCChHHHHHHHH-HHhhCcchHHHHhh------C----HHHHHHHHHHHHHhhCCCChHHHH---HHH
Q psy16144 16 AEWHFL-LYL-LPWRWEPLSAGLR-KKQYISSATNIYNL------F----PDDAVVQYTQACRIIGTSDLALHQ---KLA 79 (122)
Q Consensus 16 ~~~~~L-Lyi-~~gk~d~A~elL~-~l~~nps~~kA~e~------l----y~dAa~~ye~Aw~l~~~~~~~vgy---kLA 79 (122)
.-|-+. -|. ..|-|.+-...+. |++.+|.+..-|=+ . .+.+-..+.+++.+ ||++|.+++ ++-
T Consensus 108 k~w~~y~~Y~~k~k~y~~~~nI~~~~l~khP~nvdlWI~~c~~e~~~~ani~s~Ra~f~~glR~-N~~~p~iw~eyfr~E 186 (435)
T COG5191 108 KIWSQYAAYVIKKKMYGEMKNIFAECLTKHPLNVDLWIYCCAFELFEIANIESSRAMFLKGLRM-NSRSPRIWIEYFRME 186 (435)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCceeeeeeccchhhhhccHHHHHHHHHhhhcc-CCCCchHHHHHHHHH
Confidence 345555 444 3345888888999 99999997766653 2 88999999999999 899999875 444
Q ss_pred HHHH
Q psy16144 80 SLLL 83 (122)
Q Consensus 80 ~~yl 83 (122)
-+|+
T Consensus 187 l~yi 190 (435)
T COG5191 187 LMYI 190 (435)
T ss_pred HHHH
Confidence 4554
No 354
>smart00101 14_3_3 14-3-3 homologues. 14-3-3 homologues mediates signal transduction by binding to phosphoserine-containing proteins. They are involved in growth factor signalling and also interact with MEK kinases.
Probab=40.54 E-value=1.5e+02 Score=23.76 Aligned_cols=48 Identities=15% Similarity=0.156 Sum_probs=33.5
Q ss_pred HHHHHHHHHHHHHhhC----CCChHHHHHHH-----HHHHhcccHHHHHHHHHHHHHh
Q psy16144 53 PDDAVVQYTQACRIIG----TSDLALHQKLA-----SLLLRMKRYAEAIDVSQTIMKA 101 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~----~~~~~vgykLA-----~~ylk~kr~~~AI~vc~kVL~~ 101 (122)
-+.|...|+.|+++.+ |.+|- ...|| |.|.-.+...+|+.+-+++++.
T Consensus 144 ~~~a~~aY~~A~e~a~~~L~pt~Pi-rLgLaLN~SVF~yEI~~~~~~A~~lAk~afd~ 200 (244)
T smart00101 144 AENTLVAYKSAQDIALAELPPTHPI-RLGLALNFSVFYYEILNSPDRACNLAKQAFDE 200 (244)
T ss_pred HHHHHHHHHHHHHHHHccCCCCCHH-HHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHH
Confidence 4588999999998632 55553 22334 4566789999999888877643
No 355
>cd02681 MIT_calpain7_1 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in the nuclear thiol protease PalBH. The molecular function of the MIT domain is unclear.
Probab=40.50 E-value=1.1e+02 Score=20.23 Aligned_cols=64 Identities=11% Similarity=-0.041 Sum_probs=31.6
Q ss_pred HHHHHHHH-HHhhCcchHHHHhhC--HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHH
Q psy16144 30 EPLSAGLR-KKQYISSATNIYNLF--PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTI 98 (122)
Q Consensus 30 d~A~elL~-~l~~nps~~kA~e~l--y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kV 98 (122)
..|.++.+ +++.+.. .+.-+-+ |.+|+..+-.+.... -.||.. +...-.|...|-+=.+..++.
T Consensus 4 ~~Ai~~a~~Ave~D~~-g~y~eA~~~Y~~aie~l~~~~~~~-~~n~~~---k~~ir~K~~eYl~RAE~Lk~~ 70 (76)
T cd02681 4 RDAVQFARLAVQRDQE-GRYSEAVFYYKEAAQLLIYAEMAG-TLNDSH---LKTIQEKSNEYLDRAQALHQL 70 (76)
T ss_pred HHHHHHHHHHHHHHHc-cCHHHHHHHHHHHHHHHHHHHHhc-CCChHH---HHHHHHHHHHHHHHHHHHHHH
Confidence 35777777 7776664 3322222 555555555554442 122222 233355555555544434433
No 356
>PF02259 FAT: FAT domain; InterPro: IPR003151 The FAT domain is a domain present in the PIK-related kinases. Members of the family of PIK-related kinases may act as intracellular sensors that govern radial and horizontal pathways [].; GO: 0005515 protein binding
Probab=40.40 E-value=1.8e+02 Score=22.69 Aligned_cols=89 Identities=15% Similarity=0.125 Sum_probs=59.1
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcch-----HHHHhhC--------HHHHHHHHHHHHH-hhCCC----------------
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSA-----TNIYNLF--------PDDAVVQYTQACR-IIGTS---------------- 70 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~-----~kA~e~l--------y~dAa~~ye~Aw~-l~~~~---------------- 70 (122)
+.-++|.++.|...|. +.++++++ .-.+|+. ..+|....+...+ .....
T Consensus 155 ~aRk~g~~~~A~~~l~~~~~~~~~~~~~~~~v~~e~akllw~~g~~~~Ai~~L~~~~~~~~~~~~~~~~~~~~~~~~~~~ 234 (352)
T PF02259_consen 155 LARKAGNFQLALSALNRLFQLNPSSESLLPRVFLEYAKLLWAQGEQEEAIQKLRELLKCRLSKNIDSISNAELKSGLLES 234 (352)
T ss_pred HHHHCCCcHHHHHHHHHHhccCCcccCCCcchHHHHHHHHHHcCCHHHHHHHHHHHHHHHhhhccccccHHHHhhccccc
Confidence 8889999999999999 98877322 2222222 6788888877777 11111
Q ss_pred ----------------ChHHHHHHHHHHHhc------ccHHHHHHHHHHHHHhCCCCHHHHH
Q psy16144 71 ----------------DLALHQKLASLLLRM------KRYAEAIDVSQTIMKAHPSYPKVHK 110 (122)
Q Consensus 71 ----------------~~~vgykLA~~ylk~------kr~~~AI~vc~kVL~~~P~ypkir~ 110 (122)
-..+.+.+|--.... +.+.+++..|+++.+.+|+..+.+.
T Consensus 235 ~~~~~~~~~~~~~~~~~a~~~l~~a~w~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~k~~~ 296 (352)
T PF02259_consen 235 LEVISSTNLDKESKELKAKAFLLLAKWLDELYSKLSSESSDEILKYYKEATKLDPSWEKAWH 296 (352)
T ss_pred cccccccchhhhhHHHHHHHHHHHHHHHHhhccccccccHHHHHHHHHHHHHhChhHHHHHH
Confidence 012233334333344 7778888999999999998888654
No 357
>PF06957 COPI_C: Coatomer (COPI) alpha subunit C-terminus; InterPro: IPR010714 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the C terminus (approximately 500 residues) of the eukaryotic coatomer alpha subunit [, ]. This domain is found along with the IPR006692 from INTERPRO domain. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0005515 protein binding, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030126 COPI vesicle coat; PDB: 3MKR_B 3MV2_E 3MKQ_B 3MV3_A.
Probab=39.13 E-value=41 Score=29.26 Aligned_cols=82 Identities=13% Similarity=0.004 Sum_probs=45.5
Q ss_pred HhcCCCChHHHHHHHH--HHhhCcchHHHHh---hC----HHHH------H-HHHHHHHHhhCC--CChHH---------
Q psy16144 22 LYLLPWRWEPLSAGLR--KKQYISSATNIYN---LF----PDDA------V-VQYTQACRIIGT--SDLAL--------- 74 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~--~l~~nps~~kA~e---~l----y~dA------a-~~ye~Aw~l~~~--~~~~v--------- 74 (122)
-|+-.|.||.|..+|+ +=-.|....+.+. |. |-.+ . .+.++-|+-+++ ..|.+
T Consensus 127 dhvAAGsFetAm~LLnrQiGivnF~PLk~~Fl~~y~~s~~~l~~~~~~p~l~~~~~r~~~~~~~~~~lP~i~~~l~~L~~ 206 (422)
T PF06957_consen 127 DHVAAGSFETAMQLLNRQIGIVNFEPLKPLFLEVYQASRTYLPALPSLPPLPSYIRRNWDESNPKNGLPAIPLSLSSLEE 206 (422)
T ss_dssp HHHHCT-HHHHHHHHHHHC-B---GGGHHHHHHHHCCTEEEE-SSTTTS-EEEEEBCTTTTSSSCCG-BB----HHHHHH
T ss_pred HHHHhCCHHHHHHHHHHHhCccccHHHHHHHHHHHHhhceecccCCCCCCccccccCCccccccccCCCcCcCCHHHHHH
Confidence 6888999999999999 4344666566554 11 0000 0 000112221111 11222
Q ss_pred HHHHHHHHHhcccHHHHHHHHHHHHHhCC
Q psy16144 75 HQKLASLLLRMKRYAEAIDVSQTIMKAHP 103 (122)
Q Consensus 75 gykLA~~ylk~kr~~~AI~vc~kVL~~~P 103 (122)
..+.||-+.+.|+|.+|+++++.+|-.=|
T Consensus 207 ~Lk~gyk~~t~gKF~eA~~~Fr~iL~~i~ 235 (422)
T PF06957_consen 207 RLKEGYKLFTAGKFEEAIEIFRSILHSIP 235 (422)
T ss_dssp HHHHHHHHHHTT-HHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHhcCCHHHHHHHHHHHHHHhh
Confidence 34578888999999999999999997644
No 358
>cd02683 MIT_1 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in proteins with unknown function, co-occuring with an as yet undescribed domain. The molecular function of the MIT domain is unclear.
Probab=38.69 E-value=1.1e+02 Score=19.96 Aligned_cols=14 Identities=7% Similarity=-0.097 Sum_probs=6.8
Q ss_pred HHHHHHHH-HHhhCc
Q psy16144 30 EPLSAGLR-KKQYIS 43 (122)
Q Consensus 30 d~A~elL~-~l~~np 43 (122)
..|.++++ +++.+.
T Consensus 4 ~~a~~l~~~Ave~D~ 18 (77)
T cd02683 4 LAAKEVLKRAVELDQ 18 (77)
T ss_pred HHHHHHHHHHHHHHH
Confidence 34555555 544433
No 359
>KOG3081|consensus
Probab=37.98 E-value=2.4e+02 Score=23.60 Aligned_cols=86 Identities=16% Similarity=0.087 Sum_probs=61.3
Q ss_pred HhcCCCChHHHHHHHH-HHh-hCcchHHHHhh-------C--HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQ-YISSATNIYNL-------F--PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAE 90 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~-~nps~~kA~e~-------l--y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~ 90 (122)
+....+++.+|-=++. .-. +-|. +.-..- + |++|....+.|+.- .+++|.+.-+|=.+-+-.|+-.+
T Consensus 182 la~ggek~qdAfyifeE~s~k~~~T-~~llnG~Av~~l~~~~~eeAe~lL~eaL~k-d~~dpetL~Nliv~a~~~Gkd~~ 259 (299)
T KOG3081|consen 182 LATGGEKIQDAFYIFEELSEKTPPT-PLLLNGQAVCHLQLGRYEEAESLLEEALDK-DAKDPETLANLIVLALHLGKDAE 259 (299)
T ss_pred HhccchhhhhHHHHHHHHhcccCCC-hHHHccHHHHHHHhcCHHHHHHHHHHHHhc-cCCCHHHHHHHHHHHHHhCCChH
Confidence 5566666777777776 333 3343 222221 1 99999999999999 79999999999999888898877
Q ss_pred HHHHH-HHHHHhCCCCHHHH
Q psy16144 91 AIDVS-QTIMKAHPSYPKVH 109 (122)
Q Consensus 91 AI~vc-~kVL~~~P~ypkir 109 (122)
+.+=+ .+.-..||.+|=|+
T Consensus 260 ~~~r~l~QLk~~~p~h~~vk 279 (299)
T KOG3081|consen 260 VTERNLSQLKLSHPEHPFVK 279 (299)
T ss_pred HHHHHHHHHHhcCCcchHHH
Confidence 65533 34555789998775
No 360
>PF05131 Pep3_Vps18: Pep3/Vps18/deep orange family; InterPro: IPR007810 This region is found in a number of proteins identified as being involved in Golgi function and vacuolar sorting. The molecular function of this region is unknown. Proteins containing this domain also contain a C-terminal ring finger domain.
Probab=37.81 E-value=34 Score=25.27 Aligned_cols=29 Identities=24% Similarity=0.387 Sum_probs=20.6
Q ss_pred HHHHhcccHHHHHHHHHHHHHhCCCCHHHHHHHHHH
Q psy16144 80 SLLLRMKRYAEAIDVSQTIMKAHPSYPKVHKQIFEK 115 (122)
Q Consensus 80 ~~ylk~kr~~~AI~vc~kVL~~~P~ypkir~~il~k 115 (122)
..|++.|+|++|..+|+.- |.-|..|+.+
T Consensus 111 k~yl~~~~fd~Al~~~~~~-------~~~~d~V~~~ 139 (147)
T PF05131_consen 111 KIYLDKGDFDEALQYCKTN-------PAQRDQVLIK 139 (147)
T ss_pred HHHHhcCcHHHHHHHccCC-------HHHHHHHHHH
Confidence 4589999999999988752 4445555543
No 361
>PRK10454 PTS system N,N'-diacetylchitobiose-specific transporter subunit IIA; Provisional
Probab=37.67 E-value=60 Score=23.23 Aligned_cols=41 Identities=10% Similarity=-0.045 Sum_probs=28.1
Q ss_pred hHHHHHHHHHHHHHH-----HhcCCCChHHHHHHHH-HHhhCcchHHHHh
Q psy16144 7 DTMQIIQTIAEWHFL-----LYLLPWRWEPLSAGLR-KKQYISSATNIYN 50 (122)
Q Consensus 7 ~~~~~~~~~~~~~~L-----Lyi~~gk~d~A~elL~-~l~~nps~~kA~e 50 (122)
..|+||-.-.+.+-+ =..+.|+|++|.++++ +=+ +-.+|+.
T Consensus 20 ~~~~II~~aG~ArS~~~eAl~~Ak~gdfe~A~~~l~eA~e---~l~~AH~ 66 (115)
T PRK10454 20 VVMGLIINSGQARSLAYAALKQAKQGDFAAAKAMMDQSRM---ALNEAHL 66 (115)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHH---HHHHHHH
Confidence 456777655555544 5678999999999999 733 4555554
No 362
>KOG1914|consensus
Probab=37.41 E-value=1.1e+02 Score=28.18 Aligned_cols=62 Identities=10% Similarity=0.030 Sum_probs=50.8
Q ss_pred HHhhCcchHHHHhhC--------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHH
Q psy16144 38 KKQYISSATNIYNLF--------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMK 100 (122)
Q Consensus 38 ~l~~nps~~kA~e~l--------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~ 100 (122)
.++-||-+..+|.-+ +++.-..||+-..- -|..|.+...-...-++.+.|+.-+.++.+-|.
T Consensus 12 rie~nP~di~sw~~lire~qt~~~~~~R~~YEq~~~~-FP~s~r~W~~yi~~El~skdfe~VEkLF~RCLv 81 (656)
T KOG1914|consen 12 RIEENPYDIDSWSQLIREAQTQPIDKVRETYEQLVNV-FPSSPRAWKLYIERELASKDFESVEKLFSRCLV 81 (656)
T ss_pred HHhcCCccHHHHHHHHHHHccCCHHHHHHHHHHHhcc-CCCCcHHHHHHHHHHHHhhhHHHHHHHHHHHHH
Confidence 788899999999855 88888888888888 588888888888888888888887777766553
No 363
>cd00215 PTS_IIA_lac PTS_IIA, PTS system, lactose/cellobiose specific IIA subunit. The bacterial phosphoenolpyruvate: sugar phosphotransferase system (PTS) is a multi-protein system involved in the regulation of a variety of metabolic and transcriptional processes. This family is one of four structurally and functionally distinct group IIA PTS system cytoplasmic enzymes, necessary for the uptake of carbohydrates across the cytoplasmic membrane and their phosphorylation. This family of proteins normally function as a homotrimer, stabilized by a centrally located metal ion. Separation into subunits is thought to occur after phosphorylation.
Probab=37.11 E-value=70 Score=22.06 Aligned_cols=41 Identities=12% Similarity=-0.059 Sum_probs=28.5
Q ss_pred hHHHHHHHHHHHHHH-----HhcCCCChHHHHHHHH-HHhhCcchHHHHh
Q psy16144 7 DTMQIIQTIAEWHFL-----LYLLPWRWEPLSAGLR-KKQYISSATNIYN 50 (122)
Q Consensus 7 ~~~~~~~~~~~~~~L-----Lyi~~gk~d~A~elL~-~l~~nps~~kA~e 50 (122)
..|+||-.-.+.+-. =+.+.|+|++|.++++ +=+ .-.+|+.
T Consensus 4 ~~~~iI~~aG~Ars~~~eAl~~a~~g~fe~A~~~l~ea~~---~l~~AH~ 50 (97)
T cd00215 4 ISFQIILHAGNARSKALEALKAAKEGDFAEAEELLEEAND---SLNEAHH 50 (97)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHH---HHHHHHH
Confidence 357787766555544 5678999999999999 733 4455554
No 364
>KOG2047|consensus
Probab=37.08 E-value=3.3e+02 Score=25.72 Aligned_cols=96 Identities=10% Similarity=0.138 Sum_probs=73.3
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcc------------------hHHHHhhC---------HHHHHHHHHHHHHhhCCCChH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISS------------------ATNIYNLF---------PDDAVVQYTQACRIIGTSDLA 73 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps------------------~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~ 73 (122)
.-++...++.|..+++ ++..=.. +.+.|.+. ++.--..|++-+++ .-.-|.
T Consensus 434 mElrh~~~~~Al~lm~~A~~vP~~~~~~~yd~~~pvQ~rlhrSlkiWs~y~DleEs~gtfestk~vYdriidL-riaTPq 512 (835)
T KOG2047|consen 434 MELRHENFEAALKLMRRATHVPTNPELEYYDNSEPVQARLHRSLKIWSMYADLEESLGTFESTKAVYDRIIDL-RIATPQ 512 (835)
T ss_pred HHHhhhhHHHHHHHHHhhhcCCCchhhhhhcCCCcHHHHHHHhHHHHHHHHHHHHHhccHHHHHHHHHHHHHH-hcCCHH
Confidence 6677889999999999 6653222 22333322 44455678888999 688899
Q ss_pred HHHHHHHHHHhcccHHHHHHHHHHHHHhCC--CCHHHHHHHHHHHHh
Q psy16144 74 LHQKLASLLLRMKRYAEAIDVSQTIMKAHP--SYPKVHKQIFEKAMV 118 (122)
Q Consensus 74 vgykLA~~ylk~kr~~~AI~vc~kVL~~~P--~ypkir~~il~ka~~ 118 (122)
+-.+.|..+.-++-|++|-.+|++-..+.| +--.|+..-|.|...
T Consensus 513 ii~NyAmfLEeh~yfeesFk~YErgI~LFk~p~v~diW~tYLtkfi~ 559 (835)
T KOG2047|consen 513 IIINYAMFLEEHKYFEESFKAYERGISLFKWPNVYDIWNTYLTKFIK 559 (835)
T ss_pred HHHHHHHHHHhhHHHHHHHHHHHcCCccCCCccHHHHHHHHHHHHHH
Confidence 999999999999999999999999888765 556778888887754
No 365
>PRK09591 celC cellobiose phosphotransferase system IIA component; Reviewed
Probab=36.79 E-value=52 Score=23.02 Aligned_cols=44 Identities=11% Similarity=0.058 Sum_probs=30.8
Q ss_pred cchhHHHHHHHHHHHHHH-----HhcCCCChHHHHHHHH-HHhhCcchHHHHh
Q psy16144 4 LSVDTMQIIQTIAEWHFL-----LYLLPWRWEPLSAGLR-KKQYISSATNIYN 50 (122)
Q Consensus 4 ~~~~~~~~~~~~~~~~~L-----Lyi~~gk~d~A~elL~-~l~~nps~~kA~e 50 (122)
+....|+||-.-...+-. =+.+.|+|++|.++|+ +=+ .-.+|+.
T Consensus 6 le~~~~~II~~aG~Ars~~~eAl~~ak~gdf~~A~~~l~eA~~---~l~~AH~ 55 (104)
T PRK09591 6 LQVAAFEIILHSGNARTEVHEAFAAMREGNFDLAEQKLNQSNE---ELLEAHH 55 (104)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHH---HHHHHHH
Confidence 345568888766665555 5678999999999999 733 4455554
No 366
>KOG3617|consensus
Probab=36.78 E-value=4.3e+02 Score=26.10 Aligned_cols=32 Identities=16% Similarity=0.070 Sum_probs=27.1
Q ss_pred CCChHHHHHHHHHHHhcccHHHHHHHHHHHHH
Q psy16144 69 TSDLALHQKLASLLLRMKRYAEAIDVSQTIMK 100 (122)
Q Consensus 69 ~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~ 100 (122)
..|.++.|-||.-|.-.|++++|+--+-++-.
T Consensus 964 sgd~AAcYhlaR~YEn~g~v~~Av~FfTrAqa 995 (1416)
T KOG3617|consen 964 SGDKAACYHLARMYENDGDVVKAVKFFTRAQA 995 (1416)
T ss_pred cccHHHHHHHHHHhhhhHHHHHHHHHHHHHHH
Confidence 35778899999999999999999988877653
No 367
>PF05944 Phage_term_smal: Phage small terminase subunit; InterPro: IPR010270 This entry is represented by Bacteriophage P2, GpM. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several phage small terminase subunit proteins as well as some related bacterial sequences []. M protein is probably an endonuclease which directs cos cleavage. The Q, P and M proteins are needed to package DNA into proheads and for the conversion of proheads to capsids.; GO: 0003677 DNA binding, 0004519 endonuclease activity, 0019069 viral capsid assembly
Probab=36.57 E-value=84 Score=22.94 Aligned_cols=48 Identities=2% Similarity=-0.153 Sum_probs=35.2
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC-----HHHHHHHHHHHHHhhCC
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF-----PDDAVVQYTQACRIIGT 69 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l-----y~dAa~~ye~Aw~l~~~ 69 (122)
=.+..|.|+.|.++-+ +++++-..|.-|... -++-+...+++.+.+.+
T Consensus 57 W~~D~Gd~~~AL~~a~yAi~~~l~~P~~f~R~~~t~vaeev~~~a~~~~~~g~~ 110 (132)
T PF05944_consen 57 WLFDVGDFDGALDIAEYAIEHGLPMPDRFKRTLPTFVAEEVADWALRAAKAGQS 110 (132)
T ss_pred hhhcccCHHHHHHHHHHHHHcCCCccccccCcchHHHHHHHHHHHHHHHHcCCC
Confidence 3468899999999999 999999999888755 44444444555555333
No 368
>COG2976 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=36.46 E-value=2.2e+02 Score=22.64 Aligned_cols=47 Identities=28% Similarity=0.242 Sum_probs=38.0
Q ss_pred HHHHHHHHHHHHHhhCCCCh--HHHHHHHHHHHhcccHHHHHHHHHHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSDL--ALHQKLASLLLRMKRYAEAIDVSQTIM 99 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~--~vgykLA~~ylk~kr~~~AI~vc~kVL 99 (122)
+++|...++.+...+...+. -+-.+||.+....|++++|..+...+-
T Consensus 105 ~d~A~aqL~~~l~~t~De~lk~l~~lRLArvq~q~~k~D~AL~~L~t~~ 153 (207)
T COG2976 105 LDKAEAQLKQALAQTKDENLKALAALRLARVQLQQKKADAALKTLDTIK 153 (207)
T ss_pred HHHHHHHHHHHHccchhHHHHHHHHHHHHHHHHHhhhHHHHHHHHhccc
Confidence 99999999999877543332 357899999999999999999766654
No 369
>PF04781 DUF627: Protein of unknown function (DUF627); InterPro: IPR006866 This domain represents the N-terminal region of several plant proteins of unknown function.
Probab=35.86 E-value=73 Score=22.80 Aligned_cols=32 Identities=9% Similarity=0.115 Sum_probs=27.4
Q ss_pred HHHHHHHhcccHHHHHHHHHHHHHhCCCCHHH
Q psy16144 77 KLASLLLRMKRYAEAIDVSQTIMKAHPSYPKV 108 (122)
Q Consensus 77 kLA~~ylk~kr~~~AI~vc~kVL~~~P~ypki 108 (122)
++|..+...|++.+|+++-++....|++....
T Consensus 1 e~A~~~~~rGnhiKAL~iied~i~~h~~~~~~ 32 (111)
T PF04781_consen 1 EKAKDYFARGNHIKALEIIEDLISRHGEDESS 32 (111)
T ss_pred ChHHHHHHccCHHHHHHHHHHHHHHccCCCch
Confidence 36788899999999999999999999977654
No 370
>TIGR00823 EIIA-LAC phosphotransferase system enzyme II, lactose-specific, factor III. operon. While the Lac permeases consist of two polypeptide chains (IIA and IICB), the Chb permease of E. coli consists of three (IIA, IIB and IIC). In B. subtilis, a PTS permease similar to the Chb permease of E. coli is believed to transport lichenan (a b-1,3;1,4 glucan) degradation products, oligosaccharides of 2-4 glucose units. This model is specific for the IIA subunit of the Lac PTS family.
Probab=35.86 E-value=75 Score=21.98 Aligned_cols=41 Identities=5% Similarity=-0.155 Sum_probs=28.9
Q ss_pred hHHHHHHHHHHHHHH-----HhcCCCChHHHHHHHH-HHhhCcchHHHHh
Q psy16144 7 DTMQIIQTIAEWHFL-----LYLLPWRWEPLSAGLR-KKQYISSATNIYN 50 (122)
Q Consensus 7 ~~~~~~~~~~~~~~L-----Lyi~~gk~d~A~elL~-~l~~nps~~kA~e 50 (122)
..|+||-.-.+.+-+ =..+.|+|++|.++++ +=+ .-..|+.
T Consensus 6 ~~~~iI~~aG~Ars~~~eAl~~a~~gdfe~A~~~l~eA~~---~l~~AH~ 52 (99)
T TIGR00823 6 VGFELIAYAGDARSKALEALKAAKAGDFAKARALVEQAGM---CLNEAHL 52 (99)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHH---HHHHHHH
Confidence 467888766665555 5678999999999999 733 4445554
No 371
>KOG1941|consensus
Probab=35.56 E-value=1.4e+02 Score=26.46 Aligned_cols=78 Identities=17% Similarity=0.132 Sum_probs=57.4
Q ss_pred ChHHHHHHHH-HHhhCcchH------------HHHhhC--HHHHHHHHHHHHHhhCCCCh-----HHHHHHHHHHHhccc
Q psy16144 28 RWEPLSAGLR-KKQYISSAT------------NIYNLF--PDDAVVQYTQACRIIGTSDL-----ALHQKLASLLLRMKR 87 (122)
Q Consensus 28 k~d~A~elL~-~l~~nps~~------------kA~e~l--y~dAa~~ye~Aw~l~~~~~~-----~vgykLA~~ylk~kr 87 (122)
.|.+++.+.+ ++.+--.|+ .|+.-+ ++.+..+||+|.++.+.++. .|...|+..|-..++
T Consensus 98 ~f~kt~~y~k~~l~lpgt~~~~~~gq~~l~~~~Ahlgls~fq~~Lesfe~A~~~A~~~~D~~LElqvcv~Lgslf~~l~D 177 (518)
T KOG1941|consen 98 EFHKTISYCKTCLGLPGTRAGQLGGQVSLSMGNAHLGLSVFQKALESFEKALRYAHNNDDAMLELQVCVSLGSLFAQLKD 177 (518)
T ss_pred HhhhHHHHHHHHhcCCCCCcccccchhhhhHHHHhhhHHHHHHHHHHHHHHHHHhhccCCceeeeehhhhHHHHHHHHHh
Confidence 3666777777 776533333 233333 99999999999999655442 478899999999999
Q ss_pred HHHHHHHHHHHHHhCCCC
Q psy16144 88 YAEAIDVSQTIMKAHPSY 105 (122)
Q Consensus 88 ~~~AI~vc~kVL~~~P~y 105 (122)
+++|..--.+++++-.++
T Consensus 178 ~~Kal~f~~kA~~lv~s~ 195 (518)
T KOG1941|consen 178 YEKALFFPCKAAELVNSY 195 (518)
T ss_pred hhHHhhhhHhHHHHHHhc
Confidence 999998888888765544
No 372
>COG2912 Uncharacterized conserved protein [Function unknown]
Probab=35.31 E-value=2.2e+02 Score=23.42 Aligned_cols=54 Identities=9% Similarity=-0.068 Sum_probs=44.9
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHh-----hC----HHHHHHHHHHHHHhhCCCChHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYN-----LF----PDDAVVQYTQACRIIGTSDLALHQ 76 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e-----~l----y~dAa~~ye~Aw~l~~~~~~~vgy 76 (122)
.|.+.++++.|..... .+..||..+..+. |. +.-|..-++...+.+ |++|.+-+
T Consensus 190 ~~~~e~~~~~al~~~~r~l~l~P~dp~eirDrGliY~ql~c~~vAl~dl~~~~~~~-P~~~~a~~ 253 (269)
T COG2912 190 ALLRELQWELALRVAERLLDLNPEDPYEIRDRGLIYAQLGCYHVALEDLSYFVEHC-PDDPIAEM 253 (269)
T ss_pred HHHHhhchHHHHHHHHHHHhhCCCChhhccCcHHHHHhcCCchhhHHHHHHHHHhC-CCchHHHH
Confidence 8999999999999999 9999999885444 33 889999999999995 88876543
No 373
>PF11817 Foie-gras_1: Foie gras liver health family 1; InterPro: IPR021773 Mutating the gene foie gras in zebrafish has been shown to affect development; the mutants develop large, lipid-filled hepatocytes in the liver, resembling those in individuals with fatty liver disease []. Foie-gras protein is long and has several well-defined domains though none of them has a known function. We have annotated this one as the first []. THe C terminus of this region contains TPR repeats.
Probab=34.54 E-value=1.7e+02 Score=22.74 Aligned_cols=47 Identities=13% Similarity=0.069 Sum_probs=36.2
Q ss_pred HHHHHHHHHHHHHhhC-CC----ChHHHHHHHHHHHhcccHHHHHHHHHHHH
Q psy16144 53 PDDAVVQYTQACRIIG-TS----DLALHQKLASLLLRMKRYAEAIDVSQTIM 99 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~-~~----~~~vgykLA~~ylk~kr~~~AI~vc~kVL 99 (122)
|++|...|+.+...-. +. ...+.-+|..++.+.|+.++.+.+|-+.+
T Consensus 194 ~~~A~~~l~~~~~~yr~egW~~l~~~~l~~l~~Ca~~~~~~~~~l~~~leLl 245 (247)
T PF11817_consen 194 YDKALKLLEPAASSYRREGWWSLLTEVLWRLLECAKRLGDVEDYLTTSLELL 245 (247)
T ss_pred HHHHHHHHHHHHHHHHhCCcHHHHHHHHHHHHHHHHHhCCHHHHHHHHHHHh
Confidence 8888888888843211 11 25678899999999999999999988766
No 374
>cd02679 MIT_spastin MIT: domain contained within Microtubule Interacting and Trafficking molecules. This MIT domain sub-family is found in the AAA protein spastin, a probable ATPase involved in the assembly or function of nuclear protein complexes; spastins might also be involved in microtubule dynamics. The molecular function of the MIT domain is unclear.
Probab=33.93 E-value=79 Score=21.11 Aligned_cols=33 Identities=9% Similarity=0.046 Sum_probs=19.8
Q ss_pred CChHHHHHHHH-HHhhCcchHHHHhhCHHHHHHHHHHHHH
Q psy16144 27 WRWEPLSAGLR-KKQYISSATNIYNLFPDDAVVQYTQACR 65 (122)
Q Consensus 27 gk~d~A~elL~-~l~~nps~~kA~e~ly~dAa~~ye~Aw~ 65 (122)
+-|+.|-+.+. +|+++-.- ..++|+.+|+++..
T Consensus 3 ~~~~~A~~~I~kaL~~dE~g------~~e~Al~~Y~~gi~ 36 (79)
T cd02679 3 GYYKQAFEEISKALRADEWG------DKEQALAHYRKGLR 36 (79)
T ss_pred hHHHHHHHHHHHHhhhhhcC------CHHHHHHHHHHHHH
Confidence 45788888888 87766541 13445555555544
No 375
>KOG1914|consensus
Probab=33.90 E-value=1.9e+02 Score=26.67 Aligned_cols=36 Identities=11% Similarity=0.135 Sum_probs=32.7
Q ss_pred HHHHHHH-HhcCCCChHHHHHHHH-HHhhCcchHHHHh
Q psy16144 15 IAEWHFL-LYLLPWRWEPLSAGLR-KKQYISSATNIYN 50 (122)
Q Consensus 15 ~~~~~~L-Lyi~~gk~d~A~elL~-~l~~nps~~kA~e 50 (122)
|..|..| =+.++..+|+..+... .+...|+.+.+|-
T Consensus 20 i~sw~~lire~qt~~~~~~R~~YEq~~~~FP~s~r~W~ 57 (656)
T KOG1914|consen 20 IDSWSQLIREAQTQPIDKVRETYEQLVNVFPSSPRAWK 57 (656)
T ss_pred HHHHHHHHHHHccCCHHHHHHHHHHHhccCCCCcHHHH
Confidence 7789999 7788889999999999 9999999999996
No 376
>KOG1915|consensus
Probab=33.88 E-value=3.3e+02 Score=24.96 Aligned_cols=79 Identities=9% Similarity=-0.033 Sum_probs=51.1
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC--------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF--------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAI 92 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l--------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI 92 (122)
--|++-+.+.|.+.|. ++-.-|-.-..--|. ++---..|++=++. +|.|..+..+.|-.=...|.-+.|-
T Consensus 413 feIRq~~l~~ARkiLG~AIG~cPK~KlFk~YIelElqL~efDRcRkLYEkfle~-~Pe~c~~W~kyaElE~~LgdtdRaR 491 (677)
T KOG1915|consen 413 FEIRQLNLTGARKILGNAIGKCPKDKLFKGYIELELQLREFDRCRKLYEKFLEF-SPENCYAWSKYAELETSLGDTDRAR 491 (677)
T ss_pred HHHHHcccHHHHHHHHHHhccCCchhHHHHHHHHHHHHhhHHHHHHHHHHHHhc-ChHhhHHHHHHHHHHHHhhhHHHHH
Confidence 5566667777777777 665444321111122 66677777777777 4888888777777777777777777
Q ss_pred HHHHHHHHh
Q psy16144 93 DVSQTIMKA 101 (122)
Q Consensus 93 ~vc~kVL~~ 101 (122)
.++.-++.+
T Consensus 492 aifelAi~q 500 (677)
T KOG1915|consen 492 AIFELAISQ 500 (677)
T ss_pred HHHHHHhcC
Confidence 776666643
No 377
>KOG1463|consensus
Probab=33.79 E-value=3.2e+02 Score=23.78 Aligned_cols=25 Identities=28% Similarity=0.526 Sum_probs=21.7
Q ss_pred HHHHHHHHhcccHHHHHHHHHHHHH
Q psy16144 76 QKLASLLLRMKRYAEAIDVSQTIMK 100 (122)
Q Consensus 76 ykLA~~ylk~kr~~~AI~vc~kVL~ 100 (122)
.+|...|++.++|.+|+..-..+++
T Consensus 132 arli~Ly~d~~~YteAlaL~~~L~r 156 (411)
T KOG1463|consen 132 ARLIRLYNDTKRYTEALALINDLLR 156 (411)
T ss_pred HHHHHHHHhhHHHHHHHHHHHHHHH
Confidence 4799999999999999998776664
No 378
>KOG0546|consensus
Probab=32.59 E-value=84 Score=27.04 Aligned_cols=68 Identities=16% Similarity=0.050 Sum_probs=57.8
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAE 90 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~ 90 (122)
..+..+.+..|..... ++..++|+.+|+... +++|...++.|... .|++..+--.+...-.+.+.+.+
T Consensus 284 ~~lk~~~~~~a~~~~~~~~~~~~s~tka~~Rr~~~~~~~~~~~~a~~~~~~a~~~-~p~d~~i~~~~~~~~~~~~~~~~ 361 (372)
T KOG0546|consen 284 VGLKVKGRGGARFRTNEALRDERSKTKAHYRRGQAYKLLKNYDEALEDLKKAKQK-APNDKAIEEELENVRQKKKQYNR 361 (372)
T ss_pred hcccccCCCcceeccccccccChhhCcHHHHHHhHHHhhhchhhhHHHHHHhhcc-CcchHHHHHHHHHhhhHHHHHHH
Confidence 4566788888888888 899999999999844 99999999999999 79999999988888777776654
No 379
>TIGR02561 HrpB1_HrpK type III secretion protein HrpB1/HrpK. This gene is found within type III secretion operons in a limited range of species including Xanthomonas, Ralstonia and Burkholderia.
Probab=32.59 E-value=2.2e+02 Score=21.53 Aligned_cols=80 Identities=11% Similarity=-0.114 Sum_probs=50.5
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHH-----HHhhC----HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATN-----IYNLF----PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~k-----A~e~l----y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
..+..++.++++++|. .--..|..+. +|-.+ |.||+..++..-+= .+.+|-.-=-+|+++.-.|+..=
T Consensus 19 ~aL~~~d~~D~e~lLdALrvLrP~~~e~d~~dg~l~i~rg~w~eA~rvlr~l~~~-~~~~p~~kAL~A~CL~al~Dp~W- 96 (153)
T TIGR02561 19 YALRSADPYDAQAMLDALRVLRPNLKELDMFDGWLLIARGNYDEAARILRELLSS-AGAPPYGKALLALCLNAKGDAEW- 96 (153)
T ss_pred HHHhcCCHHHHHHHHHHHHHhCCCccccchhHHHHHHHcCCHHHHHHHHHhhhcc-CCCchHHHHHHHHHHHhcCChHH-
Confidence 4556888888988888 6667777443 33333 88898888887776 46666555566676666665432
Q ss_pred HHHHHHHHHhCC
Q psy16144 92 IDVSQTIMKAHP 103 (122)
Q Consensus 92 I~vc~kVL~~~P 103 (122)
-..-.+||...+
T Consensus 97 r~~A~~~le~~~ 108 (153)
T TIGR02561 97 HVHADEVLARDA 108 (153)
T ss_pred HHHHHHHHHhCC
Confidence 222334554444
No 380
>PF04053 Coatomer_WDAD: Coatomer WD associated region ; InterPro: IPR006692 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits. This entry represents the WD-associated region found in coatomer subunits alpha, beta and beta' subunits. The alpha-subunit (RET1P) of the coatomer complex in Saccharomyces cerevisiae (Baker's yeast), participates in membrane transport between the endoplasmic reticulum and Golgi apparatus. The protein contains six WD-40 repeat motifs in its N-terminal region []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 3MKQ_B.
Probab=32.29 E-value=3.3e+02 Score=23.54 Aligned_cols=27 Identities=22% Similarity=0.128 Sum_probs=18.4
Q ss_pred ChHHHHHHHHHHHhcccHHHHHHHHHH
Q psy16144 71 DLALHQKLASLLLRMKRYAEAIDVSQT 97 (122)
Q Consensus 71 ~~~vgykLA~~ylk~kr~~~AI~vc~k 97 (122)
++..+-+||...|+.|++.-|+.++++
T Consensus 346 ~~~~W~~Lg~~AL~~g~~~lAe~c~~k 372 (443)
T PF04053_consen 346 DPEKWKQLGDEALRQGNIELAEECYQK 372 (443)
T ss_dssp THHHHHHHHHHHHHTTBHHHHHHHHHH
T ss_pred cHHHHHHHHHHHHHcCCHHHHHHHHHh
Confidence 466677777777777777777665554
No 381
>PF10602 RPN7: 26S proteasome subunit RPN7; InterPro: IPR019585 This entry represents the regulatory subunit RPN7 (known as the non-ATPase regulatory subunit 6 in higher eukaryotes) of the 26S proteasome. This entry also matches the evolutionarily related subunit 1 of the COP9 signalosome complex (CSN) from Arabidopsis []. The 26S proteasome plays a major role in ATP-dependent degradation of ubiquitinated proteins. Substrate specificity is conferred by the regulatory particle (RP), which can dissociate into stable lid and base subcomplexes. The regulatory subunit RPN7 is one of the lid subunits of the 26S proteasome and has been shown in Saccharomyces cerevisiae (Baker's yeast) to be required for structural integrity []. The COP9 signalosome is a conserved protein complex composed of eight subunits, where Individual subunits of the complex have been linked to various signal transduction pathways leading to gene expression and cell cycle control []. The overall organisation and the amino acid sequences of the COP9 signalosome subunits resemble the lid subcomplex of the 19 S regulatory particle for the 26 S proteasome []. COP9 subunit 1 (CSN1 or GPS1) of the COP9 complex is an essential subunit of the complex with regard to both structural integrity and functionality. The N-terminal region of subunit 1 (CSN1-N) can inhibit c-fos expression from either a transfected template or a chromosomal transgene (fos-lacZ), and may contain the activity domain that confers most of the repression functions of CSN1. The C-terminal region of subunit 1 (CSN1-C) allows integration of the protein into the COP9 signalosome.
Probab=31.74 E-value=1.5e+02 Score=22.11 Aligned_cols=24 Identities=17% Similarity=0.203 Sum_probs=14.1
Q ss_pred HHHHHHHHhcccHHHHHHHHHHHH
Q psy16144 76 QKLASLLLRMKRYAEAIDVSQTIM 99 (122)
Q Consensus 76 ykLA~~ylk~kr~~~AI~vc~kVL 99 (122)
..||..|.+.|+..+|+..|.++.
T Consensus 40 ~~l~~~~~~~Gd~~~A~k~y~~~~ 63 (177)
T PF10602_consen 40 EDLADHYCKIGDLEEALKAYSRAR 63 (177)
T ss_pred HHHHHHHHHhhhHHHHHHHHHHHh
Confidence 355666666666666666665544
No 382
>PF07035 Mic1: Colon cancer-associated protein Mic1-like; InterPro: IPR009755 This entry represents the C terminus (approximately 160 residues) of a number of proteins that resemble colon cancer-associated protein Mic1.
Probab=30.71 E-value=2e+02 Score=21.72 Aligned_cols=93 Identities=14% Similarity=0.112 Sum_probs=54.3
Q ss_pred HhcCCCChHHHHHHHHHHhhCcchHHHHhhC-HHHH-HHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHH
Q psy16144 22 LYLLPWRWEPLSAGLRKKQYISSATNIYNLF-PDDA-VVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIM 99 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~~l~~nps~~kA~e~l-y~dA-a~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL 99 (122)
+.++.|++..-..+++.=-..+|-+-|...+ +... ...++-|.+.... -....-..-..++..|++.+|+...++.-
T Consensus 38 lLi~~~~~~~L~qllq~~Vi~DSk~lA~~LLs~~~~~~~~~Ql~lDMLkR-L~~~~~~iievLL~~g~vl~ALr~ar~~~ 116 (167)
T PF07035_consen 38 LLIRNGQFSQLHQLLQYHVIPDSKPLACQLLSLGNQYPPAYQLGLDMLKR-LGTAYEEIIEVLLSKGQVLEALRYARQYH 116 (167)
T ss_pred HHHHcCCHHHHHHHHhhcccCCcHHHHHHHHHhHccChHHHHHHHHHHHH-hhhhHHHHHHHHHhCCCHHHHHHHHHHcC
Confidence 8889999999999999222355656666544 1110 1124444443211 01122223368999999999999888764
Q ss_pred HhCCCCHHHHHHHHHHHHh
Q psy16144 100 KAHPSYPKVHKQIFEKAMV 118 (122)
Q Consensus 100 ~~~P~ypkir~~il~ka~~ 118 (122)
..+. +.++ .+||.|..
T Consensus 117 ~~~~--~~~~-~fLeAA~~ 132 (167)
T PF07035_consen 117 KVDS--VPAR-KFLEAAAN 132 (167)
T ss_pred Cccc--CCHH-HHHHHHHH
Confidence 4433 2233 37777754
No 383
>PF04840 Vps16_C: Vps16, C-terminal region; InterPro: IPR006925 This protein forms part of the Class C vacuolar protein sorting (Vps) complex. Vps16 is essential for vacuolar protein sorting, which is essential for viability in plants, but not yeast []. The Class C Vps complex is required for SNARE-mediated membrane fusion at the lysosome-like yeast vacuole. It is thought to play essential roles in membrane docking and fusion at the Golgi-to-endosome and endosome-to-vacuole stages of transport []. The role of VPS16 in this complex is not known.; GO: 0006886 intracellular protein transport, 0005737 cytoplasm
Probab=30.52 E-value=1.6e+02 Score=24.22 Aligned_cols=84 Identities=18% Similarity=0.277 Sum_probs=51.5
Q ss_pred HHHH--HhcCCCChHHHHHHHHHHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcc
Q psy16144 18 WHFL--LYLLPWRWEPLSAGLRKKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMK 86 (122)
Q Consensus 18 ~~~L--Lyi~~gk~d~A~elL~~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~k 86 (122)
|++. -++..|+||+=.+.-+. -+ +|.+|+-. ..+|..+..+ + +. -+-...|++.|
T Consensus 211 w~lki~aLa~~~~w~eL~~fa~s---kK-sPIGyepFv~~~~~~~~~~eA~~yI~k---~--~~-----~~rv~~y~~~~ 276 (319)
T PF04840_consen 211 WWLKIKALAENKDWDELEKFAKS---KK-SPIGYEPFVEACLKYGNKKEASKYIPK---I--PD-----EERVEMYLKCG 276 (319)
T ss_pred HHHHHHHHHhcCCHHHHHHHHhC---CC-CCCChHHHHHHHHHCCCHHHHHHHHHh---C--Ch-----HHHHHHHHHCC
Confidence 4444 77888999866553222 23 47777733 6666666665 2 11 23467789999
Q ss_pred cHHHHHHHHH---------HHHHhCCCC-HHHHHHHHHH
Q psy16144 87 RYAEAIDVSQ---------TIMKAHPSY-PKVHKQIFEK 115 (122)
Q Consensus 87 r~~~AI~vc~---------kVL~~~P~y-pkir~~il~k 115 (122)
.+.+|++.|. .+++.-|++ +....+.++.
T Consensus 277 ~~~~A~~~A~~~kd~~~L~~i~~~~~~~~~~~~~~~i~~ 315 (319)
T PF04840_consen 277 DYKEAAQEAFKEKDIDLLKQILKRCPGNNDQLIADKIEQ 315 (319)
T ss_pred CHHHHHHHHHHcCCHHHHHHHHHHCCCCChHHHHHHHHH
Confidence 9999998865 566666643 4443344443
No 384
>KOG3783|consensus
Probab=30.47 E-value=3.7e+02 Score=24.38 Aligned_cols=95 Identities=9% Similarity=-0.049 Sum_probs=59.2
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcc-----hHHHHhhC--------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhccc
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISS-----ATNIYNLF--------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKR 87 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps-----~~kA~e~l--------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr 87 (122)
++...|+-+.|.+.+. ++. ++ ....||.. |.+|+..+....+.++=+..---|-.|-+|+...+
T Consensus 276 ~l~~~g~~eaa~~~~~~~v~--~~~kQ~~~l~~fE~aw~~v~~~~~~~aad~~~~L~desdWS~a~Y~Yfa~cc~l~~~~ 353 (546)
T KOG3783|consen 276 ILSIKGNSEAAIDMESLSIP--IRMKQVKSLMVFERAWLSVGQHQYSRAADSFDLLRDESDWSHAFYTYFAGCCLLQNWE 353 (546)
T ss_pred HHHHcccHHHHHHHHHhccc--HHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHhhhhhhHHHHHHHHHHHHhccHH
Confidence 6666677888888888 776 22 23344422 99999999999888755545455555566665554
Q ss_pred HHHH-----------HHHHHHHHHhCCCCHHHHHHHHHHHHh
Q psy16144 88 YAEA-----------IDVSQTIMKAHPSYPKVHKQIFEKAMV 118 (122)
Q Consensus 88 ~~~A-----------I~vc~kVL~~~P~ypkir~~il~ka~~ 118 (122)
-... .++-+..++..|-+..+.+=+.-|+++
T Consensus 354 ~~q~~~~ne~~a~~~~k~~~~l~~~a~K~~P~E~f~~RKver 395 (546)
T KOG3783|consen 354 VNQGAGGNEEKAQLYFKVGEELLANAGKNLPLEKFIVRKVER 395 (546)
T ss_pred HHHhcccchhHHHHHHHHHHHHHHhccccCchhHHHHHHHHH
Confidence 4433 224445566656555555556666654
No 385
>KOG0739|consensus
Probab=30.08 E-value=2.1e+02 Score=24.75 Aligned_cols=31 Identities=10% Similarity=0.022 Sum_probs=14.9
Q ss_pred HHHHHHHH-HHhhCcchHHHHhhCHHHHHHHHHHHHHh
Q psy16144 30 EPLSAGLR-KKQYISSATNIYNLFPDDAVVQYTQACRI 66 (122)
Q Consensus 30 d~A~elL~-~l~~nps~~kA~e~ly~dAa~~ye~Aw~l 66 (122)
.+|.++.+ +++.+.. .- |.+|...|++|+++
T Consensus 8 ~kaI~lv~kA~~eD~a--~n----Y~eA~~lY~~aleY 39 (439)
T KOG0739|consen 8 QKAIDLVKKAIDEDNA--KN----YEEALRLYQNALEY 39 (439)
T ss_pred HHHHHHHHHHhhhcch--hc----hHHHHHHHHHHHHH
Confidence 45666666 6553321 11 44444455444444
No 386
>KOG0889|consensus
Probab=29.57 E-value=84 Score=34.33 Aligned_cols=54 Identities=20% Similarity=0.225 Sum_probs=43.3
Q ss_pred HHHHHH---HhcCCCChHHHHHHHH-HHhhCcchHHHHh----hC-------------HHHHHHHHHHHHHhhCC
Q psy16144 16 AEWHFL---LYLLPWRWEPLSAGLR-KKQYISSATNIYN----LF-------------PDDAVVQYTQACRIIGT 69 (122)
Q Consensus 16 ~~~~~L---Lyi~~gk~d~A~elL~-~l~~nps~~kA~e----~l-------------y~dAa~~ye~Aw~l~~~ 69 (122)
||...+ +..+.|++++|...+. +++..-.|+|||. |+ -..|..||-+|..+.+.
T Consensus 2812 aeff~lkG~f~~kL~~~eeAn~~fs~AvQi~~~l~KaW~~Wg~y~~~~f~~e~~ni~~a~~avsCyLqA~~~~~~ 2886 (3550)
T KOG0889|consen 2812 AEFFTLKGMFLEKLGKFEEANKAFSAAVQIDDGLGKAWAEWGKYLDNRFNKEPVNISFACNAVSCYLQAARLYNS 2886 (3550)
T ss_pred HHHHHhhhHHHHHhcCcchhHHHHHHHHHHHhhhHHHHHHHHHHHHHHHhccCcccHHHHHHHHHHHHHhccccc
Confidence 444444 8889999999999999 9999999999998 22 45677888888877533
No 387
>KOG1550|consensus
Probab=29.21 E-value=4e+02 Score=23.45 Aligned_cols=44 Identities=18% Similarity=0.147 Sum_probs=27.4
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcc---cHHHHHHHHHHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMK---RYAEAIDVSQTIM 99 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~k---r~~~AI~vc~kVL 99 (122)
+..|...|.+|=+++ +|..-|.||.+|+..- ++..|-..|+.+-
T Consensus 309 ~~~A~~~~~~aA~~g---~~~a~~~lg~~~~~g~~~~d~~~A~~yy~~Aa 355 (552)
T KOG1550|consen 309 YEKALKLYTKAAELG---NPDAQYLLGVLYETGTKERDYRRAFEYYSLAA 355 (552)
T ss_pred HHHHHHHHHHHHhcC---CchHHHHHHHHHHcCCccccHHHHHHHHHHHH
Confidence 456777777776664 4666677777777655 3456666555544
No 388
>KOG1920|consensus
Probab=28.93 E-value=1.3e+02 Score=29.90 Aligned_cols=38 Identities=24% Similarity=0.245 Sum_probs=22.2
Q ss_pred HHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHHHHHHHHHH
Q psy16144 76 QKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVHKQIFEKAM 117 (122)
Q Consensus 76 ykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir~~il~ka~ 117 (122)
..|+.-+.-.+|+.||-.+ +..+=++|+=--..|=||+
T Consensus 1003 ~~L~s~L~e~~kh~eAa~i----l~e~~sd~~~av~ll~ka~ 1040 (1265)
T KOG1920|consen 1003 EELVSRLVEQRKHYEAAKI----LLEYLSDPEEAVALLCKAK 1040 (1265)
T ss_pred HHHHHHHHHcccchhHHHH----HHHHhcCHHHHHHHHhhHh
Confidence 7777777777777777763 3334344444444444443
No 389
>PF11417 Inhibitor_G39P: Loader and inhibitor of phage G40P; InterPro: IPR024424 G39P inhibits the initiation of DNA replication by blocking G40P replicative helicase. G39P has a bipartite stricture consisting of a folded N-terminal domain and an unfolded C-terminal domain. The C-terminal is essential for helicase interaction [].; PDB: 1NO1_B.
Probab=28.89 E-value=1.2e+02 Score=19.84 Aligned_cols=28 Identities=18% Similarity=0.285 Sum_probs=16.0
Q ss_pred HHHHHHHHHhcCCCChHHHHHHHH-HHhhCc
Q psy16144 14 TIAEWHFLLYLLPWRWEPLSAGLR-KKQYIS 43 (122)
Q Consensus 14 ~~~~~~~LLyi~~gk~d~A~elL~-~l~~np 43 (122)
+|.-||.+| ....|+.+..-|+ .+.-||
T Consensus 32 ~v~~W~~~L--~d~~ye~v~~al~~~i~~~k 60 (71)
T PF11417_consen 32 TVDLWYDML--KDYDYEIVMKALKKHIATNK 60 (71)
T ss_dssp HHHHHHHHH--TTS-HHHHHHHHHHHHHH-S
T ss_pred HHHHHHHHH--HhCCHHHHHHHHHHHHHhCC
Confidence 455666665 5566777777666 555444
No 390
>PF06667 PspB: Phage shock protein B; InterPro: IPR009554 This family consists of several bacterial phage shock protein B (PspB) sequences. The phage shock protein (psp) operon is induced in response to heat, ethanol, osmotic shock and infection by filamentous bacteriophages []. Expression of the operon requires the alternative sigma factor sigma54 and the transcriptional activator PspF. In addition, PspA plays a negative regulatory role, and the integral-membrane proteins PspB and PspC play a positive one [].; GO: 0006355 regulation of transcription, DNA-dependent, 0009271 phage shock
Probab=28.75 E-value=1.5e+02 Score=19.66 Aligned_cols=49 Identities=18% Similarity=0.285 Sum_probs=39.9
Q ss_pred HHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHH
Q psy16144 57 VVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPK 107 (122)
Q Consensus 57 a~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypk 107 (122)
.-+|..-|+-++.-+..=--.|...+.++.|..+=|+..+++|+ .++|.
T Consensus 23 ~lHY~sk~~~~~gLs~~d~~~L~~L~~~a~rm~eRI~tLE~ILd--ae~P~ 71 (75)
T PF06667_consen 23 ILHYRSKWKSSQGLSEEDEQRLQELYEQAERMEERIETLERILD--AEHPN 71 (75)
T ss_pred HHHHHHhcccCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHc--CCCCC
Confidence 46788888887676777778899999999999999999999994 34443
No 391
>PF10345 Cohesin_load: Cohesin loading factor; InterPro: IPR019440 Cohesin loading factor is a conserved protein that has been characterised in fungi. It is associated with the cohesin complex and is required in G1 for cohesin binding to chromosomes, but is dispensable in G2 when cohesion has been established. It is often referred to as Ssl3 in Schizosaccharomyces pombe (Fission yeast), and Scc4 in Saccharomyces cerevisiae (Baker's yeast). It complexes with Mis4 [].
Probab=28.69 E-value=4.1e+02 Score=23.47 Aligned_cols=87 Identities=18% Similarity=0.080 Sum_probs=57.2
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcc--hHHHHhhC------------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhc-
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISS--ATNIYNLF------------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRM- 85 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps--~~kA~e~l------------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~- 85 (122)
|+-.+..+|+|+.+|. ++....+ ..+....+ +..|.....++.+.+ .+.+......+|-+++.
T Consensus 69 L~~eT~n~~~Ae~~L~k~~~l~~~~~~~d~k~~~~~ll~~i~~~~~~~~a~~~l~~~I~~~-~~~~~~~w~~~frll~~~ 147 (608)
T PF10345_consen 69 LLEETENLDLAETYLEKAILLCERHRLTDLKFRCQFLLARIYFKTNPKAALKNLDKAIEDS-ETYGHSAWYYAFRLLKIQ 147 (608)
T ss_pred HHHHcCCHHHHHHHHHHHHHhccccchHHHHHHHHHHHHHHHHhcCHHHHHHHHHHHHHHH-hccCchhHHHHHHHHHHH
Confidence 4446778999999999 7666643 33221111 666888999998886 44444544445555533
Q ss_pred -----ccHHHHHHHHHHHHHhC--CCCHHHH
Q psy16144 86 -----KRYAEAIDVSQTIMKAH--PSYPKVH 109 (122)
Q Consensus 86 -----kr~~~AI~vc~kVL~~~--P~ypkir 109 (122)
+++.-|++.++.+.... +..|.++
T Consensus 148 l~~~~~d~~~Al~~L~~~~~~a~~~~d~~~~ 178 (608)
T PF10345_consen 148 LALQHKDYNAALENLQSIAQLANQRGDPAVF 178 (608)
T ss_pred HHHhcccHHHHHHHHHHHHHHhhhcCCHHHH
Confidence 78999999998887655 4555553
No 392
>PF13226 DUF4034: Domain of unknown function (DUF4034)
Probab=28.43 E-value=1.6e+02 Score=24.17 Aligned_cols=20 Identities=10% Similarity=-0.197 Sum_probs=16.4
Q ss_pred HHHHHHH-HHhhCcchHHHHh
Q psy16144 31 PLSAGLR-KKQYISSATNIYN 50 (122)
Q Consensus 31 ~A~elL~-~l~~nps~~kA~e 50 (122)
.-.+.|+ .++.+|.+.-||.
T Consensus 61 ~~~~~LkaWv~a~P~Sy~A~L 81 (277)
T PF13226_consen 61 ARLAVLKAWVAACPKSYHAHL 81 (277)
T ss_pred hHHHHHHHHHHHCCCChHHHH
Confidence 3678899 9999998787775
No 393
>TIGR03549 conserved hypothetical protein TIGR03549. This family consists of remarkably well-conserved proteins from gamma and beta Proteobacteria, heavily skewed towards organisms of marine environments. This family has an OsmC-like N-terminal domain. It shares a central domain, modeled by pfam02624 and TIGR00702, with other families of smaller proteins. The function is unknown. Fifteen of the first sixteen members of this family are from selenouridine-positive genomes, but this correlation may be fortuitous.
Probab=28.21 E-value=3.2e+02 Score=25.63 Aligned_cols=32 Identities=28% Similarity=0.259 Sum_probs=28.7
Q ss_pred HHHHHHHHHhcCCCChHHHHHHHH-HHhhCcch
Q psy16144 14 TIAEWHFLLYLLPWRWEPLSAGLR-KKQYISSA 45 (122)
Q Consensus 14 ~~~~~~~LLyi~~gk~d~A~elL~-~l~~nps~ 45 (122)
+|.|+-.||++..|..|+|.+.+. ++++|-+.
T Consensus 606 ~~~elk~~~~la~g~~~~al~~~~~~~~~~~~~ 638 (718)
T TIGR03549 606 TILELKLLIYLALQRHEDALELVGMFLQFNDNT 638 (718)
T ss_pred ehHHHHHHHHHhcCCHHHHHHHHHHHHHhCCCc
Confidence 577888889999999999999999 99999883
No 394
>PF13311 DUF4080: Protein of unknown function (DUF4080)
Probab=27.97 E-value=2.7e+02 Score=21.05 Aligned_cols=43 Identities=23% Similarity=0.275 Sum_probs=33.0
Q ss_pred cccchhHHHHHHHHHHHHHH-HhcCCCChHHHHHHHH-HHhhCcchHHHH
Q psy16144 2 KSLSVDTMQIIQTIAEWHFL-LYLLPWRWEPLSAGLR-KKQYISSATNIY 49 (122)
Q Consensus 2 ~~~~~~~~~~~~~~~~~~~L-Lyi~~gk~d~A~elL~-~l~~nps~~kA~ 49 (122)
+.||.+.|+-+..+++ ++ +|-++|+|....++|- .. +|....+
T Consensus 20 ~~Ls~~ei~~Lk~~e~--~le~yyNSgrF~~tl~~l~~~~---~spF~ff 64 (190)
T PF13311_consen 20 KWLSFDEIQRLKRFED--MLEKYYNSGRFDYTLEYLLKEF---GSPFDFF 64 (190)
T ss_pred CCCCHHHHHHHHHHHH--HHHHHhhhhhHHHHHHHHHHhc---CCHHHHH
Confidence 4689999999998876 55 8999999999988876 43 4544444
No 395
>KOG3303|consensus
Probab=27.51 E-value=3.1e+02 Score=21.58 Aligned_cols=86 Identities=10% Similarity=-0.113 Sum_probs=50.6
Q ss_pred HHHHHHHHHHHH-----HhcCCCChHH-----HHHHHH-HHhhCcchHHHHhhCHHHHHHHHHHHHHhhCCCChHHHHH-
Q psy16144 10 QIIQTIAEWHFL-----LYLLPWRWEP-----LSAGLR-KKQYISSATNIYNLFPDDAVVQYTQACRIIGTSDLALHQK- 77 (122)
Q Consensus 10 ~~~~~~~~~~~L-----Lyi~~gk~d~-----A~elL~-~l~~nps~~kA~e~ly~dAa~~ye~Aw~l~~~~~~~vgyk- 77 (122)
||++.|..++-. =-+++|-.++ ..-... ++..|+.|..||- |.-+......+|+. +-+-|+.+..
T Consensus 32 ~i~~E~~~l~~~~~~i~~~~~~g~~~d~~~~~~~~m~h~~~~RNkRClmAY~--~~R~~~lr~~~W~~-g~~lps~i~~s 108 (192)
T KOG3303|consen 32 QILEEIDALDARNESIKQKAQSGGSDDNLLPVNYLMMHLSLERNKRCLMAYV--YLRLARLRDLAWSY-GLDLPSSIRFS 108 (192)
T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCccchhHHHHHHHHHHHHHhHHHHHHHH--HHHHHHHHHHHHhc-CCCccHHHHHh
Confidence 566666665554 3345666555 233445 8999999999875 44555667789999 5555555433
Q ss_pred HHH-HHHhcccHHHHHHHHHHH
Q psy16144 78 LAS-LLLRMKRYAEAIDVSQTI 98 (122)
Q Consensus 78 LA~-~ylk~kr~~~AI~vc~kV 98 (122)
|+- =-.-.++|-.+++-|..-
T Consensus 109 ls~~E~eyf~~Ys~~La~y~~~ 130 (192)
T KOG3303|consen 109 LSHEEEEYFKNYSNLLAEYMGP 130 (192)
T ss_pred hhHHHHHHHHHHHHHHHHHhcC
Confidence 221 112245666666655543
No 396
>PF10955 DUF2757: Protein of unknown function (DUF2757); InterPro: IPR020115 This entry contains proteins with no known function.
Probab=26.94 E-value=41 Score=22.61 Aligned_cols=16 Identities=19% Similarity=0.436 Sum_probs=12.7
Q ss_pred HHHHHHHHhCCCCHHH
Q psy16144 93 DVSQTIMKAHPSYPKV 108 (122)
Q Consensus 93 ~vc~kVL~~~P~ypki 108 (122)
+-|++.|+.||++-..
T Consensus 56 e~C~eaL~~nPe~~~~ 71 (76)
T PF10955_consen 56 EDCQEALERNPEYHEY 71 (76)
T ss_pred HHHHHHHHhCcchhhh
Confidence 3499999999987544
No 397
>PF12925 APP_E2: E2 domain of amyloid precursor protein; InterPro: IPR024329 Amyloid-beta precursor protein (APP, or A4) is associated with Alzheimer's disease (AD), because one of its breakdown products, amyloid-beta (A-beta), aggregates to form amyloid or senile plaques [, ]. Mutations in APP or in proteins that process APP have been linked with early-onset, familial AD. Individuals with Down's syndrome carry an extra copy of chromosome 21, which contains the APP gene, and almost invariably develop amyloid plaques and Alzheimer's symptoms. APP is important for the neurogenesis and neuronal regeneration, either through the intact protein, or through its many breakdown products []. APP consists of a large N-terminal extracellular region containing heparin-binding and copper-binding sites, a short hydrophobic transmembrane domain, and a short C-terminal intracellular domain. The N-terminal region is similar in structure to cysteine-rich growth factors and appears to function as a cell surface receptor, contributing to neurite growth, neuronal adhesion, axonogenesis and cell mobility []. APP acts as a kinesin I membrane receptor to mediate the axonal transport of beta-secretase and presenilin 1. The N-terminal domain can regulate neurite outgrowth through its binding to heparin and collagen I and IV, which are components of the extracellular matrix. APP is also coupled to apoptosis-inducing pathways, and is involved in copper homeostasis/oxidative stress through copper ion reduction, where copper-metallated APP induces neuronal death []. The C-terminal intracellular domain appears to be involved in transcription regulation through protein-protein interactions. APP can promote transcription activation through binding to APBB1/Tip60, and may bind to the adaptor protein FE65 to transactivate a wide variety of different promoters. APP can be processed by different sets of enzymes: In the non-amyloidogenic (non-plaque-forming) pathway, APP is cleaved by alpha-secretase to yield a soluble N-terminal sAPP-alpha (neuroprotective) and a membrane-bound CTF-alpha. CTF-alpha is broken-down by presenilin-containing gamma-secretase to yield soluble p3 and membrane-bound AICD (nuclear signalling). In the amyloidogenic pathway (plaque-forming), APP is broken down by beta-secretase to yield soluble sAPP-beta and membrane-bound CTF-beta. CTF-beta is broken down by gamma-secretase to yield soluble amyloid-beta and membrane-bound AICD. Amyloid-beta is required for neuronal function, but can aggregate to form amyloid plaques that seem to disrupt brain cells by clogging points of cell-cell contact. The E2 domain is the largest of the conserved domains in the amyloidogenic glycoproteins. The structure of E2 consists of two coiled-coil sub-structures connected through a continuous helix, and bears an unexpected resemblance to the spectrin family of protein structures. E2 can reversibly dimerise in solution, and the dimerisation occurs along the longest dimension of the molecule in an antiparallel orientation, which enables the N-terminal substructure of one monomer to pack against the C-terminal substructure of a second monomer. The high degree of conservation of residues at the putative dimer interface suggests that the E2 dimer observed in the crystal could be physiologically relevant. Heparin sulphate proteoglycans, the putative ligands for the precursor present in extracellular matrix, bind to E2 at a conserved and positively charged site near the dimer interface [].; PDB: 3K6B_A 3K66_A 1TKN_A 3NYL_A 3NYJ_A 3UMH_A 3UMK_A 3UMI_A 3QMK_B 3PMR_B ....
Probab=26.57 E-value=1.8e+02 Score=22.71 Aligned_cols=62 Identities=15% Similarity=0.165 Sum_probs=41.7
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHh--cccHHHHHHHHHHHHHhCCC-CHHHHHHHHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLR--MKRYAEAIDVSQTIMKAHPS-YPKVHKQIFEK 115 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk--~kr~~~AI~vc~kVL~~~P~-ypkir~~il~k 115 (122)
-..|..+|..|+.-..|+...|. +.-.-|++ .++-.-.|.-|+.++..+|. -++++..+|..
T Consensus 113 kr~al~~y~~al~~~ppn~~~vl-~~Lk~yiRa~~KDR~Htl~h~~H~~~~dp~~A~~~k~~vl~h 177 (193)
T PF12925_consen 113 KRAALENYTAALQADPPNPHKVL-KALKKYIRAEEKDRQHTLRHFEHLRMVDPEEAAQIKPQVLTH 177 (193)
T ss_dssp HHHHHHHHHHHHTCSS--HHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHhcCCCChHHHH-HHHHHHHHHHHHhHHHHHHHHHHHHhcCHHHHHHhHHHHHHH
Confidence 78899999999998544444444 44334443 45556678889999999994 46777777654
No 398
>smart00671 SEL1 Sel1-like repeats. These represent a subfamily of TPR (tetratricopeptide repeat) sequences.
Probab=26.42 E-value=1e+02 Score=15.75 Aligned_cols=28 Identities=11% Similarity=0.107 Sum_probs=18.5
Q ss_pred HHHHHHHHHHHhc----ccHHHHHHHHHHHHH
Q psy16144 73 ALHQKLASLLLRM----KRYAEAIDVSQTIMK 100 (122)
Q Consensus 73 ~vgykLA~~ylk~----kr~~~AI~vc~kVL~ 100 (122)
.+-+.||..|+.. ++..+|+..++++-+
T Consensus 2 ~a~~~lg~~~~~G~g~~~d~~~A~~~~~~Aa~ 33 (36)
T smart00671 2 EAQYNLGQMYEYGLGVKKDLEKALEYYKKAAE 33 (36)
T ss_pred HHHHHHHHHHHcCCCCCcCHHHHHHHHHHHHH
Confidence 4567777777754 366777777766543
No 399
>PF10151 DUF2359: Uncharacterised conserved protein (DUF2359); InterPro: IPR019308 This is a 450 amino acid region of a family of proteins conserved from insects to humans. The function is not known.
Probab=26.40 E-value=54 Score=28.87 Aligned_cols=23 Identities=9% Similarity=-0.018 Sum_probs=19.4
Q ss_pred HHHHHHHH-HHhhCcchHHHHhhC
Q psy16144 30 EPLSAGLR-KKQYISSATNIYNLF 52 (122)
Q Consensus 30 d~A~elL~-~l~~nps~~kA~e~l 52 (122)
+|+.+.|- ||..||+|.+.|+.+
T Consensus 167 ~E~~~~li~CLt~d~~c~~~Wr~l 190 (469)
T PF10151_consen 167 KELISILIWCLTQDPDCFKVWRQL 190 (469)
T ss_pred HHHHHHHHHHhcCChHHHHHHHHH
Confidence 45777777 999999999999955
No 400
>KOG0985|consensus
Probab=25.73 E-value=4.8e+02 Score=26.34 Aligned_cols=87 Identities=16% Similarity=0.108 Sum_probs=58.4
Q ss_pred HhcCCCChHHHHHHHHHHhhCcchHHHHhhCHHHHHHHHHHHHHhh-CCCChHHHHHHHHHHHhcccHHHHHHHHHHHHH
Q psy16144 22 LYLLPWRWEPLSAGLRKKQYISSATNIYNLFPDDAVVQYTQACRII-GTSDLALHQKLASLLLRMKRYAEAIDVSQTIMK 100 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~~l~~nps~~kA~e~ly~dAa~~ye~Aw~l~-~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~ 100 (122)
+.+..+-|++|-..++-...|-|-.+.. -|-+.+.++|.++. .-+-|+++-.||-+-+..|+-.|||+-|-+
T Consensus 1057 iai~~~LyEEAF~ifkkf~~n~~A~~VL----ie~i~~ldRA~efAe~~n~p~vWsqlakAQL~~~~v~dAieSyik--- 1129 (1666)
T KOG0985|consen 1057 IAIENQLYEEAFAIFKKFDMNVSAIQVL----IENIGSLDRAYEFAERCNEPAVWSQLAKAQLQGGLVKDAIESYIK--- 1129 (1666)
T ss_pred HHhhhhHHHHHHHHHHHhcccHHHHHHH----HHHhhhHHHHHHHHHhhCChHHHHHHHHHHHhcCchHHHHHHHHh---
Confidence 7788888899888887434444322221 23334444444432 124589999999999999999999998755
Q ss_pred hCCCCHHHHHHHHHHHH
Q psy16144 101 AHPSYPKVHKQIFEKAM 117 (122)
Q Consensus 101 ~~P~ypkir~~il~ka~ 117 (122)
.|.|---.++.+.|-
T Consensus 1130 --adDps~y~eVi~~a~ 1144 (1666)
T KOG0985|consen 1130 --ADDPSNYLEVIDVAS 1144 (1666)
T ss_pred --cCCcHHHHHHHHHHH
Confidence 356666677777764
No 401
>PF09796 QCR10: Ubiquinol-cytochrome-c reductase complex subunit (QCR10); InterPro: IPR019182 This entry represents subunit 10 of the cytochrome b-c1 complex (also known as the ubiquinol-cytochrome c reductase complex or complex III). This complex is located on the inner mitochondrial membrane and it couples electron transfer from ubiquinol to cytochrome. Subunit 10 is required for stable association of the iron-sulphur protein with the complex [].
Probab=25.61 E-value=42 Score=21.78 Aligned_cols=15 Identities=20% Similarity=0.516 Sum_probs=12.1
Q ss_pred hCCCCHHHHHHHHHH
Q psy16144 101 AHPSYPKVHKQIFEK 115 (122)
Q Consensus 101 ~~P~ypkir~~il~k 115 (122)
.-..=|++|+|||.|
T Consensus 31 f~~~vPr~q~dil~K 45 (64)
T PF09796_consen 31 FTSGVPRFQRDILQK 45 (64)
T ss_pred HhcCCcHHHHHHHHh
Confidence 445579999999987
No 402
>COG5107 RNA14 Pre-mRNA 3'-end processing (cleavage and polyadenylation) factor [RNA processing and modification]
Probab=24.43 E-value=1.4e+02 Score=27.11 Aligned_cols=55 Identities=5% Similarity=0.094 Sum_probs=46.0
Q ss_pred HHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHHHHHhCCCCHHHHHHHHH
Q psy16144 59 QYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQTIMKAHPSYPKVHKQIFE 114 (122)
Q Consensus 59 ~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~~~P~ypkir~~il~ 114 (122)
.++.-.+- ||+|....|-|-.-|.-.+.+++--++|++.+.-.|-+|.+|.+-|.
T Consensus 30 rLRerIkd-NPtnI~S~fqLiq~~~tq~s~~~~re~yeq~~~pfp~~~~aw~ly~s 84 (660)
T COG5107 30 RLRERIKD-NPTNILSYFQLIQYLETQESMDAEREMYEQLSSPFPIMEHAWRLYMS 84 (660)
T ss_pred HHHHHhhc-CchhHHHHHHHHHHHhhhhhHHHHHHHHHHhcCCCccccHHHHHHhc
Confidence 34444455 79999999999999999999999999999999999999999877653
No 403
>KOG0530|consensus
Probab=24.11 E-value=4.4e+02 Score=22.22 Aligned_cols=83 Identities=1% Similarity=-0.021 Sum_probs=60.0
Q ss_pred HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHH-H
Q psy16144 22 LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYA-E 90 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~-~ 90 (122)
|+.+.-+.+-|..+.. ++..||.+--.|.|- .=...-.|-......+|.|=.|.--=-+.....|.+. .
T Consensus 52 I~~~~E~S~RAl~LT~d~i~lNpAnYTVW~yRr~iL~~l~~dL~~El~~l~eI~e~npKNYQvWHHRr~ive~l~d~s~r 131 (318)
T KOG0530|consen 52 IIAKNEKSPRALQLTEDAIRLNPANYTVWQYRRVILRHLMSDLNKELEYLDEIIEDNPKNYQVWHHRRVIVELLGDPSFR 131 (318)
T ss_pred HHhccccCHHHHHHHHHHHHhCcccchHHHHHHHHHHHhHHHHHHHHHHHHHHHHhCccchhHHHHHHHHHHHhcCcccc
Confidence 7888889999999999 999999988888854 2223333444444447888777666566777777776 6
Q ss_pred HHHHHHHHHHhCCC
Q psy16144 91 AIDVSQTIMKAHPS 104 (122)
Q Consensus 91 AI~vc~kVL~~~P~ 104 (122)
=++.|+.+|..+..
T Consensus 132 ELef~~~~l~~DaK 145 (318)
T KOG0530|consen 132 ELEFTKLMLDDDAK 145 (318)
T ss_pred hHHHHHHHHhcccc
Confidence 67888888876653
No 404
>PF04121 Nup84_Nup100: Nuclear pore protein 84 / 107 ; InterPro: IPR007252 Nup84p forms a complex with five proteins, including Nup120p, Nup85p, Sec13p, and a Sec13p homolog. This Nup84p complex in conjunction with Sec13-type proteins is required for correct nuclear pore biogenesis [].; GO: 0006810 transport, 0005643 nuclear pore; PDB: 3CQC_A 3CQG_A 3I4R_A 3IKO_I 3JRO_C.
Probab=24.08 E-value=1e+02 Score=27.92 Aligned_cols=26 Identities=27% Similarity=0.183 Sum_probs=21.4
Q ss_pred HHHHHHHHHHHhcccHHHHHHHHHHH
Q psy16144 73 ALHQKLASLLLRMKRYAEAIDVSQTI 98 (122)
Q Consensus 73 ~vgykLA~~ylk~kr~~~AI~vc~kV 98 (122)
...++--|.+++.|++++|++.|++.
T Consensus 134 ~~~~~~i~~llR~G~~~eA~~lc~~~ 159 (697)
T PF04121_consen 134 RALLKYIFELLRAGRIEEAQELCRER 159 (697)
T ss_dssp HHHHHHHHHHHHTT-HHHHHHHHHHT
T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHC
Confidence 45677789999999999999999953
No 405
>KOG3540|consensus
Probab=24.01 E-value=2.1e+02 Score=25.94 Aligned_cols=93 Identities=16% Similarity=0.275 Sum_probs=58.1
Q ss_pred HHHHHHHHHHHH-HhcCCCChHHHHHHHH-HHhhCcchHHHHh--------------------hC---HHHHHHHHHHHH
Q psy16144 10 QIIQTIAEWHFL-LYLLPWRWEPLSAGLR-KKQYISSATNIYN--------------------LF---PDDAVVQYTQAC 64 (122)
Q Consensus 10 ~~~~~~~~~~~L-Lyi~~gk~d~A~elL~-~l~~nps~~kA~e--------------------~l---y~dAa~~ye~Aw 64 (122)
++-+..-||-.. . |++. .+|.. + ..++..+-..+.| ++ =.+|...|..|+
T Consensus 265 rmd~VmkEW~~ae~--qaKn-PKAek--qalnqhFQ~~v~sLEee~a~erqqlvetH~~RV~AmlNdrrR~Ale~ylaAL 339 (615)
T KOG3540|consen 265 RMDKVMKEWEEAET--QAKN-PKAEK--QALNQHFQKTVSSLEEEAARERQQLVETHEARVEAMLNDRRRDALENYLAAL 339 (615)
T ss_pred HHHHHHHHHHHHHh--cccC-chhhH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHH
Confidence 344566778777 5 3333 45655 4 5555555455554 11 788999999999
Q ss_pred HhhCCCChHHHHHHHHHHHhcccHHHH--------HHHHHHHHHhCCCC-HHHHHHHHH
Q psy16144 65 RIIGTSDLALHQKLASLLLRMKRYAEA--------IDVSQTIMKAHPSY-PKVHKQIFE 114 (122)
Q Consensus 65 ~l~~~~~~~vgykLA~~ylk~kr~~~A--------I~vc~kVL~~~P~y-pkir~~il~ 114 (122)
.- .|.-| -.+++-.+||+.| +..|+.||+.+|.- ..+|..|+-
T Consensus 340 qa-~pprp------~~Vl~aLkrYvRAEqKdr~HTlrhyqHv~~vDpkkAaqmk~qV~t 391 (615)
T KOG3540|consen 340 QA-DPPRP------HRVLQALKRYVRAEQKDRMHTLRHYQHVLAVDPKKAAQMKSQVMT 391 (615)
T ss_pred hc-CCCCh------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHH
Confidence 99 56555 3344455566655 56899999999943 445555543
No 406
>PF10776 DUF2600: Protein of unknown function (DUF2600); InterPro: IPR019712 This is a bacterial family of proteins. Some members in the family are annotated as YtpB, however no function is currently known.
Probab=23.86 E-value=1.1e+02 Score=25.96 Aligned_cols=29 Identities=24% Similarity=0.356 Sum_probs=25.5
Q ss_pred HHHHHHHHHHHHhCCCCHHHHHHHHHHHH
Q psy16144 89 AEAIDVSQTIMKAHPSYPKVHKQIFEKAM 117 (122)
Q Consensus 89 ~~AI~vc~kVL~~~P~ypkir~~il~ka~ 117 (122)
..=+..|++++..-|.|+.|+..+++-|.
T Consensus 117 ~~LV~tCq~~l~~lP~y~~V~~~l~~la~ 145 (330)
T PF10776_consen 117 RALVRTCQQVLAQLPSYELVRPYLLELAR 145 (330)
T ss_pred HHHHHHHHHHHHhCCCHHHHHHHHHHHHH
Confidence 44578899999999999999999998775
No 407
>TIGR00985 3a0801s04tom mitochondrial import receptor subunit translocase of outer membrane 20 kDa subunit.
Probab=23.74 E-value=2.7e+02 Score=20.84 Aligned_cols=39 Identities=18% Similarity=0.210 Sum_probs=32.7
Q ss_pred HHHHHHHHhcc-cHHHHHHHHHHHHHhCCCCHHHHHHHHHH
Q psy16144 76 QKLASLLLRMK-RYAEAIDVSQTIMKAHPSYPKVHKQIFEK 115 (122)
Q Consensus 76 ykLA~~ylk~k-r~~~AI~vc~kVL~~~P~ypkir~~il~k 115 (122)
..+|-.+...| +..++...+=++|...| +|.---.|+++
T Consensus 94 V~~GE~L~~~g~~~~ega~hf~nAl~Vc~-qP~~LL~iyq~ 133 (148)
T TIGR00985 94 VQLGEELMAQGTNVDEGAVHFYNALKVYP-QPQQLLSIYQQ 133 (148)
T ss_pred HHHHHHHHhCCCchHHHHHHHHHHHHhCC-CHHHHHHHHHh
Confidence 36788888888 99999999999999999 78776777765
No 408
>COG5159 RPN6 26S proteasome regulatory complex component [Posttranslational modification, protein turnover, chaperones]
Probab=23.42 E-value=1e+02 Score=26.48 Aligned_cols=26 Identities=19% Similarity=0.208 Sum_probs=24.5
Q ss_pred HHHHHHHHhcccHHHHHHHHHHHHHh
Q psy16144 76 QKLASLLLRMKRYAEAIDVSQTIMKA 101 (122)
Q Consensus 76 ykLA~~ylk~kr~~~AI~vc~kVL~~ 101 (122)
..||.|..+.+++.+||.+|.++|..
T Consensus 7 le~a~~~v~~~~~~~ai~~yk~iL~k 32 (421)
T COG5159 7 LELANNAVKSNDIEKAIGEYKRILGK 32 (421)
T ss_pred HHHHHHhhhhhhHHHHHHHHHHHhcC
Confidence 78999999999999999999999965
No 409
>KOG0985|consensus
Probab=23.07 E-value=5.8e+02 Score=25.81 Aligned_cols=90 Identities=14% Similarity=0.134 Sum_probs=67.0
Q ss_pred HHHHHHHH--HhcCCCChHHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHH
Q psy16144 14 TIAEWHFL--LYLLPWRWEPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASL 81 (122)
Q Consensus 14 ~~~~~~~L--Lyi~~gk~d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ 81 (122)
+-+-|-.+ -.++.|...+|.+-+= + +++-.|... |+|=++++..|-+..+ -|-+--.|-|+
T Consensus 1103 ~p~vWsqlakAQL~~~~v~dAieSyika-----dDps~y~eVi~~a~~~~~~edLv~yL~MaRkk~~--E~~id~eLi~A 1175 (1666)
T KOG0985|consen 1103 EPAVWSQLAKAQLQGGLVKDAIESYIKA-----DDPSNYLEVIDVASRTGKYEDLVKYLLMARKKVR--EPYIDSELIFA 1175 (1666)
T ss_pred ChHHHHHHHHHHHhcCchHHHHHHHHhc-----CCcHHHHHHHHHHHhcCcHHHHHHHHHHHHHhhc--CccchHHHHHH
Confidence 44567777 7788899999988654 4 344445433 9999999999998854 47888999999
Q ss_pred HHhcccHHHHHHHHHHHHHhCCCCHHHHHHHHHHH
Q psy16144 82 LLRMKRYAEAIDVSQTIMKAHPSYPKVHKQIFEKA 116 (122)
Q Consensus 82 ylk~kr~~~AI~vc~kVL~~~P~ypkir~~il~ka 116 (122)
|-|.+|..|-++ .+ .-||...|.. |=|+|
T Consensus 1176 yAkt~rl~elE~----fi-~gpN~A~i~~-vGdrc 1204 (1666)
T KOG0985|consen 1176 YAKTNRLTELEE----FI-AGPNVANIQQ-VGDRC 1204 (1666)
T ss_pred HHHhchHHHHHH----Hh-cCCCchhHHH-HhHHH
Confidence 999999998444 44 7888888764 44444
No 410
>PF09311 Rab5-bind: Rabaptin-like protein; InterPro: IPR015390 This domain is predominantly found in Rabaptin and allows for binding to the GTPase Rab5. This interaction is necessary and sufficient for Rab5-dependent recruitment of Rabaptin5 to early endosomal membranes []. ; PDB: 3NF1_A 3CEQ_B 3EDT_H 1X79_C 1TU3_F.
Probab=23.02 E-value=66 Score=24.24 Aligned_cols=30 Identities=20% Similarity=0.125 Sum_probs=22.7
Q ss_pred HHHHHHHHHHH--HhcCCCChHHHHHHHH-HHh
Q psy16144 11 IIQTIAEWHFL--LYLLPWRWEPLSAGLR-KKQ 40 (122)
Q Consensus 11 ~~~~~~~~~~L--Lyi~~gk~d~A~elL~-~l~ 40 (122)
|-..+..+|.| -|.++|+|+.|+.+++ +|+
T Consensus 136 ~~~rl~tL~nlv~q~~~q~r~evav~~~KqalE 168 (181)
T PF09311_consen 136 IPARLRTLHNLVIQYESQGRYEVAVPLCKQALE 168 (181)
T ss_dssp S-HHHHHHHHHHHHHHHTT-HHHHHHHHHHHHH
T ss_pred chHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHH
Confidence 34556677777 8889999999999998 765
No 411
>PF04348 LppC: LppC putative lipoprotein; InterPro: IPR007443 This entry includes several bacterial outer membrane antigens, whose molecular function is unknown.; PDB: 3CKM_A.
Probab=22.18 E-value=29 Score=30.58 Aligned_cols=79 Identities=19% Similarity=0.165 Sum_probs=0.0
Q ss_pred HHHHH---HhcCCCChHHHHHHHH-HH--hhCcchHHHHhhC----------HHHHHHHHH--HHHHhhCCCChHHHHHH
Q psy16144 17 EWHFL---LYLLPWRWEPLSAGLR-KK--QYISSATNIYNLF----------PDDAVVQYT--QACRIIGTSDLALHQKL 78 (122)
Q Consensus 17 ~~~~L---Lyi~~gk~d~A~elL~-~l--~~nps~~kA~e~l----------y~dAa~~ye--~Aw~l~~~~~~~vgykL 78 (122)
.|.++ .+++.|+++.|..++. .- ..+++...-+.++ +.+|...+. .-|.++...-....-..
T Consensus 25 ~~~L~Aa~a~l~~g~~~~A~~ll~~l~~~~L~~~q~~~~~Ll~A~lal~~~~~~~Al~~L~~~~~~~l~~~~~~~~~~l~ 104 (536)
T PF04348_consen 25 QLLLLAARALLQEGDWAQAQALLNQLDPQQLSPSQQARYQLLRARLALAQGDPEQALSLLNAQDLWQLPPEQQARYHQLR 104 (536)
T ss_dssp --------------------------------------------------------------------------------
T ss_pred HHHHHHHHHHHhCCCHHHHHHHHHhcccccCChHHHHHHHHHHHHHHHhcCCHHHHHHHhccCCcccCCHHHHHHHHHHH
Confidence 34455 8999999999999999 54 4555544333322 778887776 56667544444555567
Q ss_pred HHHHHhcccHHHHHHHH
Q psy16144 79 ASLLLRMKRYAEAIDVS 95 (122)
Q Consensus 79 A~~ylk~kr~~~AI~vc 95 (122)
|..|...|+..+|+.-.
T Consensus 105 A~a~~~~~~~l~Aa~~~ 121 (536)
T PF04348_consen 105 AQAYEQQGDPLAAARER 121 (536)
T ss_dssp -----------------
T ss_pred HHHHHhcCCHHHHHHHH
Confidence 88999999988887743
No 412
>KOG1478|consensus
Probab=21.79 E-value=1.2e+02 Score=25.60 Aligned_cols=37 Identities=19% Similarity=0.217 Sum_probs=28.8
Q ss_pred CCChHHHHHHHHHHHhccc-------------HHHHHHHHHHHHHhCCCC
Q psy16144 69 TSDLALHQKLASLLLRMKR-------------YAEAIDVSQTIMKAHPSY 105 (122)
Q Consensus 69 ~~~~~vgykLA~~ylk~kr-------------~~~AI~vc~kVL~~~P~y 105 (122)
..|..+|+.+++=++...+ ..+|.++|.+.++-||+.
T Consensus 10 GanSglGl~i~~RLl~~~De~~~ltl~ltcR~~~kae~vc~~lk~f~p~~ 59 (341)
T KOG1478|consen 10 GANSGLGLAICKRLLAEDDENVRLTLCLTCRNMSKAEAVCAALKAFHPKS 59 (341)
T ss_pred cCCCcccHHHHHHHHhccCCceeEEEEEEeCChhHHHHHHHHHHHhCCCc
Confidence 4566778888777776543 478999999999999954
No 413
>KOG1310|consensus
Probab=21.69 E-value=1.5e+02 Score=27.48 Aligned_cols=58 Identities=9% Similarity=-0.165 Sum_probs=38.7
Q ss_pred HHHHHHHH-HHhhCcchHHHHhhC---------HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHH
Q psy16144 30 EPLSAGLR-KKQYISSATNIYNLF---------PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEA 91 (122)
Q Consensus 30 d~A~elL~-~l~~nps~~kA~e~l---------y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~A 91 (122)
=+|..-+. +++.||+..|||.+| |.+|.. -+|.+ ...+|+=.-.-+++...-++...|
T Consensus 428 ~~AlrDch~Alrln~s~~kah~~la~aL~el~r~~eal~---~~~al-q~~~Ptd~a~~~~v~~l~rDi~aa 495 (758)
T KOG1310|consen 428 YLALRDCHVALRLNPSIQKAHFRLARALNELTRYLEALS---CHWAL-QMSFPTDVARQNFVLCLPRDISAA 495 (758)
T ss_pred HHHHHhHHhhccCChHHHHHHHHHHHHHHHHhhHHHhhh---hHHHH-hhcCchhhhhhhhhhccccchHHH
Confidence 35555566 899999999999977 777744 46666 366664444446666665555543
No 414
>KOG0292|consensus
Probab=21.45 E-value=72 Score=30.93 Aligned_cols=82 Identities=13% Similarity=-0.059 Sum_probs=51.5
Q ss_pred HhcCCCChHHHHHHHH--HHhhCcchHHHHhhC-------HHH------HHHHHHHHHHhh--CCCChHHHHH-------
Q psy16144 22 LYLLPWRWEPLSAGLR--KKQYISSATNIYNLF-------PDD------AVVQYTQACRII--GTSDLALHQK------- 77 (122)
Q Consensus 22 Lyi~~gk~d~A~elL~--~l~~nps~~kA~e~l-------y~d------Aa~~ye~Aw~l~--~~~~~~vgyk------- 77 (122)
-++..|-|+.|..+|. +=-.|=+..+-+..- |-. +.-+|.+.|.=+ +..-|.+|++
T Consensus 915 dhvaAGsf~tA~~lL~dqvgvv~f~p~Kt~fl~iy~~sR~~l~~~~~~~~~~~~~R~~se~~~~~~~P~v~~~l~~l~~k 994 (1202)
T KOG0292|consen 915 DHVAAGSFETAMRLLHDQVGVVNFGPLKTHFLKIYAGSRTYLRATPCLPVSLYPVRNWSETSSKQGLPAVGFKLSQLNKK 994 (1202)
T ss_pred hhhhcCchHHHHHHHHhhhcceecccHHhhhhhhccccceeccCCCCcccccccccccccchhhccCCcccccHHHHHHH
Confidence 4567899999999998 222232323332200 222 334444555443 4556777774
Q ss_pred --HHHHHHhcccHHHHHHHHHHHHHhCC
Q psy16144 78 --LASLLLRMKRYAEAIDVSQTIMKAHP 103 (122)
Q Consensus 78 --LA~~ylk~kr~~~AI~vc~kVL~~~P 103 (122)
.|+-+-+.|+|.+||+.+|.++=.=|
T Consensus 995 l~~gy~ltt~gKf~eAie~Frsii~~i~ 1022 (1202)
T KOG0292|consen 995 LQKGYKLTTEGKFGEAIEKFRSIIYSIP 1022 (1202)
T ss_pred HHHHHhhhccCcHHHHHHHHHHHHhhee
Confidence 46667788999999999999886555
No 415
>COG3947 Response regulator containing CheY-like receiver and SARP domains [Signal transduction mechanisms]
Probab=21.32 E-value=2.2e+02 Score=24.36 Aligned_cols=44 Identities=16% Similarity=0.141 Sum_probs=40.4
Q ss_pred HHHHHHHHHHHHHhhCCCChHHHHHHHHHHHhcccHHHHHHHHHH
Q psy16144 53 PDDAVVQYTQACRIIGTSDLALHQKLASLLLRMKRYAEAIDVSQT 97 (122)
Q Consensus 53 y~dAa~~ye~Aw~l~~~~~~~vgykLA~~ylk~kr~~~AI~vc~k 97 (122)
|.||+..-+++..+ +|=+.+...-|-..|++.|+--.|+..|.+
T Consensus 295 ~neAi~l~qr~ltl-dpL~e~~nk~lm~~la~~gD~is~~khyer 338 (361)
T COG3947 295 PNEAIQLHQRALTL-DPLSEQDNKGLMASLATLGDEISAIKHYER 338 (361)
T ss_pred hHHHHHHHHHHhhc-ChhhhHHHHHHHHHHHHhccchhhhhHHHH
Confidence 99999999999999 899999999999999999998888877664
No 416
>PF01239 PPTA: Protein prenyltransferase alpha subunit repeat; InterPro: IPR002088 Protein prenylation is the posttranslational attachment of either a farnesyl group or a geranylgeranyl group via a thioether linkage (-C-S-C-) to a cysteine at or near the carboxyl terminus of the protein. Farnesyl and geranylgeranyl groups are polyisoprenes, unsaturated hydrocarbons with a multiple of five carbons; the chain is 15 carbons long in the farnesyl moiety and 20 carbons long in the geranylgeranyl moiety. There are three different protein prenyltransferases in humans: farnesyltransferase (FT) and geranylgeranyltransferase 1 (GGT1) share the same motif (the CaaX box) around the cysteine in their substrates, and are thus called CaaX prenyltransferases, whereas geranylgeranyltransferase 2 (GGT2, also called Rab geranylgeranyltransferase) recognises a different motif and is thus called a non-CaaX prenyltransferase. Protein prenyltransferases are currently known only in eukaryotes, but they are widespread, being found in vertebrates, insects, nematodes, plants, fungi and protozoa, including several parasites. Each protein consists of two subunits, alpha and beta; the alpha subunit of FT and GGT1 is encoded by the same gene, FNTA. The alpha subunit is thought to participate in a stable complex with the isoprenyl substrate; the beta subunit binds the peptide substrate. In the alpha subunits of both types of protein prenyltransferases, seven tetratricopeptide repeats are formed by pairs of helices that are stabilised by conserved intercalating residues. The alpha subunits of GGT2 in mammals and plants also have an immunoglobulin-like domain between the fifth and sixth tetratricopeptide repeat, as well as leucine-rich repeats at the carboxyl terminus. The functions of these additional domains in GGT2 are as yet undefined, but they are apparently not directly involved in the interaction with substrates and Rab escort proteins. The tetratricopeptide repeats of the alpha subunit form a right-handed superhelix, which embraces the (alpha-alpha)6 barrel of the beta subunit []. ; GO: 0008318 protein prenyltransferase activity, 0018342 protein prenylation; PDB: 1S63_A 1LD7_A 1LD8_A 2H6G_A 1SA4_A 1MZC_A 1TN6_A 2F0Y_A 2H6H_A 2H6F_A ....
Probab=20.84 E-value=1.5e+02 Score=15.47 Aligned_cols=19 Identities=5% Similarity=0.256 Sum_probs=15.7
Q ss_pred HHHHHHHHHHhCCCCHHHH
Q psy16144 91 AIDVSQTIMKAHPSYPKVH 109 (122)
Q Consensus 91 AI~vc~kVL~~~P~ypkir 109 (122)
.++.+.+++..+|.+.-++
T Consensus 2 El~~~~~~l~~~pknys~W 20 (31)
T PF01239_consen 2 ELEFTKKALEKDPKNYSAW 20 (31)
T ss_dssp HHHHHHHHHHHSTTCHHHH
T ss_pred HHHHHHHHHHHCcccccHH
Confidence 4778899999999888773
No 417
>KOG2168|consensus
Probab=20.69 E-value=6.1e+02 Score=24.29 Aligned_cols=36 Identities=22% Similarity=0.217 Sum_probs=24.8
Q ss_pred hhHHHHHHHHHH-------HHH--HHhcCCCChHHHHHHHH-HHhh
Q psy16144 6 VDTMQIIQTIAE-------WHF--LLYLLPWRWEPLSAGLR-KKQY 41 (122)
Q Consensus 6 ~~~~~~~~~~~~-------~~~--LLyi~~gk~d~A~elL~-~l~~ 41 (122)
-|+-+|+-+.|. |.. +||-..|+||.|.+++. +|..
T Consensus 619 ~~~~~i~~~vA~~a~~~G~~~~sI~LY~lag~yd~al~link~LS~ 664 (835)
T KOG2168|consen 619 EDLQKIILEVASEADEDGLFEDAILLYHLAGDYDKALELINKLLSQ 664 (835)
T ss_pred hhHHHHHHHHHHHHHhcCCHHHHHHHHHHhhhhhHHHHHHHHHHHH
Confidence 456666655442 222 29999999999999998 7753
No 418
>PF09311 Rab5-bind: Rabaptin-like protein; InterPro: IPR015390 This domain is predominantly found in Rabaptin and allows for binding to the GTPase Rab5. This interaction is necessary and sufficient for Rab5-dependent recruitment of Rabaptin5 to early endosomal membranes []. ; PDB: 3NF1_A 3CEQ_B 3EDT_H 1X79_C 1TU3_F.
Probab=20.38 E-value=1.4e+02 Score=22.47 Aligned_cols=32 Identities=16% Similarity=0.217 Sum_probs=25.4
Q ss_pred CCChHHHHHHHHHHHhcccHHHHHHHHHHHHH
Q psy16144 69 TSDLALHQKLASLLLRMKRYAEAIDVSQTIMK 100 (122)
Q Consensus 69 ~~~~~vgykLA~~ylk~kr~~~AI~vc~kVL~ 100 (122)
|......-+|---|.+.|||..|+..|.+.|.
T Consensus 137 ~~rl~tL~nlv~q~~~q~r~evav~~~KqalE 168 (181)
T PF09311_consen 137 PARLRTLHNLVIQYESQGRYEVAVPLCKQALE 168 (181)
T ss_dssp -HHHHHHHHHHHHHHHTT-HHHHHHHHHHHHH
T ss_pred hHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHH
Confidence 44455577888899999999999999999884
Done!