Query gi|254780592|ref|YP_003065005.1| TPR repeat-containing protein [Candidatus Liberibacter asiaticus str. psy62] Match_columns 271 No_of_seqs 173 out of 8782 Neff 10.1 Searched_HMMs 39220 Date Mon May 30 01:32:28 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780592.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 KOG4626 consensus 99.9 1.9E-26 4.9E-31 179.7 14.8 195 48-242 230-424 (966) 2 KOG4626 consensus 99.9 1E-25 2.5E-30 175.3 15.3 199 49-247 129-361 (966) 3 TIGR02917 PEP_TPR_lipo putativ 99.9 1.2E-24 3E-29 168.8 20.0 232 23-254 14-284 (924) 4 COG5010 TadD Flp pilus assembl 99.9 1.2E-24 3.1E-29 168.7 17.8 245 15-260 4-256 (257) 5 TIGR02917 PEP_TPR_lipo putativ 99.9 4.9E-24 1.2E-28 165.0 19.3 222 42-263 389-612 (924) 6 TIGR02521 type_IV_pilW type IV 99.9 3.2E-24 8.2E-29 166.1 17.3 226 19-261 1-237 (247) 7 KOG1126 consensus 99.9 2.3E-23 5.9E-28 160.8 16.0 216 41-256 358-607 (638) 8 KOG1125 consensus 99.9 4.1E-22 1E-26 153.2 15.0 201 37-237 286-529 (579) 9 PRK11447 cellulose synthase su 99.9 8.1E-21 2.1E-25 145.2 20.5 222 41-262 275-553 (1158) 10 KOG1126 consensus 99.9 1.4E-21 3.6E-26 149.9 16.1 205 46-250 329-567 (638) 11 TIGR02521 type_IV_pilW type IV 99.9 9.5E-22 2.4E-26 150.9 15.0 215 17-231 2-241 (247) 12 COG3063 PilF Tfp pilus assembl 99.9 9E-21 2.3E-25 145.0 19.5 216 17-232 6-233 (250) 13 KOG1155 consensus 99.9 4.3E-19 1.1E-23 134.7 23.3 194 41-234 267-494 (559) 14 KOG0547 consensus 99.9 7.8E-20 2E-24 139.2 17.8 199 43-241 367-572 (606) 15 KOG0547 consensus 99.8 3.3E-19 8.3E-24 135.4 16.9 209 46-254 336-552 (606) 16 KOG1155 consensus 99.8 1.9E-18 4.9E-23 130.7 19.9 217 50-266 241-493 (559) 17 KOG2002 consensus 99.8 2.4E-18 6.2E-23 130.1 16.0 248 3-250 124-386 (1018) 18 COG3063 PilF Tfp pilus assembl 99.8 1.9E-18 4.9E-23 130.7 15.4 194 70-263 35-230 (250) 19 PRK11788 hypothetical protein; 99.8 6.2E-17 1.6E-21 121.5 21.8 205 36-240 35-248 (389) 20 TIGR00990 3a0801s09 mitochondr 99.8 1.7E-18 4.4E-23 131.0 13.0 232 33-264 393-641 (649) 21 KOG1129 consensus 99.8 3.4E-18 8.7E-23 129.2 14.0 219 45-266 232-453 (478) 22 KOG0624 consensus 99.8 1.2E-16 3.1E-21 119.7 21.6 207 35-241 37-258 (504) 23 KOG1173 consensus 99.8 1.1E-17 2.7E-22 126.1 15.8 210 42-251 318-535 (611) 24 TIGR00990 3a0801s09 mitochondr 99.8 1.3E-17 3.3E-22 125.6 15.6 214 47-260 374-597 (649) 25 KOG1125 consensus 99.7 1.6E-16 4.1E-21 118.9 15.2 200 29-228 311-564 (579) 26 KOG2076 consensus 99.7 1.1E-15 2.7E-20 113.9 19.4 225 40-264 143-508 (895) 27 PRK11447 cellulose synthase su 99.7 2.2E-15 5.7E-20 111.9 19.6 131 113-243 279-423 (1158) 28 KOG2002 consensus 99.7 1.6E-15 4E-20 112.8 18.4 229 25-257 192-432 (1018) 29 KOG1129 consensus 99.7 1.4E-16 3.7E-21 119.2 13.0 200 43-242 263-465 (478) 30 KOG1173 consensus 99.7 1.4E-15 3.6E-20 113.1 17.7 205 46-250 288-499 (611) 31 KOG2003 consensus 99.7 1.5E-15 3.9E-20 112.9 15.2 192 48-239 502-693 (840) 32 TIGR03302 OM_YfiO outer membra 99.7 4.9E-14 1.2E-18 103.7 21.0 197 16-232 3-229 (235) 33 KOG1840 consensus 99.7 3.3E-14 8.3E-19 104.8 19.6 218 42-259 205-470 (508) 34 COG2956 Predicted N-acetylgluc 99.6 4.4E-14 1.1E-18 104.0 18.1 201 35-236 34-244 (389) 35 KOG1840 consensus 99.6 1.2E-14 3E-19 107.5 15.1 193 42-234 247-478 (508) 36 KOG0495 consensus 99.6 2E-13 5E-18 100.0 21.1 215 28-243 575-790 (913) 37 KOG2003 consensus 99.6 7.5E-15 1.9E-19 108.7 12.7 212 51-262 471-683 (840) 38 PRK11788 hypothetical protein; 99.6 2E-13 5.2E-18 99.9 19.8 209 48-256 81-299 (389) 39 KOG0495 consensus 99.6 3.9E-13 1E-17 98.1 20.9 214 40-254 554-767 (913) 40 PRK10747 putative protoheme IX 99.6 4.9E-13 1.3E-17 97.5 20.4 217 44-263 126-385 (398) 41 KOG1174 consensus 99.6 3.3E-14 8.4E-19 104.7 14.3 170 87-256 317-488 (564) 42 KOG4162 consensus 99.6 1.1E-13 2.7E-18 101.6 16.9 198 43-240 485-788 (799) 43 PRK12370 invasion protein regu 99.6 2.3E-13 5.9E-18 99.5 18.4 189 47-236 272-470 (553) 44 PRK11189 lipoprotein NlpI; Pro 99.6 2.2E-14 5.5E-19 105.9 13.1 177 55-237 84-268 (297) 45 KOG0550 consensus 99.6 2.3E-14 5.9E-19 105.7 13.2 194 45-238 58-319 (486) 46 KOG0548 consensus 99.6 6.8E-14 1.7E-18 102.8 14.7 205 44-249 232-471 (539) 47 KOG1174 consensus 99.6 1.5E-13 3.7E-18 100.8 16.1 179 56-234 218-396 (564) 48 PRK02603 photosystem I assembl 99.6 3.9E-14 1E-18 104.3 10.9 150 43-212 6-160 (172) 49 KOG2076 consensus 99.6 3.1E-13 8E-18 98.7 15.5 205 31-236 201-513 (895) 50 KOG0548 consensus 99.6 3.1E-13 8E-18 98.7 14.5 177 50-234 312-488 (539) 51 PRK10049 pgaA outer membrane p 99.5 1.5E-12 3.8E-17 94.6 17.8 100 44-144 107-206 (818) 52 COG5010 TadD Flp pilus assembl 99.5 1.2E-12 2.9E-17 95.3 16.1 152 46-197 76-227 (257) 53 KOG1127 consensus 99.5 2E-13 5.2E-18 99.9 11.9 186 49-234 471-658 (1238) 54 KOG1128 consensus 99.5 1.9E-13 5E-18 100.0 11.8 182 46-234 434-615 (777) 55 PRK10866 outer membrane protei 99.5 6.7E-12 1.7E-16 90.6 18.7 197 18-230 8-236 (243) 56 KOG0553 consensus 99.5 1.7E-13 4.4E-18 100.3 9.7 113 43-155 88-200 (304) 57 PRK02603 photosystem I assembl 99.5 1.3E-13 3.3E-18 101.1 9.0 120 120-239 15-153 (172) 58 PRK12370 invasion protein regu 99.5 3.4E-12 8.7E-17 92.4 16.1 199 66-264 252-465 (553) 59 COG3071 HemY Uncharacterized e 99.5 3.9E-11 9.9E-16 85.9 21.4 224 40-263 88-385 (400) 60 PRK10747 putative protoheme IX 99.5 5.2E-12 1.3E-16 91.3 16.9 217 42-261 90-350 (398) 61 PRK10049 pgaA outer membrane p 99.5 2.3E-11 5.9E-16 87.3 19.9 194 36-230 31-224 (818) 62 KOG4162 consensus 99.5 4.9E-11 1.3E-15 85.3 21.1 212 52-263 460-778 (799) 63 KOG1156 consensus 99.5 1.4E-11 3.7E-16 88.6 18.2 201 40-240 11-253 (700) 64 KOG1128 consensus 99.5 2.7E-12 6.9E-17 93.0 14.4 203 46-263 408-611 (777) 65 KOG0624 consensus 99.5 4.9E-12 1.2E-16 91.4 15.4 165 42-206 78-257 (504) 66 KOG1127 consensus 99.5 6.9E-12 1.8E-16 90.5 15.8 208 48-255 431-645 (1238) 67 KOG3060 consensus 99.5 2.2E-11 5.6E-16 87.4 17.9 162 76-237 58-222 (289) 68 KOG0553 consensus 99.5 2.3E-12 5.9E-17 93.4 12.5 117 73-189 84-200 (304) 69 KOG2376 consensus 99.4 7.9E-12 2E-16 90.2 15.0 193 45-241 21-259 (652) 70 COG4783 Putative Zn-dependent 99.4 1.6E-11 4.1E-16 88.2 15.5 135 101-235 303-437 (484) 71 KOG3060 consensus 99.4 1.6E-11 4.1E-16 88.3 15.2 180 41-220 57-239 (289) 72 PRK10370 formate-dependent nit 99.4 3E-12 7.6E-17 92.7 11.3 121 50-170 61-184 (206) 73 cd00189 TPR Tetratricopeptide 99.4 1E-12 2.6E-17 95.6 8.8 98 140-237 2-99 (100) 74 KOG0550 consensus 99.4 2.1E-12 5.5E-17 93.6 10.4 167 72-238 171-353 (486) 75 KOG1156 consensus 99.4 8.7E-12 2.2E-16 89.9 13.4 177 41-217 46-264 (700) 76 COG4783 Putative Zn-dependent 99.4 1.9E-11 4.9E-16 87.8 14.9 156 62-234 298-453 (484) 77 COG2956 Predicted N-acetylgluc 99.4 1.1E-10 2.9E-15 83.0 17.0 212 47-259 80-302 (389) 78 cd00189 TPR Tetratricopeptide 99.4 9.6E-13 2.4E-17 95.8 6.3 97 73-169 3-99 (100) 79 pfam04733 Coatomer_E Coatomer 99.3 8.5E-10 2.2E-14 77.7 19.1 201 51-259 81-286 (290) 80 PRK10370 formate-dependent nit 99.3 4.4E-11 1.1E-15 85.6 11.9 119 87-205 64-185 (206) 81 KOG4340 consensus 99.3 3.6E-10 9.1E-15 80.0 14.1 187 45-233 19-205 (459) 82 COG3071 HemY Uncharacterized e 99.2 1.3E-09 3.3E-14 76.6 16.4 187 47-234 129-389 (400) 83 COG4235 Cytochrome c biogenesi 99.2 1.6E-10 4.1E-15 82.1 11.8 129 120-248 138-270 (287) 84 KOG2053 consensus 99.2 2.7E-09 6.9E-14 74.6 18.0 214 43-258 16-245 (932) 85 CHL00033 ycf3 photosystem I as 99.2 7.7E-11 2E-15 84.1 7.9 76 44-119 39-117 (165) 86 COG4105 ComL DNA uptake lipopr 99.2 6.6E-09 1.7E-13 72.2 17.3 196 27-240 17-237 (254) 87 pfam04733 Coatomer_E Coatomer 99.2 8.1E-09 2.1E-13 71.7 17.4 205 42-254 41-249 (290) 88 PRK11189 lipoprotein NlpI; Pro 99.2 7.5E-10 1.9E-14 78.0 11.5 216 15-234 5-227 (297) 89 PRK10803 hypothetical protein; 99.2 1.3E-08 3.4E-13 70.3 17.9 102 148-249 152-259 (262) 90 COG4235 Cytochrome c biogenesi 99.1 7.4E-10 1.9E-14 78.1 10.5 121 89-209 141-264 (287) 91 KOG4648 consensus 99.1 1.1E-10 2.8E-15 83.1 5.7 201 40-241 101-336 (536) 92 COG4785 NlpI Lipoprotein NlpI, 99.1 2.4E-09 6.2E-14 74.9 12.3 178 53-236 82-267 (297) 93 KOG1915 consensus 99.1 2.3E-08 5.8E-13 69.0 16.2 207 58-265 310-530 (677) 94 KOG1070 consensus 99.1 6E-08 1.5E-12 66.4 17.8 190 53-243 1441-1637(1710) 95 CHL00033 ycf3 photosystem I as 99.1 6.1E-10 1.6E-14 78.6 7.5 120 71-210 32-155 (165) 96 KOG1070 consensus 99.0 4.9E-08 1.2E-12 66.9 16.2 204 32-236 1453-1664(1710) 97 KOG2376 consensus 99.0 3.9E-08 1E-12 67.5 15.4 56 40-98 83-138 (652) 98 KOG3785 consensus 99.0 4E-08 1E-12 67.4 15.3 132 45-177 66-224 (557) 99 COG0457 NrfG FOG: TPR repeat [ 99.0 1.2E-07 3.1E-12 64.4 17.3 198 41-238 64-268 (291) 100 COG1729 Uncharacterized protei 99.0 4E-08 1E-12 67.4 14.7 107 141-247 144-256 (262) 101 TIGR02552 LcrH_SycD type III s 99.0 1.6E-09 4E-14 76.0 6.8 109 95-203 9-117 (137) 102 KOG1915 consensus 99.0 2.8E-07 7.2E-12 62.2 18.2 192 47-240 84-278 (677) 103 TIGR02795 tol_pal_ybgF tol-pal 99.0 8.7E-09 2.2E-13 71.5 10.0 104 139-242 3-112 (119) 104 cd05804 StaR_like StaR_like; a 98.9 1.4E-07 3.6E-12 64.1 15.9 165 39-203 46-217 (355) 105 PRK09782 bacteriophage N4 rece 98.9 8.4E-08 2.1E-12 65.5 14.2 190 45-236 518-707 (987) 106 KOG1130 consensus 98.9 1.1E-08 2.7E-13 70.9 9.1 188 46-233 27-262 (639) 107 TIGR02552 LcrH_SycD type III s 98.9 2.9E-09 7.3E-14 74.5 5.9 93 43-135 25-117 (137) 108 PRK10803 hypothetical protein; 98.9 1.9E-08 4.9E-13 69.4 10.1 97 114-210 152-254 (262) 109 cd05804 StaR_like StaR_like; a 98.9 2E-07 5E-12 63.2 15.2 168 69-237 42-217 (355) 110 KOG0543 consensus 98.9 2.1E-08 5.3E-13 69.2 9.9 101 143-243 213-328 (397) 111 TIGR02795 tol_pal_ybgF tol-pal 98.9 1.4E-08 3.5E-13 70.3 8.9 103 39-141 5-113 (119) 112 COG0457 NrfG FOG: TPR repeat [ 98.8 2.9E-06 7.4E-11 56.1 19.2 192 49-240 36-236 (291) 113 KOG0543 consensus 98.8 7.5E-08 1.9E-12 65.8 10.7 143 73-234 211-354 (397) 114 KOG3785 consensus 98.8 3.3E-07 8.4E-12 61.8 13.1 187 43-237 29-216 (557) 115 COG4700 Uncharacterized protei 98.7 1.3E-06 3.3E-11 58.2 15.3 147 53-233 73-220 (251) 116 TIGR03302 OM_YfiO outer membra 98.7 1.4E-06 3.5E-11 58.0 14.7 166 68-259 31-222 (235) 117 KOG2047 consensus 98.7 3.1E-06 7.8E-11 55.9 15.9 153 49-201 360-540 (835) 118 PRK09782 bacteriophage N4 rece 98.7 6.4E-07 1.6E-11 60.1 11.7 186 40-230 48-271 (987) 119 KOG4648 consensus 98.6 9.6E-08 2.4E-12 65.1 7.1 102 142-243 101-202 (536) 120 pfam09295 ChAPs ChAPs (Chs5p-A 98.6 3E-07 7.5E-12 62.1 9.4 113 49-164 182-294 (395) 121 pfam09295 ChAPs ChAPs (Chs5p-A 98.6 4.4E-07 1.1E-11 61.1 9.3 118 78-198 177-294 (395) 122 KOG2053 consensus 98.6 3.3E-06 8.3E-11 55.7 13.5 199 40-239 47-259 (932) 123 COG1729 Uncharacterized protei 98.6 9.2E-07 2.3E-11 59.1 10.2 104 107-210 144-253 (262) 124 KOG2047 consensus 98.6 3.4E-05 8.6E-10 49.5 18.1 205 46-250 397-630 (835) 125 KOG4340 consensus 98.6 3.5E-06 9E-11 55.5 13.0 157 43-201 51-207 (459) 126 KOG1130 consensus 98.5 1.9E-07 4.8E-12 63.3 5.6 127 106-232 197-341 (639) 127 KOG3081 consensus 98.5 9.7E-06 2.5E-10 52.8 14.2 166 69-241 107-277 (299) 128 COG4700 Uncharacterized protei 98.5 7.5E-06 1.9E-10 53.5 13.5 134 129-263 81-218 (251) 129 KOG3081 consensus 98.5 3.2E-05 8.1E-10 49.7 16.7 179 70-255 72-256 (299) 130 COG4785 NlpI Lipoprotein NlpI, 98.4 1.3E-06 3.3E-11 58.2 7.8 183 19-204 8-199 (297) 131 KOG3617 consensus 98.4 2.1E-05 5.4E-10 50.8 13.8 143 46-198 810-993 (1416) 132 KOG1941 consensus 98.4 8.2E-05 2.1E-09 47.2 16.7 165 70-234 83-274 (518) 133 KOG4234 consensus 98.4 3.1E-06 7.8E-11 55.9 9.1 104 110-213 101-209 (271) 134 KOG1941 consensus 98.4 8E-06 2E-10 53.4 11.1 192 40-231 126-356 (518) 135 COG3118 Thioredoxin domain-con 98.4 2.2E-05 5.6E-10 50.7 13.0 163 69-233 133-299 (304) 136 KOG2610 consensus 98.3 5.1E-05 1.3E-09 48.4 14.6 152 48-199 115-274 (491) 137 KOG0376 consensus 98.3 1E-06 2.6E-11 58.9 5.6 121 38-158 6-128 (476) 138 PRK10866 outer membrane protei 98.3 0.00011 2.8E-09 46.4 15.7 157 68-240 30-209 (243) 139 KOG4234 consensus 98.3 3.9E-06 9.8E-11 55.3 8.0 102 141-242 98-204 (271) 140 KOG2796 consensus 98.2 3.5E-05 8.8E-10 49.4 11.9 168 72-239 124-319 (366) 141 KOG4555 consensus 98.2 8.8E-06 2.2E-10 53.1 8.8 87 48-134 55-145 (175) 142 KOG2796 consensus 98.2 1.3E-05 3.4E-10 52.0 9.5 146 71-216 178-333 (366) 143 COG3898 Uncharacterized membra 98.2 0.00034 8.8E-09 43.3 17.3 157 46-203 130-294 (531) 144 COG3118 Thioredoxin domain-con 98.2 2.9E-05 7.3E-10 49.9 11.1 126 108-234 138-264 (304) 145 KOG2610 consensus 98.2 2.8E-05 7.2E-10 50.0 10.8 182 75-257 108-304 (491) 146 KOG0551 consensus 98.1 3.9E-05 9.8E-10 49.2 9.4 87 44-130 89-179 (390) 147 KOG3617 consensus 98.1 0.00039 9.8E-09 43.0 14.1 147 76-232 806-993 (1416) 148 KOG4555 consensus 98.1 2.4E-05 6.2E-10 50.4 7.9 90 143-232 48-141 (175) 149 KOG0376 consensus 98.0 6.3E-06 1.6E-10 54.0 4.2 109 73-181 7-115 (476) 150 PRK04841 transcriptional regul 98.0 0.00095 2.4E-08 40.6 15.7 57 178-234 697-759 (903) 151 KOG4642 consensus 98.0 7.1E-06 1.8E-10 53.7 4.2 88 45-132 19-106 (284) 152 KOG1914 consensus 98.0 0.0011 2.7E-08 40.3 15.8 197 58-254 267-484 (656) 153 KOG1308 consensus 98.0 1.8E-06 4.5E-11 57.3 0.8 94 43-136 121-214 (377) 154 KOG0545 consensus 98.0 0.00015 3.9E-09 45.5 10.6 103 141-243 181-301 (329) 155 COG2976 Uncharacterized protei 98.0 0.00016 4.1E-09 45.4 10.6 93 109-202 94-189 (207) 156 KOG2471 consensus 97.9 4E-05 1E-09 49.1 6.1 26 209-234 622-647 (696) 157 PRK06330 transcript cleavage f 97.8 0.002 5.2E-08 38.6 16.2 120 47-168 43-180 (906) 158 KOG1585 consensus 97.8 0.0021 5.5E-08 38.5 14.4 189 42-231 37-252 (308) 159 COG3898 Uncharacterized membra 97.8 0.0022 5.7E-08 38.4 21.3 195 48-245 96-301 (531) 160 PRK10153 DNA-binding transcrip 97.8 0.0012 3E-08 40.1 12.4 81 169-250 409-489 (512) 161 PRK11906 transcriptional regul 97.8 8.6E-05 2.2E-09 47.0 6.3 145 87-231 275-432 (458) 162 COG3107 LppC Putative lipoprot 97.7 0.0021 5.2E-08 38.6 13.3 137 16-157 13-155 (604) 163 pfam08424 DUF1740 Protein of u 97.7 0.00082 2.1E-08 41.0 11.2 89 56-144 5-108 (231) 164 KOG0545 consensus 97.7 0.00038 9.6E-09 43.1 9.4 106 104-209 178-301 (329) 165 COG2976 Uncharacterized protei 97.7 0.0022 5.6E-08 38.4 13.1 129 40-170 57-191 (207) 166 KOG4642 consensus 97.7 6.9E-05 1.8E-09 47.6 5.3 92 76-167 16-107 (284) 167 PRK06330 transcript cleavage f 97.7 0.0037 9.4E-08 37.0 17.6 158 59-219 107-271 (906) 168 PRK04841 transcriptional regul 97.6 0.004 1E-07 36.8 16.3 84 47-130 463-557 (903) 169 pfam06552 TOM20_plant Plant sp 97.6 0.00025 6.3E-09 44.2 7.1 87 52-138 7-114 (186) 170 PRK11906 transcriptional regul 97.6 0.00052 1.3E-08 42.2 8.4 128 120-247 274-413 (458) 171 PRK10153 DNA-binding transcrip 97.6 0.0047 1.2E-07 36.4 14.5 108 155-262 351-468 (512) 172 COG4105 ComL DNA uptake lipopr 97.6 0.0048 1.2E-07 36.3 14.9 144 69-238 33-199 (254) 173 KOG0530 consensus 97.6 0.0023 5.8E-08 38.3 11.5 172 49-220 56-236 (318) 174 pfam02259 FAT FAT domain. The 97.6 0.0014 3.6E-08 39.6 10.3 33 186-218 272-304 (351) 175 KOG1308 consensus 97.5 0.00021 5.4E-09 44.6 5.3 86 82-167 126-211 (377) 176 KOG1585 consensus 97.5 0.0067 1.7E-07 35.4 13.0 162 72-233 33-217 (308) 177 KOG0530 consensus 97.5 0.0014 3.7E-08 39.5 9.2 167 50-216 92-269 (318) 178 pfam03704 BTAD Bacterial trans 97.5 0.0017 4.3E-08 39.1 9.4 61 172-232 62-122 (146) 179 KOG1258 consensus 97.4 0.0082 2.1E-07 34.9 13.4 173 48-220 309-489 (577) 180 pfam10300 IML2 Putative mitoch 97.4 0.0047 1.2E-07 36.3 11.0 105 118-222 230-339 (446) 181 PRK10941 putative transcriptio 97.3 0.0029 7.4E-08 37.7 9.3 81 173-253 182-262 (269) 182 KOG0551 consensus 97.3 0.0013 3.3E-08 39.8 7.4 106 137-242 80-189 (390) 183 KOG1586 consensus 97.3 0.012 3E-07 34.0 14.8 161 49-210 47-233 (288) 184 KOG3824 consensus 97.3 0.00072 1.8E-08 41.4 5.7 66 48-113 128-193 (472) 185 COG4649 Uncharacterized protei 97.2 0.013 3.4E-07 33.6 11.7 93 35-127 57-155 (221) 186 KOG1914 consensus 97.2 0.013 3.2E-07 33.7 11.5 146 86-231 347-497 (656) 187 KOG2471 consensus 97.2 0.0038 9.7E-08 36.9 8.7 43 173-215 336-378 (696) 188 KOG4507 consensus 97.2 0.004 1E-07 36.8 8.6 110 130-239 205-316 (886) 189 KOG4507 consensus 97.1 0.0069 1.8E-07 35.3 9.7 46 110-155 255-300 (886) 190 pfam03704 BTAD Bacterial trans 97.1 0.0038 9.6E-08 37.0 8.3 114 73-198 9-122 (146) 191 pfam06552 TOM20_plant Plant sp 97.1 0.0013 3.2E-08 39.9 5.6 104 85-212 6-120 (186) 192 KOG3824 consensus 97.1 0.0018 4.6E-08 38.9 6.3 144 2-146 35-192 (472) 193 KOG1550 consensus 97.1 0.019 4.9E-07 32.6 15.7 173 51-231 227-422 (552) 194 COG3914 Spy Predicted O-linked 97.1 0.019 4.8E-07 32.7 11.3 98 47-144 78-182 (620) 195 pfam10300 IML2 Putative mitoch 97.0 0.02 5.2E-07 32.5 12.1 107 49-155 229-340 (446) 196 PRK10941 putative transcriptio 97.0 0.0036 9.2E-08 37.1 6.9 77 139-215 182-258 (269) 197 COG3914 Spy Predicted O-linked 96.9 0.01 2.5E-07 34.4 9.0 143 44-186 38-190 (620) 198 pfam04910 DUF654 Protein of un 96.9 0.027 6.9E-07 31.7 11.7 148 58-205 28-209 (343) 199 KOG2396 consensus 96.9 0.0099 2.5E-07 34.4 8.5 62 52-113 121-183 (568) 200 pfam05843 Suf Suppressor of fo 96.8 0.032 8.1E-07 31.3 11.1 81 52-132 17-98 (275) 201 COG2909 MalT ATP-dependent tra 96.8 0.033 8.3E-07 31.2 15.1 154 46-199 468-645 (894) 202 COG2909 MalT ATP-dependent tra 96.8 0.033 8.4E-07 31.2 18.4 195 39-233 418-645 (894) 203 pfam05843 Suf Suppressor of fo 96.8 0.031 8E-07 31.3 10.3 115 25-139 22-142 (275) 204 COG0790 FOG: TPR repeat, SEL1 96.7 0.039 9.9E-07 30.7 14.7 166 46-220 51-236 (292) 205 KOG1258 consensus 96.7 0.039 1E-06 30.7 11.4 120 107-226 300-420 (577) 206 KOG3616 consensus 96.6 0.041 1E-06 30.6 10.7 22 176-197 769-790 (1636) 207 KOG0985 consensus 96.6 0.046 1.2E-06 30.3 16.5 167 68-240 1102-1313(1666) 208 pfam08424 DUF1740 Protein of u 96.5 0.049 1.3E-06 30.1 10.6 138 53-207 50-192 (231) 209 pfam02259 FAT FAT domain. The 96.5 0.05 1.3E-06 30.1 16.6 109 135-243 147-295 (351) 210 COG0790 FOG: TPR repeat, SEL1 96.5 0.053 1.3E-06 29.9 12.8 148 81-234 52-219 (292) 211 KOG1839 consensus 96.5 0.048 1.2E-06 30.2 9.7 155 76-230 938-1123(1236) 212 KOG1839 consensus 96.4 0.047 1.2E-06 30.2 9.6 102 133-234 968-1085(1236) 213 pfam00515 TPR_1 Tetratricopept 96.3 0.0034 8.5E-08 37.3 3.0 31 72-102 3-33 (34) 214 TIGR02561 HrpB1_HrpK type III 96.2 0.06 1.5E-06 29.6 9.0 102 152-255 30-135 (190) 215 KOG2396 consensus 96.1 0.03 7.8E-07 31.4 7.2 93 54-146 89-182 (568) 216 pfam00515 TPR_1 Tetratricopept 96.1 0.0049 1.3E-07 36.3 3.1 31 174-204 3-33 (34) 217 pfam04053 Coatomer_WDAD Coatom 96.1 0.084 2.1E-06 28.7 14.4 158 44-232 263-420 (435) 218 COG4976 Predicted methyltransf 96.0 0.0072 1.8E-07 35.2 3.6 61 46-106 5-65 (287) 219 smart00299 CLH Clathrin heavy 96.0 0.094 2.4E-06 28.4 10.4 130 70-218 7-137 (140) 220 KOG2300 consensus 96.0 0.096 2.5E-06 28.3 16.0 188 47-234 286-513 (629) 221 KOG0985 consensus 95.9 0.1 2.6E-06 28.2 13.0 175 38-218 1106-1325(1666) 222 KOG1310 consensus 95.8 0.022 5.7E-07 32.2 5.5 93 46-138 384-479 (758) 223 KOG1586 consensus 95.8 0.11 2.9E-06 27.9 14.5 186 50-250 28-242 (288) 224 KOG1310 consensus 95.8 0.023 5.8E-07 32.2 5.3 85 85-169 389-476 (758) 225 pfam04348 LppC LppC putative l 95.8 0.12 3E-06 27.8 13.6 84 77-160 31-120 (535) 226 pfam04053 Coatomer_WDAD Coatom 95.7 0.13 3.2E-06 27.6 11.4 129 79-233 264-392 (435) 227 smart00028 TPR Tetratricopepti 95.6 0.0097 2.5E-07 34.4 3.0 31 72-102 3-33 (34) 228 KOG1550 consensus 95.6 0.14 3.5E-06 27.4 11.1 115 50-168 263-394 (552) 229 COG5107 RNA14 Pre-mRNA 3'-end 95.5 0.15 3.8E-06 27.1 10.2 174 58-234 290-494 (660) 230 KOG2041 consensus 95.5 0.15 3.9E-06 27.1 13.6 13 69-81 691-703 (1189) 231 COG4259 Uncharacterized protei 95.4 0.16 4E-06 27.0 9.1 94 12-105 4-107 (121) 232 smart00028 TPR Tetratricopepti 95.3 0.017 4.2E-07 33.0 3.1 31 174-204 3-33 (34) 233 COG5191 Uncharacterized conser 95.2 0.043 1.1E-06 30.5 5.0 65 49-113 120-185 (435) 234 KOG3364 consensus 95.1 0.14 3.6E-06 27.3 7.5 79 173-251 33-116 (149) 235 COG5107 RNA14 Pre-mRNA 3'-end 95.1 0.2 5.1E-06 26.4 13.7 75 58-132 30-104 (660) 236 COG4649 Uncharacterized protei 95.0 0.21 5.3E-06 26.3 10.4 97 145-241 101-202 (221) 237 pfam07719 TPR_2 Tetratricopept 95.0 0.013 3.4E-07 33.6 2.0 31 72-102 3-33 (34) 238 pfam04781 DUF627 Protein of un 95.0 0.16 4.2E-06 26.9 7.5 27 77-103 3-29 (112) 239 pfam04781 DUF627 Protein of un 94.9 0.093 2.4E-06 28.4 6.1 47 190-236 62-108 (112) 240 pfam09613 HrpB1_HrpK Bacterial 94.9 0.23 5.8E-06 26.0 10.0 72 49-120 23-94 (156) 241 pfam09613 HrpB1_HrpK Bacterial 94.7 0.13 3.3E-06 27.6 6.4 102 151-253 23-124 (156) 242 pfam07719 TPR_2 Tetratricopept 94.7 0.024 6E-07 32.1 2.6 31 174-204 3-33 (34) 243 COG5191 Uncharacterized conser 94.7 0.069 1.8E-06 29.2 4.9 88 59-146 96-184 (435) 244 COG4976 Predicted methyltransf 94.7 0.051 1.3E-06 30.0 4.3 65 77-141 2-66 (287) 245 KOG3616 consensus 94.5 0.29 7.4E-06 25.4 11.1 14 179-192 713-726 (1636) 246 COG2912 Uncharacterized conser 94.0 0.36 9.2E-06 24.8 8.5 80 172-251 181-260 (269) 247 COG3629 DnrI DNA-binding trans 93.9 0.38 9.6E-06 24.7 10.9 79 170-248 151-236 (280) 248 pfam04184 ST7 ST7 protein. The 93.5 0.44 1.1E-05 24.3 11.9 153 84-249 182-339 (540) 249 KOG0546 consensus 93.3 0.17 4.3E-06 26.8 4.8 73 171-243 274-346 (372) 250 KOG0276 consensus 93.3 0.48 1.2E-05 24.1 7.1 48 114-166 647-694 (794) 251 KOG4814 consensus 93.1 0.52 1.3E-05 23.9 8.7 87 75-161 359-451 (872) 252 TIGR00540 hemY_coli hemY prote 92.7 0.59 1.5E-05 23.5 17.0 79 184-262 311-393 (409) 253 pfam11207 DUF2989 Protein of u 92.4 0.65 1.6E-05 23.3 10.1 77 148-226 117-198 (203) 254 KOG3783 consensus 92.2 0.68 1.7E-05 23.1 12.9 90 53-144 250-343 (546) 255 KOG1464 consensus 92.1 0.7 1.8E-05 23.0 12.6 52 47-98 38-93 (440) 256 KOG1538 consensus 91.8 0.75 1.9E-05 22.9 13.8 23 74-96 636-658 (1081) 257 pfam09797 NatB_MDM20 N-acetylt 91.5 0.8 2E-05 22.7 7.7 47 185-231 192-238 (365) 258 PRK13184 pknD serine/threonine 91.2 0.87 2.2E-05 22.5 8.0 94 46-140 485-588 (933) 259 KOG0529 consensus 91.1 0.89 2.3E-05 22.4 10.9 129 49-177 88-234 (421) 260 pfam00637 Clathrin Region in C 91.1 0.89 2.3E-05 22.4 9.5 39 79-117 16-55 (139) 261 pfam10255 Paf67 RNA polymerase 90.8 0.94 2.4E-05 22.3 7.8 96 69-166 74-192 (402) 262 pfam09670 Cas_Cas02710 CRISPR- 90.6 0.99 2.5E-05 22.1 10.4 58 43-100 138-199 (379) 263 COG3014 Uncharacterized protei 90.5 1 2.6E-05 22.1 13.5 74 18-98 8-86 (449) 264 pfam09986 DUF2225 Uncharacteri 90.2 1.1 2.7E-05 21.9 9.3 64 175-238 121-197 (214) 265 pfam10602 RPN7 26S proteasome 90.1 1.1 2.8E-05 21.9 11.3 96 71-166 37-141 (174) 266 COG3947 Response regulator con 90.0 0.53 1.3E-05 23.8 4.2 56 176-231 283-338 (361) 267 PRK11619 lytic murein transgly 89.7 1.2 3E-05 21.7 21.0 13 42-54 39-51 (645) 268 KOG2063 consensus 89.2 1.3 3.2E-05 21.5 14.2 162 73-234 507-712 (877) 269 COG1747 Uncharacterized N-term 89.0 1.3 3.4E-05 21.4 18.8 41 56-97 118-158 (711) 270 COG4455 ImpE Protein of avirul 88.4 1.4 3.7E-05 21.1 6.7 64 42-105 7-70 (273) 271 KOG2422 consensus 88.4 1.5 3.7E-05 21.1 12.0 154 49-203 251-450 (665) 272 pfam11846 DUF3366 Domain of un 87.7 1.6 4.1E-05 20.9 6.5 44 193-237 132-175 (193) 273 PRK10802 peptidoglycan-associa 85.9 0.82 2.1E-05 22.6 3.1 27 11-37 3-29 (173) 274 KOG0890 consensus 85.8 2 5.2E-05 20.2 16.3 60 43-102 1456-1515(2382) 275 KOG0128 consensus 83.1 2.7 6.8E-05 19.5 12.4 204 56-259 298-554 (881) 276 PRK10598 hypothetical protein; 82.4 1.6 4.1E-05 20.8 3.4 38 12-49 1-38 (186) 277 TIGR01004 PulS_OutS lipoprotei 81.1 3.2 8.1E-05 19.0 7.5 73 10-84 4-87 (136) 278 KOG1920 consensus 79.1 3.7 9.4E-05 18.6 17.3 57 178-234 958-1027(1265) 279 pfam06474 MLTD_N MLTD_N. 78.8 0.48 1.2E-05 24.0 -0.3 24 17-40 3-26 (93) 280 PRK09967 putative outer membra 78.2 2.8 7E-05 19.4 3.4 27 14-40 3-29 (160) 281 KOG4056 consensus 77.8 4 0.0001 18.4 5.4 38 73-110 84-121 (143) 282 KOG3807 consensus 76.9 4.3 0.00011 18.3 12.2 137 82-231 196-336 (556) 283 KOG2114 consensus 75.8 4.6 0.00012 18.1 10.8 61 37-98 334-396 (933) 284 pfam10345 Cohesin_load Cohesin 74.9 4.8 0.00012 17.9 18.7 65 178-242 362-435 (593) 285 COG5187 RPN7 26S proteasome re 74.4 5 0.00013 17.8 12.3 27 105-131 116-142 (412) 286 PRK11627 hypothetical protein; 74.2 2.9 7.4E-05 19.3 2.7 23 12-36 2-24 (192) 287 COG5159 RPN6 26S proteasome re 73.6 5.2 0.00013 17.7 16.3 191 40-230 7-270 (421) 288 KOG4279 consensus 73.6 5.2 0.00013 17.7 6.5 117 69-187 200-335 (1226) 289 cd07625 BAR_Vps17p The Bin/Amp 73.5 5.2 0.00013 17.7 9.9 127 71-202 46-173 (230) 290 PRK00022 lolB outer membrane l 72.4 3.2 8.3E-05 19.0 2.6 19 18-36 6-24 (203) 291 pfam11359 gpUL132 Glycoprotein 70.2 2.5 6.3E-05 19.7 1.6 32 2-36 53-84 (235) 292 pfam06957 COPI_C Coatomer (COP 69.5 6.4 0.00016 17.2 6.1 24 42-65 123-146 (421) 293 pfam07720 TPR_3 Tetratricopept 68.5 5 0.00013 17.8 2.8 25 73-97 4-28 (34) 294 KOG1463 consensus 68.1 6.9 0.00018 17.0 15.8 40 198-237 277-318 (411) 295 pfam04190 DUF410 Protein of un 67.8 7 0.00018 16.9 11.9 26 68-93 88-113 (260) 296 TIGR03659 IsdE heme ABC transp 67.2 5.4 0.00014 17.6 2.8 23 14-36 3-25 (289) 297 KOG0687 consensus 66.5 7.4 0.00019 16.8 9.2 114 56-170 84-213 (393) 298 TIGR03362 VI_chp_7 type VI sec 66.3 7.5 0.00019 16.8 13.7 62 172-233 213-277 (301) 299 COG3317 NlpB Uncharacterized l 66.2 5.4 0.00014 17.6 2.7 29 11-41 3-31 (342) 300 cd02684 MIT_2 MIT: domain cont 65.9 7.6 0.00019 16.7 3.3 14 53-66 4-17 (75) 301 PRK11063 metQ DL-methionine tr 65.3 7.1 0.00018 16.9 3.1 25 10-34 3-27 (271) 302 smart00386 HAT HAT (Half-A-TPR 65.1 7.9 0.0002 16.6 3.6 28 51-78 2-29 (33) 303 PRK09915 putative outer membra 64.8 7.1 0.00018 16.9 3.0 24 12-35 6-29 (488) 304 KOG4151 consensus 64.6 8 0.00021 16.6 7.2 95 45-139 62-162 (748) 305 PRK13731 conjugal transfer sur 64.3 3.7 9.5E-05 18.6 1.5 24 10-33 1-24 (243) 306 KOG2758 consensus 63.9 8.3 0.00021 16.5 11.1 176 57-232 116-314 (432) 307 smart00745 MIT Microtubule Int 63.2 8.5 0.00022 16.4 3.2 13 84-96 22-34 (77) 308 PRK11443 hypothetical protein; 62.7 5 0.00013 17.8 1.9 26 17-42 3-28 (120) 309 cd02679 MIT_spastin MIT: domai 62.2 8.2 0.00021 16.5 3.0 14 84-97 22-35 (79) 310 COG3056 Uncharacterized lipopr 62.1 8.9 0.00023 16.3 4.2 25 12-36 13-37 (204) 311 pfam10579 Rapsyn_N Rapsyn N-te 61.5 9.2 0.00023 16.2 5.8 20 46-65 16-35 (80) 312 TIGR03516 ppisom_GldI peptidyl 61.4 9.2 0.00023 16.2 5.8 51 18-68 5-61 (177) 313 PRK04405 prsA peptidylprolyl i 61.2 9.3 0.00024 16.2 3.4 25 11-35 4-29 (298) 314 PRK10449 heat-inducible protei 61.0 7.1 0.00018 16.9 2.5 24 12-37 1-24 (140) 315 PRK12450 foldase protein PrsA; 60.3 9.6 0.00025 16.1 3.5 22 12-33 5-26 (309) 316 PRK10783 mltD membrane-bound l 60.1 1.7 4.3E-05 20.7 -0.8 23 16-38 2-24 (449) 317 pfam07273 DUF1439 Protein of u 59.7 9.9 0.00025 16.0 3.4 29 20-48 8-36 (177) 318 pfam07721 TPR_4 Tetratricopept 59.6 8.7 0.00022 16.3 2.7 21 209-229 4-24 (26) 319 cd02683 MIT_1 MIT: domain cont 59.2 9.7 0.00025 16.1 2.9 14 83-96 19-32 (77) 320 pfam10516 SHNi-TPR SHNi-TPR. S 58.9 6.8 0.00017 17.0 2.1 25 209-233 4-28 (38) 321 PRK12700 flgH flagellar basal 58.9 10 0.00026 16.0 2.9 18 18-35 10-27 (230) 322 pfam02402 Lysis_col Lysis prot 58.7 2.9 7.4E-05 19.3 0.2 22 13-34 2-23 (46) 323 pfam04212 MIT MIT (microtubule 58.7 10 0.00026 15.9 3.0 11 220-230 19-29 (69) 324 PRK09859 multidrug efflux syst 58.6 6.6 0.00017 17.1 2.0 21 13-33 4-24 (385) 325 PRK03002 prsA peptidylprolyl i 58.6 8.4 0.00021 16.5 2.5 22 12-33 3-24 (285) 326 PRK03095 prsA peptidylprolyl i 58.5 7.9 0.0002 16.6 2.3 22 13-34 2-23 (287) 327 COG3065 Slp Starvation-inducib 58.4 10 0.00025 16.0 2.8 24 13-36 6-29 (191) 328 PRK10510 putative outer membra 58.2 7.7 0.0002 16.7 2.2 25 13-37 4-28 (219) 329 TIGR02722 lp_ uncharacterized 58.1 8.6 0.00022 16.4 2.5 18 16-33 5-22 (215) 330 TIGR03352 VI_chp_3 type VI sec 58.0 7.3 0.00019 16.8 2.1 20 18-37 4-23 (146) 331 KOG2581 consensus 56.4 11 0.00029 15.7 14.0 61 144-204 215-279 (493) 332 PRK12697 flgH flagellar basal 56.1 7.7 0.0002 16.7 2.0 24 12-35 6-29 (227) 333 pfam05590 DUF769 Xylella fasti 55.5 4.3 0.00011 18.2 0.6 23 15-37 6-28 (279) 334 PRK11023 hypothetical protein; 55.5 6.6 0.00017 17.1 1.5 21 12-32 1-21 (191) 335 PRK13883 conjugal transfer pro 55.4 11 0.00029 15.6 2.8 24 13-38 2-25 (156) 336 PRK12701 flgH flagellar basal 54.8 9.7 0.00025 16.1 2.3 21 14-34 3-23 (230) 337 pfam07219 HemY_N HemY protein 54.5 12 0.00031 15.5 8.4 29 76-104 91-119 (134) 338 pfam07901 DUF1672 Protein of u 54.4 7.7 0.0002 16.7 1.7 22 16-37 4-25 (304) 339 TIGR00752 slp outer membrane l 53.8 7.4 0.00019 16.8 1.6 25 14-38 2-26 (190) 340 KOG1497 consensus 53.2 13 0.00032 15.4 12.0 57 73-130 106-170 (399) 341 PRK01326 prsA foldase protein 52.8 13 0.00033 15.3 2.7 22 13-34 4-25 (310) 342 PRK01622 OxaA-like protein pre 51.5 13 0.00034 15.2 4.4 24 13-36 6-29 (266) 343 PRK00249 flgH flagellar basal 51.1 11 0.00029 15.6 2.2 22 15-36 4-25 (231) 344 TIGR02269 TIGR02269 Myxococcus 50.0 11 0.00029 15.6 2.0 24 18-41 2-25 (228) 345 pfam07079 DUF1347 Protein of u 49.3 15 0.00037 15.0 16.3 70 179-250 469-538 (550) 346 TIGR02887 spore_ger_x_C germin 48.8 11 0.00027 15.8 1.7 21 14-34 2-22 (400) 347 COG4941 Predicted RNA polymera 47.7 15 0.00039 14.8 15.7 132 118-250 270-409 (415) 348 TIGR02747 TraV type IV conjuga 47.4 9.8 0.00025 16.0 1.4 18 15-32 4-22 (174) 349 TIGR02508 type_III_yscG type I 47.4 16 0.0004 14.8 5.4 94 141-237 9-103 (118) 350 PRK10760 murein hydrolase B; P 47.1 16 0.0004 14.8 4.7 20 18-37 5-24 (357) 351 TIGR03525 GldK gliding motilit 46.9 12 0.00031 15.4 1.8 21 12-34 1-21 (449) 352 COG5633 Predicted periplasmic 46.9 8.6 0.00022 16.4 1.0 21 20-40 7-27 (123) 353 PRK02998 prsA peptidylprolyl i 46.8 16 0.00041 14.7 3.0 20 15-34 7-26 (283) 354 pfam12583 TPPII_N Tripeptidyl 46.6 16 0.00041 14.7 5.2 21 87-107 87-107 (134) 355 pfam03304 Mlp Mlp lipoprotein 46.4 9 0.00023 16.3 1.0 22 13-34 1-22 (150) 356 pfam00938 Lipoprotein_3 Lipopr 46.3 16 0.00041 14.7 3.7 56 19-74 2-64 (101) 357 PRK12698 flgH flagellar basal 45.7 15 0.00038 14.9 2.1 19 18-36 4-22 (224) 358 COG2015 Alkyl sulfatase and re 45.6 17 0.00042 14.6 4.3 51 175-225 455-505 (655) 359 PRK09578 periplasmic multidrug 45.5 17 0.00042 14.6 3.0 21 13-33 6-26 (385) 360 PRK09837 copper/silver efflux 45.4 16 0.0004 14.8 2.2 20 18-37 6-25 (460) 361 pfam06085 Rz1 Lipoprotein Rz1 44.7 13 0.00034 15.2 1.7 24 14-37 3-27 (59) 362 PRK02463 OxaA-like protein pre 44.1 18 0.00045 14.5 3.5 27 13-39 7-33 (307) 363 COG5536 BET4 Protein prenyltra 44.0 18 0.00045 14.5 10.6 128 52-179 90-234 (328) 364 PRK12788 flgH flagellar basal 43.9 15 0.00037 15.0 1.8 19 16-34 2-20 (231) 365 COG4669 EscJ Type III secretor 42.1 19 0.00048 14.3 3.8 27 17-43 7-33 (246) 366 COG3417 FlgN Collagen-binding 41.8 19 0.00049 14.3 2.9 17 17-33 7-23 (200) 367 KOG0567 consensus 41.7 19 0.00049 14.3 14.5 183 46-234 59-261 (289) 368 COG4764 Uncharacterized protei 41.6 19 0.00049 14.2 2.5 78 18-101 8-88 (197) 369 pfam10938 YfdX YfdX protein. Y 41.6 19 0.00049 14.2 7.7 62 173-234 76-145 (155) 370 PRK02944 OxaA-like protein pre 41.3 19 0.00049 14.2 2.7 22 13-34 4-25 (255) 371 TIGR03504 FimV_Cterm FimV C-te 40.8 20 0.0005 14.2 4.6 23 211-233 4-26 (44) 372 PRK11548 hypothetical protein; 40.8 20 0.0005 14.2 4.3 22 12-34 3-24 (113) 373 PRK11679 lipoprotein; Provisio 40.6 20 0.00051 14.1 2.4 21 14-34 9-29 (345) 374 PRK12696 flgH flagellar basal 40.2 20 0.00051 14.1 2.5 20 17-36 5-24 (238) 375 pfam10368 YkyA Putative cell-w 40.2 20 0.00051 14.1 5.7 16 21-36 2-17 (205) 376 pfam08139 LPAM_1 Prokaryotic m 39.7 21 0.00052 14.1 2.2 18 18-35 5-22 (26) 377 TIGR02996 rpt_mate_G_obs repea 39.4 19 0.00048 14.3 1.8 30 58-87 4-33 (48) 378 pfam11153 DUF2931 Protein of u 39.0 20 0.00051 14.1 1.9 16 17-32 3-18 (202) 379 COG3017 LolB Outer membrane li 38.6 21 0.00054 14.0 2.9 24 10-33 2-25 (206) 380 PHA00407 phage lambda Rz1-like 37.3 22 0.00057 13.8 2.8 24 13-36 32-55 (84) 381 PRK12407 flgH flagellar basal 37.1 22 0.00056 13.9 1.9 17 18-34 4-20 (220) 382 pfam06604 consensus 36.6 20 0.0005 14.2 1.5 22 12-33 3-24 (181) 383 PRK10796 LPS-assembly lipoprot 36.2 23 0.00059 13.7 1.9 29 13-41 2-30 (196) 384 COG5461 Type IV pili component 35.9 24 0.0006 13.7 2.4 24 14-37 7-30 (224) 385 pfam07980 SusD SusD family. Th 35.6 24 0.00061 13.7 3.3 29 70-98 83-111 (211) 386 PRK13616 lipoprotein LpqB; Pro 35.6 24 0.00061 13.7 3.5 25 16-40 8-32 (590) 387 pfam08631 SPO22 Meiosis protei 35.4 24 0.00061 13.6 17.7 123 47-169 4-153 (280) 388 KOG0292 consensus 35.0 24 0.00062 13.6 4.9 47 46-97 653-699 (1202) 389 KOG0686 consensus 35.0 24 0.00062 13.6 11.7 173 54-231 128-329 (466) 390 TIGR00756 PPR pentatricopeptid 34.8 25 0.00063 13.6 3.3 21 211-231 5-25 (35) 391 PRK12699 flgH flagellar basal 34.5 25 0.00063 13.6 2.5 22 15-36 18-39 (246) 392 pfam12262 Lipase_bact_N Bacter 33.3 26 0.00066 13.4 1.9 24 12-35 1-24 (269) 393 KOG2066 consensus 32.1 27 0.00069 13.3 8.8 33 64-96 499-531 (846) 394 PRK10175 hypothetical protein; 31.7 16 0.0004 14.8 0.4 21 17-37 3-23 (75) 395 TIGR03042 PS_II_psbQ_bact phot 31.4 28 0.00071 13.2 9.1 22 13-34 2-23 (142) 396 COG2913 OlmA Outer membrane li 31.3 28 0.00072 13.2 4.4 21 13-33 6-26 (147) 397 pfam03317 ELF ELF protein. Thi 30.3 29 0.00074 13.1 10.1 42 23-70 84-128 (284) 398 KOG1600 consensus 30.1 29 0.00074 13.1 1.5 34 187-224 274-309 (321) 399 PRK09861 cytoplasmic membrane 30.0 29 0.00075 13.1 3.2 27 6-34 2-28 (272) 400 TIGR01781 Trep_dent_lipo Trepo 29.7 30 0.00076 13.1 2.0 22 13-34 4-25 (463) 401 pfam07269 consensus 29.3 30 0.00077 13.0 1.8 32 22-54 7-38 (55) 402 COG3133 SlyB Outer membrane li 29.3 30 0.00077 13.0 1.6 25 13-37 2-26 (154) 403 pfam09680 Tiny_TM_bacill Prote 28.8 29 0.00073 13.2 1.3 19 14-32 5-23 (26) 404 PRK11162 mltA murein transglyc 28.7 31 0.00079 13.0 4.2 21 16-36 4-24 (362) 405 PRK13614 lipoprotein LpqB; Pro 28.4 31 0.0008 12.9 2.8 23 15-37 9-31 (573) 406 TIGR03524 GldJ gliding motilit 28.4 31 0.0008 12.9 2.7 19 15-33 7-25 (559) 407 pfam07312 DUF1459 Protein of u 27.8 30 0.00076 13.1 1.3 22 10-33 1-22 (84) 408 PRK04168 hypothetical protein; 27.6 18 0.00047 14.4 0.2 52 13-66 6-57 (336) 409 pfam06291 Lambda_Bor Bor prote 27.6 32 0.00081 12.9 1.4 21 12-34 1-21 (97) 410 pfam03032 Brevenin Brevenin/es 27.6 29 0.00074 13.2 1.1 24 10-33 1-24 (46) 411 pfam09533 DUF2380 Predicted li 27.3 33 0.00084 12.8 2.5 24 17-40 6-29 (220) 412 TIGR03044 PS_II_psb27 photosys 27.0 33 0.00085 12.8 6.9 51 17-73 12-62 (135) 413 PRK11459 multidrug resistance 26.8 34 0.00086 12.7 2.8 20 17-36 9-28 (478) 414 TIGR00380 cobD cobalamin biosy 26.8 34 0.00086 12.7 1.8 49 22-70 89-137 (322) 415 TIGR02522 pilus_cpaD pilus (Ca 26.7 34 0.00086 12.7 2.2 26 18-43 2-27 (211) 416 PRK11653 hypothetical protein; 26.0 35 0.00089 12.7 3.2 20 18-37 25-44 (223) 417 PRK10781 rcsF outer membrane l 25.6 35 0.0009 12.6 1.6 20 17-36 3-22 (133) 418 TIGR00926 2A1704 Oligopeptide 25.5 24 0.00061 13.7 0.4 26 14-39 193-218 (781) 419 PRK08878 adenosylcobinamide-ph 25.2 36 0.00092 12.6 5.7 17 215-231 220-236 (317) 420 pfam10140 essB Predicted membr 24.8 37 0.00093 12.5 10.4 80 149-232 233-313 (359) 421 PRK11616 hypothetical protein; 24.5 37 0.00095 12.5 1.7 19 16-34 6-24 (109) 422 TIGR00985 3a0801s04tom mitocho 24.3 37 0.00095 12.5 8.2 63 51-113 65-138 (155) 423 KOG2997 consensus 24.3 37 0.00095 12.5 3.3 24 82-105 31-54 (366) 424 PRK10974 glycerol-3-phosphate 24.2 38 0.00096 12.4 1.5 24 10-33 1-25 (439) 425 PRK13859 type IV secretion sys 24.1 38 0.00096 12.4 1.9 36 20-56 5-40 (55) 426 smart00777 Mad3_BUB1_I Mad3/BU 24.1 38 0.00096 12.4 3.2 43 189-231 80-124 (125) 427 PRK13684 Ycf48-like protein; P 24.0 38 0.00097 12.4 3.2 17 16-32 10-26 (333) 428 PRK10523 lipoprotein involved 23.7 38 0.00098 12.4 1.9 25 12-36 3-27 (234) 429 COG4851 CamS Protein involved 23.4 39 0.00099 12.4 1.6 21 13-33 2-22 (382) 430 COG5510 Predicted small secret 23.0 40 0.001 12.3 2.2 17 17-33 9-25 (44) 431 pfam05643 DUF799 Putative bact 23.0 40 0.001 12.3 2.4 19 17-35 4-22 (215) 432 TIGR03502 lipase_Pla1_cef extr 22.6 40 0.001 12.3 1.4 20 12-31 1-20 (792) 433 PRK09857 hypothetical protein; 22.5 41 0.001 12.3 7.7 56 181-236 215-270 (292) 434 pfam01535 PPR PPR repeat. This 22.2 41 0.0011 12.2 3.5 20 211-230 5-24 (31) 435 COG4808 Uncharacterized protei 21.3 43 0.0011 12.1 2.8 24 11-34 4-27 (152) 436 PRK10477 outer membrane lipopr 20.9 44 0.0011 12.0 2.3 20 15-34 4-23 (177) 437 PRK08476 F0F1 ATP synthase sub 20.4 45 0.0011 12.0 5.0 21 1-21 1-21 (141) 438 pfam00244 14-3-3 14-3-3 protei 20.3 45 0.0011 12.0 12.7 46 188-234 142-197 (236) 439 cd02680 MIT_calpain7_2 MIT: do 20.3 45 0.0011 12.0 2.9 15 217-231 17-31 (75) 440 pfam04090 RNA_pol_I_TF RNA pol 20.2 45 0.0012 12.0 8.7 137 73-219 44-184 (192) No 1 >KOG4626 consensus Probab=99.95 E-value=1.9e-26 Score=179.71 Aligned_cols=195 Identities=17% Similarity=0.146 Sum_probs=110.9 Q ss_pred HHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHH Q ss_conf 42019999999999999575787999999999987410012446665566327521001355553210000011258876 Q gi|254780592|r 48 NHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAI 127 (271) Q Consensus 48 ~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~ 127 (271) .+|+...|+..|.++++.||+-+++++++|.+|...+.+++|+..+.+++.+.|.+..++.+++-+|..+|..+-|...| T Consensus 230 ~~Gei~~aiq~y~eAvkldP~f~dAYiNLGnV~ke~~~~d~Avs~Y~rAl~lrpn~A~a~gNla~iYyeqG~ldlAI~~Y 309 (966) T KOG4626 230 AQGEIWLAIQHYEEAVKLDPNFLDAYINLGNVYKEARIFDRAVSCYLRALNLRPNHAVAHGNLACIYYEQGLLDLAIDTY 309 (966) T ss_pred HCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCHHHCCCEEEEEECCCCHHHHHHHH T ss_conf 04369999999987653797652778658899998722037899999997429730312043479973144278889999 Q ss_pred HHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHH Q ss_conf 42211000023356699999997421111100000000000000012457776543322355435999999953898547 Q gi|254780592|r 128 NRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSR 207 (271) Q Consensus 128 ~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~ 207 (271) ++++..+|..++++.++|+.+...|+..+|...|.+++...|+..++..|+|.+|..+|.+++|...|.++++..|..+. T Consensus 310 kral~~~P~F~~Ay~NlanALkd~G~V~ea~~cYnkaL~l~p~hadam~NLgni~~E~~~~e~A~~ly~~al~v~p~~aa 389 (966) T KOG4626 310 KRALELQPNFPDAYNNLANALKDKGSVTEAVDCYNKALRLCPNHADAMNNLGNIYREQGKIEEATRLYLKALEVFPEFAA 389 (966) T ss_pred HHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCHHHHH T ss_conf 99870399956777579999775054679999999998758763778887989998861221889999999863802455 Q ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHH Q ss_conf 88899999998699899999999862917286659 Q gi|254780592|r 208 IRQNLALVVGLQGRMKEAYSIASQELSPEEATRNI 242 (271) Q Consensus 208 ~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~~ 242 (271) +..|||.+|.++|++++|+.+|+.+|..+|.-++. T Consensus 390 a~nNLa~i~kqqgnl~~Ai~~YkealrI~P~fAda 424 (966) T KOG4626 390 AHNNLASIYKQQGNLDDAIMCYKEALRIKPTFADA 424 (966) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHCCCHHHHH T ss_conf 64229999996054778999999887149407899 No 2 >KOG4626 consensus Probab=99.94 E-value=1e-25 Score=175.34 Aligned_cols=199 Identities=19% Similarity=0.112 Sum_probs=89.4 Q ss_pred HCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHH---------------------- Q ss_conf 2019999999999999575787999999999987410012446665566327521001---------------------- Q gi|254780592|r 49 HEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEV---------------------- 106 (271) Q Consensus 49 ~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~---------------------- 106 (271) .|++++|...|+.+++++|++.++|+++|.+|...|+.+.|...|..+++++|....+ T Consensus 129 rg~~~~al~~y~~aiel~p~fida~inla~al~~~~~~~~a~~~~~~alqlnP~l~ca~s~lgnLlka~Grl~ea~~cYl 208 (966) T KOG4626 129 RGQLQDALALYRAAIELKPKFIDAYINLAAALVTQGDLELAVQCFFEALQLNPDLYCARSDLGNLLKAEGRLEEAKACYL 208 (966) T ss_pred HCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHCCCCHHHHHCCHHHHHHHHCCCCHHHHHHH T ss_conf 22288999999999852821367775378898745777613799998885095045542244678876336133689999 Q ss_pred ------------HHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHH Q ss_conf ------------35555321000001125887642211000023356699999997421111100000000000000012 Q gi|254780592|r 107 ------------LAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSI 174 (271) Q Consensus 107 ------------~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~ 174 (271) |..+|.++...|+...|+..|.+++..+|....+|.++|.+|...+.++.|+..|.+++..+|+...+ T Consensus 209 kAi~~qp~fAiawsnLg~~f~~~Gei~~aiq~y~eAvkldP~f~dAYiNLGnV~ke~~~~d~Avs~Y~rAl~lrpn~A~a 288 (966) T KOG4626 209 KAIETQPCFAIAWSNLGCVFNAQGEIWLAIQHYEEAVKLDPNFLDAYINLGNVYKEARIFDRAVSCYLRALNLRPNHAVA 288 (966) T ss_pred HHHHHCCCEEEEEHHCCHHHHHCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCHHH T ss_conf 99862975045312000477604369999999987653797652778658899998722037899999997429730312 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHHHHHHH Q ss_conf 4577765433223554359999999538985478889999999869989999999986291728665999999 Q gi|254780592|r 175 VSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQELSPEEATRNIKYIKS 247 (271) Q Consensus 175 ~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~~~~l~~ 247 (271) +.|+|.+|..+|+.+.|+..|++++++.|...+++.|||.++...|+.+||+.+|+++|...|.+++..+..+ T Consensus 289 ~gNla~iYyeqG~ldlAI~~Ykral~~~P~F~~Ay~NlanALkd~G~V~ea~~cYnkaL~l~p~hadam~NLg 361 (966) T KOG4626 289 HGNLACIYYEQGLLDLAIDTYKRALELQPNFPDAYNNLANALKDKGSVTEAVDCYNKALRLCPNHADAMNNLG 361 (966) T ss_pred CCCEEEEEECCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHH T ss_conf 0434799731442788899999987039995677757999977505467999999999875876377888798 No 3 >TIGR02917 PEP_TPR_lipo putative PEP-CTERM system TPR-repeat lipoprotein; InterPro: IPR014266 This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system , analogous to the LPXTG/sortase system common in Gram-positive bacteria. The proteins in this entry occur in a species if, and only if, a transmembrane histidine kinase (IPR014265 from INTERPRO) and a DNA-binding response regulator (IPR014264 from INTERPRO) also occur. The present of tetratricopeptide repeats (TPR) suggests they may be involved in protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.. Probab=99.94 E-value=1.2e-24 Score=168.77 Aligned_cols=232 Identities=16% Similarity=0.137 Sum_probs=181.3 Q ss_pred HHHHHHHH--CCCCCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCC Q ss_conf 99998540--3685433117999999842019999999999999575787999999999987410012446665566327 Q gi|254780592|r 23 PLLLFSSC--HLNPRISIPDPESLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILY 100 (271) Q Consensus 23 ~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~ 100 (271) ..++++|| +.....+.....+..+++.+++.+|+-.++.+++.+|++.++.+.+|.+|...|+...|..-|+|+.++. T Consensus 14 a~~lLsaCsGg~~~~~~~l~~~Ak~yl~~~~~~aAiI~LKNAL~~~p~~~EARflLg~~Y~~~Gd~~~AEKEL~kA~~lg 93 (924) T TIGR02917 14 AILLLSACSGGDQLSPEELIEEAKKYLQKNKYKAAIIQLKNALQKDPNDAEARFLLGKIYLAQGDAAAAEKELRKALSLG 93 (924) T ss_pred HHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCC T ss_conf 99998630578888968999999999861899722677578774086747889999999998459888889999998568 Q ss_pred CCCCHHHHHH-----------------------------------HHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHH Q ss_conf 5210013555-----------------------------------53210000011258876422110000233566999 Q gi|254780592|r 101 PEDQEVLAAY-----------------------------------GKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKG 145 (271) Q Consensus 101 p~~~~~~~~~-----------------------------------~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg 145 (271) +....+...+ |..+...|+.++|...|..+...+|....+...++ T Consensus 94 ~~~~~~~p~LAra~l~~~~~~~~~~~~~~~~~~~~~~~a~ll~~~G~A~L~~g~~~~A~~~~~~Al~~~p~~~~A~lglA 173 (924) T TIGR02917 94 YPKEQVLPLLARAYLLQGKFQQVLDELEAKTSLDDEEAAELLALRGLAYLGLGELEEAQKSYEQALAIAPRSLDAKLGLA 173 (924) T ss_pred CCHHHHHHHHHHHHHHCCCHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHH T ss_conf 99865539999999855957899985478877870669999999999988507878999999998763402735899999 Q ss_pred HHHHHHH--HHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHH Q ss_conf 9999742--11111000000000000000124577765433223554359999999538985478889999999869989 Q gi|254780592|r 146 SVLAQMG--KHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMK 223 (271) Q Consensus 146 ~~~~~~~--~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~ 223 (271) .+....+ ++++|.....+++..+|...++|.-.|.+..-+|++++|...|++++++.|.+...+..++.+....|+++ T Consensus 174 ~~~~~~~ss~~~~a~~~~d~vL~~~P~~~dA~~L~g~l~~~~~~~~~A~~~y~ka~~~~P~~~~~~~~~~~~~l~~~~~e 253 (924) T TIGR02917 174 RLALAENSSKLDEARALIDEVLTADPKNVDALLLKGDLLLSSGNIEQALAAYRKAIELRPNNIAVLLALATILLEAGEFE 253 (924) T ss_pred HHHHHHCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCHH T ss_conf 99997337787889999999973287522279999999997277689999999999638887478999999999736667 Q ss_pred HHHHHHHHHCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 9999999862917286659999999983212 Q gi|254780592|r 224 EAYSIASQELSPEEATRNIKYIKSILSQRDP 254 (271) Q Consensus 224 eA~~~~~~~L~~~~~~~~~~~l~~~l~~~~~ 254 (271) +|...++..+..-|.++...|+++++.-+.+ T Consensus 254 ~A~~~~d~~~k~~p~~p~a~y~kAlv~~q~~ 284 (924) T TIGR02917 254 EAEKDADALLKKAPNSPQAAYLKALVDFQKK 284 (924) T ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHC T ss_conf 8899999999984778468999999999833 No 4 >COG5010 TadD Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking and secretion] Probab=99.93 E-value=1.2e-24 Score=168.67 Aligned_cols=245 Identities=36% Similarity=0.504 Sum_probs=223.1 Q ss_pred HHHHHHHHHHHHHHHHCCCC--CCC------CHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCC Q ss_conf 99999999999985403685--433------1179999998420199999999999995757879999999999874100 Q gi|254780592|r 15 IKCIVFLIPLLLFSSCHLNP--RIS------IPDPESLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRT 86 (271) Q Consensus 15 ~~~~~~l~~~~~~~~~~~~~--~~~------~~~~~~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~ 86 (271) +..+++.-+++.++||.+.. +.. .........+...+...+...+-+...++|++..+ .+++..++..|+- T Consensus 4 ~~~~~~a~l~~~~~~c~~~~~~~~~~~g~~~~a~~~~~~~~~~~q~~~a~~al~~~~~~~p~d~~i-~~~a~a~~~~G~a 82 (257) T COG5010 4 LGGAVLAALALSLAGCAGQAGQREMSTGASAKATGAPESSLAMRQTQGAAAALGAAVLRNPEDLSI-AKLATALYLRGDA 82 (257) T ss_pred HHHHHHHHHHHHHCCCCCCCCCHHHCCCCCHHCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCHHHH-HHHHHHHHHCCCC T ss_conf 877999999999742421157400002221001144789999876668999999998068201778-8877777740555 Q ss_pred HHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 12446665566327521001355553210000011258876422110000233566999999974211111000000000 Q gi|254780592|r 87 AQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALE 166 (271) Q Consensus 87 ~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~ 166 (271) +++..+..+....+|.+..++..+|......|++..|...+.++....|++++.|..+|.+|.+.|+.+.|...|.++++ T Consensus 83 ~~~l~~~~~~~~~~~~d~~ll~~~gk~~~~~g~~~~A~~~~rkA~~l~p~d~~~~~~lgaaldq~Gr~~~Ar~ay~qAl~ 162 (257) T COG5010 83 DSSLAVLQKSAIAYPKDRELLAAQGKNQIRNGNFGEAVSVLRKAARLAPTDWEAWNLLGAALDQLGRFDEARRAYRQALE 162 (257) T ss_pred CCHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHH T ss_conf 50078875540147454999999999999804568899999998436999746666778999973675677899999998 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHHHHHH Q ss_conf 00000012457776543322355435999999953898547888999999986998999999998629172866599999 Q gi|254780592|r 167 LSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQELSPEEATRNIKYIK 246 (271) Q Consensus 167 ~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~~~~l~ 246 (271) +.|+.+.+..|+|..+...||++.|..++..+....+.++.+..||+++...+|++++|+.+..+.+.++....++.+++ T Consensus 163 L~~~~p~~~nNlgms~~L~gd~~~A~~lll~a~l~~~ad~~v~~NLAl~~~~~g~~~~A~~i~~~e~~~~~~~~~~~~l~ 242 (257) T COG5010 163 LAPNEPSIANNLGMSLLLRGDLEDAETLLLPAYLSPAADSRVRQNLALVVGLQGDFREAEDIAVQELLSEQAANNVAALR 242 (257) T ss_pred HCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHCCCCCCCHHHHHHHHHHH T ss_conf 42588446665899999727798999999999867887638999899998404886777762420036444766999999 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 99983212999999 Q gi|254780592|r 247 SILSQRDPWKKIAK 260 (271) Q Consensus 247 ~~l~~~~~~~~~a~ 260 (271) ....+.+.|..+.+ T Consensus 243 ~~~~~~~~~~~~~~ 256 (257) T COG5010 243 AAASQSGAWTQLAK 256 (257) T ss_pred HHHCCCCHHHHHHC T ss_conf 86142431677740 No 5 >TIGR02917 PEP_TPR_lipo putative PEP-CTERM system TPR-repeat lipoprotein; InterPro: IPR014266 This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system , analogous to the LPXTG/sortase system common in Gram-positive bacteria. The proteins in this entry occur in a species if, and only if, a transmembrane histidine kinase (IPR014265 from INTERPRO) and a DNA-binding response regulator (IPR014264 from INTERPRO) also occur. The present of tetratricopeptide repeats (TPR) suggests they may be involved in protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.. Probab=99.93 E-value=4.9e-24 Score=164.98 Aligned_cols=222 Identities=16% Similarity=0.123 Sum_probs=140.0 Q ss_pred HHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHH Q ss_conf 99999842019999999999999575787999999999987410012446665566327521001355553210000011 Q gi|254780592|r 42 ESLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLD 121 (271) Q Consensus 42 ~~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~ 121 (271) -...++..|++++|..+|.++-+.+|+|+.+...+|...+..|+..+|++.++++++++|+...+-..+...|++.|+++ T Consensus 389 lG~~~L~~g~~~~A~~yl~ka~~~~P~~a~~~t~lG~~~L~~Gd~~~Ai~dL~~A~~l~~~~~~a~~~L~~~yL~~~~~d 468 (924) T TIGR02917 389 LGEAYLAKGDFEKAAEYLAKASELDPENAAARTKLGISKLSQGDPSEAIADLETAAQLDPELGRADLLLILSYLRSGEFD 468 (924) T ss_pred HHHHHHHCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHH T ss_conf 99999863896789999987610589808899999988620688347899999999727775347899999887420489 Q ss_pred HHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 25887642211000023356699999997421111100000000000000012457776543322355435999999953 Q gi|254780592|r 122 EGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQM 201 (271) Q Consensus 122 ~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~ 201 (271) +|+...++.....|+++..|+.+|.+|...|+..+|...|+++++++|+++.+..+++.+-...|+++.|...|++++.. T Consensus 469 KAla~~~~l~~~~pd~p~~~nl~G~i~l~~~~~~kA~~~f~~AL~i~p~~~~A~~nLA~~d~~~~~~~~A~~r~~~~l~~ 548 (924) T TIGR02917 469 KALAAAKKLEKKQPDNPSLHNLLGAIYLGKKDLAKAREAFEKALKIEPDFFPAAANLARIDLQEGNPDDAKQRFEKVLTI 548 (924) T ss_pred HHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH T ss_conf 99999999975288982689999999985178789999999875318778478999999887428968999999999875 Q ss_pred CCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHH--HHHHHHHHHHHHHHHHHHHHHH Q ss_conf 8985478889999999869989999999986291728665--9999999983212999999997 Q gi|254780592|r 202 IGADSRIRQNLALVVGLQGRMKEAYSIASQELSPEEATRN--IKYIKSILSQRDPWKKIAKARS 263 (271) Q Consensus 202 ~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~--~~~l~~~l~~~~~~~~~a~a~~ 263 (271) +|.+.++...|+.++...|+.+++...++++...+|.+-. +.+.+.++.+.+..+.++-++. T Consensus 549 ~P~~~~a~~~La~L~~~~g~~~~~~~~l~~A~~~~~~~~~p~l~L~~~yl~~~~l~kAl~~l~~ 612 (924) T TIGR02917 549 DPKNLRALLALAELAARTGNEEEAVAWLEKAAEKNPKEIEPALALAQYYLSKGELKKALAILNE 612 (924) T ss_pred CCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCHHHHHHHHHH T ss_conf 7433799999999998518968999999999862817777899999999865898789999999 No 6 >TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW; InterPro: IPR013360 Proteins in this entry are designated PilF and PilW . This outer membrane protein is required both for pilus stability and for pilus functions such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.. Probab=99.93 E-value=3.2e-24 Score=166.10 Aligned_cols=226 Identities=20% Similarity=0.210 Sum_probs=117.0 Q ss_pred HHHHHHHHHHHHCCCC-CCCCHH-HHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHH Q ss_conf 9999999985403685-433117-99999984201999999999999957578799999999998741001244666556 Q gi|254780592|r 19 VFLIPLLLFSSCHLNP-RISIPD-PESLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQV 96 (271) Q Consensus 19 ~~l~~~~~~~~~~~~~-~~~~~~-~~~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~a 96 (271) ++++.+++++||++.+ +..... .......+.. .-..+++.+|..|+..|+++.|..-+++| T Consensus 1 ~~l~~~~~l~GC~~~~P~~q~~~~~~~~~~~~~~-----------------~~a~~r~qLal~Yl~~G~~~~Ak~~l~~A 63 (247) T TIGR02521 1 LALVLLLALTGCVTTPPTVQEQEVPESSRTTDRE-----------------KAAKIRVQLALGYLEQGDLEKAKENLDKA 63 (247) T ss_pred CHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCHH-----------------HHHHHHHHHHHHHHHCCCHHHHHHHHHHH T ss_conf 3887846785157887897787888875510279-----------------99999999999998607878999999999 Q ss_pred HHCCCCCCHHHHHHHHHHHHHC--CHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHH--HHHHHHHHHHHCCCC--CCH Q ss_conf 6327521001355553210000--0112588764221100002335669999999742--111110000000000--000 Q gi|254780592|r 97 AILYPEDQEVLAAYGKSLANAG--YLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMG--KHSEALIEYERALEL--SPN 170 (271) Q Consensus 97 l~~~p~~~~~~~~~~~~~~~~g--~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~--~~~~A~~~~~~~~~~--~p~ 170 (271) ++.+|++..++..++.+|...| +.+.|...|++++...|++.+.+++.|..+.++| ++++|+..|+.+++. -+. T Consensus 64 L~~dp~~~~a~~~lA~~Yq~~grP~~~~A~~~f~~Al~~~p~~g~~~NNYG~FLC~~g~~~~~~A~~~F~~A~~~p~Y~~ 143 (247) T TIGR02521 64 LEHDPDDYDAYLALALYYQQLGRPELEKAEQSFRKALTLKPNNGDVLNNYGTFLCQQGNDKYEQAEQQFEAAIEDPLYPQ 143 (247) T ss_pred HHHCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCCEECHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCC T ss_conf 87376661289999999987179955789999999862489985232014577640489706899999999986688973 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCC---CCCCHHHHHHHH Q ss_conf 00124577765433223554359999999538985478889999999869989999999986291---728665999999 Q gi|254780592|r 171 ESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQELSP---EEATRNIKYIKS 247 (271) Q Consensus 171 ~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~---~~~~~~~~~l~~ 247 (271) ....+.|+|.+....|+.+.|..+|++|+..+|+.+.....|+.+....|++.+|...+++-... .+..+...+|.. T Consensus 144 ~~~~~eN~G~C~~~~g~~~~A~~~l~~Al~~dp~~p~~lleLa~l~~~~g~~~~A~~~l~ry~~~~G~~~~~a~sL~Lg~ 223 (247) T TIGR02521 144 PARSYENAGLCALKAGDTEKAEKYLTRALKIDPQRPESLLELAELYYERGQYAQARALLERYQQTVGVKNQTAESLWLGI 223 (247) T ss_pred CHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHH T ss_conf 30245549999986178688999999999858788855799999987507728999999999987367888873478999 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 99832129999999 Q gi|254780592|r 248 ILSQRDPWKKIAKA 261 (271) Q Consensus 248 ~l~~~~~~~~~a~a 261 (271) -+++..|-...++. T Consensus 224 riA~~~Gd~~~~~~ 237 (247) T TIGR02521 224 RIARALGDVAAAKR 237 (247) T ss_pred HHHHHHCCHHHHHH T ss_conf 99998442999999 No 7 >KOG1126 consensus Probab=99.91 E-value=2.3e-23 Score=160.81 Aligned_cols=216 Identities=12% Similarity=0.060 Sum_probs=196.2 Q ss_pred HHHHHHHHHCCHHHHHHHHH----------------------------------HHHHHCCCCHHHHHHHHHHHHHHHCC Q ss_conf 99999984201999999999----------------------------------99995757879999999999874100 Q gi|254780592|r 41 PESLENMNHEQLLEVTSTIG----------------------------------LQYQSHTKNKMIGIVYADVLRRVGRT 86 (271) Q Consensus 41 ~~~~~~~~~~~~~~A~~~~~----------------------------------~al~~~P~~~~~~~~~a~~l~~~g~~ 86 (271) +....+.+.++|++|...|+ ..++.+|+.|+.|...|.+|.-++++ T Consensus 358 q~GrayFEl~~Y~~a~~~F~~~r~~~p~rv~~meiyST~LWHLq~~v~Ls~Laq~Li~~~~~sPesWca~GNcfSLQkdh 437 (638) T KOG1126 358 QLGRAYFELIEYDQAERIFSLVRRIEPYRVKGMEIYSTTLWHLQDEVALSYLAQDLIDTDPNSPESWCALGNCFSLQKDH 437 (638) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHCCHHHHHHHH T ss_conf 98899998987899999999998614541132467888888877567788998988750878707787740256665679 Q ss_pred HHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 12446665566327521001355553210000011258876422110000233566999999974211111000000000 Q gi|254780592|r 87 AQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALE 166 (271) Q Consensus 87 ~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~ 166 (271) ++|+..|+++++++|...-++..+|.=+....++|+|...|++++..+|.+..+|+.+|.+|.++++++.|.-.|.++++ T Consensus 438 ~~Aik~f~RAiQldp~faYayTLlGhE~~~~ee~d~a~~~fr~Al~~~~rhYnAwYGlG~vy~Kqek~e~Ae~~fqkA~~ 517 (638) T KOG1126 438 DTAIKCFKRAIQLDPRFAYAYTLLGHESIATEEFDKAMKSFRKALGVDPRHYNAWYGLGTVYLKQEKLEFAEFHFQKAVE 517 (638) T ss_pred HHHHHHHHHHHCCCCCCCHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHEECCCHHHHHHHHHHHHHC T ss_conf 99999887765468763212353077345457777677877765157805667888620302204104678998876422 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHHHHHH Q ss_conf 00000012457776543322355435999999953898547888999999986998999999998629172866599999 Q gi|254780592|r 167 LSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQELSPEEATRNIKYIK 246 (271) Q Consensus 167 ~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~~~~l~ 246 (271) ++|.+......+|..+.+.|+.++|+..+++|+-++|.++-..+..|.++..+++++||...++..-..-|.++.+.++. T Consensus 518 INP~nsvi~~~~g~~~~~~k~~d~AL~~~~~A~~ld~kn~l~~~~~~~il~~~~~~~eal~~LEeLk~~vP~es~v~~ll 597 (638) T KOG1126 518 INPSNSVILCHIGRIQHQLKRKDKALQLYEKAIHLDPKNPLCKYHRASILFSLGRYVEALQELEELKELVPQESSVFALL 597 (638) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCHHHHHHHH T ss_conf 79610357765259999860132799999998816878815678877888762236999999999998575057899999 Q ss_pred HHHHHHHHHH Q ss_conf 9998321299 Q gi|254780592|r 247 SILSQRDPWK 256 (271) Q Consensus 247 ~~l~~~~~~~ 256 (271) +.+.++=+.. T Consensus 598 gki~k~~~~~ 607 (638) T KOG1126 598 GKIYKRLGNT 607 (638) T ss_pred HHHHHHHCCC T ss_conf 9999997330 No 8 >KOG1125 consensus Probab=99.89 E-value=4.1e-22 Score=153.19 Aligned_cols=201 Identities=20% Similarity=0.171 Sum_probs=184.3 Q ss_pred CCHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHH Q ss_conf 31179999998420199999999999995757879999999999874100124466655663275210013555532100 Q gi|254780592|r 37 SIPDPESLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLAN 116 (271) Q Consensus 37 ~~~~~~~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~ 116 (271) ..|..+....+..|++.+|+..|+.+++.||++.++|..+|.+....++-..|+.-++++++++|++.+++..++..|.. T Consensus 286 pdPf~eG~~lm~nG~L~~A~LafEAAVkqdP~haeAW~~LG~~qaENE~E~~ai~AL~rcl~LdP~NleaLmaLAVSytN 365 (579) T KOG1125 286 PDPFKEGCNLMKNGDLSEAALAFEAAVKQDPQHAEAWQKLGITQAENENEQNAISALRRCLELDPTNLEALMALAVSYTN 365 (579) T ss_pred CCHHHHHHHHHHCCCCHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHH T ss_conf 87489999998638905999999999861928789999961576503420889999999984699519999998888861 Q ss_pred HCCHHHHHHHHHHHHH-----------------------------------------HHC--HHHHHHHHHHHHHHHHHH Q ss_conf 0001125887642211-----------------------------------------000--023356699999997421 Q gi|254780592|r 117 AGYLDEGLDAINRAQR-----------------------------------------PDI--PDWQLISAKGSVLAQMGK 153 (271) Q Consensus 117 ~g~~~~A~~~~~~a~~-----------------------------------------~~p--~~~~~~~~lg~~~~~~~~ 153 (271) .|.-..|+..+.+.+. ..| .+++....||.+|...|+ T Consensus 366 eg~q~~Al~~L~~Wi~~~p~y~~l~~a~~~~~~~~~~s~~~~~~l~~i~~~fLeaa~~~~~~~DpdvQ~~LGVLy~ls~e 445 (579) T KOG1125 366 EGLQNQALKMLDKWIRNKPKYVHLVSAGENEDFENTKSFLDSSHLAHIQELFLEAARQLPTKIDPDVQSGLGVLYNLSGE 445 (579) T ss_pred HHHHHHHHHHHHHHHHHCCCCHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCHH T ss_conf 46688899999999984862111146676656667767788789999999999999857788884277626789740147 Q ss_pred HHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHC Q ss_conf 11110000000000000001245777654332235543599999995389854788899999998699899999999862 Q gi|254780592|r 154 HSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQEL 233 (271) Q Consensus 154 ~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L 233 (271) ++.|+++|+.++..+|++...|+.+|.++....+.++|+..|++|+++.|+..++++|||..+..+|.|.||...+=.+| T Consensus 446 fdraiDcf~~AL~v~Pnd~~lWNRLGAtLAN~~~s~EAIsAY~rALqLqP~yVR~RyNlgIS~mNlG~ykEA~~hlL~AL 525 (579) T KOG1125 446 FDRAVDCFEAALQVKPNDYLLWNRLGATLANGNRSEEAISAYNRALQLQPGYVRVRYNLGISCMNLGAYKEAVKHLLEAL 525 (579) T ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCEEEEEHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 88899999999704984177898866886077553999999999984188726530101456620566999999999999 Q ss_pred CCCC Q ss_conf 9172 Q gi|254780592|r 234 SPEE 237 (271) Q Consensus 234 ~~~~ 237 (271) +..+ T Consensus 526 ~mq~ 529 (579) T KOG1125 526 SMQR 529 (579) T ss_pred HHHH T ss_conf 8653 No 9 >PRK11447 cellulose synthase subunit BcsC; Provisional Probab=99.89 E-value=8.1e-21 Score=145.24 Aligned_cols=222 Identities=15% Similarity=0.130 Sum_probs=189.3 Q ss_pred HHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCC--CH------------H Q ss_conf 999999842019999999999999575787999999999987410012446665566327521--00------------1 Q gi|254780592|r 41 PESLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPED--QE------------V 106 (271) Q Consensus 41 ~~~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~--~~------------~ 106 (271) .+....++.|+...|+..|.+++..+|+|++++-.+|.++.+.|++.+|..+|+++.+.+|+. .. . T Consensus 275 ~~Gl~aL~~G~l~aAe~~l~~AL~~~PnD~daLGgLG~v~lRqG~~~eA~~yf~rA~~~d~~~~~~~kW~sll~ta~yW~ 354 (1158) T PRK11447 275 QQGLAAVDSGMGGKAIAELQQAVRANPKDSEALGALGQAYSQRGDRARAVAQFEKALALDPHSSNRDKWESLLKVNRYWL 354 (1158) T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHH T ss_conf 98999997799788999999999719998789988899999658889999999999970899854688999998878999 Q ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHH-- Q ss_conf 355553210000011258876422110000233566999999974211111000000000000000124577765433-- Q gi|254780592|r 107 LAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLL-- 184 (271) Q Consensus 107 ~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~-- 184 (271) +...++.....|++.+|...++.+...+|.++.+...+|.++...+++..|+..|+.++..+|.+..++..+..+|.. T Consensus 355 ~l~qa~~A~~~g~~~~A~~l~~~A~~~~p~~~~A~l~LaD~~~a~~d~~~AE~~Yr~~L~~~~~N~~AlrGL~~l~~~q~ 434 (1158) T PRK11447 355 LIQQGDAALKANNLAQAERLYQQARQVDNTDSYAVLGLGDVAMARKDYAAAERYYQQTLRMDSGNTNAVRGLANIYRQQS 434 (1158) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHC T ss_conf 99999999987998999999999985599985599999999998399799999999999748998789999999998639 Q ss_pred ----------------------------------------HHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHH Q ss_conf ----------------------------------------2235543599999995389854788899999998699899 Q gi|254780592|r 185 ----------------------------------------MGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKE 224 (271) Q Consensus 185 ----------------------------------------~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~e 224 (271) +|+...|+..+++++...|+++..++.||.+|..+|+..+ T Consensus 435 ~~~A~~ll~~l~~~q~~~~~~~~~~l~~~~l~~qA~~~~~~Gd~~~A~~~l~qa~~~~P~dpWlR~~LAr~~~~~G~~~~ 514 (1158) T PRK11447 435 PEKALAFIASLSASQRRSIDDIERSLQNDRLAQQAEALENQGKWAQAAELQRQRLALDPGSVWITYRLAQDLWQAGQRSQ 514 (1158) T ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCHHH T ss_conf 99999999837998887763144454679999999999978899999999999997189970499999999997799889 Q ss_pred HHHHHHHHCCCCCCCHHHHHHHHHHH-HHHHHHHHHHHH Q ss_conf 99999986291728665999999998-321299999999 Q gi|254780592|r 225 AYSIASQELSPEEATRNIKYIKSILS-QRDPWKKIAKAR 262 (271) Q Consensus 225 A~~~~~~~L~~~~~~~~~~~l~~~l~-~~~~~~~~a~a~ 262 (271) |..+++..+...|.+++..|..+++. ..+.|....... T Consensus 515 A~~lM~~l~~~~p~~pe~~yA~ALyls~~~~~~~A~a~L 553 (1158) T PRK11447 515 ADALMRRLAQQKPNDPEQVYAYGLYLSGSDRDRAALAHL 553 (1158) T ss_pred HHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHH T ss_conf 999999998449999899999999987678879999999 No 10 >KOG1126 consensus Probab=99.89 E-value=1.4e-21 Score=149.90 Aligned_cols=205 Identities=15% Similarity=0.110 Sum_probs=183.2 Q ss_pred HHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHH---------------------------- Q ss_conf 9842019999999999999575787999999999987410012446665566---------------------------- Q gi|254780592|r 46 NMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVA---------------------------- 97 (271) Q Consensus 46 ~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al---------------------------- 97 (271) .+..-+..+|+..|.+.-+..++-.+++..+|.+|..++++++|..+|+.+- T Consensus 329 ~~s~y~~~~A~~~~~klp~h~~nt~wvl~q~GrayFEl~~Y~~a~~~F~~~r~~~p~rv~~meiyST~LWHLq~~v~Ls~ 408 (638) T KOG1126 329 SLSQYNCREALNLFEKLPSHHYNTGWVLSQLGRAYFELIEYDQAERIFSLVRRIEPYRVKGMEIYSTTLWHLQDEVALSY 408 (638) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 98888799999999864775378447999988999989878999999999986145411324678888888775677889 Q ss_pred ------HCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHH Q ss_conf ------32752100135555321000001125887642211000023356699999997421111100000000000000 Q gi|254780592|r 98 ------ILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNE 171 (271) Q Consensus 98 ------~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~ 171 (271) ..+|+.++.|+.+|.|+.-+++.+.|++.|++++..+|...-++..+|+=+.....++.|..+|+.++..+|.+ T Consensus 409 Laq~Li~~~~~sPesWca~GNcfSLQkdh~~Aik~f~RAiQldp~faYayTLlGhE~~~~ee~d~a~~~fr~Al~~~~rh 488 (638) T KOG1126 409 LAQDLIDTDPNSPESWCALGNCFSLQKDHDTAIKCFKRAIQLDPRFAYAYTLLGHESIATEEFDKAMKSFRKALGVDPRH 488 (638) T ss_pred HHHHHHHHCCCCCHHHHHHCCHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCHH T ss_conf 98988750878707787740256665679999998877654687632123530773454577776778777651578056 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHH Q ss_conf 0124577765433223554359999999538985478889999999869989999999986291728665999999998 Q gi|254780592|r 172 SSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQELSPEEATRNIKYIKSILS 250 (271) Q Consensus 172 ~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~~~~l~~~l~ 250 (271) ..+|+.+|.+|.++++++.|+-+|++|++++|.++.+...+|.++...|+.|+|...+++++-.+|.+....+-++.+. T Consensus 489 YnAwYGlG~vy~Kqek~e~Ae~~fqkA~~INP~nsvi~~~~g~~~~~~k~~d~AL~~~~~A~~ld~kn~l~~~~~~~il 567 (638) T KOG1126 489 YNAWYGLGTVYLKQEKLEFAEFHFQKAVEINPSNSVILCHIGRIQHQLKRKDKALQLYEKAIHLDPKNPLCKYHRASIL 567 (638) T ss_pred HHHHHHHHHHEECCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHH T ss_conf 6788862030220410467899887642279610357765259999860132799999998816878815678877888 No 11 >TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW; InterPro: IPR013360 Proteins in this entry are designated PilF and PilW . This outer membrane protein is required both for pilus stability and for pilus functions such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.. Probab=99.89 E-value=9.5e-22 Score=150.95 Aligned_cols=215 Identities=17% Similarity=0.178 Sum_probs=191.4 Q ss_pred HHHHHHHHHHHHHHC------CCC-CCCCHH---------HHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHH Q ss_conf 999999999985403------685-433117---------9999998420199999999999995757879999999999 Q gi|254780592|r 17 CIVFLIPLLLFSSCH------LNP-RISIPD---------PESLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVL 80 (271) Q Consensus 17 ~~~~l~~~~~~~~~~------~~~-~~~~~~---------~~~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l 80 (271) ++++++++.+|++.. ..+ ...... +.+..++++|+++.|...+++||+.||++..++..+|..| T Consensus 2 ~l~~~~~l~GC~~~~P~~q~~~~~~~~~~~~~~~~a~~r~qLal~Yl~~G~~~~Ak~~l~~AL~~dp~~~~a~~~lA~~Y 81 (247) T TIGR02521 2 ALVLLLALTGCVTTPPTVQEQEVPESSRTTDREKAAKIRVQLALGYLEQGDLEKAKENLDKALEHDPDDYDAYLALALYY 81 (247) T ss_pred HHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHH T ss_conf 88784678515788789778788887551027999999999999998607878999999999873766612899999999 Q ss_pred HHHH--CCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHC--CHHHHHHHHHHHHHH--HCHHHHHHHHHHHHHHHHHHH Q ss_conf 8741--0012446665566327521001355553210000--011258876422110--000233566999999974211 Q gi|254780592|r 81 RRVG--RTAQALAVMRQVAILYPEDQEVLAAYGKSLANAG--YLDEGLDAINRAQRP--DIPDWQLISAKGSVLAQMGKH 154 (271) Q Consensus 81 ~~~g--~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g--~~~~A~~~~~~a~~~--~p~~~~~~~~lg~~~~~~~~~ 154 (271) .+.| +.++|...|++++.++|++..+++++|..+..+| ++++|...|.+|+.. -+.-+.++.++|.|..+.|+. T Consensus 82 q~~grP~~~~A~~~f~~Al~~~p~~g~~~NNYG~FLC~~g~~~~~~A~~~F~~A~~~p~Y~~~~~~~eN~G~C~~~~g~~ 161 (247) T TIGR02521 82 QQLGRPELEKAEQSFRKALTLKPNNGDVLNNYGTFLCQQGNDKYEQAEQQFEAAIEDPLYPQPARSYENAGLCALKAGDT 161 (247) T ss_pred HHHCCCCHHHHHHHHHHHHHCCCCCCCEECHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCH T ss_conf 87179955789999999862489985232014577640489706899999999986688973302455499999861786 Q ss_pred HHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC---CCCHHHHHHHHHHHHHCCCHHHHHHHHHH Q ss_conf 111000000000000000124577765433223554359999999538---98547888999999986998999999998 Q gi|254780592|r 155 SEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMI---GADSRIRQNLALVVGLQGRMKEAYSIASQ 231 (271) Q Consensus 155 ~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~---p~~~~~~~~La~~~~~~g~~~eA~~~~~~ 231 (271) ++|...|.+++..+|..+..+..++.+++..|++..|...+++..+.. +..++.+..-..+-...|+.+.+.++-.+ T Consensus 162 ~~A~~~l~~Al~~dp~~p~~lleLa~l~~~~g~~~~A~~~l~ry~~~~G~~~~~a~sL~Lg~riA~~~Gd~~~~~~y~~q 241 (247) T TIGR02521 162 EKAEKYLTRALKIDPQRPESLLELAELYYERGQYAQARALLERYQQTVGVKNQTAESLWLGIRIARALGDVAAAKRYGAQ 241 (247) T ss_pred HHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHH T ss_conf 88999999999858788855799999987507728999999999987367888873478999999984429999999998 No 12 >COG3063 PilF Tfp pilus assembly protein PilF [Cell motility and secretion / Intracellular trafficking and secretion] Probab=99.88 E-value=9e-21 Score=144.96 Aligned_cols=216 Identities=19% Similarity=0.180 Sum_probs=188.5 Q ss_pred HHHHHHHHHHHHHHCCCCCCC--CH--------HHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCC Q ss_conf 999999999985403685433--11--------79999998420199999999999995757879999999999874100 Q gi|254780592|r 17 CIVFLIPLLLFSSCHLNPRIS--IP--------DPESLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRT 86 (271) Q Consensus 17 ~~~~l~~~~~~~~~~~~~~~~--~~--------~~~~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~ 86 (271) ..|.++..+.+++|++.+... +- ...++.+++.||+..|...++++++.||++..+|..+|.+|...|+. T Consensus 6 ~~i~lv~~La~~~cvt~~~~~~~~~~~~aa~arlqLal~YL~~gd~~~A~~nlekAL~~DPs~~~a~~~~A~~Yq~~Ge~ 85 (250) T COG3063 6 TRISLVFSLALGACVTDPAPSRQTDRNEAAKARLQLALGYLQQGDYAQAKKNLEKALEHDPSYYLAHLVRAHYYQKLGEN 85 (250) T ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCH T ss_conf 88999998763200589975332258899999999999999778999999889999884952088999999999990775 Q ss_pred HHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHH--HCHHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 12446665566327521001355553210000011258876422110--0002335669999999742111110000000 Q gi|254780592|r 87 AQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQRP--DIPDWQLISAKGSVLAQMGKHSEALIEYERA 164 (271) Q Consensus 87 ~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~--~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~ 164 (271) +.|-+.|++++.++|++.+++.++|..+..+|++++|...|++++.. -+..+..+.++|.|..+.|+.+.|...++++ T Consensus 86 ~~A~e~YrkAlsl~p~~GdVLNNYG~FLC~qg~~~eA~q~F~~Al~~P~Y~~~s~t~eN~G~Cal~~gq~~~A~~~l~ra 165 (250) T COG3063 86 DLADESYRKALSLAPNNGDVLNNYGAFLCAQGRPEEAMQQFERALADPAYGEPSDTLENLGLCALKAGQFDQAEEYLKRA 165 (250) T ss_pred HHHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHH T ss_conf 56799999998529886405521069997089868999999998718887775304544578886437704479999999 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHH Q ss_conf 00000000124577765433223554359999999538985478889999999869989999999986 Q gi|254780592|r 165 LELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQE 232 (271) Q Consensus 165 ~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~ 232 (271) ++.+|+.+.....++......|++..|..+++...+..+...+.......+-...|+-+.+-++-.+- T Consensus 166 L~~dp~~~~~~l~~a~~~~~~~~y~~Ar~~~~~~~~~~~~~A~sL~L~iriak~~gd~~~a~~Y~~qL 233 (250) T COG3063 166 LELDPQFPPALLELARLHYKAGDYAPARLYLERYQQRGGAQAESLLLGIRIAKRLGDRAAAQRYQAQL 233 (250) T ss_pred HHHCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHH T ss_conf 88375787678998798875313018999999997025665899998999999965488999999999 No 13 >KOG1155 consensus Probab=99.87 E-value=4.3e-19 Score=134.67 Aligned_cols=194 Identities=16% Similarity=0.148 Sum_probs=117.0 Q ss_pred HHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHH----------------------------------HHHHHCC Q ss_conf 999999842019999999999999575787999999999----------------------------------9874100 Q gi|254780592|r 41 PESLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADV----------------------------------LRRVGRT 86 (271) Q Consensus 41 ~~~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~----------------------------------l~~~g~~ 86 (271) ..+....++.|+++|+..|+...+.||-..+-.-.+..+ |...+++ T Consensus 267 ~~A~~~y~~rDfD~a~s~Feei~knDPYRl~dmdlySN~LYv~~~~skLs~LA~~v~~idKyR~ETCCiIaNYYSlr~eH 346 (559) T KOG1155 267 QIAAASYNQRDFDQAESVFEEIRKNDPYRLDDMDLYSNVLYVKNDKSKLSYLAQNVSNIDKYRPETCCIIANYYSLRSEH 346 (559) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEHHHHHHHHHH T ss_conf 99998861220788999999998549773215878767799972117899999999875147752324431378887768 Q ss_pred HHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 12446665566327521001355553210000011258876422110000233566999999974211111000000000 Q gi|254780592|r 87 AQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALE 166 (271) Q Consensus 87 ~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~ 166 (271) ++|+.+|+++++++|....+|..+|.-|..+.+...|.+.|+++++.+|.+.++|+.+|..|.-.+.+.-|.=.|+++.. T Consensus 347 EKAv~YFkRALkLNp~~~~aWTLmGHEyvEmKNt~AAi~sYRrAvdi~p~DyRAWYGLGQaYeim~Mh~YaLyYfqkA~~ 426 (559) T KOG1155 347 EKAVMYFKRALKLNPKYLSAWTLMGHEYVEMKNTHAAIESYRRAVDINPRDYRAWYGLGQAYEIMKMHFYALYYFQKALE 426 (559) T ss_pred HHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH T ss_conf 99999999998328431378998607989835618899999988814925678885206899981356999999999872 Q ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCC Q ss_conf 00000012457776543322355435999999953898547888999999986998999999998629 Q gi|254780592|r 167 LSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQELS 234 (271) Q Consensus 167 ~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~ 234 (271) ..|+++..|..+|.+|...++.++|+.+|++++..+..+..++..||..+..+++.++|...+++.+. T Consensus 427 ~kPnDsRlw~aLG~CY~kl~~~~eAiKCykrai~~~dte~~~l~~LakLye~l~d~~eAa~~yek~v~ 494 (559) T KOG1155 427 LKPNDSRLWVALGECYEKLNRLEEAIKCYKRAILLGDTEGSALVRLAKLYEELKDLNEAAQYYEKYVE 494 (559) T ss_pred CCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 49996689999999999855689999999888852464338999999999998747999999999999 No 14 >KOG0547 consensus Probab=99.86 E-value=7.8e-20 Score=139.22 Aligned_cols=199 Identities=15% Similarity=0.147 Sum_probs=143.9 Q ss_pred HHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHH Q ss_conf 99998420199999999999995757879999999999874100124466655663275210013555532100000112 Q gi|254780592|r 43 SLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDE 122 (271) Q Consensus 43 ~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~ 122 (271) +..++...+-.+-...|+++...||+|++++...|.++.-.+++++|++-|++++.++|++...+..++..+++++++++ T Consensus 367 a~~y~d~~~~~~~~~~F~~A~~ldp~n~dvYyHRgQm~flL~q~e~A~aDF~Kai~L~pe~~~~~iQl~~a~Yr~~k~~~ 446 (606) T KOG0547 367 AAAYADENQSEKMWKDFNKAEDLDPENPDVYYHRGQMRFLLQQYEEAIADFQKAISLDPENAYAYIQLCCALYRQHKIAE 446 (606) T ss_pred HHHHHHHHCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99986011308899998998744988983257677899999989999987998760582533799999999999878999 Q ss_pred HHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCH------HHHHHHHHHHHHH-HHHHHHHHHHHH Q ss_conf 588764221100002335669999999742111110000000000000------0012457776543-322355435999 Q gi|254780592|r 123 GLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPN------ESSIVSNIAMSYL-LMGDLKTAEEKL 195 (271) Q Consensus 123 A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~------~~~~~~~~~~~~~-~~g~~~~A~~~~ 195 (271) ++..|+.+...-|..++.+...|.++..++++++|.+.|.+++++.|. +...+.+.|.+.. ..+++.+|+..+ T Consensus 447 ~m~~Fee~kkkFP~~~Evy~~fAeiLtDqqqFd~A~k~YD~ai~LE~~~~~~~v~~~plV~Ka~l~~qwk~d~~~a~~Ll 526 (606) T KOG0547 447 SMKTFEEAKKKFPNCPEVYNLFAEILTDQQQFDKAVKQYDKAIELEPREHLIIVNAAPLVHKALLVLQWKEDINQAENLL 526 (606) T ss_pred HHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHCHHHHHHHHHHHH T ss_conf 99999999986889823778999998317767789998888885064201132460544304676435165489999999 Q ss_pred HHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHH Q ss_conf 9999538985478889999999869989999999986291728665 Q gi|254780592|r 196 RFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQELSPEEATRN 241 (271) Q Consensus 196 ~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~ 241 (271) ++|++++|....++..||.+..++|+.++|+++|++++........ T Consensus 527 ~KA~e~Dpkce~A~~tlaq~~lQ~~~i~eAielFEksa~lArt~~E 572 (606) T KOG0547 527 RKAIELDPKCEQAYETLAQFELQRGKIDEAIELFEKSAQLARTESE 572 (606) T ss_pred HHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9987259447899999989999886599999999999999885999 No 15 >KOG0547 consensus Probab=99.84 E-value=3.3e-19 Score=135.40 Aligned_cols=209 Identities=12% Similarity=0.070 Sum_probs=187.5 Q ss_pred HHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHH Q ss_conf 98420199999999999995757879999999999874100124466655663275210013555532100000112588 Q gi|254780592|r 46 NMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLD 125 (271) Q Consensus 46 ~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~ 125 (271) ++-.|+.-.|...|+++++.+|.+...++.+|.+|....+.++....|+++..++|+++.++..+|..+.-++++++|.. T Consensus 336 ~fL~g~~~~a~~d~~~~I~l~~~~~~lyI~~a~~y~d~~~~~~~~~~F~~A~~ldp~n~dvYyHRgQm~flL~q~e~A~a 415 (606) T KOG0547 336 HFLKGDSLGAQEDFDAAIKLDPAFNSLYIKRAAAYADENQSEKMWKDFNKAEDLDPENPDVYYHRGQMRFLLQQYEEAIA 415 (606) T ss_pred HHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 23148810116669999845765523899999998601130889999899874498898325767789999998999998 Q ss_pred HHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC- Q ss_conf 7642211000023356699999997421111100000000000000012457776543322355435999999953898- Q gi|254780592|r 126 AINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGA- 204 (271) Q Consensus 126 ~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~- 204 (271) .|++++.++|.+.-.+..++-...++++.++++..|+.+....|+.++.+...|.++..+++++.|+..|.+|+.+.|. T Consensus 416 DF~Kai~L~pe~~~~~iQl~~a~Yr~~k~~~~m~~Fee~kkkFP~~~Evy~~fAeiLtDqqqFd~A~k~YD~ai~LE~~~ 495 (606) T KOG0547 416 DFQKAISLDPENAYAYIQLCCALYRQHKIAESMKTFEEAKKKFPNCPEVYNLFAEILTDQQQFDKAVKQYDKAIELEPRE 495 (606) T ss_pred HHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 79987605825337999999999998789999999999998688982377899999831776778999888888506420 Q ss_pred -----CHHHHHHHHHH-HHHCCCHHHHHHHHHHHCCCCCCCHHHHH-HHHHHHHHHH Q ss_conf -----54788899999-99869989999999986291728665999-9999983212 Q gi|254780592|r 205 -----DSRIRQNLALV-VGLQGRMKEAYSIASQELSPEEATRNIKY-IKSILSQRDP 254 (271) Q Consensus 205 -----~~~~~~~La~~-~~~~g~~~eA~~~~~~~L~~~~~~~~~~~-l~~~l~~~~~ 254 (271) +...+.+-|.+ +...+++++|+.++++++..+|......- +..+.-|+.. T Consensus 496 ~~~~v~~~plV~Ka~l~~qwk~d~~~a~~Ll~KA~e~Dpkce~A~~tlaq~~lQ~~~ 552 (606) T KOG0547 496 HLIIVNAAPLVHKALLVLQWKEDINQAENLLRKAIELDPKCEQAYETLAQFELQRGK 552 (606) T ss_pred CCCCCCCHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHH T ss_conf 113246054430467643516548999999999872594478999999899998865 No 16 >KOG1155 consensus Probab=99.83 E-value=1.9e-18 Score=130.68 Aligned_cols=217 Identities=13% Similarity=0.070 Sum_probs=186.2 Q ss_pred CCHHHHHHHHHHHHHH-CCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHH------------------- Q ss_conf 0199999999999995-75787999999999987410012446665566327521001355------------------- Q gi|254780592|r 50 EQLLEVTSTIGLQYQS-HTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAA------------------- 109 (271) Q Consensus 50 ~~~~~A~~~~~~al~~-~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~------------------- 109 (271) .+.+++...+...+.. -|+++......|.+.+..-++++|++.|+...+.+|....-.-. T Consensus 241 ~q~~e~~~k~e~l~~~gf~~~~~i~~~~A~~~y~~rDfD~a~s~Feei~knDPYRl~dmdlySN~LYv~~~~skLs~LA~ 320 (559) T KOG1155 241 HQHEEALQKKERLSSVGFPNSMYIKTQIAAASYNQRDFDQAESVFEEIRKNDPYRLDDMDLYSNVLYVKNDKSKLSYLAQ 320 (559) T ss_pred HHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 88889999999887516874177999999988612207889999999985497732158787677999721178999999 Q ss_pred ---------------HHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHH Q ss_conf ---------------55321000001125887642211000023356699999997421111100000000000000012 Q gi|254780592|r 110 ---------------YGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSI 174 (271) Q Consensus 110 ---------------~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~ 174 (271) +|..|.-.++-++|...|++++.++|+...+|..+|+-|...++...|+++|+.++++.|.+..+ T Consensus 321 ~v~~idKyR~ETCCiIaNYYSlr~eHEKAv~YFkRALkLNp~~~~aWTLmGHEyvEmKNt~AAi~sYRrAvdi~p~DyRA 400 (559) T KOG1155 321 NVSNIDKYRPETCCIIANYYSLRSEHEKAVMYFKRALKLNPKYLSAWTLMGHEYVEMKNTHAAIESYRRAVDINPRDYRA 400 (559) T ss_pred HHHHHCCCCCCCEEEEHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCHHHHH T ss_conf 99875147752324431378887768999999999983284313789986079898356188999999888149256788 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHHH-HHHHHHHHHH Q ss_conf 457776543322355435999999953898547888999999986998999999998629172866599-9999998321 Q gi|254780592|r 175 VSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQELSPEEATRNIK-YIKSILSQRD 253 (271) Q Consensus 175 ~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~~~-~l~~~l~~~~ 253 (271) |+.+|++|.-.+-..=|+-+|++|.+..|.+++.|..||.+|...++.+||+.+|.+++...+.+.+.. .+..+..+-. T Consensus 401 WYGLGQaYeim~Mh~YaLyYfqkA~~~kPnDsRlw~aLG~CY~kl~~~~eAiKCykrai~~~dte~~~l~~LakLye~l~ 480 (559) T KOG1155 401 WYGLGQAYEIMKMHFYALYYFQKALELKPNDSRLWVALGECYEKLNRLEEAIKCYKRAILLGDTEGSALVRLAKLYEELK 480 (559) T ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHH T ss_conf 85206899981356999999999872499966899999999998556899999998888524643389999999999987 Q ss_pred HHHHHHHHHHHCC Q ss_conf 2999999997114 Q gi|254780592|r 254 PWKKIAKARSNHN 266 (271) Q Consensus 254 ~~~~~a~a~~~~~ 266 (271) ...+.+++...+- T Consensus 481 d~~eAa~~yek~v 493 (559) T KOG1155 481 DLNEAAQYYEKYV 493 (559) T ss_pred HHHHHHHHHHHHH T ss_conf 4799999999999 No 17 >KOG2002 consensus Probab=99.81 E-value=2.4e-18 Score=130.08 Aligned_cols=248 Identities=13% Similarity=0.041 Sum_probs=171.3 Q ss_pred HHHHHHHHHHHHHHHHHHH-------HHHHHHHHHCCCCCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCC-HHHHH Q ss_conf 7875699999899999999-------999998540368543311799999984201999999999999957578-79999 Q gi|254780592|r 3 ILFCPVILFLKKIKCIVFL-------IPLLLFSSCHLNPRISIPDPESLENMNHEQLLEVTSTIGLQYQSHTKN-KMIGI 74 (271) Q Consensus 3 ~~~~~~~~f~~~~~~~~~l-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~~~~al~~~P~~-~~~~~ 74 (271) +|+-++...+++..++... ...+.++.-.+.+.+.....++.-.++.++|..|..+|++++..+|.. ++..+ T Consensus 124 ~m~~~~~l~~~~~~~l~~~~~~~~~A~a~F~~Vl~~sp~Nil~LlGkA~i~ynkkdY~~al~yyk~al~inp~~~aD~rI 203 (1018) T KOG2002 124 DMYEDSHLLVQRGFLLLEGDKSMDDADAQFHFVLKQSPDNILALLGKARIAYNKKDYRGALKYYKKALRINPACKADVRI 203 (1018) T ss_pred HCCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCCCCCC T ss_conf 50575033123232220477637779999999996088535889989999961213899999999998508666787631 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHH---HCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHH Q ss_conf 999999874100124466655663275210013555532100---00011258876422110000233566999999974 Q gi|254780592|r 75 VYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLAN---AGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQM 151 (271) Q Consensus 75 ~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~---~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~ 151 (271) ..|.++...|+.+.|+..|.++++++|.+..++..+|.+-.. ...+..+...+.++...++.+|.....++..+... T Consensus 204 gig~Cf~kl~~~~~a~~a~~ralqLdp~~v~alv~L~~~~l~~~d~~s~~~~~~ll~~ay~~n~~nP~~l~~LAn~fyfK 283 (1018) T KOG2002 204 GIGHCFWKLGMSEKALLAFERALQLDPTCVSALVALGEVDLNFNDSDSYKKGVQLLQRAYKENNENPVALNHLANHFYFK 283 (1018) T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHC T ss_conf 14667875034366799999998507155999999999998701567788899998888760689957999997777413 Q ss_pred HHHHHHHHHHHHCCCCCCH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC-HHHHHHHHHHHHHCCCHHHHHH Q ss_conf 2111110000000000000---00124577765433223554359999999538985-4788899999998699899999 Q gi|254780592|r 152 GKHSEALIEYERALELSPN---ESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGAD-SRIRQNLALVVGLQGRMKEAYS 227 (271) Q Consensus 152 ~~~~~A~~~~~~~~~~~p~---~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~-~~~~~~La~~~~~~g~~~eA~~ 227 (271) |+++.+......++..... -....+++|.+|..+|++++|..+|.++++.+|++ .-.++.+|+++...|.+.+|.. T Consensus 284 ~dy~~v~~la~~ai~~t~~~~~~aes~Y~~gRs~Ha~Gd~ekA~~yY~~s~k~~~d~~~l~~~GlgQm~i~~~dle~s~~ 363 (1018) T KOG2002 284 KDYERVWHLAEHAIKNTENKSIKAESFYQLGRSYHAQGDFEKAFKYYMESLKADNDNFVLPLVGLGQMYIKRGDLEESKF 363 (1018) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHCHHHHHHH T ss_conf 53789999999999861014777788999999988622689999999999724888762001212578987132778899 Q ss_pred HHHHHCCCCCCCHHHHHHHHHHH Q ss_conf 99986291728665999999998 Q gi|254780592|r 228 IASQELSPEEATRNIKYIKSILS 250 (271) Q Consensus 228 ~~~~~L~~~~~~~~~~~l~~~l~ 250 (271) ++++++...|++.....+.+.+. T Consensus 364 ~fEkv~k~~p~~~etm~iLG~Ly 386 (1018) T KOG2002 364 CFEKVLKQLPNNYETMKILGCLY 386 (1018) T ss_pred HHHHHHHHCCCHHHHHHHHHHHH T ss_conf 99999986765088999988678 No 18 >COG3063 PilF Tfp pilus assembly protein PilF [Cell motility and secretion / Intracellular trafficking and secretion] Probab=99.81 E-value=1.9e-18 Score=130.70 Aligned_cols=194 Identities=18% Similarity=0.087 Sum_probs=177.5 Q ss_pred HHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHH Q ss_conf 79999999999874100124466655663275210013555532100000112588764221100002335669999999 Q gi|254780592|r 70 KMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLA 149 (271) Q Consensus 70 ~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~ 149 (271) ..+.+.+|.-|.+.|++.+|...++++++.+|++..++..++.+|...|+.+.|.+.|++++..+|++.+.+++.|..+. T Consensus 35 a~arlqLal~YL~~gd~~~A~~nlekAL~~DPs~~~a~~~~A~~Yq~~Ge~~~A~e~YrkAlsl~p~~GdVLNNYG~FLC 114 (250) T COG3063 35 AKARLQLALGYLQQGDYAQAKKNLEKALEHDPSYYLAHLVRAHYYQKLGENDLADESYRKALSLAPNNGDVLNNYGAFLC 114 (250) T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHHH T ss_conf 99999999999977899999988999988495208899999999999077556799999998529886405521069997 Q ss_pred HHHHHHHHHHHHHHCCCC--CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHH Q ss_conf 742111110000000000--000001245777654332235543599999995389854788899999998699899999 Q gi|254780592|r 150 QMGKHSEALIEYERALEL--SPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYS 227 (271) Q Consensus 150 ~~~~~~~A~~~~~~~~~~--~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~ 227 (271) .+|++++|...|++++.. .+.....+.|+|.+..++|+.+.|..+|+++++.+|+++.....++......|++.+|.. T Consensus 115 ~qg~~~eA~q~F~~Al~~P~Y~~~s~t~eN~G~Cal~~gq~~~A~~~l~raL~~dp~~~~~~l~~a~~~~~~~~y~~Ar~ 194 (250) T COG3063 115 AQGRPEEAMQQFERALADPAYGEPSDTLENLGLCALKAGQFDQAEEYLKRALELDPQFPPALLELARLHYKAGDYAPARL 194 (250) T ss_pred HCCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCHHHHH T ss_conf 08986899999999871888777530454457888643770447999999988375787678998798875313018999 Q ss_pred HHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999862917286659999999983212999999997 Q gi|254780592|r 228 IASQELSPEEATRNIKYIKSILSQRDPWKKIAKARS 263 (271) Q Consensus 228 ~~~~~L~~~~~~~~~~~l~~~l~~~~~~~~~a~a~~ 263 (271) .+++....-+..+...++..-+.++-|-...+..+. T Consensus 195 ~~~~~~~~~~~~A~sL~L~iriak~~gd~~~a~~Y~ 230 (250) T COG3063 195 YLERYQQRGGAQAESLLLGIRIAKRLGDRAAAQRYQ 230 (250) T ss_pred HHHHHHHCCCCCHHHHHHHHHHHHHHCCHHHHHHHH T ss_conf 999997025665899998999999965488999999 No 19 >PRK11788 hypothetical protein; Provisional Probab=99.80 E-value=6.2e-17 Score=121.46 Aligned_cols=205 Identities=17% Similarity=0.091 Sum_probs=177.6 Q ss_pred CCCHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCC---C-CCHHHHHHH Q ss_conf 331179999998420199999999999995757879999999999874100124466655663275---2-100135555 Q gi|254780592|r 36 ISIPDPESLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYP---E-DQEVLAAYG 111 (271) Q Consensus 36 ~~~~~~~~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p---~-~~~~~~~~~ 111 (271) .+....+..+++-+++.++|++.|-.+++.||+..++++.+|..+++.|+.++|+.+-+.++.... . ...++..+| T Consensus 35 l~~~Y~~Gln~Ll~~q~dkAid~f~~~~~~~~~t~e~hlaLG~LfRrrGE~dRAIriHq~Ll~rp~L~~~~r~~a~~ELa 114 (389) T PRK11788 35 LSRDYFKGLNFLLNEQPDKAIDLFIEMLKVDPETVELHLALGNLFRRRGEVDRAIRIHQNLLSRPDLTREQRLLALQELG 114 (389) T ss_pred CCHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHH T ss_conf 66999989988725894599999999986481149999999999996560889999999997287989999999999999 Q ss_pred HHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHH-----HHHHHHHHHHHHHH Q ss_conf 3210000011258876422110000233566999999974211111000000000000000-----12457776543322 Q gi|254780592|r 112 KSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNES-----SIVSNIAMSYLLMG 186 (271) Q Consensus 112 ~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~-----~~~~~~~~~~~~~g 186 (271) .-|...|-+|.|...|.+....++....+...+..+|...+++++|++..++.......+. -.+..++.-....| T Consensus 115 ~Dy~~aGllDRAE~~f~~l~~~~~~~~~al~~Ll~iye~~~dW~~Ai~~a~~L~~~~~~~~~~~iAhy~CElA~~~~~~~ 194 (389) T PRK11788 115 QDYLKAGLLDRAEELFLQLVDEDDFAEGALRQLLTIYQQEKDWQKAIEVAERLEKLGGDSLRKEIAHFYCELAQQALARG 194 (389) T ss_pred HHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCC T ss_conf 99986774999999999998285426999999999999864599999999999870476506999999999999998748 Q ss_pred HHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCH Q ss_conf 355435999999953898547888999999986998999999998629172866 Q gi|254780592|r 187 DLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQELSPEEATR 240 (271) Q Consensus 187 ~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~ 240 (271) +.++|...+++|++.+|.+.+++..+|.++...|++++|...++++...+|.-. T Consensus 195 ~~~~A~~~L~kAL~~dp~cvRA~lllg~l~~~~g~~~~Ai~~l~~i~~q~p~~l 248 (389) T PRK11788 195 DLDAARALLKKALAADPQCVRASILLGDLALAQGDYAAAIEALERVEEQDPEYL 248 (389) T ss_pred CHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCHHHH T ss_conf 999999999999874824341899999999987999999999999997395779 No 20 >TIGR00990 3a0801s09 mitochondrial precursor proteins import receptor; InterPro: IPR005687 The mitochondrial protein translocase (MPT) family, which brings nuclearly encoded preproteins into mitochondria, is very complex with 19 currently identified protein constituents. These proteins include several chaperone proteins, four proteins of the outer membrane translocase (Tom) import receptor, five proteins of the Tom channel complex, five proteins of the inner membrane translocase (Tim) and three "motor" proteins. This family is specific for the Tom70 proteins. ; GO: 0015450 P-P-bond-hydrolysis-driven protein transmembrane transporter activity, 0006886 intracellular protein transport, 0005741 mitochondrial outer membrane. Probab=99.80 E-value=1.7e-18 Score=130.95 Aligned_cols=232 Identities=17% Similarity=0.136 Sum_probs=185.2 Q ss_pred CCCCCCHHHH-HHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHH Q ss_conf 8543311799-999984201999999999999957578799999999998741001244666556632752100135555 Q gi|254780592|r 33 NPRISIPDPE-SLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYG 111 (271) Q Consensus 33 ~~~~~~~~~~-~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~ 111 (271) .|......+. +.-.+..++-+++.+.|++|++.||+|+++++.+|....-.|++++|-.-|+++++++|+....++.+| T Consensus 393 ~P~~~qSYI~~A~~~~~~~~~~~~~~~F~~Al~~~~~~~~~YYHRaQl~FI~~~f~~A~KDy~K~~~LDp~~IF~~IQL~ 472 (649) T TIGR00990 393 DPRVTQSYIKRASINLELGDPDKAEEDFDKALKKNSEDPDIYYHRAQLHFIKGEFAQAGKDYQKSIDLDPDFIFSYIQLG 472 (649) T ss_pred CCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCCCEEHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHH T ss_conf 78731789999999741578422588999998618898751101778998873077675457766432711022689999 Q ss_pred HHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHH------HHHHHHHHHHH- Q ss_conf 32100000112588764221100002335669999999742111110000000000000001------24577765433- Q gi|254780592|r 112 KSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESS------IVSNIAMSYLL- 184 (271) Q Consensus 112 ~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~------~~~~~~~~~~~- 184 (271) .+.++.|.++.....|.++...-|..|+.++..|-++..+.++++|++-|..++++...... -+.+.+.++.+ T Consensus 473 ~~~Y~~~~~~~~~~~F~~~~~~Fp~~P~V~Nyy~ElL~D~~~F~~A~~~fD~AI~LE~~~~~~~~~v~PL~~Ka~~L~~n 552 (649) T TIGR00990 473 VLQYKLGSIASSMALFRRCKKNFPEAPDVYNYYGELLLDKNKFDEAIEKFDLAIELEKELKPIYVNVLPLINKALALFQN 552 (649) T ss_pred HHHHHCCCHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHCHHHHHHHHHHHHHHHHHHHCCCCEEEECHHHHHHHHHHHHC T ss_conf 98863034434789999998617788885305788861436799999985689998750598076313367789997626 Q ss_pred --HHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCC--CCCCH--HHHH---HHHHHHHHHHH Q ss_conf --223554359999999538985478889999999869989999999986291--72866--5999---99999832129 Q gi|254780592|r 185 --MGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQELSP--EEATR--NIKY---IKSILSQRDPW 255 (271) Q Consensus 185 --~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~--~~~~~--~~~~---l~~~l~~~~~~ 255 (271) ..+|.+|...+++|+.++|....+...||++..++|+.+||+.+|+++-.. ...+. -+.| .+..+.-+..| T Consensus 553 P~~~~f~EA~~L~EKA~~~DP~~~~A~~~lAQl~LQQ~~v~EAl~~FE~~A~LART~~E~~~Ai~yAEA~~~Q~~v~~~Y 632 (649) T TIGR00990 553 PWKQDFIEAENLLEKALIIDPECDIAVATLAQLLLQQGDVDEALKLFERAAELARTEGELVQAISYAEATRVQIQVQEDY 632 (649) T ss_pred CCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 52100789999988863048841478999999876234689999999877876301677898988888988889987314 Q ss_pred HHHHHHHHH Q ss_conf 999999971 Q gi|254780592|r 256 KKIAKARSN 264 (271) Q Consensus 256 ~~~a~a~~~ 264 (271) +.+|+-.+- T Consensus 633 P~LA~Kl~~ 641 (649) T TIGR00990 633 PELASKLQG 641 (649) T ss_pred HHHHHHHHH T ss_conf 278999875 No 21 >KOG1129 consensus Probab=99.79 E-value=3.4e-18 Score=129.15 Aligned_cols=219 Identities=15% Similarity=0.115 Sum_probs=200.7 Q ss_pred HHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHH Q ss_conf 99842019999999999999575787999999999987410012446665566327521001355553210000011258 Q gi|254780592|r 45 ENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGL 124 (271) Q Consensus 45 ~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~ 124 (271) -++..|.+.+|+..++..++..| .++.+.-++.+|.+..++..|+.++.+.++..|.+...+...++++..+++.++|. T Consensus 232 CylrLgm~r~AekqlqssL~q~~-~~dTfllLskvY~ridQP~~AL~~~~~gld~fP~~VT~l~g~ARi~eam~~~~~a~ 310 (478) T KOG1129 232 CYLRLGMPRRAEKQLQSSLTQFP-HPDTFLLLSKVYQRIDQPERALLVIGEGLDSFPFDVTYLLGQARIHEAMEQQEDAL 310 (478) T ss_pred HHHHHCCHHHHHHHHHHHHHCCC-CHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99981682666899998764377-61699999999987235889999986465238832566663699999997078899 Q ss_pred HHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC- Q ss_conf 8764221100002335669999999742111110000000000000001245777654332235543599999995389- Q gi|254780592|r 125 DAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIG- 203 (271) Q Consensus 125 ~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p- 203 (271) +.|+.+++.+|.+.++..-+|.-|..-++++-|+..|++.+.....+++.+.|+|.+-+..++++-++..|++|++.-. T Consensus 311 ~lYk~vlk~~~~nvEaiAcia~~yfY~~~PE~AlryYRRiLqmG~~speLf~NigLCC~yaqQ~D~~L~sf~RAlstat~ 390 (478) T KOG1129 311 QLYKLVLKLHPINVEAIACIAVGYFYDNNPEMALRYYRRILQMGAQSPELFCNIGLCCLYAQQIDLVLPSFQRALSTATQ 390 (478) T ss_pred HHHHHHHHCCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCC T ss_conf 99999984087650035665530013898689999999999832777677765888987540030317889999863368 Q ss_pred --CCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf --854788899999998699899999999862917286659999999983212999999997114 Q gi|254780592|r 204 --ADSRIRQNLALVVGLQGRMKEAYSIASQELSPEEATRNIKYIKSILSQRDPWKKIAKARSNHN 266 (271) Q Consensus 204 --~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~~~~l~~~l~~~~~~~~~a~a~~~~~ 266 (271) .-+++|+|||.+....|++.-|..+++-+|..++.+.......+++..+.| ++..+|+..+ T Consensus 391 ~~~aaDvWYNlg~vaV~iGD~nlA~rcfrlaL~~d~~h~ealnNLavL~~r~G--~i~~Arsll~ 453 (478) T KOG1129 391 PGQAADVWYNLGFVAVTIGDFNLAKRCFRLALTSDAQHGEALNNLAVLAARSG--DILGARSLLN 453 (478) T ss_pred CCHHHHHHHCCCEEEEECCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCC--CHHHHHHHHH T ss_conf 64156666400116770342488888889875058523888876899885127--6387899998 No 22 >KOG0624 consensus Probab=99.79 E-value=1.2e-16 Score=119.69 Aligned_cols=207 Identities=16% Similarity=0.134 Sum_probs=185.4 Q ss_pred CCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHH Q ss_conf 43311799999984201999999999999957578799999999998741001244666556632752100135555321 Q gi|254780592|r 35 RISIPDPESLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSL 114 (271) Q Consensus 35 ~~~~~~~~~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~ 114 (271) .+...+.-..+.+..+++..|+..|..+++.||+|..++++.|.+|..+|+..-|+.-+.+++++.|+-..+....|.++ T Consensus 37 dvekhlElGk~lla~~Q~sDALt~yHaAve~dp~~Y~aifrRaT~yLAmGksk~al~Dl~rVlelKpDF~~ARiQRg~vl 116 (504) T KOG0624 37 DVEKHLELGKELLARGQLSDALTHYHAAVEGDPNNYQAIFRRATVYLAMGKSKAALQDLSRVLELKPDFMAARIQRGVVL 116 (504) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCCHHHHHHHHCHHH T ss_conf 99999998899997346778999988887078066999998777776525774003308999861831888888722035 Q ss_pred HHHCCHHHHHHHHHHHHHHHCHHHH---HHHHH------------HHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHH Q ss_conf 0000011258876422110000233---56699------------99999742111110000000000000001245777 Q gi|254780592|r 115 ANAGYLDEGLDAINRAQRPDIPDWQ---LISAK------------GSVLAQMGKHSEALIEYERALELSPNESSIVSNIA 179 (271) Q Consensus 115 ~~~g~~~~A~~~~~~a~~~~p~~~~---~~~~l------------g~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~ 179 (271) +.+|+++.|...|+..+..+|+... ....+ ...+...|++..++......+++.|=+.+.+...+ T Consensus 117 lK~Gele~A~~DF~~vl~~~~s~~~~~eaqskl~~~~e~~~l~~ql~s~~~~GD~~~ai~~i~~llEi~~Wda~l~~~Ra 196 (504) T KOG0624 117 LKQGELEQAEADFDQVLQHEPSNGLVLEAQSKLALIQEHWVLVQQLKSASGSGDCQNAIEMITHLLEIQPWDASLRQARA 196 (504) T ss_pred HHCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHH T ss_conf 53252888987999997259873026889988776899999999999884387465599999888853713368999998 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHH Q ss_conf 65433223554359999999538985478889999999869989999999986291728665 Q gi|254780592|r 180 MSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQELSPEEATRN 241 (271) Q Consensus 180 ~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~ 241 (271) .+|+..|++..|+..++.+..++.++++..+.....++..|+..++....+.+|..+|.+.. T Consensus 197 kc~i~~~e~k~AI~Dlk~askLs~DnTe~~ykis~L~Y~vgd~~~sL~~iRECLKldpdHK~ 258 (504) T KOG0624 197 KCYIAEGEPKKAIHDLKQASKLSQDNTEGHYKISQLLYTVGDAENSLKEIRECLKLDPDHKL 258 (504) T ss_pred HHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHH T ss_conf 88885386888888899987404665388898998997661188789999999806953336 No 23 >KOG1173 consensus Probab=99.79 E-value=1.1e-17 Score=126.10 Aligned_cols=210 Identities=15% Similarity=0.101 Sum_probs=188.2 Q ss_pred HHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHH Q ss_conf 99999842019999999999999575787999999999987410012446665566327521001355553210000011 Q gi|254780592|r 42 ESLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLD 121 (271) Q Consensus 42 ~~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~ 121 (271) -+.=++..+.+.+|..+|.|+...||.-..+|+.+|.++.-.|+.+||++.+..|-++.|..-.-..-+|.-|.+++++. T Consensus 318 Vg~YYl~i~k~seARry~SKat~lD~~fgpaWl~fghsfa~e~EhdQAmaaY~tAarl~~G~hlP~LYlgmey~~t~n~k 397 (611) T KOG1173 318 VGCYYLMIGKYSEARRYFSKATTLDPTFGPAWLAFGHSFAGEGEHDQAMAAYFTAARLMPGCHLPSLYLGMEYMRTNNLK 397 (611) T ss_pred HHHHHHHHCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCHH T ss_conf 77899981486799999888760376532799987677650105788999999999866677631789999998742398 Q ss_pred HHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC-------HHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 258876422110000233566999999974211111000000000000-------0001245777654332235543599 Q gi|254780592|r 122 EGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSP-------NESSIVSNIAMSYLLMGDLKTAEEK 194 (271) Q Consensus 122 ~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p-------~~~~~~~~~~~~~~~~g~~~~A~~~ 194 (271) -|...|..+....|.++-.+..+|.+....+.+.+|...|..++...+ ...+.+.|+|.++.+.+.+++|+.+ T Consensus 398 LAe~Ff~~A~ai~P~Dplv~~Elgvvay~~~~y~~A~~~f~~~l~~ik~~~~e~~~w~p~~~NLGH~~Rkl~~~~eAI~~ 477 (611) T KOG1173 398 LAEKFFKQALAIAPSDPLVLHELGVVAYTYEEYPEALKYFQKALEVIKSVLNEKIFWEPTLNNLGHAYRKLNKYEEAIDY 477 (611) T ss_pred HHHHHHHHHHHCCCCCCHHHHHHHHEEEHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999871189860443314420332754688999999998876630543221157787678999997667999999 Q ss_pred HHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHHH-HHHHHHHH Q ss_conf 9999953898547888999999986998999999998629172866599-99999983 Q gi|254780592|r 195 LRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQELSPEEATRNIK-YIKSILSQ 251 (271) Q Consensus 195 ~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~~~-~l~~~l~~ 251 (271) +++++...|.+++++..+|.++..+|+++.|+..+.++|..+|.+.... .++..+.. T Consensus 478 ~q~aL~l~~k~~~~~asig~iy~llgnld~Aid~fhKaL~l~p~n~~~~~lL~~aie~ 535 (611) T KOG1173 478 YQKALLLSPKDASTHASIGYIYHLLGNLDKAIDHFHKALALKPDNIFISELLKLAIED 535 (611) T ss_pred HHHHHHCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHH T ss_conf 9999871877656888899999996580779999999871487528999999999975 No 24 >TIGR00990 3a0801s09 mitochondrial precursor proteins import receptor; InterPro: IPR005687 The mitochondrial protein translocase (MPT) family, which brings nuclearly encoded preproteins into mitochondria, is very complex with 19 currently identified protein constituents. These proteins include several chaperone proteins, four proteins of the outer membrane translocase (Tom) import receptor, five proteins of the Tom channel complex, five proteins of the inner membrane translocase (Tim) and three "motor" proteins. This family is specific for the Tom70 proteins. ; GO: 0015450 P-P-bond-hydrolysis-driven protein transmembrane transporter activity, 0006886 intracellular protein transport, 0005741 mitochondrial outer membrane. Probab=99.78 E-value=1.3e-17 Score=125.60 Aligned_cols=214 Identities=13% Similarity=0.044 Sum_probs=188.3 Q ss_pred HHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHH Q ss_conf 84201999999999999957578799999999998741001244666556632752100135555321000001125887 Q gi|254780592|r 47 MNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDA 126 (271) Q Consensus 47 ~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~ 126 (271) .-.++-.+|...+.++++.+|.-...++..|.+....|..+++.+.|++++..+++++.+|..+|....=.|+|++|.+. T Consensus 374 ~L~~~~~~A~~Dl~K~IEL~P~~~qSYI~~A~~~~~~~~~~~~~~~F~~Al~~~~~~~~~YYHRaQl~FI~~~f~~A~KD 453 (649) T TIGR00990 374 CLKGKHLEALADLSKSIELDPRVTQSYIKRASINLELGDPDKAEEDFDKALKKNSEDPDIYYHRAQLHFIKGEFAQAGKD 453 (649) T ss_pred HHCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCCCEEHHHHHHHHHHHHHHHHHH T ss_conf 31288157899877654227873178999999974157842258899999861889875110177899887307767545 Q ss_pred HHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC- Q ss_conf 6422110000233566999999974211111000000000000000124577765433223554359999999538985- Q gi|254780592|r 127 INRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGAD- 205 (271) Q Consensus 127 ~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~- 205 (271) |+++..+||.....+..||..-.+.|..+.....|+++....|..|++++..|.++..++++++|+.-|..|+++.... T Consensus 454 y~K~~~LDp~~IF~~IQL~~~~Y~~~~~~~~~~~F~~~~~~Fp~~P~V~Nyy~ElL~D~~~F~~A~~~fD~AI~LE~~~~ 533 (649) T TIGR00990 454 YQKSIDLDPDFIFSYIQLGVLQYKLGSIASSMALFRRCKKNFPEAPDVYNYYGELLLDKNKFDEAIEKFDLAIELEKELK 533 (649) T ss_pred HHHHHHCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHCHHHHHHHHHHHHHHHHHHHCCC T ss_conf 77664327110226899999886303443478999999861778888530578886143679999998568999875059 Q ss_pred -----HHHHHHHHHHHHH---CCCHHHHHHHHHHHCCCCCCCHHHHHHHH-HHHHHHHHHHHHH Q ss_conf -----4788899999998---69989999999986291728665999999-9983212999999 Q gi|254780592|r 206 -----SRIRQNLALVVGL---QGRMKEAYSIASQELSPEEATRNIKYIKS-ILSQRDPWKKIAK 260 (271) Q Consensus 206 -----~~~~~~La~~~~~---~g~~~eA~~~~~~~L~~~~~~~~~~~l~~-~l~~~~~~~~~a~ 260 (271) .-.+.+-+.++.+ ..+|.||..++++++..+|.......-.+ ++-|++.-.++.+ T Consensus 534 ~~~~~v~PL~~Ka~~L~~nP~~~~f~EA~~L~EKA~~~DP~~~~A~~~lAQl~LQQ~~v~EAl~ 597 (649) T TIGR00990 534 PIYVNVLPLINKALALFQNPWKQDFIEAENLLEKALIIDPECDIAVATLAQLLLQQGDVDEALK 597 (649) T ss_pred CEEEECHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 8076313367789997626521007899999888630488414789999998762346899999 No 25 >KOG1125 consensus Probab=99.74 E-value=1.6e-16 Score=118.92 Aligned_cols=200 Identities=22% Similarity=0.232 Sum_probs=160.6 Q ss_pred HHCCCCC-CCCHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCC------ Q ss_conf 4036854-331179999998420199999999999995757879999999999874100124466655663275------ Q gi|254780592|r 29 SCHLNPR-ISIPDPESLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYP------ 101 (271) Q Consensus 29 ~~~~~~~-~~~~~~~~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p------ 101 (271) +...+|. ......-.+.+...++=..|+..++++++.+|+|.++++.+|..|...|.-.+|+.++++++..+| T Consensus 311 AVkqdP~haeAW~~LG~~qaENE~E~~ai~AL~rcl~LdP~NleaLmaLAVSytNeg~q~~Al~~L~~Wi~~~p~y~~l~ 390 (579) T KOG1125 311 AVKQDPQHAEAWQKLGITQAENENEQNAISALRRCLELDPTNLEALMALAVSYTNEGLQNQALKMLDKWIRNKPKYVHLV 390 (579) T ss_pred HHHHCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHCC T ss_conf 98619287899999615765034208899999999846995199999988888614668889999999998486211114 Q ss_pred -------------------------------------CCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHH Q ss_conf -------------------------------------2100135555321000001125887642211000023356699 Q gi|254780592|r 102 -------------------------------------EDQEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAK 144 (271) Q Consensus 102 -------------------------------------~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~l 144 (271) .++.++..+|.+|.-.|+|+.|...|+.++...|++...|+.+ T Consensus 391 ~a~~~~~~~~~~s~~~~~~l~~i~~~fLeaa~~~~~~~DpdvQ~~LGVLy~ls~efdraiDcf~~AL~v~Pnd~~lWNRL 470 (579) T KOG1125 391 SAGENEDFENTKSFLDSSHLAHIQELFLEAARQLPTKIDPDVQSGLGVLYNLSGEFDRAVDCFEAALQVKPNDYLLWNRL 470 (579) T ss_pred CCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHH T ss_conf 66766566677677887899999999999998577888842776267897401478889999999970498417789886 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC----------CHHHHHHHHH Q ss_conf 999997421111100000000000000012457776543322355435999999953898----------5478889999 Q gi|254780592|r 145 GSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGA----------DSRIRQNLAL 214 (271) Q Consensus 145 g~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~----------~~~~~~~La~ 214 (271) |..+.--.+.++|+..|.+++++.|....+++|+|..++..|.|++|+.+|-.|+.+.+. +..+|..|-. T Consensus 471 GAtLAN~~~s~EAIsAY~rALqLqP~yVR~RyNlgIS~mNlG~ykEA~~hlL~AL~mq~ks~~~~~~~~~se~iw~tLR~ 550 (579) T KOG1125 471 GATLANGNRSEEAISAYNRALQLQPGYVRVRYNLGISCMNLGAYKEAVKHLLEALSMQRKSRNHNKAPMASENIWQTLRL 550 (579) T ss_pred HHHHCCCCCCHHHHHHHHHHHHCCCCEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHH T ss_conf 68860775539999999999841887265301014566205669999999999998653155544477525789999999 Q ss_pred HHHHCCCHHHHHHH Q ss_conf 99986998999999 Q gi|254780592|r 215 VVGLQGRMKEAYSI 228 (271) Q Consensus 215 ~~~~~g~~~eA~~~ 228 (271) ++..+++.|-+... T Consensus 551 als~~~~~D~l~~a 564 (579) T KOG1125 551 ALSAMNRSDLLQEA 564 (579) T ss_pred HHHHCCCCHHHHHH T ss_conf 99970881678874 No 26 >KOG2076 consensus Probab=99.74 E-value=1.1e-15 Score=113.89 Aligned_cols=225 Identities=17% Similarity=0.210 Sum_probs=168.1 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCC Q ss_conf 79999998420199999999999995757879999999999874100124466655663275210013555532100000 Q gi|254780592|r 40 DPESLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGY 119 (271) Q Consensus 40 ~~~~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~ 119 (271) ..++......|++++|+..+..+++.+|.++.++..+|.+|.+.|+.+++...+-.+..++|.+...|..++....++|+ T Consensus 143 l~eAN~lfarg~~eeA~~i~~EvIkqdp~~~~ay~tL~~IyEqrGd~eK~l~~~llAAHL~p~d~e~W~~ladls~~~~~ 222 (895) T KOG2076 143 LGEANNLFARGDLEEAEEILMEVIKQDPRNPIAYYTLGEIYEQRGDIEKALNFWLLAAHLNPKDYELWKRLADLSEQLGN 222 (895) T ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCC T ss_conf 99988999807789999999999972736513699999999980329999999999996699886899999999986012 Q ss_pred HHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC------------------------------ Q ss_conf 11258876422110000233566999999974211111000000000000------------------------------ Q gi|254780592|r 120 LDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSP------------------------------ 169 (271) Q Consensus 120 ~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p------------------------------ 169 (271) +..|.-.|.+++..+|.+++....++.+|.+.|+...|+..|.+.+...| T Consensus 223 i~qA~~cy~rAI~~~p~n~~~~~ers~L~~~~G~~~~Am~~f~~l~~~~p~~d~er~~d~i~~~~~~~~~~~~~e~a~~~ 302 (895) T KOG2076 223 INQARYCYSRAIQANPSNWELIYERSSLYQKTGDLKRAMETFLQLLQLDPPVDIERIEDLIRRVAHYFITHNERERAAKA 302 (895) T ss_pred HHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999860886228999999999984707888999999986289015899999999999999971078999999 Q ss_pred -------------------------------------------------------------------------------- Q ss_conf -------------------------------------------------------------------------------- Q gi|254780592|r 170 -------------------------------------------------------------------------------- 169 (271) Q Consensus 170 -------------------------------------------------------------------------------- 169 (271) T Consensus 303 le~~~s~~~~~~~~ed~ni~ael~l~~~q~d~~~~~i~~~~~r~~e~d~~e~~~~~~~~~~~~~~~~~~~~~s~~l~v~r 382 (895) T KOG2076 303 LEGALSKEKDEASLEDLNILAELFLKNKQSDKALMKIVDDRNRESEKDDSEWDTDERRREEPNALCEVGKELSYDLRVIR 382 (895) T ss_pred HHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHH T ss_conf 99777410340356278999999998678887669999986013678716540022034555320147877775630576 Q ss_pred -----------------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC-CHHHHHHHHHHHHHC Q ss_conf -----------------------------00012457776543322355435999999953898-547888999999986 Q gi|254780592|r 170 -----------------------------NESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGA-DSRIRQNLALVVGLQ 219 (271) Q Consensus 170 -----------------------------~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~-~~~~~~~La~~~~~~ 219 (271) +.++.+..++.++..-|++.+|+.+|..++...+. +..+|+.+|.+|..+ T Consensus 383 l~icL~~L~~~e~~e~ll~~l~~~n~~~~d~~dL~~d~a~al~~~~~~~~Al~~l~~i~~~~~~~~~~vw~~~a~c~~~l 462 (895) T KOG2076 383 LMICLVHLKERELLEALLHFLVEDNVWVSDDVDLYLDLADALTNIGKYKEALRLLSPITNREGYQNAFVWYKLARCYMEL 462 (895) T ss_pred HHHHHHCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHH T ss_conf 75432002443458889888888337845557899999999983655888999999986483335513469899999987 Q ss_pred CCHHHHHHHHHHHCCCCCCCHHHHH-HHHHHHHHHHHHHHHHHHHH Q ss_conf 9989999999986291728665999-99999832129999999971 Q gi|254780592|r 220 GRMKEAYSIASQELSPEEATRNIKY-IKSILSQRDPWKKIAKARSN 264 (271) Q Consensus 220 g~~~eA~~~~~~~L~~~~~~~~~~~-l~~~l~~~~~~~~~a~a~~~ 264 (271) |.+++|...|+++|...|.+-++.. +-.+..+.+...++..+..+ T Consensus 463 ~e~e~A~e~y~kvl~~~p~~~D~Ri~Lasl~~~~g~~EkalEtL~~ 508 (895) T KOG2076 463 GEYEEAIEFYEKVLILAPDNLDARITLASLYQQLGNHEKALETLEQ 508 (895) T ss_pred HHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHC T ss_conf 4289999999999952898601663199999763888899998713 No 27 >PRK11447 cellulose synthase subunit BcsC; Provisional Probab=99.73 E-value=2.2e-15 Score=111.88 Aligned_cols=131 Identities=22% Similarity=0.258 Sum_probs=79.0 Q ss_pred HHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHH--------------HHHHH Q ss_conf 2100000112588764221100002335669999999742111110000000000000001--------------24577 Q gi|254780592|r 113 SLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESS--------------IVSNI 178 (271) Q Consensus 113 ~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~--------------~~~~~ 178 (271) .....|+...|...|.+++...|++++++-.+|.++.+.|++.+|...|+++....|+... ..... T Consensus 279 ~aL~~G~l~aAe~~l~~AL~~~PnD~daLGgLG~v~lRqG~~~eA~~yf~rA~~~d~~~~~~~kW~sll~ta~yW~~l~q 358 (1158) T PRK11447 279 AAVDSGMGGKAIAELQQAVRANPKDSEALGALGQAYSQRGDRARAVAQFEKALALDPHSSNRDKWESLLKVNRYWLLIQQ 358 (1158) T ss_pred HHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 99977997889999999997199987899888999996588899999999999708998546889999988789999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHHH Q ss_conf 76543322355435999999953898547888999999986998999999998629172866599 Q gi|254780592|r 179 AMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQELSPEEATRNIK 243 (271) Q Consensus 179 ~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~~~ 243 (271) |......|++.+|...++++...+|+++.+...||.++...|++.+|+..|+.+|..++.+.... T Consensus 359 a~~A~~~g~~~~A~~l~~~A~~~~p~~~~A~l~LaD~~~a~~d~~~AE~~Yr~~L~~~~~N~~Al 423 (1158) T PRK11447 359 GDAALKANNLAQAERLYQQARQVDNTDSYAVLGLGDVAMARKDYAAAERYYQQTLRMDSGNTNAV 423 (1158) T ss_pred HHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHH T ss_conf 99999879989999999999855999855999999999983997999999999997489987899 No 28 >KOG2002 consensus Probab=99.72 E-value=1.6e-15 Score=112.82 Aligned_cols=229 Identities=11% Similarity=0.096 Sum_probs=141.4 Q ss_pred HHHHHHCCCCCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHH---CCHHHHHHHHHHHHCCC Q ss_conf 998540368543311799999984201999999999999957578799999999998741---00124466655663275 Q gi|254780592|r 25 LLFSSCHLNPRISIPDPESLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVG---RTAQALAVMRQVAILYP 101 (271) Q Consensus 25 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g---~~~~A~~~~~~al~~~p 101 (271) .+...|.+.+++... .--+.-++.+.|...|.++++.||.+..++..+|.+-.... .+..++..+.++-..+| T Consensus 192 ~inp~~~aD~rIgig----~Cf~kl~~~~~a~~a~~ralqLdp~~v~alv~L~~~~l~~~d~~s~~~~~~ll~~ay~~n~ 267 (1018) T KOG2002 192 RINPACKADVRIGIG----HCFWKLGMSEKALLAFERALQLDPTCVSALVALGEVDLNFNDSDSYKKGVQLLQRAYKENN 267 (1018) T ss_pred HCCCCCCCCCCCHHH----HHHHHCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCC T ss_conf 508666787631146----6787503436679999999850715599999999999870156778889999888876068 Q ss_pred CCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHH---HCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHH-HHHHHH Q ss_conf 21001355553210000011258876422110---00023356699999997421111100000000000000-012457 Q gi|254780592|r 102 EDQEVLAAYGKSLANAGYLDEGLDAINRAQRP---DIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNE-SSIVSN 177 (271) Q Consensus 102 ~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~---~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~-~~~~~~ 177 (271) ++|.++..++.-+...|+++.+......++.. .+..++.++++|.++..+|++++|..+|..++..+++. .-.+.. T Consensus 268 ~nP~~l~~LAn~fyfK~dy~~v~~la~~ai~~t~~~~~~aes~Y~~gRs~Ha~Gd~ekA~~yY~~s~k~~~d~~~l~~~G 347 (1018) T KOG2002 268 ENPVALNHLANHFYFKKDYERVWHLAEHAIKNTENKSIKAESFYQLGRSYHAQGDFEKAFKYYMESLKADNDNFVLPLVG 347 (1018) T ss_pred CCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCCCCCC T ss_conf 99579999977774135378999999999986101477778899999998862268999999999972488876200121 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCC----CHHHHHHHHHHHCCCCCCCHHHHHHHH-HHHHH Q ss_conf 7765433223554359999999538985478889999999869----989999999986291728665999999-99832 Q gi|254780592|r 178 IAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQG----RMKEAYSIASQELSPEEATRNIKYIKS-ILSQR 252 (271) Q Consensus 178 ~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g----~~~eA~~~~~~~L~~~~~~~~~~~l~~-~l~~~ 252 (271) +|+.++..|+++.|+.+|++.+...|++.+...-||-.|...+ ..+.|..++.+++.+.|.+.+...-.+ ++.+. T Consensus 348 lgQm~i~~~dle~s~~~fEkv~k~~p~~~etm~iLG~Lya~~~~~~~~~d~a~~~l~K~~~~~~~d~~a~l~laql~e~~ 427 (1018) T KOG2002 348 LGQMYIKRGDLEESKFCFEKVLKQLPNNYETMKILGCLYAHSAKKQEKRDKASNVLGKVLEQTPVDSEAWLELAQLLEQT 427 (1018) T ss_pred HHHHHHHHCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHC T ss_conf 25789871327788999999998676508899998867886401257789999999887750534088899999999812 Q ss_pred HHHHH Q ss_conf 12999 Q gi|254780592|r 253 DPWKK 257 (271) Q Consensus 253 ~~~~~ 257 (271) +.|.. T Consensus 428 d~~~s 432 (1018) T KOG2002 428 DPWAS 432 (1018) T ss_pred CHHHH T ss_conf 76889 No 29 >KOG1129 consensus Probab=99.72 E-value=1.4e-16 Score=119.21 Aligned_cols=200 Identities=17% Similarity=0.135 Sum_probs=183.1 Q ss_pred HHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHH Q ss_conf 99998420199999999999995757879999999999874100124466655663275210013555532100000112 Q gi|254780592|r 43 SLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDE 122 (271) Q Consensus 43 ~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~ 122 (271) +..+.+..+...|...+..-+++-|.|...+...|.++..+++.+.|+++++.+++.+|.+.++..-++..|.--++.+- T Consensus 263 skvY~ridQP~~AL~~~~~gld~fP~~VT~l~g~ARi~eam~~~~~a~~lYk~vlk~~~~nvEaiAcia~~yfY~~~PE~ 342 (478) T KOG1129 263 SKVYQRIDQPERALLVIGEGLDSFPFDVTYLLGQARIHEAMEQQEDALQLYKLVLKLHPINVEAIACIAVGYFYDNNPEM 342 (478) T ss_pred HHHHHHHCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEECCCCCCCHHH T ss_conf 99998723588999998646523883256666369999999707889999999984087650035665530013898689 Q ss_pred HHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 588764221100002335669999999742111110000000000000---00124577765433223554359999999 Q gi|254780592|r 123 GLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPN---ESSIVSNIAMSYLLMGDLKTAEEKLRFAS 199 (271) Q Consensus 123 A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~---~~~~~~~~~~~~~~~g~~~~A~~~~~~al 199 (271) |+.+|++.+...-..++.+.++|.|...-++++-+..+|.+++....+ ..++|+|+|...+.-||+..|.++|+-++ T Consensus 343 AlryYRRiLqmG~~speLf~NigLCC~yaqQ~D~~L~sf~RAlstat~~~~aaDvWYNlg~vaV~iGD~nlA~rcfrlaL 422 (478) T KOG1129 343 ALRYYRRILQMGAQSPELFCNIGLCCLYAQQIDLVLPSFQRALSTATQPGQAADVWYNLGFVAVTIGDFNLAKRCFRLAL 422 (478) T ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCHHHHHHHCCCEEEEECCCHHHHHHHHHHHH T ss_conf 99999999983277767776588898754003031788999986336864156666400116770342488888889875 Q ss_pred HCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHH Q ss_conf 5389854788899999998699899999999862917286659 Q gi|254780592|r 200 QMIGADSRIRQNLALVVGLQGRMKEAYSIASQELSPEEATRNI 242 (271) Q Consensus 200 ~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~~ 242 (271) ..+|++.+++.|||..-...|+.++|..+++.+-+..|.-.+. T Consensus 423 ~~d~~h~ealnNLavL~~r~G~i~~Arsll~~A~s~~P~m~E~ 465 (478) T KOG1129 423 TSDAQHGEALNNLAVLAARSGDILGARSLLNAAKSVMPDMAEV 465 (478) T ss_pred CCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCC T ss_conf 0585238888768998851276387899998765418530035 No 30 >KOG1173 consensus Probab=99.72 E-value=1.4e-15 Score=113.09 Aligned_cols=205 Identities=13% Similarity=0.053 Sum_probs=145.0 Q ss_pred HHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHH Q ss_conf 98420199999999999995757879999999999874100124466655663275210013555532100000112588 Q gi|254780592|r 46 NMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLD 125 (271) Q Consensus 46 ~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~ 125 (271) ....++..+--..-.+.++.+|+.+..|+..|.-|...|++.+|..+|-|+..++|....+|..+|..+...|+-+.|+. T Consensus 288 l~el~~~n~Lf~lsh~LV~~yP~~a~sW~aVg~YYl~i~k~seARry~SKat~lD~~fgpaWl~fghsfa~e~EhdQAma 367 (611) T KOG1173 288 LYELGKSNKLFLLSHKLVDLYPSKALSWFAVGCYYLMIGKYSEARRYFSKATTLDPTFGPAWLAFGHSFAGEGEHDQAMA 367 (611) T ss_pred HHHHCCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCHHHHHHH T ss_conf 99816521699988879872888773146577899981486799999888760376532799987677650105788999 Q ss_pred HHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC- Q ss_conf 7642211000023356699999997421111100000000000000012457776543322355435999999953898- Q gi|254780592|r 126 AINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGA- 204 (271) Q Consensus 126 ~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~- 204 (271) .|..+.+.-|....-...+|.-|.+.++...|...|..++.+.|+++-++..+|......+.+.+|..+|+++++..++ T Consensus 368 aY~tAarl~~G~hlP~LYlgmey~~t~n~kLAe~Ff~~A~ai~P~Dplv~~Elgvvay~~~~y~~A~~~f~~~l~~ik~~ 447 (611) T KOG1173 368 AYFTAARLMPGCHLPSLYLGMEYMRTNNLKLAEKFFKQALAIAPSDPLVLHELGVVAYTYEEYPEALKYFQKALEVIKSV 447 (611) T ss_pred HHHHHHHHCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCCHHHHHHHHEEEHHHHHHHHHHHHHHHHHHHHHC T ss_conf 99999986667763178999999874239899999999871189860443314420332754688999999998876630 Q ss_pred ---C---HHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHH Q ss_conf ---5---478889999999869989999999986291728665999999998 Q gi|254780592|r 205 ---D---SRIRQNLALVVGLQGRMKEAYSIASQELSPEEATRNIKYIKSILS 250 (271) Q Consensus 205 ---~---~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~~~~l~~~l~ 250 (271) . ...+.|||.+++.++++++|+..++++|...|.+.+..---+++. T Consensus 448 ~~e~~~w~p~~~NLGH~~Rkl~~~~eAI~~~q~aL~l~~k~~~~~asig~iy 499 (611) T KOG1173 448 LNEKIFWEPTLNNLGHAYRKLNKYEEAIDYYQKALLLSPKDASTHASIGYIY 499 (611) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHH T ss_conf 5432211577876789999976679999999999871877656888899999 No 31 >KOG2003 consensus Probab=99.70 E-value=1.5e-15 Score=112.91 Aligned_cols=192 Identities=12% Similarity=0.058 Sum_probs=146.3 Q ss_pred HHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHH Q ss_conf 42019999999999999575787999999999987410012446665566327521001355553210000011258876 Q gi|254780592|r 48 NHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAI 127 (271) Q Consensus 48 ~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~ 127 (271) .+|+++.|.+.|+.++..|....+++++.|..+...|+.++|+.+|-++-.+--++.+++..++.+|..+.+...|.+++ T Consensus 502 ~ngd~dka~~~ykeal~ndasc~ealfniglt~e~~~~ldeald~f~klh~il~nn~evl~qianiye~led~aqaie~~ 581 (840) T KOG2003 502 ANGDLDKAAEFYKEALNNDASCTEALFNIGLTAEALGNLDEALDCFLKLHAILLNNAEVLVQIANIYELLEDPAQAIELL 581 (840) T ss_pred ECCCHHHHHHHHHHHHCCCHHHHHHHHHHCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHH T ss_conf 15868999999999971742889999971123987447899999999999999702999999999999854879999999 Q ss_pred HHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHH Q ss_conf 42211000023356699999997421111100000000000000012457776543322355435999999953898547 Q gi|254780592|r 128 NRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSR 207 (271) Q Consensus 128 ~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~ 207 (271) -++....|+++..+..+|.+|.+.|+..+|.+++-.+..-.|-+.+...++|..|+...-.+.|+.+|++|.-+.|+-+. T Consensus 582 ~q~~slip~dp~ilskl~dlydqegdksqafq~~ydsyryfp~nie~iewl~ayyidtqf~ekai~y~ekaaliqp~~~k 661 (840) T KOG2003 582 MQANSLIPNDPAILSKLADLYDQEGDKSQAFQCHYDSYRYFPCNIETIEWLAAYYIDTQFSEKAINYFEKAALIQPNQSK 661 (840) T ss_pred HHHCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHH T ss_conf 98501588988999999988610364445456643010236762379999998887647789999999888734853789 Q ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCC Q ss_conf 88899999998699899999999862917286 Q gi|254780592|r 208 IRQNLALVVGLQGRMKEAYSIASQELSPEEAT 239 (271) Q Consensus 208 ~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~ 239 (271) .....|.++...|++..|...|+..-..-|.+ T Consensus 662 wqlmiasc~rrsgnyqka~d~yk~~hrkfped 693 (840) T KOG2003 662 WQLMIASCFRRSGNYQKAFDLYKDIHRKFPED 693 (840) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCC T ss_conf 99999999875364888999999988758550 No 32 >TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795). Probab=99.68 E-value=4.9e-14 Score=103.68 Aligned_cols=197 Identities=15% Similarity=0.053 Sum_probs=146.9 Q ss_pred HHHHHHHHHHHHHHHCCCCCCC----------CHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCH---HHHHHHHHHHHH Q ss_conf 9999999999985403685433----------117999999842019999999999999575787---999999999987 Q gi|254780592|r 16 KCIVFLIPLLLFSSCHLNPRIS----------IPDPESLENMNHEQLLEVTSTIGLQYQSHTKNK---MIGIVYADVLRR 82 (271) Q Consensus 16 ~~~~~l~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~A~~~~~~al~~~P~~~---~~~~~~a~~l~~ 82 (271) ..+++++++++++||++.+... .....+.+.++.|++.+|...|+......|..+ .+.+.+|.++.. T Consensus 3 ~~~~~~~~~~~l~sCs~~~~~~~~~~~~~~~~~lY~~a~~~~~~~~y~~A~~~fe~l~~~yp~s~~a~~A~l~~a~a~y~ 82 (235) T TIGR03302 3 LLILLLALLLLLAGCSSKKKKEADPVEERPAEELYNEAKEALDSGDYTEAIKYFEALESRYPFSPYAEQAQLDLAYAYYK 82 (235) T ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHH T ss_conf 99999999999987389976655555578999999999999987899999999999998689931379999999999997 Q ss_pred HHCCHHHHHHHHHHHHCCCCCC---HHHHHHHHHHHHH--------CCHHHHHHHHHHHHHHHCHHHHHH---HHHHHHH Q ss_conf 4100124466655663275210---0135555321000--------001125887642211000023356---6999999 Q gi|254780592|r 83 VGRTAQALAVMRQVAILYPEDQ---EVLAAYGKSLANA--------GYLDEGLDAINRAQRPDIPDWQLI---SAKGSVL 148 (271) Q Consensus 83 ~g~~~~A~~~~~~al~~~p~~~---~~~~~~~~~~~~~--------g~~~~A~~~~~~a~~~~p~~~~~~---~~lg~~~ 148 (271) .|++++|+..+++.+..+|+++ .++...|.++..+ +...+|...++..+...|+..-.- ..+..+. T Consensus 83 ~~~y~~A~~~~~~Fi~~yP~~~~~~ya~y~~a~s~~~~~~~~~rDq~~~~~A~~~f~~~i~~yP~S~ya~dA~~~i~~l~ 162 (235) T TIGR03302 83 SGDYAEAIAAADRFIRLHPNHPDADYAYYLRGLSNYKQIDDSDRDQTAAREAFEAFQELIRRYPNSEYAPDAKKRMDYLR 162 (235) T ss_pred HHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH T ss_conf 64399999999999998877434999999987998873665445808999999999999988899657999999999999 Q ss_pred HHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC---HHHHHHHHHHHHHCCCHHHH Q ss_conf 974211111000000000000000124577765433223554359999999538985---47888999999986998999 Q gi|254780592|r 149 AQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGAD---SRIRQNLALVVGLQGRMKEA 225 (271) Q Consensus 149 ~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~---~~~~~~La~~~~~~g~~~eA 225 (271) ..+. .--...|..|...|.+..|+..++.+++.-|+. .+++..++.+|..+|..++| T Consensus 163 ~~LA--------------------~~e~~ia~~Y~k~~~y~aAi~r~~~vl~~Yp~t~~~~eAl~~l~eay~~lg~~~~A 222 (235) T TIGR03302 163 NRLA--------------------GKELYVARFYLKRGAYVAAINRFETVVENYPDTPATEEALARLVEAYLALGLTDLA 222 (235) T ss_pred HHHH--------------------HHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCHHHH T ss_conf 9999--------------------99999999999841709999999999986899851399999999999986998999 Q ss_pred HHHHHHH Q ss_conf 9999986 Q gi|254780592|r 226 YSIASQE 232 (271) Q Consensus 226 ~~~~~~~ 232 (271) ....+.. T Consensus 223 ~~~~~~l 229 (235) T TIGR03302 223 QDAAAVL 229 (235) T ss_pred HHHHHHH T ss_conf 9999999 No 33 >KOG1840 consensus Probab=99.67 E-value=3.3e-14 Score=104.77 Aligned_cols=218 Identities=18% Similarity=0.161 Sum_probs=160.7 Q ss_pred HHHHHHHHCCHHHHHHHHHHHHHH--------CCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHC--------CCCCCH Q ss_conf 999998420199999999999995--------7578799999999998741001244666556632--------752100 Q gi|254780592|r 42 ESLENMNHEQLLEVTSTIGLQYQS--------HTKNKMIGIVYADVLRRVGRTAQALAVMRQVAIL--------YPEDQE 105 (271) Q Consensus 42 ~~~~~~~~~~~~~A~~~~~~al~~--------~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~--------~p~~~~ 105 (271) -+..+..+++|+.|+..++.+++. +|.-......+|..|...+++++|+.+|++++.+ +|.... T Consensus 205 La~~y~~~g~~e~A~~l~k~Al~~l~k~~G~~hl~va~~l~~~a~~y~~~~k~~eAv~ly~~AL~i~e~~~G~~h~~va~ 284 (508) T KOG1840 205 LAEMYAVQGRLEKAEPLCKQALRILEKTSGLKHLVVASMLNILALVYRSLGKYDEAVNLYEEALTIREEVFGEDHPAVAA 284 (508) T ss_pred HHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCCCCHHHHH T ss_conf 99999985207789999999999998703766689999999999999974428999999999999999853898789999 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHHH--------HCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC-------- Q ss_conf 1355553210000011258876422110--------000233566999999974211111000000000000-------- Q gi|254780592|r 106 VLAAYGKSLANAGYLDEGLDAINRAQRP--------DIPDWQLISAKGSVLAQMGKHSEALIEYERALELSP-------- 169 (271) Q Consensus 106 ~~~~~~~~~~~~g~~~~A~~~~~~a~~~--------~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p-------- 169 (271) ++..++..|...|++.+|...+++++.. .|.-...+..++.++...+++++|...+.+++++-- T Consensus 285 ~l~nLa~ly~~~GKf~EA~~~~e~Al~I~~~~~~~~~~~v~~~l~~~~~~~~~~~~~Eea~~l~q~al~i~~~~~g~~~~ 364 (508) T KOG1840 285 TLNNLAVLYYKQGKFAEAEEYCERALEIYEKLLGASHPEVAAQLSELAAILQSMNEYEEAKKLLQKALKIYLDAPGEDNV 364 (508) T ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCCCCH T ss_conf 99889999860477577899999999999985036858999999899999988520568999999999999861464105 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC--------CCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCC------C Q ss_conf 000124577765433223554359999999538--------98547888999999986998999999998629------1 Q gi|254780592|r 170 NESSIVSNIAMSYLLMGDLKTAEEKLRFASQMI--------GADSRIRQNLALVVGLQGRMKEAYSIASQELS------P 235 (271) Q Consensus 170 ~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~--------p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~------~ 235 (271) .....+.++|..|..+|++.+|.+.+++|+++. +....+..+||..+.+.+++.+|...|..+.. + T Consensus 365 ~~a~~~~nl~~l~~~~gk~~ea~~~~k~ai~~~~~~~~~~~~~~~~~l~~la~~~~~~k~~~~a~~l~~~~~~i~~~~g~ 444 (508) T KOG1840 365 NLAKIYANLAELYLKMGKYKEAEELYKKAIQILRELLGKKDYGVGKPLNQLAEAYEELKKYEEAEQLFEEAKDIMKLCGP 444 (508) T ss_pred HHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCC T ss_conf 78989989999999944656799999999999886156767145478999999999834602289999999889998588 Q ss_pred CCCCHHHH--HHHHHHHHHHHHHHHH Q ss_conf 72866599--9999998321299999 Q gi|254780592|r 236 EEATRNIK--YIKSILSQRDPWKKIA 259 (271) Q Consensus 236 ~~~~~~~~--~l~~~l~~~~~~~~~a 259 (271) +..+-... -|.+.+.+.+.++++- T Consensus 445 ~~~~~~~~~~nL~~~Y~~~g~~e~a~ 470 (508) T KOG1840 445 DHPDVTYTYLNLAALYRAQGNYEAAE 470 (508) T ss_pred CCCCHHHHHHHHHHHHHHCCCHHHHH T ss_conf 99746899988999999735688899 No 34 >COG2956 Predicted N-acetylglucosaminyl transferase [Carbohydrate transport and metabolism] Probab=99.65 E-value=4.4e-14 Score=103.99 Aligned_cols=201 Identities=17% Similarity=0.124 Sum_probs=136.6 Q ss_pred CCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCC-----CHHHHH Q ss_conf 433117999999842019999999999999575787999999999987410012446665566327521-----001355 Q gi|254780592|r 35 RISIPDPESLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPED-----QEVLAA 109 (271) Q Consensus 35 ~~~~~~~~~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~-----~~~~~~ 109 (271) +.+...-+..+.+-+.+.++|++.|-..++.+|...++++.+|..+++.|+.|+|+.+-+.++ ..|+. ..++.. T Consensus 34 ~Lsr~Yv~GlNfLLs~Q~dKAvdlF~e~l~~d~~t~e~~ltLGnLfRsRGEvDRAIRiHQ~L~-~spdlT~~qr~lAl~q 112 (389) T COG2956 34 RLSRDYVKGLNFLLSNQPDKAVDLFLEMLQEDPETFEAHLTLGNLFRSRGEVDRAIRIHQTLL-ESPDLTFEQRLLALQQ 112 (389) T ss_pred HCCHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH-CCCCCCHHHHHHHHHH T ss_conf 553888867889861696369999999983581567898898889886350889999999885-2899866899999999 Q ss_pred HHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHH-----HHHHHHHHHHH Q ss_conf 5532100000112588764221100002335669999999742111110000000000000001-----24577765433 Q gi|254780592|r 110 YGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESS-----IVSNIAMSYLL 184 (271) Q Consensus 110 ~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~-----~~~~~~~~~~~ 184 (271) +|.-|+..|-+|.|...|........-...+...+-.+|....++++|++..++..++.+.+.. .+..++..+.. T Consensus 113 L~~Dym~aGl~DRAE~~f~~L~de~efa~~AlqqLl~IYQ~treW~KAIe~A~~L~k~~~q~~~~eIAqfyCELAq~~~~ 192 (389) T COG2956 113 LGRDYMAAGLLDRAEDIFNQLVDEGEFAEGALQQLLNIYQATREWEKAIDVAERLVKLGGQTYRVEIAQFYCELAQQALA 192 (389) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHH T ss_conf 98999985646689999999734634328999999999998368999999999999718852125899999999999740 Q ss_pred HHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCC Q ss_conf 2235543599999995389854788899999998699899999999862917 Q gi|254780592|r 185 MGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQELSPE 236 (271) Q Consensus 185 ~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~ 236 (271) ..+.+.|...+++|++.+|.+.++-..+|.++...|+++.|.+.++.++..+ T Consensus 193 ~~~~d~A~~~l~kAlqa~~~cvRAsi~lG~v~~~~g~y~~AV~~~e~v~~Qn 244 (389) T COG2956 193 SSDVDRARELLKKALQADKKCVRASIILGRVELAKGDYQKAVEALERVLEQN 244 (389) T ss_pred HHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHC T ss_conf 3118999999999986282001035454388875266699999999999849 No 35 >KOG1840 consensus Probab=99.65 E-value=1.2e-14 Score=107.51 Aligned_cols=193 Identities=17% Similarity=0.149 Sum_probs=157.9 Q ss_pred HHHHHHHHCCHHHHHHHHHHHHHH--------CCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCC--------CCCCH Q ss_conf 999998420199999999999995--------75787999999999987410012446665566327--------52100 Q gi|254780592|r 42 ESLENMNHEQLLEVTSTIGLQYQS--------HTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILY--------PEDQE 105 (271) Q Consensus 42 ~~~~~~~~~~~~~A~~~~~~al~~--------~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~--------p~~~~ 105 (271) -+.-+++.+++.+|+..|++++.. +|.-..++.++|..|...|++++|..++++++++. |+-.. T Consensus 247 ~a~~y~~~~k~~eAv~ly~~AL~i~e~~~G~~h~~va~~l~nLa~ly~~~GKf~EA~~~~e~Al~I~~~~~~~~~~~v~~ 326 (508) T KOG1840 247 LALVYRSLGKYDEAVNLYEEALTIREEVFGEDHPAVAATLNNLAVLYYKQGKFAEAEEYCERALEIYEKLLGASHPEVAA 326 (508) T ss_pred HHHHHHHHCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCHHHHHH T ss_conf 99999974428999999999999999853898789999998899998604775778999999999999850368589999 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHHH--------HCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC-------- Q ss_conf 1355553210000011258876422110--------000233566999999974211111000000000000-------- Q gi|254780592|r 106 VLAAYGKSLANAGYLDEGLDAINRAQRP--------DIPDWQLISAKGSVLAQMGKHSEALIEYERALELSP-------- 169 (271) Q Consensus 106 ~~~~~~~~~~~~g~~~~A~~~~~~a~~~--------~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p-------- 169 (271) .+..++.++..++++++|...+++++.. ++..+..+.++|.+|...|++++|.+.+++++.... T Consensus 327 ~l~~~~~~~~~~~~~Eea~~l~q~al~i~~~~~g~~~~~~a~~~~nl~~l~~~~gk~~ea~~~~k~ai~~~~~~~~~~~~ 406 (508) T KOG1840 327 QLSELAAILQSMNEYEEAKKLLQKALKIYLDAPGEDNVNLAKIYANLAELYLKMGKYKEAEELYKKAIQILRELLGKKDY 406 (508) T ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCCCH T ss_conf 99899999988520568999999999999861464105789899899999999446567999999999998861567671 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC----C---CCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCC Q ss_conf 00012457776543322355435999999953----8---98547888999999986998999999998629 Q gi|254780592|r 170 NESSIVSNIAMSYLLMGDLKTAEEKLRFASQM----I---GADSRIRQNLALVVGLQGRMKEAYSIASQELS 234 (271) Q Consensus 170 ~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~----~---p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~ 234 (271) .....++++|..+.+.+++.+|...|.++..+ . |+-...+.|||.+|..+|++++|+++.++++. T Consensus 407 ~~~~~l~~la~~~~~~k~~~~a~~l~~~~~~i~~~~g~~~~~~~~~~~nL~~~Y~~~g~~e~a~~~~~~~~~ 478 (508) T KOG1840 407 GVGKPLNQLAEAYEELKKYEEAEQLFEEAKDIMKLCGPDHPDVTYTYLNLAALYRAQGNYEAAEELEEKVLN 478 (508) T ss_pred HHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH T ss_conf 454789999999998346022899999998899985889974689998899999973568889999999989 No 36 >KOG0495 consensus Probab=99.65 E-value=2e-13 Score=99.97 Aligned_cols=215 Identities=12% Similarity=0.036 Sum_probs=180.1 Q ss_pred HHHCCCCCCCCH-HHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHH Q ss_conf 540368543311-7999999842019999999999999575787999999999987410012446665566327521001 Q gi|254780592|r 28 SSCHLNPRISIP-DPESLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEV 106 (271) Q Consensus 28 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~ 106 (271) -.|.+.|..... +.-+-+....||..+|...+..+++.+|++.++|+..-.+.....++++|..+|.++-...| ..++ T Consensus 575 kav~~~pkae~lwlM~ake~w~agdv~~ar~il~~af~~~pnseeiwlaavKle~en~e~eraR~llakar~~sg-TeRv 653 (913) T KOG0495 575 KAVEQCPKAEILWLMYAKEKWKAGDVPAARVILDQAFEANPNSEEIWLAAVKLEFENDELERARDLLAKARSISG-TERV 653 (913) T ss_pred HHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCC-CCHH T ss_conf 999738862268998888887517848899999999983898188898988875224648999999999750288-5126 Q ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 35555321000001125887642211000023356699999997421111100000000000000012457776543322 Q gi|254780592|r 107 LAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMG 186 (271) Q Consensus 107 ~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g 186 (271) +.-.+....-+|+.++|...++++++.-|.....|..+|.++.+.++.+.|...|...+..-|.....|..++.+-...| T Consensus 654 ~mKs~~~er~ld~~eeA~rllEe~lk~fp~f~Kl~lmlGQi~e~~~~ie~aR~aY~~G~k~cP~~ipLWllLakleEk~~ 733 (913) T KOG0495 654 WMKSANLERYLDNVEEALRLLEEALKSFPDFHKLWLMLGQIEEQMENIEMAREAYLQGTKKCPNSIPLWLLLAKLEEKDG 733 (913) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHC T ss_conf 78776899986407999999999988487558999998689999877999999997025648898129999977888726 Q ss_pred HHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHHH Q ss_conf 355435999999953898547888999999986998999999998629172866599 Q gi|254780592|r 187 DLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQELSPEEATRNIK 243 (271) Q Consensus 187 ~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~~~ 243 (271) ....|...++++.-.+|.+...|.....+-...|+.++|...+.++|..-|+...+. T Consensus 734 ~~~rAR~ildrarlkNPk~~~lwle~Ir~ElR~Gn~~~a~~lmakALQecp~sg~LW 790 (913) T KOG0495 734 QLVRARSILDRARLKNPKNALLWLESIRMELRAGNKEQAELLMAKALQECPSSGLLW 790 (913) T ss_pred CHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHH T ss_conf 526699998888734998626499999999974889999999999998687522268 No 37 >KOG2003 consensus Probab=99.63 E-value=7.5e-15 Score=108.68 Aligned_cols=212 Identities=14% Similarity=0.081 Sum_probs=120.9 Q ss_pred CHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHH Q ss_conf 19999999999999575787999999999987410012446665566327521001355553210000011258876422 Q gi|254780592|r 51 QLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRA 130 (271) Q Consensus 51 ~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a 130 (271) ++..|..+.+.++..|.-|+.++.+.|.+-...|++++|.+.+++++..+....++++.+|..+..+|+.++|+..|-+. T Consensus 471 ~~~~aqqyad~aln~dryn~~a~~nkgn~~f~ngd~dka~~~ykeal~ndasc~ealfniglt~e~~~~ldeald~f~kl 550 (840) T KOG2003 471 DFADAQQYADIALNIDRYNAAALTNKGNIAFANGDLDKAAEFYKEALNNDASCTEALFNIGLTAEALGNLDEALDCFLKL 550 (840) T ss_pred CHHHHHHHHHHHHCCCCCCHHHHHCCCCEEEECCCHHHHHHHHHHHHCCCHHHHHHHHHHCCCHHHHCCHHHHHHHHHHH T ss_conf 06677877787742022477775067754441586899999999997174288999997112398744789999999999 Q ss_pred HHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHH Q ss_conf 11000023356699999997421111100000000000000012457776543322355435999999953898547888 Q gi|254780592|r 131 QRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQ 210 (271) Q Consensus 131 ~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~ 210 (271) ...-.++.+.+..++.+|....+..+|++.+.++..+-|+++.++..+|.+|-+.|+..+|..++-......|.+.+... T Consensus 551 h~il~nn~evl~qianiye~led~aqaie~~~q~~slip~dp~ilskl~dlydqegdksqafq~~ydsyryfp~nie~ie 630 (840) T KOG2003 551 HAILLNNAEVLVQIANIYELLEDPAQAIELLMQANSLIPNDPAILSKLADLYDQEGDKSQAFQCHYDSYRYFPCNIETIE 630 (840) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCHHHHH T ss_conf 99997029999999999998548799999999850158898899999998861036444545664301023676237999 Q ss_pred HHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHHHH-HHHHHHHHHHHHHHHHHH Q ss_conf 9999999869989999999986291728665999-999998321299999999 Q gi|254780592|r 211 NLALVVGLQGRMKEAYSIASQELSPEEATRNIKY-IKSILSQRDPWKKIAKAR 262 (271) Q Consensus 211 ~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~~~~-l~~~l~~~~~~~~~a~a~ 262 (271) -||.-|....=++.|+.+++++--..|....... +..-+++.++|++.-.-+ T Consensus 631 wl~ayyidtqf~ekai~y~ekaaliqp~~~kwqlmiasc~rrsgnyqka~d~y 683 (840) T KOG2003 631 WLAAYYIDTQFSEKAINYFEKAALIQPNQSKWQLMIASCFRRSGNYQKAFDLY 683 (840) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHHHH T ss_conf 99988876477899999998887348537899999999987536488899999 No 38 >PRK11788 hypothetical protein; Provisional Probab=99.63 E-value=2e-13 Score=99.91 Aligned_cols=209 Identities=10% Similarity=-0.008 Sum_probs=124.0 Q ss_pred HHCCHHHHHHHHHHHHHHCCCC----HHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHH Q ss_conf 4201999999999999957578----799999999998741001244666556632752100135555321000001125 Q gi|254780592|r 48 NHEQLLEVTSTIGLQYQSHTKN----KMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEG 123 (271) Q Consensus 48 ~~~~~~~A~~~~~~al~~~P~~----~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A 123 (271) +.|.++.|+..=+..+.+..-+ ..+++.+|.-|.+.|-+|+|...|.++++..+....++..+-.+|...++|++| T Consensus 81 rrGE~dRAIriHq~Ll~rp~L~~~~r~~a~~ELa~Dy~~aGllDRAE~~f~~l~~~~~~~~~al~~Ll~iye~~~dW~~A 160 (389) T PRK11788 81 RRGEVDRAIRIHQNLLSRPDLTREQRLLALQELGQDYLKAGLLDRAEELFLQLVDEDDFAEGALRQLLTIYQQEKDWQKA 160 (389) T ss_pred HCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 65608899999999972879899999999999999998677499999999999828542699999999999986459999 Q ss_pred HHHHHHHHHHHC-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 887642211000-----023356699999997421111100000000000000012457776543322355435999999 Q gi|254780592|r 124 LDAINRAQRPDI-----PDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFA 198 (271) Q Consensus 124 ~~~~~~a~~~~p-----~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a 198 (271) .+..++...... ..+-.+..++.-....|+.+.|...+.+++..+|+.+.++..+|.++...|+++.|+..++++ T Consensus 161 i~~a~~L~~~~~~~~~~~iAhy~CElA~~~~~~~~~~~A~~~L~kAL~~dp~cvRA~lllg~l~~~~g~~~~Ai~~l~~i 240 (389) T PRK11788 161 IEVAERLEKLGGDSLRKEIAHFYCELAQQALARGDLDAARALLKKALAADPQCVRASILLGDLALAQGDYAAAIEALERV 240 (389) T ss_pred HHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHH T ss_conf 99999998704765069999999999999987489999999999998748243418999999999879999999999999 Q ss_pred HHCCCCCH-HHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 95389854-78889999999869989999999986291728665999999998321299 Q gi|254780592|r 199 SQMIGADS-RIRQNLALVVGLQGRMKEAYSIASQELSPEEATRNIKYIKSILSQRDPWK 256 (271) Q Consensus 199 l~~~p~~~-~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~~~~l~~~l~~~~~~~ 256 (271) .+.+|... .+...|-.+|...|+.++....+++.+...|.......+-..+.+.++.. T Consensus 241 ~~q~p~~l~~vl~~L~~~y~~l~~~~~~~~~L~~~~~~~~~~~l~l~l~~~l~~~~g~~ 299 (389) T PRK11788 241 EEQDPEYLPEVLPKLMECYQALGDEAEGLEFLRRAVEEYPGADLLLALAQLLEEREGPE 299 (389) T ss_pred HHHCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCHH T ss_conf 97395779999999999999828888999999999985899359999999999769999 No 39 >KOG0495 consensus Probab=99.63 E-value=3.9e-13 Score=98.15 Aligned_cols=214 Identities=14% Similarity=0.085 Sum_probs=173.7 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCC Q ss_conf 79999998420199999999999995757879999999999874100124466655663275210013555532100000 Q gi|254780592|r 40 DPESLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGY 119 (271) Q Consensus 40 ~~~~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~ 119 (271) +.-+...-+.|..++-...+++++..-|+.+..|+.++..+...|+..+|..++.++.+.+|++.+.|..-..+.....+ T Consensus 554 lra~~~ek~hgt~Esl~Allqkav~~~pkae~lwlM~ake~w~agdv~~ar~il~~af~~~pnseeiwlaavKle~en~e 633 (913) T KOG0495 554 LRAAMFEKSHGTRESLEALLQKAVEQCPKAEILWLMYAKEKWKAGDVPAARVILDQAFEANPNSEEIWLAAVKLEFENDE 633 (913) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCC T ss_conf 99999988318699999999999973886226899888888751784889999999998389818889898887522464 Q ss_pred HHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 11258876422110000233566999999974211111000000000000000124577765433223554359999999 Q gi|254780592|r 120 LDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFAS 199 (271) Q Consensus 120 ~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al 199 (271) ++.|...+.++....| ....|.-.+.+...+++.++|+..++.+++..|.....|.++|+++..+++.+.|...|..-+ T Consensus 634 ~eraR~llakar~~sg-TeRv~mKs~~~er~ld~~eeA~rllEe~lk~fp~f~Kl~lmlGQi~e~~~~ie~aR~aY~~G~ 712 (913) T KOG0495 634 LERARDLLAKARSISG-TERVWMKSANLERYLDNVEEALRLLEEALKSFPDFHKLWLMLGQIEEQMENIEMAREAYLQGT 712 (913) T ss_pred HHHHHHHHHHHHCCCC-CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 8999999999750288-512678776899986407999999999988487558999998689999877999999997025 Q ss_pred HCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 5389854788899999998699899999999862917286659999999983212 Q gi|254780592|r 200 QMIGADSRIRQNLALVVGLQGRMKEAYSIASQELSPEEATRNIKYIKSILSQRDP 254 (271) Q Consensus 200 ~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~~~~l~~~l~~~~~ 254 (271) ..-|.....|..|+.+-...|+...|..+++++.-.+|.+..+..-..-.+.+-| T Consensus 713 k~cP~~ipLWllLakleEk~~~~~rAR~ildrarlkNPk~~~lwle~Ir~ElR~G 767 (913) T KOG0495 713 KKCPNSIPLWLLLAKLEEKDGQLVRARSILDRARLKNPKNALLWLESIRMELRAG 767 (913) T ss_pred CCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCC T ss_conf 6488981299999778887265266999988887349986264999999999748 No 40 >PRK10747 putative protoheme IX biogenesis protein; Provisional Probab=99.61 E-value=4.9e-13 Score=97.54 Aligned_cols=217 Identities=13% Similarity=0.086 Sum_probs=161.7 Q ss_pred HHHHHHCCHHHHHHHHHHHHHHCCCCH-HHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHH Q ss_conf 999842019999999999999575787-9999999999874100124466655663275210013555532100000112 Q gi|254780592|r 44 LENMNHEQLLEVTSTIGLQYQSHTKNK-MIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDE 122 (271) Q Consensus 44 ~~~~~~~~~~~A~~~~~~al~~~P~~~-~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~ 122 (271) ......|+++....++.++.+..|++. .+.+..++.....|++++|.+.++++.+..|.++.++..+..+|...++|+. T Consensus 126 rAA~~~g~~~~rd~yL~~A~e~~p~a~lav~lt~Ael~l~~~q~e~Ala~L~~L~~~~p~~~~VL~Ll~~~y~~~~dW~a 205 (398) T PRK10747 126 EAAQQRGDEARANQHLERAAELAGNDTIPVEITRVRLQLARNENHAARHGVDKLLEVTPRHPEVLRLAEQAYIRTGAWSS 205 (398) T ss_pred HHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 99998889899999999988858984889999999999977998999999999986199999999999999998745999 Q ss_pred HHHHHHHHHHH------------------------------------------HCHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 58876422110------------------------------------------000233566999999974211111000 Q gi|254780592|r 123 GLDAINRAQRP------------------------------------------DIPDWQLISAKGSVLAQMGKHSEALIE 160 (271) Q Consensus 123 A~~~~~~a~~~------------------------------------------~p~~~~~~~~lg~~~~~~~~~~~A~~~ 160 (271) ....+....+. ...++.....++..+...|++++|... T Consensus 206 L~~LLp~L~K~~~l~~~e~~~L~~~a~~~ll~~~~~~~~~~~L~~~W~~lp~~~r~~~~l~~~ya~~Li~~g~~~~A~~l 285 (398) T PRK10747 206 LLDIIPSMAKAHVGDEEHRAMLEQQAWIGLMDQARADNGSEGLRNWWKNQSRKTRHQVALQVAMAEHLIECDDHDTAQQI 285 (398) T ss_pred HHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCHHHHCCHHHHHHHHHHHHHCCCHHHHHHH T ss_conf 99999999875799999999999999999999987323479999999868998756999999999999868996899999 Q ss_pred HHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCH Q ss_conf 00000000000012457776543322355435999999953898547888999999986998999999998629172866 Q gi|254780592|r 161 YERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQELSPEEATR 240 (271) Q Consensus 161 ~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~ 240 (271) ..+.++..+++.-+ ..+|.+ ..++....+...++-++..|+++..+..||.++...+.+.+|..+++++++.+|... T Consensus 286 l~~~Lkk~wd~~Lv-~ly~~l--~~~d~~~~l~~~e~wlk~~~~~~~Ll~aLGrL~~~~~lwgkA~~yle~sl~l~p~~~ 362 (398) T PRK10747 286 IIDGLKRQYDDRLV-LPIPRL--KTNNPEQLEKVLRQQIKQVGDRPLLWSTLGQSLMKHGEWQEASLAFRAALKQRPDAY 362 (398) T ss_pred HHHHHHHCCCHHHH-HHHHHC--CCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHCCCCHH T ss_conf 99998618998999-998756--779989999999998731899989999999999973509999999999986599999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 59999999983212999999997 Q gi|254780592|r 241 NIKYIKSILSQRDPWKKIAKARS 263 (271) Q Consensus 241 ~~~~l~~~l~~~~~~~~~a~a~~ 263 (271) ....+..++.+.+.-.+.++.+. T Consensus 363 ~~~~La~l~e~lg~~~~A~~~yr 385 (398) T PRK10747 363 DYAWLADALDRLHKPEEAAAMRR 385 (398) T ss_pred HHHHHHHHHHHCCCHHHHHHHHH T ss_conf 99999999998599799999999 No 41 >KOG1174 consensus Probab=99.61 E-value=3.3e-14 Score=104.74 Aligned_cols=170 Identities=12% Similarity=0.028 Sum_probs=80.5 Q ss_pred HHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 12446665566327521001355553210000011258876422110000233566999999974211111000000000 Q gi|254780592|r 87 AQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALE 166 (271) Q Consensus 87 ~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~ 166 (271) ..|+.+.+++++.+|.+..++...|..+..+|+.++|...|+.+....|.+.+.|..+-.+|...|++.+|....++.+. T Consensus 317 ~rAL~~~eK~I~~~~r~~~alilKG~lL~~~~R~~~A~IaFR~Aq~Lap~rL~~Y~GL~hsYLA~~~~kEA~~~An~~~~ 396 (564) T KOG1174 317 ERALNFVEKCIDSEPRNHEALILKGRLLIALERHTQAVIAFRTAQMLAPYRLEIYRGLFHSYLAQKRFKEANALANWTIR 396 (564) T ss_pred HHHHHHHHHHHHCCCCCCHHHHHCCHHHHHCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH T ss_conf 99999998876158553367886139988545628899999998742514578998899999961408999999999999 Q ss_pred CCCHHHHHHHHHH-HHH-HHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHHHH Q ss_conf 0000001245777-654-33223554359999999538985478889999999869989999999986291728665999 Q gi|254780592|r 167 LSPNESSIVSNIA-MSY-LLMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQELSPEEATRNIKY 244 (271) Q Consensus 167 ~~p~~~~~~~~~~-~~~-~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~~~~ 244 (271) ..|.+...+...| ... ....--+.|..++++++.+.|....+...+|......|++.++++++++.|...+....-.. T Consensus 397 ~~~~sA~~LtL~g~~V~~~dp~~rEKAKkf~ek~L~~~P~Y~~AV~~~AEL~~~Eg~~~D~i~LLe~~L~~~~D~~LH~~ 476 (564) T KOG1174 397 LFQNSARSLTLFGTLVLFPDPRMREKAKKFAEKSLKINPIYTPAVNLIAELCQVEGPTKDIIKLLEKHLIIFPDVNLHNH 476 (564) T ss_pred HHHHHHHHHHHHHHEEECCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCHHHHHH T ss_conf 76531446766300011568506799999998420358860899999999998627654189999998742562089999 Q ss_pred HHHHHHHHHHHH Q ss_conf 999998321299 Q gi|254780592|r 245 IKSILSQRDPWK 256 (271) Q Consensus 245 l~~~l~~~~~~~ 256 (271) +.-.+.-.|-++ T Consensus 477 Lgd~~~A~Ne~Q 488 (564) T KOG1174 477 LGDIMRAQNEPQ 488 (564) T ss_pred HHHHHHHHHHHH T ss_conf 999999861079 No 42 >KOG4162 consensus Probab=99.61 E-value=1.1e-13 Score=101.59 Aligned_cols=198 Identities=20% Similarity=0.143 Sum_probs=139.2 Q ss_pred HHHHHHHCCHHHHHHHHHHHHHH-CCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCH---------------- Q ss_conf 99998420199999999999995-7578799999999998741001244666556632752100---------------- Q gi|254780592|r 43 SLENMNHEQLLEVTSTIGLQYQS-HTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQE---------------- 105 (271) Q Consensus 43 ~~~~~~~~~~~~A~~~~~~al~~-~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~---------------- 105 (271) +.++..+++.++|....+.+++. ...++.+|..+|.++...+++.+|+.+.+-+++-.|+|-. T Consensus 485 alq~A~~R~l~sAl~~~~eaL~l~~~~~~~~whLLALvlSa~kr~~~Al~vvd~al~E~~~N~~l~~~~~~i~~~~~~~e 564 (799) T KOG4162 485 ALQYAEQRQLTSALDYAREALALNRGDSAKAWHLLALVLSAQKRLKEALDVVDAALEEFGDNHVLMDGKIHIELTFNDRE 564 (799) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHCCCHH T ss_conf 99999997699999999999871577648899999999962241677899999999974122200101110122116588 Q ss_pred -------------------------------------------------------------------------------- Q ss_conf -------------------------------------------------------------------------------- Q gi|254780592|r 106 -------------------------------------------------------------------------------- 105 (271) Q Consensus 106 -------------------------------------------------------------------------------- 105 (271) T Consensus 565 ~~l~t~~~~L~~we~~~~~q~~~~~g~~~~lk~~l~la~~q~~~a~s~sr~ls~l~a~~~~~~~se~~Lp~s~~~~~~~~ 644 (799) T KOG4162 565 EALDTCIHKLALWEAEYGVQQTLDEGKLLRLKAGLHLALSQPTDAISTSRYLSSLVASQLKSAGSELKLPSSTVLPGPDS 644 (799) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCC T ss_conf 88888999999987311676654134566541211437656545303468988999865542156101674101578873 Q ss_pred -------HHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHH Q ss_conf -------1355553210000011258876422110000233566999999974211111000000000000000124577 Q gi|254780592|r 106 -------VLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNI 178 (271) Q Consensus 106 -------~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~ 178 (271) .|...+..+...++.+++...+.++...+|..+..|+..|.++...|..++|.+.|..++.++|+++.....+ T Consensus 645 ~~~~~~~lwllaa~~~~~~~~~~~a~~CL~Ea~~~~~l~~~~~~~~G~~~~~~~~~~EA~~af~~Al~ldP~hv~s~~Al 724 (799) T KOG4162 645 LWYLLQKLWLLAADLFLLSGNDDEARSCLLEASKIDPLSASVYYLRGLLLEVKGQLEEAKEAFLVALALDPDHVPSMTAL 724 (799) T ss_pred HHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHH T ss_conf 27899999999999988538832889999998750313389999864899987746899999999873288874778999 Q ss_pred HHHHHHHHHHHHHHH--HHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCH Q ss_conf 765433223554359--99999953898547888999999986998999999998629172866 Q gi|254780592|r 179 AMSYLLMGDLKTAEE--KLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQELSPEEATR 240 (271) Q Consensus 179 ~~~~~~~g~~~~A~~--~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~ 240 (271) |.++...|+...|.. .+..+++.+|.+.++|+.||.++..+|+.++|..+|..++..++.++ T Consensus 725 a~~lle~G~~~la~~~~~L~dalr~dp~n~eaW~~LG~v~k~~Gd~~~Aaecf~aa~qLe~S~P 788 (799) T KOG4162 725 AELLLELGSPRLAEKRSLLSDALRLDPLNHEAWYYLGEVFKKLGDSKQAAECFQAALQLEESNP 788 (799) T ss_pred HHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCC T ss_conf 9999971884207789999999860887779999999999974666789999999986444798 No 43 >PRK12370 invasion protein regulator; Provisional Probab=99.61 E-value=2.3e-13 Score=99.53 Aligned_cols=189 Identities=12% Similarity=0.014 Sum_probs=169.5 Q ss_pred HHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHC---------CHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHH Q ss_conf 842019999999999999575787999999999987410---------01244666556632752100135555321000 Q gi|254780592|r 47 MNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGR---------TAQALAVMRQVAILYPEDQEVLAAYGKSLANA 117 (271) Q Consensus 47 ~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~---------~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~ 117 (271) .+.++...|...|.+++..+|+...++-.++.+|.+.+. ...|....+++++++|.++.++..+|.+..-. T Consensus 272 ft~edn~~A~~LF~~Av~~dP~fA~AyA~la~cy~q~a~l~~~d~~~~~~~A~~~AeravEldp~dp~a~~~lg~~~~L~ 351 (553) T PRK12370 272 YTPYSLQQALKLLTQCVNMSPNSIAPYCALAECYLSMAQMGIFDKQNAMIKAKEHAIKATELDHNNPQALGLLGLINTIH 351 (553) T ss_pred CCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHC T ss_conf 18877999999999987339774466787899999763403688288999999999998743878878889999998723 Q ss_pred CCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 00112588764221100002335669999999742111110000000000000001245777654332235543599999 Q gi|254780592|r 118 GYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRF 197 (271) Q Consensus 118 g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~ 197 (271) |+++.+..+|+++...+|+.++.++..|..+...|+..+|.+..+.+++++|..+..+...+.+.......++|+....+ T Consensus 352 ~d~~~a~~~f~rA~~L~Pnsa~~~Y~~~W~~~maGr~~ea~~~i~~a~~LdP~~~~~~~~~l~~~~~~~~~d~a~~l~~~ 431 (553) T PRK12370 352 SEYIVGSLLFKQANLLSPISADIKYYYGWNLFMAGQLEEALQTINECLKLDPTRAAAGITKLWITYYHTGIDDAIRLGDE 431 (553) T ss_pred CCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCCHHHHHHHH T ss_conf 55224479998764259984257999999999758689999999998835986520279999999984231079999998 Q ss_pred HHHCCCCC-HHHHHHHHHHHHHCCCHHHHHHHHHHHCCCC Q ss_conf 99538985-4788899999998699899999999862917 Q gi|254780592|r 198 ASQMIGAD-SRIRQNLALVVGLQGRMKEAYSIASQELSPE 236 (271) Q Consensus 198 al~~~p~~-~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~ 236 (271) .....|.+ +-+..-++.++..+|+.+.|.+...+ ++++ T Consensus 432 ~~~q~~~~~~~~~s~~~m~~~~~g~~~~a~~~~~~-~~~~ 470 (553) T PRK12370 432 LRSQHLQDNPILLSMQVMFLSLKGKHELARKLTKE-ISTQ 470 (553) T ss_pred HHCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHH-CCHH T ss_conf 62507787836999999999733838999999985-1803 No 44 >PRK11189 lipoprotein NlpI; Provisional Probab=99.61 E-value=2.2e-14 Score=105.87 Aligned_cols=177 Identities=15% Similarity=0.007 Sum_probs=86.7 Q ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH Q ss_conf 99999999995757879999999999874100124466655663275210013555532100000112588764221100 Q gi|254780592|r 55 VTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQRPD 134 (271) Q Consensus 55 A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~ 134 (271) |.-.|.+++..+|+-++++..+|.-+.+.|+++.|.+.|+.+++++|++.-++.++|..+.--|+++.|...+.+....+ T Consensus 84 AR~DF~qAL~l~P~~adayN~LGiy~tq~~~fD~AyEAFDs~LEL~P~y~YA~lNRGIAlYYg~R~~LA~~D~~~fy~~d 163 (297) T PRK11189 84 ARNDFSQALALRPDMADAYNYLGIYLTQEGEFDAAYEAFDSVLELDPTYDYAYLNRGIALYYGGRYDLAQDDLLAFYQDD 163 (297) T ss_pred HHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCC T ss_conf 98649989753998589999999999997119999999877772698734899830278874463788999999999609 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC-CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC-------H Q ss_conf 002335669999999742111110000000000-00000124577765433223554359999999538985-------4 Q gi|254780592|r 135 IPDWQLISAKGSVLAQMGKHSEALIEYERALEL-SPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGAD-------S 206 (271) Q Consensus 135 p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~-~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~-------~ 206 (271) |+++---.++-.+-. .-+..+|...+..-.+. +.+.+ -+++...| .|+..+ ...++++..-..++ . T Consensus 164 p~DpYR~LWLYl~e~-~~d~~~A~~~L~~r~~~~~~~~W--~~~iV~~y--lg~ise-~~~~~~~~~~~~~n~~lAErLc 237 (297) T PRK11189 164 PNDPYRALWLYLIEQ-KLDPKQAKENLKQRYEKLDKEQW--GWNIVDFY--LGKISE-EQLFERLKAGATDNTELAERLC 237 (297) T ss_pred CCCCHHHHHHHHHHC-CCCHHHHHHHHHHHHHHCCCHHH--HHHHHHHH--HCCCCH-HHHHHHHHHHCCCCHHHHHHHH T ss_conf 998279999999981-16999999999999983571226--89999999--458999-9999999872737099999999 Q ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHCCCCC Q ss_conf 7888999999986998999999998629172 Q gi|254780592|r 207 RIRQNLALVVGLQGRMKEAYSIASQELSPEE 237 (271) Q Consensus 207 ~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~ 237 (271) ++++.||..+..+|+.++|..+|+-+++.+- T Consensus 238 EaYFYLaK~~~~~G~~~~A~~~fKLAla~NV 268 (297) T PRK11189 238 ETYFYLAKYYLSLGDLDEAANLFKLALANNV 268 (297) T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHHCCC T ss_conf 9999999999987998999999999870482 No 45 >KOG0550 consensus Probab=99.61 E-value=2.3e-14 Score=105.68 Aligned_cols=194 Identities=18% Similarity=0.138 Sum_probs=136.9 Q ss_pred HHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHH Q ss_conf 99842019999999999999575787999999999987410012446665566327521001355553210000011258 Q gi|254780592|r 45 ENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGL 124 (271) Q Consensus 45 ~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~ 124 (271) +.+.+..|.+|+..+..|++..|++...|.+.+.+++..|++++|.-..++.+.++|.....+...+.++..+++..+|. T Consensus 58 ~~yk~k~Y~nal~~yt~Ai~~~pd~a~yy~nRAa~~m~~~~~~~a~~dar~~~r~kd~~~k~~~r~~~c~~a~~~~i~A~ 137 (486) T KOG0550 58 AFYKQKTYGNALKNYTFAIDMCPDNASYYSNRAATLMMLGRFEEALGDARQSVRLKDGFSKGQLREGQCHLALSDLIEAE 137 (486) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCHHHHCHHHHHHHHHHHHHHCCCCHHHHEECCCCCCCCCCCHHHHHHHHHHHHHHH T ss_conf 17778668889988899998584003654416889998776752421166530057874101245224455567889899 Q ss_pred HHH----------------------------------------------------HHHHHHHCHHHHHHHHHHHHHHHHH Q ss_conf 876----------------------------------------------------4221100002335669999999742 Q gi|254780592|r 125 DAI----------------------------------------------------NRAQRPDIPDWQLISAKGSVLAQMG 152 (271) Q Consensus 125 ~~~----------------------------------------------------~~a~~~~p~~~~~~~~lg~~~~~~~ 152 (271) +.+ -..+..++.+.+.+..+|.++...+ T Consensus 138 ~~~~~~~~~~~anal~~~~~~~~s~s~~pac~~a~~lka~cl~~~~~~~~a~~ea~~ilkld~~n~~al~vrg~~~yy~~ 217 (486) T KOG0550 138 EKLKSKQAYKAANALPTLEKLAPSHSREPACFKAKLLKAECLAFLGDYDEAQSEAIDILKLDATNAEALYVRGLCLYYND 217 (486) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHCCCCCCCCC T ss_conf 88621016677653122200242346773256788741056543034316778888887516301678886153310232 Q ss_pred HHHHHHHHHHHCCCCCCHHHH------------HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCH----HHHHHHHHHH Q ss_conf 111110000000000000001------------245777654332235543599999995389854----7888999999 Q gi|254780592|r 153 KHSEALIEYERALELSPNESS------------IVSNIAMSYLLMGDLKTAEEKLRFASQMIGADS----RIRQNLALVV 216 (271) Q Consensus 153 ~~~~A~~~~~~~~~~~p~~~~------------~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~----~~~~~La~~~ 216 (271) +.+.++..|.+++.+.|+... .+..-|.-..+.|++..|.++|..++.++|++. ..|.+.|.+. T Consensus 218 ~~~ka~~hf~qal~ldpdh~~sk~~~~~~k~le~~k~~gN~~fk~G~y~~A~E~Yteal~idP~n~~~naklY~nra~v~ 297 (486) T KOG0550 218 NADKAINHFQQALRLDPDHQKSKSASMMPKKLEVKKERGNDAFKNGNYRKAYECYTEALNIDPSNKKTNAKLYGNRALVN 297 (486) T ss_pred CHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHH T ss_conf 06888888765202680256677676407899988750566763442467899887763379322224689998767664 Q ss_pred HHCCCHHHHHHHHHHHCCCCCC Q ss_conf 9869989999999986291728 Q gi|254780592|r 217 GLQGRMKEAYSIASQELSPEEA 238 (271) Q Consensus 217 ~~~g~~~eA~~~~~~~L~~~~~ 238 (271) ..+|+..||+...+.+++.++. T Consensus 298 ~rLgrl~eaisdc~~Al~iD~s 319 (486) T KOG0550 298 IRLGRLREAISDCNEALKIDSS 319 (486) T ss_pred CCCCCCHHHHHHHHHHHHCCHH T ss_conf 2067716566544666634798 No 46 >KOG0548 consensus Probab=99.60 E-value=6.8e-14 Score=102.80 Aligned_cols=205 Identities=13% Similarity=0.125 Sum_probs=174.6 Q ss_pred HHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCH-------HHHHHHHHHHH Q ss_conf 99984201999999999999957578799999999998741001244666556632752100-------13555532100 Q gi|254780592|r 44 LENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQE-------VLAAYGKSLAN 116 (271) Q Consensus 44 ~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~-------~~~~~~~~~~~ 116 (271) .......++..|+..+.++++.+ .+...+.+.+-+|...|++.+.+....++++..-+... ....+|..+.. T Consensus 232 naaykkk~f~~a~q~y~~a~el~-~~it~~~n~aA~~~e~~~~~~c~~~c~~a~E~gre~rad~klIak~~~r~g~a~~k 310 (539) T KOG0548 232 NAAYKKKDFETAIQHYAKALELA-TDITYLNNIAAVYLERGKYAECIELCEKAVEVGRELRADYKLIAKALARLGNAYTK 310 (539) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHCCHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99887454899999999998675-66678888988997313798744220889987578999999999999986311355 Q ss_pred HCCHHHHHHHHHHHHH--------------------------HHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCH Q ss_conf 0001125887642211--------------------------00002335669999999742111110000000000000 Q gi|254780592|r 117 AGYLDEGLDAINRAQR--------------------------PDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPN 170 (271) Q Consensus 117 ~g~~~~A~~~~~~a~~--------------------------~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~ 170 (271) .++++.+...|.+++. .+|.....-..-|+.+.+.|++..|+..|.+++..+|+ T Consensus 311 ~~~~~~ai~~~~kaLte~Rt~~~ls~lk~~Ek~~k~~e~~a~~~pe~A~e~r~kGne~Fk~gdy~~Av~~YteAIkr~P~ 390 (539) T KOG0548 311 REDYEGAIKYYQKALTEHRTPDLLSKLKEAEKALKEAERKAYINPEKAEEEREKGNEAFKKGDYPEAVKHYTEAIKRDPE 390 (539) T ss_pred HHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCC T ss_conf 77678899999998663258887877789999999988887318467899998778877535889999999999842984 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHHH--HHHHH Q ss_conf 0012457776543322355435999999953898547888999999986998999999998629172866599--99999 Q gi|254780592|r 171 ESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQELSPEEATRNIK--YIKSI 248 (271) Q Consensus 171 ~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~~~--~l~~~ 248 (271) +...+.|.+.+|...|.+..|+...+++++++|+....|.+-|.++..+.+|+.|...|+.++..+|.+..+. +-+++ T Consensus 391 Da~lYsNRAac~~kL~~~~~aL~Da~~~ieL~p~~~kgy~RKg~al~~mk~ydkAleay~eale~dp~~~e~~~~~~rc~ 470 (539) T KOG0548 391 DARLYSNRAACYLKLGEYPEALKDAKKCIELDPNFIKAYLRKGAALRAMKEYDKALEAYQEALELDPSNAEAIDGYRRCV 470 (539) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH T ss_conf 15788889999998731798998899987509318999999999999999899999999998734915589999999999 Q ss_pred H Q ss_conf 9 Q gi|254780592|r 249 L 249 (271) Q Consensus 249 l 249 (271) - T Consensus 471 ~ 471 (539) T KOG0548 471 E 471 (539) T ss_pred H T ss_conf 9 No 47 >KOG1174 consensus Probab=99.60 E-value=1.5e-13 Score=100.78 Aligned_cols=179 Identities=16% Similarity=0.081 Sum_probs=90.9 Q ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHC Q ss_conf 99999999957578799999999998741001244666556632752100135555321000001125887642211000 Q gi|254780592|r 56 TSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQRPDI 135 (271) Q Consensus 56 ~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p 135 (271) ...+-.....-|+|...+..+|.++...|++++|+-.|+++.-++|+....--.+|..+...|+++.-..........+. T Consensus 218 t~l~le~~~~lr~NvhLl~~lak~~~~~Gdn~~a~~~Fe~~~~~dpy~i~~MD~Ya~LL~~eg~~e~~~~L~~~Lf~~~~ 297 (564) T KOG1174 218 TFLMLHDNTTLRCNEHLMMALGKCLYYNGDYFQAEDIFSSTLCANPDNVEAMDLYAVLLGQEGGCEQDSALMDYLFAKVK 297 (564) T ss_pred HHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCC T ss_conf 88878765406760889988755653416756899999887507815555689999999860487678999999985220 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHH Q ss_conf 02335669999999742111110000000000000001245777654332235543599999995389854788899999 Q gi|254780592|r 136 PDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALV 215 (271) Q Consensus 136 ~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~ 215 (271) ....-|+.-+..++..++++.|...-++++..+|.+..++...|.++...++.++|+-.|+.|..+.|.+-++|..|.-. T Consensus 298 ~ta~~wfV~~~~l~~~K~~~rAL~~~eK~I~~~~r~~~alilKG~lL~~~~R~~~A~IaFR~Aq~Lap~rL~~Y~GL~hs 377 (564) T KOG1174 298 YTASHWFVHAQLLYDEKKFERALNFVEKCIDSEPRNHEALILKGRLLIALERHTQAVIAFRTAQMLAPYRLEIYRGLFHS 377 (564) T ss_pred CCCCCEEEEEEHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHCCHHHHHCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHH T ss_conf 23240234310145776689999999887615855336788613998854562889999999874251457899889999 Q ss_pred HHHCCCHHHHHHHHHHHCC Q ss_conf 9986998999999998629 Q gi|254780592|r 216 VGLQGRMKEAYSIASQELS 234 (271) Q Consensus 216 ~~~~g~~~eA~~~~~~~L~ 234 (271) |..+|++.||....+.++. T Consensus 378 YLA~~~~kEA~~~An~~~~ 396 (564) T KOG1174 378 YLAQKRFKEANALANWTIR 396 (564) T ss_pred HHHHCCHHHHHHHHHHHHH T ss_conf 9961408999999999999 No 48 >PRK02603 photosystem I assembly protein Ycf3; Provisional Probab=99.56 E-value=3.9e-14 Score=104.26 Aligned_cols=150 Identities=15% Similarity=0.060 Sum_probs=122.9 Q ss_pred HHHHHHHCCHHHHHHHHHHHHHHCCCCHH--HHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCC---CHHHHHHHHHHHHH Q ss_conf 99998420199999999999995757879--99999999987410012446665566327521---00135555321000 Q gi|254780592|r 43 SLENMNHEQLLEVTSTIGLQYQSHTKNKM--IGIVYADVLRRVGRTAQALAVMRQVAILYPED---QEVLAAYGKSLANA 117 (271) Q Consensus 43 ~~~~~~~~~~~~A~~~~~~al~~~P~~~~--~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~---~~~~~~~~~~~~~~ 117 (271) ..++.-...|.-+.+.+.+++..+|+..+ ++++.|.++...|++++|++.+++++++.|+. ..++..+|.++... T Consensus 6 ~ndnfidk~ftv~ad~~~k~~p~~~~~k~A~~Yy~~G~~~q~~G~Y~EAle~Y~kAL~Le~dp~D~s~ilyNiGlIY~~~ 85 (172) T PRK02603 6 RNDNFIDKSFTVMADLILKILPINKKAKEAFVYYRDGMSAQNDGDYAEALENYEEALKLEEDPNDRSYILYNMAIIYASN 85 (172) T ss_pred CCCCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHC T ss_conf 46411442489999999998158863368999997225777457999999999998727789620899997799999986 Q ss_pred CCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 00112588764221100002335669999999742111110000000000000001245777654332235543599999 Q gi|254780592|r 118 GYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRF 197 (271) Q Consensus 118 g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~ 197 (271) |++++|++.|.+++..+|+.+.+++++|.+|..+|+..... |..-.....+++|..++.+ T Consensus 86 G~~~kAie~Y~kALeiNP~~~~AlNNlgvIY~k~G~~a~~~--------------------~~~~~~~~~fdkAa~yw~k 145 (172) T PRK02603 86 GEHDKALETYHQALELNPKQPSALNNIAVIYHKRGRKAEEA--------------------GDQDEAEAWFDKAAEYWKQ 145 (172) T ss_pred CCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHC--------------------CCHHHHHHHHHHHHHHHHH T ss_conf 99889999999999868997899999999999985999984--------------------5877899999999999999 Q ss_pred HHHCCCCCHHHHHHH Q ss_conf 995389854788899 Q gi|254780592|r 198 ASQMIGADSRIRQNL 212 (271) Q Consensus 198 al~~~p~~~~~~~~L 212 (271) |+++.|++--...++ T Consensus 146 AI~l~P~nyi~aqn~ 160 (172) T PRK02603 146 AIRLAPNNYIEAQNW 160 (172) T ss_pred HHHHCCCCHHHHHHH T ss_conf 885099870999999 No 49 >KOG2076 consensus Probab=99.56 E-value=3.1e-13 Score=98.75 Aligned_cols=205 Identities=17% Similarity=0.144 Sum_probs=147.9 Q ss_pred CCCCCCCCHHHHHHH--HHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCH--- Q ss_conf 368543311799999--984201999999999999957578799999999998741001244666556632752100--- Q gi|254780592|r 31 HLNPRISIPDPESLE--NMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQE--- 105 (271) Q Consensus 31 ~~~~~~~~~~~~~~~--~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~--- 105 (271) +.+|..- ..+.... ..+.|.+.+|.-+|.+|++.+|.|-......+..|.+.|+...|...+.++++..|.... T Consensus 201 HL~p~d~-e~W~~ladls~~~~~i~qA~~cy~rAI~~~p~n~~~~~ers~L~~~~G~~~~Am~~f~~l~~~~p~~d~er~ 279 (895) T KOG2076 201 HLNPKDY-ELWKRLADLSEQLGNINQARYCYSRAIQANPSNWELIYERSSLYQKTGDLKRAMETFLQLLQLDPPVDIERI 279 (895) T ss_pred HCCCCCH-HHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHCCCHHHHHH T ss_conf 6699886-899999999986012999999999998608862289999999999847078889999999862890158999 Q ss_pred ---HH-----------------------------------HHHHHHHHHHCCHHHHHHHHHHHH---------------- Q ss_conf ---13-----------------------------------555532100000112588764221---------------- Q gi|254780592|r 106 ---VL-----------------------------------AAYGKSLANAGYLDEGLDAINRAQ---------------- 131 (271) Q Consensus 106 ---~~-----------------------------------~~~~~~~~~~g~~~~A~~~~~~a~---------------- 131 (271) ++ ..++.++.....++.+......-. T Consensus 280 ~d~i~~~~~~~~~~~~~e~a~~~le~~~s~~~~~~~~ed~ni~ael~l~~~q~d~~~~~i~~~~~r~~e~d~~e~~~~~~ 359 (895) T KOG2076 280 EDLIRRVAHYFITHNERERAAKALEGALSKEKDEASLEDLNILAELFLKNKQSDKALMKIVDDRNRESEKDDSEWDTDER 359 (895) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHH T ss_conf 99999999999971078999999997774103403562789999999986788876699999860136787165400220 Q ss_pred ---------------------------------------------HH---HCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf ---------------------------------------------10---000233566999999974211111000000 Q gi|254780592|r 132 ---------------------------------------------RP---DIPDWQLISAKGSVLAQMGKHSEALIEYER 163 (271) Q Consensus 132 ---------------------------------------------~~---~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~ 163 (271) .. -...++.+..++..+...|++..|+..+.. T Consensus 360 ~~~~~~~~~~~~~~~s~~l~v~rl~icL~~L~~~e~~e~ll~~l~~~n~~~~d~~dL~~d~a~al~~~~~~~~Al~~l~~ 439 (895) T KOG2076 360 RREEPNALCEVGKELSYDLRVIRLMICLVHLKERELLEALLHFLVEDNVWVSDDVDLYLDLADALTNIGKYKEALRLLSP 439 (895) T ss_pred CCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHH T ss_conf 34555320147877775630576754320024434588898888883378455578999999999836558889999999 Q ss_pred CCCCCCHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCC Q ss_conf 00000000-01245777654332235543599999995389854788899999998699899999999862917 Q gi|254780592|r 164 ALELSPNE-SSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQELSPE 236 (271) Q Consensus 164 ~~~~~p~~-~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~ 236 (271) .....+.. ..+|..+|.+|...|.+++|+.+|++++...|++.+++..|+.++..+|++++|.+.+.+...++ T Consensus 440 i~~~~~~~~~~vw~~~a~c~~~l~e~e~A~e~y~kvl~~~p~~~D~Ri~Lasl~~~~g~~EkalEtL~~~~~~D 513 (895) T KOG2076 440 ITNREGYQNAFVWYKLARCYMELGEYEEAIEFYEKVLILAPDNLDARITLASLYQQLGNHEKALETLEQIINPD 513 (895) T ss_pred HHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCC T ss_conf 86483335513469899999987428999999999995289860166319999976388889999871345899 No 50 >KOG0548 consensus Probab=99.55 E-value=3.1e-13 Score=98.75 Aligned_cols=177 Identities=18% Similarity=0.190 Sum_probs=121.9 Q ss_pred CCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHH Q ss_conf 01999999999999957578799999999998741001244666556632752100135555321000001125887642 Q gi|254780592|r 50 EQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINR 129 (271) Q Consensus 50 ~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~ 129 (271) +++..|+.+|.+++...-. ..++.+....++++...+...-.+|+-..-...-|..+...|+|..|+..|.+ T Consensus 312 ~~~~~ai~~~~kaLte~Rt--------~~~ls~lk~~Ek~~k~~e~~a~~~pe~A~e~r~kGne~Fk~gdy~~Av~~Yte 383 (539) T KOG0548 312 EDYEGAIKYYQKALTEHRT--------PDLLSKLKEAEKALKEAERKAYINPEKAEEEREKGNEAFKKGDYPEAVKHYTE 383 (539) T ss_pred HHHHHHHHHHHHHHHHHCC--------HHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCHHHHHHHHHH T ss_conf 7678899999998663258--------88787778999999998888731846789999877887753588999999999 Q ss_pred HHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHH Q ss_conf 21100002335669999999742111110000000000000001245777654332235543599999995389854788 Q gi|254780592|r 130 AQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIR 209 (271) Q Consensus 130 a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~ 209 (271) ++..+|+++..+.+++.||..++.+..|+......++++|+....+...|.++....+|+.|...|+++++.+|.+.++. T Consensus 384 AIkr~P~Da~lYsNRAac~~kL~~~~~aL~Da~~~ieL~p~~~kgy~RKg~al~~mk~ydkAleay~eale~dp~~~e~~ 463 (539) T KOG0548 384 AIKRDPEDARLYSNRAACYLKLGEYPEALKDAKKCIELDPNFIKAYLRKGAALRAMKEYDKALEAYQEALELDPSNAEAI 463 (539) T ss_pred HHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHH T ss_conf 98429841578888999999873179899889998750931899999999999999989999999999873491558999 Q ss_pred HHHHHHHHHCCCHHHHHHHHHHHCC Q ss_conf 8999999986998999999998629 Q gi|254780592|r 210 QNLALVVGLQGRMKEAYSIASQELS 234 (271) Q Consensus 210 ~~La~~~~~~g~~~eA~~~~~~~L~ 234 (271) ..+..+...+......+++.++++. T Consensus 464 ~~~~rc~~a~~~~~~~ee~~~r~~~ 488 (539) T KOG0548 464 DGYRRCVEAQRGDETPEETKRRAMA 488 (539) T ss_pred HHHHHHHHHHHCCCCHHHHHHHHCC T ss_conf 9999999986357988999876306 No 51 >PRK10049 pgaA outer membrane protein PgaA; Provisional Probab=99.55 E-value=1.5e-12 Score=94.57 Aligned_cols=100 Identities=18% Similarity=0.081 Sum_probs=63.1 Q ss_pred HHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHH Q ss_conf 99984201999999999999957578799999999998741001244666556632752100135555321000001125 Q gi|254780592|r 44 LENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEG 123 (271) Q Consensus 44 ~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A 123 (271) ...-+.+++..|++.+++++.++|+|++.++.++.++...|++++|+..+++.....|+... +..++.++...|+.-.+ T Consensus 107 ka~Rn~kq~~~Al~ly~~~~~~~P~n~d~~lGlalt~Ada~~~~~Al~~~~~~~~~~p~~~~-~~~~aY~~~~~~~~~~a 185 (818) T PRK10049 107 RAYRNLKQWQNSLTLWQKALSLEPQNKDYQSGQILTLADAGHYDEALVKAKQLVAGAPDVAN-LLALAYIYRLAGRHYDA 185 (818) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHH-HHHHHHHHHHCCCCHHH T ss_conf 99863554677999999985029999407899999998638869999999998762897678-99999999862742999 Q ss_pred HHHHHHHHHHHCHHHHHHHHH Q ss_conf 887642211000023356699 Q gi|254780592|r 124 LDAINRAQRPDIPDWQLISAK 144 (271) Q Consensus 124 ~~~~~~a~~~~p~~~~~~~~l 144 (271) +..+.+++..+|++.+....+ T Consensus 186 L~~~~~al~~~P~~~~~~~~y 206 (818) T PRK10049 186 LRAMTEALRRAPSSQEVLTEY 206 (818) T ss_pred HHHHHHHHHHCCCCHHHHHHH T ss_conf 999999997689998999999 No 52 >COG5010 TadD Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking and secretion] Probab=99.53 E-value=1.2e-12 Score=95.27 Aligned_cols=152 Identities=20% Similarity=0.164 Sum_probs=117.6 Q ss_pred HHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHH Q ss_conf 98420199999999999995757879999999999874100124466655663275210013555532100000112588 Q gi|254780592|r 46 NMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLD 125 (271) Q Consensus 46 ~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~ 125 (271) ....|+-+++.....+....+|.+.+.+..+|....+.|++.+|+..+.++..+.|.+++++..+|.+|.+.|+++.|.. T Consensus 76 ~~~~G~a~~~l~~~~~~~~~~~~d~~ll~~~gk~~~~~g~~~~A~~~~rkA~~l~p~d~~~~~~lgaaldq~Gr~~~Ar~ 155 (257) T COG5010 76 LYLRGDADSSLAVLQKSAIAYPKDRELLAAQGKNQIRNGNFGEAVSVLRKAARLAPTDWEAWNLLGAALDQLGRFDEARR 155 (257) T ss_pred HHHCCCCCCHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCHHHHHH T ss_conf 77405555007887554014745499999999999980456889999999843699974666677899997367567789 Q ss_pred HHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 764221100002335669999999742111110000000000000001245777654332235543599999 Q gi|254780592|r 126 AINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRF 197 (271) Q Consensus 126 ~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~ 197 (271) .|.++++..|.++....++|..+.-.|+.+.|...+..+....+.+..+..|++.....+|++++|.....+ T Consensus 156 ay~qAl~L~~~~p~~~nNlgms~~L~gd~~~A~~lll~a~l~~~ad~~v~~NLAl~~~~~g~~~~A~~i~~~ 227 (257) T COG5010 156 AYRQALELAPNEPSIANNLGMSLLLRGDLEDAETLLLPAYLSPAADSRVRQNLALVVGLQGDFREAEDIAVQ 227 (257) T ss_pred HHHHHHHHCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHCCC T ss_conf 999999842588446665899999727798999999999867887638999899998404886777762420 No 53 >KOG1127 consensus Probab=99.53 E-value=2e-13 Score=99.88 Aligned_cols=186 Identities=15% Similarity=0.086 Sum_probs=170.2 Q ss_pred HCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHH Q ss_conf 20199999999999995757879999999999874100124466655663275210013555532100000112588764 Q gi|254780592|r 49 HEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAIN 128 (271) Q Consensus 49 ~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~ 128 (271) ..+...|...|-+++..||.-.-++-.+|.+|+..-+..+|..+|+++-+++|.+.++....++.+.+..+++.|..... T Consensus 471 rK~~~~al~ali~alrld~~~apaf~~LG~iYrd~~Dm~RA~kCf~KAFeLDatdaeaaaa~adtyae~~~we~a~~I~l 550 (1238) T KOG1127 471 RKNSALALHALIRALRLDVSLAPAFAFLGQIYRDSDDMKRAKKCFDKAFELDATDAEAAAASADTYAEESTWEEAFEICL 550 (1238) T ss_pred HHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHH T ss_conf 41078899999999854642238999988999878989999999988863791234467888887631541889999998 Q ss_pred HHHHHHCHHH--HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCH Q ss_conf 2211000023--35669999999742111110000000000000001245777654332235543599999995389854 Q gi|254780592|r 129 RAQRPDIPDW--QLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADS 206 (271) Q Consensus 129 ~a~~~~p~~~--~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~ 206 (271) ++.+..|... ..|..+|..|...++..+|+..|+.++..+|++...|..+|.+|...|++..|+..|.+|..++|.+. T Consensus 551 ~~~qka~a~~~k~nW~~rG~yyLea~n~h~aV~~fQsALR~dPkD~n~W~gLGeAY~~sGry~~AlKvF~kAs~LrP~s~ 630 (1238) T KOG1127 551 RAAQKAPAFACKENWVQRGPYYLEAHNLHGAVCEFQSALRTDPKDYNLWLGLGEAYPESGRYSHALKVFTKASLLRPLSK 630 (1238) T ss_pred HHHHHCHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCEEHHHHHHHHHHHCCCHHH T ss_conf 87421547787752354043436852155589999998638906578899877887743762007776424574482768 Q ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHCC Q ss_conf 7888999999986998999999998629 Q gi|254780592|r 207 RIRQNLALVVGLQGRMKEAYSIASQELS 234 (271) Q Consensus 207 ~~~~~La~~~~~~g~~~eA~~~~~~~L~ 234 (271) -..+--|.+.-..|++.+|...+...+. T Consensus 631 y~~fk~A~~ecd~GkYkeald~l~~ii~ 658 (1238) T KOG1127 631 YGRFKEAVMECDNGKYKEALDALGLIIY 658 (1238) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999876668999999999999 No 54 >KOG1128 consensus Probab=99.53 E-value=1.9e-13 Score=100.00 Aligned_cols=182 Identities=15% Similarity=0.133 Sum_probs=146.5 Q ss_pred HHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHH Q ss_conf 98420199999999999995757879999999999874100124466655663275210013555532100000112588 Q gi|254780592|r 46 NMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLD 125 (271) Q Consensus 46 ~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~ 125 (271) +...|+-..|.....+.++ +|.++..|..+|++.....=+++|.++.+.. +..+...+|......++++++.. T Consensus 434 Y~~lg~~~kaeei~~q~le-k~~d~~lyc~LGDv~~d~s~yEkawElsn~~------sarA~r~~~~~~~~~~~fs~~~~ 506 (777) T KOG1128 434 YLLLGQHGKAEEINRQELE-KDPDPRLYCLLGDVLHDPSLYEKAWELSNYI------SARAQRSLALLILSNKDFSEADK 506 (777) T ss_pred HHHHCCCCHHHHHHHHHHC-CCCCCHHHHHHHHHCCCHHHHHHHHHHHHHH------HHHHHHHHCCCCCCCCCHHHHHH T ss_conf 9984230047899999826-9985126997531203848999999986565------57787752543335212899999 Q ss_pred HHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 76422110000233566999999974211111000000000000000124577765433223554359999999538985 Q gi|254780592|r 126 AINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGAD 205 (271) Q Consensus 126 ~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~ 205 (271) +++..+..+|-....|+.+|.+..+.++.+.|...|...+...|++..+|+|+..+|+..++..+|-..+++|++-+-++ T Consensus 507 hle~sl~~nplq~~~wf~~G~~ALqlek~q~av~aF~rcvtL~Pd~~eaWnNls~ayi~~~~k~ra~~~l~EAlKcn~~~ 586 (777) T KOG1128 507 HLERSLEINPLQLGTWFGLGCAALQLEKEQAAVKAFHRCVTLEPDNAEAWNNLSTAYIRLKKKKRAFRKLKEALKCNYQH 586 (777) T ss_pred HHHHHHHCCCCCHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 99987650663255787414899987636789999998962488820555236689999751488899999986147788 Q ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHCC Q ss_conf 47888999999986998999999998629 Q gi|254780592|r 206 SRIRQNLALVVGLQGRMKEAYSIASQELS 234 (271) Q Consensus 206 ~~~~~~La~~~~~~g~~~eA~~~~~~~L~ 234 (271) ..+|.|.-.+....|.+++|.+.+.+.+. T Consensus 587 w~iWENymlvsvdvge~eda~~A~~rll~ 615 (777) T KOG1128 587 WQIWENYMLVSVDVGEFEDAIKAYHRLLD 615 (777) T ss_pred CEEEECHHHHHHHCCCHHHHHHHHHHHHH T ss_conf 73220312345531408999999999987 No 55 >PRK10866 outer membrane protein assembly complex subunit YfiO; Provisional Probab=99.51 E-value=6.7e-12 Score=90.58 Aligned_cols=197 Identities=10% Similarity=0.052 Sum_probs=136.6 Q ss_pred HHHHHHHHHHHHHCCCCCC------CCHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHH---HHHHHHHHHHHHHCCHH Q ss_conf 9999999998540368543------31179999998420199999999999995757879---99999999987410012 Q gi|254780592|r 18 IVFLIPLLLFSSCHLNPRI------SIPDPESLENMNHEQLLEVTSTIGLQYQSHTKNKM---IGIVYADVLRRVGRTAQ 88 (271) Q Consensus 18 ~~~l~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~A~~~~~~al~~~P~~~~---~~~~~a~~l~~~g~~~~ 88 (271) +++++++++++||++.... ......+.+.++.|++.+|+..|+......|..+. +.+.+|.++...|++++ T Consensus 8 ~~~~~l~l~l~gCs~~~~~~~~~~~~~~Y~~A~~~~~~~~y~~A~~~fe~l~~~yP~s~~A~qA~L~~ayayy~~~~y~~ 87 (243) T PRK10866 8 VAAATLSLFLAGCSGSKEEVPDNPPNEIYATAQQKLQDGNWKQAITQLEALDNRYPFGPYSQQVQLDLIYAYYKNADLPL 87 (243) T ss_pred HHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCHHH T ss_conf 99999999998768998888899999999999999987899999999999998689976789999999999873077899 Q ss_pred HHHHHHHHHHCCCCCCHH---HHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 446665566327521001---35555321000001125887642211000023356699999997421111100000000 Q gi|254780592|r 89 ALAVMRQVAILYPEDQEV---LAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERAL 165 (271) Q Consensus 89 A~~~~~~al~~~p~~~~~---~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~ 165 (271) |+..+++.++++|.++.+ +...|.++...+ +.....+... ++.+ ..++...+|...|...+ T Consensus 88 A~~~~~rFi~lyP~~~~v~Ya~Yl~gl~~~~~~--~~~~~~~~~~---~~~~-----------rD~~~~~~A~~~f~~li 151 (243) T PRK10866 88 AQAAIDRFMRLNPTHPNIDYVMYMRGLTNMALD--DSALQGFFGV---DRSD-----------RDPQHARAAFSDFSKLV 151 (243) T ss_pred HHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCC--HHHHHHHHCC---CHHH-----------CCHHHHHHHHHHHHHHH T ss_conf 999999999867899973899999889877401--3588887432---5643-----------07699999999999999 Q ss_pred CCCCHHH-----------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC---HHHHHHHHHHHHHCCCHHHH Q ss_conf 0000000-----------------124577765433223554359999999538985---47888999999986998999 Q gi|254780592|r 166 ELSPNES-----------------SIVSNIAMSYLLMGDLKTAEEKLRFASQMIGAD---SRIRQNLALVVGLQGRMKEA 225 (271) Q Consensus 166 ~~~p~~~-----------------~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~---~~~~~~La~~~~~~g~~~eA 225 (271) ...|++. .--...|..|++.|.+..|+.-++..++.-|+. .++...|+.+|..+|..++| T Consensus 152 ~~yP~S~Ya~dA~~r~~~lrn~LA~~El~Va~fY~kr~~y~AAi~R~~~vi~~Yp~t~~~~eAL~~l~eaY~~Lgl~~~A 231 (243) T PRK10866 152 RGYPNSQYTTDATKRLVFLKDRLAKYEYSVAEYYTKRGAWVAVVNRVEGMLRDYPDTQATRDALPLMENAYRQMQLNAQA 231 (243) T ss_pred HHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCHHHH T ss_conf 87899845899999999999999999999999999926569999999999976899963399999999999986997899 Q ss_pred HHHHH Q ss_conf 99999 Q gi|254780592|r 226 YSIAS 230 (271) Q Consensus 226 ~~~~~ 230 (271) ....+ T Consensus 232 ~~~~~ 236 (243) T PRK10866 232 DKVAK 236 (243) T ss_pred HHHHH T ss_conf 99999 No 56 >KOG0553 consensus Probab=99.50 E-value=1.7e-13 Score=100.34 Aligned_cols=113 Identities=12% Similarity=0.101 Sum_probs=80.0 Q ss_pred HHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHH Q ss_conf 99998420199999999999995757879999999999874100124466655663275210013555532100000112 Q gi|254780592|r 43 SLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDE 122 (271) Q Consensus 43 ~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~ 122 (271) ....+..++|.+|+..|.+||+.+|+|+..+.+.|.+|.+.|+++.|+.-++.++.++|....+|..+|.+|..+|++++ T Consensus 88 GN~~m~~~~Y~eAv~kY~~AI~l~P~nAVyycNRAAAy~~Lg~~~~AVkDce~Al~iDp~yskay~RLG~A~~~~gk~~~ 167 (304) T KOG0553 88 GNKLMKNKDYQEAVDKYTEAIELDPTNAVYYCNRAAAYSKLGEYEDAVKDCESALSIDPHYSKAYGRLGLAYLALGKYEE 167 (304) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHH T ss_conf 89999862599999999999845887626777899999986011789999999776290899999998799980673799 Q ss_pred HHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHH Q ss_conf 588764221100002335669999999742111 Q gi|254780592|r 123 GLDAINRAQRPDIPDWQLISAKGSVLAQMGKHS 155 (271) Q Consensus 123 A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~ 155 (271) |.+.|++++.++|.+...+.++...-..++... T Consensus 168 A~~aykKaLeldP~Ne~~K~nL~~Ae~~l~e~~ 200 (304) T KOG0553 168 AIEAYKKALELDPDNESYKSNLKIAEQKLNEPK 200 (304) T ss_pred HHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCC T ss_conf 999997643168884899988999998724787 No 57 >PRK02603 photosystem I assembly protein Ycf3; Provisional Probab=99.50 E-value=1.3e-13 Score=101.09 Aligned_cols=120 Identities=22% Similarity=0.218 Sum_probs=78.8 Q ss_pred HHHHHHHHHHHHHHHCHH--HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHH---HHHHHHHHHHHHHHHHHHHHH Q ss_conf 112588764221100002--335669999999742111110000000000000001---245777654332235543599 Q gi|254780592|r 120 LDEGLDAINRAQRPDIPD--WQLISAKGSVLAQMGKHSEALIEYERALELSPNESS---IVSNIAMSYLLMGDLKTAEEK 194 (271) Q Consensus 120 ~~~A~~~~~~a~~~~p~~--~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~---~~~~~~~~~~~~g~~~~A~~~ 194 (271) +..+...+.+++...|.. +..+...|..+...|++++|++.|.+++++.|++.+ ++.++|.+|...|++++|+.+ T Consensus 15 ftv~ad~~~k~~p~~~~~k~A~~Yy~~G~~~q~~G~Y~EAle~Y~kAL~Le~dp~D~s~ilyNiGlIY~~~G~~~kAie~ 94 (172) T PRK02603 15 FTVMADLILKILPINKKAKEAFVYYRDGMSAQNDGDYAEALENYEEALKLEEDPNDRSYILYNMAIIYASNGEHDKALET 94 (172) T ss_pred HHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCHHHHHHH T ss_conf 89999999998158863368999997225777457999999999998727789620899997799999986998899999 Q ss_pred HHHHHHCCCCCHHHHHHHHHHHHHCCC--------------HHHHHHHHHHHCCCCCCC Q ss_conf 999995389854788899999998699--------------899999999862917286 Q gi|254780592|r 195 LRFASQMIGADSRIRQNLALVVGLQGR--------------MKEAYSIASQELSPEEAT 239 (271) Q Consensus 195 ~~~al~~~p~~~~~~~~La~~~~~~g~--------------~~eA~~~~~~~L~~~~~~ 239 (271) |++|++++|..+.++.++|.+|..+|+ +++|.+++.+++...|.+ T Consensus 95 Y~kALeiNP~~~~AlNNlgvIY~k~G~~a~~~~~~~~~~~~fdkAa~yw~kAI~l~P~n 153 (172) T PRK02603 95 YHQALELNPKQPSALNNIAVIYHKRGRKAEEAGDQDEAEAWFDKAAEYWKQAIRLAPNN 153 (172) T ss_pred HHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 99999868997899999999999985999984587789999999999999988509987 No 58 >PRK12370 invasion protein regulator; Provisional Probab=99.50 E-value=3.4e-12 Score=92.38 Aligned_cols=199 Identities=11% Similarity=0.062 Sum_probs=161.9 Q ss_pred CCCCHHHHHH--HHHHHHHHH---CCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCC---------HHHHHHHHHHHH Q ss_conf 7578799999--999998741---00124466655663275210013555532100000---------112588764221 Q gi|254780592|r 66 HTKNKMIGIV--YADVLRRVG---RTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGY---------LDEGLDAINRAQ 131 (271) Q Consensus 66 ~P~~~~~~~~--~a~~l~~~g---~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~---------~~~A~~~~~~a~ 131 (271) .|.+.++|.. +|.-++... +...|+.+|++++.++|+...++..++.||.+... ...|.....+++ T Consensus 252 ~~~sLDA~~~Yl~Gl~hl~rft~edn~~A~~LF~~Av~~dP~fA~AyA~la~cy~q~a~l~~~d~~~~~~~A~~~Aerav 331 (553) T PRK12370 252 ELNSIDSTMVYLRGKHELNQYTPYSLQQALKLLTQCVNMSPNSIAPYCALAECYLSMAQMGIFDKQNAMIKAKEHAIKAT 331 (553) T ss_pred CCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHH T ss_conf 89997489999978999973188779999999999873397744667878999997634036882889999999999987 Q ss_pred HHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHH Q ss_conf 10000233566999999974211111000000000000000124577765433223554359999999538985478889 Q gi|254780592|r 132 RPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQN 211 (271) Q Consensus 132 ~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~ 211 (271) +.+|.++.+...+|.++.-.++++-+...|+++..++|+....++..|..+.-.|+..+|...++++++++|..+..+.. T Consensus 332 Eldp~dp~a~~~lg~~~~L~~d~~~a~~~f~rA~~L~Pnsa~~~Y~~~W~~~maGr~~ea~~~i~~a~~LdP~~~~~~~~ 411 (553) T PRK12370 332 ELDHNNPQALGLLGLINTIHSEYIVGSLLFKQANLLSPISADIKYYYGWNLFMAGQLEEALQTINECLKLDPTRAAAGIT 411 (553) T ss_pred HCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCHHHHH T ss_conf 43878878889999998723552244799987642599842579999999997586899999999988359865202799 Q ss_pred HHHHHHHCCCHHHHHHHHHHHCCCCCC-CHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999869989999999986291728-66599999999832129999999971 Q gi|254780592|r 212 LALVVGLQGRMKEAYSIASQELSPEEA-TRNIKYIKSILSQRDPWKKIAKARSN 264 (271) Q Consensus 212 La~~~~~~g~~~eA~~~~~~~L~~~~~-~~~~~~l~~~l~~~~~~~~~a~a~~~ 264 (271) ...+..-....|+|+.....-.+..+. +.-+.-..+|.-.-.|..+.|+.... T Consensus 412 ~l~~~~~~~~~d~a~~l~~~~~~q~~~~~~~~~s~~~m~~~~~g~~~~a~~~~~ 465 (553) T PRK12370 412 KLWITYYHTGIDDAIRLGDELRSQHLQDNPILLSMQVMFLSLKGKHELARKLTK 465 (553) T ss_pred HHHHHHHHCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHH T ss_conf 999999842310799999986250778783699999999973383899999998 No 59 >COG3071 HemY Uncharacterized enzyme of heme biosynthesis [Coenzyme metabolism] Probab=99.49 E-value=3.9e-11 Score=85.91 Aligned_cols=224 Identities=12% Similarity=0.072 Sum_probs=142.2 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCC-CHHHHHHHHHHHHHC Q ss_conf 7999999842019999999999999575787999999999987410012446665566327521-001355553210000 Q gi|254780592|r 40 DPESLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPED-QEVLAAYGKSLANAG 118 (271) Q Consensus 40 ~~~~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~-~~~~~~~~~~~~~~g 118 (271) ..++...+..|+|.+|+..+.++-+-.+.-.-+++.-+++-.++|+.+.+-.++.++.+..+++ ..+....++.+...| T Consensus 88 ~~egl~~l~eG~~~qAEkl~~rnae~~e~p~l~~l~aA~AA~qrgd~~~an~yL~eaae~~~~~~l~v~ltrarlll~~~ 167 (400) T COG3071 88 LNEGLLKLFEGDFQQAEKLLRRNAEHGEQPVLAYLLAAEAAQQRGDEDRANRYLAEAAELAGDDTLAVELTRARLLLNRR 167 (400) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCC T ss_conf 99999987067389999999976534734089999999998863057789899999862589940889999999998678 Q ss_pred CHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC------------------------------- Q ss_conf 0112588764221100002335669999999742111110000000000------------------------------- Q gi|254780592|r 119 YLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALEL------------------------------- 167 (271) Q Consensus 119 ~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~------------------------------- 167 (271) +++.|...........|.+++.......+|.+.|++........+..+. T Consensus 168 d~~aA~~~v~~ll~~~pr~~~vlrLa~r~y~~~g~~~~ll~~l~~L~ka~~l~~~e~~~le~~a~~glL~q~~~~~~~~g 247 (400) T COG3071 168 DYPAARENVDQLLEMTPRHPEVLRLALRAYIRLGAWQALLAILPKLRKAGLLSDEEAARLEQQAWEGLLQQARDDNGSEG 247 (400) T ss_pred CCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCHH T ss_conf 86568988999987286886999999999998511899999889998735788299999999999999998736200147 Q ss_pred -----------CCHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------------------------------HHHCCCCC Q ss_conf -----------000001245777654332235543599999-------------------------------99538985 Q gi|254780592|r 168 -----------SPNESSIVSNIAMSYLLMGDLKTAEEKLRF-------------------------------ASQMIGAD 205 (271) Q Consensus 168 -----------~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~-------------------------------al~~~p~~ 205 (271) .-+++.+...++.-+...|+.++|....+. .+...|++ T Consensus 248 L~~~W~~~pr~lr~~p~l~~~~a~~li~l~~~~~A~~~i~~~Lk~~~D~~L~~~~~~l~~~d~~~l~k~~e~~l~~h~~~ 327 (400) T COG3071 248 LKTWWKNQPRKLRNDPELVVAYAERLIRLGDHDEAQEIIEDALKRQWDPRLCRLIPRLRPGDPEPLIKAAEKWLKQHPED 327 (400) T ss_pred HHHHHHHCCHHHHCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCC T ss_conf 89999866497605834899999999975876889999999987445856999886518899367999999999839998 Q ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 4788899999998699899999999862917286659999999983212999999997 Q gi|254780592|r 206 SRIRQNLALVVGLQGRMKEAYSIASQELSPEEATRNIKYIKSILSQRDPWKKIAKARS 263 (271) Q Consensus 206 ~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~~~~l~~~l~~~~~~~~~a~a~~ 263 (271) +..+..||..+...+.|.+|...++.+++..++..+...+...+.+.+.-.+..+.+. T Consensus 328 p~L~~tLG~L~~k~~~w~kA~~~leaAl~~~~s~~~~~~la~~~~~~g~~~~A~~~r~ 385 (400) T COG3071 328 PLLLSTLGRLALKNKLWGKASEALEAALKLRPSASDYAELADALDQLGEPEEAEQVRR 385 (400) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHH T ss_conf 1499999999998417899999999997258974369999999998188688999999 No 60 >PRK10747 putative protoheme IX biogenesis protein; Provisional Probab=99.49 E-value=5.2e-12 Score=91.26 Aligned_cols=217 Identities=10% Similarity=0.046 Sum_probs=142.0 Q ss_pred HHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCC-HHHHHHHHHHHHHCCH Q ss_conf 999998420199999999999995757879999999999874100124466655663275210-0135555321000001 Q gi|254780592|r 42 ESLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQ-EVLAAYGKSLANAGYL 120 (271) Q Consensus 42 ~~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~-~~~~~~~~~~~~~g~~ 120 (271) ++.-.+..|++..|...+.+..+....-.-.++.-|.+-.++|++++.-.+++++.+..|+.. .+....+......|++ T Consensus 90 ~Gl~~l~EG~~~~AeK~l~k~a~~~~~pllnyL~AArAA~~~g~~~~rd~yL~~A~e~~p~a~lav~lt~Ael~l~~~q~ 169 (398) T PRK10747 90 QALLKLAEGDYQQVEKLMAKNADHAEQPVVNYLLAAEAAQQRGDEARANQHLERAAELAGNDTIPVEITRVRLQLARNEN 169 (398) T ss_pred HHHHHHHCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCH T ss_conf 99999877799999999986343567748899999999998889899999999988858984889999999999977998 Q ss_pred HHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC---CC---------------------------- Q ss_conf 12588764221100002335669999999742111110000000000---00---------------------------- Q gi|254780592|r 121 DEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALEL---SP---------------------------- 169 (271) Q Consensus 121 ~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~---~p---------------------------- 169 (271) +.|...+.+.....|.++.....+..+|...++++.....+....+. .+ T Consensus 170 e~Ala~L~~L~~~~p~~~~VL~Ll~~~y~~~~dW~aL~~LLp~L~K~~~l~~~e~~~L~~~a~~~ll~~~~~~~~~~~L~ 249 (398) T PRK10747 170 HAARHGVDKLLEVTPRHPEVLRLAEQAYIRTGAWSSLLDIIPSMAKAHVGDEEHRAMLEQQAWIGLMDQARADNGSEGLR 249 (398) T ss_pred HHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCHHHHH T ss_conf 99999999998619999999999999999874599999999999875799999999999999999999987323479999 Q ss_pred -----------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCC Q ss_conf -----------000124577765433223554359999999538985478889999999869989999999986291728 Q gi|254780592|r 170 -----------NESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQELSPEEA 238 (271) Q Consensus 170 -----------~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~ 238 (271) .++.....++..+...|+.++|...+.+.+...++..-+ ..++.+ ..++..+.....++-+...|. T Consensus 250 ~~W~~lp~~~r~~~~l~~~ya~~Li~~g~~~~A~~ll~~~Lkk~wd~~Lv-~ly~~l--~~~d~~~~l~~~e~wlk~~~~ 326 (398) T PRK10747 250 NWWKNQSRKTRHQVALQVAMAEHLIECDDHDTAQQIIIDGLKRQYDDRLV-LPIPRL--KTNNPEQLEKVLRQQIKQVGD 326 (398) T ss_pred HHHHHCCHHHHCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHH-HHHHHC--CCCCHHHHHHHHHHHHHCCCC T ss_conf 99986899875699999999999986899689999999998618998999-998756--779989999999998731899 Q ss_pred CHHHHHHHHHH-HHHHHHHHHHHH Q ss_conf 66599999999-832129999999 Q gi|254780592|r 239 TRNIKYIKSIL-SQRDPWKKIAKA 261 (271) Q Consensus 239 ~~~~~~l~~~l-~~~~~~~~~a~a 261 (271) ++.+....+.+ .+.+.|.++..+ T Consensus 327 ~~~Ll~aLGrL~~~~~lwgkA~~y 350 (398) T PRK10747 327 RPLLWSTLGQSLMKHGEWQEASLA 350 (398) T ss_pred CHHHHHHHHHHHHHHCHHHHHHHH T ss_conf 989999999999973509999999 No 61 >PRK10049 pgaA outer membrane protein PgaA; Provisional Probab=99.49 E-value=2.3e-11 Score=87.31 Aligned_cols=194 Identities=13% Similarity=0.076 Sum_probs=121.9 Q ss_pred CCCHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHH Q ss_conf 33117999999842019999999999999575787999999999987410012446665566327521001355553210 Q gi|254780592|r 36 ISIPDPESLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLA 115 (271) Q Consensus 36 ~~~~~~~~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~ 115 (271) ........+-....|++..|+.++++..+..|.+.......-.++.=.|+-.+++.++++....-|-+..++..++..+. T Consensus 31 ~d~~yd~~Ii~AR~G~~~~al~~L~~~~~~~~~~~~~v~D~l~i~sWag~d~~v~~~~~~~~~~~~lp~~~l~~~Aka~R 110 (818) T PRK10049 31 ADTAYDALIIEARAGDTAPALSWLQQKSKAGALSNGQVADWLQIALWAGQDQEVITVYERYQSHMQLPARGLAAAARAYR 110 (818) T ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHH T ss_conf 25689999999864882368999998753387667889889999853477779999999618888899799999999986 Q ss_pred HHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 00001125887642211000023356699999997421111100000000000000012457776543322355435999 Q gi|254780592|r 116 NAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKL 195 (271) Q Consensus 116 ~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~ 195 (271) ..++|+.|++.+++++...|.++..+..++..+...|+.+.|...+.+.....|+... +..++.++...|+.-.++..+ T Consensus 111 n~kq~~~Al~ly~~~~~~~P~n~d~~lGlalt~Ada~~~~~Al~~~~~~~~~~p~~~~-~~~~aY~~~~~~~~~~aL~~~ 189 (818) T PRK10049 111 NLKQWQNSLTLWQKALSLEPQNKDYQSGQILTLADAGHYDEALVKAKQLVAGAPDVAN-LLALAYIYRLAGRHYDALRAM 189 (818) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHH-HHHHHHHHHHCCCCHHHHHHH T ss_conf 3554677999999985029999407899999998638869999999998762897678-999999998627429999999 Q ss_pred HHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHH Q ss_conf 99995389854788899999998699899999999 Q gi|254780592|r 196 RFASQMIGADSRIRQNLALVVGLQGRMKEAYSIAS 230 (271) Q Consensus 196 ~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~ 230 (271) ++++...|++.++...+-.++...+-...|..... T Consensus 190 ~~al~~~P~~~~~~~~y~~al~~~~i~~pA~~l~~ 224 (818) T PRK10049 190 TEALRRAPSSQEVLTEYVQALQRNRIAAPALRLAK 224 (818) T ss_pred HHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHH T ss_conf 99997689998999999999986699718999765 No 62 >KOG4162 consensus Probab=99.48 E-value=4.9e-11 Score=85.28 Aligned_cols=212 Identities=16% Similarity=0.096 Sum_probs=160.5 Q ss_pred HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCC-CCCHHHHHHHHHHHHHCCHHHHHHHHHHH Q ss_conf 99999999999995757879999999999874100124466655663275-21001355553210000011258876422 Q gi|254780592|r 52 LLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYP-EDQEVLAAYGKSLANAGYLDEGLDAINRA 130 (271) Q Consensus 52 ~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p-~~~~~~~~~~~~~~~~g~~~~A~~~~~~a 130 (271) -.+++..++++++.+|.||.+.+.++.-|...++.+.|..+.++++++++ ++..+|..++.++...+++.+|+...+.+ T Consensus 460 h~kslqale~av~~d~~dp~~if~lalq~A~~R~l~sAl~~~~eaL~l~~~~~~~~whLLALvlSa~kr~~~Al~vvd~a 539 (799) T KOG4162 460 HKKSLQALEEAVQFDPTDPLVIFYLALQYAEQRQLTSALDYAREALALNRGDSAKAWHLLALVLSAQKRLKEALDVVDAA 539 (799) T ss_pred HHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999861899824899999999999769999999999987157764889999999996224167789999999 Q ss_pred HHHHCHHH------------------------------------------------------------------------ Q ss_conf 11000023------------------------------------------------------------------------ Q gi|254780592|r 131 QRPDIPDW------------------------------------------------------------------------ 138 (271) Q Consensus 131 ~~~~p~~~------------------------------------------------------------------------ 138 (271) +...|.+. T Consensus 540 l~E~~~N~~l~~~~~~i~~~~~~~e~~l~t~~~~L~~we~~~~~q~~~~~g~~~~lk~~l~la~~q~~~a~s~sr~ls~l 619 (799) T KOG4162 540 LEEFGDNHVLMDGKIHIELTFNDREEALDTCIHKLALWEAEYGVQQTLDEGKLLRLKAGLHLALSQPTDAISTSRYLSSL 619 (799) T ss_pred HHHHHHHHHHCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHH T ss_conf 99741222001011101221165888888899999998731167665413456654121143765654530346898899 Q ss_pred -------------------------------HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf -------------------------------3566999999974211111000000000000000124577765433223 Q gi|254780592|r 139 -------------------------------QLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGD 187 (271) Q Consensus 139 -------------------------------~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~ 187 (271) ..|...+..+...++.+++..++.++....|.....++..|..+..+|+ T Consensus 620 ~a~~~~~~~se~~Lp~s~~~~~~~~~~~~~~~lwllaa~~~~~~~~~~~a~~CL~Ea~~~~~l~~~~~~~~G~~~~~~~~ 699 (799) T KOG4162 620 VASQLKSAGSELKLPSSTVLPGPDSLWYLLQKLWLLAADLFLLSGNDDEARSCLLEASKIDPLSASVYYLRGLLLEVKGQ 699 (799) T ss_pred HHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHH T ss_conf 98655421561016741015788732789999999999998853883288999999875031338999986489998774 Q ss_pred HHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHH--HHHHHCCCCCCCHHHHHH-HHHHHHHHHHHHHHHHHH Q ss_conf 5543599999995389854788899999998699899999--999862917286659999-999983212999999997 Q gi|254780592|r 188 LKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYS--IASQELSPEEATRNIKYI-KSILSQRDPWKKIAKARS 263 (271) Q Consensus 188 ~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~--~~~~~L~~~~~~~~~~~l-~~~l~~~~~~~~~a~a~~ 263 (271) ..+|...|..|+.++|++..+...+|.++...|+..-|.. .+..++..+|.+...+|. +.+..+++..++.+..+. T Consensus 700 ~~EA~~af~~Al~ldP~hv~s~~Ala~~lle~G~~~la~~~~~L~dalr~dp~n~eaW~~LG~v~k~~Gd~~~Aaecf~ 778 (799) T KOG4162 700 LEEAKEAFLVALALDPDHVPSMTALAELLLELGSPRLAEKRSLLSDALRLDPLNHEAWYYLGEVFKKLGDSKQAAECFQ 778 (799) T ss_pred HHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHH T ss_conf 6899999999873288874778999999997188420778999999986088777999999999997466678999999 No 63 >KOG1156 consensus Probab=99.48 E-value=1.4e-11 Score=88.55 Aligned_cols=201 Identities=15% Similarity=0.100 Sum_probs=144.8 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCC Q ss_conf 79999998420199999999999995757879999999999874100124466655663275210013555532100000 Q gi|254780592|r 40 DPESLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGY 119 (271) Q Consensus 40 ~~~~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~ 119 (271) ..++..-.+.+||..-....++.+++.|++++.+-..|..+...|+.++|..+...++..++...-.|..+|.++....+ T Consensus 11 F~~~lk~yE~kQYkkgLK~~~~iL~k~~eHgeslAmkGL~L~~lg~~~ea~~~vr~glr~d~~S~vCwHv~gl~~R~dK~ 90 (700) T KOG1156 11 FRRALKCYETKQYKKGLKLIKQILKKFPEHGESLAMKGLTLNCLGKKEEAYELVRLGLRNDLKSHVCWHVLGLLQRSDKK 90 (700) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHCCCHHHCCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHH T ss_conf 99999998877888689999999873776422578604323024661889999987740276655107888888741175 Q ss_pred HHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 11258876422110000233566999999974211111000000000000000124577765433223554359999999 Q gi|254780592|r 120 LDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFAS 199 (271) Q Consensus 120 ~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al 199 (271) |++|..+|+.++..+|++.+.+..++.+-.+.++++.....-...++..|.....|...+.++...|++..|...++.-. T Consensus 91 Y~eaiKcy~nAl~~~~dN~qilrDlslLQ~QmRd~~~~~~tr~~LLql~~~~ra~w~~~Avs~~L~g~y~~A~~il~ef~ 170 (700) T KOG1156 91 YDEAIKCYRNALKIEKDNLQILRDLSLLQIQMRDYEGYLETRNQLLQLRPSQRASWIGFAVAQHLLGEYKMALEILEEFE 170 (700) T ss_pred HHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 89999999999833997299999999999999861618888999977224537999999999999877887789999999 Q ss_pred HCC---CCC---------------------------------------HHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCC Q ss_conf 538---985---------------------------------------47888999999986998999999998629172 Q gi|254780592|r 200 QMI---GAD---------------------------------------SRIRQNLALVVGLQGRMKEAYSIASQELSPEE 237 (271) Q Consensus 200 ~~~---p~~---------------------------------------~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~ 237 (271) +.. |+. ......-|.++..+|+.++|..+++.-+...| T Consensus 171 ~t~~~~~s~~~~e~se~~Ly~n~i~~E~g~~q~ale~L~~~e~~i~Dkla~~e~ka~l~~kl~~lEeA~~~y~~Ll~rnP 250 (700) T KOG1156 171 KTQNTSPSKEDYEHSELLLYQNQILIEAGSLQKALEHLLDNEKQIVDKLAFEETKADLLMKLGQLEEAVKVYRRLLERNP 250 (700) T ss_pred HHHCCCCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 86204998888999999899999999724688999999852067777778864288899987418768999999986293 Q ss_pred CCH Q ss_conf 866 Q gi|254780592|r 238 ATR 240 (271) Q Consensus 238 ~~~ 240 (271) .+- T Consensus 251 dn~ 253 (700) T KOG1156 251 DNL 253 (700) T ss_pred HHH T ss_conf 367 No 64 >KOG1128 consensus Probab=99.48 E-value=2.7e-12 Score=92.99 Aligned_cols=203 Identities=13% Similarity=0.052 Sum_probs=165.1 Q ss_pred HHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHH Q ss_conf 98420199999999999995757879999999999874100124466655663275210013555532100000112588 Q gi|254780592|r 46 NMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLD 125 (271) Q Consensus 46 ~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~ 125 (271) .++-|=..+|+..+.+ ++.|-....+|...|+..+|..+..+-++ .|.++..|..+|++....--|++|.+ T Consensus 408 l~slGitksAl~I~Er--------lemw~~vi~CY~~lg~~~kaeei~~q~le-k~~d~~lyc~LGDv~~d~s~yEkawE 478 (777) T KOG1128 408 LLSLGITKSALVIFER--------LEMWDPVILCYLLLGQHGKAEEINRQELE-KDPDPRLYCLLGDVLHDPSLYEKAWE 478 (777) T ss_pred HHHHCHHHHHHHHHHH--------HHHHHHHHHHHHHHCCCCHHHHHHHHHHC-CCCCCHHHHHHHHHCCCHHHHHHHHH T ss_conf 9970107899999984--------87778789999984230047899999826-99851269975312038489999999 Q ss_pred HHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 76422110000233566999999974211111000000000000000124577765433223554359999999538985 Q gi|254780592|r 126 AINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGAD 205 (271) Q Consensus 126 ~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~ 205 (271) ..+.... .+...+|......++++++...++.+++.+|-....|++.|.+.++.+++..|..+|...+...|++ T Consensus 479 lsn~~sa------rA~r~~~~~~~~~~~fs~~~~hle~sl~~nplq~~~wf~~G~~ALqlek~q~av~aF~rcvtL~Pd~ 552 (777) T KOG1128 479 LSNYISA------RAQRSLALLILSNKDFSEADKHLERSLEINPLQLGTWFGLGCAALQLEKEQAAVKAFHRCVTLEPDN 552 (777) T ss_pred HHHHHHH------HHHHHHCCCCCCCCCHHHHHHHHHHHHHCCCCCHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 8656557------7877525433352128999999998765066325578741489998763678999999896248882 Q ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHH-HHHHHHHHHHHHHH Q ss_conf 47888999999986998999999998629172866599999999-83212999999997 Q gi|254780592|r 206 SRIRQNLALVVGLQGRMKEAYSIASQELSPEEATRNIKYIKSIL-SQRDPWKKIAKARS 263 (271) Q Consensus 206 ~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~~~~l~~~l-~~~~~~~~~a~a~~ 263 (271) .++|.|++..|...|+..+|...++.+++-+..+-.+.-.-.++ ..-+.|+++-+++. T Consensus 553 ~eaWnNls~ayi~~~~k~ra~~~l~EAlKcn~~~w~iWENymlvsvdvge~eda~~A~~ 611 (777) T KOG1128 553 AEAWNNLSTAYIRLKKKKRAFRKLKEALKCNYQHWQIWENYMLVSVDVGEFEDAIKAYH 611 (777) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEECHHHHHHHCCCHHHHHHHHH T ss_conf 05552366899997514888999999861477887322031234553140899999999 No 65 >KOG0624 consensus Probab=99.47 E-value=4.9e-12 Score=91.45 Aligned_cols=165 Identities=17% Similarity=0.123 Sum_probs=111.2 Q ss_pred HHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHH---H---------- Q ss_conf 999998420199999999999995757879999999999874100124466655663275210013---5---------- Q gi|254780592|r 42 ESLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVL---A---------- 108 (271) Q Consensus 42 ~~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~---~---------- 108 (271) .+..++.-|....|+..+.+.++..|+-..+.+..|.++..+|++++|..-|+++++.+|++.... . T Consensus 78 RaT~yLAmGksk~al~Dl~rVlelKpDF~~ARiQRg~vllK~Gele~A~~DF~~vl~~~~s~~~~~eaqskl~~~~e~~~ 157 (504) T KOG0624 78 RATVYLAMGKSKAALQDLSRVLELKPDFMAARIQRGVVLLKQGELEQAEADFDQVLQHEPSNGLVLEAQSKLALIQEHWV 157 (504) T ss_pred HHHHHHHHCCCCCCHHHHHHHHHCCCCHHHHHHHHCHHHHHCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHH T ss_conf 77777652577400330899986183188888872203553252888987999997259873026889988776899999 Q ss_pred --HHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf --555321000001125887642211000023356699999997421111100000000000000012457776543322 Q gi|254780592|r 109 --AYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMG 186 (271) Q Consensus 109 --~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g 186 (271) .....+...|+...+.+.....++..|.++..+..++.||...|+...|+...+.+-++..++.+.++.+..++...| T Consensus 158 l~~ql~s~~~~GD~~~ai~~i~~llEi~~Wda~l~~~Rakc~i~~~e~k~AI~Dlk~askLs~DnTe~~ykis~L~Y~vg 237 (504) T KOG0624 158 LVQQLKSASGSGDCQNAIEMITHLLEIQPWDASLRQARAKCYIAEGEPKKAIHDLKQASKLSQDNTEGHYKISQLLYTVG 237 (504) T ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHH T ss_conf 99999988438746559999988885371336899999888885386888888899987404665388898998997661 Q ss_pred HHHHHHHHHHHHHHCCCCCH Q ss_conf 35543599999995389854 Q gi|254780592|r 187 DLKTAEEKLRFASQMIGADS 206 (271) Q Consensus 187 ~~~~A~~~~~~al~~~p~~~ 206 (271) +.+.++...++.++++|++. T Consensus 238 d~~~sL~~iRECLKldpdHK 257 (504) T KOG0624 238 DAENSLKEIRECLKLDPDHK 257 (504) T ss_pred HHHHHHHHHHHHHCCCCCHH T ss_conf 18878999999980695333 No 66 >KOG1127 consensus Probab=99.47 E-value=6.9e-12 Score=90.50 Aligned_cols=208 Identities=14% Similarity=0.080 Sum_probs=159.9 Q ss_pred HHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHH-----HHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHH Q ss_conf 4201999999999999957578799999999998-----74100124466655663275210013555532100000112 Q gi|254780592|r 48 NHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLR-----RVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDE 122 (271) Q Consensus 48 ~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~-----~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~ 122 (271) +...+.++.-.+.+++++.-+-....+..+..+. .+.+...|...|-+++.++|....++..+|.+|....+-.. T Consensus 431 nd~slselswc~~~~~ek~mdva~~~~~e~~~~w~a~~~~rK~~~~al~ali~alrld~~~apaf~~LG~iYrd~~Dm~R 510 (1238) T KOG1127 431 NDDSLSELSWCLPRALEKMMDVALLLECENSEFWVALGCMRKNSALALHALIRALRLDVSLAPAFAFLGQIYRDSDDMKR 510 (1238) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 76266676677777677642499999987779999999964107889999999985464223899998899987898999 Q ss_pred HHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 5887642211000023356699999997421111100000000000000--01245777654332235543599999995 Q gi|254780592|r 123 GLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNE--SSIVSNIAMSYLLMGDLKTAEEKLRFASQ 200 (271) Q Consensus 123 A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~--~~~~~~~~~~~~~~g~~~~A~~~~~~al~ 200 (271) |...|+++...|+.+.+.+...+..|....+.+.|........+..|.. ...|...|..|...++...|+..|+.|+. T Consensus 511 A~kCf~KAFeLDatdaeaaaa~adtyae~~~we~a~~I~l~~~qka~a~~~k~nW~~rG~yyLea~n~h~aV~~fQsALR 590 (1238) T KOG1127 511 AKKCFDKAFELDATDAEAAAASADTYAEESTWEEAFEICLRAAQKAPAFACKENWVQRGPYYLEAHNLHGAVCEFQSALR 590 (1238) T ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHC T ss_conf 99999888637912344678888876315418899999988742154778775235404343685215558999999863 Q ss_pred CCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 3898547888999999986998999999998629172866599999999832129 Q gi|254780592|r 201 MIGADSRIRQNLALVVGLQGRMKEAYSIASQELSPEEATRNIKYIKSILSQRDPW 255 (271) Q Consensus 201 ~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~~~~l~~~l~~~~~~ 255 (271) .+|.+.+.|..||.+|...|++.-|.+++.++.-.+|.+-...+-.+.+.+..|. T Consensus 591 ~dPkD~n~W~gLGeAY~~sGry~~AlKvF~kAs~LrP~s~y~~fk~A~~ecd~Gk 645 (1238) T KOG1127 591 TDPKDYNLWLGLGEAYPESGRYSHALKVFTKASLLRPLSKYGRFKEAVMECDNGK 645 (1238) T ss_pred CCCHHHHHHHHHHHHHHHCCCEEHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHH T ss_conf 8906578899877887743762007776424574482768999999999987666 No 67 >KOG3060 consensus Probab=99.46 E-value=2.2e-11 Score=87.43 Aligned_cols=162 Identities=22% Similarity=0.150 Sum_probs=90.5 Q ss_pred HHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999874100124466655663275210013555532100000112588764221100002335669999999742111 Q gi|254780592|r 76 YADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHS 155 (271) Q Consensus 76 ~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~ 155 (271) ...+....|+.+-|-.+++++....|+..++....|..+...|++++|.+.|+..+..+|.+...+-..-.+...+|+.- T Consensus 58 V~IAAld~~~~~lAq~C~~~L~~~fp~S~RV~~lkam~lEa~~~~~~A~e~y~~lL~ddpt~~v~~KRKlAilka~GK~l 137 (289) T KOG3060 58 VFIAALDTGRDDLAQKCINQLRDRFPGSKRVGKLKAMLLEATGNYKEAIEYYESLLEDDPTDTVIRKRKLAILKAQGKNL 137 (289) T ss_pred HHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCCH T ss_conf 99999983313789999999998678970679899999998613545999999984468631489999999999718857 Q ss_pred HHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCH---HHHHHHHHHH Q ss_conf 1100000000000000012457776543322355435999999953898547888999999986998---9999999986 Q gi|254780592|r 156 EALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRM---KEAYSIASQE 232 (271) Q Consensus 156 ~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~---~eA~~~~~~~ 232 (271) +|++.+...++..+++.++|..++.+|+..|+++.|.-++++.+=..|.++..+..||.++.-+|-. .-|.++|.++ T Consensus 138 ~aIk~ln~YL~~F~~D~EAW~eLaeiY~~~~~f~kA~fClEE~ll~~P~n~l~f~rlae~~Yt~gg~eN~~~arkyy~~a 217 (289) T KOG3060 138 EAIKELNEYLDKFMNDQEAWHELAEIYLSEGDFEKAAFCLEELLLIQPFNPLYFQRLAEVLYTQGGAENLELARKYYERA 217 (289) T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999835769999999999976768999999999999759871899999999999984188899999999999 Q ss_pred CCCCC Q ss_conf 29172 Q gi|254780592|r 233 LSPEE 237 (271) Q Consensus 233 L~~~~ 237 (271) +...+ T Consensus 218 lkl~~ 222 (289) T KOG3060 218 LKLNP 222 (289) T ss_pred HHHCH T ss_conf 87274 No 68 >KOG0553 consensus Probab=99.45 E-value=2.3e-12 Score=93.41 Aligned_cols=117 Identities=19% Similarity=0.200 Sum_probs=66.1 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHH Q ss_conf 99999999874100124466655663275210013555532100000112588764221100002335669999999742 Q gi|254780592|r 73 GIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMG 152 (271) Q Consensus 73 ~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~ 152 (271) +..-|.-++..++|.+|+..|.++++++|.++..+.+.+.+|.++|.++.|.+.++.++..||..+.+|..+|..|..+| T Consensus 84 LK~eGN~~m~~~~Y~eAv~kY~~AI~l~P~nAVyycNRAAAy~~Lg~~~~AVkDce~Al~iDp~yskay~RLG~A~~~~g 163 (304) T KOG0553 84 LKNEGNKLMKNKDYQEAVDKYTEAIELDPTNAVYYCNRAAAYSKLGEYEDAVKDCESALSIDPHYSKAYGRLGLAYLALG 163 (304) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCC T ss_conf 99988999986259999999999984588762677789999998601178999999977629089999999879998067 Q ss_pred HHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 1111100000000000000012457776543322355 Q gi|254780592|r 153 KHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLK 189 (271) Q Consensus 153 ~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~ 189 (271) ++++|++.|+++++++|++.....++..+-...+... T Consensus 164 k~~~A~~aykKaLeldP~Ne~~K~nL~~Ae~~l~e~~ 200 (304) T KOG0553 164 KYEEAIEAYKKALELDPDNESYKSNLKIAEQKLNEPK 200 (304) T ss_pred CHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCC T ss_conf 3799999997643168884899988999998724787 No 69 >KOG2376 consensus Probab=99.45 E-value=7.9e-12 Score=90.17 Aligned_cols=193 Identities=21% Similarity=0.190 Sum_probs=145.6 Q ss_pred HHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHH Q ss_conf 99842019999999999999575787999999999987410012446665566327521001355553210000011258 Q gi|254780592|r 45 ENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGL 124 (271) Q Consensus 45 ~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~ 124 (271) -+...++|++|+....+.+...|+++.++.-.-.++.+.++|++|+.+.++-....-.+. ..+..+.|.++++..++|+ T Consensus 21 ~~~~~~e~e~a~k~~~Kil~~~pdd~~a~~cKvValIq~~ky~~ALk~ikk~~~~~~~~~-~~fEKAYc~Yrlnk~Deal 99 (652) T KOG2376 21 RHGKNGEYEEAVKTANKILSIVPDDEDAIRCKVVALIQLDKYEDALKLIKKNGALLVINS-FFFEKAYCEYRLNKLDEAL 99 (652) T ss_pred HHCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHCCH-HHHHHHHHHHHCCCHHHHH T ss_conf 733533799999999888725899476686667642036679999999985111431111-3578888998705678899 Q ss_pred HHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC------------------------------CC-HHHH Q ss_conf 8764221100002335669999999742111110000000000------------------------------00-0001 Q gi|254780592|r 125 DAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALEL------------------------------SP-NESS 173 (271) Q Consensus 125 ~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~------------------------------~p-~~~~ 173 (271) ..++ ..++.+.......+.++.+++++++|.+.|+...+. .| ++.+ T Consensus 100 k~~~---~~~~~~~~ll~L~AQvlYrl~~ydealdiY~~L~kn~~dd~d~~~r~nl~a~~a~l~~~~~q~v~~v~e~sye 176 (652) T KOG2376 100 KTLK---GLDRLDDKLLELRAQVLYRLERYDEALDIYQHLAKNNSDDQDEERRANLLAVAAALQVQLLQSVPEVPEDSYE 176 (652) T ss_pred HHHH---CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHH T ss_conf 9875---0355661889999999998753899999999998558705789999789998874047798745588854299 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHC-------CCCC--------HHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCC Q ss_conf 2457776543322355435999999953-------8985--------478889999999869989999999986291728 Q gi|254780592|r 174 IVSNIAMSYLLMGDLKTAEEKLRFASQM-------IGAD--------SRIRQNLALVVGLQGRMKEAYSIASQELSPEEA 238 (271) Q Consensus 174 ~~~~~~~~~~~~g~~~~A~~~~~~al~~-------~p~~--------~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~ 238 (271) .++|.+.++...|+|.+|++.+++++.+ +..+ ..++..|+.++..+|+..||.+.|...+..++. T Consensus 177 l~yN~Ac~~i~~gky~qA~elL~kA~~~~~e~l~~~d~~eEeie~el~~IrvQlayVlQ~~Gqt~ea~~iy~~~i~~~~~ 256 (652) T KOG2376 177 LLYNTACILIENGKYNQAIELLEKALRICREKLEDEDTNEEEIEEELNPIRVQLAYVLQLQGQTAEASSIYVDIIKRNPA 256 (652) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCC T ss_conf 99999999985034899999999999988876314665306688887789999999999943308789999999872688 Q ss_pred CHH Q ss_conf 665 Q gi|254780592|r 239 TRN 241 (271) Q Consensus 239 ~~~ 241 (271) +.. T Consensus 257 D~~ 259 (652) T KOG2376 257 DEP 259 (652) T ss_pred CCH T ss_conf 822 No 70 >COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only] Probab=99.43 E-value=1.6e-11 Score=88.25 Aligned_cols=135 Identities=20% Similarity=0.120 Sum_probs=114.4 Q ss_pred CCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHH Q ss_conf 52100135555321000001125887642211000023356699999997421111100000000000000012457776 Q gi|254780592|r 101 PEDQEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAM 180 (271) Q Consensus 101 p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~ 180 (271) |....++...+..++..|.+++|+..++..+...|+++-.+...+.++...++..+|.+.+++++...|+....+.++|. T Consensus 303 ~~~~aa~YG~A~~~~~~~~~d~A~~~l~~L~~~~P~N~~~~~~~~~i~~~~nk~~~A~e~~~kal~l~P~~~~l~~~~a~ 382 (484) T COG4783 303 RGGLAAQYGRALQTYLAGQYDEALKLLQPLIAAQPDNPYYLELAGDILLEANKAKEAIERLKKALALDPNSPLLQLNLAQ 382 (484) T ss_pred CCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCHHHHHHHH T ss_conf 36537788899999983270157899999998489987999999999987588677999999998408985079999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCC Q ss_conf 5433223554359999999538985478889999999869989999999986291 Q gi|254780592|r 181 SYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQELSP 235 (271) Q Consensus 181 ~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~ 235 (271) +|+..|++.+|+..++..+..+|+++..|..||..|..+|+..+|...+...+.. T Consensus 383 all~~g~~~eai~~L~~~~~~~p~dp~~w~~LAqay~~~g~~~~a~~A~AE~~~~ 437 (484) T COG4783 383 ALLKGGKPQEAIRILNRYLFNDPEDPNGWDLLAQAYAELGNRAEALLARAEGYAL 437 (484) T ss_pred HHHHCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHH T ss_conf 9995699789999999986448777369999999999847617799999999986 No 71 >KOG3060 consensus Probab=99.42 E-value=1.6e-11 Score=88.27 Aligned_cols=180 Identities=14% Similarity=0.084 Sum_probs=163.3 Q ss_pred HHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCH Q ss_conf 99999984201999999999999957578799999999998741001244666556632752100135555321000001 Q gi|254780592|r 41 PESLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYL 120 (271) Q Consensus 41 ~~~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~ 120 (271) .-.+..+..+...-|...++..-..-|+++.+....|..+...|++++|+++++..++-+|.+.-.+...-.+...+|+. T Consensus 57 qV~IAAld~~~~~lAq~C~~~L~~~fp~S~RV~~lkam~lEa~~~~~~A~e~y~~lL~ddpt~~v~~KRKlAilka~GK~ 136 (289) T KOG3060 57 QVFIAALDTGRDDLAQKCINQLRDRFPGSKRVGKLKAMLLEATGNYKEAIEYYESLLEDDPTDTVIRKRKLAILKAQGKN 136 (289) T ss_pred HHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCC T ss_conf 99999998331378999999999867897067989999999861354599999998446863148999999999971885 Q ss_pred HHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHH---HHHHHHHHHHH Q ss_conf 125887642211000023356699999997421111100000000000000012457776543322---35543599999 Q gi|254780592|r 121 DEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMG---DLKTAEEKLRF 197 (271) Q Consensus 121 ~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g---~~~~A~~~~~~ 197 (271) .+|++.+...++.-+.+.++|..++.+|...|++++|.-+++..+-..|.++..+..++..+..+| ++..|..+|.+ T Consensus 137 l~aIk~ln~YL~~F~~D~EAW~eLaeiY~~~~~f~kA~fClEE~ll~~P~n~l~f~rlae~~Yt~gg~eN~~~arkyy~~ 216 (289) T KOG3060 137 LEAIKELNEYLDKFMNDQEAWHELAEIYLSEGDFEKAAFCLEELLLIQPFNPLYFQRLAEVLYTQGGAENLELARKYYER 216 (289) T ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 79999999999983576999999999997676899999999999975987189999999999998418889999999999 Q ss_pred HHHCCCCCHHHHHHHHHHHHHCC Q ss_conf 99538985478889999999869 Q gi|254780592|r 198 ASQMIGADSRIRQNLALVVGLQG 220 (271) Q Consensus 198 al~~~p~~~~~~~~La~~~~~~g 220 (271) ++++.|.+.+.++.+-.+-...- T Consensus 217 alkl~~~~~ral~GI~lc~~~la 239 (289) T KOG3060 217 ALKLNPKNLRALFGIYLCGSALA 239 (289) T ss_pred HHHHCHHHHHHHHHHHHHHHHHH T ss_conf 98727473899999999899999 No 72 >PRK10370 formate-dependent nitrite reductase complex subunit NrfG; Provisional Probab=99.42 E-value=3e-12 Score=92.73 Aligned_cols=121 Identities=15% Similarity=0.089 Sum_probs=82.9 Q ss_pred CCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCC---HHHHHHH Q ss_conf 0199999999999995757879999999999874100124466655663275210013555532100000---1125887 Q gi|254780592|r 50 EQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGY---LDEGLDA 126 (271) Q Consensus 50 ~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~---~~~A~~~ 126 (271) ...+..+..++..++.+|+|.+.|+.+|.+|...|+++.|+..|++++++.|+++.++..++.++...++ .+++... T Consensus 61 q~~e~~~~~L~~rL~~~Pdd~~gW~~LGr~y~~~~~~~~A~~Ay~kA~~L~p~~~~il~~yA~aL~~~~~~~~t~~~~~l 140 (206) T PRK10370 61 QTPEAQLQALQDKIRANPQNSEQWALLGEYYLWRNAYDNALLAYRQALQLRGENAELYAALATVLYYQAGQHMTPQTREM 140 (206) T ss_pred CHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCCHHHHHH T ss_conf 41799999999999839988899999999999768989999999999975799878999999999987598776899999 Q ss_pred HHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCH Q ss_conf 64221100002335669999999742111110000000000000 Q gi|254780592|r 127 INRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPN 170 (271) Q Consensus 127 ~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~ 170 (271) +++++..+|.+..++..+|......++|..|+...++.++..|. T Consensus 141 L~~AL~lDp~~~~AL~Lla~~AFe~~dY~~AI~~Wq~lL~~~~p 184 (206) T PRK10370 141 IDKALALDANEVTALMLLASDAFMQADYAQAIELWQKLLDLNSP 184 (206) T ss_pred HHHHHHHCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCC T ss_conf 99998749897889999999999826499999999999845799 No 73 >cd00189 TPR Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and C Probab=99.42 E-value=1e-12 Score=95.61 Aligned_cols=98 Identities=27% Similarity=0.363 Sum_probs=68.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHC Q ss_conf 56699999997421111100000000000000012457776543322355435999999953898547888999999986 Q gi|254780592|r 140 LISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQ 219 (271) Q Consensus 140 ~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~ 219 (271) .+..+|.++...|++++|+..|.++++..|+++.++..+|.++...|++++|+.+|+++++.+|.+..++.++|.++..+ T Consensus 2 ~~~~lg~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~l~~~~~~~~~~~~A~~~~~~~~~~~~~~~~~~~~la~~~~~~ 81 (100) T cd00189 2 ALLNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALELDPDNAKAYYNLGLAYYKL 81 (100) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHC T ss_conf 89999999998499999999999999859998999999999999859858999999999986989899999999999994 Q ss_pred CCHHHHHHHHHHHCCCCC Q ss_conf 998999999998629172 Q gi|254780592|r 220 GRMKEAYSIASQELSPEE 237 (271) Q Consensus 220 g~~~eA~~~~~~~L~~~~ 237 (271) |++++|...+++++..+| T Consensus 82 ~~~~~A~~~~~~al~~~P 99 (100) T cd00189 82 GKYEEALEAYEKALELDP 99 (100) T ss_pred CCHHHHHHHHHHHHHHCC T ss_conf 999999999999997490 No 74 >KOG0550 consensus Probab=99.42 E-value=2.1e-12 Score=93.62 Aligned_cols=167 Identities=18% Similarity=0.166 Sum_probs=131.9 Q ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHH------------ Q ss_conf 99999999987410012446665566327521001355553210000011258876422110000233------------ Q gi|254780592|r 72 IGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQ------------ 139 (271) Q Consensus 72 ~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~------------ 139 (271) +-+.-+.++...|++++|....-..+++++.+..++...|.++...++.+.+..++++++..+|.... T Consensus 171 a~~lka~cl~~~~~~~~a~~ea~~ilkld~~n~~al~vrg~~~yy~~~~~ka~~hf~qal~ldpdh~~sk~~~~~~k~le 250 (486) T KOG0550 171 AKLLKAECLAFLGDYDEAQSEAIDILKLDATNAEALYVRGLCLYYNDNADKAINHFQQALRLDPDHQKSKSASMMPKKLE 250 (486) T ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHH T ss_conf 88741056543034316778888887516301678886153310232068888887652026802566776764078999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHH Q ss_conf 5669999999742111110000000000000001----245777654332235543599999995389854788899999 Q gi|254780592|r 140 LISAKGSVLAQMGKHSEALIEYERALELSPNESS----IVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALV 215 (271) Q Consensus 140 ~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~----~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~ 215 (271) .+-..|+...+.|++..|.+.|..++.++|.+.. .+.+.+......|+.++|+..+..|++++|....++...|.+ T Consensus 251 ~~k~~gN~~fk~G~y~~A~E~Yteal~idP~n~~~naklY~nra~v~~rLgrl~eaisdc~~Al~iD~syikall~ra~c 330 (486) T KOG0550 251 VKKERGNDAFKNGNYRKAYECYTEALNIDPSNKKTNAKLYGNRALVNIRLGRLREAISDCNEALKIDSSYIKALLRRANC 330 (486) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHH T ss_conf 88750566763442467899887763379322224689998767664206771656654466663479888999998898 Q ss_pred HHHCCCHHHHHHHHHHHCCCCCC Q ss_conf 99869989999999986291728 Q gi|254780592|r 216 VGLQGRMKEAYSIASQELSPEEA 238 (271) Q Consensus 216 ~~~~g~~~eA~~~~~~~L~~~~~ 238 (271) +..+++|.+|.+.+++++..+.. T Consensus 331 ~l~le~~e~AV~d~~~a~q~~~s 353 (486) T KOG0550 331 HLALEKWEEAVEDYEKAMQLEKD 353 (486) T ss_pred HHHHHHHHHHHHHHHHHHHHCCC T ss_conf 88998999999999999863056 No 75 >KOG1156 consensus Probab=99.42 E-value=8.7e-12 Score=89.91 Aligned_cols=177 Identities=13% Similarity=0.051 Sum_probs=126.2 Q ss_pred HHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCH Q ss_conf 99999984201999999999999957578799999999998741001244666556632752100135555321000001 Q gi|254780592|r 41 PESLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYL 120 (271) Q Consensus 41 ~~~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~ 120 (271) ......-.-|+-++|.+..+..+..||.+...|-.+|.+++...++++|+.+|..|+...|++...+..++....+++++ T Consensus 46 mkGL~L~~lg~~~ea~~~vr~glr~d~~S~vCwHv~gl~~R~dK~Y~eaiKcy~nAl~~~~dN~qilrDlslLQ~QmRd~ 125 (700) T KOG1156 46 MKGLTLNCLGKKEEAYELVRLGLRNDLKSHVCWHVLGLLQRSDKKYDEAIKCYRNALKIEKDNLQILRDLSLLQIQMRDY 125 (700) T ss_pred HCCCHHHCCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHH T ss_conf 60432302466188999998774027665510788888874117589999999999833997299999999999999861 Q ss_pred HHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC---C---------------------------- Q ss_conf 125887642211000023356699999997421111100000000000---0---------------------------- Q gi|254780592|r 121 DEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELS---P---------------------------- 169 (271) Q Consensus 121 ~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~---p---------------------------- 169 (271) +.....-.+.+...|.....|...+....-.|++..|....+...... | T Consensus 126 ~~~~~tr~~LLql~~~~ra~w~~~Avs~~L~g~y~~A~~il~ef~~t~~~~~s~~~~e~se~~Ly~n~i~~E~g~~q~al 205 (700) T KOG1156 126 EGYLETRNQLLQLRPSQRASWIGFAVAQHLLGEYKMALEILEEFEKTQNTSPSKEDYEHSELLLYQNQILIEAGSLQKAL 205 (700) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHCCCHHHHH T ss_conf 61888899997722453799999999999987788778999999986204998888999999899999999724688999 Q ss_pred -----------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHH Q ss_conf -----------000124577765433223554359999999538985478889999999 Q gi|254780592|r 170 -----------NESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALVVG 217 (271) Q Consensus 170 -----------~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~ 217 (271) +........+.++...+++++|...+...+..+|++...+..+-.++. T Consensus 206 e~L~~~e~~i~Dkla~~e~ka~l~~kl~~lEeA~~~y~~Ll~rnPdn~~Yy~~l~~~lg 264 (700) T KOG1156 206 EHLLDNEKQIVDKLAFEETKADLLMKLGQLEEAVKVYRRLLERNPDNLDYYEGLEKALG 264 (700) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHH T ss_conf 99985206777777886428889998741876899999998629336789999999999 No 76 >COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only] Probab=99.41 E-value=1.9e-11 Score=87.76 Aligned_cols=156 Identities=20% Similarity=0.147 Sum_probs=138.1 Q ss_pred HHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHH Q ss_conf 99957578799999999998741001244666556632752100135555321000001125887642211000023356 Q gi|254780592|r 62 QYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLI 141 (271) Q Consensus 62 al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~ 141 (271) ....+|.-+.+++..+..++..|++++|...++.++...|+++..+...+.++...++..+|.+.+++++..+|...-.+ T Consensus 298 ~~~~~~~~~aa~YG~A~~~~~~~~~d~A~~~l~~L~~~~P~N~~~~~~~~~i~~~~nk~~~A~e~~~kal~l~P~~~~l~ 377 (484) T COG4783 298 AKRSKRGGLAAQYGRALQTYLAGQYDEALKLLQPLIAAQPDNPYYLELAGDILLEANKAKEAIERLKKALALDPNSPLLQ 377 (484) T ss_pred HHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCHHH T ss_conf 97617365377888999999832701578999999984899879999999999875886779999999984089850799 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCC Q ss_conf 69999999742111110000000000000001245777654332235543599999995389854788899999998699 Q gi|254780592|r 142 SAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGR 221 (271) Q Consensus 142 ~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~ 221 (271) .++|..+...|++.+|+..+.......|+++..|..++.+|..+|+..++.. ..|..+...|+ T Consensus 378 ~~~a~all~~g~~~eai~~L~~~~~~~p~dp~~w~~LAqay~~~g~~~~a~~-----------------A~AE~~~~~G~ 440 (484) T COG4783 378 LNLAQALLKGGKPQEAIRILNRYLFNDPEDPNGWDLLAQAYAELGNRAEALL-----------------ARAEGYALAGR 440 (484) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHH-----------------HHHHHHHHCCC T ss_conf 9999999956997899999999864487773699999999998476177999-----------------99999986788 Q ss_pred HHHHHHHHHHHCC Q ss_conf 8999999998629 Q gi|254780592|r 222 MKEAYSIASQELS 234 (271) Q Consensus 222 ~~eA~~~~~~~L~ 234 (271) +++|+..+.++.. T Consensus 441 ~~~A~~~l~~A~~ 453 (484) T COG4783 441 LEQAIIFLMRASQ 453 (484) T ss_pred HHHHHHHHHHHHH T ss_conf 7899999999998 No 77 >COG2956 Predicted N-acetylglucosaminyl transferase [Carbohydrate transport and metabolism] Probab=99.38 E-value=1.1e-10 Score=83.04 Aligned_cols=212 Identities=9% Similarity=-0.002 Sum_probs=173.9 Q ss_pred HHHCCHHHHHHHHHHHHHHCCCCH-----HHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHH Q ss_conf 842019999999999999575787-----999999999987410012446665566327521001355553210000011 Q gi|254780592|r 47 MNHEQLLEVTSTIGLQYQSHTKNK-----MIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLD 121 (271) Q Consensus 47 ~~~~~~~~A~~~~~~al~~~P~~~-----~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~ 121 (271) -+.|..+.|+..=+ .+-..|+-+ -++..+|.-|+..|-+|.|..+|..+++....-..++..+-.+|....+|+ T Consensus 80 RsRGEvDRAIRiHQ-~L~~spdlT~~qr~lAl~qL~~Dym~aGl~DRAE~~f~~L~de~efa~~AlqqLl~IYQ~treW~ 158 (389) T COG2956 80 RSRGEVDRAIRIHQ-TLLESPDLTFEQRLLALQQLGRDYMAAGLLDRAEDIFNQLVDEGEFAEGALQQLLNIYQATREWE 158 (389) T ss_pred HHCCHHHHHHHHHH-HHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 86350889999999-88528998668999999999899998564668999999973463432899999999999836899 Q ss_pred HHHHHHHHHHHHHCHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 2588764221100002-----33566999999974211111000000000000000124577765433223554359999 Q gi|254780592|r 122 EGLDAINRAQRPDIPD-----WQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLR 196 (271) Q Consensus 122 ~A~~~~~~a~~~~p~~-----~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~ 196 (271) +|.+..++.....+.. +..+..++..+....+.+.|...+.+++..+|+...+-..+|.++...|+|..|++.++ T Consensus 159 KAIe~A~~L~k~~~q~~~~eIAqfyCELAq~~~~~~~~d~A~~~l~kAlqa~~~cvRAsi~lG~v~~~~g~y~~AV~~~e 238 (389) T COG2956 159 KAIDVAERLVKLGGQTYRVEIAQFYCELAQQALASSDVDRARELLKKALQADKKCVRASIILGRVELAKGDYQKAVEALE 238 (389) T ss_pred HHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCHHHHHHHHH T ss_conf 99999999997188521258999999999997403118999999999986282001035454388875266699999999 Q ss_pred HHHHCCCCC-HHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 999538985-478889999999869989999999986291728665999999998321299999 Q gi|254780592|r 197 FASQMIGAD-SRIRQNLALVVGLQGRMKEAYSIASQELSPEEATRNIKYIKSILSQRDPWKKIA 259 (271) Q Consensus 197 ~al~~~p~~-~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~~~~l~~~l~~~~~~~~~a 259 (271) .+++.+|+. +++...|..+|..+|+.++....+.++....+...-...+.-++...+|..++. T Consensus 239 ~v~~Qn~~yl~evl~~L~~~Y~~lg~~~~~~~fL~~~~~~~~g~~~~l~l~~l~e~~eG~~~Aq 302 (389) T COG2956 239 RVLEQNPEYLSEVLEMLYECYAQLGKPAEGLNFLRRAMETNTGADAELMLADLIELQEGIDAAQ 302 (389) T ss_pred HHHHHCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCHHHHH T ss_conf 9998495779999999999999947877899999999864478519999999999710779999 No 78 >cd00189 TPR Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and C Probab=99.37 E-value=9.6e-13 Score=95.76 Aligned_cols=97 Identities=24% Similarity=0.337 Sum_probs=40.5 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHH Q ss_conf 99999999874100124466655663275210013555532100000112588764221100002335669999999742 Q gi|254780592|r 73 GIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMG 152 (271) Q Consensus 73 ~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~ 152 (271) ++++|.++...|++++|+..|+++++++|.++.++..+|.++...|++++|...+++++..+|.++..+..+|.++..+| T Consensus 3 ~~~lg~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~l~~~~~~~~~~~~A~~~~~~~~~~~~~~~~~~~~la~~~~~~~ 82 (100) T cd00189 3 LLNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALELDPDNAKAYYNLGLAYYKLG 82 (100) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCC T ss_conf 99999999984999999999999998599989999999999998598589999999999869898999999999999949 Q ss_pred HHHHHHHHHHHCCCCCC Q ss_conf 11111000000000000 Q gi|254780592|r 153 KHSEALIEYERALELSP 169 (271) Q Consensus 153 ~~~~A~~~~~~~~~~~p 169 (271) ++++|+..|+++++..| T Consensus 83 ~~~~A~~~~~~al~~~P 99 (100) T cd00189 83 KYEEALEAYEKALELDP 99 (100) T ss_pred CHHHHHHHHHHHHHHCC T ss_conf 99999999999997490 No 79 >pfam04733 Coatomer_E Coatomer epsilon subunit. This family represents the epsilon subunit of the coatomer complex, which is involved in the regulation of intracellular protein trafficking between the endoplasmic reticulum and the Golgi complex. Probab=99.33 E-value=8.5e-10 Score=77.68 Aligned_cols=201 Identities=16% Similarity=0.115 Sum_probs=149.7 Q ss_pred CHHHHHHHHHHHHH--HCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHH Q ss_conf 19999999999999--5757879999999999874100124466655663275210013555532100000112588764 Q gi|254780592|r 51 QLLEVTSTIGLQYQ--SHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAIN 128 (271) Q Consensus 51 ~~~~A~~~~~~al~--~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~ 128 (271) +.+.+...+...+. ..+.|+...+..|.+|...|++++|+.++.+. ++.+.......++..++|.+-|.+.++ T Consensus 81 ~~~~~~~~l~~~~~~~~~~~~~~~~~~aati~~~~g~~e~AL~~l~~~-----~~lE~~al~VQi~L~~nR~DLA~ke~~ 155 (290) T pfam04733 81 RKESILASLKEWVADSHIGSNSTLRLLAAIIFIHEGDFDDALKHLHKG-----ENLEAMALNVQILLKMHRIDLAEQQLK 155 (290) T ss_pred CHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHCC-----CCHHHHHHHHHHHHHCCCHHHHHHHHH T ss_conf 145699999999874015667899999999999779999999998558-----966799999999997288367999999 Q ss_pred HHHHHHCHHHHHHHHHHHHHHHHH--HHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCH Q ss_conf 221100002335669999999742--111110000000000000001245777654332235543599999995389854 Q gi|254780592|r 129 RAQRPDIPDWQLISAKGSVLAQMG--KHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADS 206 (271) Q Consensus 129 ~a~~~~p~~~~~~~~lg~~~~~~~--~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~ 206 (271) .....+.+..-.....+.+....| .+..|.-.|.......+.++..++..+.+.+.+|++++|...+++++..+|.++ T Consensus 156 ~~q~~~eDs~l~qLa~awv~l~~Ggek~q~A~yif~EL~~~~~~t~lllng~Av~~m~~~~~~eAe~~L~eAl~~d~~d~ 235 (290) T pfam04733 156 KMQQIDEDATLTQLANAWVKLAVGGEKIQDAYYIFQEFSEKYDSTPLLLNGQAVCCMCLGRYEEAESLLKEALDKDAKDP 235 (290) T ss_pred HHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCH T ss_conf 98843510599999999999972647789899999999852289789999999999874787999999999987288988 Q ss_pred HHHHHHHHHHHHCCCHHHH-HHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 7888999999986998999-9999986291728665999999998321299999 Q gi|254780592|r 207 RIRQNLALVVGLQGRMKEA-YSIASQELSPEEATRNIKYIKSILSQRDPWKKIA 259 (271) Q Consensus 207 ~~~~~La~~~~~~g~~~eA-~~~~~~~L~~~~~~~~~~~l~~~l~~~~~~~~~a 259 (271) +...|+..+....|+..|. .+++.+.-...|.++.+ ..+..+...+..++ T Consensus 236 dtlaNliv~s~~~gK~~e~~~ryl~qL~~~~p~H~~v---~~~~~k~~~FD~~~ 286 (290) T pfam04733 236 ETLINLVVCALHLGKPAEVSNRNLSQLKLSHPTHPLV---KDLNEKEAEFDRAV 286 (290) T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHH---HHHHHHHHHHHHHH T ss_conf 9999999999980998789999999998629998599---99999999999999 No 80 >PRK10370 formate-dependent nitrite reductase complex subunit NrfG; Provisional Probab=99.32 E-value=4.4e-11 Score=85.59 Aligned_cols=119 Identities=18% Similarity=0.148 Sum_probs=81.1 Q ss_pred HHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHH---HHHHHHHHH Q ss_conf 12446665566327521001355553210000011258876422110000233566999999974211---111000000 Q gi|254780592|r 87 AQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKH---SEALIEYER 163 (271) Q Consensus 87 ~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~---~~A~~~~~~ 163 (271) ++.+.-++.-++.+|++...|..+|++|...|+++.|...|+++....|++++.+..++..+...++. .++...+++ T Consensus 64 e~~~~~L~~rL~~~Pdd~~gW~~LGr~y~~~~~~~~A~~Ay~kA~~L~p~~~~il~~yA~aL~~~~~~~~t~~~~~lL~~ 143 (206) T PRK10370 64 EAQLQALQDKIRANPQNSEQWALLGEYYLWRNAYDNALLAYRQALQLRGENAELYAALATVLYYQAGQHMTPQTREMIDK 143 (206) T ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHH T ss_conf 99999999999839988899999999999768989999999999975799878999999999987598776899999999 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 000000000124577765433223554359999999538985 Q gi|254780592|r 164 ALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGAD 205 (271) Q Consensus 164 ~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~ 205 (271) +++.+|++..++..+|...+..+||.+|+.++++.+...|.+ T Consensus 144 AL~lDp~~~~AL~Lla~~AFe~~dY~~AI~~Wq~lL~~~~p~ 185 (206) T PRK10370 144 ALALDANEVTALMLLASDAFMQADYAQAIELWQKLLDLNSPR 185 (206) T ss_pred HHHHCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCC T ss_conf 987498978899999999998264999999999998457999 No 81 >KOG4340 consensus Probab=99.26 E-value=3.6e-10 Score=80.01 Aligned_cols=187 Identities=16% Similarity=0.096 Sum_probs=154.8 Q ss_pred HHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHH Q ss_conf 99842019999999999999575787999999999987410012446665566327521001355553210000011258 Q gi|254780592|r 45 ENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGL 124 (271) Q Consensus 45 ~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~ 124 (271) ..+....|..|+..+....+.+|++...+..+|.+|+...++..|..+++++-.+.|.......-.+..+...+.+.+|+ T Consensus 19 ~lI~d~ry~DaI~~l~s~~Er~p~~rAgLSlLgyCYY~~Q~f~~AA~CYeQL~ql~P~~~qYrlY~AQSLY~A~i~ADAL 98 (459) T KOG4340 19 RLIRDARYADAIQLLGSELERSPRSRAGLSLLGYCYYRLQEFALAAECYEQLGQLHPELEQYRLYQAQSLYKACIYADAL 98 (459) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCHHHHH T ss_conf 99987667889999988874396006788899999999999988889999998637188889999999999703528899 Q ss_pred HHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 87642211000023356699999997421111100000000000000012457776543322355435999999953898 Q gi|254780592|r 125 DAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGA 204 (271) Q Consensus 125 ~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~ 204 (271) .........+.-..+....-+.+....++...+....++.-. .++.+...+.|-+..+.|+++.|+.-|+.|++.+.- T Consensus 99 rV~~~~~D~~~L~~~~lqLqaAIkYse~Dl~g~rsLveQlp~--en~Ad~~in~gCllykegqyEaAvqkFqaAlqvsGy 176 (459) T KOG4340 99 RVAFLLLDNPALHSRVLQLQAAIKYSEGDLPGSRSLVEQLPS--ENEADGQINLGCLLYKEGQYEAAVQKFQAALQVSGY 176 (459) T ss_pred HHHHHHCCCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHCCC--CCCHHHHHCCHHHEECCCCHHHHHHHHHHHHHHCCC T ss_conf 999995684989999999999986054555354999985778--885023303012000165489999999998730576 Q ss_pred CHHHHHHHHHHHHHCCCHHHHHHHHHHHC Q ss_conf 54788899999998699899999999862 Q gi|254780592|r 205 DSRIRQNLALVVGLQGRMKEAYSIASQEL 233 (271) Q Consensus 205 ~~~~~~~La~~~~~~g~~~eA~~~~~~~L 233 (271) ++..-+++|++....|+++.|.......+ T Consensus 177 qpllAYniALaHy~~~qyasALk~iSEIi 205 (459) T KOG4340 177 QPLLAYNLALAHYSSRQYASALKHISEII 205 (459) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 72789999999873120778999999999 No 82 >COG3071 HemY Uncharacterized enzyme of heme biosynthesis [Coenzyme metabolism] Probab=99.25 E-value=1.3e-09 Score=76.61 Aligned_cols=187 Identities=18% Similarity=0.121 Sum_probs=145.3 Q ss_pred HHHCCHHHHHHHHHHHHHHCC-CCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHH Q ss_conf 842019999999999999575-7879999999999874100124466655663275210013555532100000112588 Q gi|254780592|r 47 MNHEQLLEVTSTIGLQYQSHT-KNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLD 125 (271) Q Consensus 47 ~~~~~~~~A~~~~~~al~~~P-~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~ 125 (271) -.-||++.|-.++.++-+..| ++....+..+.++...|+++.|.+-..++++..|.+++++....++|...|+|..... T Consensus 129 ~qrgd~~~an~yL~eaae~~~~~~l~v~ltrarlll~~~d~~aA~~~v~~ll~~~pr~~~vlrLa~r~y~~~g~~~~ll~ 208 (400) T COG3071 129 QQRGDEDRANRYLAEAAELAGDDTLAVELTRARLLLNRRDYPAARENVDQLLEMTPRHPEVLRLALRAYIRLGAWQALLA 208 (400) T ss_pred HHCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCHHHHHH T ss_conf 86305778989999986258994088999999999867886568988999987286886999999999998511899999 Q ss_pred HHHHHHH------------------------------------------HHCHHHHHHHHHHHHHHHHHHHHHHHHHHH- Q ss_conf 7642211------------------------------------------000023356699999997421111100000- Q gi|254780592|r 126 AINRAQR------------------------------------------PDIPDWQLISAKGSVLAQMGKHSEALIEYE- 162 (271) Q Consensus 126 ~~~~a~~------------------------------------------~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~- 162 (271) .+.+..+ .-..+++.-..++.-+...|++++|.+... T Consensus 209 ~l~~L~ka~~l~~~e~~~le~~a~~glL~q~~~~~~~~gL~~~W~~~pr~lr~~p~l~~~~a~~li~l~~~~~A~~~i~~ 288 (400) T COG3071 209 ILPKLRKAGLLSDEEAARLEQQAWEGLLQQARDDNGSEGLKTWWKNQPRKLRNDPELVVAYAERLIRLGDHDEAQEIIED 288 (400) T ss_pred HHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHCCHHHHCCHHHHHHHHHHHHHCCCHHHHHHHHHH T ss_conf 88999873578829999999999999999873620014789999866497605834899999999975876889999999 Q ss_pred ------------------------------HCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHH Q ss_conf ------------------------------00000000001245777654332235543599999995389854788899 Q gi|254780592|r 163 ------------------------------RALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNL 212 (271) Q Consensus 163 ------------------------------~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~L 212 (271) +.++..|+++..+..+|..+...+.+.+|..+|+.+++..|+. ..+..+ T Consensus 289 ~Lk~~~D~~L~~~~~~l~~~d~~~l~k~~e~~l~~h~~~p~L~~tLG~L~~k~~~w~kA~~~leaAl~~~~s~-~~~~~l 367 (400) T COG3071 289 ALKRQWDPRLCRLIPRLRPGDPEPLIKAAEKWLKQHPEDPLLLSTLGRLALKNKLWGKASEALEAALKLRPSA-SDYAEL 367 (400) T ss_pred HHHHCCCHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCH-HHHHHH T ss_conf 9874458569998865188993679999999998399981499999999998417899999999997258974-369999 Q ss_pred HHHHHHCCCHHHHHHHHHHHCC Q ss_conf 9999986998999999998629 Q gi|254780592|r 213 ALVVGLQGRMKEAYSIASQELS 234 (271) Q Consensus 213 a~~~~~~g~~~eA~~~~~~~L~ 234 (271) |.++..+|+..+|..+.+.+|. T Consensus 368 a~~~~~~g~~~~A~~~r~e~L~ 389 (400) T COG3071 368 ADALDQLGEPEEAEQVRREALL 389 (400) T ss_pred HHHHHHCCCHHHHHHHHHHHHH T ss_conf 9999981886889999999998 No 83 >COG4235 Cytochrome c biogenesis factor [Posttranslational modification, protein turnover, chaperones] Probab=99.25 E-value=1.6e-10 Score=82.13 Aligned_cols=129 Identities=21% Similarity=0.183 Sum_probs=111.0 Q ss_pred HHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHH---HHHHHHHHH Q ss_conf 11258876422110000233566999999974211111000000000000000124577765433223---554359999 Q gi|254780592|r 120 LDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGD---LKTAEEKLR 196 (271) Q Consensus 120 ~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~---~~~A~~~~~ 196 (271) .+.....++..+..+|++.+-|..+|.+|..+|+.+.|...|.+++.+.|++++.+..+|.++..+.+ ..++...|+ T Consensus 138 ~~~l~a~Le~~L~~nP~d~egW~~Lg~~ym~~~~~~~A~~AY~~A~rL~g~n~~~~~g~aeaL~~~a~~~~ta~a~~ll~ 217 (287) T COG4235 138 MEALIARLETHLQQNPGDAEGWDLLGRAYMALGRASDALLAYRNALRLAGDNPEILLGLAEALYYQAGQQMTAKARALLR 217 (287) T ss_pred HHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHH T ss_conf 99999999999984988724499999999984443479999999987379988999999999999538844499999999 Q ss_pred HHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHH-HHHHHHH Q ss_conf 999538985478889999999869989999999986291728665-9999999 Q gi|254780592|r 197 FASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQELSPEEATRN-IKYIKSI 248 (271) Q Consensus 197 ~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~-~~~l~~~ 248 (271) +++..+|++.++.+.||..+...|++.+|...++.-|+..|.+.. ...+... T Consensus 218 ~al~~D~~~iral~lLA~~afe~g~~~~A~~~Wq~lL~~lp~~~~rr~~ie~~ 270 (287) T COG4235 218 QALALDPANIRALSLLAFAAFEQGDYAEAAAAWQMLLDLLPADDPRRSLIERS 270 (287) T ss_pred HHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHH T ss_conf 99843973288999999999981459999999999985189998237899999 No 84 >KOG2053 consensus Probab=99.25 E-value=2.7e-09 Score=74.62 Aligned_cols=214 Identities=17% Similarity=0.167 Sum_probs=161.1 Q ss_pred HHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHH Q ss_conf 99998420199999999999995757879999999999874100124466655663275210013555532100000112 Q gi|254780592|r 43 SLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDE 122 (271) Q Consensus 43 ~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~ 122 (271) -.+.+..+++..|....++.++++|+.+.+..--|.++.++|+.++|..+++..-...+.+...+..+..+|..+|+.++ T Consensus 16 i~d~ld~~qfkkal~~~~kllkk~Pn~~~a~vLkaLsl~r~gk~~ea~~~Le~~~~~~~~D~~tLq~l~~~y~d~~~~d~ 95 (932) T KOG2053 16 IYDLLDSSQFKKALAKLGKLLKKHPNALYAKVLKALSLFRLGKGDEALKLLEALYGLKGTDDLTLQFLQNVYRDLGKLDE 95 (932) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 88775267799999999999987899179999999999981586467888751304799856899999999998742568 Q ss_pred HHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHH-H---------HHHHH Q ss_conf 5887642211000023356699999997421111100000000000000012457776543322-3---------55435 Q gi|254780592|r 123 GLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMG-D---------LKTAE 192 (271) Q Consensus 123 A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g-~---------~~~A~ 192 (271) +...|+++...+|+ .+....+-.+|.+.+.+.+-.+..-+..+..|+.+-..+....+.++.- . ..-|. T Consensus 96 ~~~~Ye~~~~~~P~-eell~~lFmayvR~~~yk~qQkaa~~LyK~~pk~~yyfWsV~Slilqs~~~~~~~~~~i~l~LA~ 174 (932) T KOG2053 96 AVHLYERANQKYPS-EELLYHLFMAYVREKSYKKQQKAALQLYKNFPKRAYYFWSVISLILQSIFSENELLDPILLALAE 174 (932) T ss_pred HHHHHHHHHHHCCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCCCCCCHHHHHHH T ss_conf 99999998710884-88999999999998889999999999998587664058999999998225886321103278899 Q ss_pred HHHHHHHHCC-CCCH--HHHHHHHHHHHHCCCHHHHHHHHHHHCCCC--CCCHHHH-HHHHHHHHHHHHHHH Q ss_conf 9999999538-9854--788899999998699899999999862917--2866599-999999832129999 Q gi|254780592|r 193 EKLRFASQMI-GADS--RIRQNLALVVGLQGRMKEAYSIASQELSPE--EATRNIK-YIKSILSQRDPWKKI 258 (271) Q Consensus 193 ~~~~~al~~~-p~~~--~~~~~La~~~~~~g~~~eA~~~~~~~L~~~--~~~~~~~-~l~~~l~~~~~~~~~ 258 (271) ..+++.++.+ +-.+ ++.. .-.++..+|.++||..++..-+... +.+..+. .-..++..-+.|+++ T Consensus 175 ~m~~~~l~~~gk~~s~aE~~L-yl~iL~~~~k~~eal~~l~~~la~~l~~~~~~l~~~~~dllk~l~~w~~l 245 (932) T KOG2053 175 KMVQKLLEKKGKIESEAEIIL-YLLILELQGKYQEALEFLAITLAEKLTSANLYLENKKLDLLKLLNRWQEL 245 (932) T ss_pred HHHHHHHCCCCCCCHHHHHHH-HHHHHHHCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCHHHH T ss_conf 999998614775103899999-99999862467889999978888742654468999888999984676889 No 85 >CHL00033 ycf3 photosystem I assembly protein Ycf3 Probab=99.19 E-value=7.7e-11 Score=84.10 Aligned_cols=76 Identities=12% Similarity=-0.015 Sum_probs=41.9 Q ss_pred HHHHHHCCHHHHHHHHHHHHHHCCC---CHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCC Q ss_conf 9998420199999999999995757---879999999999874100124466655663275210013555532100000 Q gi|254780592|r 44 LENMNHEQLLEVTSTIGLQYQSHTK---NKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGY 119 (271) Q Consensus 44 ~~~~~~~~~~~A~~~~~~al~~~P~---~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~ 119 (271) +...+.|++.+|++.|.++++..++ ...++.++|.++...|++++|+++|.++++++|+.+.++.++|.+|..+|+ T Consensus 39 msaQs~GeyaEALe~Y~eAL~Le~D~~Drs~iLYNIGlIy~~nGe~~kALeyY~qALeiNP~~pqA~NNlGvIyh~qG~ 117 (165) T CHL00033 39 MSAQSEGEYAEALQNYYEAMRLEIDPYDRSYILYNIGLIHTSNGEHTKALEYYFQALERNPFLPQALNNMAVICHYRGE 117 (165) T ss_pred CHHHCCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHH T ss_conf 1010135389999999986234568512668998999999985888899999999998599968999789999999879 No 86 >COG4105 ComL DNA uptake lipoprotein [General function prediction only] Probab=99.18 E-value=6.6e-09 Score=72.25 Aligned_cols=196 Identities=17% Similarity=0.070 Sum_probs=127.5 Q ss_pred HHHHCCCCC--------CCCHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCH---HHHHHHHHHHHHHHCCHHHHHHHHH Q ss_conf 854036854--------33117999999842019999999999999575787---9999999999874100124466655 Q gi|254780592|r 27 FSSCHLNPR--------ISIPDPESLENMNHEQLLEVTSTIGLQYQSHTKNK---MIGIVYADVLRRVGRTAQALAVMRQ 95 (271) Q Consensus 27 ~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~A~~~~~~al~~~P~~~---~~~~~~a~~l~~~g~~~~A~~~~~~ 95 (271) .++|+.... ......++...++.|++.+|+..|++....+|.++ .+.+.++.++++.|++++|+..+++ T Consensus 17 ~~~cs~~~~~~~~~~~p~~~LY~~g~~~L~~gn~~~A~~~fe~l~~~~p~~~~~~qa~l~l~yA~Yk~~~y~~A~~~~dr 96 (254) T COG4105 17 STGCSGDKDKNGVYNLPASELYNEGLTELQKGNYEEAIKYFEALDSRHPFSPYSEQAQLDLAYAYYKNGEYDLALAYIDR 96 (254) T ss_pred HCCCCCCHHHCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHH T ss_conf 12577623330555788999999999998637889999999999870999811289999999998813558999999999 Q ss_pred HHHCCCCCCHH---HHHHHHHHHH--------HCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 66327521001---3555532100--------000112588764221100002335669999999742111110000000 Q gi|254780592|r 96 VAILYPEDQEV---LAAYGKSLAN--------AGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERA 164 (271) Q Consensus 96 al~~~p~~~~~---~~~~~~~~~~--------~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~ 164 (271) .+.++|.++.+ +...|.++.. ......|...++..+..-|+.. =+...-.+. T Consensus 97 Fi~lyP~~~n~dY~~YlkgLs~~~~i~~~~rDq~~~~~A~~~f~~~i~ryPnS~-----------------Ya~dA~~~i 159 (254) T COG4105 97 FIRLYPTHPNADYAYYLKGLSYFFQIDDVTRDQSAARAAFAAFKELVQRYPNSR-----------------YAPDAKARI 159 (254) T ss_pred HHHHCCCCCCHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCC-----------------CHHHHHHHH T ss_conf 998789998716999999998752677544477999999999999998788970-----------------235599999 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCH---HHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCH Q ss_conf 000000001245777654332235543599999995389854---7888999999986998999999998629172866 Q gi|254780592|r 165 LELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADS---RIRQNLALVVGLQGRMKEAYSIASQELSPEEATR 240 (271) Q Consensus 165 ~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~---~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~ 240 (271) ......-..-=...|..|...|.+..|+.-++..++..|+.+ +++..+..+|..+|-.++|...- +.|..+..+. T Consensus 160 ~~~~d~LA~~Em~IaryY~kr~~~~AA~nR~~~v~e~y~~t~~~~eaL~~l~eaY~~lgl~~~a~~~~-~vl~~N~p~s 237 (254) T COG4105 160 VKLNDALAGHEMAIARYYLKRGAYVAAINRFEEVLENYPDTSAVREALARLEEAYYALGLTDEAKKTA-KVLGANYPDS 237 (254) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCHHHHHHHH-HHHHHCCCCC T ss_conf 99999988999999999999057699999999998714555204778999999999828727899999-9997368988 No 87 >pfam04733 Coatomer_E Coatomer epsilon subunit. This family represents the epsilon subunit of the coatomer complex, which is involved in the regulation of intracellular protein trafficking between the endoplasmic reticulum and the Golgi complex. Probab=99.17 E-value=8.1e-09 Score=71.70 Aligned_cols=205 Identities=13% Similarity=0.089 Sum_probs=153.4 Q ss_pred HHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHH--CCCCCCHHHHHHHHHHHHHCC Q ss_conf 999998420199999999999995757879999999999874100124466655663--275210013555532100000 Q gi|254780592|r 42 ESLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAI--LYPEDQEVLAAYGKSLANAGY 119 (271) Q Consensus 42 ~~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~--~~p~~~~~~~~~~~~~~~~g~ 119 (271) ....++..|++...+..+.. ..+-...+...++..+....+.+.+++.+.+.+. ..+.++.+....|.++...|+ T Consensus 41 ~~Ra~IAlg~~~~vl~ei~~---~~~p~L~av~~lA~y~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~aati~~~~g~ 117 (290) T pfam04733 41 MYRSYLALGSYQIVISEIKE---SAATPLQAVRLLAEYLNSPSRKESILASLKEWVADSHIGSNSTLRLLAAIIFIHEGD 117 (290) T ss_pred HHHHHHHCCCHHHHHHHCCC---CCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCC T ss_conf 99999973770378886267---898317999999998748411456999999998740156678999999999997799 Q ss_pred HHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHH--HHHHHHHHHHH Q ss_conf 1125887642211000023356699999997421111100000000000000012457776543322--35543599999 Q gi|254780592|r 120 LDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMG--DLKTAEEKLRF 197 (271) Q Consensus 120 ~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g--~~~~A~~~~~~ 197 (271) +++|+..+.+. .+.+.......++.++++.+.|.+.+..+.+.+.+..-.....+++.+..| ++.+|.-.|++ T Consensus 118 ~e~AL~~l~~~-----~~lE~~al~VQi~L~~nR~DLA~ke~~~~q~~~eDs~l~qLa~awv~l~~Ggek~q~A~yif~E 192 (290) T pfam04733 118 FDDALKHLHKG-----ENLEAMALNVQILLKMHRIDLAEQQLKKMQQIDEDATLTQLANAWVKLAVGGEKIQDAYYIFQE 192 (290) T ss_pred HHHHHHHHHCC-----CCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHH T ss_conf 99999998558-----9667999999999972883679999999884351059999999999997264778989999999 Q ss_pred HHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHH Q ss_conf 995389854788899999998699899999999862917286659999999983212 Q gi|254780592|r 198 ASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQELSPEEATRNIKYIKSILSQRDP 254 (271) Q Consensus 198 al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~~~~l~~~l~~~~~ 254 (271) .-+..+..+-.+..+|.+...+|+++||+..+..++..++.++++......+.+.-| T Consensus 193 L~~~~~~t~lllng~Av~~m~~~~~~eAe~~L~eAl~~d~~d~dtlaNliv~s~~~g 249 (290) T pfam04733 193 FSEKYDSTPLLLNGQAVCCMCLGRYEEAESLLKEALDKDAKDPETLINLVVCALHLG 249 (290) T ss_pred HHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHC T ss_conf 985228978999999999987478799999999998728898899999999999809 No 88 >PRK11189 lipoprotein NlpI; Provisional Probab=99.16 E-value=7.5e-10 Score=78.02 Aligned_cols=216 Identities=14% Similarity=0.078 Sum_probs=146.4 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCC-CHHHHHHHHHHHCCHHHHHHHHHHHHHHCC----CCHHHHHHHHHHHHHHHCCHHH Q ss_conf 99999999999985403685433-117999999842019999999999999575----7879999999999874100124 Q gi|254780592|r 15 IKCIVFLIPLLLFSSCHLNPRIS-IPDPESLENMNHEQLLEVTSTIGLQYQSHT----KNKMIGIVYADVLRRVGRTAQA 89 (271) Q Consensus 15 ~~~~~~l~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~A~~~~~~al~~~P----~~~~~~~~~a~~l~~~g~~~~A 89 (271) +...+++.++++++||++++... ....-+....-.=+.+=++.-+.+.+.... .-...++..|.+|-..|-..-| T Consensus 5 ~r~~~~~~~~l~LsGC~s~~~~~~~~~~~a~plqp~~q~Ev~laRl~qlL~~~~Lt~e~rA~l~yeRGvlyDSlGLr~LA 84 (297) T PRK11189 5 LRWCFVATAALLLAGCASSNTSWRKSEVLAVPLQPSYQQEVILARLNQILASRDLTDEERAQLHYERGVLYDSLGLRALA 84 (297) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHCCHHHH T ss_conf 99999999999998640588764448651446787799999999999997434589899999999831477641209999 Q ss_pred HHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 46665566327521001355553210000011258876422110000233566999999974211111000000000000 Q gi|254780592|r 90 LAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSP 169 (271) Q Consensus 90 ~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p 169 (271) .--|.+++.+.|+-+.++..+|..+...|+++.|.+.|+.+++.+|+..-++.++|..+...|+++.|.+.+.+....+| T Consensus 85 R~DF~qAL~l~P~~adayN~LGiy~tq~~~fD~AyEAFDs~LEL~P~y~YA~lNRGIAlYYg~R~~LA~~D~~~fy~~dp 164 (297) T PRK11189 85 RNDFSQALALRPDMADAYNYLGIYLTQEGEFDAAYEAFDSVLELDPTYDYAYLNRGIALYYGGRYDLAQDDLLAFYQDDP 164 (297) T ss_pred HHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCC T ss_conf 86499897539985899999999999971199999998777726987348998302788744637889999999996099 Q ss_pred HHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCC Q ss_conf 000124577765433--22355435999999953898547888999999986998999999998629 Q gi|254780592|r 170 NESSIVSNIAMSYLL--MGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQELS 234 (271) Q Consensus 170 ~~~~~~~~~~~~~~~--~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~ 234 (271) +++-=.. +.|+. .-+..+|...+++-.+...++... .++...|...-..++........+. T Consensus 165 ~DpYR~L---WLYl~e~~~d~~~A~~~L~~r~~~~~~~~W~-~~iV~~ylg~ise~~~~~~~~~~~~ 227 (297) T PRK11189 165 NDPYRAL---WLYLIEQKLDPKQAKENLKQRYEKLDKEQWG-WNIVDFYLGKISEEQLFERLKAGAT 227 (297) T ss_pred CCCHHHH---HHHHHHCCCCHHHHHHHHHHHHHHCCCHHHH-HHHHHHHHCCCCHHHHHHHHHHHCC T ss_conf 9827999---9999981169999999999999835712268-9999999458999999999987273 No 89 >PRK10803 hypothetical protein; Provisional Probab=99.16 E-value=1.3e-08 Score=70.34 Aligned_cols=102 Identities=10% Similarity=0.112 Sum_probs=89.1 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC---HHHHHHHHHHHHHCCC Q ss_conf 997421111100000000000000---0124577765433223554359999999538985---4788899999998699 Q gi|254780592|r 148 LAQMGKHSEALIEYERALELSPNE---SSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGAD---SRIRQNLALVVGLQGR 221 (271) Q Consensus 148 ~~~~~~~~~A~~~~~~~~~~~p~~---~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~---~~~~~~La~~~~~~g~ 221 (271) ....+++++|+..|..-++..|++ +.+++++|.+|..+|++++|+..|.+.++..|++ +++++.+|.++...|+ T Consensus 152 vl~~k~y~~Ai~aF~~Fi~~yP~s~y~~nA~YWLGe~~y~~~~~~~A~~~F~~vv~~yP~s~K~pdAllklg~~~~~lg~ 231 (262) T PRK10803 152 VQDKSRQDDAIVAFQNFIKKYPDSTYQPNANYWLGQLNYNKGKKDDAAYYFASVVKNYPKSPKAADAMFKVGVIMQDKGD 231 (262) T ss_pred HHHHCCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCC T ss_conf 99735699999999999987879987743099999999974378999999999998677998678999999999998599 Q ss_pred HHHHHHHHHHHCCCCCCCHHHHHHHHHH Q ss_conf 8999999998629172866599999999 Q gi|254780592|r 222 MKEAYSIASQELSPEEATRNIKYIKSIL 249 (271) Q Consensus 222 ~~eA~~~~~~~L~~~~~~~~~~~l~~~l 249 (271) .++|...|++.+...|........+..+ T Consensus 232 ~~~Ak~~l~~vi~~YP~S~aA~lAk~~L 259 (262) T PRK10803 232 TAKAKAVYQQVIKKYPGTDGAKQAQKRL 259 (262) T ss_pred HHHHHHHHHHHHHHCCCCHHHHHHHHHH T ss_conf 9999999999999885998999999998 No 90 >COG4235 Cytochrome c biogenesis factor [Posttranslational modification, protein turnover, chaperones] Probab=99.13 E-value=7.4e-10 Score=78.08 Aligned_cols=121 Identities=20% Similarity=0.193 Sum_probs=85.3 Q ss_pred HHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHH---HHHHHHHHHHHCC Q ss_conf 4466655663275210013555532100000112588764221100002335669999999742---1111100000000 Q gi|254780592|r 89 ALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMG---KHSEALIEYERAL 165 (271) Q Consensus 89 A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~---~~~~A~~~~~~~~ 165 (271) -+.-++.-+..+|++.+-|..+|.+|..+|+++.|...|.++....|++++.+..+|.++.... ...++...+++++ T Consensus 141 l~a~Le~~L~~nP~d~egW~~Lg~~ym~~~~~~~A~~AY~~A~rL~g~n~~~~~g~aeaL~~~a~~~~ta~a~~ll~~al 220 (287) T COG4235 141 LIARLETHLQQNPGDAEGWDLLGRAYMALGRASDALLAYRNALRLAGDNPEILLGLAEALYYQAGQQMTAKARALLRQAL 220 (287) T ss_pred HHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHH T ss_conf 99999999984988724499999999984443479999999987379988999999999999538844499999999998 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHH Q ss_conf 00000001245777654332235543599999995389854788 Q gi|254780592|r 166 ELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIR 209 (271) Q Consensus 166 ~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~ 209 (271) ..+|++..++..+|..++.+|+|.+|...++..+...|.+..-+ T Consensus 221 ~~D~~~iral~lLA~~afe~g~~~~A~~~Wq~lL~~lp~~~~rr 264 (287) T COG4235 221 ALDPANIRALSLLAFAAFEQGDYAEAAAAWQMLLDLLPADDPRR 264 (287) T ss_pred HCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCHH T ss_conf 43973288999999999981459999999999985189998237 No 91 >KOG4648 consensus Probab=99.11 E-value=1.1e-10 Score=83.10 Aligned_cols=201 Identities=7% Similarity=-0.076 Sum_probs=153.3 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCC Q ss_conf 79999998420199999999999995757879999999999874100124466655663275210013555532100000 Q gi|254780592|r 40 DPESLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGY 119 (271) Q Consensus 40 ~~~~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~ 119 (271) ..+...+..+|+|++|+.+|.+.+..+|-|+..+.+.|.+|.+..++..|..-++.++.++.....++...+.+-..+|. T Consensus 101 KE~GN~yFKQgKy~EAIDCYs~~ia~~P~NpV~~~NRA~AYlk~K~FA~AE~DC~~AiaLd~~Y~KAYSRR~~AR~~Lg~ 180 (536) T KOG4648 101 KERGNTYFKQGKYEEAIDCYSTAIAVYPHNPVYHINRALAYLKQKSFAQAEEDCEAAIALDKLYVKAYSRRMQARESLGN 180 (536) T ss_pred HHHHHHHHHCCCHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 87633676425413566665301245788864200588999988877754302889876328899998777789998775 Q ss_pred HHHHHHHHHHHHHHHCHHHHHHHHHHHHH-----------------------------------HHHHHHHHHHHHHHHC Q ss_conf 11258876422110000233566999999-----------------------------------9742111110000000 Q gi|254780592|r 120 LDEGLDAINRAQRPDIPDWQLISAKGSVL-----------------------------------AQMGKHSEALIEYERA 164 (271) Q Consensus 120 ~~~A~~~~~~a~~~~p~~~~~~~~lg~~~-----------------------------------~~~~~~~~A~~~~~~~ 164 (271) ..+|.+.++.++.+.|+..+..-.++.+- ...|.++.++.++... T Consensus 181 ~~EAKkD~E~vL~LEP~~~ELkK~~a~i~Sl~E~~I~~KsT~G~~~A~Q~~~Q~l~~K~~G~~Fsk~~~~~~~i~~~~~~ 260 (536) T KOG4648 181 NMEAKKDCETVLALEPKNIELKKSLARINSLRERKIATKSTPGFTPARQGMIQILPIKKPGYKFSKKAMRSVPVVDVVSP 260 (536) T ss_pred HHHHHHHHHHHHHCCCCCHHHHHHHHHHCCHHHHHHHHHCCCCCCHHHHHHHHHCCCCCCCCHHHHHHCCCCCEEEEECC T ss_conf 78777778899860856178999999861636544665257885667760555304257763132321214432675146 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHH Q ss_conf 00000000124577765433223554359999999538985478889999999869989999999986291728665 Q gi|254780592|r 165 LELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQELSPEEATRN 241 (271) Q Consensus 165 ~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~ 241 (271) +.....+.....+ +..+..-.+++.++....+++-.+|........-+.+..-.|...|+...++.++...|.... T Consensus 261 ~A~~~~~~~L~~~-~~~~~KI~~~~~~~~~~~~~~~~~~s~~~~~s~~~~A~T~~~~~~E~K~~~~T~~~~~P~~~~ 336 (536) T KOG4648 261 RATIDDSNQLRIS-DEDIDKIFNSNCGIIEEVKKTNPKPTPMPDTSGPPKAETIAKTSKEVKPTKQTAVKVAPAVET 336 (536) T ss_pred CCCCCCCCCCCCC-HHHHHHHHHCCHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHCCCCHHHEEEECCCCCC T ss_conf 4446841237664-778998862122478888750789986822369950678875544227341213551456556 No 92 >COG4785 NlpI Lipoprotein NlpI, contains TPR repeats [General function prediction only] Probab=99.11 E-value=2.4e-09 Score=74.91 Aligned_cols=178 Identities=13% Similarity=-0.009 Sum_probs=110.1 Q ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHH Q ss_conf 99999999999957578799999999998741001244666556632752100135555321000001125887642211 Q gi|254780592|r 53 LEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQR 132 (271) Q Consensus 53 ~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~ 132 (271) .-|.-.|.+++..+|+-|+++.-+|.-+...|+++.|.+.|+.+++++|.+.-+..+.|..+.-.|++.-|.+.+.+--. T Consensus 82 ~LAR~DftQaLai~P~m~~vfNyLG~Yl~~a~~fdaa~eaFds~~ELDp~y~Ya~lNRgi~~YY~gR~~LAq~d~~~fYQ 161 (297) T COG4785 82 ALARNDFSQALAIRPDMPEVFNYLGIYLTQAGNFDAAYEAFDSVLELDPTYNYAHLNRGIALYYGGRYKLAQDDLLAFYQ 161 (297) T ss_pred HHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCHHHHHCCCEEEEECCCCHHHHHHHHHHHH T ss_conf 99860145666349675899989999998602205799986667104775107776044023444751555999999984 Q ss_pred HHCHHHHHHHHHHHHHHHHHHHHHHHHH-HHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC------ Q ss_conf 0000233566999999974211111000-000000000000124577765433223554359999999538985------ Q gi|254780592|r 133 PDIPDWQLISAKGSVLAQMGKHSEALIE-YERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGAD------ 205 (271) Q Consensus 133 ~~p~~~~~~~~lg~~~~~~~~~~~A~~~-~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~------ 205 (271) .||+++.--.++-.. ...-++.+|... .+++...+...+... +. -...|+..+ ...++++.+...++ T Consensus 162 ~D~~DPfR~LWLYl~-E~k~dP~~A~tnL~qR~~~~d~e~WG~~--iV--~~yLgkiS~-e~l~~~~~a~a~~n~~~Ae~ 235 (297) T COG4785 162 DDPNDPFRSLWLYLN-EQKLDPKQAKTNLKQRAEKSDKEQWGWN--IV--EFYLGKISE-ETLMERLKADATDNTSLAEH 235 (297) T ss_pred CCCCCHHHHHHHHHH-HHCCCHHHHHHHHHHHHHCCCHHHHHHH--HH--HHHHHHCCH-HHHHHHHHHHHHHHHHHHHH T ss_conf 498980789999987-6107868999999999870567655187--99--999704039-99999998631212899999 Q ss_pred -HHHHHHHHHHHHHCCCHHHHHHHHHHHCCCC Q ss_conf -4788899999998699899999999862917 Q gi|254780592|r 206 -SRIRQNLALVVGLQGRMKEAYSIASQELSPE 236 (271) Q Consensus 206 -~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~ 236 (271) ++.++.||.-+...|..++|..+++-+++.+ T Consensus 236 LTEtyFYL~K~~l~~G~~~~A~~LfKLaiann 267 (297) T COG4785 236 LTETYFYLGKYYLSLGDLDEATALFKLAVANN 267 (297) T ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHH T ss_conf 99999999999861311788999999999887 No 93 >KOG1915 consensus Probab=99.08 E-value=2.3e-08 Score=68.96 Aligned_cols=207 Identities=10% Similarity=-0.031 Sum_probs=141.5 Q ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCH--H-------HHHHHHH-HHHHCCHHHHHHHH Q ss_conf 999999957578799999999998741001244666556632752100--1-------3555532-10000011258876 Q gi|254780592|r 58 TIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQE--V-------LAAYGKS-LANAGYLDEGLDAI 127 (271) Q Consensus 58 ~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~--~-------~~~~~~~-~~~~g~~~~A~~~~ 127 (271) .|.+.+..||-|.++|+.+-......|+.++-.+.|++++...|...+ . |.+++.. .....+.+.+.+.| T Consensus 310 qYE~~v~~np~nYDsWfdylrL~e~~g~~~~Ire~yErAIanvpp~~ekr~W~RYIYLWinYalyeEle~ed~ertr~vy 389 (677) T KOG1915 310 QYEKEVSKNPYNYDSWFDYLRLEESVGDKDRIRETYERAIANVPPASEKRYWRRYIYLWINYALYEELEAEDVERTRQVY 389 (677) T ss_pred HHHHHHHHCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 79999872987706899999999722778899999999870599446778899999999999999999831278899999 Q ss_pred HHHHHHHCH----HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 422110000----2335669999999742111110000000000000001245777654332235543599999995389 Q gi|254780592|r 128 NRAQRPDIP----DWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIG 203 (271) Q Consensus 128 ~~a~~~~p~----~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p 203 (271) +..+.+.|. .+..|...+....++.+...|...+..++..-|.+... ..+..+-++.++++.-...|++-++.+| T Consensus 390 q~~l~lIPHkkFtFaKiWlmyA~feIRq~~l~~ARkiLG~AIG~cPK~KlF-k~YIelElqL~efDRcRkLYEkfle~~P 468 (677) T KOG1915 390 QACLDLIPHKKFTFAKIWLMYAQFEIRQLNLTGARKILGNAIGKCPKDKLF-KGYIELELQLREFDRCRKLYEKFLEFSP 468 (677) T ss_pred HHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHCCH T ss_conf 999863674311299999999999998703278999988874349716677-7787899987657899999999981692 Q ss_pred CCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 85478889999999869989999999986291728665999999998321299999999711 Q gi|254780592|r 204 ADSRIRQNLALVVGLQGRMKEAYSIASQELSPEEATRNIKYIKSILSQRDPWKKIAKARSNH 265 (271) Q Consensus 204 ~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~~~~l~~~l~~~~~~~~~a~a~~~~ 265 (271) .+..+|...|..-..+|+++.|..+|+-+++....+---.++++.+--.-.-.+..++|+.| T Consensus 469 e~c~~W~kyaElE~~LgdtdRaRaifelAi~qp~ldmpellwkaYIdFEi~~~E~ekaR~LY 530 (677) T KOG1915 469 ENCYAWSKYAELETSLGDTDRARAIFELAISQPALDMPELLWKAYIDFEIEEGEFEKARALY 530 (677) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHH T ss_conf 76589998999998740078888999998338532037899987600454002578899999 No 94 >KOG1070 consensus Probab=99.06 E-value=6e-08 Score=66.39 Aligned_cols=190 Identities=13% Similarity=0.019 Sum_probs=136.8 Q ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHC-CCCC----CHHHHHHHHHHHHHCCHHHHHHHH Q ss_conf 99999999999957578799999999998741001244666556632-7521----001355553210000011258876 Q gi|254780592|r 53 LEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAIL-YPED----QEVLAAYGKSLANAGYLDEGLDAI 127 (271) Q Consensus 53 ~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~-~p~~----~~~~~~~~~~~~~~g~~~~A~~~~ 127 (271) .+..+.|.+.+..+|+..-.|++|-.-..+.++.++|.+++++++.. ++.. ..+|..+-.....-|.-+...+.| T Consensus 1441 pesaeDferlvrssPNSSi~WI~YMaf~LelsEiekAR~iaerAL~tIN~REeeEKLNiWiA~lNlEn~yG~eesl~kVF 1520 (1710) T KOG1070 1441 PESAEDFERLVRSSPNSSILWIRYMAFHLELSEIEKARKIAERALKTINFREEEEKLNIWIAYLNLENAYGTEESLKKVF 1520 (1710) T ss_pred CCCHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCHHHHHHHH T ss_conf 76778899998609984028999999885024567789999998611786416777778999986787527689999999 Q ss_pred HHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC--C Q ss_conf 42211000023356699999997421111100000000000000012457776543322355435999999953898--5 Q gi|254780592|r 128 NRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGA--D 205 (271) Q Consensus 128 ~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~--~ 205 (271) +++.+.... ...+..+..+|.....+++|.+.++.+++...+....|..++..++.+++-+.|...+++|++.-|. + T Consensus 1521 eRAcqycd~-~~V~~~L~~iy~k~ek~~~A~ell~~m~KKF~q~~~vW~~y~~fLl~~ne~~aa~~lL~rAL~~lPk~eH 1599 (1710) T KOG1070 1521 ERACQYCDA-YTVHLKLLGIYEKSEKNDEADELLRLMLKKFGQTRKVWIMYADFLLRQNEAEAARELLKRALKSLPKQEH 1599 (1710) T ss_pred HHHHHHCCH-HHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCHHHH T ss_conf 999874336-8899998879987623104899999999986532568999999985144788999999999864650236 Q ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHHH Q ss_conf 47888999999986998999999998629172866599 Q gi|254780592|r 206 SRIRQNLALVVGLQGRMKEAYSIASQELSPEEATRNIK 243 (271) Q Consensus 206 ~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~~~ 243 (271) .++...-|++....|+.+.+..+++..|...|..-+++ T Consensus 1600 v~~IskfAqLEFk~GDaeRGRtlfEgll~ayPKRtDlW 1637 (1710) T KOG1070 1600 VEFISKFAQLEFKYGDAERGRTLFEGLLSAYPKRTDLW 1637 (1710) T ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHH T ss_conf 88999999987651884356789887776385430279 No 95 >CHL00033 ycf3 photosystem I assembly protein Ycf3 Probab=99.06 E-value=6.1e-10 Score=78.58 Aligned_cols=120 Identities=13% Similarity=-0.004 Sum_probs=93.1 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCC---CCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHH Q ss_conf 99999999998741001244666556632752---100135555321000001125887642211000023356699999 Q gi|254780592|r 71 MIGIVYADVLRRVGRTAQALAVMRQVAILYPE---DQEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSV 147 (271) Q Consensus 71 ~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~---~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~ 147 (271) -++++-|..-...|++.+|++.+.+++++.++ ...++.++|.++...|++++|++.|.+++..+|+.+++++++|.+ T Consensus 32 f~yyr~GmsaQs~GeyaEALe~Y~eAL~Le~D~~Drs~iLYNIGlIy~~nGe~~kALeyY~qALeiNP~~pqA~NNlGvI 111 (165) T CHL00033 32 FTYYRDGMSAQSEGEYAEALQNYYEAMRLEIDPYDRSYILYNIGLIHTSNGEHTKALEYYFQALERNPFLPQALNNMAVI 111 (165) T ss_pred HHHHHCCCHHHCCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHH T ss_conf 99998151010135389999999986234568512668998999999985888899999999998599968999789999 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC-HHHHH Q ss_conf 9974211111000000000000000124577765433223554359999999538985-47888 Q gi|254780592|r 148 LAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGAD-SRIRQ 210 (271) Q Consensus 148 ~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~-~~~~~ 210 (271) |..+|......... -.....+++|..++.+|+.+.|++ .++.. T Consensus 112 yh~qG~~a~~~~~~--------------------~~~~~~fd~aa~~w~~ai~lap~ny~eaqn 155 (165) T CHL00033 112 CHYRGEQAIEQGDS--------------------EIAEAWFDQAAEYWKQAIALAPGNYIEAQN 155 (165) T ss_pred HHHHHHHHHHCCCH--------------------HHHHHHHHHHHHHHHHHHCCCCCCCHHHHH T ss_conf 99987999984586--------------------789999999999999987038997099999 No 96 >KOG1070 consensus Probab=99.03 E-value=4.9e-08 Score=66.93 Aligned_cols=204 Identities=11% Similarity=0.054 Sum_probs=172.0 Q ss_pred CCCCCCCHHHHHHH-HHHHCCHHHHHHHHHHHHHH-CCCCH----HHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCH Q ss_conf 68543311799999-98420199999999999995-75787----99999999998741001244666556632752100 Q gi|254780592|r 32 LNPRISIPDPESLE-NMNHEQLLEVTSTIGLQYQS-HTKNK----MIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQE 105 (271) Q Consensus 32 ~~~~~~~~~~~~~~-~~~~~~~~~A~~~~~~al~~-~P~~~----~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~ 105 (271) ++|..+..+.+-++ +++.++.++|.....+|+.. |+... ..|+.+-.....-|.-+.-..+|+++.+.. +.-. T Consensus 1453 ssPNSSi~WI~YMaf~LelsEiekAR~iaerAL~tIN~REeeEKLNiWiA~lNlEn~yG~eesl~kVFeRAcqyc-d~~~ 1531 (1710) T KOG1070 1453 SSPNSSILWIRYMAFHLELSEIEKARKIAERALKTINFREEEEKLNIWIAYLNLENAYGTEESLKKVFERACQYC-DAYT 1531 (1710) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHC-CHHH T ss_conf 099840289999998850245677899999986117864167777789999867875276899999999998743-3688 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCH--HHHHHHHHHHHHH Q ss_conf 13555532100000112588764221100002335669999999742111110000000000000--0012457776543 Q gi|254780592|r 106 VLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPN--ESSIVSNIAMSYL 183 (271) Q Consensus 106 ~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~--~~~~~~~~~~~~~ 183 (271) ++..+..+|...+.+++|.+.++..++.-......|..++..+.++++.+.|...+.+++..-|. ........+++-+ T Consensus 1532 V~~~L~~iy~k~ek~~~A~ell~~m~KKF~q~~~vW~~y~~fLl~~ne~~aa~~lL~rAL~~lPk~eHv~~IskfAqLEF 1611 (1710) T KOG1070 1532 VHLKLLGIYEKSEKNDEADELLRLMLKKFGQTRKVWIMYADFLLRQNEAEAARELLKRALKSLPKQEHVEFISKFAQLEF 1611 (1710) T ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHH T ss_conf 99998879987623104899999999986532568999999985144788999999999864650236889999999876 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCC Q ss_conf 32235543599999995389854788899999998699899999999862917 Q gi|254780592|r 184 LMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQELSPE 236 (271) Q Consensus 184 ~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~ 236 (271) +.||.+.+...|+..+...|...+.|..+..+-...|..+....+|++++... T Consensus 1612 k~GDaeRGRtlfEgll~ayPKRtDlW~VYid~eik~~~~~~vR~lfeRvi~l~ 1664 (1710) T KOG1070 1612 KYGDAERGRTLFEGLLSAYPKRTDLWSVYIDMEIKHGDIKYVRDLFERVIELK 1664 (1710) T ss_pred HCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCC T ss_conf 51884356789887776385430279999998870477799999999998647 No 97 >KOG2376 consensus Probab=99.02 E-value=3.9e-08 Score=67.49 Aligned_cols=56 Identities=20% Similarity=0.163 Sum_probs=27.7 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHH Q ss_conf 79999998420199999999999995757879999999999874100124466655663 Q gi|254780592|r 40 DPESLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAI 98 (271) Q Consensus 40 ~~~~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~ 98 (271) ..++.-.++.+..++|...++ -.++.+..++...|.++++.|++++|+.+|+++++ T Consensus 83 fEKAYc~Yrlnk~Dealk~~~---~~~~~~~~ll~L~AQvlYrl~~ydealdiY~~L~k 138 (652) T KOG2376 83 FEKAYCEYRLNKLDEALKTLK---GLDRLDDKLLELRAQVLYRLERYDEALDIYQHLAK 138 (652) T ss_pred HHHHHHHHHCCCHHHHHHHHH---CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 788889987056788999875---03556618899999999987538999999999985 No 98 >KOG3785 consensus Probab=99.01 E-value=4e-08 Score=67.43 Aligned_cols=132 Identities=9% Similarity=-0.027 Sum_probs=53.7 Q ss_pred HHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCH------------------- Q ss_conf 9984201999999999999957578799999999998741001244666556632752100------------------- Q gi|254780592|r 45 ENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQE------------------- 105 (271) Q Consensus 45 ~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~------------------- 105 (271) -+...|+|++|...|.-+.+++.-+.+.+.++|.+..-.|++.+|.+...++-+ .|-..+ T Consensus 66 C~fhLgdY~~Al~~Y~~~~~~~~~~~el~vnLAcc~FyLg~Y~eA~~~~~ka~k-~pL~~RLlfhlahklndEk~~~~fh 144 (557) T KOG3785 66 CYFHLGDYEEALNVYTFLMNKDDAPAELGVNLACCKFYLGQYIEAKSIAEKAPK-TPLCIRLLFHLAHKLNDEKRILTFH 144 (557) T ss_pred HHHHHCCHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCC-CHHHHHHHHHHHHHHCCHHHHHHHH T ss_conf 887521399999999988536799701025289999999989999999861898-7599999999998717688999999 Q ss_pred --------HHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHH Q ss_conf --------135555321000001125887642211000023356699999997421111100000000000000012457 Q gi|254780592|r 106 --------VLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSN 177 (271) Q Consensus 106 --------~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~ 177 (271) -...++-+....-.|++|...|.+.+..+|.....-..++.||.+...++-+.+.+...+...|++.-+.+. T Consensus 145 ~~LqD~~EdqLSLAsvhYmR~HYQeAIdvYkrvL~dn~ey~alNVy~ALCyyKlDYydvsqevl~vYL~q~pdStiA~NL 224 (557) T KOG3785 145 SSLQDTLEDQLSLASVHYMRMHYQEAIDVYKRVLQDNPEYIALNVYMALCYYKLDYYDVSQEVLKVYLRQFPDSTIAKNL 224 (557) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHH T ss_conf 88754678777677779999989999999999970486364567889999985561666899999999968995888889 No 99 >COG0457 NrfG FOG: TPR repeat [General function prediction only] Probab=99.00 E-value=1.2e-07 Score=64.45 Aligned_cols=198 Identities=22% Similarity=0.256 Sum_probs=152.8 Q ss_pred HHHHHHHHHCCHHHHHHHHHHHHH--HCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHH-HHHHH Q ss_conf 999999842019999999999999--575787999999999987410012446665566327521001355553-21000 Q gi|254780592|r 41 PESLENMNHEQLLEVTSTIGLQYQ--SHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGK-SLANA 117 (271) Q Consensus 41 ~~~~~~~~~~~~~~A~~~~~~al~--~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~-~~~~~ 117 (271) .........+++..+...+...+. ..+.....+...+..+...+++.+++..+.+++...+.+.......+. ++... T Consensus 64 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 143 (291) T COG0457 64 LLALALLKLGRLEEALELLEKALELELLPNLAEALLNLGLLLEALGKYEEALELLEKALALDPDPDLAEALLALGALYEL 143 (291) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHH T ss_conf 98889998663799999899988775352359999999999999843999999999987336564269999999999986 Q ss_pred CCHHHHHHHHHHHHHHHC---HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCH-HHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 001125887642211000---02335669999999742111110000000000000-00124577765433223554359 Q gi|254780592|r 118 GYLDEGLDAINRAQRPDI---PDWQLISAKGSVLAQMGKHSEALIEYERALELSPN-ESSIVSNIAMSYLLMGDLKTAEE 193 (271) Q Consensus 118 g~~~~A~~~~~~a~~~~p---~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~-~~~~~~~~~~~~~~~g~~~~A~~ 193 (271) |+++.+...+.++....| .........+..+...++++.+...+...+...+. ....+...+..+...++++.|.. T Consensus 144 ~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~ 223 (291) T COG0457 144 GDYEEALELYEKALELDPELNELAEALLALGALLEALGRYEEALELLEKALKLNPDDDAEALLNLGLLYLKLGKYEEALE 223 (291) T ss_pred HHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHH T ss_conf 40999999999998749444678999999998988703399999999999862801227999999999987525999999 Q ss_pred HHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCC Q ss_conf 999999538985478889999999869989999999986291728 Q gi|254780592|r 194 KLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQELSPEEA 238 (271) Q Consensus 194 ~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~ 238 (271) .+..++...|.....+..++..+...|..+++...+.+.+...+. T Consensus 224 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 268 (291) T COG0457 224 YYEKALELDPDNAEALYNLALLLLELGRYEEALEALEKALELDPD 268 (291) T ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCHH T ss_conf 999998729101799999999988388699999999999987988 No 100 >COG1729 Uncharacterized protein conserved in bacteria [Function unknown] Probab=98.99 E-value=4e-08 Score=67.43 Aligned_cols=107 Identities=18% Similarity=0.136 Sum_probs=94.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCCHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC---HHHHHHHHH Q ss_conf 6699999997421111100000000000000---0124577765433223554359999999538985---478889999 Q gi|254780592|r 141 ISAKGSVLAQMGKHSEALIEYERALELSPNE---SSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGAD---SRIRQNLAL 214 (271) Q Consensus 141 ~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~---~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~---~~~~~~La~ 214 (271) .+..+.-+...|++..|...|...+...|++ +.+++++|.++..+|+++.|...|..+++-.|.+ ++.++.||. T Consensus 144 ~Y~~A~~~~ksgdy~~A~~~F~~fi~~YP~s~~~~nA~yWLGe~~y~qg~y~~Aa~~f~~~~k~~P~s~KApdallKlg~ 223 (262) T COG1729 144 LYNAALDLYKSGDYAEAEQAFQAFIKKYPNSTYTPNAYYWLGESLYAQGDYEDAAYIFARVVKDYPKSPKAPDALLKLGV 223 (262) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHH T ss_conf 99999999983878899999999997099975560249998899986246167999999999838999988289999999 Q ss_pred HHHHCCCHHHHHHHHHHHCCCCCCCHHHHHHHH Q ss_conf 999869989999999986291728665999999 Q gi|254780592|r 215 VVGLQGRMKEAYSIASQELSPEEATRNIKYIKS 247 (271) Q Consensus 215 ~~~~~g~~~eA~~~~~~~L~~~~~~~~~~~l~~ 247 (271) +...+|+.++|-..++++++..|..+.....+. T Consensus 224 ~~~~l~~~d~A~atl~qv~k~YP~t~aA~~Ak~ 256 (262) T COG1729 224 SLGRLGNTDEACATLQQVIKRYPGTDAAKLAKV 256 (262) T ss_pred HHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHH T ss_conf 999821779999999999987899779999999 No 101 >TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD; InterPro: IPR005415 The type III secretion system of Gram-negative bacteria is used to transport virulence factors from the pathogen directly into the host cell and is only triggered when the bacterium comes into close contact with the host. Effector proteins secreted by the type III system do not possess a secretion signal, and are considered unique because of this. Yersinia spp. secrete effector proteins called YopB and YopD that facilitate the spread of other translocated proteins through the type III needle and the host cell cytoplasm . Both are believed to act as pore translocases, forming apertures in the host cell membrane and allowing the bacterium easy access to its cytoplasm. YopD also acts as a negative regulator of the Yersinia low-calcium response, and in turn is controlled by a chaperone, SycD . This protein also regulates YopB secretion. SycD is located on the Yop pathogenicity island of Yersinia spp., and is speculated to prevent a premature interaction between YopB, YopD and the calcium-response LcrV protein . It has been speculated that a type III secretion mechanism also exists in Chlamydial species. With the sequencing of the Chlamydia trachomatis genome, several proteins similar to characterised type III proteins have emerged, including a SycD homologue . The Pseudomonas aeruginosa gene PcrH is also similar to the Yersinia chaperone, suggesting a comparable function. Proteins in this entry are found in type III secretion operons. LcrH, from Yersinia is believed to have a regulatory function in the low-calcium response of the secretion system ,. The same protein is also known as SycD (SYC = Specific Yop Chaperone) for its chaperone role . In Pseudomonas, where the homolog is known as PcrH, the chaperone role has been demonstrated and the regulatory role appears to be absent . ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array .; GO: 0006950 response to stress. Probab=98.97 E-value=1.6e-09 Score=76.04 Aligned_cols=109 Identities=22% Similarity=0.232 Sum_probs=47.5 Q ss_pred HHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHH Q ss_conf 56632752100135555321000001125887642211000023356699999997421111100000000000000012 Q gi|254780592|r 95 QVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSI 174 (271) Q Consensus 95 ~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~ 174 (271) .+..++++.-+.....|......|++++|...|.-....+|.+...|..||.|+..++++++|+..|..+..+++++|.. T Consensus 9 ~~~~l~~~~L~~~Y~~AY~~y~~G~Y~~A~~~F~~L~~~~~~~~~Y~~gLa~c~q~~~~Y~~A~~~Y~~a~~l~~~~P~p 88 (137) T TIGR02552 9 SLLGLDSEQLEAIYALAYNLYQQGRYDEALKLFQLLAAYDPWNADYWLGLAACCQMLKEYEEAIDAYALAAALDPDDPRP 88 (137) T ss_pred HHHCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHCCCCCCCC T ss_conf 77289998999999999999853667999999999999743036899999999998616789999999997527686951 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 45777654332235543599999995389 Q gi|254780592|r 175 VSNIAMSYLLMGDLKTAEEKLRFASQMIG 203 (271) Q Consensus 175 ~~~~~~~~~~~g~~~~A~~~~~~al~~~p 203 (271) +...|.+++..|+.+.|...|+-++++.+ T Consensus 89 ~~~~a~C~l~~g~~~~A~~aL~~a~~~~~ 117 (137) T TIGR02552 89 YYHAAECYLALGEPESALKALDLAIEIAG 117 (137) T ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHHC T ss_conf 26998999872897899999999999848 No 102 >KOG1915 consensus Probab=98.97 E-value=2.8e-07 Score=62.23 Aligned_cols=192 Identities=15% Similarity=0.069 Sum_probs=131.4 Q ss_pred HHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHH Q ss_conf 84201999999999999957578799999999998741001244666556632752100135555321000001125887 Q gi|254780592|r 47 MNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDA 126 (271) Q Consensus 47 ~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~ 126 (271) .+++++..|.+.+.+|++.+..|...|+.++++-++......|..++++++.+-|.-...|.-+......+|+...|.+. T Consensus 84 esq~e~~RARSv~ERALdvd~r~itLWlkYae~Emknk~vNhARNv~dRAvt~lPRVdqlWyKY~ymEE~LgNi~gaRqi 163 (677) T KOG1915 84 ESQKEIQRARSVFERALDVDYRNITLWLKYAEFEMKNKQVNHARNVWDRAVTILPRVDQLWYKYIYMEEMLGNIAGARQI 163 (677) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCHHHHHH T ss_conf 76789999999999987034223359999999998640576788789888885330899999999999996541879999 Q ss_pred HHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCH Q ss_conf 64221100002335669999999742111110000000000000001245777654332235543599999995389854 Q gi|254780592|r 127 INRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADS 206 (271) Q Consensus 127 ~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~ 206 (271) |.+.+...|.. .+|......-.+.+..+.|...|++-+-..|+ ...|...+..-..-|+...+...|+.|++.-.++- T Consensus 164 ferW~~w~P~e-qaW~sfI~fElRykeieraR~IYerfV~~HP~-v~~wikyarFE~k~g~~~~aR~VyerAie~~~~d~ 241 (677) T KOG1915 164 FERWMEWEPDE-QAWLSFIKFELRYKEIERARSIYERFVLVHPK-VSNWIKYARFEEKHGNVALARSVYERAIEFLGDDE 241 (677) T ss_pred HHHHHCCCCCH-HHHHHHHHHHHHHHHHHHHHHHHHHHHEECCC-HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHH T ss_conf 99997079727-89999999999865787788999888503602-89899999998752718889999999999861378 Q ss_pred ---HHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCH Q ss_conf ---7888999999986998999999998629172866 Q gi|254780592|r 207 ---RIRQNLALVVGLQGRMKEAYSIASQELSPEEATR 240 (271) Q Consensus 207 ---~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~ 240 (271) ......|..-..+..++.|..+|+-+|+.-|.+. T Consensus 242 ~~e~lfvaFA~fEe~qkE~ERar~iykyAld~~pk~r 278 (677) T KOG1915 242 EAEILFVAFAEFEERQKEYERARFIYKYALDHIPKGR 278 (677) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 9999999999999988899999999999987468650 No 103 >TIGR02795 tol_pal_ybgF tol-pal system protein YbgF; InterPro: IPR014162 Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, and a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.. Probab=98.96 E-value=8.7e-09 Score=71.52 Aligned_cols=104 Identities=12% Similarity=0.132 Sum_probs=73.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC---HHHHHHH Q ss_conf 356699999997421111100000000000000---0124577765433223554359999999538985---4788899 Q gi|254780592|r 139 QLISAKGSVLAQMGKHSEALIEYERALELSPNE---SSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGAD---SRIRQNL 212 (271) Q Consensus 139 ~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~---~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~---~~~~~~L 212 (271) +.....+....+.|+++.|+..|..-++..|++ +.+++++|.++..+|+++.|...|..+++..|.+ +++++.| T Consensus 3 ~~~y~aa~~~lk~gdY~~A~~~F~~F~~~YP~S~y~pnA~YWLGe~~y~~~~~~~A~~~f~~v~~~yp~s~KaPdALLKl 82 (119) T TIGR02795 3 EAYYDAALLVLKAGDYADAIQAFQAFLKKYPKSTYAPNAHYWLGEAYYAQGDYADAAKAFLAVVKKYPKSPKAPDALLKL 82 (119) T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHH T ss_conf 67899999998538878999999999886879876710547999999997339999999999986458998606899999 Q ss_pred HHHHHHCCCHHHHHHHHHHHCCCCCCCHHH Q ss_conf 999998699899999999862917286659 Q gi|254780592|r 213 ALVVGLQGRMKEAYSIASQELSPEEATRNI 242 (271) Q Consensus 213 a~~~~~~g~~~eA~~~~~~~L~~~~~~~~~ 242 (271) |.++..+|+.++|..+|++++...|..... T Consensus 83 g~~~~~~g~~~~A~~~l~~V~~~YP~s~aA 112 (119) T TIGR02795 83 GMSLQELGDKEKAKATLQQVIKRYPGSSAA 112 (119) T ss_pred HHHHHHHCCHHHHHHHHHHHHHHCCCCHHH T ss_conf 999987289889999999998566998788 No 104 >cd05804 StaR_like StaR_like; a well-conserved protein found in bacteria, plants, and animals. A family member from Streptomyces toyocaensis, StaR is part of a gene cluster involved in the biosynthesis of glycopeptide antibiotics (GPAs), specifically A47934. It has been speculated that StaR could be a flavoprotein hydroxylating a tyrosine sidechain. Some family members have been annotated as proteins containing tetratricopeptide (TPR) repeats, which may at least indicate mostly alpha-helical secondary structure. Probab=98.94 E-value=1.4e-07 Score=64.12 Aligned_cols=165 Identities=17% Similarity=0.078 Sum_probs=113.7 Q ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHH---HHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHH Q ss_conf 1799999984201999999999999957578799999---9999987410012446665566327521001355553210 Q gi|254780592|r 39 PDPESLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIV---YADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLA 115 (271) Q Consensus 39 ~~~~~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~---~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~ 115 (271) .-.++....-.|++.+|...+.+.++.+|.|.-++.. .--.-...|..+....++......+|.+.-.+-.++..+. T Consensus 46 ~hv~a~~~~~~g~~~~A~~~~~~~l~~~P~D~lalk~~~~~f~lg~~~g~~~~~~~~l~~~~p~~~~~~~~~gm~AF~le 125 (355) T cd05804 46 AHVEALSAWIAGDLPKALALLEQLLDDYPRDLLALKLHLGAFGLGDFSGMRDHVARVLPLWAPENPDYWYLLGMLAFGLE 125 (355) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHCCCCCCCCHHHHHHHHHHHH T ss_conf 99999999987997899999999998698359999987674102541324056999987257348983699999999999 Q ss_pred HHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCH----HHHHHHHHHHHHHHHHHHHHH Q ss_conf 0000112588764221100002335669999999742111110000000000000----001245777654332235543 Q gi|254780592|r 116 NAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPN----ESSIVSNIAMSYLLMGDLKTA 191 (271) Q Consensus 116 ~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~----~~~~~~~~~~~~~~~g~~~~A 191 (271) +.|++++|.+.-++++..+|.+.=+...+++++..+|+.++++...+.....-.. ..-.++-.+..++..|+++++ T Consensus 126 E~g~~~~Ae~~a~~al~~np~d~wa~HAvAHv~~~~g~~~eG~~~l~~~~~~w~~~~~l~~H~wWH~AL~~Le~g~~~~~ 205 (355) T cd05804 126 EAGQYDRAEEAARRALELNPDDAWAVHAVAHVLEMQGRFKEGIAFMESWRDTWDCSSMLRGHNWWHLALFYLERGDYEAA 205 (355) T ss_pred HCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCHHHH T ss_conf 85889999999999996399993799999999998388788999998517525778858989999999998877898899 Q ss_pred HHHHHHHHHCCC Q ss_conf 599999995389 Q gi|254780592|r 192 EEKLRFASQMIG 203 (271) Q Consensus 192 ~~~~~~al~~~p 203 (271) ...|..-+...+ T Consensus 206 l~iyD~~i~~~~ 217 (355) T cd05804 206 LAIYDTHIAPSA 217 (355) T ss_pred HHHHHHHHCCCC T ss_conf 999888744566 No 105 >PRK09782 bacteriophage N4 receptor, outer membrane subunit; Provisional Probab=98.93 E-value=8.4e-08 Score=65.48 Aligned_cols=190 Identities=16% Similarity=0.094 Sum_probs=109.3 Q ss_pred HHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHH Q ss_conf 99842019999999999999575787999999999987410012446665566327521001355553210000011258 Q gi|254780592|r 45 ENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGL 124 (271) Q Consensus 45 ~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~ 124 (271) .....||+..|...+.+.-... -+....+.-+......|+...+..+++.+.+..-++....-.+...+...|+.+.|+ T Consensus 518 ~~~~~Gd~aaa~~aw~~l~~~~-l~~~~RLtaa~~alaAGd~~aa~r~~~~a~~~~a~~~A~~a~i~~~~~~~g~~~~Al 596 (987) T PRK09782 518 QAYQVEDYATALAAWQKISLHD-MSNEDLLAAANTAQAAGNGAARDRWLQQAEQRGLGDNALYWWLHAQRYIPGQPELAL 596 (987) T ss_pred HHHHHHHHHHHHHHHHHCCHHH-CCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCHHHH T ss_conf 9997103789999998489543-465888888887775489789999999987526732778999998763038738788 Q ss_pred HHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 87642211000023356699999997421111100000000000000012457776543322355435999999953898 Q gi|254780592|r 125 DAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGA 204 (271) Q Consensus 125 ~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~ 204 (271) ...++++...|. ...+...+..+.+.|+...++..+..++...|+.+.....+|..+...|+..++...++++-.-.|. T Consensus 597 ~~~r~al~~ap~-a~~~~A~~~~~~Q~~d~~qa~~~LAeAv~lepd~~~y~A~lGy~la~~~D~Aq~~~~lerAh~~~p~ 675 (987) T PRK09782 597 NDLTRSLNIAPS-ANAYVARATIYRQRHNVPAAVSDLRAALELEPNNSNYQAALGYALWDSGDIAQSREMLERAHKDLPD 675 (987) T ss_pred HHHHHHHHCCCC-HHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHHCCCCC T ss_conf 889999860800-4668999999997368078999999998179998541156777861399899999999975215775 Q ss_pred CHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCC Q ss_conf 54788899999998699899999999862917 Q gi|254780592|r 205 DSRIRQNLALVVGLQGRMKEAYSIASQELSPE 236 (271) Q Consensus 205 ~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~ 236 (271) +..+...|+.++.+.|+.+.|...++++|+.. T Consensus 676 d~~~~~~lAy~~~rl~D~a~a~~~~r~vID~~ 707 (987) T PRK09782 676 DPALIRQLAYVNQRLDDMAATQHYARLVIDDI 707 (987) T ss_pred CCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHH T ss_conf 52324788899876134699999999998654 No 106 >KOG1130 consensus Probab=98.91 E-value=1.1e-08 Score=70.94 Aligned_cols=188 Identities=19% Similarity=0.158 Sum_probs=116.0 Q ss_pred HHHHCCHHHHHHHHHHHHHHCCCCHH----HHHHHHHHHHHHHCCHHHHHHHHHHHHC------CCCCCHHHHHHHHHHH Q ss_conf 98420199999999999995757879----9999999998741001244666556632------7521001355553210 Q gi|254780592|r 46 NMNHEQLLEVTSTIGLQYQSHTKNKM----IGIVYADVLRRVGRTAQALAVMRQVAIL------YPEDQEVLAAYGKSLA 115 (271) Q Consensus 46 ~~~~~~~~~A~~~~~~al~~~P~~~~----~~~~~a~~l~~~g~~~~A~~~~~~al~~------~p~~~~~~~~~~~~~~ 115 (271) .-..|+.+.-+..|+.+++...++.. ++..+|.+|...++|++|+++-..-+.+ .-....+--++|.++- T Consensus 27 Lck~gdcraGv~ff~aA~qvGTeDl~tLSAIYsQLGNAyfyL~DY~kAl~yH~hDltlar~lgdklGEAKssgNLGNtlK 106 (639) T KOG1130 27 LCKMGDCRAGVDFFKAALQVGTEDLSTLSAIYSQLGNAYFYLKDYEKALKYHTHDLTLARLLGDKLGEAKSSGNLGNTLK 106 (639) T ss_pred HHHCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHCCHHCCCCCCCCCCCHHH T ss_conf 87514624568999999983612788999999974114446763999976422357799874442012200455201456 Q ss_pred HHCCHHHHHHHHHHHHHHH------CHHHHHHHHHHHHHHHHHHHHH--------------------HHHHHHHCCCCCC Q ss_conf 0000112588764221100------0023356699999997421111--------------------1000000000000 Q gi|254780592|r 116 NAGYLDEGLDAINRAQRPD------IPDWQLISAKGSVLAQMGKHSE--------------------ALIEYERALELSP 169 (271) Q Consensus 116 ~~g~~~~A~~~~~~a~~~~------p~~~~~~~~lg~~~~~~~~~~~--------------------A~~~~~~~~~~~p 169 (271) -+|.|++|.....+.+... -....+++++|++|...|+... |.+.|..-+++.. T Consensus 107 v~G~fdeA~~cc~rhLd~areLgDrv~e~RAlYNlgnvYhakGk~~g~~~pee~g~f~~ev~~al~~Av~fy~eNL~l~~ 186 (639) T KOG1130 107 VKGAFDEALTCCFRHLDFARELGDRVLESRALYNLGNVYHAKGKCTGLEAPEEKGAFNAEVTSALENAVKFYMENLELSE 186 (639) T ss_pred HHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHCCCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 30232068899998867999974777665777411111221264158888143065547889999999999999899998 Q ss_pred H------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC------CHHHHHHHHHHHHHCCCHHHHHHHHHHHC Q ss_conf 0------0012457776543322355435999999953898------54788899999998699899999999862 Q gi|254780592|r 170 N------ESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGA------DSRIRQNLALVVGLQGRMKEAYSIASQEL 233 (271) Q Consensus 170 ~------~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~------~~~~~~~La~~~~~~g~~~eA~~~~~~~L 233 (271) . .-.++-++|..|...|+|++|+.+-+.-+.+... ..+++.|||.++.-+|+++.|.+.|+..+ T Consensus 187 ~lgDr~aqGRa~GnLGNTyYlLGdf~~ai~~H~~RL~ia~efGDrAaeRRA~sNlgN~hiflg~fe~A~ehYK~tl 262 (639) T KOG1130 187 KLGDRLAQGRAYGNLGNTYYLLGDFDQAIHFHKLRLEIAQEFGDRAAERRAHSNLGNCHIFLGNFELAIEHYKLTL 262 (639) T ss_pred HHHHHHHHCCHHCCCCCEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHCCCHHHHHHHHHHH T ss_conf 7446776331102457635650457999999999999999863188887764043103555304276999999999 No 107 >TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD; InterPro: IPR005415 The type III secretion system of Gram-negative bacteria is used to transport virulence factors from the pathogen directly into the host cell and is only triggered when the bacterium comes into close contact with the host. Effector proteins secreted by the type III system do not possess a secretion signal, and are considered unique because of this. Yersinia spp. secrete effector proteins called YopB and YopD that facilitate the spread of other translocated proteins through the type III needle and the host cell cytoplasm . Both are believed to act as pore translocases, forming apertures in the host cell membrane and allowing the bacterium easy access to its cytoplasm. YopD also acts as a negative regulator of the Yersinia low-calcium response, and in turn is controlled by a chaperone, SycD . This protein also regulates YopB secretion. SycD is located on the Yop pathogenicity island of Yersinia spp., and is speculated to prevent a premature interaction between YopB, YopD and the calcium-response LcrV protein . It has been speculated that a type III secretion mechanism also exists in Chlamydial species. With the sequencing of the Chlamydia trachomatis genome, several proteins similar to characterised type III proteins have emerged, including a SycD homologue . The Pseudomonas aeruginosa gene PcrH is also similar to the Yersinia chaperone, suggesting a comparable function. Proteins in this entry are found in type III secretion operons. LcrH, from Yersinia is believed to have a regulatory function in the low-calcium response of the secretion system ,. The same protein is also known as SycD (SYC = Specific Yop Chaperone) for its chaperone role . In Pseudomonas, where the homolog is known as PcrH, the chaperone role has been demonstrated and the regulatory role appears to be absent . ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array .; GO: 0006950 response to stress. Probab=98.90 E-value=2.9e-09 Score=74.47 Aligned_cols=93 Identities=14% Similarity=0.043 Sum_probs=46.7 Q ss_pred HHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHH Q ss_conf 99998420199999999999995757879999999999874100124466655663275210013555532100000112 Q gi|254780592|r 43 SLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDE 122 (271) Q Consensus 43 ~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~ 122 (271) +....++|+|.+|...|.-..-.||.+.+.++.+|.+....|++++|+..|..+..++|++|......+.|+..+|+... T Consensus 25 AY~~y~~G~Y~~A~~~F~~L~~~~~~~~~Y~~gLa~c~q~~~~Y~~A~~~Y~~a~~l~~~~P~p~~~~a~C~l~~g~~~~ 104 (137) T TIGR02552 25 AYNLYQQGRYDEALKLFQLLAAYDPWNADYWLGLAACCQMLKEYEEAIDAYALAAALDPDDPRPYYHAAECYLALGEPES 104 (137) T ss_pred HHHHHHCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHCCHHH T ss_conf 99998536679999999999997430368999999999986167899999999975276869512699899987289789 Q ss_pred HHHHHHHHHHHHC Q ss_conf 5887642211000 Q gi|254780592|r 123 GLDAINRAQRPDI 135 (271) Q Consensus 123 A~~~~~~a~~~~p 135 (271) |...++.++.... T Consensus 105 A~~aL~~a~~~~~ 117 (137) T TIGR02552 105 ALKALDLAIEIAG 117 (137) T ss_pred HHHHHHHHHHHHC T ss_conf 9999999999848 No 108 >PRK10803 hypothetical protein; Provisional Probab=98.90 E-value=1.9e-08 Score=69.41 Aligned_cols=97 Identities=11% Similarity=0.081 Sum_probs=85.0 Q ss_pred HHHHCCHHHHHHHHHHHHHHHCH---HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHH---HHHHHHHHHHHHHHHH Q ss_conf 10000011258876422110000---23356699999997421111100000000000000---0124577765433223 Q gi|254780592|r 114 LANAGYLDEGLDAINRAQRPDIP---DWQLISAKGSVLAQMGKHSEALIEYERALELSPNE---SSIVSNIAMSYLLMGD 187 (271) Q Consensus 114 ~~~~g~~~~A~~~~~~a~~~~p~---~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~---~~~~~~~~~~~~~~g~ 187 (271) ....++|++|...|..-+...|+ .+.+++++|.+|...|++++|...|...+...|++ ++++..+|.++...|+ T Consensus 152 vl~~k~y~~Ai~aF~~Fi~~yP~s~y~~nA~YWLGe~~y~~~~~~~A~~~F~~vv~~yP~s~K~pdAllklg~~~~~lg~ 231 (262) T PRK10803 152 VQDKSRQDDAIVAFQNFIKKYPDSTYQPNANYWLGQLNYNKGKKDDAAYYFASVVKNYPKSPKAADAMFKVGVIMQDKGD 231 (262) T ss_pred HHHHCCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCC T ss_conf 99735699999999999987879987743099999999974378999999999998677998678999999999998599 Q ss_pred HHHHHHHHHHHHHCCCCCHHHHH Q ss_conf 55435999999953898547888 Q gi|254780592|r 188 LKTAEEKLRFASQMIGADSRIRQ 210 (271) Q Consensus 188 ~~~A~~~~~~al~~~p~~~~~~~ 210 (271) .++|..+|++.++..|++..+.. T Consensus 232 ~~~Ak~~l~~vi~~YP~S~aA~l 254 (262) T PRK10803 232 TAKAKAVYQQVIKKYPGTDGAKQ 254 (262) T ss_pred HHHHHHHHHHHHHHCCCCHHHHH T ss_conf 99999999999998859989999 No 109 >cd05804 StaR_like StaR_like; a well-conserved protein found in bacteria, plants, and animals. A family member from Streptomyces toyocaensis, StaR is part of a gene cluster involved in the biosynthesis of glycopeptide antibiotics (GPAs), specifically A47934. It has been speculated that StaR could be a flavoprotein hydroxylating a tyrosine sidechain. Some family members have been annotated as proteins containing tetratricopeptide (TPR) repeats, which may at least indicate mostly alpha-helical secondary structure. Probab=98.90 E-value=2e-07 Score=63.19 Aligned_cols=168 Identities=21% Similarity=0.188 Sum_probs=112.5 Q ss_pred CHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHH----HCCHHHHHHHHHHHHHHHCHHHHHHHHH Q ss_conf 879999999999874100124466655663275210013555532100----0001125887642211000023356699 Q gi|254780592|r 69 NKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLAN----AGYLDEGLDAINRAQRPDIPDWQLISAK 144 (271) Q Consensus 69 ~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~----~g~~~~A~~~~~~a~~~~p~~~~~~~~l 144 (271) ..+-...-+..+...|++.+|...+++.+..+|.+.-++.. -..+.- .|..+.....+......+|.++-.+..+ T Consensus 42 ~RE~~hv~a~~~~~~g~~~~A~~~~~~~l~~~P~D~lalk~-~~~~f~lg~~~g~~~~~~~~l~~~~p~~~~~~~~~gm~ 120 (355) T cd05804 42 ERERAHVEALSAWIAGDLPKALALLEQLLDDYPRDLLALKL-HLGAFGLGDFSGMRDHVARVLPLWAPENPDYWYLLGML 120 (355) T ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHH-HHHHHCCCCCCCCCHHHHHHHHHCCCCCCCCHHHHHHH T ss_conf 89999999999998799789999999999869835999998-76741025413240569999872573489836999999 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC----HHHHHHHHHHHHHCC Q ss_conf 9999974211111000000000000000124577765433223554359999999538985----478889999999869 Q gi|254780592|r 145 GSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGAD----SRIRQNLALVVGLQG 220 (271) Q Consensus 145 g~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~----~~~~~~La~~~~~~g 220 (271) +..+...|++++|.+.-+++++++|++.=+...++-.+..+|+.++++.+++.....-..+ .-.|..+|+.+...| T Consensus 121 AF~leE~g~~~~Ae~~a~~al~~np~d~wa~HAvAHv~~~~g~~~eG~~~l~~~~~~w~~~~~l~~H~wWH~AL~~Le~g 200 (355) T cd05804 121 AFGLEEAGQYDRAEEAARRALELNPDDAWAVHAVAHVLEMQGRFKEGIAFMESWRDTWDCSSMLRGHNWWHLALFYLERG 200 (355) T ss_pred HHHHHHCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCC T ss_conf 99999858899999999999963999937999999999983887889999985175257788589899999999988778 Q ss_pred CHHHHHHHHHHHCCCCC Q ss_conf 98999999998629172 Q gi|254780592|r 221 RMKEAYSIASQELSPEE 237 (271) Q Consensus 221 ~~~eA~~~~~~~L~~~~ 237 (271) +++++..+|...+.+.. T Consensus 201 ~~~~~l~iyD~~i~~~~ 217 (355) T cd05804 201 DYEAALAIYDTHIAPSA 217 (355) T ss_pred CHHHHHHHHHHHHCCCC T ss_conf 98899999888744566 No 110 >KOG0543 consensus Probab=98.89 E-value=2.1e-08 Score=69.21 Aligned_cols=101 Identities=21% Similarity=0.194 Sum_probs=52.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCH---------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHH Q ss_conf 9999999742111110000000000000---------------0012457776543322355435999999953898547 Q gi|254780592|r 143 AKGSVLAQMGKHSEALIEYERALELSPN---------------ESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSR 207 (271) Q Consensus 143 ~lg~~~~~~~~~~~A~~~~~~~~~~~p~---------------~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~ 207 (271) ..|+.+.+.|++..|...|++++..-+. -...+.|++.++++.++|..|+..+.++++.+|.|.. T Consensus 213 e~Gn~~fK~gk~~~A~~~Yerav~~l~~~~~~~~ee~~~~~~~k~~~~lNlA~c~lKl~~~~~Ai~~c~kvLe~~~~N~K 292 (397) T KOG0543 213 ERGNVLFKEGKFKLAKKRYERAVSFLEYRRSFDEEEQKKAEALKLACHLNLAACYLKLKEYKEAIESCNKVLELDPNNVK 292 (397) T ss_pred HHHHHHHHHCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHH T ss_conf 75158875121999999999999875113469978999998888777647998887412188999999998845899626 Q ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHHH Q ss_conf 888999999986998999999998629172866599 Q gi|254780592|r 208 IRQNLALVVGLQGRMKEAYSIASQELSPEEATRNIK 243 (271) Q Consensus 208 ~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~~~ 243 (271) +.++-|.++..+|.++.|...+++++..+|.+..+. T Consensus 293 ALyRrG~A~l~~~e~~~A~~df~ka~k~~P~Nka~~ 328 (397) T KOG0543 293 ALYRRGQALLALGEYDLARDDFQKALKLEPSNKAAR 328 (397) T ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHH T ss_conf 777878898864209899999999997488868999 No 111 >TIGR02795 tol_pal_ybgF tol-pal system protein YbgF; InterPro: IPR014162 Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, and a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.. Probab=98.88 E-value=1.4e-08 Score=70.29 Aligned_cols=103 Identities=13% Similarity=0.084 Sum_probs=88.5 Q ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHCCCC---HHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCC---CHHHHHHHH Q ss_conf 1799999984201999999999999957578---7999999999987410012446665566327521---001355553 Q gi|254780592|r 39 PDPESLENMNHEQLLEVTSTIGLQYQSHTKN---KMIGIVYADVLRRVGRTAQALAVMRQVAILYPED---QEVLAAYGK 112 (271) Q Consensus 39 ~~~~~~~~~~~~~~~~A~~~~~~al~~~P~~---~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~---~~~~~~~~~ 112 (271) ....+...+..|||+.|+..|+.-++..|++ |.+++.+|++++..|+++.|...|..+++.+|++ ++++..+|. T Consensus 5 ~y~aa~~~lk~gdY~~A~~~F~~F~~~YP~S~y~pnA~YWLGe~~y~~~~~~~A~~~f~~v~~~yp~s~KaPdALLKlg~ 84 (119) T TIGR02795 5 YYDAALLVLKAGDYADAIQAFQAFLKKYPKSTYAPNAHYWLGEAYYAQGDYADAAKAFLAVVKKYPKSPKAPDALLKLGM 84 (119) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHH T ss_conf 89999999853887899999999988687987671054799999999733999999999998645899860689999999 Q ss_pred HHHHHCCHHHHHHHHHHHHHHHCHHHHHH Q ss_conf 21000001125887642211000023356 Q gi|254780592|r 113 SLANAGYLDEGLDAINRAQRPDIPDWQLI 141 (271) Q Consensus 113 ~~~~~g~~~~A~~~~~~a~~~~p~~~~~~ 141 (271) ++..+|+.++|...|++.+..-|....+. T Consensus 85 ~~~~~g~~~~A~~~l~~V~~~YP~s~aA~ 113 (119) T TIGR02795 85 SLQELGDKEKAKATLQQVIKRYPGSSAAK 113 (119) T ss_pred HHHHHCCHHHHHHHHHHHHHHCCCCHHHH T ss_conf 99872898899999999985669987889 No 112 >COG0457 NrfG FOG: TPR repeat [General function prediction only] Probab=98.83 E-value=2.9e-06 Score=56.06 Aligned_cols=192 Identities=23% Similarity=0.234 Sum_probs=165.7 Q ss_pred HCCHHHHHHHHHHHHHHCCC--CHHHHHHHHHHHHHHHCCHHHHHHHHHHHH--CCCCCCHHHHHHHHHHHHHCCHHHHH Q ss_conf 20199999999999995757--879999999999874100124466655663--27521001355553210000011258 Q gi|254780592|r 49 HEQLLEVTSTIGLQYQSHTK--NKMIGIVYADVLRRVGRTAQALAVMRQVAI--LYPEDQEVLAAYGKSLANAGYLDEGL 124 (271) Q Consensus 49 ~~~~~~A~~~~~~al~~~P~--~~~~~~~~a~~l~~~g~~~~A~~~~~~al~--~~p~~~~~~~~~~~~~~~~g~~~~A~ 124 (271) .+.+..+...+...+...+. ........+..+...++...+...+..... ..+.....+...+......+++..+. T Consensus 36 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~ 115 (291) T COG0457 36 LGELAEALELLEEALELLPNSDLAGLLLLLALALLKLGRLEEALELLEKALELELLPNLAEALLNLGLLLEALGKYEEAL 115 (291) T ss_pred HHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 44699999999999876331111689999888999866379999989998877535235999999999999984399999 Q ss_pred HHHHHHHHHHCHHHHHHHHHHH-HHHHHHHHHHHHHHHHHCCCCCC---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 8764221100002335669999-99974211111000000000000---0001245777654332235543599999995 Q gi|254780592|r 125 DAINRAQRPDIPDWQLISAKGS-VLAQMGKHSEALIEYERALELSP---NESSIVSNIAMSYLLMGDLKTAEEKLRFASQ 200 (271) Q Consensus 125 ~~~~~a~~~~p~~~~~~~~lg~-~~~~~~~~~~A~~~~~~~~~~~p---~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~ 200 (271) ..+.......+.........+. ++...|+++.+...+.+++...| .........+..+...++++.++..+.+++. T Consensus 116 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~ 195 (291) T COG0457 116 ELLEKALALDPDPDLAEALLALGALYELGDYEEALELYEKALELDPELNELAEALLALGALLEALGRYEEALELLEKALK 195 (291) T ss_pred HHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999873365642699999999999864099999999999874944467899999999898870339999999999986 Q ss_pred CCCC-CHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCH Q ss_conf 3898-547888999999986998999999998629172866 Q gi|254780592|r 201 MIGA-DSRIRQNLALVVGLQGRMKEAYSIASQELSPEEATR 240 (271) Q Consensus 201 ~~p~-~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~ 240 (271) ..+. ....+..++..+...|.+++|...+..++...+... T Consensus 196 ~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~ 236 (291) T COG0457 196 LNPDDDAEALLNLGLLYLKLGKYEEALEYYEKALELDPDNA 236 (291) T ss_pred HCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCH T ss_conf 28012279999999999875259999999999987291017 No 113 >KOG0543 consensus Probab=98.81 E-value=7.5e-08 Score=65.78 Aligned_cols=143 Identities=13% Similarity=0.141 Sum_probs=108.2 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHH Q ss_conf 99999999874100124466655663275210013555532100000112588764221100002335669999999742 Q gi|254780592|r 73 GIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMG 152 (271) Q Consensus 73 ~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~ 152 (271) .-..|..|.+.|++..|...|++++..-+.....-. ++.. .........+.+++.|+.+++ T Consensus 211 ~ke~Gn~~fK~gk~~~A~~~Yerav~~l~~~~~~~~------------ee~~-------~~~~~k~~~~lNlA~c~lKl~ 271 (397) T KOG0543 211 KKERGNVLFKEGKFKLAKKRYERAVSFLEYRRSFDE------------EEQK-------KAEALKLACHLNLAACYLKLK 271 (397) T ss_pred HHHHHHHHHHHCHHHHHHHHHHHHHHHHHCCCCCCH------------HHHH-------HHHHHHHHHHHHHHHHHHHHH T ss_conf 877515887512199999999999987511346997------------8999-------998888777647998887412 Q ss_pred HHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHH-HHHHHH Q ss_conf 1111100000000000000012457776543322355435999999953898547888999999986998999-999998 Q gi|254780592|r 153 KHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEA-YSIASQ 231 (271) Q Consensus 153 ~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA-~~~~~~ 231 (271) ++..|+..-.+.++..|++..+++..|.++...|+++.|...|+++++..|.|..+...|..+-....++.+. .+.|.+ T Consensus 272 ~~~~Ai~~c~kvLe~~~~N~KALyRrG~A~l~~~e~~~A~~df~ka~k~~P~Nka~~~el~~l~~k~~~~~~kekk~y~~ 351 (397) T KOG0543 272 EYKEAIESCNKVLELDPNNVKALYRRGQALLALGEYDLARDDFQKALKLEPSNKAARAELIKLKQKIREYEEKEKKMYAN 351 (397) T ss_pred HHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 18899999999884589962677787889886420989999999999748886899999999999999999889999999 Q ss_pred HCC Q ss_conf 629 Q gi|254780592|r 232 ELS 234 (271) Q Consensus 232 ~L~ 234 (271) .+. T Consensus 352 mF~ 354 (397) T KOG0543 352 MFA 354 (397) T ss_pred HHH T ss_conf 850 No 114 >KOG3785 consensus Probab=98.78 E-value=3.3e-07 Score=61.85 Aligned_cols=187 Identities=10% Similarity=0.015 Sum_probs=98.0 Q ss_pred HHHHHHHCCHHHHHHHHHHHHHHCCCCH-HHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHH Q ss_conf 9999842019999999999999575787-999999999987410012446665566327521001355553210000011 Q gi|254780592|r 43 SLENMNHEQLLEVTSTIGLQYQSHTKNK-MIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLD 121 (271) Q Consensus 43 ~~~~~~~~~~~~A~~~~~~al~~~P~~~-~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~ 121 (271) -.+-+...||..|+..++-....+.... ..-..+|.++...|++++|++.++-+.+.+.-+.+.+..++-+..-+|.|. T Consensus 29 Ledfls~rDytGAislLefk~~~~~EEE~~~~lWia~C~fhLgdY~~Al~~Y~~~~~~~~~~~el~vnLAcc~FyLg~Y~ 108 (557) T KOG3785 29 LEDFLSNRDYTGAISLLEFKLNLDREEEDSLQLWIAHCYFHLGDYEEALNVYTFLMNKDDAPAELGVNLACCKFYLGQYI 108 (557) T ss_pred HHHHHHCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHH T ss_conf 88887525514589999886044414467899999998875213999999999885367997010252899999999899 Q ss_pred HHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 25887642211000023356699999997421111100000000000000012457776543322355435999999953 Q gi|254780592|r 122 EGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQM 201 (271) Q Consensus 122 ~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~ 201 (271) +|.....++.+ .|-..+.++.++ .+.++... +-.|...+. +..+....++....-...|.+|+..|++.+.- T Consensus 109 eA~~~~~ka~k-~pL~~RLlfhla---hklndEk~-~~~fh~~Lq---D~~EdqLSLAsvhYmR~HYQeAIdvYkrvL~d 180 (557) T KOG3785 109 EAKSIAEKAPK-TPLCIRLLFHLA---HKLNDEKR-ILTFHSSLQ---DTLEDQLSLASVHYMRMHYQEAIDVYKRVLQD 180 (557) T ss_pred HHHHHHHHCCC-CHHHHHHHHHHH---HHHCCHHH-HHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 99999861898-759999999999---87176889-999998875---46787776777799999899999999999704 Q ss_pred CCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCC Q ss_conf 898547888999999986998999999998629172 Q gi|254780592|r 202 IGADSRIRQNLALVVGLQGRMKEAYSIASQELSPEE 237 (271) Q Consensus 202 ~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~ 237 (271) +|.....-..+|++|..+.-++-+.+++.--|..-| T Consensus 181 n~ey~alNVy~ALCyyKlDYydvsqevl~vYL~q~p 216 (557) T KOG3785 181 NPEYIALNVYMALCYYKLDYYDVSQEVLKVYLRQFP 216 (557) T ss_pred CHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCC T ss_conf 863645678899999855616668999999999689 No 115 >COG4700 Uncharacterized protein conserved in bacteria containing a divergent form of TPR repeats [Function unknown] Probab=98.75 E-value=1.3e-06 Score=58.21 Aligned_cols=147 Identities=14% Similarity=0.089 Sum_probs=91.9 Q ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHH-CCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHH Q ss_conf 9999999999995757879999999999874100124466655663-275210013555532100000112588764221 Q gi|254780592|r 53 LEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAI-LYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQ 131 (271) Q Consensus 53 ~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~-~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~ 131 (271) +.......+.++.-|.+ .-.+.+|..+.+.|++.+|...|++++. +.-+++..+..++......+++-.+...+++.. T Consensus 73 ~R~~Rea~~~~~~ApTv-qnr~rLa~al~elGr~~EA~~hy~qalsG~fA~d~a~lLglA~Aqfa~~~~A~a~~tLe~l~ 151 (251) T COG4700 73 ERHLREATEELAIAPTV-QNRYRLANALAELGRYHEAVPHYQQALSGIFAHDAAMLLGLAQAQFAIQEFAAAQQTLEDLM 151 (251) T ss_pred HHHHHHHHHHHHHCHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHH T ss_conf 58999999887646137-88999999999841056668899998546567988999888988886165888888999986 Q ss_pred HHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHH Q ss_conf 10000233566999999974211111000000000000000124577765433223554359999999538985478889 Q gi|254780592|r 132 RPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQN 211 (271) Q Consensus 132 ~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~ 211 (271) +.+|.. ..++....++..+..+|.+.+|...|+.++...|+ ...+-. T Consensus 152 e~~pa~--------------------------------r~pd~~Ll~aR~laa~g~~a~Aesafe~a~~~ypg-~~ar~~ 198 (251) T COG4700 152 EYNPAF--------------------------------RSPDGHLLFARTLAAQGKYADAESAFEVAISYYPG-PQARIY 198 (251) T ss_pred HCCCCC--------------------------------CCCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHCCC-HHHHHH T ss_conf 059765--------------------------------89972589999998458805589999999985887-778989 Q ss_pred HHHHHHHCCCHHHHHHHHHHHC Q ss_conf 9999998699899999999862 Q gi|254780592|r 212 LALVVGLQGRMKEAYSIASQEL 233 (271) Q Consensus 212 La~~~~~~g~~~eA~~~~~~~L 233 (271) ++.++..+|+.+||...+.... T Consensus 199 Y~e~La~qgr~~ea~aq~~~v~ 220 (251) T COG4700 199 YAEMLAKQGRLREANAQYVAVV 220 (251) T ss_pred HHHHHHHHCCHHHHHHHHHHHH T ss_conf 9999997122167888799999 No 116 >TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795). Probab=98.72 E-value=1.4e-06 Score=58.00 Aligned_cols=166 Identities=11% Similarity=-0.067 Sum_probs=116.2 Q ss_pred CCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCC---HHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHH---HHH Q ss_conf 7879999999999874100124466655663275210---0135555321000001125887642211000023---356 Q gi|254780592|r 68 KNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQ---EVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDW---QLI 141 (271) Q Consensus 68 ~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~---~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~---~~~ 141 (271) ..+..++..|..+...|++.+|+..|+++...+|..+ .+...++.++...|++++|...+++-+...|.+. .++ T Consensus 31 ~~~~~lY~~a~~~~~~~~y~~A~~~fe~l~~~yp~s~~a~~A~l~~a~a~y~~~~y~~A~~~~~~Fi~~yP~~~~~~ya~ 110 (235) T TIGR03302 31 RPAEELYNEAKEALDSGDYTEAIKYFEALESRYPFSPYAEQAQLDLAYAYYKSGDYAEAIAAADRFIRLHPNHPDADYAY 110 (235) T ss_pred CCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHH T ss_conf 89999999999999878999999999999986899313799999999999976439999999999999887743499999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHH------------- Q ss_conf 6999999974211111000000000000000124577765433223554359999999538985478------------- Q gi|254780592|r 142 SAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRI------------- 208 (271) Q Consensus 142 ~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~------------- 208 (271) ...|.++..+-. . ....++....|+..|+..+...|++.-+ T Consensus 111 y~~a~s~~~~~~----------------~----------~~rDq~~~~~A~~~f~~~i~~yP~S~ya~dA~~~i~~l~~~ 164 (235) T TIGR03302 111 YLRGLSNYKQID----------------D----------SDRDQTAAREAFEAFQELIRRYPNSEYAPDAKKRMDYLRNR 164 (235) T ss_pred HHHHHHHHHHCC----------------C----------CCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHH T ss_conf 998799887366----------------5----------44580899999999999998889965799999999999999 Q ss_pred ----HHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCH---HHHHHHHHHHHHHHHHHHH Q ss_conf ----88999999986998999999998629172866---5999999998321299999 Q gi|254780592|r 209 ----RQNLALVVGLQGRMKEAYSIASQELSPEEATR---NIKYIKSILSQRDPWKKIA 259 (271) Q Consensus 209 ----~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~---~~~~l~~~l~~~~~~~~~a 259 (271) -...|..|...|++..|...++..+..-|... ...+...-....-|..+.| T Consensus 165 LA~~e~~ia~~Y~k~~~y~aAi~r~~~vl~~Yp~t~~~~eAl~~l~eay~~lg~~~~A 222 (235) T TIGR03302 165 LAGKELYVARFYLKRGAYVAAINRFETVVENYPDTPATEEALARLVEAYLALGLTDLA 222 (235) T ss_pred HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCHHHH T ss_conf 9999999999999841709999999999986899851399999999999986998999 No 117 >KOG2047 consensus Probab=98.70 E-value=3.1e-06 Score=55.90 Aligned_cols=153 Identities=14% Similarity=0.152 Sum_probs=60.8 Q ss_pred HCCHHHHHHHHHHHHHH-CCCC-----HHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCC----CHHHHHHHHHHHHHC Q ss_conf 20199999999999995-7578-----7999999999987410012446665566327521----001355553210000 Q gi|254780592|r 49 HEQLLEVTSTIGLQYQS-HTKN-----KMIGIVYADVLRRVGRTAQALAVMRQVAILYPED----QEVLAAYGKSLANAG 118 (271) Q Consensus 49 ~~~~~~A~~~~~~al~~-~P~~-----~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~----~~~~~~~~~~~~~~g 118 (271) .++..+-+..|..|++. ||.- ...|..+|..|...|+.+.|..+|+++.+..-.. ..+|..++....+.. T Consensus 360 e~~~~~~i~tyteAv~~vdP~ka~Gs~~~Lw~~faklYe~~~~l~~aRvifeka~~V~y~~v~dLa~vw~~waemElrh~ 439 (835) T KOG2047 360 EGNAAEQINTYTEAVKTVDPKKAVGSPGTLWVEFAKLYENNGDLDDARVIFEKATKVPYKTVEDLAEVWCAWAEMELRHE 439 (835) T ss_pred CCCHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 47767787889999971695437897026799999999833748889999988414786326789999999999887665 Q ss_pred CHHHHHHHHHHHHHHH------------C------HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHH Q ss_conf 0112588764221100------------0------023356699999997421111100000000000000012457776 Q gi|254780592|r 119 YLDEGLDAINRAQRPD------------I------PDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAM 180 (271) Q Consensus 119 ~~~~A~~~~~~a~~~~------------p------~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~ 180 (271) +++.|+...+++...- | .....|..++......|-++.....|.+.++++--.+....|+|. T Consensus 440 ~~~~Al~lm~~A~~vP~~~~~~~yd~~~pvQ~rlhrSlkiWs~y~DleEs~gtfestk~vYdriidLriaTPqii~NyAm 519 (835) T KOG2047 440 NFEAALKLMRRATHVPTNPELEYYDNSEPVQARLHRSLKIWSMYADLEESLGTFESTKAVYDRIIDLRIATPQIIINYAM 519 (835) T ss_pred HHHHHHHHHHHHHCCCCCHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCHHHHHHHHH T ss_conf 28999999876304898515555058985899998719999999889997252799999999888876187999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHC Q ss_conf 543322355435999999953 Q gi|254780592|r 181 SYLLMGDLKTAEEKLRFASQM 201 (271) Q Consensus 181 ~~~~~g~~~~A~~~~~~al~~ 201 (271) .+..-.-++++...|++-+.+ T Consensus 520 fLEeh~yfeesFk~YErgI~L 540 (835) T KOG2047 520 FLEEHKYFEESFKAYERGISL 540 (835) T ss_pred HHHHHHHHHHHHHHHHCCCCC T ss_conf 987558999999999737756 No 118 >PRK09782 bacteriophage N4 receptor, outer membrane subunit; Provisional Probab=98.67 E-value=6.4e-07 Score=60.05 Aligned_cols=186 Identities=12% Similarity=0.020 Sum_probs=102.0 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHH--- Q ss_conf 79999998420199999999999995757879999999999874100124466655663275210013555532100--- Q gi|254780592|r 40 DPESLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLAN--- 116 (271) Q Consensus 40 ~~~~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~--- 116 (271) ..++.....++++.+|+..++-++..-|+|..+...++++|.+.|++.+|..++++..+.+|.+...-..+..+... T Consensus 48 iDkAy~e~k~~~~~~a~re~~h~l~~vP~n~~~~~~Laea~~h~g~~~eA~all~~~~~~~~gDA~L~l~L~wie~~pp~ 127 (987) T PRK09782 48 LDKALKAQKNNDEATAIREFEYIHQQVPDNIPLTLYLAEAYRHFGHDDRARLLLEDQLKRHPGDARLERSLAAIPVEVKS 127 (987) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCCC T ss_conf 67999998715488899999999986889619999999999983893899999986134898257999888764148998 Q ss_pred --------------------------------HCCHHHHHHHHHH-HHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf --------------------------------0001125887642-2110000233566999999974211111000000 Q gi|254780592|r 117 --------------------------------AGYLDEGLDAINR-AQRPDIPDWQLISAKGSVLAQMGKHSEALIEYER 163 (271) Q Consensus 117 --------------------------------~g~~~~A~~~~~~-a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~ 163 (271) +..+..+...++- .....+.-......++.......++..++..+.. T Consensus 128 a~~~Ee~la~~k~~~~v~~~q~~S~~~~~al~LAk~~ga~~aL~~~~~pp~~~g~~lr~~r~~~a~~lk~w~~a~~~l~~ 207 (987) T PRK09782 128 VTTVEELLAQQKACDAVPTLRCRSEVGQNALRLAQLPVARAQLNDATFAASPEGKTLRTDLLQRAIYLKQWSQADTLYNE 207 (987) T ss_pred CCHHHHHHHHCCHHHCCCHHHHHHHHHCCCCCHHHCCCCHHHCCHHHCCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHH T ss_conf 42699999750543020067786553003346554058253250321899755636478999999870377888766578 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC--CHHHHHHHHHHHHHCCCHHHHHHHHH Q ss_conf 00000000012457776543322355435999999953898--54788899999998699899999999 Q gi|254780592|r 164 ALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGA--DSRIRQNLALVVGLQGRMKEAYSIAS 230 (271) Q Consensus 164 ~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~--~~~~~~~La~~~~~~g~~~eA~~~~~ 230 (271) +.....-+..-+..++.+++..++-+....+ +..+- +...+..++.-+...|.+++|...+. T Consensus 208 ~~~~~~l~a~d~~~la~a~l~~~~e~~l~al-----qa~~~~~~~Q~rl~ya~~li~~G~~~~aq~~L~ 271 (987) T PRK09782 208 ARQQNTLSAAERRQWFDVLLAGQLDDRLLAL-----QSQGIFTDPQSRITYATALAYRGEKARLQHYLI 271 (987) T ss_pred HHHCCCCCHHHHHHHHHHHHHCCCHHHHHHH-----CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 8735688889999999888730237889873-----279877677789999999998306999999987 No 119 >KOG4648 consensus Probab=98.65 E-value=9.6e-08 Score=65.12 Aligned_cols=102 Identities=22% Similarity=0.184 Sum_probs=60.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCC Q ss_conf 69999999742111110000000000000001245777654332235543599999995389854788899999998699 Q gi|254780592|r 142 SAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGR 221 (271) Q Consensus 142 ~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~ 221 (271) -..|+-|.++|.+++|+++|.+++...|.++..+.|.+.+|+....+..|...+..|+.++.....+|.+.+.+...+|. T Consensus 101 KE~GN~yFKQgKy~EAIDCYs~~ia~~P~NpV~~~NRA~AYlk~K~FA~AE~DC~~AiaLd~~Y~KAYSRR~~AR~~Lg~ 180 (536) T KOG4648 101 KERGNTYFKQGKYEEAIDCYSTAIAVYPHNPVYHINRALAYLKQKSFAQAEEDCEAAIALDKLYVKAYSRRMQARESLGN 180 (536) T ss_pred HHHHHHHHHCCCHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 87633676425413566665301245788864200588999988877754302889876328899998777789998775 Q ss_pred HHHHHHHHHHHCCCCCCCHHHH Q ss_conf 8999999998629172866599 Q gi|254780592|r 222 MKEAYSIASQELSPEEATRNIK 243 (271) Q Consensus 222 ~~eA~~~~~~~L~~~~~~~~~~ 243 (271) ..||.+.++.+|..+|....+. T Consensus 181 ~~EAKkD~E~vL~LEP~~~ELk 202 (536) T KOG4648 181 NMEAKKDCETVLALEPKNIELK 202 (536) T ss_pred HHHHHHHHHHHHHCCCCCHHHH T ss_conf 7877777889986085617899 No 120 >pfam09295 ChAPs ChAPs (Chs5p-Arf1p-binding proteins). ChAPs (Chs5p-Arf1p-binding proteins) are required for the export of specialized cargo from the Golgi. They physically interact with Chs3, Chs5 and the small GTPase Arf1, and they form also interactions with each other. Probab=98.64 E-value=3e-07 Score=62.13 Aligned_cols=113 Identities=14% Similarity=0.131 Sum_probs=91.7 Q ss_pred HCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHH Q ss_conf 20199999999999995757879999999999874100124466655663275210013555532100000112588764 Q gi|254780592|r 49 HEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAIN 128 (271) Q Consensus 49 ~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~ 128 (271) .+.++.++..+++..+.+| ++..-++.++...++--+|+..+.+.+...|.+...+...+..+...++++-|+...+ T Consensus 182 t~~~~~~~~~le~L~~~~p---~v~~llakv~~~~~~E~~av~~i~~~L~~~p~d~~LL~~Qa~FLl~K~~yelAL~~A~ 258 (395) T pfam09295 182 TKRYDRALNLLEKLREEEP---EVVSLLARVLLLMDEEIDAVKLMHDALQENPMDYELLCLQAEFLLSKKKDELALECAK 258 (395) T ss_pred CCCHHHHHHHHHHHHHCCC---HHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHH T ss_conf 4438989999999875184---4999999999965608999999999986188767899999999981798899999999 Q ss_pred HHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 221100002335669999999742111110000000 Q gi|254780592|r 129 RAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERA 164 (271) Q Consensus 129 ~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~ 164 (271) +++...|.+...|..++.+|...|+++.|.-.++.. T Consensus 259 ~av~~~P~~F~~W~~La~~Yi~l~d~e~ALl~LNS~ 294 (395) T pfam09295 259 RAVNSAPSEFKTWALLVKVYIKLEDWENALLTLNSC 294 (395) T ss_pred HHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 987317866188999999999863299999999568 No 121 >pfam09295 ChAPs ChAPs (Chs5p-Arf1p-binding proteins). ChAPs (Chs5p-Arf1p-binding proteins) are required for the export of specialized cargo from the Golgi. They physically interact with Chs3, Chs5 and the small GTPase Arf1, and they form also interactions with each other. Probab=98.60 E-value=4.4e-07 Score=61.07 Aligned_cols=118 Identities=19% Similarity=0.216 Sum_probs=92.2 Q ss_pred HHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99987410012446665566327521001355553210000011258876422110000233566999999974211111 Q gi|254780592|r 78 DVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEA 157 (271) Q Consensus 78 ~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A 157 (271) ......++++.++.+++++...+|+ +...+++++...++..+|...+.+++...|.+...+...+..+...++++.| T Consensus 177 ~~~~~t~~~~~~~~~le~L~~~~p~---v~~llakv~~~~~~E~~av~~i~~~L~~~p~d~~LL~~Qa~FLl~K~~yelA 253 (395) T pfam09295 177 KYVRLTKRYDRALNLLEKLREEEPE---VVSLLARVLLLMDEEIDAVKLMHDALQENPMDYELLCLQAEFLLSKKKDELA 253 (395) T ss_pred HHHHHCCCHHHHHHHHHHHHHCCCH---HHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHH T ss_conf 9994244389899999998751844---9999999999656089999999999861887678999999999817988999 Q ss_pred HHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 00000000000000012457776543322355435999999 Q gi|254780592|r 158 LIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFA 198 (271) Q Consensus 158 ~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a 198 (271) ....+++....|.+...|..++.+|+..|+++.|+-.+... T Consensus 254 L~~A~~av~~~P~~F~~W~~La~~Yi~l~d~e~ALl~LNS~ 294 (395) T pfam09295 254 LECAKRAVNSAPSEFKTWALLVKVYIKLEDWENALLTLNSC 294 (395) T ss_pred HHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 99999987317866188999999999863299999999568 No 122 >KOG2053 consensus Probab=98.59 E-value=3.3e-06 Score=55.72 Aligned_cols=199 Identities=15% Similarity=0.097 Sum_probs=131.7 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCC Q ss_conf 79999998420199999999999995757879999999999874100124466655663275210013555532100000 Q gi|254780592|r 40 DPESLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGY 119 (271) Q Consensus 40 ~~~~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~ 119 (271) --++....+.|..++|...++......|+|...+-.+-.+|...|+.++|..+|+++++.+|+ .+.+..+-.+|.+.+. T Consensus 47 vLkaLsl~r~gk~~ea~~~Le~~~~~~~~D~~tLq~l~~~y~d~~~~d~~~~~Ye~~~~~~P~-eell~~lFmayvR~~~ 125 (932) T KOG2053 47 VLKALSLFRLGKGDEALKLLEALYGLKGTDDLTLQFLQNVYRDLGKLDEAVHLYERANQKYPS-EELLYHLFMAYVREKS 125 (932) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC-HHHHHHHHHHHHHHHH T ss_conf 999999998158646788875130479985689999999999874256899999998710884-8899999999999888 Q ss_pred HHHHHHHHHHHHHHHCHHHHH-HHHHHHHHHHHHHHHHHHH---------HHHHCCCCCCH-H-HHHHHHHHHHHHHHHH Q ss_conf 112588764221100002335-6699999997421111100---------00000000000-0-0124577765433223 Q gi|254780592|r 120 LDEGLDAINRAQRPDIPDWQL-ISAKGSVLAQMGKHSEALI---------EYERALELSPN-E-SSIVSNIAMSYLLMGD 187 (271) Q Consensus 120 ~~~A~~~~~~a~~~~p~~~~~-~~~lg~~~~~~~~~~~A~~---------~~~~~~~~~p~-~-~~~~~~~~~~~~~~g~ 187 (271) |.+-.+.--+.-+..|+.+-. |.....+...+...+.... ...+.++..+. . ..=...+-.++..+|+ T Consensus 126 yk~qQkaa~~LyK~~pk~~yyfWsV~Slilqs~~~~~~~~~~i~l~LA~~m~~~~l~~~gk~~s~aE~~Lyl~iL~~~~k 205 (932) T KOG2053 126 YKKQQKAALQLYKNFPKRAYYFWSVISLILQSIFSENELLDPILLALAEKMVQKLLEKKGKIESEAEIILYLLILELQGK 205 (932) T ss_pred HHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCC T ss_conf 99999999999985876640589999999982258863211032788999999986147751038999999999986246 Q ss_pred HHHHHHHH-HHHHHCCCCCHHHHH-HHHHHHHHCCCHHHHHHHHHHHCCCCCCC Q ss_conf 55435999-999953898547888-99999998699899999999862917286 Q gi|254780592|r 188 LKTAEEKL-RFASQMIGADSRIRQ-NLALVVGLQGRMKEAYSIASQELSPEEAT 239 (271) Q Consensus 188 ~~~A~~~~-~~al~~~p~~~~~~~-~La~~~~~~g~~~eA~~~~~~~L~~~~~~ 239 (271) +++|...+ .+..+..+....... .-...+...+++.+=.++..+.+...+.+ T Consensus 206 ~~eal~~l~~~la~~l~~~~~~l~~~~~dllk~l~~w~~l~~l~~~Ll~k~~Dd 259 (932) T KOG2053 206 YQEALEFLAITLAEKLTSANLYLENKKLDLLKLLNRWQELFELSSRLLEKGNDD 259 (932) T ss_pred HHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCC T ss_conf 788999997888874265446899988899998467688999999999847841 No 123 >COG1729 Uncharacterized protein conserved in bacteria [Function unknown] Probab=98.57 E-value=9.2e-07 Score=59.10 Aligned_cols=104 Identities=14% Similarity=0.089 Sum_probs=90.7 Q ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHHCH---HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHH---HHHHHHHHH Q ss_conf 355553210000011258876422110000---23356699999997421111100000000000000---012457776 Q gi|254780592|r 107 LAAYGKSLANAGYLDEGLDAINRAQRPDIP---DWQLISAKGSVLAQMGKHSEALIEYERALELSPNE---SSIVSNIAM 180 (271) Q Consensus 107 ~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~---~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~---~~~~~~~~~ 180 (271) .+..+.-+...|+|..|...|...+...|+ .+.+++++|.++..+|++++|...|.......|++ ++.+..+|. T Consensus 144 ~Y~~A~~~~ksgdy~~A~~~F~~fi~~YP~s~~~~nA~yWLGe~~y~qg~y~~Aa~~f~~~~k~~P~s~KApdallKlg~ 223 (262) T COG1729 144 LYNAALDLYKSGDYAEAEQAFQAFIKKYPNSTYTPNAYYWLGESLYAQGDYEDAAYIFARVVKDYPKSPKAPDALLKLGV 223 (262) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHH T ss_conf 99999999983878899999999997099975560249998899986246167999999999838999988289999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCHHHHH Q ss_conf 543322355435999999953898547888 Q gi|254780592|r 181 SYLLMGDLKTAEEKLRFASQMIGADSRIRQ 210 (271) Q Consensus 181 ~~~~~g~~~~A~~~~~~al~~~p~~~~~~~ 210 (271) +....|+.++|...|+++++..|+.+.+.. T Consensus 224 ~~~~l~~~d~A~atl~qv~k~YP~t~aA~~ 253 (262) T COG1729 224 SLGRLGNTDEACATLQQVIKRYPGTDAAKL 253 (262) T ss_pred HHHHHCCHHHHHHHHHHHHHHCCCCHHHHH T ss_conf 999821779999999999987899779999 No 124 >KOG2047 consensus Probab=98.56 E-value=3.4e-05 Score=49.50 Aligned_cols=205 Identities=16% Similarity=0.113 Sum_probs=140.7 Q ss_pred HHHHCCHHHHHHHHHHHHHHCCCC----HHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCC------------------C Q ss_conf 984201999999999999957578----799999999998741001244666556632752------------------1 Q gi|254780592|r 46 NMNHEQLLEVTSTIGLQYQSHTKN----KMIGIVYADVLRRVGRTAQALAVMRQVAILYPE------------------D 103 (271) Q Consensus 46 ~~~~~~~~~A~~~~~~al~~~P~~----~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~------------------~ 103 (271) +.+.++++.|...|+++.+.+=.- ..+|...|+.-.+..+++.|+.+++++...-.. . T Consensus 397 Ye~~~~l~~aRvifeka~~V~y~~v~dLa~vw~~waemElrh~~~~~Al~lm~~A~~vP~~~~~~~yd~~~pvQ~rlhrS 476 (835) T KOG2047 397 YENNGDLDDARVIFEKATKVPYKTVEDLAEVWCAWAEMELRHENFEAALKLMRRATHVPTNPELEYYDNSEPVQARLHRS 476 (835) T ss_pred HHHCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHCCCCCHHHHHHHH T ss_conf 98337488899999884147863267899999999998876652899999987630489851555505898589999871 Q ss_pred CHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHH--HHHHHHHHH- Q ss_conf 00135555321000001125887642211000023356699999997421111100000000000000--012457776- Q gi|254780592|r 104 QEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNE--SSIVSNIAM- 180 (271) Q Consensus 104 ~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~--~~~~~~~~~- 180 (271) ..+|..+++.....|-++.....|++.+.+.--.++...+.|..+.....++++.+.|++.+.+.+-+ .++|..+-. T Consensus 477 lkiWs~y~DleEs~gtfestk~vYdriidLriaTPqii~NyAmfLEeh~yfeesFk~YErgI~LFk~p~v~diW~tYLtk 556 (835) T KOG2047 477 LKIWSMYADLEESLGTFESTKAVYDRIIDLRIATPQIIINYAMFLEEHKYFEESFKAYERGISLFKWPNVYDIWNTYLTK 556 (835) T ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHH T ss_conf 99999998899972527999999998888761879999999999875589999999997377567883289999999999 Q ss_pred --HHHHHHHHHHHHHHHHHHHHCCCCC--HHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHH Q ss_conf --5433223554359999999538985--478889999999869989999999986291728665999999998 Q gi|254780592|r 181 --SYLLMGDLKTAEEKLRFASQMIGAD--SRIRQNLALVVGLQGRMKEAYSIASQELSPEEATRNIKYIKSILS 250 (271) Q Consensus 181 --~~~~~g~~~~A~~~~~~al~~~p~~--~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~~~~l~~~l~ 250 (271) .....-..+.|...|++|++..|.. ..++...|..-..-|--..|.++|+++-+.-+...........+. T Consensus 557 fi~rygg~klEraRdLFEqaL~~Cpp~~aKtiyLlYA~lEEe~GLar~amsiyerat~~v~~a~~l~myni~I~ 630 (835) T KOG2047 557 FIKRYGGTKLERARDLFEQALDGCPPEHAKTIYLLYAKLEEEHGLARHAMSIYERATSAVKEAQRLDMYNIYIK 630 (835) T ss_pred HHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH T ss_conf 99996688778899999999835998877899999999999850799899999998732798999999999999 No 125 >KOG4340 consensus Probab=98.56 E-value=3.5e-06 Score=55.52 Aligned_cols=157 Identities=14% Similarity=0.049 Sum_probs=105.0 Q ss_pred HHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHH Q ss_conf 99998420199999999999995757879999999999874100124466655663275210013555532100000112 Q gi|254780592|r 43 SLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDE 122 (271) Q Consensus 43 ~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~ 122 (271) +.-+....+|..|...|.+.-...|+.....+-.|..+++.+.+..|+.+.........-..+.+..-+.+....+++.. T Consensus 51 gyCYY~~Q~f~~AA~CYeQL~ql~P~~~qYrlY~AQSLY~A~i~ADALrV~~~~~D~~~L~~~~lqLqaAIkYse~Dl~g 130 (459) T KOG4340 51 GYCYYRLQEFALAAECYEQLGQLHPELEQYRLYQAQSLYKACIYADALRVAFLLLDNPALHSRVLQLQAAIKYSEGDLPG 130 (459) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 99999999998888999999863718888999999999970352889999999568498999999999998605455535 Q ss_pred HHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 5887642211000023356699999997421111100000000000000012457776543322355435999999953 Q gi|254780592|r 123 GLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQM 201 (271) Q Consensus 123 A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~ 201 (271) +....++... .+++....+.|-+..+.|+++.|.+-|..+++..+-.+..-++++.+....++++.|+.+..+.++. T Consensus 131 ~rsLveQlp~--en~Ad~~in~gCllykegqyEaAvqkFqaAlqvsGyqpllAYniALaHy~~~qyasALk~iSEIieR 207 (459) T KOG4340 131 SRSLVEQLPS--ENEADGQINLGCLLYKEGQYEAAVQKFQAALQVSGYQPLLAYNLALAHYSSRQYASALKHISEIIER 207 (459) T ss_pred HHHHHHHCCC--CCCHHHHHCCHHHEECCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 4999985778--8850233030120001654899999999987305767278999999987312077899999999975 No 126 >KOG1130 consensus Probab=98.52 E-value=1.9e-07 Score=63.33 Aligned_cols=127 Identities=15% Similarity=0.163 Sum_probs=58.5 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHHHHC------HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC----CCH--HHH Q ss_conf 135555321000001125887642211000------02335669999999742111110000000000----000--001 Q gi|254780592|r 106 VLAAYGKSLANAGYLDEGLDAINRAQRPDI------PDWQLISAKGSVLAQMGKHSEALIEYERALEL----SPN--ESS 173 (271) Q Consensus 106 ~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p------~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~----~p~--~~~ 173 (271) ++-.+|..|+-+|+|+.|...-+.-+.... ....++.++|+++.-.|+++.|.+.|+..+.+ ... ... T Consensus 197 a~GnLGNTyYlLGdf~~ai~~H~~RL~ia~efGDrAaeRRA~sNlgN~hiflg~fe~A~ehYK~tl~LAielg~r~vEAQ 276 (639) T KOG1130 197 AYGNLGNTYYLLGDFDQAIHFHKLRLEIAQEFGDRAAERRAHSNLGNCHIFLGNFELAIEHYKLTLNLAIELGNRTVEAQ 276 (639) T ss_pred HHCCCCCEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHCCCHHHHHHHHHHHHHHHHHCCHHHHHH T ss_conf 10245763565045799999999999999986318888776404310355530427699999999999999631368889 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCC------CCHHHHHHHHHHHHHCCCHHHHHHHHHHH Q ss_conf 245777654332235543599999995389------85478889999999869989999999986 Q gi|254780592|r 174 IVSNIAMSYLLMGDLKTAEEKLRFASQMIG------ADSRIRQNLALVVGLQGRMKEAYSIASQE 232 (271) Q Consensus 174 ~~~~~~~~~~~~g~~~~A~~~~~~al~~~p------~~~~~~~~La~~~~~~g~~~eA~~~~~~~ 232 (271) ..+.+|..|....++..|+.++++-+.+.. ...++++.||..+...|..+.|....++. T Consensus 277 scYSLgNtytll~e~~kAI~Yh~rHLaIAqeL~DriGe~RacwSLgna~~alg~h~kAl~fae~h 341 (639) T KOG1130 277 SCYSLGNTYTLLKEVQKAITYHQRHLAIAQELEDRIGELRACWSLGNAFNALGEHRKALYFAELH 341 (639) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 89876448999999999999999999999998876414899999878887642178889999999 No 127 >KOG3081 consensus Probab=98.51 E-value=9.7e-06 Score=52.83 Aligned_cols=166 Identities=18% Similarity=0.129 Sum_probs=108.2 Q ss_pred CHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHH Q ss_conf 87999999999987410012446665566327521001355553210000011258876422110000233566999999 Q gi|254780592|r 69 NKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVL 148 (271) Q Consensus 69 ~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~ 148 (271) |...+..-|.++...|++++|.....+. .+.++...-..++.++.+.+-|...+++....+.. .....++..| T Consensus 107 n~i~~l~aa~i~~~~~~~deAl~~~~~~-----~~lE~~Al~VqI~lk~~r~d~A~~~lk~mq~ided--~tLtQLA~aw 179 (299) T KOG3081 107 NLIDLLLAAIIYMHDGDFDEALKALHLG-----ENLEAAALNVQILLKMHRFDLAEKELKKMQQIDED--ATLTQLAQAW 179 (299) T ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHCC-----CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHH--HHHHHHHHHH T ss_conf 3899997668754278868999998521-----24889998779999878999999999999711357--8999999999 Q ss_pred HH----HHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHH Q ss_conf 97----42111110000000000000001245777654332235543599999995389854788899999998699899 Q gi|254780592|r 149 AQ----MGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKE 224 (271) Q Consensus 149 ~~----~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~e 224 (271) .. .+...+|.=.|+..-+..|..+..+...+.+.+.+|++++|...++.++..++.+++...|+-..-...|..++ T Consensus 180 v~la~ggek~qdAfyifeE~s~k~~~T~~llnG~Av~~l~~~~~eeAe~lL~eaL~kd~~dpetL~Nliv~a~~~Gkd~~ 259 (299) T KOG3081 180 VKLATGGEKIQDAFYIFEELSEKTPPTPLLLNGQAVCHLQLGRYEEAESLLEEALDKDAKDPETLANLIVLALHLGKDAE 259 (299) T ss_pred HHHHCCCHHHHHHHHHHHHHHCCCCCCHHHHCCHHHHHHHHCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCHH T ss_conf 99960632454689999998353599768870699998971699999999999985347878999999999998189917 Q ss_pred HHHHHHHHC-CCCCCCHH Q ss_conf 999999862-91728665 Q gi|254780592|r 225 AYSIASQEL-SPEEATRN 241 (271) Q Consensus 225 A~~~~~~~L-~~~~~~~~ 241 (271) +..-+-.-| ...|.+.. T Consensus 260 ~~~r~l~QLk~~~p~h~~ 277 (299) T KOG3081 260 VTERNLSQLKLSHPEHPF 277 (299) T ss_pred HHHHHHHHHHHCCCCCHH T ss_conf 788899998740876537 No 128 >COG4700 Uncharacterized protein conserved in bacteria containing a divergent form of TPR repeats [Function unknown] Probab=98.51 E-value=7.5e-06 Score=53.53 Aligned_cols=134 Identities=16% Similarity=0.147 Sum_probs=100.9 Q ss_pred HHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC-CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC-- Q ss_conf 22110000233566999999974211111000000000-000000124577765433223554359999999538985-- Q gi|254780592|r 129 RAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALE-LSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGAD-- 205 (271) Q Consensus 129 ~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~-~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~-- 205 (271) +.+...|.. .....+|..+...|++.+|...|.+++. ....+...+..++.+.+..+++..|...+++..+.+|.. T Consensus 81 ~~~~~ApTv-qnr~rLa~al~elGr~~EA~~hy~qalsG~fA~d~a~lLglA~Aqfa~~~~A~a~~tLe~l~e~~pa~r~ 159 (251) T COG4700 81 EELAIAPTV-QNRYRLANALAELGRYHEAVPHYQQALSGIFAHDAAMLLGLAQAQFAIQEFAAAQQTLEDLMEYNPAFRS 159 (251) T ss_pred HHHHHCHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCCC T ss_conf 887646137-8899999999984105666889999854656798899988898888616588888899998605976589 Q ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHH-HHHHHHHHHH Q ss_conf 478889999999869989999999986291728665999999998321-2999999997 Q gi|254780592|r 206 SRIRQNLALVVGLQGRMKEAYSIASQELSPEEATRNIKYIKSILSQRD-PWKKIAKARS 263 (271) Q Consensus 206 ~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~~~~l~~~l~~~~-~~~~~a~a~~ 263 (271) ++....+|.++..+|++.+|+..++.+++..|...-..+...++..++ ....-++++. T Consensus 160 pd~~Ll~aR~laa~g~~a~Aesafe~a~~~ypg~~ar~~Y~e~La~qgr~~ea~aq~~~ 218 (251) T COG4700 160 PDGHLLFARTLAAQGKYADAESAFEVAISYYPGPQARIYYAEMLAKQGRLREANAQYVA 218 (251) T ss_pred CCCHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCHHHHHHHHHH T ss_conf 97258999999845880558999999998588777898999999971221678887999 No 129 >KOG3081 consensus Probab=98.51 E-value=3.2e-05 Score=49.66 Aligned_cols=179 Identities=14% Similarity=0.089 Sum_probs=128.2 Q ss_pred HHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCC--CCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHH Q ss_conf 79999999999874100124466655663275--2100135555321000001125887642211000023356699999 Q gi|254780592|r 70 KMIGIVYADVLRRVGRTAQALAVMRQVAILYP--EDQEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSV 147 (271) Q Consensus 70 ~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p--~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~ 147 (271) ..+...++..+..-++.+.-++-+.+.+.... .+...+..-+.++...|++++|+....+. ...+....-..+ T Consensus 72 lqAvr~~a~~~~~e~~~~~~~~~l~E~~a~~~~~sn~i~~l~aa~i~~~~~~~deAl~~~~~~-----~~lE~~Al~VqI 146 (299) T KOG3081 72 LQAVRLLAEYLELESNKKSILASLYELVADSTDGSNLIDLLLAAIIYMHDGDFDEALKALHLG-----ENLEAAALNVQI 146 (299) T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHCC-----CHHHHHHHHHHH T ss_conf 899999998860800267788879999875014413899997668754278868999998521-----248899987799 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHH Q ss_conf 9974211111000000000000000124577765433----223554359999999538985478889999999869989 Q gi|254780592|r 148 LAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLL----MGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMK 223 (271) Q Consensus 148 ~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~----~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~ 223 (271) +.+..+.+.|....+++...+.+.. +..++.+++. .+.+..|.-+|+..-+..|..+......|.+...+||++ T Consensus 147 ~lk~~r~d~A~~~lk~mq~ided~t--LtQLA~awv~la~ggek~qdAfyifeE~s~k~~~T~~llnG~Av~~l~~~~~e 224 (299) T KOG3081 147 LLKMHRFDLAEKELKKMQQIDEDAT--LTQLAQAWVKLATGGEKIQDAFYIFEELSEKTPPTPLLLNGQAVCHLQLGRYE 224 (299) T ss_pred HHHHHHHHHHHHHHHHHHCCCHHHH--HHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCHHHHCCHHHHHHHHCCHH T ss_conf 9987899999999999971135789--99999999999606324546899999983535997688706999989716999 Q ss_pred HHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 99999998629172866599999999832129 Q gi|254780592|r 224 EAYSIASQELSPEEATRNIKYIKSILSQRDPW 255 (271) Q Consensus 224 eA~~~~~~~L~~~~~~~~~~~l~~~l~~~~~~ 255 (271) ||+.+++.+|..++.+++.......+.+..|. T Consensus 225 eAe~lL~eaL~kd~~dpetL~Nliv~a~~~Gk 256 (299) T KOG3081 225 EAESLLEEALDKDAKDPETLANLIVLALHLGK 256 (299) T ss_pred HHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCC T ss_conf 99999999985347878999999999998189 No 130 >COG4785 NlpI Lipoprotein NlpI, contains TPR repeats [General function prediction only] Probab=98.42 E-value=1.3e-06 Score=58.17 Aligned_cols=183 Identities=19% Similarity=0.136 Sum_probs=124.8 Q ss_pred HHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHCCHHH--HHHHHHHHHHHC----CCCHHHHHHHHHHHHHHHCCHHHHHH Q ss_conf 999999998540368543311799999984201999--999999999957----57879999999999874100124466 Q gi|254780592|r 19 VFLIPLLLFSSCHLNPRISIPDPESLENMNHEQLLE--VTSTIGLQYQSH----TKNKMIGIVYADVLRRVGRTAQALAV 92 (271) Q Consensus 19 ~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--A~~~~~~al~~~----P~~~~~~~~~a~~l~~~g~~~~A~~~ 92 (271) +.+++++.++||+......-...+.+..-.+..++. -+.-+.+.+... ......++..|..|-..|-..-|.-- T Consensus 8 ~~~~~~~~LagC~~~~~~~~~k~~~~~~~~qp~lqqEV~iarlsqlL~~~~l~~eeRA~l~fERGvlYDSlGL~~LAR~D 87 (297) T COG4785 8 CFVATALTLAGCSNQNETFWRKSEVLAVPLQPTLQQEVILARMSQILASRALTDEERAQLLFERGVLYDSLGLRALARND 87 (297) T ss_pred HHHHHHHHHHHHHCCCCCEECCCCEEECCCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHH T ss_conf 99999998775304667410356203346787488899999999999860488788889988612034224079998601 Q ss_pred HHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHH Q ss_conf 65566327521001355553210000011258876422110000233566999999974211111000000000000000 Q gi|254780592|r 93 MRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNES 172 (271) Q Consensus 93 ~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~ 172 (271) |.+++.+.|+-+.+...+|..+...|+++.|.+.|+...+.||...-+..++|..+..-|++..|.+.+.+-...+|+++ T Consensus 88 ftQaLai~P~m~~vfNyLG~Yl~~a~~fdaa~eaFds~~ELDp~y~Ya~lNRgi~~YY~gR~~LAq~d~~~fYQ~D~~DP 167 (297) T COG4785 88 FSQALAIRPDMPEVFNYLGIYLTQAGNFDAAYEAFDSVLELDPTYNYAHLNRGIALYYGGRYKLAQDDLLAFYQDDPNDP 167 (297) T ss_pred HHHHHHCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCHHHHHCCCEEEEECCCCHHHHHHHHHHHHCCCCCH T ss_conf 45666349675899989999998602205799986667104775107776044023444751555999999984498980 Q ss_pred HHHHHHHHHHHH--HHHHHHHHHHH-HHHHHCCCC Q ss_conf 124577765433--22355435999-999953898 Q gi|254780592|r 173 SIVSNIAMSYLL--MGDLKTAEEKL-RFASQMIGA 204 (271) Q Consensus 173 ~~~~~~~~~~~~--~g~~~~A~~~~-~~al~~~p~ 204 (271) --- -+.|+. .-++.+|...+ +++.+.+.+ T Consensus 168 fR~---LWLYl~E~k~dP~~A~tnL~qR~~~~d~e 199 (297) T COG4785 168 FRS---LWLYLNEQKLDPKQAKTNLKQRAEKSDKE 199 (297) T ss_pred HHH---HHHHHHHHCCCHHHHHHHHHHHHHCCCHH T ss_conf 789---99998761078689999999998705676 No 131 >KOG3617 consensus Probab=98.41 E-value=2.1e-05 Score=50.78 Aligned_cols=143 Identities=17% Similarity=0.138 Sum_probs=63.2 Q ss_pred HHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHH Q ss_conf 98420199999999999995757879999999999874100124466655663275210013555532100000112588 Q gi|254780592|r 46 NMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLD 125 (271) Q Consensus 46 ~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~ 125 (271) .++.|.+++|...|++.-.-| .+-..|...|.+++|.++.+.--.++ -...+..++.-+...++.+.|++ T Consensus 810 AieLgMlEeA~~lYr~ckR~D--------LlNKlyQs~g~w~eA~eiAE~~DRiH--Lr~Tyy~yA~~Lear~Di~~Ale 879 (1416) T KOG3617 810 AIELGMLEEALILYRQCKRYD--------LLNKLYQSQGMWSEAFEIAETKDRIH--LRNTYYNYAKYLEARRDIEAALE 879 (1416) T ss_pred HHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHCCCHHHHHHHHHHCCCEE--HHHHHHHHHHHHHHHCCHHHHHH T ss_conf 998734889999999888887--------99999986100888888775104031--24568778788886212889998 Q ss_pred HHHHHHH----------HHC----------HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC------------------ Q ss_conf 7642211----------000----------02335669999999742111110000000000------------------ Q gi|254780592|r 126 AINRAQR----------PDI----------PDWQLISAKGSVLAQMGKHSEALIEYERALEL------------------ 167 (271) Q Consensus 126 ~~~~a~~----------~~p----------~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~------------------ 167 (271) +|+++-. .+| .+...|.+.|..+...|+.+.|+..|..+... T Consensus 880 yyEK~~~hafev~rmL~e~p~~~e~Yv~~~~d~~L~~WWgqYlES~GemdaAl~~Y~~A~D~fs~VrI~C~qGk~~kAa~ 959 (1416) T KOG3617 880 YYEKAGVHAFEVFRMLKEYPKQIEQYVRRKRDESLYSWWGQYLESVGEMDAALSFYSSAKDYFSMVRIKCIQGKTDKAAR 959 (1416) T ss_pred HHHHCCCHHHHHHHHHHHCHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHEEEEEECCCCHHHHH T ss_conf 88751873878899988583889999985065289999999975243157889999876555542366763167358899 Q ss_pred ---CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf ---0000012457776543322355435999999 Q gi|254780592|r 168 ---SPNESSIVSNIAMSYLLMGDLKTAEEKLRFA 198 (271) Q Consensus 168 ---~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a 198 (271) ...+..+-+.+|..|...|++.+|+.+|.+| T Consensus 960 iA~esgd~AAcYhlaR~YEn~g~v~~Av~FfTrA 993 (1416) T KOG3617 960 IAEESGDKAACYHLARMYENDGDVVKAVKFFTRA 993 (1416) T ss_pred HHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9886141788999999862247788899998998 No 132 >KOG1941 consensus Probab=98.40 E-value=8.2e-05 Score=47.16 Aligned_cols=165 Identities=18% Similarity=0.154 Sum_probs=115.1 Q ss_pred HHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCC-----CHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHH------H Q ss_conf 7999999999987410012446665566327521-----0013555532100000112588764221100002------3 Q gi|254780592|r 70 KMIGIVYADVLRRVGRTAQALAVMRQVAILYPED-----QEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPD------W 138 (271) Q Consensus 70 ~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~-----~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~------~ 138 (271) .+++.+++..+.+.-++.+++.+-+--+.+.-.. ..+...++.....++.++++++.|+++....... . T Consensus 83 ~ea~lnlar~~e~l~~f~kt~~y~k~~l~lpgt~~~~~~gq~~l~~~~Ahlgls~fq~~Lesfe~A~~~A~~~~D~~LEl 162 (518) T KOG1941 83 LEAYLNLARSNEKLCEFHKTISYCKTCLGLPGTRAGQLGGQVSLSMGNAHLGLSVFQKALESFEKALRYAHNNDDAMLEL 162 (518) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEE T ss_conf 99999899889999985668998878742898785556503565777775137889999999999998765069840354 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHH----H------HHHHHHHHHHHHHHHHHHHHHHHHHHHHCC------ Q ss_conf 356699999997421111100000000000000----0------124577765433223554359999999538------ Q gi|254780592|r 139 QLISAKGSVLAQMGKHSEALIEYERALELSPNE----S------SIVSNIAMSYLLMGDLKTAEEKLRFASQMI------ 202 (271) Q Consensus 139 ~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~----~------~~~~~~~~~~~~~g~~~~A~~~~~~al~~~------ 202 (271) +....+|.++....++++|.-+..++.++.... + .+++.++.++-.+|..-.|.++.+++.++. T Consensus 163 qvcv~Lgslf~~l~D~~Kal~f~~kA~~lv~s~~l~d~~~kyr~~~lyhmaValR~~G~LgdA~e~C~Ea~klal~~Gdr 242 (518) T KOG1941 163 QVCVSLGSLFAQLKDYEKALFFPCKAAELVNSYGLKDWSLKYRAMSLYHMAVALRLLGRLGDAMECCEEAMKLALQHGDR 242 (518) T ss_pred EHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCH T ss_conf 13533889999987446777614769999875275605689999999999999998412113999999999999984777 Q ss_pred CCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCC Q ss_conf 98547888999999986998999999998629 Q gi|254780592|r 203 GADSRIRQNLALVVGLQGRMKEAYSIASQELS 234 (271) Q Consensus 203 p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~ 234 (271) +...+....+|.||...|+.+.|..-|+++.. T Consensus 243 a~~arc~~~~aDIyR~~gd~e~af~rYe~Am~ 274 (518) T KOG1941 243 ALQARCLLCFADIYRSRGDLERAFRRYEQAMG 274 (518) T ss_pred HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH T ss_conf 98899999999998861307668999999998 No 133 >KOG4234 consensus Probab=98.39 E-value=3.1e-06 Score=55.89 Aligned_cols=104 Identities=20% Similarity=0.218 Sum_probs=49.9 Q ss_pred HHHHHHHHCCHHHHHHHHHHHHHHHCHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHH Q ss_conf 55321000001125887642211000023-----3566999999974211111000000000000000124577765433 Q gi|254780592|r 110 YGKSLANAGYLDEGLDAINRAQRPDIPDW-----QLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLL 184 (271) Q Consensus 110 ~~~~~~~~g~~~~A~~~~~~a~~~~p~~~-----~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~ 184 (271) -|.-+...|+|++|...|..++..-|..+ -.+.++|.++.+++.++.|+....+++++.|....++...+.+|.+ T Consensus 101 EGN~~F~ngdyeeA~skY~~Ale~cp~~~~e~rsIly~Nraaa~iKl~k~e~aI~dcsKaiel~pty~kAl~RRAeayek 180 (271) T KOG4234 101 EGNELFKNGDYEEANSKYQEALESCPSTSTEERSILYSNRAAALIKLRKWESAIEDCSKAIELNPTYEKALERRAEAYEK 180 (271) T ss_pred HHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHH T ss_conf 77776513407789999999998686630888999972058899875668989999876575084067999999999885 Q ss_pred HHHHHHHHHHHHHHHHCCCCCHHHHHHHH Q ss_conf 22355435999999953898547888999 Q gi|254780592|r 185 MGDLKTAEEKLRFASQMIGADSRIRQNLA 213 (271) Q Consensus 185 ~g~~~~A~~~~~~al~~~p~~~~~~~~La 213 (271) ...+++|+..|++.++.+|....++...+ T Consensus 181 ~ek~eealeDyKki~E~dPs~~ear~~i~ 209 (271) T KOG4234 181 MEKYEEALEDYKKILESDPSRREAREAIA 209 (271) T ss_pred HHHHHHHHHHHHHHHHHCCCHHHHHHHHH T ss_conf 55178889999999871820689999998 No 134 >KOG1941 consensus Probab=98.38 E-value=8e-06 Score=53.36 Aligned_cols=192 Identities=11% Similarity=0.056 Sum_probs=100.9 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHCCCCH------HHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCC---------- Q ss_conf 7999999842019999999999999575787------999999999987410012446665566327521---------- Q gi|254780592|r 40 DPESLENMNHEQLLEVTSTIGLQYQSHTKNK------MIGIVYADVLRRVGRTAQALAVMRQVAILYPED---------- 103 (271) Q Consensus 40 ~~~~~~~~~~~~~~~A~~~~~~al~~~P~~~------~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~---------- 103 (271) ..-+..++.-+.|+++.+.|+++++...++. .+...+|..+.+..++++|..+..++.++.... T Consensus 126 l~~~~Ahlgls~fq~~Lesfe~A~~~A~~~~D~~LElqvcv~Lgslf~~l~D~~Kal~f~~kA~~lv~s~~l~d~~~kyr 205 (518) T KOG1941 126 LSMGNAHLGLSVFQKALESFEKALRYAHNNDDAMLELQVCVSLGSLFAQLKDYEKALFFPCKAAELVNSYGLKDWSLKYR 205 (518) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHH T ss_conf 57777751378899999999999987650698403541353388999998744677761476999987527560568999 Q ss_pred CHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH------CHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCH------H Q ss_conf 0013555532100000112588764221100------002335669999999742111110000000000000------0 Q gi|254780592|r 104 QEVLAAYGKSLANAGYLDEGLDAINRAQRPD------IPDWQLISAKGSVLAQMGKHSEALIEYERALELSPN------E 171 (271) Q Consensus 104 ~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~------p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~------~ 171 (271) .-++..++..+..+|+.-+|.+..+.+.+.. +.......-+|.+|...|+.+.|..-|+.+...... . T Consensus 206 ~~~lyhmaValR~~G~LgdA~e~C~Ea~klal~~Gdra~~arc~~~~aDIyR~~gd~e~af~rYe~Am~~m~~~gdrmgq 285 (518) T KOG1941 206 AMSLYHMAVALRLLGRLGDAMECCEEAMKLALQHGDRALQARCLLCFADIYRSRGDLERAFRRYEQAMGTMASLGDRMGQ 285 (518) T ss_pred HHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999984121139999999999999847779889999999999886130766899999999899875057788 Q ss_pred HHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHCCC----CC--HHHHHHHHHHHHHCCCHHHHHHHHHH Q ss_conf 01245777654332235-----543599999995389----85--47888999999986998999999998 Q gi|254780592|r 172 SSIVSNIAMSYLLMGDL-----KTAEEKLRFASQMIG----AD--SRIRQNLALVVGLQGRMKEAYSIASQ 231 (271) Q Consensus 172 ~~~~~~~~~~~~~~g~~-----~~A~~~~~~al~~~p----~~--~~~~~~La~~~~~~g~~~eA~~~~~~ 231 (271) ..++...+..+....-. =+|+++-++++++.. .. ...+..++.+|..+|.-++=...+.+ T Consensus 286 v~al~g~Akc~~~~r~~~k~~~Crale~n~r~levA~~IG~K~~vlK~hcrla~iYrs~gl~d~~~~h~~r 356 (518) T KOG1941 286 VEALDGAAKCLETLRLQNKICNCRALEFNTRLLEVASSIGAKLSVLKLHCRLASIYRSKGLQDELRAHVVR 356 (518) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHH T ss_conf 99987689899999876124454054688899999998535788899999799999850430668999999 No 135 >COG3118 Thioredoxin domain-containing protein [Posttranslational modification, protein turnover, chaperones] Probab=98.36 E-value=2.2e-05 Score=50.66 Aligned_cols=163 Identities=17% Similarity=0.144 Sum_probs=98.9 Q ss_pred CHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHH--H Q ss_conf 87999999999987410012446665566327521001355553210000011258876422110000233566999--9 Q gi|254780592|r 69 NKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKG--S 146 (271) Q Consensus 69 ~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg--~ 146 (271) ..+.-+..+.-....|++.+|...++.++...|++..+...++.++...|+.+.|...+...-.......... ..+ . T Consensus 133 ~~e~~~~~~~~~~~~e~~~~a~~~~~~al~~~~~~~~~~~~la~~~l~~g~~e~A~~iL~~lP~~~~~~~~~~-l~a~i~ 211 (304) T COG3118 133 EEEEALAEAKELIEAEDFGEAAPLLKQALQAAPENSEAKLLLAECLLAAGDVEAAQAILAALPLQAQDKAAHG-LQAQIE 211 (304) T ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHCCCCCHHHHHHH-HHHHHH T ss_conf 7788898766654165445589999999985845663777899999981885899999972850010457888-999999 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC--CCHHHHHHHHHHHHHCCCHHH Q ss_conf 999742111110000000000000001245777654332235543599999995389--854788899999998699899 Q gi|254780592|r 147 VLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIG--ADSRIRQNLALVVGLQGRMKE 224 (271) Q Consensus 147 ~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p--~~~~~~~~La~~~~~~g~~~e 224 (271) .+.+.....+ ...+......+|++..+...++..+...|+.+.|...+-..+..+- .+..++..|-.++...|..|. T Consensus 212 ll~qaa~~~~-~~~l~~~~aadPdd~~aa~~lA~~~~~~g~~e~Ale~Ll~~l~~d~~~~d~~~Rk~lle~f~~~g~~Dp 290 (304) T COG3118 212 LLEQAAATPE-IQDLQRRLAADPDDVEAALALADQLHLVGRNEAALEHLLALLRRDRGFEDGEARKTLLELFEAFGPADP 290 (304) T ss_pred HHHHHHCCCC-HHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCCCH T ss_conf 9999745997-789999987099988999999999997388799999999998750344376899999999996388987 Q ss_pred HHHHHHHHC Q ss_conf 999999862 Q gi|254780592|r 225 AYSIASQEL 233 (271) Q Consensus 225 A~~~~~~~L 233 (271) +...+++-| T Consensus 291 ~~~~~RRkL 299 (304) T COG3118 291 LVLAYRRKL 299 (304) T ss_pred HHHHHHHHH T ss_conf 899999999 No 136 >KOG2610 consensus Probab=98.35 E-value=5.1e-05 Score=48.44 Aligned_cols=152 Identities=14% Similarity=0.031 Sum_probs=97.7 Q ss_pred HHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHC-CCCC---CHHHHHHHHHHHHHCCHHHH Q ss_conf 4201999999999999957578799999999998741001244666556632-7521---00135555321000001125 Q gi|254780592|r 48 NHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAIL-YPED---QEVLAAYGKSLANAGYLDEG 123 (271) Q Consensus 48 ~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~-~p~~---~~~~~~~~~~~~~~g~~~~A 123 (271) ..|++.+|...+++.++..|.|.-++..--.++.-+|+.+.....+++.+.. +|+- .-+.-.++..+...|-+++| T Consensus 115 ~~g~~h~a~~~wdklL~d~PtDlla~kfsh~a~fy~G~~~~~k~ai~kIip~wn~dlp~~sYv~GmyaFgL~E~g~y~dA 194 (491) T KOG2610 115 GRGKHHEAAIEWDKLLDDYPTDLLAVKFSHDAHFYNGNQIGKKNAIEKIIPKWNADLPCYSYVHGMYAFGLEECGIYDDA 194 (491) T ss_pred CCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHCCCHHH T ss_conf 35664177888999987585035544544467874141442145799843345899857899888998667872550269 Q ss_pred HHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 887642211000023356699999997421111100000000000000----0124577765433223554359999999 Q gi|254780592|r 124 LDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNE----SSIVSNIAMSYLLMGDLKTAEEKLRFAS 199 (271) Q Consensus 124 ~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~----~~~~~~~~~~~~~~g~~~~A~~~~~~al 199 (271) .+..++++..++.+.=+....++++...+++.++.+.-.+.-..=... .--+...+..++..+.|+.|++.|.+-+ T Consensus 195 Ek~A~ralqiN~~D~Wa~Ha~aHVlem~~r~Keg~eFM~~ted~Wr~s~mlasHNyWH~Al~~iE~aeye~aleIyD~ei 274 (491) T KOG2610 195 EKQADRALQINRFDCWASHAKAHVLEMNGRHKEGKEFMYKTEDDWRQSWMLASHNYWHTALFHIEGAEYEKALEIYDREI 274 (491) T ss_pred HHHHHHHCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHH T ss_conf 99998631478741688988999998402056579999853141212368876506888886410111357888887999 No 137 >KOG0376 consensus Probab=98.33 E-value=1e-06 Score=58.86 Aligned_cols=121 Identities=12% Similarity=0.014 Sum_probs=103.8 Q ss_pred CHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHH Q ss_conf 11799999984201999999999999957578799999999998741001244666556632752100135555321000 Q gi|254780592|r 38 IPDPESLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANA 117 (271) Q Consensus 38 ~~~~~~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~ 117 (271) .+..++.+.+...+|+.|+..|.++++++|++...+-+.+.++...+.+..|+.-+.++++.+|.....+...|.+...+ T Consensus 6 e~k~ean~~l~~~~fd~avdlysKaI~ldpnca~~~anRa~a~lK~e~~~~Al~Da~kaie~dP~~~K~Y~rrg~a~m~l 85 (476) T KOG0376 6 ELKNEANEALKDKVFDVAVDLYSKAIELDPNCAIYFANRALAHLKVESFGGALHDALKAIELDPTYIKAYVRRGTAVMAL 85 (476) T ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCEEEECHHHHHHEEECHHHHHHHHHHHHHHCCCHHHHEEEECCHHHHHH T ss_conf 41467765412001999999999998528863244230212343200033677877763540801233034311778757 Q ss_pred CCHHHHHHHHHHHHHHHCHHHHHHHHHHHHH--HHHHHHHHHH Q ss_conf 0011258876422110000233566999999--9742111110 Q gi|254780592|r 118 GYLDEGLDAINRAQRPDIPDWQLISAKGSVL--AQMGKHSEAL 158 (271) Q Consensus 118 g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~--~~~~~~~~A~ 158 (271) +++.+|...|+......|+++.....+-.|- .....++.++ T Consensus 86 ~~~~~A~~~l~~~~~l~Pnd~~~~r~~~Ec~~~vs~~~fe~ai 128 (476) T KOG0376 86 GEFKKALLDLEKVKKLAPNDPDATRKIDECNKIVSEEKFEKAI 128 (476) T ss_pred HHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 7799999999876623867688998789999999887641124 No 138 >PRK10866 outer membrane protein assembly complex subunit YfiO; Provisional Probab=98.32 E-value=0.00011 Score=46.37 Aligned_cols=157 Identities=12% Similarity=-0.008 Sum_probs=113.0 Q ss_pred CCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCC---HHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHH---H Q ss_conf 7879999999999874100124466655663275210---013555532100000112588764221100002335---6 Q gi|254780592|r 68 KNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQ---EVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQL---I 141 (271) Q Consensus 68 ~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~---~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~---~ 141 (271) .++..++..|......|++.+|+..|+++....|..+ .+...++.++...+++++|...+++-+...|.++.. + T Consensus 30 ~~~~~~Y~~A~~~~~~~~y~~A~~~fe~l~~~yP~s~~A~qA~L~~ayayy~~~~y~~A~~~~~rFi~lyP~~~~v~Ya~ 109 (243) T PRK10866 30 NPPNEIYATAQQKLQDGNWKQAITQLEALDNRYPFGPYSQQVQLDLIYAYYKNADLPLAQAAIDRFMRLNPTHPNIDYVM 109 (243) T ss_pred CCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHH T ss_conf 99999999999999878999999999999986899767899999999998730778999999999998678999738999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHH---H-------H-- Q ss_conf 699999997421111100000000000000012457776543322355435999999953898547---8-------8-- Q gi|254780592|r 142 SAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSR---I-------R-- 209 (271) Q Consensus 142 ~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~---~-------~-- 209 (271) ...|.++...++ ..+..... .-..-..++...+|...|++.+...|++.- + + T Consensus 110 Yl~gl~~~~~~~-----~~~~~~~~-----------~~~~~rD~~~~~~A~~~f~~li~~yP~S~Ya~dA~~r~~~lrn~ 173 (243) T PRK10866 110 YMRGLTNMALDD-----SALQGFFG-----------VDRSDRDPQHARAAFSDFSKLVRGYPNSQYTTDATKRLVFLKDR 173 (243) T ss_pred HHHHHHHHHCCH-----HHHHHHHC-----------CCHHHCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHH T ss_conf 998898774013-----58888743-----------25643076999999999999998789984589999999999999 Q ss_pred -----HHHHHHHHHCCCHHHHHHHHHHHCCCCCCCH Q ss_conf -----8999999986998999999998629172866 Q gi|254780592|r 210 -----QNLALVVGLQGRMKEAYSIASQELSPEEATR 240 (271) Q Consensus 210 -----~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~ 240 (271) ...|..|...|.+.-|..-++.++..-|... T Consensus 174 LA~~El~Va~fY~kr~~y~AAi~R~~~vi~~Yp~t~ 209 (243) T PRK10866 174 LAKYEYSVAEYYTKRGAWVAVVNRVEGMLRDYPDTQ 209 (243) T ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCC T ss_conf 999999999999992656999999999997689996 No 139 >KOG4234 consensus Probab=98.30 E-value=3.9e-06 Score=55.28 Aligned_cols=102 Identities=22% Similarity=0.221 Sum_probs=71.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHH Q ss_conf 669999999742111110000000000000001-----245777654332235543599999995389854788899999 Q gi|254780592|r 141 ISAKGSVLAQMGKHSEALIEYERALELSPNESS-----IVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALV 215 (271) Q Consensus 141 ~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~-----~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~ 215 (271) .-.-|+-+...|++++|...|..+++.-|..+. .+.+.|.+++.++..+.|+..+.++++++|.+..+..+.|.+ T Consensus 98 lK~EGN~~F~ngdyeeA~skY~~Ale~cp~~~~e~rsIly~Nraaa~iKl~k~e~aI~dcsKaiel~pty~kAl~RRAea 177 (271) T KOG4234 98 LKKEGNELFKNGDYEEANSKYQEALESCPSTSTEERSILYSNRAAALIKLRKWESAIEDCSKAIELNPTYEKALERRAEA 177 (271) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH T ss_conf 98877776513407789999999998686630888999972058899875668989999876575084067999999999 Q ss_pred HHHCCCHHHHHHHHHHHCCCCCCCHHH Q ss_conf 998699899999999862917286659 Q gi|254780592|r 216 VGLQGRMKEAYSIASQELSPEEATRNI 242 (271) Q Consensus 216 ~~~~g~~~eA~~~~~~~L~~~~~~~~~ 242 (271) |..+.++++|+.-|++.+..+|..... T Consensus 178 yek~ek~eealeDyKki~E~dPs~~ea 204 (271) T KOG4234 178 YEKMEKYEEALEDYKKILESDPSRREA 204 (271) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCHHHH T ss_conf 885551788899999998718206899 No 140 >KOG2796 consensus Probab=98.25 E-value=3.5e-05 Score=49.44 Aligned_cols=168 Identities=17% Similarity=0.175 Sum_probs=131.8 Q ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHH------------CCCCC---------CHHHHHHHHHHHHHCCHHHHHHHHHHH Q ss_conf 999999999874100124466655663------------27521---------001355553210000011258876422 Q gi|254780592|r 72 IGIVYADVLRRVGRTAQALAVMRQVAI------------LYPED---------QEVLAAYGKSLANAGYLDEGLDAINRA 130 (271) Q Consensus 72 ~~~~~a~~l~~~g~~~~A~~~~~~al~------------~~p~~---------~~~~~~~~~~~~~~g~~~~A~~~~~~a 130 (271) ..+-.|++....|.+.+.+.-+..+.. ..|+. ..+-..+..++...|+|.-.+..+.+. T Consensus 124 mR~lhAe~~~~lgnpqesLdRl~~L~~~V~~ii~~~e~~~~~ESsv~lW~KRl~~Vmy~~~~~llG~kEy~iS~d~~~~v 203 (366) T KOG2796 124 MRILHAELQQYLGNPQESLDRLHKLKTVVSKILANLEQGLAEESSIRLWRKRLGRVMYSMANCLLGMKEYVLSVDAYHSV 203 (366) T ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHCCHHHHEEHHHHHHH T ss_conf 99999998865699267799999999999999999882566042799999999999999999983630451109999999 Q ss_pred HHHH-CHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 1100-00233566999999974211111000000000000------0001245777654332235543599999995389 Q gi|254780592|r 131 QRPD-IPDWQLISAKGSVLAQMGKHSEALIEYERALELSP------NESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIG 203 (271) Q Consensus 131 ~~~~-p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p------~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p 203 (271) ++.+ |.++.....+|.+-.+.|+.+.|...|...-+... ....+..+.+.+++.++++..|...+.+.+..+| T Consensus 204 i~~~~e~~p~L~s~Lgr~~MQ~GD~k~a~~yf~~vek~~~kL~~~q~~~~V~~n~a~i~lg~nn~a~a~r~~~~i~~~D~ 283 (366) T KOG2796 204 IKYYPEQEPQLLSGLGRISMQIGDIKTAEKYFQDVEKVTQKLDGLQGKIMVLMNSAFLHLGQNNFAEAHRFFTEILRMDP 283 (366) T ss_pred HHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHEEECCCCHHHHHHHHHHCCCCCC T ss_conf 86387306799999878888604388899899999987614323404678876344202213335999998850422489 Q ss_pred CCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCC Q ss_conf 854788899999998699899999999862917286 Q gi|254780592|r 204 ADSRIRQNLALVVGLQGRMKEAYSIASQELSPEEAT 239 (271) Q Consensus 204 ~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~ 239 (271) .++.+..+-|+++.-.|+..+|.++++..+...|.. T Consensus 284 ~~~~a~NnKALcllYlg~l~DAiK~~e~~~~~~P~~ 319 (366) T KOG2796 284 RNAVANNNKALCLLYLGKLKDALKQLEAMVQQDPRH 319 (366) T ss_pred CCHHHHCHHHHHHHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 870220519999999877889999999876118862 No 141 >KOG4555 consensus Probab=98.25 E-value=8.8e-06 Score=53.09 Aligned_cols=87 Identities=15% Similarity=0.063 Sum_probs=38.1 Q ss_pred HHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCC----HHHHHHHHHHHHHCCHHHH Q ss_conf 420199999999999995757879999999999874100124466655663275210----0135555321000001125 Q gi|254780592|r 48 NHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQ----EVLAAYGKSLANAGYLDEG 123 (271) Q Consensus 48 ~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~----~~~~~~~~~~~~~g~~~~A 123 (271) ..|+++.|++.|.+++..-|+++.++.+.+.+++-.|+.++|+.-+++++++.-... .++...|.+|..+|+.+.| T Consensus 55 E~g~Ld~AlE~F~qal~l~P~raSayNNRAQa~RLq~~~e~ALdDLn~AleLag~~trtacqa~vQRg~lyRl~g~dd~A 134 (175) T KOG4555 55 EAGDLDGALELFGQALCLAPERASAYNNRAQALRLQGDDEEALDDLNKALELAGDQTRTACQAFVQRGLLYRLLGNDDAA 134 (175) T ss_pred HCCCHHHHHHHHHHHHHHCCCCHHHHCCHHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCHHHH T ss_conf 50456789999999998543353866139999987288187899899999862862069999999999999993761888 Q ss_pred HHHHHHHHHHH Q ss_conf 88764221100 Q gi|254780592|r 124 LDAINRAQRPD 134 (271) Q Consensus 124 ~~~~~~a~~~~ 134 (271) ...|..+-... T Consensus 135 R~DFe~AA~LG 145 (175) T KOG4555 135 RADFEAAAQLG 145 (175) T ss_pred HHHHHHHHHHC T ss_conf 88699999867 No 142 >KOG2796 consensus Probab=98.24 E-value=1.3e-05 Score=51.95 Aligned_cols=146 Identities=15% Similarity=0.179 Sum_probs=117.8 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHCC-CCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCH------HHHHHHH Q ss_conf 999999999987410012446665566327-521001355553210000011258876422110000------2335669 Q gi|254780592|r 71 MIGIVYADVLRRVGRTAQALAVMRQVAILY-PEDQEVLAAYGKSLANAGYLDEGLDAINRAQRPDIP------DWQLISA 143 (271) Q Consensus 71 ~~~~~~a~~l~~~g~~~~A~~~~~~al~~~-p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~------~~~~~~~ 143 (271) .+-+-+..++.-.|++.-++..++++++.+ |.++.....+|++.++.|+.+.+...+++..+.... ..-...+ T Consensus 178 ~Vmy~~~~~llG~kEy~iS~d~~~~vi~~~~e~~p~L~s~Lgr~~MQ~GD~k~a~~yf~~vek~~~kL~~~q~~~~V~~n 257 (366) T KOG2796 178 RVMYSMANCLLGMKEYVLSVDAYHSVIKYYPEQEPQLLSGLGRISMQIGDIKTAEKYFQDVEKVTQKLDGLQGKIMVLMN 257 (366) T ss_pred HHHHHHHHHHHCCHHHHEEHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHH T ss_conf 99999999983630451109999999863873067999998788886043888998999999876143234046788763 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCH---HHHHHHHHHH Q ss_conf 999999742111110000000000000001245777654332235543599999995389854---7888999999 Q gi|254780592|r 144 KGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADS---RIRQNLALVV 216 (271) Q Consensus 144 lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~---~~~~~La~~~ 216 (271) .+.++...+++..|...+.+.+..+|.++.+.++.+.+++..|+...|+..++.++++.|... ....||-.+| T Consensus 258 ~a~i~lg~nn~a~a~r~~~~i~~~D~~~~~a~NnKALcllYlg~l~DAiK~~e~~~~~~P~~~l~es~~~nL~tmy 333 (366) T KOG2796 258 SAFLHLGQNNFAEAHRFFTEILRMDPRNAVANNNKALCLLYLGKLKDALKQLEAMVQQDPRHYLHESVLFNLTTMY 333 (366) T ss_pred HHHEEECCCCHHHHHHHHHHCCCCCCCCHHHHCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHH T ss_conf 4420221333599999885042248987022051999999987788999999987611886210145888789899 No 143 >COG3898 Uncharacterized membrane-bound protein [Function unknown] Probab=98.23 E-value=0.00034 Score=43.33 Aligned_cols=157 Identities=13% Similarity=0.067 Sum_probs=91.6 Q ss_pred HHHHCCHHHHHHHHHHHHHHCCCCHHHHH-HHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHH Q ss_conf 98420199999999999995757879999-99999987410012446665566327521001355553210000011258 Q gi|254780592|r 46 NMNHEQLLEVTSTIGLQYQSHTKNKMIGI-VYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGL 124 (271) Q Consensus 46 ~~~~~~~~~A~~~~~~al~~~P~~~~~~~-~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~ 124 (271) .+-.|+++.|...|+..+ .+|.-...-+ .+-..-.+.|..+-|..+.+++..+-|.-+-+....-......|+|+.|+ T Consensus 130 al~eG~~~~Ar~kfeAMl-~dPEtRllGLRgLyleAqr~GareaAr~yAe~Aa~~Ap~l~WA~~AtLe~r~~~gdWd~Al 208 (531) T COG3898 130 ALLEGDYEDARKKFEAML-DDPETRLLGLRGLYLEAQRLGAREAARHYAERAAEKAPQLPWAARATLEARCAAGDWDGAL 208 (531) T ss_pred HHHCCCHHHHHHHHHHHH-CCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCHHHHH T ss_conf 886476187999999873-6838877767889999986021899999999987636677167999999998667868899 Q ss_pred HHHHHHHHH---HCHHHH---HHHHHHHH-HHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 876422110---000233---56699999-99742111110000000000000001245777654332235543599999 Q gi|254780592|r 125 DAINRAQRP---DIPDWQ---LISAKGSV-LAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRF 197 (271) Q Consensus 125 ~~~~~a~~~---~p~~~~---~~~~lg~~-~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~ 197 (271) +..+..... .+...+ .-..-+.. -.-..+...|.+....+.++.|+-...-..-+.++...|+..++-..++. T Consensus 209 kLvd~~~~~~vie~~~aeR~rAvLLtAkA~s~ldadp~~Ar~~A~~a~KL~pdlvPaav~AAralf~d~~~rKg~~ilE~ 288 (531) T COG3898 209 KLVDAQRAAKVIEKDVAERSRAVLLTAKAMSLLDADPASARDDALEANKLAPDLVPAAVVAARALFRDGNLRKGSKILET 288 (531) T ss_pred HHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHH T ss_conf 99998888775261167789899998888877248867788999987503876336999999998742462433248899 Q ss_pred HHHCCC Q ss_conf 995389 Q gi|254780592|r 198 ASQMIG 203 (271) Q Consensus 198 al~~~p 203 (271) +-+..| T Consensus 289 aWK~eP 294 (531) T COG3898 289 AWKAEP 294 (531) T ss_pred HHHCCC T ss_conf 873499 No 144 >COG3118 Thioredoxin domain-containing protein [Posttranslational modification, protein turnover, chaperones] Probab=98.23 E-value=2.9e-05 Score=49.93 Aligned_cols=126 Identities=14% Similarity=-0.010 Sum_probs=63.4 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHH-HHHHHHHHHHHHHH Q ss_conf 55553210000011258876422110000233566999999974211111000000000000000-12457776543322 Q gi|254780592|r 108 AAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNES-SIVSNIAMSYLLMG 186 (271) Q Consensus 108 ~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~-~~~~~~~~~~~~~g 186 (271) ..-+.-....|++.++...+..++...|...+....++.++...|+.+.|...+...-....+.. ..+......+.+.. T Consensus 138 ~~~~~~~~~~e~~~~a~~~~~~al~~~~~~~~~~~~la~~~l~~g~~e~A~~iL~~lP~~~~~~~~~~l~a~i~ll~qaa 217 (304) T COG3118 138 LAEAKELIEAEDFGEAAPLLKQALQAAPENSEAKLLLAECLLAAGDVEAAQAILAALPLQAQDKAAHGLQAQIELLEQAA 217 (304) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 98766654165445589999999985845663777899999981885899999972850010457888999999999974 Q ss_pred HHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCC Q ss_conf 355435999999953898547888999999986998999999998629 Q gi|254780592|r 187 DLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQELS 234 (271) Q Consensus 187 ~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~ 234 (271) +..+ ...++..+..+|++.++.+.||..+...|+.++|.+.+=..+. T Consensus 218 ~~~~-~~~l~~~~aadPdd~~aa~~lA~~~~~~g~~e~Ale~Ll~~l~ 264 (304) T COG3118 218 ATPE-IQDLQRRLAADPDDVEAALALADQLHLVGRNEAALEHLLALLR 264 (304) T ss_pred CCCC-HHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHH T ss_conf 5997-7899999870999889999999999973887999999999987 No 145 >KOG2610 consensus Probab=98.21 E-value=2.8e-05 Score=50.00 Aligned_cols=182 Identities=17% Similarity=0.099 Sum_probs=139.7 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHH-HCHH---HHHHHHHHHHHHH Q ss_conf 99999987410012446665566327521001355553210000011258876422110-0002---3356699999997 Q gi|254780592|r 75 VYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQRP-DIPD---WQLISAKGSVLAQ 150 (271) Q Consensus 75 ~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~-~p~~---~~~~~~lg~~~~~ 150 (271) .-+.++.-.|++.+|-...++.++.+|.+-.++..--+.+..+|+.+.-...+++.+.. ++.. .-..-.++..+.. T Consensus 108 ~~aai~~~~g~~h~a~~~wdklL~d~PtDlla~kfsh~a~fy~G~~~~~k~ai~kIip~wn~dlp~~sYv~GmyaFgL~E 187 (491) T KOG2610 108 AKAAILWGRGKHHEAAIEWDKLLDDYPTDLLAVKFSHDAHFYNGNQIGKKNAIEKIIPKWNADLPCYSYVHGMYAFGLEE 187 (491) T ss_pred HHHHHHHCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHH T ss_conf 26787633566417788899998758503554454446787414144214579984334589985789988899866787 Q ss_pred HHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC-----HHHHHHHHHHHHHCCCHHHH Q ss_conf 4211111000000000000000124577765433223554359999999538985-----47888999999986998999 Q gi|254780592|r 151 MGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGAD-----SRIRQNLALVVGLQGRMKEA 225 (271) Q Consensus 151 ~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~-----~~~~~~La~~~~~~g~~~eA 225 (271) .|-+++|.+...+++++++.+.=+....+-.+...|+..++.++.++--. +-.. +.-|...|+.+.+.+.++.| T Consensus 188 ~g~y~dAEk~A~ralqiN~~D~Wa~Ha~aHVlem~~r~Keg~eFM~~ted-~Wr~s~mlasHNyWH~Al~~iE~aeye~a 266 (491) T KOG2610 188 CGIYDDAEKQADRALQINRFDCWASHAKAHVLEMNGRHKEGKEFMYKTED-DWRQSWMLASHNYWHTALFHIEGAEYEKA 266 (491) T ss_pred HCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHCCC-CHHHHHHHHHHHHHHHHHHHHCCCCHHHH T ss_conf 25502699999863147874168898899999840205657999985314-12123688765068888864101113578 Q ss_pred HHHHHHHCC----CCCCCHHHHHH--HHHHHHHHHHHH Q ss_conf 999998629----17286659999--999983212999 Q gi|254780592|r 226 YSIASQELS----PEEATRNIKYI--KSILSQRDPWKK 257 (271) Q Consensus 226 ~~~~~~~L~----~~~~~~~~~~l--~~~l~~~~~~~~ 257 (271) +++|.+.+- .+.+.+...++ .++....+-|.+ T Consensus 267 leIyD~ei~k~l~k~Da~a~~~~ld~dgv~~~~d~~~k 304 (491) T KOG2610 267 LEIYDREIWKRLEKDDAVARDVYLDLDGVDLRSDLWRK 304 (491) T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 88887999987511331455544335567767888887 No 146 >KOG0551 consensus Probab=98.09 E-value=3.9e-05 Score=49.16 Aligned_cols=87 Identities=11% Similarity=0.048 Sum_probs=42.5 Q ss_pred HHHHHHCCHHHHHHHHHHHHHHCCCCHH----HHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCC Q ss_conf 9998420199999999999995757879----999999999874100124466655663275210013555532100000 Q gi|254780592|r 44 LENMNHEQLLEVTSTIGLQYQSHTKNKM----IGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGY 119 (271) Q Consensus 44 ~~~~~~~~~~~A~~~~~~al~~~P~~~~----~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~ 119 (271) ..++...+|..|+..|.+.++..-.|++ .+.+.|-+....|++..|+.-..+++..+|.+..++..-+.|+..+.+ T Consensus 89 N~~fK~Kryk~A~~~Yt~Glk~kc~D~dlnavLY~NRAAa~~~l~NyRs~l~Dcs~al~~~P~h~Ka~~R~Akc~~eLe~ 168 (390) T KOG0551 89 NEYFKEKRYKDAVESYTEGLKKKCADPDLNAVLYTNRAAAQLYLGNYRSALNDCSAALKLKPTHLKAYIRGAKCLLELER 168 (390) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHH T ss_conf 89877756768999998887641899237899884299999999989999999999861487304434356689999988 Q ss_pred HHHHHHHHHHH Q ss_conf 11258876422 Q gi|254780592|r 120 LDEGLDAINRA 130 (271) Q Consensus 120 ~~~A~~~~~~a 130 (271) +.+|..+.+.. T Consensus 169 ~~~a~nw~ee~ 179 (390) T KOG0551 169 FAEAVNWCEEG 179 (390) T ss_pred HHHHHHHHHHH T ss_conf 88877777653 No 147 >KOG3617 consensus Probab=98.06 E-value=0.00039 Score=43.03 Aligned_cols=147 Identities=18% Similarity=0.119 Sum_probs=82.0 Q ss_pred HHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999874100124466655663275210013555532100000112588764221100002335669999999742111 Q gi|254780592|r 76 YADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHS 155 (271) Q Consensus 76 ~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~ 155 (271) .|..-...|-.++|..+|.+....+ .+...|...|.|++|.+..+.--... ....+++.+..+...++.+ T Consensus 806 vAvLAieLgMlEeA~~lYr~ckR~D--------LlNKlyQs~g~w~eA~eiAE~~DRiH--Lr~Tyy~yA~~Lear~Di~ 875 (1416) T KOG3617 806 VAVLAIELGMLEEALILYRQCKRYD--------LLNKLYQSQGMWSEAFEIAETKDRIH--LRNTYYNYAKYLEARRDIE 875 (1416) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHCCCHHHHHHHHHHCCCEE--HHHHHHHHHHHHHHHCCHH T ss_conf 7899998734889999999888887--------99999986100888888775104031--2456877878888621288 Q ss_pred HHHHHHHHCCCC----------CC----------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------------- Q ss_conf 110000000000----------00----------0001245777654332235543599999995--------------- Q gi|254780592|r 156 EALIEYERALEL----------SP----------NESSIVSNIAMSYLLMGDLKTAEEKLRFASQ--------------- 200 (271) Q Consensus 156 ~A~~~~~~~~~~----------~p----------~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~--------------- 200 (271) .|++.|+++-.. .| .+...|.++|..+.-.|+.+.|+.+|..|-. T Consensus 876 ~AleyyEK~~~hafev~rmL~e~p~~~e~Yv~~~~d~~L~~WWgqYlES~GemdaAl~~Y~~A~D~fs~VrI~C~qGk~~ 955 (1416) T KOG3617 876 AALEYYEKAGVHAFEVFRMLKEYPKQIEQYVRRKRDESLYSWWGQYLESVGEMDAALSFYSSAKDYFSMVRIKCIQGKTD 955 (1416) T ss_pred HHHHHHHHCCCHHHHHHHHHHHCHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHEEEEEECCCCH T ss_conf 99988875187387889998858388999998506528999999997524315788999987655554236676316735 Q ss_pred ------CCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHH Q ss_conf ------38985478889999999869989999999986 Q gi|254780592|r 201 ------MIGADSRIRQNLALVVGLQGRMKEAYSIASQE 232 (271) Q Consensus 201 ------~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~ 232 (271) ....+..+-+.||..|...|+..+|...|.++ T Consensus 956 kAa~iA~esgd~AAcYhlaR~YEn~g~v~~Av~FfTrA 993 (1416) T KOG3617 956 KAARIAEESGDKAACYHLARMYENDGDVVKAVKFFTRA 993 (1416) T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 88999886141788999999862247788899998998 No 148 >KOG4555 consensus Probab=98.06 E-value=2.4e-05 Score=50.40 Aligned_cols=90 Identities=27% Similarity=0.262 Sum_probs=57.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCH----HHHHHHHHHHHH Q ss_conf 9999999742111110000000000000001245777654332235543599999995389854----788899999998 Q gi|254780592|r 143 AKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADS----RIRQNLALVVGL 218 (271) Q Consensus 143 ~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~----~~~~~La~~~~~ 218 (271) .-|..+...|+.+.|++.|.+++.+-|...++++|.++++-.+|+.++|+..+.+|+++..+.. .++...|.+|.. T Consensus 48 l~~valaE~g~Ld~AlE~F~qal~l~P~raSayNNRAQa~RLq~~~e~ALdDLn~AleLag~~trtacqa~vQRg~lyRl 127 (175) T KOG4555 48 LKAIALAEAGDLDGALELFGQALCLAPERASAYNNRAQALRLQGDDEEALDDLNKALELAGDQTRTACQAFVQRGLLYRL 127 (175) T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHCCHHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHH T ss_conf 99889885045678999999999854335386613999998728818789989999986286206999999999999999 Q ss_pred CCCHHHHHHHHHHH Q ss_conf 69989999999986 Q gi|254780592|r 219 QGRMKEAYSIASQE 232 (271) Q Consensus 219 ~g~~~eA~~~~~~~ 232 (271) +|+.+.|..-|+.+ T Consensus 128 ~g~dd~AR~DFe~A 141 (175) T KOG4555 128 LGNDDAARADFEAA 141 (175) T ss_pred HCCHHHHHHHHHHH T ss_conf 37618888869999 No 149 >KOG0376 consensus Probab=98.02 E-value=6.3e-06 Score=53.96 Aligned_cols=109 Identities=16% Similarity=0.140 Sum_probs=86.9 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHH Q ss_conf 99999999874100124466655663275210013555532100000112588764221100002335669999999742 Q gi|254780592|r 73 GIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMG 152 (271) Q Consensus 73 ~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~ 152 (271) +-+.+......+.++.|+..+-++++++|++...+...+..+...+++-.|+..+.++++.+|.....|...|......+ T Consensus 7 ~k~ean~~l~~~~fd~avdlysKaI~ldpnca~~~anRa~a~lK~e~~~~Al~Da~kaie~dP~~~K~Y~rrg~a~m~l~ 86 (476) T KOG0376 7 LKNEANEALKDKVFDVAVDLYSKAIELDPNCAIYFANRALAHLKVESFGGALHDALKAIELDPTYIKAYVRRGTAVMALG 86 (476) T ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCEEEECHHHHHHEEECHHHHHHHHHHHHHHCCCHHHHEEEECCHHHHHHH T ss_conf 14677654120019999999999985288632442302123432000336778777635408012330343117787577 Q ss_pred HHHHHHHHHHHCCCCCCHHHHHHHHHHHH Q ss_conf 11111000000000000000124577765 Q gi|254780592|r 153 KHSEALIEYERALELSPNESSIVSNIAMS 181 (271) Q Consensus 153 ~~~~A~~~~~~~~~~~p~~~~~~~~~~~~ 181 (271) ++.+|...|+....+.|+++.+...+..+ T Consensus 87 ~~~~A~~~l~~~~~l~Pnd~~~~r~~~Ec 115 (476) T KOG0376 87 EFKKALLDLEKVKKLAPNDPDATRKIDEC 115 (476) T ss_pred HHHHHHHHHHHHHHCCCCCHHHHHHHHHH T ss_conf 79999999987662386768899878999 No 150 >PRK04841 transcriptional regulator MalT; Provisional Probab=98.01 E-value=0.00095 Score=40.63 Aligned_cols=57 Identities=25% Similarity=0.181 Sum_probs=33.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCC------CHHHHHHHHHHHHHCCCHHHHHHHHHHHCC Q ss_conf 776543322355435999999953898------547888999999986998999999998629 Q gi|254780592|r 178 IAMSYLLMGDLKTAEEKLRFASQMIGA------DSRIRQNLALVVGLQGRMKEAYSIASQELS 234 (271) Q Consensus 178 ~~~~~~~~g~~~~A~~~~~~al~~~p~------~~~~~~~La~~~~~~g~~~eA~~~~~~~L~ 234 (271) .+.+++..|+++.|...++........ ..++...++.++...|+.++|...+.+++. T Consensus 697 ~ar~~l~~g~~~~A~~~l~~l~~~a~~~~~~~~~~~~~ll~A~a~~~~g~~~~A~~~l~~AL~ 759 (903) T PRK04841 697 IARAQILLGQFDEAEIVLEELNENARSLRLMSDLNRNLILLNQLYWQQGRKSDAQRALLEALK 759 (903) T ss_pred HHHHHHHCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH T ss_conf 999999829979999999999999987275346999999999999987999999999999999 No 151 >KOG4642 consensus Probab=98.00 E-value=7.1e-06 Score=53.67 Aligned_cols=88 Identities=11% Similarity=-0.020 Sum_probs=81.7 Q ss_pred HHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHH Q ss_conf 99842019999999999999575787999999999987410012446665566327521001355553210000011258 Q gi|254780592|r 45 ENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGL 124 (271) Q Consensus 45 ~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~ 124 (271) ...-...|..|+..|.+++..+|..+..+-+.|.++.+..+++....-..++++++|+.......+|..+.....+++|. T Consensus 19 k~f~~k~y~~ai~~y~raI~~nP~~~~Y~tnralchlk~~~~~~v~~dcrralql~~N~vk~h~flg~~~l~s~~~~eaI 98 (284) T KOG4642 19 KCFIPKRYDDAIDCYSRAICINPTVASYYTNRALCHLKLKHWEPVEEDCRRALQLDPNLVKAHYFLGQWLLQSKGYDEAI 98 (284) T ss_pred CCCCHHHHCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCHHH T ss_conf 02530111468999999885489730244547888887620026666178898607287899999888887523660789 Q ss_pred HHHHHHHH Q ss_conf 87642211 Q gi|254780592|r 125 DAINRAQR 132 (271) Q Consensus 125 ~~~~~a~~ 132 (271) ..+.++.. T Consensus 99 ~~Lqra~s 106 (284) T KOG4642 99 KVLQRAYS 106 (284) T ss_pred HHHHHHHH T ss_conf 99999998 No 152 >KOG1914 consensus Probab=97.99 E-value=0.0011 Score=40.34 Aligned_cols=197 Identities=11% Similarity=0.024 Sum_probs=126.6 Q ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHHHC--------------CHHHHHHHHHHHHCCC-CCCHHHHHHHHHHHHHCC--- Q ss_conf 9999999575787999999999987410--------------0124466655663275-210013555532100000--- Q gi|254780592|r 58 TIGLQYQSHTKNKMIGIVYADVLRRVGR--------------TAQALAVMRQVAILYP-EDQEVLAAYGKSLANAGY--- 119 (271) Q Consensus 58 ~~~~al~~~P~~~~~~~~~a~~l~~~g~--------------~~~A~~~~~~al~~~p-~~~~~~~~~~~~~~~~g~--- 119 (271) .+++++..-+-.|++|+..+.-+...++ .+++.+++++++...- .+...+..++......-+ T Consensus 267 ayeQ~ll~l~~~peiWy~~s~yl~~~s~l~~~~~d~~~a~~~t~e~~~~yEr~I~~l~~~~~~Ly~~~a~~eE~~~~~n~ 346 (656) T KOG1914 267 AYEQCLLYLGYHPEIWYDYSMYLIEISDLLTEKGDVPDAKSLTDEAASIYERAIEGLLKENKLLYFALADYEESRYDDNK 346 (656) T ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCH T ss_conf 99999999743799999999999986689887423521134589999999998788777889999998766887514305 Q ss_pred HHHHHHHHHHHHHHHCHHH-HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHH-HHHHHHHHHHHHHHHHH Q ss_conf 1125887642211000023-356699999997421111100000000000000012457776-54332235543599999 Q gi|254780592|r 120 LDEGLDAINRAQRPDIPDW-QLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAM-SYLLMGDLKTAEEKLRF 197 (271) Q Consensus 120 ~~~A~~~~~~a~~~~p~~~-~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~-~~~~~g~~~~A~~~~~~ 197 (271) .+.....+++.+.....+. -.+..+-+.-.+..-...|...|.++.+..-....++..-|. -|..++|...|...|+- T Consensus 347 ~~~~~~~~~~ll~~~~~~~tLv~~~~mn~irR~eGlkaaR~iF~kaR~~~r~~hhVfVa~A~mEy~cskD~~~AfrIFeL 426 (656) T KOG1914 347 EKKVHEIYNKLLKIEDIDLTLVYCQYMNFIRRAEGLKAARKIFKKAREDKRTRHHVFVAAALMEYYCSKDKETAFRIFEL 426 (656) T ss_pred HHHHHHHHHHHHHHHCCCCCEEHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHH T ss_conf 44307999998732214874302699999998640788999999985356774135678899999862781488999998 Q ss_pred HHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCC-CCCCHHHHHHHHHHHHHHH Q ss_conf 99538985478889999999869989999999986291-7286659999999983212 Q gi|254780592|r 198 ASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQELSP-EEATRNIKYIKSILSQRDP 254 (271) Q Consensus 198 al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~-~~~~~~~~~l~~~l~~~~~ 254 (271) -+...++.+.........+...|+-..+..++++++.. -+.+........++.-... T Consensus 427 GLkkf~d~p~yv~~YldfL~~lNdd~N~R~LFEr~l~s~l~~~ks~~Iw~r~l~yES~ 484 (656) T KOG1914 427 GLKKFGDSPEYVLKYLDFLSHLNDDNNARALFERVLTSVLSADKSKEIWDRMLEYESN 484 (656) T ss_pred HHHHCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHH T ss_conf 8875599769999999999983862108999999872158866668999999999872 No 153 >KOG1308 consensus Probab=97.98 E-value=1.8e-06 Score=57.35 Aligned_cols=94 Identities=12% Similarity=0.077 Sum_probs=67.4 Q ss_pred HHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHH Q ss_conf 99998420199999999999995757879999999999874100124466655663275210013555532100000112 Q gi|254780592|r 43 SLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDE 122 (271) Q Consensus 43 ~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~ 122 (271) +.+.++.|.++.|++.+.++++++|.....+...+.++...++...|+.-+..+++++|+...-+...+.....+|+|++ T Consensus 121 A~ealn~G~~~~ai~~~t~ai~lnp~~A~l~~kR~sv~lkl~kp~aairD~d~A~ein~Dsa~~ykfrg~A~rllg~~e~ 200 (377) T KOG1308 121 ASEALNDGEFDTAIELFTSAIELNPPLAILYAKRASVFLKLKKPNAAIRDCDFAIEINPDSAKGYKFRGYAERLLGNWEE 200 (377) T ss_pred HHHHHCCCCHHHHHCCCCCCCCCCCCHHHHCCCCCCEEEECCCCCHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHCHHH T ss_conf 89873276433331155312036994344105534100223687346540245541683102355303489987202287 Q ss_pred HHHHHHHHHHHHCH Q ss_conf 58876422110000 Q gi|254780592|r 123 GLDAINRAQRPDIP 136 (271) Q Consensus 123 A~~~~~~a~~~~p~ 136 (271) +...+..+...+.. T Consensus 201 aa~dl~~a~kld~d 214 (377) T KOG1308 201 AAHDLALACKLDYD 214 (377) T ss_pred HHHHHHHHHHCCCC T ss_conf 77799999861654 No 154 >KOG0545 consensus Probab=97.97 E-value=0.00015 Score=45.46 Aligned_cols=103 Identities=16% Similarity=0.104 Sum_probs=80.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCC------------------CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 6699999997421111100000000------------------0000000124577765433223554359999999538 Q gi|254780592|r 141 ISAKGSVLAQMGKHSEALIEYERAL------------------ELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMI 202 (271) Q Consensus 141 ~~~lg~~~~~~~~~~~A~~~~~~~~------------------~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~ 202 (271) ....|+-+...|++.+|...|..++ +++......+.|+.++++..|+|-+++......+... T Consensus 181 l~q~GN~lfk~~~ykEA~~~YreAi~~l~~L~lkEkP~e~eW~eLdk~~tpLllNy~QC~L~~~e~yevleh~seiL~~~ 260 (329) T KOG0545 181 LHQEGNRLFKLGRYKEASSKYREAIICLRNLQLKEKPGEPEWLELDKMITPLLLNYCQCLLKKEEYYEVLEHCSEILRHH 260 (329) T ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 98750146662217789999999999999887404899818999987633778867778744878999999889998508 Q ss_pred CCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHHH Q ss_conf 98547888999999986998999999998629172866599 Q gi|254780592|r 203 GADSRIRQNLALVVGLQGRMKEAYSIASQELSPEEATRNIK 243 (271) Q Consensus 203 p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~~~ 243 (271) |++..+++..|.+....=+.+||.+-+.++|..+|+-+.+. T Consensus 261 ~~nvKA~frRakAhaa~Wn~~eA~~D~~~vL~ldpslasvV 301 (329) T KOG0545 261 PGNVKAYFRRAKAHAAVWNEAEAKADLQKVLELDPSLASVV 301 (329) T ss_pred CCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCHHHHHHH T ss_conf 85089999987787753688888889999986195667899 No 155 >COG2976 Uncharacterized protein conserved in bacteria [Function unknown] Probab=97.97 E-value=0.00016 Score=45.36 Aligned_cols=93 Identities=18% Similarity=0.037 Sum_probs=33.4 Q ss_pred HHHHHHHHHCCHHHHHHHHHHHHHHHCH---HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHH Q ss_conf 5553210000011258876422110000---2335669999999742111110000000000000001245777654332 Q gi|254780592|r 109 AYGKSLANAGYLDEGLDAINRAQRPDIP---DWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLM 185 (271) Q Consensus 109 ~~~~~~~~~g~~~~A~~~~~~a~~~~p~---~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~ 185 (271) .++..+...+++++|...++.++....+ ..-.-..++.+...+|.++.|+..+.......=. ...-...|.+++.. T Consensus 94 ~lAk~~ve~~~~d~A~aqL~~~l~~t~De~lk~l~~lRLArvq~q~~k~D~AL~~L~t~~~~~w~-~~~~elrGDill~k 172 (207) T COG2976 94 ELAKAEVEANNLDKAEAQLKQALAQTKDENLKALAALRLARVQLQQKKADAALKTLDTIKEESWA-AIVAELRGDILLAK 172 (207) T ss_pred HHHHHHHHHCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHH-HHHHHHHHHHHHHC T ss_conf 99999986342999999999987145257899999999999999860388899987230111078-89999742699972 Q ss_pred HHHHHHHHHHHHHHHCC Q ss_conf 23554359999999538 Q gi|254780592|r 186 GDLKTAEEKLRFASQMI 202 (271) Q Consensus 186 g~~~~A~~~~~~al~~~ 202 (271) |+.++|...|+++++.. T Consensus 173 g~k~~Ar~ay~kAl~~~ 189 (207) T COG2976 173 GDKQEARAAYEKALESD 189 (207) T ss_pred CCHHHHHHHHHHHHHCC T ss_conf 86599999999998715 No 156 >KOG2471 consensus Probab=97.87 E-value=4e-05 Score=49.08 Aligned_cols=26 Identities=31% Similarity=0.385 Sum_probs=14.4 Q ss_pred HHHHHHHHHHCCCHHHHHHHHHHHCC Q ss_conf 88999999986998999999998629 Q gi|254780592|r 209 RQNLALVVGLQGRMKEAYSIASQELS 234 (271) Q Consensus 209 ~~~La~~~~~~g~~~eA~~~~~~~L~ 234 (271) +++||..+..+|.+|+|.+++.++.. T Consensus 622 ~~nLa~a~alq~~~dqAk~ll~~aat 647 (696) T KOG2471 622 FANLAAALALQGHHDQAKSLLTHAAT 647 (696) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHH T ss_conf 99899999872441789999998887 No 157 >PRK06330 transcript cleavage factor/unknown domain fusion protein; Validated Probab=97.83 E-value=0.002 Score=38.60 Aligned_cols=120 Identities=10% Similarity=0.039 Sum_probs=73.5 Q ss_pred HHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHH------------------HHHHHHHHHCCCCCCHHHH Q ss_conf 8420199999999999995757879999999999874100124------------------4666556632752100135 Q gi|254780592|r 47 MNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQA------------------LAVMRQVAILYPEDQEVLA 108 (271) Q Consensus 47 ~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A------------------~~~~~~al~~~p~~~~~~~ 108 (271) -..+..++|....+.-+..+|++..+++..|.+-...+.+..+ ..+.++.+ ..-++.-++. T Consensus 43 ~~~n~~~ea~~~~~~hl~~~~~sI~alY~~gil~l~~~~~~d~~l~~ll~~F~~~~kw~ivE~i~~~iL-~y~e~~~AL~ 121 (906) T PRK06330 43 KSENLTDEAKDICEEHLKEHKKSISALYISGILSLSRRPLNDSNLLNLLDSFKQNLKWAIVEHICDKIL-LYGENKLALY 121 (906) T ss_pred HHCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHH-HHCCCHHHHH T ss_conf 732417999999999985080036999998889861045632659999999986130069999999999-8531078999 Q ss_pred HHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 555321000001125887642211000023356699999997421111100000000000 Q gi|254780592|r 109 AYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELS 168 (271) Q Consensus 109 ~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~ 168 (271) .++.+|....+.+++...+++....|-.+++..-.+|..+... +.+.|+.+|++++... T Consensus 122 ~la~~y~~~~~n~el~~iwerlv~~D~~~~e~~~~~a~~~e~~-d~~kai~y~k~A~~rf 180 (906) T PRK06330 122 TLAEAYAKLKKNKELKVIWERLVKADRDNPEIVKKLATSYEEE-DKEKSITYLKKAIYRF 180 (906) T ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHC-CHHHHHHHHHHHHHHH T ss_conf 9999999835774389999999971344668999998888740-2688999999999999 No 158 >KOG1585 consensus Probab=97.81 E-value=0.0021 Score=38.46 Aligned_cols=189 Identities=17% Similarity=0.141 Sum_probs=126.7 Q ss_pred HHHHHHHHCCHHHHHHHHHHHHHHCCCCHH------HHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCH-----HHHHH Q ss_conf 999998420199999999999995757879------9999999998741001244666556632752100-----13555 Q gi|254780592|r 42 ESLENMNHEQLLEVTSTIGLQYQSHTKNKM------IGIVYADVLRRVGRTAQALAVMRQVAILYPEDQE-----VLAAY 110 (271) Q Consensus 42 ~~~~~~~~~~~~~A~~~~~~al~~~P~~~~------~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~-----~~~~~ 110 (271) -+...-+..+|++|...+.++.+-.-+|.. ++-..|..+....++.++..+++++..++-++.. .-... T Consensus 37 AAvafRnAk~feKakdcLlkA~~~yEnnrslfhAAKayEqaamLake~~klsEvvdl~eKAs~lY~E~GspdtAAmaleK 116 (308) T KOG1585 37 AAVAFRNAKKFEKAKDCLLKASKGYENNRSLFHAAKAYEQAAMLAKELSKLSEVVDLYEKASELYVECGSPDTAAMALEK 116 (308) T ss_pred HHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHH T ss_conf 99999851357888999999999988321387888889999999999887279999999999999970995469999999 Q ss_pred HHHHHHHCCHHHHHHHHHHHHHHHCH------HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC------CCHHHHHHHHH Q ss_conf 53210000011258876422110000------2335669999999742111110000000000------00000124577 Q gi|254780592|r 111 GKSLANAGYLDEGLDAINRAQRPDIP------DWQLISAKGSVLAQMGKHSEALIEYERALEL------SPNESSIVSNI 178 (271) Q Consensus 111 ~~~~~~~g~~~~A~~~~~~a~~~~p~------~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~------~p~~~~~~~~~ 178 (271) +.-.....++++|++.|+++...-.. ..+.+...+.++.+...+++|-..+.+-... .+.....+... T Consensus 117 Aak~lenv~Pd~AlqlYqralavve~~dr~~ma~el~gk~sr~lVrl~kf~Eaa~a~lKe~~~~~~~~~y~~~~k~~va~ 196 (308) T KOG1585 117 AAKALENVKPDDALQLYQRALAVVEEDDRDQMAFELYGKCSRVLVRLEKFTEAATAFLKEGVAADKCDAYNSQCKAYVAA 196 (308) T ss_pred HHHHHHCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHH T ss_conf 99886238888999999999999850534878889998754573335775688999998635999986112389999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHC----CCCCHHHHHHHHHHHHHCCCHHHHHHHHHH Q ss_conf 76543322355435999999953----898547888999999986998999999998 Q gi|254780592|r 179 AMSYLLMGDLKTAEEKLRFASQM----IGADSRIRQNLALVVGLQGRMKEAYSIASQ 231 (271) Q Consensus 179 ~~~~~~~g~~~~A~~~~~~al~~----~p~~~~~~~~La~~~~~~g~~~eA~~~~~~ 231 (271) ...++...||..|..+++...+. .|++.+...+|-.. ...|+.++...++.. T Consensus 197 ilv~L~~~Dyv~aekc~r~~~qip~f~~sed~r~lenLL~a-yd~gD~E~~~kvl~s 252 (308) T KOG1585 197 ILVYLYAHDYVQAEKCYRDCSQIPAFLKSEDSRSLENLLTA-YDEGDIEEIKKVLSS 252 (308) T ss_pred HHHHHHHHHHHHHHHHHCCHHCCCCCCCHHHHHHHHHHHHH-HCCCCHHHHHHHHCC T ss_conf 99972088899999983523027664674778999999987-533788999999727 No 159 >COG3898 Uncharacterized membrane-bound protein [Function unknown] Probab=97.80 E-value=0.0022 Score=38.36 Aligned_cols=195 Identities=14% Similarity=0.035 Sum_probs=122.9 Q ss_pred HHCCHHHHHHHHHHHHHHCCCC--HHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCH-HHHHHHHHHHHHCCHHHHH Q ss_conf 4201999999999999957578--799999999998741001244666556632752100-1355553210000011258 Q gi|254780592|r 48 NHEQLLEVTSTIGLQYQSHTKN--KMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQE-VLAAYGKSLANAGYLDEGL 124 (271) Q Consensus 48 ~~~~~~~A~~~~~~al~~~P~~--~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~-~~~~~~~~~~~~g~~~~A~ 124 (271) ..||-..|...-.+.-+.-..| |-+++.-+..-.-.|+++.|..-|+-- --+|+... .+..+..--...|..+.|. T Consensus 96 gAGda~lARkmt~~~~~llssDqepLIhlLeAQaal~eG~~~~Ar~kfeAM-l~dPEtRllGLRgLyleAqr~GareaAr 174 (531) T COG3898 96 GAGDASLARKMTARASKLLSSDQEPLIHLLEAQAALLEGDYEDARKKFEAM-LDDPETRLLGLRGLYLEAQRLGAREAAR 174 (531) T ss_pred CCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHH-HCCHHHHHHHHHHHHHHHHHCCCHHHHH T ss_conf 167568999999998765424645899999988988647618799999987-3683887776788999998602189999 Q ss_pred HHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHH--------HHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 8764221100002335669999999742111110000000000000001--------24577765433223554359999 Q gi|254780592|r 125 DAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESS--------IVSNIAMSYLLMGDLKTAEEKLR 196 (271) Q Consensus 125 ~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~--------~~~~~~~~~~~~g~~~~A~~~~~ 196 (271) .+..++-...|..+=.+...-......|+++.|++......+..--... .+.-.+...+ ..|...|...-. T Consensus 175 ~yAe~Aa~~Ap~l~WA~~AtLe~r~~~gdWd~AlkLvd~~~~~~vie~~~aeR~rAvLLtAkA~s~l-dadp~~Ar~~A~ 253 (531) T COG3898 175 HYAERAAEKAPQLPWAARATLEARCAAGDWDGALKLVDAQRAAKVIEKDVAERSRAVLLTAKAMSLL-DADPASARDDAL 253 (531) T ss_pred HHHHHHHHHCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHH-CCCHHHHHHHHH T ss_conf 9999987636677167999999998667868899999988887752611677898999988888772-488677889999 Q ss_pred HHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHHHHH Q ss_conf 9995389854788899999998699899999999862917286659999 Q gi|254780592|r 197 FASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQELSPEEATRNIKYI 245 (271) Q Consensus 197 ~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~~~~l 245 (271) +++++.|+....-..-+..+...|+..++-.+++.+...+| +..+..+ T Consensus 254 ~a~KL~pdlvPaav~AAralf~d~~~rKg~~ilE~aWK~eP-HP~ia~l 301 (531) T COG3898 254 EANKLAPDLVPAAVVAARALFRDGNLRKGSKILETAWKAEP-HPDIALL 301 (531) T ss_pred HHHHCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCC-CHHHHHH T ss_conf 87503876336999999998742462433248899873499-8489999 No 160 >PRK10153 DNA-binding transcriptional activator CadC; Provisional Probab=97.77 E-value=0.0012 Score=40.09 Aligned_cols=81 Identities=20% Similarity=0.123 Sum_probs=53.3 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHHHHHHHH Q ss_conf 00001245777654332235543599999995389854788899999998699899999999862917286659999999 Q gi|254780592|r 169 PNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQELSPEEATRNIKYIKSI 248 (271) Q Consensus 169 p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~~~~l~~~ 248 (271) +..+.++..++...+.+|+.++|..++.++.... .+.-.|.-+|.++...|+.+.|.+.|-++.-.++......+...+ T Consensus 409 ~~~~riyeAlAl~al~~~~~~~A~~~L~~~~~~~-~Svl~YILlgK~aEl~G~~~~A~e~Y~~Af~~~~s~~tl~l~enL 487 (512) T PRK10153 409 NVLSRIYEALAVQALVKGDTDEAYQALNKAIELE-MSWLNYVLLGKVAELKGDNRLAADAYSTAFNLRPGENTYYLIENL 487 (512) T ss_pred CCCCHHHHHHHHHHHHCCCHHHHHHHHHHCHHHH-HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHC T ss_conf 8872889999999984799999999998675645-569999999999996499668899999998038838999998625 Q ss_pred HH Q ss_conf 98 Q gi|254780592|r 249 LS 250 (271) Q Consensus 249 l~ 250 (271) .. T Consensus 488 vF 489 (512) T PRK10153 488 VF 489 (512) T ss_pred CC T ss_conf 30 No 161 >PRK11906 transcriptional regulator; Provisional Probab=97.76 E-value=8.6e-05 Score=47.01 Aligned_cols=145 Identities=12% Similarity=0.085 Sum_probs=99.5 Q ss_pred HHHHHHHHHHHHCCC---CCCHHHHHHHHHHHHH---CCH------HHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHH Q ss_conf 124466655663275---2100135555321000---001------1258876422110000233566999999974211 Q gi|254780592|r 87 AQALAVMRQVAILYP---EDQEVLAAYGKSLANA---GYL------DEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKH 154 (271) Q Consensus 87 ~~A~~~~~~al~~~p---~~~~~~~~~~~~~~~~---g~~------~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~ 154 (271) .+|+..|.+.....| .+..-+..++.|+..+ |.+ ++|.+..+++.+.++.++.+...+|.+-.-.|++ T Consensus 275 ~rAl~if~~~~n~s~~q~~~t~~Yc~LAEChlSlAl~G~se~e~A~~kA~e~~d~a~~i~~~n~qaLgllgLIsgL~~q~ 354 (458) T PRK11906 275 YRAMTIFDRLQNKSDIQTLKTECYCLLAECHMSLALHGKSELELAAQKALELLDYVSDITTVDGKILAIMGLITGLSGQA 354 (458) T ss_pred HHHHHHHHHHHCCCCCCCCCCCCEEEHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCHH T ss_conf 99999999874167887566665011999999999846227889999999999998836777767889999986134236 Q ss_pred HHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHH-HHHCCCHHHHHHHHHH Q ss_conf 1110000000000000001245777654332235543599999995389854788899999-9986998999999998 Q gi|254780592|r 155 SEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALV-VGLQGRMKEAYSIASQ 231 (271) Q Consensus 155 ~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~-~~~~g~~~eA~~~~~~ 231 (271) .-+...|+++--+.|+..+.++..+...+..|+.++|.....+.++++|....+-..---+ ..--.-.+.+++.+=+ T Consensus 355 ~Vs~~LFeQA~ll~p~s~di~yy~~~~~F~~g~leea~~~I~ksL~leP~r~~a~i~k~~v~~y~~~p~~n~i~~~~~ 432 (458) T PRK11906 355 KVSHILFEQAKIHSTDIASLYYYRALVHFHNEKIEEARICIDKSLQLEPRRRKAVVIKECVDMYVPNPLKNNIKLYYK 432 (458) T ss_pred HHHHHHHHHHHCCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCHHHCCCHHHHHHHHCCCCCCCCHHHHHHH T ss_conf 698999987414699863312133353563374088999999986149401002456777651269985212678763 No 162 >COG3107 LppC Putative lipoprotein [General function prediction only] Probab=97.75 E-value=0.0021 Score=38.57 Aligned_cols=137 Identities=15% Similarity=0.103 Sum_probs=87.3 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHH-HHHHHHHHCCHHHHHHHH Q ss_conf 9999999999985403685433117999999842019999999999999575787999999-999987410012446665 Q gi|254780592|r 16 KCIVFLIPLLLFSSCHLNPRISIPDPESLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVY-ADVLRRVGRTAQALAVMR 94 (271) Q Consensus 16 ~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~-a~~l~~~g~~~~A~~~~~ 94 (271) ..+.+++.+++++||++..+..... .+...-...+.-++.++-+..+++-..|..+ +.++.+.|+.++|...+. T Consensus 13 r~l~~illa~vla~C~t~~~~~~~~-----~lq~~a~a~s~~yl~qa~qs~~~~~~~~~llAa~al~~e~k~~qA~~Ll~ 87 (604) T COG3107 13 RLLTPILLALVLAGCSTFLPSGSVV-----LLQGTANASSQFYLQQAQQSSGEQQNDWLLLAARALVEEGKTAQAQALLN 87 (604) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCHH-----HCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHH T ss_conf 0038999999997604678998601-----11678211199999997633701566499999999997478288999998 Q ss_pred HHH-HCCCCC-CHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH-CH--HHHHHHHHHHHHHHHHHHHHH Q ss_conf 566-327521-0013555532100000112588764221100-00--233566999999974211111 Q gi|254780592|r 95 QVA-ILYPED-QEVLAAYGKSLANAGYLDEGLDAINRAQRPD-IP--DWQLISAKGSVLAQMGKHSEA 157 (271) Q Consensus 95 ~al-~~~p~~-~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~-p~--~~~~~~~lg~~~~~~~~~~~A 157 (271) ++- .+.|.. .+.-...+.+.....++..|...+.+....+ |. ..+.|...+.+....|+.-++ T Consensus 88 ql~~~Ltd~Q~~~~~LL~ael~la~~q~~~Al~~L~~~~~~~ls~~Qq~Ry~q~~a~a~ea~~~~~~a 155 (604) T COG3107 88 QLPQELTDAQRAEKSLLAAELALAQKQPAAALQQLAKLLPADLSQNQQARYYQARADALEARGDSIDA 155 (604) T ss_pred HCCCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCHHHCCHHHHHHHHHHHHHHHHCCCCHHHH T ss_conf 65623798999999999999998422729999998520321167788999999999998613215899 No 163 >pfam08424 DUF1740 Protein of unknown function (DUF1740). This is a family of eukaryotic proteins of unknown function. Probab=97.74 E-value=0.00082 Score=41.01 Aligned_cols=89 Identities=9% Similarity=0.008 Sum_probs=63.3 Q ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHHH--------------CCHHHHHHHHHHHH-CCCCCCHHHHHHHHHHHHHCCH Q ss_conf 99999999957578799999999998741--------------00124466655663-2752100135555321000001 Q gi|254780592|r 56 TSTIGLQYQSHTKNKMIGIVYADVLRRVG--------------RTAQALAVMRQVAI-LYPEDQEVLAAYGKSLANAGYL 120 (271) Q Consensus 56 ~~~~~~al~~~P~~~~~~~~~a~~l~~~g--------------~~~~A~~~~~~al~-~~p~~~~~~~~~~~~~~~~g~~ 120 (271) ...+.+.++.+|.|.+.|+.+.......- -.+.=++++++|++ .+|++...+..+-.......+. T Consensus 5 ~~eln~~v~e~P~di~~Wleli~~Qd~~~~~~~~~~~~~~~~~laE~KlsilEkAL~~~np~~~~Lll~~l~~~~~~w~~ 84 (231) T pfam08424 5 NAELNRLVRENPEDIDAWIELIRHQEELLRRGSGLLTKAERKQLAEIKLSILEKALKEHNPDSERLLLGLLEEGEKVWDT 84 (231) T ss_pred HHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCH T ss_conf 89999998719625999999999999886413322336789999999999999999985899789999999999985787 Q ss_pred HHHHHHHHHHHHHHCHHHHHHHHH Q ss_conf 125887642211000023356699 Q gi|254780592|r 121 DEGLDAINRAQRPDIPDWQLISAK 144 (271) Q Consensus 121 ~~A~~~~~~a~~~~p~~~~~~~~l 144 (271) ++....+++.+..+|.+...|..+ T Consensus 85 ~~l~~~We~~l~~~p~~~~LW~~y 108 (231) T pfam08424 85 DELLSRWEKVLKQNPGSPSLWRKY 108 (231) T ss_pred HHHHHHHHHHHHHCCCCHHHHHHH T ss_conf 999999999999889975999999 No 164 >KOG0545 consensus Probab=97.74 E-value=0.00038 Score=43.09 Aligned_cols=106 Identities=16% Similarity=0.044 Sum_probs=88.1 Q ss_pred CHHHHHHHHHHHHHCCHHHHHHHHHHHH------------------HHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 0013555532100000112588764221------------------1000023356699999997421111100000000 Q gi|254780592|r 104 QEVLAAYGKSLANAGYLDEGLDAINRAQ------------------RPDIPDWQLISAKGSVLAQMGKHSEALIEYERAL 165 (271) Q Consensus 104 ~~~~~~~~~~~~~~g~~~~A~~~~~~a~------------------~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~ 165 (271) ..++...|.-++..|+|.+|...|..++ +++......+.++..|+...|++-++++.-...+ T Consensus 178 v~~l~q~GN~lfk~~~ykEA~~~YreAi~~l~~L~lkEkP~e~eW~eLdk~~tpLllNy~QC~L~~~e~yevleh~seiL 257 (329) T KOG0545 178 VPVLHQEGNRLFKLGRYKEASSKYREAIICLRNLQLKEKPGEPEWLELDKMITPLLLNYCQCLLKKEEYYEVLEHCSEIL 257 (329) T ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 28898750146662217789999999999999887404899818999987633778867778744878999999889998 Q ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHH Q ss_conf 00000001245777654332235543599999995389854788 Q gi|254780592|r 166 ELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIR 209 (271) Q Consensus 166 ~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~ 209 (271) ...|.+..+++..|.+....=+..+|.+.|.++++.+|.-..+. T Consensus 258 ~~~~~nvKA~frRakAhaa~Wn~~eA~~D~~~vL~ldpslasvV 301 (329) T KOG0545 258 RHHPGNVKAYFRRAKAHAAVWNEAEAKADLQKVLELDPSLASVV 301 (329) T ss_pred HCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCHHHHHHH T ss_conf 50885089999987787753688888889999986195667899 No 165 >COG2976 Uncharacterized protein conserved in bacteria [Function unknown] Probab=97.72 E-value=0.0022 Score=38.42 Aligned_cols=129 Identities=14% Similarity=0.146 Sum_probs=80.1 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHH---HHHHHHHHHHHHCCHHHHHHHHHHHHCCCCC---CHHHHHHHHH Q ss_conf 799999984201999999999999957578799---9999999987410012446665566327521---0013555532 Q gi|254780592|r 40 DPESLENMNHEQLLEVTSTIGLQYQSHTKNKMI---GIVYADVLRRVGRTAQALAVMRQVAILYPED---QEVLAAYGKS 113 (271) Q Consensus 40 ~~~~~~~~~~~~~~~A~~~~~~al~~~P~~~~~---~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~---~~~~~~~~~~ 113 (271) ..+.....+.+.. +......+....+|.+..+ .+.+|..+...|++++|...++.++....+. .-+-..++++ T Consensus 57 Y~~~i~~~~ak~~-~~~~~~ekf~~~n~~t~Ya~laaL~lAk~~ve~~~~d~A~aqL~~~l~~t~De~lk~l~~lRLArv 135 (207) T COG2976 57 YQNAIKAVQAKKP-KSIAAAEKFVQANGKTIYAVLAALELAKAEVEANNLDKAEAQLKQALAQTKDENLKALAALRLARV 135 (207) T ss_pred HHHHHHHHHCCCC-HHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHH T ss_conf 9999999855770-259999999861563589999999999999863429999999999871452578999999999999 Q ss_pred HHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCH Q ss_conf 100000112588764221100002335669999999742111110000000000000 Q gi|254780592|r 114 LANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPN 170 (271) Q Consensus 114 ~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~ 170 (271) +...|.+|+|+..++...... -.+.....+|.++..+|+..+|...|.+++...+. T Consensus 136 q~q~~k~D~AL~~L~t~~~~~-w~~~~~elrGDill~kg~k~~Ar~ay~kAl~~~~s 191 (207) T COG2976 136 QLQQKKADAALKTLDTIKEES-WAAIVAELRGDILLAKGDKQEARAAYEKALESDAS 191 (207) T ss_pred HHHHHHHHHHHHHHHCCCCCC-HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCC T ss_conf 998603888999872301110-78899997426999728659999999999871577 No 166 >KOG4642 consensus Probab=97.71 E-value=6.9e-05 Score=47.61 Aligned_cols=92 Identities=15% Similarity=0.011 Sum_probs=70.9 Q ss_pred HHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999874100124466655663275210013555532100000112588764221100002335669999999742111 Q gi|254780592|r 76 YADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHS 155 (271) Q Consensus 76 ~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~ 155 (271) -|..+....+++.|+..|-+++.++|..+.++...+.+++++.+|+.+.....++++++|+.....+.+|........++ T Consensus 16 ~gnk~f~~k~y~~ai~~y~raI~~nP~~~~Y~tnralchlk~~~~~~v~~dcrralql~~N~vk~h~flg~~~l~s~~~~ 95 (284) T KOG4642 16 QGNKCFIPKRYDDAIDCYSRAICINPTVASYYTNRALCHLKLKHWEPVEEDCRRALQLDPNLVKAHYFLGQWLLQSKGYD 95 (284) T ss_pred CCCCCCCHHHHCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCC T ss_conf 24602530111468999999885489730244547888887620026666178898607287899999888887523660 Q ss_pred HHHHHHHHCCCC Q ss_conf 110000000000 Q gi|254780592|r 156 EALIEYERALEL 167 (271) Q Consensus 156 ~A~~~~~~~~~~ 167 (271) .|+..+.++..+ T Consensus 96 eaI~~Lqra~sl 107 (284) T KOG4642 96 EAIKVLQRAYSL 107 (284) T ss_pred HHHHHHHHHHHH T ss_conf 789999999988 No 167 >PRK06330 transcript cleavage factor/unknown domain fusion protein; Validated Probab=97.66 E-value=0.0037 Score=37.03 Aligned_cols=158 Identities=9% Similarity=0.061 Sum_probs=101.0 Q ss_pred HHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHH Q ss_conf 99999957578799999999998741001244666556632752100135555321000001125887642211000023 Q gi|254780592|r 59 IGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDW 138 (271) Q Consensus 59 ~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~ 138 (271) -++.+ ...++..++..+|.+|.+..+.+++..+++++++.+-+++.....++..+..- +.++|..+|++|+...-..- T Consensus 107 ~~~iL-~y~e~~~AL~~la~~y~~~~~n~el~~iwerlv~~D~~~~e~~~~~a~~~e~~-d~~kai~y~k~A~~rf~~~k 184 (906) T PRK06330 107 CDKIL-LYGENKLALYTLAEAYAKLKKNKELKVIWERLVKADRDNPEIVKKLATSYEEE-DKEKSITYLKKAIYRFIKKK 184 (906) T ss_pred HHHHH-HHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHC-CHHHHHHHHHHHHHHHHHHH T ss_conf 99999-85310789999999999835774389999999971344668999998888740-26889999999999998775 Q ss_pred HHHHHHHHHHHHH-----HHHHHHHHHHHHCCCC--CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHH Q ss_conf 3566999999974-----2111110000000000--00000124577765433223554359999999538985478889 Q gi|254780592|r 139 QLISAKGSVLAQM-----GKHSEALIEYERALEL--SPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQN 211 (271) Q Consensus 139 ~~~~~lg~~~~~~-----~~~~~A~~~~~~~~~~--~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~ 211 (271) + +...-.++... .+++--....+..... .......+..+-..|....+++.++..++..++.++.+..++.+ T Consensus 185 ~-~~~~~eiw~klv~~~~dd~~~f~~i~~~~~~~~~~~~~~~l~~~i~~~y~~~e~~~~~i~~lK~iL~~d~~~~kaR~~ 263 (906) T PRK06330 185 Q-YVGIEEIWSKLVHNNSDDFDFFLRIERKILGHREFTRLVGLLYDIVEPYKQLENWDEVIYILKKILEHDNKSNKAREE 263 (906) T ss_pred H-HCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCEEEEEHHHHHHHHHHCCCHHHHHHHHHHHHHCCHHHHHHHHH T ss_conf 4-122889999998608631789999999998764347036641676788761276889999999997335010688999 Q ss_pred HHHHHHHC Q ss_conf 99999986 Q gi|254780592|r 212 LALVVGLQ 219 (271) Q Consensus 212 La~~~~~~ 219 (271) |..+|... T Consensus 264 li~~~k~k 271 (906) T PRK06330 264 LIRFYKEK 271 (906) T ss_pred HHHHHHHH T ss_conf 99999986 No 168 >PRK04841 transcriptional regulator MalT; Provisional Probab=97.63 E-value=0.004 Score=36.78 Aligned_cols=84 Identities=20% Similarity=0.201 Sum_probs=37.0 Q ss_pred HHHCCHHHHHHHHHHHHHHCCCCHH-----HHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCC------HHHHHHHHHHH Q ss_conf 8420199999999999995757879-----999999999874100124466655663275210------01355553210 Q gi|254780592|r 47 MNHEQLLEVTSTIGLQYQSHTKNKM-----IGIVYADVLRRVGRTAQALAVMRQVAILYPEDQ------EVLAAYGKSLA 115 (271) Q Consensus 47 ~~~~~~~~A~~~~~~al~~~P~~~~-----~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~------~~~~~~~~~~~ 115 (271) ...++++.+......++..-|.+.. +...++..+...|++++|...+..+........ ......+.+.. T Consensus 463 ~~~~~~~~a~~~a~~al~~l~~~~~~~~~~~~~~l~~~~~~~G~l~~a~~~~~~a~~~~~~~~~~~~~~~~~~~~a~i~~ 542 (903) T PRK04841 463 INDGDPEEAERLAKLALAELPLTWYYSRIVATSVLGEVHHCKGELARALAMMQQTEQMARQHDVYHYALWSLLQQSEILF 542 (903) T ss_pred HHCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHH T ss_conf 85899999999999997536763479899999999999997799999999999999999983853579999999999999 Q ss_pred HHCCHHHHHHHHHHH Q ss_conf 000011258876422 Q gi|254780592|r 116 NAGYLDEGLDAINRA 130 (271) Q Consensus 116 ~~g~~~~A~~~~~~a 130 (271) ..|+...|...++++ T Consensus 543 ~~G~l~~A~~~~~~a 557 (903) T PRK04841 543 AQGFLQAAYETQEKA 557 (903) T ss_pred HCCCHHHHHHHHHHH T ss_conf 787899999999999 No 169 >pfam06552 TOM20_plant Plant specific mitochondrial import receptor subunit TOM20. This family consists of several plant specific mitochondrial import receptor subunit TOM20 (translocase of outer membrane 20 kDa subunit) proteins. Most mitochondrial proteins are encoded by the nuclear genome, and are synthesized in the cytosol. TOM20 is a general import receptor that binds to mitochondrial pre-sequences in the early step of protein import into the mitochondria. Probab=97.62 E-value=0.00025 Score=44.20 Aligned_cols=87 Identities=9% Similarity=0.168 Sum_probs=63.8 Q ss_pred HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCH----------HHHHHHHHHHHCCCCCCHHHHHHHHHHHH----- Q ss_conf 999999999999957578799999999998741001----------24466655663275210013555532100----- Q gi|254780592|r 52 LLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTA----------QALAVMRQVAILYPEDQEVLAAYGKSLAN----- 116 (271) Q Consensus 52 ~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~----------~A~~~~~~al~~~p~~~~~~~~~~~~~~~----- 116 (271) |+.|...-+..+++||.|++.+.+-|-++....++. .|++-+++++.++|....++--+|.++-. T Consensus 7 FE~aRk~AE~~y~~nP~DadnLtRWGGALLELsqFq~~~ds~~Mi~dAisKlEeAL~InP~khdalWclGNA~TS~~Fl~ 86 (186) T pfam06552 7 FEHARKSAEATYKKNPLDADNLTRWGGALLELSQFQSVPDAKKMIQDAISKLEEALVINPKKHDALWCIGNAYTSHAFLT 86 (186) T ss_pred HHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCCCCCHHHEECCCCCCCCCCC T ss_conf 99999999998852984588898888999999861264479999999999999987139887533330054110151317 Q ss_pred ------HCCHHHHHHHHHHHHHHHCHHH Q ss_conf ------0001125887642211000023 Q gi|254780592|r 117 ------AGYLDEGLDAINRAQRPDIPDW 138 (271) Q Consensus 117 ------~g~~~~A~~~~~~a~~~~p~~~ 138 (271) .+.+++|..+|+++...+|.+. T Consensus 87 pD~~~A~~~F~kA~~~Fq~AvdeeP~Ne 114 (186) T pfam06552 87 PDQTEAKDNFDKATQFFQQAVDEQPDND 114 (186) T ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCHH T ss_conf 8789999889999999999863498618 No 170 >PRK11906 transcriptional regulator; Provisional Probab=97.60 E-value=0.00052 Score=42.22 Aligned_cols=128 Identities=15% Similarity=0.082 Sum_probs=95.1 Q ss_pred HHHHHHHHHHHHHH---HCHHHHHHHHHHHHHHHH---------HHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 11258876422110---000233566999999974---------211111000000000000000124577765433223 Q gi|254780592|r 120 LDEGLDAINRAQRP---DIPDWQLISAKGSVLAQM---------GKHSEALIEYERALELSPNESSIVSNIAMSYLLMGD 187 (271) Q Consensus 120 ~~~A~~~~~~a~~~---~p~~~~~~~~lg~~~~~~---------~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~ 187 (271) ...|+..|.+.... .|....-|..++.|+... ....+|.+...++.++++.++.++.-+|.+-...|+ T Consensus 274 l~rAl~if~~~~n~s~~q~~~t~~Yc~LAEChlSlAl~G~se~e~A~~kA~e~~d~a~~i~~~n~qaLgllgLIsgL~~q 353 (458) T PRK11906 274 IYRAMTIFDRLQNKSDIQTLKTECYCLLAECHMSLALHGKSELELAAQKALELLDYVSDITTVDGKILAIMGLITGLSGQ 353 (458) T ss_pred HHHHHHHHHHHHCCCCCCCCCCCCEEEHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCH T ss_conf 99999999987416788756666501199999999984622788999999999999883677776788999998613423 Q ss_pred HHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHHHHHHH Q ss_conf 554359999999538985478889999999869989999999986291728665999999 Q gi|254780592|r 188 LKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQELSPEEATRNIKYIKS 247 (271) Q Consensus 188 ~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~~~~l~~ 247 (271) ..-+...|++|--+.|...++++..+......|+.++|..+.+++|..+|...-....+. T Consensus 354 ~~Vs~~LFeQA~ll~p~s~di~yy~~~~~F~~g~leea~~~I~ksL~leP~r~~a~i~k~ 413 (458) T PRK11906 354 AKVSHILFEQAKIHSTDIASLYYYRALVHFHNEKIEEARICIDKSLQLEPRRRKAVVIKE 413 (458) T ss_pred HHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCHHHCCCHHHH T ss_conf 669899998741469986331213335356337408899999998614940100245677 No 171 >PRK10153 DNA-binding transcriptional activator CadC; Provisional Probab=97.59 E-value=0.0047 Score=36.37 Aligned_cols=108 Identities=18% Similarity=0.151 Sum_probs=45.0 Q ss_pred HHHHHHHHHCCCCCCHHHHHHHHHHHHHHH----HH----HHHHHHHHHHHHHH--CCCCCHHHHHHHHHHHHHCCCHHH Q ss_conf 111000000000000000124577765433----22----35543599999995--389854788899999998699899 Q gi|254780592|r 155 SEALIEYERALELSPNESSIVSNIAMSYLL----MG----DLKTAEEKLRFASQ--MIGADSRIRQNLALVVGLQGRMKE 224 (271) Q Consensus 155 ~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~----~g----~~~~A~~~~~~al~--~~p~~~~~~~~La~~~~~~g~~~e 224 (271) ..+++.++++++..|++.-+.......+.. .+ +...--..+++... ..+..+++|.-+|......|+.++ T Consensus 351 ~~~idLl~~~~~~ePdn~Yv~Ae~yi~~~~~~~l~~~~~~~l~~Ln~~l~~~~~~~~~~~~~riyeAlAl~al~~~~~~~ 430 (512) T PRK10153 351 NKGSDLLEEILKSEPDFTYALAEKALADHVRQSLQPLDEKQLAALNTELDNIVALPGVNVLSRIYEALAVQALVKGDTDE 430 (512) T ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCHHH T ss_conf 99999999999729983899999999999998549888999999999999987332468872889999999984799999 Q ss_pred HHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999986291728665999999998321299999999 Q gi|254780592|r 225 AYSIASQELSPEEATRNIKYIKSILSQRDPWKKIAKAR 262 (271) Q Consensus 225 A~~~~~~~L~~~~~~~~~~~l~~~l~~~~~~~~~a~a~ 262 (271) |..++.++...+.+--....++...+-++.-..++.++ T Consensus 431 A~~~L~~~~~~~~Svl~YILlgK~aEl~G~~~~A~e~Y 468 (512) T PRK10153 431 AYQALNKAIELEMSWLNYVLLGKVAELKGDNRLAADAY 468 (512) T ss_pred HHHHHHHCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHH T ss_conf 99999867564556999999999999649966889999 No 172 >COG4105 ComL DNA uptake lipoprotein [General function prediction only] Probab=97.58 E-value=0.0048 Score=36.34 Aligned_cols=144 Identities=13% Similarity=-0.067 Sum_probs=97.8 Q ss_pred CHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCC---CHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHH--- Q ss_conf 87999999999987410012446665566327521---001355553210000011258876422110000233566--- Q gi|254780592|r 69 NKMIGIVYADVLRRVGRTAQALAVMRQVAILYPED---QEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLIS--- 142 (271) Q Consensus 69 ~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~---~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~--- 142 (271) -+..+++-|....+.|++++|+..|+.+...+|.. ..+...++.++...++++.|....++-+...|.++..-. T Consensus 33 p~~~LY~~g~~~L~~gn~~~A~~~fe~l~~~~p~~~~~~qa~l~l~yA~Yk~~~y~~A~~~~drFi~lyP~~~n~dY~~Y 112 (254) T COG4105 33 PASELYNEGLTELQKGNYEEAIKYFEALDSRHPFSPYSEQAQLDLAYAYYKNGEYDLALAYIDRFIRLYPTHPNADYAYY 112 (254) T ss_pred CHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCHHHHHH T ss_conf 89999999999986378899999999998709998112899999999988135589999999999987899987169999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCH---HH----------- Q ss_conf 9999999742111110000000000000001245777654332235543599999995389854---78----------- Q gi|254780592|r 143 AKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADS---RI----------- 208 (271) Q Consensus 143 ~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~---~~----------- 208 (271) ..|.++... +-.....+.-..+|...|++.++.-|++. ++ T Consensus 113 lkgLs~~~~--------------------------i~~~~rDq~~~~~A~~~f~~~i~ryPnS~Ya~dA~~~i~~~~d~L 166 (254) T COG4105 113 LKGLSYFFQ--------------------------IDDVTRDQSAARAAFAAFKELVQRYPNSRYAPDAKARIVKLNDAL 166 (254) T ss_pred HHHHHHHCC--------------------------CCCCCCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHH T ss_conf 999987526--------------------------775444779999999999999987889702355999999999998 Q ss_pred ---HHHHHHHHHHCCCHHHHHHHHHHHCCCCCC Q ss_conf ---889999999869989999999986291728 Q gi|254780592|r 209 ---RQNLALVVGLQGRMKEAYSIASQELSPEEA 238 (271) Q Consensus 209 ---~~~La~~~~~~g~~~eA~~~~~~~L~~~~~ 238 (271) =...|..|..-|.+--|..-++..+..-|. T Consensus 167 A~~Em~IaryY~kr~~~~AA~nR~~~v~e~y~~ 199 (254) T COG4105 167 AGHEMAIARYYLKRGAYVAAINRFEEVLENYPD 199 (254) T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCC T ss_conf 899999999999905769999999999871455 No 173 >KOG0530 consensus Probab=97.58 E-value=0.0023 Score=38.30 Aligned_cols=172 Identities=13% Similarity=0.059 Sum_probs=134.9 Q ss_pred HCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHH-CCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHH-HHHHH Q ss_conf 201999999999999957578799999999998741-0012446665566327521001355553210000011-25887 Q gi|254780592|r 49 HEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVG-RTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLD-EGLDA 126 (271) Q Consensus 49 ~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g-~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~-~A~~~ 126 (271) .+.-..|......++..+|.|-.+|--.-.++...+ +..+-+.++.+.++.+|.+-++|...-.+....|++. .-++. T Consensus 56 ~E~S~RAl~LT~d~i~lNpAnYTVW~yRr~iL~~l~~dL~~El~~l~eI~e~npKNYQvWHHRr~ive~l~d~s~rELef 135 (318) T KOG0530 56 NEKSPRALQLTEDAIRLNPANYTVWQYRRVILRHLMSDLNKELEYLDEIIEDNPKNYQVWHHRRVIVELLGDPSFRELEF 135 (318) T ss_pred CCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCCHHHH T ss_conf 65687899989999972854214899999999986888999999999998738652147799999999806941103789 Q ss_pred HHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHH-HH-----HHHHHHHHHHHHHH Q ss_conf 6422110000233566999999974211111000000000000000124577765433-22-----35543599999995 Q gi|254780592|r 127 INRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLL-MG-----DLKTAEEKLRFASQ 200 (271) Q Consensus 127 ~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~-~g-----~~~~A~~~~~~al~ 200 (271) ....+..+.++.-+|..+-.+....+.++.-.......++.+-.+.++|+..-..... .| ..+.-+.+..+.+. T Consensus 136 ~~~~l~~DaKNYHaWshRqW~~r~F~~~~~EL~y~~~Lle~Di~NNSAWN~Ryfvi~~~~~~~~~~~le~El~yt~~~I~ 215 (318) T KOG0530 136 TKLMLDDDAKNYHAWSHRQWVLRFFKDYEDELAYADELLEEDIRNNSAWNQRYFVITNTKGVISKAELERELNYTKDKIL 215 (318) T ss_pred HHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHEEEEEEEECCCCCCHHHHHHHHHHHHHHHH T ss_conf 99997111110026689999999875078899999999987620220222025888824677659999999999999997 Q ss_pred CCCCCHHHHHHHHHHHHH-CC Q ss_conf 389854788899999998-69 Q gi|254780592|r 201 MIGADSRIRQNLALVVGL-QG 220 (271) Q Consensus 201 ~~p~~~~~~~~La~~~~~-~g 220 (271) ..|++..+|..|..++.. .| T Consensus 216 ~vP~NeSaWnYL~G~l~~d~g 236 (318) T KOG0530 216 LVPNNESAWNYLKGLLELDSG 236 (318) T ss_pred HCCCCCCHHHHHHHHHHHCCC T ss_conf 288874189999999996147 No 174 >pfam02259 FAT FAT domain. The FAT domain is named after FRAP, ATM and TRRAP. Probab=97.57 E-value=0.0014 Score=39.56 Aligned_cols=33 Identities=18% Similarity=0.006 Sum_probs=23.7 Q ss_pred HHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHH Q ss_conf 235543599999995389854788899999998 Q gi|254780592|r 186 GDLKTAEEKLRFASQMIGADSRIRQNLALVVGL 218 (271) Q Consensus 186 g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~ 218 (271) ++.++.+..|+.|++.+|+...+|..+|..+.. T Consensus 272 ~~~~~il~~y~~A~~~~~~w~Kaw~~wa~~~~~ 304 (351) T pfam02259 272 GKKDEILKAYRTATQFDDQWYKAWHSWALANFE 304 (351) T ss_pred CCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH T ss_conf 579999999999998786648999999999999 No 175 >KOG1308 consensus Probab=97.49 E-value=0.00021 Score=44.60 Aligned_cols=86 Identities=14% Similarity=0.075 Sum_probs=39.7 Q ss_pred HHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 74100124466655663275210013555532100000112588764221100002335669999999742111110000 Q gi|254780592|r 82 RVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEY 161 (271) Q Consensus 82 ~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~ 161 (271) ..|.++.|++.+-.+++++|.....+...+.++..++++..|...+..++..+|+...-+-.+|.....+|++++|...+ T Consensus 126 n~G~~~~ai~~~t~ai~lnp~~A~l~~kR~sv~lkl~kp~aairD~d~A~ein~Dsa~~ykfrg~A~rllg~~e~aa~dl 205 (377) T KOG1308 126 NDGEFDTAIELFTSAIELNPPLAILYAKRASVFLKLKKPNAAIRDCDFAIEINPDSAKGYKFRGYAERLLGNWEEAAHDL 205 (377) T ss_pred CCCCHHHHHCCCCCCCCCCCCHHHHCCCCCCEEEECCCCCHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHCHHHHHHHH T ss_conf 27643333115531203699434410553410022368734654024554168310235530348998720228777799 Q ss_pred HHCCCC Q ss_conf 000000 Q gi|254780592|r 162 ERALEL 167 (271) Q Consensus 162 ~~~~~~ 167 (271) ..+.++ T Consensus 206 ~~a~kl 211 (377) T KOG1308 206 ALACKL 211 (377) T ss_pred HHHHHC T ss_conf 999861 No 176 >KOG1585 consensus Probab=97.47 E-value=0.0067 Score=35.43 Aligned_cols=162 Identities=12% Similarity=0.040 Sum_probs=113.2 Q ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCH------HHHHHHHHHHHHCCHHHHHHHHHHHHHHH-----CHHHHH Q ss_conf 9999999998741001244666556632752100------13555532100000112588764221100-----002335 Q gi|254780592|r 72 IGIVYADVLRRVGRTAQALAVMRQVAILYPEDQE------VLAAYGKSLANAGYLDEGLDAINRAQRPD-----IPDWQL 140 (271) Q Consensus 72 ~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~------~~~~~~~~~~~~g~~~~A~~~~~~a~~~~-----p~~~~~ 140 (271) .+..-+.+++...++++|...+.++.+-+..+.. ++...+.....+..+.++...++++.... |..+.. T Consensus 33 ~yekAAvafRnAk~feKakdcLlkA~~~yEnnrslfhAAKayEqaamLake~~klsEvvdl~eKAs~lY~E~GspdtAAm 112 (308) T KOG1585 33 LYEKAAVAFRNAKKFEKAKDCLLKASKGYENNRSLFHAAKAYEQAAMLAKELSKLSEVVDLYEKASELYVECGSPDTAAM 112 (308) T ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHH T ss_conf 99999999985135788899999999998832138788888999999999988727999999999999997099546999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCCHH------HHHHHHHHHHHHHHHHHHHHHHHHHHH----HHCCCCC--HHH Q ss_conf 6699999997421111100000000000000------012457776543322355435999999----9538985--478 Q gi|254780592|r 141 ISAKGSVLAQMGKHSEALIEYERALELSPNE------SSIVSNIAMSYLLMGDLKTAEEKLRFA----SQMIGAD--SRI 208 (271) Q Consensus 141 ~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~------~~~~~~~~~~~~~~g~~~~A~~~~~~a----l~~~p~~--~~~ 208 (271) -...+.-.....++++|+..|.+++.+...+ .+.+...+..++....+++|...+.+- .+...-+ -.. T Consensus 113 aleKAak~lenv~Pd~AlqlYqralavve~~dr~~ma~el~gk~sr~lVrl~kf~Eaa~a~lKe~~~~~~~~~y~~~~k~ 192 (308) T KOG1585 113 ALEKAAKALENVKPDDALQLYQRALAVVEEDDRDQMAFELYGKCSRVLVRLEKFTEAATAFLKEGVAADKCDAYNSQCKA 192 (308) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHH T ss_conf 99999988623888899999999999985053487888999875457333577568899999863599998611238999 Q ss_pred HHHHHHHHHHCCCHHHHHHHHHHHC Q ss_conf 8899999998699899999999862 Q gi|254780592|r 209 RQNLALVVGLQGRMKEAYSIASQEL 233 (271) Q Consensus 209 ~~~La~~~~~~g~~~eA~~~~~~~L 233 (271) +..+.+++....++..|+.+++..- T Consensus 193 ~va~ilv~L~~~Dyv~aekc~r~~~ 217 (308) T KOG1585 193 YVAAILVYLYAHDYVQAEKCYRDCS 217 (308) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCHH T ss_conf 9999999720888999999835230 No 177 >KOG0530 consensus Probab=97.46 E-value=0.0014 Score=39.53 Aligned_cols=167 Identities=11% Similarity=0.001 Sum_probs=108.7 Q ss_pred CCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCH-HHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHH Q ss_conf 01999999999999957578799999999998741001-24466655663275210013555532100000112588764 Q gi|254780592|r 50 EQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTA-QALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAIN 128 (271) Q Consensus 50 ~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~-~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~ 128 (271) .++.+-.++++..++.+|+|-.+|-..-.+....|++. +-+++.+..+..+..+-.+|...-.+...-+.++.-+.... T Consensus 92 ~dL~~El~~l~eI~e~npKNYQvWHHRr~ive~l~d~s~rELef~~~~l~~DaKNYHaWshRqW~~r~F~~~~~EL~y~~ 171 (318) T KOG0530 92 SDLNKELEYLDEIIEDNPKNYQVWHHRRVIVELLGDPSFRELEFTKLMLDDDAKNYHAWSHRQWVLRFFKDYEDELAYAD 171 (318) T ss_pred HHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 88999999999998738652147799999999806941103789999971111100266899999998750788999999 Q ss_pred HHHHHHCHHHHHHHHHHHHHHH-H-----HHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHH-HH--HHHHHHHHHHHHH Q ss_conf 2211000023356699999997-4-----211111000000000000000124577765433-22--3554359999999 Q gi|254780592|r 129 RAQRPDIPDWQLISAKGSVLAQ-M-----GKHSEALIEYERALELSPNESSIVSNIAMSYLL-MG--DLKTAEEKLRFAS 199 (271) Q Consensus 129 ~a~~~~p~~~~~~~~lg~~~~~-~-----~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~-~g--~~~~A~~~~~~al 199 (271) ..+..+-.+-.+|+.+-.+... . -..+.-+....+.+...|++.++|..+...+.. .| -...-........ T Consensus 172 ~Lle~Di~NNSAWN~Ryfvi~~~~~~~~~~~le~El~yt~~~I~~vP~NeSaWnYL~G~l~~d~gl~s~s~vv~f~~~l~ 251 (318) T KOG0530 172 ELLEEDIRNNSAWNQRYFVITNTKGVISKAELERELNYTKDKILLVPNNESAWNYLKGLLELDSGLSSDSKVVSFVENLY 251 (318) T ss_pred HHHHHHHHCCCHHHEEEEEEEECCCCCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHCCCCCCCCHHHHHHHHHH T ss_conf 99987620220222025888824677659999999999999997288874189999999996147768804999999986 Q ss_pred -HCCCCCHHHHHHHHHHH Q ss_conf -53898547888999999 Q gi|254780592|r 200 -QMIGADSRIRQNLALVV 216 (271) Q Consensus 200 -~~~p~~~~~~~~La~~~ 216 (271) +.+-.++...--+..+| T Consensus 252 ~~~~~~sP~lla~l~d~~ 269 (318) T KOG0530 252 LQLPKRSPFLLAFLLDLY 269 (318) T ss_pred HCCCCCCHHHHHHHHHHH T ss_conf 145778865799999999 No 178 >pfam03704 BTAD Bacterial transcriptional activator domain. Found in the DNRI/REDD/AFSR family of regulators. This region of AFSR along with the C terminal region is capable of independently directing actinorhodin production. This family contains TPR repeats. Probab=97.45 E-value=0.0017 Score=39.13 Aligned_cols=61 Identities=18% Similarity=0.151 Sum_probs=50.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHH Q ss_conf 0124577765433223554359999999538985478889999999869989999999986 Q gi|254780592|r 172 SSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQE 232 (271) Q Consensus 172 ~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~ 232 (271) ..+...++..+...|++++|+..+++++..+|-+-.++..|-.++...|+..+|...|++. T Consensus 62 ~~a~~~l~~~~~~~g~~~~a~~~~~~~l~~dPl~E~~~~~Lm~a~~~~G~~~~Al~~Y~~~ 122 (146) T pfam03704 62 LRALERRIEADLRLGRHEEALAELRALVALHPLRERLHRQLMRALYRSGRQAEALRVYRRL 122 (146) T ss_pred HHHHHHHHHHHHHCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHH T ss_conf 9999999999988598302199999999868874299999999999889999999999999 No 179 >KOG1258 consensus Probab=97.40 E-value=0.0082 Score=34.89 Aligned_cols=173 Identities=14% Similarity=0.036 Sum_probs=78.4 Q ss_pred HHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCC-CCCCHHHHHHHHHHHHHCCHHHHHHH Q ss_conf 42019999999999999575787999999999987410012446665566327-52100135555321000001125887 Q gi|254780592|r 48 NHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILY-PEDQEVLAAYGKSLANAGYLDEGLDA 126 (271) Q Consensus 48 ~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~-p~~~~~~~~~~~~~~~~g~~~~A~~~ 126 (271) ..|++......+++++.-=..-++.|++++.-....|+.+-|..++..+.+.+ |..+......+.+....|+++.|... T Consensus 309 ~~g~~~~~~~l~ercli~cA~Y~efWiky~~~m~~~~~~~~~~~~~~~~~~i~~k~~~~i~L~~a~f~e~~~n~~~A~~~ 388 (577) T KOG1258 309 TLGDFSRVFILFERCLIPCALYDEFWIKYARWMESSGDVSLANNVLARACKIHVKKTPIIHLLEARFEESNGNFDDAKVI 388 (577) T ss_pred HCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHH T ss_conf 40517889999999873775238899999999997176267779997432110778707999999999962469999999 Q ss_pred HHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHH---HHHHHCCCCCCH---HHHHHHHHHH-HHHHHHHHHHHHHHHHHHH Q ss_conf 64221100002335669999999742111110---000000000000---0012457776-5433223554359999999 Q gi|254780592|r 127 INRAQRPDIPDWQLISAKGSVLAQMGKHSEAL---IEYERALELSPN---ESSIVSNIAM-SYLLMGDLKTAEEKLRFAS 199 (271) Q Consensus 127 ~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~---~~~~~~~~~~p~---~~~~~~~~~~-~~~~~g~~~~A~~~~~~al 199 (271) +++....-|...+.-.....+..++|+.+.+. ..+....+-..+ ....+...+. .+.-.++.+.|...+.+++ T Consensus 389 lq~i~~e~pg~v~~~l~~~~~e~r~~~~~~~~~~~~l~s~~~~~~~~~~i~~~l~~~~~r~~~~i~~d~~~a~~~l~~~~ 468 (577) T KOG1258 389 LQRIESEYPGLVEVVLRKINWERRKGNLEDANYKNELYSSIYEGKENNGILEKLYVKFARLRYKIREDADLARIILLEAN 468 (577) T ss_pred HHHHHHHCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHH T ss_conf 99988657763556778876999834245566899999886045447544688999999999998227889999999752 Q ss_pred HCCCCCHHHHHHHHHHHHHCC Q ss_conf 538985478889999999869 Q gi|254780592|r 200 QMIGADSRIRQNLALVVGLQG 220 (271) Q Consensus 200 ~~~p~~~~~~~~La~~~~~~g 220 (271) +..|.+...+..+-.+....+ T Consensus 469 ~~~~~~k~~~~~~~~~~~~~~ 489 (577) T KOG1258 469 DILPDCKVLYLELIRFELIQP 489 (577) T ss_pred HCCCCCHHHHHHHHHHHHHCC T ss_conf 017720799999989998377 No 180 >pfam10300 IML2 Putative mitochondrial outer membrane protein. This is a family of proteins conserved from fungi to humans. Members of this family also carry a TPR_2 domain pfam07719 at their C-terminus. Many of them are annotated as being mitochondrial outer membrane protein IML2. Probab=97.38 E-value=0.0047 Score=36.35 Aligned_cols=105 Identities=15% Similarity=0.074 Sum_probs=55.9 Q ss_pred CCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHH----HHHHHHHHHHHHHHHHHHHHH Q ss_conf 0011258876422110000233566999999974211111000000000000000----124577765433223554359 Q gi|254780592|r 118 GYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNES----SIVSNIAMSYLLMGDLKTAEE 193 (271) Q Consensus 118 g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~----~~~~~~~~~~~~~g~~~~A~~ 193 (271) +..+.+.+.+......-|+..-.....|.+....|+.++|+..+..+.+....-. -.++.++..+..+.++++|.. T Consensus 230 ~~~~~a~~lL~~~~~~yP~s~l~l~~~gR~~~~~g~l~~Ai~~~~~~~~~~~~~kQ~~~lc~~El~w~~~~~~~w~~A~~ 309 (446) T pfam10300 230 GPLEEAEALLEPSRKRFPNSALWLFFEARIESLKGNLDEALELFEECIESQSEWKQIHHLCYWELAWCHVFLHRWKQAAN 309 (446) T ss_pred CCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 88999999999988538994689999999999737999999999986744654499999999999999999978999999 Q ss_pred HHHHHHHCCCCCH-HHHHHHHHHHHHCCCH Q ss_conf 9999995389854-7888999999986998 Q gi|254780592|r 194 KLRFASQMIGADS-RIRQNLALVVGLQGRM 222 (271) Q Consensus 194 ~~~~al~~~p~~~-~~~~~La~~~~~~g~~ 222 (271) ++....+.+.=+. -..+..|.++...++. T Consensus 310 ~~~~L~~~s~WSka~Y~Y~~a~c~~~~~~~ 339 (446) T pfam10300 310 YFLLLSDESKWSKALYAYLAAACLLMLGRE 339 (446) T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHHHHCCC T ss_conf 999999845418999999999999865330 No 181 >PRK10941 putative transcriptional regulator; Provisional Probab=97.32 E-value=0.0029 Score=37.66 Aligned_cols=81 Identities=10% Similarity=-0.004 Sum_probs=72.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHH Q ss_conf 12457776543322355435999999953898547888999999986998999999998629172866599999999832 Q gi|254780592|r 173 SIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQELSPEEATRNIKYIKSILSQR 252 (271) Q Consensus 173 ~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~~~~l~~~l~~~ 252 (271) ..+.++-.+|....+++.|+.+.+..+...|+++.-+...|.+|.++|.+..|...++.-+...|.+.....++..+..- T Consensus 182 RmL~nLK~~~~~e~~~~~AL~~~e~lL~l~Pddp~EiRDRGli~~qL~c~~~Al~DLe~fle~~P~dp~ae~ik~qi~~l 261 (269) T PRK10941 182 KLLDTLKAALMEEKQMELALRASEALLQFNPEDPYEIRDRGLIYAQLDCEHVALSDLSYFVEQCPEDPISEMIRAQIHSI 261 (269) T ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH T ss_conf 99999999998511499999999999862969979999999999984994899999999999785972899999999998 Q ss_pred H Q ss_conf 1 Q gi|254780592|r 253 D 253 (271) Q Consensus 253 ~ 253 (271) + T Consensus 262 ~ 262 (269) T PRK10941 262 E 262 (269) T ss_pred H T ss_conf 6 No 182 >KOG0551 consensus Probab=97.31 E-value=0.0013 Score=39.82 Aligned_cols=106 Identities=14% Similarity=0.147 Sum_probs=81.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHH Q ss_conf 2335669999999742111110000000000000001----245777654332235543599999995389854788899 Q gi|254780592|r 137 DWQLISAKGSVLAQMGKHSEALIEYERALELSPNESS----IVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNL 212 (271) Q Consensus 137 ~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~----~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~L 212 (271) .++.+-.-|+-|.+.+++..|..+|.+.++..-.+++ .+.|.+.+....|+|+.++..+.+++.++|.+..++++- T Consensus 80 ~Aen~KeeGN~~fK~Kryk~A~~~Yt~Glk~kc~D~dlnavLY~NRAAa~~~l~NyRs~l~Dcs~al~~~P~h~Ka~~R~ 159 (390) T KOG0551 80 QAENYKEEGNEYFKEKRYKDAVESYTEGLKKKCADPDLNAVLYTNRAAAQLYLGNYRSALNDCSAALKLKPTHLKAYIRG 159 (390) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHH T ss_conf 99988887689877756768999998887641899237899884299999999989999999999861487304434356 Q ss_pred HHHHHHCCCHHHHHHHHHHHCCCCCCCHHH Q ss_conf 999998699899999999862917286659 Q gi|254780592|r 213 ALVVGLQGRMKEAYSIASQELSPEEATRNI 242 (271) Q Consensus 213 a~~~~~~g~~~eA~~~~~~~L~~~~~~~~~ 242 (271) |.++..+.++++|..-.+..+..+...... T Consensus 160 Akc~~eLe~~~~a~nw~ee~~~~d~e~K~~ 189 (390) T KOG0551 160 AKCLLELERFAEAVNWCEEGLQIDDEAKKA 189 (390) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 689999988888777776531101778999 No 183 >KOG1586 consensus Probab=97.27 E-value=0.012 Score=33.97 Aligned_cols=161 Identities=12% Similarity=0.069 Sum_probs=106.6 Q ss_pred HCCHHHHHHHHHHHHHHCCC-----CH-HHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHH------HHHHHHHHHH Q ss_conf 20199999999999995757-----87-999999999987410012446665566327521001------3555532100 Q gi|254780592|r 49 HEQLLEVTSTIGLQYQSHTK-----NK-MIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEV------LAAYGKSLAN 116 (271) Q Consensus 49 ~~~~~~A~~~~~~al~~~P~-----~~-~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~------~~~~~~~~~~ 116 (271) ..++..|-..|-++-+.+-+ +. ..+...+.+ ++.+++.+|+.++++++++..+..+. +..++.+|.. T Consensus 47 aK~w~~AG~aflkaA~~h~k~~skhDaat~YveA~~c-ykk~~~~eAv~cL~~aieIyt~~Grf~~aAk~~~~iaEiyEs 125 (288) T KOG1586 47 AKNWSAAGDAFLKAADLHLKAGSKHDAATTYVEAANC-YKKVDPEEAVNCLEKAIEIYTDMGRFTMAAKHHIEIAEIYES 125 (288) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH-HHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 8868887899999999988608811378899999887-621584889999999999998522777877511139999862 Q ss_pred H-CCHHHHHHHHHHHHHHHCHHH------HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHH-------HHHHHHHH Q ss_conf 0-001125887642211000023------356699999997421111100000000000000012-------45777654 Q gi|254780592|r 117 A-GYLDEGLDAINRAQRPDIPDW------QLISAKGSVLAQMGKHSEALIEYERALELSPNESSI-------VSNIAMSY 182 (271) Q Consensus 117 ~-g~~~~A~~~~~~a~~~~p~~~------~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~-------~~~~~~~~ 182 (271) - .++++|+.+|+.+-+.-..+. ..+.--+..-...+++.+|+..|++.....-++... ....|.++ T Consensus 126 dl~d~ekaI~~YE~Aae~yk~ees~ssANKC~lKvA~yaa~leqY~~Ai~iyeqva~~s~~n~LLKys~KdyflkAgLCh 205 (288) T KOG1586 126 DLQDFEKAIAHYEQAAEYYKGEESVSSANKCLLKVAQYAAQLEQYSKAIDIYEQVARSSLDNNLLKYSAKDYFLKAGLCH 205 (288) T ss_pred HHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHH T ss_conf 27779999999999999970015552688999999999999999999999999999986036487767888999999886 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCHHHHH Q ss_conf 3322355435999999953898547888 Q gi|254780592|r 183 LLMGDLKTAEEKLRFASQMIGADSRIRQ 210 (271) Q Consensus 183 ~~~g~~~~A~~~~~~al~~~p~~~~~~~ 210 (271) +...|.-.+...+++..+.+|..++.+. T Consensus 206 l~~~D~v~a~~ALeky~~~dP~F~dsRE 233 (288) T KOG1586 206 LCKADEVNAQRALEKYQELDPAFTDSRE 233 (288) T ss_pred HHCCCHHHHHHHHHHHHHCCCCCCCCHH T ss_conf 7302077899999988753986464088 No 184 >KOG3824 consensus Probab=97.26 E-value=0.00072 Score=41.35 Aligned_cols=66 Identities=12% Similarity=0.089 Sum_probs=39.4 Q ss_pred HHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHH Q ss_conf 420199999999999995757879999999999874100124466655663275210013555532 Q gi|254780592|r 48 NHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKS 113 (271) Q Consensus 48 ~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~ 113 (271) ..|+.+.|...|.-++..+|.++++++.+|.......+.-+|-.+|-+++..+|.+.+++.+..+. T Consensus 128 ~~Gk~ekA~~lfeHAlalaP~~p~~L~e~G~f~E~~~~iv~ADq~Y~~ALtisP~nseALvnR~RT 193 (472) T KOG3824 128 KDGKLEKAMTLFEHALALAPTNPQILIEMGQFREMHNEIVEADQCYVKALTISPGNSEALVNRART 193 (472) T ss_pred HCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHEEEEECCCCHHHHHHHHCC T ss_conf 643468899999999830899779999976888851446766556420232179855877503115 No 185 >COG4649 Uncharacterized protein conserved in bacteria [Function unknown] Probab=97.21 E-value=0.013 Score=33.62 Aligned_cols=93 Identities=14% Similarity=0.049 Sum_probs=47.7 Q ss_pred CCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCH--HHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCC----HHHH Q ss_conf 433117999999842019999999999999575787--9999999999874100124466655663275210----0135 Q gi|254780592|r 35 RISIPDPESLENMNHEQLLEVTSTIGLQYQSHTKNK--MIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQ----EVLA 108 (271) Q Consensus 35 ~~~~~~~~~~~~~~~~~~~~A~~~~~~al~~~P~~~--~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~----~~~~ 108 (271) ........++...+.+..++|...|...-+.+-.+. -+.++.|.++.+.|+...|+..|.++-...|-.. .+.. T Consensus 57 ~sgd~flaAL~lA~~~k~d~Alaaf~~lektg~g~YpvLA~mr~at~~a~kgdta~AV~aFdeia~dt~~P~~~rd~ARl 136 (221) T COG4649 57 KSGDAFLAALKLAQENKTDDALAAFTDLEKTGYGSYPVLARMRAATLLAQKGDTAAAVAAFDEIAADTSIPQIGRDLARL 136 (221) T ss_pred CCHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCHHHHHHHH T ss_conf 32589999999987079668999999998648885569999999999861264788999899885257876013679999 Q ss_pred HHHHHHHHHCCHHHHHHHH Q ss_conf 5553210000011258876 Q gi|254780592|r 109 AYGKSLANAGYLDEGLDAI 127 (271) Q Consensus 109 ~~~~~~~~~g~~~~A~~~~ 127 (271) .-+..+...|-|++..... T Consensus 137 raa~lLvD~gsy~dV~srv 155 (221) T COG4649 137 RAAYLLVDNGSYDDVSSRV 155 (221) T ss_pred HHHHHHHCCCCHHHHHHHH T ss_conf 8789874356488899874 No 186 >KOG1914 consensus Probab=97.19 E-value=0.013 Score=33.73 Aligned_cols=146 Identities=10% Similarity=0.052 Sum_probs=78.7 Q ss_pred CHHHHHHHHHHHHCCCCC-CHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHH-HHHHHHHHHHHHHHHH Q ss_conf 012446665566327521-00135555321000001125887642211000023356699999-9974211111000000 Q gi|254780592|r 86 TAQALAVMRQVAILYPED-QEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSV-LAQMGKHSEALIEYER 163 (271) Q Consensus 86 ~~~A~~~~~~al~~~p~~-~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~-~~~~~~~~~A~~~~~~ 163 (271) .+..-+++++++.....+ .-+++.+-..-.+..-...|...|.++.+........+..-+.+ |.-.++.+-|...|+- T Consensus 347 ~~~~~~~~~~ll~~~~~~~tLv~~~~mn~irR~eGlkaaR~iF~kaR~~~r~~hhVfVa~A~mEy~cskD~~~AfrIFeL 426 (656) T KOG1914 347 EKKVHEIYNKLLKIEDIDLTLVYCQYMNFIRRAEGLKAARKIFKKAREDKRTRHHVFVAAALMEYYCSKDKETAFRIFEL 426 (656) T ss_pred HHHHHHHHHHHHHHHCCCCCEEHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHH T ss_conf 44307999998732214874302699999998640788999999985356774135678899999862781488999998 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC--CCCC-HHHHHHHHHHHHHCCCHHHHHHHHHH Q ss_conf 00000000012457776543322355435999999953--8985-47888999999986998999999998 Q gi|254780592|r 164 ALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQM--IGAD-SRIRQNLALVVGLQGRMKEAYSIASQ 231 (271) Q Consensus 164 ~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~--~p~~-~~~~~~La~~~~~~g~~~eA~~~~~~ 231 (271) .+...++++....-+...+...|+-..+...|++++.. +|+. .++|...-.--..-|+..-..++-++ T Consensus 427 GLkkf~d~p~yv~~YldfL~~lNdd~N~R~LFEr~l~s~l~~~ks~~Iw~r~l~yES~vGdL~si~~lekR 497 (656) T KOG1914 427 GLKKFGDSPEYVLKYLDFLSHLNDDNNARALFERVLTSVLSADKSKEIWDRMLEYESNVGDLNSILKLEKR 497 (656) T ss_pred HHHHCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHH T ss_conf 88755997699999999999838621089999998721588666689999999998722549999999999 No 187 >KOG2471 consensus Probab=97.18 E-value=0.0038 Score=36.94 Aligned_cols=43 Identities=23% Similarity=0.224 Sum_probs=20.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHH Q ss_conf 1245777654332235543599999995389854788899999 Q gi|254780592|r 173 SIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALV 215 (271) Q Consensus 173 ~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~ 215 (271) .+++|.|..|+..|++-.|..+|.+++...-.+++.|.+||.+ T Consensus 336 eilYNcG~~~Lh~grPl~AfqCf~~av~vfh~nPrlWLRlAEc 378 (696) T KOG2471 336 EILYNCGLLYLHSGRPLLAFQCFQKAVHVFHRNPRLWLRLAEC 378 (696) T ss_pred HHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHH T ss_conf 4677423799863890799999999999982183899999999 No 188 >KOG4507 consensus Probab=97.15 E-value=0.004 Score=36.84 Aligned_cols=110 Identities=15% Similarity=0.070 Sum_probs=70.7 Q ss_pred HHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHH Q ss_conf 2110000233566999999974211111000000000000000--12457776543322355435999999953898547 Q gi|254780592|r 130 AQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNES--SIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSR 207 (271) Q Consensus 130 a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~--~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~ 207 (271) +.+..+.++.....-+..+...|+..+|..++..++-..+... .++..+|.++.+.|...+|.-.+..|+.-.|.-.. T Consensus 205 glq~~~~sw~lH~~as~YWR~~G~~~~A~~Ca~~a~hf~~~h~kdi~lLSlaTiL~RaG~sadA~iILhAA~~dA~~~t~ 284 (886) T KOG4507 205 GLQKNTSSWVLHNMASFYWRIKGEPYQAVECAMRALHFSSRHNKDIALLSLATVLHRAGFSADAAVILHAALDDADFFTS 284 (886) T ss_pred HHHCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCCCCCC T ss_conf 65418436889988999999738854445789987651783001005767999998712421113331134057753356 Q ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCC Q ss_conf 88899999998699899999999862917286 Q gi|254780592|r 208 IRQNLALVVGLQGRMKEAYSIASQELSPEEAT 239 (271) Q Consensus 208 ~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~ 239 (271) -++.++.++..+|.+....-+|..++..+|.- T Consensus 285 n~y~l~~i~aml~~~N~S~~~ydha~k~~p~f 316 (886) T KOG4507 285 NYYTLGNIYAMLGEYNHSVLCYDHALQARPGF 316 (886) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 62018999999866202665356664038530 No 189 >KOG4507 consensus Probab=97.14 E-value=0.0069 Score=35.33 Aligned_cols=46 Identities=15% Similarity=0.109 Sum_probs=14.3 Q ss_pred HHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHH Q ss_conf 5532100000112588764221100002335669999999742111 Q gi|254780592|r 110 YGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHS 155 (271) Q Consensus 110 ~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~ 155 (271) +|.++.+.|.--+|--++--++...|.....+..++.++...+.+. T Consensus 255 laTiL~RaG~sadA~iILhAA~~dA~~~t~n~y~l~~i~aml~~~N 300 (886) T KOG4507 255 LATVLHRAGFSADAAVILHAALDDADFFTSNYYTLGNIYAMLGEYN 300 (886) T ss_pred HHHHHHHCCCCCCHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHH T ss_conf 9999987124211133311340577533566201899999986620 No 190 >pfam03704 BTAD Bacterial transcriptional activator domain. Found in the DNRI/REDD/AFSR family of regulators. This region of AFSR along with the C terminal region is capable of independently directing actinorhodin production. This family contains TPR repeats. Probab=97.14 E-value=0.0038 Score=36.96 Aligned_cols=114 Identities=18% Similarity=0.154 Sum_probs=67.0 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHH Q ss_conf 99999999874100124466655663275210013555532100000112588764221100002335669999999742 Q gi|254780592|r 73 GIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMG 152 (271) Q Consensus 73 ~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~ 152 (271) +...|......|++.+|+..+.+++.+.-.+.-.-.. .+.|-.. ...............++..+...| T Consensus 9 ~~~~a~~a~~~gd~~~a~~~~~~Al~LyrG~~L~~~~-------~~~W~~~-----~r~~l~~~~~~a~~~l~~~~~~~g 76 (146) T pfam03704 9 LVRAGRRALAAGDPAEAARLLRAALALWRGPALADVP-------AGPWLEA-----ERERLEELRLRALERRIEADLRLG 76 (146) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCCCCC-------CCHHHHH-----HHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 9999999887899999999999999855898877898-------5168999-----999999999999999999998859 Q ss_pred HHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 1111100000000000000012457776543322355435999999 Q gi|254780592|r 153 KHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFA 198 (271) Q Consensus 153 ~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a 198 (271) +++.++......+..+|-+..++..+-.++...|+..+|+..|++. T Consensus 77 ~~~~a~~~~~~~l~~dPl~E~~~~~Lm~a~~~~G~~~~Al~~Y~~~ 122 (146) T pfam03704 77 RHEEALAELRALVALHPLRERLHRQLMRALYRSGRQAEALRVYRRL 122 (146) T ss_pred CCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHH T ss_conf 8302199999999868874299999999999889999999999999 No 191 >pfam06552 TOM20_plant Plant specific mitochondrial import receptor subunit TOM20. This family consists of several plant specific mitochondrial import receptor subunit TOM20 (translocase of outer membrane 20 kDa subunit) proteins. Most mitochondrial proteins are encoded by the nuclear genome, and are synthesized in the cytosol. TOM20 is a general import receptor that binds to mitochondrial pre-sequences in the early step of protein import into the mitochondria. Probab=97.10 E-value=0.0013 Score=39.88 Aligned_cols=104 Identities=17% Similarity=0.174 Sum_probs=55.9 Q ss_pred CCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 00124466655663275210013555532100000112588764221100002335669999999742111110000000 Q gi|254780592|r 85 RTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERA 164 (271) Q Consensus 85 ~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~ 164 (271) -++.|....+.....+|.+..-+..+|-++..+-++...-+ .....++|+.-++.+ T Consensus 6 fFE~aRk~AE~~y~~nP~DadnLtRWGGALLELsqFq~~~d------------------------s~~Mi~dAisKlEeA 61 (186) T pfam06552 6 FFEHARKSAEATYKKNPLDADNLTRWGGALLELSQFQSVPD------------------------AKKMIQDAISKLEEA 61 (186) T ss_pred HHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCCHH------------------------HHHHHHHHHHHHHHH T ss_conf 99999999999885298458889888899999986126447------------------------999999999999998 Q ss_pred CCCCCHHHHHHHHHHHHHHH-----------HHHHHHHHHHHHHHHHCCCCCHHHHHHH Q ss_conf 00000000124577765433-----------2235543599999995389854788899 Q gi|254780592|r 165 LELSPNESSIVSNIAMSYLL-----------MGDLKTAEEKLRFASQMIGADSRIRQNL 212 (271) Q Consensus 165 ~~~~p~~~~~~~~~~~~~~~-----------~g~~~~A~~~~~~al~~~p~~~~~~~~L 212 (271) +.++|.-.++++-+|.++.- ...+++|..+|++|+..+|+|.-.+..| T Consensus 62 L~InP~khdalWclGNA~TS~~Fl~pD~~~A~~~F~kA~~~Fq~AvdeeP~NelYrKsL 120 (186) T pfam06552 62 LVINPKKHDALWCIGNAYTSHAFLTPDQTEAKDNFDKATQFFQQAVDEQPDNDLYRKSL 120 (186) T ss_pred HCCCCCCCCHHHEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHH T ss_conf 71398875333300541101513178789999889999999999863498618999999 No 192 >KOG3824 consensus Probab=97.09 E-value=0.0018 Score=38.95 Aligned_cols=144 Identities=13% Similarity=0.135 Sum_probs=90.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHH--HHHHHHHHCCHHH---------HHHHHHHHHHHCCCC- Q ss_conf 2787569999989999999999999854036854331179--9999984201999---------999999999957578- Q gi|254780592|r 2 IILFCPVILFLKKIKCIVFLIPLLLFSSCHLNPRISIPDP--ESLENMNHEQLLE---------VTSTIGLQYQSHTKN- 69 (271) Q Consensus 2 ~~~~~~~~~f~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~---------A~~~~~~al~~~P~~- 69 (271) +|.|+-+..-++.+.++..+..++.+-.-+ .++...... .......+.+... +...-....+-+|.+ T Consensus 35 iii~v~~sl~~~~f~a~s~~~tv~k~~q~~-~~p~~~~l~~~~~ya~~~~~~~~~~~~~yd~~~~~~v~at~te~~pa~~ 113 (472) T KOG3824 35 IIISVLVSLICQHFVAVSTLHTVIKPKQKS-SPPPSNRLNSPEKYAPAVQKPTFLVDPIYDEQTAVPVMSTQTENDPAKV 113 (472) T ss_pred EHHHHHHHHHHHHHHHHHHHHHHHCHHHHC-CCCCCCCCCCHHHCCCCCCCCHHHCCCHHCCCCCCCCCHHHCCCCCHHH T ss_conf 325788876546778777888861545306-8998334567022030336814320430022013420012123672206 Q ss_pred --HHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHH Q ss_conf --79999999999874100124466655663275210013555532100000112588764221100002335669999 Q gi|254780592|r 70 --KMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGS 146 (271) Q Consensus 70 --~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~ 146 (271) ...-++.|.-..+.|+.++|..+|+.++.+.|.+++++..+|.......+.-+|-..|-+++...|.+.+++.++.. T Consensus 114 kEA~~Al~~A~~~~~~Gk~ekA~~lfeHAlalaP~~p~~L~e~G~f~E~~~~iv~ADq~Y~~ALtisP~nseALvnR~R 192 (472) T KOG3824 114 KEAILALKAAGRSRKDGKLEKAMTLFEHALALAPTNPQILIEMGQFREMHNEIVEADQCYVKALTISPGNSEALVNRAR 192 (472) T ss_pred HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHEEEEECCCCHHHHHHHHC T ss_conf 9999999998898864346889999999983089977999997688885144676655642023217985587750311 No 193 >KOG1550 consensus Probab=97.06 E-value=0.019 Score=32.65 Aligned_cols=173 Identities=12% Similarity=0.010 Sum_probs=116.8 Q ss_pred CHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH-----HCCHHHHHHHHHHHH-----CCCCCCHHHHHHHHHHHHHC-- Q ss_conf 199999999999995757879999999999874-----100124466655663-----27521001355553210000-- Q gi|254780592|r 51 QLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRV-----GRTAQALAVMRQVAI-----LYPEDQEVLAAYGKSLANAG-- 118 (271) Q Consensus 51 ~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~-----g~~~~A~~~~~~al~-----~~p~~~~~~~~~~~~~~~~g-- 118 (271) +...|..+++.+-+.. +..+...+|.++..- .++++|+.+++.+.. ..-..+.+...+|.+|.... T Consensus 227 ~~~~a~~~~~~~a~~g--~~~a~~~~g~~y~~G~~g~~~d~e~a~~~l~~aa~~~~~~a~~~~~~a~~~lg~~Y~~g~~~ 304 (552) T KOG1550 227 ELSEAFKYYREAAKLG--HSEAQYALGICYLAGTYGVTQDLESAIEYLKLAAESFKKAATKGLPPAQYGLGRLYLQGLGV 304 (552) T ss_pred HHHHHHHHHHHHHHCC--CHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCC T ss_conf 0678899999987627--88999998999986255766689999999999988625666504742247999999818987 Q ss_pred ---CHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHH---HHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHH----HHHH Q ss_conf ---0112588764221100002335669999999742---11111000000000000000124577765433----2235 Q gi|254780592|r 119 ---YLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMG---KHSEALIEYERALELSPNESSIVSNIAMSYLL----MGDL 188 (271) Q Consensus 119 ---~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~---~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~----~g~~ 188 (271) +++.|...+.++.... .+.....+|.++.... +...|...|..+... ....+..+++.+|.. .-+. T Consensus 305 ~~~d~~~A~~~~~~aA~~g--~~~a~~~lg~~~~~g~~~~d~~~A~~yy~~Aa~~--G~~~A~~~la~~y~~G~gv~r~~ 380 (552) T KOG1550 305 EKIDYEKALKLYTKAAELG--NPDAQYLLGVLYETGTKERDYRRAFEYYSLAAKA--GHILAIYRLALCYELGLGVERNL 380 (552) T ss_pred CCCCHHHHHHHHHHHHHCC--CHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHC--CCHHHHHHHHHHHHHCCCCCCCH T ss_conf 5124999999999998559--9048999889876276321388999999999975--99899999999997477868877 Q ss_pred HHHHHHHHHHHHCCCCCHHHHHHHHHHHHHC-CCHHHHHHHHHH Q ss_conf 5435999999953898547888999999986-998999999998 Q gi|254780592|r 189 KTAEEKLRFASQMIGADSRIRQNLALVVGLQ-GRMKEAYSIASQ 231 (271) Q Consensus 189 ~~A~~~~~~al~~~p~~~~~~~~La~~~~~~-g~~~eA~~~~~~ 231 (271) ..|..+++++.+.+ ++.+.+.++..+..- ++++.+...+.. T Consensus 381 ~~A~~~~k~aA~~g--~~~A~~~~~~~~~~g~~~~~~~~~~~~~ 422 (552) T KOG1550 381 ELAFAYYKKAAEKG--NPSAAYLLGAFYEYGVGRYDTALALYLY 422 (552) T ss_pred HHHHHHHHHHHHCC--CHHHHHHHHHHHHHCCCCCCHHHHHHHH T ss_conf 89999999999827--8867999999998535652278999999 No 194 >COG3914 Spy Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones] Probab=97.06 E-value=0.019 Score=32.65 Aligned_cols=98 Identities=19% Similarity=0.068 Sum_probs=62.3 Q ss_pred HHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHH-HHHCCCCCCHHHHHH------HHHHHHHCC Q ss_conf 8420199999999999995757879999999999874100124466655-663275210013555------532100000 Q gi|254780592|r 47 MNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQ-VAILYPEDQEVLAAY------GKSLANAGY 119 (271) Q Consensus 47 ~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~-al~~~p~~~~~~~~~------~~~~~~~g~ 119 (271) ...++...+...+..++..||.++.+..+++.++...|....+...+.+ +....|.+....... ++.+..+|+ T Consensus 78 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~L~~ale~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 157 (620) T COG3914 78 APLADSTLAFLAKRIPLSVNPENCPAVQNLAAALELDGLQFLALADISEIAEWLSPDNAEFLGHLIRFYQLGRYLKLLGR 157 (620) T ss_pred CCCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 42466266899876567549552669998888899842478889999988876074007787667778999899998524 Q ss_pred HHHHHHHHHHHHHHHCHHHHHHHHH Q ss_conf 1125887642211000023356699 Q gi|254780592|r 120 LDEGLDAINRAQRPDIPDWQLISAK 144 (271) Q Consensus 120 ~~~A~~~~~~a~~~~p~~~~~~~~l 144 (271) ..++...+.+.....|..++....+ T Consensus 158 ~~~~~~~l~~~~d~~p~~~~~~~~~ 182 (620) T COG3914 158 TAEAELALERAVDLLPKYPRVLGAL 182 (620) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 8999999999988543306667678 No 195 >pfam10300 IML2 Putative mitochondrial outer membrane protein. This is a family of proteins conserved from fungi to humans. Members of this family also carry a TPR_2 domain pfam07719 at their C-terminus. Many of them are annotated as being mitochondrial outer membrane protein IML2. Probab=97.03 E-value=0.02 Score=32.47 Aligned_cols=107 Identities=9% Similarity=-0.023 Sum_probs=73.3 Q ss_pred HCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCC----CHHHHHHHHHHHHHCCHHHHH Q ss_conf 2019999999999999575787999999999987410012446665566327521----001355553210000011258 Q gi|254780592|r 49 HEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPED----QEVLAAYGKSLANAGYLDEGL 124 (271) Q Consensus 49 ~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~----~~~~~~~~~~~~~~g~~~~A~ 124 (271) .++...|...+....++.|+..-.++.-|.++...|+.++|+..+++++.....- .-.+..++.++.-+.+|++|. T Consensus 229 ~~~~~~a~~lL~~~~~~yP~s~l~l~~~gR~~~~~g~l~~Ai~~~~~~~~~~~~~kQ~~~lc~~El~w~~~~~~~w~~A~ 308 (446) T pfam10300 229 EGPLEEAEALLEPSRKRFPNSALWLFFEARIESLKGNLDEALELFEECIESQSEWKQIHHLCYWELAWCHVFLHRWKQAA 308 (446) T ss_pred CCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 98899999999998853899468999999999973799999999998674465449999999999999999997899999 Q ss_pred HHHHHHHHHHCHHHHHH-HHHHHHHHHHHHHH Q ss_conf 87642211000023356-69999999742111 Q gi|254780592|r 125 DAINRAQRPDIPDWQLI-SAKGSVLAQMGKHS 155 (271) Q Consensus 125 ~~~~~a~~~~p~~~~~~-~~lg~~~~~~~~~~ 155 (271) ..+.+..+........| +..|.++...++.+ T Consensus 309 ~~~~~L~~~s~WSka~Y~Y~~a~c~~~~~~~~ 340 (446) T pfam10300 309 NYFLLLSDESKWSKALYAYLAAACLLMLGREE 340 (446) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCH T ss_conf 99999998454189999999999998653301 No 196 >PRK10941 putative transcriptional regulator; Provisional Probab=96.96 E-value=0.0036 Score=37.07 Aligned_cols=77 Identities=16% Similarity=0.159 Sum_probs=66.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHH Q ss_conf 35669999999742111110000000000000001245777654332235543599999995389854788899999 Q gi|254780592|r 139 QLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALV 215 (271) Q Consensus 139 ~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~ 215 (271) ....++=.+|...++++.|..+.+..+...|+++.-+...|.+|.+.|.+..|...++..++..|+++.+-.-..++ T Consensus 182 RmL~nLK~~~~~e~~~~~AL~~~e~lL~l~Pddp~EiRDRGli~~qL~c~~~Al~DLe~fle~~P~dp~ae~ik~qi 258 (269) T PRK10941 182 KLLDTLKAALMEEKQMELALRASEALLQFNPEDPYEIRDRGLIYAQLDCEHVALSDLSYFVEQCPEDPISEMIRAQI 258 (269) T ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHH T ss_conf 99999999998511499999999999862969979999999999984994899999999999785972899999999 No 197 >COG3914 Spy Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones] Probab=96.94 E-value=0.01 Score=34.37 Aligned_cols=143 Identities=19% Similarity=0.198 Sum_probs=101.7 Q ss_pred HHHHHHCCHHH-HHHHHHHHHHHCCCCHHHHHHH--HHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCH Q ss_conf 99984201999-9999999999575787999999--99998741001244666556632752100135555321000001 Q gi|254780592|r 44 LENMNHEQLLE-VTSTIGLQYQSHTKNKMIGIVY--ADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYL 120 (271) Q Consensus 44 ~~~~~~~~~~~-A~~~~~~al~~~P~~~~~~~~~--a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~ 120 (271) .+.++.+.... ++..+..-+..+|.++..++.. ...+...+....+.-.+..++..+|+...+..+++..+...|.. T Consensus 38 ~~~l~~~~~~~~~~~a~~~~~~~~~~~~~llla~~lsi~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~L~~ale~~~~~ 117 (620) T COG3914 38 LEWLNAEGLQALAIYALLLGIAINDVNPELLLAAFLSILLAPLADSTLAFLAKRIPLSVNPENCPAVQNLAAALELDGLQ 117 (620) T ss_pred HHHHCCCCCHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHH T ss_conf 99851567356788998713705787778999999875124246626689987656754955266999888889984247 Q ss_pred HHHHHHHHH-HHHHHCHHHHHHHHH------HHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHH Q ss_conf 125887642-211000023356699------999997421111100000000000000012457776543322 Q gi|254780592|r 121 DEGLDAINR-AQRPDIPDWQLISAK------GSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMG 186 (271) Q Consensus 121 ~~A~~~~~~-a~~~~p~~~~~~~~l------g~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g 186 (271) ..+...+.. +....|.+.+....+ +......++..++.....+..+..|..+.+...+......+. T Consensus 118 ~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~l~~~~d~~p~~~~~~~~~~~~r~~~c 190 (620) T COG3914 118 FLALADISEIAEWLSPDNAEFLGHLIRFYQLGRYLKLLGRTAEAELALERAVDLLPKYPRVLGALMTARQEQC 190 (620) T ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 8889999988876074007787667778999899998524899999999998854330666767889899754 No 198 >pfam04910 DUF654 Protein of unknown function, DUF654. This family includes a number of poorly characterized eukaryotic proteins. Probab=96.89 E-value=0.027 Score=31.73 Aligned_cols=148 Identities=15% Similarity=0.038 Sum_probs=94.9 Q ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHC-----CC------------------CCCH---HHHHHH Q ss_conf 999999957578799999999998741001244666556632-----75------------------2100---135555 Q gi|254780592|r 58 TIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAIL-----YP------------------EDQE---VLAAYG 111 (271) Q Consensus 58 ~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~-----~p------------------~~~~---~~~~~~ 111 (271) .+-..+..+|-+.+.++.+++++.++|+...|.+.+++++-. +| ++.. ++.... T Consensus 28 ~l~~lL~~~PYHvdtLLq~s~v~~~qgd~~~a~dlieRALf~~e~~~h~~F~~~~~~g~~rL~~~~~eNR~fflal~r~i 107 (343) T pfam04910 28 ALISLLQKYPYHVDTLLQVSEIFRRQGDHSLANDLIERALFAFERALHPSFSPSLTSGNCRLPYRRPENRQFFLALFRYI 107 (343) T ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCCHHHHHHHHHHHHHH T ss_conf 99999985998378999999999984778889999999999977662254542567760227678341299999999999 Q ss_pred HHHHHHCCHHHHHHHHHHHHHHHCH-HHHHHHHHHHH-HHHHHHHHHHHHHHHHCCCCCCH--HHHHHHHHHHHHHHHHH Q ss_conf 3210000011258876422110000-23356699999-99742111110000000000000--00124577765433223 Q gi|254780592|r 112 KSLANAGYLDEGLDAINRAQRPDIP-DWQLISAKGSV-LAQMGKHSEALIEYERALELSPN--ESSIVSNIAMSYLLMGD 187 (271) Q Consensus 112 ~~~~~~g~~~~A~~~~~~a~~~~p~-~~~~~~~lg~~-~~~~~~~~~A~~~~~~~~~~~p~--~~~~~~~~~~~~~~~g~ 187 (271) ..+.+.|-+-.|.++.+-.+..+|. |+-.....-.. ..+.++++--+...+.......- -+..-...+.++...++ T Consensus 108 ~~L~~Rgc~rTAlE~~KLLLsLdp~~DP~g~ll~ID~~ALrs~ey~~li~f~~~~~~~~~~~~lPn~afS~aLA~~~l~~ 187 (343) T pfam04910 108 QSLGRRGCWRTALEWCKLLLSLDPEEDPLGALLLIDYYALRAKEYEWLIDFSESPLASRNLSLLPNFAYSVALALFLLEK 187 (343) T ss_pred HHHHHCCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHCCCHHHHHHHHHHHHHC T ss_conf 99886654899999999998449888813679999999996353779999997353034065584878999999999748 Q ss_pred ----HHHHHHHHHHHHHCCCCC Q ss_conf ----554359999999538985 Q gi|254780592|r 188 ----LKTAEEKLRFASQMIGAD 205 (271) Q Consensus 188 ----~~~A~~~~~~al~~~p~~ 205 (271) .+.|...+++|+..-|.- T Consensus 188 ~~~~~~~A~~~L~~Ai~~fP~v 209 (343) T pfam04910 188 DDGLSESARALLLRAILRFPWV 209 (343) T ss_pred CCCCCHHHHHHHHHHHHHHHHH T ss_conf 6666078999999999984999 No 199 >KOG2396 consensus Probab=96.87 E-value=0.0099 Score=34.39 Aligned_cols=62 Identities=16% Similarity=0.104 Sum_probs=40.9 Q ss_pred HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHC-CHHHHHHHHHHHHCCCCCCHHHHHHHHH Q ss_conf 9999999999999575787999999999987410-0124466655663275210013555532 Q gi|254780592|r 52 LLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGR-TAQALAVMRQVAILYPEDQEVLAAYGKS 113 (271) Q Consensus 52 ~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~-~~~A~~~~~~al~~~p~~~~~~~~~~~~ 113 (271) +.+-...|.+++..+|++++.|+.-|.-....+. .+.|..++.+++..+|+.+..|..+-+. T Consensus 121 ~~~v~ki~~~~l~~Hp~~~dLWI~aA~wefe~n~ni~saRalflrgLR~npdsp~Lw~eyfrm 183 (568) T KOG2396 121 YGEVKKIFAAMLAKHPNNPDLWIYAAKWEFEINLNIESARALFLRGLRFNPDSPKLWKEYFRM 183 (568) T ss_pred HHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHH T ss_conf 468999999999858998306886655687503235899999997764189976899999999 No 200 >pfam05843 Suf Suppressor of forked protein (Suf). This family consists of several eukaryotic suppressor of forked (Suf) like proteins. The Drosophila melanogaster Suppressor of forked [Su(f)] protein shares homology with the yeast RNA14 protein and the 77-kDa subunit of human cleavage stimulation factor, which are proteins involved in mRNA 3' end formation. This suggests a role for Su(f) in mRNA 3' end formation in Drosophila. The su(f) gene produces three transcripts; two of them are polyadenylated at the end of the transcription unit, and one is a truncated transcript, polyadenylated in intron 4. It is thought that su(f) plays a role in the regulation of poly(A) site utilisation and an important role of the GU-rich sequence for this regulation to occur. Probab=96.80 E-value=0.032 Score=31.30 Aligned_cols=81 Identities=11% Similarity=0.021 Sum_probs=34.3 Q ss_pred HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH-HHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHH Q ss_conf 9999999999999575787999999999987-410012446665566327521001355553210000011258876422 Q gi|254780592|r 52 LLEVTSTIGLQYQSHTKNKMIGIVYADVLRR-VGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRA 130 (271) Q Consensus 52 ~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~-~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a 130 (271) ..+|...|.+|-+...-.-+++..-|.+-+. .++..-|..+|+.+++..++++.....+-+.+...++...+...|+++ T Consensus 17 i~~aR~vF~~aRk~~~~t~~vyva~Al~E~~~~~d~~~a~kIFelGlK~f~~~~~y~~~Yld~Li~~nd~~n~R~lFE~~ 96 (275) T pfam05843 17 IKAARKVFKKARKKGRITYHVYVASALMEYYCSKDPKTAFKIFELGLKLFPEDGEFVLKYLDFLISLNDDTNARVLFETV 96 (275) T ss_pred HHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHH T ss_conf 28899999998607898559999999999987378079999999998765897899999999998717750189999999 Q ss_pred HH Q ss_conf 11 Q gi|254780592|r 131 QR 132 (271) Q Consensus 131 ~~ 132 (271) +. T Consensus 97 ~~ 98 (275) T pfam05843 97 VT 98 (275) T ss_pred HH T ss_conf 85 No 201 >COG2909 MalT ATP-dependent transcriptional regulator [Transcription] Probab=96.78 E-value=0.033 Score=31.22 Aligned_cols=154 Identities=20% Similarity=0.132 Sum_probs=67.6 Q ss_pred HHHHCCHHHHHHHHHHHHHHCCCCH-----HHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCC------CHHHHHHHHHH Q ss_conf 9842019999999999999575787-----999999999987410012446665566327521------00135555321 Q gi|254780592|r 46 NMNHEQLLEVTSTIGLQYQSHTKNK-----MIGIVYADVLRRVGRTAQALAVMRQVAILYPED------QEVLAAYGKSL 114 (271) Q Consensus 46 ~~~~~~~~~A~~~~~~al~~~P~~~-----~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~------~~~~~~~~~~~ 114 (271) .++.++++.|+...+.+++.-|.+. .+....|.+..-.|++++|..+.+++.+..... ..+....+.++ T Consensus 468 al~~~~~e~a~~lar~al~~L~~~~~~~r~~~~sv~~~a~~~~G~~~~Al~~~~~a~~~a~~~~~~~l~~~~~~~~s~il 547 (894) T COG2909 468 ALNRGDPEEAEDLARLALVQLPEAAYRSRIVALSVLGEAAHIRGELTQALALMQQAEQMARQHDVYHLALWSLLQQSEIL 547 (894) T ss_pred HHHCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH T ss_conf 98458778999999999986612254000256540568888734399999988879999998044889999999999999 Q ss_pred HHHCC--HHHHHHHHHHH----HHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC----CHHH---HHHHHHHHH Q ss_conf 00000--11258876422----11000023356699999997421111100000000000----0000---124577765 Q gi|254780592|r 115 ANAGY--LDEGLDAINRA----QRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELS----PNES---SIVSNIAMS 181 (271) Q Consensus 115 ~~~g~--~~~A~~~~~~a----~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~----p~~~---~~~~~~~~~ 181 (271) ...|+ +.+....+... ....|........++.++...-+.+.+......+++.. |... ....+++.+ T Consensus 548 ~~qGq~~~a~~~~~~~~~~~q~l~q~~~~~f~~~~r~~ll~~~~r~~~~~~ear~~~~~~~~~~~~~~~~~~~~~~LA~l 627 (894) T COG2909 548 EAQGQVARAEQEKAFNLIREQHLEQKPRHEFLVRIRAQLLRAWLRLDLAEAEARLGIEVGSVYTPQPLLSRLALSMLAEL 627 (894) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHCCCCHHHHHHHHHHHHHH T ss_conf 98307889988888777998876542532157889999999999886666776433004220363236799999999999 Q ss_pred HHHHHHHHHHHHHHHHHH Q ss_conf 433223554359999999 Q gi|254780592|r 182 YLLMGDLKTAEEKLRFAS 199 (271) Q Consensus 182 ~~~~g~~~~A~~~~~~al 199 (271) +...|++++|...+.... T Consensus 628 ~~~~Gdl~~A~~~l~~~~ 645 (894) T COG2909 628 EFLRGDLDKALAQLDELE 645 (894) T ss_pred HHHCCCHHHHHHHHHHHH T ss_conf 875388888998999999 No 202 >COG2909 MalT ATP-dependent transcriptional regulator [Transcription] Probab=96.78 E-value=0.033 Score=31.19 Aligned_cols=195 Identities=18% Similarity=0.126 Sum_probs=127.7 Q ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHCCC--C-------HHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCC----- Q ss_conf 179999998420199999999999995757--8-------79999999999874100124466655663275210----- Q gi|254780592|r 39 PDPESLENMNHEQLLEVTSTIGLQYQSHTK--N-------KMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQ----- 104 (271) Q Consensus 39 ~~~~~~~~~~~~~~~~A~~~~~~al~~~P~--~-------~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~----- 104 (271) ...++........+.+|...+.++...-|. . .+..--.|.+....|+++.|..+.+.++..-|.+. T Consensus 418 vll~aW~~~s~~r~~ea~~li~~l~~~l~~~~~~~~~~l~ae~~aL~a~val~~~~~e~a~~lar~al~~L~~~~~~~r~ 497 (894) T COG2909 418 VLLQAWLLASQHRLAEAETLIARLEHFLKAPMHSRQGDLLAEFQALRAQVALNRGDPEEAEDLARLALVQLPEAAYRSRI 497 (894) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCHHHH T ss_conf 99999999973691789999999998707776631244889999999999984587789999999999866122540002 Q ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHHHH----CHHHHHHHH--HHHHHHHHH--HHHHHHHHHHHCCC----CCCHHH Q ss_conf 013555532100000112588764221100----002335669--999999742--11111000000000----000000 Q gi|254780592|r 105 EVLAAYGKSLANAGYLDEGLDAINRAQRPD----IPDWQLISA--KGSVLAQMG--KHSEALIEYERALE----LSPNES 172 (271) Q Consensus 105 ~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~----p~~~~~~~~--lg~~~~~~~--~~~~A~~~~~~~~~----~~p~~~ 172 (271) ..+...+.+..-.|++++|......+.+.. ......|.. .+.++..+| .++.....|..... ..|... T Consensus 498 ~~~sv~~~a~~~~G~~~~Al~~~~~a~~~a~~~~~~~l~~~~~~~~s~il~~qGq~~~a~~~~~~~~~~~q~l~q~~~~~ 577 (894) T COG2909 498 VALSVLGEAAHIRGELTQALALMQQAEQMARQHDVYHLALWSLLQQSEILEAQGQVARAEQEKAFNLIREQHLEQKPRHE 577 (894) T ss_pred HHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCH T ss_conf 56540568888734399999988879999998044889999999999999983078899888887779988765425321 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCC----CC--CH-HHHHHHHHHHHHCCCHHHHHHHHHHHC Q ss_conf 124577765433223554359999999538----98--54-788899999998699899999999862 Q gi|254780592|r 173 SIVSNIAMSYLLMGDLKTAEEKLRFASQMI----GA--DS-RIRQNLALVVGLQGRMKEAYSIASQEL 233 (271) Q Consensus 173 ~~~~~~~~~~~~~g~~~~A~~~~~~al~~~----p~--~~-~~~~~La~~~~~~g~~~eA~~~~~~~L 233 (271) ......+.++...-+++.+.....+.++.. |. .+ -+...||.++...|+.++|...+.... T Consensus 578 f~~~~r~~ll~~~~r~~~~~~ear~~~~~~~~~~~~~~~~~~~~~~LA~l~~~~Gdl~~A~~~l~~~~ 645 (894) T COG2909 578 FLVRIRAQLLRAWLRLDLAEAEARLGIEVGSVYTPQPLLSRLALSMLAELEFLRGDLDKALAQLDELE 645 (894) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH T ss_conf 57889999999999886666776433004220363236799999999999875388888998999999 No 203 >pfam05843 Suf Suppressor of forked protein (Suf). This family consists of several eukaryotic suppressor of forked (Suf) like proteins. The Drosophila melanogaster Suppressor of forked [Su(f)] protein shares homology with the yeast RNA14 protein and the 77-kDa subunit of human cleavage stimulation factor, which are proteins involved in mRNA 3' end formation. This suggests a role for Su(f) in mRNA 3' end formation in Drosophila. The su(f) gene produces three transcripts; two of them are polyadenylated at the end of the transcription unit, and one is a truncated transcript, polyadenylated in intron 4. It is thought that su(f) plays a role in the regulation of poly(A) site utilisation and an important role of the GU-rich sequence for this regulation to occur. Probab=96.75 E-value=0.031 Score=31.33 Aligned_cols=115 Identities=14% Similarity=0.064 Sum_probs=78.1 Q ss_pred HHHHHHCCCCCCCCHHHH---HHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCC Q ss_conf 998540368543311799---99998420199999999999995757879999999999874100124466655663275 Q gi|254780592|r 25 LLFSSCHLNPRISIPDPE---SLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYP 101 (271) Q Consensus 25 ~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p 101 (271) .+|.-|-..+...-.-.. .+++...+|-.=|...|+.-++.-|++++..+.+.+-+...++...+..+|++++..-+ T Consensus 22 ~vF~~aRk~~~~t~~vyva~Al~E~~~~~d~~~a~kIFelGlK~f~~~~~y~~~Yld~Li~~nd~~n~R~lFE~~~~~l~ 101 (275) T pfam05843 22 KVFKKARKKGRITYHVYVASALMEYYCSKDPKTAFKIFELGLKLFPEDGEFVLKYLDFLISLNDDTNARVLFETVVTKLT 101 (275) T ss_pred HHHHHHHHCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCC T ss_conf 99999860789855999999999998737807999999999876589789999999999871775018999999985048 Q ss_pred CCC---HHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHH Q ss_conf 210---01355553210000011258876422110000233 Q gi|254780592|r 102 EDQ---EVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQ 139 (271) Q Consensus 102 ~~~---~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~ 139 (271) ... ..|..+......-|+.......-++-.+.-|.+.. T Consensus 102 ~~~e~~~i~~~~~~fEs~~gdL~~i~klE~R~~e~fPed~~ 142 (275) T pfam05843 102 PEPEAKPLWKKFIKYESKYGDLSSILKLEKRMFELFPEDPP 142 (275) T ss_pred CCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCH T ss_conf 73777999999999988508699999999999986789727 No 204 >COG0790 FOG: TPR repeat, SEL1 subfamily [General function prediction only] Probab=96.68 E-value=0.039 Score=30.75 Aligned_cols=166 Identities=17% Similarity=0.053 Sum_probs=114.0 Q ss_pred HHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH----HCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHH----H Q ss_conf 98420199999999999995757879999999999874----100124466655663275210013555532100----0 Q gi|254780592|r 46 NMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRV----GRTAQALAVMRQVAILYPEDQEVLAAYGKSLAN----A 117 (271) Q Consensus 46 ~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~----g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~----~ 117 (271) ....+++..|...+.++-.. .++.....++..+..- .+..+|...++.+. ....+.+...+|..+.. . T Consensus 51 ~~~~~~~~~a~~~~~~a~~~--~~~~a~~~l~~~y~~g~gv~~~~~~A~~~~~~~a--~~g~~~a~~~lg~~~~~G~gv~ 126 (292) T COG0790 51 SAYPPDYAKALKSYEKAAEL--GDAAALALLGQMYGAGKGVSRDKTKAADWYRCAA--ADGLAEALFNLGLMYANGRGVP 126 (292) T ss_pred HCCCCCHHHHHHHHHHHHHC--CCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHH--HCCCHHHHHHHHHHHHCCCCCC T ss_conf 00221489889999988643--5476789999998635552013899999999987--3697999999999985589974 Q ss_pred CCHHHHHHHHHHHHHHHCHH-HHHHHHHHHHHHHHH-------HHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHH----H Q ss_conf 00112588764221100002-335669999999742-------11111000000000000000124577765433----2 Q gi|254780592|r 118 GYLDEGLDAINRAQRPDIPD-WQLISAKGSVLAQMG-------KHSEALIEYERALELSPNESSIVSNIAMSYLL----M 185 (271) Q Consensus 118 g~~~~A~~~~~~a~~~~p~~-~~~~~~lg~~~~~~~-------~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~----~ 185 (271) .+..+|..++.++....... ......+|.++..-. +...|...+.++-... +..+...+|..|.. . T Consensus 127 ~d~~~A~~~~~~Aa~~g~~~a~~~~~~l~~~y~~g~~~~~~~~~~~~A~~~~~~aa~~~--~~~a~~~lg~~y~~G~Gv~ 204 (292) T COG0790 127 LDLVKALKYYEKAAKLGNVEAALAMYRLGLAYLSGLQALAVAYDDKKALYLYRKAAELG--NPDAQLLLGRMYEKGLGVP 204 (292) T ss_pred CCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHC--CHHHHHHHHHHHHCCCCCC T ss_conf 58999999999999847930599999999986188632121514899999999999836--7999999999997588876 Q ss_pred HHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCC Q ss_conf 23554359999999538985478889999999869 Q gi|254780592|r 186 GDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQG 220 (271) Q Consensus 186 g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g 220 (271) .++++|..+|+++.+... ....+.++ ++...| T Consensus 205 ~d~~~A~~wy~~Aa~~g~--~~a~~~~~-~~~~~g 236 (292) T COG0790 205 RDLKKAFRWYKKAAEQGD--GAACYNLG-LMYLNG 236 (292) T ss_pred CCHHHHHHHHHHHHHCCC--HHHHHHHH-HHHHCC T ss_conf 689999999999997698--99999999-998668 No 205 >KOG1258 consensus Probab=96.67 E-value=0.039 Score=30.72 Aligned_cols=120 Identities=13% Similarity=-0.046 Sum_probs=59.0 Q ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC-CCHHHHHHHHHHHHHHHH Q ss_conf 3555532100000112588764221100002335669999999742111110000000000-000001245777654332 Q gi|254780592|r 107 LAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALEL-SPNESSIVSNIAMSYLLM 185 (271) Q Consensus 107 ~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~-~p~~~~~~~~~~~~~~~~ 185 (271) |..+-......|+++...-.+++.+..-....+.|...+......|+.+.+...+..+.+. .+..+.++...+..-... T Consensus 300 w~~yLdf~i~~g~~~~~~~l~ercli~cA~Y~efWiky~~~m~~~~~~~~~~~~~~~~~~i~~k~~~~i~L~~a~f~e~~ 379 (577) T KOG1258 300 WRYYLDFEITLGDFSRVFILFERCLIPCALYDEFWIKYARWMESSGDVSLANNVLARACKIHVKKTPIIHLLEARFEESN 379 (577) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHH T ss_conf 99876455540517889999999873775238899999999997176267779997432110778707999999999962 Q ss_pred HHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHH Q ss_conf 23554359999999538985478889999999869989999 Q gi|254780592|r 186 GDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAY 226 (271) Q Consensus 186 g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~ 226 (271) |+++.|...+++..+..|+...+-..-+.+....|+.+.+. T Consensus 380 ~n~~~A~~~lq~i~~e~pg~v~~~l~~~~~e~r~~~~~~~~ 420 (577) T KOG1258 380 GNFDDAKVILQRIESEYPGLVEVVLRKINWERRKGNLEDAN 420 (577) T ss_pred CCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCHHHHH T ss_conf 46999999999988657763556778876999834245566 No 206 >KOG3616 consensus Probab=96.65 E-value=0.041 Score=30.61 Aligned_cols=22 Identities=36% Similarity=0.395 Sum_probs=10.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH Q ss_conf 5777654332235543599999 Q gi|254780592|r 176 SNIAMSYLLMGDLKTAEEKLRF 197 (271) Q Consensus 176 ~~~~~~~~~~g~~~~A~~~~~~ 197 (271) ...+..|...|+++.|.+.|.+ T Consensus 769 ~~iadhyan~~dfe~ae~lf~e 790 (1636) T KOG3616 769 GEIADHYANKGDFEIAEELFTE 790 (1636) T ss_pred HHHHHHHCCCHHHHHHHHHHHH T ss_conf 6788875253037899999873 No 207 >KOG0985 consensus Probab=96.58 E-value=0.046 Score=30.33 Aligned_cols=167 Identities=15% Similarity=0.068 Sum_probs=98.4 Q ss_pred CCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHH Q ss_conf 78799999999998741001244666556632752100135555321000001125887642211000023356699999 Q gi|254780592|r 68 KNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSV 147 (271) Q Consensus 68 ~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~ 147 (271) +.|..|..+|.+-.+.|...+|++-|-++ +++..+....+...+.|.|++-..++..+..... .+..-..+-.. T Consensus 1102 n~p~vWsqlakAQL~~~~v~dAieSyika-----dDps~y~eVi~~a~~~~~~edLv~yL~MaRkk~~-E~~id~eLi~A 1175 (1666) T KOG0985 1102 NEPAVWSQLAKAQLQGGLVKDAIESYIKA-----DDPSNYLEVIDVASRTGKYEDLVKYLLMARKKVR-EPYIDSELIFA 1175 (1666) T ss_pred CCHHHHHHHHHHHHHCCCHHHHHHHHHHC-----CCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHC-CCCCHHHHHHH T ss_conf 88579999999988458407899978615-----8817789999998753769999999999997404-76412899999 Q ss_pred HHHHHHHHHHHHHH----------------HH----CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC---- Q ss_conf 99742111110000----------------00----0000000001245777654332235543599999995389---- Q gi|254780592|r 148 LAQMGKHSEALIEY----------------ER----ALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIG---- 203 (271) Q Consensus 148 ~~~~~~~~~A~~~~----------------~~----~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p---- 203 (271) |.+.++..+-.+.. +. +.++--++.+-+..++..++..|+|..|...-++|-+... T Consensus 1176 yAkt~rl~elE~fi~gpN~A~i~~vGdrcf~~~~y~aAkl~y~~vSN~a~La~TLV~LgeyQ~AVD~aRKAns~ktWK~V 1255 (1666) T KOG0985 1176 YAKTNRLTELEEFIAGPNVANIQQVGDRCFEEKMYEAAKLLYSNVSNFAKLASTLVYLGEYQGAVDAARKANSTKTWKEV 1255 (1666) T ss_pred HHHHCCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHH T ss_conf 99752077899986498702288875887540235999999997646999999999988777787776640330179999 Q ss_pred -----C----------------CHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCH Q ss_conf -----8----------------547888999999986998999999998629172866 Q gi|254780592|r 204 -----A----------------DSRIRQNLALVVGLQGRMKEAYSIASQELSPEEATR 240 (271) Q Consensus 204 -----~----------------~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~ 240 (271) + +.+-...|...|...|-++|-+.+++.+|..+.++- T Consensus 1256 cfaCvd~~EFrlAQiCGL~iivhadeLeeli~~Yq~rGyFeElIsl~Ea~LGLERAHM 1313 (1666) T KOG0985 1256 CFACVDKEEFRLAQICGLNIIVHADELEELIEYYQDRGYFEELISLLEAGLGLERAHM 1313 (1666) T ss_pred HHHHHCHHHHHHHHHCCCEEEEEHHHHHHHHHHHHHCCCHHHHHHHHHHHHCHHHHHH T ss_conf 9997062454498763744898677689999999825709999999986525047777 No 208 >pfam08424 DUF1740 Protein of unknown function (DUF1740). This is a family of eukaryotic proteins of unknown function. Probab=96.53 E-value=0.049 Score=30.13 Aligned_cols=138 Identities=9% Similarity=-0.039 Sum_probs=85.5 Q ss_pred HHHHHHHHHHHH-HCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHH-HC--CHHHHHHHHH Q ss_conf 999999999999-5757879999999999874100124466655663275210013555532100-00--0112588764 Q gi|254780592|r 53 LEVTSTIGLQYQ-SHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLAN-AG--YLDEGLDAIN 128 (271) Q Consensus 53 ~~A~~~~~~al~-~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~-~g--~~~~A~~~~~ 128 (271) +.-+..|++|++ .+|++...++.+-......-+.++-..-+++++..+|.+...|..+-...+. .. .+++....|. T Consensus 50 E~KlsilEkAL~~~np~~~~Lll~~l~~~~~~w~~~~l~~~We~~l~~~p~~~~LW~~yL~~~q~~~~~f~~~~v~~~y~ 129 (231) T pfam08424 50 EIKLSILEKALKEHNPDSERLLLGLLEEGEKVWDTDELLSRWEKVLKQNPGSPSLWRKYLDFRQGDFANFSYSKVRKTYE 129 (231) T ss_pred HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHH T ss_conf 99999999999985899789999999999985787999999999999889975999999999860654670888999999 Q ss_pred HHHHHHCHHHHHHHHHHHHHHHHHHH-HHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHH Q ss_conf 22110000233566999999974211-11100000000000000012457776543322355435999999953898547 Q gi|254780592|r 129 RAQRPDIPDWQLISAKGSVLAQMGKH-SEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSR 207 (271) Q Consensus 129 ~a~~~~p~~~~~~~~lg~~~~~~~~~-~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~ 207 (271) +.+. .+... ......-............+..++.....+.|..+.|...++-.++.+--.+. T Consensus 130 ~cl~-----------------~l~~~~~~~~~~~~~~~~~e~~~l~lflr~~~flrqaG~~E~Aval~QA~lE~n~f~P~ 192 (231) T pfam08424 130 KCLR-----------------ALKAAKLKEITSVPAVPELEEVMLYLFLRLCRFLRQAGYTELAVALWQALLEFNFFRPD 192 (231) T ss_pred HHHH-----------------HHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCCH T ss_conf 9999-----------------98750024432455530278999999999999999859689999999999988268836 No 209 >pfam02259 FAT FAT domain. The FAT domain is named after FRAP, ATM and TRRAP. Probab=96.51 E-value=0.05 Score=30.07 Aligned_cols=109 Identities=15% Similarity=0.019 Sum_probs=70.3 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC-------- Q ss_conf 002335669999999742111110000000000000--0012457776543322355435999999953898-------- Q gi|254780592|r 135 IPDWQLISAKGSVLAQMGKHSEALIEYERALELSPN--ESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGA-------- 204 (271) Q Consensus 135 p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~--~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~-------- 204 (271) ......|...+.+..+.|+++.|...+.++....+. .+.+....+..+..+|+..+|+..++..+..... T Consensus 147 ~e~~~~wlk~a~laRK~g~~~~A~~~l~~l~~~~~~~~~p~v~~e~aKllW~~g~~~~Ai~~L~~~~~~~~~~~~~~~~~ 226 (351) T pfam02259 147 HELAEMWLKFANLARKSGRFSLAEKALLKLLTYDTREDLPNVVIAYAKYLWAKGEQQEAFQKLREFTSCYLSKPVGSSSD 226 (351) T ss_pred HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 55899999999999888899899999999862588789954699999999974889999999999999875212122443 Q ss_pred ------------------------CHHHHHHHHHHHHH------CCCHHHHHHHHHHHCCCCCCCHHHH Q ss_conf ------------------------54788899999998------6998999999998629172866599 Q gi|254780592|r 205 ------------------------DSRIRQNLALVVGL------QGRMKEAYSIASQELSPEEATRNIK 243 (271) Q Consensus 205 ------------------------~~~~~~~La~~~~~------~g~~~eA~~~~~~~L~~~~~~~~~~ 243 (271) ..+++..+|.-... .+..+++...|..+...++...... T Consensus 227 ~~~~~~~~~~~~~~~~~~~~~~~~~Ar~~l~lg~W~~~~~~~~~~~~~~~il~~y~~A~~~~~~w~Kaw 295 (351) T pfam02259 227 SELLLGLTYEVISSTNLEYFEAKLLARCFLKLGEWLDKLQMNWGQGKKDEILKAYRTATQFDDQWYKAW 295 (351) T ss_pred HHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCCHHHH T ss_conf 023210122345560144453799999999998999985654364579999999999998786648999 No 210 >COG0790 FOG: TPR repeat, SEL1 subfamily [General function prediction only] Probab=96.48 E-value=0.053 Score=29.95 Aligned_cols=148 Identities=20% Similarity=0.101 Sum_probs=59.4 Q ss_pred HHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHH----HCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHH----HH Q ss_conf 874100124466655663275210013555532100----0001125887642211000023356699999997----42 Q gi|254780592|r 81 RRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLAN----AGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQ----MG 152 (271) Q Consensus 81 ~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~----~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~----~~ 152 (271) ...+.+..+...+.++.... +......++..+.. ..+...|..+++.+ .+...+.....+|.++.. .. T Consensus 52 ~~~~~~~~a~~~~~~a~~~~--~~~a~~~l~~~y~~g~gv~~~~~~A~~~~~~~--a~~g~~~a~~~lg~~~~~G~gv~~ 127 (292) T COG0790 52 AYPPDYAKALKSYEKAAELG--DAAALALLGQMYGAGKGVSRDKTKAADWYRCA--AADGLAEALFNLGLMYANGRGVPL 127 (292) T ss_pred CCCCCHHHHHHHHHHHHHCC--CHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHH--HHCCCHHHHHHHHHHHHCCCCCCC T ss_conf 02214898899999886435--47678999999863555201389999999998--736979999999999855899745 Q ss_pred HHHHHHHHHHHCCCCCCHH-HHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHH----CC Q ss_conf 1111100000000000000-012457776543322-------35543599999995389854788899999998----69 Q gi|254780592|r 153 KHSEALIEYERALELSPNE-SSIVSNIAMSYLLMG-------DLKTAEEKLRFASQMIGADSRIRQNLALVVGL----QG 220 (271) Q Consensus 153 ~~~~A~~~~~~~~~~~p~~-~~~~~~~~~~~~~~g-------~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~----~g 220 (271) +..+|...+.++....... ......++..|..-. +...|...|.++-... +..+..++|.+|.. .. T Consensus 128 d~~~A~~~~~~Aa~~g~~~a~~~~~~l~~~y~~g~~~~~~~~~~~~A~~~~~~aa~~~--~~~a~~~lg~~y~~G~Gv~~ 205 (292) T COG0790 128 DLVKALKYYEKAAKLGNVEAALAMYRLGLAYLSGLQALAVAYDDKKALYLYRKAAELG--NPDAQLLLGRMYEKGLGVPR 205 (292) T ss_pred CHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHC--CHHHHHHHHHHHHCCCCCCC T ss_conf 8999999999999847930599999999986188632121514899999999999836--79999999999975888766 Q ss_pred CHHHHHHHHHHHCC Q ss_conf 98999999998629 Q gi|254780592|r 221 RMKEAYSIASQELS 234 (271) Q Consensus 221 ~~~eA~~~~~~~L~ 234 (271) ++.+|...|+++-. T Consensus 206 d~~~A~~wy~~Aa~ 219 (292) T COG0790 206 DLKKAFRWYKKAAE 219 (292) T ss_pred CHHHHHHHHHHHHH T ss_conf 89999999999997 No 211 >KOG1839 consensus Probab=96.47 E-value=0.048 Score=30.17 Aligned_cols=155 Identities=16% Similarity=0.053 Sum_probs=88.7 Q ss_pred HHHHHHHHHCCHHHHH------HHHHH-HHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHH--------HHCHHHHH Q ss_conf 9999987410012446------66556-632752100135555321000001125887642211--------00002335 Q gi|254780592|r 76 YADVLRRVGRTAQALA------VMRQV-AILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQR--------PDIPDWQL 140 (271) Q Consensus 76 ~a~~l~~~g~~~~A~~------~~~~a-l~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~--------~~p~~~~~ 140 (271) .+..-...|.+.++.+ ++... -.++|+....+..++.++..+|+.++|..+-.++.- ..|+.... T Consensus 938 ~gq~~~~e~~~~~~~~~~~slnl~~~v~~~~h~~~~~~~~~La~l~~~~~d~~~Ai~~~~ka~ii~eR~~g~ds~~t~~~ 1017 (1236) T KOG1839 938 QGQEALLEDGFSEAYELPESLNLLNNVMGVLHPEVASKYRSLAKLSNRLGDNQEAIAQQRKACIISERVLGKDSPNTKLA 1017 (1236) T ss_pred HHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCHHHHHHHCCCCEEEECHHCCCCCHHHHHH T ss_conf 65666412201455420024567777764005568899999998876533358899840121554001116798788877 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCC--------CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC-----C--- Q ss_conf 66999999974211111000000000--------0000001245777654332235543599999995389-----8--- Q gi|254780592|r 141 ISAKGSVLAQMGKHSEALIEYERALE--------LSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIG-----A--- 204 (271) Q Consensus 141 ~~~lg~~~~~~~~~~~A~~~~~~~~~--------~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p-----~--- 204 (271) +.+++......+....|...+..+.. ..|.......+++.++...++++.|+.+++.|+..+. . T Consensus 1018 y~nlal~~f~~~~~~~al~~~~ra~~l~~Ls~ge~hP~~a~~~~nle~l~~~v~e~d~al~~le~A~a~~~~v~g~~~l~ 1097 (1236) T KOG1839 1018 YGNLALYEFAVKNLSGALKSLNRALKLKLLSSGEDHPPTALSFINLELLLLGVEEADTALRYLESALAKNKKVLGPKELE 1097 (1236) T ss_pred HHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHH T ss_conf 66789998741575511466888898630345788996223264799998558888778999999998776624852103 Q ss_pred CHHHHHHHHHHHHHCCCHHHHHHHHH Q ss_conf 54788899999998699899999999 Q gi|254780592|r 205 DSRIRQNLALVVGLQGRMKEAYSIAS 230 (271) Q Consensus 205 ~~~~~~~La~~~~~~g~~~eA~~~~~ 230 (271) ....+..++......|.+..|....+ T Consensus 1098 ~~~~~~~~a~l~~s~~dfr~al~~ek 1123 (1236) T KOG1839 1098 TALSYHALARLFESMKDFRNALEHEK 1123 (1236) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 45678899998752688999998876 No 212 >KOG1839 consensus Probab=96.44 E-value=0.047 Score=30.22 Aligned_cols=102 Identities=18% Similarity=0.128 Sum_probs=47.4 Q ss_pred HHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC--------CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC--- Q ss_conf 00002335669999999742111110000000000--------0000012457776543322355435999999953--- Q gi|254780592|r 133 PDIPDWQLISAKGSVLAQMGKHSEALIEYERALEL--------SPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQM--- 201 (271) Q Consensus 133 ~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~--------~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~--- 201 (271) ..|.....+..++.++...++.++|+..-.++.-. .|+....+.+++......++...|...+.++... T Consensus 968 ~h~~~~~~~~~La~l~~~~~d~~~Ai~~~~ka~ii~eR~~g~ds~~t~~~y~nlal~~f~~~~~~~al~~~~ra~~l~~L 1047 (1236) T KOG1839 968 LHPEVASKYRSLAKLSNRLGDNQEAIAQQRKACIISERVLGKDSPNTKLAYGNLALYEFAVKNLSGALKSLNRALKLKLL 1047 (1236) T ss_pred CCHHHHHHHHHHHHHHHHHCCHHHHHHHCCCCEEEECHHCCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCC T ss_conf 05568899999998876533358899840121554001116798788877667899987415755114668888986303 Q ss_pred -----CCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCC Q ss_conf -----898547888999999986998999999998629 Q gi|254780592|r 202 -----IGADSRIRQNLALVVGLQGRMKEAYSIASQELS 234 (271) Q Consensus 202 -----~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~ 234 (271) .|.-.-+..++.+++...++++-|..+++.+++ T Consensus 1048 s~ge~hP~~a~~~~nle~l~~~v~e~d~al~~le~A~a 1085 (1236) T KOG1839 1048 SSGEDHPPTALSFINLELLLLGVEEADTALRYLESALA 1085 (1236) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 45788996223264799998558888778999999998 No 213 >pfam00515 TPR_1 Tetratricopeptide repeat. Probab=96.30 E-value=0.0034 Score=37.27 Aligned_cols=31 Identities=13% Similarity=0.194 Sum_probs=13.3 Q ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHCCCC Q ss_conf 9999999998741001244666556632752 Q gi|254780592|r 72 IGIVYADVLRRVGRTAQALAVMRQVAILYPE 102 (271) Q Consensus 72 ~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~ 102 (271) ++.++|.+|..+|++++|+..|+++++++|+ T Consensus 3 ~~~~lG~~y~~~~~~~~A~~~~~~Al~l~p~ 33 (34) T pfam00515 3 ALYNLGNAYFKLGKYDEAIEYYEKALELNPN 33 (34) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHCCCCC T ss_conf 9999899999917899999999998843989 No 214 >TIGR02561 HrpB1_HrpK type III secretion protein HrpB1/HrpK; InterPro: IPR013394 This family of proteins is encoded by genes found within type III secretion operons in a limited range of species including Xanthomonas, Ralstonia and Burkholderia.. Probab=96.21 E-value=0.06 Score=29.61 Aligned_cols=102 Identities=17% Similarity=0.197 Sum_probs=51.2 Q ss_pred HHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC---HHHHHHHHHHHHHCCCHHHHHHH Q ss_conf 211111000000000000000124577765433223554359999999538985---47888999999986998999999 Q gi|254780592|r 152 GKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGAD---SRIRQNLALVVGLQGRMKEAYSI 228 (271) Q Consensus 152 ~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~---~~~~~~La~~~~~~g~~~eA~~~ 228 (271) +++.++...+....-++|+-..+-..=|.+++..|+|++|.+.|.+.....|.. +...-.||.++..+|+ +.+.. T Consensus 30 ~~~~d~~l~L~AlRvLrP~l~~~d~~dG~L~i~~G~y~dAaR~L~E~~~~~~~~svm~yg~ALlAlcL~~~~D--~~W~~ 107 (190) T TIGR02561 30 ADPYDIELLLDALRVLRPNLKELDMFDGWLLIARGNYDDAARILRELLASAGAASVMPYGKALLALCLSAKGD--DEWHL 107 (190) T ss_pred CCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCC--HHHHH T ss_conf 7888899999998502765068899999998615875899999998724899720437899999998743288--88899 Q ss_pred -HHHHCCCCCCCHHHHHHHHHHHHHHHH Q ss_conf -998629172866599999999832129 Q gi|254780592|r 229 -ASQELSPEEATRNIKYIKSILSQRDPW 255 (271) Q Consensus 229 -~~~~L~~~~~~~~~~~l~~~l~~~~~~ 255 (271) ...+|..++....+.+.+++..+.+-- T Consensus 108 ~A~e~L~~~a~~dav~LVkAL~~~~D~~ 135 (190) T TIGR02561 108 HADEVLEEDADADAVALVKALEAREDVL 135 (190) T ss_pred HHHHHHHCCCCCHHHHHHHHHHHHHHHH T ss_conf 9999874487850679999886532489 No 215 >KOG2396 consensus Probab=96.14 E-value=0.03 Score=31.40 Aligned_cols=93 Identities=6% Similarity=-0.004 Sum_probs=78.6 Q ss_pred HHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCC-HHHHHHHHHHHHH Q ss_conf 999999999995757879999999999874100124466655663275210013555532100000-1125887642211 Q gi|254780592|r 54 EVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGY-LDEGLDAINRAQR 132 (271) Q Consensus 54 ~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~-~~~A~~~~~~a~~ 132 (271) .-...|+.+..+-|.|+..|.++..-....+.+.+--.+|.+++..+|+++..|..-+.-....+. .+.|...+.+++. T Consensus 89 rIv~lyr~at~rf~~D~~lW~~yi~f~kk~~~~~~v~ki~~~~l~~Hp~~~dLWI~aA~wefe~n~ni~saRalflrgLR 168 (568) T KOG2396 89 RIVFLYRRATNRFNGDVKLWLSYIAFCKKKKTYGEVKKIFAAMLAKHPNNPDLWIYAAKWEFEINLNIESARALFLRGLR 168 (568) T ss_pred HHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHH T ss_conf 99999999997567777899999999998422468999999999858998306886655687503235899999997764 Q ss_pred HHCHHHHHHHHHHH Q ss_conf 00002335669999 Q gi|254780592|r 133 PDIPDWQLISAKGS 146 (271) Q Consensus 133 ~~p~~~~~~~~lg~ 146 (271) .+|..+..|...-. T Consensus 169 ~npdsp~Lw~eyfr 182 (568) T KOG2396 169 FNPDSPKLWKEYFR 182 (568) T ss_pred CCCCCHHHHHHHHH T ss_conf 18997689999999 No 216 >pfam00515 TPR_1 Tetratricopeptide repeat. Probab=96.12 E-value=0.0049 Score=36.25 Aligned_cols=31 Identities=19% Similarity=0.233 Sum_probs=15.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 2457776543322355435999999953898 Q gi|254780592|r 174 IVSNIAMSYLLMGDLKTAEEKLRFASQMIGA 204 (271) Q Consensus 174 ~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~ 204 (271) ++.++|.+|..+|++++|+.+|++|++++|+ T Consensus 3 ~~~~lG~~y~~~~~~~~A~~~~~~Al~l~p~ 33 (34) T pfam00515 3 ALYNLGNAYFKLGKYDEAIEYYEKALELNPN 33 (34) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHCCCCC T ss_conf 9999899999917899999999998843989 No 217 >pfam04053 Coatomer_WDAD Coatomer WD associated region. This region is composed of WD40 repeats. Probab=96.10 E-value=0.084 Score=28.70 Aligned_cols=158 Identities=17% Similarity=0.157 Sum_probs=101.0 Q ss_pred HHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHH Q ss_conf 99984201999999999999957578799999999998741001244666556632752100135555321000001125 Q gi|254780592|r 44 LENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEG 123 (271) Q Consensus 44 ~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A 123 (271) ...+-.+++++|...+...--.-|. +-....+.-+...|-++.|+.+.. +| ...+. ...+.|+.+.| T Consensus 263 k~av~r~d~~~~~~~~~~~~l~~p~--~~~~~i~~fLe~~G~~e~AL~~~~-----D~---~~rFe---LAl~lG~L~~A 329 (435) T pfam04053 263 KTALLRKDYEEVLRIIANSNLLPPK--DQGQKIIRFLEKKGYPELALQFVT-----DP---DTRFD---LALELGNLDVA 329 (435) T ss_pred HHHHHCCCHHHHHHHHHHCCCCCCH--HHHHHHHHHHHHCCCHHHHHHHCC-----CH---HHHHH---HHHHCCCHHHH T ss_conf 9999757788999876421336873--466799999997797888876608-----90---47616---76861999999 Q ss_pred HHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 88764221100002335669999999742111110000000000000001245777654332235543599999995389 Q gi|254780592|r 124 LDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIG 203 (271) Q Consensus 124 ~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p 203 (271) .+...+. ++...|..+|.....+|+++.|..+|.++-.. ..+..+|.-.|+.+.=....+.|..... T Consensus 330 ~~~A~~~-----~~~~~Wk~Lg~~AL~~g~~~lAe~~~~k~~d~--------~~LllLy~~tG~~~~L~kl~~~a~~~g~ 396 (435) T pfam04053 330 LEIAKEL-----DDEHKWKRLGDAALSQGNIKLAEEAYQKAKDF--------DKLLLLYLSTGNKEKLKKLAKIAEERGD 396 (435) T ss_pred HHHHHHH-----CCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCH--------HHHHHHHHHCCCHHHHHHHHHHHHHCCC T ss_conf 9999873-----67999999999999749958999999974253--------4559999984999999999999998476 Q ss_pred CCHHHHHHHHHHHHHCCCHHHHHHHHHHH Q ss_conf 85478889999999869989999999986 Q gi|254780592|r 204 ADSRIRQNLALVVGLQGRMKEAYSIASQE 232 (271) Q Consensus 204 ~~~~~~~~La~~~~~~g~~~eA~~~~~~~ 232 (271) .+.... ++...|+.++-..++.+. T Consensus 397 ~n~aF~-----~~~~lgd~~~cv~lL~~t 420 (435) T pfam04053 397 YNSAFQ-----NALYLGDVEKCVDILIKT 420 (435) T ss_pred CCHHHH-----HHHHHCCHHHHHHHHHHC T ss_conf 229999-----998808999999999985 No 218 >COG4976 Predicted methyltransferase (contains TPR repeat) [General function prediction only] Probab=96.04 E-value=0.0072 Score=35.23 Aligned_cols=61 Identities=13% Similarity=0.025 Sum_probs=56.7 Q ss_pred HHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHH Q ss_conf 9842019999999999999575787999999999987410012446665566327521001 Q gi|254780592|r 46 NMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEV 106 (271) Q Consensus 46 ~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~ 106 (271) ....+|.+.|.+.|.++++.-|+....|+++|......|+.+.|..-++++++++|++... T Consensus 5 ~~~~~D~~aaaely~qal~lap~w~~gwfR~g~~~ekag~~daAa~a~~~~L~ldp~D~~g 65 (287) T COG4976 5 LAESGDAEAAAELYNQALELAPEWAAGWFRLGEYTEKAGEFDAAAAAYEEVLELDPEDHGG 65 (287) T ss_pred HCCCCCHHHHHHHHHHHHHCCCHHHHHHHHCCHHHHHCCCHHHHHHHHHHHHCCCCCCCCC T ss_conf 1036784899999987750470444345540236663041899999999887079621222 No 219 >smart00299 CLH Clathrin heavy chain repeat homology. Probab=96.00 E-value=0.094 Score=28.41 Aligned_cols=130 Identities=8% Similarity=0.089 Sum_probs=60.9 Q ss_pred HHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHH Q ss_conf 79999999999874100124466655663275210013555532100000112588764221100002335669999999 Q gi|254780592|r 70 KMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLA 149 (271) Q Consensus 70 ~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~ 149 (271) +.-......++...|.+.+-+.+++.++..++.++.....++.++...+. ...+..+.+- .+... ....|..+. T Consensus 7 ~~~~~~vi~~~e~~~~~~~L~~~L~~~~~~~~~~~~l~~~Li~~yak~~~-~~~le~~~~~----~~~yd-~~~i~~~C~ 80 (140) T smart00299 7 PIDVSEVVELFEKRNLLEELIPYLESALKLNSENPALQTKLIELYAKYDP-QKEIERLDNK----SNHYD-IEKVGKLCE 80 (140) T ss_pred CHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCC-HHHHHHHHHC----CCCCC-HHHHHHHHH T ss_conf 54799999999987999999999999984576670799999999998293-8689998703----67588-999999999 Q ss_pred HHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHH Q ss_conf 7421111100000000000000012457776543-32235543599999995389854788899999998 Q gi|254780592|r 150 QMGKHSEALIEYERALELSPNESSIVSNIAMSYL-LMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGL 218 (271) Q Consensus 150 ~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~-~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~ 218 (271) ..+-+++|...+.+.-. ... ....++ ..++++.|+.++.++ ++++.|..++..... T Consensus 81 ~~~l~~~a~~ly~~~~~----~~~----ai~~~i~~~~~~~~aie~~~~~-----~~~e~w~~l~~~~ld 137 (140) T smart00299 81 KAKLYEEAVELYKKDGN----FKD----AIVTLIEHLGNYEKAIEYFVKQ-----NNPELWAEVLKALLD 137 (140) T ss_pred HHCCHHHHHHHHHHCCC----HHH----HHHHHHHHHHHHHHHHHHHHHC-----CCHHHHHHHHHHHHC T ss_conf 81339999999987064----999----9999999867499999999984-----999999999999855 No 220 >KOG2300 consensus Probab=95.98 E-value=0.096 Score=28.34 Aligned_cols=188 Identities=13% Similarity=0.053 Sum_probs=123.5 Q ss_pred HHHCCHHHHHHHHHHHHHHCCC---CH-----------HHHHHHHHHHHHHHCCHHHHHHHHHHHH---CCCC------- Q ss_conf 8420199999999999995757---87-----------9999999999874100124466655663---2752------- Q gi|254780592|r 47 MNHEQLLEVTSTIGLQYQSHTK---NK-----------MIGIVYADVLRRVGRTAQALAVMRQVAI---LYPE------- 102 (271) Q Consensus 47 ~~~~~~~~A~~~~~~al~~~P~---~~-----------~~~~~~a~~l~~~g~~~~A~~~~~~al~---~~p~------- 102 (271) +..|.+.+|..+-++++..-.+ -+ ..+.....+-.-.|++.+|+.-+....+ ..|. T Consensus 286 m~~gy~~~~~K~tDe~i~q~eklkq~d~~srilsm~km~~LE~iv~c~lv~~~~~~al~~i~dm~~w~~r~p~~~Llr~~ 365 (629) T KOG2300 286 MPAGYFKKAQKYTDEAIKQTEKLKQADLMSRILSMFKMILLEHIVMCRLVRGDYVEALEEIVDMKNWCTRFPTPLLLRAH 365 (629) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHH T ss_conf 14689999999999999987640124321489999999999999999987078899999999999998718945777773 Q ss_pred CCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCH---HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHH-------H Q ss_conf 1001355553210000011258876422110000---23356699999997421111100000000000000-------0 Q gi|254780592|r 103 DQEVLAAYGKSLANAGYLDEGLDAINRAQRPDIP---DWQLISAKGSVLAQMGKHSEALIEYERALELSPNE-------S 172 (271) Q Consensus 103 ~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~---~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~-------~ 172 (271) .......+|......+-++.|...|..++..... .+....+++..|.++++.+.--+..+..-..+..+ . T Consensus 366 ~~~ih~LlGlys~sv~~~enAe~hf~~a~k~t~~~dl~a~~nlnlAi~YL~~~~~ed~y~~ld~i~p~nt~s~ssq~l~a 445 (629) T KOG2300 366 EAQIHMLLGLYSHSVNCYENAEFHFIEATKLTESIDLQAFCNLNLAISYLRIGDAEDLYKALDLIGPLNTNSLSSQRLEA 445 (629) T ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCCCCCCHHHHHHH T ss_conf 78999997167631024778999999999862077899999986999999712087899999862898777624999987 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC------HHHHHHHHHHHHHCCCHHHHHHHHHHHCC Q ss_conf 124577765433223554359999999538985------47888999999986998999999998629 Q gi|254780592|r 173 SIVSNIAMSYLLMGDLKTAEEKLRFASQMIGAD------SRIRQNLALVVGLQGRMKEAYSIASQELS 234 (271) Q Consensus 173 ~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~------~~~~~~La~~~~~~g~~~eA~~~~~~~L~ 234 (271) .+++..|...+.++++.+|...+++.++..... .-....|+.+....|+..|+....+-++. T Consensus 446 ~~~~v~glfaf~qn~lnEaK~~l~e~Lkmanaed~~rL~a~~LvLLs~v~lslgn~~es~nmvrpamq 513 (629) T KOG2300 446 SILYVYGLFAFKQNDLNEAKRFLRETLKMANAEDLNRLTACSLVLLSHVFLSLGNTVESRNMVRPAMQ 513 (629) T ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHCCCHHHH T ss_conf 79999999999855289999999998754040257788878999999999981452887725545888 No 221 >KOG0985 consensus Probab=95.91 E-value=0.1 Score=28.16 Aligned_cols=175 Identities=12% Similarity=0.022 Sum_probs=80.3 Q ss_pred CHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHH Q ss_conf 11799999984201999999999999957578799999999998741001244666556632752100135555321000 Q gi|254780592|r 38 IPDPESLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANA 117 (271) Q Consensus 38 ~~~~~~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~ 117 (271) ...+.+...+..+-..+|++.|-++ +||..+.....+..+.|++++-+.++.-+.+...+ +.+-..+-.+|... T Consensus 1106 vWsqlakAQL~~~~v~dAieSyika-----dDps~y~eVi~~a~~~~~~edLv~yL~MaRkk~~E-~~id~eLi~AyAkt 1179 (1666) T KOG0985 1106 VWSQLAKAQLQGGLVKDAIESYIKA-----DDPSNYLEVIDVASRTGKYEDLVKYLLMARKKVRE-PYIDSELIFAYAKT 1179 (1666) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHC-----CCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCC-CCCHHHHHHHHHHH T ss_conf 9999999988458407899978615-----88177899999987537699999999999974047-64128999999975 Q ss_pred CCHHHHHHHHH--------------------HHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCH------- Q ss_conf 00112588764--------------------221100002335669999999742111110000000000000------- Q gi|254780592|r 118 GYLDEGLDAIN--------------------RAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPN------- 170 (271) Q Consensus 118 g~~~~A~~~~~--------------------~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~------- 170 (271) ++..+-.+.+. .+-+.-......|..++..+..+|++..|.+..+++-....= T Consensus 1180 ~rl~elE~fi~gpN~A~i~~vGdrcf~~~~y~aAkl~y~~vSN~a~La~TLV~LgeyQ~AVD~aRKAns~ktWK~VcfaC 1259 (1666) T KOG0985 1180 NRLTELEEFIAGPNVANIQQVGDRCFEEKMYEAAKLLYSNVSNFAKLASTLVYLGEYQGAVDAARKANSTKTWKEVCFAC 1259 (1666) T ss_pred CCHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH T ss_conf 20778999864987022888758875402359999999976469999999999887777877766403301799999997 Q ss_pred ------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHH Q ss_conf ------------------001245777654332235543599999995389854788899999998 Q gi|254780592|r 171 ------------------ESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGL 218 (271) Q Consensus 171 ------------------~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~ 218 (271) ..+-+..+...|...|-+++-+..++.++.+...+-..+..||..|.. T Consensus 1260 vd~~EFrlAQiCGL~iivhadeLeeli~~Yq~rGyFeElIsl~Ea~LGLERAHMgmfTELaiLYsk 1325 (1666) T KOG0985 1260 VDKEEFRLAQICGLNIIVHADELEELIEYYQDRGYFEELISLLEAGLGLERAHMGMFTELAILYSK 1325 (1666) T ss_pred HCHHHHHHHHHCCCEEEEEHHHHHHHHHHHHHCCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHH T ss_conf 062454498763744898677689999999825709999999986525047777889999999985 No 222 >KOG1310 consensus Probab=95.85 E-value=0.022 Score=32.23 Aligned_cols=93 Identities=15% Similarity=-0.035 Sum_probs=68.5 Q ss_pred HHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHH---CCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHH Q ss_conf 984201999999999999957578799999999998741---00124466655663275210013555532100000112 Q gi|254780592|r 46 NMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVG---RTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDE 122 (271) Q Consensus 46 ~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g---~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~ 122 (271) -+..+....|+..|.++++..|.....+.++|.++++.+ +.--|+.-.-.++.++|....++..+++++.+++++.+ T Consensus 384 ~ly~~~~~~~i~~~s~a~q~~~~~~~~l~nraa~lmkRkW~~d~~~AlrDch~Alrln~s~~kah~~la~aL~el~r~~e 463 (758) T KOG1310 384 GLYESIVSGAISHYSRAIQYVPDAIYLLENRAAALMKRKWRGDSYLALRDCHVALRLNPSIQKAHFRLARALNELTRYLE 463 (758) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 10368887789998877664550267887689999864114307899876776256783888889999999999766877 Q ss_pred HHHHHHHHHHHHCHHH Q ss_conf 5887642211000023 Q gi|254780592|r 123 GLDAINRAQRPDIPDW 138 (271) Q Consensus 123 A~~~~~~a~~~~p~~~ 138 (271) |++...-+....|.+. T Consensus 464 al~~~~alq~~~Ptd~ 479 (758) T KOG1310 464 ALSCHWALQMSFPTDV 479 (758) T ss_pred HHHHHHHHHHCCCHHH T ss_conf 5312898751693566 No 223 >KOG1586 consensus Probab=95.82 E-value=0.11 Score=27.92 Aligned_cols=186 Identities=11% Similarity=0.040 Sum_probs=101.9 Q ss_pred CCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCC------CCCHHHHHHHHHHHHHCCHHHH Q ss_conf 0199999999999995757879999999999874100124466655663275------2100135555321000001125 Q gi|254780592|r 50 EQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYP------EDQEVLAAYGKSLANAGYLDEG 123 (271) Q Consensus 50 ~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p------~~~~~~~~~~~~~~~~g~~~~A 123 (271) +.+++|.+.+.++- ..|....++..|=..|-++.+.+- +-...+..-+.+|.. ++.++| T Consensus 28 ~k~eeAadl~~~Aa--------------n~yklaK~w~~AG~aflkaA~~h~k~~skhDaat~YveA~~cykk-~~~~eA 92 (288) T KOG1586 28 NKYEEAAELYERAA--------------NMYKLAKNWSAAGDAFLKAADLHLKAGSKHDAATTYVEAANCYKK-VDPEEA 92 (288) T ss_pred CCHHHHHHHHHHHH--------------HHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHC-CCHHHH T ss_conf 50689999999987--------------899988868887899999999988608811378899999887621-584889 Q ss_pred HHHHHHHHHHHCHH------HHHHHHHHHHHHHH-HHHHHHHHHHHHCCCCCCHHH------HHHHHHHHHHHHHHHHHH Q ss_conf 88764221100002------33566999999974-211111000000000000000------124577765433223554 Q gi|254780592|r 124 LDAINRAQRPDIPD------WQLISAKGSVLAQM-GKHSEALIEYERALELSPNES------SIVSNIAMSYLLMGDLKT 190 (271) Q Consensus 124 ~~~~~~a~~~~p~~------~~~~~~lg~~~~~~-~~~~~A~~~~~~~~~~~p~~~------~~~~~~~~~~~~~g~~~~ 190 (271) ...++++++.-..- +..+..+|.+|..- .++++|+..|+++-+-..... .-+.-.+..--..++|.+ T Consensus 93 v~cL~~aieIyt~~Grf~~aAk~~~~iaEiyEsdl~d~ekaI~~YE~Aae~yk~ees~ssANKC~lKvA~yaa~leqY~~ 172 (288) T KOG1586 93 VNCLEKAIEIYTDMGRFTMAAKHHIEIAEIYESDLQDFEKAIAHYEQAAEYYKGEESVSSANKCLLKVAQYAAQLEQYSK 172 (288) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999852277787751113999986227779999999999999970015552688999999999999999999 Q ss_pred HHHHHHHHHHCCCCCH-------HHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCC---HHHHHHHHHHH Q ss_conf 3599999995389854-------788899999998699899999999862917286---65999999998 Q gi|254780592|r 191 AEEKLRFASQMIGADS-------RIRQNLALVVGLQGRMKEAYSIASQELSPEEAT---RNIKYIKSILS 250 (271) Q Consensus 191 A~~~~~~al~~~p~~~-------~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~---~~~~~l~~~l~ 250 (271) |+..|++....+-+++ +..+.-|++..-.++.-.+...+++-...+|+- +.-.+++.++. T Consensus 173 Ai~iyeqva~~s~~n~LLKys~KdyflkAgLChl~~~D~v~a~~ALeky~~~dP~F~dsREckflk~L~~ 242 (288) T KOG1586 173 AIDIYEQVARSSLDNNLLKYSAKDYFLKAGLCHLCKADEVNAQRALEKYQELDPAFTDSRECKFLKDLLD 242 (288) T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHH T ss_conf 9999999999860364877678889999998867302077899999988753986464088799999999 No 224 >KOG1310 consensus Probab=95.78 E-value=0.023 Score=32.18 Aligned_cols=85 Identities=14% Similarity=-0.018 Sum_probs=35.9 Q ss_pred CCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHC---CHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 0012446665566327521001355553210000---0112588764221100002335669999999742111110000 Q gi|254780592|r 85 RTAQALAVMRQVAILYPEDQEVLAAYGKSLANAG---YLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEY 161 (271) Q Consensus 85 ~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g---~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~ 161 (271) +...++..+-++++..|.....+..++.+++..+ +--.|+...-.++.++|....+++.++.++..++++.+|++.. T Consensus 389 ~~~~~i~~~s~a~q~~~~~~~~l~nraa~lmkRkW~~d~~~AlrDch~Alrln~s~~kah~~la~aL~el~r~~eal~~~ 468 (758) T KOG1310 389 IVSGAISHYSRAIQYVPDAIYLLENRAAALMKRKWRGDSYLALRDCHVALRLNPSIQKAHFRLARALNELTRYLEALSCH 468 (758) T ss_pred HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 88778999887766455026788768999986411430789987677625678388888999999999976687753128 Q ss_pred HHCCCCCC Q ss_conf 00000000 Q gi|254780592|r 162 ERALELSP 169 (271) Q Consensus 162 ~~~~~~~p 169 (271) .......| T Consensus 469 ~alq~~~P 476 (758) T KOG1310 469 WALQMSFP 476 (758) T ss_pred HHHHHCCC T ss_conf 98751693 No 225 >pfam04348 LppC LppC putative lipoprotein. This family includes several bacterial outer membrane antigens, whose molecular function is unknown. Probab=95.78 E-value=0.12 Score=27.82 Aligned_cols=84 Identities=18% Similarity=0.141 Sum_probs=32.1 Q ss_pred HHHHHHHHCCHHHHHHHHHHH--HCCCCC-CHHHHHHHHHHHHHCCHHHHHHHHHHH--HHHHC-HHHHHHHHHHHHHHH Q ss_conf 999987410012446665566--327521-001355553210000011258876422--11000-023356699999997 Q gi|254780592|r 77 ADVLRRVGRTAQALAVMRQVA--ILYPED-QEVLAAYGKSLANAGYLDEGLDAINRA--QRPDI-PDWQLISAKGSVLAQ 150 (271) Q Consensus 77 a~~l~~~g~~~~A~~~~~~al--~~~p~~-~~~~~~~~~~~~~~g~~~~A~~~~~~a--~~~~p-~~~~~~~~lg~~~~~ 150 (271) +.++...|++.+|..++.++. .++|.. .+.....+.+....|+++.|...+... ....+ ...+.+..++.++.. T Consensus 31 a~a~i~eg~~~qA~~ll~~l~~~~L~~~q~~e~~Ll~A~l~~~~~~~~~Al~~L~~~~~~~L~~~q~~r~~~l~A~~~~~ 110 (535) T pfam04348 31 ARVLIEENKIAQAQALLRELSVQQLTDEQKLDKSLLEAELAAAKRQNEVALQQLSHQHLSGLSPSQQSRYYEGRARIAEA 110 (535) T ss_pred HHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHH T ss_conf 99999779999999999974041199999999999999999975999999998557981107999999999999999997 Q ss_pred HHHHHHHHHH Q ss_conf 4211111000 Q gi|254780592|r 151 MGKHSEALIE 160 (271) Q Consensus 151 ~~~~~~A~~~ 160 (271) .|++-+|... T Consensus 111 ~~~~l~a~r~ 120 (535) T pfam04348 111 RGDAIEAAKA 120 (535) T ss_pred CCCHHHHHHH T ss_conf 6986999999 No 226 >pfam04053 Coatomer_WDAD Coatomer WD associated region. This region is composed of WD40 repeats. Probab=95.68 E-value=0.13 Score=27.59 Aligned_cols=129 Identities=17% Similarity=0.134 Sum_probs=71.5 Q ss_pred HHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99874100124466655663275210013555532100000112588764221100002335669999999742111110 Q gi|254780592|r 79 VLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEAL 158 (271) Q Consensus 79 ~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~ 158 (271) .....|+++++...+...--+.|. .-...+...+...|..+.|+... .+++..+. +....|+.+.|. T Consensus 264 ~av~r~d~~~~~~~~~~~~l~~p~--~~~~~i~~fLe~~G~~e~AL~~~--------~D~~~rFe---LAl~lG~L~~A~ 330 (435) T pfam04053 264 TALLRKDYEEVLRIIANSNLLPPK--DQGQKIIRFLEKKGYPELALQFV--------TDPDTRFD---LALELGNLDVAL 330 (435) T ss_pred HHHHCCCHHHHHHHHHHCCCCCCH--HHHHHHHHHHHHCCCHHHHHHHC--------CCHHHHHH---HHHHCCCHHHHH T ss_conf 999757788999876421336873--46679999999779788887660--------89047616---768619999999 Q ss_pred HHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHC Q ss_conf 000000000000001245777654332235543599999995389854788899999998699899999999862 Q gi|254780592|r 159 IEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQEL 233 (271) Q Consensus 159 ~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L 233 (271) +...+. ++..-|..+|...+.+|+++-|+.+|+++- -+..|..+|...|+.+.=.++.+.+- T Consensus 331 ~~A~~~-----~~~~~Wk~Lg~~AL~~g~~~lAe~~~~k~~--------d~~~LllLy~~tG~~~~L~kl~~~a~ 392 (435) T pfam04053 331 EIAKEL-----DDEHKWKRLGDAALSQGNIKLAEEAYQKAK--------DFDKLLLLYLSTGNKEKLKKLAKIAE 392 (435) T ss_pred HHHHHH-----CCHHHHHHHHHHHHHCCCHHHHHHHHHHHC--------CHHHHHHHHHHCCCHHHHHHHHHHHH T ss_conf 999873-----679999999999997499589999999742--------53455999998499999999999999 No 227 >smart00028 TPR Tetratricopeptide repeats. Repeats present in 4 or more copies in proteins. Contain a minimum of 34 amino acids each and self-associate via a "knobs and holes" mechanism. Probab=95.65 E-value=0.0097 Score=34.44 Aligned_cols=31 Identities=16% Similarity=0.171 Sum_probs=13.3 Q ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHCCCC Q ss_conf 9999999998741001244666556632752 Q gi|254780592|r 72 IGIVYADVLRRVGRTAQALAVMRQVAILYPE 102 (271) Q Consensus 72 ~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~ 102 (271) ++.++|.+|...|++++|+..++++++++|+ T Consensus 3 ~~~~lg~~y~~~~~~~~Ai~~~~~al~~~p~ 33 (34) T smart00028 3 ALYNLGNAYLKLGDYDEALEYYEKALELDPN 33 (34) T ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHCCCCC T ss_conf 8999899999823699999999997735999 No 228 >KOG1550 consensus Probab=95.61 E-value=0.14 Score=27.42 Aligned_cols=115 Identities=12% Similarity=0.011 Sum_probs=43.6 Q ss_pred CCHHHHHHHHHHHHHH-----CCCCHHHHHHHHHHHHHHH-----CCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHC- Q ss_conf 0199999999999995-----7578799999999998741-----0012446665566327521001355553210000- Q gi|254780592|r 50 EQLLEVTSTIGLQYQS-----HTKNKMIGIVYADVLRRVG-----RTAQALAVMRQVAILYPEDQEVLAAYGKSLANAG- 118 (271) Q Consensus 50 ~~~~~A~~~~~~al~~-----~P~~~~~~~~~a~~l~~~g-----~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g- 118 (271) +|.+.|+.++..+.+. .-.++.+...+|.+|.+.. +...|..++.++.... ++.+...+|.++..-. T Consensus 263 ~d~e~a~~~l~~aa~~~~~~a~~~~~~a~~~lg~~Y~~g~~~~~~d~~~A~~~~~~aA~~g--~~~a~~~lg~~~~~g~~ 340 (552) T KOG1550 263 QDLESAIEYLKLAAESFKKAATKGLPPAQYGLGRLYLQGLGVEKIDYEKALKLYTKAAELG--NPDAQYLLGVLYETGTK 340 (552) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCC--CHHHHHHHHHHHHCCCC T ss_conf 6899999999999886256665047422479999998189875124999999999998559--90489998898762763 Q ss_pred --CHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHH----HHHHHHHHHHHHCCCCC Q ss_conf --011258876422110000233566999999974----21111100000000000 Q gi|254780592|r 119 --YLDEGLDAINRAQRPDIPDWQLISAKGSVLAQM----GKHSEALIEYERALELS 168 (271) Q Consensus 119 --~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~----~~~~~A~~~~~~~~~~~ 168 (271) +...|..+|..+.... +..+...++.|+..- .+...|...+.++.+.. T Consensus 341 ~~d~~~A~~yy~~Aa~~G--~~~A~~~la~~y~~G~gv~r~~~~A~~~~k~aA~~g 394 (552) T KOG1550 341 ERDYRRAFEYYSLAAKAG--HILAIYRLALCYELGLGVERNLELAFAYYKKAAEKG 394 (552) T ss_pred CCCHHHHHHHHHHHHHCC--CHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCC T ss_conf 213889999999999759--989999999999747786887789999999999827 No 229 >COG5107 RNA14 Pre-mRNA 3'-end processing (cleavage and polyadenylation) factor [RNA processing and modification] Probab=95.48 E-value=0.15 Score=27.15 Aligned_cols=174 Identities=9% Similarity=0.033 Sum_probs=99.8 Q ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHH----- Q ss_conf 999999957578799999999998741001244666556632752100135555321000001125887642211----- Q gi|254780592|r 58 TIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQR----- 132 (271) Q Consensus 58 ~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~----- 132 (271) .+.+++..-|-.++.|+.+..-+...++-..|+...+++....|. ....++.++....+-+.....|++.+. T Consensus 290 ~~~q~~~y~~~~~evw~dys~Y~~~isd~q~al~tv~rg~~~sps---L~~~lse~yel~nd~e~v~~~fdk~~q~L~r~ 366 (660) T COG5107 290 IHNQILDYFYYAEEVWFDYSEYLIGISDKQKALKTVERGIEMSPS---LTMFLSEYYELVNDEEAVYGCFDKCTQDLKRK 366 (660) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHCCCCCCC---HHEEHHHHHHHCCCHHHHHHHHHHHHHHHHHH T ss_conf 999888885512999887899986234888899999824668972---32009987753066888763288899999999 Q ss_pred -------------HHCHH-HHH-----------HHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHH-HHHHHH Q ss_conf -------------00002-335-----------6699999997421111100000000000000012457776-543322 Q gi|254780592|r 133 -------------PDIPD-WQL-----------ISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAM-SYLLMG 186 (271) Q Consensus 133 -------------~~p~~-~~~-----------~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~-~~~~~g 186 (271) .+|.. .+. +..+-+...+..-.+.|...|.++.+..-....++..-+. -+..+| T Consensus 367 ys~~~s~~~s~~D~N~e~~~Ell~kr~~k~t~v~C~~~N~v~r~~Gl~aaR~~F~k~rk~~~~~h~vyi~~A~~E~~~~~ 446 (660) T COG5107 367 YSMGESESASKVDNNFEYSKELLLKRINKLTFVFCVHLNYVLRKRGLEAARKLFIKLRKEGIVGHHVYIYCAFIEYYATG 446 (660) T ss_pred HHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEHHHHHHHHCC T ss_conf 86220031015668710148999998753236788999999987558999999999712577775234317788988618 Q ss_pred HHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCC Q ss_conf 355435999999953898547888999999986998999999998629 Q gi|254780592|r 187 DLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQELS 234 (271) Q Consensus 187 ~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~ 234 (271) ++..|-..|+--+...|+++-..+..-..+...++-..|..++++++. T Consensus 447 d~~ta~~ifelGl~~f~d~~~y~~kyl~fLi~inde~naraLFetsv~ 494 (660) T COG5107 447 DRATAYNIFELGLLKFPDSTLYKEKYLLFLIRINDEENARALFETSVE 494 (660) T ss_pred CCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHH T ss_conf 953178999877773799538999999999870847778999987688 No 230 >KOG2041 consensus Probab=95.47 E-value=0.15 Score=27.11 Aligned_cols=13 Identities=8% Similarity=-0.056 Sum_probs=4.9 Q ss_pred CHHHHHHHHHHHH Q ss_conf 8799999999998 Q gi|254780592|r 69 NKMIGIVYADVLR 81 (271) Q Consensus 69 ~~~~~~~~a~~l~ 81 (271) +|..|-.+|+.-. T Consensus 691 HprLWrllAe~Al 703 (1189) T KOG2041 691 HPRLWRLLAEYAL 703 (1189) T ss_pred CHHHHHHHHHHHH T ss_conf 6689999999999 No 231 >COG4259 Uncharacterized protein conserved in bacteria [Function unknown] Probab=95.42 E-value=0.16 Score=27.01 Aligned_cols=94 Identities=14% Similarity=0.152 Sum_probs=60.0 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCC----CHHHHHHHHHHH-----CCHHHHHHHHHHHHHHC-CCCHHHHHHHHHHHH Q ss_conf 98999999999999985403685433----117999999842-----01999999999999957-578799999999998 Q gi|254780592|r 12 LKKIKCIVFLIPLLLFSSCHLNPRIS----IPDPESLENMNH-----EQLLEVTSTIGLQYQSH-TKNKMIGIVYADVLR 81 (271) Q Consensus 12 ~~~~~~~~~l~~~~~~~~~~~~~~~~----~~~~~~~~~~~~-----~~~~~A~~~~~~al~~~-P~~~~~~~~~a~~l~ 81 (271) ++.+.-+-+++.++.++||...|.+. ....+-.+.+.. .+..+-...+.+.-.++ |--|..+-.+|..|. T Consensus 4 ~ktlsr~al~~av~~LagC~~gpKslY~w~gYq~tvyEy~K~~es~e~Q~~~le~~~ek~~ak~~~vpPG~HAhLGlLys 83 (121) T COG4259 4 LKTLSRLALLLAVAALAGCGGGPKSLYQWEGYQDTVYEYFKGDESKEAQTAALEKYLEKIGAKNGAVPPGYHAHLGLLYS 83 (121) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHH T ss_conf 48899999999999998805798641013786389999981788788999999999999751489999727888777886 Q ss_pred HHHCCHHHHHHHHHHHHCCCCCCH Q ss_conf 741001244666556632752100 Q gi|254780592|r 82 RVGRTAQALAVMRQVAILYPEDQE 105 (271) Q Consensus 82 ~~g~~~~A~~~~~~al~~~p~~~~ 105 (271) +.|+.++|+..|+.--.+.|+... T Consensus 84 ~~G~~e~a~~eFetEKalFPES~~ 107 (121) T COG4259 84 NSGKDEQAVREFETEKALFPESGV 107 (121) T ss_pred HCCCHHHHHHHHHHHHHHCCCCHH T ss_conf 049828999999876640766016 No 232 >smart00028 TPR Tetratricopeptide repeats. Repeats present in 4 or more copies in proteins. Contain a minimum of 34 amino acids each and self-associate via a "knobs and holes" mechanism. Probab=95.28 E-value=0.017 Score=33.01 Aligned_cols=31 Identities=26% Similarity=0.288 Sum_probs=14.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 2457776543322355435999999953898 Q gi|254780592|r 174 IVSNIAMSYLLMGDLKTAEEKLRFASQMIGA 204 (271) Q Consensus 174 ~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~ 204 (271) ++.++|.+|..+|++++|+.+|+++++++|+ T Consensus 3 ~~~~lg~~y~~~~~~~~Ai~~~~~al~~~p~ 33 (34) T smart00028 3 ALYNLGNAYLKLGDYDEALEYYEKALELDPN 33 (34) T ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHCCCCC T ss_conf 8999899999823699999999997735999 No 233 >COG5191 Uncharacterized conserved protein, contains HAT (Half-A-TPR) repeat [General function prediction only] Probab=95.20 E-value=0.043 Score=30.50 Aligned_cols=65 Identities=12% Similarity=0.047 Sum_probs=38.7 Q ss_pred HCCHHHHHHHHHHHHHHCCCCHHHHHH-HHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHH Q ss_conf 201999999999999957578799999-99999874100124466655663275210013555532 Q gi|254780592|r 49 HEQLLEVTSTIGLQYQSHTKNKMIGIV-YADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKS 113 (271) Q Consensus 49 ~~~~~~A~~~~~~al~~~P~~~~~~~~-~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~ 113 (271) .+-+.+--..+..+++++|.|.+.|+. -+.-+...++.+.+.+.+.+++..+|+.+..|..+-+. T Consensus 120 ~k~y~~~~nI~~~~l~khP~nvdlWI~~c~~e~~~~ani~s~Ra~f~~glR~N~~~p~iw~eyfr~ 185 (435) T COG5191 120 KKMYGEMKNIFAECLTKHPLNVDLWIYCCAFELFEIANIESSRAMFLKGLRMNSRSPRIWIEYFRM 185 (435) T ss_pred HHHHHHHHHHHHHHHHCCCCCCEEEEEECCCHHHHHCCHHHHHHHHHHHHCCCCCCCHHHHHHHHH T ss_conf 878989999999999619987405656300035441567989999986531488984299999999 No 234 >KOG3364 consensus Probab=95.12 E-value=0.14 Score=27.32 Aligned_cols=79 Identities=13% Similarity=0.000 Sum_probs=49.1 Q ss_pred HHHHHHHHHHHHHH---HHHHHHHHHHHHHH-CCCCC-HHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHHHHHHH Q ss_conf 12457776543322---35543599999995-38985-478889999999869989999999986291728665999999 Q gi|254780592|r 173 SIVSNIAMSYLLMG---DLKTAEEKLRFASQ-MIGAD-SRIRQNLALVVGLQGRMKEAYSIASQELSPEEATRNIKYIKS 247 (271) Q Consensus 173 ~~~~~~~~~~~~~g---~~~~A~~~~~~al~-~~p~~-~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~~~~l~~ 247 (271) ....+++++++... |..+.+..++..++ -.|.. .+..+.||.-+...|+|+.+..+.+..|..+|++....-++. T Consensus 33 ~s~f~lAwaLV~S~~~~dv~~GI~iLe~l~~~~~~~~rRe~lyYLAvg~yRlkeY~~s~~yvd~ll~~e~~n~Qa~~Lk~ 112 (149) T KOG3364 33 QSQFNLAWALVRSRDTEDVQEGIVILEDLLKSAHPERRRECLYYLAVGHYRLKEYSKSLRYVDALLETEPNNRQALELKE 112 (149) T ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHH T ss_conf 88899999997464258999769999998610684000132135677788886589999999999960877078999999 Q ss_pred HHHH Q ss_conf 9983 Q gi|254780592|r 248 ILSQ 251 (271) Q Consensus 248 ~l~~ 251 (271) .+.. T Consensus 113 ~ied 116 (149) T KOG3364 113 TIED 116 (149) T ss_pred HHHH T ss_conf 9999 No 235 >COG5107 RNA14 Pre-mRNA 3'-end processing (cleavage and polyadenylation) factor [RNA processing and modification] Probab=95.08 E-value=0.2 Score=26.36 Aligned_cols=75 Identities=8% Similarity=0.035 Sum_probs=49.7 Q ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHH Q ss_conf 999999957578799999999998741001244666556632752100135555321000001125887642211 Q gi|254780592|r 58 TIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQR 132 (271) Q Consensus 58 ~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~ 132 (271) -++.-++.+|.|...|+.+..-|..+|.+++-.+.+++...-.|-.+.+|..+-.......++......|.+.+. T Consensus 30 rLRerIkdNPtnI~S~fqLiq~~~tq~s~~~~re~yeq~~~pfp~~~~aw~ly~s~ELA~~df~svE~lf~rCL~ 104 (660) T COG5107 30 RLRERIKDNPTNILSYFQLIQYLETQESMDAEREMYEQLSSPFPIMEHAWRLYMSGELARKDFRSVESLFGRCLK 104 (660) T ss_pred HHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHCCHHHHHHHHHHHHHHHHHHH T ss_conf 899986139431899999999984020489999999985488766437899986015545358899999999976 No 236 >COG4649 Uncharacterized protein conserved in bacteria [Function unknown] Probab=95.04 E-value=0.21 Score=26.30 Aligned_cols=97 Identities=18% Similarity=0.131 Sum_probs=45.7 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCHHH----HHHHHHHHHHHHHHHHHHHHHHHHHH-HHCCCCCHHHHHHHHHHHHHC Q ss_conf 9999974211111000000000000000----12457776543322355435999999-953898547888999999986 Q gi|254780592|r 145 GSVLAQMGKHSEALIEYERALELSPNES----SIVSNIAMSYLLMGDLKTAEEKLRFA-SQMIGADSRIRQNLALVVGLQ 219 (271) Q Consensus 145 g~~~~~~~~~~~A~~~~~~~~~~~p~~~----~~~~~~~~~~~~~g~~~~A~~~~~~a-l~~~p~~~~~~~~La~~~~~~ 219 (271) +.+....|+...|...|...-...+-+. .++..-+.+++..|-|++-..-.+.. ...+|-...++..||+.-... T Consensus 101 at~~a~kgdta~AV~aFdeia~dt~~P~~~rd~ARlraa~lLvD~gsy~dV~srvepLa~d~n~mR~sArEALglAa~ka 180 (221) T COG4649 101 ATLLAQKGDTAAAVAAFDEIAADTSIPQIGRDLARLRAAYLLVDNGSYDDVSSRVEPLAGDGNPMRHSAREALGLAAYKA 180 (221) T ss_pred HHHHHHCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHC T ss_conf 99986126478899989988525787601367999987898743564888998741304899701788999876888732 Q ss_pred CCHHHHHHHHHHHCCCCCCCHH Q ss_conf 9989999999986291728665 Q gi|254780592|r 220 GRMKEAYSIASQELSPEEATRN 241 (271) Q Consensus 220 g~~~eA~~~~~~~L~~~~~~~~ 241 (271) |++..|.+.|.+..+..+...+ T Consensus 181 gd~a~A~~~F~qia~Da~aprn 202 (221) T COG4649 181 GDFAKAKSWFVQIANDAQAPRN 202 (221) T ss_pred CCHHHHHHHHHHHHCCCCCCHH T ss_conf 2467799999999701469677 No 237 >pfam07719 TPR_2 Tetratricopeptide repeat. This Pfam entry includes outlying Tetratricopeptide-like repeats (TPR) that are not matched by pfam00515. Probab=95.03 E-value=0.013 Score=33.64 Aligned_cols=31 Identities=16% Similarity=0.125 Sum_probs=13.0 Q ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHCCCC Q ss_conf 9999999998741001244666556632752 Q gi|254780592|r 72 IGIVYADVLRRVGRTAQALAVMRQVAILYPE 102 (271) Q Consensus 72 ~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~ 102 (271) ++..+|.++...|++++|+..|+++++++|+ T Consensus 3 ~~~~~g~~~~~~g~~~~A~~~~~~al~l~p~ 33 (34) T pfam07719 3 ALYNLGLAYYKLGDYEEALEAYEKALELDPN 33 (34) T ss_pred HHHHCCHHHHHCCCHHHHHHHHHHHHCCCCC T ss_conf 6875014876216699999999997726999 No 238 >pfam04781 DUF627 Protein of unknown function (DUF627). This family represents the N-terminal region of several plant proteins of unknown function. Probab=94.97 E-value=0.16 Score=26.92 Aligned_cols=27 Identities=22% Similarity=0.133 Sum_probs=12.4 Q ss_pred HHHHHHHHCCHHHHHHHHHHHHCCCCC Q ss_conf 999987410012446665566327521 Q gi|254780592|r 77 ADVLRRVGRTAQALAVMRQVAILYPED 103 (271) Q Consensus 77 a~~l~~~g~~~~A~~~~~~al~~~p~~ 103 (271) |.-+.+.|++-+|+++.++.+..++++ T Consensus 3 A~~~~~~GnhiKAL~iied~is~h~~~ 29 (112) T pfam04781 3 ARDLFAKGDYIKALEIIEDSISDHGKD 29 (112) T ss_pred HHHHHHCCCHHHHHHHHHHHHHHCCCC T ss_conf 589987657599999999999872676 No 239 >pfam04781 DUF627 Protein of unknown function (DUF627). This family represents the N-terminal region of several plant proteins of unknown function. Probab=94.92 E-value=0.093 Score=28.43 Aligned_cols=47 Identities=19% Similarity=0.019 Sum_probs=29.1 Q ss_pred HHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCC Q ss_conf 43599999995389854788899999998699899999999862917 Q gi|254780592|r 190 TAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQELSPE 236 (271) Q Consensus 190 ~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~ 236 (271) .++.++.++..+.|......+.||.-+...--++++....+++|+.. T Consensus 62 ~sve~~s~~~~Lsp~~A~sL~~la~~l~s~~~Ykk~v~kak~aLsv~ 108 (112) T pfam04781 62 GSVECFSEDVLLSPDAANSLFELGQQLGSVMYYKKAVKKAKQALSVT 108 (112) T ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 99999988750187899999999999702889999999999871666 No 240 >pfam09613 HrpB1_HrpK Bacterial type III secretion protein (HrpB1_HrpK). This family of proteins is encoded by genes found within type III secretion operons in a limited range of species including Xanthomonas, Ralstonia and Burkholderia. Probab=94.88 E-value=0.23 Score=26.03 Aligned_cols=72 Identities=7% Similarity=-0.058 Sum_probs=55.0 Q ss_pred HCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCH Q ss_conf 201999999999999957578799999999998741001244666556632752100135555321000001 Q gi|254780592|r 49 HEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYL 120 (271) Q Consensus 49 ~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~ 120 (271) .++...+...++-.--..|+++++-+.-|-.+...|++++|+.++.++....|..+.....++.|+..+|+. T Consensus 23 ~~~~~D~e~lL~AlRvLRP~~~e~d~~dg~l~i~rg~w~dA~r~lr~v~~~~~~~~~~kAL~A~CL~a~~D~ 94 (156) T pfam09613 23 LADIADAEELLEALRILRPRLAELDLFDGWLYIGRGQWDDAARLLRELEGRASHLPLCKALMAQCLFALQDP 94 (156) T ss_pred CCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCH T ss_conf 488778999999999858983266587877877606589999999998626778489999999999874796 No 241 >pfam09613 HrpB1_HrpK Bacterial type III secretion protein (HrpB1_HrpK). This family of proteins is encoded by genes found within type III secretion operons in a limited range of species including Xanthomonas, Ralstonia and Burkholderia. Probab=94.73 E-value=0.13 Score=27.59 Aligned_cols=102 Identities=14% Similarity=0.045 Sum_probs=51.8 Q ss_pred HHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHH Q ss_conf 42111110000000000000001245777654332235543599999995389854788899999998699899999999 Q gi|254780592|r 151 MGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIAS 230 (271) Q Consensus 151 ~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~ 230 (271) .++++++...+....-+.|+.+.+-.--|.+++..|++++|+..|+......+..+...-.++.++..+|+.+= ..+.+ T Consensus 23 ~~~~~D~e~lL~AlRvLRP~~~e~d~~dg~l~i~rg~w~dA~r~lr~v~~~~~~~~~~kAL~A~CL~a~~D~~W-r~~A~ 101 (156) T pfam09613 23 LADIADAEELLEALRILRPRLAELDLFDGWLYIGRGQWDDAARLLRELEGRASHLPLCKALMAQCLFALQDPAW-REYAD 101 (156) T ss_pred CCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCHHH-HHHHH T ss_conf 48877899999999985898326658787787760658999999999862677848999999999987479679-99999 Q ss_pred HHCCCCCCCHHHHHHHHHHHHHH Q ss_conf 86291728665999999998321 Q gi|254780592|r 231 QELSPEEATRNIKYIKSILSQRD 253 (271) Q Consensus 231 ~~L~~~~~~~~~~~l~~~l~~~~ 253 (271) .+|...+........+.++.+.+ T Consensus 102 evLe~~~~~~~~~LV~aL~~~~~ 124 (156) T pfam09613 102 EVLEEGGDPEALTLVKALLARAE 124 (156) T ss_pred HHHHHCCCCHHHHHHHHHHHHHH T ss_conf 99861899018999999998899 No 242 >pfam07719 TPR_2 Tetratricopeptide repeat. This Pfam entry includes outlying Tetratricopeptide-like repeats (TPR) that are not matched by pfam00515. Probab=94.69 E-value=0.024 Score=32.08 Aligned_cols=31 Identities=23% Similarity=0.324 Sum_probs=16.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 2457776543322355435999999953898 Q gi|254780592|r 174 IVSNIAMSYLLMGDLKTAEEKLRFASQMIGA 204 (271) Q Consensus 174 ~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~ 204 (271) .+..+|.++..+|++++|+.+|+++++++|+ T Consensus 3 ~~~~~g~~~~~~g~~~~A~~~~~~al~l~p~ 33 (34) T pfam07719 3 ALYNLGLAYYKLGDYEEALEAYEKALELDPN 33 (34) T ss_pred HHHHCCHHHHHCCCHHHHHHHHHHHHCCCCC T ss_conf 6875014876216699999999997726999 No 243 >COG5191 Uncharacterized conserved protein, contains HAT (Half-A-TPR) repeat [General function prediction only] Probab=94.68 E-value=0.069 Score=29.21 Aligned_cols=88 Identities=7% Similarity=-0.030 Sum_probs=71.2 Q ss_pred HHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHH-HHHHHHHHCCHHHHHHHHHHHHHHHCHH Q ss_conf 999999575787999999999987410012446665566327521001355-5532100000112588764221100002 Q gi|254780592|r 59 IGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAA-YGKSLANAGYLDEGLDAINRAQRPDIPD 137 (271) Q Consensus 59 ~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~-~~~~~~~~g~~~~A~~~~~~a~~~~p~~ 137 (271) |.+.-.+-|+|+..|..++.--...|-+.+--.++.++++.+|.+.+.|.. -..-+...++++.+...+.+++..+|.. T Consensus 96 ~~R~tnkff~D~k~w~~y~~Y~~k~k~y~~~~nI~~~~l~khP~nvdlWI~~c~~e~~~~ani~s~Ra~f~~glR~N~~~ 175 (435) T COG5191 96 LYRSTNKFFNDPKIWSQYAAYVIKKKMYGEMKNIFAECLTKHPLNVDLWIYCCAFELFEIANIESSRAMFLKGLRMNSRS 175 (435) T ss_pred EEHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCHHHHHCCHHHHHHHHHHHHCCCCCC T ss_conf 65512027887089999999999987898999999999961998740565630003544156798999998653148898 Q ss_pred HHHHHHHHH Q ss_conf 335669999 Q gi|254780592|r 138 WQLISAKGS 146 (271) Q Consensus 138 ~~~~~~lg~ 146 (271) +..|...-. T Consensus 176 p~iw~eyfr 184 (435) T COG5191 176 PRIWIEYFR 184 (435) T ss_pred CHHHHHHHH T ss_conf 429999999 No 244 >COG4976 Predicted methyltransferase (contains TPR repeat) [General function prediction only] Probab=94.67 E-value=0.051 Score=30.00 Aligned_cols=65 Identities=23% Similarity=0.154 Sum_probs=55.5 Q ss_pred HHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHH Q ss_conf 99998741001244666556632752100135555321000001125887642211000023356 Q gi|254780592|r 77 ADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLI 141 (271) Q Consensus 77 a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~ 141 (271) +..+...|+.+-|.+.+.+++.+-|+....|..+|......|+++.|...|++.++.+|.+.... T Consensus 2 a~~~~~~~D~~aaaely~qal~lap~w~~gwfR~g~~~ekag~~daAa~a~~~~L~ldp~D~~ga 66 (287) T COG4976 2 AYMLAESGDAEAAAELYNQALELAPEWAAGWFRLGEYTEKAGEFDAAAAAYEEVLELDPEDHGGA 66 (287) T ss_pred CCHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHCCHHHHHCCCHHHHHHHHHHHHCCCCCCCCCH T ss_conf 30010367848999999877504704443455402366630418999999998870796212223 No 245 >KOG3616 consensus Probab=94.45 E-value=0.29 Score=25.39 Aligned_cols=14 Identities=21% Similarity=0.185 Sum_probs=4.8 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 76543322355435 Q gi|254780592|r 179 AMSYLLMGDLKTAE 192 (271) Q Consensus 179 ~~~~~~~g~~~~A~ 192 (271) |..+.+.|+++.|+ T Consensus 713 g~hl~~~~q~daai 726 (1636) T KOG3616 713 GDHLEQIGQLDAAI 726 (1636) T ss_pred HHHHHHHHHHHHHH T ss_conf 58999987677998 No 246 >COG2912 Uncharacterized conserved protein [Function unknown] Probab=94.00 E-value=0.36 Score=24.82 Aligned_cols=80 Identities=14% Similarity=0.013 Sum_probs=66.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHHH Q ss_conf 01245777654332235543599999995389854788899999998699899999999862917286659999999983 Q gi|254780592|r 172 SSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQELSPEEATRNIKYIKSILSQ 251 (271) Q Consensus 172 ~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~~~~l~~~l~~ 251 (271) .....++-..+...++++.|..+.++.+.++|.++.-+.--|.+|.++|.+.-|...++..+..-|.+.....++..+.+ T Consensus 181 ~rll~~lk~~~~~e~~~~~al~~~~r~l~l~P~dp~eirDrGliY~ql~c~~vAl~dl~~~~~~~P~~~~a~~ir~~l~~ 260 (269) T COG2912 181 SRLLRNLKAALLRELQWELALRVAERLLDLNPEDPYEIRDRGLIYAQLGCYHVALEDLSYFVEHCPDDPIAEMIRAQLLE 260 (269) T ss_pred HHHHHHHHHHHHHHHCHHHHHHHHHHHHHHCCCCHHHCCCCHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHH T ss_conf 99999999999984056999999999875189984431670899986488445699999999849993578999999999 No 247 >COG3629 DnrI DNA-binding transcriptional activator of the SARP family [Signal transduction mechanisms] Probab=93.91 E-value=0.38 Score=24.72 Aligned_cols=79 Identities=16% Similarity=0.064 Sum_probs=61.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHH-------CCCCCCCHHH Q ss_conf 000124577765433223554359999999538985478889999999869989999999986-------2917286659 Q gi|254780592|r 170 NESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQE-------LSPEEATRNI 242 (271) Q Consensus 170 ~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~-------L~~~~~~~~~ 242 (271) ....++..++..+...|+++.+++.+++.+..+|-+...|..+-..|...|+...|+..|++. +..+|..... T Consensus 151 ~~~~~l~~lae~~~~~~~~~~~~~~l~~Li~~dp~~E~~~~~lm~~y~~~g~~~~ai~~y~~l~~~~~edlgi~P~~~~~ 230 (280) T COG3629 151 LFIKALTKLAEALIACGRADAVIEHLERLIELDPYDEPAYLRLMEAYLVNGRQSAAIRAYRQLKKTLAEELGIDPAPELR 230 (280) T ss_pred HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCCCHHHH T ss_conf 99999999999987312188999999999862964649999999999975880689999999999754304899408999 Q ss_pred HHHHHH Q ss_conf 999999 Q gi|254780592|r 243 KYIKSI 248 (271) Q Consensus 243 ~~l~~~ 248 (271) ..+... T Consensus 231 ~~y~~~ 236 (280) T COG3629 231 ALYEEI 236 (280) T ss_pred HHHHHH T ss_conf 989998 No 248 >pfam04184 ST7 ST7 protein. The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain. Probab=93.55 E-value=0.44 Score=24.32 Aligned_cols=153 Identities=14% Similarity=0.118 Sum_probs=68.8 Q ss_pred HCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 10012446665566327521001355553210000011258876422110000233566999999974211111000000 Q gi|254780592|r 84 GRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYER 163 (271) Q Consensus 84 g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~ 163 (271) -++..-+....++++++|+...++..++.- +..-.-+|...++++++...... ..+.-....+...++ T Consensus 182 Rnp~~RIkaA~~ALe~n~dCatAyiLLAEE--EatTI~eaE~~~kqALK~~E~~y----r~sq~~qh~~~~~ea------ 249 (540) T pfam04184 182 RNPQARIQAAYQALEINPECATAYVLLAEE--EATTITEAERLFKQALKAGESTY----RSSQQIASYGQQGDA------ 249 (540) T ss_pred CCHHHHHHHHHHHHHHCCCCHHHHHHHHHH--HCCHHHHHHHHHHHHHHHHHHHH----HHHHHHCCCCCCHHH------ T ss_conf 191989999999987197418889998642--10469999999999999888887----655553055751024------ Q ss_pred CCCCCCHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC--HHHHHHHHHHHHHCCCHHHHHHHHHHHCCC-CCC Q ss_conf 00000000--0124577765433223554359999999538985--478889999999869989999999986291-728 Q gi|254780592|r 164 ALELSPNE--SSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGAD--SRIRQNLALVVGLQGRMKEAYSIASQELSP-EEA 238 (271) Q Consensus 164 ~~~~~p~~--~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~--~~~~~~La~~~~~~g~~~eA~~~~~~~L~~-~~~ 238 (271) +-.+..+ .-+...++.+--++|+.++|++.++..++--|-- -.+..||..++.+...+.+...++.+-=+. -|. T Consensus 250 -~~rrd~nv~vyikRRlAMCaRklgr~kEAvKm~rDL~ke~p~~~~lniheNLiEalLe~QaYadvQavLakYDdislPk 328 (540) T pfam04184 250 -QHRRDTNVLVYIKRRLAMCARKLGRTREAVKMMRDLMKEFPLLSMLNIHENLLEALLELQAYADVQAVLAKYDDISLPK 328 (540) T ss_pred -HHHCCCCEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCC T ss_conf -5523573143108899999987511899999999998757236776788999999999888899999997602356863 Q ss_pred CHHHHHHHHHH Q ss_conf 66599999999 Q gi|254780592|r 239 TRNIKYIKSIL 249 (271) Q Consensus 239 ~~~~~~l~~~l 249 (271) .+.+-|-.+++ T Consensus 329 SAticyTaALL 339 (540) T pfam04184 329 SATICYTAALL 339 (540) T ss_pred HHHHHHHHHHH T ss_conf 47898999999 No 249 >KOG0546 consensus Probab=93.32 E-value=0.17 Score=26.83 Aligned_cols=73 Identities=15% Similarity=0.014 Sum_probs=44.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHHH Q ss_conf 0012457776543322355435999999953898547888999999986998999999998629172866599 Q gi|254780592|r 171 ESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQELSPEEATRNIK 243 (271) Q Consensus 171 ~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~~~ 243 (271) -.....+.+..-+..+++..|......++..++..+.+++..+..+....++++|..-++.+....|.+..+. T Consensus 274 r~~~~~n~~~~~lk~~~~~~a~~~~~~~~~~~~s~tka~~Rr~~~~~~~~~~~~a~~~~~~a~~~~p~d~~i~ 346 (372) T KOG0546 274 RFSIRRNLAAVGLKVKGRGGARFRTNEALRDERSKTKAHYRRGQAYKLLKNYDEALEDLKKAKQKAPNDKAIE 346 (372) T ss_pred CCCCCCCHHHHCCCCCCCCCCEECCCCCCCCCHHHCCHHHHHHHHHHHHHCHHHHHHHHHHHHCCCCCHHHHH T ss_conf 0000023677512136777623216400236811182888877688764345666778887542285449999 No 250 >KOG0276 consensus Probab=93.29 E-value=0.48 Score=24.07 Aligned_cols=48 Identities=21% Similarity=0.096 Sum_probs=21.7 Q ss_pred HHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 10000011258876422110000233566999999974211111000000000 Q gi|254780592|r 114 LANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALE 166 (271) Q Consensus 114 ~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~ 166 (271) ..++|+++.|.+...++ +...-|-.+|......++...|.+++.++.. T Consensus 647 al~lgrl~iA~~la~e~-----~s~~Kw~~Lg~~al~~~~l~lA~EC~~~a~d 694 (794) T KOG0276 647 ALKLGRLDIAFDLAVEA-----NSEVKWRQLGDAALSAGELPLASECFLRARD 694 (794) T ss_pred HHHCCCHHHHHHHHHHH-----CCHHHHHHHHHHHHHCCCCHHHHHHHHHHCC T ss_conf 45527188899987752-----3667799999998641663458999975304 No 251 >KOG4814 consensus Probab=93.07 E-value=0.52 Score=23.86 Aligned_cols=87 Identities=14% Similarity=0.021 Sum_probs=37.3 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHCCCCC------CHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHH Q ss_conf 99999987410012446665566327521------001355553210000011258876422110000233566999999 Q gi|254780592|r 75 VYADVLRRVGRTAQALAVMRQVAILYPED------QEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVL 148 (271) Q Consensus 75 ~~a~~l~~~g~~~~A~~~~~~al~~~p~~------~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~ 148 (271) +-|.-+.+..+|.-++..|+.-+.-.|.+ ......+..||..+.+.|.|.+.++.+.+.+|..+-......... T Consensus 359 n~A~~~F~~~~Y~~s~~~y~~Sl~~i~~D~~~~~FaK~qR~l~~CYL~L~QLD~A~E~~~EAE~~d~~~~l~q~~~~~~~ 438 (872) T KOG4814 359 NTAKKLFKMEKYVVSIRFYKLSLKDIISDNYSDRFAKIQRALQVCYLKLEQLDNAVEVYQEAEEVDRQSPLCQLLMLQSF 438 (872) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHH T ss_conf 88887753776899999999988744750465789999999999982599887799999999860732388999999999 Q ss_pred HHHHHHHHHHHHH Q ss_conf 9742111110000 Q gi|254780592|r 149 AQMGKHSEALIEY 161 (271) Q Consensus 149 ~~~~~~~~A~~~~ 161 (271) ..-+..++|.... T Consensus 439 ~~E~~Se~AL~~~ 451 (872) T KOG4814 439 LAEDKSEEALTCL 451 (872) T ss_pred HHHCCHHHHHHHH T ss_conf 9832217789999 No 252 >TIGR00540 hemY_coli hemY protein; InterPro: IPR005254 This is a family of uncharacterised proteins encoded next to a heme-biosynthetic enzyme in two gamma division proteobacteria (Escherichia coli and Haemophilus influenzae). It is known in no other species. The gene symbol hemY is unfortunate in that an unrelated protein, protoporphyrinogen oxidase, is designated as HemG in E. coli but as HemY in Bacillus subtilis. . Probab=92.69 E-value=0.59 Score=23.53 Aligned_cols=79 Identities=13% Similarity=0.113 Sum_probs=44.9 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCH--HHHHHHHHHHHHCCCHHHHHHHHHHH--CCCCCCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 32235543599999995389854--78889999999869989999999986--291728665999999998321299999 Q gi|254780592|r 184 LMGDLKTAEEKLRFASQMIGADS--RIRQNLALVVGLQGRMKEAYSIASQE--LSPEEATRNIKYIKSILSQRDPWKKIA 259 (271) Q Consensus 184 ~~g~~~~A~~~~~~al~~~p~~~--~~~~~La~~~~~~g~~~eA~~~~~~~--L~~~~~~~~~~~l~~~l~~~~~~~~~a 259 (271) ...+...-...+++-.+.-.+.. -.+..+|..+...|.+.++...++.. +...+...+.....-.+.+........ T Consensus 311 ~~~~~~~~~~~~~~~~~~~~~~~~c~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 390 (409) T TIGR00540 311 KPEDPEKLLKLLEKRAKNVGDKPKCLLWRALGYLLVKNGEWIEAADAFKNVAALPERPDAYDLAWLADALDRLGDPELAE 390 (409) T ss_pred CCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCHHHHH T ss_conf 66765789999998874035203677777766677621204578888776664010010346889999987405656888 Q ss_pred HHH Q ss_conf 999 Q gi|254780592|r 260 KAR 262 (271) Q Consensus 260 ~a~ 262 (271) ..+ T Consensus 391 ~~~ 393 (409) T TIGR00540 391 AVR 393 (409) T ss_pred HHH T ss_conf 887 No 253 >pfam11207 DUF2989 Protein of unknown function (DUF2989). Some members in this bacterial family of proteins are annotated as lipoproteins however this cannot be confirmed. Probab=92.36 E-value=0.65 Score=23.27 Aligned_cols=77 Identities=18% Similarity=0.192 Sum_probs=54.3 Q ss_pred HHHHHHHHHHHHHHHHCCC-CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC----CHHHHHHHHHHHHHCCCH Q ss_conf 9974211111000000000-00000012457776543322355435999999953898----547888999999986998 Q gi|254780592|r 148 LAQMGKHSEALIEYERALE-LSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGA----DSRIRQNLALVVGLQGRM 222 (271) Q Consensus 148 ~~~~~~~~~A~~~~~~~~~-~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~----~~~~~~~La~~~~~~g~~ 222 (271) +.+.|+ +.|...|.+.-. -.-+++.....+|..| ...|.+.++..+.+++++... ++++...||.++..+|++ T Consensus 117 Wsr~~d-~~A~~~fL~~E~~~~l~t~elq~aLAtyY-~k~d~~Kt~~Ll~~aL~l~~~~d~~n~eil~sLas~~~~~~~~ 194 (203) T pfam11207 117 WSRFGD-QAALRRFLALEGQPELETPELQYALATYY-IKRDPDKTIQLLYRALELYTPGDKFNPEILLSLASIYQKQKNY 194 (203) T ss_pred HHCCCC-HHHHHHHHHHCCCCCCCCHHHHHHHHHHH-HCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCC T ss_conf 621597-99999999863899889899999999999-8048899999999999805876757979999999999871421 Q ss_pred HHHH Q ss_conf 9999 Q gi|254780592|r 223 KEAY 226 (271) Q Consensus 223 ~eA~ 226 (271) +.|. T Consensus 195 ~~AY 198 (203) T pfam11207 195 EQAY 198 (203) T ss_pred HHHE T ss_conf 3430 No 254 >KOG3783 consensus Probab=92.17 E-value=0.68 Score=23.13 Aligned_cols=90 Identities=8% Similarity=-0.027 Sum_probs=63.7 Q ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCH----HHHHHHHHHHHHCCHHHHHHHHH Q ss_conf 99999999999957578799999999998741001244666556632752100----13555532100000112588764 Q gi|254780592|r 53 LEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQE----VLAAYGKSLANAGYLDEGLDAIN 128 (271) Q Consensus 53 ~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~----~~~~~~~~~~~~g~~~~A~~~~~ 128 (271) ....+.+.+..+..|+++...+..+..+...|+.+.|+..+...+. +.-.+ -+..++.++.-+.+|..|...+. T Consensus 250 ~~~~~~Ll~~~~~~p~ga~wll~~ar~l~~~g~~eaa~~~~~~~v~--~~~kQ~~~l~~fE~aw~~v~~~~~~~aad~~~ 327 (546) T KOG3783 250 EECEKALKKYRKRYPKGALWLLMEARILSIKGNSEAAIDMESLSIP--IRMKQVKSLMVFERAWLSVGQHQYSRAADSFD 327 (546) T ss_pred HHHHHHHHHHHHHCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHCCC--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999986889974899850999998899970647989999982162--89999999999999999999999887761899 Q ss_pred HHHHHHCHHHHHHHHH Q ss_conf 2211000023356699 Q gi|254780592|r 129 RAQRPDIPDWQLISAK 144 (271) Q Consensus 129 ~a~~~~p~~~~~~~~l 144 (271) ..........-.|..+ T Consensus 328 ~L~desdWS~a~Y~Yf 343 (546) T KOG3783 328 LLRDESDWSHAFYTYF 343 (546) T ss_pred HHHHHHHHHHHHHHHH T ss_conf 9875544558889999 No 255 >KOG1464 consensus Probab=92.05 E-value=0.7 Score=23.05 Aligned_cols=52 Identities=10% Similarity=0.066 Sum_probs=41.4 Q ss_pred HHHCCHHHHHHHHHHHHHHCCCCHH----HHHHHHHHHHHHHCCHHHHHHHHHHHH Q ss_conf 8420199999999999995757879----999999999874100124466655663 Q gi|254780592|r 47 MNHEQLLEVTSTIGLQYQSHTKNKM----IGIVYADVLRRVGRTAQALAVMRQVAI 98 (271) Q Consensus 47 ~~~~~~~~A~~~~~~al~~~P~~~~----~~~~~a~~l~~~g~~~~A~~~~~~al~ 98 (271) +...+..+|+..|++.++..|...+ ++.....+..+.|++++-.+.+++.+. T Consensus 38 l~e~~p~~Al~sF~kVlelEgEKgeWGFKALKQmiKI~f~l~~~~eMm~~Y~qlLT 93 (440) T KOG1464 38 LKEDEPKEALSSFQKVLELEGEKGEWGFKALKQMIKINFRLGNYKEMMERYKQLLT 93 (440) T ss_pred CCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH T ss_conf 44368799999999999416643426799999888777504667999999999999 No 256 >KOG1538 consensus Probab=91.81 E-value=0.75 Score=22.87 Aligned_cols=23 Identities=22% Similarity=0.416 Sum_probs=16.3 Q ss_pred HHHHHHHHHHHCCHHHHHHHHHH Q ss_conf 99999998741001244666556 Q gi|254780592|r 74 IVYADVLRRVGRTAQALAVMRQV 96 (271) Q Consensus 74 ~~~a~~l~~~g~~~~A~~~~~~a 96 (271) +.+|.++.-.|++.+|..+|++. T Consensus 636 iLlA~~~Ay~gKF~EAAklFk~~ 658 (1081) T KOG1538 636 LLLADVFAYQGKFHEAAKLFKRS 658 (1081) T ss_pred HHHHHHHHHHHHHHHHHHHHHHC T ss_conf 89898987755689999999974 No 257 >pfam09797 NatB_MDM20 N-acetyltransferase B complex (NatB) non catalytic subunit. This is the non-catalytic subunit of the N-terminal acetyltransferase B complex (NatB). The NatB complex catalyses the acetylation of the amino-terminal methionine residue of all proteins beginning with Met-Asp or Met-Glu and of some proteins beginning with Met-Asn or Met-Met. In Saccharomyces cerevisiae this subunit is called MDM20 and in Schizosaccharomyces pombe it is called Arm1. NatB acetylates the Tpm1 protein and regulates and tropomyocin-actin interactions. This subunit is required by the NatB complex for the N-terminal acetylation of Tpm1. Probab=91.53 E-value=0.8 Score=22.69 Aligned_cols=47 Identities=17% Similarity=0.039 Sum_probs=37.4 Q ss_pred HHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHH Q ss_conf 22355435999999953898547888999999986998999999998 Q gi|254780592|r 185 MGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQ 231 (271) Q Consensus 185 ~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~ 231 (271) .+..-+|+..++.++..+|.+.....-|..+|..+|-.+-|...+.. T Consensus 192 ~~~l~~ai~lLE~~l~~sp~n~~~kLlLvrlY~~LG~~~~A~~~~~~ 238 (365) T pfam09797 192 LEYLLEAIALLENGLKKSPHNYDLKLLLIRLYLLLGAASLAYEHYEK 238 (365) T ss_pred HHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHH T ss_conf 68999999999999857985689999999999986991799999986 No 258 >PRK13184 pknD serine/threonine-protein kinase; Reviewed Probab=91.20 E-value=0.87 Score=22.49 Aligned_cols=94 Identities=11% Similarity=-0.059 Sum_probs=50.7 Q ss_pred HHHHCCHHHHHHHHHHHHHHCCCCH---HHHHHHHHHHHHH-----HC--CHHHHHHHHHHHHCCCCCCHHHHHHHHHHH Q ss_conf 9842019999999999999575787---9999999999874-----10--012446665566327521001355553210 Q gi|254780592|r 46 NMNHEQLLEVTSTIGLQYQSHTKNK---MIGIVYADVLRRV-----GR--TAQALAVMRQVAILYPEDQEVLAAYGKSLA 115 (271) Q Consensus 46 ~~~~~~~~~A~~~~~~al~~~P~~~---~~~~~~a~~l~~~-----g~--~~~A~~~~~~al~~~p~~~~~~~~~~~~~~ 115 (271) -++...|++|...|++.-+.-|... ++.++.|..+..+ |+ +.+|++-|+++ .-.|.-+--+...+.+|. T Consensus 485 fl~~k~y~~a~~~yrri~~sfpgr~eg~ea~fr~git~le~a~~~~~~~~~~~al~~f~~l-h~~~~apleylgkalvyq 563 (933) T PRK13184 485 FLAEKLYDRAAIFYRRIRESFPGRKEGYEAQFRLGITLLEKASEQGGDQEFTQALSEFSYL-HGGVAAPLEYLGKALVYQ 563 (933) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH-CCCCCCCHHHHHHHHHHH T ss_conf 7736668999999999986289953007889986389999887604725799999999885-389887578888899998 Q ss_pred HHCCHHHHHHHHHHHHHHHCHHHHH Q ss_conf 0000112588764221100002335 Q gi|254780592|r 116 NAGYLDEGLDAINRAQRPDIPDWQL 140 (271) Q Consensus 116 ~~g~~~~A~~~~~~a~~~~p~~~~~ 140 (271) ++|++++-.+.+.-+++..|..++. T Consensus 564 ~~~~~~eeik~~~la~kry~~hp~i 588 (933) T PRK13184 564 RLGEYNEEIKSLLLALKRYSQHPEI 588 (933) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 7300789999999999756799873 No 259 >KOG0529 consensus Probab=91.09 E-value=0.89 Score=22.42 Aligned_cols=129 Identities=12% Similarity=0.045 Sum_probs=82.4 Q ss_pred HCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCC--HHHHHHHHHHHHCCCCCCHHHHHHHHHHHHH----CCHHH Q ss_conf 20199999999999995757879999999999874100--1244666556632752100135555321000----00112 Q gi|254780592|r 49 HEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRT--AQALAVMRQVAILYPEDQEVLAAYGKSLANA----GYLDE 122 (271) Q Consensus 49 ~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~--~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~----g~~~~ 122 (271) +..+++-+.+...+++.+|++..+|..+.-++.+.+.. ..-+.+.+++++.+|.+-..|.-.-.+.... ....+ T Consensus 88 ~~~ld~eL~~~~~~L~~npksY~aW~hR~w~L~~~p~~~~~~EL~lcek~L~~D~RNfh~W~YRRfV~~~~~~~~~~~~~ 167 (421) T KOG0529 88 QALLDEELKYVESALKVNPKSYGAWHHRKWVLQKNPHSDWNTELQLCEKALKQDPRNFHAWHYRRFVVEQAERSRNLEKE 167 (421) T ss_pred HHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCCCCHH T ss_conf 87628889999999985732688999999999748982388899999998713855454168999999987313445356 Q ss_pred HHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHH------------HHHHHHHHHCCCCCCHHHHHHHH Q ss_conf 58876422110000233566999999974211------------11100000000000000012457 Q gi|254780592|r 123 GLDAINRAQRPDIPDWQLISAKGSVLAQMGKH------------SEALIEYERALELSPNESSIVSN 177 (271) Q Consensus 123 A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~------------~~A~~~~~~~~~~~p~~~~~~~~ 177 (271) -+++..+.+..++.+..+|..+..++...-.. ..-.+.-..++-.+|++...|.. T Consensus 168 El~ftt~~I~~nfSNYsaWhyRs~lL~~l~~~~~~g~~~~~~~l~sEle~v~saiFTdp~DqS~WfY 234 (421) T KOG0529 168 ELEFTTKLINDNFSNYSAWHYRSLLLSTLHPKEADGNFMPKELLQSELEMVHSAIFTDPEDQSCWFY 234 (421) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCEEEE T ss_conf 7888999986263323578889999987513334676577798999999999988458554650242 No 260 >pfam00637 Clathrin Region in Clathrin and VPS. Each region is about 140 amino acids long. The regions are composed of multiple alpha helical repeats. They occur in the arm region of the Clathrin heavy chain. Probab=91.07 E-value=0.89 Score=22.41 Aligned_cols=39 Identities=10% Similarity=0.177 Sum_probs=14.6 Q ss_pred HHHHHHCCHHHHHHHHHHHHC-CCCCCHHHHHHHHHHHHH Q ss_conf 998741001244666556632-752100135555321000 Q gi|254780592|r 79 VLRRVGRTAQALAVMRQVAIL-YPEDQEVLAAYGKSLANA 117 (271) Q Consensus 79 ~l~~~g~~~~A~~~~~~al~~-~p~~~~~~~~~~~~~~~~ 117 (271) .+...|.+.+-..+++.++.. ...++.....+..++... T Consensus 16 ~~~~~~~~~~l~~yLe~~~~~~~~~~~~~~t~L~~ly~~~ 55 (139) T pfam00637 16 LFEKAGLLEELIPYLESALKSNFSNDPALQTALLELYAKY 55 (139) T ss_pred HHHHCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHH T ss_conf 9988799389999999998416210337999999999983 No 261 >pfam10255 Paf67 RNA polymerase I-associated factor PAF67. RNA polymerase I is a multisubunit enzyme and its transcription competence is dependent on the presence of PAF67. This family of proteins is conserved from worms to humans. Probab=90.84 E-value=0.94 Score=22.28 Aligned_cols=96 Identities=10% Similarity=0.024 Sum_probs=51.1 Q ss_pred CHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCC-------CH--------HHHHHHHHHHHHCCHHHHHHHHHHHH-- Q ss_conf 87999999999987410012446665566327521-------00--------13555532100000112588764221-- Q gi|254780592|r 69 NKMIGIVYADVLRRVGRTAQALAVMRQVAILYPED-------QE--------VLAAYGKSLANAGYLDEGLDAINRAQ-- 131 (271) Q Consensus 69 ~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~-------~~--------~~~~~~~~~~~~g~~~~A~~~~~~a~-- 131 (271) +...-++.-..+....+..+-++..+.. .+|.. .. +...+.++..-+|+|..|++.++..- T Consensus 74 ~~~~VLn~L~sLv~kS~I~e~l~~~~~g--~~~~~~~~~~g~~~l~~~LGYFSlIgLlRlhvLLGDY~~Alk~L~~Idl~ 151 (402) T pfam10255 74 NVYSVLNVLYSLVDKSQIFEQLEAEKAG--EDPSEVADEYGSSNLYKMLGYFSIIGLLRVHVLLGDYYLALKVLEPIDLS 151 (402) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHCC--CCCHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCC T ss_conf 3488999999999987389999987447--99355533556641999989999999999987005689999975241446 Q ss_pred ------HHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf ------10000233566999999974211111000000000 Q gi|254780592|r 132 ------RPDIPDWQLISAKGSVLAQMGKHSEALIEYERALE 166 (271) Q Consensus 132 ------~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~ 166 (271) ..-+-....+...|-+|...+++.+|+..|...+- T Consensus 152 ~~~l~~kV~~~~vs~~YyvGFaYlMlrRY~DAir~f~~iL~ 192 (402) T pfam10255 152 KKGLYSKVPAAYVSLYYYVGFAYLMLRRYADAIRVFSQILI 192 (402) T ss_pred CCHHHCCCCHHHEEHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 32010247404021245889999998658999999999999 No 262 >pfam09670 Cas_Cas02710 CRISPR-associated protein (Cas_Cas02710). Members of this family are found, exclusively in the vicinity of CRISPR repeats and other CRISPR-associated (cas) genes, in Methanothermobacter thermautotrophicus (Methanobacterium thermoformicicum), Thermus thermophilus (Deinococcus-Thermus), Chloroflexus aurantiacus (Chloroflexi), and Thermomicrobium roseum (Thermomicrobia). Probab=90.57 E-value=0.99 Score=22.13 Aligned_cols=58 Identities=9% Similarity=-0.058 Sum_probs=32.8 Q ss_pred HHHHHHHCCHHHHHHHHHHHHHHCCCCHH--HHHHHHHHHHHH--HCCHHHHHHHHHHHHCC Q ss_conf 99998420199999999999995757879--999999999874--10012446665566327 Q gi|254780592|r 43 SLENMNHEQLLEVTSTIGLQYQSHTKNKM--IGIVYADVLRRV--GRTAQALAVMRQVAILY 100 (271) Q Consensus 43 ~~~~~~~~~~~~A~~~~~~al~~~P~~~~--~~~~~a~~l~~~--g~~~~A~~~~~~al~~~ 100 (271) +...++.++|..|...++.+...-|.+.. .+..+...|... -++.+|...+++.+... T Consensus 138 a~~lfn~~~y~~A~~~l~~l~~rl~~~~~~~~~~~l~~~y~~WD~f~~~~A~~~L~~~~~~~ 199 (379) T pfam09670 138 AIRLFNKGDYGSALRILEDLARRLPGRKDYAAYKLLSEGYLAWDSFDHEEAADYLSKVLALL 199 (379) T ss_pred HHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHH T ss_conf 99988456889999999999873797457999999999999986558999999999988754 No 263 >COG3014 Uncharacterized protein conserved in bacteria [Function unknown] Probab=90.48 E-value=1 Score=22.08 Aligned_cols=74 Identities=18% Similarity=0.057 Sum_probs=42.3 Q ss_pred HHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHCCH--HHHHHHHHHHHHHCCCCHHHHH---HHHHHHHHHHCCHHHHHH Q ss_conf 99999999985403685433117999999842019--9999999999995757879999---999999874100124466 Q gi|254780592|r 18 IVFLIPLLLFSSCHLNPRISIPDPESLENMNHEQL--LEVTSTIGLQYQSHTKNKMIGI---VYADVLRRVGRTAQALAV 92 (271) Q Consensus 18 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~A~~~~~~al~~~P~~~~~~~---~~a~~l~~~g~~~~A~~~ 92 (271) +...|..+.++||..... ..++.+..++ .-+...|++..+.-|++....+ +-|.++...|++.++... T Consensus 8 I~s~I~~L~LTgC~N~~~-------~~~hf~atl~y~~~~~~~y~Q~~q~~kk~~~~il~~L~~Gl~a~~~~dya~S~~~ 80 (449) T COG3014 8 IMSVILGLSLTGCINYRM-------DLEHFNATLYYEESPKKAYEQSKQFTKKKKNALLWDLQNGLSALYARDYATSLGV 80 (449) T ss_pred HHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 344566763598776787-------7653236788875205589999886011268888756400789876667765305 Q ss_pred HHHHHH Q ss_conf 655663 Q gi|254780592|r 93 MRQVAI 98 (271) Q Consensus 93 ~~~al~ 98 (271) +.-+-+ T Consensus 81 ldAae~ 86 (449) T COG3014 81 LDAAEQ 86 (449) T ss_pred HHHHHH T ss_conf 899999 No 264 >pfam09986 DUF2225 Uncharacterized protein conserved in bacteria (DUF2225). This domain, found in various hypothetical bacterial proteins, has no known function. Probab=90.21 E-value=1.1 Score=21.94 Aligned_cols=64 Identities=23% Similarity=0.221 Sum_probs=41.8 Q ss_pred HHHHHHHHHHHHH-------HHHHHHHHHHHHHCCC--C----CHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCC Q ss_conf 4577765433223-------5543599999995389--8----5478889999999869989999999986291728 Q gi|254780592|r 175 VSNIAMSYLLMGD-------LKTAEEKLRFASQMIG--A----DSRIRQNLALVVGLQGRMKEAYSIASQELSPEEA 238 (271) Q Consensus 175 ~~~~~~~~~~~g~-------~~~A~~~~~~al~~~p--~----~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~ 238 (271) ...++++|-.+|+ +..|...|+++.+... . ...+.+.+|.+....|++++|...+.+.+..... T Consensus 121 ~LriAWlyR~~~~~e~E~~fl~~Al~~y~~ay~~e~~~~~~~~e~~l~YLigeL~rRlG~~~eA~~wfs~vi~~~~~ 197 (214) T pfam09986 121 CLRLAWLYRLLGDEEKELLFLKKALELYKEAYEKEDFPIEGIDEATLMYLIGELSRRLGNKEEALRWFSKVIGSPKA 197 (214) T ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCC T ss_conf 99999986225885899999999999999999606688777118899999999999949999999999999639666 No 265 >pfam10602 RPN7 26S proteasome subunit RPN7. RPN7 (known as the non ATPase regulatory subunit 6 in higher eukaryotes) is one of the lid subunits of the 26S proteasome and has been shown in Saccharomyces cerevisiae to be required for structural integrity. The 26S proteasome is is involved in the ATP-dependent degradation of ubiquitinated proteins. Probab=90.09 E-value=1.1 Score=21.88 Aligned_cols=96 Identities=15% Similarity=0.099 Sum_probs=49.2 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCC---CHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH--CHHHHHHH--- Q ss_conf 999999999987410012446665566327521---0013555532100000112588764221100--00233566--- Q gi|254780592|r 71 MIGIVYADVLRRVGRTAQALAVMRQVAILYPED---QEVLAAYGKSLANAGYLDEGLDAINRAQRPD--IPDWQLIS--- 142 (271) Q Consensus 71 ~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~---~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~--p~~~~~~~--- 142 (271) .++..+|.-|.+.|+.+.|+..|.++.+..... ......+-++-...++|........++.... +.++...+ T Consensus 37 ~~~~~l~~~y~~iGd~~~A~k~y~~~~~~~ts~~~~id~~l~~irv~i~~~d~~~v~~~i~ka~~~~e~ggDw~r~nrlk 116 (174) T pfam10602 37 MANLDLGEHYAKIGDLENALKAYSRAREYCTSLGHKIDMCLNLIRVGIFFNDWDHVSKYIEKAKSLIEKGGDWERRNRLK 116 (174) T ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHH T ss_conf 99999899999966799999999998831788459999999999999997009999999999985651445587898899 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf -999999974211111000000000 Q gi|254780592|r 143 -AKGSVLAQMGKHSEALIEYERALE 166 (271) Q Consensus 143 -~lg~~~~~~~~~~~A~~~~~~~~~ 166 (271) .-|......+++..|-..|..... T Consensus 117 ~y~Gl~~l~~r~f~~AA~~fl~~~~ 141 (174) T pfam10602 117 VYEGLALLAIRDFKEAAKLFLDSLS 141 (174) T ss_pred HHHHHHHHHHCHHHHHHHHHHCCCC T ss_conf 9999999971649999999963767 No 266 >COG3947 Response regulator containing CheY-like receiver and SARP domains [Signal transduction mechanisms] Probab=89.96 E-value=0.53 Score=23.80 Aligned_cols=56 Identities=16% Similarity=0.090 Sum_probs=35.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHH Q ss_conf 57776543322355435999999953898547888999999986998999999998 Q gi|254780592|r 176 SNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQ 231 (271) Q Consensus 176 ~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~ 231 (271) .-.+..|...|.+.+|+...++++..+|-+...+..|-.++...|+--++.+.|++ T Consensus 283 gkva~~yle~g~~neAi~l~qr~ltldpL~e~~nk~lm~~la~~gD~is~~khyer 338 (361) T COG3947 283 GKVARAYLEAGKPNEAIQLHQRALTLDPLSEQDNKGLMASLATLGDEISAIKHYER 338 (361) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHH T ss_conf 99999999759927899999877202704467899999999980650434238999 No 267 >PRK11619 lytic murein transglycosylase; Provisional Probab=89.67 E-value=1.2 Score=21.67 Aligned_cols=13 Identities=8% Similarity=0.044 Sum_probs=5.6 Q ss_pred HHHHHHHHCCHHH Q ss_conf 9999984201999 Q gi|254780592|r 42 ESLENMNHEQLLE 54 (271) Q Consensus 42 ~~~~~~~~~~~~~ 54 (271) ++...+..+|... T Consensus 39 ~A~~Al~~~d~~~ 51 (645) T PRK11619 39 QIKQAWDNRQMDV 51 (645) T ss_pred HHHHHHHCCCHHH T ss_conf 9999998889999 No 268 >KOG2063 consensus Probab=89.22 E-value=1.3 Score=21.47 Aligned_cols=162 Identities=16% Similarity=0.061 Sum_probs=87.3 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHCC----CCCCHHHHHHHHHHHHHCCH--HHHHHHHHHHHHHHCHHH-------- Q ss_conf 9999999987410012446665566327----52100135555321000001--125887642211000023-------- Q gi|254780592|r 73 GIVYADVLRRVGRTAQALAVMRQVAILY----PEDQEVLAAYGKSLANAGYL--DEGLDAINRAQRPDIPDW-------- 138 (271) Q Consensus 73 ~~~~a~~l~~~g~~~~A~~~~~~al~~~----p~~~~~~~~~~~~~~~~g~~--~~A~~~~~~a~~~~p~~~-------- 138 (271) +..++..|...|++++|+.++.+...-. +.....+...-..+...+.. +-..++..-....+|... T Consensus 507 y~~Li~LY~~kg~h~~AL~ll~~l~d~~~~~d~~~~~~~e~ii~YL~~l~~~~~~Li~~y~~wvl~~~p~~gi~Ift~~~ 586 (877) T KOG2063 507 YRELIELYATKGMHEKALQLLRDLVDEDSDTDSFQLDGLEKIIEYLKKLGAENLDLILEYADWVLNKNPEAGIQIFTSED 586 (877) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHEEEEECCC T ss_conf 99999999862655999999999723644232114443788999998745653668898766543149412205530267 Q ss_pred ----HHHHH-HHHHHHHHHHHHHHHHHHHHCCCCCCHHH-HHHHHHHHHHHH--------HHHHHHHHHH--HHHHHH-- Q ss_conf ----35669-99999974211111000000000000000-124577765433--------2235543599--999995-- Q gi|254780592|r 139 ----QLISA-KGSVLAQMGKHSEALIEYERALELSPNES-SIVSNIAMSYLL--------MGDLKTAEEK--LRFASQ-- 200 (271) Q Consensus 139 ----~~~~~-lg~~~~~~~~~~~A~~~~~~~~~~~p~~~-~~~~~~~~~~~~--------~g~~~~A~~~--~~~al~-- 200 (271) ..... ...-+........++.++++.+....... ..+..+...|.. .++-+++.+. .++... T Consensus 587 ~~~~~sis~~~Vl~~l~~~~~~l~I~YLE~li~~~~~~~~~lht~ll~ly~e~v~~~~~~~~kg~e~~E~~~rekl~~~l 666 (877) T KOG2063 587 KQEAESISRDDVLNYLKSKEPKLLIPYLEHLISDNRLTSTLLHTVLLKLYLEKVLEQASTDGKGEEAPETTVREKLLDFL 666 (877) T ss_pred HHHHCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCCCHHCCCCCHHHHHHHHHHHHH T ss_conf 66654689899998765518015689999986740155337899999999999851357111046320345889999876 Q ss_pred -----CCCC-------CHHHHHHHHHHHHHCCCHHHHHHHHHHHCC Q ss_conf -----3898-------547888999999986998999999998629 Q gi|254780592|r 201 -----MIGA-------DSRIRQNLALVVGLQGRMKEAYSIASQELS 234 (271) Q Consensus 201 -----~~p~-------~~~~~~~La~~~~~~g~~~eA~~~~~~~L~ 234 (271) .+|+ ....|...|.++.++|+.++|..++-..|. T Consensus 667 ~~s~~Y~p~~~L~~~~~~~l~ee~aill~rl~khe~aL~Iyv~~L~ 712 (877) T KOG2063 667 ESSDLYDPQLLLERLNGDELYEERAILLGRLGKHEEALHIYVHELD 712 (877) T ss_pred HHHCCCCCCHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 4412558422555454011899998887442017899999999962 No 269 >COG1747 Uncharacterized N-terminal domain of the transcription elongation factor GreA [Function unknown] Probab=88.99 E-value=1.3 Score=21.37 Aligned_cols=41 Identities=15% Similarity=0.087 Sum_probs=17.6 Q ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHH Q ss_conf 999999999575787999999999987410012446665566 Q gi|254780592|r 56 TSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVA 97 (271) Q Consensus 56 ~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al 97 (271) ...+.+.++-+-++...--.++..|.. ++...+..+|.+++ T Consensus 118 ~~lWer~ve~dfnDvv~~ReLa~~yEk-ik~sk~a~~f~Ka~ 158 (711) T COG1747 118 YSLWERLVEYDFNDVVIGRELADKYEK-IKKSKAAEFFGKAL 158 (711) T ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHH-HCHHHHHHHHHHHH T ss_conf 789999999511257879999999997-22054799999999 No 270 >COG4455 ImpE Protein of avirulence locus involved in temperature-dependent protein secretion [General function prediction only] Probab=88.43 E-value=1.4 Score=21.14 Aligned_cols=64 Identities=19% Similarity=0.139 Sum_probs=57.0 Q ss_pred HHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCH Q ss_conf 9999984201999999999999957578799999999998741001244666556632752100 Q gi|254780592|r 42 ESLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQE 105 (271) Q Consensus 42 ~~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~ 105 (271) ...+.+....+.+|+...+.-++.+|.|......+-..|--.|++++|...++-+-++.|+... T Consensus 7 t~seLL~~~sL~dai~~a~~qVkakPtda~~RhflfqLlcvaGdw~kAl~Ql~l~a~l~p~~t~ 70 (273) T COG4455 7 TISELLDDNSLQDAIGLARDQVKAKPTDAGGRHFLFQLLCVAGDWEKALAQLNLAATLSPQDTV 70 (273) T ss_pred HHHHHHHHCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCH T ss_conf 8999987313999999999998649866630567787886466499999998888652901026 No 271 >KOG2422 consensus Probab=88.36 E-value=1.5 Score=21.11 Aligned_cols=154 Identities=17% Similarity=0.080 Sum_probs=96.1 Q ss_pred HCCHHHHHHHHHHHHHH------------CCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHH------------------ Q ss_conf 20199999999999995------------757879999999999874100124466655663------------------ Q gi|254780592|r 49 HEQLLEVTSTIGLQYQS------------HTKNKMIGIVYADVLRRVGRTAQALAVMRQVAI------------------ 98 (271) Q Consensus 49 ~~~~~~A~~~~~~al~~------------~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~------------------ 98 (271) ...|.+|...|.-+.+. +|-+.+.++.++.+...+|+.+-|...+++++- T Consensus 251 s~sYeqaq~~F~~av~~~d~n~v~~lL~ssPYHvdsLLqva~~~r~qgD~e~aadLieR~Ly~~d~a~hp~F~~~sg~cR 330 (665) T KOG2422 251 SNSYEQAQRDFYLAVIVHDPNNVLILLISSPYHVDSLLQVADIFRFQGDREMAADLIERGLYVFDRALHPNFIPFSGNCR 330 (665) T ss_pred CHHHHHHHHHHHHHHHHCCCCCEEEEECCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCC T ss_conf 60789999999989860077642565316884066899999999871224567889998999998874334565655536 Q ss_pred ---CCCCCCHHHH---HHHHHHHHHCCHHHHHHHHHHHHHHHCH-HHHHHHHHHHHH-HHHHHHHHHHHHHHHC-----C Q ss_conf ---2752100135---5553210000011258876422110000-233566999999-9742111110000000-----0 Q gi|254780592|r 99 ---LYPEDQEVLA---AYGKSLANAGYLDEGLDAINRAQRPDIP-DWQLISAKGSVL-AQMGKHSEALIEYERA-----L 165 (271) Q Consensus 99 ---~~p~~~~~~~---~~~~~~~~~g~~~~A~~~~~~a~~~~p~-~~~~~~~lg~~~-~~~~~~~~A~~~~~~~-----~ 165 (271) .+|++...+. ..-..+...|=|..|.++.+-.+.++|. ++-....+-.+| .+..+|+--++.++.. + T Consensus 331 L~y~~~eNR~FyL~l~r~m~~l~~RGC~rTA~E~cKlllsLdp~eDPl~~l~~ID~~ALrareYqwiI~~~~~~e~~n~l 410 (665) T KOG2422 331 LPYIYPENRQFYLALFRYMQSLAQRGCWRTALEWCKLLLSLDPSEDPLGILYLIDIYALRAREYQWIIELSNEPENMNKL 410 (665) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCH T ss_conf 76456026899999999999888667717899999999523776784157899999999877689999998889861538 Q ss_pred CCCCHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHCCC Q ss_conf 0000000124577765433223---5543599999995389 Q gi|254780592|r 166 ELSPNESSIVSNIAMSYLLMGD---LKTAEEKLRFASQMIG 203 (271) Q Consensus 166 ~~~p~~~~~~~~~~~~~~~~g~---~~~A~~~~~~al~~~p 203 (271) ..-|+.... ..++..|+.... ...|...+.+|+..-| T Consensus 411 ~~~PN~~yS-~AlA~f~l~~~~~~~rqsa~~~l~qAl~~~P 450 (665) T KOG2422 411 SQLPNFGYS-LALARFFLRKNEEDDRQSALNALLQALKHHP 450 (665) T ss_pred HHCCCCHHH-HHHHHHHHHCCCHHHHHHHHHHHHHHHHHCC T ss_conf 556771478-9999999963872568999999999988571 No 272 >pfam11846 DUF3366 Domain of unknown function (DUF3366). This domain is functionally uncharacterized. This domain is found in bacteria. This presumed domain is about 200 amino acids in length. Probab=87.73 E-value=1.6 Score=20.87 Aligned_cols=44 Identities=20% Similarity=0.104 Sum_probs=25.5 Q ss_pred HHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCC Q ss_conf 999999953898547888999999986998999999998629172 Q gi|254780592|r 193 EKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQELSPEE 237 (271) Q Consensus 193 ~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~ 237 (271) ..-++.++..| .+.++.+++.++..+|+.++|....+++...-| T Consensus 132 ~wa~~~l~~~P-~p~~y~~la~~~~~~G~~~~A~~~~~~a~~lyP 175 (193) T pfam11846 132 EQALKLLRYKP-RPAVYRRLALALALLGKPAEARDWMAQARYLYP 175 (193) T ss_pred HHHHHHHHCCC-CHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCC T ss_conf 99999987499-759999999999986999999999999998399 No 273 >PRK10802 peptidoglycan-associated outer membrane lipoprotein; Provisional Probab=85.86 E-value=0.82 Score=22.63 Aligned_cols=27 Identities=22% Similarity=0.492 Sum_probs=19.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 998999999999999985403685433 Q gi|254780592|r 11 FLKKIKCIVFLIPLLLFSSCHLNPRIS 37 (271) Q Consensus 11 f~~~~~~~~~l~~~~~~~~~~~~~~~~ 37 (271) |.|-++.+++++.++.+++|++++... T Consensus 3 ~nk~~K~l~ia~~~l~LaACSS~~~~~ 29 (173) T PRK10802 3 LNKVLKGLMIALPVMAIAACSSNKNAS 29 (173) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 078999999999999998557999888 No 274 >KOG0890 consensus Probab=85.82 E-value=2 Score=20.23 Aligned_cols=60 Identities=8% Similarity=0.009 Sum_probs=40.3 Q ss_pred HHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCC Q ss_conf 999984201999999999999957578799999999998741001244666556632752 Q gi|254780592|r 43 SLENMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPE 102 (271) Q Consensus 43 ~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~ 102 (271) -..+-..|+++.|...|+++++.+|++...+...-...+..|++...+...+-.....++ T Consensus 1456 il~~e~~g~~~da~~Cye~~~q~~p~~~~~~~g~l~sml~~~~l~t~i~~~dg~~~~~se 1515 (2382) T KOG0890 1456 ILEHEASGNWADAAACYERLIQKDPDKEKHHSGVLKSMLAIQHLSTEILHLDGLIINRSE 1515 (2382) T ss_pred HHHHHHHCCHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHCCCCHHHHHHHHCCHHHCCCH T ss_conf 999874035889999999862589965320456777654213255787653012102678 No 275 >KOG0128 consensus Probab=83.11 E-value=2.7 Score=19.50 Aligned_cols=204 Identities=11% Similarity=0.064 Sum_probs=104.0 Q ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHH-HC---------------- Q ss_conf 9999999995757879999999999874100124466655663275210013555532100-00---------------- Q gi|254780592|r 56 TSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLAN-AG---------------- 118 (271) Q Consensus 56 ~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~-~g---------------- 118 (271) ...+.+.++..|.-..-|+.+..-....|.+-.-...+++++.-.+.+...|..++...-. ++ T Consensus 298 ~~~~e~~~q~~~~~~q~~~~yidfe~~~G~p~ri~l~~eR~~~E~~~~~~~wi~y~~~~d~eLkv~~~~~~~~~ra~R~c 377 (881) T KOG0128 298 LFKFERLVQKEPIKDQEWMSYIDFEKKSGDPVRIQLIEERAVAEMVLDRALWIGYGVYLDTELKVPQRGVSVHPRAVRSC 377 (881) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCHHHCCC T ss_conf 87788776552788899999988887448853788888888886044688876554513520356543232151432178 Q ss_pred -----CHHHHHHHHHH----------HHHHHCHHHHHHHHHHHHHH-H------H-------HHHHHHHHHHHHCCCC-C Q ss_conf -----01125887642----------21100002335669999999-7------4-------2111110000000000-0 Q gi|254780592|r 119 -----YLDEGLDAINR----------AQRPDIPDWQLISAKGSVLA-Q------M-------GKHSEALIEYERALEL-S 168 (271) Q Consensus 119 -----~~~~A~~~~~~----------a~~~~p~~~~~~~~lg~~~~-~------~-------~~~~~A~~~~~~~~~~-~ 168 (271) -+..+.-.+.+ .+..+-+......+....+. . . ..+..|...+...... - T Consensus 378 p~tgdL~~rallAleR~re~~~vI~~~l~~~ls~~~~l~~~~~~~rr~~~~~~~s~~~s~lr~~F~~A~~eLt~~~~~~~ 457 (881) T KOG0128 378 PWTGDLWKRALLALERNREEITVIVQNLEKDLSMTVELHNDYLAYRRRCTNIIDSQDYSSLRAAFNHAWEELTELYGDQL 457 (881) T ss_pred CHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 52777899989999726752202998778888777888888999998613643166679999999999999999864312 Q ss_pred CHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHCCCCCHH-HHHHHHHHHHHCCCHHHHHHHHHHHCCCC--CC--CHHH Q ss_conf 000012457776543-322355435999999953898547-88899999998699899999999862917--28--6659 Q gi|254780592|r 169 PNESSIVSNIAMSYL-LMGDLKTAEEKLRFASQMIGADSR-IRQNLALVVGLQGRMKEAYSIASQELSPE--EA--TRNI 242 (271) Q Consensus 169 p~~~~~~~~~~~~~~-~~g~~~~A~~~~~~al~~~p~~~~-~~~~La~~~~~~g~~~eA~~~~~~~L~~~--~~--~~~~ 242 (271) ......+.+++..+. ..++.+.+...+...+.....+.. .|.....+-...|+...+..++++++..- +. ..-. T Consensus 458 Dt~~~~~q~wA~~E~sl~~nmd~~R~iWn~imty~~~~iag~Wle~~~lE~~~g~~~~~R~~~R~ay~~~~~~~~~~ev~ 537 (881) T KOG0128 458 DTRTEVLQLWAQVEASLLKNMDKAREIWNFIMTYGGGSIAGKWLEAINLEREYGDGPSARKVLRKAYSQVVDPEDALEVL 537 (881) T ss_pred HHHHHHHHHHHHHHHHHHHCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCHHHHHHH T ss_conf 25789999999999998513354367660544577304789999997679984876517999998872476832689999 Q ss_pred HHHHHHHHHHHHHHHHH Q ss_conf 99999998321299999 Q gi|254780592|r 243 KYIKSILSQRDPWKKIA 259 (271) Q Consensus 243 ~~l~~~l~~~~~~~~~a 259 (271) .+.+.+......|..+. T Consensus 538 ~~~~r~Ere~gtl~~~~ 554 (881) T KOG0128 538 EFFRRFEREYGTLESFD 554 (881) T ss_pred HHHHHHHHCCCCHHHHH T ss_conf 99999875056599986 No 276 >PRK10598 hypothetical protein; Provisional Probab=82.38 E-value=1.6 Score=20.85 Aligned_cols=38 Identities=18% Similarity=0.239 Sum_probs=27.7 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHH Q ss_conf 98999999999999985403685433117999999842 Q gi|254780592|r 12 LKKIKCIVFLIPLLLFSSCHLNPRISIPDPESLENMNH 49 (271) Q Consensus 12 ~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 49 (271) +|++.....++.+..++||.+-...+....+..++++. T Consensus 1 mkk~~l~~~l~l~~llsGC~~l~qYsISEqEin~YL~~ 38 (186) T PRK10598 1 MKKFLLAAALLVSGLLVGCNQLTQYTISEQEINQYLAK 38 (186) T ss_pred CCHHHHHHHHHHHHHHHCCCCCCCEEECHHHHHHHHHH T ss_conf 91269999999999971435336440079999999987 No 277 >TIGR01004 PulS_OutS lipoprotein, PulS/OutS family; InterPro: IPR005699 This family comprises lipoproteins from four gamma proteobacterial species: PulS protein of Klebsiella pneumoniae (P20440 from SWISSPROT), the OutS protein of Erwinia chrysanthemi (Q01567 from SWISSPROT) and Pectobacterium chrysanthemi, and the functionally uncharacterised E. coli protein EtpO. PulS and OutS have been shown to interact with and facilitate insertion of secretins into the outer membrane, suggesting a chaperone-like, or piloting function for members of this family.; GO: 0008565 protein transporter activity, 0006886 intracellular protein transport. Probab=81.06 E-value=3.2 Score=19.04 Aligned_cols=73 Identities=21% Similarity=0.409 Sum_probs=41.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCC---------CHHHHHHHHHHHCCHHHHHHHHHHHHHH-C-CCCHHHHHHHHH Q ss_conf 9998999999999999985403685433---------1179999998420199999999999995-7-578799999999 Q gi|254780592|r 10 LFLKKIKCIVFLIPLLLFSSCHLNPRIS---------IPDPESLENMNHEQLLEVTSTIGLQYQS-H-TKNKMIGIVYAD 78 (271) Q Consensus 10 ~f~~~~~~~~~l~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~A~~~~~~al~~-~-P~~~~~~~~~a~ 78 (271) =|+|++-+++++++.+.++||++-.++. .+-.+.++.+.. .-.|..|++--=++ | |+|..++..--. T Consensus 4 n~lKkln~~~~~~~~v~lsgCqtPaPvqn~~~~~~a~Vp~~eQleQlAS--~vaa~rYLk~~CnRSDLPdd~~Il~~A~n 81 (136) T TIGR01004 4 NILKKLNVVLSLLVVVLLSGCQTPAPVQNRAIRKAAAVPASEQLEQLAS--VVAAARYLKVKCNRSDLPDDEAILKVAVN 81 (136) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHH--HHHHHHHHHHCCCCCCCCCHHHHHHHHHH T ss_conf 3789999999999999864047899888875555787887778999999--99998886420677899700579999999 Q ss_pred HHHHHH Q ss_conf 998741 Q gi|254780592|r 79 VLRRVG 84 (271) Q Consensus 79 ~l~~~g 84 (271) +-...| T Consensus 82 va~~kG 87 (136) T TIGR01004 82 VAVKKG 87 (136) T ss_pred HHHHHC T ss_conf 875631 No 278 >KOG1920 consensus Probab=79.11 E-value=3.7 Score=18.65 Aligned_cols=57 Identities=23% Similarity=0.194 Sum_probs=28.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHH----------HHCCCCCHHHH---HHHHHHHHHCCCHHHHHHHHHHHCC Q ss_conf 776543322355435999999----------95389854788---8999999986998999999998629 Q gi|254780592|r 178 IAMSYLLMGDLKTAEEKLRFA----------SQMIGADSRIR---QNLALVVGLQGRMKEAYSIASQELS 234 (271) Q Consensus 178 ~~~~~~~~g~~~~A~~~~~~a----------l~~~p~~~~~~---~~La~~~~~~g~~~eA~~~~~~~L~ 234 (271) -+..|...|+.++|+..++.+ .+..++..++. ..|+.-+..++++-||-.+....++ T Consensus 958 Aal~Ye~~GklekAl~a~~~~~dWr~~l~~a~ql~~~~de~~~~a~~L~s~L~e~~kh~eAa~il~e~~s 1027 (1265) T KOG1920 958 AALMYERCGKLEKALKAYKECGDWREALSLAAQLSEGKDELVILAEELVSRLVEQRKHYEAAKILLEYLS 1027 (1265) T ss_pred HHHHHHHHCCHHHHHHHHHHHCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHC T ss_conf 7999997134899999999836699999998752477778999999999999874163359999999715 No 279 >pfam06474 MLTD_N MLTD_N. Probab=78.77 E-value=0.48 Score=24.05 Aligned_cols=24 Identities=21% Similarity=0.337 Sum_probs=15.4 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCHH Q ss_conf 999999999985403685433117 Q gi|254780592|r 17 CIVFLIPLLLFSSCHLNPRISIPD 40 (271) Q Consensus 17 ~~~~l~~~~~~~~~~~~~~~~~~~ 40 (271) ...++++++.|+||++.+...... T Consensus 3 ~kail~~alLLaGCQst~~~~~~t 26 (93) T pfam06474 3 AKAVLAAALLLVGCQSTQAVSSTT 26 (93) T ss_pred HHHHHHHHHHHHHHCCCCCCCCCC T ss_conf 899999999999813799999987 No 280 >PRK09967 putative outer membrane lipoprotein; Provisional Probab=78.15 E-value=2.8 Score=19.41 Aligned_cols=27 Identities=15% Similarity=0.299 Sum_probs=16.6 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCHH Q ss_conf 999999999999985403685433117 Q gi|254780592|r 14 KIKCIVFLIPLLLFSSCHLNPRISIPD 40 (271) Q Consensus 14 ~~~~~~~l~~~~~~~~~~~~~~~~~~~ 40 (271) |.....+++.+++++||++.+...++. T Consensus 3 k~~~~~~l~~~l~lsgC~s~~~~~t~~ 29 (160) T PRK09967 3 KHLVAPLIFTSLILTGCQSPQGKFTPE 29 (160) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCHH T ss_conf 789999999999998458999998989 No 281 >KOG4056 consensus Probab=77.84 E-value=4 Score=18.41 Aligned_cols=38 Identities=8% Similarity=0.183 Sum_probs=27.6 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHH Q ss_conf 99999999874100124466655663275210013555 Q gi|254780592|r 73 GIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAY 110 (271) Q Consensus 73 ~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~ 110 (271) -+.+|+.|...|+.+++..++-.++...|.....+.-+ T Consensus 84 qv~lGE~L~~qg~~e~ga~h~~nAi~vcgqpaqLL~vl 121 (143) T KOG4056 84 QVQLGEELLAQGNEEEGAEHLANAIVVCGQPAQLLQVL 121 (143) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHH T ss_conf 99868999976578888999988886168779999999 No 282 >KOG3807 consensus Probab=76.91 E-value=4.3 Score=18.25 Aligned_cols=137 Identities=12% Similarity=0.058 Sum_probs=57.3 Q ss_pred HHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 74100124466655663275210013555532100000112588764221100002335669999999742111110000 Q gi|254780592|r 82 RVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEY 161 (271) Q Consensus 82 ~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~ 161 (271) +.-++.+-+.....+++++|+...++..++.-. ..-..+|...++++++...... .........|...+|. T Consensus 196 RERnp~~RI~~A~~ALeIN~eCA~AyvLLAEEE--a~Ti~~AE~l~k~ALka~e~~y----r~sqq~qh~~~~~da~--- 266 (556) T KOG3807 196 RERNPPARIKAAYQALEINNECATAYVLLAEEE--ATTIVDAERLFKQALKAGETIY----RQSQQCQHQSPQHEAQ--- 266 (556) T ss_pred HHCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHCCCHHHH--- T ss_conf 731938889989998752801335787663675--6479999999999999878676----5678876414411356--- Q ss_pred HHCCCCCCHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCH--HHHHHHHHHHHHCCCHHHHHHHHHH Q ss_conf 000000000001--245777654332235543599999995389854--7888999999986998999999998 Q gi|254780592|r 162 ERALELSPNESS--IVSNIAMSYLLMGDLKTAEEKLRFASQMIGADS--RIRQNLALVVGLQGRMKEAYSIASQ 231 (271) Q Consensus 162 ~~~~~~~p~~~~--~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~--~~~~~La~~~~~~g~~~eA~~~~~~ 231 (271) ..+..+.. +...++.+--++|+.++|+..++...+-.|-.+ .+..||-..+.+..-+.+-..++.+ T Consensus 267 ----~rRDtnvl~YIKRRLAMCARklGrlrEA~K~~RDL~ke~pl~t~lniheNLiEalLE~QAYADvqavLak 336 (556) T KOG3807 267 ----LRRDTNVLVYIKRRLAMCARKLGRLREAVKIMRDLMKEFPLLTMLNIHENLLEALLELQAYADVQAVLAK 336 (556) T ss_pred ----HHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf ----6145615688899999999876249999999998764052899988889999999998988899999975 No 283 >KOG2114 consensus Probab=75.82 E-value=4.6 Score=18.07 Aligned_cols=61 Identities=13% Similarity=-0.015 Sum_probs=33.6 Q ss_pred CCHHHHHHHHHH-HCCHHHHHHHHHHHHHHCCCC-HHHHHHHHHHHHHHHCCHHHHHHHHHHHH Q ss_conf 311799999984-201999999999999957578-79999999999874100124466655663 Q gi|254780592|r 37 SIPDPESLENMN-HEQLLEVTSTIGLQYQSHTKN-KMIGIVYADVLRRVGRTAQALAVMRQVAI 98 (271) Q Consensus 37 ~~~~~~~~~~~~-~~~~~~A~~~~~~al~~~P~~-~~~~~~~a~~l~~~g~~~~A~~~~~~al~ 98 (271) ..+.....+.+- +.-|.-|+...++. ..+|+. .+.+..+|.-++..|++++|...|-+.+. T Consensus 334 ek~le~kL~iL~kK~ly~~Ai~LAk~~-~~d~d~~~~i~~kYgd~Ly~Kgdf~~A~~qYI~tI~ 396 (933) T KOG2114 334 EKDLETKLDILFKKNLYKVAINLAKSQ-HLDEDTLAEIHRKYGDYLYGKGDFDEATDQYIETIG 396 (933) T ss_pred ECCHHHHHHHHHHHHHHHHHHHHHHHC-CCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCC T ss_conf 165888999999764179999999863-898789999999987887744777889999998706 No 284 >pfam10345 Cohesin_load Cohesin loading factor. Cohesin_load is a common cohesin loading factor protein that is conserved in fungi. It is associated with the cohesin complex and is required in G1 for cohesin binding to chromosomes but dispensable in G2 when cohesion has been established. It is referred to as both Ssl3, in pombe, and Scc4, in S.cerevisiae. It complexes with Mis4. Probab=74.87 E-value=4.8 Score=17.92 Aligned_cols=65 Identities=14% Similarity=0.156 Sum_probs=28.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCC---CCC------HHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHH Q ss_conf 7765433223554359999999538---985------4788899999998699899999999862917286659 Q gi|254780592|r 178 IAMSYLLMGDLKTAEEKLRFASQMI---GAD------SRIRQNLALVVGLQGRMKEAYSIASQELSPEEATRNI 242 (271) Q Consensus 178 ~~~~~~~~g~~~~A~~~~~~al~~~---p~~------~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~~ 242 (271) ++....-.+++..|......+.... |+. +-.++..|..+...|+.+.|...|.+...+.....++ T Consensus 362 ~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~yL~gl~~q~~gdl~~A~~~y~~~~~~~~~~~el 435 (593) T pfam10345 362 IIWCAFVLGDWAKADRLNEFLRSLNQKLFEKLLKDLQPLLHYLEGLYAQSKGDLEAALYYYTKVAARKGTFREL 435 (593) T ss_pred HHHHHHHCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCHHH T ss_conf 99999865379889988999998865043014565679999999999987167999999986043168973289 No 285 >COG5187 RPN7 26S proteasome regulatory complex component, contains PCI domain [Posttranslational modification, protein turnover, chaperones] Probab=74.41 E-value=5 Score=17.85 Aligned_cols=27 Identities=11% Similarity=0.121 Sum_probs=12.7 Q ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHH Q ss_conf 013555532100000112588764221 Q gi|254780592|r 105 EVLAAYGKSLANAGYLDEGLDAINRAQ 131 (271) Q Consensus 105 ~~~~~~~~~~~~~g~~~~A~~~~~~a~ 131 (271) +++.+++..|.+.++.+.+.++..+.+ T Consensus 116 ea~~n~aeyY~qi~D~~ng~~~~~~~~ 142 (412) T COG5187 116 EADRNIAEYYCQIMDIQNGFEWMRRLM 142 (412) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999889999998740664899999999 No 286 >PRK11627 hypothetical protein; Provisional Probab=74.22 E-value=2.9 Score=19.29 Aligned_cols=23 Identities=35% Similarity=0.703 Sum_probs=15.9 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 9899999999999998540368543 Q gi|254780592|r 12 LKKIKCIVFLIPLLLFSSCHLNPRI 36 (271) Q Consensus 12 ~~~~~~~~~l~~~~~~~~~~~~~~~ 36 (271) +|++ ++.+++++.|+||++.|.. T Consensus 2 mkk~--~~~l~a~~lL~gCas~p~~ 24 (192) T PRK11627 2 LKKI--LFPLVALFMLAGCATPPTT 24 (192) T ss_pred HHHH--HHHHHHHHHHHHHCCCCCE T ss_conf 2879--9999999999860699763 No 287 >COG5159 RPN6 26S proteasome regulatory complex component [Posttranslational modification, protein turnover, chaperones] Probab=73.59 E-value=5.2 Score=17.73 Aligned_cols=191 Identities=12% Similarity=0.033 Sum_probs=101.0 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHCC--------CCHHHHHHHHHHHHHHHCCHHHHH-------------------H Q ss_conf 7999999842019999999999999575--------787999999999987410012446-------------------6 Q gi|254780592|r 40 DPESLENMNHEQLLEVTSTIGLQYQSHT--------KNKMIGIVYADVLRRVGRTAQALA-------------------V 92 (271) Q Consensus 40 ~~~~~~~~~~~~~~~A~~~~~~al~~~P--------~~~~~~~~~a~~l~~~g~~~~A~~-------------------~ 92 (271) +..+....+..|+++|+..|.+.+.+.- ....+...++.+|...|++..--. + T Consensus 7 le~a~~~v~~~~~~~ai~~yk~iL~kg~s~dek~~nEqE~tvlel~~lyv~~g~~~~l~~~i~~sre~m~~ftk~k~~Ki 86 (421) T COG5159 7 LELANNAVKSNDIEKAIGEYKRILGKGVSKDEKTLNEQEATVLELFKLYVSKGDYCSLGDTITSSREAMEDFTKPKITKI 86 (421) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCHHHHHH T ss_conf 99998764044599999999999657777126566478889999998997167734288887743899997340568999 Q ss_pred HHHHHHCCCCCCHHH---------------------------HHHHHHHHHHCCHHHHHHHHHHHH------HHHCHHHH Q ss_conf 655663275210013---------------------------555532100000112588764221------10000233 Q gi|254780592|r 93 MRQVAILYPEDQEVL---------------------------AAYGKSLANAGYLDEGLDAINRAQ------RPDIPDWQ 139 (271) Q Consensus 93 ~~~al~~~p~~~~~~---------------------------~~~~~~~~~~g~~~~A~~~~~~a~------~~~p~~~~ 139 (271) .+.+++..|..+..+ ..+...+...|.|.+|+....... ...+.... T Consensus 87 irtLiekf~~~~dsl~dqi~v~~~~iewA~rEkr~fLr~~Le~Kli~l~y~~~~YsdalalIn~ll~ElKk~DDK~~Li~ 166 (421) T COG5159 87 IRTLIEKFPYSSDSLEDQIKVLTALIEWADREKRKFLRLELECKLIYLLYKTGKYSDALALINPLLHELKKYDDKINLIT 166 (421) T ss_pred HHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCCEEE T ss_conf 99999966789854788999999999999888889999999999999998135277799988799999986167642010 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCH--HHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHCCC---CCHHHH Q ss_conf 5669999999742111110000000000000--001-----245777654332235543599999995389---854788 Q gi|254780592|r 140 LISAKGSVLAQMGKHSEALIEYERALELSPN--ESS-----IVSNIAMSYLLMGDLKTAEEKLRFASQMIG---ADSRIR 209 (271) Q Consensus 140 ~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~--~~~-----~~~~~~~~~~~~g~~~~A~~~~~~al~~~p---~~~~~~ 209 (271) .+..-..+|....+..++...+..+...... .|+ .-..-|.......+|..|-.+|-++++-.. .+..+. T Consensus 167 vhllESKvyh~irnv~KskaSLTaArt~Ans~YCPpqlqa~lDL~sGIlhcdd~dyktA~SYF~Ea~Egft~l~~d~kAc 246 (421) T COG5159 167 VHLLESKVYHEIRNVSKSKASLTAARTLANSAYCPPQLQAQLDLLSGILHCDDRDYKTASSYFIEALEGFTLLKMDVKAC 246 (421) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCCEEECCCCCHHHHHHHHHHHHCCCCCCCHHHHH T ss_conf 12566889999971024355778999885114788888878888636212045551058999999982402342429999 Q ss_pred HHHHHH---HHHCCCHHHHHHHHH Q ss_conf 899999---998699899999999 Q gi|254780592|r 210 QNLALV---VGLQGRMKEAYSIAS 230 (271) Q Consensus 210 ~~La~~---~~~~g~~~eA~~~~~ 230 (271) ..|-.+ -...++.+|-..+++ T Consensus 247 ~sLkYmlLSkIMlN~~~evk~vl~ 270 (421) T COG5159 247 VSLKYMLLSKIMLNRREEVKAVLR 270 (421) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 999999999999854999999981 No 288 >KOG4279 consensus Probab=73.59 E-value=5.2 Score=17.72 Aligned_cols=117 Identities=16% Similarity=0.152 Sum_probs=59.1 Q ss_pred CHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCC------CHHHHHHHHHHHH---HCCHHHHHHHHHHHHHHH-CHHH Q ss_conf 87999999999987410012446665566327521------0013555532100---000112588764221100-0023 Q gi|254780592|r 69 NKMIGIVYADVLRRVGRTAQALAVMRQVAILYPED------QEVLAAYGKSLAN---AGYLDEGLDAINRAQRPD-IPDW 138 (271) Q Consensus 69 ~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~------~~~~~~~~~~~~~---~g~~~~A~~~~~~a~~~~-p~~~ 138 (271) .++.-.++-..|+...+|+.-+.+.+.+.. -|+. ..+.+.++.++-+ -|+-++|+...-..++.. |-.+ T Consensus 200 ~~d~V~nlmlSyRDvQdY~amirLVe~Lk~-iP~t~~vve~~nv~f~YaFALNRRNr~GDRakAL~~~l~lve~eg~vap 278 (1226) T KOG4279 200 HPDTVSNLMLSYRDVQDYDAMIRLVEDLKR-IPDTLKVVETHNVRFHYAFALNRRNRPGDRAKALNTVLPLVEKEGPVAP 278 (1226) T ss_pred CHHHHHHHHHHHCCCCCHHHHHHHHHHHHH-CCCHHHHHCCCCEEEEEEEHHCCCCCCCCHHHHHHHHHHHHHHCCCCCC T ss_conf 788999988642130105889999999985-7205545416844798641111567986188899988898874589788 Q ss_pred HHHHHHHHHHHH---------HHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 356699999997---------4211111000000000000000124577765433223 Q gi|254780592|r 139 QLISAKGSVLAQ---------MGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGD 187 (271) Q Consensus 139 ~~~~~lg~~~~~---------~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~ 187 (271) +.+..-|.+|.. .+..+.|+..|+++.+..|..... .|++.++...|. T Consensus 279 Dm~Cl~GRIYKDmF~~S~ytDa~s~~~a~~WyrkaFeveP~~~sG-IN~atLL~aaG~ 335 (1226) T KOG4279 279 DMYCLCGRIYKDMFIASNYTDAESLNHAIEWYRKAFEVEPLEYSG-INLATLLRAAGE 335 (1226) T ss_pred CEEEEECHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCHHHCC-CCHHHHHHHHHH T ss_conf 625450135454543168765466788899999876169104226-029999997313 No 289 >cd07625 BAR_Vps17p The Bin/Amphiphysin/Rvs (BAR) domain of yeast Sorting Nexin Vps17p. BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions. Sorting nexins (SNXs) are Phox homology (PX) domain containing proteins that are involved in regulating membrane traffic and protein sorting in the endosomal system. SNXs differ from each other in their lipid-binding specificity, subcellular localization and specific function in the endocytic pathway. A subset of SNXs also contain BAR domains. The PX-BAR structural unit determines the specific membrane targeting of SNXs. Vsp17p forms a dimer with Vps5p, the yeast counterpart of human SNX1, and is part of the retromer complex that mediates the transport of the carboxypeptidase Y receptor Vps10p from endosomes to Golgi. BAR domains form dimers that bind to membranes, induce membrane bending and curvature, and may also be involved in protein-protein interactions. Probab=73.52 E-value=5.2 Score=17.71 Aligned_cols=127 Identities=16% Similarity=0.047 Sum_probs=64.9 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHH Q ss_conf 99999999998741001244666556632752100135555321000001125887642211000023356699999997 Q gi|254780592|r 71 MIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQ 150 (271) Q Consensus 71 ~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~ 150 (271) .+.-.++......|..-..+...+ .+|....+|..+|.+....|+++.+....+.+.-.||-.+.....+..--.- T Consensus 46 K~rr~L~~~~~dlg~kl~~ls~~e----~h~gl~n~~~Klgkv~q~vgD~~~~qat~e~~TLgD~L~~~~~day~vKEtL 121 (230) T cd07625 46 KARKQLSLEEADFGQKLIQLSVEE----THHGLGNLYEKFGKVLTAVGDIDSIQATVDMATLYDGLEWISRDAYVVKEAL 121 (230) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCC----CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHH T ss_conf 999998788899998987524322----6874678999999999996019999999878886142999874089999999 Q ss_pred HHHHHHHHHHHHHCCC-CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 4211111000000000-000000124577765433223554359999999538 Q gi|254780592|r 151 MGKHSEALIEYERALE-LSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMI 202 (271) Q Consensus 151 ~~~~~~A~~~~~~~~~-~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~ 202 (271) .+++- -+.-+.++-. .......+-...+..-+.--+.|+|+..++.|.... T Consensus 122 TNRh~-lmREL~qAq~~t~sK~~~A~rlK~ss~i~p~KVDEAi~~LeeA~~~E 173 (230) T cd07625 122 TNRHL-LMRELIQAQQNTKSKQEAARRLKAKRDINPLKVDEAIRQLEEATKHE 173 (230) T ss_pred HHHHH-HHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 98899-99999999999999999999872567898777999999999999999 No 290 >PRK00022 lolB outer membrane lipoprotein LolB; Provisional Probab=72.41 E-value=3.2 Score=18.98 Aligned_cols=19 Identities=21% Similarity=0.391 Sum_probs=13.6 Q ss_pred HHHHHHHHHHHHHCCCCCC Q ss_conf 9999999998540368543 Q gi|254780592|r 18 IVFLIPLLLFSSCHLNPRI 36 (271) Q Consensus 18 ~~~l~~~~~~~~~~~~~~~ 36 (271) .++++++++++||++.|.. T Consensus 6 ~~l~~~~llLsgCat~p~~ 24 (203) T PRK00022 6 RLLPLAALLLAGCASLPPP 24 (203) T ss_pred HHHHHHHHHHHHCCCCCCC T ss_conf 9999999998661489998 No 291 >pfam11359 gpUL132 Glycoprotein UL132. Glycoprotein UL132 is a low-abundance structural component of Human cytomegalovirus (HCMV). The function of this protein is not fully understood. Probab=70.16 E-value=2.5 Score=19.70 Aligned_cols=32 Identities=31% Similarity=0.692 Sum_probs=16.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 27875699999899999999999998540368543 Q gi|254780592|r 2 IILFCPVILFLKKIKCIVFLIPLLLFSSCHLNPRI 36 (271) Q Consensus 2 ~~~~~~~~~f~~~~~~~~~l~~~~~~~~~~~~~~~ 36 (271) ||+||..-- .|.+.++++.+.+.++|...|.. T Consensus 53 Ii~Ycv~g~---sils~llVl~a~ly~sc~~r~gr 84 (235) T pfam11359 53 IIIYCVSGV---SILSFLLVLVAVLYSSCHTRPGR 84 (235) T ss_pred HHEEEECCH---HHHHHHHHHHHHHHHHHHCCCCC T ss_conf 311341026---99999999998777766417886 No 292 >pfam06957 COPI_C Coatomer (COPI) alpha subunit C-terminus. This family represents the C-terminus (approximately 500 residues) of the eukaryotic coatomer alpha subunit. Coatomer (COPI) is a large cytosolic protein complex which forms a coat around vesicles budding from the Golgi apparatus. Such coatomer-coated vesicles have been proposed to play a role in many distinct steps of intracellular transport. Note that many family members also contain the pfam04053 domain. Probab=69.49 E-value=6.4 Score=17.16 Aligned_cols=24 Identities=4% Similarity=-0.051 Sum_probs=18.5 Q ss_pred HHHHHHHHCCHHHHHHHHHHHHHH Q ss_conf 999998420199999999999995 Q gi|254780592|r 42 ESLENMNHEQLLEVTSTIGLQYQS 65 (271) Q Consensus 42 ~~~~~~~~~~~~~A~~~~~~al~~ 65 (271) .+.+|...|.|+.|...+.+.+-. T Consensus 123 laadHvAAGsFesAm~LLnrQvGi 146 (421) T pfam06957 123 LAADHVAAGSFETAMRLLHDQLGV 146 (421) T ss_pred CHHHHHHCCCHHHHHHHHHHHHCC T ss_conf 678888617799999999986484 No 293 >pfam07720 TPR_3 Tetratricopeptide repeat. This Pfam entry includes tetratricopeptide-like repeats found in the LcrH/SycD-like chaperones. Probab=68.49 E-value=5 Score=17.84 Aligned_cols=25 Identities=20% Similarity=0.095 Sum_probs=10.1 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHHH Q ss_conf 9999999987410012446665566 Q gi|254780592|r 73 GIVYADVLRRVGRTAQALAVMRQVA 97 (271) Q Consensus 73 ~~~~a~~l~~~g~~~~A~~~~~~al 97 (271) ++.+|-.+...|++++|..++.-+. T Consensus 4 l~~~Aa~~~~k~ky~~A~~~y~~l~ 28 (34) T pfam07720 4 LYSLAAVFYLKGKYQEAEHLYALLA 28 (34) T ss_pred HHHHHHHHHHCCHHHHHHHHHHHHH T ss_conf 9998999987021347999999999 No 294 >KOG1463 consensus Probab=68.07 E-value=6.9 Score=16.97 Aligned_cols=40 Identities=18% Similarity=0.145 Sum_probs=15.2 Q ss_pred HHHCCCCCHHHHHHHHHHH--HHCCCHHHHHHHHHHHCCCCC Q ss_conf 9953898547888999999--986998999999998629172 Q gi|254780592|r 198 ASQMIGADSRIRQNLALVV--GLQGRMKEAYSIASQELSPEE 237 (271) Q Consensus 198 al~~~p~~~~~~~~La~~~--~~~g~~~eA~~~~~~~L~~~~ 237 (271) +++....+.++....|.++ ..+..|..|...|+..|..+| T Consensus 277 ~l~y~g~~i~AmkavAeA~~nRSLkdF~~AL~~yk~eL~~D~ 318 (411) T KOG1463 277 ALKYAGRDIDAMKAVAEAFGNRSLKDFEKALADYKKELAEDP 318 (411) T ss_pred HHHCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCH T ss_conf 886047335999999998637868999999999689871284 No 295 >pfam04190 DUF410 Protein of unknown function (DUF410). This family of proteins is from Caenorhabditis elegans and has no known function. The protein has some GO references indicating that the protein has a positive regulation of growth rate and is involved in nematode larval development. Probab=67.80 E-value=7 Score=16.94 Aligned_cols=26 Identities=12% Similarity=0.069 Sum_probs=12.2 Q ss_pred CCHHHHHHHHHHHHHHHCCHHHHHHH Q ss_conf 78799999999998741001244666 Q gi|254780592|r 68 KNKMIGIVYADVLRRVGRTAQALAVM 93 (271) Q Consensus 68 ~~~~~~~~~a~~l~~~g~~~~A~~~~ 93 (271) .+|+.+..+|..+.+.|++.+|..+| T Consensus 88 Gdp~LH~~ig~~~~~e~~~~~Ae~Hf 113 (260) T pfam04190 88 GDPALHTLIAKKLVKEDNVESAENHF 113 (260) T ss_pred CCHHHHHHHHHHHHHCCCHHHHHHHH T ss_conf 99999999999998758889998885 No 296 >TIGR03659 IsdE heme ABC transporter, heme-binding protein isdE. This family of ABC substrate-binding proteins is observed primarily in close proximity with proteins localized to the cell wall and bearing the NEAT (NEAr Transporter, pfam05031) heme-binding domain. IsdE has been shown to bind heme and is involved in the process of scavenging heme for the purpose of obtaining iron. Probab=67.22 E-value=5.4 Score=17.62 Aligned_cols=23 Identities=13% Similarity=0.205 Sum_probs=16.0 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 99999999999998540368543 Q gi|254780592|r 14 KIKCIVFLIPLLLFSSCHLNPRI 36 (271) Q Consensus 14 ~~~~~~~l~~~~~~~~~~~~~~~ 36 (271) .+.++++++++++++||+++... T Consensus 3 ~~~~~~~~~~~~~l~~C~~~~~~ 25 (289) T TIGR03659 3 ILSLILLALLSLGLTGCSSSKAE 25 (289) T ss_pred HHHHHHHHHHHHHHHHCCCCCCC T ss_conf 29999999999998432899877 No 297 >KOG0687 consensus Probab=66.45 E-value=7.4 Score=16.78 Aligned_cols=114 Identities=11% Similarity=0.054 Sum_probs=72.2 Q ss_pred HHHHHHHHHHCCCC------HHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCC------CCHHHHHHHHHHHHHC----C Q ss_conf 99999999957578------799999999998741001244666556632752------1001355553210000----0 Q gi|254780592|r 56 TSTIGLQYQSHTKN------KMIGIVYADVLRRVGRTAQALAVMRQVAILYPE------DQEVLAAYGKSLANAG----Y 119 (271) Q Consensus 56 ~~~~~~al~~~P~~------~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~------~~~~~~~~~~~~~~~g----~ 119 (271) ++.++..++..-+| .+++++.|+-|-+.|+.+.|.+.+.+..+..-. -......+|..|.... . T Consensus 84 i~eld~~iedaeenlGE~ev~ea~~~kaeYycqigDkena~~~~~~t~~ktvs~g~kiDVvf~~iRlglfy~D~~lV~~~ 163 (393) T KOG0687 84 IKELDEKIEDAEENLGESEVREAMLRKAEYYCQIGDKENALEALRKTYEKTVSLGHKIDVVFYKIRLGLFYLDHDLVTES 163 (393) T ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCHHHHHHH T ss_conf 99999999999873122789999999999999860387799999999887764053035688888778862108899988 Q ss_pred HHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCH Q ss_conf 112588764221100002335669999999742111110000000000000 Q gi|254780592|r 120 LDEGLDAINRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPN 170 (271) Q Consensus 120 ~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~ 170 (271) .++|...+++.-..+..+ ..-...|.......++.+|...|..++..... T Consensus 164 iekak~liE~GgDWeRrN-RlKvY~Gly~msvR~Fk~Aa~Lfld~vsTFtS 213 (393) T KOG0687 164 IEKAKSLIEEGGDWERRN-RLKVYQGLYCMSVRNFKEAADLFLDSVSTFTS 213 (393) T ss_pred HHHHHHHHHHCCCHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 999899998278866665-68898889999997378899999987045451 No 298 >TIGR03362 VI_chp_7 type VI secretion-associated protein, VC_A0119 family. This protein family is one of two related families in type VI secretion systems that contain an ImpA-related N-terminal domain (pfam06812). Probab=66.28 E-value=7.5 Score=16.76 Aligned_cols=62 Identities=16% Similarity=0.061 Sum_probs=42.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCH---HHHHHHHHHHHHCCCHHHHHHHHHHHC Q ss_conf 01245777654332235543599999995389854---788899999998699899999999862 Q gi|254780592|r 172 SSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADS---RIRQNLALVVGLQGRMKEAYSIASQEL 233 (271) Q Consensus 172 ~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~---~~~~~La~~~~~~g~~~eA~~~~~~~L 233 (271) .......+......|..+.|+..++..+...++.. ..+..++.++...|+++-|...++... T Consensus 213 ~~~~~~eA~~l~~~~gl~~Al~~L~~~l~~~~s~R~rf~~rL~~ArL~~~~g~~~lA~~~l~~L~ 277 (301) T TIGR03362 213 WEELREEARALAAEGGLEAALQRLQQRLAQAREPRERFHWRLLLARLLEQAGKAELAQQLYAALD 277 (301) T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH T ss_conf 89999999999974799999999997532489979999999999999998698799999999999 No 299 >COG3317 NlpB Uncharacterized lipoprotein [Cell envelope biogenesis, outer membrane] Probab=66.16 E-value=5.4 Score=17.60 Aligned_cols=29 Identities=14% Similarity=0.234 Sum_probs=19.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCCHHH Q ss_conf 9989999999999999854036854331179 Q gi|254780592|r 11 FLKKIKCIVFLIPLLLFSSCHLNPRISIPDP 41 (271) Q Consensus 11 f~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 41 (271) +++++ ++.++.++.++||++.+....+.. T Consensus 3 ~~~~l--v~~al~v~~LaaCSs~~~~~~q~~ 31 (342) T COG3317 3 SSAKL--VLGALLVLLLAACSSDSEYKRQVS 31 (342) T ss_pred HHHHH--HHHHHHHHHHHHCCCCCCCCCCCC T ss_conf 17889--999999998741468850003344 No 300 >cd02684 MIT_2 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in proteins with an n-terminal serine/threonine kinase domain. The molecular function of the MIT domain is unclear. Probab=65.87 E-value=7.6 Score=16.72 Aligned_cols=14 Identities=0% Similarity=-0.078 Sum_probs=5.3 Q ss_pred HHHHHHHHHHHHHC Q ss_conf 99999999999957 Q gi|254780592|r 53 LEVTSTIGLQYQSH 66 (271) Q Consensus 53 ~~A~~~~~~al~~~ 66 (271) ..|++...+|++.| T Consensus 4 ~kAi~lv~~Av~~D 17 (75) T cd02684 4 EKAIALVVQAVKKD 17 (75) T ss_pred HHHHHHHHHHHHHH T ss_conf 88999999999988 No 301 >PRK11063 metQ DL-methionine transporter substrate-binding subunit; Provisional Probab=65.26 E-value=7.1 Score=16.87 Aligned_cols=25 Identities=20% Similarity=0.285 Sum_probs=20.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 9998999999999999985403685 Q gi|254780592|r 10 LFLKKIKCIVFLIPLLLFSSCHLNP 34 (271) Q Consensus 10 ~f~~~~~~~~~l~~~~~~~~~~~~~ 34 (271) |-+|.+.++..++.++.++||++.. T Consensus 3 ~~~k~~~~~~~l~~~l~l~gCg~~~ 27 (271) T PRK11063 3 FKFKTFAAVGALIGSLALVGCGQDE 27 (271) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 0299999999999999750058985 No 302 >smart00386 HAT HAT (Half-A-TPR) repeats. Present in several RNA-binding proteins. Structurally and sequentially thought to be similar to TPRs. Probab=65.10 E-value=7.9 Score=16.62 Aligned_cols=28 Identities=11% Similarity=0.142 Sum_probs=15.1 Q ss_pred CHHHHHHHHHHHHHHCCCCHHHHHHHHH Q ss_conf 1999999999999957578799999999 Q gi|254780592|r 51 QLLEVTSTIGLQYQSHTKNKMIGIVYAD 78 (271) Q Consensus 51 ~~~~A~~~~~~al~~~P~~~~~~~~~a~ 78 (271) +.+.|...|++++...|.++..|..++. T Consensus 2 ~~~~~r~iyer~l~~~~~~~~~W~~y~~ 29 (33) T smart00386 2 DIERARKIYERALEKFPKSVELWLKYAE 29 (33) T ss_pred CHHHHHHHHHHHHHHCCHHHHHHHHHHH T ss_conf 2889999999999977147999999987 No 303 >PRK09915 putative outer membrane efflux protein MdtP; Provisional Probab=64.83 E-value=7.1 Score=16.88 Aligned_cols=24 Identities=25% Similarity=0.336 Sum_probs=16.6 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 989999999999999854036854 Q gi|254780592|r 12 LKKIKCIVFLIPLLLFSSCHLNPR 35 (271) Q Consensus 12 ~~~~~~~~~l~~~~~~~~~~~~~~ 35 (271) +|.+.+..++.++++++||++.|. T Consensus 6 ~~r~~~~~~l~~~~lLagC~~~p~ 29 (488) T PRK09915 6 LSRLLLCSILGSTTLISGCALVRK 29 (488) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 769999999999999752668999 No 304 >KOG4151 consensus Probab=64.59 E-value=8 Score=16.56 Aligned_cols=95 Identities=8% Similarity=0.022 Sum_probs=67.9 Q ss_pred HHHHHCCHHHHHHHHHHHHHHCC----CCHHHHHHHHHHHHHH--HCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHC Q ss_conf 99842019999999999999575----7879999999999874--10012446665566327521001355553210000 Q gi|254780592|r 45 ENMNHEQLLEVTSTIGLQYQSHT----KNKMIGIVYADVLRRV--GRTAQALAVMRQVAILYPEDQEVLAAYGKSLANAG 118 (271) Q Consensus 45 ~~~~~~~~~~A~~~~~~al~~~P----~~~~~~~~~a~~l~~~--g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~~~~g 118 (271) -..+.+++.++..-|..++..-| .....+.+.+.++.+. |++..++.-..-+....|....++...+.+|...+ T Consensus 62 ~~~~K~d~~~~~~~~~~~~~llp~~~~~~a~~~~~~~s~~m~~~l~~~~~~~~E~~la~~~~p~i~~~Ll~r~~~y~al~ 141 (748) T KOG4151 62 KLFQKRDYEGAMFRYDCAIKLLPKDHHVVATLRSNQASCYMQLGLGEYPKAIPECELALESQPRISKALLKRARKYEALN 141 (748) T ss_pred HHHHHHHHHCCCHHHHHHHEECCCCCHHHHHHHHHHHHHHHHCCCCCHHHHCCCHHHHHHCCCHHHHHHHHHHHHHHHHH T ss_conf 77665404135354412310045532035667778999986248543022237424565516317787734556778787 Q ss_pred CHHHHHHHHHHHHHHHCHHHH Q ss_conf 011258876422110000233 Q gi|254780592|r 119 YLDEGLDAINRAQRPDIPDWQ 139 (271) Q Consensus 119 ~~~~A~~~~~~a~~~~p~~~~ 139 (271) +++-|.+.+.-....+|.... T Consensus 142 k~d~a~rdl~i~~~~~p~~~~ 162 (748) T KOG4151 142 KLDLAVRDLRIVEKMDPSNVS 162 (748) T ss_pred HHHHHHHHHHHHHCCCCCCCH T ss_conf 788899999987347877521 No 305 >PRK13731 conjugal transfer surface exclusion protein TraT; Provisional Probab=64.27 E-value=3.7 Score=18.60 Aligned_cols=24 Identities=21% Similarity=0.328 Sum_probs=20.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 999899999999999998540368 Q gi|254780592|r 10 LFLKKIKCIVFLIPLLLFSSCHLN 33 (271) Q Consensus 10 ~f~~~~~~~~~l~~~~~~~~~~~~ 33 (271) |-+|++..+.++.+.+.|+||+.. T Consensus 1 m~~kk~~~~~~~~~~l~lsGC~a~ 24 (243) T PRK13731 1 MKTKKLMMVALVSSTLALSGCGAM 24 (243) T ss_pred CCHHHHHHHHHHHHHHHHCCHHHH T ss_conf 933775799999999997156876 No 306 >KOG2758 consensus Probab=63.86 E-value=8.3 Score=16.47 Aligned_cols=176 Identities=11% Similarity=0.002 Sum_probs=93.2 Q ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCC--CCCHHHHHHHHH--HHHHCCHHHHHHHHHHHHH Q ss_conf 999999995757879999999999874100124466655663275--210013555532--1000001125887642211 Q gi|254780592|r 57 STIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYP--EDQEVLAAYGKS--LANAGYLDEGLDAINRAQR 132 (271) Q Consensus 57 ~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p--~~~~~~~~~~~~--~~~~g~~~~A~~~~~~a~~ 132 (271) ..+++-..-.|+..++++.+|...+.-|++..|-.++-....+.+ +....-..+|.. ..-+.+|+.|++.+.+..+ T Consensus 116 ~~L~e~ynf~~e~i~~lykyakfqyeCGNY~gAs~yLY~~r~l~~~~d~n~lsalwGKlASEIL~qnWd~A~edL~rLre 195 (432) T KOG2758 116 QHLQEHYNFTPERIETLYKYAKFQYECGNYSGASDYLYFYRALVSDPDRNYLSALWGKLASEILTQNWDGALEDLTRLRE 195 (432) T ss_pred HHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99998629988999999999999875168531899999999963886652699999999999998648889999999999 Q ss_pred HH---CHH--HHHHHHHHHH----HHHHHHHHHHHHHHHHCCC--------CCCHHHHHHHHHHHHHHHH-HHHHHHHHH Q ss_conf 00---002--3356699999----9974211111000000000--------0000001245777654332-235543599 Q gi|254780592|r 133 PD---IPD--WQLISAKGSV----LAQMGKHSEALIEYERALE--------LSPNESSIVSNIAMSYLLM-GDLKTAEEK 194 (271) Q Consensus 133 ~~---p~~--~~~~~~lg~~----~~~~~~~~~A~~~~~~~~~--------~~p~~~~~~~~~~~~~~~~-g~~~~A~~~ 194 (271) .. +-. +.....+..+ +.-.=++.++.+..-...- +....+-.+..++.+.+.. .+....++. T Consensus 196 ~IDs~~f~~~~~~l~qRtWLiHWslfv~fnhpkgrd~iid~fly~p~YLNaIQt~cPhllRYLatAvvtnk~~rr~~lkd 275 (432) T KOG2758 196 YIDSKSFSTSAQQLQQRTWLIHWSLFVFFNHPKGRDTIIDMFLYQPPYLNAIQTSCPHLLRYLATAVVTNKRRRRNRLKD 275 (432) T ss_pred HHCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHCCCHHHHHHHHHHCHHHHHHHHHHHHCCHHHHHHHHHH T ss_conf 87036545489998888898878777650588705689999834777788998627899999999851033866788999 Q ss_pred HHHHHHCCC-CCHHHHHHHHHHHHHCCCHHHHHHHHHHH Q ss_conf 999995389-85478889999999869989999999986 Q gi|254780592|r 195 LRFASQMIG-ADSRIRQNLALVVGLQGRMKEAYSIASQE 232 (271) Q Consensus 195 ~~~al~~~p-~~~~~~~~La~~~~~~g~~~eA~~~~~~~ 232 (271) +-+.++... +..+.....-.++...-+++.|...++.+ T Consensus 276 lvkVIqqE~ysYkDPiteFl~clyvn~DFdgAq~kl~eC 314 (432) T KOG2758 276 LVKVIQQESYSYKDPITEFLECLYVNYDFDGAQKKLREC 314 (432) T ss_pred HHHHHHHHCCCCCCCHHHHHHHHHHCCCHHHHHHHHHHH T ss_conf 999998750344782899998895011627899999998 No 307 >smart00745 MIT Microtubule Interacting and Trafficking molecule domain. Probab=63.20 E-value=8.5 Score=16.40 Aligned_cols=13 Identities=23% Similarity=0.473 Sum_probs=4.3 Q ss_pred HCCHHHHHHHHHH Q ss_conf 1001244666556 Q gi|254780592|r 84 GRTAQALAVMRQV 96 (271) Q Consensus 84 g~~~~A~~~~~~a 96 (271) |++++|+.+|.++ T Consensus 22 g~y~eA~~~Y~~a 34 (77) T smart00745 22 GDYEEALELYKKA 34 (77) T ss_pred CCHHHHHHHHHHH T ss_conf 8899999999999 No 308 >PRK11443 hypothetical protein; Provisional Probab=62.69 E-value=5 Score=17.84 Aligned_cols=26 Identities=23% Similarity=0.434 Sum_probs=18.8 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCHHHH Q ss_conf 99999999998540368543311799 Q gi|254780592|r 17 CIVFLIPLLLFSSCHLNPRISIPDPE 42 (271) Q Consensus 17 ~~~~l~~~~~~~~~~~~~~~~~~~~~ 42 (271) .+++.+++++++||+.+|.+..+..+ T Consensus 3 ~~~~~~~~lll~GCq~~p~t~ap~~~ 28 (120) T PRK11443 3 KFIAPLLALLVSGCQIDPYTHAPTLT 28 (120) T ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCC T ss_conf 58999999999666789624686556 No 309 >cd02679 MIT_spastin MIT: domain contained within Microtubule Interacting and Trafficking molecules. This MIT domain sub-family is found in the AAA protein spastin, a probable ATPase involved in the assembly or function of nuclear protein complexes; spastins might also be involved in microtubule dynamics. The molecular function of the MIT domain is unclear. Probab=62.22 E-value=8.2 Score=16.52 Aligned_cols=14 Identities=43% Similarity=0.415 Sum_probs=5.4 Q ss_pred HCCHHHHHHHHHHH Q ss_conf 10012446665566 Q gi|254780592|r 84 GRTAQALAVMRQVA 97 (271) Q Consensus 84 g~~~~A~~~~~~al 97 (271) |+.++|+.+|++++ T Consensus 22 g~~~~Al~~Y~~gi 35 (79) T cd02679 22 GDKEQALAHYRKGL 35 (79) T ss_pred CCHHHHHHHHHHHH T ss_conf 89999999999999 No 310 >COG3056 Uncharacterized lipoprotein [Cell envelope biogenesis, outer membrane] Probab=62.10 E-value=8.9 Score=16.28 Aligned_cols=25 Identities=20% Similarity=0.348 Sum_probs=17.1 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 9899999999999998540368543 Q gi|254780592|r 12 LKKIKCIVFLIPLLLFSSCHLNPRI 36 (271) Q Consensus 12 ~~~~~~~~~l~~~~~~~~~~~~~~~ 36 (271) -+..+.++.+...++|+||+..|.+ T Consensus 13 ~~t~k~L~~laa~~lLagC~a~~~t 37 (204) T COG3056 13 NMTKKILFPLAAIFLLAGCAAPPTT 37 (204) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCE T ss_conf 8899999999999999753799750 No 311 >pfam10579 Rapsyn_N Rapsyn N-terminal myristoylation and linker region. Neuromuscular junction formation relies upon the clustering of acetylcholine receptors and other proteins in the muscle membrane. Rapsyn is a peripheral membrane protein that is selectively concentrated at the neuromuscular junction and is essential for the formation of synaptic acetylcholine receptor aggregates. Acetylcholine receptors fail to aggregate beneath nerve terminals in mice where rapsyn has been knocked out. The N-terminal six amino acids of rapsyn are its myristoylation site, and myristoylation is necessary for the targeting of the protein to the membrane. Probab=61.52 E-value=9.2 Score=16.21 Aligned_cols=20 Identities=0% Similarity=-0.167 Sum_probs=7.8 Q ss_pred HHHHCCHHHHHHHHHHHHHH Q ss_conf 98420199999999999995 Q gi|254780592|r 46 NMNHEQLLEVTSTIGLQYQS 65 (271) Q Consensus 46 ~~~~~~~~~A~~~~~~al~~ 65 (271) .+++.+.+.|+..+.+++++ T Consensus 16 LY~~~~~~~Av~~W~~~L~k 35 (80) T pfam10579 16 LYQSNNTEKALEIWRKVLEK 35 (80) T ss_pred HHHHHHHHHHHHHHHHHHHH T ss_conf 99971289999999999987 No 312 >TIGR03516 ppisom_GldI peptidyl-prolyl isomerase, gliding motility-associated. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldI is a FKBP-type peptidyl-prolyl cis-trans isomerase (pfam00254) linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae and Cytophaga hutchinsonii. Knockout of this gene abolishes the gliding phenotype. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. This family is only found in Bacteroidetes containing the suite of genes proposed to confer the gliding motility phenotype. Probab=61.39 E-value=9.2 Score=16.20 Aligned_cols=51 Identities=18% Similarity=0.185 Sum_probs=28.0 Q ss_pred HHHHHHHHHHHHHCCCC------CCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHCCC Q ss_conf 99999999985403685------4331179999998420199999999999995757 Q gi|254780592|r 18 IVFLIPLLLFSSCHLNP------RISIPDPESLENMNHEQLLEVTSTIGLQYQSHTK 68 (271) Q Consensus 18 ~~~l~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~A~~~~~~al~~~P~ 68 (271) +++++.++++.||+... ..+..........|..-...-...+++.++.|+. T Consensus 5 i~~i~~~~~~~sC~~~e~RkPi~~~s~~f~~~S~~rNK~l~a~Ee~~I~~~i~~ds~ 61 (177) T TIGR03516 5 IAVILLLLLLLGCKTPEARRPISRSSGTTIKLSAERNKKLIAAEEAAIKRIISADSI 61 (177) T ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 478989899872699646688665875149999999899999999999999986985 No 313 >PRK04405 prsA peptidylprolyl isomerase; Provisional Probab=61.19 E-value=9.3 Score=16.18 Aligned_cols=25 Identities=24% Similarity=0.200 Sum_probs=15.4 Q ss_pred HHHHHHHH-HHHHHHHHHHHHCCCCC Q ss_conf 99899999-99999999854036854 Q gi|254780592|r 11 FLKKIKCI-VFLIPLLLFSSCHLNPR 35 (271) Q Consensus 11 f~~~~~~~-~~l~~~~~~~~~~~~~~ 35 (271) ++|++.+. +.+++++.++||++++. T Consensus 4 ~MKK~~l~~~~~~~~l~LaaCss~~~ 29 (298) T PRK04405 4 KMKKWALAAASAGLLLSLAGCSSNNK 29 (298) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 78999999999999999987179997 No 314 >PRK10449 heat-inducible protein; Provisional Probab=60.98 E-value=7.1 Score=16.87 Aligned_cols=24 Identities=29% Similarity=0.567 Sum_probs=16.7 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 98999999999999985403685433 Q gi|254780592|r 12 LKKIKCIVFLIPLLLFSSCHLNPRIS 37 (271) Q Consensus 12 ~~~~~~~~~l~~~~~~~~~~~~~~~~ 37 (271) +|++ +.+++..+.++||++++... T Consensus 1 MKk~--l~l~~~~llLagC~s~~~~~ 24 (140) T PRK10449 1 MKKV--VALVALSLLMAGCVSSGKIS 24 (140) T ss_pred CCHH--HHHHHHHHHHHHCCCCCCCC T ss_conf 9034--99999999997523899887 No 315 >PRK12450 foldase protein PrsA; Reviewed Probab=60.28 E-value=9.6 Score=16.08 Aligned_cols=22 Identities=27% Similarity=0.409 Sum_probs=16.0 Q ss_pred HHHHHHHHHHHHHHHHHHHCCC Q ss_conf 9899999999999998540368 Q gi|254780592|r 12 LKKIKCIVFLIPLLLFSSCHLN 33 (271) Q Consensus 12 ~~~~~~~~~l~~~~~~~~~~~~ 33 (271) -|-+..++.+.+++.|++|++. T Consensus 5 KK~~~~~~~~~svl~LaAC~s~ 26 (309) T PRK12450 5 NKLITGVVTLATVVTLSACQSS 26 (309) T ss_pred HHHHHHHHHHHHHHHHHHHCCC T ss_conf 9999999999999999860589 No 316 >PRK10783 mltD membrane-bound lytic murein transglycosylase D; Provisional Probab=60.15 E-value=1.7 Score=20.70 Aligned_cols=23 Identities=17% Similarity=0.313 Sum_probs=16.2 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 99999999999854036854331 Q gi|254780592|r 16 KCIVFLIPLLLFSSCHLNPRISI 38 (271) Q Consensus 16 ~~~~~l~~~~~~~~~~~~~~~~~ 38 (271) ++..+++++++++||++.+.... T Consensus 2 k~~~~l~~~~ll~gcq~~~~~~~ 24 (449) T PRK10783 2 KAKAILLASVLLVGCQSSKNDAQ 24 (449) T ss_pred CHHHHHHHHHHHHHCCCCCCCCC T ss_conf 18999999999972557888888 No 317 >pfam07273 DUF1439 Protein of unknown function (DUF1439). This family consists of several hypothetical bacterial proteins of around 190 residues in length. Several members of this family are annotated as being putative lipoproteins and are often known as YceB. The function of this family is unknown. Probab=59.68 E-value=9.9 Score=16.01 Aligned_cols=29 Identities=28% Similarity=0.448 Sum_probs=17.6 Q ss_pred HHHHHHHHHHHCCCCCCCCHHHHHHHHHH Q ss_conf 99999998540368543311799999984 Q gi|254780592|r 20 FLIPLLLFSSCHLNPRISIPDPESLENMN 48 (271) Q Consensus 20 ~l~~~~~~~~~~~~~~~~~~~~~~~~~~~ 48 (271) ++++++.++||.+....+....+..++++ T Consensus 8 ~~~l~~~LsgC~~ls~ysISE~Ein~yL~ 36 (177) T pfam07273 8 ILLLVLLLSGCASLSQYSISEAEINQYLQ 36 (177) T ss_pred HHHHHHHHHCCCCCCCEEECHHHHHHHHH T ss_conf 99999997155756625266999999998 No 318 >pfam07721 TPR_4 Tetratricopeptide repeat. This Pfam entry includes tetratricopeptide-like repeats not detected by the pfam00515, pfam07719 and pfam07720 models. Probab=59.63 E-value=8.7 Score=16.35 Aligned_cols=21 Identities=24% Similarity=0.181 Sum_probs=10.3 Q ss_pred HHHHHHHHHHCCCHHHHHHHH Q ss_conf 889999999869989999999 Q gi|254780592|r 209 RQNLALVVGLQGRMKEAYSIA 229 (271) Q Consensus 209 ~~~La~~~~~~g~~~eA~~~~ 229 (271) ...+|.++...|++++|...+ T Consensus 4 ~~~la~~~~~~G~~~~A~~~l 24 (26) T pfam07721 4 LLALARALLALGDLDEARALL 24 (26) T ss_pred HHHHHHHHHHHCCHHHHHHHH T ss_conf 999999999806778899886 No 319 >cd02683 MIT_1 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in proteins with unknown function, co-occuring with an as yet undescribed domain. The molecular function of the MIT domain is unclear. Probab=59.24 E-value=9.7 Score=16.07 Aligned_cols=14 Identities=29% Similarity=0.356 Sum_probs=4.6 Q ss_pred HHCCHHHHHHHHHH Q ss_conf 41001244666556 Q gi|254780592|r 83 VGRTAQALAVMRQV 96 (271) Q Consensus 83 ~g~~~~A~~~~~~a 96 (271) .|++.+|+.+|..+ T Consensus 19 ~~~y~eAl~~Y~~g 32 (77) T cd02683 19 EGRFQEALVCYQEG 32 (77) T ss_pred CCCHHHHHHHHHHH T ss_conf 15699999999999 No 320 >pfam10516 SHNi-TPR SHNi-TPR. SHNi-TPR family members contain a reiterated sequence motif that is an interrupted form of TPR repeat. Probab=58.93 E-value=6.8 Score=17.01 Aligned_cols=25 Identities=12% Similarity=-0.032 Sum_probs=9.1 Q ss_pred HHHHHHHHHHCCCHHHHHHHHHHHC Q ss_conf 8899999998699899999999862 Q gi|254780592|r 209 RQNLALVVGLQGRMKEAYSIASQEL 233 (271) Q Consensus 209 ~~~La~~~~~~g~~~eA~~~~~~~L 233 (271) +..||.+-....+|++|..-++++| T Consensus 4 ~~~LgEislE~e~F~qA~~D~~~~L 28 (38) T pfam10516 4 YDLLGEISLENENFPQAVEDLRKAL 28 (38) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHH T ss_conf 8877786540156077899999999 No 321 >PRK12700 flgH flagellar basal body L-ring protein; Reviewed Probab=58.89 E-value=10 Score=15.97 Aligned_cols=18 Identities=28% Similarity=0.355 Sum_probs=11.9 Q ss_pred HHHHHHHHHHHHHCCCCC Q ss_conf 999999999854036854 Q gi|254780592|r 18 IVFLIPLLLFSSCHLNPR 35 (271) Q Consensus 18 ~~~l~~~~~~~~~~~~~~ 35 (271) .+++.++++++||++.|+ T Consensus 10 ~~~~~~~~ll~GCa~~~~ 27 (230) T PRK12700 10 PVCAALLALAAGCAMIPP 27 (230) T ss_pred HHHHHHHHHHHCCCCCCC T ss_conf 999999999744468999 No 322 >pfam02402 Lysis_col Lysis protein. These small bacterial proteins are required for colicin release and partial cell lysis. This family contains lysis proteins for several different forms of colicin. Bacillus subtilis lytA has been included in this family, the similarity is not highly significant, however it is also a short protein, that is involved in secretion of other proteins (Bateman A pers. obs.). Probab=58.67 E-value=2.9 Score=19.26 Aligned_cols=22 Identities=41% Similarity=0.541 Sum_probs=15.7 Q ss_pred HHHHHHHHHHHHHHHHHHCCCC Q ss_conf 8999999999999985403685 Q gi|254780592|r 13 KKIKCIVFLIPLLLFSSCHLNP 34 (271) Q Consensus 13 ~~~~~~~~l~~~~~~~~~~~~~ 34 (271) |++..+++++..+++++|+.+. T Consensus 2 kki~~~~i~~~~~~L~aCQaNy 23 (46) T pfam02402 2 KKILFIGILLLTVLLSACQANY 23 (46) T ss_pred CEEEEHHHHHHHHHHHHHHHHH T ss_conf 2420139999999999855504 No 323 >pfam04212 MIT MIT (microtubule interacting and transport) domain. The MIT domain forms an asymmetric three-helix bundle and binds ESCRT-III (endosomal sorting complexes required for transport) substrates. Probab=58.65 E-value=10 Score=15.91 Aligned_cols=11 Identities=27% Similarity=0.450 Sum_probs=3.5 Q ss_pred CCHHHHHHHHH Q ss_conf 99899999999 Q gi|254780592|r 220 GRMKEAYSIAS 230 (271) Q Consensus 220 g~~~eA~~~~~ 230 (271) |++++|..+|. T Consensus 19 g~y~~A~~~Y~ 29 (69) T pfam04212 19 GNYEEALELYK 29 (69) T ss_pred CCHHHHHHHHH T ss_conf 99999999999 No 324 >PRK09859 multidrug efflux system protein MdtE; Provisional Probab=58.59 E-value=6.6 Score=17.08 Aligned_cols=21 Identities=14% Similarity=0.260 Sum_probs=14.4 Q ss_pred HHHHHHHHHHHHHHHHHHCCC Q ss_conf 899999999999998540368 Q gi|254780592|r 13 KKIKCIVFLIPLLLFSSCHLN 33 (271) Q Consensus 13 ~~~~~~~~l~~~~~~~~~~~~ 33 (271) |+...+.+++++++++||+.. T Consensus 4 k~~~li~ll~~~~lL~gC~~~ 24 (385) T PRK09859 4 RRKLLIPLLFCGAMLTACDDK 24 (385) T ss_pred HHHHHHHHHHHHHHHHCCCCC T ss_conf 067899999999999537999 No 325 >PRK03002 prsA peptidylprolyl isomerase; Reviewed Probab=58.57 E-value=8.4 Score=16.45 Aligned_cols=22 Identities=36% Similarity=0.505 Sum_probs=17.0 Q ss_pred HHHHHHHHHHHHHHHHHHHCCC Q ss_conf 9899999999999998540368 Q gi|254780592|r 12 LKKIKCIVFLIPLLLFSSCHLN 33 (271) Q Consensus 12 ~~~~~~~~~l~~~~~~~~~~~~ 33 (271) -|.|.+++.+++++.++||+.. T Consensus 3 kK~i~~~~~~~svl~LaaC~~~ 24 (285) T PRK03002 3 GKHIFIITALISILMLSACGQK 24 (285) T ss_pred HHHHHHHHHHHHHHHHHHHCCC T ss_conf 0189999999999999984579 No 326 >PRK03095 prsA peptidylprolyl isomerase; Reviewed Probab=58.45 E-value=7.9 Score=16.61 Aligned_cols=22 Identities=18% Similarity=0.286 Sum_probs=15.1 Q ss_pred HHHHHHHHHHHHHHHHHHCCCC Q ss_conf 8999999999999985403685 Q gi|254780592|r 13 KKIKCIVFLIPLLLFSSCHLNP 34 (271) Q Consensus 13 ~~~~~~~~l~~~~~~~~~~~~~ 34 (271) |+..+.+..++++.+++|+++. T Consensus 2 KK~~la~~~~svl~LaaC~~~~ 23 (287) T PRK03095 2 KKAMLALAATSVIALSACGTSS 23 (287) T ss_pred CHHHHHHHHHHHHHHHHCCCCC T ss_conf 0789999999999998516899 No 327 >COG3065 Slp Starvation-inducible outer membrane lipoprotein [Cell envelope biogenesis, outer membrane] Probab=58.36 E-value=10 Score=15.98 Aligned_cols=24 Identities=21% Similarity=0.245 Sum_probs=17.0 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 899999999999998540368543 Q gi|254780592|r 13 KKIKCIVFLIPLLLFSSCHLNPRI 36 (271) Q Consensus 13 ~~~~~~~~l~~~~~~~~~~~~~~~ 36 (271) ++.+..++...++.++||.+.|.. T Consensus 6 ~~~~~~l~~~laflLsgC~tiPk~ 29 (191) T COG3065 6 NMKKGALIGTLAFLLSGCVTIPKA 29 (191) T ss_pred HHHHHHHHHHHHHHHHHCCCCCHH T ss_conf 647999999999997630357831 No 328 >PRK10510 putative outer membrane lipoprotein; Provisional Probab=58.24 E-value=7.7 Score=16.69 Aligned_cols=25 Identities=20% Similarity=0.260 Sum_probs=17.2 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 8999999999999985403685433 Q gi|254780592|r 13 KKIKCIVFLIPLLLFSSCHLNPRIS 37 (271) Q Consensus 13 ~~~~~~~~l~~~~~~~~~~~~~~~~ 37 (271) +.+.+..++..++.++||.++|.+. T Consensus 4 r~~lia~~~a~~l~lsgC~t~p~tg 28 (219) T PRK10510 4 RVYLIAAIVSGALAVSGCTTNPYTG 28 (219) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCCC T ss_conf 6899999999998741356789876 No 329 >TIGR02722 lp_ uncharacterized lipoprotein; InterPro: IPR014094 Members of this protein family are restricted to the Proteobacteria, and all are predicted lipoproteins. In genomes that contain the thiK gene for the salvage enzyme thiamin kinase, the member of this family is encoded nearby.. Probab=58.14 E-value=8.6 Score=16.39 Aligned_cols=18 Identities=28% Similarity=0.360 Sum_probs=15.3 Q ss_pred HHHHHHHHHHHHHHHCCC Q ss_conf 999999999998540368 Q gi|254780592|r 16 KCIVFLIPLLLFSSCHLN 33 (271) Q Consensus 16 ~~~~~l~~~~~~~~~~~~ 33 (271) .++++++++++|+||++. T Consensus 5 ~l~~~~~~Al~L~GC~~~ 22 (215) T TIGR02722 5 ILIFVALLALLLSGCVSQ 22 (215) T ss_pred HHHHHHHHHHHHHCCCCC T ss_conf 789999999998546887 No 330 >TIGR03352 VI_chp_3 type VI secretion lipoprotein, VC_A0113 family. Work by Mougous, et al. (2006), describes IAHP-related loci as a type VI secretion system (PMID:16763151). This protein family is associated with type VI secretion loci, although not treated explicitly by Mougous, et al. Probab=57.98 E-value=7.3 Score=16.82 Aligned_cols=20 Identities=30% Similarity=0.521 Sum_probs=13.0 Q ss_pred HHHHHHHHHHHHHCCCCCCC Q ss_conf 99999999985403685433 Q gi|254780592|r 18 IVFLIPLLLFSSCHLNPRIS 37 (271) Q Consensus 18 ~~~l~~~~~~~~~~~~~~~~ 37 (271) .++++..++++||++.++.. T Consensus 4 ~~~l~~~lllagC~s~~~~~ 23 (146) T TIGR03352 4 AVLLAACLLLAGCSSAPPPK 23 (146) T ss_pred EHHHHHHHHHHHCCCCCCCC T ss_conf 55899999985325999988 No 331 >KOG2581 consensus Probab=56.43 E-value=11 Score=15.68 Aligned_cols=61 Identities=8% Similarity=-0.150 Sum_probs=25.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCC--CC--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 999999742111110000000000--00--00012457776543322355435999999953898 Q gi|254780592|r 144 KGSVLAQMGKHSEALIEYERALEL--SP--NESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGA 204 (271) Q Consensus 144 lg~~~~~~~~~~~A~~~~~~~~~~--~p--~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~ 204 (271) +-.+|...+.++.|.....+..-- .. ......+.+|.+..-+++|..|.++|-+|+...|+ T Consensus 215 LLr~yL~n~lydqa~~lvsK~~~pe~~snne~ARY~yY~GrIkaiqldYssA~~~~~qa~rkapq 279 (493) T KOG2581 215 LLRNYLHNKLYDQADKLVSKSVYPEAASNNEWARYLYYLGRIKAIQLDYSSALEYFLQALRKAPQ 279 (493) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCC T ss_conf 99998616788789877650458654532799999999876888740177899999999984852 No 332 >PRK12697 flgH flagellar basal body L-ring protein; Reviewed Probab=56.05 E-value=7.7 Score=16.66 Aligned_cols=24 Identities=17% Similarity=0.391 Sum_probs=19.2 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 989999999999999854036854 Q gi|254780592|r 12 LKKIKCIVFLIPLLLFSSCHLNPR 35 (271) Q Consensus 12 ~~~~~~~~~l~~~~~~~~~~~~~~ 35 (271) +++....+|.+.++.++||++.+. T Consensus 6 ~~~~~~~~~al~~l~LsGCa~~~~ 29 (227) T PRK12697 6 VRRPGAAACALAALALAGCAQIPR 29 (227) T ss_pred HCCCCHHHHHHHHHHHHCCCCCCC T ss_conf 414038999999999726567899 No 333 >pfam05590 DUF769 Xylella fastidiosa protein of unknown function (DUF769). This family consists of several uncharacterized hypothetical proteins of unknown function from Xylella fastidiosa, the organism that causes Pierce's disease in plants. Probab=55.54 E-value=4.3 Score=18.23 Aligned_cols=23 Identities=26% Similarity=0.424 Sum_probs=17.0 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 99999999999985403685433 Q gi|254780592|r 15 IKCIVFLIPLLLFSSCHLNPRIS 37 (271) Q Consensus 15 ~~~~~~l~~~~~~~~~~~~~~~~ 37 (271) -.+.-+|...+.++||++.|++. T Consensus 6 rL~~SllaAslLLagCSsgPpID 28 (279) T pfam05590 6 RLSCSLLAASLLLAGCSSGPPID 28 (279) T ss_pred CHHHHHHHHHHHHHCCCCCCCCC T ss_conf 40169999999972378899866 No 334 >PRK11023 hypothetical protein; Provisional Probab=55.47 E-value=6.6 Score=17.09 Aligned_cols=21 Identities=29% Similarity=0.548 Sum_probs=16.2 Q ss_pred HHHHHHHHHHHHHHHHHHHCC Q ss_conf 989999999999999854036 Q gi|254780592|r 12 LKKIKCIVFLIPLLLFSSCHL 32 (271) Q Consensus 12 ~~~~~~~~~l~~~~~~~~~~~ 32 (271) ||.++.+.++++++.++||.. T Consensus 1 mk~~~~l~~l~~~~~L~GC~~ 21 (191) T PRK11023 1 MKALSPLAVLISALLLQGCVA 21 (191) T ss_pred CCHHHHHHHHHHHHHHCCCHH T ss_conf 944889999999999726823 No 335 >PRK13883 conjugal transfer protein TrbH; Provisional Probab=55.43 E-value=11 Score=15.62 Aligned_cols=24 Identities=17% Similarity=0.364 Sum_probs=13.9 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 89999999999999854036854331 Q gi|254780592|r 13 KKIKCIVFLIPLLLFSSCHLNPRISI 38 (271) Q Consensus 13 ~~~~~~~~l~~~~~~~~~~~~~~~~~ 38 (271) +|| +++.+.++.++||.++.+... T Consensus 2 rK~--~~~all~laL~gCAtt~~YGN 25 (156) T PRK13883 2 RKI--LSLALLALALGGCATTSQYGN 25 (156) T ss_pred HHH--HHHHHHHHHHHCCCCCCCCCC T ss_conf 268--999999999703313578877 No 336 >PRK12701 flgH flagellar basal body L-ring protein; Reviewed Probab=54.79 E-value=9.7 Score=16.07 Aligned_cols=21 Identities=14% Similarity=0.229 Sum_probs=15.3 Q ss_pred HHHHHHHHHHHHHHHHHCCCC Q ss_conf 999999999999985403685 Q gi|254780592|r 14 KIKCIVFLIPLLLFSSCHLNP 34 (271) Q Consensus 14 ~~~~~~~l~~~~~~~~~~~~~ 34 (271) .+...++|+.+++++||++.. T Consensus 3 r~~~~~~~la~~lL~GC~~~~ 23 (230) T PRK12701 3 RLNIAVSCLATALLFGCEALH 23 (230) T ss_pred HHHHHHHHHHHHHHHCCCCCC T ss_conf 589999999999975566889 No 337 >pfam07219 HemY_N HemY protein N-terminus. This family represents the N-terminus (approximately 150 residues) of bacterial HemY porphyrin biosynthesis proteins. This is a membrane protein involved in a late step of protoheme IX synthesis. Probab=54.52 E-value=12 Score=15.49 Aligned_cols=29 Identities=21% Similarity=0.012 Sum_probs=12.6 Q ss_pred HHHHHHHHHCCHHHHHHHHHHHHCCCCCC Q ss_conf 99999874100124466655663275210 Q gi|254780592|r 76 YADVLRRVGRTAQALAVMRQVAILYPEDQ 104 (271) Q Consensus 76 ~a~~l~~~g~~~~A~~~~~~al~~~p~~~ 104 (271) .|.+-...|++++|...+.++.+..+.+. T Consensus 91 ~Gl~a~~~Gd~~~A~k~~~ka~k~~~~p~ 119 (134) T pfam07219 91 EGLLALAEGDWALAERLARKAAELEDQPP 119 (134) T ss_pred HHHHHHHCCCHHHHHHHHHHHHHCCCCCH T ss_conf 99999978999999999999876379935 No 338 >pfam07901 DUF1672 Protein of unknown function (DUF1672). This family is composed of hypothetical bacterial proteins of unknown function. Probab=54.36 E-value=7.7 Score=16.69 Aligned_cols=22 Identities=32% Similarity=0.403 Sum_probs=15.4 Q ss_pred HHHHHHHHHHHHHHHCCCCCCC Q ss_conf 9999999999985403685433 Q gi|254780592|r 16 KCIVFLIPLLLFSSCHLNPRIS 37 (271) Q Consensus 16 ~~~~~l~~~~~~~~~~~~~~~~ 37 (271) +.+.+++++++++||+..+... T Consensus 4 ~i~~il~~~llL~GCs~mn~~~ 25 (304) T pfam07901 4 LIKLILIATLLLSGCSTTNNES 25 (304) T ss_pred HHHHHHHHHHHHCCCCCCCCCC T ss_conf 8999999999971334455542 No 339 >TIGR00752 slp outer membrane lipoprotein, Slp family; InterPro: IPR004658 Slp superfamily members are present in the Gram-negative gamma proteobacteria Escherichia coli (which also contains a close paralog), Haemophilus influenzae and Pasteurella multocida and Vibrio cholerae. The known members of the family to date share a motif LX[GA]C near the N-terminus, which is compatible with the possibility that the protein is modified into a lipoprotein with Cys as the new N-terminus. Slp from E. coli is known to be a lipoprotein of the outer membrane and to be expressed in response to carbon starvation.; GO: 0019867 outer membrane. Probab=53.80 E-value=7.4 Score=16.77 Aligned_cols=25 Identities=16% Similarity=0.252 Sum_probs=18.8 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCC Q ss_conf 9999999999999854036854331 Q gi|254780592|r 14 KIKCIVFLIPLLLFSSCHLNPRISI 38 (271) Q Consensus 14 ~~~~~~~l~~~~~~~~~~~~~~~~~ 38 (271) .++.+++...++.|+||.+.|..-. T Consensus 2 ~~~g~l~~~l~f~L~gC~~~P~~ik 26 (190) T TIGR00752 2 VKKGLLITALAFLLTGCIAVPKAIK 26 (190) T ss_pred CCHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 2014568889988753146742345 No 340 >KOG1497 consensus Probab=53.16 E-value=13 Score=15.35 Aligned_cols=57 Identities=16% Similarity=0.033 Sum_probs=23.1 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHCCCC--------CCHHHHHHHHHHHHHCCHHHHHHHHHHH Q ss_conf 999999998741001244666556632752--------1001355553210000011258876422 Q gi|254780592|r 73 GIVYADVLRRVGRTAQALAVMRQVAILYPE--------DQEVLAAYGKSLANAGYLDEGLDAINRA 130 (271) Q Consensus 73 ~~~~a~~l~~~g~~~~A~~~~~~al~~~p~--------~~~~~~~~~~~~~~~g~~~~A~~~~~~a 130 (271) ...+|.+|...+++..|-..+.- +.++.. .......+++.|...++..+|.....++ T Consensus 106 rl~LAsiYE~Eq~~~~aaq~L~~-I~~~tg~~~~d~~~kl~l~iriarlyLe~~d~veae~~inRa 170 (399) T KOG1497 106 RLHLASIYEKEQNWRDAAQVLVG-IPLDTGQKAYDVEQKLLLCIRIARLYLEDDDKVEAEAYINRA 170 (399) T ss_pred HHHHHHHHHHHHHHHHHHHHHHC-CCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH T ss_conf 99999999876469999998854-276654344125788999999999987158477899998887 No 341 >PRK01326 prsA foldase protein PrsA; Reviewed Probab=52.80 E-value=13 Score=15.32 Aligned_cols=22 Identities=23% Similarity=0.344 Sum_probs=15.4 Q ss_pred HHHHHHHHHHHHHHHHHHCCCC Q ss_conf 8999999999999985403685 Q gi|254780592|r 13 KKIKCIVFLIPLLLFSSCHLNP 34 (271) Q Consensus 13 ~~~~~~~~l~~~~~~~~~~~~~ 34 (271) |-+..++.+.+++.|+||++.. T Consensus 4 Kl~~~~~~~~svl~LaaCs~~~ 25 (310) T PRK01326 4 KLIAGAVTLLSVATLAACSKTN 25 (310) T ss_pred HHHHHHHHHHHHHHHHHHCCCC T ss_conf 8999999999999998716999 No 342 >PRK01622 OxaA-like protein precursor; Validated Probab=51.52 E-value=13 Score=15.19 Aligned_cols=24 Identities=17% Similarity=0.355 Sum_probs=16.6 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 899999999999998540368543 Q gi|254780592|r 13 KKIKCIVFLIPLLLFSSCHLNPRI 36 (271) Q Consensus 13 ~~~~~~~~l~~~~~~~~~~~~~~~ 36 (271) |.+.+..++..+++++||+.+++. T Consensus 6 ~~~l~~~~~~~~~~lsgc~~~~~~ 29 (266) T PRK01622 6 RAVLISLSLLLVFVLSGCSNAAPI 29 (266) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 999999999999999546899998 No 343 >PRK00249 flgH flagellar basal body L-ring protein; Reviewed Probab=51.07 E-value=11 Score=15.65 Aligned_cols=22 Identities=32% Similarity=0.448 Sum_probs=15.6 Q ss_pred HHHHHHHHHHHHHHHHCCCCCC Q ss_conf 9999999999998540368543 Q gi|254780592|r 15 IKCIVFLIPLLLFSSCHLNPRI 36 (271) Q Consensus 15 ~~~~~~l~~~~~~~~~~~~~~~ 36 (271) +...++++.+++++||++.++. T Consensus 4 ~~~~~~~~~~~~LsgCa~~~~~ 25 (231) T PRK00249 4 MLIALALLLLLLLSGCASIPPK 25 (231) T ss_pred HHHHHHHHHHHHHHHCCCCCCC T ss_conf 8999999999998753489998 No 344 >TIGR02269 TIGR02269 Myxococcus xanthus paralogous lipoprotein family TIGR02269; InterPro: IPR011755 This family consists of at least 9 paralogs in Myxococcus xanthus, a member of the Deltaproteobacteria. One appears truncated toward the N-terminus; the others are predicted lipoproteins. The function is unknown.. Probab=49.98 E-value=11 Score=15.64 Aligned_cols=24 Identities=25% Similarity=0.436 Sum_probs=17.9 Q ss_pred HHHHHHHHHHHHHCCCCCCCCHHH Q ss_conf 999999999854036854331179 Q gi|254780592|r 18 IVFLIPLLLFSSCHLNPRISIPDP 41 (271) Q Consensus 18 ~~~l~~~~~~~~~~~~~~~~~~~~ 41 (271) +++.+++++|.||+++.+.+.+.. T Consensus 2 l~~~lla~ll~GCa~~~~~~dP~~ 25 (228) T TIGR02269 2 LLLLLLALLLVGCASTAPKSDPPA 25 (228) T ss_pred HHHHHHHHHHHHHCCCCCCCCCHH T ss_conf 579988999875205787787402 No 345 >pfam07079 DUF1347 Protein of unknown function (DUF1347). This family consists of several hypothetical bacterial proteins of around 610 residues in length. Members of this family are highly conserved and seem to be specific to Chlamydia species. The function of this family is unknown. Probab=49.25 E-value=15 Score=14.97 Aligned_cols=70 Identities=19% Similarity=0.077 Sum_probs=43.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHH Q ss_conf 765433223554359999999538985478889999999869989999999986291728665999999998 Q gi|254780592|r 179 AMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQELSPEEATRNIKYIKSILS 250 (271) Q Consensus 179 ~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~~~~l~~~l~ 250 (271) +..+..+|+|+.-.-+-- -+..-..++.++..+|+++...+++.||++++.. |.+....-+...-++++- T Consensus 469 AeyLfa~gdY~kCy~YS~-WLtkvaPS~~~yrLlGLcL~enK~Y~EAwe~l~~-Lp~n~~~~dskvqKAl~l 538 (550) T pfam07079 469 AEYLFAQGDYHKCYLYSL-WLTKVAPSPLAYRLLGLCLLENKEYLEAWEYLHS-LPLNERTYDSKVQKALLL 538 (550) T ss_pred HHHHHHCCCCCHHHHHHH-HHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHH-CCCCCCHHHHHHHHHHHH T ss_conf 899975377004365698-8876189788999998888401658999999974-899864116899999999 No 346 >TIGR02887 spore_ger_x_C germination protein, Ger(x)C family; InterPro: IPR008844 The GerAA, -AB, and -AC proteins of the Bacillus subtilis spore are required for the germination response to L-alanine as the sole germinant. Members of GerAC family are thought to be located in the inner spore membrane. Although the function of this family is unclear, they are likely to encode the components of the germination apparatus that respond directly to this germinant, mediating the spore's response .; GO: 0009847 spore germination, 0016020 membrane. Probab=48.75 E-value=11 Score=15.81 Aligned_cols=21 Identities=29% Similarity=0.684 Sum_probs=16.3 Q ss_pred HHHHHHHHHHHHHHHHHCCCC Q ss_conf 999999999999985403685 Q gi|254780592|r 14 KIKCIVFLIPLLLFSSCHLNP 34 (271) Q Consensus 14 ~~~~~~~l~~~~~~~~~~~~~ 34 (271) ..+++++++++++|+||+..- T Consensus 2 ~~~~~~~~~~~~lL~GCwd~~ 22 (400) T TIGR02887 2 KLKILLLILALLLLTGCWDKR 22 (400) T ss_pred HHHHHHHHHHHHHHCCCCCCH T ss_conf 466789999999832665620 No 347 >COG4941 Predicted RNA polymerase sigma factor containing a TPR repeat domain [Transcription] Probab=47.72 E-value=15 Score=14.83 Aligned_cols=132 Identities=17% Similarity=0.192 Sum_probs=71.6 Q ss_pred CCHHHHHHHHHHHHHHHC-HHHHHHHHHHHHHHHHH---HHH-HHHHHHHHCC-CCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 001125887642211000-02335669999999742---111-1100000000-00000001245777654332235543 Q gi|254780592|r 118 GYLDEGLDAINRAQRPDI-PDWQLISAKGSVLAQMG---KHS-EALIEYERAL-ELSPNESSIVSNIAMSYLLMGDLKTA 191 (271) Q Consensus 118 g~~~~A~~~~~~a~~~~p-~~~~~~~~lg~~~~~~~---~~~-~A~~~~~~~~-~~~p~~~~~~~~~~~~~~~~g~~~~A 191 (271) +-.+++...+.++..... .-......++.++..-. +.+ ..+..+-.++ ...|+ +.+-.|.+.+.-..--...+ T Consensus 270 ~lI~eg~all~rA~~~~~pGPYqlqAAIaa~HA~a~~aedtDW~~I~aLYdaL~~~apS-PvV~LNRAVAla~~~Gp~ag 348 (415) T COG4941 270 ALIDEGLALLDRALASRRPGPYQLQAAIAALHARARRAEDTDWPAIDALYDALEQAAPS-PVVTLNRAVALAMREGPAAG 348 (415) T ss_pred HHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCC-CEEEEHHHHHHHHHHHHHHH T ss_conf 88787899999999748998189999999998753335799859999999999984899-85751388999976557868 Q ss_pred HHHHHHHHHC--CCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHHHHHHHHHH Q ss_conf 5999999953--8985478889999999869989999999986291728665999999998 Q gi|254780592|r 192 EEKLRFASQM--IGADSRIRQNLALVVGLQGRMKEAYSIASQELSPEEATRNIKYIKSILS 250 (271) Q Consensus 192 ~~~~~~al~~--~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~~~~~~~~~l~~~l~ 250 (271) +...+..... -.++.-.+-.-|-.+..+|+.+||...|++++..........+++.-+. T Consensus 349 La~ve~L~~~~~L~gy~~~h~~RadlL~rLgr~~eAr~aydrAi~La~~~aer~~l~~r~~ 409 (415) T COG4941 349 LAMVEALLARPRLDGYHLYHAARADLLARLGRVEEARAAYDRAIALARNAAERAFLRQRLD 409 (415) T ss_pred HHHHHHHHCCCCCCCCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHH T ss_conf 9999875124355434220877789999818707789999999986688699999999998 No 348 >TIGR02747 TraV type IV conjugative transfer system protein TraV; InterPro: IPR014118 This entry includes TraV, which is a component of conjugative type IV secretion system. TraV is an outer membrane lipoprotein that is believed to interact with the secretin TraK , , . The alignment contains three conserved cysteines in the N-terminal half.. Probab=47.43 E-value=9.8 Score=16.02 Aligned_cols=18 Identities=22% Similarity=0.602 Sum_probs=10.3 Q ss_pred HHHHHHH-HHHHHHHHHCC Q ss_conf 9999999-99999854036 Q gi|254780592|r 15 IKCIVFL-IPLLLFSSCHL 32 (271) Q Consensus 15 ~~~~~~l-~~~~~~~~~~~ 32 (271) +.+++.+ ++++.|+|||+ T Consensus 4 ~~~L~~~~~~~f~LtGCsa 22 (174) T TIGR02747 4 LKVLLLLACVAFLLTGCSA 22 (174) T ss_pred HHHHHHHHHHHHHHHHHHH T ss_conf 7899999999872002577 No 349 >TIGR02508 type_III_yscG type III secretion protein, YscG family; InterPro: IPR013348 YscG is a molecular chaperone for YscE, where both are part of the type III secretion system that in Yersinia is designated Ysc (Yersinia secretion). The secretion system delivers effector proteins, designated Yops (Yersinia outer proteins), in Yersinia. This entry consists of YscG from Yersinia, and functionally equivalent type III secretion proteins in other species: e.g. AscG in Aeromonas and LscG in Photorhabdus luminescens.; GO: 0009405 pathogenesis. Probab=47.38 E-value=16 Score=14.79 Aligned_cols=94 Identities=18% Similarity=0.015 Sum_probs=51.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCC Q ss_conf 66999999974211111000000000000000124577765433223554359999999538985478889999999869 Q gi|254780592|r 141 ISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQG 220 (271) Q Consensus 141 ~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g 220 (271) +..++.+-.....+++|--.-............+.......+.++|+|.+|....++.-.. .++.---+|++-.+.| T Consensus 9 LAeiAL~gtG~HcH~EAn~IAdWL~~~~e~~E~v~LIrlsSLmN~G~Y~~Al~lg~~~~ta---yPdLepwlALce~rlG 85 (118) T TIGR02508 9 LAEIALVGTGHHCHQEANTIADWLEETSESEEAVVLIRLSSLMNRGDYQEALQLGEELCTA---YPDLEPWLALCEWRLG 85 (118) T ss_pred HHHHHHHHCCCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHCCCCCCC---CCCHHHHHHHHHHHHH T ss_conf 9999998459983678989999745899816999999986312755799999724336887---7787789999999987 Q ss_pred CHH-HHHHHHHHHCCCCC Q ss_conf 989-99999998629172 Q gi|254780592|r 221 RMK-EAYSIASQELSPEE 237 (271) Q Consensus 221 ~~~-eA~~~~~~~L~~~~ 237 (271) -.+ .+.+..+=+-+.+| T Consensus 86 l~~Al~~Rl~rLa~s~~p 103 (118) T TIGR02508 86 LLSALEERLLRLAASGDP 103 (118) T ss_pred HHHHHHHHHHHHHCCCCH T ss_conf 999999999987427985 No 350 >PRK10760 murein hydrolase B; Provisional Probab=47.10 E-value=16 Score=14.77 Aligned_cols=20 Identities=20% Similarity=0.590 Sum_probs=13.2 Q ss_pred HHHHHHHHHHHHHCCCCCCC Q ss_conf 99999999985403685433 Q gi|254780592|r 18 IVFLIPLLLFSSCHLNPRIS 37 (271) Q Consensus 18 ~~~l~~~~~~~~~~~~~~~~ 37 (271) +.++...+++++||+.|... T Consensus 5 ~~~~~~~~~l~~css~p~~~ 24 (357) T PRK10760 5 VTLLPLFVLLAACSSKPKPT 24 (357) T ss_pred HHHHHHHHHHHHHCCCCCCC T ss_conf 86899999999851899999 No 351 >TIGR03525 GldK gliding motility-associated lipoprotein GldK. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldK is a lipoprotein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae. Knockouts of GldK abolish the gliding phenotype. GldK is homologous to GldJ. There is a GldK homolog in Cytophaga hutchinsonii and several other species that has a different, shorter architecture and is represented by a separate model. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility. Probab=46.95 E-value=12 Score=15.45 Aligned_cols=21 Identities=33% Similarity=0.526 Sum_probs=13.6 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 98999999999999985403685 Q gi|254780592|r 12 LKKIKCIVFLIPLLLFSSCHLNP 34 (271) Q Consensus 12 ~~~~~~~~~l~~~~~~~~~~~~~ 34 (271) ||++ ++++++++++++|.+.. T Consensus 1 Mkk~--~~~~~~~~~~~SCg~~~ 21 (449) T TIGR03525 1 MKKY--LVFAALVVLVYSCGSGD 21 (449) T ss_pred CCHH--HHHHHHHHHHHHCCCCC T ss_conf 9124--89999999872105899 No 352 >COG5633 Predicted periplasmic lipoprotein [General function prediction only] Probab=46.87 E-value=8.6 Score=16.37 Aligned_cols=21 Identities=29% Similarity=0.569 Sum_probs=15.4 Q ss_pred HHHHHHHHHHHCCCCCCCCHH Q ss_conf 999999985403685433117 Q gi|254780592|r 20 FLIPLLLFSSCHLNPRISIPD 40 (271) Q Consensus 20 ~l~~~~~~~~~~~~~~~~~~~ 40 (271) +.++++.|.||++.+...... T Consensus 7 ~~l~~~lLvGCsS~~~i~~~~ 27 (123) T COG5633 7 LSLALLLLVGCSSHQEILVND 27 (123) T ss_pred HHHHHHHHHCCCCCCCCCCCC T ss_conf 999999942047887752255 No 353 >PRK02998 prsA peptidylprolyl isomerase; Reviewed Probab=46.80 E-value=16 Score=14.74 Aligned_cols=20 Identities=15% Similarity=0.227 Sum_probs=13.5 Q ss_pred HHHHHHHHHHHHHHHHCCCC Q ss_conf 99999999999985403685 Q gi|254780592|r 15 IKCIVFLIPLLLFSSCHLNP 34 (271) Q Consensus 15 ~~~~~~l~~~~~~~~~~~~~ 34 (271) +..++++.+++.+++|+++. T Consensus 7 ~~~~~~~~~~l~LaaC~s~~ 26 (283) T PRK02998 7 FIGTIISCVVLALSACGSSD 26 (283) T ss_pred HHHHHHHHHHHHHHHCCCCC T ss_conf 99999999999998668998 No 354 >pfam12583 TPPII_N Tripeptidyl peptidase II N terminal. This domain family is found in bacteria and eukaryotes, and is approximately 190 amino acids in length. The family is found in association with pfam00082. Tripeptidyl peptidase II (TPPII) is a crucial component of the proteolytic cascade acting downstream of the 26S proteasome in the ubiquitin-proteasome pathway. It is an amino peptidase belonging to the subtilase family removing tripeptides from the free N terminus of oligopeptides. Probab=46.59 E-value=16 Score=14.72 Aligned_cols=21 Identities=14% Similarity=0.186 Sum_probs=7.5 Q ss_pred HHHHHHHHHHHHCCCCCCHHH Q ss_conf 124466655663275210013 Q gi|254780592|r 87 AQALAVMRQVAILYPEDQEVL 107 (271) Q Consensus 87 ~~A~~~~~~al~~~p~~~~~~ 107 (271) +.|..+|+.++..+|++..++ T Consensus 87 E~AE~iY~ev~~~~P~hL~aH 107 (134) T pfam12583 87 ENAEKIYNEVVAAHPKHLQAH 107 (134) T ss_pred HHHHHHHHHHHHHCCHHHHHH T ss_conf 889999999998781159999 No 355 >pfam03304 Mlp Mlp lipoprotein family. The Mlp (for Multicopy Lipoprotein) family of lipoproteins is found in Borrelia species. This family were previously known as 2.9 lipoprotein genes. These surface expressed genes may represent new candidate vaccinogens for Lyme disease. Members of this family generally are downstream of four ORFs called A,B,C and D that are involved in hemolytic activity. Probab=46.42 E-value=9 Score=16.27 Aligned_cols=22 Identities=36% Similarity=0.606 Sum_probs=17.7 Q ss_pred HHHHHHHHHHHHHHHHHHCCCC Q ss_conf 8999999999999985403685 Q gi|254780592|r 13 KKIKCIVFLIPLLLFSSCHLNP 34 (271) Q Consensus 13 ~~~~~~~~l~~~~~~~~~~~~~ 34 (271) ++|+-+++++.++++.||-++. T Consensus 1 MKIINilfclfll~Ln~Cnsnd 22 (150) T pfam03304 1 MKIINILFCLFLLMLNGCNSND 22 (150) T ss_pred CCEEHHHHHHHHHHHCCCCCCC T ss_conf 9431499999999982656786 No 356 >pfam00938 Lipoprotein_3 Lipoprotein. This family of lipoproteins is Mycoplasma specific. Probab=46.32 E-value=16 Score=14.69 Aligned_cols=56 Identities=16% Similarity=0.253 Sum_probs=27.6 Q ss_pred HHHHHHHHHHHHCCCCCCCC---HHHHHHHHHHH----CCHHHHHHHHHHHHHHCCCCHHHHH Q ss_conf 99999999854036854331---17999999842----0199999999999995757879999 Q gi|254780592|r 19 VFLIPLLLFSSCHLNPRISI---PDPESLENMNH----EQLLEVTSTIGLQYQSHTKNKMIGI 74 (271) Q Consensus 19 ~~l~~~~~~~~~~~~~~~~~---~~~~~~~~~~~----~~~~~A~~~~~~al~~~P~~~~~~~ 74 (271) ++++++.++++|+....... ...+....-+. ++=...+..+++.++.||+...-.+ T Consensus 2 lfll~g~~LSACS~a~~~~~~~~~~~~~~~~sk~~el~e~KknlIt~LKksYe~np~~Tt~~L 64 (101) T pfam00938 2 LFLLSGTLLSACSQADLRVLIKELTAQDYDVSKNIELSEGKKNLITSLKKSYESNPKKTTNLL 64 (101) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCHHHHHHH T ss_conf 599995999997288899999885554777864010330088999999999715908899999 No 357 >PRK12698 flgH flagellar basal body L-ring protein; Reviewed Probab=45.67 E-value=15 Score=14.90 Aligned_cols=19 Identities=26% Similarity=0.588 Sum_probs=12.0 Q ss_pred HHHHHHHHHHHHHCCCCCC Q ss_conf 9999999998540368543 Q gi|254780592|r 18 IVFLIPLLLFSSCHLNPRI 36 (271) Q Consensus 18 ~~~l~~~~~~~~~~~~~~~ 36 (271) .++++++++++||++.+.. T Consensus 4 ~~l~~~~l~L~gCa~~~~~ 22 (224) T PRK12698 4 YILLALALLLAGCSSTPKK 22 (224) T ss_pred HHHHHHHHHHHCCCCCCCC T ss_conf 9999999998365688988 No 358 >COG2015 Alkyl sulfatase and related hydrolases [Secondary metabolites biosynthesis, transport, and catabolism] Probab=45.64 E-value=17 Score=14.63 Aligned_cols=51 Identities=24% Similarity=0.111 Sum_probs=42.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHH Q ss_conf 457776543322355435999999953898547888999999986998999 Q gi|254780592|r 175 VSNIAMSYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEA 225 (271) Q Consensus 175 ~~~~~~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA 225 (271) ...++.--..+|+|.-+.+.+.+++-.+|++..++...|-++.++|--.|. T Consensus 455 Vl~la~ea~~kGdyrW~a~lln~~VfAdp~n~~Ar~L~Ad~lEQLgYqaE~ 505 (655) T COG2015 455 VLELAREAFDKGDYRWAAELLNQAVFADPGNKAARELQADALEQLGYQAES 505 (655) T ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCC T ss_conf 999999887546601899997467716976188998877389885002100 No 359 >PRK09578 periplasmic multidrug efflux lipoprotein precursor; Reviewed Probab=45.53 E-value=17 Score=14.62 Aligned_cols=21 Identities=10% Similarity=0.333 Sum_probs=15.1 Q ss_pred HHHHHHHHHHHHHHHHHHCCC Q ss_conf 899999999999998540368 Q gi|254780592|r 13 KKIKCIVFLIPLLLFSSCHLN 33 (271) Q Consensus 13 ~~~~~~~~l~~~~~~~~~~~~ 33 (271) |...++++++++++++||+.. T Consensus 6 r~~~~l~~l~~~~~L~gC~~~ 26 (385) T PRK09578 6 RRRLALAALVAAFALAGCGKG 26 (385) T ss_pred HHHHHHHHHHHHHHHHCCCCC T ss_conf 799999999999998167999 No 360 >PRK09837 copper/silver efflux system outer membrane protein CusC; Provisional Probab=45.45 E-value=16 Score=14.78 Aligned_cols=20 Identities=20% Similarity=0.454 Sum_probs=12.3 Q ss_pred HHHHHHHHHHHHHCCCCCCC Q ss_conf 99999999985403685433 Q gi|254780592|r 18 IVFLIPLLLFSSCHLNPRIS 37 (271) Q Consensus 18 ~~~l~~~~~~~~~~~~~~~~ 37 (271) ++.+.++++++||+..|... T Consensus 6 ll~l~~~l~L~gC~~~P~y~ 25 (460) T PRK09837 6 LLPFCVALALTGCSLAPDYQ 25 (460) T ss_pred HHHHHHHHHHCCCCCCCCCC T ss_conf 99999999982477899999 No 361 >pfam06085 Rz1 Lipoprotein Rz1 precursor. This family consists of several bacteria and phage lipoprotein Rz1 precursors. Rz1 is a proline-rich lipoprotein from bacteriophage lambda which is known to have fusogenic properties. Rz1-induced liposome fusion is thought to be mediated primarily by the generation of local perturbation in the bilayer lipid membrane and to a lesser extent by electrostatic forces. Probab=44.69 E-value=13 Score=15.18 Aligned_cols=24 Identities=25% Similarity=0.502 Sum_probs=15.1 Q ss_pred HHHHHHHH-HHHHHHHHHCCCCCCC Q ss_conf 99999999-9999985403685433 Q gi|254780592|r 14 KIKCIVFL-IPLLLFSSCHLNPRIS 37 (271) Q Consensus 14 ~~~~~~~l-~~~~~~~~~~~~~~~~ 37 (271) +++..++. .+.++.++|.+.|... T Consensus 3 ~lkm~l~~~m~~L~vsaC~S~p~vq 27 (59) T pfam06085 3 KLKMKLCVMMLPLVVSACSSKPPVQ 27 (59) T ss_pred CCHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 2025699999899999826999865 No 362 >PRK02463 OxaA-like protein precursor; Provisional Probab=44.12 E-value=18 Score=14.48 Aligned_cols=27 Identities=19% Similarity=0.199 Sum_probs=16.9 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCH Q ss_conf 899999999999998540368543311 Q gi|254780592|r 13 KKIKCIVFLIPLLLFSSCHLNPRISIP 39 (271) Q Consensus 13 ~~~~~~~~l~~~~~~~~~~~~~~~~~~ 39 (271) |-..+.+++..+++++||...+....+ T Consensus 7 ~~~~~~~~~~~~l~LsgC~~~~~~~~~ 33 (307) T PRK02463 7 RILFSGLALSMLLTLTGCVGRDKHGNP 33 (307) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCCCCC T ss_conf 999999999999999634689999997 No 363 >COG5536 BET4 Protein prenyltransferase, alpha subunit [Posttranslational modification, protein turnover, chaperones] Probab=43.96 E-value=18 Score=14.47 Aligned_cols=128 Identities=11% Similarity=-0.022 Sum_probs=79.0 Q ss_pred HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH--HCCHHHHHHHHHHHHCCCCCCHHHHHHHHHH---HHHCCHH---HH Q ss_conf 99999999999995757879999999999874--1001244666556632752100135555321---0000011---25 Q gi|254780592|r 52 LLEVTSTIGLQYQSHTKNKMIGIVYADVLRRV--GRTAQALAVMRQVAILYPEDQEVLAAYGKSL---ANAGYLD---EG 123 (271) Q Consensus 52 ~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~--g~~~~A~~~~~~al~~~p~~~~~~~~~~~~~---~~~g~~~---~A 123 (271) ++.-.++.+.+++.+|++-..|...--++... ..+..-..+.++.+..+|.+-.+|.-...+. ...+++. +- T Consensus 90 ldneld~~~~~lk~~PK~YqiW~HR~~~Le~~p~~~~~rEl~itkklld~DsrNyH~W~YR~~vl~~ie~~~N~S~~k~e 169 (328) T COG5536 90 LDNELDFLDEALKDNPKNYQIWHHRQWMLELFPKPSWGRELFITKKLLDSDSRNYHVWSYRRWVLRTIEDLFNFSDLKHE 169 (328) T ss_pred HHCHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHCCCCCCCCEEEEEEEEEECCHHHCCCHHHHHH T ss_conf 30578999999851884122268999999738886500269999998602665454044576533131431231068888 Q ss_pred HHHHHHHHHHHCHHHHHHHHHHHHHHH---H------HHHHHHHHHHHHCCCCCCHHHHHHHHHH Q ss_conf 887642211000023356699999997---4------2111110000000000000001245777 Q gi|254780592|r 124 LDAINRAQRPDIPDWQLISAKGSVLAQ---M------GKHSEALIEYERALELSPNESSIVSNIA 179 (271) Q Consensus 124 ~~~~~~a~~~~p~~~~~~~~lg~~~~~---~------~~~~~A~~~~~~~~~~~p~~~~~~~~~~ 179 (271) .+.-...+..++.+..+|..+-..... . ...++-++..-..+-..|++.+++...- T Consensus 170 ~eytt~~I~tdi~N~SaW~~r~~~~~~~~~~~~visqk~l~~eL~~i~~~if~~p~~~S~w~y~r 234 (328) T COG5536 170 LEYTTSLIETDIYNNSAWHHRYIWIERRFNRGDVISQKYLEKELEYIFDKIFTDPDNQSVWGYLR 234 (328) T ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCCCCHHHHHH T ss_conf 77478998607777388888999999998602621289999999999862304854331156777 No 364 >PRK12788 flgH flagellar basal body L-ring protein; Reviewed Probab=43.89 E-value=15 Score=14.97 Aligned_cols=19 Identities=21% Similarity=0.439 Sum_probs=14.3 Q ss_pred HHHHHHHHHHHHHHHCCCC Q ss_conf 9999999999985403685 Q gi|254780592|r 16 KCIVFLIPLLLFSSCHLNP 34 (271) Q Consensus 16 ~~~~~l~~~~~~~~~~~~~ 34 (271) +.++.++.+++++||+..+ T Consensus 2 r~l~~~~a~l~LsGCa~~~ 20 (231) T PRK12788 2 RLLVAILACLALAGCANNL 20 (231) T ss_pred CHHHHHHHHHHHHHHCCCC T ss_conf 0899999999976403874 No 365 >COG4669 EscJ Type III secretory pathway, lipoprotein EscJ [Intracellular trafficking and secretion] Probab=42.08 E-value=19 Score=14.29 Aligned_cols=27 Identities=19% Similarity=0.561 Sum_probs=16.9 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCHHHHH Q ss_conf 999999999985403685433117999 Q gi|254780592|r 17 CIVFLIPLLLFSSCHLNPRISIPDPES 43 (271) Q Consensus 17 ~~~~l~~~~~~~~~~~~~~~~~~~~~~ 43 (271) ++++++..+.|+||-..--...+..++ T Consensus 7 ~l~~l~l~l~L~gCk~~Ly~gL~e~eA 33 (246) T COG4669 7 YLFLLLLILLLTGCKVDLYTGLSEKEA 33 (246) T ss_pred HHHHHHHHHHHHCCHHHHHCCCCHHHH T ss_conf 999999999983356988707987678 No 366 >COG3417 FlgN Collagen-binding surface adhesin SpaP (antigen I/II family) [General function prediction only] Probab=41.75 E-value=19 Score=14.26 Aligned_cols=17 Identities=24% Similarity=0.446 Sum_probs=12.5 Q ss_pred HHHHHHHHHHHHHHCCC Q ss_conf 99999999998540368 Q gi|254780592|r 17 CIVFLIPLLLFSSCHLN 33 (271) Q Consensus 17 ~~~~l~~~~~~~~~~~~ 33 (271) ...+++++++|+||++. T Consensus 7 ~~~il~~al~l~GCs~~ 23 (200) T COG3417 7 YASILLLALFLSGCSSE 23 (200) T ss_pred HHHHHHHHHHHHHCCCC T ss_conf 99999999998411367 No 367 >KOG0567 consensus Probab=41.71 E-value=19 Score=14.25 Aligned_cols=183 Identities=11% Similarity=0.024 Sum_probs=112.4 Q ss_pred HHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCC---HHHHHHHHHHHHHCCH-- Q ss_conf 98420199999999999995757879999999999874100124466655663275210---0135555321000001-- Q gi|254780592|r 46 NMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVAILYPEDQ---EVLAAYGKSLANAGYL-- 120 (271) Q Consensus 46 ~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~---~~~~~~~~~~~~~g~~-- 120 (271) .+.+-+...|+..+...+......|.+....|+++-..| .....+++++.. .+|-.. .....+.++....+-. T Consensus 59 ~LgQ~~~~~Av~~l~~vl~desq~pmvRhEAaealga~~-~~~~~~~l~k~~-~dp~~~v~ETc~lAi~rle~~~~~~~~ 136 (289) T KOG0567 59 VLGQMQDEDAVPVLVEVLLDESQEPMVRHEAAEALGAIG-DPESLEILTKYI-KDPCKEVRETCELAIKRLEWKDIIDKI 136 (289) T ss_pred HHHHHCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHC-CHHHHHHHHHHH-CCCCCCCCHHHHHHHHHHHHHHCCCCC T ss_conf 334214411357999873334422488889999987514-612478999985-277641005899999999875200344 Q ss_pred ----------HHHH---HHH--HHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHH Q ss_conf ----------1258---876--4221100002335669999999742111110000000000000001245777654332 Q gi|254780592|r 121 ----------DEGL---DAI--NRAQRPDIPDWQLISAKGSVLAQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLM 185 (271) Q Consensus 121 ----------~~A~---~~~--~~a~~~~p~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~ 185 (271) +.+- ..+ -+..-.+.+.+...-.++..+.+.-..++|+..+.+.+... +.-.....+..+ .| T Consensus 137 ~~~~p~~SvdPa~p~~~ssv~~lr~~lld~t~~l~~Ry~amF~LRn~g~EeaI~al~~~l~~~--SalfrhEvAfVf-GQ 213 (289) T KOG0567 137 ANSSPYISVDPAPPANLSSVHELRAELLDETKPLFERYRAMFYLRNIGTEEAINALIDGLADD--SALFRHEVAFVF-GQ 213 (289) T ss_pred CCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCC--HHHHHHHHHHHH-HH T ss_conf 556842147999863113199999999842035789986425766258589999999751263--399899999998-50 Q ss_pred HHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCC Q ss_conf 2355435999999953898547888999999986998999999998629 Q gi|254780592|r 186 GDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQELS 234 (271) Q Consensus 186 g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~ 234 (271) =+...|+..+.+.+.-...++-++..-|.++...++ +++..++++.+. T Consensus 214 l~s~~ai~~L~k~L~d~~E~pMVRhEaAeALGaIa~-e~~~~vL~e~~~ 261 (289) T KOG0567 214 LQSPAAIPSLIKVLLDETEHPMVRHEAAEALGAIAD-EDCVEVLKEYLG 261 (289) T ss_pred CCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCC-HHHHHHHHHHCC T ss_conf 146023699999998620131799999999873167-899999999818 No 368 >COG4764 Uncharacterized protein conserved in bacteria [Function unknown] Probab=41.60 E-value=19 Score=14.24 Aligned_cols=78 Identities=14% Similarity=0.192 Sum_probs=40.0 Q ss_pred HHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHC---CHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHH Q ss_conf 999999999854036854331179999998420---19999999999999575787999999999987410012446665 Q gi|254780592|r 18 IVFLIPLLLFSSCHLNPRISIPDPESLENMNHE---QLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMR 94 (271) Q Consensus 18 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~ 94 (271) .++.+.++.++||.+.|.-..-.- +.-..+.| +..+|..-.++-. -.-+-..+|.+|...|-..-|..-=- T Consensus 8 ~~~~v~lL~LagCaTaP~~~~N~C-avF~qr~g~fnnW~~aA~~~er~~-----GvPVpVlMAtiy~ESgFk~~ArPPRt 81 (197) T COG4764 8 LVFAVVLLALAGCATAPSQVNNVC-AVFDQRDGWFNNWQRAALRTEREY-----GVPVPVLMATIYTESGFKHNARPPRT 81 (197) T ss_pred HHHHHHHHHHHHCCCCCCCCCCHH-HHHHHCCCHHHHHHHHHHHHHHHH-----CCCCHHHHHHHHHHHCCCCCCCCCCC T ss_conf 999999999703036876765256-655506741657999999988872-----99716789998876148777899751 Q ss_pred HHHHCCC Q ss_conf 5663275 Q gi|254780592|r 95 QVAILYP 101 (271) Q Consensus 95 ~al~~~p 101 (271) +.+-.-| T Consensus 82 klLgfIP 88 (197) T COG4764 82 KLLGFIP 88 (197) T ss_pred EEEEEEE T ss_conf 2676640 No 369 >pfam10938 YfdX YfdX protein. YfdX is a protein found in Proteobacteria of unknown function. The protein coding for this gene is regulated by EvgA in E. coli. Probab=41.58 E-value=19 Score=14.24 Aligned_cols=62 Identities=21% Similarity=0.220 Sum_probs=40.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH-------CCC-CCHHHHHHHHHHHHHCCCHHHHHHHHHHHCC Q ss_conf 1245777654332235543599999995-------389-8547888999999986998999999998629 Q gi|254780592|r 173 SIVSNIAMSYLLMGDLKTAEEKLRFASQ-------MIG-ADSRIRQNLALVVGLQGRMKEAYSIASQELS 234 (271) Q Consensus 173 ~~~~~~~~~~~~~g~~~~A~~~~~~al~-------~~p-~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~ 234 (271) ......+...+..|+...|...++-+-. .-| .....-...|..+...|++.||...++.+++ T Consensus 76 ~~a~~~A~~~l~~Gd~~~A~~~L~~a~~ev~~~~~~lPL~~~~~av~~A~~lL~~gk~~eA~~aL~~A~~ 145 (155) T pfam10938 76 KAAIKKANELLKKGEKQAARETLKLAGSEVVITVALLPLAQTPAAVKQAASLLDEGKYYEAKAALKEALD 145 (155) T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHCCCCEEEEEEECCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHC T ss_conf 9999999999977898999999986366405777736715209999999999977997899999999861 No 370 >PRK02944 OxaA-like protein precursor; Validated Probab=41.31 E-value=19 Score=14.22 Aligned_cols=22 Identities=9% Similarity=0.092 Sum_probs=14.2 Q ss_pred HHHHHHHHHHHHHHHHHHCCCC Q ss_conf 8999999999999985403685 Q gi|254780592|r 13 KKIKCIVFLIPLLLFSSCHLNP 34 (271) Q Consensus 13 ~~~~~~~~l~~~~~~~~~~~~~ 34 (271) |......++..+++++||+..+ T Consensus 4 ~~~~~~~~~~~~~~lsgC~~~~ 25 (255) T PRK02944 4 KLGLLAMVVALMAILAGCSEVN 25 (255) T ss_pred HHHHHHHHHHHHHHHHCCCCCC T ss_conf 9999999999999996246889 No 371 >TIGR03504 FimV_Cterm FimV C-terminal domain. This protein is found at the extreme C-terminus of FimV from Pseudomonas aeruginosa, and of TspA of Neisseria meningitidis. Disruption of the former blocks twitching motility from type IV pili; Semmler, et al. suggest a role in peptidoglycan layer remodelling required by type IV fimbrial systems. Probab=40.85 E-value=20 Score=14.17 Aligned_cols=23 Identities=17% Similarity=0.173 Sum_probs=11.4 Q ss_pred HHHHHHHHCCCHHHHHHHHHHHC Q ss_conf 99999998699899999999862 Q gi|254780592|r 211 NLALVVGLQGRMKEAYSIASQEL 233 (271) Q Consensus 211 ~La~~~~~~g~~~eA~~~~~~~L 233 (271) .||.+|..+|+.+.|..+++.++ T Consensus 4 DLA~aYiemGD~egAr~lL~EVi 26 (44) T TIGR03504 4 DLARAYIEMGDLEGARELLEEVI 26 (44) T ss_pred HHHHHHHHCCCHHHHHHHHHHHH T ss_conf 89999998298898999999999 No 372 >PRK11548 hypothetical protein; Provisional Probab=40.76 E-value=20 Score=14.16 Aligned_cols=22 Identities=18% Similarity=0.264 Sum_probs=14.2 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 98999999999999985403685 Q gi|254780592|r 12 LKKIKCIVFLIPLLLFSSCHLNP 34 (271) Q Consensus 12 ~~~~~~~~~l~~~~~~~~~~~~~ 34 (271) +|.+ +...+++++.++||++.+ T Consensus 3 ~k~l-~~a~ll~~lllsgCS~~~ 24 (113) T PRK11548 3 CKTL-TAAAAVLLMLTAGCSTLE 24 (113) T ss_pred HHHH-HHHHHHHHHHHCCCCCCC T ss_conf 5788-999999999980115787 No 373 >PRK11679 lipoprotein; Provisional Probab=40.56 E-value=20 Score=14.14 Aligned_cols=21 Identities=14% Similarity=0.369 Sum_probs=14.7 Q ss_pred HHHHHHHHHHHHHHHHHCCCC Q ss_conf 999999999999985403685 Q gi|254780592|r 14 KIKCIVFLIPLLLFSSCHLNP 34 (271) Q Consensus 14 ~~~~~~~l~~~~~~~~~~~~~ 34 (271) ++.-++-+++++++++|++.+ T Consensus 9 ~~~~v~~~~lv~~L~ACss~~ 29 (345) T PRK11679 9 RLAKVAGVSLVLLLAACSSDQ 29 (345) T ss_pred HHHHHHHHHHHHHHHHCCCCH T ss_conf 377788899999998607975 No 374 >PRK12696 flgH flagellar basal body L-ring protein; Reviewed Probab=40.18 E-value=20 Score=14.11 Aligned_cols=20 Identities=20% Similarity=0.343 Sum_probs=13.3 Q ss_pred HHHHHHHHHHHHHHCCCCCC Q ss_conf 99999999998540368543 Q gi|254780592|r 17 CIVFLIPLLLFSSCHLNPRI 36 (271) Q Consensus 17 ~~~~l~~~~~~~~~~~~~~~ 36 (271) +++++.++++++||++.... T Consensus 5 ~l~~~~~~~~L~GC~~~~~~ 24 (238) T PRK12696 5 LLAASCAVLLLSGCNAARQQ 24 (238) T ss_pred HHHHHHHHHHHHCCCCCCCC T ss_conf 99999999995165688888 No 375 >pfam10368 YkyA Putative cell-wall binding lipoprotein. YkyA is a family of proteins containing a lipoprotein signal and a hydrolase domain. It is similar to cell wall binding proteins and might also be recognisable by a host immune defence system. It is thus likely to belong to pathways important for pathogenicity. Probab=40.16 E-value=20 Score=14.11 Aligned_cols=16 Identities=19% Similarity=0.636 Sum_probs=11.2 Q ss_pred HHHHHHHHHHCCCCCC Q ss_conf 9999998540368543 Q gi|254780592|r 21 LIPLLLFSSCHLNPRI 36 (271) Q Consensus 21 l~~~~~~~~~~~~~~~ 36 (271) ++++++|+||.+...+ T Consensus 2 ~~s~~lLaGC~~~~~p 17 (205) T pfam10368 2 ILSILLLTGCTTGKKP 17 (205) T ss_pred CHHHHHHHHHCCCCCH T ss_conf 1899999862699985 No 376 >pfam08139 LPAM_1 Prokaryotic membrane lipoprotein lipid attachment site. In prokaryotes, membrane lipoproteins are synthesized with a precursor signal peptide, which is cleaved by a specific lipoprotein signal peptidase (signal peptidase II). The peptidase recognizes a conserved sequence and cuts upstream of a cysteine residue to which a glyceride-fatty acid lipid is attached. Probab=39.69 E-value=21 Score=14.06 Aligned_cols=18 Identities=28% Similarity=0.700 Sum_probs=10.9 Q ss_pred HHHHHHHHHHHHHCCCCC Q ss_conf 999999999854036854 Q gi|254780592|r 18 IVFLIPLLLFSSCHLNPR 35 (271) Q Consensus 18 ~~~l~~~~~~~~~~~~~~ 35 (271) +++++.++.++||++... T Consensus 5 ~~l~~~~l~LagCas~~~ 22 (26) T pfam08139 5 LLLLLALLLLAGCASXXX 22 (26) T ss_pred HHHHHHHHHHHCCCCCCC T ss_conf 999999999823320013 No 377 >TIGR02996 rpt_mate_G_obs repeat-companion domain TIGR02996; InterPro: IPR014338 This entry is found in Myxococcus xanthus DK 1622. Most member proteins have extensive repeats similar to the leucine-rich repeat, or another repeat class or a region of low-complexity sequence. This domain is not repeated, and is usually adjacent to the WGR domain.. Probab=39.38 E-value=19 Score=14.27 Aligned_cols=30 Identities=27% Similarity=0.334 Sum_probs=16.1 Q ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHHHCCH Q ss_conf 999999957578799999999998741001 Q gi|254780592|r 58 TIGLQYQSHTKNKMIGIVYADVLRRVGRTA 87 (271) Q Consensus 58 ~~~~al~~~P~~~~~~~~~a~~l~~~g~~~ 87 (271) .|-++|-.+|.+-...+.||.=|...|+.. T Consensus 4 alL~AI~~~P~dDT~RLvYADWLdE~g~~~ 33 (48) T TIGR02996 4 ALLRAILAHPDDDTPRLVYADWLDEHGDPA 33 (48) T ss_pred HHHHHHHHCCCCCCHHHHHHHHHCCCCCHH T ss_conf 779999739898864655401101578701 No 378 >pfam11153 DUF2931 Protein of unknown function (DUF2931). Some members in this family of proteins are annotated as lipoproteins however this cannot be confirmed. Currently, there is no known function. Probab=39.03 E-value=20 Score=14.11 Aligned_cols=16 Identities=38% Similarity=0.752 Sum_probs=11.8 Q ss_pred HHHHHHHHHHHHHHCC Q ss_conf 9999999999854036 Q gi|254780592|r 17 CIVFLIPLLLFSSCHL 32 (271) Q Consensus 17 ~~~~l~~~~~~~~~~~ 32 (271) .++++++.+.+++|++ T Consensus 3 ~i~~ll~~lll~aCs~ 18 (202) T pfam11153 3 KILLLLLLLLLTACST 18 (202) T ss_pred HHHHHHHHHHHHHHCC T ss_conf 1489999999876025 No 379 >COG3017 LolB Outer membrane lipoprotein involved in outer membrane biogenesis [Cell envelope biogenesis, outer membrane] Probab=38.62 E-value=21 Score=13.96 Aligned_cols=24 Identities=21% Similarity=0.592 Sum_probs=16.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 999899999999999998540368 Q gi|254780592|r 10 LFLKKIKCIVFLIPLLLFSSCHLN 33 (271) Q Consensus 10 ~f~~~~~~~~~l~~~~~~~~~~~~ 33 (271) .+++.+...+..+.++++++|+.. T Consensus 2 ~~~~~~~~~l~~~As~LL~aC~~~ 25 (206) T COG3017 2 PMMKRLLFLLLALASLLLTACTLT 25 (206) T ss_pred CHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 527778789999999999756566 No 380 >PHA00407 phage lambda Rz1-like protein Probab=37.27 E-value=22 Score=13.83 Aligned_cols=24 Identities=21% Similarity=0.297 Sum_probs=16.1 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 899999999999998540368543 Q gi|254780592|r 13 KKIKCIVFLIPLLLFSSCHLNPRI 36 (271) Q Consensus 13 ~~~~~~~~l~~~~~~~~~~~~~~~ 36 (271) |....-.+++++.+++||++.... T Consensus 32 kaaLIGlllicv~tISGCaSes~l 55 (84) T PHA00407 32 KAALIGLLLICVATISGCASESKL 55 (84) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 999999999999997666502569 No 381 >PRK12407 flgH flagellar basal body L-ring protein; Reviewed Probab=37.07 E-value=22 Score=13.86 Aligned_cols=17 Identities=12% Similarity=0.352 Sum_probs=11.5 Q ss_pred HHHHHHHHHHHHHCCCC Q ss_conf 99999999985403685 Q gi|254780592|r 18 IVFLIPLLLFSSCHLNP 34 (271) Q Consensus 18 ~~~l~~~~~~~~~~~~~ 34 (271) +++.+++++++||++.+ T Consensus 4 l~l~~~~l~LsGC~~~~ 20 (220) T PRK12407 4 LILTPMVLALCGCESPA 20 (220) T ss_pred HHHHHHHHHHHCCCCCC T ss_conf 79999999973658888 No 382 >pfam06604 consensus Probab=36.62 E-value=20 Score=14.19 Aligned_cols=22 Identities=18% Similarity=0.393 Sum_probs=16.6 Q ss_pred HHHHHHHHHHHHHHHHHHHCCC Q ss_conf 9899999999999998540368 Q gi|254780592|r 12 LKKIKCIVFLIPLLLFSSCHLN 33 (271) Q Consensus 12 ~~~~~~~~~l~~~~~~~~~~~~ 33 (271) ..|+..++++.+++.++||+.. T Consensus 3 ~sk~~~~~~~~~v~~laGCq~~ 24 (181) T pfam06604 3 FSKAGLVAGLAAVLVLAGCQRS 24 (181) T ss_pred CCHHHHHHHHHHHHHHHCCCCC T ss_conf 2067789999999997344533 No 383 >PRK10796 LPS-assembly lipoprotein RlpB; Provisional Probab=36.22 E-value=23 Score=13.73 Aligned_cols=29 Identities=14% Similarity=0.356 Sum_probs=20.0 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCHHH Q ss_conf 89999999999999854036854331179 Q gi|254780592|r 13 KKIKCIVFLIPLLLFSSCHLNPRISIPDP 41 (271) Q Consensus 13 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~~ 41 (271) +.++.+++.+++++++||...-+.....+ T Consensus 2 r~l~~l~l~lavll~agCGFhLRg~~~lP 30 (196) T PRK10796 2 RYLATLLLSLAVLVTAGCGWHLRGTTQVP 30 (196) T ss_pred HHHHHHHHHHHHHHHCCCCCEECCCCCCC T ss_conf 56999999999999747671576899998 No 384 >COG5461 Type IV pili component [Cell motility and secretion] Probab=35.95 E-value=24 Score=13.70 Aligned_cols=24 Identities=13% Similarity=0.362 Sum_probs=17.4 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 999999999999985403685433 Q gi|254780592|r 14 KIKCIVFLIPLLLFSSCHLNPRIS 37 (271) Q Consensus 14 ~~~~~~~l~~~~~~~~~~~~~~~~ 37 (271) +++.++.......++||.+.++.. T Consensus 7 Pvk~VL~~a~~allagCAs~d~~~ 30 (224) T COG5461 7 PVKSVLLVAATALLAGCASRDPST 30 (224) T ss_pred CHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 528899999999863213689655 No 385 >pfam07980 SusD SusD family. This family includes several hypothetical proteins. It also contains RagB, a protein involved in signalling and SusD, an outer membrane protein involved in nutrient binding. Probab=35.61 E-value=24 Score=13.66 Aligned_cols=29 Identities=28% Similarity=0.474 Sum_probs=22.2 Q ss_pred HHHHHHHHHHHHHHHCCHHHHHHHHHHHH Q ss_conf 79999999999874100124466655663 Q gi|254780592|r 70 KMIGIVYADVLRRVGRTAQALAVMRQVAI 98 (271) Q Consensus 70 ~~~~~~~a~~l~~~g~~~~A~~~~~~al~ 98 (271) .++++.+|+++.+.|+..+|.+.++++.+ T Consensus 83 aev~L~~AEA~~~~g~~~~A~~~in~vR~ 111 (211) T pfam07980 83 AEVLLMYAEALNELGGTAEAIEYLNKVRE 111 (211) T ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHH T ss_conf 99999999999764882899999999998 No 386 >PRK13616 lipoprotein LpqB; Provisional Probab=35.56 E-value=24 Score=13.66 Aligned_cols=25 Identities=28% Similarity=0.405 Sum_probs=17.6 Q ss_pred HHHHHHHHHHHHHHHCCCCCCCCHH Q ss_conf 9999999999985403685433117 Q gi|254780592|r 16 KCIVFLIPLLLFSSCHLNPRISIPD 40 (271) Q Consensus 16 ~~~~~l~~~~~~~~~~~~~~~~~~~ 40 (271) .+..++.++++++||++.|....+. T Consensus 8 ~~~~~~~~~~llaGCaslP~ss~pq 32 (590) T PRK13616 8 ALAALLAVALLVAGCASLPSSSAPQ 32 (590) T ss_pred HHHHHHHHHHHHHHCCCCCCCCCCC T ss_conf 9999999999861203689999972 No 387 >pfam08631 SPO22 Meiosis protein SPO22/ZIP4 like. SPO22/ZIP4 in yeast is a meiosis specific protein involved in sporulation. It has been shown to regulate crossover distribution by promoting synaptonemal complex formation. Probab=35.39 E-value=24 Score=13.64 Aligned_cols=123 Identities=12% Similarity=0.034 Sum_probs=53.9 Q ss_pred HHHCCHHHHHHHHHHHHHHC----CCCH----HHHHHHHHHHHHHH-CCHHHHHHHHHHHHCCC---CC----------- Q ss_conf 84201999999999999957----5787----99999999998741-00124466655663275---21----------- Q gi|254780592|r 47 MNHEQLLEVTSTIGLQYQSH----TKNK----MIGIVYADVLRRVG-RTAQALAVMRQVAILYP---ED----------- 103 (271) Q Consensus 47 ~~~~~~~~A~~~~~~al~~~----P~~~----~~~~~~a~~l~~~g-~~~~A~~~~~~al~~~p---~~----------- 103 (271) ..+|+++-|..++.|+=... |+.. ...++.|..+...+ +++.|...++++.+.-+ +. T Consensus 4 w~qgd~~~A~~~~~k~~~l~~~~~p~~~~~La~~~yn~G~~ll~~~~~~~~A~~wL~ra~d~l~~~~~~~~~~~~~~eLr 83 (280) T pfam08631 4 WRQGDLDLAEHLLSKAKSLKNSLDPDLTEELARVLYNIGKSLLERKTDFSEAVKWLQRALDLLEKGAKNDQKSTEGSELK 83 (280) T ss_pred HHHCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHH T ss_conf 77558699999999878998668969999999999999999997589879999999999998626653133685488899 Q ss_pred CHHHHHHHHHHHHHCCHHH---HHHHHHHHHHHHCHHHHHHHHHHHHHHHH-HHHHHHHHHHHHCCCCCC Q ss_conf 0013555532100000112---58876422110000233566999999974-211111000000000000 Q gi|254780592|r 104 QEVLAAYGKSLANAGYLDE---GLDAINRAQRPDIPDWQLISAKGSVLAQM-GKHSEALIEYERALELSP 169 (271) Q Consensus 104 ~~~~~~~~~~~~~~g~~~~---A~~~~~~a~~~~p~~~~~~~~lg~~~~~~-~~~~~A~~~~~~~~~~~p 169 (271) ..++..++.++...+..+. |...+.-.....|+.+..+...-.++.+. ++.+...+.+..++.... T Consensus 84 ~~iL~~La~a~l~~~~~es~~ka~~~l~~l~~e~~~~~~~~~L~l~il~~~~~~~~~~~~~L~~mi~~~~ 153 (280) T pfam08631 84 FSILRLLAKALLETGESESVLKALKALSLLESEFPNKPELYLLALEILKKRPGPEEEYEDVLMRMIKSVD 153 (280) T ss_pred HHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCC T ss_conf 9999999999844687468999999999998617887248999999997389964899999999998467 No 388 >KOG0292 consensus Probab=35.02 E-value=24 Score=13.61 Aligned_cols=47 Identities=11% Similarity=-0.001 Sum_probs=37.3 Q ss_pred HHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHH Q ss_conf 9842019999999999999575787999999999987410012446665566 Q gi|254780592|r 46 NMNHEQLLEVTSTIGLQYQSHTKNKMIGIVYADVLRRVGRTAQALAVMRQVA 97 (271) Q Consensus 46 ~~~~~~~~~A~~~~~~al~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~al 97 (271) .++-|+++.|.+.-.+. ++++.|.++|+.-.++|+..-|.-.|++.- T Consensus 653 aLe~gnle~ale~akkl-----dd~d~w~rLge~Al~qgn~~IaEm~yQ~~k 699 (1202) T KOG0292 653 ALECGNLEVALEAAKKL-----DDKDVWERLGEEALRQGNHQIAEMCYQRTK 699 (1202) T ss_pred EHHHCCHHHHHHHHHHC-----CCHHHHHHHHHHHHHHCCHHHHHHHHHHHH T ss_conf 03408899999998733-----867899999999998345488999998763 No 389 >KOG0686 consensus Probab=34.97 E-value=24 Score=13.60 Aligned_cols=173 Identities=14% Similarity=0.040 Sum_probs=87.0 Q ss_pred HHHHHHHHHHHHCCCC------HHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCC---CCHHHHHHHHHHHHHCCHHHHH Q ss_conf 9999999999957578------799999999998741001244666556632752---1001355553210000011258 Q gi|254780592|r 54 EVTSTIGLQYQSHTKN------KMIGIVYADVLRRVGRTAQALAVMRQVAILYPE---DQEVLAAYGKSLANAGYLDEGL 124 (271) Q Consensus 54 ~A~~~~~~al~~~P~~------~~~~~~~a~~l~~~g~~~~A~~~~~~al~~~p~---~~~~~~~~~~~~~~~g~~~~A~ 124 (271) .-.+.++.-++...+| ..++..+|.-|..-|+.+.|+..|-++-.--.. ....+.++-.+-..+|+|.+.. T Consensus 128 ~~le~L~~eLk~yK~n~iKEsiRra~~Dl~dhy~~cG~l~~Alr~YsR~RdYCTs~khvInm~ln~i~VSI~~~nw~hv~ 207 (466) T KOG0686 128 LKLEKLDNELKSYKDNLIKESIRRALEDLGDHYLDCGQLDNALRCYSRARDYCTSAKHVINMCLNLILVSIYMGNWGHVL 207 (466) T ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHH T ss_conf 88999999999862012569999999999999998433888876301334543001899999999999997424314333 Q ss_pred HHHHHHHHHH----C----HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC--------CCHHHHHHHHHHHH-HHHHHH Q ss_conf 8764221100----0----02335669999999742111110000000000--------00000124577765-433223 Q gi|254780592|r 125 DAINRAQRPD----I----PDWQLISAKGSVLAQMGKHSEALIEYERALEL--------SPNESSIVSNIAMS-YLLMGD 187 (271) Q Consensus 125 ~~~~~a~~~~----p----~~~~~~~~lg~~~~~~~~~~~A~~~~~~~~~~--------~p~~~~~~~~~~~~-~~~~g~ 187 (271) ....++...- . -.+.....-|......+.+..|...|..+..- .|.+..++..+..+ -+...+ T Consensus 208 sy~~~A~st~~~~~~~~q~v~~kl~C~agLa~L~lkkyk~aa~~fL~~~~~~~d~~~ivtpsdv~iYggLcALAtfdr~~ 287 (466) T KOG0686 208 SYISKAESTPDANENLAQEVPAKLKCAAGLANLLLKKYKSAAKYFLLAEFDHCDYPEIVTPSDVAIYGGLCALATFDRQD 287 (466) T ss_pred HHHHHHHHCCHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEECCHHHHHHHHHHHHCCCCHHH T ss_conf 67777774730245577744841679988999987778999999982787766752114600367887667650478999 Q ss_pred HHH---HHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHH Q ss_conf 554---35999999953898547888999999986998999999998 Q gi|254780592|r 188 LKT---AEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQ 231 (271) Q Consensus 188 ~~~---A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~ 231 (271) +.. .-..|+.-++..|.-.++ ++..|. ++|.--.+.+++ T Consensus 288 Lk~~vi~n~~Fk~flel~Pqlr~i---l~~fy~--sky~~cl~~L~~ 329 (466) T KOG0686 288 LKLNVIKNESFKLFLELEPQLREI---LFKFYS--SKYASCLELLRE 329 (466) T ss_pred HHHHHHCCHHHHHHHHCCHHHHHH---HHHHHH--HHHHHHHHHHHH T ss_conf 999987005566677407699999---999862--568899999997 No 390 >TIGR00756 PPR pentatricopeptide repeat domain; InterPro: IPR002885 Pentatricopeptide repeat proteins are characterised by the presence of a tandem array of repeats, where the number of PPR motifs controls the affinity and specificity of the PPR protein for RNA. These proteins occur predominantly in plants, where they appear to play essential roles in RNA/DNA metabolism in mitochondria and chloroplasts . It has been suggested that each of the highly variable PPR proteins is a gene-specific regulator of plant organellar RNA metabolism. PPR proteins may also play a role in organelle biogenesis, probably via binding to organellar transcripts . Examples of PPR repeat-containing proteins include PET309 P32522 from SWISSPROT, which may be involved in RNA stabilisation , and crp1, which is involved in RNA processing . The repeat is associated with a predicted plant protein O49549 from SWISSPROT that has a domain organization similar to the human BRCA1 protein.. Probab=34.83 E-value=25 Score=13.59 Aligned_cols=21 Identities=24% Similarity=0.270 Sum_probs=8.2 Q ss_pred HHHHHHHHCCCHHHHHHHHHH Q ss_conf 999999986998999999998 Q gi|254780592|r 211 NLALVVGLQGRMKEAYSIASQ 231 (271) Q Consensus 211 ~La~~~~~~g~~~eA~~~~~~ 231 (271) .+-..+...|++++|..++++ T Consensus 5 ~li~~~~~~g~~~~a~~~~~~ 25 (35) T TIGR00756 5 TLIDGLCKAGRVEEALELFDE 25 (35) T ss_pred HHHHHHHCCCCHHHHHHHHHH T ss_conf 889998628987899999999 No 391 >PRK12699 flgH flagellar basal body L-ring protein; Reviewed Probab=34.53 E-value=25 Score=13.56 Aligned_cols=22 Identities=23% Similarity=0.387 Sum_probs=14.9 Q ss_pred HHHHHHHHHHHHHHHHCCCCCC Q ss_conf 9999999999998540368543 Q gi|254780592|r 15 IKCIVFLIPLLLFSSCHLNPRI 36 (271) Q Consensus 15 ~~~~~~l~~~~~~~~~~~~~~~ 36 (271) +...++++.+++++||+..++. T Consensus 18 ~~~~~~~~~~~~l~gC~~~~~~ 39 (246) T PRK12699 18 LLGPVLIVMLALVGGCSLPTPA 39 (246) T ss_pred HHHHHHHHHHHHHCCCCCCCCC T ss_conf 4888999999986044689998 No 392 >pfam12262 Lipase_bact_N Bacterial virulence factor lipase N-terminal. This domain family is found in bacteria, and is typically between 258 and 271 amino acids in length. There are two conserved sequence motifs: DGT and DGWST. This family is the N-terminal region of bacterial virulence factor lipase. The N-terminal region contains a potential signalling sequence. Probab=33.29 E-value=26 Score=13.43 Aligned_cols=24 Identities=25% Similarity=0.390 Sum_probs=17.8 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 989999999999999854036854 Q gi|254780592|r 12 LKKIKCIVFLIPLLLFSSCHLNPR 35 (271) Q Consensus 12 ~~~~~~~~~l~~~~~~~~~~~~~~ 35 (271) ||+....+.+.+++.++||.-... T Consensus 1 MKk~~l~~~i~SAL~LaGCg~~s~ 24 (269) T pfam12262 1 MKKKFLALLLASALLLAGCGDDTE 24 (269) T ss_pred CCHHHHHHHHHHHHHHCCCCCCCC T ss_conf 924799999999864111579965 No 393 >KOG2066 consensus Probab=32.13 E-value=27 Score=13.32 Aligned_cols=33 Identities=12% Similarity=0.164 Sum_probs=15.8 Q ss_pred HHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHH Q ss_conf 957578799999999998741001244666556 Q gi|254780592|r 64 QSHTKNKMIGIVYADVLRRVGRTAQALAVMRQV 96 (271) Q Consensus 64 ~~~P~~~~~~~~~a~~l~~~g~~~~A~~~~~~a 96 (271) +.+..+....-.++..|...|+|..|...+-++ T Consensus 499 ~q~Se~~~L~e~La~LYl~d~~Y~~Al~~ylkl 531 (846) T KOG2066 499 KQNSESTALLEVLAHLYLYDNKYEKALPIYLKL 531 (846) T ss_pred HHHCCCHHHHHHHHHHHHHCCCHHHHHHHHHHC T ss_conf 751121369999999998725727799999851 No 394 >PRK10175 hypothetical protein; Provisional Probab=31.73 E-value=16 Score=14.79 Aligned_cols=21 Identities=29% Similarity=0.257 Sum_probs=14.8 Q ss_pred HHHHHHHHHHHHHHCCCCCCC Q ss_conf 999999999985403685433 Q gi|254780592|r 17 CIVFLIPLLLFSSCHLNPRIS 37 (271) Q Consensus 17 ~~~~l~~~~~~~~~~~~~~~~ 37 (271) .+++.+.++.++||+|...-. T Consensus 3 ~i~~~~~~lllsGC~SIms~t 23 (75) T PRK10175 3 LIVVSIMVTLLSGCGSIISRT 23 (75) T ss_pred EHHHHHHHHHHCCCHHHHCCC T ss_conf 448888999981642443055 No 395 >TIGR03042 PS_II_psbQ_bact photosystem II protein PsbQ. This protein through the member sll1638 from Synechocystis sp. PCC 6803, was shown to be part of the cyanobacteria photosystem II. It is homologous to (but quite diverged from) the chloroplast PsbQ protein, called oxygen-evolving enhancer protein 3 (OEE3). We designate this cyanobacteria protein PsbQ by homology. Probab=31.41 E-value=28 Score=13.24 Aligned_cols=22 Identities=14% Similarity=0.431 Sum_probs=12.4 Q ss_pred HHHHHHHHHHHHHHHHHHCCCC Q ss_conf 8999999999999985403685 Q gi|254780592|r 13 KKIKCIVFLIPLLLFSSCHLNP 34 (271) Q Consensus 13 ~~~~~~~~l~~~~~~~~~~~~~ 34 (271) +.|.++++++++.++.+|+... T Consensus 2 Rsi~s~~L~~~~~fLvsC~gp~ 23 (142) T TIGR03042 2 RSLASLLLVLLLTFLVSCSGPA 23 (142) T ss_pred HHHHHHHHHHHHHHHHHCCCCC T ss_conf 7699999999999998838998 No 396 >COG2913 OlmA Outer membrane lipoprotein OmlA (small protein A) [Cell envelope biogenesis, outer membrane] Probab=31.28 E-value=28 Score=13.23 Aligned_cols=21 Identities=24% Similarity=0.273 Sum_probs=15.2 Q ss_pred HHHHHHHHHHHHHHHHHHCCC Q ss_conf 899999999999998540368 Q gi|254780592|r 13 KKIKCIVFLIPLLLFSSCHLN 33 (271) Q Consensus 13 ~~~~~~~~l~~~~~~~~~~~~ 33 (271) ..+.++..+++++.+++|++. T Consensus 6 ~~~~~~~a~l~~~als~Cst~ 26 (147) T COG2913 6 TAILAIAALLGAAALSGCSTL 26 (147) T ss_pred HHHHHHHHHHHHHHHCCCCCC T ss_conf 889999999999983148754 No 397 >pfam03317 ELF ELF protein. This is a family of hypothetical proteins from cereal crops. Probab=30.30 E-value=29 Score=13.13 Aligned_cols=42 Identities=17% Similarity=0.174 Sum_probs=22.3 Q ss_pred HHHHHHHHCCC---CCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCH Q ss_conf 99998540368---5433117999999842019999999999999575787 Q gi|254780592|r 23 PLLLFSSCHLN---PRISIPDPESLENMNHEQLLEVTSTIGLQYQSHTKNK 70 (271) Q Consensus 23 ~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~A~~~~~~al~~~P~~~ 70 (271) -.+.+..|.+- -+++...+++..++.- ...|-..++.||... T Consensus 84 ~~f~lltc~sflitlppeiqdpqalahl~g------lnfylslyeqdp~wv 128 (284) T pfam03317 84 HLFGLLTCASFLITLPPEIQDPQALAHLAG------LNFYLSLYEQDPEWV 128 (284) T ss_pred HHHHHHHHHHEEEECCCCCCCHHHHHHHHC------CHHEEHHHHCCCCHH T ss_conf 997067576603536965478799998703------511002563495099 No 398 >KOG1600 consensus Probab=30.07 E-value=29 Score=13.14 Aligned_cols=34 Identities=21% Similarity=0.082 Sum_probs=15.1 Q ss_pred HHHHHHHHHHHHHHCCCCC--HHHHHHHHHHHHHCCCHHH Q ss_conf 3554359999999538985--4788899999998699899 Q gi|254780592|r 187 DLKTAEEKLRFASQMIGAD--SRIRQNLALVVGLQGRMKE 224 (271) Q Consensus 187 ~~~~A~~~~~~al~~~p~~--~~~~~~La~~~~~~g~~~e 224 (271) +++.++.. .+.+++. .++...+|.++...-=.++ T Consensus 274 dyr~ge~~----y~~d~T~~~I~~~a~lGlA~D~K~~s~~ 309 (321) T KOG1600 274 DYRHGEEW----YQLDITWYLIDFFAALGLAYDLKTPSEA 309 (321) T ss_pred HHHHHHHH----HHHCCCHHHHHHHHHHHHHHHCCCCHHH T ss_conf 56746677----5608541999999984067525776299 No 399 >PRK09861 cytoplasmic membrane lipoprotein-28; Provisional Probab=30.03 E-value=29 Score=13.10 Aligned_cols=27 Identities=15% Similarity=0.249 Sum_probs=17.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 56999998999999999999985403685 Q gi|254780592|r 6 CPVILFLKKIKCIVFLIPLLLFSSCHLNP 34 (271) Q Consensus 6 ~~~~~f~~~~~~~~~l~~~~~~~~~~~~~ 34 (271) +-++.|++.. ..+++++++++||+... T Consensus 2 ~~~~~~l~~~--a~ll~~~~~l~~C~~~~ 28 (272) T PRK09861 2 KLTTHHLRAG--AALLLAGILLAGCDQSS 28 (272) T ss_pred CCHHHHHHHH--HHHHHHHHHHHHCCCCC T ss_conf 5109999999--99999999987458976 No 400 >TIGR01781 Trep_dent_lipo Treponema denticola clustered lipoprotein; InterPro: IPR010103 This entry represents a family of six predicted lipoproteins from a region of about 20 tandemly arranged genes in the Treponema denticola genome. Two other neighbouring genes share the lipoprotein signal peptide region but do not show more extensive homology. The function of this locus is unknown.. Probab=29.73 E-value=30 Score=13.07 Aligned_cols=22 Identities=32% Similarity=0.495 Sum_probs=18.3 Q ss_pred HHHHHHHHHHHHHHHHHHCCCC Q ss_conf 8999999999999985403685 Q gi|254780592|r 13 KKIKCIVFLIPLLLFSSCHLNP 34 (271) Q Consensus 13 ~~~~~~~~l~~~~~~~~~~~~~ 34 (271) -|++.+++|+++++|-+|+..- T Consensus 4 dKlKlIFIL~LAvLLFSC~ke~ 25 (463) T TIGR01781 4 DKLKLIFILMLAVLLFSCKKEV 25 (463) T ss_pred CHHHHHHHHHHHHHHHCCCCCC T ss_conf 4046888999999860476211 No 401 >pfam07269 consensus Probab=29.32 E-value=30 Score=13.02 Aligned_cols=32 Identities=13% Similarity=0.180 Sum_probs=16.1 Q ss_pred HHHHHHHHHCCCCCCCCHHHHHHHHHHHCCHHH Q ss_conf 999998540368543311799999984201999 Q gi|254780592|r 22 IPLLLFSSCHLNPRISIPDPESLENMNHEQLLE 54 (271) Q Consensus 22 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 54 (271) +.++.++||+++....+... ..-.+|.|++.- T Consensus 7 ~l~l~LAgC~T~d~lat~~G-p~fplNvgrwqP 38 (55) T pfam07269 7 LLILALAGCQTNDVLATCKG-PRFPLNVGRWQP 38 (55) T ss_pred HHHHHHHHCCCCCCCCCCCC-CCCCCCCCCCCC T ss_conf 99999875114671111358-863301245557 No 402 >COG3133 SlyB Outer membrane lipoprotein [Cell envelope biogenesis, outer membrane] Probab=29.27 E-value=30 Score=13.02 Aligned_cols=25 Identities=24% Similarity=0.246 Sum_probs=17.7 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 8999999999999985403685433 Q gi|254780592|r 13 KKIKCIVFLIPLLLFSSCHLNPRIS 37 (271) Q Consensus 13 ~~~~~~~~l~~~~~~~~~~~~~~~~ 37 (271) +|...++..+.++.+.||..++..+ T Consensus 2 ~k~~~avA~~~al~L~GCan~d~~S 26 (154) T COG3133 2 IKTALAVAALMALSLTGCANNDTLS 26 (154) T ss_pred CHHHHHHHHHHHHHCCCCCCCCCCC T ss_conf 4257899999887400110587436 No 403 >pfam09680 Tiny_TM_bacill Protein of unknown function (Tiny_TM_bacill). This entry represents a family of hypothetical proteins, half of which are 40 residues or less in length. Members are found only in spore-forming species. A Gly-rich variable region is followed by a strongly conserved, highly hydrophobic region, predicted to form a transmembrane helix, ending with an invariant Gly. The consensus for this stretch is FALLVVFILLIIV. Probab=28.80 E-value=29 Score=13.17 Aligned_cols=19 Identities=26% Similarity=0.342 Sum_probs=12.5 Q ss_pred HHHHHHHHHHHHHHHHHCC Q ss_conf 9999999999999854036 Q gi|254780592|r 14 KIKCIVFLIPLLLFSSCHL 32 (271) Q Consensus 14 ~~~~~~~l~~~~~~~~~~~ 32 (271) ++..++++.+++++.||+- T Consensus 5 gFaLiVVLFILLIIVG~s~ 23 (26) T pfam09680 5 GFALIVVLFILLIIVGASY 23 (26) T ss_pred CEEHHHHHHHHHHHHHHHH T ss_conf 1226999999999983354 No 404 >PRK11162 mltA murein transglycosylase A; Provisional Probab=28.67 E-value=31 Score=12.95 Aligned_cols=21 Identities=24% Similarity=0.463 Sum_probs=14.4 Q ss_pred HHHHHHHHHHHHHHHCCCCCC Q ss_conf 999999999998540368543 Q gi|254780592|r 16 KCIVFLIPLLLFSSCHLNPRI 36 (271) Q Consensus 16 ~~~~~l~~~~~~~~~~~~~~~ 36 (271) +.+++.++++++++|++.|.- T Consensus 4 ~~~~~~~~~~~l~~c~~~p~~ 24 (362) T PRK11162 4 KYLLMGIVVALLAACSSKPTD 24 (362) T ss_pred HHHHHHHHHHHHHHHCCCCCC T ss_conf 999999999999760479876 No 405 >PRK13614 lipoprotein LpqB; Provisional Probab=28.42 E-value=31 Score=12.93 Aligned_cols=23 Identities=17% Similarity=0.316 Sum_probs=16.5 Q ss_pred HHHHHHHHHHHHHHHHCCCCCCC Q ss_conf 99999999999985403685433 Q gi|254780592|r 15 IKCIVFLIPLLLFSSCHLNPRIS 37 (271) Q Consensus 15 ~~~~~~l~~~~~~~~~~~~~~~~ 37 (271) ....+++++++.++||++.|... T Consensus 9 ~~~~ll~~~~v~LagCasiP~sg 31 (573) T PRK13614 9 ASLALLVLLGVTLSACAQIPRSG 31 (573) T ss_pred HHHHHHHHHHHHHHHHCCCCCCC T ss_conf 79999999998865404699998 No 406 >TIGR03524 GldJ gliding motility-associated lipoprotein GldJ. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldJ is a lipoprotein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae. Knockouts of GldJ abolish the gliding phenotype. GldJ is homologous to GldK. There is a GldJ homolog in Cytophaga hutchinsonii and several other species that has a different, shorter architecture and is represented by a separate model. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility. Probab=28.41 E-value=31 Score=12.93 Aligned_cols=19 Identities=32% Similarity=0.518 Sum_probs=13.2 Q ss_pred HHHHHHHHHHHHHHHHCCC Q ss_conf 9999999999998540368 Q gi|254780592|r 15 IKCIVFLIPLLLFSSCHLN 33 (271) Q Consensus 15 ~~~~~~l~~~~~~~~~~~~ 33 (271) +++++++++++++++|+.. T Consensus 7 ~~~~~~~~~~~~~~sC~~~ 25 (559) T TIGR03524 7 FKITFIVFVSLLFVSCKKK 25 (559) T ss_pred HHHHHHHHHHHHHHHCCCC T ss_conf 9999999999997642588 No 407 >pfam07312 DUF1459 Protein of unknown function (DUF1459). This family consists of several hypothetical Caenorhabditis elegans proteins of around 85 residues in length. The function of this family is unknown. Probab=27.76 E-value=30 Score=13.06 Aligned_cols=22 Identities=36% Similarity=0.685 Sum_probs=13.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 999899999999999998540368 Q gi|254780592|r 10 LFLKKIKCIVFLIPLLLFSSCHLN 33 (271) Q Consensus 10 ~f~~~~~~~~~l~~~~~~~~~~~~ 33 (271) ||.|.+ +++.+..+++++|+.. T Consensus 1 mfqk~~--ivl~~a~fciss~qv~ 22 (84) T pfam07312 1 MFQKSL--IVLAFALFCISSCQVL 22 (84) T ss_pred CCCHHH--HHHHHHHHHHHHCEEE T ss_conf 930335--6999999986111066 No 408 >PRK04168 hypothetical protein; Provisional Probab=27.63 E-value=18 Score=14.36 Aligned_cols=52 Identities=17% Similarity=0.289 Sum_probs=26.6 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHC Q ss_conf 899999999999998540368543311799999984201999999999999957 Q gi|254780592|r 13 KKIKCIVFLIPLLLFSSCHLNPRISIPDPESLENMNHEQLLEVTSTIGLQYQSH 66 (271) Q Consensus 13 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~~~~al~~~ 66 (271) +++..+++++.++.++||.+...... .....-...|.+..+.+.+.+..+.. T Consensus 6 ~~~~~~~~~~~~~~~~g~~~~~~~~~--~~~l~Vf~AGSL~~pf~ei~~~Fe~~ 57 (336) T PRK04168 6 RKIVIVILLLLALVFLGCVNTSEAEP--PGKLKIFHAGSLSVPFEEYEKEFEAY 57 (336) T ss_pred EEHHHHHHHHHHHHHCCCCCCCCCCC--CEEEEEEECCCCHHHHHHHHHHHHHH T ss_conf 22037999999999705557777788--61599997552589999999999987 No 409 >pfam06291 Lambda_Bor Bor protein. This family consists of several Bacteriophage lambda Bor and Escherichia coli Iss proteins. Expression of bor significantly increases the survival of the Escherichia coli host cell in animal serum. This property is a well known bacterial virulence determinant indeed, bor and its adjacent sequences are highly homologous to the iss serum resistance locus of the plasmid ColV2-K94, which confers virulence in animals. It has been suggested that lysogeny may generally have a role in bacterial survival in animal hosts, and perhaps in pathogenesis. Probab=27.58 E-value=32 Score=12.89 Aligned_cols=21 Identities=24% Similarity=0.362 Sum_probs=13.2 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 98999999999999985403685 Q gi|254780592|r 12 LKKIKCIVFLIPLLLFSSCHLNP 34 (271) Q Consensus 12 ~~~~~~~~~l~~~~~~~~~~~~~ 34 (271) ||++. +...+++.++||++.. T Consensus 1 MKk~~--~~~~~allLtgCa~QT 21 (97) T pfam06291 1 MKKML--FAAALALLITGCAQQT 21 (97) T ss_pred CHHHH--HHHHHHHHHCCCCCEE T ss_conf 92259--9999999972133049 No 410 >pfam03032 Brevenin Brevenin/esculentin/gaegurin/rugosin family. This family contains a number of defence peptides secreted from the skin of amphibians, including the opiate-like dermorphins and deltorphins, and the antimicrobial dermoseptins and temporins. Probab=27.55 E-value=29 Score=13.16 Aligned_cols=24 Identities=29% Similarity=0.445 Sum_probs=18.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 999899999999999998540368 Q gi|254780592|r 10 LFLKKIKCIVFLIPLLLFSSCHLN 33 (271) Q Consensus 10 ~f~~~~~~~~~l~~~~~~~~~~~~ 33 (271) +++|+-..+++++..+.+|-|-.. T Consensus 1 ftlKKSlll~fFLG~islSlCe~e 24 (46) T pfam03032 1 FTLKKSLLLVLFLGLVSLSLCEEE 24 (46) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 961053899999865147652443 No 411 >pfam09533 DUF2380 Predicted lipoprotein of unknown function (DUF2380). This family consists of at least 9 paralogs in Myxococcus xanthus, a member of the Deltaproteobacteria. One appears truncated toward the N-terminus; the others are predicted lipoproteins. The function is unknown. Probab=27.29 E-value=33 Score=12.81 Aligned_cols=24 Identities=17% Similarity=0.338 Sum_probs=16.7 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCHH Q ss_conf 999999999985403685433117 Q gi|254780592|r 17 CIVFLIPLLLFSSCHLNPRISIPD 40 (271) Q Consensus 17 ~~~~l~~~~~~~~~~~~~~~~~~~ 40 (271) .+.++++++.+.||++.++..... T Consensus 6 ~~~l~vLal~~~GCasaa~~p~~~ 29 (220) T pfam09533 6 VLWLLVLALLWVGCASAAPTPAQQ 29 (220) T ss_pred HHHHHHHHHHHHHHCCCCCCCCCC T ss_conf 999999999984501578986666 No 412 >TIGR03044 PS_II_psb27 photosystem II protein Psb27. Members of this family are the Psb27 protein of the cyanobacterial photosynthetic supracomplex, photosystem II. Although most protein components of both cyanobacterial and chloroplast versions of photosystem II are closely related and described together by single HMM families, this family is strictly bacterial. Some uncharacterized proteins with highly divergent sequences, from Arabidopsis, score between trusted and noise cutoffs for this model but are not at this time assigned as functionally equivalent photosystem II proteins. Probab=26.95 E-value=33 Score=12.77 Aligned_cols=51 Identities=16% Similarity=0.139 Sum_probs=25.3 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHH Q ss_conf 999999999985403685433117999999842019999999999999575787999 Q gi|254780592|r 17 CIVFLIPLLLFSSCHLNPRISIPDPESLENMNHEQLLEVTSTIGLQYQSHTKNKMIG 73 (271) Q Consensus 17 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~~~~al~~~P~~~~~~ 73 (271) ..+++...+++.+|+........-.- .+|.-.-+..++.+++...+++.-- T Consensus 12 lal~L~~~l~l~~cs~a~~s~ltG~Y------~~DT~~Vi~tlr~~i~lp~d~pn~~ 62 (135) T TIGR03044 12 LALVLGLCLLLTACSGAAKTRLTGDY------VEDTLAVIQTLREAIDLPDDDPNKS 62 (135) T ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCH------HHHHHHHHHHHHHHHCCCCCCCCHH T ss_conf 99999999997345677765678752------9989999999999980898998889 No 413 >PRK11459 multidrug resistance outer membrane protein MdtQ; Provisional Probab=26.78 E-value=34 Score=12.75 Aligned_cols=20 Identities=15% Similarity=0.227 Sum_probs=13.2 Q ss_pred HHHHHHHHHHHHHHCCCCCC Q ss_conf 99999999998540368543 Q gi|254780592|r 17 CIVFLIPLLLFSSCHLNPRI 36 (271) Q Consensus 17 ~~~~l~~~~~~~~~~~~~~~ 36 (271) ....+.++++++||++.|.. T Consensus 9 ~~a~l~~~llLaGCa~~~~~ 28 (478) T PRK11459 9 ATASLPLFILLAGCAPMHET 28 (478) T ss_pred HHHHHHHHHHHHCCCCCCCC T ss_conf 89999999998378899999 No 414 >TIGR00380 cobD cobalamin biosynthesis protein CobD; InterPro: IPR004485 Cobalamin (vitamin B12) is a structurally complex cofactor, consisting of a modified tetrapyrrole with a centrally chelated cobalt. Cobalamin is usually found in one of two biologically active forms: methylcobalamin and adocobalamin. Most prokaryotes, as well as animals, have cobalamin-dependent enzymes, whereas plants and fungi do not appear to use it. In bacteria and archaea, these include methionine synthase, ribonucleotide reductase, glutamate and methylmalonyl-CoA mutases, ethanolamine ammonia lyase, and diol dehydratase . In mammals, cobalamin is obtained through the diet, and is required for methionine synthase and methylmalonyl-CoA mutase . There are at least two distinct cobalamin biosynthetic pathways in bacteria : Aerobic pathway that requires oxygen and in which cobalt is inserted late in the pathway ; found in Pseudomonas denitrificans and Rhodobacter capsulatus. Anaerobic pathway in which cobalt insertion is the first committed step towards cobalamin synthesis ; found in Salmonella typhimurium, Bacillus megaterium, and Propionibacterium freudenreichii shermanii. Either pathway can be divided into two parts: (1) corrin ring synthesis (differs in aerobic and anaerobic pathways) and (2) adenosylation of corrin ring, attachment of aminopropanol arm, and assembly of the nucleotide loop (common to both pathways) . There are about 30 enzymes involved in either pathway, where those involved in the aerobic pathway are prefixed Cob and those of the anaerobic pathway Cbi. Several of these enzymes are pathway-specific: CbiD, CbiG, and CbiK are specific to the anaerobic route of S. typhimurium, whereas CobE, CobF, CobG, CobN, CobS, CobT, and CobW are unique to the aerobic pathway of P. denitrificans. This entry represents the CbiB protein, which is involved in cobalamin biosynthesis and porphyrin biosynthesis. It converts cobyric acid to cobinamide by the addition of aminopropanol on the F carboxylic group. It is part of the cob operon .; GO: 0009236 cobalamin biosynthetic process, 0016021 integral to membrane. Probab=26.76 E-value=34 Score=12.75 Aligned_cols=49 Identities=14% Similarity=0.210 Sum_probs=27.5 Q ss_pred HHHHHHHHHCCCCCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCH Q ss_conf 9999985403685433117999999842019999999999999575787 Q gi|254780592|r 22 IPLLLFSSCHLNPRISIPDPESLENMNHEQLLEVTSTIGLQYQSHTKNK 70 (271) Q Consensus 22 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~~~~~~al~~~P~~~ 70 (271) +..+.+++|-+.-.....-.+-.+.+..||.+.|...++-.+-+|.++. T Consensus 89 i~~~Ll~s~is~~sL~~~A~~~~E~~k~GDle~AR~~l~~~VSRdt~~L 137 (322) T TIGR00380 89 IAVILLSSVISVKSLVEAAKKVIESLKEGDLEDARKKLQMIVSRDTEEL 137 (322) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCCCCCC T ss_conf 9999999999999999972202666557982678999876422475534 No 415 >TIGR02522 pilus_cpaD pilus (Caulobacter type) biogenesis lipoprotein CpaD; InterPro: IPR013361 Proteins in this entry consist of a pilus biogenesis protein, CpaD, from Caulobacter, and homologues in other bacteria, including three in the root nodule bacterium Bradyrhizobium japonicum. The molecular function of the homologues is not known.. Probab=26.73 E-value=34 Score=12.74 Aligned_cols=26 Identities=12% Similarity=0.107 Sum_probs=18.2 Q ss_pred HHHHHHHHHHHHHCCCCCCCCHHHHH Q ss_conf 99999999985403685433117999 Q gi|254780592|r 18 IVFLIPLLLFSSCHLNPRISIPDPES 43 (271) Q Consensus 18 ~~~l~~~~~~~~~~~~~~~~~~~~~~ 43 (271) ++++..++.|+||.++++...+.... T Consensus 2 ~~~~a~~~~LagC~~~~P~~~~~~~p 27 (211) T TIGR02522 2 VLLLALTALLAGCASTDPEIDVGALP 27 (211) T ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCC T ss_conf 78988999975305888755534323 No 416 >PRK11653 hypothetical protein; Provisional Probab=26.01 E-value=35 Score=12.66 Aligned_cols=20 Identities=5% Similarity=0.316 Sum_probs=14.0 Q ss_pred HHHHHHHHHHHHHCCCCCCC Q ss_conf 99999999985403685433 Q gi|254780592|r 18 IVFLIPLLLFSSCHLNPRIS 37 (271) Q Consensus 18 ~~~l~~~~~~~~~~~~~~~~ 37 (271) .+.+..+++|+||...+... T Consensus 25 alav~avF~LagCe~~~e~~ 44 (223) T PRK11653 25 ALAVAAVFMLAGCEKSDETV 44 (223) T ss_pred HHHHHHHHHHHHCCCCCCCH T ss_conf 88988999865047886640 No 417 >PRK10781 rcsF outer membrane lipoprotein; Reviewed Probab=25.61 E-value=35 Score=12.62 Aligned_cols=20 Identities=15% Similarity=0.424 Sum_probs=14.0 Q ss_pred HHHHHHHHHHHHHHCCCCCC Q ss_conf 99999999998540368543 Q gi|254780592|r 17 CIVFLIPLLLFSSCHLNPRI 36 (271) Q Consensus 17 ~~~~l~~~~~~~~~~~~~~~ 36 (271) .+.+.+.++.++||+..... T Consensus 3 ~lpi~llal~ltGCs~l~~~ 22 (133) T PRK10781 3 ALPICLLALMLTGCSMLSRS 22 (133) T ss_pred HHHHHHHHHHHHCCCCCCCC T ss_conf 08999999998431344548 No 418 >TIGR00926 2A1704 Oligopeptide transporter (Peptide:H+ symporter); InterPro: IPR004768 Oligopeptide transporter, peptide:H+ symporter supports the proton-coupled intake of oligopeptides of 2 to 4 amino acids. These proteins may constitute a major route for the absorption of the end products of protein digestion.; GO: 0015198 oligopeptide transporter activity, 0006857 oligopeptide transport, 0016021 integral to membrane. Probab=25.50 E-value=24 Score=13.65 Aligned_cols=26 Identities=15% Similarity=0.400 Sum_probs=15.8 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCH Q ss_conf 99999999999998540368543311 Q gi|254780592|r 14 KIKCIVFLIPLLLFSSCHLNPRISIP 39 (271) Q Consensus 14 ~~~~~~~l~~~~~~~~~~~~~~~~~~ 39 (271) ||..++-++++++|.+-+.--+...| T Consensus 193 GvPaiLMi~AlivF~~Gs~~YkK~PP 218 (781) T TIGR00926 193 GVPAILMIVALIVFMAGSFMYKKQPP 218 (781) T ss_pred CHHHHHHHHHHHHHHHCCCCCCCCCC T ss_conf 41689999999999830313475917 No 419 >PRK08878 adenosylcobinamide-phosphate synthase; Provisional Probab=25.25 E-value=36 Score=12.58 Aligned_cols=17 Identities=12% Similarity=0.044 Sum_probs=7.3 Q ss_pred HHHHCCCHHHHHHHHHH Q ss_conf 99986998999999998 Q gi|254780592|r 215 VVGLQGRMKEAYSIASQ 231 (271) Q Consensus 215 ~~~~~g~~~eA~~~~~~ 231 (271) .+...|++..|....++ T Consensus 220 ~~~~~~~~~~a~~~~~~ 236 (317) T PRK08878 220 LILLGKNAGHTFQGLLQ 236 (317) T ss_pred HHHHHCCHHHHHHHHHH T ss_conf 99995699999999997 No 420 >pfam10140 essB Predicted membrane protein essB. Members of this family of prokaryotic proteins include the virulence factor essB, which is required for the synthesis and secretion of EsxA and EsxB, both ESAT-6 like proteins. Probab=24.81 E-value=37 Score=12.52 Aligned_cols=80 Identities=19% Similarity=0.112 Sum_probs=39.4 Q ss_pred HHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHH Q ss_conf 97421111100000000000000012457776543322355435-99999995389854788899999998699899999 Q gi|254780592|r 149 AQMGKHSEALIEYERALELSPNESSIVSNIAMSYLLMGDLKTAE-EKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYS 227 (271) Q Consensus 149 ~~~~~~~~A~~~~~~~~~~~p~~~~~~~~~~~~~~~~g~~~~A~-~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~ 227 (271) .-..++.+.+..++..-. ..-+....+.+|..|+...+..... ...-.-+... ++-.+-+==++...|+++||.. T Consensus 233 fi~~dYs~VI~~L~~~~~-~~lp~~~kY~LA~SyV~~e~Ls~~QK~~Ilnnitlk---Sde~yLlYWIyiGrg~~eeAld 308 (359) T pfam10140 233 FLNVDYSKVITTLEKYDP-ESLPKSVKYELAYSYVQVEKLSDDQKENILNNVTLK---SDELYLLYWIYIGRGEFEEALD 308 (359) T ss_pred HHHCCHHHHHHHHHHCCH-HHCCHHHHHHHHHHHHHCCCCCHHHHHHHHHCCCCC---CCHHHHHHHHHHCCCCHHHHHH T ss_conf 986258999987620793-249977777898999961666799999998307756---7789999999976876499999 Q ss_pred HHHHH Q ss_conf 99986 Q gi|254780592|r 228 IASQE 232 (271) Q Consensus 228 ~~~~~ 232 (271) +.+.- T Consensus 309 iA~~L 313 (359) T pfam10140 309 IARRL 313 (359) T ss_pred HHHHC T ss_conf 99847 No 421 >PRK11616 hypothetical protein; Provisional Probab=24.51 E-value=37 Score=12.49 Aligned_cols=19 Identities=21% Similarity=0.301 Sum_probs=13.2 Q ss_pred HHHHHHHHHHHHHHHCCCC Q ss_conf 9999999999985403685 Q gi|254780592|r 16 KCIVFLIPLLLFSSCHLNP 34 (271) Q Consensus 16 ~~~~~l~~~~~~~~~~~~~ 34 (271) ...++....+.++||++.. T Consensus 6 ~~f~~~~g~l~~sGCSSvM 24 (109) T PRK11616 6 LAFMICSGMLLLSGCSSVM 24 (109) T ss_pred HHHHHHHHHHHHCCCHHHH T ss_conf 3799985899875751230 No 422 >TIGR00985 3a0801s04tom mitochondrial import receptor subunit; InterPro: IPR016043 Virtually all mitochondrial precursors are imported via the same mechanism : precursors first bind to receptors on the mitochondrial surface, then insert into the translocation channel in the outer membrane. Many outer-membrane proteins participate in the early stages of import, four of which (MAS20, MAS22, MAS37 and MAS70) are components of the receptor. MAS20, which forms a subcomplex with MAS22, seems to interact with most or all mitochondrial precursors, suggesting that the protein binds directly to mitochondrial targeting sequences. The MAS37 and MAS70 components also form a subcomplex, the two subcomplexes possibly binding via their trans- membrane (TM) regions - the TM region of MAS70 promotes oligomerisation of attatched protein domains and shares sequence similarity with the TM region of MAS20 .. Probab=24.34 E-value=37 Score=12.47 Aligned_cols=63 Identities=14% Similarity=0.230 Sum_probs=41.8 Q ss_pred CHHHHHHHHHHHHHHCC--CCHHH-------HHHHHHHHHHHH--CCHHHHHHHHHHHHCCCCCCHHHHHHHHH Q ss_conf 19999999999999575--78799-------999999998741--00124466655663275210013555532 Q gi|254780592|r 51 QLLEVTSTIGLQYQSHT--KNKMI-------GIVYADVLRRVG--RTAQALAVMRQVAILYPEDQEVLAAYGKS 113 (271) Q Consensus 51 ~~~~A~~~~~~al~~~P--~~~~~-------~~~~a~~l~~~g--~~~~A~~~~~~al~~~p~~~~~~~~~~~~ 113 (271) .++.-++..-.-+..+| .+++- -+..|+.|...| +.++|...|-+|+...|...+.+.-+-++ T Consensus 65 k~~~~~r~~~~~~~~~P~p~d~~e~e~fFl~~V~~GE~L~~q~P~~~~e~a~~fy~Al~Vyp~P~~Ll~Iyq~T 138 (155) T TIGR00985 65 KLRKVVREAVAELAKEPDPSDAEEKEAFFLEEVQLGEELLAQQPDKVKEAALHFYKALKVYPQPQDLLSIYQRT 138 (155) T ss_pred HHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHC T ss_conf 99899999998764078999887898799999998778504899517899999998876279924899998642 No 423 >KOG2997 consensus Probab=24.33 E-value=37 Score=12.47 Aligned_cols=24 Identities=17% Similarity=0.278 Sum_probs=11.3 Q ss_pred HHHCCHHHHHHHHHHHHCCCCCCH Q ss_conf 741001244666556632752100 Q gi|254780592|r 82 RVGRTAQALAVMRQVAILYPEDQE 105 (271) Q Consensus 82 ~~g~~~~A~~~~~~al~~~p~~~~ 105 (271) +.|..-+|+..+..|+++.|+-.. T Consensus 31 q~G~l~dai~fYR~AlqI~~diEs 54 (366) T KOG2997 31 QDGSLYDAINFYRDALQIVPDIES 54 (366) T ss_pred HCCCHHHHHHHHHHHHCCCCHHHH T ss_conf 347487778887765538800999 No 424 >PRK10974 glycerol-3-phosphate transporter periplasmic binding protein; Provisional Probab=24.15 E-value=38 Score=12.45 Aligned_cols=24 Identities=17% Similarity=0.196 Sum_probs=14.4 Q ss_pred HHHHHHHHHHHH-HHHHHHHHHCCC Q ss_conf 999899999999-999998540368 Q gi|254780592|r 10 LFLKKIKCIVFL-IPLLLFSSCHLN 33 (271) Q Consensus 10 ~f~~~~~~~~~l-~~~~~~~~~~~~ 33 (271) ||+|.+++-.+. .+++.++|.+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~a~a 25 (439) T PRK10974 1 MFMKSLRSTALGLALGLALSGNAQA 25 (439) T ss_pred CCHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 9557899999999999984455679 No 425 >PRK13859 type IV secretion system lipoprotein VirB7; Provisional Probab=24.14 E-value=38 Score=12.45 Aligned_cols=36 Identities=14% Similarity=0.187 Sum_probs=18.5 Q ss_pred HHHHHHHHHHHCCCCCCCCHHHHHHHHHHHCCHHHHH Q ss_conf 9999999854036854331179999998420199999 Q gi|254780592|r 20 FLIPLLLFSSCHLNPRISIPDPESLENMNHEQLLEVT 56 (271) Q Consensus 20 ~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~A~ 56 (271) ++++++.+++|+++....+...- .-.++.|.++-+- T Consensus 5 lL~l~l~La~CqT~D~lAtckGp-~FplNVgrWqpt~ 40 (55) T PRK13859 5 LLCLALALAGCQTNDTLASCKGP-IFPLNVGRWQPTP 40 (55) T ss_pred HHHHHHHHHHCCCCCCCCCCCCC-CCCCCCCCCCCCH T ss_conf 99999999860046831124588-6331224555786 No 426 >smart00777 Mad3_BUB1_I Mad3/BUB1 hoMad3/BUB1 homology region 1. Proteins containing this domain are checkpoint proteins involved in cell division. This region has been shown to be essential for the binding of the binding of BUB1 and MAD3 to CDC20p. Probab=24.13 E-value=38 Score=12.45 Aligned_cols=43 Identities=16% Similarity=0.155 Sum_probs=28.5 Q ss_pred HHHHHHHHHHHH--CCCCCHHHHHHHHHHHHHCCCHHHHHHHHHH Q ss_conf 543599999995--3898547888999999986998999999998 Q gi|254780592|r 189 KTAEEKLRFASQ--MIGADSRIRQNLALVVGLQGRMKEAYSIASQ 231 (271) Q Consensus 189 ~~A~~~~~~al~--~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~ 231 (271) +.+...|+.... +....+..|...|..+...|++.+|.++|+. T Consensus 80 ~~p~~if~~l~~~~IG~~~AlfYe~~A~~lE~~g~~~~A~~iy~~ 124 (125) T smart00777 80 DEPRELFQFLYSKGIGTKLALFYEEWAQLLEAAGRYKKADEVYQL 124 (125) T ss_pred CCHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHC T ss_conf 999999999998780243699999999999984889889999877 No 427 >PRK13684 Ycf48-like protein; Provisional Probab=24.01 E-value=38 Score=12.43 Aligned_cols=17 Identities=24% Similarity=0.448 Sum_probs=12.2 Q ss_pred HHHHHHHHHHHHHHHCC Q ss_conf 99999999999854036 Q gi|254780592|r 16 KCIVFLIPLLLFSSCHL 32 (271) Q Consensus 16 ~~~~~l~~~~~~~~~~~ 32 (271) ..++++++.+.|++|++ T Consensus 10 ~l~l~~~~~~~~~~c~~ 26 (333) T PRK13684 10 NLLLLLALGLVLSGCST 26 (333) T ss_pred HHHHHHHHHHHHCCCCC T ss_conf 99999999986322356 No 428 >PRK10523 lipoprotein involved with copper homeostasis and adhesion; Provisional Probab=23.66 E-value=38 Score=12.39 Aligned_cols=25 Identities=16% Similarity=0.278 Sum_probs=18.6 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCC Q ss_conf 9899999999999998540368543 Q gi|254780592|r 12 LKKIKCIVFLIPLLLFSSCHLNPRI 36 (271) Q Consensus 12 ~~~~~~~~~l~~~~~~~~~~~~~~~ 36 (271) .|.+.++++.+.++.+.||...... T Consensus 3 kk~~~~~~~a~g~~~l~GC~~r~~~ 27 (234) T PRK10523 3 KKAILTALAAVGLFALMGCNNRAEV 27 (234) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCCC T ss_conf 5899999999999987634665434 No 429 >COG4851 CamS Protein involved in sex pheromone biosynthesis [General function prediction only] Probab=23.44 E-value=39 Score=12.36 Aligned_cols=21 Identities=29% Similarity=0.335 Sum_probs=14.2 Q ss_pred HHHHHHHHHHHHHHHHHHCCC Q ss_conf 899999999999998540368 Q gi|254780592|r 13 KKIKCIVFLIPLLLFSSCHLN 33 (271) Q Consensus 13 ~~~~~~~~l~~~~~~~~~~~~ 33 (271) |+...+....++++++||.+. T Consensus 2 kktl~i~~ta~vliLs~C~~~ 22 (382) T COG4851 2 KKTLGIAATASVLILSGCFPF 22 (382) T ss_pred CHHHHHHHHHHHHHHHHCCCC T ss_conf 401558999999998611676 No 430 >COG5510 Predicted small secreted protein [Function unknown] Probab=23.04 E-value=40 Score=12.31 Aligned_cols=17 Identities=18% Similarity=0.415 Sum_probs=11.9 Q ss_pred HHHHHHHHHHHHHHCCC Q ss_conf 99999999998540368 Q gi|254780592|r 17 CIVFLIPLLLFSSCHLN 33 (271) Q Consensus 17 ~~~~l~~~~~~~~~~~~ 33 (271) .+++++..+.+++|-+. T Consensus 9 i~~vll~s~llaaCNT~ 25 (44) T COG5510 9 IALVLLASTLLAACNTM 25 (44) T ss_pred HHHHHHHHHHHHHHHHH T ss_conf 99999999999886630 No 431 >pfam05643 DUF799 Putative bacterial lipoprotein (DUF799). This family consists of several bacterial proteins of unknown function. Some of the family members are described as putative lipoproteins. Probab=22.98 E-value=40 Score=12.31 Aligned_cols=19 Identities=26% Similarity=0.466 Sum_probs=13.5 Q ss_pred HHHHHHHHHHHHHHCCCCC Q ss_conf 9999999999854036854 Q gi|254780592|r 17 CIVFLIPLLLFSSCHLNPR 35 (271) Q Consensus 17 ~~~~l~~~~~~~~~~~~~~ 35 (271) .++.+..+++|+||+..++ T Consensus 4 lil~l~~~l~L~gCa~~~~ 22 (215) T pfam05643 4 LILGLAAVLALSACQVQKA 22 (215) T ss_pred HHHHHHHHHHHHHCCCCCC T ss_conf 4999999999862348999 No 432 >TIGR03502 lipase_Pla1_cef extracellular lipase, Pla-1/cef family. Members of this protein family are bacterial lipoproteins largely from the Gammaproteobacteria. Characterized members are expressed in extracellularly and have esterase activity. Members include the lipase Pla-1 from Aeromonas hydrophila (AF092033) and CHO cell elongation factor (cef) from Vibrio hollisae Probab=22.58 E-value=40 Score=12.26 Aligned_cols=20 Identities=20% Similarity=0.343 Sum_probs=16.1 Q ss_pred HHHHHHHHHHHHHHHHHHHC Q ss_conf 98999999999999985403 Q gi|254780592|r 12 LKKIKCIVFLIPLLLFSSCH 31 (271) Q Consensus 12 ~~~~~~~~~l~~~~~~~~~~ 31 (271) |||....+.+.+++.++||. T Consensus 1 MKk~~l~~~iasal~LagCG 20 (792) T TIGR03502 1 MKKLLLSLAIASALGLAGCG 20 (792) T ss_pred CCHHHHHHHHHHHHHHHCCC T ss_conf 93378999999987500457 No 433 >PRK09857 hypothetical protein; Provisional Probab=22.54 E-value=41 Score=12.25 Aligned_cols=56 Identities=16% Similarity=0.084 Sum_probs=43.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCC Q ss_conf 54332235543599999995389854788899999998699899999999862917 Q gi|254780592|r 181 SYLLMGDLKTAEEKLRFASQMIGADSRIRQNLALVVGLQGRMKEAYSIASQELSPE 236 (271) Q Consensus 181 ~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~La~~~~~~g~~~eA~~~~~~~L~~~ 236 (271) .....|+...-...++...+..|...+.....|.-+...|+.+++..+.++-|... T Consensus 215 Yll~~~~~~~~~~fi~~la~~~p~~~E~iMTIAE~l~qEG~q~~~~~iAr~mL~~G 270 (292) T PRK09857 215 YILQTGDAVRFNDFIDGVAERSPKHKESLMTIAERLRQEGEQSKALHIAKIMLESG 270 (292) T ss_pred HHHHCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 99861674458999999998584101378659999999999999999999999869 No 434 >pfam01535 PPR PPR repeat. This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. #=GF CC This repeat occurs in PET309, which may be involved in RNA stabilisation. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein that has a domain organisation similar to the human BRCA1 protein. The repeat has been called PPR. Probab=22.16 E-value=41 Score=12.21 Aligned_cols=20 Identities=20% Similarity=0.225 Sum_probs=7.6 Q ss_pred HHHHHHHHCCCHHHHHHHHH Q ss_conf 99999998699899999999 Q gi|254780592|r 211 NLALVVGLQGRMKEAYSIAS 230 (271) Q Consensus 211 ~La~~~~~~g~~~eA~~~~~ 230 (271) .+-..+...|++++|.++++ T Consensus 5 ~li~~~~~~g~~~~a~~~~~ 24 (31) T pfam01535 5 SLISGYCKAGKLEEALELFK 24 (31) T ss_pred HHHHHHHHCCCHHHHHHHHH T ss_conf 99999987799999999999 No 435 >COG4808 Uncharacterized protein conserved in bacteria [Function unknown] Probab=21.25 E-value=43 Score=12.09 Aligned_cols=24 Identities=17% Similarity=0.283 Sum_probs=17.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 998999999999999985403685 Q gi|254780592|r 11 FLKKIKCIVFLIPLLLFSSCHLNP 34 (271) Q Consensus 11 f~~~~~~~~~l~~~~~~~~~~~~~ 34 (271) |.+....++.+++++.++||.-.. T Consensus 4 l~kl~~~~~alil~~sl~gCgdkE 27 (152) T COG4808 4 LNKLFSLVVALVLVFSLAGCGDKE 27 (152) T ss_pred HHHHHHHHHHHHHHHHHHHCCCHH T ss_conf 887889999999988754048666 No 436 >PRK10477 outer membrane lipoprotein Blc; Provisional Probab=20.85 E-value=44 Score=12.04 Aligned_cols=20 Identities=10% Similarity=0.301 Sum_probs=14.4 Q ss_pred HHHHHHHHHHHHHHHHCCCC Q ss_conf 99999999999985403685 Q gi|254780592|r 15 IKCIVFLIPLLLFSSCHLNP 34 (271) Q Consensus 15 ~~~~~~l~~~~~~~~~~~~~ 34 (271) +..++.+..++++.+|++.. T Consensus 4 ~~~~~~~~~~~~~~~c~~~~ 23 (177) T PRK10477 4 LPVVAAVTAAFLVVACSSPT 23 (177) T ss_pred HHHHHHHHHHHHHHHCCCCC T ss_conf 78999999999997646999 No 437 >PRK08476 F0F1 ATP synthase subunit B'; Validated Probab=20.40 E-value=45 Score=11.98 Aligned_cols=21 Identities=14% Similarity=0.472 Sum_probs=14.5 Q ss_pred CHHHHHHHHHHHHHHHHHHHH Q ss_conf 927875699999899999999 Q gi|254780592|r 1 MIILFCPVILFLKKIKCIVFL 21 (271) Q Consensus 1 ~~~~~~~~~~f~~~~~~~~~l 21 (271) |||-+=|..||..-|..++++ T Consensus 1 ~ml~~n~~~~i~~~i~FliL~ 21 (141) T PRK08476 1 MMLDINPYLMLLTFVVFLLLI 21 (141) T ss_pred CCCCCCHHHHHHHHHHHHHHH T ss_conf 965776779999999999999 No 438 >pfam00244 14-3-3 14-3-3 protein. Probab=20.35 E-value=45 Score=11.98 Aligned_cols=46 Identities=24% Similarity=0.235 Sum_probs=30.3 Q ss_pred HHHHHHHHHHHHH-----CCCCCHHHHHHHHH-----HHHHCCCHHHHHHHHHHHCC Q ss_conf 5543599999995-----38985478889999-----99986998999999998629 Q gi|254780592|r 188 LKTAEEKLRFASQ-----MIGADSRIRQNLAL-----VVGLQGRMKEAYSIASQELS 234 (271) Q Consensus 188 ~~~A~~~~~~al~-----~~p~~~~~~~~La~-----~~~~~g~~~eA~~~~~~~L~ 234 (271) .+.|...|++|++ +.|.+ .++..|++ .|...++.++|..+.+++.. T Consensus 142 ~~~a~~aY~~A~~~A~~~L~pt~-PirLgLaLN~SVF~YEi~~~~~~A~~lAk~Afd 197 (236) T pfam00244 142 ADKALEAYKAALEIAEKELPPTH-PIRLGLALNFSVFYYEILNSPEKACELAKQAFD 197 (236) T ss_pred HHHHHHHHHHHHHHHHHHCCCCC-HHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH T ss_conf 99999999999999986289547-999999998999999981898999999999999 No 439 >cd02680 MIT_calpain7_2 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in the nuclear thiol protease PalBH. The molecular function of the MIT domain is unclear. Probab=20.32 E-value=45 Score=11.97 Aligned_cols=15 Identities=20% Similarity=0.346 Sum_probs=6.2 Q ss_pred HHCCCHHHHHHHHHH Q ss_conf 986998999999998 Q gi|254780592|r 217 GLQGRMKEAYSIASQ 231 (271) Q Consensus 217 ~~~g~~~eA~~~~~~ 231 (271) ...|+.++|+.+|.. T Consensus 17 DE~g~~~eAieLYt~ 31 (75) T cd02680 17 DEKGNAEEAIELYTE 31 (75) T ss_pred HHCCCHHHHHHHHHH T ss_conf 434688888999999 No 440 >pfam04090 RNA_pol_I_TF RNA polymerase I specific initiation factor. Probab=20.18 E-value=45 Score=11.95 Aligned_cols=137 Identities=9% Similarity=-0.103 Sum_probs=55.6 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCH-HHHHHHHHHHHHCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHH Q ss_conf 999999998741001244666556632752100-1355553210000011258876422110000233566999999974 Q gi|254780592|r 73 GIVYADVLRRVGRTAQALAVMRQVAILYPEDQE-VLAAYGKSLANAGYLDEGLDAINRAQRPDIPDWQLISAKGSVLAQM 151 (271) Q Consensus 73 ~~~~a~~l~~~g~~~~A~~~~~~al~~~p~~~~-~~~~~~~~~~~~g~~~~A~~~~~~a~~~~p~~~~~~~~lg~~~~~~ 151 (271) +..+-......++++.|..+|--++..-.-+.+ .|..-..++.+++......+.++.....-|.... .... T Consensus 44 L~~LLhlnilr~nW~~AYk~F~LLIR~p~VDIR~iW~lG~eIL~~l~~~~~~~~Fl~wl~~~ys~~~~--------fn~~ 115 (192) T pfam04090 44 LTDLLHLNILRRNWKLAYRIFCLLIRLPSVDIRSIWPLGSEILNNLNEETSSEKFLEWLSSFYSSKSS--------FNQS 115 (192) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHCCCHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCH--------HCCC T ss_conf 99999999998607899999999981678747860402799999758103289999999987567630--------0278 Q ss_pred HHHHHHHHHHHHCCC-CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC--CCHHHHHHHHHHHHHC Q ss_conf 211111000000000-0000001245777654332235543599999995389--8547888999999986 Q gi|254780592|r 152 GKHSEALIEYERALE-LSPNESSIVSNIAMSYLLMGDLKTAEEKLRFASQMIG--ADSRIRQNLALVVGLQ 219 (271) Q Consensus 152 ~~~~~A~~~~~~~~~-~~p~~~~~~~~~~~~~~~~g~~~~A~~~~~~al~~~p--~~~~~~~~La~~~~~~ 219 (271) .++......|+..-. ..|...-. .+=..+...++|.+++..+.+.+=..| ++..+|+-.|.+.... T Consensus 116 ~~~~~~aPvfrsGSrthTP~yvit--~LW~lli~~~~y~~l~~~lsElvL~PPy~~d~~~~fi~alc~l~~ 184 (192) T pfam04090 116 TNYRSLAPVFRSGSRTHTPLYIIT--SLWNLLIKRTEYQELIDKLSELVLEPPYMDDGEIWFIYALCHLVE 184 (192) T ss_pred CCCCCCCCCCCCCCCCCCHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHH T ss_conf 876566875046875545799999--999999963589999999988820799888843688899999997 Done!