Query gi|254781033|ref|YP_003065446.1| HemY domain-containing protein [Candidatus Liberibacter asiaticus str. psy62] Match_columns 492 No_of_seqs 366 out of 1705 Neff 8.9 Searched_HMMs 39220 Date Mon May 30 03:46:11 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254781033.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 COG3898 Uncharacterized membra 100.0 0 0 647.2 48.6 424 1-426 1-429 (531) 2 PRK10747 putative protoheme IX 100.0 0 0 410.3 48.0 384 1-387 1-389 (398) 3 COG3071 HemY Uncharacterized e 100.0 1.8E-41 0 341.7 45.9 384 1-387 1-389 (400) 4 pfam07219 HemY_N HemY protein 99.9 9.3E-25 2.4E-29 211.2 18.2 133 1-135 1-134 (134) 5 PRK11788 hypothetical protein; 99.9 1.4E-20 3.6E-25 178.6 39.0 330 85-419 36-381 (389) 6 TIGR02917 PEP_TPR_lipo putativ 99.9 6.6E-21 1.7E-25 181.1 33.5 295 91-392 390-688 (924) 7 TIGR02917 PEP_TPR_lipo putativ 99.9 5.6E-21 1.4E-25 181.7 33.0 299 86-391 272-585 (924) 8 PRK11447 cellulose synthase su 99.9 1.9E-16 4.8E-21 146.4 40.0 302 82-386 268-665 (1158) 9 COG2956 Predicted N-acetylgluc 99.8 2.8E-15 7.2E-20 137.2 34.9 328 87-419 38-381 (389) 10 TIGR00540 hemY_coli hemY prote 99.8 9.9E-16 2.5E-20 140.8 32.1 377 1-380 1-391 (409) 11 PRK11447 cellulose synthase su 99.8 1.9E-13 4.9E-18 122.9 35.9 309 71-386 339-739 (1158) 12 KOG4626 consensus 99.8 1.8E-15 4.6E-20 138.8 23.8 300 89-396 155-494 (966) 13 KOG4626 consensus 99.7 4.9E-15 1.2E-19 135.4 22.7 261 91-359 225-490 (966) 14 PRK10747 putative protoheme IX 99.7 6.2E-12 1.6E-16 111.1 34.7 342 5-393 2-362 (398) 15 KOG0495 consensus 99.7 5.4E-12 1.4E-16 111.6 30.8 289 95-391 527-849 (913) 16 KOG1126 consensus 99.7 4.8E-13 1.2E-17 119.8 24.9 300 84-390 319-622 (638) 17 PRK11788 hypothetical protein; 99.6 8.4E-12 2.1E-16 110.1 29.2 294 126-425 41-374 (389) 18 KOG0495 consensus 99.6 2E-11 5.1E-16 107.2 30.1 262 123-390 587-882 (913) 19 KOG2002 consensus 99.6 7.6E-12 1.9E-16 110.5 27.5 289 90-386 134-440 (1018) 20 COG3071 HemY Uncharacterized e 99.6 7.4E-10 1.9E-14 94.9 34.1 88 132-219 96-184 (400) 21 KOG2002 consensus 99.6 1.3E-10 3.3E-15 100.9 29.4 255 94-352 280-591 (1018) 22 KOG1126 consensus 99.6 7.6E-11 1.9E-15 102.6 26.9 277 85-367 354-633 (638) 23 TIGR02521 type_IV_pilW type IV 99.5 6E-12 1.5E-16 111.2 20.5 190 190-384 39-241 (247) 24 KOG0547 consensus 99.5 8.3E-11 2.1E-15 102.3 25.9 79 130-208 125-203 (606) 25 KOG2076 consensus 99.5 3.5E-10 8.9E-15 97.5 27.6 322 64-391 118-481 (895) 26 KOG0624 consensus 99.5 7E-09 1.8E-13 87.3 32.2 267 85-357 39-339 (504) 27 COG2956 Predicted N-acetylgluc 99.5 2.1E-09 5.4E-14 91.4 27.8 293 127-425 42-374 (389) 28 TIGR02521 type_IV_pilW type IV 99.4 9.9E-11 2.5E-15 101.7 20.2 198 124-351 41-242 (247) 29 PRK10049 pgaA outer membrane p 99.4 9.3E-08 2.4E-12 78.6 33.6 131 263-394 327-481 (818) 30 PRK10049 pgaA outer membrane p 99.4 1.1E-07 2.8E-12 78.0 32.9 127 233-359 331-480 (818) 31 KOG1125 consensus 99.4 5.9E-11 1.5E-15 103.5 15.1 235 124-358 289-531 (579) 32 COG3063 PilF Tfp pilus assembl 99.4 1.7E-09 4.3E-14 92.2 22.0 156 130-287 45-200 (250) 33 COG3063 PilF Tfp pilus assembl 99.4 9.5E-10 2.4E-14 94.1 20.7 192 156-386 37-234 (250) 34 KOG2003 consensus 99.4 5.2E-09 1.3E-13 88.3 24.2 217 129-350 499-718 (840) 35 KOG0547 consensus 99.3 1.7E-08 4.2E-13 84.4 26.6 277 80-360 272-572 (606) 36 KOG4162 consensus 99.3 1.5E-07 3.7E-12 77.0 31.4 249 138-387 462-782 (799) 37 KOG1173 consensus 99.3 1.8E-08 4.5E-13 84.1 24.6 264 95-364 255-528 (611) 38 KOG1174 consensus 99.3 2.6E-08 6.7E-13 82.8 24.6 276 117-393 193-505 (564) 39 TIGR00990 3a0801s09 mitochondr 99.3 8.1E-09 2.1E-13 86.8 21.4 259 100-362 338-613 (649) 40 KOG1915 consensus 99.3 2.2E-07 5.6E-12 75.7 28.6 303 93-397 216-546 (677) 41 KOG2003 consensus 99.2 6.6E-07 1.7E-11 71.9 30.2 204 162-369 498-705 (840) 42 KOG0624 consensus 99.2 3.6E-08 9.1E-13 81.8 23.1 266 90-361 78-377 (504) 43 KOG1155 consensus 99.2 1.4E-06 3.6E-11 69.4 30.0 25 130-154 272-296 (559) 44 KOG1070 consensus 99.2 2.8E-07 7E-12 74.9 25.8 243 139-387 1443-1699(1710) 45 KOG1155 consensus 99.2 2.1E-06 5.4E-11 68.0 32.1 266 118-387 162-494 (559) 46 pfam04733 Coatomer_E Coatomer 99.2 8.1E-07 2.1E-11 71.2 26.8 262 125-396 6-273 (290) 47 KOG1174 consensus 99.1 2.8E-08 7.2E-13 82.6 18.6 190 166-360 312-506 (564) 48 KOG2076 consensus 99.1 1.3E-06 3.4E-11 69.5 26.6 279 91-371 214-539 (895) 49 KOG1173 consensus 99.1 1.5E-06 3.7E-11 69.2 25.1 262 126-393 250-523 (611) 50 KOG1125 consensus 99.1 3.2E-08 8.1E-13 82.2 16.3 226 161-390 292-529 (579) 51 COG4783 Putative Zn-dependent 99.1 1.5E-06 3.9E-11 69.1 24.8 149 226-374 306-459 (484) 52 KOG1129 consensus 99.0 1.7E-07 4.2E-12 76.6 18.9 296 89-393 152-463 (478) 53 PRK12370 invasion protein regu 99.0 8.2E-06 2.1E-10 63.4 26.5 211 157-397 263-478 (553) 54 KOG1156 consensus 99.0 4.9E-06 1.2E-10 65.1 25.0 296 89-387 46-433 (700) 55 KOG1840 consensus 99.0 3.8E-06 9.7E-11 66.0 24.4 239 149-388 194-479 (508) 56 cd05804 StaR_like StaR_like; a 99.0 9.6E-06 2.4E-10 62.8 33.5 263 85-352 44-334 (355) 57 TIGR00990 3a0801s09 mitochondr 99.0 1E-07 2.6E-12 78.2 15.9 250 134-390 336-607 (649) 58 KOG1127 consensus 98.9 3.3E-07 8.5E-12 74.2 16.9 84 133-216 575-658 (1238) 59 KOG1129 consensus 98.9 1.1E-06 2.9E-11 70.1 17.8 197 189-391 224-427 (478) 60 PRK12370 invasion protein regu 98.8 1.6E-06 4E-11 69.0 18.3 158 135-296 276-444 (553) 61 KOG0548 consensus 98.8 2.8E-05 7.2E-10 59.2 28.4 299 85-387 3-454 (539) 62 COG4783 Putative Zn-dependent 98.8 3E-05 7.7E-10 58.9 26.2 124 257-381 303-430 (484) 63 KOG1070 consensus 98.8 2.4E-05 6.2E-10 59.7 23.2 214 174-389 1444-1664(1710) 64 COG5010 TadD Flp pilus assembl 98.8 4.7E-06 1.2E-10 65.2 19.4 162 124-289 70-231 (257) 65 KOG1840 consensus 98.8 3.9E-05 1E-09 58.1 28.7 169 185-353 196-395 (508) 66 PRK10370 formate-dependent nit 98.8 1.9E-06 4.9E-11 68.3 17.2 152 191-357 27-184 (206) 67 KOG0550 consensus 98.8 3.4E-06 8.6E-11 66.4 18.3 276 86-367 51-363 (486) 68 KOG1156 consensus 98.8 5.1E-05 1.3E-09 57.1 26.1 53 304-356 384-436 (700) 69 COG4235 Cytochrome c biogenesi 98.7 4.3E-05 1.1E-09 57.8 22.8 119 241-359 137-261 (287) 70 COG3898 Uncharacterized membra 98.7 6.1E-05 1.6E-09 56.5 32.9 260 88-359 124-396 (531) 71 PRK04841 transcriptional regul 98.7 6.6E-05 1.7E-09 56.3 31.7 301 86-391 411-764 (903) 72 COG5010 TadD Flp pilus assembl 98.7 1.7E-05 4.3E-10 60.9 19.2 175 141-352 55-229 (257) 73 KOG0548 consensus 98.7 2.2E-05 5.6E-10 60.0 19.3 204 157-361 227-462 (539) 74 KOG3060 consensus 98.6 5.1E-05 1.3E-09 57.2 20.7 172 154-329 52-229 (289) 75 TIGR03302 OM_YfiO outer membra 98.6 1.9E-05 4.8E-10 60.5 18.3 186 157-357 36-235 (235) 76 PRK09782 bacteriophage N4 rece 98.6 6.9E-05 1.8E-09 56.1 21.2 159 232-391 548-709 (987) 77 KOG3060 consensus 98.6 3.3E-05 8.4E-10 58.7 19.4 165 92-262 60-227 (289) 78 KOG2376 consensus 98.6 2.4E-05 6.1E-10 59.8 17.9 124 88-216 83-252 (652) 79 pfam04733 Coatomer_E Coatomer 98.5 0.00018 4.5E-09 53.0 28.8 256 95-367 12-278 (290) 80 PRK11189 lipoprotein NlpI; Pro 98.5 0.00018 4.6E-09 52.9 21.5 186 168-358 79-270 (297) 81 KOG0550 consensus 98.5 2.2E-05 5.7E-10 60.0 16.8 176 122-297 171-358 (486) 82 PRK02603 photosystem I assembl 98.5 6E-06 1.5E-10 64.4 13.7 143 99-265 14-159 (172) 83 PRK10370 formate-dependent nit 98.5 5E-05 1.3E-09 57.2 17.7 103 116-218 77-182 (206) 84 cd00189 TPR Tetratricopeptide 98.5 7.4E-06 1.9E-10 63.7 13.3 95 263-357 3-100 (100) 85 PRK04841 transcriptional regul 98.5 0.00024 6.2E-09 51.9 28.8 29 326-354 732-760 (903) 86 COG4235 Cytochrome c biogenesi 98.5 3.2E-05 8.1E-10 58.8 15.6 126 166-292 134-259 (287) 87 cd00189 TPR Tetratricopeptide 98.4 7.1E-06 1.8E-10 63.8 11.0 59 160-218 6-64 (100) 88 PRK10866 outer membrane protei 98.4 0.00037 9.4E-09 50.5 19.8 178 157-349 35-236 (243) 89 KOG4162 consensus 98.4 0.00044 1.1E-08 49.9 26.4 302 86-391 359-752 (799) 90 PRK02603 photosystem I assembl 98.3 2.6E-05 6.7E-10 59.4 13.1 100 174-274 19-120 (172) 91 KOG2376 consensus 98.3 0.00053 1.4E-08 49.2 27.8 165 241-411 356-542 (652) 92 TIGR03302 OM_YfiO outer membra 98.3 0.00061 1.6E-08 48.8 18.6 187 84-291 33-234 (235) 93 CHL00033 ycf3 photosystem I as 98.2 2.3E-05 5.8E-10 59.9 10.9 122 117-262 28-152 (165) 94 PRK11189 lipoprotein NlpI; Pro 98.2 0.00085 2.2E-08 47.7 22.6 219 79-305 60-282 (297) 95 KOG1127 consensus 98.2 0.00092 2.3E-08 47.4 22.5 219 133-353 471-699 (1238) 96 KOG0553 consensus 98.2 3.7E-05 9.5E-10 58.2 11.0 108 233-371 88-195 (304) 97 COG3118 Thioredoxin domain-con 98.1 0.00082 2.1E-08 47.7 17.1 154 230-383 138-297 (304) 98 KOG3081 consensus 98.1 0.0013 3.3E-08 46.2 22.5 164 229-397 111-280 (299) 99 KOG1128 consensus 98.1 0.00082 2.1E-08 47.7 16.4 128 267-396 431-589 (777) 100 cd05804 StaR_like StaR_like; a 98.1 0.0016 4E-08 45.6 27.7 259 124-386 47-334 (355) 101 COG4105 ComL DNA uptake lipopr 98.1 0.0017 4.4E-08 45.2 17.4 199 120-359 34-238 (254) 102 KOG0553 consensus 98.0 0.0008 2E-08 47.9 15.4 98 193-294 86-183 (304) 103 KOG4340 consensus 98.0 0.00077 2E-08 48.0 15.2 117 94-217 88-207 (459) 104 KOG1128 consensus 98.0 0.0022 5.7E-08 44.4 22.7 48 241-288 534-581 (777) 105 PRK10803 hypothetical protein; 97.9 0.0029 7.4E-08 43.5 18.4 97 267-363 149-254 (262) 106 KOG2053 consensus 97.9 0.0035 8.9E-08 42.8 27.4 41 309-349 457-497 (932) 107 pfam09295 ChAPs ChAPs (Chs5p-A 97.9 0.00066 1.7E-08 48.5 12.6 103 241-346 184-289 (395) 108 COG4700 Uncharacterized protei 97.9 0.0033 8.4E-08 43.0 15.7 143 146-292 82-229 (251) 109 COG4700 Uncharacterized protei 97.8 0.0036 9.3E-08 42.7 15.7 100 123-222 92-194 (251) 110 KOG1915 consensus 97.8 0.0045 1.1E-07 42.0 33.8 291 94-389 83-501 (677) 111 KOG3785 consensus 97.8 0.0046 1.2E-07 41.9 21.4 111 260-370 285-440 (557) 112 TIGR02795 tol_pal_ybgF tol-pal 97.8 0.0011 2.7E-08 46.9 12.5 98 264-361 6-112 (119) 113 KOG0543 consensus 97.8 0.0015 3.7E-08 45.8 13.0 90 299-388 265-355 (397) 114 pfam09295 ChAPs ChAPs (Chs5p-A 97.8 0.0023 5.9E-08 44.2 14.0 121 159-286 174-294 (395) 115 PRK10866 outer membrane protei 97.8 0.0054 1.4E-07 41.4 17.0 177 85-282 33-234 (243) 116 COG2909 MalT ATP-dependent tra 97.7 0.0065 1.7E-07 40.8 24.6 91 124-216 419-525 (894) 117 PRK09782 bacteriophage N4 rece 97.7 0.0066 1.7E-07 40.7 30.3 115 240-355 590-707 (987) 118 TIGR02795 tol_pal_ybgF tol-pal 97.7 0.0012 3E-08 46.6 10.9 102 162-264 10-114 (119) 119 CHL00033 ycf3 photosystem I as 97.6 0.0042 1.1E-07 42.2 13.6 63 229-291 34-99 (165) 120 PRK10803 hypothetical protein; 97.6 0.0039 1E-07 42.5 13.1 98 165-263 153-253 (262) 121 COG1729 Uncharacterized protei 97.6 0.01 2.6E-07 39.2 16.8 101 162-263 149-252 (262) 122 COG1729 Uncharacterized protei 97.5 0.0061 1.6E-07 41.0 13.1 107 229-363 144-253 (262) 123 TIGR02552 LcrH_SycD type III s 97.5 0.006 1.5E-07 41.0 12.6 106 117-222 14-121 (137) 124 TIGR02552 LcrH_SycD type III s 97.4 0.0023 5.9E-08 44.3 10.0 109 145-257 9-117 (137) 125 COG4649 Uncharacterized protei 97.4 0.014 3.6E-07 38.1 14.6 139 232-370 64-216 (221) 126 COG3118 Thioredoxin domain-con 97.4 0.015 3.9E-07 37.9 19.5 129 158-290 138-266 (304) 127 KOG0543 consensus 97.4 0.0095 2.4E-07 39.5 12.7 71 155-225 258-328 (397) 128 KOG1941 consensus 97.3 0.02 5E-07 37.0 17.8 165 126-290 89-276 (518) 129 KOG3785 consensus 97.3 0.021 5.4E-07 36.7 18.1 186 97-296 35-221 (557) 130 COG4785 NlpI Lipoprotein NlpI, 97.3 0.021 5.5E-07 36.7 21.3 218 79-301 60-278 (297) 131 COG2976 Uncharacterized protei 97.2 0.024 6.2E-07 36.3 18.2 90 162-256 97-189 (207) 132 KOG2796 consensus 97.1 0.029 7.4E-07 35.7 15.8 157 228-384 179-353 (366) 133 KOG2796 consensus 97.1 0.029 7.4E-07 35.7 12.7 208 121-328 123-362 (366) 134 COG2909 MalT ATP-dependent tra 97.1 0.032 8.2E-07 35.3 25.3 96 88-183 419-526 (894) 135 KOG2053 consensus 97.0 0.034 8.7E-07 35.1 24.4 121 96-219 21-141 (932) 136 KOG4340 consensus 97.0 0.038 9.8E-07 34.7 14.6 249 91-347 51-332 (459) 137 pfam03704 BTAD Bacterial trans 96.9 0.049 1.2E-06 33.9 12.9 60 156-215 64-123 (146) 138 KOG4555 consensus 96.9 0.039 9.9E-07 34.7 11.4 96 165-260 54-149 (175) 139 COG4105 ComL DNA uptake lipopr 96.8 0.049 1.3E-06 33.9 18.9 192 162-396 42-241 (254) 140 KOG4555 consensus 96.7 0.045 1.1E-06 34.2 10.6 95 127-221 50-148 (175) 141 pfam03704 BTAD Bacterial trans 96.6 0.068 1.7E-06 32.8 11.0 131 156-302 8-145 (146) 142 COG4785 NlpI Lipoprotein NlpI, 96.5 0.077 2E-06 32.4 17.4 179 173-358 84-270 (297) 143 COG0457 NrfG FOG: TPR repeat [ 96.5 0.079 2E-06 32.3 27.3 51 307-357 218-268 (291) 144 KOG4234 consensus 96.5 0.084 2.1E-06 32.1 13.4 56 304-359 147-202 (271) 145 pfam07219 HemY_N HemY protein 96.5 0.087 2.2E-06 32.0 11.2 128 5-169 2-134 (134) 146 pfam06305 DUF1049 Protein of u 96.5 0.036 9.1E-07 35.0 9.0 69 14-89 5-73 (80) 147 COG0457 NrfG FOG: TPR repeat [ 96.4 0.096 2.4E-06 31.6 27.6 83 306-389 182-266 (291) 148 KOG3081 consensus 96.3 0.1 2.6E-06 31.5 22.6 250 96-360 20-277 (299) 149 COG1066 Sms Predicted ATP-depe 96.3 0.0025 6.4E-08 44.0 2.3 34 392-425 6-40 (456) 150 KOG2047 consensus 96.3 0.11 2.7E-06 31.2 31.5 194 129-322 356-581 (835) 151 PRK10153 DNA-binding transcrip 96.3 0.11 2.8E-06 31.2 18.1 148 255-403 324-491 (512) 152 COG4649 Uncharacterized protei 96.2 0.12 2.9E-06 31.0 14.0 145 74-218 48-197 (221) 153 PRK12798 chemotaxis protein; R 96.2 0.12 2.9E-06 31.0 18.9 159 195-355 119-287 (416) 154 PRK11823 DNA repair protein Ra 96.2 0.0031 8E-08 43.2 2.3 32 390-421 4-36 (454) 155 KOG4648 consensus 96.2 0.12 3.1E-06 30.8 10.6 102 163-268 106-207 (536) 156 COG4455 ImpE Protein of avirul 96.1 0.093 2.4E-06 31.7 9.6 57 161-217 8-64 (273) 157 pfam02259 FAT FAT domain. The 96.0 0.14 3.6E-06 30.4 23.9 65 225-289 149-215 (351) 158 KOG0551 consensus 96.0 0.14 3.6E-06 30.3 10.9 99 190-288 83-181 (390) 159 cd01121 Sms Sms (bacterial rad 96.0 0.0046 1.2E-07 41.9 2.5 29 394-422 1-30 (372) 160 KOG2047 consensus 95.8 0.17 4.4E-06 29.7 28.2 221 167-389 490-726 (835) 161 pfam06552 TOM20_plant Plant sp 95.7 0.18 4.6E-06 29.5 9.5 62 308-369 52-124 (186) 162 pfam09613 HrpB1_HrpK Bacterial 95.5 0.22 5.7E-06 28.7 11.1 99 155-258 11-109 (156) 163 PRK10941 putative transcriptio 95.4 0.23 5.8E-06 28.7 9.7 71 300-370 190-261 (269) 164 pfam10300 IML2 Putative mitoch 95.4 0.23 5.8E-06 28.7 21.4 146 239-384 186-354 (446) 165 KOG4234 consensus 95.3 0.25 6.4E-06 28.4 12.8 99 193-291 100-199 (271) 166 COG1592 Rubrerythrin [Energy p 95.2 0.26 6.7E-06 28.2 12.0 22 393-415 134-158 (166) 167 TIGR00416 sms DNA repair prote 95.1 0.011 2.7E-07 39.1 1.7 30 390-419 5-35 (481) 168 KOG3617 consensus 94.9 0.31 7.9E-06 27.7 21.2 165 124-288 862-1108(1416) 169 KOG0545 consensus 94.8 0.33 8.4E-06 27.5 13.2 55 304-358 243-297 (329) 170 PRK10941 putative transcriptio 94.6 0.35 8.9E-06 27.2 8.3 68 267-334 188-258 (269) 171 COG2976 Uncharacterized protei 94.6 0.38 9.7E-06 27.0 16.0 134 86-220 55-191 (207) 172 KOG4648 consensus 94.4 0.4 1E-05 26.8 8.5 109 191-303 100-209 (536) 173 KOG1308 consensus 94.4 0.094 2.4E-06 31.7 5.1 70 303-372 160-229 (377) 174 pfam04348 LppC LppC putative l 94.4 0.42 1.1E-05 26.7 18.1 108 106-214 11-124 (535) 175 COG5416 Uncharacterized integr 94.3 0.25 6.4E-06 28.4 7.2 64 11-81 31-94 (98) 176 KOG0376 consensus 94.3 0.17 4.4E-06 29.6 6.3 108 90-199 10-117 (476) 177 KOG0551 consensus 94.1 0.44 1.1E-05 26.4 8.0 100 266-365 87-193 (390) 178 KOG0376 consensus 94.0 0.14 3.5E-06 30.4 5.2 87 131-219 15-103 (476) 179 pfam10345 Cohesin_load Cohesin 93.8 0.52 1.3E-05 25.9 22.5 117 98-215 75-207 (593) 180 PRK11906 transcriptional regul 93.8 0.53 1.3E-05 25.9 10.1 88 276-363 320-410 (458) 181 pfam09613 HrpB1_HrpK Bacterial 93.8 0.53 1.3E-05 25.9 12.2 61 304-364 23-83 (156) 182 pfam04053 Coatomer_WDAD Coatom 93.6 0.57 1.5E-05 25.6 16.2 156 94-286 265-420 (435) 183 KOG1308 consensus 93.5 0.05 1.3E-06 33.9 2.3 62 158-219 118-179 (377) 184 pfam11146 DUF2905 Protein of u 93.5 0.38 9.8E-06 26.9 6.8 57 1-57 1-64 (65) 185 pfam04348 LppC LppC putative l 93.5 0.58 1.5E-05 25.5 15.2 57 88-144 28-85 (535) 186 pfam05843 Suf Suppressor of fo 93.4 0.61 1.6E-05 25.4 17.5 144 190-336 3-152 (275) 187 KOG3617 consensus 93.3 0.62 1.6E-05 25.3 15.0 149 91-252 807-993 (1416) 188 PRK10153 DNA-binding transcrip 93.3 0.63 1.6E-05 25.2 16.7 123 170-294 350-479 (512) 189 KOG1586 consensus 93.0 0.69 1.8E-05 24.9 18.3 42 330-371 199-244 (288) 190 COG3771 Predicted membrane pro 93.0 0.69 1.8E-05 24.9 9.0 83 1-86 1-83 (97) 191 KOG4642 consensus 92.8 0.74 1.9E-05 24.7 7.4 85 200-288 22-106 (284) 192 KOG4507 consensus 92.7 0.75 1.9E-05 24.7 8.3 92 271-362 618-713 (886) 193 COG3629 DnrI DNA-binding trans 92.7 0.75 1.9E-05 24.6 7.6 59 158-216 157-215 (280) 194 PRK06330 transcript cleavage f 92.5 0.79 2E-05 24.5 21.1 38 164-201 234-271 (906) 195 KOG3824 consensus 91.2 1.1 2.8E-05 23.4 7.2 76 119-194 115-190 (472) 196 TIGR02561 HrpB1_HrpK type III 91.0 1.1 2.8E-05 23.3 13.2 125 167-298 29-156 (190) 197 pfam00515 TPR_1 Tetratricopept 91.0 0.39 9.9E-06 26.9 4.3 32 326-357 2-33 (34) 198 pfam04190 DUF410 Protein of un 90.9 1.1 2.9E-05 23.2 13.6 28 184-211 86-113 (260) 199 KOG1585 consensus 90.4 1.3 3.2E-05 22.9 18.6 198 166-364 43-266 (308) 200 KOG1130 consensus 90.1 1.3 3.3E-05 22.8 16.0 131 160-290 201-345 (639) 201 COG1110 Reverse gyrase [DNA re 90.1 0.086 2.2E-06 32.0 0.3 27 391-417 692-719 (1187) 202 pfam07719 TPR_2 Tetratricopept 89.3 0.41 1E-05 26.7 3.3 32 326-357 2-33 (34) 203 COG4976 Predicted methyltransf 89.1 0.9 2.3E-05 24.1 4.9 60 130-189 5-64 (287) 204 KOG3364 consensus 89.0 1.6 4E-05 22.1 7.3 78 296-373 37-120 (149) 205 COG2912 Uncharacterized conser 88.3 1.7 4.4E-05 21.8 7.7 64 301-364 191-254 (269) 206 smart00028 TPR Tetratricopepti 88.2 0.77 2E-05 24.6 4.1 32 326-357 2-33 (34) 207 KOG2471 consensus 87.5 1.9 4.9E-05 21.5 8.2 28 326-353 336-363 (696) 208 KOG2610 consensus 87.4 1.9 5E-05 21.4 18.1 212 90-303 109-334 (491) 209 KOG2396 consensus 86.8 2.1 5.3E-05 21.2 10.0 67 321-387 456-524 (568) 210 PRK09401 reverse gyrase; Revie 86.7 0.18 4.5E-06 29.6 0.1 27 390-416 674-701 (1176) 211 KOG1914 consensus 86.5 2.2 5.5E-05 21.1 22.2 209 176-390 267-503 (656) 212 pfam09988 DUF2227 Uncharacteri 86.3 1.5 3.8E-05 22.4 4.6 14 26-39 61-74 (174) 213 KOG3807 consensus 85.9 2.3 5.8E-05 20.9 13.3 206 155-378 185-421 (556) 214 pfam04184 ST7 ST7 protein. The 84.5 2.6 6.7E-05 20.4 13.3 206 156-379 170-406 (540) 215 pfam04910 DUF654 Protein of un 84.5 2.6 6.7E-05 20.4 16.6 50 163-216 19-68 (343) 216 TIGR03504 FimV_Cterm FimV C-te 84.2 1.8 4.6E-05 21.7 4.2 29 328-356 2-30 (44) 217 KOG2300 consensus 83.7 2.8 7.2E-05 20.2 16.0 29 265-293 450-478 (629) 218 TIGR01477 RIFIN variant surfac 83.2 2.1 5.4E-05 21.1 4.3 11 84-94 199-209 (395) 219 KOG3616 consensus 82.4 3.1 8E-05 19.8 19.2 22 327-348 884-905 (1636) 220 pfam10602 RPN7 26S proteasome 80.4 3.6 9.3E-05 19.3 12.8 99 156-254 38-141 (174) 221 KOG2422 consensus 80.0 3.7 9.6E-05 19.2 12.3 30 185-214 281-310 (665) 222 pfam07214 DUF1418 Protein of u 80.0 3.8 9.6E-05 19.2 5.3 64 1-65 7-71 (96) 223 TIGR02976 phageshock_pspB phag 79.1 3.5 9E-05 19.4 4.2 59 39-111 1-59 (75) 224 PRK13341 recombination factor 78.6 4.1 0.0001 18.9 7.7 15 373-387 457-471 (726) 225 TIGR03362 VI_chp_7 type VI sec 78.0 4.3 0.00011 18.8 15.4 150 231-384 104-275 (301) 226 PRK00750 lysK lysyl-tRNA synth 77.9 2.1 5.2E-05 21.2 2.7 10 166-175 144-153 (513) 227 COG0790 FOG: TPR repeat, SEL1 77.2 4.5 0.00011 18.6 19.1 47 169-217 92-142 (292) 228 pfam11460 DUF3007 Protein of u 77.0 4.2 0.00011 18.8 4.1 39 48-89 41-79 (104) 229 pfam08424 DUF1740 Protein of u 76.9 4.6 0.00012 18.5 17.6 108 248-355 55-187 (231) 230 KOG4318 consensus 76.4 4.7 0.00012 18.5 17.1 23 266-288 534-556 (1088) 231 pfam01921 tRNA-synt_1f tRNA sy 75.1 2.7 6.9E-05 20.3 2.7 56 331-417 127-182 (355) 232 pfam12273 RCR Chitin synthesis 74.2 2.7 7E-05 20.3 2.6 16 79-94 25-41 (124) 233 KOG2114 consensus 73.8 5.4 0.00014 18.0 11.6 18 95-112 379-396 (933) 234 KOG1538 consensus 73.6 5.4 0.00014 17.9 13.6 116 91-210 592-725 (1081) 235 cd01675 RNR_III Class III ribo 71.7 2.3 5.9E-05 20.9 1.7 50 373-422 492-548 (555) 236 PRK11006 phoR phosphate regulo 71.2 6.1 0.00016 17.6 10.4 52 3-68 9-60 (431) 237 TIGR02264 gmx_para_CXXCG Myxoc 70.1 2.4 6E-05 20.8 1.5 38 402-439 175-214 (247) 238 pfam11808 DUF3329 Domain of un 70.0 6.5 0.00016 17.4 7.9 54 3-70 7-60 (90) 239 KOG2041 consensus 70.0 6.5 0.00016 17.4 25.7 26 87-112 737-762 (1189) 240 KOG2846 consensus 69.5 6.6 0.00017 17.3 5.4 25 393-417 220-253 (328) 241 pfam09670 Cas_Cas02710 CRISPR- 69.4 6.6 0.00017 17.3 12.6 25 267-291 248-272 (379) 242 PRK11619 lytic murein transgly 69.0 6.7 0.00017 17.2 27.4 42 165-206 140-181 (645) 243 cd00729 rubredoxin_SM Rubredox 68.2 2.5 6.5E-05 20.5 1.3 25 392-416 1-28 (34) 244 cd00674 LysRS_core_class_I Thi 67.6 5.3 0.00013 18.1 2.8 59 279-346 81-140 (354) 245 COG1230 CzcD Co/Zn/Cd efflux s 67.3 7.3 0.00019 17.0 4.2 20 19-38 71-90 (296) 246 COG1545 Predicted nucleic-acid 66.8 3.1 7.8E-05 19.9 1.5 50 374-423 10-60 (140) 247 PRK13342 recombination factor 66.7 7.4 0.00019 16.9 10.1 18 198-215 186-203 (417) 248 PRK00564 hypA hydrogenase nick 65.7 3.1 7.9E-05 19.8 1.3 26 392-417 70-99 (117) 249 COG5243 HRD1 HRD ubiquitin lig 65.4 7.8 0.0002 16.7 6.3 66 3-68 2-68 (491) 250 PRK08579 anaerobic ribonucleos 65.2 3 7.7E-05 19.9 1.2 50 371-422 538-596 (623) 251 PRK12775 putative trifunctiona 64.2 3.1 8E-05 19.8 1.1 16 260-275 441-456 (993) 252 PRK05452 anaerobic nitric oxid 63.6 3.2 8.1E-05 19.8 1.1 25 390-414 422-466 (479) 253 TIGR00739 yajC preprotein tran 63.5 8.5 0.00022 16.4 3.4 16 43-58 4-19 (86) 254 pfam10516 SHNi-TPR SHNi-TPR. S 63.4 8.3 0.00021 16.5 3.2 29 326-354 2-30 (38) 255 PRK07111 anaerobic ribonucleos 63.1 3.4 8.7E-05 19.5 1.2 29 393-422 647-676 (703) 256 PRK13455 F0F1 ATP synthase sub 62.9 8.6 0.00022 16.4 9.3 51 38-110 27-77 (184) 257 pfam07721 TPR_4 Tetratricopept 61.9 8 0.0002 16.6 2.9 23 327-349 3-25 (26) 258 pfam05297 Herpes_LMP1 Herpesvi 61.8 4.4 0.00011 18.7 1.5 23 4-26 27-49 (382) 259 PRK12496 hypothetical protein; 61.6 4.6 0.00012 18.5 1.6 20 395-414 131-153 (166) 260 pfam11207 DUF2989 Protein of u 61.6 9.1 0.00023 16.2 7.7 97 278-378 95-197 (203) 261 KOG1310 consensus 61.4 9.2 0.00023 16.2 9.1 53 239-291 424-476 (758) 262 pfam11846 DUF3366 Domain of un 61.3 9.2 0.00023 16.2 13.1 43 144-187 135-177 (193) 263 PRK08475 F0F1 ATP synthase sub 60.7 9.4 0.00024 16.1 7.7 18 1-18 1-18 (170) 264 PRK10591 hypothetical protein; 60.1 9.6 0.00024 16.0 5.8 64 1-65 7-71 (96) 265 COG3914 Spy Predicted O-linked 60.1 9.6 0.00024 16.0 15.0 96 165-260 78-176 (620) 266 KOG1464 consensus 59.8 9.7 0.00025 16.0 15.6 188 164-353 37-259 (440) 267 KOG1550 consensus 59.7 9.7 0.00025 16.0 23.1 95 259-355 287-394 (552) 268 COG4941 Predicted RNA polymera 59.6 9.8 0.00025 16.0 14.7 182 169-362 211-402 (415) 269 PRK08270 anaerobic ribonucleos 59.6 4.3 0.00011 18.7 1.2 19 395-414 627-646 (681) 270 PRK12452 cardiolipin synthetas 59.5 8.5 0.00022 16.4 2.7 22 47-68 60-81 (509) 271 cd00350 rubredoxin_like Rubred 59.4 3.7 9.5E-05 19.2 0.8 24 393-416 1-27 (33) 272 COG2389 Uncharacterized metal- 58.8 10 0.00026 15.9 6.2 12 28-39 115-126 (179) 273 PRK13184 pknD serine/threonine 58.4 10 0.00026 15.8 8.4 120 83-207 510-644 (933) 274 PRK08271 anaerobic ribonucleos 58.4 5.4 0.00014 18.0 1.5 49 373-422 541-597 (625) 275 TIGR00373 TIGR00373 conserved 58.3 6.5 0.00017 17.3 1.9 38 377-415 95-139 (168) 276 pfam01155 HypA Hydrogenase exp 58.1 6.5 0.00017 17.3 1.9 26 392-417 68-96 (112) 277 TIGR02487 NrdD anaerobic ribon 57.8 4.9 0.00012 18.3 1.2 46 371-416 560-619 (655) 278 cd07853 STKc_NLK The catalytic 57.7 5.7 0.00015 17.8 1.5 58 341-411 261-325 (372) 279 TIGR02614 ftsW cell division p 57.5 11 0.00027 15.7 7.5 93 36-131 162-274 (370) 280 COG3369 Zinc finger domain con 57.5 5.3 0.00014 18.0 1.3 26 392-419 30-55 (78) 281 PRK09765 2-O-a-mannosyl-D-glyc 57.4 8.2 0.00021 16.6 2.3 20 305-324 434-454 (638) 282 PRK11466 hybrid sensory histid 56.7 11 0.00028 15.6 12.6 18 3-20 8-25 (912) 283 TIGR02443 TIGR02443 conserved 56.6 4.6 0.00012 18.5 0.9 15 407-421 10-24 (63) 284 PRK06266 transcription initiat 56.5 6.4 0.00016 17.4 1.6 25 392-416 116-146 (178) 285 COG5107 RNA14 Pre-mRNA 3'-end 56.3 11 0.00028 15.6 21.6 116 262-378 399-521 (660) 286 cd02682 MIT_AAA_Arch MIT: doma 55.6 11 0.00029 15.5 4.3 65 244-317 5-69 (75) 287 pfam10938 YfdX YfdX protein. Y 55.5 11 0.00029 15.5 10.6 26 262-287 119-144 (155) 288 pfam09535 Gmx_para_CXXCG Prote 54.9 6.9 0.00018 17.1 1.6 38 401-439 166-204 (237) 289 pfam03094 Mlo Mlo family. A fa 54.6 12 0.0003 15.4 6.2 56 38-100 8-71 (481) 290 pfam11947 DUF3464 Protein of u 54.6 12 0.0003 15.4 3.8 63 1-67 57-120 (149) 291 COG1333 ResB ResB protein requ 54.5 9.5 0.00024 16.0 2.2 63 339-407 378-445 (478) 292 PRK00762 hypA hydrogenase nick 54.2 7.4 0.00019 16.9 1.6 26 392-417 69-103 (124) 293 COG3107 LppC Putative lipoprot 54.2 12 0.0003 15.3 15.6 86 118-203 61-149 (604) 294 COG2816 NPY1 NTP pyrophosphohy 54.1 5.7 0.00014 17.8 1.0 21 395-415 113-138 (279) 295 pfam12583 TPPII_N Tripeptidyl 54.0 12 0.0003 15.3 4.8 44 155-198 71-114 (134) 296 pfam00301 Rubredoxin Rubredoxi 53.4 5.5 0.00014 17.9 0.8 11 393-403 1-12 (47) 297 pfam09526 DUF2387 Probable met 52.7 5.2 0.00013 18.1 0.7 16 407-422 9-24 (61) 298 KOG0890 consensus 52.7 12 0.00032 15.2 19.2 68 322-391 1667-1735(2382) 299 COG2176 PolC DNA polymerase II 52.6 7.5 0.00019 16.9 1.4 27 389-415 910-948 (1444) 300 pfam03850 Tfb4 Transcription f 52.6 6.2 0.00016 17.5 1.0 25 389-413 246-271 (271) 301 PRK03824 hypA hydrogenase nick 52.4 9 0.00023 16.2 1.8 10 408-417 109-118 (135) 302 COG5270 PUA domain (predicted 52.0 9.4 0.00024 16.1 1.9 40 390-430 11-50 (202) 303 pfam00558 Vpu Vpu protein. The 51.9 13 0.00032 15.1 6.7 37 43-88 7-43 (81) 304 TIGR00823 EIIA-LAC PTS system, 51.7 13 0.00033 15.0 3.5 30 84-113 17-46 (99) 305 pfam03833 PolC_DP2 DNA polymer 51.6 8.4 0.00022 16.5 1.6 36 378-415 608-650 (852) 306 pfam04930 FUN14 FUN14 family. 51.5 13 0.00033 15.0 3.6 32 3-34 18-49 (97) 307 pfam11234 DUF3036 Protein of u 50.3 13 0.00034 14.9 8.2 96 2-98 8-105 (155) 308 PRK11029 FtsH protease regulat 50.3 13 0.00034 14.9 2.8 35 1-35 1-37 (334) 309 pfam06957 COPI_C Coatomer (COP 50.1 14 0.00035 14.9 4.2 49 314-362 287-336 (421) 310 COG5159 RPN6 26S proteasome re 49.7 14 0.00035 14.8 17.3 236 126-361 51-321 (421) 311 cd00730 rubredoxin Rubredoxin; 49.4 6.5 0.00017 17.3 0.7 10 394-403 2-12 (50) 312 COG5191 Uncharacterized conser 49.4 14 0.00035 14.8 4.0 70 129-198 116-186 (435) 313 PRK06260 threonine synthase; V 49.2 9.5 0.00024 16.1 1.5 21 240-260 204-224 (400) 314 PRK00241 nudC NADH pyrophospha 49.2 7.4 0.00019 16.9 1.0 22 395-416 102-128 (257) 315 COG4178 ABC-type uncharacteriz 48.9 14 0.00036 14.7 12.0 56 12-67 177-232 (604) 316 PRK06450 threonine synthase; V 48.9 9.4 0.00024 16.1 1.5 10 93-102 50-59 (336) 317 TIGR00756 PPR pentatricopeptid 48.7 14 0.00036 14.7 3.4 28 191-218 3-30 (35) 318 COG0846 SIR2 NAD-dependent pro 48.6 9.5 0.00024 16.1 1.4 24 392-415 121-155 (250) 319 TIGR01405 polC_Gram_pos DNA po 48.2 9.8 0.00025 16.0 1.5 26 389-414 713-750 (1264) 320 PRK09458 pspB phage shock prot 48.1 14 0.00037 14.6 2.4 18 42-59 4-21 (75) 321 pfam06667 PspB Phage shock pro 47.7 15 0.00037 14.6 3.9 21 39-59 1-21 (75) 322 COG3947 Response regulator con 47.6 15 0.00037 14.6 5.9 59 329-389 283-346 (361) 323 PRK05580 primosome assembly pr 47.4 9.6 0.00024 16.0 1.3 10 25-34 30-40 (699) 324 PRK11486 flagellar biosynthesi 47.3 14 0.00037 14.6 2.2 26 40-65 17-42 (124) 325 pfam09323 DUF1980 Domain of un 47.1 15 0.00038 14.5 3.7 14 1-14 1-14 (179) 326 pfam06676 DUF1178 Protein of u 47.0 11 0.00028 15.6 1.5 23 394-416 6-42 (145) 327 TIGR00869 sec62 protein transl 46.9 7.2 0.00018 17.0 0.6 17 19-35 143-160 (259) 328 PRK09934 putative fimbrial pro 46.4 15 0.00039 14.4 4.2 37 1-38 1-37 (171) 329 pfam05140 ResB ResB-like famil 46.3 8.2 0.00021 16.5 0.8 52 1-52 1-65 (437) 330 pfam10009 DUF2252 Uncharacteri 45.7 8.6 0.00022 16.4 0.9 25 276-300 271-295 (383) 331 LOAD_little_fing consensus 45.5 11 0.00028 15.5 1.4 25 391-415 2-27 (31) 332 pfam04781 DUF627 Protein of un 45.5 16 0.0004 14.3 8.5 26 163-188 5-30 (112) 333 COG5336 Uncharacterized protei 45.5 16 0.0004 14.3 5.3 43 3-56 47-89 (116) 334 PRK11059 regulatory protein Cs 45.4 16 0.0004 14.3 10.2 54 38-104 135-188 (642) 335 pfam10112 Halogen_Hydrol 5-bro 45.3 16 0.0004 14.3 11.8 57 136-198 107-166 (199) 336 COG4260 Membrane protease subu 45.3 11 0.00028 15.5 1.4 25 384-413 297-322 (345) 337 KOG1497 consensus 44.5 16 0.00041 14.2 11.9 126 160-285 67-209 (399) 338 PRK12380 hydrogenase nickel in 44.5 11 0.00027 15.7 1.1 26 390-416 68-96 (113) 339 TIGR00934 2a38euk potassium up 44.3 16 0.00042 14.2 3.9 52 307-359 454-505 (1197) 340 PRK10316 hypothetical protein; 43.5 17 0.00043 14.1 8.7 22 265-286 174-195 (209) 341 COG2268 Uncharacterized protei 43.3 17 0.00043 14.1 2.2 10 89-98 137-146 (548) 342 COG5294 Uncharacterized protei 43.1 14 0.00037 14.7 1.7 38 1-38 1-44 (113) 343 COG4031 Predicted metal-bindin 43.1 11 0.00027 15.6 1.0 12 407-418 13-24 (227) 344 pfam10058 DUF2296 Predicted in 42.6 16 0.0004 14.3 1.8 24 393-416 19-51 (51) 345 TIGR02827 RNR_anaer_Bdell anae 42.5 12 0.0003 15.3 1.2 25 392-416 582-607 (638) 346 PRK00448 polC DNA polymerase I 42.3 13 0.00034 14.9 1.4 29 159-187 664-694 (1436) 347 PRK10712 fructose-specific PTS 42.1 17 0.00044 14.0 1.9 20 305-324 369-389 (563) 348 COG0375 HybF Zn finger protein 42.0 12 0.00031 15.2 1.2 27 390-417 68-97 (115) 349 pfam10255 Paf67 RNA polymerase 41.9 18 0.00045 13.9 5.2 63 154-216 122-192 (402) 350 pfam07295 DUF1451 Protein of u 41.8 13 0.00033 15.0 1.3 22 394-415 115-141 (148) 351 pfam08631 SPO22 Meiosis protei 41.6 18 0.00045 13.9 25.3 20 94-113 3-22 (280) 352 pfam08194 DIM DIM protein. Dro 41.3 18 0.00046 13.9 3.6 29 3-31 2-30 (37) 353 TIGR02956 TMAO_torS TMAO reduc 41.2 18 0.00046 13.9 5.3 28 165-192 177-205 (1052) 354 TIGR02738 TrbB type-F conjugat 41.1 9.3 0.00024 16.1 0.4 29 390-418 55-92 (176) 355 COG4956 Integral membrane prot 40.9 18 0.00047 13.8 5.6 19 1-19 1-20 (356) 356 COG1198 PriA Primosomal protei 40.6 15 0.00038 14.6 1.4 19 392-415 434-453 (730) 357 pfam09297 zf-NADH-PPase NADH p 40.6 12 0.0003 15.3 0.9 21 395-415 5-30 (32) 358 COG1579 Zn-ribbon protein, pos 40.6 9 0.00023 16.2 0.3 13 404-416 219-231 (239) 359 TIGR00100 hypA hydrogenase nic 40.5 13 0.00033 15.0 1.1 22 395-416 83-109 (128) 360 KOG2487 consensus 39.6 8.2 0.00021 16.6 -0.0 33 392-426 272-305 (314) 361 COG4781 Membrane domain of mem 39.6 19 0.00049 13.7 6.8 56 5-60 30-85 (340) 362 TIGR02159 PA_CoA_Oxy4 phenylac 39.6 19 0.00049 13.7 2.4 41 378-418 81-123 (152) 363 pfam09538 FYDLN_acid Protein o 39.5 14 0.00037 14.6 1.2 22 393-414 9-34 (104) 364 PRK06251 V-type ATP synthase s 39.3 19 0.00049 13.7 3.9 71 1-71 1-71 (101) 365 COG4232 Thiol:disulfide interc 39.2 19 0.00049 13.7 2.5 28 313-340 464-491 (569) 366 COG3529 Predicted nucleic-acid 39.1 8 0.0002 16.6 -0.1 16 407-422 11-26 (66) 367 TIGR02386 rpoC_TIGR DNA-direct 39.0 14 0.00036 14.7 1.1 25 331-356 453-477 (1552) 368 PRK04023 DNA polymerase II lar 38.8 20 0.0005 13.6 2.1 32 392-425 632-664 (1128) 369 PRK06393 rpoE DNA-directed RNA 38.5 16 0.0004 14.3 1.3 26 395-422 7-34 (64) 370 KOG0706 consensus 38.2 17 0.00043 14.1 1.4 11 24-34 33-43 (454) 371 PRK07418 acetolactate synthase 37.8 16 0.00041 14.3 1.2 25 457-481 547-571 (615) 372 PRK09687 hypothetical protein; 37.6 20 0.00052 13.5 21.9 229 117-362 34-271 (280) 373 TIGR02387 rpoC1_cyan DNA-direc 37.3 12 0.00031 15.2 0.5 15 356-370 406-420 (625) 374 COG2093 DNA-directed RNA polym 36.3 15 0.00038 14.5 0.9 28 395-422 6-35 (64) 375 PRK00068 hypothetical protein; 36.2 21 0.00054 13.3 4.8 12 96-107 131-142 (978) 376 pfam11682 DUF3279 Protein of u 36.1 13 0.00032 15.1 0.5 39 375-414 79-118 (128) 377 TIGR00627 tfb4 transcription f 35.9 14 0.00036 14.7 0.7 78 330-414 213-293 (295) 378 PRK10350 hypothetical protein; 35.8 22 0.00055 13.3 1.9 18 1-18 1-18 (146) 379 PRK03776 phosphoglycerol trans 35.7 22 0.00055 13.3 5.6 21 388-408 544-564 (762) 380 PRK13861 type IV secretion sys 35.5 22 0.00056 13.2 3.0 21 1-21 1-21 (293) 381 PRK10270 hypothetical protein; 35.4 22 0.00056 13.2 3.2 14 1-14 1-14 (340) 382 COG4499 Predicted membrane pro 35.1 22 0.00056 13.2 9.8 12 334-345 322-333 (434) 383 PRK09109 motC flagellar motor 35.1 22 0.00056 13.2 9.7 15 173-187 142-156 (246) 384 TIGR00354 polC DNA polymerase 35.1 17 0.00042 14.2 1.0 247 163-424 395-709 (1173) 385 TIGR02508 type_III_yscG type I 34.8 22 0.00057 13.2 2.9 50 266-315 46-95 (118) 386 PRK03598 hypothetical protein; 34.8 22 0.00057 13.2 3.2 12 1-12 1-12 (331) 387 PRK08456 flagellar motor prote 34.7 22 0.00057 13.2 11.1 17 4-20 6-22 (257) 388 pfam11817 Foie-gras_1 Foie gra 34.7 22 0.00057 13.1 6.1 23 330-352 190-212 (254) 389 COG5415 Predicted integral mem 34.4 23 0.00058 13.1 5.8 44 374-417 164-225 (251) 390 smart00531 TFIIE Transcription 34.0 21 0.00054 13.4 1.3 25 391-415 97-132 (147) 391 PRK07591 threonine synthase; V 33.4 22 0.00056 13.2 1.3 20 240-259 225-244 (422) 392 PRK08565 DNA-directed RNA poly 32.8 20 0.00052 13.5 1.1 22 24-45 197-218 (1101) 393 PRK12482 flagellar motor prote 32.5 24 0.00062 12.9 13.0 30 1-33 1-30 (287) 394 pfam09986 DUF2225 Uncharacteri 32.1 24 0.00062 12.9 8.7 27 327-353 167-193 (214) 395 PRK09966 hypothetical protein; 31.9 25 0.00063 12.8 5.0 13 96-108 68-80 (407) 396 TIGR02745 ccoG_rdxA_fixG cytoc 31.5 25 0.00064 12.8 9.2 96 3-99 5-118 (474) 397 KOG0276 consensus 31.0 25 0.00065 12.7 11.0 46 163-213 646-691 (794) 398 pfam05145 AmoA Putative ammoni 31.0 25 0.00065 12.7 2.2 13 90-102 85-97 (318) 399 KOG4606 consensus 31.0 25 0.00065 12.7 6.8 33 2-34 31-63 (126) 400 TIGR01054 rgy reverse gyrase; 30.8 19 0.00049 13.7 0.7 17 328-344 464-480 (1843) 401 KOG2581 consensus 30.8 26 0.00065 12.7 13.1 25 363-387 379-404 (493) 402 PRK09173 F0F1 ATP synthase sub 30.6 26 0.00066 12.7 6.5 51 38-110 2-52 (159) 403 pfam00320 GATA GATA zinc finge 30.5 26 0.00066 12.7 1.6 18 409-426 1-18 (36) 404 pfam05568 ASFV_J13L African sw 30.3 26 0.00066 12.6 3.1 14 80-93 55-68 (189) 405 PRK03681 hypA hydrogenase nick 30.1 26 0.00067 12.6 1.3 25 393-417 70-98 (114) 406 COG1096 Predicted RNA-binding 30.1 22 0.00057 13.1 1.0 14 402-415 161-174 (188) 407 pfam09946 DUF2178 Predicted me 30.1 26 0.00067 12.6 6.9 31 41-71 27-57 (111) 408 COG3183 Predicted restriction 29.7 12 0.00032 15.1 -0.4 13 403-415 238-250 (272) 409 COG3170 FimV Tfp pilus assembl 29.5 27 0.00068 12.5 4.8 31 330-360 503-533 (755) 410 pfam01616 Orbi_NS3 Orbivirus N 29.2 27 0.00069 12.5 5.3 11 173-183 49-59 (194) 411 COG1862 YajC Preprotein transl 29.2 27 0.00069 12.5 4.5 17 41-57 8-24 (97) 412 COG2256 MGS1 ATPase related to 29.0 27 0.0007 12.5 7.4 41 245-288 252-295 (436) 413 PRK12665 putative monovalent c 28.9 27 0.0007 12.5 6.2 22 16-37 45-66 (524) 414 TIGR02302 aProt_lowcomp conser 28.9 27 0.0007 12.5 6.3 20 165-184 611-630 (899) 415 TIGR03525 GldK gliding motilit 28.1 28 0.00072 12.4 2.2 17 1-17 1-18 (449) 416 PRK08476 F0F1 ATP synthase sub 28.0 28 0.00072 12.3 10.3 26 37-62 3-28 (141) 417 COG4647 AcxC Acetone carboxyla 27.9 18 0.00045 13.9 0.1 25 388-419 109-133 (165) 418 COG1439 Predicted nucleic acid 27.8 26 0.00066 12.7 0.9 24 392-415 138-162 (177) 419 PRK05457 heat shock protein Ht 27.8 29 0.00073 12.3 5.0 21 1-21 1-21 (289) 420 PRK10478 putative fructose-lik 27.7 29 0.00073 12.3 1.6 13 331-343 255-267 (359) 421 TIGR02661 MauD methylamine deh 27.6 19 0.00048 13.8 0.2 12 405-416 82-93 (189) 422 pfam02058 Guanylin Guanylin pr 27.6 29 0.00073 12.3 4.4 27 14-40 9-35 (109) 423 pfam06674 DUF1176 Protein of u 27.6 19 0.00049 13.7 0.2 18 405-423 231-250 (338) 424 KOG3251 consensus 27.6 29 0.00074 12.3 2.8 12 274-285 174-185 (213) 425 TIGR01031 rpmF_bact ribosomal 27.2 24 0.0006 13.0 0.6 21 395-417 29-51 (56) 426 PRK10833 putative assembly pro 27.2 29 0.00075 12.2 2.8 12 68-79 27-38 (617) 427 KOG1482 consensus 27.1 29 0.00075 12.2 5.5 33 11-43 114-146 (379) 428 PRK10633 hypothetical protein; 27.0 29 0.00075 12.2 6.8 34 35-68 38-71 (80) 429 TIGR00328 flhB flagellar biosy 26.9 30 0.00075 12.2 2.9 32 272-303 289-325 (352) 430 pfam00641 zf-RanBP Zn-finger i 26.8 25 0.00063 12.8 0.7 24 392-415 3-27 (30) 431 TIGR00603 rad25 DNA repair hel 26.8 30 0.00076 12.2 1.2 40 392-432 16-57 (756) 432 pfam03745 DUF309 Domain of unk 26.7 30 0.00076 12.2 6.3 29 265-293 4-32 (62) 433 PRK10525 cytochrome o ubiquino 26.7 30 0.00076 12.2 4.8 25 1-26 3-27 (305) 434 PRK08197 threonine synthase; V 26.4 30 0.00077 12.1 1.1 41 356-396 326-373 (402) 435 TIGR02928 TIGR02928 orc1/cdc6 26.4 30 0.00077 12.1 4.9 25 265-289 243-267 (383) 436 pfam09723 CxxC_CxxC_SSSS Zinc 26.3 27 0.0007 12.5 0.9 21 394-414 6-34 (42) 437 TIGR02177 PorB_KorB 2-oxoacid: 26.2 22 0.00057 13.2 0.4 60 355-423 138-201 (302) 438 TIGR00785 dass transporter, di 26.0 31 0.00078 12.1 2.7 34 26-59 204-237 (473) 439 PRK13728 conjugal transfer pro 25.9 25 0.00064 12.8 0.6 31 388-418 51-90 (181) 440 KOG0530 consensus 25.9 31 0.00078 12.1 10.2 79 138-216 96-175 (318) 441 COG4814 Uncharacterized protei 25.9 18 0.00046 13.9 -0.1 22 1-22 1-22 (288) 442 PRK01741 cell division protein 25.5 31 0.00079 12.0 1.3 15 41-55 5-19 (342) 443 COG4320 Uncharacterized protei 25.4 30 0.00077 12.2 0.9 27 277-303 279-305 (410) 444 pfam10956 DUF2756 Protein of u 25.4 31 0.0008 12.0 2.0 18 1-18 1-18 (104) 445 pfam01535 PPR PPR repeat. This 25.3 31 0.0008 12.0 3.4 24 192-215 4-27 (31) 446 TIGR00805 oat sodium-independe 25.2 32 0.0008 12.0 1.4 29 395-423 528-558 (723) 447 KOG0546 consensus 25.2 32 0.00081 12.0 5.6 40 320-359 304-343 (372) 448 COG3364 Zn-ribbon containing p 25.1 30 0.00076 12.2 0.9 22 395-416 4-30 (112) 449 CHL00177 ccs1 c-type cytochrom 25.0 24 0.0006 13.0 0.3 20 4-23 17-36 (427) 450 PRK00349 uvrA excinuclease ABC 25.0 32 0.00081 12.0 3.5 27 388-414 736-769 (944) 451 pfam05817 Ribophorin_II Oligos 25.0 32 0.00081 12.0 6.2 25 324-348 234-258 (636) 452 pfam07509 DUF1523 Protein of u 24.9 32 0.00081 11.9 4.4 17 23-39 111-127 (174) 453 pfam11381 DUF3185 Protein of u 24.7 30 0.00077 12.1 0.8 48 51-98 8-55 (59) 454 PRK12468 flhB flagellar biosyn 24.5 32 0.00083 11.9 2.0 10 331-340 318-327 (383) 455 PRK13130 H/ACA RNA-protein com 24.4 32 0.00083 11.9 1.0 18 396-414 8-25 (54) 456 pfam05249 consensus 24.4 33 0.00083 11.9 6.7 62 3-80 23-84 (305) 457 pfam11157 DUF2937 Protein of u 24.3 33 0.00083 11.9 1.6 41 31-71 121-162 (167) 458 PRK13697 cytochrome c6; Provis 24.2 33 0.00084 11.9 2.8 19 1-19 1-19 (111) 459 COG1645 Uncharacterized Zn-fin 24.2 27 0.00069 12.5 0.5 17 396-413 31-51 (131) 460 PRK13264 3-hydroxyanthranilate 24.2 30 0.00076 12.2 0.7 23 393-415 120-166 (177) 461 COG1337 CRISPR system related 24.2 33 0.00084 11.8 2.7 35 375-417 56-90 (249) 462 KOG0686 consensus 24.1 33 0.00084 11.8 7.1 100 155-254 151-257 (466) 463 KOG4814 consensus 24.0 33 0.00084 11.8 14.2 156 198-353 364-521 (872) 464 TIGR02108 PQQ_syn_pqqB coenzym 23.8 26 0.00066 12.7 0.3 10 392-412 14-23 (314) 465 pfam12297 EVC2_like Ellis van 23.7 33 0.00085 11.8 3.0 39 45-83 67-105 (429) 466 PRK01294 lipase chaperone; Pro 23.7 33 0.00085 11.8 3.2 25 1-25 4-28 (339) 467 KOG0292 consensus 23.7 34 0.00085 11.8 4.8 73 138-210 1068-1140(1202) 468 PTZ00042 stevor; Provisional 23.7 34 0.00085 11.8 3.7 22 42-63 267-288 (304) 469 smart00704 ZnF_CDGSH CDGSH-typ 23.6 34 0.00086 11.8 1.2 19 394-414 15-33 (38) 470 pfam04194 PDCD2_C Programmed c 23.5 34 0.00086 11.8 1.2 34 392-425 89-122 (169) 471 KOG1226 consensus 23.4 34 0.00086 11.7 5.1 51 32-83 708-758 (783) 472 pfam05478 Prominin Prominin. T 23.4 34 0.00086 11.7 5.5 67 4-70 425-491 (809) 473 PRK05696 fliL flagellar basal 23.3 34 0.00087 11.7 2.9 21 1-21 18-38 (168) 474 PRK04949 putative sulfate tran 23.1 34 0.00087 11.7 2.9 66 4-69 35-100 (251) 475 pfam02591 DUF164 Putative zinc 23.1 30 0.00077 12.1 0.6 9 407-415 47-55 (56) 476 TIGR00340 zpr1_rel zinc finger 23.0 30 0.00077 12.2 0.5 12 393-404 29-41 (168) 477 COG5402 Uncharacterized conser 23.0 34 0.00088 11.7 2.5 36 1-36 1-42 (194) 478 KOG4629 consensus 23.0 34 0.00088 11.7 6.6 66 2-67 152-218 (714) 479 pfam08273 Prim_Zn_Ribbon Zinc- 22.9 35 0.00088 11.7 0.9 13 409-421 6-19 (39) 480 pfam10072 consensus 22.9 35 0.00088 11.7 1.7 69 5-73 39-107 (112) 481 pfam08882 Acetone_carb_G Aceto 22.7 35 0.00089 11.6 1.3 42 390-439 59-100 (106) 482 pfam01350 Flavi_NS4A Flaviviru 22.7 33 0.00084 11.9 0.7 59 1-59 49-117 (145) 483 TIGR02873 spore_ylxY probable 22.6 29 0.00075 12.2 0.4 10 414-423 207-216 (269) 484 PRK00481 NAD-dependent deacety 22.4 35 0.0009 11.6 1.6 22 394-415 123-151 (239) 485 PRK00420 hypothetical protein; 22.4 35 0.0009 11.6 1.9 39 377-415 6-45 (107) 486 COG1299 FruA Phosphotransferas 22.4 35 0.0009 11.6 4.3 65 8-73 59-125 (343) 487 KOG3052 consensus 22.4 15 0.00038 14.5 -1.1 83 402-486 99-181 (311) 488 cd01410 SIRT7 SIRT7: Eukaryoti 22.3 36 0.00091 11.6 1.5 21 395-415 97-129 (206) 489 PRK09546 zntB zinc transporter 22.3 36 0.00091 11.6 4.3 65 1-65 263-327 (327) 490 PRK05170 hypothetical protein; 22.3 20 0.00051 13.6 -0.5 14 401-414 17-30 (148) 491 PRK09110 flagellar motor prote 22.3 36 0.00091 11.6 12.2 166 1-174 1-198 (283) 492 pfam11004 DUF2843 Protein of u 22.2 36 0.00091 11.6 1.7 52 352-403 191-243 (281) 493 cd07365 MhpB_like Subunit B of 22.1 36 0.00091 11.6 4.4 90 315-404 195-310 (310) 494 TIGR01654 bact_immun_7tm bacte 22.0 36 0.00092 11.5 5.6 110 4-113 253-365 (745) 495 PRK12336 translation initiatio 21.7 36 0.00093 11.5 0.8 10 408-417 100-109 (201) 496 pfam08702 Fib_alpha Fibrinogen 21.7 36 0.00093 11.5 1.1 26 390-418 1-26 (146) 497 pfam10122 Mu-like_Com Mu-like 21.6 36 0.00093 11.5 1.3 25 409-433 27-51 (51) 498 PRK10861 lsignal peptidase I; 21.6 37 0.00093 11.5 3.7 58 1-58 1-82 (324) 499 PTZ00234 variable surface prot 21.6 17 0.00043 14.1 -1.0 36 40-84 366-401 (433) 500 COG4652 Uncharacterized protei 21.6 28 0.00072 12.4 0.2 71 1-71 207-277 (657) No 1 >COG3898 Uncharacterized membrane-bound protein [Function unknown] Probab=100.00 E-value=0 Score=647.20 Aligned_cols=424 Identities=34% Similarity=0.579 Sum_probs=411.5 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHH Q ss_conf 96799999999999999999734898489998997662218999999999999999999999998418288999999999 Q gi|254781033|r 1 MLRLIRYFFVISLVICSFIIVSHYPEDVSITWGNRLYRTSPFVILSILYFFLFAWILLFAVSRFFLSCPAMLFHMLHKRN 80 (492) Q Consensus 1 M~r~l~~~~~i~~~~~~~~~l~~~~G~v~i~~~~~~ie~s~~~~i~~~~~~~~~~~ll~~l~~~i~~~p~~~~~~~~~rr 80 (492) |||+++|+|+|+++++++.|++++||..++.|.||.||||+.++.++++.++..+.++||++++||.+|+.++|||+.|| T Consensus 1 MiRvlfflilV~alg~gfawLadrPG~lsl~w~G~~~emslm~Aa~~iva~vaav~llwwlv~~iw~sP~t~~Ryfr~rK 80 (531) T COG3898 1 MIRVLFFLILVAALGFGFAWLADRPGELSLIWQGQQYEMSLMVAASIIVALVAAVLLLWWLVRSIWESPYTARRYFRERK 80 (531) T ss_pred CHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHH T ss_conf 95689999999999867899708986536886213688899999999999999999999999999968088999999987 Q ss_pred HHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHHHH Q ss_conf 99999999999999721004787999986301346661589999987532222100016899999972688505799986 Q gi|254781033|r 81 YDKGYKALYTGLMSIAAHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPATREFAVYSL 160 (492) Q Consensus 81 ~~k~~~al~~gl~al~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~~~gL 160 (492) +++||++|+.|||+.++||...|+|+.+++.+...+|+++|++||.||++...|+++.+++.|+.|+++|+|+.+|+||| T Consensus 81 RdrgyqALStGliAagAGda~lARkmt~~~~~llssDqepLIhlLeAQaal~eG~~~~Ar~kfeAMl~dPEtRllGLRgL 160 (531) T COG3898 81 RDRGYQALSTGLIAAGAGDASLARKMTARASKLLSSDQEPLIHLLEAQAALLEGDYEDARKKFEAMLDDPETRLLGLRGL 160 (531) T ss_pred HHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCHHHHHHHHHHH T ss_conf 64478887541244216756899999999876542464589999998898864761879999998736838877767889 Q ss_pred HHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHC----CCCHHHHHHHHHHHHHHHH Q ss_conf 545521588699999999888421269899999999999525745699999999950----2310357899999999999 Q gi|254781033|r 161 YFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKN----AKEWNRNRAILLIARSLEN 236 (492) Q Consensus 161 ~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~----~~~~~~~~a~l~~~~a~~~ 236 (492) +.++.+.|+.+.|++|++++....|..+|+...+++..++.|||++|+++++..+.. ....++.+++++++.++.. T Consensus 161 yleAqr~GareaAr~yAe~Aa~~Ap~l~WA~~AtLe~r~~~gdWd~AlkLvd~~~~~~vie~~~aeR~rAvLLtAkA~s~ 240 (531) T COG3898 161 YLEAQRLGAREAARHYAERAAEKAPQLPWAARATLEARCAAGDWDGALKLVDAQRAAKVIEKDVAERSRAVLLTAKAMSL 240 (531) T ss_pred HHHHHHCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHH T ss_conf 99998602189999999998763667716799999999866786889999998888775261167789899998888877 Q ss_pred HHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHCCC-CHHHHHHHHH Q ss_conf 966998999999987543133527999999999997399478999999999832898999999985078-9589999999 Q gi|254781033|r 237 ADKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHPEIANIYTHLLSE-NTVGKLKRAL 315 (492) Q Consensus 237 ~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p~La~a~~~~~~~-~~~~al~~~~ 315 (492) ++ .|+..++..+.+++|+.|+++|+.++.++.|.+.|+.+|+.++||.+|+..|||+++.+|.+.+++ ...++++|.+ T Consensus 241 ld-adp~~Ar~~A~~a~KL~pdlvPaav~AAralf~d~~~rKg~~ilE~aWK~ePHP~ia~lY~~ar~gdta~dRlkRa~ 319 (531) T COG3898 241 LD-ADPASARDDALEANKLAPDLVPAAVVAARALFRDGNLRKGSKILETAWKAEPHPDIALLYVRARSGDTALDRLKRAK 319 (531) T ss_pred HC-CCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCHHHHHHHHH T ss_conf 24-88677889999875038763369999999987424624332488998734998489999988047974889999999 Q ss_pred HHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCEE Q ss_conf 99982689836899999999987998899999887866099989999999999985799899999999996288999527 Q gi|254781033|r 316 RLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKEIFLLLAQIEQANSHNTDKILYWTQSALHAMPDPLWI 395 (492) Q Consensus 316 ~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~~~~llA~ie~~e~~d~~~ar~wl~rA~~a~~dp~W~ 395 (492) ++.+..|||.|+.+++++.+++.|+|..||...+.+.+..|+.+.|+|||+||+.+.||.+++|+|+.+++++|+||.|+ T Consensus 320 ~L~slk~nnaes~~~va~aAlda~e~~~ARa~Aeaa~r~~pres~~lLlAdIeeAetGDqg~vR~wlAqav~APrdPaW~ 399 (531) T COG3898 320 KLESLKPNNAESSLAVAEAALDAGEFSAARAKAEAAAREAPRESAYLLLADIEEAETGDQGKVRQWLAQAVKAPRDPAWT 399 (531) T ss_pred HHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCCCC T ss_conf 99852842067889999998722564878888998753072346899987787632586478999999984588997531 Q ss_pred CCCCCCCHHHCCCCCCCCCCCCCCCCCCCCC Q ss_conf 5698955424224837687364003678755 Q gi|254781033|r 396 SDDGYLSSVWLPLSPISKTLCYFEWKIPTKS 426 (492) Q Consensus 396 c~~g~~~~~W~~~c~~~~~~~~~~W~~p~~~ 426 (492) . +|++++.|.|++|++|++|+|+|+.|+.. T Consensus 400 a-dg~vse~wapvspvtgRLdafewkap~~q 429 (531) T COG3898 400 A-DGVVSEAWAPVSPVTGRLDAFEWKAPTGQ 429 (531) T ss_pred C-CCCCCCCCCCCCCCHHHHHHHHHCCCHHH T ss_conf 4-67344243446874000024540386554 No 2 >PRK10747 putative protoheme IX biogenesis protein; Provisional Probab=100.00 E-value=0 Score=410.28 Aligned_cols=384 Identities=15% Similarity=0.104 Sum_probs=341.7 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHH Q ss_conf 96799999999999999999734898489998997662218999999999999999999999998418288999999999 Q gi|254781033|r 1 MLRLIRYFFVISLVICSFIIVSHYPEDVSITWGNRLYRTSPFVILSILYFFLFAWILLFAVSRFFLSCPAMLFHMLHKRN 80 (492) Q Consensus 1 M~r~l~~~~~i~~~~~~~~~l~~~~G~v~i~~~~~~ie~s~~~~i~~~~~~~~~~~ll~~l~~~i~~~p~~~~~~~~~rr 80 (492) |||++++|++++++++++.++.++||||+|.|++|+||||+++++++++++|+++++++|+++.+++.|..++.|+.+|| T Consensus 1 M~r~l~~~~~l~~g~~~g~~~~~~~GYVlI~~~~~~iEtSl~~~~i~~v~~~~~l~~l~~ll~~~~~~~~~~~~w~~~rr 80 (398) T PRK10747 1 MLKVLLLFVLLIAGIVVGPMIAGHQGYVLIQTDNYNIETSVTGLAIILILAMVVLFAIEWLLRRIFRTGAHTRGWFVGRK 80 (398) T ss_pred CCHHHHHHHHHHHHHHHHHHHCCCCCEEEEEECCEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHH T ss_conf 90699999999999999999848998699988998678119999999999999999999999999846267788888758 Q ss_pred HHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHH-HH Q ss_conf 99999999999999721004787999986301346661589999987532222100016899999972688505799-98 Q gi|254781033|r 81 YDKGYKALYTGLMSIAAHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPATREFAV-YS 159 (492) Q Consensus 81 ~~k~~~al~~gl~al~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~~-~g 159 (492) ++|+.+.+.+|++++++|||..|+|.+.++.+ .++.|.+++|.+|++|+.+||++.+++|++.+.+..|...+++ .. T Consensus 81 ~~kA~~~t~~Gl~~l~EG~~~~AeK~l~k~a~--~~~~pllnyL~AArAA~~~g~~~~rd~yL~~A~e~~p~a~lav~lt 158 (398) T PRK10747 81 RRRARKQTEQALLKLAEGDYQQVEKLMAKNAD--HAEQPVVNYLLAAEAAQQRGDEARANQHLERAAELAGNDTIPVEIT 158 (398) T ss_pred HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCC--CCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHH T ss_conf 99999999999999877799999999986343--5677488999999999988898999999999888589848899999 Q ss_pred HHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHH----HHHHHH Q ss_conf 6545521588699999999888421269899999999999525745699999999950231035789999----999999 Q gi|254781033|r 160 LYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILL----IARSLE 235 (492) Q Consensus 160 L~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~----~~~a~~ 235 (492) .+...+++|++++|+..++++.+..|+|+.++..+.++|.+.+||+..+.+++...|.+.........+. ...... T Consensus 159 ~Ael~l~~~q~e~Ala~L~~L~~~~p~~~~VL~Ll~~~y~~~~dW~aL~~LLp~L~K~~~l~~~e~~~L~~~a~~~ll~~ 238 (398) T PRK10747 159 RVRLQLARNENHAARHGVDKLLEVTPRHPEVLRLAEQAYIRTGAWSSLLDIIPSMAKAHVGDEEHRAMLEQQAWIGLMDQ 238 (398) T ss_pred HHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 99999977998999999999986199999999999999998745999999999998757999999999999999999999 Q ss_pred HHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCHHHHHHHHH Q ss_conf 99669989999999875431335279999999999973994789999999998328989999999850789589999999 Q gi|254781033|r 236 NADKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHPEIANIYTHLLSENTVGKLKRAL 315 (492) Q Consensus 236 ~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p~La~a~~~~~~~~~~~al~~~~ 315 (492) ....+..+.-...|....+.....+.....|++.+++.|+..+|.+++++.++.+||++|+..|..+..+++...++.++ T Consensus 239 ~~~~~~~~~L~~~W~~lp~~~r~~~~l~~~ya~~Li~~g~~~~A~~ll~~~Lkk~wd~~Lv~ly~~l~~~d~~~~l~~~e 318 (398) T PRK10747 239 ARADNGSEGLRNWWKNQSRKTRHQVALQVAMAEHLIECDDHDTAQQIIIDGLKRQYDDRLVLPIPRLKTNNPEQLEKVLR 318 (398) T ss_pred HHHHCCHHHHHHHHHHCCHHHHCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCHHHHHHHHH T ss_conf 87323479999999868998756999999999999868996899999999986189989999987567799899999999 Q ss_pred HHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHC Q ss_conf 999826898368999999999879988999998878660999899999999999857998999999999962 Q gi|254781033|r 316 RLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKEIFLLLAQIEQANSHNTDKILYWTQSALH 387 (492) Q Consensus 316 ~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~~~~llA~ie~~e~~d~~~ar~wl~rA~~ 387 (492) +|++.+|+|+..++++|++|++.++|++|+++++++++..|+..+|..+|++.+.. |+...+.+...+++. T Consensus 319 ~wlk~~~~~~~Ll~aLGrL~~~~~lwgkA~~yle~sl~l~p~~~~~~~La~l~e~l-g~~~~A~~~yr~gL~ 389 (398) T PRK10747 319 QQIKQVGDRPLLWSTLGQSLMKHGEWQEASLAFRAALKQRPDAYDYAWLADALDRL-HKPEEAAAMRRDGLM 389 (398) T ss_pred HHHHCCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHC-CCHHHHHHHHHHHHH T ss_conf 98731899989999999999973509999999999986599999999999999985-997999999999999 No 3 >COG3071 HemY Uncharacterized enzyme of heme biosynthesis [Coenzyme metabolism] Probab=100.00 E-value=1.8e-41 Score=341.68 Aligned_cols=384 Identities=17% Similarity=0.141 Sum_probs=342.6 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHH Q ss_conf 96799999999999999999734898489998997662218999999999999999999999998418288999999999 Q gi|254781033|r 1 MLRLIRYFFVISLVICSFIIVSHYPEDVSITWGNRLYRTSPFVILSILYFFLFAWILLFAVSRFFLSCPAMLFHMLHKRN 80 (492) Q Consensus 1 M~r~l~~~~~i~~~~~~~~~l~~~~G~v~i~~~~~~ie~s~~~~i~~~~~~~~~~~ll~~l~~~i~~~p~~~~~~~~~rr 80 (492) |+|++++|++++++++.+.|+..++|||+|...+|.||||+.++++.+++++++++++.|+++.++++|..++.||..|| T Consensus 1 M~~vl~l~~ll~agi~~g~~~~~qqgyVlI~~~~~~ie~Sl~~lv~~~ii~lvv~~~l~~~l~~v~~~~~~~~~w~~~rK 80 (400) T COG3071 1 MRRVLLLFVLLLAGIGVGLAIAGQQGYVLIQTDNYNIEMSLTTLVIFLIIALVVLYLLEWLLRRVLRTPAHTRGWFSRRK 80 (400) T ss_pred CHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEECCEEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH T ss_conf 90059999999999999999834678269974460453229999999999999999999999999448198888988889 Q ss_pred HHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHHH-HHHH Q ss_conf 999999999999997210047879999863013466615899999875322221000168999999726885057-9998 Q gi|254781033|r 81 YDKGYKALYTGLMSIAAHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPATREF-AVYS 159 (492) Q Consensus 81 ~~k~~~al~~gl~al~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~-~~~g 159 (492) ++|+..++.+|++.+++|||.+|+|+..+..+. .++|.+.++++|.+|++.||.+.+.+|+..+.+.+++..+ .... T Consensus 81 rrra~~~~~egl~~l~eG~~~qAEkl~~rnae~--~e~p~l~~l~aA~AA~qrgd~~~an~yL~eaae~~~~~~l~v~lt 158 (400) T COG3071 81 RRRARKALNEGLLKLFEGDFQQAEKLLRRNAEH--GEQPVLAYLLAAEAAQQRGDEDRANRYLAEAAELAGDDTLAVELT 158 (400) T ss_pred HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHC--CCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCHHHHHHH T ss_conf 999999999999987067389999999976534--734089999999998863057789899999862589940889999 Q ss_pred HHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHH----HHHHHHHHHH Q ss_conf 654552158869999999988842126989999999999952574569999999995023103578----9999999999 Q gi|254781033|r 160 LYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNR----AILLIARSLE 235 (492) Q Consensus 160 L~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~----a~l~~~~a~~ 235 (492) ..+..+.+||++.|+....++++..|.++.++....++|.+.|+|+++...+++..|.+--.+... .....+.-.+ T Consensus 159 rarlll~~~d~~aA~~~v~~ll~~~pr~~~vlrLa~r~y~~~g~~~~ll~~l~~L~ka~~l~~~e~~~le~~a~~glL~q 238 (400) T COG3071 159 RARLLLNRRDYPAARENVDQLLEMTPRHPEVLRLALRAYIRLGAWQALLAILPKLRKAGLLSDEEAARLEQQAWEGLLQQ 238 (400) T ss_pred HHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 99999867886568988999987286886999999999998511899999889998735788299999999999999998 Q ss_pred HHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCHHHHHHHHH Q ss_conf 99669989999999875431335279999999999973994789999999998328989999999850789589999999 Q gi|254781033|r 236 NADKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHPEIANIYTHLLSENTVGKLKRAL 315 (492) Q Consensus 236 ~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p~La~a~~~~~~~~~~~al~~~~ 315 (492) ..+.++.+.-...|++-.+.-...++..+.++.-+++.|...+|.++++.+++.++++.|...+..+..+++...++.++ T Consensus 239 ~~~~~~~~gL~~~W~~~pr~lr~~p~l~~~~a~~li~l~~~~~A~~~i~~~Lk~~~D~~L~~~~~~l~~~d~~~l~k~~e 318 (400) T COG3071 239 ARDDNGSEGLKTWWKNQPRKLRNDPELVVAYAERLIRLGDHDEAQEIIEDALKRQWDPRLCRLIPRLRPGDPEPLIKAAE 318 (400) T ss_pred HHCCCCCHHHHHHHHHCCHHHHCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCCHHHHHHHH T ss_conf 73620014789999866497605834899999999975876889999999987445856999886518899367999999 Q ss_pred HHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHC Q ss_conf 999826898368999999999879988999998878660999899999999999857998999999999962 Q gi|254781033|r 316 RLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKEIFLLLAQIEQANSHNTDKILYWTQSALH 387 (492) Q Consensus 316 ~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~~~~llA~ie~~e~~d~~~ar~wl~rA~~ 387 (492) +|++.+|+++..+++||++|+..++|++|+++|+.+++..|+...|..+|...+.. |+...+.+--+.++. T Consensus 319 ~~l~~h~~~p~L~~tLG~L~~k~~~w~kA~~~leaAl~~~~s~~~~~~la~~~~~~-g~~~~A~~~r~e~L~ 389 (400) T COG3071 319 KWLKQHPEDPLLLSTLGRLALKNKLWGKASEALEAALKLRPSASDYAELADALDQL-GEPEEAEQVRREALL 389 (400) T ss_pred HHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHC-CCHHHHHHHHHHHHH T ss_conf 99983999814999999999984178999999999972589743699999999981-886889999999998 No 4 >pfam07219 HemY_N HemY protein N-terminus. This family represents the N-terminus (approximately 150 residues) of bacterial HemY porphyrin biosynthesis proteins. This is a membrane protein involved in a late step of protoheme IX synthesis. Probab=99.94 E-value=9.3e-25 Score=211.18 Aligned_cols=133 Identities=25% Similarity=0.338 Sum_probs=125.4 Q ss_pred CHHHHHHHHHHHHH-HHHHHHHHCCCCEEEEEECCEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHH Q ss_conf 96799999999999-99999973489848999899766221899999999999999999999999841828899999999 Q gi|254781033|r 1 MLRLIRYFFVISLV-ICSFIIVSHYPEDVSITWGNRLYRTSPFVILSILYFFLFAWILLFAVSRFFLSCPAMLFHMLHKR 79 (492) Q Consensus 1 M~r~l~~~~~i~~~-~~~~~~l~~~~G~v~i~~~~~~ie~s~~~~i~~~~~~~~~~~ll~~l~~~i~~~p~~~~~~~~~r 79 (492) |+|+++|+++++++ ++++.|+.++||||+|.|+||+||||+++++++++++++++++++++++++++.|+++++|+++| T Consensus 1 M~R~l~~~~~~l~~~~~~~~~l~~~pGyV~I~~~~~~ie~Sl~~~~~~l~~~~~~l~~l~~ll~~l~~~p~~~~~~~~~r 80 (134) T pfam07219 1 MLRVLLFLFLVLLLGLFGGAWLAGQPGYVVISYGGYRIETSLTFAAILLLAVLAVLFLLEWLLRLIIRLPRRVRRYFRRR 80 (134) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCEEEEECHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHH T ss_conf 97899999999999999999993699959999989988941999999999999999999999999973658999999999 Q ss_pred HHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHC Q ss_conf 99999999999999972100478799998630134666158999998753222210 Q gi|254781033|r 80 NYDKGYKALYTGLMSIAAHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQ 135 (492) Q Consensus 80 r~~k~~~al~~gl~al~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd 135 (492) |++||++++++|++++++||+..|+|.+.++.+ .+++|++++|++||+|+.+|| T Consensus 81 r~~k~~~al~~Gl~a~~~Gd~~~A~k~~~ka~k--~~~~p~l~~LlaA~aA~~~GD 134 (134) T pfam07219 81 KRRRGRKALSEGLLALAEGDWALAERLARKAAE--LEDQPPLALLLAARAAQAEGD 134 (134) T ss_pred HHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH--CCCCCHHHHHHHHHHHHHCCC T ss_conf 999999999999999978999999999999876--379935799999999988579 No 5 >PRK11788 hypothetical protein; Provisional Probab=99.94 E-value=1.4e-20 Score=178.59 Aligned_cols=330 Identities=15% Similarity=0.112 Sum_probs=268.2 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCC----CHHHHHHHH Q ss_conf 9999999999721004787999986301346661589999987532222100016899999972688----505799986 Q gi|254781033|r 85 YKALYTGLMSIAAHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPA----TREFAVYSL 160 (492) Q Consensus 85 ~~al~~gl~al~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~----~~~~~~~gL 160 (492) -....+|+-.+...+.++|-..+..+.+. .++..=++|..+.-....|+.+.|.+.-+.++..|. .+..++..| T Consensus 36 ~~~Y~~Gln~Ll~~q~dkAid~f~~~~~~--~~~t~e~hlaLG~LfRrrGE~dRAIriHq~Ll~rp~L~~~~r~~a~~EL 113 (389) T PRK11788 36 SRDYFKGLNFLLNEQPDKAIDLFIEMLKV--DPETVELHLALGNLFRRRGEVDRAIRIHQNLLSRPDLTREQRLLALQEL 113 (389) T ss_pred CHHHHHHHHHHHCCCCHHHHHHHHHHHHC--CCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHH T ss_conf 69999899887258945999999999864--8114999999999999656088999999999728798999999999999 Q ss_pred HHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCC-CHHHHHHHHHHHHHHHHHHC Q ss_conf 54552158869999999988842126989999999999952574569999999995023-10357899999999999966 Q gi|254781033|r 161 YFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAK-EWNRNRAILLIARSLENADK 239 (492) Q Consensus 161 ~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~-~~~~~~a~l~~~~a~~~~~~ 239 (492) +.-+++.|=++.|...+.++.+..+....++..|+.+|.+.+||++|++..++..+... ......+..+...+...... T Consensus 114 a~Dy~~aGllDRAE~~f~~l~~~~~~~~~al~~Ll~iye~~~dW~~Ai~~a~~L~~~~~~~~~~~iAhy~CElA~~~~~~ 193 (389) T PRK11788 114 GQDYLKAGLLDRAEELFLQLVDEDDFAEGALRQLLTIYQQEKDWQKAIEVAERLEKLGGDSLRKEIAHFYCELAQQALAR 193 (389) T ss_pred HHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHC T ss_conf 99998677499999999999828542699999999999986459999999999987047650699999999999999874 Q ss_pred CCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCH--HHHHHHHH--HCCCCHHHHHHHHH Q ss_conf 998999999987543133527999999999997399478999999999832898--99999998--50789589999999 Q gi|254781033|r 240 GDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHP--EIANIYTH--LLSENTVGKLKRAL 315 (492) Q Consensus 240 ~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p--~La~a~~~--~~~~~~~~al~~~~ 315 (492) ++.+.|.+.+.+|++.+|+.+.+.++.++++...|++++|.+.+.+....+|.. .++..... -..++..+-++.++ T Consensus 194 ~~~~~A~~~L~kAL~~dp~cvRA~lllg~l~~~~g~~~~Ai~~l~~i~~q~p~~l~~vl~~L~~~y~~l~~~~~~~~~L~ 273 (389) T PRK11788 194 GDLDAARALLKKALAADPQCVRASILLGDLALAQGDYAAAIEALERVEEQDPEYLPEVLPKLMECYQALGDEAEGLEFLR 273 (389) T ss_pred CCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHCCCHHHHHHHHH T ss_conf 89999999999998748243418999999999879999999999999973957799999999999998288889999999 Q ss_pred HHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHH--HHHHHHCCCHH----HHHHHHHHHHCCC Q ss_conf 99982689836899999999987998899999887866099989999999--99998579989----9999999996288 Q gi|254781033|r 316 RLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKEIFLLLA--QIEQANSHNTD----KILYWTQSALHAM 389 (492) Q Consensus 316 ~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~~~~llA--~ie~~e~~d~~----~ar~wl~rA~~a~ 389 (492) ++...+|+ +...+.+++.....+..+.|...+.+.++..||-+.+.-+. .+...+.+... ..+..+.+.+ . T Consensus 274 ~~~~~~~~-~~l~l~l~~~l~~~~g~~~A~~~l~~~L~~~Psl~g~~~Ll~~~l~~~~~~~~~~~l~~l~~~v~~~~--~ 350 (389) T PRK11788 274 RAVEEYPG-ADLLLALAQLLEEREGPEAAQALLREQLRRHPSLRGFHRLLDYHLAEAEEGRAKESLELLRDLVGEQL--K 350 (389) T ss_pred HHHHHCCC-CHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHH--H T ss_conf 99985899-35999999999976999999999999975299889999999987752447654358999999999997--1 Q ss_pred CCCCEECC-CCCCCHHHCCCCCCCCCCCCCC Q ss_conf 99952756-9895542422483768736400 Q gi|254781033|r 390 PDPLWISD-DGYLSSVWLPLSPISKTLCYFE 419 (492) Q Consensus 390 ~dp~W~c~-~g~~~~~W~~~c~~~~~~~~~~ 419 (492) ..|.+.|. ||+.+..--=.||.|++.+|+. T Consensus 351 ~~~~Y~C~~CGF~~~~~~WqCPsC~~W~Si~ 381 (389) T PRK11788 351 RKPRYRCRNCGFTARTLYWHCPSCKAWETIK 381 (389) T ss_pred CCCCEECCCCCCCCCCEEEECCCCCCCCCCC T ss_conf 7999769999998883145790999867848 No 6 >TIGR02917 PEP_TPR_lipo putative PEP-CTERM system TPR-repeat lipoprotein; InterPro: IPR014266 This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system , analogous to the LPXTG/sortase system common in Gram-positive bacteria. The proteins in this entry occur in a species if, and only if, a transmembrane histidine kinase (IPR014265 from INTERPRO) and a DNA-binding response regulator (IPR014264 from INTERPRO) also occur. The present of tetratricopeptide repeats (TPR) suggests they may be involved in protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.. Probab=99.93 E-value=6.6e-21 Score=181.15 Aligned_cols=295 Identities=19% Similarity=0.145 Sum_probs=170.7 Q ss_pred HHHHHHCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCH Q ss_conf 99997210047879999863013466615899999875322221000168999999726885057999865455215886 Q gi|254781033|r 91 GLMSIAAHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPATREFAVYSLYFESCRIGDL 170 (492) Q Consensus 91 gl~al~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~~~gL~~~al~~gd~ 170 (492) |...+..||+++|.+++.|+... .|+.+-.+.-.+..-...||...+..-++.+++-++.-..+-.-|...+++.|+. T Consensus 390 G~~~L~~g~~~~A~~yl~ka~~~--~P~~a~~~t~lG~~~L~~Gd~~~Ai~dL~~A~~l~~~~~~a~~~L~~~yL~~~~~ 467 (924) T TIGR02917 390 GEAYLAKGDFEKAAEYLAKASEL--DPENAAARTKLGISKLSQGDPSEAIADLETAAQLDPELGRADLLLILSYLRSGEF 467 (924) T ss_pred HHHHHHCCCHHHHHHHHHHHHCC--CCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHH T ss_conf 99998638967899999876105--8980889999998862068834789999999972777534789999988742048 Q ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHH Q ss_conf 99999999888421269899999999999525745699999999950231035789999999999996699899999998 Q gi|254781033|r 171 NSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIASYHDAI 250 (492) Q Consensus 171 ~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~ 250 (492) ++|+..++++.+..|+++..+..++.+|...||..+|+..|+++++..|++-.. ...++.....+++++.|.+.+. T Consensus 468 dKAla~~~~l~~~~pd~p~~~nl~G~i~l~~~~~~kA~~~f~~AL~i~p~~~~A----~~nLA~~d~~~~~~~~A~~r~~ 543 (924) T TIGR02917 468 DKALAAAKKLEKKQPDNPSLHNLLGAIYLGKKDLAKAREAFEKALKIEPDFFPA----AANLARIDLQEGNPDDAKQRFE 543 (924) T ss_pred HHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHH----HHHHHHHHHHCCCHHHHHHHHH T ss_conf 999999999975288982689999999985178789999999875318778478----9999998874289689999999 Q ss_pred HHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCC---HHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHH Q ss_conf 754313352799999999999739947899999999983289---89999999850789589999999999826898368 Q gi|254781033|r 251 ESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPH---PEIANIYTHLLSENTVGKLKRALRLEEINKESVES 327 (492) Q Consensus 251 ~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~---p~La~a~~~~~~~~~~~al~~~~~l~~~~P~~~e~ 327 (492) +.+..+|.++.+...++.+..+.|+..++...|+++...+|. |.|..+-.++..+++..++..++.....+|++++. T Consensus 544 ~~l~~~P~~~~a~~~La~L~~~~g~~~~~~~~l~~A~~~~~~~~~p~l~L~~~yl~~~~l~kAl~~l~~~~~~~~~~~~~ 623 (924) T TIGR02917 544 KVLTIDPKNLRALLALAELAARTGNEEEAVAWLEKAAEKNPKEIEPALALAQYYLSKGELKKALAILNEAADLAPDSPEA 623 (924) T ss_pred HHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHH T ss_conf 99875743379999999999851896899999999986281777789999999986589878999999998731475789 Q ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHCCCCH-HHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCC Q ss_conf 9999999998799889999988786609998-9999999999985799899999999996288999 Q gi|254781033|r 328 LVIVSKIALEMGSIDQAHAKAMLAMKIAPRK-EIFLLLAQIEQANSHNTDKILYWTQSALHAMPDP 392 (492) Q Consensus 328 ~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~-~~~~llA~ie~~e~~d~~~ar~wl~rA~~a~~dp 392 (492) ..+||+.++..|+...|.+.++++++..|++ ..+..+|.+.... .|...++.-+.||+.-.||. T Consensus 624 ~~~Lg~~ql~~~d~~~A~~~f~Kl~~~~P~~~~a~~~lA~~~~~~-~n~~~A~~~~~~aL~~~~d~ 688 (924) T TIGR02917 624 LELLGRAQLAAGDLNKAVESFKKLLELQPDSALALLLLADAYAVA-KNYAKALTSLKKALELKPDN 688 (924) T ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHCCCC T ss_conf 999999999732467799999999986888723899999999986-13799999999986127890 No 7 >TIGR02917 PEP_TPR_lipo putative PEP-CTERM system TPR-repeat lipoprotein; InterPro: IPR014266 This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system , analogous to the LPXTG/sortase system common in Gram-positive bacteria. The proteins in this entry occur in a species if, and only if, a transmembrane histidine kinase (IPR014265 from INTERPRO) and a DNA-binding response regulator (IPR014264 from INTERPRO) also occur. The present of tetratricopeptide repeats (TPR) suggests they may be involved in protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.. Probab=99.93 E-value=5.6e-21 Score=181.71 Aligned_cols=299 Identities=17% Similarity=0.165 Sum_probs=221.6 Q ss_pred HHHHHHHHHHHCCH---HHHHHHHH--------HHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHH Q ss_conf 99999999972100---47879999--------86301346661589999987532222100016899999972688505 Q gi|254781033|r 86 KALYTGLMSIAAHN---IPLARKMH--------SYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPATRE 154 (492) Q Consensus 86 ~al~~gl~al~~Gd---~~~A~k~~--------~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~ 154 (492) ....+||+.+-.++ ++.|+-.+ ..+.+..+ +-.-..++++-+....|++..|..||.+.++..|... T Consensus 272 a~y~kAlv~~q~~~naG~~~A~~~~aqvlDhls~~~l~~~p--~~~~a~~lAG~~~y~lg~~~~A~~yl~~~l~~~P~~~ 349 (924) T TIGR02917 272 AAYLKALVDFQKKNNAGYEDARETLAQVLDHLSQAALKSAP--EYLPALLLAGASEYQLGNLEQAEQYLNQILKAAPNSH 349 (924) T ss_pred HHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCC--CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCH T ss_conf 89999999998335774489999999999999999996187--7425999999999996138999999999998668866 Q ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH Q ss_conf 79998654552158869999999988842126989999999999952574569999999995023103578999999999 Q gi|254781033|r 155 FAVYSLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSL 234 (492) Q Consensus 155 ~~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~ 234 (492) .+.|-|+...++.|+++.|...+..+...+++++.++..++..|.+.||+++|.+.|.++-...|.. +...+..+. T Consensus 350 ~A~rllA~i~L~~g~~~~A~~~l~~~~~~~~~d~~~l~~lG~~~L~~g~~~~A~~yl~ka~~~~P~~----a~~~t~lG~ 425 (924) T TIGR02917 350 QARRLLAAIQLREGRVDEAIATLSPALGLDSDDSALLSLLGEAYLAKGDFEKAAEYLAKASELDPEN----AAARTKLGI 425 (924) T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCH----HHHHHHHHH T ss_conf 8999999999862897899999998861588776899999999986389678999998761058980----889999998 Q ss_pred HHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHH-HHH--HHHHHCCCCHHHHH Q ss_conf 999669989999999875431335279999999999973994789999999998328989-999--99985078958999 Q gi|254781033|r 235 ENADKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHPE-IAN--IYTHLLSENTVGKL 311 (492) Q Consensus 235 ~~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p~-La~--a~~~~~~~~~~~al 311 (492) ..+..||..++...+..|.+++|+...+-+.++-.|++.|+.++|.+++++..+.+|+.- +.. ...++..++...+. T Consensus 426 ~~L~~Gd~~~Ai~dL~~A~~l~~~~~~a~~~L~~~yL~~~~~dKAla~~~~l~~~~pd~p~~~nl~G~i~l~~~~~~kA~ 505 (924) T TIGR02917 426 SKLSQGDPSEAIADLETAAQLDPELGRADLLLILSYLRSGEFDKALAAAKKLEKKQPDNPSLHNLLGAIYLGKKDLAKAR 505 (924) T ss_pred HHHCCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCHHHHH T ss_conf 86206883478999999997277753478999998874204899999999997528898268999999998517878999 Q ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCH-HHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCC Q ss_conf 99999998268983689999999998799889999988786609998-99999999999857998999999999962889 Q gi|254781033|r 312 KRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRK-EIFLLLAQIEQANSHNTDKILYWTQSALHAMP 390 (492) Q Consensus 312 ~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~-~~~~llA~ie~~e~~d~~~ar~wl~rA~~a~~ 390 (492) ..|++.++.+|++..+...||++.++.|+.+.|+..++.++..+|.. ++.+.+++|.... |+...+-.||++|....| T Consensus 506 ~~f~~AL~i~p~~~~A~~nLA~~d~~~~~~~~A~~r~~~~l~~~P~~~~a~~~La~L~~~~-g~~~~~~~~l~~A~~~~~ 584 (924) T TIGR02917 506 EAFEKALKIEPDFFPAAANLARIDLQEGNPDDAKQRFEKVLTIDPKNLRALLALAELAART-GNEEEAVAWLEKAAEKNP 584 (924) T ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHC-CCHHHHHHHHHHHHHHCH T ss_conf 9999875318778478999999887428968999999999875743379999999999851-896899999999986281 Q ss_pred C Q ss_conf 9 Q gi|254781033|r 391 D 391 (492) Q Consensus 391 d 391 (492) + T Consensus 585 ~ 585 (924) T TIGR02917 585 K 585 (924) T ss_pred H T ss_conf 7 No 8 >PRK11447 cellulose synthase subunit BcsC; Provisional Probab=99.88 E-value=1.9e-16 Score=146.39 Aligned_cols=302 Identities=15% Similarity=0.116 Sum_probs=215.6 Q ss_pred HHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCC--------- Q ss_conf 99999999999997210047879999863013466615899999875322221000168999999726885--------- Q gi|254781033|r 82 DKGYKALYTGLMSIAAHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPAT--------- 152 (492) Q Consensus 82 ~k~~~al~~gl~al~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~--------- 152 (492) +.+|.+-.+|+-++..|+...|+..+.++.+.++.|.+.|.-| +.+++.+|++.+|..+|+++.+.++. T Consensus 268 DP~~~Ar~~Gl~aL~~G~l~aAe~~l~~AL~~~PnD~daLGgL--G~v~lRqG~~~eA~~yf~rA~~~d~~~~~~~kW~s 345 (1158) T PRK11447 268 DPAFRARQQGLAAVDSGMGGKAIAELQQAVRANPKDSEALGAL--GQAYSQRGDRARAVAQFEKALALDPHSSNRDKWES 345 (1158) T ss_pred CCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHH--HHHHHHCCCHHHHHHHHHHHHHHCCCCCCHHHHHH T ss_conf 8047899989999977997889999999997199987899888--99999658889999999999970899854688999 Q ss_pred -----HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHH---- Q ss_conf -----05799986545521588699999999888421269899999999999525745699999999950231035---- Q gi|254781033|r 153 -----REFAVYSLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNR---- 223 (492) Q Consensus 153 -----~~~~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~---- 223 (492) ++-+...-...+++.|++..|...++.+..+.|+++.+...|++++..++||.+|+.++..+++..+.... T Consensus 346 ll~ta~yW~~l~qa~~A~~~g~~~~A~~l~~~A~~~~p~~~~A~l~LaD~~~a~~d~~~AE~~Yr~~L~~~~~N~~AlrG 425 (1158) T PRK11447 346 LLKVNRYWLLIQQGDAALKANNLAQAERLYQQARQVDNTDSYAVLGLGDVAMARKDYAAAERYYQQTLRMDSGNTNAVRG 425 (1158) T ss_pred HHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHH T ss_conf 99887899999999999987998999999999985599985599999999998399799999999999748998789999 Q ss_pred ----------------------------------HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHH Q ss_conf ----------------------------------7899999999999966998999999987543133527999999999 Q gi|254781033|r 224 ----------------------------------NRAILLIARSLENADKGDMIASYHDAIESLKLCDNSIMASICAAKS 269 (492) Q Consensus 224 ----------------------------------~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~ 269 (492) .....+...+...+..|+...|...+++++++.|+.+...+.+|++ T Consensus 426 L~~l~~~q~~~~A~~ll~~l~~~q~~~~~~~~~~l~~~~l~~qA~~~~~~Gd~~~A~~~l~qa~~~~P~dpWlR~~LAr~ 505 (1158) T PRK11447 426 LANIYRQQSPEKALAFIASLSASQRRSIDDIERSLQNDRLAQQAEALENQGKWAQAAELQRQRLALDPGSVWITYRLAQD 505 (1158) T ss_pred HHHHHHHHCHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHH T ss_conf 99999863999999999837998887763144454679999999999978899999999999997189970499999999 Q ss_pred HHHCCCHHHHHHHHHHHHHHCC-CHHHHHHHHHHC--CCCHHHHHH---------------------------------- Q ss_conf 9973994789999999998328-989999999850--789589999---------------------------------- Q gi|254781033|r 270 LISQNKKRKAEVILEKIWKVNP-HPEIANIYTHLL--SENTVGKLK---------------------------------- 312 (492) Q Consensus 270 l~~~g~~~~A~~~le~a~~~~p-~p~La~a~~~~~--~~~~~~al~---------------------------------- 312 (492) |...|+..+|..++.......| +|+-..+++.+. .++...++. T Consensus 506 ~~~~G~~~~A~~lM~~l~~~~p~~pe~~yA~ALyls~~~~~~~A~a~L~~IP~a~~t~~mr~L~~Rl~~~~~la~A~~l~ 585 (1158) T PRK11447 506 LWQAGQRSQADALMRRLAQQKPNDPEQVYAYGLYLSGSDRDRAALAHLNTLPRAQWNSNIQELAQRLQSDQVLESANRLR 585 (1158) T ss_pred HHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHCCHHHCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99779988999999999844999989999999998767887999999973994337978999999987788999999999 Q ss_pred ------HHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHH-HHHHHHHHHHHHCCCHHHHHHHHHHH Q ss_conf ------99999982689836899999999987998899999887866099989-99999999998579989999999999 Q gi|254781033|r 313 ------RALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKE-IFLLLAQIEQANSHNTDKILYWTQSA 385 (492) Q Consensus 313 ------~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~-~~~llA~ie~~e~~d~~~ar~wl~rA 385 (492) ....++...|.++...+.+|..+++.|++..|+..+..++..+|... ..+-++++... .||...++..+... T Consensus 586 ~~G~~~~A~alLr~~p~~~~~~l~lAd~~~~~gd~~~A~~~y~~~l~~~P~n~da~Lg~~~~~~a-~g~~~~A~~~l~~l 664 (1158) T PRK11447 586 DSGKEAEAEALLRQQPPSTRIDLTLADWAQQRGDYAAARAAYQRVLTREPGNADARLGLTEVDIA-QGDLAAARAQLAKL 664 (1158) T ss_pred HCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHH-CCCHHHHHHHHHHH T ss_conf 76888899999744999844689999999987999999999999970499987889999999986-89999999999763 Q ss_pred H Q ss_conf 6 Q gi|254781033|r 386 L 386 (492) Q Consensus 386 ~ 386 (492) . T Consensus 665 ~ 665 (1158) T PRK11447 665 P 665 (1158) T ss_pred H T ss_conf 2 No 9 >COG2956 Predicted N-acetylglucosaminyl transferase [Carbohydrate transport and metabolism] Probab=99.83 E-value=2.8e-15 Score=137.22 Aligned_cols=328 Identities=16% Similarity=0.165 Sum_probs=262.9 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCC----HHHHHHHHHH Q ss_conf 999999997210047879999863013466615899999875322221000168999999726885----0579998654 Q gi|254781033|r 87 ALYTGLMSIAAHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPAT----REFAVYSLYF 162 (492) Q Consensus 87 al~~gl~al~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~----~~~~~~gL~~ 162 (492) -...|+-.+.+...++|...+....+... ...=++|..+.-....|..+.|.+.-+.+.+.|+. +.+++..|+. T Consensus 38 ~Yv~GlNfLLs~Q~dKAvdlF~e~l~~d~--~t~e~~ltLGnLfRsRGEvDRAIRiHQ~L~~spdlT~~qr~lAl~qL~~ 115 (389) T COG2956 38 DYVKGLNFLLSNQPDKAVDLFLEMLQEDP--ETFEAHLTLGNLFRSRGEVDRAIRIHQTLLESPDLTFEQRLLALQQLGR 115 (389) T ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHCCC--HHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHH T ss_conf 88867889861696369999999983581--5678988988898863508899999998852899866899999999989 Q ss_pred HHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCC-HHHHHHHHHHHHHHHHHHCCC Q ss_conf 5521588699999999888421269899999999999525745699999999950231-035789999999999996699 Q gi|254781033|r 163 ESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKE-WNRNRAILLIARSLENADKGD 241 (492) Q Consensus 163 ~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~-~~~~~a~l~~~~a~~~~~~~~ 241 (492) -++..|=++.|...+..+....--.+.++..|+.+|....||++|++..++..+..+. .+-.-+..+...+...+..++ T Consensus 116 Dym~aGl~DRAE~~f~~L~de~efa~~AlqqLl~IYQ~treW~KAIe~A~~L~k~~~q~~~~eIAqfyCELAq~~~~~~~ 195 (389) T COG2956 116 DYMAAGLLDRAEDIFNQLVDEGEFAEGALQQLLNIYQATREWEKAIDVAERLVKLGGQTYRVEIAQFYCELAQQALASSD 195 (389) T ss_pred HHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 99985646689999999734634328999999999998368999999999999718852125899999999999740311 Q ss_pred HHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCC--HHHH----HHHHHHCCCCHHHHHHHHH Q ss_conf 899999998754313352799999999999739947899999999983289--8999----9999850789589999999 Q gi|254781033|r 242 MIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPH--PEIA----NIYTHLLSENTVGKLKRAL 315 (492) Q Consensus 242 ~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~--p~La----~a~~~~~~~~~~~al~~~~ 315 (492) .+.|+..+.+|+..+|..+.+.+++++++...|++.+|.+.++.....+|+ +++. ..|.+ .+++.+.+.++. T Consensus 196 ~d~A~~~l~kAlqa~~~cvRAsi~lG~v~~~~g~y~~AV~~~e~v~~Qn~~yl~evl~~L~~~Y~~--lg~~~~~~~fL~ 273 (389) T COG2956 196 VDRARELLKKALQADKKCVRASIILGRVELAKGDYQKAVEALERVLEQNPEYLSEVLEMLYECYAQ--LGKPAEGLNFLR 273 (389) T ss_pred HHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHH--HCCHHHHHHHHH T ss_conf 899999999998628200103545438887526669999999999984957799999999999999--478778999999 Q ss_pred HHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHH--HHHHHCCCHHHHHHHHHHHH--CCCCC Q ss_conf 999826898368999999999879988999998878660999899999999--99985799899999999996--28899 Q gi|254781033|r 316 RLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKEIFLLLAQ--IEQANSHNTDKILYWTQSAL--HAMPD 391 (492) Q Consensus 316 ~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~~~~llA~--ie~~e~~d~~~ar~wl~rA~--~a~~d 391 (492) +....+++ ++..+.++++-....--+.|+.++..-+...||-+.+.-+-+ |.+++.|--...-.-|.+.+ +-.+. T Consensus 274 ~~~~~~~g-~~~~l~l~~l~e~~eG~~~Aq~~l~~Ql~r~Pt~~gf~rl~~~~l~daeeg~~k~sL~~lr~mvgeql~~~ 352 (389) T COG2956 274 RAMETNTG-ADAELMLADLIELQEGIDAAQAYLTRQLRRKPTMRGFHRLMDYHLADAEEGRAKESLDLLRDMVGEQLRRK 352 (389) T ss_pred HHHHCCCC-CCHHHHHHHHHHHHHCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHC T ss_conf 99864478-51999999999971077999999999986298188999998855021466622325999999999997316 Q ss_pred CCEECC-CCCCCHHHCCCCCCCCCCCCCC Q ss_conf 952756-9895542422483768736400 Q gi|254781033|r 392 PLWISD-DGYLSSVWLPLSPISKTLCYFE 419 (492) Q Consensus 392 p~W~c~-~g~~~~~W~~~c~~~~~~~~~~ 419 (492) |.+.|. ||+....--=.||.|....|+. T Consensus 353 ~~YRC~~CGF~a~~l~W~CPsC~~W~Tik 381 (389) T COG2956 353 PRYRCQNCGFTAHTLYWHCPSCRAWETIK 381 (389) T ss_pred CCCEECCCCCCHHEEEEECCCCCCCCCCC T ss_conf 77210016863101354188756522417 No 10 >TIGR00540 hemY_coli hemY protein; InterPro: IPR005254 This is a family of uncharacterised proteins encoded next to a heme-biosynthetic enzyme in two gamma division proteobacteria (Escherichia coli and Haemophilus influenzae). It is known in no other species. The gene symbol hemY is unfortunate in that an unrelated protein, protoporphyrinogen oxidase, is designated as HemG in E. coli but as HemY in Bacillus subtilis. . Probab=99.83 E-value=9.9e-16 Score=140.76 Aligned_cols=377 Identities=15% Similarity=0.052 Sum_probs=311.8 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHH Q ss_conf 96799999999999999999734898489998997662218999999999999999999999998418288999999999 Q gi|254781033|r 1 MLRLIRYFFVISLVICSFIIVSHYPEDVSITWGNRLYRTSPFVILSILYFFLFAWILLFAVSRFFLSCPAMLFHMLHKRN 80 (492) Q Consensus 1 M~r~l~~~~~i~~~~~~~~~l~~~~G~v~i~~~~~~ie~s~~~~i~~~~~~~~~~~ll~~l~~~i~~~p~~~~~~~~~rr 80 (492) |++++++++.++.....+.++.++.||+.+.-.++.+++++..+++.+++.+.+++.+-|.++.++........|+..++ T Consensus 1 ~~~~~~~~~~~~~g~~~gp~~~g~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~~~~~~w~~~~~ 80 (409) T TIGR00540 1 LLKVLLLFLLLLAGLVAGPYIAGHQGYVLIETDNYNIETSVTGLAIFLILALAVLFAFEWGLTRFFRLGAHTYGWFSGRK 80 (409) T ss_pred CHHHHHHHHHHHHHHHHHCCCCCCCCEEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHH T ss_conf 84689999999987774000026555156520330234467899999999999999999998988862110223331024 Q ss_pred HHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHH-HH Q ss_conf 99999999999999721004787999986301346661589999987532222100016899999972688505799-98 Q gi|254781033|r 81 YDKGYKALYTGLMSIAAHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPATREFAV-YS 159 (492) Q Consensus 81 ~~k~~~al~~gl~al~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~~-~g 159 (492) +.++.+...+|++-+..||+..+++...+-.. ....|.+.++..+.+++..||...+.+++....+......+.. .. T Consensus 81 ~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~--~~~~p~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~~g~d~~~~~~~ 158 (409) T TIGR00540 81 RRKARKQTLEGLLKLDEGDYAKAEKLLGKNAD--HAEEPVLNLLLAAEAAQQRGDEFRANRYLEEAAELAGNDNLPVEIA 158 (409) T ss_pred HHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCC--HHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCHHHHH T ss_conf 55566667887775302126788888631431--0235788888876665521411234567777766315664024567 Q ss_pred HHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHH----HHHHHHH Q ss_conf 654552158869999999988842126989999999999952574569999999995023103578999----9999999 Q gi|254781033|r 160 LYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAIL----LIARSLE 235 (492) Q Consensus 160 L~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l----~~~~a~~ 235 (492) .....+..+....+....++.++..|.++.+.....+.|.+.+-|....+.++...+..-........+ ....... T Consensus 159 ~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~d~ 238 (409) T TIGR00540 159 RTRLLLARNELPAARHGVDKLLEVAPRNPEVLKLAEEAYLRTGAWSALDDILPNVANAGLGDDEEFKDLESKAWNGLLDE 238 (409) T ss_pred HHHHHHHHHCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHH T ss_conf 78888763001023323567776401336789999988874334678888765555302454367888776665434444 Q ss_pred HHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHH-----HHHHHHHHCCCCHHHH Q ss_conf 99669989999999875431335279999999999973994789999999998328989-----9999998507895899 Q gi|254781033|r 236 NADKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHPE-----IANIYTHLLSENTVGK 310 (492) Q Consensus 236 ~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p~-----La~a~~~~~~~~~~~a 310 (492) .......+.....|..-.+.....+...+.....+...+++..+..+.-...+...+.. +..-...+..+++... T Consensus 239 ~~~~~g~~g~~~ww~~~~~~~~~~~~~~~~~~~~~~~c~d~~~~~~~~~~~~~~~~d~~~~~~p~~~~~~~~~~~~~~~~ 318 (409) T TIGR00540 239 KRNDEGSDGLLNWWNNQPRHTRNNVELKVALAEHLIDCDDHDTATELTFDGLKKLGDDTAISLPLLLPIPRLKPEDPEKL 318 (409) T ss_pred HHHCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHCCCCCCCCHHHH T ss_conf 43002214677653012001244567888888887503553036788898887411431000022310001366765789 Q ss_pred HHHHHHHHHHCCCCH--HHHHHHHHHHHHCCCHHHHHHHHH--HHHHCCCCHHHHHHHHHHHHHHCCCHHHHHH Q ss_conf 999999998268983--689999999998799889999988--7866099989999999999985799899999 Q gi|254781033|r 311 LKRALRLEEINKESV--ESLVIVSKIALEMGSIDQAHAKAM--LAMKIAPRKEIFLLLAQIEQANSHNTDKILY 380 (492) Q Consensus 311 l~~~~~l~~~~P~~~--e~~~~LA~~~~~~g~~~~Ar~~l~--~al~~~P~~~~~~llA~ie~~e~~d~~~ar~ 380 (492) .+.+++-.+.-.+.+ -....++..++..++|.++...+. .++...|+...+..++.+.+.. ++...+.. T Consensus 319 ~~~~~~~~~~~~~~~~c~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~-~~~~~~~~ 391 (409) T TIGR00540 319 LKLLEKRAKNVGDKPKCLLWRALGYLLVKNGEWIEAADAFKNVAALPERPDAYDLAWLADALDRL-GDPELAEA 391 (409) T ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHC-CCHHHHHH T ss_conf 99999887403520367777776667762120457888877666401001034688999998740-56568888 No 11 >PRK11447 cellulose synthase subunit BcsC; Provisional Probab=99.78 E-value=1.9e-13 Score=122.90 Aligned_cols=309 Identities=14% Similarity=0.124 Sum_probs=186.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCC Q ss_conf 89999999999999999999999972100478799998630134666158999998753222210001689999997268 Q gi|254781033|r 71 MLFHMLHKRNYDKGYKALYTGLMSIAAHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIP 150 (492) Q Consensus 71 ~~~~~~~~rr~~k~~~al~~gl~al~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~ 150 (492) .-.+|...-+-.+=+..+.+|--++..|++..|++.+..+.+... ..+.+.+..+..+..+||+..|+++|+.++..+ T Consensus 339 ~~~kW~sll~ta~yW~~l~qa~~A~~~g~~~~A~~l~~~A~~~~p--~~~~A~l~LaD~~~a~~d~~~AE~~Yr~~L~~~ 416 (1158) T PRK11447 339 NRDKWESLLKVNRYWLLIQQGDAALKANNLAQAERLYQQARQVDN--TDSYAVLGLGDVAMARKDYAAAERYYQQTLRMD 416 (1158) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCC--CCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHC T ss_conf 468899999887899999999999987998999999999985599--985599999999998399799999999999748 Q ss_pred CCHHHHHHHHHH------------------------------------------HHHCCCCHHHHHHHHHHHHHHCCCCH Q ss_conf 850579998654------------------------------------------55215886999999998884212698 Q gi|254781033|r 151 ATREFAVYSLYF------------------------------------------ESCRIGDLNSAQRYATKALDISPDAP 188 (492) Q Consensus 151 ~~~~~~~~gL~~------------------------------------------~al~~gd~~~A~~~~~~a~~~~P~~~ 188 (492) +.+.-+++||+. .++.+||...|.+.+++++...|+++ T Consensus 417 ~~N~~AlrGL~~l~~~q~~~~A~~ll~~l~~~q~~~~~~~~~~l~~~~l~~qA~~~~~~Gd~~~A~~~l~qa~~~~P~dp 496 (1158) T PRK11447 417 SGNTNAVRGLANIYRQQSPEKALAFIASLSASQRRSIDDIERSLQNDRLAQQAEALENQGKWAQAAELQRQRLALDPGSV 496 (1158) T ss_pred CCCHHHHHHHHHHHHHHCHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCC T ss_conf 99878999999999863999999999837998887763144454679999999999978899999999999997189970 Q ss_pred HHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHH-HHHHH---------------------------------------H Q ss_conf 9999999999952574569999999995023103-57899---------------------------------------9 Q gi|254781033|r 189 WVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWN-RNRAI---------------------------------------L 228 (492) Q Consensus 189 ~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~-~~~a~---------------------------------------l 228 (492) |+...|+++|...|+-..|..+++......+... ...+. . T Consensus 497 WlR~~LAr~~~~~G~~~~A~~lM~~l~~~~p~~pe~~yA~ALyls~~~~~~~A~a~L~~IP~a~~t~~mr~L~~Rl~~~~ 576 (1158) T PRK11447 497 WITYRLAQDLWQAGQRSQADALMRRLAQQKPNDPEQVYAYGLYLSGSDRDRAALAHLNTLPRAQWNSNIQELAQRLQSDQ 576 (1158) T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHCCHHHCCHHHHHHHHHHHHHH T ss_conf 49999999999779988999999999844999989999999998767887999999973994337978999999987788 Q ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCC-HHH--HHHHHHHCCC Q ss_conf 9999999996699899999998754313352799999999999739947899999999983289-899--9999985078 Q gi|254781033|r 229 LIARSLENADKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPH-PEI--ANIYTHLLSE 305 (492) Q Consensus 229 ~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~-p~L--a~a~~~~~~~ 305 (492) .+..+..+...|+...|...+ +..|........++..+...|+...|...+.......|. ++. +.+-..+..+ T Consensus 577 ~la~A~~l~~~G~~~~A~alL----r~~p~~~~~~l~lAd~~~~~gd~~~A~~~y~~~l~~~P~n~da~Lg~~~~~~a~g 652 (1158) T PRK11447 577 VLESANRLRDSGKEAEAEALL----RQQPPSTRIDLTLADWAQQRGDYAAARAAYQRVLTREPGNADARLGLTEVDIAQG 652 (1158) T ss_pred HHHHHHHHHHCCCHHHHHHHH----HCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCC T ss_conf 999999999768888999997----4499984468999999998799999999999997049998788999999998689 Q ss_pred CHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHC----CCCHHHHHHH---HHHHHHHCCCHHHH Q ss_conf 9589999999999826898368999999999879988999998878660----9998999999---99999857998999 Q gi|254781033|r 306 NTVGKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKI----APRKEIFLLL---AQIEQANSHNTDKI 378 (492) Q Consensus 306 ~~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~----~P~~~~~~ll---A~ie~~e~~d~~~a 378 (492) +...+...+..+....++++.....+|++....|+..+|...+++++.. .|+....+++ +.+... .|+...+ T Consensus 653 ~~~~A~~~l~~l~~~~~~d~~~~~~~A~~~~~~G~~~~A~~~~~~l~a~~~~~~~~~~~al~~rd~ar~~~~-~g~~~~A 731 (1158) T PRK11447 653 DLAAARAQLAKLPAFANDSLNTQRRVALAWAALGDTAAAAQTFNRLIPQAKSQPPSMESALVLRDAARFEAQ-TGDPQQA 731 (1158) T ss_pred CHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHH-CCCHHHH T ss_conf 999999999763234899867999999999976999999999999997622599854567877776899983-6998899 Q ss_pred HHHHHHHH Q ss_conf 99999996 Q gi|254781033|r 379 LYWTQSAL 386 (492) Q Consensus 379 r~wl~rA~ 386 (492) ...+..++ T Consensus 732 ~~~~~~a~ 739 (1158) T PRK11447 732 LETYKDAM 739 (1158) T ss_pred HHHHHHHH T ss_conf 99999998 No 12 >KOG4626 consensus Probab=99.76 E-value=1.8e-15 Score=138.76 Aligned_cols=300 Identities=19% Similarity=0.101 Sum_probs=215.6 Q ss_pred HHHHHHHHCCHHHHHHHHHHHHCCCCCCCCHH---HHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHH Q ss_conf 99999972100478799998630134666158---999998753222210001689999997268850579998654552 Q gi|254781033|r 89 YTGLMSIAAHNIPLARKMHSYVSQQHTFHNEY---LVYLLEVQIALAERQYNIAHEKLEMMLQIPATREFAVYSLYFESC 165 (492) Q Consensus 89 ~~gl~al~~Gd~~~A~k~~~~a~~~~~~~~~~---L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~~~gL~~~al 165 (492) ..|..-...|+...|-..+..+....+..-.. +.+|+.+ +|...+|..-|.++++.+|+-.+++-.|..... T Consensus 155 nla~al~~~~~~~~a~~~~~~alqlnP~l~ca~s~lgnLlka-----~Grl~ea~~cYlkAi~~qp~fAiawsnLg~~f~ 229 (966) T KOG4626 155 NLAAALVTQGDLELAVQCFFEALQLNPDLYCARSDLGNLLKA-----EGRLEEAKACYLKAIETQPCFAIAWSNLGCVFN 229 (966) T ss_pred HHHHHHHHCCCCCCCHHHHHHHHHCCCCHHHHHCCHHHHHHH-----HCCCCHHHHHHHHHHHHCCCEEEEEHHCCHHHH T ss_conf 378898745777613799998885095045542244678876-----336133689999998629750453120004776 Q ss_pred CCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHH--------------------- Q ss_conf 15886999999998884212698999999999995257456999999999502310357--------------------- Q gi|254781033|r 166 RIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRN--------------------- 224 (492) Q Consensus 166 ~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~--------------------- 224 (492) .+|+.-.|+++++++.+++|+...+...|+.+|...+.++.|+..+.+++...+.+... T Consensus 230 ~~Gei~~aiq~y~eAvkldP~f~dAYiNLGnV~ke~~~~d~Avs~Y~rAl~lrpn~A~a~gNla~iYyeqG~ldlAI~~Y 309 (966) T KOG4626 230 AQGEIWLAIQHYEEAVKLDPNFLDAYINLGNVYKEARIFDRAVSCYLRALNLRPNHAVAHGNLACIYYEQGLLDLAIDTY 309 (966) T ss_pred HCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCHHHCCCEEEEEECCCCHHHHHHHH T ss_conf 04369999999987653797652778658899998722037899999997429730312043479973144278889999 Q ss_pred -H--------HHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCH-- Q ss_conf -8--------99999999999966998999999987543133527999999999997399478999999999832898-- Q gi|254781033|r 225 -R--------AILLIARSLENADKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHP-- 293 (492) Q Consensus 225 -~--------a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p-- 293 (492) + -......+..+.+.|+..++...+.+++++.|....+...++.++.++|+...|.+++++++...||- T Consensus 310 kral~~~P~F~~Ay~NlanALkd~G~V~ea~~cYnkaL~l~p~hadam~NLgni~~E~~~~e~A~~ly~~al~v~p~~aa 389 (966) T KOG4626 310 KRALELQPNFPDAYNNLANALKDKGSVTEAVDCYNKALRLCPNHADAMNNLGNIYREQGKIEEATRLYLKALEVFPEFAA 389 (966) T ss_pred HHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCHHHHH T ss_conf 99870399956777579999775054679999999998758763778887989998861221889999999863802455 Q ss_pred ---HHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCH-HHHHHHHHHHH Q ss_conf ---99999998507895899999999998268983689999999998799889999988786609998-99999999999 Q gi|254781033|r 294 ---EIANIYTHLLSENTVGKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRK-EIFLLLAQIEQ 369 (492) Q Consensus 294 ---~La~a~~~~~~~~~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~-~~~~llA~ie~ 369 (492) .|+..|- ..|+..+++..++..++..|...+++..+|..+-..|+-+.|...+.+++..+|+- ..+--+|.|.+ T Consensus 390 a~nNLa~i~k--qqgnl~~Ai~~YkealrI~P~fAda~~NmGnt~ke~g~v~~A~q~y~rAI~~nPt~AeAhsNLasi~k 467 (966) T KOG4626 390 AHNNLASIYK--QQGNLDDAIMCYKEALRIKPTFADALSNMGNTYKEMGDVSAAIQCYTRAIQINPTFAEAHSNLASIYK 467 (966) T ss_pred HHHHHHHHHH--HCCCHHHHHHHHHHHHHCCCHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH T ss_conf 6422999999--60547789999998871494078998750427788641788999999988639078898754788860 Q ss_pred HHCCCHHHHHHHHHHHHCCCCC-CCEEC Q ss_conf 8579989999999999628899-95275 Q gi|254781033|r 370 ANSHNTDKILYWTQSALHAMPD-PLWIS 396 (492) Q Consensus 370 ~e~~d~~~ar~wl~rA~~a~~d-p~W~c 396 (492) . .|+...+-.-.+.|++-.|| |.=.| T Consensus 468 D-sGni~~AI~sY~~aLklkPDfpdA~c 494 (966) T KOG4626 468 D-SGNIPEAIQSYRTALKLKPDFPDAYC 494 (966) T ss_pred C-CCCCHHHHHHHHHHHCCCCCCCHHHH T ss_conf 3-68818899999998716999943566 No 13 >KOG4626 consensus Probab=99.74 E-value=4.9e-15 Score=135.36 Aligned_cols=261 Identities=15% Similarity=0.091 Sum_probs=186.4 Q ss_pred HHHHHHCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCH Q ss_conf 99997210047879999863013466615899999875322221000168999999726885057999865455215886 Q gi|254781033|r 91 GLMSIAAHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPATREFAVYSLYFESCRIGDL 170 (492) Q Consensus 91 gl~al~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~~~gL~~~al~~gd~ 170 (492) |-+....|+...|...+.++.+.++.=-+...+|... ....+.++.|..-|+.++...|+..+++-.++..+-.+|+. T Consensus 225 g~~f~~~Gei~~aiq~y~eAvkldP~f~dAYiNLGnV--~ke~~~~d~Avs~Y~rAl~lrpn~A~a~gNla~iYyeqG~l 302 (966) T KOG4626 225 GCVFNAQGEIWLAIQHYEEAVKLDPNFLDAYINLGNV--YKEARIFDRAVSCYLRALNLRPNHAVAHGNLACIYYEQGLL 302 (966) T ss_pred CHHHHHCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHH--HHHHHCCHHHHHHHHHHHHCCCCCHHHCCCEEEEEECCCCH T ss_conf 0477604369999999987653797652778658899--99872203789999999742973031204347997314427 Q ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHH Q ss_conf 99999999888421269899999999999525745699999999950231035789999999999996699899999998 Q gi|254781033|r 171 NSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIASYHDAI 250 (492) Q Consensus 171 ~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~ 250 (492) +-|+..+++++++.|+.+.+...|+......|+-.+|...+.+++...+.+.. .....+..+.+.+..++|...+. T Consensus 303 dlAI~~Ykral~~~P~F~~Ay~NlanALkd~G~V~ea~~cYnkaL~l~p~had----am~NLgni~~E~~~~e~A~~ly~ 378 (966) T KOG4626 303 DLAIDTYKRALELQPNFPDAYNNLANALKDKGSVTEAVDCYNKALRLCPNHAD----AMNNLGNIYREQGKIEEATRLYL 378 (966) T ss_pred HHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHH----HHHHHHHHHHHHCCCHHHHHHHH T ss_conf 88899999987039995677757999977505467999999999875876377----88879899988612218899999 Q ss_pred HHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHH-----CCCCHHHHHHHHHHHHHHCCCCH Q ss_conf 7543133527999999999997399478999999999832898999999985-----07895899999999998268983 Q gi|254781033|r 251 ESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHPEIANIYTHL-----LSENTVGKLKRALRLEEINKESV 325 (492) Q Consensus 251 ~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p~La~a~~~~-----~~~~~~~al~~~~~l~~~~P~~~ 325 (492) +++...|++..+...++.+|..+|+..+|..-+..++...|.. +.+|... +.++...+++.+.+.+..+|... T Consensus 379 ~al~v~p~~aaa~nNLa~i~kqqgnl~~Ai~~YkealrI~P~f--Ada~~NmGnt~ke~g~v~~A~q~y~rAI~~nPt~A 456 (966) T KOG4626 379 KALEVFPEFAAAHNNLASIYKQQGNLDDAIMCYKEALRIKPTF--ADALSNMGNTYKEMGDVSAAIQCYTRAIQINPTFA 456 (966) T ss_pred HHHHHCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCHH--HHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCHHH T ss_conf 9986380245564229999996054778999999887149407--89987504277886417889999999886390788 Q ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHH Q ss_conf 6899999999987998899999887866099989 Q gi|254781033|r 326 ESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKE 359 (492) Q Consensus 326 e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~ 359 (492) +++-.||.++-++|+..+|....+.++++.||.+ T Consensus 457 eAhsNLasi~kDsGni~~AI~sY~~aLklkPDfp 490 (966) T KOG4626 457 EAHSNLASIYKDSGNIPEAIQSYRTALKLKPDFP 490 (966) T ss_pred HHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCCCC T ss_conf 9875478886036881889999999871699994 No 14 >PRK10747 putative protoheme IX biogenesis protein; Provisional Probab=99.71 E-value=6.2e-12 Score=111.13 Aligned_cols=342 Identities=10% Similarity=0.031 Sum_probs=223.2 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCEEEEEECCEEEEE----HHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHH Q ss_conf 99999999999999997348984899989976622----18999999999999999999999998418288999999999 Q gi|254781033|r 5 IRYFFVISLVICSFIIVSHYPEDVSITWGNRLYRT----SPFVILSILYFFLFAWILLFAVSRFFLSCPAMLFHMLHKRN 80 (492) Q Consensus 5 l~~~~~i~~~~~~~~~l~~~~G~v~i~~~~~~ie~----s~~~~i~~~~~~~~~~~ll~~l~~~i~~~p~~~~~~~~~rr 80 (492) +-++++++++++|++.-.--.| =.||.+-. ++=+-+..++++++++++++.++.++++.-..+.+..+.++ T Consensus 2 ~r~l~~~~~l~~g~~~g~~~~~-----~~GYVlI~~~~~~iEtSl~~~~i~~v~~~~~l~~l~~ll~~~~~~~~~~~~w~ 76 (398) T PRK10747 2 LKVLLLFVLLIAGIVVGPMIAG-----HQGYVLIQTDNYNIETSVTGLAIILILAMVVLFAIEWLLRRIFRTGAHTRGWF 76 (398) T ss_pred CHHHHHHHHHHHHHHHHHHHCC-----CCCEEEEEECCEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHH T ss_conf 0699999999999999999848-----99869998899867811999999999999999999999999984626778888 Q ss_pred HHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHHHH Q ss_conf 99999999999999721004787999986301346661589999987532222100016899999972688505799986 Q gi|254781033|r 81 YDKGYKALYTGLMSIAAHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPATREFAVYSL 160 (492) Q Consensus 81 ~~k~~~al~~gl~al~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~~~gL 160 (492) . -+| .+++.+ ....+..+...||+..|++.+.+.++..++..+.+..- T Consensus 77 ~---------------------~rr-~~kA~~----------~t~~Gl~~l~EG~~~~AeK~l~k~a~~~~~pllnyL~A 124 (398) T PRK10747 77 V---------------------GRK-RRRARK----------QTEQALLKLAEGDYQQVEKLMAKNADHAEQPVVNYLLA 124 (398) T ss_pred H---------------------HHH-HHHHHH----------HHHHHHHHHHCCCHHHHHHHHHHHCCCCCCHHHHHHHH T ss_conf 8---------------------758-999999----------99999999877799999999986343567748899999 Q ss_pred HHHHHCCCCHHHHHHHHHHHHHHCCCCHH-HHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHC Q ss_conf 54552158869999999988842126989-99999999995257456999999999502310357899999999999966 Q gi|254781033|r 161 YFESCRIGDLNSAQRYATKALDISPDAPW-VTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADK 239 (492) Q Consensus 161 ~~~al~~gd~~~A~~~~~~a~~~~P~~~~-a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~ 239 (492) ++.+.+.||++..-++++++.+..|+..- +..+..+++..+|+|++|+.++++..+..|.++. ++.+. ...+... T Consensus 125 ArAA~~~g~~~~rd~yL~~A~e~~p~a~lav~lt~Ael~l~~~q~e~Ala~L~~L~~~~p~~~~---VL~Ll-~~~y~~~ 200 (398) T PRK10747 125 AEAAQQRGDEARANQHLERAAELAGNDTIPVEITRVRLQLARNENHAARHGVDKLLEVTPRHPE---VLRLA-EQAYIRT 200 (398) T ss_pred HHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHH---HHHHH-HHHHHHH T ss_conf 9999988898999999999888589848899999999999779989999999999861999999---99999-9999987 Q ss_pred CCHHHHHHHHHHHHHCCCCCHHHHH------HHHHHHHHCCCHHHHHHHHHHHHHHCC-----CHHHHHHHHH--HCCCC Q ss_conf 9989999999875431335279999------999999973994789999999998328-----9899999998--50789 Q gi|254781033|r 240 GDMIASYHDAIESLKLCDNSIMASI------CAAKSLISQNKKRKAEVILEKIWKVNP-----HPEIANIYTH--LLSEN 306 (492) Q Consensus 240 ~~~~~A~~~~~~a~~~~p~~~~a~~------~~a~~l~~~g~~~~A~~~le~a~~~~p-----~p~La~a~~~--~~~~~ 306 (492) +|.........+..|.. ...+..+ .+...+.+. ..+....-+++.|+..| +|.+...|+. +..|. T Consensus 201 ~dW~aL~~LLp~L~K~~-~l~~~e~~~L~~~a~~~ll~~~-~~~~~~~~L~~~W~~lp~~~r~~~~l~~~ya~~Li~~g~ 278 (398) T PRK10747 201 GAWSSLLDIIPSMAKAH-VGDEEHRAMLEQQAWIGLMDQA-RADNGSEGLRNWWKNQSRKTRHQVALQVAMAEHLIECDD 278 (398) T ss_pred HHHHHHHHHHHHHHHCC-CCCHHHHHHHHHHHHHHHHHHH-HHHCCHHHHHHHHHHCCHHHHCCHHHHHHHHHHHHHCCC T ss_conf 45999999999998757-9999999999999999999998-732347999999986899875699999999999986899 Q ss_pred HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHH-HHHHHHHHCCCHHHHHHHHHHH Q ss_conf 589999999999826898368999999999879988999998878660999899999-9999998579989999999999 Q gi|254781033|r 307 TVGKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKEIFLL-LAQIEQANSHNTDKILYWTQSA 385 (492) Q Consensus 307 ~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~~~~l-lA~ie~~e~~d~~~ar~wl~rA 385 (492) ..++.+.+++.++.+++..-.. .++++ ..++..+....+++-++.+|+...+++ ++.+... ++.-++|+++++++ T Consensus 279 ~~~A~~ll~~~Lkk~wd~~Lv~-ly~~l--~~~d~~~~l~~~e~wlk~~~~~~~Ll~aLGrL~~~-~~lwgkA~~yle~s 354 (398) T PRK10747 279 HDTAQQIIIDGLKRQYDDRLVL-PIPRL--KTNNPEQLEKVLRQQIKQVGDRPLLWSTLGQSLMK-HGEWQEASLAFRAA 354 (398) T ss_pred HHHHHHHHHHHHHHCCCHHHHH-HHHHC--CCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHH-HCHHHHHHHHHHHH T ss_conf 6899999999986189989999-98756--77998999999999873189998999999999997-35099999999999 Q ss_pred HCCCCCCC Q ss_conf 62889995 Q gi|254781033|r 386 LHAMPDPL 393 (492) Q Consensus 386 ~~a~~dp~ 393 (492) ++-.|++. T Consensus 355 l~l~p~~~ 362 (398) T PRK10747 355 LKQRPDAY 362 (398) T ss_pred HHCCCCHH T ss_conf 86599999 No 15 >KOG0495 consensus Probab=99.67 E-value=5.4e-12 Score=111.63 Aligned_cols=289 Identities=15% Similarity=0.087 Sum_probs=138.1 Q ss_pred HHCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHH Q ss_conf 72100478799998630134666158999998753222210001689999997268850579998654552158869999 Q gi|254781033|r 95 IAAHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPATREFAVYSLYFESCRIGDLNSAQ 174 (492) Q Consensus 95 l~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~~~gL~~~al~~gd~~~A~ 174 (492) ...+-+..|+..++.+.+.-+.+. -..+-+++.-..-|..+.-...|+.+...-|....-+...+......||..+|+ T Consensus 527 ~k~~~~~carAVya~alqvfp~k~--slWlra~~~ek~hgt~Esl~Allqkav~~~pkae~lwlM~ake~w~agdv~~ar 604 (913) T KOG0495 527 EKRPAIECARAVYAHALQVFPCKK--SLWLRAAMFEKSHGTRESLEALLQKAVEQCPKAEILWLMYAKEKWKAGDVPAAR 604 (913) T ss_pred HHCCHHHHHHHHHHHHHHHCCCHH--HHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHH T ss_conf 756408999999999876365400--899999999883186999999999999738862268998888887517848899 Q ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 99998884212698999999999995257456999999999502310357899999999999966998999999987543 Q gi|254781033|r 175 RYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIASYHDAIESLK 254 (492) Q Consensus 175 ~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~ 254 (492) ..+..+++.+|++..++.+-..+.....+++.|+.+|.++....+..+ +++-.+...-..+..++|+..+.++++ T Consensus 605 ~il~~af~~~pnseeiwlaavKle~en~e~eraR~llakar~~sgTeR-----v~mKs~~~er~ld~~eeA~rllEe~lk 679 (913) T KOG0495 605 VILDQAFEANPNSEEIWLAAVKLEFENDELERARDLLAKARSISGTER-----VWMKSANLERYLDNVEEALRLLEEALK 679 (913) T ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCH-----HHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999983898188898988875224648999999999750288512-----678776899986407999999999988 Q ss_pred CCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCH-HHHHHHHHHC--CCCHHHHHHHHHHHHHHCCCCHHHHHHH Q ss_conf 133527999999999997399478999999999832898-9999999850--7895899999999998268983689999 Q gi|254781033|r 255 LCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHP-EIANIYTHLL--SENTVGKLKRALRLEEINKESVESLVIV 331 (492) Q Consensus 255 ~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p-~La~a~~~~~--~~~~~~al~~~~~l~~~~P~~~e~~~~L 331 (492) ..|+|.-.+.+.+.++.+.++...|...+....+..||. -|....+.+. .++...+...+++....||++...++.. T Consensus 680 ~fp~f~Kl~lmlGQi~e~~~~ie~aR~aY~~G~k~cP~~ipLWllLakleEk~~~~~rAR~ildrarlkNPk~~~lwle~ 759 (913) T KOG0495 680 SFPDFHKLWLMLGQIEEQMENIEMAREAYLQGTKKCPNSIPLWLLLAKLEEKDGQLVRARSILDRARLKNPKNALLWLES 759 (913) T ss_pred HCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCCHHHHHH T ss_conf 48755899999868999987799999999702564889812999997788872652669999888873499862649999 Q ss_pred HHHHHHCCCHHHHHHHHHHHHHCCCC-------------------------------HHHHHHHHHHHHHHCCCHHHHHH Q ss_conf 99999879988999998878660999-------------------------------89999999999985799899999 Q gi|254781033|r 332 SKIALEMGSIDQAHAKAMLAMKIAPR-------------------------------KEIFLLLAQIEQANSHNTDKILY 380 (492) Q Consensus 332 A~~~~~~g~~~~Ar~~l~~al~~~P~-------------------------------~~~~~llA~ie~~e~~d~~~ar~ 380 (492) .++-+++|..++|+..+.++++..|+ .++.+..+.+.-.+ ....++|. T Consensus 760 Ir~ElR~Gn~~~a~~lmakALQecp~sg~LWaEaI~le~~~~rkTks~DALkkce~dphVLlaia~lfw~e-~k~~kar~ 838 (913) T KOG0495 760 IRMELRAGNKEQAELLMAKALQECPSSGLLWAEAIWLEPRPQRKTKSIDALKKCEHDPHVLLAIAKLFWSE-KKIEKARE 838 (913) T ss_pred HHHHHHCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHCCCCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHH-HHHHHHHH T ss_conf 99999748899999999999986875222689998704676652177899985168955399999999988-88999999 Q ss_pred HHHHHHCCCCC Q ss_conf 99999628899 Q gi|254781033|r 381 WTQSALHAMPD 391 (492) Q Consensus 381 wl~rA~~a~~d 391 (492) |+.||++..|| T Consensus 839 Wf~Ravk~d~d 849 (913) T KOG0495 839 WFERAVKKDPD 849 (913) T ss_pred HHHHHHCCCCC T ss_conf 99999702886 No 16 >KOG1126 consensus Probab=99.67 E-value=4.8e-13 Score=119.79 Aligned_cols=300 Identities=15% Similarity=0.101 Sum_probs=237.7 Q ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHHH Q ss_conf 99999999999721004787999986301346661589999987532222100016899999972688505799986545 Q gi|254781033|r 84 GYKALYTGLMSIAAHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPATREFAVYSLYFE 163 (492) Q Consensus 84 ~~~al~~gl~al~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~~~gL~~~ 163 (492) +++.+..|...+..=++..|...+.+.-...+ + ....+...+.+....++++++++.|+.+-+..|.+..+.--.-.. T Consensus 319 llr~~~~~~~~~s~y~~~~A~~~~~klp~h~~-n-t~wvl~q~GrayFEl~~Y~~a~~~F~~~r~~~p~rv~~meiyST~ 396 (638) T KOG1126 319 LLRGLGEGYRSLSQYNCREALNLFEKLPSHHY-N-TGWVLSQLGRAYFELIEYDQAERIFSLVRRIEPYRVKGMEIYSTT 396 (638) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC-C-CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHH T ss_conf 99999999999888879999999986477537-8-447999988999989878999999999986145411324678888 Q ss_pred HHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHH Q ss_conf 52158869999999988842126989999999999952574569999999995023103578999999999999669989 Q gi|254781033|r 164 SCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMI 243 (492) Q Consensus 164 al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~ 243 (492) .....|.-+--.+++.+...+|+.|..+..++.++..++|.+.|++.+++++...+.+. ..++-.+.+.....+.+ T Consensus 397 LWHLq~~v~Ls~Laq~Li~~~~~sPesWca~GNcfSLQkdh~~Aik~f~RAiQldp~fa----YayTLlGhE~~~~ee~d 472 (638) T KOG1126 397 LWHLQDEVALSYLAQDLIDTDPNSPESWCALGNCFSLQKDHDTAIKCFKRAIQLDPRFA----YAYTLLGHESIATEEFD 472 (638) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHCCHHHHHHHHHHHHHHHHHHHCCCCCCC----HHHHHCCCHHHHHHHHH T ss_conf 88877567788998988750878707787740256665679999998877654687632----12353077345457777 Q ss_pred HHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHH-H--HHCCCCHHHHHHHHHHHHHH Q ss_conf 99999987543133527999999999997399478999999999832898999999-9--85078958999999999982 Q gi|254781033|r 244 ASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHPEIANIY-T--HLLSENTVGKLKRALRLEEI 320 (492) Q Consensus 244 ~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p~La~a~-~--~~~~~~~~~al~~~~~l~~~ 320 (492) .|...+++|++.+|++--|+.-++.+|.+++++..|+-.+++|...+|.......+ + +-+.+....++..+++.... T Consensus 473 ~a~~~fr~Al~~~~rhYnAwYGlG~vy~Kqek~e~Ae~~fqkA~~INP~nsvi~~~~g~~~~~~k~~d~AL~~~~~A~~l 552 (638) T KOG1126 473 KAMKSFRKALGVDPRHYNAWYGLGTVYLKQEKLEFAEFHFQKAVEINPSNSVILCHIGRIQHQLKRKDKALQLYEKAIHL 552 (638) T ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHEECCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 67787776515780566788862030220410467899887642279610357765259999860132799999998816 Q ss_pred CCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCH-HHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCC Q ss_conf 68983689999999998799889999988786609998-99999999999857998999999999962889 Q gi|254781033|r 321 NKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRK-EIFLLLAQIEQANSHNTDKILYWTQSALHAMP 390 (492) Q Consensus 321 ~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~-~~~~llA~ie~~e~~d~~~ar~wl~rA~~a~~ 390 (492) +|.|+-.++-.|.+....+++.+|...|+.+-+..|+. .++.+++.|...-++-+-+.. -+.-|.+-.| T Consensus 553 d~kn~l~~~~~~~il~~~~~~~eal~~LEeLk~~vP~es~v~~llgki~k~~~~~~~Al~-~f~~A~~ldp 622 (638) T KOG1126 553 DPKNPLCKYHRASILFSLGRYVEALQELEELKELVPQESSVFALLGKIYKRLGNTDLALL-HFSWALDLDP 622 (638) T ss_pred CCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCHHHHH-HHHHHHCCCC T ss_conf 878815678877888762236999999999998575057899999999999733058887-6677860798 No 17 >PRK11788 hypothetical protein; Provisional Probab=99.64 E-value=8.4e-12 Score=110.09 Aligned_cols=294 Identities=17% Similarity=0.099 Sum_probs=223.0 Q ss_pred HHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCH-----HHHHHHHHHHHH Q ss_conf 875322221000168999999726885057999865455215886999999998884212698-----999999999995 Q gi|254781033|r 126 EVQIALAERQYNIAHEKLEMMLQIPATREFAVYSLYFESCRIGDLNSAQRYATKALDISPDAP-----WVTEAVVQQYVL 200 (492) Q Consensus 126 ~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~-----~a~~~L~~l~~~ 200 (492) .+-...++.+.++|.+.|..+++.++.....+..|.....+.|+.+.|+..=+.++. .|+.+ .++..|++=|.. T Consensus 41 ~Gln~Ll~~q~dkAid~f~~~~~~~~~t~e~hlaLG~LfRrrGE~dRAIriHq~Ll~-rp~L~~~~r~~a~~ELa~Dy~~ 119 (389) T PRK11788 41 KGLNFLLNEQPDKAIDLFIEMLKVDPETVELHLALGNLFRRRGEVDRAIRIHQNLLS-RPDLTREQRLLALQELGQDYLK 119 (389) T ss_pred HHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH-CCCCCHHHHHHHHHHHHHHHHH T ss_conf 899887258945999999999864811499999999999965608899999999972-8798999999999999999986 Q ss_pred CCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHH-----HHHHHHHHHHHCCC Q ss_conf 2574569999999995023103578999999999999669989999999875431335279-----99999999997399 Q gi|254781033|r 201 AKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIASYHDAIESLKLCDNSIM-----ASICAAKSLISQNK 275 (492) Q Consensus 201 ~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~-----a~~~~a~~l~~~g~ 275 (492) .|=.+.|...+.+......... . .+......+...++-..|.+.+.+..+....... ...-+|......|+ T Consensus 120 aGllDRAE~~f~~l~~~~~~~~--~--al~~Ll~iye~~~dW~~Ai~~a~~L~~~~~~~~~~~iAhy~CElA~~~~~~~~ 195 (389) T PRK11788 120 AGLLDRAEELFLQLVDEDDFAE--G--ALRQLLTIYQQEKDWQKAIEVAERLEKLGGDSLRKEIAHFYCELAQQALARGD 195 (389) T ss_pred CCCHHHHHHHHHHHHHCCCCHH--H--HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCC T ss_conf 7749999999999982854269--9--99999999998645999999999998704765069999999999999987489 Q ss_pred HHHHHHHHHHHHHHCCCH---HHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCH-HHHHHHHHHHHHCCCHHHHHHHHHHH Q ss_conf 478999999999832898---99999998507895899999999998268983-68999999999879988999998878 Q gi|254781033|r 276 KRKAEVILEKIWKVNPHP---EIANIYTHLLSENTVGKLKRALRLEEINKESV-ESLVIVSKIALEMGSIDQAHAKAMLA 351 (492) Q Consensus 276 ~~~A~~~le~a~~~~p~p---~La~a~~~~~~~~~~~al~~~~~l~~~~P~~~-e~~~~LA~~~~~~g~~~~Ar~~l~~a 351 (492) ..+|.+.++++.+.+|+- .+..+-.++..+++..+++.++++..++|... ++.-.+-..+...|+-++-...+++. T Consensus 196 ~~~A~~~L~kAL~~dp~cvRA~lllg~l~~~~g~~~~Ai~~l~~i~~q~p~~l~~vl~~L~~~y~~l~~~~~~~~~L~~~ 275 (389) T PRK11788 196 LDAARALLKKALAADPQCVRASILLGDLALAQGDYAAAIEALERVEEQDPEYLPEVLPKLMECYQALGDEAEGLEFLRRA 275 (389) T ss_pred HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH T ss_conf 99999999999874824341899999999987999999999999997395779999999999999828888999999999 Q ss_pred HHCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCEE--------C--CCCCCCH---------------HHC Q ss_conf 66099989999999999985799899999999996288999527--------5--6989554---------------242 Q gi|254781033|r 352 MKIAPRKEIFLLLAQIEQANSHNTDKILYWTQSALHAMPDPLWI--------S--DDGYLSS---------------VWL 406 (492) Q Consensus 352 l~~~P~~~~~~llA~ie~~e~~d~~~ar~wl~rA~~a~~dp~W~--------c--~~g~~~~---------------~W~ 406 (492) ++..|+..+...+++..... ++...|..|+.+.+.-.|+-.+. . ..+...+ .=. T Consensus 276 ~~~~~~~~l~l~l~~~l~~~-~g~~~A~~~l~~~L~~~Psl~g~~~Ll~~~l~~~~~~~~~~~l~~l~~~v~~~~~~~~~ 354 (389) T PRK11788 276 VEEYPGADLLLALAQLLEER-EGPEAAQALLREQLRRHPSLRGFHRLLDYHLAEAEEGRAKESLELLRDLVGEQLKRKPR 354 (389) T ss_pred HHHCCCCHHHHHHHHHHHHC-CCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCC T ss_conf 98589935999999999976-99999999999997529988999999998775244765435899999999999717999 Q ss_pred CCCCCCC-CCCCCCCCCCCC Q ss_conf 2483768-736400367875 Q gi|254781033|r 407 PLSPISK-TLCYFEWKIPTK 425 (492) Q Consensus 407 ~~c~~~~-~~~~~~W~~p~~ 425 (492) ..|.+|| +-.++.|.=|-- T Consensus 355 Y~C~~CGF~~~~~~WqCPsC 374 (389) T PRK11788 355 YRCRNCGFTARTLYWHCPSC 374 (389) T ss_pred EECCCCCCCCCCEEEECCCC T ss_conf 76999999888314579099 No 18 >KOG0495 consensus Probab=99.63 E-value=2e-11 Score=107.16 Aligned_cols=262 Identities=15% Similarity=0.119 Sum_probs=127.6 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCC Q ss_conf 99987532222100016899999972688505799986545521588699999999888421269899999999999525 Q gi|254781033|r 123 YLLEVQIALAERQYNIAHEKLEMMLQIPATREFAVYSLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAK 202 (492) Q Consensus 123 ~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~g 202 (492) .|+.|-..-..||...|+..+..+.+..|....-...-....-...+++.|+.++.++....|.. .++..-..+..-++ T Consensus 587 wlM~ake~w~agdv~~ar~il~~af~~~pnseeiwlaavKle~en~e~eraR~llakar~~sgTe-Rv~mKs~~~er~ld 665 (913) T KOG0495 587 WLMYAKEKWKAGDVPAARVILDQAFEANPNSEEIWLAAVKLEFENDELERARDLLAKARSISGTE-RVWMKSANLERYLD 665 (913) T ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCC-HHHHHHHHHHHHHH T ss_conf 99888888751784889999999998389818889898887522464899999999975028851-26787768999864 Q ss_pred CCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHH Q ss_conf 74569999999995023103578999999999999669989999999875431335279999999999973994789999 Q gi|254781033|r 203 EWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVI 282 (492) Q Consensus 203 dw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~ 282 (492) +.++|+.+++..++..+++.. +++..++...+.++...|++.+....+..|..+|.++.++++-...|+.-+|..+ T Consensus 666 ~~eeA~rllEe~lk~fp~f~K----l~lmlGQi~e~~~~ie~aR~aY~~G~k~cP~~ipLWllLakleEk~~~~~rAR~i 741 (913) T KOG0495 666 NVEEALRLLEEALKSFPDFHK----LWLMLGQIEEQMENIEMAREAYLQGTKKCPNSIPLWLLLAKLEEKDGQLVRARSI 741 (913) T ss_pred HHHHHHHHHHHHHHHCCCHHH----HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCCHHHHHHH T ss_conf 079999999999884875589----9999868999987799999999702564889812999997788872652669999 Q ss_pred HHHHHHHCCCHHHH---HHHHHHCCCCHH------------------------------HHHHHHHHHHHHCCCCHHHHH Q ss_conf 99999832898999---999985078958------------------------------999999999982689836899 Q gi|254781033|r 283 LEKIWKVNPHPEIA---NIYTHLLSENTV------------------------------GKLKRALRLEEINKESVESLV 329 (492) Q Consensus 283 le~a~~~~p~p~La---~a~~~~~~~~~~------------------------------~al~~~~~l~~~~P~~~e~~~ 329 (492) |+++...||...+. ..-.+++.|+.. .+.......++.-.+|+.+++ T Consensus 742 ldrarlkNPk~~~lwle~Ir~ElR~Gn~~~a~~lmakALQecp~sg~LWaEaI~le~~~~rkTks~DALkkce~dphVLl 821 (913) T KOG0495 742 LDRARLKNPKNALLWLESIRMELRAGNKEQAELLMAKALQECPSSGLLWAEAIWLEPRPQRKTKSIDALKKCEHDPHVLL 821 (913) T ss_pred HHHHHHCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHCCCCCCCHHHHHHHHHCCCCCHHHH T ss_conf 88887349986264999999999748899999999999986875222689998704676652177899985168955399 Q ss_pred HHHHHHHHCCCHHHHHHHHHHHHHCCCCHHH-HHHHHHHHHHHCCCHHHHHHHHHHHHCCCC Q ss_conf 9999999879988999998878660999899-999999999857998999999999962889 Q gi|254781033|r 330 IVSKIALEMGSIDQAHAKAMLAMKIAPRKEI-FLLLAQIEQANSHNTDKILYWTQSALHAMP 390 (492) Q Consensus 330 ~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~~-~~llA~ie~~e~~d~~~ar~wl~rA~~a~~ 390 (492) ++|++.+....+++||+.++++++.+|+... +...-..+ .++|.+..-.+-+.+-..+.| T Consensus 822 aia~lfw~e~k~~kar~Wf~Ravk~d~d~GD~wa~fykfe-l~hG~eed~kev~~~c~~~EP 882 (913) T KOG0495 822 AIAKLFWSEKKIEKAREWFERAVKKDPDNGDAWAWFYKFE-LRHGTEEDQKEVLKKCETAEP 882 (913) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHH-HHHCCHHHHHHHHHHHHCCCC T ss_conf 9999999888899999999999702886412999999999-983777779999998760599 No 19 >KOG2002 consensus Probab=99.63 E-value=7.6e-12 Score=110.47 Aligned_cols=289 Identities=15% Similarity=0.040 Sum_probs=107.9 Q ss_pred HHHHHHHCCHH--HHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHHH-HHHHHHHHHHC Q ss_conf 99999721004--7879999863013466615899999875322221000168999999726885057-99986545521 Q gi|254781033|r 90 TGLMSIAAHNI--PLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPATREF-AVYSLYFESCR 166 (492) Q Consensus 90 ~gl~al~~Gd~--~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~-~~~gL~~~al~ 166 (492) +|+.-+..|+. +.|.+.+..+.+ .++++.|.+|..|..+...||+..+..+|+.++...|...- ...|...-..+ T Consensus 134 ~~~~~l~~~~~~~~~A~a~F~~Vl~--~sp~Nil~LlGkA~i~ynkkdY~~al~yyk~al~inp~~~aD~rIgig~Cf~k 211 (1018) T KOG2002 134 QRGFLLLEGDKSMDDADAQFHFVLK--QSPDNILALLGKARIAYNKKDYRGALKYYKKALRINPACKADVRIGIGHCFWK 211 (1018) T ss_pred HHHHHHHCCCCCHHHHHHHHHHHHH--HCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHH T ss_conf 3232220477637779999999996--08853588998999996121389999999999850866678763114667875 Q ss_pred CCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHH---HHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHH Q ss_conf 5886999999998884212698999999999995257456---9999999995023103578999999999999669989 Q gi|254781033|r 167 IGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSR---AITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMI 243 (492) Q Consensus 167 ~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~---A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~ 243 (492) .|+.+.|+..++++++++|.+..++..|+.+.....|-+. +..++.++-+..+..+ +++.-.+......++.. T Consensus 212 l~~~~~a~~a~~ralqLdp~~v~alv~L~~~~l~~~d~~s~~~~~~ll~~ay~~n~~nP----~~l~~LAn~fyfK~dy~ 287 (1018) T KOG2002 212 LGMSEKALLAFERALQLDPTCVSALVALGEVDLNFNDSDSYKKGVQLLQRAYKENNENP----VALNHLANHFYFKKDYE 287 (1018) T ss_pred CCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCC----HHHHHHHHHHHHCCCHH T ss_conf 03436679999999850715599999999999870156778889999888876068995----79999977774135378 Q ss_pred HHHHHHHHHHHCCC---CCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHH----HHHHHHHCCCCHHHHHHHHHH Q ss_conf 99999987543133---52799999999999739947899999999983289899----999998507895899999999 Q gi|254781033|r 244 ASYHDAIESLKLCD---NSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHPEI----ANIYTHLLSENTVGKLKRALR 316 (492) Q Consensus 244 ~A~~~~~~a~~~~p---~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p~L----a~a~~~~~~~~~~~al~~~~~ 316 (492) .+...+..+....- -........|+.+..+|++++|.+++-.+.+.+++.-+ ...-.++..++...+...|++ T Consensus 288 ~v~~la~~ai~~t~~~~~~aes~Y~~gRs~Ha~Gd~ekA~~yY~~s~k~~~d~~~l~~~GlgQm~i~~~dle~s~~~fEk 367 (1018) T KOG2002 288 RVWHLAEHAIKNTENKSIKAESFYQLGRSYHAQGDFEKAFKYYMESLKADNDNFVLPLVGLGQMYIKRGDLEESKFCFEK 367 (1018) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHCHHHHHHHHHHH T ss_conf 99999999998610147777889999999886226899999999997248887620012125789871327788999999 Q ss_pred HHHHCCCCHHHHHHHHHHHHHCC----CHHHHHHHHHHHHHCCCC-HHHHHHHHHHHHHHCCCHHHHHHHHHHHH Q ss_conf 99826898368999999999879----988999998878660999-89999999999985799899999999996 Q gi|254781033|r 317 LEEINKESVESLVIVSKIALEMG----SIDQAHAKAMLAMKIAPR-KEIFLLLAQIEQANSHNTDKILYWTQSAL 386 (492) Q Consensus 317 l~~~~P~~~e~~~~LA~~~~~~g----~~~~Ar~~l~~al~~~P~-~~~~~llA~ie~~e~~d~~~ar~wl~rA~ 386 (492) +++.+|++.+....||.++...+ .-++|..++.++++..|. .+.|+.++.+.+. +|..+.-..+.+|+ T Consensus 368 v~k~~p~~~etm~iLG~Lya~~~~~~~~~d~a~~~l~K~~~~~~~d~~a~l~laql~e~--~d~~~sL~~~~~A~ 440 (1018) T KOG2002 368 VLKQLPNNYETMKILGCLYAHSAKKQEKRDKASNVLGKVLEQTPVDSEAWLELAQLLEQ--TDPWASLDAYGNAL 440 (1018) T ss_pred HHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHH--CCHHHHHHHHHHHH T ss_conf 99867650889999886788640125778999999988775053408889999999981--27688999999899 No 20 >COG3071 HemY Uncharacterized enzyme of heme biosynthesis [Coenzyme metabolism] Probab=99.58 E-value=7.4e-10 Score=94.94 Aligned_cols=88 Identities=16% Similarity=0.123 Sum_probs=40.2 Q ss_pred HHHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCH-HHHHHHHHHHHHCCCCHHHHHH Q ss_conf 221000168999999726885057999865455215886999999998884212698-9999999999952574569999 Q gi|254781033|r 132 AERQYNIAHEKLEMMLQIPATREFAVYSLYFESCRIGDLNSAQRYATKALDISPDAP-WVTEAVVQQYVLAKEWSRAITF 210 (492) Q Consensus 132 ~~gd~~~A~~~~~~~~~~~~~~~~~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~-~a~~~L~~l~~~~gdw~~A~~~ 210 (492) ..||+..|++......+..+...+.+..-+..+.+.||.+.+-.++.++.+..++.. .+..+..+++..++|++.|+.. T Consensus 96 ~eG~~~qAEkl~~rnae~~e~p~l~~l~aA~AA~qrgd~~~an~yL~eaae~~~~~~l~v~ltrarlll~~~d~~aA~~~ 175 (400) T COG3071 96 FEGDFQQAEKLLRRNAEHGEQPVLAYLLAAEAAQQRGDEDRANRYLAEAAELAGDDTLAVELTRARLLLNRRDYPAAREN 175 (400) T ss_pred HCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCHHHHHH T ss_conf 06738999999997653473408999999999886305778989999986258994088999999999867886568988 Q ss_pred HHHHHHCCC Q ss_conf 999995023 Q gi|254781033|r 211 LNQKKKNAK 219 (492) Q Consensus 211 l~~~~~~~~ 219 (492) ++++++..+ T Consensus 176 v~~ll~~~p 184 (400) T COG3071 176 VDQLLEMTP 184 (400) T ss_pred HHHHHHHCC T ss_conf 999987286 No 21 >KOG2002 consensus Probab=99.57 E-value=1.3e-10 Score=100.88 Aligned_cols=255 Identities=12% Similarity=0.063 Sum_probs=133.8 Q ss_pred HHHCCHHHHHHHHHHHHCCCCCCCCHHH---HHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCH-HHHHHHHHHHHHCCCC Q ss_conf 9721004787999986301346661589---99998753222210001689999997268850-5799986545521588 Q gi|254781033|r 94 SIAAHNIPLARKMHSYVSQQHTFHNEYL---VYLLEVQIALAERQYNIAHEKLEMMLQIPATR-EFAVYSLYFESCRIGD 169 (492) Q Consensus 94 al~~Gd~~~A~k~~~~a~~~~~~~~~~L---~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~-~~~~~gL~~~al~~gd 169 (492) .++.||+..+..++.-+.... .+.++ .....+++.+.+||++.|..||.+.+..++.. .+.+.||...++..|| T Consensus 280 fyfK~dy~~v~~la~~ai~~t--~~~~~~aes~Y~~gRs~Ha~Gd~ekA~~yY~~s~k~~~d~~~l~~~GlgQm~i~~~d 357 (1018) T KOG2002 280 FYFKKDYERVWHLAEHAIKNT--ENKSIKAESFYQLGRSYHAQGDFEKAFKYYMESLKADNDNFVLPLVGLGQMYIKRGD 357 (1018) T ss_pred HHHCCCHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHCH T ss_conf 741353789999999999861--014777788999999988622689999999999724888762001212578987132 Q ss_pred HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCC----CCHHHHHHHHHHHHCCCCHHHHH-------------------- Q ss_conf 699999999888421269899999999999525----74569999999995023103578-------------------- Q gi|254781033|r 170 LNSAQRYATKALDISPDAPWVTEAVVQQYVLAK----EWSRAITFLNQKKKNAKEWNRNR-------------------- 225 (492) Q Consensus 170 ~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~g----dw~~A~~~l~~~~~~~~~~~~~~-------------------- 225 (492) .+.+...+++.++..|++......|+-+|...+ .-+.|..++.+.++..+...+.- T Consensus 358 le~s~~~fEkv~k~~p~~~etm~iLG~Lya~~~~~~~~~d~a~~~l~K~~~~~~~d~~a~l~laql~e~~d~~~sL~~~~ 437 (1018) T KOG2002 358 LEESKFCFEKVLKQLPNNYETMKILGCLYAHSAKKQEKRDKASNVLGKVLEQTPVDSEAWLELAQLLEQTDPWASLDAYG 437 (1018) T ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCHHHHHHHHH T ss_conf 77889999999986765088999988678864012577899999998877505340888999999998127688999999 Q ss_pred --------------HHHHHHHHHHHHHCCCHHHHHHHHHHHHHC-----CCCC-----HHHHHHHHHHHHHCCCHHHHHH Q ss_conf --------------999999999999669989999999875431-----3352-----7999999999997399478999 Q gi|254781033|r 226 --------------AILLIARSLENADKGDMIASYHDAIESLKL-----CDNS-----IMASICAAKSLISQNKKRKAEV 281 (492) Q Consensus 226 --------------a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~-----~p~~-----~~a~~~~a~~l~~~g~~~~A~~ 281 (492) ...+...+..++..|+...|...+.+|... +++. +..-..+|+++...++...|.. T Consensus 438 ~A~d~L~~~~~~ip~E~LNNvaslhf~~g~~~~A~~~f~~A~~~~~~~~n~de~~~~~lt~~YNlarl~E~l~~~~~A~e 517 (1018) T KOG2002 438 NALDILESKGKQIPPEVLNNVASLHFRLGNIEKALEHFKSALGKLLEVANKDEGKSTNLTLKYNLARLLEELHDTEVAEE 517 (1018) T ss_pred HHHHHHHHCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 89999997278898899975889999850768789889988643012207100235106899889999874300067999 Q ss_pred HHHHHHHHCCCHHHHHHHHHH-----CCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHH Q ss_conf 999999832898999999985-----07895899999999998268983689999999998799889999988786 Q gi|254781033|r 282 ILEKIWKVNPHPEIANIYTHL-----LSENTVGKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAM 352 (492) Q Consensus 282 ~le~a~~~~p~p~La~a~~~~-----~~~~~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al 352 (492) .+....+.+|. ...+|..+ ..++..++...+...+..+..|++.+-.+|..++...+|..|..-++..+ T Consensus 518 ~Yk~Ilkehp~--YId~ylRl~~ma~~k~~~~ea~~~lk~~l~~d~~np~arsl~G~~~l~k~~~~~a~k~f~~i~ 591 (1018) T KOG2002 518 MYKSILKEHPG--YIDAYLRLGCMARDKNNLYEASLLLKDALNIDSSNPNARSLLGNLHLKKSEWKPAKKKFETIL 591 (1018) T ss_pred HHHHHHHHCCH--HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHH T ss_conf 99999987840--678998756887740671889999999872355792799998899975555320110799998 No 22 >KOG1126 consensus Probab=99.55 E-value=7.6e-11 Score=102.62 Aligned_cols=277 Identities=11% Similarity=0.087 Sum_probs=198.5 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHH Q ss_conf 99999999997210047879999863013466615899999875322221000168999999726885057999865455 Q gi|254781033|r 85 YKALYTGLMSIAAHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPATREFAVYSLYFES 164 (492) Q Consensus 85 ~~al~~gl~al~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~~~gL~~~a 164 (492) +-...-|...+--+|+.+|++.+..+.+..+-.-...-.+ ..---+++.+. +.--.-+.+++.+++....+-.+.... T Consensus 354 wvl~q~GrayFEl~~Y~~a~~~F~~~r~~~p~rv~~meiy-ST~LWHLq~~v-~Ls~Laq~Li~~~~~sPesWca~GNcf 431 (638) T KOG1126 354 WVLSQLGRAYFELIEYDQAERIFSLVRRIEPYRVKGMEIY-STTLWHLQDEV-ALSYLAQDLIDTDPNSPESWCALGNCF 431 (638) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHH-HHHHHHHHHHH-HHHHHHHHHHHHCCCCCHHHHHHCCHH T ss_conf 7999988999989878999999999986145411324678-88888877567-788998988750878707787740256 Q ss_pred HCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHH Q ss_conf 21588699999999888421269899999999999525745699999999950231035789999999999996699899 Q gi|254781033|r 165 CRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIA 244 (492) Q Consensus 165 l~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~ 244 (492) --++|.++|++.++++.+++|+..+++..++--+....+++.|...+..+++..+.+. -.+++.+..++.++..+. T Consensus 432 SLQkdh~~Aik~f~RAiQldp~faYayTLlGhE~~~~ee~d~a~~~fr~Al~~~~rhY----nAwYGlG~vy~Kqek~e~ 507 (638) T KOG1126 432 SLQKDHDTAIKCFKRAIQLDPRFAYAYTLLGHESIATEEFDKAMKSFRKALGVDPRHY----NAWYGLGTVYLKQEKLEF 507 (638) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCCHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCHHH----HHHHHHHHHEECCCHHHH T ss_conf 6656799999988776546876321235307734545777767787776515780566----788862030220410467 Q ss_pred HHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHH---HHHHHHHCCCCHHHHHHHHHHHHHHC Q ss_conf 999998754313352799999999999739947899999999983289899---99999850789589999999999826 Q gi|254781033|r 245 SYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHPEI---ANIYTHLLSENTVGKLKRALRLEEIN 321 (492) Q Consensus 245 A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p~L---a~a~~~~~~~~~~~al~~~~~l~~~~ 321 (492) |.-.+.+|...+|...-....++..+.+.|+.++|..++++|...+|..-+ -.+-.....++..++++.+|.+.... T Consensus 508 Ae~~fqkA~~INP~nsvi~~~~g~~~~~~k~~d~AL~~~~~A~~ld~kn~l~~~~~~~il~~~~~~~eal~~LEeLk~~v 587 (638) T KOG1126 508 AEFHFQKAVEINPSNSVILCHIGRIQHQLKRKDKALQLYEKAIHLDPKNPLCKYHRASILFSLGRYVEALQELEELKELV 587 (638) T ss_pred HHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHC T ss_conf 89988764227961035776525999986013279999999881687881567887788876223699999999999857 Q ss_pred CCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHH Q ss_conf 8983689999999998799889999988786609998999999999 Q gi|254781033|r 322 KESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKEIFLLLAQI 367 (492) Q Consensus 322 P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~~~~llA~i 367 (492) |++...+.++|+.+-+.|+.+.|..++.=+.+++|...--..-+++ T Consensus 588 P~es~v~~llgki~k~~~~~~~Al~~f~~A~~ldpkg~~i~~k~~~ 633 (638) T KOG1126 588 PQESSVFALLGKIYKRLGNTDLALLHFSWALDLDPKGAQIQIKAAI 633 (638) T ss_pred CCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCHHHHHHHH T ss_conf 5057899999999999733058887667786079850010678876 No 23 >TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW; InterPro: IPR013360 Proteins in this entry are designated PilF and PilW . This outer membrane protein is required both for pilus stability and for pilus functions such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.. Probab=99.54 E-value=6e-12 Score=111.22 Aligned_cols=190 Identities=18% Similarity=0.092 Sum_probs=142.3 Q ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCC--CHHHHHHHHHHHHHCCCCCHHHHHHHH Q ss_conf 999999999952574569999999995023103578999999999999669--989999999875431335279999999 Q gi|254781033|r 190 VTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKG--DMIASYHDAIESLKLCDNSIMASICAA 267 (492) Q Consensus 190 a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~--~~~~A~~~~~~a~~~~p~~~~a~~~~a 267 (492) +...|+.-|.++||.+.|.+-+++++++.|...... .+++..+...| +.+.|...+.+|+.++|+...+...|| T Consensus 39 ~r~qLal~Yl~~G~~~~Ak~~l~~AL~~dp~~~~a~----~~lA~~Yq~~grP~~~~A~~~f~~Al~~~p~~g~~~NNYG 114 (247) T TIGR02521 39 IRVQLALGYLEQGDLEKAKENLDKALEHDPDDYDAY----LALALYYQQLGRPELEKAEQSFRKALTLKPNNGDVLNNYG 114 (247) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHH----HHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCCEECHHH T ss_conf 999999999860787899999999987376661289----9999999871799557899999998624899852320145 Q ss_pred HHHHHCC--CHHHHHHHHHHHHHHCCCHHHHH-----HHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCC Q ss_conf 9999739--94789999999998328989999-----9998507895899999999998268983689999999998799 Q gi|254781033|r 268 KSLISQN--KKRKAEVILEKIWKVNPHPEIAN-----IYTHLLSENTVGKLKRALRLEEINKESVESLVIVSKIALEMGS 340 (492) Q Consensus 268 ~~l~~~g--~~~~A~~~le~a~~~~p~p~La~-----a~~~~~~~~~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~ 340 (492) .+|.++| ++++|...+++|.+..--+.... .+..++.++...+.+.|+++++.+|+.+++.+.||++.+..|+ T Consensus 115 ~FLC~~g~~~~~~A~~~F~~A~~~p~Y~~~~~~~eN~G~C~~~~g~~~~A~~~l~~Al~~dp~~p~~lleLa~l~~~~g~ 194 (247) T TIGR02521 115 TFLCQQGNDKYEQAEQQFEAAIEDPLYPQPARSYENAGLCALKAGDTEKAEKYLTRALKIDPQRPESLLELAELYYERGQ 194 (247) T ss_pred HHHHHCCCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCC T ss_conf 77640489706899999999986688973302455499999861786889999999998587888557999999875077 Q ss_pred HHHHHHHHHHHHHC----CCCHHHHHHHHHHHHHHCCCHHHHHHHHHH Q ss_conf 88999998878660----999899999999999857998999999999 Q gi|254781033|r 341 IDQAHAKAMLAMKI----APRKEIFLLLAQIEQANSHNTDKILYWTQS 384 (492) Q Consensus 341 ~~~Ar~~l~~al~~----~P~~~~~~llA~ie~~e~~d~~~ar~wl~r 384 (492) +..|+..+++.... .++.+.+.|+.+|+... ||...++.|-.+ T Consensus 195 ~~~A~~~l~ry~~~~G~~~~~a~sL~Lg~riA~~~-Gd~~~~~~y~~q 241 (247) T TIGR02521 195 YAQARALLERYQQTVGVKNQTAESLWLGIRIARAL-GDVAAAKRYGAQ 241 (247) T ss_pred HHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHH-CCHHHHHHHHHH T ss_conf 28999999999987367888873478999999984-429999999998 No 24 >KOG0547 consensus Probab=99.54 E-value=8.3e-11 Score=102.34 Aligned_cols=79 Identities=14% Similarity=0.110 Sum_probs=52.9 Q ss_pred HHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHH Q ss_conf 2222100016899999972688505799986545521588699999999888421269899999999999525745699 Q gi|254781033|r 130 ALAERQYNIAHEKLEMMLQIPATREFAVYSLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAI 208 (492) Q Consensus 130 A~~~gd~~~A~~~~~~~~~~~~~~~~~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~ 208 (492) +..+|+++.|.++|..+++..|+...-+..+.--+...||++.-.+.-.++++++|+...++......+.+.|+..+++ T Consensus 125 ~f~~kkY~eAIkyY~~AI~l~p~epiFYsNraAcY~~lgd~~~Vied~TkALEl~P~Y~KAl~RRA~A~E~lg~~~eal 203 (606) T KOG0547 125 FFRNKKYDEAIKYYTQAIELCPDEPIFYSNRAACYESLGDWEKVIEDCTKALELNPDYVKALLRRASAHEQLGKFDEAL 203 (606) T ss_pred HHHCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCHHHHH T ss_conf 5302538889999999985089976133108999999850999999878776249088999999878877532388898 No 25 >KOG2076 consensus Probab=99.52 E-value=3.5e-10 Score=97.46 Aligned_cols=322 Identities=12% Similarity=0.003 Sum_probs=190.0 Q ss_pred HHHCCHHHHHHHHHHHHHHH-HHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHH Q ss_conf 98418288999999999999-99999999999721004787999986301346661589999987532222100016899 Q gi|254781033|r 64 FFLSCPAMLFHMLHKRNYDK-GYKALYTGLMSIAAHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEK 142 (492) Q Consensus 64 ~i~~~p~~~~~~~~~rr~~k-~~~al~~gl~al~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~ 142 (492) +-.+.|++-++.+.+++-.+ ....+.++....+.||...|++.+..+.++.+....+. ...+...+++||...+... T Consensus 118 ~s~~~~k~~~~~r~~~~l~~~l~~ll~eAN~lfarg~~eeA~~i~~EvIkqdp~~~~ay--~tL~~IyEqrGd~eK~l~~ 195 (895) T KOG2076 118 KSTGTKKRGRRSRGKSKLAPELRQLLGEANNLFARGDLEEAEEILMEVIKQDPRNPIAY--YTLGEIYEQRGDIEKALNF 195 (895) T ss_pred ECCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHH--HHHHHHHHHCCCHHHHHHH T ss_conf 11567766788886444689999999998899980778999999999997273651369--9999999980329999999 Q ss_pred HHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCH- Q ss_conf 9999726885057999865455215886999999998884212698999999999995257456999999999502310- Q gi|254781033|r 143 LEMMLQIPATREFAVYSLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEW- 221 (492) Q Consensus 143 ~~~~~~~~~~~~~~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~- 221 (492) +..++.-.|....-+..++....+.|.+.+|+-++.++++.+|.+-........+|.+.|+...|..++.++....+.. T Consensus 196 ~llAAHL~p~d~e~W~~ladls~~~~~i~qA~~cy~rAI~~~p~n~~~~~ers~L~~~~G~~~~Am~~f~~l~~~~p~~d 275 (895) T KOG2076 196 WLLAAHLNPKDYELWKRLADLSEQLGNINQARYCYSRAIQANPSNWELIYERSSLYQKTGDLKRAMETFLQLLQLDPPVD 275 (895) T ss_pred HHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHCCCHH T ss_conf 99999669988689999999998601299999999999860886228999999999984707888999999986289015 Q ss_pred -HHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH--CCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHH---------- Q ss_conf -357899999999999966998999999987543--1335279999999999973994789999999998---------- Q gi|254781033|r 222 -NRNRAILLIARSLENADKGDMIASYHDAIESLK--LCDNSIMASICAAKSLISQNKKRKAEVILEKIWK---------- 288 (492) Q Consensus 222 -~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~--~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~---------- 288 (492) .+-... ....+......++...|.+.+..++. .+....+-...++.+++...+++++...+...-. T Consensus 276 ~er~~d~-i~~~~~~~~~~~~~e~a~~~le~~~s~~~~~~~~ed~ni~ael~l~~~q~d~~~~~i~~~~~r~~e~d~~e~ 354 (895) T KOG2076 276 IERIEDL-IRRVAHYFITHNERERAAKALEGALSKEKDEASLEDLNILAELFLKNKQSDKALMKIVDDRNRESEKDDSEW 354 (895) T ss_pred HHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHH T ss_conf 8999999-999999999710789999999977741034035627899999999867888766999998601367871654 Q ss_pred ------------HCCCH-----HH-----HHHHHHHCCCCHHHHHHHHHHHHHHC--C-CCHHHHHHHHHHHHHCCCHHH Q ss_conf ------------32898-----99-----99999850789589999999999826--8-983689999999998799889 Q gi|254781033|r 289 ------------VNPHP-----EI-----ANIYTHLLSENTVGKLKRALRLEEIN--K-ESVESLVIVSKIALEMGSIDQ 343 (492) Q Consensus 289 ------------~~p~p-----~L-----a~a~~~~~~~~~~~al~~~~~l~~~~--P-~~~e~~~~LA~~~~~~g~~~~ 343 (492) ..+++ +| ..-..++...+..+.+-.+ +...+ | ++++..+-+++++++.|.+.+ T Consensus 355 ~~~~~~~~~~~~~~~~~~~~s~~l~v~rl~icL~~L~~~e~~e~ll~~--l~~~n~~~~d~~dL~~d~a~al~~~~~~~~ 432 (895) T KOG2076 355 DTDERRREEPNALCEVGKELSYDLRVIRLMICLVHLKERELLEALLHF--LVEDNVWVSDDVDLYLDLADALTNIGKYKE 432 (895) T ss_pred HHHHHCCCCCCCCCCCCCCCCCCCHHHHHHHHHHCCCCCCHHHHHHHH--HHHHCCCHHHHHHHHHHHHHHHHHCCCHHH T ss_conf 002203455532014787777563057675432002443458889888--888337845557899999999983655888 Q ss_pred HHHHHHHHHHCCCCHH--HHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCC Q ss_conf 9999887866099989--99999999998579989999999999628899 Q gi|254781033|r 344 AHAKAMLAMKIAPRKE--IFLLLAQIEQANSHNTDKILYWTQSALHAMPD 391 (492) Q Consensus 344 Ar~~l~~al~~~P~~~--~~~llA~ie~~e~~d~~~ar~wl~rA~~a~~d 391 (492) |.+++..+....+... +|..+|..+... +....|.+...+++.-.|| T Consensus 433 Al~~l~~i~~~~~~~~~~vw~~~a~c~~~l-~e~e~A~e~y~kvl~~~p~ 481 (895) T KOG2076 433 ALRLLSPITNREGYQNAFVWYKLARCYMEL-GEYEEAIEFYEKVLILAPD 481 (895) T ss_pred HHHHHHHHHCCCCCCCHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHCCCC T ss_conf 999999986483335513469899999987-4289999999999952898 No 26 >KOG0624 consensus Probab=99.48 E-value=7e-09 Score=87.34 Aligned_cols=267 Identities=15% Similarity=0.090 Sum_probs=179.7 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHH Q ss_conf 99999999997210047879999863013466615899999875322221000168999999726885057999865455 Q gi|254781033|r 85 YKALYTGLMSIAAHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPATREFAVYSLYFES 164 (492) Q Consensus 85 ~~al~~gl~al~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~~~gL~~~a 164 (492) .+.|.-|---++.|.+..|...+-++.. ..++..++.+-.|-.....|....+..-+...++..|.-.-+..--.... T Consensus 39 ekhlElGk~lla~~Q~sDALt~yHaAve--~dp~~Y~aifrRaT~yLAmGksk~al~Dl~rVlelKpDF~~ARiQRg~vl 116 (504) T KOG0624 39 EKHLELGKELLARGQLSDALTHYHAAVE--GDPNNYQAIFRRATVYLAMGKSKAALQDLSRVLELKPDFMAARIQRGVVL 116 (504) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHC--CCCHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHCCCCHHHHHHHHCHHH T ss_conf 9999988999973467789999888870--78066999998777776525774003308999861831888888722035 Q ss_pred HCCCCHHHHHHHHHHHHHHCCCCHHHHH---------------HHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHH Q ss_conf 2158869999999988842126989999---------------9999999525745699999999950231035789999 Q gi|254781033|r 165 CRIGDLNSAQRYATKALDISPDAPWVTE---------------AVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILL 229 (492) Q Consensus 165 l~~gd~~~A~~~~~~a~~~~P~~~~a~~---------------~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~ 229 (492) +++|.+++|..-+...+..+|++..... ..+.-+...||...++..+...+...+= .+.+. T Consensus 117 lK~Gele~A~~DF~~vl~~~~s~~~~~eaqskl~~~~e~~~l~~ql~s~~~~GD~~~ai~~i~~llEi~~W----da~l~ 192 (504) T KOG0624 117 LKQGELEQAEADFDQVLQHEPSNGLVLEAQSKLALIQEHWVLVQQLKSASGSGDCQNAIEMITHLLEIQPW----DASLR 192 (504) T ss_pred HHCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCC----HHHHH T ss_conf 53252888987999997259873026889988776899999999999884387465599999888853713----36899 Q ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHH------------ Q ss_conf 99999999669989999999875431335279999999999973994789999999998328989999------------ Q gi|254781033|r 230 IARSLENADKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHPEIAN------------ 297 (492) Q Consensus 230 ~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p~La~------------ 297 (492) ..++..+...+++..|....+.+.+++.++.....-.+.++...|+.+.+.+.++..++.+|+.-+-. T Consensus 193 ~~Rakc~i~~~e~k~AI~Dlk~askLs~DnTe~~ykis~L~Y~vgd~~~sL~~iRECLKldpdHK~Cf~~YKklkKv~K~ 272 (504) T KOG0624 193 QARAKCYIAEGEPKKAIHDLKQASKLSQDNTEGHYKISQLLYTVGDAENSLKEIRECLKLDPDHKLCFPFYKKLKKVVKS 272 (504) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHH T ss_conf 99988888538688888889998740466538889899899766118878999999980695333677889999999999 Q ss_pred ---HHHHHCCCCHHHHHHHHHHHHHHCCCCHHHH----HHHHHHHHHCCCHHHHHHHHHHHHHCCCC Q ss_conf ---9998507895899999999998268983689----99999999879988999998878660999 Q gi|254781033|r 298 ---IYTHLLSENTVGKLKRALRLEEINKESVESL----VIVSKIALEMGSIDQAHAKAMLAMKIAPR 357 (492) Q Consensus 298 ---a~~~~~~~~~~~al~~~~~l~~~~P~~~e~~----~~LA~~~~~~g~~~~Ar~~l~~al~~~P~ 357 (492) +-..++.....+.+...+++++.+|.-+..+ ..+.+.+...+++++|.+.-..+++.+|+ T Consensus 273 les~e~~ie~~~~t~cle~ge~vlk~ep~~~~ir~~~~r~~c~C~~~d~~~~eAiqqC~evL~~d~~ 339 (504) T KOG0624 273 LESAEQAIEEKHWTECLEAGEKVLKNEPEETMIRYNGFRVLCTCYREDEQFGEAIQQCKEVLDIDPD 339 (504) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEHEEEECCCCCCCHHHHHHHHHHHHHCCCH T ss_conf 9989999862249999998789871598656142120000000005667778999999999715903 No 27 >COG2956 Predicted N-acetylglucosaminyl transferase [Carbohydrate transport and metabolism] Probab=99.45 E-value=2.1e-09 Score=91.35 Aligned_cols=293 Identities=17% Similarity=0.056 Sum_probs=190.4 Q ss_pred HHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHH-----HHHHHHHHHHHC Q ss_conf 753222210001689999997268850579998654552158869999999988842126989-----999999999952 Q gi|254781033|r 127 VQIALAERQYNIAHEKLEMMLQIPATREFAVYSLYFESCRIGDLNSAQRYATKALDISPDAPW-----VTEAVVQQYVLA 201 (492) Q Consensus 127 A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~-----a~~~L~~l~~~~ 201 (492) +-....+.+.++|.+.|..+++.++..+.....|.....+.|..|.|+..-+. +-..|+.+. +...|++-|... T Consensus 42 GlNfLLs~Q~dKAvdlF~e~l~~d~~t~e~~ltLGnLfRsRGEvDRAIRiHQ~-L~~spdlT~~qr~lAl~qL~~Dym~a 120 (389) T COG2956 42 GLNFLLSNQPDKAVDLFLEMLQEDPETFEAHLTLGNLFRSRGEVDRAIRIHQT-LLESPDLTFEQRLLALQQLGRDYMAA 120 (389) T ss_pred HHHHHHHCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHH-HHCCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 78898616963699999999835815678988988898863508899999998-85289986689999999998999985 Q ss_pred CCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHH--HH---HHHHHHHHHCCCH Q ss_conf 574569999999995023103578999999999999669989999999875431335279--99---9999999973994 Q gi|254781033|r 202 KEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIASYHDAIESLKLCDNSIM--AS---ICAAKSLISQNKK 276 (492) Q Consensus 202 gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~--a~---~~~a~~l~~~g~~ 276 (492) |=++.|...+........-...... .....+....+-..|.+.+.+..++.++... .+ .-+|..+....+. T Consensus 121 Gl~DRAE~~f~~L~de~efa~~Alq----qLl~IYQ~treW~KAIe~A~~L~k~~~q~~~~eIAqfyCELAq~~~~~~~~ 196 (389) T COG2956 121 GLLDRAEDIFNQLVDEGEFAEGALQ----QLLNIYQATREWEKAIDVAERLVKLGGQTYRVEIAQFYCELAQQALASSDV 196 (389) T ss_pred HHHHHHHHHHHHHHCCHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 6466899999997346343289999----999999983689999999999997188521258999999999997403118 Q ss_pred HHHHHHHHHHHHHCCCH---HHHHHHHHHCCCCHHHHHHHHHHHHHHCCCC-HHHHHHHHHHHHHCCCHHHHHHHHHHHH Q ss_conf 78999999999832898---9999999850789589999999999826898-3689999999998799889999988786 Q gi|254781033|r 277 RKAEVILEKIWKVNPHP---EIANIYTHLLSENTVGKLKRALRLEEINKES-VESLVIVSKIALEMGSIDQAHAKAMLAM 352 (492) Q Consensus 277 ~~A~~~le~a~~~~p~p---~La~a~~~~~~~~~~~al~~~~~l~~~~P~~-~e~~~~LA~~~~~~g~~~~Ar~~l~~al 352 (492) ++|...+++++..+|.- .+...-.++..|++..+++.+++.+.+||+. ++....|...|.+.|+..+....|.... T Consensus 197 d~A~~~l~kAlqa~~~cvRAsi~lG~v~~~~g~y~~AV~~~e~v~~Qn~~yl~evl~~L~~~Y~~lg~~~~~~~fL~~~~ 276 (389) T COG2956 197 DRARELLKKALQADKKCVRASIILGRVELAKGDYQKAVEALERVLEQNPEYLSEVLEMLYECYAQLGKPAEGLNFLRRAM 276 (389) T ss_pred HHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHH T ss_conf 99999999998628200103545438887526669999999999984957799999999999999478778999999998 Q ss_pred HCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCC-----------------CEECCC-------C-CCCHHHCC Q ss_conf 6099989999999999985799899999999996288999-----------------527569-------8-95542422 Q gi|254781033|r 353 KIAPRKEIFLLLAQIEQANSHNTDKILYWTQSALHAMPDP-----------------LWISDD-------G-YLSSVWLP 407 (492) Q Consensus 353 ~~~P~~~~~~llA~ie~~e~~d~~~ar~wl~rA~~a~~dp-----------------~W~c~~-------g-~~~~~W~~ 407 (492) +..+....-+.|+++.+...|. ..++..+.+-++..|+- .|..+- | .+-..=.. T Consensus 277 ~~~~g~~~~l~l~~l~e~~eG~-~~Aq~~l~~Ql~r~Pt~~gf~rl~~~~l~daeeg~~k~sL~~lr~mvgeql~~~~~Y 355 (389) T COG2956 277 ETNTGADAELMLADLIELQEGI-DAAQAYLTRQLRRKPTMRGFHRLMDYHLADAEEGRAKESLDLLRDMVGEQLRRKPRY 355 (389) T ss_pred HCCCCCCHHHHHHHHHHHHHCH-HHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHCCCC T ss_conf 6447851999999999971077-999999999986298188999998855021466622325999999999997316772 Q ss_pred CCCCCCC-CCCCCCCCCCC Q ss_conf 4837687-36400367875 Q gi|254781033|r 408 LSPISKT-LCYFEWKIPTK 425 (492) Q Consensus 408 ~c~~~~~-~~~~~W~~p~~ 425 (492) -|.+||- --+|-|.-|.- T Consensus 356 RC~~CGF~a~~l~W~CPsC 374 (389) T COG2956 356 RCQNCGFTAHTLYWHCPSC 374 (389) T ss_pred EECCCCCCHHEEEEECCCC T ss_conf 1001686310135418875 No 28 >TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW; InterPro: IPR013360 Proteins in this entry are designated PilF and PilW . This outer membrane protein is required both for pilus stability and for pilus functions such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.. Probab=99.44 E-value=9.9e-11 Score=101.75 Aligned_cols=198 Identities=17% Similarity=0.135 Sum_probs=160.0 Q ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCC--CHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHC Q ss_conf 998753222210001689999997268850579998654552158--869999999988842126989999999999952 Q gi|254781033|r 124 LLEVQIALAERQYNIAHEKLEMMLQIPATREFAVYSLYFESCRIG--DLNSAQRYATKALDISPDAPWVTEAVVQQYVLA 201 (492) Q Consensus 124 Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~~~gL~~~al~~g--d~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~ 201 (492) +-.|-.+..+||...|.+-++++++.+|....++..|+..+++.| +.+.|...+++++.++|++..++...|..+|++ T Consensus 41 ~qLal~Yl~~G~~~~Ak~~l~~AL~~dp~~~~a~~~lA~~Yq~~grP~~~~A~~~f~~Al~~~p~~g~~~NNYG~FLC~~ 120 (247) T TIGR02521 41 VQLALGYLEQGDLEKAKENLDKALEHDPDDYDAYLALALYYQQLGRPELEKAEQSFRKALTLKPNNGDVLNNYGTFLCQQ 120 (247) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCCEECHHHHHHHHC T ss_conf 99999998607878999999999873766612899999999871799557899999998624899852320145776404 Q ss_pred C--CCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHH Q ss_conf 5--74569999999995023103578999999999999669989999999875431335279999999999973994789 Q gi|254781033|r 202 K--EWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKA 279 (492) Q Consensus 202 g--dw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A 279 (492) | ++++|...|+++.+... +.. ....+...+.+.+..|+.+.|...+.+|++.+|+.+++...++++.+..|++.+| T Consensus 121 g~~~~~~A~~~F~~A~~~p~-Y~~-~~~~~eN~G~C~~~~g~~~~A~~~l~~Al~~dp~~p~~lleLa~l~~~~g~~~~A 198 (247) T TIGR02521 121 GNDKYEQAEQQFEAAIEDPL-YPQ-PARSYENAGLCALKAGDTEKAEKYLTRALKIDPQRPESLLELAELYYERGQYAQA 198 (247) T ss_pred CCCCHHHHHHHHHHHHHCCC-CCC-CHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCHHHH T ss_conf 89706899999999986688-973-3024554999998617868899999999985878885579999998750772899 Q ss_pred HHHHHHHHHHCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHH Q ss_conf 999999998328989999999850789589999999999826898368999999999879988999998878 Q gi|254781033|r 280 EVILEKIWKVNPHPEIANIYTHLLSENTVGKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLA 351 (492) Q Consensus 280 ~~~le~a~~~~p~p~La~a~~~~~~~~~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~a 351 (492) ...|++.-... -..+..++++.+-.+++...||-..+.++-.++ T Consensus 199 ~~~l~ry~~~~----------------------------G~~~~~a~sL~Lg~riA~~~Gd~~~~~~y~~qL 242 (247) T TIGR02521 199 RALLERYQQTV----------------------------GVKNQTAESLWLGIRIARALGDVAAAKRYGAQL 242 (247) T ss_pred HHHHHHHHHHH----------------------------CCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHH T ss_conf 99999999873----------------------------678888734789999999844299999999985 No 29 >PRK10049 pgaA outer membrane protein PgaA; Provisional Probab=99.41 E-value=9.3e-08 Score=78.55 Aligned_cols=131 Identities=18% Similarity=0.113 Sum_probs=86.1 Q ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHHCCCH------H--HHHHHHHHCCCCHHHHHHHHHHHHHHCCC----------- Q ss_conf 9999999997399478999999999832898------9--99999985078958999999999982689----------- Q gi|254781033|r 263 SICAAKSLISQNKKRKAEVILEKIWKVNPHP------E--IANIYTHLLSENTVGKLKRALRLEEINKE----------- 323 (492) Q Consensus 263 ~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p------~--La~a~~~~~~~~~~~al~~~~~l~~~~P~----------- 323 (492) ...+|..|...++..+|+.+++......|.. + ....|+++..++..++.+.++++.+..|- T Consensus 327 ~~avadayL~~~qP~kA~~iy~~l~~~~~~~~~~~~~e~~~~LfYA~~e~e~~~eA~~l~~~~~~~~p~~~~~~g~~~~~ 406 (818) T PRK10049 327 RRWVASAYLDRRQPEKAAPILTELFYHPETIAPDLSDEELADLFYSLLESEQYPGALQVAVHTIETTPYFVRLYGLPTSI 406 (818) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCC T ss_conf 99999999863996899999999973485457653567887789998611118999999998653289604306888898 Q ss_pred ----CHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHH-HHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCE Q ss_conf ----83689999999998799889999988786609998999-999999998579989999999999628899952 Q gi|254781033|r 324 ----SVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKEIF-LLLAQIEQANSHNTDKILYWTQSALHAMPDPLW 394 (492) Q Consensus 324 ----~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~~~-~llA~ie~~e~~d~~~ar~wl~rA~~a~~dp~W 394 (492) ..+...+++...+-.++.++|.+.++..+...|-..-+ .-+|+|++.. |-...+..++.++..-.|+... T Consensus 407 PNddy~~~~~l~a~~~~~~ndL~~Ae~~le~l~~~APgN~~L~~~~A~v~r~R-g~pr~Ae~~~~~a~~l~P~~~~ 481 (818) T PRK10049 407 PNDDWLQGQSLLVQSLKASNDLPQAEKRLEELAYTAPGNQGLRIDYASVYQAR-GWPRAAENELKAAESLEPRSLN 481 (818) T ss_pred CCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHC-CCHHHHHHHHHHHHHCCCCCHH T ss_conf 88279999999999999757849999999999863999879999999999854-9959999999998700864279 No 30 >PRK10049 pgaA outer membrane protein PgaA; Provisional Probab=99.39 E-value=1.1e-07 Score=77.99 Aligned_cols=127 Identities=12% Similarity=-0.022 Sum_probs=97.5 Q ss_pred HHHHHHCCCHHHHHHHHHHHHHCCCCCHHH-----HHHHHHHHHHCCCHHHHHHHHHHHHHHC-------------CCHH Q ss_conf 999996699899999998754313352799-----9999999997399478999999999832-------------8989 Q gi|254781033|r 233 SLENADKGDMIASYHDAIESLKLCDNSIMA-----SICAAKSLISQNKKRKAEVILEKIWKVN-------------PHPE 294 (492) Q Consensus 233 a~~~~~~~~~~~A~~~~~~a~~~~p~~~~a-----~~~~a~~l~~~g~~~~A~~~le~a~~~~-------------p~p~ 294 (492) +-.++..+.+.+|...+.+.....|...+. ..-+--.+++.+++.+|..++++..... |+++ T Consensus 331 adayL~~~qP~kA~~iy~~l~~~~~~~~~~~~~e~~~~LfYA~~e~e~~~eA~~l~~~~~~~~p~~~~~~g~~~~~PNdd 410 (818) T PRK10049 331 ASAYLDRRQPEKAAPILTELFYHPETIAPDLSDEELADLFYSLLESEQYPGALQVAVHTIETTPYFVRLYGLPTSIPNDD 410 (818) T ss_pred HHHHHHCCCHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCHH T ss_conf 99998639968999999999734854576535678877899986111189999999986532896043068888988827 Q ss_pred HHH-----HHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHH Q ss_conf 999-----99985078958999999999982689836899999999987998899999887866099989 Q gi|254781033|r 295 IAN-----IYTHLLSENTVGKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKE 359 (492) Q Consensus 295 La~-----a~~~~~~~~~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~ 359 (492) ... +-..+..++...+-+.++.++...|.|...++.+|.++...|....|...++++...+|... T Consensus 411 y~~~~~l~a~~~~~~ndL~~Ae~~le~l~~~APgN~~L~~~~A~v~r~Rg~pr~Ae~~~~~a~~l~P~~~ 480 (818) T PRK10049 411 WLQGQSLLVQSLKASNDLPQAEKRLEELAYTAPGNQGLRIDYASVYQARGWPRAAENELKAAESLEPRSL 480 (818) T ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCH T ss_conf 9999999999999757849999999999863999879999999999854995999999999870086427 No 31 >KOG1125 consensus Probab=99.37 E-value=5.9e-11 Score=103.49 Aligned_cols=235 Identities=11% Similarity=0.022 Sum_probs=192.5 Q ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCC Q ss_conf 99875322221000168999999726885057999865455215886999999998884212698999999999995257 Q gi|254781033|r 124 LLEVQIALAERQYNIAHEKLEMMLQIPATREFAVYSLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKE 203 (492) Q Consensus 124 Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gd 203 (492) +-.+...+..|+...|.-.|++.+..+|.+..+.+.|....-..++-..|+..++++++++|++..++..|+--|.-+|. T Consensus 289 f~eG~~lm~nG~L~~A~LafEAAVkqdP~haeAW~~LG~~qaENE~E~~ai~AL~rcl~LdP~NleaLmaLAVSytNeg~ 368 (579) T KOG1125 289 FKEGCNLMKNGDLSEAALAFEAAVKQDPQHAEAWQKLGITQAENENEQNAISALRRCLELDPTNLEALMALAVSYTNEGL 368 (579) T ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHH T ss_conf 89999998638905999999999861928789999961576503420889999999984699519999998888861466 Q ss_pred CHHHHHHHHHHHHCCCCHHHHHHHH---HHHHHHHHHHCCCHHHHHHHHHHHHHCCC--CCHHHHHHHHHHHHHCCCHHH Q ss_conf 4569999999995023103578999---99999999966998999999987543133--527999999999997399478 Q gi|254781033|r 204 WSRAITFLNQKKKNAKEWNRNRAIL---LIARSLENADKGDMIASYHDAIESLKLCD--NSIMASICAAKSLISQNKKRK 278 (492) Q Consensus 204 w~~A~~~l~~~~~~~~~~~~~~a~l---~~~~a~~~~~~~~~~~A~~~~~~a~~~~p--~~~~a~~~~a~~l~~~g~~~~ 278 (492) =..|++.|...++..+.+......- ................-.+.+.++.+..| ..+.....++-+|.-.|++++ T Consensus 369 q~~Al~~L~~Wi~~~p~y~~l~~a~~~~~~~~~~s~~~~~~l~~i~~~fLeaa~~~~~~~DpdvQ~~LGVLy~ls~efdr 448 (579) T KOG1125 369 QNQALKMLDKWIRNKPKYVHLVSAGENEDFENTKSFLDSSHLAHIQELFLEAARQLPTKIDPDVQSGLGVLYNLSGEFDR 448 (579) T ss_pred HHHHHHHHHHHHHHCCCCHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCHHHHH T ss_conf 88899999999984862111146676656667767788789999999999999857788884277626789740147888 Q ss_pred HHHHHHHHHHHCCCHHH-HHHHHHH--CCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCC Q ss_conf 99999999983289899-9999985--07895899999999998268983689999999998799889999988786609 Q gi|254781033|r 279 AEVILEKIWKVNPHPEI-ANIYTHL--LSENTVGKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIA 355 (492) Q Consensus 279 A~~~le~a~~~~p~p~L-a~a~~~~--~~~~~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~ 355 (492) |..-++.++...|...+ ..-++.. .++...+++.-|.+.+++.|+...+++.||-.|+..|.|.+|.++|-.++... T Consensus 449 aiDcf~~AL~v~Pnd~~lWNRLGAtLAN~~~s~EAIsAY~rALqLqP~yVR~RyNlgIS~mNlG~ykEA~~hlL~AL~mq 528 (579) T KOG1125 449 AVDCFEAALQVKPNDYLLWNRLGATLANGNRSEEAISAYNRALQLQPGYVRVRYNLGISCMNLGAYKEAVKHLLEALSMQ 528 (579) T ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999704984177898866886077553999999999984188726530101456620566999999999999865 Q ss_pred CCH Q ss_conf 998 Q gi|254781033|r 356 PRK 358 (492) Q Consensus 356 P~~ 358 (492) +.. T Consensus 529 ~ks 531 (579) T KOG1125 529 RKS 531 (579) T ss_pred HCC T ss_conf 315 No 32 >COG3063 PilF Tfp pilus assembly protein PilF [Cell motility and secretion / Intracellular trafficking and secretion] Probab=99.36 E-value=1.7e-09 Score=92.17 Aligned_cols=156 Identities=16% Similarity=0.057 Sum_probs=85.2 Q ss_pred HHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHH Q ss_conf 22221000168999999726885057999865455215886999999998884212698999999999995257456999 Q gi|254781033|r 130 ALAERQYNIAHEKLEMMLQIPATREFAVYSLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAIT 209 (492) Q Consensus 130 A~~~gd~~~A~~~~~~~~~~~~~~~~~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~ 209 (492) ...+||...|++-++++++.+|....++..++..+...|+.+.|.+.++++++++|++..++...+-..|.+|.+++|.. T Consensus 45 YL~~gd~~~A~~nlekAL~~DPs~~~a~~~~A~~Yq~~Ge~~~A~e~YrkAlsl~p~~GdVLNNYG~FLC~qg~~~eA~q 124 (250) T COG3063 45 YLQQGDYAQAKKNLEKALEHDPSYYLAHLVRAHYYQKLGENDLADESYRKALSLAPNNGDVLNNYGAFLCAQGRPEEAMQ 124 (250) T ss_pred HHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCHHHHHH T ss_conf 99778999999889999884952088999999999990775567999999985298864055210699970898689999 Q ss_pred HHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHH Q ss_conf 999999502310357899999999999966998999999987543133527999999999997399478999999999 Q gi|254781033|r 210 FLNQKKKNAKEWNRNRAILLIARSLENADKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIW 287 (492) Q Consensus 210 ~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~ 287 (492) .+++++....-.. .+..+...+.+.+..|+.+.+.+.+.++++.+|++++.....++.+++.|++..|...+++.- T Consensus 125 ~F~~Al~~P~Y~~--~s~t~eN~G~Cal~~gq~~~A~~~l~raL~~dp~~~~~~l~~a~~~~~~~~y~~Ar~~~~~~~ 200 (250) T COG3063 125 QFERALADPAYGE--PSDTLENLGLCALKAGQFDQAEEYLKRALELDPQFPPALLELARLHYKAGDYAPARLYLERYQ 200 (250) T ss_pred HHHHHHHCCCCCC--CCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHH T ss_conf 9999871888777--530454457888643770447999999988375787678998798875313018999999997 No 33 >COG3063 PilF Tfp pilus assembly protein PilF [Cell motility and secretion / Intracellular trafficking and secretion] Probab=99.35 E-value=9.5e-10 Score=94.08 Aligned_cols=192 Identities=15% Similarity=0.068 Sum_probs=148.1 Q ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHH Q ss_conf 99986545521588699999999888421269899999999999525745699999999950231035789999999999 Q gi|254781033|r 156 AVYSLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLE 235 (492) Q Consensus 156 ~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~ 235 (492) ....|+..++..||+..|...++++++.+|++..++..++.+|.. T Consensus 37 arlqLal~YL~~gd~~~A~~nlekAL~~DPs~~~a~~~~A~~Yq~----------------------------------- 81 (250) T COG3063 37 ARLQLALGYLQQGDYAQAKKNLEKALEHDPSYYLAHLVRAHYYQK----------------------------------- 81 (250) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH----------------------------------- T ss_conf 999999999977899999988999988495208899999999999----------------------------------- Q ss_pred HHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCC--CH-HHH--HHHHHHCCCCHHHH Q ss_conf 99669989999999875431335279999999999973994789999999998328--98-999--99998507895899 Q gi|254781033|r 236 NADKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNP--HP-EIA--NIYTHLLSENTVGK 310 (492) Q Consensus 236 ~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p--~p-~La--~a~~~~~~~~~~~a 310 (492) .|+.+.|.+.+++|++++|+.......|+-+|..+|++.+|...++++...-- ++ +-. ..+..++.++...+ T Consensus 82 ---~Ge~~~A~e~YrkAlsl~p~~GdVLNNYG~FLC~qg~~~eA~q~F~~Al~~P~Y~~~s~t~eN~G~Cal~~gq~~~A 158 (250) T COG3063 82 ---LGENDLADESYRKALSLAPNNGDVLNNYGAFLCAQGRPEEAMQQFERALADPAYGEPSDTLENLGLCALKAGQFDQA 158 (250) T ss_pred ---CCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCCHHH T ss_conf ---07755679999999852988640552106999708986899999999871888777530454457888643770447 Q ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHH-CCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHH Q ss_conf 9999999982689836899999999987998899999887866-099989999999999985799899999999996 Q gi|254781033|r 311 LKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMK-IAPRKEIFLLLAQIEQANSHNTDKILYWTQSAL 386 (492) Q Consensus 311 l~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~-~~P~~~~~~llA~ie~~e~~d~~~ar~wl~rA~ 386 (492) ...+++.+..+|+++.....+++..+..|++..|+.+++.... ..++.+.+.|-.+|+... ||...+..|=.+-. T Consensus 159 ~~~l~raL~~dp~~~~~~l~~a~~~~~~~~y~~Ar~~~~~~~~~~~~~A~sL~L~iriak~~-gd~~~a~~Y~~qL~ 234 (250) T COG3063 159 EEYLKRALELDPQFPPALLELARLHYKAGDYAPARLYLERYQQRGGAQAESLLLGIRIAKRL-GDRAAAQRYQAQLQ 234 (250) T ss_pred HHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHH-CCHHHHHHHHHHHH T ss_conf 99999998837578767899879887531301899999999702566589999899999996-54889999999999 No 34 >KOG2003 consensus Probab=99.35 E-value=5.2e-09 Score=88.30 Aligned_cols=217 Identities=19% Similarity=0.126 Sum_probs=100.9 Q ss_pred HHHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHH Q ss_conf 32222100016899999972688505799986545521588699999999888421269899999999999525745699 Q gi|254781033|r 129 IALAERQYNIAHEKLEMMLQIPATREFAVYSLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAI 208 (492) Q Consensus 129 aA~~~gd~~~A~~~~~~~~~~~~~~~~~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~ 208 (492) .+...||++.|.+.|+.++..+.....++..+...+...|+.++|+..+-+++.+--++..++..+..+|....|..+|+ T Consensus 499 ~~f~ngd~dka~~~ykeal~ndasc~ealfniglt~e~~~~ldeald~f~klh~il~nn~evl~qianiye~led~aqai 578 (840) T KOG2003 499 IAFANGDLDKAAEFYKEALNNDASCTEALFNIGLTAEALGNLDEALDCFLKLHAILLNNAEVLVQIANIYELLEDPAQAI 578 (840) T ss_pred EEEECCCHHHHHHHHHHHHCCCHHHHHHHHHHCCCHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHH T ss_conf 44415868999999999971742889999971123987447899999999999999702999999999999854879999 Q ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 99999995023103578999999999999669989999999875431335279999999999973994789999999998 Q gi|254781033|r 209 TFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWK 288 (492) Q Consensus 209 ~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~ 288 (492) +++-++...-|..+. .+..++..+-++||...|.+..-+..+.-|.++.....++..|+...-..++..++|++-- T Consensus 579 e~~~q~~slip~dp~----ilskl~dlydqegdksqafq~~ydsyryfp~nie~iewl~ayyidtqf~ekai~y~ekaal 654 (840) T KOG2003 579 ELLMQANSLIPNDPA----ILSKLADLYDQEGDKSQAFQCHYDSYRYFPCNIETIEWLAAYYIDTQFSEKAINYFEKAAL 654 (840) T ss_pred HHHHHHCCCCCCCHH----HHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999985015889889----9999998861036444545664301023676237999999888764778999999988873 Q ss_pred HCCCHHHHH---HHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHH Q ss_conf 328989999---99985078958999999999982689836899999999987998899999887 Q gi|254781033|r 289 VNPHPEIAN---IYTHLLSENTVGKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAML 350 (492) Q Consensus 289 ~~p~p~La~---a~~~~~~~~~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~ 350 (492) .+|+..-+. +-..-+.|++..++..++....+.|.+.+.+..|.|++-+.|. ..|+++..+ T Consensus 655 iqp~~~kwqlmiasc~rrsgnyqka~d~yk~~hrkfpedldclkflvri~~dlgl-~d~key~~k 718 (840) T KOG2003 655 IQPNQSKWQLMIASCFRRSGNYQKAFDLYKDIHRKFPEDLDCLKFLVRIAGDLGL-KDAKEYADK 718 (840) T ss_pred CCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCC-HHHHHHHHH T ss_conf 4853789999999998753648889999999887585501999999998400264-468999999 No 35 >KOG0547 consensus Probab=99.35 E-value=1.7e-08 Score=84.39 Aligned_cols=277 Identities=15% Similarity=0.060 Sum_probs=219.9 Q ss_pred HHHHHHHHHHHHHHHHHCC---HHHHHHHHHHHHCCC--CCCCC---------HHHHHHHHHHHHHHHHCCHHHHHHHHH Q ss_conf 9999999999999997210---047879999863013--46661---------589999987532222100016899999 Q gi|254781033|r 80 NYDKGYKALYTGLMSIAAH---NIPLARKMHSYVSQQ--HTFHN---------EYLVYLLEVQIALAERQYNIAHEKLEM 145 (492) Q Consensus 80 r~~k~~~al~~gl~al~~G---d~~~A~k~~~~a~~~--~~~~~---------~~L~~Ll~A~aA~~~gd~~~A~~~~~~ 145 (492) .-.+++..+..++-++-.| .+..|--...+.... ..... -..++++.+--..+.||..++.+-|.. T Consensus 272 ~~~ksDa~l~~~l~~l~~~~~e~Y~~a~~~~te~~~~~~~~~~~n~~d~~le~~A~al~~~gtF~fL~g~~~~a~~d~~~ 351 (606) T KOG0547 272 KSDKSDAALAEALEALEKGLEEGYLKAYDKATEECLGSESSLSVNEIDAELEYMAEALLLRGTFHFLKGDSLGAQEDFDA 351 (606) T ss_pred CCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHH T ss_conf 77621055888999998407253788889998886533431112232126789999998761022314881011666999 Q ss_pred HHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHH Q ss_conf 97268850579998654552158869999999988842126989999999999952574569999999995023103578 Q gi|254781033|r 146 MLQIPATREFAVYSLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNR 225 (492) Q Consensus 146 ~~~~~~~~~~~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~ 225 (492) .+..+++..--+..++..+++..|.+.-.+.+.++..++|.++.+....++++.-.+++++|..-+++.+...|.+--. T Consensus 352 ~I~l~~~~~~lyI~~a~~y~d~~~~~~~~~~F~~A~~ldp~n~dvYyHRgQm~flL~q~e~A~aDF~Kai~L~pe~~~~- 430 (606) T KOG0547 352 AIKLDPAFNSLYIKRAAAYADENQSEKMWKDFNKAEDLDPENPDVYYHRGQMRFLLQQYEEAIADFQKAISLDPENAYA- 430 (606) T ss_pred HHHCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHH- T ss_conf 9845765523899999998601130889999899874498898325767789999998999998799876058253379- Q ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHH---------HH Q ss_conf 999999999999669989999999875431335279999999999973994789999999998328989---------99 Q gi|254781033|r 226 AILLIARSLENADKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHPE---------IA 296 (492) Q Consensus 226 a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p~---------La 296 (492) ....+.....++...+....+.++.+.-|..++.....|.+|..++++.+|.+.+.++....|..+ +. T Consensus 431 ---~iQl~~a~Yr~~k~~~~m~~Fee~kkkFP~~~Evy~~fAeiLtDqqqFd~A~k~YD~ai~LE~~~~~~~v~~~plV~ 507 (606) T KOG0547 431 ---YIQLCCALYRQHKIAESMKTFEEAKKKFPNCPEVYNLFAEILTDQQQFDKAVKQYDKAIELEPREHLIIVNAAPLVH 507 (606) T ss_pred ---HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHH T ss_conf ---99999999998789999999999998688982377899999831776778999888888506420113246054430 Q ss_pred HHHHHH-CCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHH Q ss_conf 999985-0789589999999999826898368999999999879988999998878660999899 Q gi|254781033|r 297 NIYTHL-LSENTVGKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKEI 360 (492) Q Consensus 297 ~a~~~~-~~~~~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~~ 360 (492) .+...+ +.++...+++.+++.++.+|....+...||..-++.|+.++|.+.++++..+.-+..- T Consensus 508 Ka~l~~qwk~d~~~a~~Ll~KA~e~Dpkce~A~~tlaq~~lQ~~~i~eAielFEksa~lArt~~E 572 (606) T KOG0547 508 KALLVLQWKEDINQAENLLRKAIELDPKCEQAYETLAQFELQRGKIDEAIELFEKSAQLARTESE 572 (606) T ss_pred HHHHHHCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 46764351654899999999987259447899999989999886599999999999999885999 No 36 >KOG4162 consensus Probab=99.34 E-value=1.5e-07 Score=77.01 Aligned_cols=249 Identities=16% Similarity=0.143 Sum_probs=158.9 Q ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH-CCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHH Q ss_conf 1689999997268850579998654552158869999999988842-126989999999999952574569999999995 Q gi|254781033|r 138 IAHEKLEMMLQIPATREFAVYSLYFESCRIGDLNSAQRYATKALDI-SPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKK 216 (492) Q Consensus 138 ~A~~~~~~~~~~~~~~~~~~~gL~~~al~~gd~~~A~~~~~~a~~~-~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~ 216 (492) ++.+.++...+.+++....+..+..++--+++.+.|+.++++++++ ..+++.++..|+.+...++++.+|+++.+..+. T Consensus 462 kslqale~av~~d~~dp~~if~lalq~A~~R~l~sAl~~~~eaL~l~~~~~~~~whLLALvlSa~kr~~~Al~vvd~al~ 541 (799) T KOG4162 462 KSLQALEEAVQFDPTDPLVIFYLALQYAEQRQLTSALDYAREALALNRGDSAKAWHLLALVLSAQKRLKEALDVVDAALE 541 (799) T ss_pred HHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999986189982489999999999976999999999998715776488999999999622416778999999999 Q ss_pred CCCCH--------------HHHHHHH-------HHHH-----------H-------HHHHHCCCHHHHHHHHHHHHHCC- Q ss_conf 02310--------------3578999-------9999-----------9-------99996699899999998754313- Q gi|254781033|r 217 NAKEW--------------NRNRAIL-------LIAR-----------S-------LENADKGDMIASYHDAIESLKLC- 256 (492) Q Consensus 217 ~~~~~--------------~~~~a~l-------~~~~-----------a-------~~~~~~~~~~~A~~~~~~a~~~~- 256 (492) ..+++ +++...+ .+-. + ...+..++..++.+..+++..+. T Consensus 542 E~~~N~~l~~~~~~i~~~~~~~e~~l~t~~~~L~~we~~~~~q~~~~~g~~~~lk~~l~la~~q~~~a~s~sr~ls~l~a 621 (799) T KOG4162 542 EFGDNHVLMDGKIHIELTFNDREEALDTCIHKLALWEAEYGVQQTLDEGKLLRLKAGLHLALSQPTDAISTSRYLSSLVA 621 (799) T ss_pred HHHHHHHHCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHH T ss_conf 74122200101110122116588888889999999873116766541345665412114376565453034689889998 Q ss_pred ------------C--------C-----CHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCH-HHHHH--HHHHCCCCHH Q ss_conf ------------3--------5-----27999999999997399478999999999832898-99999--9985078958 Q gi|254781033|r 257 ------------D--------N-----SIMASICAAKSLISQNKKRKAEVILEKIWKVNPHP-EIANI--YTHLLSENTV 308 (492) Q Consensus 257 ------------p--------~-----~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p-~La~a--~~~~~~~~~~ 308 (492) | + ....+...+..+...++..++..-|.++.+..|-- ..... ......++-. T Consensus 622 ~~~~~~~se~~Lp~s~~~~~~~~~~~~~~~lwllaa~~~~~~~~~~~a~~CL~Ea~~~~~l~~~~~~~~G~~~~~~~~~~ 701 (799) T KOG4162 622 SQLKSAGSELKLPSSTVLPGPDSLWYLLQKLWLLAADLFLLSGNDDEARSCLLEASKIDPLSASVYYLRGLLLEVKGQLE 701 (799) T ss_pred HHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 65542156101674101578873278999999999999885388328899999987503133899998648999877468 Q ss_pred HHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHH--HHHHHHHCCCCH-HHHHHHHHHHHHHCCCHHHHHHHHHHH Q ss_conf 99999999998268983689999999998799889999--988786609998-999999999998579989999999999 Q gi|254781033|r 309 GKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHA--KAMLAMKIAPRK-EIFLLLAQIEQANSHNTDKILYWTQSA 385 (492) Q Consensus 309 ~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~--~l~~al~~~P~~-~~~~llA~ie~~e~~d~~~ar~wl~rA 385 (492) ++...|...+..+|+|+.+..++|+++++.|+-.-|.. .+..+++.+|+. ..|..++.+.+.+ ||...+-+-+.-| T Consensus 702 EA~~af~~Al~ldP~hv~s~~Ala~~lle~G~~~la~~~~~L~dalr~dp~n~eaW~~LG~v~k~~-Gd~~~Aaecf~aa 780 (799) T KOG4162 702 EAKEAFLVALALDPDHVPSMTALAELLLELGSPRLAEKRSLLSDALRLDPLNHEAWYYLGEVFKKL-GDSKQAAECFQAA 780 (799) T ss_pred HHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHC-CCHHHHHHHHHHH T ss_conf 999999998732888747789999999971884207789999999860887779999999999974-6667899999999 Q ss_pred HC Q ss_conf 62 Q gi|254781033|r 386 LH 387 (492) Q Consensus 386 ~~ 387 (492) +. T Consensus 781 ~q 782 (799) T KOG4162 781 LQ 782 (799) T ss_pred HH T ss_conf 86 No 37 >KOG1173 consensus Probab=99.30 E-value=1.8e-08 Score=84.15 Aligned_cols=264 Identities=11% Similarity=0.061 Sum_probs=171.7 Q ss_pred HHCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHH Q ss_conf 72100478799998630134666158999998753222210001689999997268850579998654552158869999 Q gi|254781033|r 95 IAAHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPATREFAVYSLYFESCRIGDLNSAQ 174 (492) Q Consensus 95 l~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~~~gL~~~al~~gd~~~A~ 174 (492) .-.+++..-.+.....-...+...+-+..-+ | .....|+....-..=.++++.-|...+.+...+.-++..|..+.|+ T Consensus 255 y~~c~f~~c~kit~~lle~dpfh~~~~~~~i-a-~l~el~~~n~Lf~lsh~LV~~yP~~a~sW~aVg~YYl~i~k~seAR 332 (611) T KOG1173 255 YYGCRFKECLKITEELLEKDPFHLPCLPLHI-A-CLYELGKSNKLFLLSHKLVDLYPSKALSWFAVGCYYLMIGKYSEAR 332 (611) T ss_pred HHCCHHHHHHHHHHHHHHHCCCCCCHHHHHH-H-HHHHHCCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCCHHHH T ss_conf 9717199999985998732789851399999-9-9998165216999888798728887731465778999814867999 Q ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 99998884212698999999999995257456999999999502310357899999999999966998999999987543 Q gi|254781033|r 175 RYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIASYHDAIESLK 254 (492) Q Consensus 175 ~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~ 254 (492) .++-|+..++|....++...+-.+...++-++|...+..+-+..+..... .+..+.++...+....|.+.+.+|.. T Consensus 333 ry~SKat~lD~~fgpaWl~fghsfa~e~EhdQAmaaY~tAarl~~G~hlP----~LYlgmey~~t~n~kLAe~Ff~~A~a 408 (611) T KOG1173 333 RYFSKATTLDPTFGPAWLAFGHSFAGEGEHDQAMAAYFTAARLMPGCHLP----SLYLGMEYMRTNNLKLAEKFFKQALA 408 (611) T ss_pred HHHHHHHHCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCH----HHHHHHHHHHHCCHHHHHHHHHHHHH T ss_conf 99888760376532799987677650105788999999999866677631----78999999874239899999999871 Q ss_pred CCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH---------CCCHHHHHH-HHHHCCCCHHHHHHHHHHHHHHCCCC Q ss_conf 13352799999999999739947899999999983---------289899999-99850789589999999999826898 Q gi|254781033|r 255 LCDNSIMASICAAKSLISQNKKRKAEVILEKIWKV---------NPHPEIANI-YTHLLSENTVGKLKRALRLEEINKES 324 (492) Q Consensus 255 ~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~---------~p~p~La~a-~~~~~~~~~~~al~~~~~l~~~~P~~ 324 (492) ..|..+-..--.+-+....+.+.+|.+++++++.. .|+|-+... -+..+.+...+++..+++.+...|++ T Consensus 409 i~P~Dplv~~Elgvvay~~~~y~~A~~~f~~~l~~ik~~~~e~~~w~p~~~NLGH~~Rkl~~~~eAI~~~q~aL~l~~k~ 488 (611) T KOG1173 409 IAPSDPLVLHELGVVAYTYEEYPEALKYFQKALEVIKSVLNEKIFWEPTLNNLGHAYRKLNKYEEAIDYYQKALLLSPKD 488 (611) T ss_pred CCCCCCHHHHHHHHEEEHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 18986044331442033275468899999999887663054322115778767899999766799999999998718776 Q ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHH Q ss_conf 3689999999998799889999988786609998999999 Q gi|254781033|r 325 VESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKEIFLLL 364 (492) Q Consensus 325 ~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~~~~ll 364 (492) ...+-.+|-++.-.|+++.|.+++-+++-+.|+...+.-| T Consensus 489 ~~~~asig~iy~llgnld~Aid~fhKaL~l~p~n~~~~~l 528 (611) T KOG1173 489 ASTHASIGYIYHLLGNLDKAIDHFHKALALKPDNIFISEL 528 (611) T ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCHHHHHH T ss_conf 5688889999999658077999999987148752899999 No 38 >KOG1174 consensus Probab=99.28 E-value=2.6e-08 Score=82.84 Aligned_cols=276 Identities=13% Similarity=0.071 Sum_probs=193.6 Q ss_pred CCHHHHHHHHHHHHHHHHCCHHHHHHHHHH--HHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHH Q ss_conf 615899999875322221000168999999--726885057999865455215886999999998884212698999999 Q gi|254781033|r 117 HNEYLVYLLEVQIALAERQYNIAHEKLEMM--LQIPATREFAVYSLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAV 194 (492) Q Consensus 117 ~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~--~~~~~~~~~~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L 194 (492) ..+-+..-..+.+....+++..+...+--+ ...-+.+..-+..++...-..||..+|...++++.-++|.+....... T Consensus 193 ~~dwls~wika~Aq~~~~~hs~a~~t~l~le~~~~lr~NvhLl~~lak~~~~~Gdn~~a~~~Fe~~~~~dpy~i~~MD~Y 272 (564) T KOG1174 193 HFDWLSKWIKALAQMFNFKHSDASQTFLMLHDNTTLRCNEHLMMALGKCLYYNGDYFQAEDIFSSTLCANPDNVEAMDLY 272 (564) T ss_pred CCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCHHHHHHHHHH T ss_conf 74189999999999970353036658887876540676088998875565341675689999988750781555568999 Q ss_pred HHHHHHCCCCHHHHHHHHHHHHCCCC------------HHH---HHH---------------HHHHHHHHHHHHCCCHHH Q ss_conf 99999525745699999999950231------------035---789---------------999999999996699899 Q gi|254781033|r 195 VQQYVLAKEWSRAITFLNQKKKNAKE------------WNR---NRA---------------ILLIARSLENADKGDMIA 244 (492) Q Consensus 195 ~~l~~~~gdw~~A~~~l~~~~~~~~~------------~~~---~~a---------------~l~~~~a~~~~~~~~~~~ 244 (492) +-+..+.|+.+....+.........- +.+ .++ ..++..+..+.+.+.+.+ T Consensus 273 a~LL~~eg~~e~~~~L~~~Lf~~~~~ta~~wfV~~~~l~~~K~~~rAL~~~eK~I~~~~r~~~alilKG~lL~~~~R~~~ 352 (564) T KOG1174 273 AVLLGQEGGCEQDSALMDYLFAKVKYTASHWFVHAQLLYDEKKFERALNFVEKCIDSEPRNHEALILKGRLLIALERHTQ 352 (564) T ss_pred HHHHHHHCCHHHHHHHHHHHHHHCCCCCCCEEEEEEHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHCCHHHHHCCCHHH T ss_conf 99998604876789999999852202324023431014577668999999988761585533678861399885456288 Q ss_pred HHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHH-HHHH-HHC---CCCHHHHHHHHHHHHH Q ss_conf 9999987543133527999999999997399478999999999832898999-9999-850---7895899999999998 Q gi|254781033|r 245 SYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHPEIA-NIYT-HLL---SENTVGKLKRALRLEE 319 (492) Q Consensus 245 A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p~La-~a~~-~~~---~~~~~~al~~~~~l~~ 319 (492) |.-.++.|..+.|........+...|...|..++|.-.-..+.+..|...-. ..++ .+- +-..+.+.+.+++-++ T Consensus 353 A~IaFR~Aq~Lap~rL~~Y~GL~hsYLA~~~~kEA~~~An~~~~~~~~sA~~LtL~g~~V~~~dp~~rEKAKkf~ek~L~ 432 (564) T KOG1174 353 AVIAFRTAQMLAPYRLEIYRGLFHSYLAQKRFKEANALANWTIRLFQNSARSLTLFGTLVLFPDPRMREKAKKFAEKSLK 432 (564) T ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHEEECCCCHHHHHHHHHHHHHCC T ss_conf 99999998742514578998899999961408999999999999765314467663000115685067999999984203 Q ss_pred HCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCC Q ss_conf 26898368999999999879988999998878660999899999999999857998999999999962889995 Q gi|254781033|r 320 INKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKEIFLLLAQIEQANSHNTDKILYWTQSALHAMPDPL 393 (492) Q Consensus 320 ~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~~~~llA~ie~~e~~d~~~ar~wl~rA~~a~~dp~ 393 (492) ++|+...+-..+|+++...|.++.+...|++.+...||...+..++++..+. +..+.+-+....|++-.|... T Consensus 433 ~~P~Y~~AV~~~AEL~~~Eg~~~D~i~LLe~~L~~~~D~~LH~~Lgd~~~A~-Ne~Q~am~~y~~ALr~dP~~~ 505 (564) T KOG1174 433 INPIYTPAVNLIAELCQVEGPTKDIIKLLEKHLIIFPDVNLHNHLGDIMRAQ-NEPQKAMEYYYKALRQDPKSK 505 (564) T ss_pred CCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHH-HHHHHHHHHHHHHHCCCCCCH T ss_conf 5886089999999999862765418999999874256208999999999986-107999999999970596407 No 39 >TIGR00990 3a0801s09 mitochondrial precursor proteins import receptor; InterPro: IPR005687 The mitochondrial protein translocase (MPT) family, which brings nuclearly encoded preproteins into mitochondria, is very complex with 19 currently identified protein constituents. These proteins include several chaperone proteins, four proteins of the outer membrane translocase (Tom) import receptor, five proteins of the Tom channel complex, five proteins of the inner membrane translocase (Tim) and three "motor" proteins. This family is specific for the Tom70 proteins. ; GO: 0015450 P-P-bond-hydrolysis-driven protein transmembrane transporter activity, 0006886 intracellular protein transport, 0005741 mitochondrial outer membrane. Probab=99.27 E-value=8.1e-09 Score=86.81 Aligned_cols=259 Identities=13% Similarity=0.041 Sum_probs=210.8 Q ss_pred HHHHHHHHHHHCCC-----CCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHH Q ss_conf 47879999863013-----4666158999998753222210001689999997268850579998654552158869999 Q gi|254781033|r 100 IPLARKMHSYVSQQ-----HTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPATREFAVYSLYFESCRIGDLNSAQ 174 (492) Q Consensus 100 ~~~A~k~~~~a~~~-----~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~~~gL~~~al~~gd~~~A~ 174 (492) +..|-+.+.+.... ++....++.+=+.+---.+.|++.+|.+=+.+.++-.|.-...+...+..-+..|+.+.+. T Consensus 338 y~~aa~~F~~~Ld~~nede~L~e~~A~~~~~~G~F~~L~~~~~~A~~Dl~K~IEL~P~~~qSYI~~A~~~~~~~~~~~~~ 417 (649) T TIGR00990 338 YEEAARAFEKALDLGNEDEKLGEKEAIALNLRGIFKCLKGKHLEALADLSKSIELDPRVTQSYIKRASINLELGDPDKAE 417 (649) T ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCHHHH T ss_conf 78999999998404775112114555544124222331288157899877654227873178999999974157842258 Q ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 99998884212698999999999995257456999999999502310357899999999999966998999999987543 Q gi|254781033|r 175 RYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIASYHDAIESLK 254 (492) Q Consensus 175 ~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~ 254 (492) ..+++|++.+++++.++...++++.-.|++++|.+-+++.....++.-- -+..++......+..+.....+.++.+ T Consensus 418 ~~F~~Al~~~~~~~~~YYHRaQl~FI~~~f~~A~KDy~K~~~LDp~~IF----~~IQL~~~~Y~~~~~~~~~~~F~~~~~ 493 (649) T TIGR00990 418 EDFDKALKKNSEDPDIYYHRAQLHFIKGEFAQAGKDYQKSIDLDPDFIF----SYIQLGVLQYKLGSIASSMALFRRCKK 493 (649) T ss_pred HHHHHHHHCCCCCCCCEEHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCH----HHHHHHHHHHHCCCHHHHHHHHHHHHH T ss_conf 8999998618898751101778998873077675457766432711022----689999988630344347899999986 Q ss_pred CCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCC--C-------HHHHHHHHHHC---CCCHHHHHHHHHHHHHHCC Q ss_conf 1335279999999999973994789999999998328--9-------89999999850---7895899999999998268 Q gi|254781033|r 255 LCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNP--H-------PEIANIYTHLL---SENTVGKLKRALRLEEINK 322 (492) Q Consensus 255 ~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p--~-------p~La~a~~~~~---~~~~~~al~~~~~l~~~~P 322 (492) .-|..+....-++.+|..++++.+|.+-+..+....- + |-+.-+.+..+ ..+..++-+.+|+.+..+| T Consensus 494 ~Fp~~P~V~Nyy~ElL~D~~~F~~A~~~fD~AI~LE~~~~~~~~~v~PL~~Ka~~L~~nP~~~~f~EA~~L~EKA~~~DP 573 (649) T TIGR00990 494 NFPEAPDVYNYYGELLLDKNKFDEAIEKFDLAIELEKELKPIYVNVLPLINKALALFQNPWKQDFIEAENLLEKALIIDP 573 (649) T ss_pred CCCCCCCCCCHHHHHHHCHHHHHHHHHHHHHHHHHHHCCCCEEEECHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCC T ss_conf 17788885305788861436799999985689998750598076313367789997626521007899999888630488 Q ss_pred CCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHH Q ss_conf 9836899999999987998899999887866099989999 Q gi|254781033|r 323 ESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKEIFL 362 (492) Q Consensus 323 ~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~~~~ 362 (492) ....+...||++.++.|+..+|..+++++.++.-|..-.. T Consensus 574 ~~~~A~~~lAQl~LQQ~~v~EAl~~FE~~A~LART~~E~~ 613 (649) T TIGR00990 574 ECDIAVATLAQLLLQQGDVDEALKLFERAAELARTEGELV 613 (649) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 4147899999987623468999999987787630167789 No 40 >KOG1915 consensus Probab=99.27 E-value=2.2e-07 Score=75.66 Aligned_cols=303 Identities=17% Similarity=0.041 Sum_probs=227.0 Q ss_pred HHHHCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHH--HHHHCCHHHHHHHHHHHHCCC-CHHH-HHHHHHHHHHCCC Q ss_conf 99721004787999986301346661589999987532--222100016899999972688-5057-9998654552158 Q gi|254781033|r 93 MSIAAHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIA--LAERQYNIAHEKLEMMLQIPA-TREF-AVYSLYFESCRIG 168 (492) Q Consensus 93 ~al~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA--~~~gd~~~A~~~~~~~~~~~~-~~~~-~~~gL~~~al~~g 168 (492) .-.-.|+...+++.+.++... ..++..--.|..|.+- ..+..++.++-.|+-+++.=| .+.. -..+....-.+-| T Consensus 216 FE~k~g~~~~aR~VyerAie~-~~~d~~~e~lfvaFA~fEe~qkE~ERar~iykyAld~~pk~raeeL~k~~~~fEKqfG 294 (677) T KOG1915 216 FEEKHGNVALARSVYERAIEF-LGDDEEAEILFVAFAEFEERQKEYERARFIYKYALDHIPKGRAEELYKKYTAFEKQFG 294 (677) T ss_pred HHHHCCCHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHC T ss_conf 987527188899999999998-6137899999999999999888999999999999874686509999999999999716 Q ss_pred CHHH---H-----HHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHH-----HH--HHHHH Q ss_conf 8699---9-----99999888421269899999999999525745699999999950231035789-----99--99999 Q gi|254781033|r 169 DLNS---A-----QRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRA-----IL--LIARS 233 (492) Q Consensus 169 d~~~---A-----~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a-----~l--~~~~a 233 (492) |... + .-.+++.++.+|-+-.++-..+.+....|+-+....++++++.+.|.....+. .+ .++.. T Consensus 295 d~~gIEd~Iv~KRk~qYE~~v~~np~nYDsWfdylrL~e~~g~~~~Ire~yErAIanvpp~~ekr~W~RYIYLWinYaly 374 (677) T KOG1915 295 DKEGIEDAIVGKRKFQYEKEVSKNPYNYDSWFDYLRLEESVGDKDRIRETYERAIANVPPASEKRYWRRYIYLWINYALY 374 (677) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 51116888766565379999872987706899999999722778899999999870599446778899999999999999 Q ss_pred HHHHHCCCHHHHHHHHHHHHHCCCC----CHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHC--CCCH Q ss_conf 9999669989999999875431335----279999999999973994789999999998328989999999850--7895 Q gi|254781033|r 234 LENADKGDMIASYHDAIESLKLCDN----SIMASICAAKSLISQNKKRKAEVILEKIWKVNPHPEIANIYTHLL--SENT 307 (492) Q Consensus 234 ~~~~~~~~~~~A~~~~~~a~~~~p~----~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p~La~a~~~~~--~~~~ 307 (492) ..+...|....++.+...+++-|. |.-.++++|...+++.+...|.++|-.+....|.+-+-..|.++. ..+. T Consensus 375 -eEle~ed~ertr~vyq~~l~lIPHkkFtFaKiWlmyA~feIRq~~l~~ARkiLG~AIG~cPK~KlFk~YIelElqL~ef 453 (677) T KOG1915 375 -EELEAEDVERTRQVYQACLDLIPHKKFTFAKIWLMYAQFEIRQLNLTGARKILGNAIGKCPKDKLFKGYIELELQLREF 453 (677) T ss_pred -HHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHH T ss_conf -9998312788999999998636743112999999999999987032789999888743497166777787899987657 Q ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCC-CHHHHHHHHHH-HHHHCCCHHHHHHHHHHH Q ss_conf 8999999999982689836899999999987998899999887866099-98999999999-998579989999999999 Q gi|254781033|r 308 VGKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAP-RKEIFLLLAQI-EQANSHNTDKILYWTQSA 385 (492) Q Consensus 308 ~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P-~~~~~~llA~i-e~~e~~d~~~ar~wl~rA 385 (492) ..-.+.+++.+.-.|.|...+.-.|++-...|+++.||..++-++.+.- +-...+.-+=| -+-+.|..+++|...+|- T Consensus 454 DRcRkLYEkfle~~Pe~c~~W~kyaElE~~LgdtdRaRaifelAi~qp~ldmpellwkaYIdFEi~~~E~ekaR~LYerl 533 (677) T KOG1915 454 DRCRKLYEKFLEFSPENCYAWSKYAELETSLGDTDRARAIFELAISQPALDMPELLWKAYIDFEIEEGEFEKARALYERL 533 (677) T ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHH T ss_conf 89999999998169276589998999998740078888999998338532037899987600454002578899999999 Q ss_pred H-CCCCCCCEECC Q ss_conf 6-28899952756 Q gi|254781033|r 386 L-HAMPDPLWISD 397 (492) Q Consensus 386 ~-~a~~dp~W~c~ 397 (492) + +.+.-++|+.- T Consensus 534 L~rt~h~kvWisF 546 (677) T KOG1915 534 LDRTQHVKVWISF 546 (677) T ss_pred HHHCCCCHHHHHH T ss_conf 8745464678869 No 41 >KOG2003 consensus Probab=99.24 E-value=6.6e-07 Score=71.89 Aligned_cols=204 Identities=12% Similarity=-0.002 Sum_probs=116.1 Q ss_pred HHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCC Q ss_conf 45521588699999999888421269899999999999525745699999999950231035789999999999996699 Q gi|254781033|r 162 FESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGD 241 (492) Q Consensus 162 ~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~ 241 (492) ..+...||++.|...+.+++..+..-..++...+..+..+|+.++|++.+-+..... ...+..++..+..+.-..+ T Consensus 498 n~~f~ngd~dka~~~ykeal~ndasc~ealfniglt~e~~~~ldeald~f~klh~il----~nn~evl~qianiye~led 573 (840) T KOG2003 498 NIAFANGDLDKAAEFYKEALNNDASCTEALFNIGLTAEALGNLDEALDCFLKLHAIL----LNNAEVLVQIANIYELLED 573 (840) T ss_pred CEEEECCCHHHHHHHHHHHHCCCHHHHHHHHHHCCCHHHHCCHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHC T ss_conf 544415868999999999971742889999971123987447899999999999999----7029999999999998548 Q ss_pred HHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHH-HHHHHH--HHCCCCHHHHHHHHHHHH Q ss_conf 89999999875431335279999999999973994789999999998328989-999999--850789589999999999 Q gi|254781033|r 242 MIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHPE-IANIYT--HLLSENTVGKLKRALRLE 318 (492) Q Consensus 242 ~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p~-La~a~~--~~~~~~~~~al~~~~~l~ 318 (492) +..|.+.+.++..+-|..+...--++.+|-+.|+..+|-...-..+.-.|..- ...-.+ ++...-...++..+++.. T Consensus 574 ~aqaie~~~q~~slip~dp~ilskl~dlydqegdksqafq~~ydsyryfp~nie~iewl~ayyidtqf~ekai~y~ekaa 653 (840) T KOG2003 574 PAQAIELLMQANSLIPNDPAILSKLADLYDQEGDKSQAFQCHYDSYRYFPCNIETIEWLAAYYIDTQFSEKAINYFEKAA 653 (840) T ss_pred HHHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 79999999985015889889999999886103644454566430102367623799999988876477899999998887 Q ss_pred HHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHH-HHHHHHH Q ss_conf 82689836899999999987998899999887866099989999-9999999 Q gi|254781033|r 319 EINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKEIFL-LLAQIEQ 369 (492) Q Consensus 319 ~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~~~~-llA~ie~ 369 (492) -..|+.....+.+|....+.|+|.+|-+.+.+.-+.-|..-.|+ .+.+|.- T Consensus 654 liqp~~~kwqlmiasc~rrsgnyqka~d~yk~~hrkfpedldclkflvri~~ 705 (840) T KOG2003 654 LIQPNQSKWQLMIASCFRRSGNYQKAFDLYKDIHRKFPEDLDCLKFLVRIAG 705 (840) T ss_pred HCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHC T ss_conf 3485378999999999875364888999999988758550199999999840 No 42 >KOG0624 consensus Probab=99.23 E-value=3.6e-08 Score=81.79 Aligned_cols=266 Identities=18% Similarity=0.106 Sum_probs=197.4 Q ss_pred HHHHHHHCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHH---HH---------- Q ss_conf 99999721004787999986301346661589999987532222100016899999972688505---79---------- Q gi|254781033|r 90 TGLMSIAAHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPATRE---FA---------- 156 (492) Q Consensus 90 ~gl~al~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~---~~---------- 156 (492) ++-..++-|....|...+.++..+++ +=.-+.+-.+.....+|..+.|..-|...++..+... .+ T Consensus 78 RaT~yLAmGksk~al~Dl~rVlelKp--DF~~ARiQRg~vllK~Gele~A~~DF~~vl~~~~s~~~~~eaqskl~~~~e~ 155 (504) T KOG0624 78 RATVYLAMGKSKAALQDLSRVLELKP--DFMAARIQRGVVLLKQGELEQAEADFDQVLQHEPSNGLVLEAQSKLALIQEH 155 (504) T ss_pred HHHHHHHHCCCCCCHHHHHHHHHCCC--CHHHHHHHHCHHHHHCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHH T ss_conf 77777652577400330899986183--1888888722035532528889879999972598730268899887768999 Q ss_pred --HHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH Q ss_conf --998654552158869999999988842126989999999999952574569999999995023103578999999999 Q gi|254781033|r 157 --VYSLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSL 234 (492) Q Consensus 157 --~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~ 234 (492) ++.....+...||...++.+....+++.|=...........|...|+...|+.-+..+-+...+.... .+.... T Consensus 156 ~~l~~ql~s~~~~GD~~~ai~~i~~llEi~~Wda~l~~~Rakc~i~~~e~k~AI~Dlk~askLs~DnTe~----~ykis~ 231 (504) T KOG0624 156 WVLVQQLKSASGSGDCQNAIEMITHLLEIQPWDASLRQARAKCYIAEGEPKKAIHDLKQASKLSQDNTEG----HYKISQ 231 (504) T ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHH----HHHHHH T ss_conf 9999999988438746559999988885371336899999888885386888888899987404665388----898998 Q ss_pred HHHHCCCHHHHHHHHHHHHHCCCCCHHHH------------HHHHHHHHHCCCHHHHHHHHHHHHHHCCC-HHHHH-H-- Q ss_conf 99966998999999987543133527999------------99999999739947899999999983289-89999-9-- Q gi|254781033|r 235 ENADKGDMIASYHDAIESLKLCDNSIMAS------------ICAAKSLISQNKKRKAEVILEKIWKVNPH-PEIAN-I-- 298 (492) Q Consensus 235 ~~~~~~~~~~A~~~~~~a~~~~p~~~~a~------------~~~a~~l~~~g~~~~A~~~le~a~~~~p~-p~La~-a-- 298 (492) .+..-|+...++...++-++++|+.--.. +--+...++.+.+.++...-++..+.+|. +.+-. . T Consensus 232 L~Y~vgd~~~sL~~iRECLKldpdHK~Cf~~YKklkKv~K~les~e~~ie~~~~t~cle~ge~vlk~ep~~~~ir~~~~r 311 (504) T KOG0624 232 LLYTVGDAENSLKEIRECLKLDPDHKLCFPFYKKLKKVVKSLESAEQAIEEKHWTECLEAGEKVLKNEPEETMIRYNGFR 311 (504) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCEEEEEEH T ss_conf 99766118878999999980695333677889999999999998999986224999999878987159865614212000 Q ss_pred -HHHH--CCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHH Q ss_conf -9985--07895899999999998268983689999999998799889999988786609998999 Q gi|254781033|r 299 -YTHL--LSENTVGKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKEIF 361 (492) Q Consensus 299 -~~~~--~~~~~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~~~ 361 (492) ...+ ..++..++++.....+...|++.+.+.-.|++++..+.|+.|..-++++.+.+++.+.. T Consensus 312 ~~c~C~~~d~~~~eAiqqC~evL~~d~~dv~~l~dRAeA~l~dE~YD~AI~dye~A~e~n~sn~~~ 377 (504) T KOG0624 312 VLCTCYREDEQFGEAIQQCKEVLDIDPDDVQVLCDRAEAYLGDEMYDDAIHDYEKALELNESNTRA 377 (504) T ss_pred EEEECCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHH T ss_conf 000000566777899999999971590378999999998762677888999899987448420788 No 43 >KOG1155 consensus Probab=99.19 E-value=1.4e-06 Score=69.37 Aligned_cols=25 Identities=16% Similarity=0.228 Sum_probs=12.9 Q ss_pred HHHHHCCHHHHHHHHHHHHCCCCHH Q ss_conf 2222100016899999972688505 Q gi|254781033|r 130 ALAERQYNIAHEKLEMMLQIPATRE 154 (492) Q Consensus 130 A~~~gd~~~A~~~~~~~~~~~~~~~ 154 (492) ...+.|++.+...|+.+.+.+|-+. T Consensus 272 ~y~~rDfD~a~s~Feei~knDPYRl 296 (559) T KOG1155 272 SYNQRDFDQAESVFEEIRKNDPYRL 296 (559) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 8612207889999999985497732 No 44 >KOG1070 consensus Probab=99.19 E-value=2.8e-07 Score=74.87 Aligned_cols=243 Identities=11% Similarity=0.048 Sum_probs=184.9 Q ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHC-CCCH----HHHHHHHHHHHHCCCCHHHHHHHHH Q ss_conf 6899999972688505799986545521588699999999888421-2698----9999999999952574569999999 Q gi|254781033|r 139 AHEKLEMMLQIPATREFAVYSLYFESCRIGDLNSAQRYATKALDIS-PDAP----WVTEAVVQQYVLAKEWSRAITFLNQ 213 (492) Q Consensus 139 A~~~~~~~~~~~~~~~~~~~gL~~~al~~gd~~~A~~~~~~a~~~~-P~~~----~a~~~L~~l~~~~gdw~~A~~~l~~ 213 (492) ..+-|..++...|+...-+...|...+..++.+.|++.+++++.-- +.-. -++.+++.+...-|.-+...+++++ T Consensus 1443 saeDferlvrssPNSSi~WI~YMaf~LelsEiekAR~iaerAL~tIN~REeeEKLNiWiA~lNlEn~yG~eesl~kVFeR 1522 (1710) T KOG1070 1443 SAEDFERLVRSSPNSSILWIRYMAFHLELSEIEKARKIAERALKTINFREEEEKLNIWIAYLNLENAYGTEESLKKVFER 1522 (1710) T ss_pred CHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHH T ss_conf 77889999860998402899999988502456778999999861178641677777899998678752768999999999 Q ss_pred HHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCC-- Q ss_conf 995023103578999999999999669989999999875431335279999999999973994789999999998328-- Q gi|254781033|r 214 KKKNAKEWNRNRAILLIARSLENADKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNP-- 291 (492) Q Consensus 214 ~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p-- 291 (492) +-+....+. .++.+...+......++|.+.+....+.-.+....+.+|+..++++++...|..+|.++++.-| T Consensus 1523 Acqycd~~~-----V~~~L~~iy~k~ek~~~A~ell~~m~KKF~q~~~vW~~y~~fLl~~ne~~aa~~lL~rAL~~lPk~ 1597 (1710) T KOG1070 1523 ACQYCDAYT-----VHLKLLGIYEKSEKNDEADELLRLMLKKFGQTRKVWIMYADFLLRQNEAEAARELLKRALKSLPKQ 1597 (1710) T ss_pred HHHHCCHHH-----HHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCHH T ss_conf 987433688-----999988799876231048999999999865325689999999851447889999999998646502 Q ss_pred -CHHHHHHHHHH--CCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCH-H---HHHHH Q ss_conf -98999999985--07895899999999998268983689999999998799889999988786609998-9---99999 Q gi|254781033|r 292 -HPEIANIYTHL--LSENTVGKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRK-E---IFLLL 364 (492) Q Consensus 292 -~p~La~a~~~~--~~~~~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~-~---~~~ll 364 (492) |-++..-++.+ ..+++......++.++..+|.-.+.+..+.+.-+..|+-.-.|..+++++.+.-+. . .|--. T Consensus 1598 eHv~~IskfAqLEFk~GDaeRGRtlfEgll~ayPKRtDlW~VYid~eik~~~~~~vR~lfeRvi~l~l~~kkmKfffKkw 1677 (1710) T KOG1070 1598 EHVEFISKFAQLEFKYGDAERGRTLFEGLLSAYPKRTDLWSVYIDMEIKHGDIKYVRDLFERVIELKLSIKKMKFFFKKW 1677 (1710) T ss_pred HHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHH T ss_conf 36889999999876518843567898877763854302799999988704777999999999986477866769999999 Q ss_pred HHHHHHHCCCHHHHHHHHHHHHC Q ss_conf 99999857998999999999962 Q gi|254781033|r 365 AQIEQANSHNTDKILYWTQSALH 387 (492) Q Consensus 365 A~ie~~e~~d~~~ar~wl~rA~~ 387 (492) .+- +.+.||++.+.+--+||.. T Consensus 1678 Ley-Ek~~Gde~~vE~VKarA~E 1699 (1710) T KOG1070 1678 LEY-EKSHGDEKNVEYVKARAKE 1699 (1710) T ss_pred HHH-HHHCCCHHHHHHHHHHHHH T ss_conf 998-7740862438999999999 No 45 >KOG1155 consensus Probab=99.15 E-value=2.1e-06 Score=67.95 Aligned_cols=266 Identities=9% Similarity=0.086 Sum_probs=159.9 Q ss_pred CHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHH-----------------------------HHHHCCC Q ss_conf 15899999875322221000168999999726885057999865-----------------------------4552158 Q gi|254781033|r 118 NEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPATREFAVYSLY-----------------------------FESCRIG 168 (492) Q Consensus 118 ~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~~~gL~-----------------------------~~al~~g 168 (492) .+++.+.|.+-.....|....|...|...+..-|-..-++..|. ....... T Consensus 162 ~D~fllYL~Gvv~k~~~~~s~A~~sfv~~v~~~P~~W~AWleL~~lit~~e~~~~l~~~l~~~~h~M~~~F~~~a~~el~ 241 (559) T KOG1155 162 KDEFLLYLYGVVLKELGLLSLAIDSFVEVVNRYPWFWSAWLELSELITDIEILSILVVGLPSDMHWMKKFFLKKAYQELH 241 (559) T ss_pred CHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHH T ss_conf 21389998889998613077889999999963875268999999860529999999853753026999999999988988 Q ss_pred CHHHHHHHHHHHHHH-CCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHH---------------HHHHHHHH Q ss_conf 869999999988842-12698999999999995257456999999999502310357---------------89999999 Q gi|254781033|r 169 DLNSAQRYATKALDI-SPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRN---------------RAILLIAR 232 (492) Q Consensus 169 d~~~A~~~~~~a~~~-~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~---------------~a~l~~~~ 232 (492) +.+.+++-.+..... -|++.-.....+.++-.+.|+++|+..++...+..| ++-. ..+..++. T Consensus 242 q~~e~~~k~e~l~~~gf~~~~~i~~~~A~~~y~~rDfD~a~s~Feei~knDP-YRl~dmdlySN~LYv~~~~skLs~LA~ 320 (559) T KOG1155 242 QHEEALQKKERLSSVGFPNSMYIKTQIAAASYNQRDFDQAESVFEEIRKNDP-YRLDDMDLYSNVLYVKNDKSKLSYLAQ 320 (559) T ss_pred HHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC-CCCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 8889999999887516874177999999988612207889999999985497-732158787677999721178999999 Q ss_pred HH----------------HHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCH--- Q ss_conf 99----------------99966998999999987543133527999999999997399478999999999832898--- Q gi|254781033|r 233 SL----------------ENADKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHP--- 293 (492) Q Consensus 233 a~----------------~~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p--- 293 (492) .. -+.-.++.+.|...+++|++++|....++.+.+.-+.+..+...|..-+..+.+.+|.. T Consensus 321 ~v~~idKyR~ETCCiIaNYYSlr~eHEKAv~YFkRALkLNp~~~~aWTLmGHEyvEmKNt~AAi~sYRrAvdi~p~DyRA 400 (559) T KOG1155 321 NVSNIDKYRPETCCIIANYYSLRSEHEKAVMYFKRALKLNPKYLSAWTLMGHEYVEMKNTHAAIESYRRAVDINPRDYRA 400 (559) T ss_pred HHHHHCCCCCCCEEEEHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCHHHHH T ss_conf 99875147752324431378887768999999999983284313789986079898356188999999888149256788 Q ss_pred --HHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCH-HHHHHHHHHHHH Q ss_conf --99999998507895899999999998268983689999999998799889999988786609998-999999999998 Q gi|254781033|r 294 --EIANIYTHLLSENTVGKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRK-EIFLLLAQIEQA 370 (492) Q Consensus 294 --~La~a~~~~~~~~~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~-~~~~llA~ie~~ 370 (492) .|..+|.-+ +=+.=++-.+++.....|+|...+.+||+.+.+.++.++|...+..++...-+. .++.-+|.+.+. T Consensus 401 WYGLGQaYeim--~Mh~YaLyYfqkA~~~kPnDsRlw~aLG~CY~kl~~~~eAiKCykrai~~~dte~~~l~~LakLye~ 478 (559) T KOG1155 401 WYGLGQAYEIM--KMHFYALYYFQKALELKPNDSRLWVALGECYEKLNRLEEAIKCYKRAILLGDTEGSALVRLAKLYEE 478 (559) T ss_pred HHHHHHHHHHH--CCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHH T ss_conf 85206899981--3569999999998724999668999999999985568999999988885246433899999999999 Q ss_pred HCCCHHHHHHHHHHHHC Q ss_conf 57998999999999962 Q gi|254781033|r 371 NSHNTDKILYWTQSALH 387 (492) Q Consensus 371 e~~d~~~ar~wl~rA~~ 387 (492) . +|...+-..+.+-+. T Consensus 479 l-~d~~eAa~~yek~v~ 494 (559) T KOG1155 479 L-KDLNEAAQYYEKYVE 494 (559) T ss_pred H-HHHHHHHHHHHHHHH T ss_conf 8-747999999999999 No 46 >pfam04733 Coatomer_E Coatomer epsilon subunit. This family represents the epsilon subunit of the coatomer complex, which is involved in the regulation of intracellular protein trafficking between the endoplasmic reticulum and the Golgi complex. Probab=99.15 E-value=8.1e-07 Score=71.20 Aligned_cols=262 Identities=10% Similarity=0.028 Sum_probs=158.1 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCC Q ss_conf 98753222210001689999997268850579998654552158869999999988842126989999999999952574 Q gi|254781033|r 125 LEVQIALAERQYNIAHEKLEMMLQIPATREFAVYSLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEW 204 (492) Q Consensus 125 l~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw 204 (492) ...+.+...|++..+......-...+........-+++-++..|+++..+...... ..|. ..+...++.......+- T Consensus 6 f~irn~Fy~G~Yq~~i~ea~~~~~s~~~~~e~~~~~~Ra~IAlg~~~~vl~ei~~~--~~p~-L~av~~lA~y~~~~~~~ 82 (290) T pfam04733 6 FNVRNYFYLGNYQKAINESDVTSLSEEALVERDVYMYRSYLALGSYQIVISEIKES--AATP-LQAVRLLAEYLNSPSRK 82 (290) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHCCCC--CCCH-HHHHHHHHHHHCCCCCH T ss_conf 78999999669999999986368991357999999999999737703788862678--9831-79999999987484114 Q ss_pred HHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHH Q ss_conf 56999999999502310357899999999999966998999999987543133527999999999997399478999999 Q gi|254781033|r 205 SRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILE 284 (492) Q Consensus 205 ~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le 284 (492) +.+...+...+........ .......+..+...|+..+|++.+.+. +..++..+...++...++.+-|.+.++ T Consensus 83 ~~~~~~l~~~~~~~~~~~~--~~~~~~aati~~~~g~~e~AL~~l~~~-----~~lE~~al~VQi~L~~nR~DLA~ke~~ 155 (290) T pfam04733 83 ESILASLKEWVADSHIGSN--STLRLLAAIIFIHEGDFDDALKHLHKG-----ENLEAMALNVQILLKMHRIDLAEQQLK 155 (290) T ss_pred HHHHHHHHHHHHHHCCCCC--HHHHHHHHHHHHHCCCHHHHHHHHHCC-----CCHHHHHHHHHHHHHCCCHHHHHHHHH T ss_conf 5699999999874015667--899999999999779999999998558-----966799999999997288367999999 Q ss_pred HHHHHCCCH---HHHHHHHHHCCC--CHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHH Q ss_conf 999832898---999999985078--958999999999982689836899999999987998899999887866099989 Q gi|254781033|r 285 KIWKVNPHP---EIANIYTHLLSE--NTVGKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKE 359 (492) Q Consensus 285 ~a~~~~p~p---~La~a~~~~~~~--~~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~ 359 (492) +.-+...+. .|+.+|..+..| +..++.-.|+.+....+..+-.+...|-+.+..|+|++|...|+.+++.+|+.. T Consensus 156 ~~q~~~eDs~l~qLa~awv~l~~Ggek~q~A~yif~EL~~~~~~t~lllng~Av~~m~~~~~~eAe~~L~eAl~~d~~d~ 235 (290) T pfam04733 156 KMQQIDEDATLTQLANAWVKLAVGGEKIQDAYYIFQEFSEKYDSTPLLLNGQAVCCMCLGRYEEAESLLKEALDKDAKDP 235 (290) T ss_pred HHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCH T ss_conf 98843510599999999999972647789899999999852289789999999999874787999999999987288988 Q ss_pred HHH-HHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCEEC Q ss_conf 999-9999999857998999999999962889995275 Q gi|254781033|r 360 IFL-LLAQIEQANSHNTDKILYWTQSALHAMPDPLWIS 396 (492) Q Consensus 360 ~~~-llA~ie~~e~~d~~~ar~wl~rA~~a~~dp~W~c 396 (492) ..+ -|.-+....+++......++.+-....|+..|+- T Consensus 236 dtlaNliv~s~~~gK~~e~~~ryl~qL~~~~p~H~~v~ 273 (290) T pfam04733 236 ETLINLVVCALHLGKPAEVSNRNLSQLKLSHPTHPLVK 273 (290) T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHH T ss_conf 99999999999809987899999999986299985999 No 47 >KOG1174 consensus Probab=99.13 E-value=2.8e-08 Score=82.59 Aligned_cols=190 Identities=14% Similarity=0.038 Sum_probs=122.3 Q ss_pred CCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHH Q ss_conf 15886999999998884212698999999999995257456999999999502310357899999999999966998999 Q gi|254781033|r 166 RIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIAS 245 (492) Q Consensus 166 ~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A 245 (492) ...+++.|+.+.+|..+.+|++..++..-+.+..+.++.++|.-.++.+....|- +...+-+....++..+...+| T Consensus 312 ~~K~~~rAL~~~eK~I~~~~r~~~alilKG~lL~~~~R~~~A~IaFR~Aq~Lap~----rL~~Y~GL~hsYLA~~~~kEA 387 (564) T KOG1174 312 DEKKFERALNFVEKCIDSEPRNHEALILKGRLLIALERHTQAVIAFRTAQMLAPY----RLEIYRGLFHSYLAQKRFKEA 387 (564) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCHHHHHCCHHHHHCCCHHHHHHHHHHHHHCCHH----HHHHHHHHHHHHHHHCCHHHH T ss_conf 7766899999998876158553367886139988545628899999998742514----578998899999961408999 Q ss_pred HHHHHHHHHCCCCCHHHHHHHH-HHHHHC-CCHHHHHHHHHHHHHHCCCHHHH---HHHHHHCCCCHHHHHHHHHHHHHH Q ss_conf 9999875431335279999999-999973-99478999999999832898999---999985078958999999999982 Q gi|254781033|r 246 YHDAIESLKLCDNSIMASICAA-KSLISQ-NKKRKAEVILEKIWKVNPHPEIA---NIYTHLLSENTVGKLKRALRLEEI 320 (492) Q Consensus 246 ~~~~~~a~~~~p~~~~a~~~~a-~~l~~~-g~~~~A~~~le~a~~~~p~p~La---~a~~~~~~~~~~~al~~~~~l~~~ 320 (492) ...+..+.+.-|....+.-+.+ ..+... --..||.+.++++++.+|..--+ .+-.....+...+.++.+++.++. T Consensus 388 ~~~An~~~~~~~~sA~~LtL~g~~V~~~dp~~rEKAKkf~ek~L~~~P~Y~~AV~~~AEL~~~Eg~~~D~i~LLe~~L~~ 467 (564) T KOG1174 388 NALANWTIRLFQNSARSLTLFGTLVLFPDPRMREKAKKFAEKSLKINPIYTPAVNLIAELCQVEGPTKDIIKLLEKHLII 467 (564) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHEEECCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHH T ss_conf 99999999976531446766300011568506799999998420358860899999999998627654189999998742 Q ss_pred CCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHH Q ss_conf 6898368999999999879988999998878660999899 Q gi|254781033|r 321 NKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKEI 360 (492) Q Consensus 321 ~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~~ 360 (492) .||. ..+..||........+.+|.+++..+++++|.... T Consensus 468 ~~D~-~LH~~Lgd~~~A~Ne~Q~am~~y~~ALr~dP~~~~ 506 (564) T KOG1174 468 FPDV-NLHNHLGDIMRAQNEPQKAMEYYYKALRQDPKSKR 506 (564) T ss_pred CCCH-HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHH T ss_conf 5620-89999999999861079999999999705964077 No 48 >KOG2076 consensus Probab=99.11 E-value=1.3e-06 Score=69.49 Aligned_cols=279 Identities=11% Similarity=0.067 Sum_probs=185.0 Q ss_pred HHHHHHCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCC----CHHHHH-HHHHHHHH Q ss_conf 9999721004787999986301346661589999987532222100016899999972688----505799-98654552 Q gi|254781033|r 91 GLMSIAAHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPA----TREFAV-YSLYFESC 165 (492) Q Consensus 91 gl~al~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~----~~~~~~-~gL~~~al 165 (492) +-++..-|++..|+-.+.++.+.++.+.+. ..-.+...+..|+...|-+.|.+++...| .+.... +.-+.... T Consensus 214 adls~~~~~i~qA~~cy~rAI~~~p~n~~~--~~ers~L~~~~G~~~~Am~~f~~l~~~~p~~d~er~~d~i~~~~~~~~ 291 (895) T KOG2076 214 ADLSEQLGNINQARYCYSRAIQANPSNWEL--IYERSSLYQKTGDLKRAMETFLQLLQLDPPVDIERIEDLIRRVAHYFI 291 (895) T ss_pred HHHHHHCCCHHHHHHHHHHHHHCCCCCHHH--HHHHHHHHHHHCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHH T ss_conf 999986012999999999998608862289--999999999847078889999999862890158999999999999999 Q ss_pred CCCCHHHHHHHHHHHHH--HCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCC--C---------------------- Q ss_conf 15886999999998884--212698999999999995257456999999999502--3---------------------- Q gi|254781033|r 166 RIGDLNSAQRYATKALD--ISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNA--K---------------------- 219 (492) Q Consensus 166 ~~gd~~~A~~~~~~a~~--~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~--~---------------------- 219 (492) ..++.+.|.+.++.++. ..--....+..+++++.....|+.++..+....... + T Consensus 292 ~~~~~e~a~~~le~~~s~~~~~~~~ed~ni~ael~l~~~q~d~~~~~i~~~~~r~~e~d~~e~~~~~~~~~~~~~~~~~~ 371 (895) T KOG2076 292 THNERERAAKALEGALSKEKDEASLEDLNILAELFLKNKQSDKALMKIVDDRNRESEKDDSEWDTDERRREEPNALCEVG 371 (895) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHCCCCCCCCCCCC T ss_conf 71078999999997774103403562789999999986788876699999860136787165400220345553201478 Q ss_pred -CHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCC-CHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHH--H Q ss_conf -103578999999999999669989999999875431335-279999999999973994789999999998328989--9 Q gi|254781033|r 220 -EWNRNRAILLIARSLENADKGDMIASYHDAIESLKLCDN-SIMASICAAKSLISQNKKRKAEVILEKIWKVNPHPE--I 295 (492) Q Consensus 220 -~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~-~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p~--L 295 (492) +....--+..+......+...+..+++.....-...-|. .+....-.+++|...|++..|.+++.......+... + T Consensus 372 ~~~s~~l~v~rl~icL~~L~~~e~~e~ll~~l~~~n~~~~d~~dL~~d~a~al~~~~~~~~Al~~l~~i~~~~~~~~~~v 451 (895) T KOG2076 372 KELSYDLRVIRLMICLVHLKERELLEALLHFLVEDNVWVSDDVDLYLDLADALTNIGKYKEALRLLSPITNREGYQNAFV 451 (895) T ss_pred CCCCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCHHH T ss_conf 77775630576754320024434588898888883378455578999999999836558889999999864833355134 Q ss_pred HHHHHH--HCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCC----------CCHHHHHH Q ss_conf 999998--507895899999999998268983689999999998799889999988786609----------99899999 Q gi|254781033|r 296 ANIYTH--LLSENTVGKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIA----------PRKEIFLL 363 (492) Q Consensus 296 a~a~~~--~~~~~~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~----------P~~~~~~l 363 (492) ....+. ...+....++..+++++...|+|.+.++.|+.++.+.|++++|-+.+++...-+ |..++... T Consensus 452 w~~~a~c~~~l~e~e~A~e~y~kvl~~~p~~~D~Ri~Lasl~~~~g~~EkalEtL~~~~~~D~~~~e~~a~~~e~ri~~~ 531 (895) T KOG2076 452 WYKLARCYMELGEYEEAIEFYEKVLILAPDNLDARITLASLYQQLGNHEKALETLEQIINPDGRNAEACAWEPERRILAH 531 (895) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCCCCCHHHCCCCHHHHHHHH T ss_conf 69899999987428999999999995289860166319999976388889999871345899400222003677789999 Q ss_pred HHHHHHHH Q ss_conf 99999985 Q gi|254781033|r 364 LAQIEQAN 371 (492) Q Consensus 364 lA~ie~~e 371 (492) ...+.... T Consensus 532 r~d~l~~~ 539 (895) T KOG2076 532 RCDILFQV 539 (895) T ss_pred HHHHHHHH T ss_conf 99999980 No 49 >KOG1173 consensus Probab=99.07 E-value=1.5e-06 Score=69.20 Aligned_cols=262 Identities=15% Similarity=0.115 Sum_probs=130.4 Q ss_pred HHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHH-HHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCC Q ss_conf 875322221000168999999726885057999-8654552158869999999988842126989999999999952574 Q gi|254781033|r 126 EVQIALAERQYNIAHEKLEMMLQIPATREFAVY-SLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEW 204 (492) Q Consensus 126 ~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~~~-gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw 204 (492) .|+.....+++..-.+.+..+++.+|-..-.+- .++ .....|+...=.-..-++.+..|+.+-.+..++-.|..-|.. T Consensus 250 ~ad~~y~~c~f~~c~kit~~lle~dpfh~~~~~~~ia-~l~el~~~n~Lf~lsh~LV~~yP~~a~sW~aVg~YYl~i~k~ 328 (611) T KOG1173 250 KADRLYYGCRFKECLKITEELLEKDPFHLPCLPLHIA-CLYELGKSNKLFLLSHKLVDLYPSKALSWFAVGCYYLMIGKY 328 (611) T ss_pred HHHHHHHCCHHHHHHHHHHHHHHHCCCCCCHHHHHHH-HHHHHCCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHCCC T ss_conf 9999997171999999859987327898513999999-999816521699988879872888773146577899981486 Q ss_pred HHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHH Q ss_conf 56999999999502310357899999999999966998999999987543133527999999999997399478999999 Q gi|254781033|r 205 SRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILE 284 (492) Q Consensus 205 ~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le 284 (492) .+|+..+.+.....+.+.+ .+++.+.....+++.+.|...+..|.++-|+.--..+-.+--+.+.|+..-|.+.+. T Consensus 329 seARry~SKat~lD~~fgp----aWl~fghsfa~e~EhdQAmaaY~tAarl~~G~hlP~LYlgmey~~t~n~kLAe~Ff~ 404 (611) T KOG1173 329 SEARRYFSKATTLDPTFGP----AWLAFGHSFAGEGEHDQAMAAYFTAARLMPGCHLPSLYLGMEYMRTNNLKLAEKFFK 404 (611) T ss_pred HHHHHHHHHHHHCCCCCCH----HHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCHHHHHHHHH T ss_conf 7999998887603765327----999876776501057889999999998666776317899999987423989999999 Q ss_pred HHHHHCCCHHHHHHHHHH---CCCCHHHHHHHHHHHHHHCC-C------CHHHHHHHHHHHHHCCCHHHHHHHHHHHHHC Q ss_conf 999832898999999985---07895899999999998268-9------8368999999999879988999998878660 Q gi|254781033|r 285 KIWKVNPHPEIANIYTHL---LSENTVGKLKRALRLEEINK-E------SVESLVIVSKIALEMGSIDQAHAKAMLAMKI 354 (492) Q Consensus 285 ~a~~~~p~p~La~a~~~~---~~~~~~~al~~~~~l~~~~P-~------~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~ 354 (492) .+....|...+..-..-+ ..+...++++.++..+..-+ . -......||-++...+.+.+|...+++++.. T Consensus 405 ~A~ai~P~Dplv~~Elgvvay~~~~y~~A~~~f~~~l~~ik~~~~e~~~w~p~~~NLGH~~Rkl~~~~eAI~~~q~aL~l 484 (611) T KOG1173 405 QALAIAPSDPLVLHELGVVAYTYEEYPEALKYFQKALEVIKSVLNEKIFWEPTLNNLGHAYRKLNKYEEAIDYYQKALLL 484 (611) T ss_pred HHHHCCCCCCHHHHHHHHEEEHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 98711898604433144203327546889999999988766305432211577876789999976679999999999871 Q ss_pred CCCH-HHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCC Q ss_conf 9998-99999999999857998999999999962889995 Q gi|254781033|r 355 APRK-EIFLLLAQIEQANSHNTDKILYWTQSALHAMPDPL 393 (492) Q Consensus 355 ~P~~-~~~~llA~ie~~e~~d~~~ar~wl~rA~~a~~dp~ 393 (492) .|.. +++.-.+=|... .|+-++|-+.+-||+--.||.. T Consensus 485 ~~k~~~~~asig~iy~l-lgnld~Aid~fhKaL~l~p~n~ 523 (611) T KOG1173 485 SPKDASTHASIGYIYHL-LGNLDKAIDHFHKALALKPDNI 523 (611) T ss_pred CCCCHHHHHHHHHHHHH-HCCHHHHHHHHHHHHHCCCCCH T ss_conf 87765688889999999-6580779999999871487528 No 50 >KOG1125 consensus Probab=99.06 E-value=3.2e-08 Score=82.19 Aligned_cols=226 Identities=12% Similarity=0.011 Sum_probs=138.2 Q ss_pred HHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCC Q ss_conf 54552158869999999988842126989999999999952574569999999995023103578999999999999669 Q gi|254781033|r 161 YFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKG 240 (492) Q Consensus 161 ~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~ 240 (492) ....++.||...|.-.++.+.+.+|.|..++..|+.++...++=..|+..|.+.++..|.... .+.+++..+..++ T Consensus 292 G~~lm~nG~L~~A~LafEAAVkqdP~haeAW~~LG~~qaENE~E~~ai~AL~rcl~LdP~Nle----aLmaLAVSytNeg 367 (579) T KOG1125 292 GCNLMKNGDLSEAALAFEAAVKQDPQHAEAWQKLGITQAENENEQNAISALRRCLELDPTNLE----ALMALAVSYTNEG 367 (579) T ss_pred HHHHHHCCCCHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHH----HHHHHHHHHHHHH T ss_conf 999986389059999999998619287899999615765034208899999999846995199----9999888886146 Q ss_pred CHHHHHHHHHHHHHCCCCCHHHHHHHHH-------HHHHCCCHHH-HHHHHHHHHHHC--CCHHHHHHHHHH--CCCCHH Q ss_conf 9899999998754313352799999999-------9997399478-999999999832--898999999985--078958 Q gi|254781033|r 241 DMIASYHDAIESLKLCDNSIMASICAAK-------SLISQNKKRK-AEVILEKIWKVN--PHPEIANIYTHL--LSENTV 308 (492) Q Consensus 241 ~~~~A~~~~~~a~~~~p~~~~a~~~~a~-------~l~~~g~~~~-A~~~le~a~~~~--p~p~La~a~~~~--~~~~~~ 308 (492) ...+|.+.+.+.++..|.........-. .......... .+..|+.+.... .+|++-...+-+ ..++.. T Consensus 368 ~q~~Al~~L~~Wi~~~p~y~~l~~a~~~~~~~~~~s~~~~~~l~~i~~~fLeaa~~~~~~~DpdvQ~~LGVLy~ls~efd 447 (579) T KOG1125 368 LQNQALKMLDKWIRNKPKYVHLVSAGENEDFENTKSFLDSSHLAHIQELFLEAARQLPTKIDPDVQSGLGVLYNLSGEFD 447 (579) T ss_pred HHHHHHHHHHHHHHHCCCCHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCHHHH T ss_conf 68889999999998486211114667665666776778878999999999999985778888427762678974014788 Q ss_pred HHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCC Q ss_conf 99999999998268983689999999998799889999988786609998999999999998579989999999999628 Q gi|254781033|r 309 GKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKEIFLLLAQIEQANSHNTDKILYWTQSALHA 388 (492) Q Consensus 309 ~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~~~~llA~ie~~e~~d~~~ar~wl~rA~~a 388 (492) .++..|+.++..+|+|.-.+--||-..-...+..+|...+.+++++.|..--.+....|.--.-|....+-..|-.|+.. T Consensus 448 raiDcf~~AL~v~Pnd~~lWNRLGAtLAN~~~s~EAIsAY~rALqLqP~yVR~RyNlgIS~mNlG~ykEA~~hlL~AL~m 527 (579) T KOG1125 448 RAVDCFEAALQVKPNDYLLWNRLGATLANGNRSEEAISAYNRALQLQPGYVRVRYNLGISCMNLGAYKEAVKHLLEALSM 527 (579) T ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 89999999970498417789886688607755399999999998418872653010145662056699999999999986 Q ss_pred CC Q ss_conf 89 Q gi|254781033|r 389 MP 390 (492) Q Consensus 389 ~~ 390 (492) .+ T Consensus 528 q~ 529 (579) T KOG1125 528 QR 529 (579) T ss_pred HH T ss_conf 53 No 51 >COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only] Probab=99.06 E-value=1.5e-06 Score=69.06 Aligned_cols=149 Identities=15% Similarity=0.094 Sum_probs=114.0 Q ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCC-HHHHHHHH--HH Q ss_conf 9999999999996699899999998754313352799999999999739947899999999983289-89999999--85 Q gi|254781033|r 226 AILLIARSLENADKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPH-PEIANIYT--HL 302 (492) Q Consensus 226 a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~-p~La~a~~--~~ 302 (492) ....+..+......++.++|+..+...++..|+++....+.+.++++.|+..+|...++++....|+ +-|...|+ .+ T Consensus 306 ~aa~YG~A~~~~~~~~~d~A~~~l~~L~~~~P~N~~~~~~~~~i~~~~nk~~~A~e~~~kal~l~P~~~~l~~~~a~all 385 (484) T COG4783 306 LAAQYGRALQTYLAGQYDEALKLLQPLIAAQPDNPYYLELAGDILLEANKAKEAIERLKKALALDPNSPLLQLNLAQALL 385 (484) T ss_pred HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHH T ss_conf 37788899999983270157899999998489987999999999987588677999999998408985079999999999 Q ss_pred CCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCC--CHHHHHHHHHHHHHHCCC Q ss_conf 078958999999999982689836899999999987998899999887866099--989999999999985799 Q gi|254781033|r 303 LSENTVGKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAP--RKEIFLLLAQIEQANSHN 374 (492) Q Consensus 303 ~~~~~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P--~~~~~~llA~ie~~e~~d 374 (492) ..+++.++++.+...+..+|+++..+..||++|-..|+-.+|...........- ..-+-.+|..-.....|. T Consensus 386 ~~g~~~eai~~L~~~~~~~p~dp~~w~~LAqay~~~g~~~~a~~A~AE~~~~~G~~~~A~~~l~~A~~~~~~~~ 459 (484) T COG4783 386 KGGKPQEAIRILNRYLFNDPEDPNGWDLLAQAYAELGNRAEALLARAEGYALAGRLEQAIIFLMRASQQVKLGF 459 (484) T ss_pred HCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCC T ss_conf 56997899999999864487773699999999998476177999999999867887899999999998636884 No 52 >KOG1129 consensus Probab=99.03 E-value=1.7e-07 Score=76.60 Aligned_cols=296 Identities=15% Similarity=0.072 Sum_probs=222.8 Q ss_pred HHHHHHHHCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHH----CCCCH---HH---HHH Q ss_conf 999999721004787999986301346661589999987532222100016899999972----68850---57---999 Q gi|254781033|r 89 YTGLMSIAAHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQ----IPATR---EF---AVY 158 (492) Q Consensus 89 ~~gl~al~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~----~~~~~---~~---~~~ 158 (492) +..|.+...|.+-...|+- -.++...|.|..-|--.......|...+.-.-....+ .+.+. +. --. T Consensus 152 taSmaa~~dG~f~nlsRLN----~tkYa~~p~l~kaLFey~fyhenDv~~aH~~~~~~~~~~~a~~s~~~~~~~dwwWk~ 227 (478) T KOG1129 152 TASMAAFNDGKFYNLSRLN----PTKYAERPTLVKALFEYLFYHENDVQKAHSLCQAVLEVERAKPSGSTGCTLDWWWKQ 227 (478) T ss_pred HHHHHCCCCCCEEEHHHCC----CHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHH T ss_conf 3555206786402235348----044203758999999999986646899988888888888605666554147689999 Q ss_pred HHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 86545521588699999999888421269899999999999525745699999999950231035789999999999996 Q gi|254781033|r 159 SLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENAD 238 (492) Q Consensus 159 gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~ 238 (492) .+..-+++.|=+..|.+.++..++..| ++.....|...|.+..+...|+.++.+.+...|.. ...++..++.+.+ T Consensus 228 Q~gkCylrLgm~r~AekqlqssL~q~~-~~dTfllLskvY~ridQP~~AL~~~~~gld~fP~~----VT~l~g~ARi~ea 302 (478) T KOG1129 228 QMGKCYLRLGMPRRAEKQLQSSLTQFP-HPDTFLLLSKVYQRIDQPERALLVIGEGLDSFPFD----VTYLLGQARIHEA 302 (478) T ss_pred HHHHHHHHHCCHHHHHHHHHHHHHCCC-CHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCH----HHHHHHHHHHHHH T ss_conf 998999981682666899998764377-61699999999987235889999986465238832----5666636999999 Q ss_pred CCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCC-CHHHHHHHHH--HCCCCHHHHHHHHH Q ss_conf 69989999999875431335279999999999973994789999999998328-9899999998--50789589999999 Q gi|254781033|r 239 KGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNP-HPEIANIYTH--LLSENTVGKLKRAL 315 (492) Q Consensus 239 ~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p-~p~La~a~~~--~~~~~~~~al~~~~ 315 (492) .++.+++.+.++..++..|.++++....+.-+.-.|+.+-|.+.+.+.+...- .|+|-...+. +..++..-.+..++ T Consensus 303 m~~~~~a~~lYk~vlk~~~~nvEaiAcia~~yfY~~~PE~AlryYRRiLqmG~~speLf~NigLCC~yaqQ~D~~L~sf~ 382 (478) T KOG1129 303 MEQQEDALQLYKLVLKLHPINVEAIACIAVGYFYDNNPEMALRYYRRILQMGAQSPELFCNIGLCCLYAQQIDLVLPSFQ 382 (478) T ss_pred HHHHHHHHHHHHHHHHCCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCHHHHHHHHH T ss_conf 97078899999999840876500356655300138986899999999998327776777658889875400303178899 Q ss_pred HHHHHCC---CCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCC Q ss_conf 9998268---9836899999999987998899999887866099989999999999985799899999999996288999 Q gi|254781033|r 316 RLEEINK---ESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKEIFLLLAQIEQANSHNTDKILYWTQSALHAMPDP 392 (492) Q Consensus 316 ~l~~~~P---~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~~~~llA~ie~~e~~d~~~ar~wl~rA~~a~~dp 392 (492) +.++... ...+.++.++..+...||+..|.+.+.-++..+|+....+-...+.+...|+...+|..+.-|..-.||. T Consensus 383 RAlstat~~~~aaDvWYNlg~vaV~iGD~nlA~rcfrlaL~~d~~h~ealnNLavL~~r~G~i~~Arsll~~A~s~~P~m 462 (478) T KOG1129 383 RALSTATQPGQAADVWYNLGFVAVTIGDFNLAKRCFRLALTSDAQHGEALNNLAVLAARSGDILGARSLLNAAKSVMPDM 462 (478) T ss_pred HHHHHCCCCCHHHHHHHCCCEEEEECCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCC T ss_conf 99863368641566664001167703424888888898750585238888768998851276387899998765418530 Q ss_pred C Q ss_conf 5 Q gi|254781033|r 393 L 393 (492) Q Consensus 393 ~ 393 (492) . T Consensus 463 ~ 463 (478) T KOG1129 463 A 463 (478) T ss_pred C T ss_conf 0 No 53 >PRK12370 invasion protein regulator; Provisional Probab=99.00 E-value=8.2e-06 Score=63.35 Aligned_cols=211 Identities=11% Similarity=0.003 Sum_probs=120.2 Q ss_pred HHHHHHHHH-CCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHH Q ss_conf 998654552-1588699999999888421269899999999999525745699999999950231035789999999999 Q gi|254781033|r 157 VYSLYFESC-RIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLE 235 (492) Q Consensus 157 ~~gL~~~al-~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~ 235 (492) ++|+...+. ...|...|+.+++++..++|+...++..+...|.+..... ..+ T Consensus 263 l~Gl~hl~rft~edn~~A~~LF~~Av~~dP~fA~AyA~la~cy~q~a~l~-----------~~d---------------- 315 (553) T PRK12370 263 LRGKHELNQYTPYSLQQALKLLTQCVNMSPNSIAPYCALAECYLSMAQMG-----------IFD---------------- 315 (553) T ss_pred HHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHC-----------CCC---------------- T ss_conf 97899997318877999999999987339774466787899999763403-----------688---------------- Q ss_pred HHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHH---HHHHHHHHCCCCHHHHHH Q ss_conf 99669989999999875431335279999999999973994789999999998328989---999999850789589999 Q gi|254781033|r 236 NADKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHPE---IANIYTHLLSENTVGKLK 312 (492) Q Consensus 236 ~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p~---La~a~~~~~~~~~~~al~ 312 (492) .+.....|...+.++..++|.++.+..+++++.--.|+..-+...++++...+|+.. ...+|..+..|+..++.+ T Consensus 316 --~~~~~~~A~~~AeravEldp~dp~a~~~lg~~~~L~~d~~~a~~~f~rA~~L~Pnsa~~~Y~~~W~~~maGr~~ea~~ 393 (553) T PRK12370 316 --KQNAMIKAKEHAIKATELDHNNPQALGLLGLINTIHSEYIVGSLLFKQANLLSPISADIKYYYGWNLFMAGQLEEALQ 393 (553) T ss_pred --HHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCHHHHHH T ss_conf --288999999999998743878878889999998723552244799987642599842579999999997586899999 Q ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCC-CHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCC Q ss_conf 99999982689836899999999987998899999887866099-98999999999998579989999999999628899 Q gi|254781033|r 313 RALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAP-RKEIFLLLAQIEQANSHNTDKILYWTQSALHAMPD 391 (492) Q Consensus 313 ~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P-~~~~~~llA~ie~~e~~d~~~ar~wl~rA~~a~~d 391 (492) .+++.++.+|..+...++...+.+-.--.+.|..........+| +..+...|+..--+..|+.+++|.|. +++++... T Consensus 394 ~i~~a~~LdP~~~~~~~~~l~~~~~~~~~d~a~~l~~~~~~q~~~~~~~~~s~~~m~~~~~g~~~~a~~~~-~~~~~~~~ 472 (553) T PRK12370 394 TINECLKLDPTRAAAGITKLWITYYHTGIDDAIRLGDELRSQHLQDNPILLSMQVMFLSLKGKHELARKLT-KEISTQEI 472 (553) T ss_pred HHHHHHHCCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHH-HHCCHHHC T ss_conf 99998835986520279999999984231079999998625077878369999999997338389999999-85180311 Q ss_pred CCEECC Q ss_conf 952756 Q gi|254781033|r 392 PLWISD 397 (492) Q Consensus 392 p~W~c~ 397 (492) ...+|- T Consensus 473 ~~~~~v 478 (553) T PRK12370 473 TGLIAV 478 (553) T ss_pred CCHHHH T ss_conf 473245 No 54 >KOG1156 consensus Probab=98.98 E-value=4.9e-06 Score=65.15 Aligned_cols=296 Identities=15% Similarity=0.160 Sum_probs=154.1 Q ss_pred HHHHHHHHCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCC Q ss_conf 99999972100478799998630134666158999998753222210001689999997268850579998654552158 Q gi|254781033|r 89 YTGLMSIAAHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPATREFAVYSLYFESCRIG 168 (492) Q Consensus 89 ~~gl~al~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~~~gL~~~al~~g 168 (492) ..|+.--.-|+...|-...+.+.+.. +....-=|.+ +-......++++|.+-|+.++...+++..-+|-|.....+.+ T Consensus 46 mkGL~L~~lg~~~ea~~~vr~glr~d-~~S~vCwHv~-gl~~R~dK~Y~eaiKcy~nAl~~~~dN~qilrDlslLQ~QmR 123 (700) T KOG1156 46 MKGLTLNCLGKKEEAYELVRLGLRND-LKSHVCWHVL-GLLQRSDKKYDEAIKCYRNALKIEKDNLQILRDLSLLQIQMR 123 (700) T ss_pred HCCCHHHCCCCHHHHHHHHHHHHCCC-CCCCHHHHHH-HHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHH T ss_conf 60432302466188999998774027-6655107888-888741175899999999998339972999999999999998 Q ss_pred CHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCC---C-HHHHHHHHHHHHHHHHHHCCCHHH Q ss_conf 869999999988842126989999999999952574569999999995023---1-035789999999999996699899 Q gi|254781033|r 169 DLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAK---E-WNRNRAILLIARSLENADKGDMIA 244 (492) Q Consensus 169 d~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~---~-~~~~~a~l~~~~a~~~~~~~~~~~ 244 (492) |++.....=.+.+++.|++...+...+..+...|++..|..+++.-.+... + ....+....+.......+.|.... T Consensus 124 d~~~~~~tr~~LLql~~~~ra~w~~~Avs~~L~g~y~~A~~il~ef~~t~~~~~s~~~~e~se~~Ly~n~i~~E~g~~q~ 203 (700) T KOG1156 124 DYEGYLETRNQLLQLRPSQRASWIGFAVAQHLLGEYKMALEILEEFEKTQNTSPSKEDYEHSELLLYQNQILIEAGSLQK 203 (700) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHCCCHHH T ss_conf 61618888999977224537999999999999877887789999999862049988889999998999999997246889 Q ss_pred HHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHH-----HHH--------------HHHHH--- Q ss_conf 99999875431335279999999999973994789999999998328989-----999--------------99985--- Q gi|254781033|r 245 SYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHPE-----IAN--------------IYTHL--- 302 (492) Q Consensus 245 A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p~-----La~--------------a~~~~--- 302 (492) +.+...+--+.--+.......-+.++.+.++..+|..++......+|+.- +-. .|... T Consensus 204 ale~L~~~e~~i~Dkla~~e~ka~l~~kl~~lEeA~~~y~~Ll~rnPdn~~Yy~~l~~~lgk~~d~~~~lk~ly~~ls~~ 283 (700) T KOG1156 204 ALEHLLDNEKQIVDKLAFEETKADLLMKLGQLEEAVKVYRRLLERNPDNLDYYEGLEKALGKIKDMLEALKALYAILSEK 283 (700) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 99999852067777778864288899987418768999999986293367899999999999843399999999987523 Q ss_pred --------------C-CCCHHH-----------------------------HHHHHHHHHHH-----CCC------C--- Q ss_conf --------------0-789589-----------------------------99999999982-----689------8--- Q gi|254781033|r 303 --------------L-SENTVG-----------------------------KLKRALRLEEI-----NKE------S--- 324 (492) Q Consensus 303 --------------~-~~~~~~-----------------------------al~~~~~l~~~-----~P~------~--- 324 (492) . .++... ....+++++.. .+. + T Consensus 284 y~r~e~p~Rlplsvl~~eel~~~vdkyL~~~l~Kg~p~vf~dl~SLyk~p~k~~~le~Lvt~y~~~L~~~~~f~~~D~~~ 363 (700) T KOG1156 284 YPRHECPRRLPLSVLNGEELKEIVDKYLRPLLSKGVPSVFKDLRSLYKDPEKVAFLEKLVTSYQHSLSGTGMFNFLDDGK 363 (700) T ss_pred CCCCCCHHHCCHHHHCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCC T ss_conf 75322203324777375246899999999987446870654158887261475799999999986327656778233344 Q ss_pred --HH-----HHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCH-HHHHHHHHHHHHHCCCHHHHHHHHHHHHC Q ss_conf --36-----89999999998799889999988786609998-99999999999857998999999999962 Q gi|254781033|r 325 --VE-----SLVIVSKIALEMGSIDQAHAKAMLAMKIAPRK-EIFLLLAQIEQANSHNTDKILYWTQSALH 387 (492) Q Consensus 325 --~e-----~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~-~~~~llA~ie~~e~~d~~~ar~wl~rA~~ 387 (492) +. ..+.++.-+...|+++.|..++..++.-.||. ..|++-|+|... .|+...+-.|+..|.. T Consensus 364 ~E~PttllWt~y~laqh~D~~g~~~~A~~yId~AIdHTPTliEly~~KaRI~kH-~G~l~eAa~~l~ea~e 433 (700) T KOG1156 364 QEPPTTLLWTLYFLAQHYDKLGDYEVALEYIDLAIDHTPTLIELYLVKARIFKH-AGLLDEAAAWLDEAQE 433 (700) T ss_pred CCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHH-CCCHHHHHHHHHHHHH T ss_conf 588418999999999888870557999999988752586099999999999984-4886899998999872 No 55 >KOG1840 consensus Probab=98.98 E-value=3.8e-06 Score=65.98 Aligned_cols=239 Identities=17% Similarity=0.108 Sum_probs=127.3 Q ss_pred CCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH--------CCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCC- Q ss_conf 68850579998654552158869999999988842--------126989999999999952574569999999995023- Q gi|254781033|r 149 IPATREFAVYSLYFESCRIGDLNSAQRYATKALDI--------SPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAK- 219 (492) Q Consensus 149 ~~~~~~~~~~gL~~~al~~gd~~~A~~~~~~a~~~--------~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~- 219 (492) .+|......+.|...+..+|+++.|....+.++++ .|.-......++.+|...+++.+|..++++++.... T Consensus 194 ~~P~~~~~~~~La~~y~~~g~~e~A~~l~k~Al~~l~k~~G~~hl~va~~l~~~a~~y~~~~k~~eAv~ly~~AL~i~e~ 273 (508) T KOG1840 194 EDPERLRTLRNLAEMYAVQGRLEKAEPLCKQALRILEKTSGLKHLVVASMLNILALVYRSLGKYDEAVNLYEEALTIREE 273 (508) T ss_pred CCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHH T ss_conf 78437889999999999852077899999999999987037666899999999999999744289999999999999998 Q ss_pred ---CHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCC--------CHHHHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf ---103578999999999999669989999999875431335--------279999999999973994789999999998 Q gi|254781033|r 220 ---EWNRNRAILLIARSLENADKGDMIASYHDAIESLKLCDN--------SIMASICAAKSLISQNKKRKAEVILEKIWK 288 (492) Q Consensus 220 ---~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~--------~~~a~~~~a~~l~~~g~~~~A~~~le~a~~ 288 (492) ..+...+..+..++..+...|+..+|...+.+|.+.... ........+.++...+++.+|.+++.++.+ T Consensus 274 ~~G~~h~~va~~l~nLa~ly~~~GKf~EA~~~~e~Al~I~~~~~~~~~~~v~~~l~~~~~~~~~~~~~Eea~~l~q~al~ 353 (508) T KOG1840 274 VFGEDHPAVAATLNNLAVLYYKQGKFAEAEEYCERALEIYEKLLGASHPEVAAQLSELAAILQSMNEYEEAKKLLQKALK 353 (508) T ss_pred HCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH T ss_conf 53898789999998899998604775778999999999999850368589999998999999885205689999999999 Q ss_pred HC------CCHHHHHHHHH-----HCCCCHHHHHHHHHHHHHHC--------CCCHHHHHHHHHHHHHCCCHHHHHHHHH Q ss_conf 32------89899999998-----50789589999999999826--------8983689999999998799889999988 Q gi|254781033|r 289 VN------PHPEIANIYTH-----LLSENTVGKLKRALRLEEIN--------KESVESLVIVSKIALEMGSIDQAHAKAM 349 (492) Q Consensus 289 ~~------p~p~La~a~~~-----~~~~~~~~al~~~~~l~~~~--------P~~~e~~~~LA~~~~~~g~~~~Ar~~l~ 349 (492) .. -|+.++..+.. ...|...++...+++.+... +...-...-+|..+.+.+.+.+|-..+. T Consensus 354 i~~~~~g~~~~~~a~~~~nl~~l~~~~gk~~ea~~~~k~ai~~~~~~~~~~~~~~~~~l~~la~~~~~~k~~~~a~~l~~ 433 (508) T KOG1840 354 IYLDAPGEDNVNLAKIYANLAELYLKMGKYKEAEELYKKAIQILRELLGKKDYGVGKPLNQLAEAYEELKKYEEAEQLFE 433 (508) T ss_pred HHHHHCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHH T ss_conf 99861464105789899899999999446567999999999998861567671454789999999998346022899999 Q ss_pred HHHHC-------CCCH-HHHHHHHHHHHHHCCCHHHHHHHHHHHHCC Q ss_conf 78660-------9998-999999999998579989999999999628 Q gi|254781033|r 350 LAMKI-------APRK-EIFLLLAQIEQANSHNTDKILYWTQSALHA 388 (492) Q Consensus 350 ~al~~-------~P~~-~~~~llA~ie~~e~~d~~~ar~wl~rA~~a 388 (492) .+... .|+. -.|.-++.+.++.| +...+-....+++++ T Consensus 434 ~~~~i~~~~g~~~~~~~~~~~nL~~~Y~~~g-~~e~a~~~~~~~~~~ 479 (508) T KOG1840 434 EAKDIMKLCGPDHPDVTYTYLNLAALYRAQG-NYEAAEELEEKVLNA 479 (508) T ss_pred HHHHHHHHHCCCCCCHHHHHHHHHHHHHHCC-CHHHHHHHHHHHHHH T ss_conf 9988999858899746899988999999735-688899999999899 No 56 >cd05804 StaR_like StaR_like; a well-conserved protein found in bacteria, plants, and animals. A family member from Streptomyces toyocaensis, StaR is part of a gene cluster involved in the biosynthesis of glycopeptide antibiotics (GPAs), specifically A47934. It has been speculated that StaR could be a flavoprotein hydroxylating a tyrosine sidechain. Some family members have been annotated as proteins containing tetratricopeptide (TPR) repeats, which may at least indicate mostly alpha-helical secondary structure. Probab=98.98 E-value=9.6e-06 Score=62.84 Aligned_cols=263 Identities=13% Similarity=0.083 Sum_probs=168.7 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHH----CCCC--HHHHHH Q ss_conf 9999999999721004787999986301346661589999987532222100016899999972----6885--057999 Q gi|254781033|r 85 YKALYTGLMSIAAHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQ----IPAT--REFAVY 158 (492) Q Consensus 85 ~~al~~gl~al~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~----~~~~--~~~~~~ 158 (492) .+...+++.+...||...|.....+..... |.+.|+.=+ -...+..|+...-+..-...+. ..+. .+.|.+ T Consensus 44 E~~hv~a~~~~~~g~~~~A~~~~~~~l~~~--P~D~lalk~-~~~~f~lg~~~g~~~~~~~~l~~~~p~~~~~~~~~gm~ 120 (355) T cd05804 44 ERAHVEALSAWIAGDLPKALALLEQLLDDY--PRDLLALKL-HLGAFGLGDFSGMRDHVARVLPLWAPENPDYWYLLGML 120 (355) T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHHHC--CCHHHHHHH-HHHHHCCCCCCCCCHHHHHHHHHCCCCCCCCHHHHHHH T ss_conf 999999999998799789999999999869--835999998-76741025413240569999872573489836999999 Q ss_pred HHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 86545521588699999999888421269899999999999525745699999999950231035789999999999996 Q gi|254781033|r 159 SLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENAD 238 (492) Q Consensus 159 gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~ 238 (492) .. -....|++++|.+..+++++++|++.|+..++.-++..+|++++.+..++.....-.....-+.-..--.+..+++ T Consensus 121 AF--~leE~g~~~~Ae~~a~~al~~np~d~wa~HAvAHv~~~~g~~~eG~~~l~~~~~~w~~~~~l~~H~wWH~AL~~Le 198 (355) T cd05804 121 AF--GLEEAGQYDRAEEAARRALELNPDDAWAVHAVAHVLEMQGRFKEGIAFMESWRDTWDCSSMLRGHNWWHLALFYLE 198 (355) T ss_pred HH--HHHHCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHH T ss_conf 99--9998588999999999999639999379999999999838878899999851752577885898999999999887 Q ss_pred CCCHHHHHHHHHHHHHCCCCC------HHHHHHHHHHHHHC-CCHHHH-HHHHHHHHHHCCC-----HHHHHHHHHHCCC Q ss_conf 699899999998754313352------79999999999973-994789-9999999983289-----8999999985078 Q gi|254781033|r 239 KGDMIASYHDAIESLKLCDNS------IMASICAAKSLISQ-NKKRKA-EVILEKIWKVNPH-----PEIANIYTHLLSE 305 (492) Q Consensus 239 ~~~~~~A~~~~~~a~~~~p~~------~~a~~~~a~~l~~~-g~~~~A-~~~le~a~~~~p~-----p~La~a~~~~~~~ 305 (492) .|+.++++..+..-+...+.. ..+..++-++...- -+...- ..+........++ .++-.+++....+ T Consensus 199 ~g~~~~~l~iyD~~i~~~~~~~~~~d~~dAaSLL~RLel~G~~~v~~Rw~~la~~~~~~~~~~~~~F~D~H~~~ala~ag 278 (355) T cd05804 199 RGDYEAALAIYDTHIAPSAESDPALDLLDAASLLWRLELAGHVDVGDRWEDLADYAAWHFPDHGLAFNDLHAALALAGAG 278 (355) T ss_pred CCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCC T ss_conf 78988999998887445668706999983999999999739987355799999999874565033199999999998489 Q ss_pred CHHH---HHHHHHHHHHHCCC------CHHHHHHHHHHHHHCCCHHHHHHHHHHHH Q ss_conf 9589---99999999982689------83689999999998799889999988786 Q gi|254781033|r 306 NTVG---KLKRALRLEEINKE------SVESLVIVSKIALEMGSIDQAHAKAMLAM 352 (492) Q Consensus 306 ~~~~---al~~~~~l~~~~P~------~~e~~~~LA~~~~~~g~~~~Ar~~l~~al 352 (492) +... .+...+...+..++ +...-++-|-.++..|+|+.|-+.|..+. T Consensus 279 ~~~~~~~ll~~~~~~~~~~~~~~~~~~~vg~pl~~Al~a~~~gd~~~a~~lL~p~~ 334 (355) T cd05804 279 DKDALDKLLAALKGRASSADDNKQPARDVGLPLAEALYAFAEGNYATALELLGPVR 334 (355) T ss_pred CHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH T ss_conf 99999999999998763777320056500489999999998589999999999989 No 57 >TIGR00990 3a0801s09 mitochondrial precursor proteins import receptor; InterPro: IPR005687 The mitochondrial protein translocase (MPT) family, which brings nuclearly encoded preproteins into mitochondria, is very complex with 19 currently identified protein constituents. These proteins include several chaperone proteins, four proteins of the outer membrane translocase (Tom) import receptor, five proteins of the Tom channel complex, five proteins of the inner membrane translocase (Tim) and three "motor" proteins. This family is specific for the Tom70 proteins. ; GO: 0015450 P-P-bond-hydrolysis-driven protein transmembrane transporter activity, 0006886 intracellular protein transport, 0005741 mitochondrial outer membrane. Probab=98.97 E-value=1e-07 Score=78.23 Aligned_cols=250 Identities=17% Similarity=0.153 Sum_probs=195.7 Q ss_pred HCCHHHHHHHHHHHHCC-CCHHH--------HHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCC Q ss_conf 10001689999997268-85057--------9998654552158869999999988842126989999999999952574 Q gi|254781033|r 134 RQYNIAHEKLEMMLQIP-ATREF--------AVYSLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEW 204 (492) Q Consensus 134 gd~~~A~~~~~~~~~~~-~~~~~--------~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw 204 (492) .-+.++-+.|+..++.. ....+ -++|.+. .-.|+...|+.-+.++.++.|.-...+.-.+.+....|+- T Consensus 336 ~~y~~aa~~F~~~Ld~~nede~L~e~~A~~~~~~G~F~--~L~~~~~~A~~Dl~K~IEL~P~~~qSYI~~A~~~~~~~~~ 413 (649) T TIGR00990 336 ESYEEAARAFEKALDLGNEDEKLGEKEAIALNLRGIFK--CLKGKHLEALADLSKSIELDPRVTQSYIKRASINLELGDP 413 (649) T ss_pred HHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHH--HHCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCC T ss_conf 22789999999984047751121145555441242223--3128815789987765422787317899999997415784 Q ss_pred HHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHH Q ss_conf 56999999999502310357899999999999966998999999987543133527999999999997399478999999 Q gi|254781033|r 205 SRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILE 284 (492) Q Consensus 205 ~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le 284 (492) +++...++++++..+.... .++-++....-.++.++|.+.+.++..++|+++-.++.++-.-.+.|.....+.++. T Consensus 414 ~~~~~~F~~Al~~~~~~~~----~YYHRaQl~FI~~~f~~A~KDy~K~~~LDp~~IF~~IQL~~~~Y~~~~~~~~~~~F~ 489 (649) T TIGR00990 414 DKAEEDFDKALKKNSEDPD----IYYHRAQLHFIKGEFAQAGKDYQKSIDLDPDFIFSYIQLGVLQYKLGSIASSMALFR 489 (649) T ss_pred HHHHHHHHHHHHCCCCCCC----CEEHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCHHHHHHHHH T ss_conf 2258899999861889875----110177899887307767545776643271102268999998863034434789999 Q ss_pred HHHHHCC-CHHHHHHHHHHCCC--CHHHHHHHHHHHHHHCCC------CHHHHHHHHHHHHH---CCCHHHHHHHHHHHH Q ss_conf 9998328-98999999985078--958999999999982689------83689999999998---799889999988786 Q gi|254781033|r 285 KIWKVNP-HPEIANIYTHLLSE--NTVGKLKRALRLEEINKE------SVESLVIVSKIALE---MGSIDQAHAKAMLAM 352 (492) Q Consensus 285 ~a~~~~p-~p~La~a~~~~~~~--~~~~al~~~~~l~~~~P~------~~e~~~~LA~~~~~---~g~~~~Ar~~l~~al 352 (492) ++.+..| -|++-.-|+++..+ +..++++.|...+.+-.. +.--++.-+.+..+ .++|.+|...+++++ T Consensus 490 ~~~~~Fp~~P~V~Nyy~ElL~D~~~F~~A~~~fD~AI~LE~~~~~~~~~v~PL~~Ka~~L~~nP~~~~f~EA~~L~EKA~ 569 (649) T TIGR00990 490 RCKKNFPEAPDVYNYYGELLLDKNKFDEAIEKFDLAIELEKELKPIYVNVLPLINKALALFQNPWKQDFIEAENLLEKAL 569 (649) T ss_pred HHHHCCCCCCCCCCHHHHHHHCHHHHHHHHHHHHHHHHHHHCCCCEEEECHHHHHHHHHHHHCCCHHHHHHHHHHHHHHH T ss_conf 99861778888530578886143679999998568999875059807631336778999762652100789999988863 Q ss_pred HCCCCHHHH-HHHHHHHHHHCCCHHHHHHHHHHHHCCCC Q ss_conf 609998999-99999999857998999999999962889 Q gi|254781033|r 353 KIAPRKEIF-LLLAQIEQANSHNTDKILYWTQSALHAMP 390 (492) Q Consensus 353 ~~~P~~~~~-~llA~ie~~e~~d~~~ar~wl~rA~~a~~ 390 (492) .++|..... .-||++.-.+ ++...+-.+++++..-.| T Consensus 570 ~~DP~~~~A~~~lAQl~LQQ-~~v~EAl~~FE~~A~LAR 607 (649) T TIGR00990 570 IIDPECDIAVATLAQLLLQQ-GDVDEALKLFERAAELAR 607 (649) T ss_pred CCCCCHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHH T ss_conf 04884147899999987623-468999999987787630 No 58 >KOG1127 consensus Probab=98.91 E-value=3.3e-07 Score=74.22 Aligned_cols=84 Identities=13% Similarity=-0.045 Sum_probs=43.2 Q ss_pred HHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHH Q ss_conf 21000168999999726885057999865455215886999999998884212698999999999995257456999999 Q gi|254781033|r 133 ERQYNIAHEKLEMMLQIPATREFAVYSLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLN 212 (492) Q Consensus 133 ~gd~~~A~~~~~~~~~~~~~~~~~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~ 212 (492) .++..++...|+..++.+|..+....||+..+...|.+..|++.+.++-.++|.+..+.....-+.+..|.+.++++.+. T Consensus 575 a~n~h~aV~~fQsALR~dPkD~n~W~gLGeAY~~sGry~~AlKvF~kAs~LrP~s~y~~fk~A~~ecd~GkYkeald~l~ 654 (1238) T KOG1127 575 AHNLHGAVCEFQSALRTDPKDYNLWLGLGEAYPESGRYSHALKVFTKASLLRPLSKYGRFKEAVMECDNGKYKEALDALG 654 (1238) T ss_pred CCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCEEHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 52155589999998638906578899877887743762007776424574482768999999999987666899999999 Q ss_pred HHHH Q ss_conf 9995 Q gi|254781033|r 213 QKKK 216 (492) Q Consensus 213 ~~~~ 216 (492) .... T Consensus 655 ~ii~ 658 (1238) T KOG1127 655 LIIY 658 (1238) T ss_pred HHHH T ss_conf 9999 No 59 >KOG1129 consensus Probab=98.85 E-value=1.1e-06 Score=70.12 Aligned_cols=197 Identities=15% Similarity=0.095 Sum_probs=113.2 Q ss_pred HHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHH Q ss_conf 99999999999525745699999999950231035789999999999996699899999998754313352799999999 Q gi|254781033|r 189 WVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIASYHDAIESLKLCDNSIMASICAAK 268 (492) Q Consensus 189 ~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~ 268 (492) |=...++..|.+.|=..+|.+.+...++..+..+ ..++++. .+..-.++..|+..+.+.+..-|..+-...-.|+ T Consensus 224 wWk~Q~gkCylrLgm~r~AekqlqssL~q~~~~d---TfllLsk--vY~ridQP~~AL~~~~~gld~fP~~VT~l~g~AR 298 (478) T KOG1129 224 WWKQQMGKCYLRLGMPRRAEKQLQSSLTQFPHPD---TFLLLSK--VYQRIDQPERALLVIGEGLDSFPFDVTYLLGQAR 298 (478) T ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCHH---HHHHHHH--HHHHHCCHHHHHHHHHHHHHCCCCHHHHHHHHHH T ss_conf 9999998999981682666899998764377616---9999999--9987235889999986465238832566663699 Q ss_pred HHHHCCCHHHHHHHHHHHHHHCCCHH--HHH-HHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHH Q ss_conf 99973994789999999998328989--999-999850789589999999999826898368999999999879988999 Q gi|254781033|r 269 SLISQNKKRKAEVILEKIWKVNPHPE--IAN-IYTHLLSENTVGKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAH 345 (492) Q Consensus 269 ~l~~~g~~~~A~~~le~a~~~~p~p~--La~-a~~~~~~~~~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar 345 (492) ++...++.+.|.++++.+++.+|..- ++- +-.+...++++-++..++++++..-.++|....+|-.|+-+++++-+. T Consensus 299 i~eam~~~~~a~~lYk~vlk~~~~nvEaiAcia~~yfY~~~PE~AlryYRRiLqmG~~speLf~NigLCC~yaqQ~D~~L 378 (478) T KOG1129 299 IHEAMEQQEDALQLYKLVLKLHPINVEAIACIAVGYFYDNNPEMALRYYRRILQMGAQSPELFCNIGLCCLYAQQIDLVL 378 (478) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCHHHHH T ss_conf 99999707889999999984087650035665530013898689999999999832777677765888987540030317 Q ss_pred HHHHHHHHCC--CCH--HHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCC Q ss_conf 9988786609--998--999999999998579989999999999628899 Q gi|254781033|r 346 AKAMLAMKIA--PRK--EIFLLLAQIEQANSHNTDKILYWTQSALHAMPD 391 (492) Q Consensus 346 ~~l~~al~~~--P~~--~~~~llA~ie~~e~~d~~~ar~wl~rA~~a~~d 391 (492) ..+++++... |.. .++--+..+.-. .||-.-+..-+.-|+...++ T Consensus 379 ~sf~RAlstat~~~~aaDvWYNlg~vaV~-iGD~nlA~rcfrlaL~~d~~ 427 (478) T KOG1129 379 PSFQRALSTATQPGQAADVWYNLGFVAVT-IGDFNLAKRCFRLALTSDAQ 427 (478) T ss_pred HHHHHHHHHCCCCCHHHHHHHCCCEEEEE-CCCHHHHHHHHHHHHCCCCC T ss_conf 88999986336864156666400116770-34248888888987505852 No 60 >PRK12370 invasion protein regulator; Provisional Probab=98.84 E-value=1.6e-06 Score=69.01 Aligned_cols=158 Identities=12% Similarity=0.017 Sum_probs=86.0 Q ss_pred CCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCC---------CHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCH Q ss_conf 0001689999997268850579998654552158---------8699999999888421269899999999999525745 Q gi|254781033|r 135 QYNIAHEKLEMMLQIPATREFAVYSLYFESCRIG---------DLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWS 205 (492) Q Consensus 135 d~~~A~~~~~~~~~~~~~~~~~~~gL~~~al~~g---------d~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~ 205 (492) +...|...|.+.+..+|....++-++...+++.. ....|...++++.+++|.++.+.-+++.+.-..+|++ T Consensus 276 dn~~A~~LF~~Av~~dP~fA~AyA~la~cy~q~a~l~~~d~~~~~~~A~~~AeravEldp~dp~a~~~lg~~~~L~~d~~ 355 (553) T PRK12370 276 SLQQALKLLTQCVNMSPNSIAPYCALAECYLSMAQMGIFDKQNAMIKAKEHAIKATELDHNNPQALGLLGLINTIHSEYI 355 (553) T ss_pred HHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCH T ss_conf 79999999999873397744667878999997634036882889999999999987438788788899999987235522 Q ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHH Q ss_conf 69999999995023103578999999999999669989999999875431335279999999999973994789999999 Q gi|254781033|r 206 RAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEK 285 (492) Q Consensus 206 ~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~ 285 (492) .+..+++++....|.+. ...|..+....-.|+..+|++...++.+++|..+.+.+.+.-+..-....+.|..+-.. T Consensus 356 ~a~~~f~rA~~L~Pnsa----~~~Y~~~W~~~maGr~~ea~~~i~~a~~LdP~~~~~~~~~l~~~~~~~~~d~a~~l~~~ 431 (553) T PRK12370 356 VGSLLFKQANLLSPISA----DIKYYYGWNLFMAGQLEEALQTINECLKLDPTRAAAGITKLWITYYHTGIDDAIRLGDE 431 (553) T ss_pred HHHHHHHHHHCCCCCCH----HHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCCHHHHHHHH T ss_conf 44799987642599842----57999999999758689999999998835986520279999999984231079999998 Q ss_pred HHHHCC--CHHHH Q ss_conf 998328--98999 Q gi|254781033|r 286 IWKVNP--HPEIA 296 (492) Q Consensus 286 a~~~~p--~p~La 296 (492) -...++ ||-+. T Consensus 432 ~~~q~~~~~~~~~ 444 (553) T PRK12370 432 LRSQHLQDNPILL 444 (553) T ss_pred HHCCCCCCCHHHH T ss_conf 6250778783699 No 61 >KOG0548 consensus Probab=98.84 E-value=2.8e-05 Score=59.19 Aligned_cols=299 Identities=13% Similarity=0.061 Sum_probs=169.4 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHH Q ss_conf 99999999997210047879999863013466615899999875322221000168999999726885057999865455 Q gi|254781033|r 85 YKALYTGLMSIAAHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPATREFAVYSLYFES 164 (492) Q Consensus 85 ~~al~~gl~al~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~~~gL~~~a 164 (492) ...-.+|..|+.+||+..|...+..+....+. +..|.-= ..-+....|++..|.+--.+..+..|+=.-|+..+...+ T Consensus 3 ~e~k~kgnaa~s~~d~~~ai~~~t~ai~l~p~-nhvlySn-rsaa~a~~~~~~~al~da~k~~~l~p~w~kgy~r~Gaa~ 80 (539) T KOG0548 3 VELKEKGNAAFSSGDFETAIRLFTEAIMLSPT-NHVLYSN-RSAAYASLGSYEKALKDATKTRRLNPDWAKGYSRKGAAL 80 (539) T ss_pred HHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCC-CCCHHCC-HHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHH T ss_conf 67887777551466678999999888706997-5201010-678888775699999889887760884236778757898 Q ss_pred HCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHH-------------------------HHHHHHH----- Q ss_conf 2158869999999988842126989999999999952574569-------------------------9999999----- Q gi|254781033|r 165 CRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRA-------------------------ITFLNQK----- 214 (492) Q Consensus 165 l~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A-------------------------~~~l~~~----- 214 (492) .-.||+++|+..+.+.++..|++......|.+.+.....-.+. ..++... T Consensus 81 ~~lg~~~eA~~ay~~GL~~d~~n~~L~~gl~~a~~~~~~~~~~~~~p~~~~~l~~~p~t~~~~~~~~~~~~l~~~~~~p~ 160 (539) T KOG0548 81 FGLGDYEEAILAYSEGLEKDPSNKQLKTGLAQAYLEDYAADQLFTKPYFHEKLANLPLTNYSLSDPAYVKILEIIQKNPT 160 (539) T ss_pred HHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHCCHHHHHHHCCHHHHHHHHHHHCCCH T ss_conf 72053899999998775329845999974888656778753120182999876127555556415789999998603827 Q ss_pred -----------------HHC------------------CC---------C--HHHH------HHHHHHHHHHHHHHCCCH Q ss_conf -----------------950------------------23---------1--0357------899999999999966998 Q gi|254781033|r 215 -----------------KKN------------------AK---------E--WNRN------RAILLIARSLENADKGDM 242 (492) Q Consensus 215 -----------------~~~------------------~~---------~--~~~~------~a~l~~~~a~~~~~~~~~ 242 (492) +.. .| . .... .+.-.-..+.......+. T Consensus 161 ~l~~~l~d~r~m~a~~~l~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~d~~ee~~~k~~a~~ek~lgnaaykkk~f 240 (539) T KOG0548 161 SLKLYLNDPRLMKADGQLKGVDELLFYASGIEILASMAEPCKQEHNGFPIIEDNTEERRVKEKAHKEKELGNAAYKKKDF 240 (539) T ss_pred HHHCCCCCHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 65020252799999988726740000113444578999966556787776204689999988645789988998874548 Q ss_pred HHHHHHHHHHHHCCCCCHHHH-----------------------------------------HHHHHHHHHCCCHHHHHH Q ss_conf 999999987543133527999-----------------------------------------999999997399478999 Q gi|254781033|r 243 IASYHDAIESLKLCDNSIMAS-----------------------------------------ICAAKSLISQNKKRKAEV 281 (492) Q Consensus 243 ~~A~~~~~~a~~~~p~~~~a~-----------------------------------------~~~a~~l~~~g~~~~A~~ 281 (492) ..+.+.+..++.++ +.+... ...+..+...+++..+.+ T Consensus 241 ~~a~q~y~~a~el~-~~it~~~n~aA~~~e~~~~~~c~~~c~~a~E~gre~rad~klIak~~~r~g~a~~k~~~~~~ai~ 319 (539) T KOG0548 241 ETAIQHYAKALELA-TDITYLNNIAAVYLERGKYAECIELCEKAVEVGRELRADYKLIAKALARLGNAYTKREDYEGAIK 319 (539) T ss_pred HHHHHHHHHHHHHH-HHHHHHHHHHHHHHHCCHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999998675-66678888988997313798744220889987578999999999999986311355776788999 Q ss_pred HHHHHHHHCCCHHHH------------------------H-----HHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHH Q ss_conf 999999832898999------------------------9-----99985078958999999999982689836899999 Q gi|254781033|r 282 ILEKIWKVNPHPEIA------------------------N-----IYTHLLSENTVGKLKRALRLEEINKESVESLVIVS 332 (492) Q Consensus 282 ~le~a~~~~p~p~La------------------------~-----a~~~~~~~~~~~al~~~~~l~~~~P~~~e~~~~LA 332 (492) .+.+++..+-.|++. . ..-.+..+++..++++|.+.++.+|+++...-..| T Consensus 320 ~~~kaLte~Rt~~~ls~lk~~Ek~~k~~e~~a~~~pe~A~e~r~kGne~Fk~gdy~~Av~~YteAIkr~P~Da~lYsNRA 399 (539) T KOG0548 320 YYQKALTEHRTPDLLSKLKEAEKALKEAERKAYINPEKAEEEREKGNEAFKKGDYPEAVKHYTEAIKRDPEDARLYSNRA 399 (539) T ss_pred HHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHH T ss_conf 99998663258887877789999999988887318467899998778877535889999999999842984157888899 Q ss_pred HHHHHCCCHHHHHHHHHHHHHCCCCH-HHHHHHHHHHHHHCCCHHHHHHHHHHHHC Q ss_conf 99998799889999988786609998-99999999999857998999999999962 Q gi|254781033|r 333 KIALEMGSIDQAHAKAMLAMKIAPRK-EIFLLLAQIEQANSHNTDKILYWTQSALH 387 (492) Q Consensus 333 ~~~~~~g~~~~Ar~~l~~al~~~P~~-~~~~llA~ie~~e~~d~~~ar~wl~rA~~ 387 (492) -.++..+.+..|.+..+..++++|+. ..|+--+.+.... .+..++-.-+..++. T Consensus 400 ac~~kL~~~~~aL~Da~~~ieL~p~~~kgy~RKg~al~~m-k~ydkAleay~eale 454 (539) T KOG0548 400 ACYLKLGEYPEALKDAKKCIELDPNFIKAYLRKGAALRAM-KEYDKALEAYQEALE 454 (539) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHH T ss_conf 9999873179899889998750931899999999999999-989999999999873 No 62 >COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only] Probab=98.83 E-value=3e-05 Score=58.94 Aligned_cols=124 Identities=15% Similarity=0.066 Sum_probs=97.7 Q ss_pred CCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCH-HHHHHHHH--HCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHH Q ss_conf 3527999999999997399478999999999832898-99999998--50789589999999999826898368999999 Q gi|254781033|r 257 DNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHP-EIANIYTH--LLSENTVGKLKRALRLEEINKESVESLVIVSK 333 (492) Q Consensus 257 p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p-~La~a~~~--~~~~~~~~al~~~~~l~~~~P~~~e~~~~LA~ 333 (492) |....+..-.+..+...|++++|++.+....+.+|++ -+..+-.+ +..++..++++++++.+.+.|+.+-..+.+|. T Consensus 303 ~~~~aa~YG~A~~~~~~~~~d~A~~~l~~L~~~~P~N~~~~~~~~~i~~~~nk~~~A~e~~~kal~l~P~~~~l~~~~a~ 382 (484) T COG4783 303 RGGLAAQYGRALQTYLAGQYDEALKLLQPLIAAQPDNPYYLELAGDILLEANKAKEAIERLKKALALDPNSPLLQLNLAQ 382 (484) T ss_pred CCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCHHHHHHHH T ss_conf 36537788899999983270157899999998489987999999999987588677999999998408985079999999 Q ss_pred HHHHCCCHHHHHHHHHHHHHCCCCH-HHHHHHHHHHHHHCCCHHHHHHH Q ss_conf 9998799889999988786609998-99999999999857998999999 Q gi|254781033|r 334 IALEMGSIDQAHAKAMLAMKIAPRK-EIFLLLAQIEQANSHNTDKILYW 381 (492) Q Consensus 334 ~~~~~g~~~~Ar~~l~~al~~~P~~-~~~~llA~ie~~e~~d~~~ar~w 381 (492) +++..|++.+|...|...+..+|+. -.|.++|+-.... ||...+..- T Consensus 383 all~~g~~~eai~~L~~~~~~~p~dp~~w~~LAqay~~~-g~~~~a~~A 430 (484) T COG4783 383 ALLKGGKPQEAIRILNRYLFNDPEDPNGWDLLAQAYAEL-GNRAEALLA 430 (484) T ss_pred HHHHCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHH-CCHHHHHHH T ss_conf 999569978999999998644877736999999999984-761779999 No 63 >KOG1070 consensus Probab=98.80 E-value=2.4e-05 Score=59.66 Aligned_cols=214 Identities=10% Similarity=0.065 Sum_probs=169.6 Q ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCC-HHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH Q ss_conf 99999888421269899999999999525745699999999950231-03578999999999999669989999999875 Q gi|254781033|r 174 QRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKE-WNRNRAILLIARSLENADKGDMIASYHDAIES 252 (492) Q Consensus 174 ~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~-~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a 252 (492) ..-+++...-+|++.-.+...+..+.+.++-++|+++.+++++.-.- .......++.+.-.....-|..+.-.+.+.+| T Consensus 1444 aeDferlvrssPNSSi~WI~YMaf~LelsEiekAR~iaerAL~tIN~REeeEKLNiWiA~lNlEn~yG~eesl~kVFeRA 1523 (1710) T KOG1070 1444 AEDFERLVRSSPNSSILWIRYMAFHLELSEIEKARKIAERALKTINFREEEEKLNIWIAYLNLENAYGTEESLKKVFERA 1523 (1710) T ss_pred HHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHH T ss_conf 78899998609984028999999885024567789999998611786416777778999986787527689999999999 Q ss_pred HHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCC-CHHHHHHHHHH--CCCCHHHHHHHHHHHHHHCCC--CHHH Q ss_conf 431335279999999999973994789999999998328-98999999985--078958999999999982689--8368 Q gi|254781033|r 253 LKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNP-HPEIANIYTHL--LSENTVGKLKRALRLEEINKE--SVES 327 (492) Q Consensus 253 ~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p-~p~La~a~~~~--~~~~~~~al~~~~~l~~~~P~--~~e~ 327 (492) .+.+-. .-....+..+|...++..+|..+++..++..- ...++..|+.. +.++...+...+++.++--|. |.+. T Consensus 1524 cqycd~-~~V~~~L~~iy~k~ek~~~A~ell~~m~KKF~q~~~vW~~y~~fLl~~ne~~aa~~lL~rAL~~lPk~eHv~~ 1602 (1710) T KOG1070 1524 CQYCDA-YTVHLKLLGIYEKSEKNDEADELLRLMLKKFGQTRKVWIMYADFLLRQNEAEAARELLKRALKSLPKQEHVEF 1602 (1710) T ss_pred HHHCCH-HHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCHHHHHHH T ss_conf 874336-8899998879987623104899999999986532568999999985144788999999999864650236889 Q ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHCCCCH-HHHHHHHHHHHHHCCCHHHHHHHHHHHHCCC Q ss_conf 9999999998799889999988786609998-9999999999985799899999999996288 Q gi|254781033|r 328 LVIVSKIALEMGSIDQAHAKAMLAMKIAPRK-EIFLLLAQIEQANSHNTDKILYWTQSALHAM 389 (492) Q Consensus 328 ~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~-~~~~llA~ie~~e~~d~~~ar~wl~rA~~a~ 389 (492) .---|.+-.+.|+-+.+|..++-.+..+|-. ..|..+.+-+.+ .+|...+|..++|+.+-. T Consensus 1603 IskfAqLEFk~GDaeRGRtlfEgll~ayPKRtDlW~VYid~eik-~~~~~~vR~lfeRvi~l~ 1664 (1710) T KOG1070 1603 ISKFAQLEFKYGDAERGRTLFEGLLSAYPKRTDLWSVYIDMEIK-HGDIKYVRDLFERVIELK 1664 (1710) T ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHC-CCCHHHHHHHHHHHHHCC T ss_conf 99999987651884356789887776385430279999998870-477799999999998647 No 64 >COG5010 TadD Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking and secretion] Probab=98.80 E-value=4.7e-06 Score=65.24 Aligned_cols=162 Identities=11% Similarity=-0.082 Sum_probs=129.4 Q ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCC Q ss_conf 99875322221000168999999726885057999865455215886999999998884212698999999999995257 Q gi|254781033|r 124 LLEVQIALAERQYNIAHEKLEMMLQIPATREFAVYSLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKE 203 (492) Q Consensus 124 Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gd 203 (492) ...+-+..+.|+.+.+..+..+.+...++....+.++.....+.|++..|...++++..++|++..++..++-.|.+.|+ T Consensus 70 ~~~a~a~~~~G~a~~~l~~~~~~~~~~~~d~~ll~~~gk~~~~~g~~~~A~~~~rkA~~l~p~d~~~~~~lgaaldq~Gr 149 (257) T COG5010 70 AKLATALYLRGDADSSLAVLQKSAIAYPKDRELLAAQGKNQIRNGNFGEAVSVLRKAARLAPTDWEAWNLLGAALDQLGR 149 (257) T ss_pred HHHHHHHHHCCCCCCHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCC T ss_conf 88777777405555007887554014745499999999999980456889999999843699974666677899997367 Q ss_pred CHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHH Q ss_conf 45699999999950231035789999999999996699899999998754313352799999999999739947899999 Q gi|254781033|r 204 WSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVIL 283 (492) Q Consensus 204 w~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~l 283 (492) .++|..-+.++++..+..+. .....++.+.-.|+...|...+.++.-..+...-+...++.+...+|+.++|+++. T Consensus 150 ~~~Ar~ay~qAl~L~~~~p~----~~nNlgms~~L~gd~~~A~~lll~a~l~~~ad~~v~~NLAl~~~~~g~~~~A~~i~ 225 (257) T COG5010 150 FDEARRAYRQALELAPNEPS----IANNLGMSLLLRGDLEDAETLLLPAYLSPAADSRVRQNLALVVGLQGDFREAEDIA 225 (257) T ss_pred HHHHHHHHHHHHHHCCCCCH----HHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHC T ss_conf 56778999999984258844----66658999997277989999999998678876389998999984048867777624 Q ss_pred HHHHHH Q ss_conf 999983 Q gi|254781033|r 284 EKIWKV 289 (492) Q Consensus 284 e~a~~~ 289 (492) .+-... T Consensus 226 ~~e~~~ 231 (257) T COG5010 226 VQELLS 231 (257) T ss_pred CCCCCC T ss_conf 200364 No 65 >KOG1840 consensus Probab=98.79 E-value=3.9e-05 Score=58.07 Aligned_cols=169 Identities=17% Similarity=0.115 Sum_probs=68.4 Q ss_pred CCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCC----CCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCC---- Q ss_conf 2698999999999995257456999999999502----31035789999999999996699899999998754313---- Q gi|254781033|r 185 PDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNA----KEWNRNRAILLIARSLENADKGDMIASYHDAIESLKLC---- 256 (492) Q Consensus 185 P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~----~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~---- 256 (492) |.-..+...++..|..+|+++.|..+.+.++... ......-+...-..+..+...+...+|...+.+|+..- T Consensus 196 P~~~~~~~~La~~y~~~g~~e~A~~l~k~Al~~l~k~~G~~hl~va~~l~~~a~~y~~~~k~~eAv~ly~~AL~i~e~~~ 275 (508) T KOG1840 196 PERLRTLRNLAEMYAVQGRLEKAEPLCKQALRILEKTSGLKHLVVASMLNILALVYRSLGKYDEAVNLYEEALTIREEVF 275 (508) T ss_pred CHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHC T ss_conf 43788999999999985207789999999999998703766689999999999999974428999999999999999853 Q ss_pred ----CCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH------CCCHHHHHHHHHH-----CCCCHHHHHHHHHHHHHHC Q ss_conf ----352799999999999739947899999999983------2898999999985-----0789589999999999826 Q gi|254781033|r 257 ----DNSIMASICAAKSLISQNKKRKAEVILEKIWKV------NPHPEIANIYTHL-----LSENTVGKLKRALRLEEIN 321 (492) Q Consensus 257 ----p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~------~p~p~La~a~~~~-----~~~~~~~al~~~~~l~~~~ 321 (492) |........++.+|.+.|++++|...++.+... ..|++++..+..+ ..+....+++.+++.+++. T Consensus 276 G~~h~~va~~l~nLa~ly~~~GKf~EA~~~~e~Al~I~~~~~~~~~~~v~~~l~~~~~~~~~~~~~Eea~~l~q~al~i~ 355 (508) T KOG1840 276 GEDHPAVAATLNNLAVLYYKQGKFAEAEEYCERALEIYEKLLGASHPEVAAQLSELAAILQSMNEYEEAKKLLQKALKIY 355 (508) T ss_pred CCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH T ss_conf 89878999999889999860477577899999999999985036858999999899999988520568999999999999 Q ss_pred -----CCC---HHHHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf -----898---36899999999987998899999887866 Q gi|254781033|r 322 -----KES---VESLVIVSKIALEMGSIDQAHAKAMLAMK 353 (492) Q Consensus 322 -----P~~---~e~~~~LA~~~~~~g~~~~Ar~~l~~al~ 353 (492) ++| +.....||+++...|+|.+|++.++++++ T Consensus 356 ~~~~g~~~~~~a~~~~nl~~l~~~~gk~~ea~~~~k~ai~ 395 (508) T KOG1840 356 LDAPGEDNVNLAKIYANLAELYLKMGKYKEAEELYKKAIQ 395 (508) T ss_pred HHHCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH T ss_conf 8614641057898998999999994465679999999999 No 66 >PRK10370 formate-dependent nitrite reductase complex subunit NrfG; Provisional Probab=98.79 E-value=1.9e-06 Score=68.26 Aligned_cols=152 Identities=11% Similarity=0.106 Sum_probs=107.8 Q ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHH Q ss_conf 99999999952574569999999995023103578999999999999669989999999875431335279999999999 Q gi|254781033|r 191 TEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIASYHDAIESLKLCDNSIMASICAAKSL 270 (492) Q Consensus 191 ~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l 270 (492) ...-...|...+.|++...-+++..+..... ......+.....+..-++.+|++...+.++|+.| T Consensus 27 v~~~~g~Y~~~~~wq~v~~~~~~~~~~l~~~---------------~~~q~~e~~~~~L~~rL~~~Pdd~~gW~~LGr~y 91 (206) T PRK10370 27 VFLCVGSYLLSPKWQAVRAEYQRQADPLRQF---------------ASQQTPEAQLQALQDKIRANPQNSEQWALLGEYY 91 (206) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHCHHHCC---------------CCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHH T ss_conf 9999754750473999998628750732067---------------8514179999999999983998889999999999 Q ss_pred HHCCCHHHHHHHHHHHHHHCC-CHHHHHHHHHHC---CC--CHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHH Q ss_conf 973994789999999998328-989999999850---78--958999999999982689836899999999987998899 Q gi|254781033|r 271 ISQNKKRKAEVILEKIWKVNP-HPEIANIYTHLL---SE--NTVGKLKRALRLEEINKESVESLVIVSKIALEMGSIDQA 344 (492) Q Consensus 271 ~~~g~~~~A~~~le~a~~~~p-~p~La~a~~~~~---~~--~~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~A 344 (492) ...|++..|...++++.+..| ++++...|+... .+ ...+....+++.++.+|+|+.++.++|-.+...++|.+| T Consensus 92 ~~~~~~~~A~~Ay~kA~~L~p~~~~il~~yA~aL~~~~~~~~t~~~~~lL~~AL~lDp~~~~AL~Lla~~AFe~~dY~~A 171 (206) T PRK10370 92 LWRNAYDNALLAYRQALQLRGENAELYAALATVLYYQAGQHMTPQTREMIDKALALDANEVTALMLLASDAFMQADYAQA 171 (206) T ss_pred HHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCHHHH T ss_conf 97689899999999999757998789999999999875987768999999999874989788999999999982649999 Q ss_pred HHHHHHHHHCCCC Q ss_conf 9998878660999 Q gi|254781033|r 345 HAKAMLAMKIAPR 357 (492) Q Consensus 345 r~~l~~al~~~P~ 357 (492) ..++++.++..|. T Consensus 172 I~~Wq~lL~~~~p 184 (206) T PRK10370 172 IELWQKLLDLNSP 184 (206) T ss_pred HHHHHHHHHCCCC T ss_conf 9999999845799 No 67 >KOG0550 consensus Probab=98.78 E-value=3.4e-06 Score=66.36 Aligned_cols=276 Identities=12% Similarity=0.034 Sum_probs=176.6 Q ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHH Q ss_conf 99999999972100478799998630134666158999998753222210001689999997268850579998654552 Q gi|254781033|r 86 KALYTGLMSIAAHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPATREFAVYSLYFESC 165 (492) Q Consensus 86 ~al~~gl~al~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~~~gL~~~al 165 (492) ++-.+|--....-++..|.+.+..+...- ++..-.+-=.|-..++-|+++.+.-.+++-.+.++...-+........+ T Consensus 51 ~~k~~gn~~yk~k~Y~nal~~yt~Ai~~~--pd~a~yy~nRAa~~m~~~~~~~a~~dar~~~r~kd~~~k~~~r~~~c~~ 128 (486) T KOG0550 51 EAKEEGNAFYKQKTYGNALKNYTFAIDMC--PDNASYYSNRAATLMMLGRFEEALGDARQSVRLKDGFSKGQLREGQCHL 128 (486) T ss_pred HHHHHCCHHHHHHHHHHHHHHHHHHHHHC--CCCHHHHCHHHHHHHHHHHHHHCCCCHHHHEECCCCCCCCCCCHHHHHH T ss_conf 98742521777866888998889999858--4003654416889998776752421166530057874101245224455 Q ss_pred CCCCHHHHHHHHH------------HHHHHCCCC------HHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHH Q ss_conf 1588699999999------------888421269------8999999999995257456999999999502310357899 Q gi|254781033|r 166 RIGDLNSAQRYAT------------KALDISPDA------PWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAI 227 (492) Q Consensus 166 ~~gd~~~A~~~~~------------~a~~~~P~~------~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~ 227 (492) ..++..+|...++ ..-.+.|.+ ..+-..-.+++.-.+++++|....-..++..+..... T Consensus 129 a~~~~i~A~~~~~~~~~~~~anal~~~~~~~~s~s~~pac~~a~~lka~cl~~~~~~~~a~~ea~~ilkld~~n~~a--- 205 (486) T KOG0550 129 ALSDLIEAEEKLKSKQAYKAANALPTLEKLAPSHSREPACFKAKLLKAECLAFLGDYDEAQSEAIDILKLDATNAEA--- 205 (486) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHH--- T ss_conf 56788989988621016677653122200242346773256788741056543034316778888887516301678--- Q ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHH------------HHHHHHHHHCCCHHHHHHHHHHHHHHCCCHH- Q ss_conf 999999999966998999999987543133527999------------9999999973994789999999998328989- Q gi|254781033|r 228 LLIARSLENADKGDMIASYHDAIESLKLCDNSIMAS------------ICAAKSLISQNKKRKAEVILEKIWKVNPHPE- 294 (492) Q Consensus 228 l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~a~------------~~~a~~l~~~g~~~~A~~~le~a~~~~p~p~- 294 (492) ++-++....-..+.+.+...+.++++++|+...+- -.-+.-..+.|++++|...+..++...|+.- T Consensus 206 -l~vrg~~~yy~~~~~ka~~hf~qal~ldpdh~~sk~~~~~~k~le~~k~~gN~~fk~G~y~~A~E~Yteal~idP~n~~ 284 (486) T KOG0550 206 -LYVRGLCLYYNDNADKAINHFQQALRLDPDHQKSKSASMMPKKLEVKKERGNDAFKNGNYRKAYECYTEALNIDPSNKK 284 (486) T ss_pred -HHHCCCCCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCC T ss_conf -8861533102320688888876520268025667767640789998875056676344246789988776337932222 Q ss_pred -HHHHH-----HHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHH Q ss_conf -99999-----98507895899999999998268983689999999998799889999988786609998999999999 Q gi|254781033|r 295 -IANIY-----THLLSENTVGKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKEIFLLLAQI 367 (492) Q Consensus 295 -La~a~-----~~~~~~~~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~~~~llA~i 367 (492) ++..| ..++.+...+++.-.+..++++|.....++..|+.++..++|++|.+.++++++..-+...-..+-+. T Consensus 285 ~naklY~nra~v~~rLgrl~eaisdc~~Al~iD~syikall~ra~c~l~le~~e~AV~d~~~a~q~~~s~e~r~~l~~A 363 (486) T KOG0550 285 TNAKLYGNRALVNIRLGRLREAISDCNEALKIDSSYIKALLRRANCHLALEKWEEAVEDYEKAMQLEKDCEIRRTLREA 363 (486) T ss_pred HHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHH T ss_conf 4689998767664206771656654466663479888999998898889989999999999998630561067899999 No 68 >KOG1156 consensus Probab=98.75 E-value=5.1e-05 Score=57.15 Aligned_cols=53 Identities=15% Similarity=0.039 Sum_probs=27.0 Q ss_pred CCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCC Q ss_conf 78958999999999982689836899999999987998899999887866099 Q gi|254781033|r 304 SENTVGKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAP 356 (492) Q Consensus 304 ~~~~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P 356 (492) .++...++..+...+..-|.-+|..++-||++-.+|+.++|...++.+.+.+- T Consensus 384 ~g~~~~A~~yId~AIdHTPTliEly~~KaRI~kH~G~l~eAa~~l~ea~elD~ 436 (700) T KOG1156 384 LGDYEVALEYIDLAIDHTPTLIELYLVKARIFKHAGLLDEAAAWLDEAQELDT 436 (700) T ss_pred CCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCC T ss_conf 05579999999887525860999999999999844886899998999872452 No 69 >COG4235 Cytochrome c biogenesis factor [Posttranslational modification, protein turnover, chaperones] Probab=98.73 E-value=4.3e-05 Score=57.76 Aligned_cols=119 Identities=13% Similarity=0.135 Sum_probs=88.0 Q ss_pred CHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCC-CHHHHHHHHHHC-----CCCHHHHHHHH Q ss_conf 989999999875431335279999999999973994789999999998328-989999999850-----78958999999 Q gi|254781033|r 241 DMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNP-HPEIANIYTHLL-----SENTVGKLKRA 314 (492) Q Consensus 241 ~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p-~p~La~a~~~~~-----~~~~~~al~~~ 314 (492) +.+.....+...+..+|+++.-+.++++.|...|+...|..-+.++.+..| +|++...|+++. +....+....+ T Consensus 137 ~~~~l~a~Le~~L~~nP~d~egW~~Lg~~ym~~~~~~~A~~AY~~A~rL~g~n~~~~~g~aeaL~~~a~~~~ta~a~~ll 216 (287) T COG4235 137 EMEALIARLETHLQQNPGDAEGWDLLGRAYMALGRASDALLAYRNALRLAGDNPEILLGLAEALYYQAGQQMTAKARALL 216 (287) T ss_pred CHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHH T ss_conf 59999999999998498872449999999998444347999999998737998899999999999953884449999999 Q ss_pred HHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHH Q ss_conf 999982689836899999999987998899999887866099989 Q gi|254781033|r 315 LRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKE 359 (492) Q Consensus 315 ~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~ 359 (492) ++.++.+|+|..++..||..+.+.|+|.+|...++..++..|... T Consensus 217 ~~al~~D~~~iral~lLA~~afe~g~~~~A~~~Wq~lL~~lp~~~ 261 (287) T COG4235 217 RQALALDPANIRALSLLAFAAFEQGDYAEAAAAWQMLLDLLPADD 261 (287) T ss_pred HHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCC T ss_conf 999843973288999999999981459999999999985189998 No 70 >COG3898 Uncharacterized membrane-bound protein [Function unknown] Probab=98.72 E-value=6.1e-05 Score=56.54 Aligned_cols=260 Identities=13% Similarity=0.046 Sum_probs=154.2 Q ss_pred HHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCHHHHHH-HHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHC Q ss_conf 9999999721004787999986301346661589999-987532222100016899999972688505799986545521 Q gi|254781033|r 88 LYTGLMSIAAHNIPLARKMHSYVSQQHTFHNEYLVYL-LEVQIALAERQYNIAHEKLEMMLQIPATREFAVYSLYFESCR 166 (492) Q Consensus 88 l~~gl~al~~Gd~~~A~k~~~~a~~~~~~~~~~L~~L-l~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~~~gL~~~al~ 166 (492) |..+-.++.+||+..|++.+...... +.--+.-| -.=..++..|+++.+++|-+......|+-.-+.+.++..-.. T Consensus 124 lLeAQaal~eG~~~~Ar~kfeAMl~d---PEtRllGLRgLyleAqr~GareaAr~yAe~Aa~~Ap~l~WA~~AtLe~r~~ 200 (531) T COG3898 124 LLEAQAALLEGDYEDARKKFEAMLDD---PETRLLGLRGLYLEAQRLGAREAARHYAERAAEKAPQLPWAARATLEARCA 200 (531) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHCC---HHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHH T ss_conf 99988988647618799999987368---388777678899999860218999999999876366771679999999986 Q ss_pred CCCHHHHHHHHHHHHHH---CCCCHHHHHHHH----HHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHC Q ss_conf 58869999999988842---126989999999----99995257456999999999502310357899999999999966 Q gi|254781033|r 167 IGDLNSAQRYATKALDI---SPDAPWVTEAVV----QQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADK 239 (492) Q Consensus 167 ~gd~~~A~~~~~~a~~~---~P~~~~a~~~L~----~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~ 239 (492) .||++.|+++.+...+. .++-..-....+ -......|-..|++.-.+..|..++.-.. ....+..+..+ T Consensus 201 ~gdWd~AlkLvd~~~~~~vie~~~aeR~rAvLLtAkA~s~ldadp~~Ar~~A~~a~KL~pdlvPa----av~AAralf~d 276 (531) T COG3898 201 AGDWDGALKLVDAQRAAKVIEKDVAERSRAVLLTAKAMSLLDADPASARDDALEANKLAPDLVPA----AVVAARALFRD 276 (531) T ss_pred CCCHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCHH----HHHHHHHHHHC T ss_conf 67868899999988887752611677898999988888772488677889999875038763369----99999998742 Q ss_pred CCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHH-HHHHHHHHCCC---HHHHHHHHHHCCCCHHHHHHHHH Q ss_conf 998999999987543133527999999999997399478999-99999983289---89999999850789589999999 Q gi|254781033|r 240 GDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEV-ILEKIWKVNPH---PEIANIYTHLLSENTVGKLKRAL 315 (492) Q Consensus 240 ~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~-~le~a~~~~p~---p~La~a~~~~~~~~~~~al~~~~ 315 (492) ++...+-..+..+.|..|.- ..+.+|. ..+.|+.....- -.++.-...|+ ..+..+-+.+..+++..+....+ T Consensus 277 ~~~rKg~~ilE~aWK~ePHP-~ia~lY~--~ar~gdta~dRlkRa~~L~slk~nnaes~~~va~aAlda~e~~~ARa~Ae 353 (531) T COG3898 277 GNLRKGSKILETAWKAEPHP-DIALLYV--RARSGDTALDRLKRAKKLESLKPNNAESSLAVAEAALDAGEFSAARAKAE 353 (531) T ss_pred CCHHHHHHHHHHHHHCCCCH-HHHHHHH--HHCCCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHH T ss_conf 46243324889987349984-8999998--80479748899999999985284206788999999872256487888899 Q ss_pred HHHHHCCCCHHHHHHHHHHHHHC-CCHHHHHHHHHHHHHCCCCHH Q ss_conf 99982689836899999999987-998899999887866099989 Q gi|254781033|r 316 RLEEINKESVESLVIVSKIALEM-GSIDQAHAKAMLAMKIAPRKE 359 (492) Q Consensus 316 ~l~~~~P~~~e~~~~LA~~~~~~-g~~~~Ar~~l~~al~~~P~~~ 359 (492) .+....|..... +++|.+.-.. ||-++.|..+.++++. |... T Consensus 354 aa~r~~pres~~-lLlAdIeeAetGDqg~vR~wlAqav~A-PrdP 396 (531) T COG3898 354 AAAREAPRESAY-LLLADIEEAETGDQGKVRQWLAQAVKA-PRDP 396 (531) T ss_pred HHHHHCCHHHHH-HHHHHHHHHCCCCHHHHHHHHHHHHCC-CCCC T ss_conf 875307234689-998778763258647899999998458-8997 No 71 >PRK04841 transcriptional regulator MalT; Provisional Probab=98.71 E-value=6.6e-05 Score=56.28 Aligned_cols=301 Identities=11% Similarity=0.029 Sum_probs=166.4 Q ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHCCCCC----CCCHH---HHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCH----- Q ss_conf 999999999721004787999986301346----66158---999998753222210001689999997268850----- Q gi|254781033|r 86 KALYTGLMSIAAHNIPLARKMHSYVSQQHT----FHNEY---LVYLLEVQIALAERQYNIAHEKLEMMLQIPATR----- 153 (492) Q Consensus 86 ~al~~gl~al~~Gd~~~A~k~~~~a~~~~~----~~~~~---L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~----- 153 (492) -.+..+++....|++..+...+..+..... ...+. ...++.+..+...|+.+.+....+......+.. T Consensus 411 L~l~~a~~~~~~~~~~~a~~~L~~~~~~~~~~~~~~~~~~~~e~~~lra~~a~~~~~~~~a~~~a~~al~~l~~~~~~~~ 490 (903) T PRK04841 411 LVLLQAWLAQSQHRYSEVNTLLARAEQELKDRNIELDGTLQAEFNALRAQVAINDGDPEEAERLAKLALAELPLTWYYSR 490 (903) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCHHHH T ss_conf 89999999997869999999999999854136765576788899999999998589999999999999753676347989 Q ss_pred HHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCC------HHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHH----H Q ss_conf 5799986545521588699999999888421269------89999999999952574569999999995023103----5 Q gi|254781033|r 154 EFAVYSLYFESCRIGDLNSAQRYATKALDISPDA------PWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWN----R 223 (492) Q Consensus 154 ~~~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~------~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~----~ 223 (492) ..+...+.......|+.+.|+..++.+....... .+.....+.+...+|+...|..++++......... . T Consensus 491 ~~~~~~l~~~~~~~G~l~~a~~~~~~a~~~~~~~~~~~~~~~~~~~~a~i~~~~G~l~~A~~~~~~a~~~~~~~~~~~~~ 570 (903) T PRK04841 491 IVATSVLGEVHHCKGELARALAMMQQTEQMARQHDVYHYALWSLLQQSEILFAQGFLQAAYETQEKAFQLIEEQHLEQLP 570 (903) T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCCCC T ss_conf 99999999999977999999999999999999838535799999999999997878999999999999999870455683 Q ss_pred HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCC-----CCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCC----CHH Q ss_conf 7899999999999966998999999987543133-----5279999999999973994789999999998328----989 Q gi|254781033|r 224 NRAILLIARSLENADKGDMIASYHDAIESLKLCD-----NSIMASICAAKSLISQNKKRKAEVILEKIWKVNP----HPE 294 (492) Q Consensus 224 ~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p-----~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p----~p~ 294 (492) ....+....+..+.+.++.+++.....+...... ....+..+.+++....|+..+|...++..-...- |.+ T Consensus 571 ~~~~~~~~~a~~~~e~~~l~~A~~~~~~~l~~~~~~~~~~~l~~~~~lari~~~~g~~~~A~~~l~~~~~l~~~~~~~~~ 650 (903) T PRK04841 571 MHEFLLRIRAQLLWEWARLDEAEQCARKGLEVLSNFQPQQQLQCLAMLARIALARGDLDNARSYLNRLENLLGNGRYHID 650 (903) T ss_pred HHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCCHH T ss_conf 78999999999999848789999999977999873482889999999999999869999999999999999842653244 Q ss_pred ------HHHHHHHHCCCCHHHHHHHHHHHHHHCC------CCHHH--HHHHHHHHHHCCCHHHHHHHHHHHHHC---CCC Q ss_conf ------9999998507895899999999998268------98368--999999999879988999998878660---999 Q gi|254781033|r 295 ------IANIYTHLLSENTVGKLKRALRLEEINK------ESVES--LVIVSKIALEMGSIDQAHAKAMLAMKI---APR 357 (492) Q Consensus 295 ------La~a~~~~~~~~~~~al~~~~~l~~~~P------~~~e~--~~~LA~~~~~~g~~~~Ar~~l~~al~~---~P~ 357 (492) ....+.....++... +.+|+...+ ++... ...++++.+..|++++|...++.+... ... T Consensus 651 ~~~~~~~~~~~~~~~~g~~~~----a~~~l~~~~~~~~~~~~~~~~~~~~~ar~~l~~g~~~~A~~~l~~l~~~a~~~~~ 726 (903) T PRK04841 651 WIANADKVRLIYWQMTGDKEA----IANWLRQAPKPEFANNHFLQGQWRNIARAQILLGQFDEAEIVLEELNENARSLRL 726 (903) T ss_pred HHHHHHHHHHHHHHHCCCHHH----HHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCC T ss_conf 766899999999996799999----9999872887554212677889999999999829979999999999999987275 Q ss_pred H----HHHHHHHHHHHHHCCCHHHHHHHHHHHHC-CCCC Q ss_conf 8----99999999999857998999999999962-8899 Q gi|254781033|r 358 K----EIFLLLAQIEQANSHNTDKILYWTQSALH-AMPD 391 (492) Q Consensus 358 ~----~~~~llA~ie~~e~~d~~~ar~wl~rA~~-a~~d 391 (492) . +.+.+.|.. ....|+...+...+.+|++ |.|+ T Consensus 727 ~~~~~~~~ll~A~a-~~~~g~~~~A~~~l~~AL~lA~~~ 764 (903) T PRK04841 727 MSDLNRNLILLNQL-YWQQGRKSDAQRALLEALKLANRT 764 (903) T ss_pred HHHHHHHHHHHHHH-HHHCCCHHHHHHHHHHHHHHHHHC T ss_conf 34699999999999-998799999999999999976146 No 72 >COG5010 TadD Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking and secretion] Probab=98.68 E-value=1.7e-05 Score=60.92 Aligned_cols=175 Identities=14% Similarity=0.049 Sum_probs=103.6 Q ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCC Q ss_conf 99999972688505799986545521588699999999888421269899999999999525745699999999950231 Q gi|254781033|r 141 EKLEMMLQIPATREFAVYSLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKE 220 (492) Q Consensus 141 ~~~~~~~~~~~~~~~~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~ 220 (492) .++..-...|....+ .++.......||-+..+.+..+.....|+...++..++..+.+.|++..|+..+.++....+. T Consensus 55 al~~~~~~~p~d~~i--~~~a~a~~~~G~a~~~l~~~~~~~~~~~~d~~ll~~~gk~~~~~g~~~~A~~~~rkA~~l~p~ 132 (257) T COG5010 55 ALGAAVLRNPEDLSI--AKLATALYLRGDADSSLAVLQKSAIAYPKDRELLAAQGKNQIRNGNFGEAVSVLRKAARLAPT 132 (257) T ss_pred HHHHHHHCCCCHHHH--HHHHHHHHHCCCCCCHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCC T ss_conf 999998068201778--887777774055550078875540147454999999999999804568899999998436999 Q ss_pred HHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHH Q ss_conf 03578999999999999669989999999875431335279999999999973994789999999998328989999999 Q gi|254781033|r 221 WNRNRAILLIARSLENADKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHPEIANIYT 300 (492) Q Consensus 221 ~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p~La~a~~ 300 (492) .-+ .+...+..+.+.|+.++|+..+.++.++.|..+.+...++-.+.-.|+++.|++++..+...-+ T Consensus 133 d~~----~~~~lgaaldq~Gr~~~Ar~ay~qAl~L~~~~p~~~nNlgms~~L~gd~~~A~~lll~a~l~~~--------- 199 (257) T COG5010 133 DWE----AWNLLGAALDQLGRFDEARRAYRQALELAPNEPSIANNLGMSLLLRGDLEDAETLLLPAYLSPA--------- 199 (257) T ss_pred CHH----HHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCC--------- T ss_conf 746----6667789999736756778999999984258844666589999972779899999999986788--------- Q ss_pred HHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHH Q ss_conf 8507895899999999998268983689999999998799889999988786 Q gi|254781033|r 301 HLLSENTVGKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAM 352 (492) Q Consensus 301 ~~~~~~~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al 352 (492) .+....-.|+.+-...|++++|+....+-+ T Consensus 200 ----------------------ad~~v~~NLAl~~~~~g~~~~A~~i~~~e~ 229 (257) T COG5010 200 ----------------------ADSRVRQNLALVVGLQGDFREAEDIAVQEL 229 (257) T ss_pred ----------------------CCHHHHHHHHHHHHHCCCHHHHHHHCCCCC T ss_conf ----------------------763899989999840488677776242003 No 73 >KOG0548 consensus Probab=98.66 E-value=2.2e-05 Score=60.02 Aligned_cols=204 Identities=12% Similarity=0.070 Sum_probs=142.4 Q ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHH---HHHH Q ss_conf 9986545521588699999999888421269899999999999525745699999999950231035789999---9999 Q gi|254781033|r 157 VYSLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILL---IARS 233 (492) Q Consensus 157 ~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~---~~~a 233 (492) ...|...+....+++.|++++.++..++ ....-+....-+|...+.+...+..-..+...+.+.+...-.+. ...+ T Consensus 227 ek~lgnaaykkk~f~~a~q~y~~a~el~-~~it~~~n~aA~~~e~~~~~~c~~~c~~a~E~gre~rad~klIak~~~r~g 305 (539) T KOG0548 227 EKELGNAAYKKKDFETAIQHYAKALELA-TDITYLNNIAAVYLERGKYAECIELCEKAVEVGRELRADYKLIAKALARLG 305 (539) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHCCHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 8998899887454899999999998675-666788889889973137987442208899875789999999999999863 Q ss_pred HHHHHCCCHHHHHHHHHHHHHC--------------------------CCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHH Q ss_conf 9999669989999999875431--------------------------33527999999999997399478999999999 Q gi|254781033|r 234 LENADKGDMIASYHDAIESLKL--------------------------CDNSIMASICAAKSLISQNKKRKAEVILEKIW 287 (492) Q Consensus 234 ~~~~~~~~~~~A~~~~~~a~~~--------------------------~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~ 287 (492) ..+...++.+.+...+.+++.. +|......-.-+.-++..|++..|.+.+.++. T Consensus 306 ~a~~k~~~~~~ai~~~~kaLte~Rt~~~ls~lk~~Ek~~k~~e~~a~~~pe~A~e~r~kGne~Fk~gdy~~Av~~YteAI 385 (539) T KOG0548 306 NAYTKREDYEGAIKYYQKALTEHRTPDLLSKLKEAEKALKEAERKAYINPEKAEEEREKGNEAFKKGDYPEAVKHYTEAI 385 (539) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH T ss_conf 11355776788999999986632588878777899999999888873184678999987788775358899999999998 Q ss_pred HHCCCHHHH---HHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHH Q ss_conf 832898999---99998507895899999999998268983689999999998799889999988786609998999 Q gi|254781033|r 288 KVNPHPEIA---NIYTHLLSENTVGKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKEIF 361 (492) Q Consensus 288 ~~~p~p~La---~a~~~~~~~~~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~~~ 361 (492) +.+|+.... .++.++..++...+++..+..++++|+..-.++--|.+....++|++|.+.++.+++.+|+..-+ T Consensus 386 kr~P~Da~lYsNRAac~~kL~~~~~aL~Da~~~ieL~p~~~kgy~RKg~al~~mk~ydkAleay~eale~dp~~~e~ 462 (539) T KOG0548 386 KRDPEDARLYSNRAACYLKLGEYPEALKDAKKCIELDPNFIKAYLRKGAALRAMKEYDKALEAYQEALELDPSNAEA 462 (539) T ss_pred HCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHH T ss_conf 42984157888899999987317989988999875093189999999999999998999999999987349155899 No 74 >KOG3060 consensus Probab=98.63 E-value=5.1e-05 Score=57.18 Aligned_cols=172 Identities=16% Similarity=0.112 Sum_probs=96.4 Q ss_pred HHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHH Q ss_conf 57999865455215886999999998884212698999999999995257456999999999502310357899999999 Q gi|254781033|r 154 EFAVYSLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARS 233 (492) Q Consensus 154 ~~~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a 233 (492) +..+--.+..++..|..+-|......+....|++..+...-+.++...|.|++|++.++..+...|.+...+- ... T Consensus 52 w~l~EqV~IAAld~~~~~lAq~C~~~L~~~fp~S~RV~~lkam~lEa~~~~~~A~e~y~~lL~ddpt~~v~~K----RKl 127 (289) T KOG3060 52 WTLYEQVFIAALDTGRDDLAQKCINQLRDRFPGSKRVGKLKAMLLEATGNYKEAIEYYESLLEDDPTDTVIRK----RKL 127 (289) T ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHCCCCCHHHHHH----HHH T ss_conf 8999999999998331378999999999867897067989999999861354599999998446863148999----999 Q ss_pred HHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHH-HHHHHHHC-----CCCH Q ss_conf 99996699899999998754313352799999999999739947899999999983289899-99999850-----7895 Q gi|254781033|r 234 LENADKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHPEI-ANIYTHLL-----SENT 307 (492) Q Consensus 234 ~~~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p~L-a~a~~~~~-----~~~~ 307 (492) ......|...++.+...+-++.-+.+..++.-++.+|...|++.+|.--+|...-.+||..+ -..|+++. ..+. T Consensus 128 Ailka~GK~l~aIk~ln~YL~~F~~D~EAW~eLaeiY~~~~~f~kA~fClEE~ll~~P~n~l~f~rlae~~Yt~gg~eN~ 207 (289) T KOG3060 128 AILKAQGKNLEAIKELNEYLDKFMNDQEAWHELAEIYLSEGDFEKAAFCLEELLLIQPFNPLYFQRLAEVLYTQGGAENL 207 (289) T ss_pred HHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 99997188579999999999983576999999999997676899999999999975987189999999999998418889 Q ss_pred HHHHHHHHHHHHHCCCCHHHHH Q ss_conf 8999999999982689836899 Q gi|254781033|r 308 VGKLKRALRLEEINKESVESLV 329 (492) Q Consensus 308 ~~al~~~~~l~~~~P~~~e~~~ 329 (492) .-+.+.+++.++.+|.+..+++ T Consensus 208 ~~arkyy~~alkl~~~~~ral~ 229 (289) T KOG3060 208 ELARKYYERALKLNPKNLRALF 229 (289) T ss_pred HHHHHHHHHHHHHCHHHHHHHH T ss_conf 9999999999872747389999 No 75 >TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795). Probab=98.63 E-value=1.9e-05 Score=60.52 Aligned_cols=186 Identities=18% Similarity=0.157 Sum_probs=107.4 Q ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHH---HHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHH Q ss_conf 998654552158869999999988842126989999---99999995257456999999999502310357899999999 Q gi|254781033|r 157 VYSLYFESCRIGDLNSAQRYATKALDISPDAPWVTE---AVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARS 233 (492) Q Consensus 157 ~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~---~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a 233 (492) +...+...+..|++..|...+++.....|.++++.. .++..+-..+++..|...+++-++..|.++...-+. +..+ T Consensus 36 lY~~a~~~~~~~~y~~A~~~fe~l~~~yp~s~~a~~A~l~~a~a~y~~~~y~~A~~~~~~Fi~~yP~~~~~~ya~-y~~a 114 (235) T TIGR03302 36 LYNEAKEALDSGDYTEAIKYFEALESRYPFSPYAEQAQLDLAYAYYKSGDYAEAIAAADRFIRLHPNHPDADYAY-YLRG 114 (235) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHH-HHHH T ss_conf 999999999878999999999999986899313799999999999976439999999999999887743499999-9987 Q ss_pred HHHH--------HCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHCCC Q ss_conf 9999--------66998999999987543133527999999999997399478999999999832898999999985078 Q gi|254781033|r 234 LENA--------DKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHPEIANIYTHLLSE 305 (492) Q Consensus 234 ~~~~--------~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p~La~a~~~~~~~ 305 (492) .... ++.....|...+.+.++.-|+..-+- +|...+...-..--..++..+-...+.+ T Consensus 115 ~s~~~~~~~~~rDq~~~~~A~~~f~~~i~~yP~S~ya~--------------dA~~~i~~l~~~LA~~e~~ia~~Y~k~~ 180 (235) T TIGR03302 115 LSNYKQIDDSDRDQTAAREAFEAFQELIRRYPNSEYAP--------------DAKKRMDYLRNRLAGKELYVARFYLKRG 180 (235) T ss_pred HHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHH--------------HHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 99887366544580899999999999998889965799--------------9999999999999999999999999841 Q ss_pred CHHHHHHHHHHHHHHCCCC---HHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCC Q ss_conf 9589999999999826898---368999999999879988999998878660999 Q gi|254781033|r 306 NTVGKLKRALRLEEINKES---VESLVIVSKIALEMGSIDQAHAKAMLAMKIAPR 357 (492) Q Consensus 306 ~~~~al~~~~~l~~~~P~~---~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~ 357 (492) ....++.+++.+++..|+. ++++..+++++...|+.++|++..+.+-..-|+ T Consensus 181 ~y~aAi~r~~~vl~~Yp~t~~~~eAl~~l~eay~~lg~~~~A~~~~~~l~~n~p~ 235 (235) T TIGR03302 181 AYVAAINRFETVVENYPDTPATEEALARLVEAYLALGLTDLAQDAAAVLGANYPD 235 (235) T ss_pred CCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCC T ss_conf 7099999999999868998513999999999999869989999999999753989 No 76 >PRK09782 bacteriophage N4 receptor, outer membrane subunit; Provisional Probab=98.63 E-value=6.9e-05 Score=56.13 Aligned_cols=159 Identities=10% Similarity=-0.006 Sum_probs=75.7 Q ss_pred HHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHH--HHHHCCCCHHH Q ss_conf 9999996699899999998754313352799999999999739947899999999983289899999--99850789589 Q gi|254781033|r 232 RSLENADKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHPEIANI--YTHLLSENTVG 309 (492) Q Consensus 232 ~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p~La~a--~~~~~~~~~~~ 309 (492) .+...+..|++..+...+..+....-+........+......|+...|..-...++...|.-+-..+ -.....++..+ T Consensus 548 aa~~alaAGd~~aa~r~~~~a~~~~a~~~A~~a~i~~~~~~~g~~~~Al~~~r~al~~ap~a~~~~A~~~~~~Q~~d~~q 627 (987) T PRK09782 548 AANTAQAAGNGAARDRWLQQAEQRGLGDNALYWWLHAQRYIPGQPELALNDLTRSLNIAPSANAYVARATIYRQRHNVPA 627 (987) T ss_pred HHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHH T ss_conf 88877754897899999999875267327789999987630387387888899998608004668999999997368078 Q ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCH-HHHHHHHHHHHHHCCCHHHHHHHHHHHHCC Q ss_conf 9999999998268983689999999998799889999988786609998-999999999998579989999999999628 Q gi|254781033|r 310 KLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRK-EIFLLLAQIEQANSHNTDKILYWTQSALHA 388 (492) Q Consensus 310 al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~-~~~~llA~ie~~e~~d~~~ar~wl~rA~~a 388 (492) +++.+...+..-|+++...-.+|-...+.++-.+++..++++-...|+. ++-.-++-+... .+|...++..+.+++.. T Consensus 628 a~~~LAeAv~lepd~~~y~A~lGy~la~~~D~Aq~~~~lerAh~~~p~d~~~~~~lAy~~~r-l~D~a~a~~~~r~vID~ 706 (987) T PRK09782 628 AVSDLRAALELEPNNSNYQAALGYALWDSGDIAQSREMLERAHKDLPDDPALIRQLAYVNQR-LDDMAATQHYARLVIDD 706 (987) T ss_pred HHHHHHHHHHCCCCCCCCCHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHH-HCCHHHHHHHHHHHHHH T ss_conf 99999999817999854115677786139989999999997521577552324788899876-13469999999999865 Q ss_pred CCC Q ss_conf 899 Q gi|254781033|r 389 MPD 391 (492) Q Consensus 389 ~~d 391 (492) ..+ T Consensus 707 ~~~ 709 (987) T PRK09782 707 IDN 709 (987) T ss_pred HHH T ss_conf 420 No 77 >KOG3060 consensus Probab=98.62 E-value=3.3e-05 Score=58.65 Aligned_cols=165 Identities=15% Similarity=0.022 Sum_probs=104.5 Q ss_pred HHHHHCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHH Q ss_conf 99972100478799998630134666158999998753222210001689999997268850579998654552158869 Q gi|254781033|r 92 LMSIAAHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPATREFAVYSLYFESCRIGDLN 171 (492) Q Consensus 92 l~al~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~~~gL~~~al~~gd~~ 171 (492) +.|+-.|....|.+-.......- +...-..-+.|-..+..|++++|.++|..+++++||.++.+..-......+|..- T Consensus 60 IAAld~~~~~lAq~C~~~L~~~f--p~S~RV~~lkam~lEa~~~~~~A~e~y~~lL~ddpt~~v~~KRKlAilka~GK~l 137 (289) T KOG3060 60 IAALDTGRDDLAQKCINQLRDRF--PGSKRVGKLKAMLLEATGNYKEAIEYYESLLEDDPTDTVIRKRKLAILKAQGKNL 137 (289) T ss_pred HHHHHHCCHHHHHHHHHHHHHHC--CCCHHHHHHHHHHHHHHHCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCCH T ss_conf 99998331378999999999867--8970679899999998613545999999984468631489999999999718857 Q ss_pred HHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHH---HHHHHHHHHHHHHCCCHHHHHHH Q ss_conf 999999988842126989999999999952574569999999995023103578---99999999999966998999999 Q gi|254781033|r 172 SAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNR---AILLIARSLENADKGDMIASYHD 248 (492) Q Consensus 172 ~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~---a~l~~~~a~~~~~~~~~~~A~~~ 248 (492) .|++.+.+.++..+++.+++.-|.++|...|++..|.-.++..+-..|..+... +.+++..+. ..+...++++ T Consensus 138 ~aIk~ln~YL~~F~~D~EAW~eLaeiY~~~~~f~kA~fClEE~ll~~P~n~l~f~rlae~~Yt~gg----~eN~~~arky 213 (289) T KOG3060 138 EAIKELNEYLDKFMNDQEAWHELAEIYLSEGDFEKAAFCLEELLLIQPFNPLYFQRLAEVLYTQGG----AENLELARKY 213 (289) T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHH----HHHHHHHHHH T ss_conf 999999999998357699999999999767689999999999997598718999999999999841----8889999999 Q ss_pred HHHHHHCCCCCHHH Q ss_conf 98754313352799 Q gi|254781033|r 249 AIESLKLCDNSIMA 262 (492) Q Consensus 249 ~~~a~~~~p~~~~a 262 (492) +.+++++.|....+ T Consensus 214 y~~alkl~~~~~ra 227 (289) T KOG3060 214 YERALKLNPKNLRA 227 (289) T ss_pred HHHHHHHCHHHHHH T ss_conf 99998727473899 No 78 >KOG2376 consensus Probab=98.59 E-value=2.4e-05 Score=59.77 Aligned_cols=124 Identities=14% Similarity=0.060 Sum_probs=65.5 Q ss_pred HHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHH------------- Q ss_conf 9999999721004787999986301346661589999987532222100016899999972688505------------- Q gi|254781033|r 88 LYTGLMSIAAHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPATRE------------- 154 (492) Q Consensus 88 l~~gl~al~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~------------- 154 (492) +..+.+..-.+..+.|.+...-.. +.+.-++.|.||..-..|+++++.+.|+.+++...... T Consensus 83 fEKAYc~Yrlnk~Dealk~~~~~~-----~~~~~ll~L~AQvlYrl~~ydealdiY~~L~kn~~dd~d~~~r~nl~a~~a 157 (652) T KOG2376 83 FEKAYCEYRLNKLDEALKTLKGLD-----RLDDKLLELRAQVLYRLERYDEALDIYQHLAKNNSDDQDEERRANLLAVAA 157 (652) T ss_pred HHHHHHHHHCCCHHHHHHHHHCCC-----CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHH T ss_conf 788889987056788999875035-----566188999999999875389999999999855870578999978999887 Q ss_pred ------------------HHHHHHHHHHHCCCCHHHHHHHHHHHHHH-------CCCC--------HHHHHHHHHHHHHC Q ss_conf ------------------79998654552158869999999988842-------1269--------89999999999952 Q gi|254781033|r 155 ------------------FAVYSLYFESCRIGDLNSAQRYATKALDI-------SPDA--------PWVTEAVVQQYVLA 201 (492) Q Consensus 155 ------------------~~~~gL~~~al~~gd~~~A~~~~~~a~~~-------~P~~--------~~a~~~L~~l~~~~ 201 (492) .-+...+......|++.+|++.++++..+ ...+ ..+...|+-++... T Consensus 158 ~l~~~~~q~v~~v~e~syel~yN~Ac~~i~~gky~qA~elL~kA~~~~~e~l~~~d~~eEeie~el~~IrvQlayVlQ~~ 237 (652) T KOG2376 158 ALQVQLLQSVPEVPEDSYELLYNTACILIENGKYNQAIELLEKALRICREKLEDEDTNEEEIEEELNPIRVQLAYVLQLQ 237 (652) T ss_pred HHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 40477987455888542999999999998503489999999999998887631466530668888778999999999994 Q ss_pred CCCHHHHHHHHHHHH Q ss_conf 574569999999995 Q gi|254781033|r 202 KEWSRAITFLNQKKK 216 (492) Q Consensus 202 gdw~~A~~~l~~~~~ 216 (492) |+-++|...+...++ T Consensus 238 Gqt~ea~~iy~~~i~ 252 (652) T KOG2376 238 GQTAEASSIYVDIIK 252 (652) T ss_pred CCHHHHHHHHHHHHH T ss_conf 330878999999987 No 79 >pfam04733 Coatomer_E Coatomer epsilon subunit. This family represents the epsilon subunit of the coatomer complex, which is involved in the regulation of intracellular protein trafficking between the endoplasmic reticulum and the Golgi complex. Probab=98.55 E-value=0.00018 Score=52.97 Aligned_cols=256 Identities=11% Similarity=0.024 Sum_probs=144.0 Q ss_pred HHCCHHHHHHHHHHHHCCCCCCCCHH---HHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHH Q ss_conf 72100478799998630134666158---999998753222210001689999997268850579998654552158869 Q gi|254781033|r 95 IAAHNIPLARKMHSYVSQQHTFHNEY---LVYLLEVQIALAERQYNIAHEKLEMMLQIPATREFAVYSLYFESCRIGDLN 171 (492) Q Consensus 95 l~~Gd~~~A~k~~~~a~~~~~~~~~~---L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~~~gL~~~al~~gd~~ 171 (492) +-.|++..+....... ..++... ..++..++.| .|++.....- ......+...+++.|+...-...+.+ T Consensus 12 Fy~G~Yq~~i~ea~~~---~~s~~~~~e~~~~~~Ra~IA--lg~~~~vl~e---i~~~~~p~L~av~~lA~y~~~~~~~~ 83 (290) T pfam04733 12 FYLGNYQKAINESDVT---SLSEEALVERDVYMYRSYLA--LGSYQIVISE---IKESAATPLQAVRLLAEYLNSPSRKE 83 (290) T ss_pred HHHHHHHHHHHHHHCC---CCCCHHHHHHHHHHHHHHHH--CCCHHHHHHH---CCCCCCCHHHHHHHHHHHHCCCCCHH T ss_conf 9966999999998636---89913579999999999997--3770378886---26789831799999999874841145 Q ss_pred HHHHHHHHHHH--HCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHH Q ss_conf 99999998884--2126989999999999952574569999999995023103578999999999999669989999999 Q gi|254781033|r 172 SAQRYATKALD--ISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIASYHDA 249 (492) Q Consensus 172 ~A~~~~~~a~~--~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~ 249 (492) .+...+.+... ..+.++.+....+.++...|+.++|++++.. . .....+ ......++..++.+.|.+.+ T Consensus 84 ~~~~~l~~~~~~~~~~~~~~~~~~aati~~~~g~~e~AL~~l~~----~---~~lE~~--al~VQi~L~~nR~DLA~ke~ 154 (290) T pfam04733 84 SILASLKEWVADSHIGSNSTLRLLAAIIFIHEGDFDDALKHLHK----G---ENLEAM--ALNVQILLKMHRIDLAEQQL 154 (290) T ss_pred HHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHC----C---CCHHHH--HHHHHHHHHCCCHHHHHHHH T ss_conf 69999999987401566789999999999977999999999855----8---966799--99999999728836799999 Q ss_pred HHHHHCCCCCHHHHHHHHHHHHHCC--CHHHHHHHHHHHHHH-CCCHHHH--HHHHHHCCCCHHHHHHHHHHHHHHCCCC Q ss_conf 8754313352799999999999739--947899999999983-2898999--9999850789589999999999826898 Q gi|254781033|r 250 IESLKLCDNSIMASICAAKSLISQN--KKRKAEVILEKIWKV-NPHPEIA--NIYTHLLSENTVGKLKRALRLEEINKES 324 (492) Q Consensus 250 ~~a~~~~p~~~~a~~~~a~~l~~~g--~~~~A~~~le~a~~~-~p~p~La--~a~~~~~~~~~~~al~~~~~l~~~~P~~ 324 (492) +.+.+.+.+.+-.-+.-+-+.+..| ++..|--+++..-.. .+.|-+. .+..++..++..++...++..+.+.|+| T Consensus 155 ~~~q~~~eDs~l~qLa~awv~l~~Ggek~q~A~yif~EL~~~~~~t~lllng~Av~~m~~~~~~eAe~~L~eAl~~d~~d 234 (290) T pfam04733 155 KKMQQIDEDATLTQLANAWVKLAVGGEKIQDAYYIFQEFSEKYDSTPLLLNGQAVCCMCLGRYEEAESLLKEALDKDAKD 234 (290) T ss_pred HHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCC T ss_conf 99884351059999999999997264778989999999985228978999999999987478799999999998728898 Q ss_pred HHHHHHHHHHHHHCCCHH-HHHHHHHHHHHCCCCHHHHHHHHHH Q ss_conf 368999999999879988-9999988786609998999999999 Q gi|254781033|r 325 VESLVIVSKIALEMGSID-QAHAKAMLAMKIAPRKEIFLLLAQI 367 (492) Q Consensus 325 ~e~~~~LA~~~~~~g~~~-~Ar~~l~~al~~~P~~~~~~llA~i 367 (492) ++....+..+....|.-. ...+++.++-..+|+.+.+.-+.+. T Consensus 235 ~dtlaNliv~s~~~gK~~e~~~ryl~qL~~~~p~H~~v~~~~~k 278 (290) T pfam04733 235 PETLINLVVCALHLGKPAEVSNRNLSQLKLSHPTHPLVKDLNEK 278 (290) T ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHH T ss_conf 89999999999980998789999999998629998599999999 No 80 >PRK11189 lipoprotein NlpI; Provisional Probab=98.55 E-value=0.00018 Score=52.93 Aligned_cols=186 Identities=13% Similarity=0.067 Sum_probs=91.8 Q ss_pred CCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHH Q ss_conf 88699999999888421269899999999999525745699999999950231035789999999999996699899999 Q gi|254781033|r 168 GDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIASYH 247 (492) Q Consensus 168 gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A~~ 247 (492) |=..-|+--+.++++++|+.+.+...|+..+.+.|+++.|.+.++..+...|++.-. .+.++....-.+..+.|.. T Consensus 79 GLr~LAR~DF~qAL~l~P~~adayN~LGiy~tq~~~fD~AyEAFDs~LEL~P~y~YA----~lNRGIAlYYg~R~~LA~~ 154 (297) T PRK11189 79 GLRALARNDFSQALALRPDMADAYNYLGIYLTQEGEFDAAYEAFDSVLELDPTYDYA----YLNRGIALYYGGRYDLAQD 154 (297) T ss_pred CCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHH----HHHHHHHHHHCCCHHHHHH T ss_conf 209999864998975399858999999999999711999999987777269873489----9830278874463788999 Q ss_pred HHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHCCC--CHHHHHHHHHHHHHHCCCC- Q ss_conf 9987543133527999999999997399478999999999832898999999985078--9589999999999826898- Q gi|254781033|r 248 DAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHPEIANIYTHLLSE--NTVGKLKRALRLEEINKES- 324 (492) Q Consensus 248 ~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p~La~a~~~~~~~--~~~~al~~~~~l~~~~P~~- 324 (492) .+.+....+|+.+--++.+ -+-...-+..+|..-|.+..+..-+..-+.....+..| +..+.++.+..-++.+..- T Consensus 155 D~~~fy~~dp~DpYR~LWL-Yl~e~~~d~~~A~~~L~~r~~~~~~~~W~~~iV~~ylg~ise~~~~~~~~~~~~~n~~lA 233 (297) T PRK11189 155 DLLAFYQDDPNDPYRALWL-YLIEQKLDPKQAKENLKQRYEKLDKEQWGWNIVDFYLGKISEEQLFERLKAGATDNTELA 233 (297) T ss_pred HHHHHHHCCCCCCHHHHHH-HHHHCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCHHHH T ss_conf 9999996099982799999-999811699999999999998357122689999999458999999999987273709999 Q ss_pred ---HHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCH Q ss_conf ---3689999999998799889999988786609998 Q gi|254781033|r 325 ---VESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRK 358 (492) Q Consensus 325 ---~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~ 358 (492) .|+..-||..+...|++++|..++.-++..+.-. T Consensus 234 ErLcEaYFYLaK~~~~~G~~~~A~~~fKLAla~NVye 270 (297) T PRK11189 234 ERLCETYFYLAKYYLSLGDLDEAANLFKLALANNVYN 270 (297) T ss_pred HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCHH T ss_conf 9999999999999998799899999999987048258 No 81 >KOG0550 consensus Probab=98.55 E-value=2.2e-05 Score=60.00 Aligned_cols=176 Identities=19% Similarity=0.143 Sum_probs=135.3 Q ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHH----------- Q ss_conf 999987532222100016899999972688505799986545521588699999999888421269899----------- Q gi|254781033|r 122 VYLLEVQIALAERQYNIAHEKLEMMLQIPATREFAVYSLYFESCRIGDLNSAQRYATKALDISPDAPWV----------- 190 (492) Q Consensus 122 ~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a----------- 190 (492) ..+|.++.....|+++.+...--..++.+++...++.--....--.++.+.|..++++++.+.|++... T Consensus 171 a~~lka~cl~~~~~~~~a~~ea~~ilkld~~n~~al~vrg~~~yy~~~~~ka~~hf~qal~ldpdh~~sk~~~~~~k~le 250 (486) T KOG0550 171 AKLLKAECLAFLGDYDEAQSEAIDILKLDATNAEALYVRGLCLYYNDNADKAINHFQQALRLDPDHQKSKSASMMPKKLE 250 (486) T ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHCCCCCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHH T ss_conf 88741056543034316778888887516301678886153310232068888887652026802566776764078999 Q ss_pred -HHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHH Q ss_conf -9999999995257456999999999502310357899999999999966998999999987543133527999999999 Q gi|254781033|r 191 -TEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIASYHDAIESLKLCDNSIMASICAAKS 269 (492) Q Consensus 191 -~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~ 269 (492) ...-+.-....|.+..|.+.+..++...|.+....+.++..++......+...++.....+++++++..+.+....|.. T Consensus 251 ~~k~~gN~~fk~G~y~~A~E~Yteal~idP~n~~~naklY~nra~v~~rLgrl~eaisdc~~Al~iD~syikall~ra~c 330 (486) T KOG0550 251 VKKERGNDAFKNGNYRKAYECYTEALNIDPSNKKTNAKLYGNRALVNIRLGRLREAISDCNEALKIDSSYIKALLRRANC 330 (486) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHH T ss_conf 88750566763442467899887763379322224689998767664206771656654466663479888999998898 Q ss_pred HHHCCCHHHHHHHHHHHHHHCCCHHHHH Q ss_conf 9973994789999999998328989999 Q gi|254781033|r 270 LISQNKKRKAEVILEKIWKVNPHPEIAN 297 (492) Q Consensus 270 l~~~g~~~~A~~~le~a~~~~p~p~La~ 297 (492) +...++..+|...++++.+..-++++-. T Consensus 331 ~l~le~~e~AV~d~~~a~q~~~s~e~r~ 358 (486) T KOG0550 331 HLALEKWEEAVEDYEKAMQLEKDCEIRR 358 (486) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCHHH T ss_conf 8899899999999999986305610678 No 82 >PRK02603 photosystem I assembly protein Ycf3; Provisional Probab=98.54 E-value=6e-06 Score=64.45 Aligned_cols=143 Identities=15% Similarity=0.181 Sum_probs=102.2 Q ss_pred HHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCC---CHHHHHHHHHHHHHCCCCHHHHHH Q ss_conf 04787999986301346661589999987532222100016899999972688---505799986545521588699999 Q gi|254781033|r 99 NIPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPA---TREFAVYSLYFESCRIGDLNSAQR 175 (492) Q Consensus 99 d~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~---~~~~~~~gL~~~al~~gd~~~A~~ 175 (492) .+..+.-...|+.........+.++.-.+.+++..|++.+|.+.|.+.++..+ .+...+..++......|+++.|+. T Consensus 14 ~ftv~ad~~~k~~p~~~~~k~A~~Yy~~G~~~q~~G~Y~EAle~Y~kAL~Le~dp~D~s~ilyNiGlIY~~~G~~~kAie 93 (172) T PRK02603 14 SFTVMADLILKILPINKKAKEAFVYYRDGMSAQNDGDYAEALENYEEALKLEEDPNDRSYILYNMAIIYASNGEHDKALE 93 (172) T ss_pred HHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCHHHHHH T ss_conf 48999999999815886336899999722577745799999999999872778962089999779999998699889999 Q ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHC Q ss_conf 99988842126989999999999952574569999999995023103578999999999999669989999999875431 Q gi|254781033|r 176 YATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIASYHDAIESLKL 255 (492) Q Consensus 176 ~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~ 255 (492) ++.++++++|+.+.++..++.+|..+|+- |.+ .+.. ...+...+.|.+.|.+|+++ T Consensus 94 ~Y~kALeiNP~~~~AlNNlgvIY~k~G~~--a~~-------~~~~---------------~~~~~~fdkAa~yw~kAI~l 149 (172) T PRK02603 94 TYHQALELNPKQPSALNNIAVIYHKRGRK--AEE-------AGDQ---------------DEAEAWFDKAAEYWKQAIRL 149 (172) T ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHHHHH--HHH-------CCCH---------------HHHHHHHHHHHHHHHHHHHH T ss_conf 99999986899789999999999998599--998-------4587---------------78999999999999998850 Q ss_pred CCCCHHHHHH Q ss_conf 3352799999 Q gi|254781033|r 256 CDNSIMASIC 265 (492) Q Consensus 256 ~p~~~~a~~~ 265 (492) +|+.-..+.. T Consensus 150 ~P~nyi~aqn 159 (172) T PRK02603 150 APNNYIEAQN 159 (172) T ss_pred CCCCHHHHHH T ss_conf 9987099999 No 83 >PRK10370 formate-dependent nitrite reductase complex subunit NrfG; Provisional Probab=98.51 E-value=5e-05 Score=57.25 Aligned_cols=103 Identities=12% Similarity=0.018 Sum_probs=46.1 Q ss_pred CCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHH---CCCCHHHHHHHHHHHHHHCCCCHHHHH Q ss_conf 66158999998753222210001689999997268850579998654552---158869999999988842126989999 Q gi|254781033|r 116 FHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPATREFAVYSLYFESC---RIGDLNSAQRYATKALDISPDAPWVTE 192 (492) Q Consensus 116 ~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~~~gL~~~al---~~gd~~~A~~~~~~a~~~~P~~~~a~~ 192 (492) .+++.-..++.++.+...|+++.|...|.++.+-.+.....+.+.+.... ......++...++++++++|+++.++. T Consensus 77 ~Pdd~~gW~~LGr~y~~~~~~~~A~~Ay~kA~~L~p~~~~il~~yA~aL~~~~~~~~t~~~~~lL~~AL~lDp~~~~AL~ 156 (206) T PRK10370 77 NPQNSEQWALLGEYYLWRNAYDNALLAYRQALQLRGENAELYAALATVLYYQAGQHMTPQTREMIDKALALDANEVTALM 156 (206) T ss_pred CCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCCHHHHH T ss_conf 99888999999999997689899999999999757998789999999999875987768999999999874989788999 Q ss_pred HHHHHHHHCCCCHHHHHHHHHHHHCC Q ss_conf 99999995257456999999999502 Q gi|254781033|r 193 AVVQQYVLAKEWSRAITFLNQKKKNA 218 (492) Q Consensus 193 ~L~~l~~~~gdw~~A~~~l~~~~~~~ 218 (492) .++-....++||.+|+...++.++.. T Consensus 157 Lla~~AFe~~dY~~AI~~Wq~lL~~~ 182 (206) T PRK10370 157 LLASDAFMQADYAQAIELWQKLLDLN 182 (206) T ss_pred HHHHHHHHHCCHHHHHHHHHHHHHCC T ss_conf 99999998264999999999998457 No 84 >cd00189 TPR Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and C Probab=98.50 E-value=7.4e-06 Score=63.74 Aligned_cols=95 Identities=11% Similarity=0.081 Sum_probs=56.3 Q ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHHCCCH-HHHHH--HHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCC Q ss_conf 9999999997399478999999999832898-99999--99850789589999999999826898368999999999879 Q gi|254781033|r 263 SICAAKSLISQNKKRKAEVILEKIWKVNPHP-EIANI--YTHLLSENTVGKLKRALRLEEINKESVESLVIVSKIALEMG 339 (492) Q Consensus 263 ~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p-~La~a--~~~~~~~~~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g 339 (492) ...++..+.+.|++.+|...++++.+.+|.. ..... ......++...+++.+++.++.+|++++.+..+|.++...| T Consensus 3 ~~~lg~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~l~~~~~~~~~~~~A~~~~~~~~~~~~~~~~~~~~la~~~~~~~ 82 (100) T cd00189 3 LLNLGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALELDPDNAKAYYNLGLAYYKLG 82 (100) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCC T ss_conf 99999999984999999999999998599989999999999998598589999999999869898999999999999949 Q ss_pred CHHHHHHHHHHHHHCCCC Q ss_conf 988999998878660999 Q gi|254781033|r 340 SIDQAHAKAMLAMKIAPR 357 (492) Q Consensus 340 ~~~~Ar~~l~~al~~~P~ 357 (492) ++++|...++++++.+|+ T Consensus 83 ~~~~A~~~~~~al~~~P~ 100 (100) T cd00189 83 KYEEALEAYEKALELDPN 100 (100) T ss_pred CHHHHHHHHHHHHHHCCC T ss_conf 999999999999974908 No 85 >PRK04841 transcriptional regulator MalT; Provisional Probab=98.49 E-value=0.00024 Score=51.86 Aligned_cols=29 Identities=17% Similarity=0.395 Sum_probs=13.9 Q ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHHC Q ss_conf 68999999999879988999998878660 Q gi|254781033|r 326 ESLVIVSKIALEMGSIDQAHAKAMLAMKI 354 (492) Q Consensus 326 e~~~~LA~~~~~~g~~~~Ar~~l~~al~~ 354 (492) +.++..+.++...|+..+|.+.+.++++. T Consensus 732 ~~~ll~A~a~~~~g~~~~A~~~l~~AL~l 760 (903) T PRK04841 732 RNLILLNQLYWQQGRKSDAQRALLEALKL 760 (903) T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHHH T ss_conf 99999999999879999999999999997 No 86 >COG4235 Cytochrome c biogenesis factor [Posttranslational modification, protein turnover, chaperones] Probab=98.45 E-value=3.2e-05 Score=58.78 Aligned_cols=126 Identities=13% Similarity=-0.005 Sum_probs=106.4 Q ss_pred CCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHH Q ss_conf 15886999999998884212698999999999995257456999999999502310357899999999999966998999 Q gi|254781033|r 166 RIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIAS 245 (492) Q Consensus 166 ~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A 245 (492) .....+.-..-++.-+..+|++..-+..|+.+|...++.+.|...|.++.+..++.......+....... .+..+..++ T Consensus 134 ~~~~~~~l~a~Le~~L~~nP~d~egW~~Lg~~ym~~~~~~~A~~AY~~A~rL~g~n~~~~~g~aeaL~~~-a~~~~ta~a 212 (287) T COG4235 134 AEQEMEALIARLETHLQQNPGDAEGWDLLGRAYMALGRASDALLAYRNALRLAGDNPEILLGLAEALYYQ-AGQQMTAKA 212 (287) T ss_pred CCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH-CCCCCCHHH T ss_conf 7335999999999999849887244999999999844434799999999873799889999999999995-388444999 Q ss_pred HHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCC Q ss_conf 99998754313352799999999999739947899999999983289 Q gi|254781033|r 246 YHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPH 292 (492) Q Consensus 246 ~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~ 292 (492) ...++++++++|+++.+...+|..++++|++.+|...++..++..|- T Consensus 213 ~~ll~~al~~D~~~iral~lLA~~afe~g~~~~A~~~Wq~lL~~lp~ 259 (287) T COG4235 213 RALLRQALALDPANIRALSLLAFAAFEQGDYAEAAAAWQMLLDLLPA 259 (287) T ss_pred HHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCC T ss_conf 99999998439732889999999999814599999999999851899 No 87 >cd00189 TPR Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and C Probab=98.39 E-value=7.1e-06 Score=63.84 Aligned_cols=59 Identities=20% Similarity=0.248 Sum_probs=25.7 Q ss_pred HHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCC Q ss_conf 65455215886999999998884212698999999999995257456999999999502 Q gi|254781033|r 160 LYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNA 218 (492) Q Consensus 160 L~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~ 218 (492) ++....+.|+++.|+..+++++++.|+++.++..++.++...|+++.|+..+.+.++.. T Consensus 6 lg~~~~~~g~~~~A~~~~~~al~~~p~~~~~~~~l~~~~~~~~~~~~A~~~~~~~~~~~ 64 (100) T cd00189 6 LGNLYYKLGDYDEALEYYEKALELDPDNADAYYNLAAAYYKLGKYEEALEDYEKALELD 64 (100) T ss_pred HHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHC T ss_conf 99999984999999999999998599989999999999998598589999999999869 No 88 >PRK10866 outer membrane protein assembly complex subunit YfiO; Provisional Probab=98.39 E-value=0.00037 Score=50.46 Aligned_cols=178 Identities=13% Similarity=0.056 Sum_probs=104.8 Q ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHH---HHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHH Q ss_conf 998654552158869999999988842126989999---99999995257456999999999502310357899999999 Q gi|254781033|r 157 VYSLYFESCRIGDLNSAQRYATKALDISPDAPWVTE---AVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARS 233 (492) Q Consensus 157 ~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~---~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a 233 (492) +...+..++..|++..|.+.++++....|.++++.. .++..+-..+++++|+..+++-++..|.++....+.. ..+ T Consensus 35 ~Y~~A~~~~~~~~y~~A~~~fe~l~~~yP~s~~A~qA~L~~ayayy~~~~y~~A~~~~~rFi~lyP~~~~v~Ya~Y-l~g 113 (243) T PRK10866 35 IYATAQQKLQDGNWKQAITQLEALDNRYPFGPYSQQVQLDLIYAYYKNADLPLAQAAIDRFMRLNPTHPNIDYVMY-MRG 113 (243) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHH-HHH T ss_conf 9999999998789999999999999868997678999999999987307789999999999986789997389999-988 Q ss_pred HHHHH------------------CCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHH Q ss_conf 99996------------------699899999998754313352799999999999739947899999999983289899 Q gi|254781033|r 234 LENAD------------------KGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHPEI 295 (492) Q Consensus 234 ~~~~~------------------~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p~L 295 (492) +.... .+....|.+.+.+.++.-|+..-+.- |..-+...-..--..++ T Consensus 114 l~~~~~~~~~~~~~~~~~~~~rD~~~~~~A~~~f~~li~~yP~S~Ya~d--------------A~~r~~~lrn~LA~~El 179 (243) T PRK10866 114 LTNMALDDSALQGFFGVDRSDRDPQHARAAFSDFSKLVRGYPNSQYTTD--------------ATKRLVFLKDRLAKYEY 179 (243) T ss_pred HHHHHCCHHHHHHHHCCCHHHCCHHHHHHHHHHHHHHHHHCCCCHHHHH--------------HHHHHHHHHHHHHHHHH T ss_conf 9877401358888743256430769999999999999987899845899--------------99999999999999999 Q ss_pred HHHHHHHCCCCHHHHHHHHHHHHHHCCCC---HHHHHHHHHHHHHCCCHHHHHHHHH Q ss_conf 99999850789589999999999826898---3689999999998799889999988 Q gi|254781033|r 296 ANIYTHLLSENTVGKLKRALRLEEINKES---VESLVIVSKIALEMGSIDQAHAKAM 349 (492) Q Consensus 296 a~a~~~~~~~~~~~al~~~~~l~~~~P~~---~e~~~~LA~~~~~~g~~~~Ar~~l~ 349 (492) -.+--.++.+....++.|++.+++..|+. +|++..+++++...|..++|.+... T Consensus 180 ~Va~fY~kr~~y~AAi~R~~~vi~~Yp~t~~~~eAL~~l~eaY~~Lgl~~~A~~~~~ 236 (243) T PRK10866 180 SVAEYYTKRGAWVAVVNRVEGMLRDYPDTQATRDALPLMENAYRQMQLNAQADKVAK 236 (243) T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHH T ss_conf 999999992656999999999997689996339999999999998699789999999 No 89 >KOG4162 consensus Probab=98.38 E-value=0.00044 Score=49.85 Aligned_cols=302 Identities=13% Similarity=0.067 Sum_probs=176.1 Q ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCHHHHHHH-HHHHHHHHHCCHHHHHHHHHHHHCC-----CCHHHHH-- Q ss_conf 9999999997210047879999863013466615899999-8753222210001689999997268-----8505799-- Q gi|254781033|r 86 KALYTGLMSIAAHNIPLARKMHSYVSQQHTFHNEYLVYLL-EVQIALAERQYNIAHEKLEMMLQIP-----ATREFAV-- 157 (492) Q Consensus 86 ~al~~gl~al~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll-~A~aA~~~gd~~~A~~~~~~~~~~~-----~~~~~~~-- 157 (492) ...+.++--.++|-...|....+.......-+..+...|+ ..+....-+...++.+|-.++++.- .-...++ T Consensus 359 ~w~~~als~saag~~s~Av~ll~~~~~~~~~ps~~s~~Lmasklc~e~l~~~eegldYA~kai~~~~~~~~~l~~~~~l~ 438 (799) T KOG4162 359 RWYQLALSYSAAGSDSKAVNLLRESLKKSEQPSDISVLLMASKLCIERLKLVEEGLDYAQKAISLLGGQRSHLKPRGYLF 438 (799) T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99998777887146448888777641565688843699999999875543156678899999987554453111568999 Q ss_pred HHHHH--HHH-------CCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHH Q ss_conf 98654--552-------158869999999988842126989999999999952574569999999995023103578999 Q gi|254781033|r 158 YSLYF--ESC-------RIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAIL 228 (492) Q Consensus 158 ~gL~~--~al-------~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l 228 (492) .|+.. .+. +.--..++++.++++.+.+|+++.+..-+.--|..+++-+.|.+...+.++..+... ... T Consensus 439 lGi~y~~~A~~a~~~seR~~~h~kslqale~av~~d~~dp~~if~lalq~A~~R~l~sAl~~~~eaL~l~~~~~---~~~ 515 (799) T KOG4162 439 LGIAYGFQARQANLKSERDALHKKSLQALEEAVQFDPTDPLVIFYLALQYAEQRQLTSALDYAREALALNRGDS---AKA 515 (799) T ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC---HHH T ss_conf 99887767652788678999999999999999861899824899999999999769999999999987157764---889 Q ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCC---------------------------------- Q ss_conf 9999999996699899999998754313352799999999999739---------------------------------- Q gi|254781033|r 229 LIARSLENADKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQN---------------------------------- 274 (492) Q Consensus 229 ~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g---------------------------------- 274 (492) +.-.+...-..++..+|......++...|++......-..+-...| T Consensus 516 whLLALvlSa~kr~~~Al~vvd~al~E~~~N~~l~~~~~~i~~~~~~~e~~l~t~~~~L~~we~~~~~q~~~~~g~~~~l 595 (799) T KOG4162 516 WHLLALVLSAQKRLKEALDVVDAALEEFGDNHVLMDGKIHIELTFNDREEALDTCIHKLALWEAEYGVQQTLDEGKLLRL 595 (799) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999622416778999999999741222001011101221165888888899999998731167665413456654 Q ss_pred ---------CHHHHH-------HHHHHHHHH------------CCCHH----------HHHHHHHHCCCCHHHHHHHHHH Q ss_conf ---------947899-------999999983------------28989----------9999998507895899999999 Q gi|254781033|r 275 ---------KKRKAE-------VILEKIWKV------------NPHPE----------IANIYTHLLSENTVGKLKRALR 316 (492) Q Consensus 275 ---------~~~~A~-------~~le~a~~~------------~p~p~----------La~a~~~~~~~~~~~al~~~~~ 316 (492) +..+|. ..+..-.+. .|.|+ +..+-.....++..++...+.. T Consensus 596 k~~l~la~~q~~~a~s~sr~ls~l~a~~~~~~~se~~Lp~s~~~~~~~~~~~~~~~lwllaa~~~~~~~~~~~a~~CL~E 675 (799) T KOG4162 596 KAGLHLALSQPTDAISTSRYLSSLVASQLKSAGSELKLPSSTVLPGPDSLWYLLQKLWLLAADLFLLSGNDDEARSCLLE 675 (799) T ss_pred HCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHH T ss_conf 12114376565453034689889998655421561016741015788732789999999999998853883288999999 Q ss_pred HHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCH-HHHHHHHHHHHHHCCCHHH--HHHHHHHHHCCCCC Q ss_conf 998268983689999999998799889999988786609998-9999999999985799899--99999999628899 Q gi|254781033|r 317 LEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRK-EIFLLLAQIEQANSHNTDK--ILYWTQSALHAMPD 391 (492) Q Consensus 317 l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~-~~~~llA~ie~~e~~d~~~--ar~wl~rA~~a~~d 391 (492) ..+..|-.+..+...|..+...|++.+|.+.+..++..+|+. +....+|++....| +..- .|..|.-|++-.|+ T Consensus 676 a~~~~~l~~~~~~~~G~~~~~~~~~~EA~~af~~Al~ldP~hv~s~~Ala~~lle~G-~~~la~~~~~L~dalr~dp~ 752 (799) T KOG4162 676 ASKIDPLSASVYYLRGLLLEVKGQLEEAKEAFLVALALDPDHVPSMTALAELLLELG-SPRLAEKRSLLSDALRLDPL 752 (799) T ss_pred HHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHC-CCCHHHHHHHHHHHHHHCCC T ss_conf 875031338999986489998774689999999987328887477899999999718-84207789999999860887 No 90 >PRK02603 photosystem I assembly protein Ycf3; Provisional Probab=98.35 E-value=2.6e-05 Score=59.41 Aligned_cols=100 Identities=15% Similarity=0.042 Sum_probs=34.3 Q ss_pred HHHHHHHHHHCCCCHHH--HHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHH Q ss_conf 99999888421269899--9999999995257456999999999502310357899999999999966998999999987 Q gi|254781033|r 174 QRYATKALDISPDAPWV--TEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIASYHDAIE 251 (492) Q Consensus 174 ~~~~~~a~~~~P~~~~a--~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~ 251 (492) ...+.+++.+.|+...+ ....+..+...|++++|+..+.++++..++... ++..++..+..+...|+.+.|++.+.+ T Consensus 19 ad~~~k~~p~~~~~k~A~~Yy~~G~~~q~~G~Y~EAle~Y~kAL~Le~dp~D-~s~ilyNiGlIY~~~G~~~kAie~Y~k 97 (172) T PRK02603 19 ADLILKILPINKKAKEAFVYYRDGMSAQNDGDYAEALENYEEALKLEEDPND-RSYILYNMAIIYASNGEHDKALETYHQ 97 (172) T ss_pred HHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCC-HHHHHHHHHHHHHHCCCHHHHHHHHHH T ss_conf 9999998158863368999997225777457999999999998727789620-899997799999986998899999999 Q ss_pred HHHCCCCCHHHHHHHHHHHHHCC Q ss_conf 54313352799999999999739 Q gi|254781033|r 252 SLKLCDNSIMASICAAKSLISQN 274 (492) Q Consensus 252 a~~~~p~~~~a~~~~a~~l~~~g 274 (492) |+.+.|+++.+...+|-++..+| T Consensus 98 ALeiNP~~~~AlNNlgvIY~k~G 120 (172) T PRK02603 98 ALELNPKQPSALNNIAVIYHKRG 120 (172) T ss_pred HHHHCCCCHHHHHHHHHHHHHHH T ss_conf 99868997899999999999985 No 91 >KOG2376 consensus Probab=98.34 E-value=0.00053 Score=49.24 Aligned_cols=165 Identities=15% Similarity=0.102 Sum_probs=95.5 Q ss_pred CHHHHHHHHHHHHHCCCCC-HHHHHHHHHHHHHCCCHHHHHHHHH--------HHHHHCCCHHHHHH----HHHHCCC-C Q ss_conf 9899999998754313352-7999999999997399478999999--------99983289899999----9985078-9 Q gi|254781033|r 241 DMIASYHDAIESLKLCDNS-IMASICAAKSLISQNKKRKAEVILE--------KIWKVNPHPEIANI----YTHLLSE-N 306 (492) Q Consensus 241 ~~~~A~~~~~~a~~~~p~~-~~a~~~~a~~l~~~g~~~~A~~~le--------~a~~~~p~p~La~a----~~~~~~~-~ 306 (492) ....+.+.+..-....|.. ....++.+.+.+.+|+.+.|..++. .......-|.++.+ |..+... . T Consensus 356 ~~~ka~e~L~~~~~~~p~~s~~v~L~~aQl~is~gn~~~A~~il~~~~~~~~ss~~~~~~~P~~V~aiv~l~~~~~~~~~ 435 (652) T KOG2376 356 KHKKAIELLLQFADGHPEKSKVVLLLRAQLKISQGNPEVALEILSLFLESWKSSILEAKHLPGTVGAIVALYYKIKDNDS 435 (652) T ss_pred HHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCC T ss_conf 87644999998741587303999999999998658988999999877666665445533570489999999972257750 Q ss_pred HHHHHHHHHHHH-HHCCCC---HHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHH--HHHHHHHHHHHHCCCHHHHHH Q ss_conf 589999999999-826898---36899999999987998899999887866099989--999999999985799899999 Q gi|254781033|r 307 TVGKLKRALRLE-EINKES---VESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKE--IFLLLAQIEQANSHNTDKILY 380 (492) Q Consensus 307 ~~~al~~~~~l~-~~~P~~---~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~--~~~llA~ie~~e~~d~~~ar~ 380 (492) ..+-+...-+|. +..+.. -..+-.+|..-+..|+-.+|...++++++.+|... .+.+...... =|..++.+ T Consensus 436 a~~vl~~Ai~~~~~~~t~s~~l~~~~~~aa~f~lr~G~~~ea~s~leel~k~n~~d~~~l~~lV~a~~~---~d~eka~~ 512 (652) T KOG2376 436 ASAVLDSAIKWWRKQQTGSIALLSLMREAAEFKLRHGNEEEASSLLEELVKFNPNDTDLLVQLVTAYAR---LDPEKAES 512 (652) T ss_pred HHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH---CCHHHHHH T ss_conf 899999999999973443167776899976888861756778999999987588618999999999983---28888877 Q ss_pred HHHHHHCCCCCCCEEC-C-CCCCCHHHCCCCCC Q ss_conf 9999962889995275-6-98955424224837 Q gi|254781033|r 381 WTQSALHAMPDPLWIS-D-DGYLSSVWLPLSPI 411 (492) Q Consensus 381 wl~rA~~a~~dp~W~c-~-~g~~~~~W~~~c~~ 411 (492) |.| .-+|++.-+. + +---..+|.+.+.+ T Consensus 513 -l~k--~L~p~~~l~~vdVd~LE~s~ga~~~~~ 542 (652) T KOG2376 513 -LSK--KLPPLKGLKAVDVDALEKSDGAKYSEA 542 (652) T ss_pred -HHH--CCCCCCCCHHCCCHHHHHCCCCCHHHH T ss_conf -762--279854310057247664147533443 No 92 >TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795). Probab=98.27 E-value=0.00061 Score=48.78 Aligned_cols=187 Identities=9% Similarity=-0.032 Sum_probs=121.5 Q ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCC-CCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHH-----H Q ss_conf 99999999999721004787999986301346-66158999998753222210001689999997268850579-----9 Q gi|254781033|r 84 GYKALYTGLMSIAAHNIPLARKMHSYVSQQHT-FHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPATREFA-----V 157 (492) Q Consensus 84 ~~~al~~gl~al~~Gd~~~A~k~~~~a~~~~~-~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~-----~ 157 (492) .......|+..+..||+..|.+.+.......+ ++....+.+..|.+....|++..|...|+..++.-|+.... + T Consensus 33 ~~~lY~~a~~~~~~~~y~~A~~~fe~l~~~yp~s~~a~~A~l~~a~a~y~~~~y~~A~~~~~~Fi~~yP~~~~~~ya~y~ 112 (235) T TIGR03302 33 AEELYNEAKEALDSGDYTEAIKYFEALESRYPFSPYAEQAQLDLAYAYYKSGDYAEAIAAADRFIRLHPNHPDADYAYYL 112 (235) T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHH T ss_conf 99999999999987899999999999998689931379999999999997643999999999999988774349999999 Q ss_pred HHHHHHH------HCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHH Q ss_conf 9865455------2158869999999988842126989999999999952574569999999995023103578999999 Q gi|254781033|r 158 YSLYFES------CRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIA 231 (492) Q Consensus 158 ~gL~~~a------l~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~ 231 (492) .|+.... ..++....|+..++..+...|++.-+-.+...+.. +...+ +.-.+. T Consensus 113 ~a~s~~~~~~~~~rDq~~~~~A~~~f~~~i~~yP~S~ya~dA~~~i~~-----------l~~~L----------A~~e~~ 171 (235) T TIGR03302 113 RGLSNYKQIDDSDRDQTAAREAFEAFQELIRRYPNSEYAPDAKKRMDY-----------LRNRL----------AGKELY 171 (235) T ss_pred HHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHH-----------HHHHH----------HHHHHH T ss_conf 879988736654458089999999999999888996579999999999-----------99999----------999999 Q ss_pred HHHHHHHCCCHHHHHHHHHHHHHCCCCC---HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCC Q ss_conf 9999996699899999998754313352---79999999999973994789999999998328 Q gi|254781033|r 232 RSLENADKGDMIASYHDAIESLKLCDNS---IMASICAAKSLISQNKKRKAEVILEKIWKVNP 291 (492) Q Consensus 232 ~a~~~~~~~~~~~A~~~~~~a~~~~p~~---~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p 291 (492) .+.-+...+....|...+...++.-|+. ..+...++..+.+.|..++|.+..+......| T Consensus 172 ia~~Y~k~~~y~aAi~r~~~vl~~Yp~t~~~~eAl~~l~eay~~lg~~~~A~~~~~~l~~n~p 234 (235) T TIGR03302 172 VARFYLKRGAYVAAINRFETVVENYPDTPATEEALARLVEAYLALGLTDLAQDAAAVLGANYP 234 (235) T ss_pred HHHHHHHCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCC T ss_conf 999999841709999999999986899851399999999999986998999999999975398 No 93 >CHL00033 ycf3 photosystem I assembly protein Ycf3 Probab=98.24 E-value=2.3e-05 Score=59.90 Aligned_cols=122 Identities=16% Similarity=0.186 Sum_probs=92.5 Q ss_pred CCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCC---CHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHH Q ss_conf 61589999987532222100016899999972688---505799986545521588699999999888421269899999 Q gi|254781033|r 117 HNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPA---TREFAVYSLYFESCRIGDLNSAQRYATKALDISPDAPWVTEA 193 (492) Q Consensus 117 ~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~---~~~~~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~ 193 (492) .....++...+.+|+..|++.+|.+.|.++++..+ .+...+..+.......|+++.|+.++.++++++|+.+.++.. T Consensus 28 ~k~af~yyr~GmsaQs~GeyaEALe~Y~eAL~Le~D~~Drs~iLYNIGlIy~~nGe~~kALeyY~qALeiNP~~pqA~NN 107 (165) T CHL00033 28 EKEAFTYYRDGMSAQSEGEYAEALQNYYEAMRLEIDPYDRSYILYNIGLIHTSNGEHTKALEYYFQALERNPFLPQALNN 107 (165) T ss_pred HHHHHHHHHCCCHHHCCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHH T ss_conf 28899999815101013538999999998623456851266899899999998588889999999999859996899978 Q ss_pred HHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHH Q ss_conf 999999525745699999999950231035789999999999996699899999998754313352799 Q gi|254781033|r 194 VVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIASYHDAIESLKLCDNSIMA 262 (492) Q Consensus 194 L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~a 262 (492) ++-+|..+|. .|.+ .+.. ...+.-.+.|-+.|++|+++.|+.-.- T Consensus 108 lGvIyh~qG~--~a~~-------~~~~---------------~~~~~~fd~aa~~w~~ai~lap~ny~e 152 (165) T CHL00033 108 MAVICHYRGE--QAIE-------QGDS---------------EIAEAWFDQAAEYWKQAIALAPGNYIE 152 (165) T ss_pred HHHHHHHHHH--HHHH-------CCCH---------------HHHHHHHHHHHHHHHHHHCCCCCCCHH T ss_conf 9999999879--9998-------4586---------------789999999999999987038997099 No 94 >PRK11189 lipoprotein NlpI; Provisional Probab=98.24 E-value=0.00085 Score=47.65 Aligned_cols=219 Identities=13% Similarity=-0.009 Sum_probs=154.1 Q ss_pred HHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHH Q ss_conf 99999999999999997210047879999863013466615899999875322221000168999999726885057999 Q gi|254781033|r 79 RNYDKGYKALYTGLMSIAAHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPATREFAVY 158 (492) Q Consensus 79 rr~~k~~~al~~gl~al~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~~~ 158 (492) ....++.-...+|++.=.-|=...|+-.+.++.+.++ +.+...+.+. .-..+.|+++.|-+.|...++-+|+...+.+ T Consensus 60 t~e~rA~l~yeRGvlyDSlGLr~LAR~DF~qAL~l~P-~~adayN~LG-iy~tq~~~fD~AyEAFDs~LEL~P~y~YA~l 137 (297) T PRK11189 60 TDEERAQLHYERGVLYDSLGLRALARNDFSQALALRP-DMADAYNYLG-IYLTQEGEFDAAYEAFDSVLELDPTYDYAYL 137 (297) T ss_pred CHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCC-CCHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHCCCCCHHHHH T ss_conf 9899999999831477641209999864998975399-8589999999-9999971199999998777726987348998 Q ss_pred HHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 86545521588699999999888421269899999999999525745699999999950231035789999999999996 Q gi|254781033|r 159 SLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENAD 238 (492) Q Consensus 159 gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~ 238 (492) ......---|.++-|..-+.+..+.+|++|.-..-|.-+..+ -|-.+|...+.+..+...... ........++. T Consensus 138 NRGIAlYYg~R~~LA~~D~~~fy~~dp~DpYR~LWLYl~e~~-~d~~~A~~~L~~r~~~~~~~~-----W~~~iV~~ylg 211 (297) T PRK11189 138 NRGIALYYGGRYDLAQDDLLAFYQDDPNDPYRALWLYLIEQK-LDPKQAKENLKQRYEKLDKEQ-----WGWNIVDFYLG 211 (297) T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCC-CCHHHHHHHHHHHHHHCCCHH-----HHHHHHHHHHC T ss_conf 302788744637889999999996099982799999999811-699999999999998357122-----68999999945 Q ss_pred CCCHHHHHHHHHHHHHCCCCCH----HHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHCCC Q ss_conf 6998999999987543133527----999999999997399478999999999832898999999985078 Q gi|254781033|r 239 KGDMIASYHDAIESLKLCDNSI----MASICAAKSLISQNKKRKAEVILEKIWKVNPHPEIANIYTHLLSE 305 (492) Q Consensus 239 ~~~~~~A~~~~~~a~~~~p~~~----~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p~La~a~~~~~~~ 305 (492) .-...+..+.+.+..+...... ++..-+|+.+..+|+..+|...++-++..+.|.=+=+-|+.++.. T Consensus 212 ~ise~~~~~~~~~~~~~n~~lAErLcEaYFYLaK~~~~~G~~~~A~~~fKLAla~NVyeFVEhRYA~lEL~ 282 (297) T PRK11189 212 KISEEQLFERLKAGATDNTELAERLCETYFYLAKYYLSLGDLDEAANLFKLALANNVYNFVEHRYALLELA 282 (297) T ss_pred CCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH T ss_conf 89999999999872737099999999999999999998799899999999987048258889999999999 No 95 >KOG1127 consensus Probab=98.22 E-value=0.00092 Score=47.37 Aligned_cols=219 Identities=11% Similarity=0.030 Sum_probs=134.8 Q ss_pred HHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHH Q ss_conf 21000168999999726885057999865455215886999999998884212698999999999995257456999999 Q gi|254781033|r 133 ERQYNIAHEKLEMMLQIPATREFAVYSLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLN 212 (492) Q Consensus 133 ~gd~~~A~~~~~~~~~~~~~~~~~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~ 212 (492) ..+...+...|-+.++.+++-..+.-.|...+...-|...|.+.+++|+++++....+.....+.|....+|+.|....- T Consensus 471 rK~~~~al~ali~alrld~~~apaf~~LG~iYrd~~Dm~RA~kCf~KAFeLDatdaeaaaa~adtyae~~~we~a~~I~l 550 (1238) T KOG1127 471 RKNSALALHALIRALRLDVSLAPAFAFLGQIYRDSDDMKRAKKCFDKAFELDATDAEAAAASADTYAEESTWEEAFEICL 550 (1238) T ss_pred HHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHH T ss_conf 41078899999999854642238999988999878989999999988863791234467888887631541889999998 Q ss_pred HHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCC Q ss_conf 99950231035789999999999996699899999998754313352799999999999739947899999999983289 Q gi|254781033|r 213 QKKKNAKEWNRNRAILLIARSLENADKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPH 292 (492) Q Consensus 213 ~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~ 292 (492) +.-...+. +.+-.-+..++..+++.++...+...+..+++.+|.....+..++..|.+.|.+.-|.+++.++-..+|. T Consensus 551 ~~~qka~a--~~~k~nW~~rG~yyLea~n~h~aV~~fQsALR~dPkD~n~W~gLGeAY~~sGry~~AlKvF~kAs~LrP~ 628 (1238) T KOG1127 551 RAAQKAPA--FACKENWVQRGPYYLEAHNLHGAVCEFQSALRTDPKDYNLWLGLGEAYPESGRYSHALKVFTKASLLRPL 628 (1238) T ss_pred HHHHHCHH--HHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCEEHHHHHHHHHHHCCCH T ss_conf 87421547--7877523540434368521555899999986389065788998778877437620077764245744827 Q ss_pred HHHHHHH---HHHCCCCHHHHHHHHHHHHH-------HCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 8999999---98507895899999999998-------2689836899999999987998899999887866 Q gi|254781033|r 293 PEIANIY---THLLSENTVGKLKRALRLEE-------INKESVESLVIVSKIALEMGSIDQAHAKAMLAMK 353 (492) Q Consensus 293 p~La~a~---~~~~~~~~~~al~~~~~l~~-------~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~ 353 (492) .-.+.-+ .+...|.+..++..++..+. ..-+-.|+++..+..+...|-..+|...+++.++ T Consensus 629 s~y~~fk~A~~ecd~GkYkeald~l~~ii~~~s~e~~~q~gLaE~~ir~akd~~~~gf~~kavd~~eksie 699 (1238) T KOG1127 629 SKYGRFKEAVMECDNGKYKEALDALGLIIYAFSLERTGQNGLAESVIRDAKDSAITGFQKKAVDFFEKSIE 699 (1238) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 68999999999987666899999999999998877776311899999998999988886400689999899 No 96 >KOG0553 consensus Probab=98.19 E-value=3.7e-05 Score=58.24 Aligned_cols=108 Identities=13% Similarity=0.088 Sum_probs=61.1 Q ss_pred HHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCHHHHHH Q ss_conf 99999669989999999875431335279999999999973994789999999998328989999999850789589999 Q gi|254781033|r 233 SLENADKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHPEIANIYTHLLSENTVGKLK 312 (492) Q Consensus 233 a~~~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p~La~a~~~~~~~~~~~al~ 312 (492) +....+.+++.+|...|.+|+.+.|.+.-.....|..|.+.|.+..|. + T Consensus 88 GN~~m~~~~Y~eAv~kY~~AI~l~P~nAVyycNRAAAy~~Lg~~~~AV-------------------------------k 136 (304) T KOG0553 88 GNKLMKNKDYQEAVDKYTEAIELDPTNAVYYCNRAAAYSKLGEYEDAV-------------------------------K 136 (304) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHH-------------------------------H T ss_conf 899998625999999999998458876267778999999860117899-------------------------------9 Q ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH Q ss_conf 99999982689836899999999987998899999887866099989999999999985 Q gi|254781033|r 313 RALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKEIFLLLAQIEQAN 371 (492) Q Consensus 313 ~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~~~~llA~ie~~e 371 (492) -+++.+..+|.+..++..||.+++..|++.+|.+.++++++++|+..+|---.++.+.. T Consensus 137 Dce~Al~iDp~yskay~RLG~A~~~~gk~~~A~~aykKaLeldP~Ne~~K~nL~~Ae~~ 195 (304) T KOG0553 137 DCESALSIDPHYSKAYGRLGLAYLALGKYEEAIEAYKKALELDPDNESYKSNLKIAEQK 195 (304) T ss_pred HHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHH T ss_conf 99997762908999999987999806737999999976431688848999889999987 No 97 >COG3118 Thioredoxin domain-containing protein [Posttranslational modification, protein turnover, chaperones] Probab=98.14 E-value=0.00082 Score=47.75 Aligned_cols=154 Identities=13% Similarity=0.094 Sum_probs=82.1 Q ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHH--HHHHH-CCCC Q ss_conf 99999999669989999999875431335279999999999973994789999999998328989999--99985-0789 Q gi|254781033|r 230 IARSLENADKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHPEIAN--IYTHL-LSEN 306 (492) Q Consensus 230 ~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p~La~--a~~~~-~~~~ 306 (492) +..+....+.++..++...+..+....|.+..+...+++.|+..|+.+.|..+|...-....+.+... +...+ ..-. T Consensus 138 ~~~~~~~~~~e~~~~a~~~~~~al~~~~~~~~~~~~la~~~l~~g~~e~A~~iL~~lP~~~~~~~~~~l~a~i~ll~qaa 217 (304) T COG3118 138 LAEAKELIEAEDFGEAAPLLKQALQAAPENSEAKLLLAECLLAAGDVEAAQAILAALPLQAQDKAAHGLQAQIELLEQAA 217 (304) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 98766654165445589999999985845663777899999981885899999972850010457888999999999974 Q ss_pred HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCC---HHHHHHHHHHHHHHCCCHHHHHHHHH Q ss_conf 589999999999826898368999999999879988999998878660999---89999999999985799899999999 Q gi|254781033|r 307 TVGKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPR---KEIFLLLAQIEQANSHNTDKILYWTQ 383 (492) Q Consensus 307 ~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~---~~~~~llA~ie~~e~~d~~~ar~wl~ 383 (492) .......+++-+..+|+|.+..+.+|..+...|+.++|-++|-..++.+-+ ..+-.-+.++-+.-+..+..+..|-+ T Consensus 218 ~~~~~~~l~~~~aadPdd~~aa~~lA~~~~~~g~~e~Ale~Ll~~l~~d~~~~d~~~Rk~lle~f~~~g~~Dp~~~~~RR 297 (304) T COG3118 218 ATPEIQDLQRRLAADPDDVEAALALADQLHLVGRNEAALEHLLALLRRDRGFEDGEARKTLLELFEAFGPADPLVLAYRR 297 (304) T ss_pred CCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHCCCCCHHHHHHHH T ss_conf 59977899999870999889999999999973887999999999987503443768999999999963889878999999 No 98 >KOG3081 consensus Probab=98.14 E-value=0.0013 Score=46.17 Aligned_cols=164 Identities=13% Similarity=0.108 Sum_probs=118.0 Q ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCH---HHHHHHHHHCC- Q ss_conf 99999999966998999999987543133527999999999997399478999999999832898---99999998507- Q gi|254781033|r 229 LIARSLENADKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHP---EIANIYTHLLS- 304 (492) Q Consensus 229 ~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p---~La~a~~~~~~- 304 (492) .+..+..+...++.++|.+.... -....+..+-..++++.++.+-|++.+++.....-+. .|+.+|..+.. T Consensus 111 ~l~aa~i~~~~~~~deAl~~~~~-----~~~lE~~Al~VqI~lk~~r~d~A~~~lk~mq~ided~tLtQLA~awv~la~g 185 (299) T KOG3081 111 LLLAAIIYMHDGDFDEALKALHL-----GENLEAAALNVQILLKMHRFDLAEKELKKMQQIDEDATLTQLAQAWVKLATG 185 (299) T ss_pred HHHHHHHHHCCCCHHHHHHHHHC-----CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCC T ss_conf 99766875427886899999852-----1248899987799998789999999999997113578999999999999606 Q ss_pred -CCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHH-HHHHHHHCCCHHHHHHHH Q ss_conf -895899999999998268983689999999998799889999988786609998999999-999998579989999999 Q gi|254781033|r 305 -ENTVGKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKEIFLLL-AQIEQANSHNTDKILYWT 382 (492) Q Consensus 305 -~~~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~~~~ll-A~ie~~e~~d~~~ar~wl 382 (492) +...++.-.|+.+..+.|-.+......|..++..|+|++|...++.++..++.....+.. --.....+.|.......+ T Consensus 186 gek~qdAfyifeE~s~k~~~T~~llnG~Av~~l~~~~~eeAe~lL~eaL~kd~~dpetL~Nliv~a~~~Gkd~~~~~r~l 265 (299) T KOG3081 186 GEKIQDAFYIFEELSEKTPPTPLLLNGQAVCHLQLGRYEEAESLLEEALDKDAKDPETLANLIVLALHLGKDAEVTERNL 265 (299) T ss_pred CHHHHHHHHHHHHHHCCCCCCHHHHCCHHHHHHHHCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCHHHHHHHH T ss_conf 32454689999998353599768870699998971699999999999985347878999999999998189917788899 Q ss_pred HHHHCCCCCCCEECC Q ss_conf 999628899952756 Q gi|254781033|r 383 QSALHAMPDPLWISD 397 (492) Q Consensus 383 ~rA~~a~~dp~W~c~ 397 (492) ..-....|.-.|+-. T Consensus 266 ~QLk~~~p~h~~vk~ 280 (299) T KOG3081 266 SQLKLSHPEHPFVKH 280 (299) T ss_pred HHHHHCCCCCHHHHH T ss_conf 998740876537787 No 99 >KOG1128 consensus Probab=98.10 E-value=0.00082 Score=47.75 Aligned_cols=128 Identities=16% Similarity=0.104 Sum_probs=88.1 Q ss_pred HHHHHHCCCHHHHHHHHHHHHHHCCCHHHHH---------------------------HH-H--HHCCCCHHHHHHHHHH Q ss_conf 9999973994789999999998328989999---------------------------99-9--8507895899999999 Q gi|254781033|r 267 AKSLISQNKKRKAEVILEKIWKVNPHPEIAN---------------------------IY-T--HLLSENTVGKLKRALR 316 (492) Q Consensus 267 a~~l~~~g~~~~A~~~le~a~~~~p~p~La~---------------------------a~-~--~~~~~~~~~al~~~~~ 316 (492) ...|...|+..+|+.++.+-.+..|+|.+.- .| + .....+..+..+.++. T Consensus 431 i~CY~~lg~~~kaeei~~q~lek~~d~~lyc~LGDv~~d~s~yEkawElsn~~sarA~r~~~~~~~~~~~fs~~~~hle~ 510 (777) T KOG1128 431 ILCYLLLGQHGKAEEINRQELEKDPDPRLYCLLGDVLHDPSLYEKAWELSNYISARAQRSLALLILSNKDFSEADKHLER 510 (777) T ss_pred HHHHHHHCCCCHHHHHHHHHHCCCCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHH T ss_conf 99999842300478999998269985126997531203848999999986565577877525433352128999999998 Q ss_pred HHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHH-HHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCEE Q ss_conf 9982689836899999999987998899999887866099989-999999999985799899999999996288999527 Q gi|254781033|r 317 LEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKE-IFLLLAQIEQANSHNTDKILYWTQSALHAMPDPLWI 395 (492) Q Consensus 317 l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~-~~~llA~ie~~e~~d~~~ar~wl~rA~~a~~dp~W~ 395 (492) -+..+|-..++++.+|.++++.++|..|..++...+..+|+.. .+.-++...-. .++..+++.-+.+|++...++ |. T Consensus 511 sl~~nplq~~~wf~~G~~ALqlek~q~av~aF~rcvtL~Pd~~eaWnNls~ayi~-~~~k~ra~~~l~EAlKcn~~~-w~ 588 (777) T KOG1128 511 SLEINPLQLGTWFGLGCAALQLEKEQAAVKAFHRCVTLEPDNAEAWNNLSTAYIR-LKKKKRAFRKLKEALKCNYQH-WQ 588 (777) T ss_pred HHHCCCCCHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHH-HHHHHHHHHHHHHHHHCCCCC-CE T ss_conf 7650663255787414899987636789999998962488820555236689999-751488899999986147788-73 Q ss_pred C Q ss_conf 5 Q gi|254781033|r 396 S 396 (492) Q Consensus 396 c 396 (492) - T Consensus 589 i 589 (777) T KOG1128 589 I 589 (777) T ss_pred E T ss_conf 2 No 100 >cd05804 StaR_like StaR_like; a well-conserved protein found in bacteria, plants, and animals. A family member from Streptomyces toyocaensis, StaR is part of a gene cluster involved in the biosynthesis of glycopeptide antibiotics (GPAs), specifically A47934. It has been speculated that StaR could be a flavoprotein hydroxylating a tyrosine sidechain. Some family members have been annotated as proteins containing tetratricopeptide (TPR) repeats, which may at least indicate mostly alpha-helical secondary structure. Probab=98.10 E-value=0.0016 Score=45.57 Aligned_cols=259 Identities=11% Similarity=-0.029 Sum_probs=144.1 Q ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHH---HHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH Q ss_conf 99875322221000168999999726885057999---865455215886999999998884212698999999999995 Q gi|254781033|r 124 LLEVQIALAERQYNIAHEKLEMMLQIPATREFAVY---SLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVL 200 (492) Q Consensus 124 Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~~~---gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~ 200 (492) .+.+-.+...||...|.+.++..+..-|...++++ +.+..--..|-.+.....+.......|...+.+..+.-.+.+ T Consensus 47 hv~a~~~~~~g~~~~A~~~~~~~l~~~P~D~lalk~~~~~f~lg~~~g~~~~~~~~l~~~~p~~~~~~~~~gm~AF~leE 126 (355) T cd05804 47 HVEALSAWIAGDLPKALALLEQLLDDYPRDLLALKLHLGAFGLGDFSGMRDHVARVLPLWAPENPDYWYLLGMLAFGLEE 126 (355) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHH T ss_conf 99999999879978999999999986983599999876741025413240569999872573489836999999999998 Q ss_pred CCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCC--C--HHHHHHHHHHHHHCCCH Q ss_conf 2574569999999995023103578999999999999669989999999875431335--2--79999999999973994 Q gi|254781033|r 201 AKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIASYHDAIESLKLCDN--S--IMASICAAKSLISQNKK 276 (492) Q Consensus 201 ~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~--~--~~a~~~~a~~l~~~g~~ 276 (492) .|++++|.+.-++++...+...- ...+.+-.+..+|+.++...........-.. + .-.+-..|-.+.+.|++ T Consensus 127 ~g~~~~Ae~~a~~al~~np~d~w----a~HAvAHv~~~~g~~~eG~~~l~~~~~~w~~~~~l~~H~wWH~AL~~Le~g~~ 202 (355) T cd05804 127 AGQYDRAEEAARRALELNPDDAW----AVHAVAHVLEMQGRFKEGIAFMESWRDTWDCSSMLRGHNWWHLALFYLERGDY 202 (355) T ss_pred CCCHHHHHHHHHHHHHCCCCCCH----HHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCH T ss_conf 58899999999999963999937----99999999998388788999998517525778858989999999998877898 Q ss_pred HHHHHHHHHHHHHCCC--H--HHHHH----H-HHHCC-CCHHHHHHHHHHHHH-HCCCC--HHHHHHHHHHHHHCCCHHH Q ss_conf 7899999999983289--8--99999----9-98507-895899999999998-26898--3689999999998799889 Q gi|254781033|r 277 RKAEVILEKIWKVNPH--P--EIANI----Y-THLLS-ENTVGKLKRALRLEE-INKES--VESLVIVSKIALEMGSIDQ 343 (492) Q Consensus 277 ~~A~~~le~a~~~~p~--p--~La~a----~-~~~~~-~~~~~al~~~~~l~~-~~P~~--~e~~~~LA~~~~~~g~~~~ 343 (492) +++.+++.......+. + ++..+ | .++.+ .+..++.+.+..... ..|++ +-.-+-.+-++..+|+.+. T Consensus 203 ~~~l~iyD~~i~~~~~~~~~~d~~dAaSLL~RLel~G~~~v~~Rw~~la~~~~~~~~~~~~~F~D~H~~~ala~ag~~~~ 282 (355) T cd05804 203 EAALAIYDTHIAPSAESDPALDLLDAASLLWRLELAGHVDVGDRWEDLADYAAWHFPDHGLAFNDLHAALALAGAGDKDA 282 (355) T ss_pred HHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCHHH T ss_conf 89999988874456687069999839999999997399873557999999998745650331999999999984899999 Q ss_pred HHHHHHHHHHCC--------CCHHH-HHHHHHHHHHHCCCHHHHHHHHHHHH Q ss_conf 999988786609--------99899-99999999985799899999999996 Q gi|254781033|r 344 AHAKAMLAMKIA--------PRKEI-FLLLAQIEQANSHNTDKILYWTQSAL 386 (492) Q Consensus 344 Ar~~l~~al~~~--------P~~~~-~~llA~ie~~e~~d~~~ar~wl~rA~ 386 (492) +.+.++..-+.. +...+ .-+.-.|..-..||...+-+-|.... T Consensus 283 ~~~ll~~~~~~~~~~~~~~~~~~~vg~pl~~Al~a~~~gd~~~a~~lL~p~~ 334 (355) T cd05804 283 LDKLLAALKGRASSADDNKQPARDVGLPLAEALYAFAEGNYATALELLGPVR 334 (355) T ss_pred HHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH T ss_conf 9999999998763777320056500489999999998589999999999989 No 101 >COG4105 ComL DNA uptake lipoprotein [General function prediction only] Probab=98.06 E-value=0.0017 Score=45.22 Aligned_cols=199 Identities=18% Similarity=0.144 Sum_probs=94.3 Q ss_pred HHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHHH---HHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHH Q ss_conf 899999875322221000168999999726885057---99986545521588699999999888421269899999999 Q gi|254781033|r 120 YLVYLLEVQIALAERQYNIAHEKLEMMLQIPATREF---AVYSLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQ 196 (492) Q Consensus 120 ~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~---~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~ 196 (492) +-.+.-.++.....||+.+|.++|+.+..+-|.... +..-++....+.++++.|+..+++-..+.|+|+.+-..+.. T Consensus 34 ~~~LY~~g~~~L~~gn~~~A~~~fe~l~~~~p~~~~~~qa~l~l~yA~Yk~~~y~~A~~~~drFi~lyP~~~n~dY~~Yl 113 (254) T COG4105 34 ASELYNEGLTELQKGNYEEAIKYFEALDSRHPFSPYSEQAQLDLAYAYYKNGEYDLALAYIDRFIRLYPTHPNADYAYYL 113 (254) T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCHHHHHHH T ss_conf 99999999999863788999999999987099981128999999999881355899999999999878999871699999 Q ss_pred HHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCH Q ss_conf 99952574569999999995023103578999999999999669989999999875431335279999999999973994 Q gi|254781033|r 197 QYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKK 276 (492) Q Consensus 197 l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~ 276 (492) .-.. .+ ..-+...+ +......|...+.+.+..-|+..-+. T Consensus 114 kgLs---------~~----~~i~~~~r--------------Dq~~~~~A~~~f~~~i~ryPnS~Ya~------------- 153 (254) T COG4105 114 KGLS---------YF----FQIDDVTR--------------DQSAARAAFAAFKELVQRYPNSRYAP------------- 153 (254) T ss_pred HHHH---------HH----CCCCCCCC--------------CHHHHHHHHHHHHHHHHHCCCCCCHH------------- T ss_conf 9998---------75----26775444--------------77999999999999998788970235------------- Q ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCC---HHHHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 789999999998328989999999850789589999999999826898---36899999999987998899999887866 Q gi|254781033|r 277 RKAEVILEKIWKVNPHPEIANIYTHLLSENTVGKLKRALRLEEINKES---VESLVIVSKIALEMGSIDQAHAKAMLAMK 353 (492) Q Consensus 277 ~~A~~~le~a~~~~p~p~La~a~~~~~~~~~~~al~~~~~l~~~~P~~---~e~~~~LA~~~~~~g~~~~Ar~~l~~al~ 353 (492) .|...+...-...--.++..+--..+.+.+..++.|++.+++..|+. .+++..+.+++...|..++|.....-+-. T Consensus 154 -dA~~~i~~~~d~LA~~Em~IaryY~kr~~~~AA~nR~~~v~e~y~~t~~~~eaL~~l~eaY~~lgl~~~a~~~~~vl~~ 232 (254) T COG4105 154 -DAKARIVKLNDALAGHEMAIARYYLKRGAYVAAINRFEEVLENYPDTSAVREALARLEEAYYALGLTDEAKKTAKVLGA 232 (254) T ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH T ss_conf -5999999999998899999999999905769999999999871455520477899999999982872789999999973 Q ss_pred CCCCHH Q ss_conf 099989 Q gi|254781033|r 354 IAPRKE 359 (492) Q Consensus 354 ~~P~~~ 359 (492) .-|+++ T Consensus 233 N~p~s~ 238 (254) T COG4105 233 NYPDSQ 238 (254) T ss_pred CCCCCC T ss_conf 689882 No 102 >KOG0553 consensus Probab=98.04 E-value=0.0008 Score=47.87 Aligned_cols=98 Identities=13% Similarity=0.123 Sum_probs=59.3 Q ss_pred HHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHH Q ss_conf 99999995257456999999999502310357899999999999966998999999987543133527999999999997 Q gi|254781033|r 193 AVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIASYHDAIESLKLCDNSIMASICAAKSLIS 272 (492) Q Consensus 193 ~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~ 272 (492) .-+.-....++|++|+..+.++++..|. .++++..++..+...+....|.+.+..++..+|.+..++..++..+.. T Consensus 86 ~eGN~~m~~~~Y~eAv~kY~~AI~l~P~----nAVyycNRAAAy~~Lg~~~~AVkDce~Al~iDp~yskay~RLG~A~~~ 161 (304) T KOG0553 86 NEGNKLMKNKDYQEAVDKYTEAIELDPT----NAVYYCNRAAAYSKLGEYEDAVKDCESALSIDPHYSKAYGRLGLAYLA 161 (304) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCC----CCHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHC T ss_conf 9889999862599999999999845887----626777899999986011789999999776290899999998799980 Q ss_pred CCCHHHHHHHHHHHHHHCCCHH Q ss_conf 3994789999999998328989 Q gi|254781033|r 273 QNKKRKAEVILEKIWKVNPHPE 294 (492) Q Consensus 273 ~g~~~~A~~~le~a~~~~p~p~ 294 (492) +|++.+|+..+++++...||.+ T Consensus 162 ~gk~~~A~~aykKaLeldP~Ne 183 (304) T KOG0553 162 LGKYEEAIEAYKKALELDPDNE 183 (304) T ss_pred CCCHHHHHHHHHHHHCCCCCCH T ss_conf 6737999999976431688848 No 103 >KOG4340 consensus Probab=98.04 E-value=0.00077 Score=47.99 Aligned_cols=117 Identities=14% Similarity=0.085 Sum_probs=79.9 Q ss_pred HHHCCHHHHHHHHHHHHCCCCCCCCHH-HHHHHHHH--HHHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCH Q ss_conf 972100478799998630134666158-99999875--322221000168999999726885057999865455215886 Q gi|254781033|r 94 SIAAHNIPLARKMHSYVSQQHTFHNEY-LVYLLEVQ--IALAERQYNIAHEKLEMMLQIPATREFAVYSLYFESCRIGDL 170 (492) Q Consensus 94 al~~Gd~~~A~k~~~~a~~~~~~~~~~-L~~Ll~A~--aA~~~gd~~~A~~~~~~~~~~~~~~~~~~~gL~~~al~~gd~ 170 (492) -..+|-+..|.+...... |++. +...+.-| .--.++|...++-..+++... ....++..+....-+.|++ T Consensus 88 LY~A~i~ADALrV~~~~~-----D~~~L~~~~lqLqaAIkYse~Dl~g~rsLveQlp~e--n~Ad~~in~gCllykegqy 160 (459) T KOG4340 88 LYKACIYADALRVAFLLL-----DNPALHSRVLQLQAAIKYSEGDLPGSRSLVEQLPSE--NEADGQINLGCLLYKEGQY 160 (459) T ss_pred HHHCCCHHHHHHHHHHHC-----CCHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHCCCC--CCHHHHHCCHHHEECCCCH T ss_conf 997035288999999956-----849899999999999860545553549999857788--8502330301200016548 Q ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHC Q ss_conf 99999999888421269899999999999525745699999999950 Q gi|254781033|r 171 NSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKN 217 (492) Q Consensus 171 ~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~ 217 (492) ++|.+-++.+++...-++-+...++..+.+.++++.|++.....+.. T Consensus 161 EaAvqkFqaAlqvsGyqpllAYniALaHy~~~qyasALk~iSEIieR 207 (459) T KOG4340 161 EAAVQKFQAALQVSGYQPLLAYNLALAHYSSRQYASALKHISEIIER 207 (459) T ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999987305767278999999987312077899999999975 No 104 >KOG1128 consensus Probab=98.01 E-value=0.0022 Score=44.35 Aligned_cols=48 Identities=19% Similarity=0.077 Sum_probs=16.1 Q ss_pred CHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 989999999875431335279999999999973994789999999998 Q gi|254781033|r 241 DMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWK 288 (492) Q Consensus 241 ~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~ 288 (492) +...+.+.+.....+.|+...++..++..+++.++..+|-..+..+.+ T Consensus 534 k~q~av~aF~rcvtL~Pd~~eaWnNls~ayi~~~~k~ra~~~l~EAlK 581 (777) T KOG1128 534 KEQAAVKAFHRCVTLEPDNAEAWNNLSTAYIRLKKKKRAFRKLKEALK 581 (777) T ss_pred HHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 367899999989624888205552366899997514888999999861 No 105 >PRK10803 hypothetical protein; Provisional Probab=97.94 E-value=0.0029 Score=43.49 Aligned_cols=97 Identities=12% Similarity=0.036 Sum_probs=70.7 Q ss_pred HHHHHHCCCHHHHHHHHHHHHHHCCCHHHH------HHHHHHCCCCHHHHHHHHHHHHHHCCCC---HHHHHHHHHHHHH Q ss_conf 999997399478999999999832898999------9999850789589999999999826898---3689999999998 Q gi|254781033|r 267 AKSLISQNKKRKAEVILEKIWKVNPHPEIA------NIYTHLLSENTVGKLKRALRLEEINKES---VESLVIVSKIALE 337 (492) Q Consensus 267 a~~l~~~g~~~~A~~~le~a~~~~p~p~La------~a~~~~~~~~~~~al~~~~~l~~~~P~~---~e~~~~LA~~~~~ 337 (492) ..+..+.+++++|..-++...+..|+..+. ..-.+...+++.+++..|..+++..|+| +++++-+|.++.. T Consensus 149 ~~Lvl~~k~y~~Ai~aF~~Fi~~yP~s~y~~nA~YWLGe~~y~~~~~~~A~~~F~~vv~~yP~s~K~pdAllklg~~~~~ 228 (262) T PRK10803 149 IALVQDKSRQDDAIVAFQNFIKKYPDSTYQPNANYWLGQLNYNKGKKDDAAYYFASVVKNYPKSPKAADAMFKVGVIMQD 228 (262) T ss_pred HHHHHHHCCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHH T ss_conf 99999735699999999999987879987743099999999974378999999999998677998678999999999998 Q ss_pred CCCHHHHHHHHHHHHHCCCCHHHHHH Q ss_conf 79988999998878660999899999 Q gi|254781033|r 338 MGSIDQAHAKAMLAMKIAPRKEIFLL 363 (492) Q Consensus 338 ~g~~~~Ar~~l~~al~~~P~~~~~~l 363 (492) .|+.++|+..++++++..|++....+ T Consensus 229 lg~~~~Ak~~l~~vi~~YP~S~aA~l 254 (262) T PRK10803 229 KGDTAKAKAVYQQVIKKYPGTDGAKQ 254 (262) T ss_pred CCCHHHHHHHHHHHHHHCCCCHHHHH T ss_conf 59999999999999998859989999 No 106 >KOG2053 consensus Probab=97.89 E-value=0.0035 Score=42.84 Aligned_cols=41 Identities=7% Similarity=0.000 Sum_probs=21.2 Q ss_pred HHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHH Q ss_conf 99999999998268983689999999998799889999988 Q gi|254781033|r 309 GKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAM 349 (492) Q Consensus 309 ~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~ 349 (492) +++-.+|..+..+|.|+...+.+-++|.-.|-++.|.+.+. T Consensus 457 eaI~LLE~glt~s~hnf~~KLlLiriY~~lGa~p~a~~~y~ 497 (932) T KOG2053 457 EAITLLENGLTKSPHNFQTKLLLIRIYSYLGAFPDAYELYK 497 (932) T ss_pred HHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCHHHHHHHH T ss_conf 99999997761399628798899999999447726999998 No 107 >pfam09295 ChAPs ChAPs (Chs5p-Arf1p-binding proteins). ChAPs (Chs5p-Arf1p-binding proteins) are required for the export of specialized cargo from the Golgi. They physically interact with Chs3, Chs5 and the small GTPase Arf1, and they form also interactions with each other. Probab=97.89 E-value=0.00066 Score=48.52 Aligned_cols=103 Identities=12% Similarity=0.069 Sum_probs=43.5 Q ss_pred CHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCH-HHHHHHHH--HCCCCHHHHHHHHHHH Q ss_conf 98999999987543133527999999999997399478999999999832898-99999998--5078958999999999 Q gi|254781033|r 241 DMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHP-EIANIYTH--LLSENTVGKLKRALRL 317 (492) Q Consensus 241 ~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p-~La~a~~~--~~~~~~~~al~~~~~l 317 (492) ..+.++..+.+.....| ...+.+++++...++.-+|.+++.++++.+|+. +|...-+. +..+++.-+++.+++. T Consensus 184 ~~~~~~~~le~L~~~~p---~v~~llakv~~~~~~E~~av~~i~~~L~~~p~d~~LL~~Qa~FLl~K~~yelAL~~A~~a 260 (395) T pfam09295 184 RYDRALNLLEKLREEEP---EVVSLLARVLLLMDEEIDAVKLMHDALQENPMDYELLCLQAEFLLSKKKDELALECAKRA 260 (395) T ss_pred CHHHHHHHHHHHHHCCC---HHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHH T ss_conf 38989999999875184---499999999996560899999999998618876789999999998179889999999998 Q ss_pred HHHCCCCHHHHHHHHHHHHHCCCHHHHHH Q ss_conf 98268983689999999998799889999 Q gi|254781033|r 318 EEINKESVESLVIVSKIALEMGSIDQAHA 346 (492) Q Consensus 318 ~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~ 346 (492) ....|.+.+++..|++.|+..|+|+.|.- T Consensus 261 v~~~P~~F~~W~~La~~Yi~l~d~e~ALl 289 (395) T pfam09295 261 VNSAPSEFKTWALLVKVYIKLEDWENALL 289 (395) T ss_pred HHCCCCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 73178661889999999998632999999 No 108 >COG4700 Uncharacterized protein conserved in bacteria containing a divergent form of TPR repeats [Function unknown] Probab=97.85 E-value=0.0033 Score=43.04 Aligned_cols=143 Identities=20% Similarity=0.193 Sum_probs=87.8 Q ss_pred HHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH-HCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHH Q ss_conf 9726885057999865455215886999999998884-212698999999999995257456999999999502310357 Q gi|254781033|r 146 MLQIPATREFAVYSLYFESCRIGDLNSAQRYATKALD-ISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRN 224 (492) Q Consensus 146 ~~~~~~~~~~~~~gL~~~al~~gd~~~A~~~~~~a~~-~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~ 224 (492) ..++.||...- ..|+......|++.+|..++++++. +.-+.+..+..+.+.+...++...+..+++......+..+.. T Consensus 82 ~~~~ApTvqnr-~rLa~al~elGr~~EA~~hy~qalsG~fA~d~a~lLglA~Aqfa~~~~A~a~~tLe~l~e~~pa~r~p 160 (251) T COG4700 82 ELAIAPTVQNR-YRLANALAELGRYHEAVPHYQQALSGIFAHDAAMLLGLAQAQFAIQEFAAAQQTLEDLMEYNPAFRSP 160 (251) T ss_pred HHHHCHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCCCC T ss_conf 87646137889-99999999841056668899998546567988999888988886165888888999986059765899 Q ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHH----HHHHHHCCC Q ss_conf 89999999999996699899999998754313352799999999999739947899999----999983289 Q gi|254781033|r 225 RAILLIARSLENADKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVIL----EKIWKVNPH 292 (492) Q Consensus 225 ~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~l----e~a~~~~p~ 292 (492) +..+++++. +...|.+..|...+..+....|+.. +...|+..+.++|+.+++...+ ....+..|| T Consensus 161 d~~Ll~aR~--laa~g~~a~Aesafe~a~~~ypg~~-ar~~Y~e~La~qgr~~ea~aq~~~v~d~~~r~~~H 229 (251) T COG4700 161 DGHLLFART--LAAQGKYADAESAFEVAISYYPGPQ-ARIYYAEMLAKQGRLREANAQYVAVVDTAKRSRPH 229 (251) T ss_pred CCHHHHHHH--HHHCCCCHHHHHHHHHHHHHCCCHH-HHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCHH T ss_conf 725899999--9845880558999999998588777-89899999997122167888799999999760466 No 109 >COG4700 Uncharacterized protein conserved in bacteria containing a divergent form of TPR repeats [Function unknown] Probab=97.83 E-value=0.0036 Score=42.71 Aligned_cols=100 Identities=10% Similarity=-0.003 Sum_probs=67.5 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHCCCC-HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCC--HHHHHHHHHHHH Q ss_conf 999875322221000168999999726885-05799986545521588699999999888421269--899999999999 Q gi|254781033|r 123 YLLEVQIALAERQYNIAHEKLEMMLQIPAT-REFAVYSLYFESCRIGDLNSAQRYATKALDISPDA--PWVTEAVVQQYV 199 (492) Q Consensus 123 ~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~-~~~~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~--~~a~~~L~~l~~ 199 (492) .+-.+.+....|++.+|...|++.+.-.-. ...-+.|++......+++..|...+++..+-+|.. +.....+++.+. T Consensus 92 r~rLa~al~elGr~~EA~~hy~qalsG~fA~d~a~lLglA~Aqfa~~~~A~a~~tLe~l~e~~pa~r~pd~~Ll~aR~la 171 (251) T COG4700 92 RYRLANALAELGRYHEAVPHYQQALSGIFAHDAAMLLGLAQAQFAIQEFAAAQQTLEDLMEYNPAFRSPDGHLLFARTLA 171 (251) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHH T ss_conf 99999999984105666889999854656798899988898888616588888899998605976589972589999998 Q ss_pred HCCCCHHHHHHHHHHHHCCCCHH Q ss_conf 52574569999999995023103 Q gi|254781033|r 200 LAKEWSRAITFLNQKKKNAKEWN 222 (492) Q Consensus 200 ~~gdw~~A~~~l~~~~~~~~~~~ 222 (492) .+|.+.+|...++.++...+... T Consensus 172 a~g~~a~Aesafe~a~~~ypg~~ 194 (251) T COG4700 172 AQGKYADAESAFEVAISYYPGPQ 194 (251) T ss_pred HCCCCHHHHHHHHHHHHHCCCHH T ss_conf 45880558999999998588777 No 110 >KOG1915 consensus Probab=97.82 E-value=0.0045 Score=42.02 Aligned_cols=291 Identities=12% Similarity=0.088 Sum_probs=141.1 Q ss_pred HHHCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCC-HHHHHHHHHHHHHCCCCHHH Q ss_conf 97210047879999863013466615899999875322221000168999999726885-05799986545521588699 Q gi|254781033|r 94 SIAAHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPAT-REFAVYSLYFESCRIGDLNS 172 (492) Q Consensus 94 al~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~-~~~~~~gL~~~al~~gd~~~ 172 (492) -...+|...|+....++....+...+.- |--+..-+.......|+..+..++..-|- ..+-+..+. .-...|+... T Consensus 83 Eesq~e~~RARSv~ERALdvd~r~itLW--lkYae~Emknk~vNhARNv~dRAvt~lPRVdqlWyKY~y-mEE~LgNi~g 159 (677) T KOG1915 83 EESQKEIQRARSVFERALDVDYRNITLW--LKYAEFEMKNKQVNHARNVWDRAVTILPRVDQLWYKYIY-MEEMLGNIAG 159 (677) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHH-HHHHHCCCHH T ss_conf 7767899999999999870342233599--999999986405767887898888853308999999999-9999654187 Q ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHH-----------------H------------ Q ss_conf 99999988842126989999999999952574569999999995023103-----------------5------------ Q gi|254781033|r 173 AQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWN-----------------R------------ 223 (492) Q Consensus 173 A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~-----------------~------------ 223 (492) |++.+++...-.|+. .++........+.++|+.|+..+++-+-..|... . T Consensus 160 aRqiferW~~w~P~e-qaW~sfI~fElRykeieraR~IYerfV~~HP~v~~wikyarFE~k~g~~~~aR~VyerAie~~~ 238 (677) T KOG1915 160 ARQIFERWMEWEPDE-QAWLSFIKFELRYKEIERARSIYERFVLVHPKVSNWIKYARFEEKHGNVALARSVYERAIEFLG 238 (677) T ss_pred HHHHHHHHHCCCCCH-HHHHHHHHHHHHHHHHHHHHHHHHHHHEECCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHH T ss_conf 999999997079727-8999999999986578778899988850360289899999998752718889999999999861 Q ss_pred ----------------------HHHHHHHHHHHHHH-----------------HCCCHHHH--------HHHHHHHHHCC Q ss_conf ----------------------78999999999999-----------------66998999--------99998754313 Q gi|254781033|r 224 ----------------------NRAILLIARSLENA-----------------DKGDMIAS--------YHDAIESLKLC 256 (492) Q Consensus 224 ----------------------~~a~l~~~~a~~~~-----------------~~~~~~~A--------~~~~~~a~~~~ 256 (492) .++...+..+.... +-||...- +--+.+-++.+ T Consensus 239 ~d~~~e~lfvaFA~fEe~qkE~ERar~iykyAld~~pk~raeeL~k~~~~fEKqfGd~~gIEd~Iv~KRk~qYE~~v~~n 318 (677) T KOG1915 239 DDEEAEILFVAFAEFEERQKEYERARFIYKYALDHIPKGRAEELYKKYTAFEKQFGDKEGIEDAIVGKRKFQYEKEVSKN 318 (677) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 37899999999999999888999999999999874686509999999999999716511168887665653799998729 Q ss_pred CCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHH----------HHH---HHHHHCCCCHHHHHHHHHHHHHHCCC Q ss_conf 35279999999999973994789999999998328989----------999---99985078958999999999982689 Q gi|254781033|r 257 DNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHPE----------IAN---IYTHLLSENTVGKLKRALRLEEINKE 323 (492) Q Consensus 257 p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p~----------La~---a~~~~~~~~~~~al~~~~~l~~~~P~ 323 (492) |-+-.++..|.++....|+...-..+++++....|-.. |.. .|.++...+....-+-++..+++-|+ T Consensus 319 p~nYDsWfdylrL~e~~g~~~~Ire~yErAIanvpp~~ekr~W~RYIYLWinYalyeEle~ed~ertr~vyq~~l~lIPH 398 (677) T KOG1915 319 PYNYDSWFDYLRLEESVGDKDRIRETYERAIANVPPASEKRYWRRYIYLWINYALYEELEAEDVERTRQVYQACLDLIPH 398 (677) T ss_pred CCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 87706899999999722778899999999870599446778899999999999999999831278899999999863674 Q ss_pred C----HHHHHHHHHHHHHC---------------------------------CCHHHHHHHHHHHHHCCCCH-HHHHHHH Q ss_conf 8----36899999999987---------------------------------99889999988786609998-9999999 Q gi|254781033|r 324 S----VESLVIVSKIALEM---------------------------------GSIDQAHAKAMLAMKIAPRK-EIFLLLA 365 (492) Q Consensus 324 ~----~e~~~~LA~~~~~~---------------------------------g~~~~Ar~~l~~al~~~P~~-~~~~llA 365 (492) . .-.+++.|.--+.. ++|+..|...++-++-.|.+ ..+.-.| T Consensus 399 kkFtFaKiWlmyA~feIRq~~l~~ARkiLG~AIG~cPK~KlFk~YIelElqL~efDRcRkLYEkfle~~Pe~c~~W~kya 478 (677) T KOG1915 399 KKFTFAKIWLMYAQFEIRQLNLTGARKILGNAIGKCPKDKLFKGYIELELQLREFDRCRKLYEKFLEFSPENCYAWSKYA 478 (677) T ss_pred CCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHH T ss_conf 31129999999999999870327899998887434971667777878999876578999999999816927658999899 Q ss_pred HHHHHHCCCHHHHHHHHHHHHCCC Q ss_conf 999985799899999999996288 Q gi|254781033|r 366 QIEQANSHNTDKILYWTQSALHAM 389 (492) Q Consensus 366 ~ie~~e~~d~~~ar~wl~rA~~a~ 389 (492) ++|.. -||..++|.-..-|+.-| T Consensus 479 ElE~~-LgdtdRaRaifelAi~qp 501 (677) T KOG1915 479 ELETS-LGDTDRARAIFELAISQP 501 (677) T ss_pred HHHHH-HHHHHHHHHHHHHHHCCC T ss_conf 99987-400788889999983385 No 111 >KOG3785 consensus Probab=97.82 E-value=0.0046 Score=41.94 Aligned_cols=111 Identities=14% Similarity=0.073 Sum_probs=57.9 Q ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHH---------------------HHHHCCCC------------ Q ss_conf 799999999999739947899999999983289899999---------------------99850789------------ Q gi|254781033|r 260 IMASICAAKSLISQNKKRKAEVILEKIWKVNPHPEIANI---------------------YTHLLSEN------------ 306 (492) Q Consensus 260 ~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p~La~a---------------------~~~~~~~~------------ 306 (492) +++.+.++-.+.++|+..+|..+.+..-...|+.-+... +.++.+++ T Consensus 285 PEARlNL~iYyL~q~dVqeA~~L~Kdl~PttP~EyilKgvv~aalGQe~gSreHlKiAqqffqlVG~Sa~ecDTIpGRQs 364 (557) T KOG3785 285 PEARLNLIIYYLNQNDVQEAISLCKDLDPTTPYEYILKGVVFAALGQETGSREHLKIAQQFFQLVGESALECDTIPGRQS 364 (557) T ss_pred HHHHHHHEEEECCCCCHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCCCCCCCHHH T ss_conf 27454112322145658999999860699985899999999998631207588999999999985365410245666678 Q ss_pred ----------HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCC-CCHHHHH-HHHHHHHH Q ss_conf ----------5899999999998268983689999999998799889999988786609-9989999-99999998 Q gi|254781033|r 307 ----------TVGKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIA-PRKEIFL-LLAQIEQA 370 (492) Q Consensus 307 ----------~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~-P~~~~~~-llA~ie~~ 370 (492) ..+-+..+...-+..-++....+.+|++....|.+.+|.+.+-++-..+ -+.-.|. .+|+..-. T Consensus 365 mAs~fFL~~qFddVl~YlnSi~sYF~NdD~Fn~N~AQAk~atgny~eaEelf~~is~~~ikn~~~Y~s~LArCyi~ 440 (557) T KOG3785 365 MASYFFLSFQFDDVLTYLNSIESYFTNDDDFNLNLAQAKLATGNYVEAEELFIRISGPEIKNKILYKSMLARCYIR 440 (557) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHH T ss_conf 8999999998879999999898873376035347899998724747787787630682321258999999999871 No 112 >TIGR02795 tol_pal_ybgF tol-pal system protein YbgF; InterPro: IPR014162 Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, and a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.. Probab=97.80 E-value=0.0011 Score=46.87 Aligned_cols=98 Identities=16% Similarity=0.087 Sum_probs=56.1 Q ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHH------HHHHHHCCCCHHHHHHHHHHHHHHCCCC---HHHHHHHHHH Q ss_conf 999999997399478999999999832898999------9999850789589999999999826898---3689999999 Q gi|254781033|r 264 ICAAKSLISQNKKRKAEVILEKIWKVNPHPEIA------NIYTHLLSENTVGKLKRALRLEEINKES---VESLVIVSKI 334 (492) Q Consensus 264 ~~~a~~l~~~g~~~~A~~~le~a~~~~p~p~La------~a~~~~~~~~~~~al~~~~~l~~~~P~~---~e~~~~LA~~ 334 (492) .-.+...++.|++..|+..+....+..|+..++ ..-+....++..++.+.|..+++..|++ |++++-||.+ T Consensus 6 y~aa~~~lk~gdY~~A~~~F~~F~~~YP~S~y~pnA~YWLGe~~y~~~~~~~A~~~f~~v~~~yp~s~KaPdALLKlg~~ 85 (119) T TIGR02795 6 YDAALLVLKAGDYADAIQAFQAFLKKYPKSTYAPNAHYWLGEAYYAQGDYADAAKAFLAVVKKYPKSPKAPDALLKLGMS 85 (119) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHH T ss_conf 99999998538878999999999886879876710547999999997339999999999986458998606899999999 Q ss_pred HHHCCCHHHHHHHHHHHHHCCCCHHHH Q ss_conf 998799889999988786609998999 Q gi|254781033|r 335 ALEMGSIDQAHAKAMLAMKIAPRKEIF 361 (492) Q Consensus 335 ~~~~g~~~~Ar~~l~~al~~~P~~~~~ 361 (492) ....|+.++|+..|+++++.-|++... T Consensus 86 ~~~~g~~~~A~~~l~~V~~~YP~s~aA 112 (119) T TIGR02795 86 LQELGDKEKAKATLQQVIKRYPGSSAA 112 (119) T ss_pred HHHHCCHHHHHHHHHHHHHHCCCCHHH T ss_conf 987289889999999998566998788 No 113 >KOG0543 consensus Probab=97.79 E-value=0.0015 Score=45.82 Aligned_cols=90 Identities=16% Similarity=0.151 Sum_probs=70.0 Q ss_pred HHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHH-HHHHHHHHHHCCCHHH Q ss_conf 998507895899999999998268983689999999998799889999988786609998999-9999999985799899 Q gi|254781033|r 299 YTHLLSENTVGKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKEIF-LLLAQIEQANSHNTDK 377 (492) Q Consensus 299 ~~~~~~~~~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~~~-~llA~ie~~e~~d~~~ 377 (492) ..++..+.+..+++...+.+...|+|.-+++-.|++++..++++.|+..++++++.+|++... .-+..+......-..+ T Consensus 265 ~c~lKl~~~~~Ai~~c~kvLe~~~~N~KALyRrG~A~l~~~e~~~A~~df~ka~k~~P~Nka~~~el~~l~~k~~~~~~k 344 (397) T KOG0543 265 ACYLKLKEYKEAIESCNKVLELDPNNVKALYRRGQALLALGEYDLARDDFQKALKLEPSNKAARAELIKLKQKIREYEEK 344 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 88874121889999999988458996267778788988642098999999999974888689999999999999999988 Q ss_pred HHHHHHHHHCC Q ss_conf 99999999628 Q gi|254781033|r 378 ILYWTQSALHA 388 (492) Q Consensus 378 ar~wl~rA~~a 388 (492) .+.-..+...- T Consensus 345 ekk~y~~mF~k 355 (397) T KOG0543 345 EKKMYANMFAK 355 (397) T ss_pred HHHHHHHHHHC T ss_conf 99999998500 No 114 >pfam09295 ChAPs ChAPs (Chs5p-Arf1p-binding proteins). ChAPs (Chs5p-Arf1p-binding proteins) are required for the export of specialized cargo from the Golgi. They physically interact with Chs3, Chs5 and the small GTPase Arf1, and they form also interactions with each other. Probab=97.79 E-value=0.0023 Score=44.23 Aligned_cols=121 Identities=13% Similarity=0.015 Sum_probs=86.7 Q ss_pred HHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 86545521588699999999888421269899999999999525745699999999950231035789999999999996 Q gi|254781033|r 159 SLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENAD 238 (492) Q Consensus 159 gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~ 238 (492) +|.....-.+.++.++..++++....| .+...+..++...++--+|++++.+.++..+. .+.++..++.-++. T Consensus 174 ~Ll~~~~~t~~~~~~~~~le~L~~~~p---~v~~llakv~~~~~~E~~av~~i~~~L~~~p~----d~~LL~~Qa~FLl~ 246 (395) T pfam09295 174 GLLKYVRLTKRYDRALNLLEKLREEEP---EVVSLLARVLLLMDEEIDAVKLMHDALQENPM----DYELLCLQAEFLLS 246 (395) T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHCCC---HHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCC----CHHHHHHHHHHHHH T ss_conf 999999424438989999999875184---49999999999656089999999999861887----67899999999981 Q ss_pred CCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHH Q ss_conf 699899999998754313352799999999999739947899999999 Q gi|254781033|r 239 KGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKI 286 (492) Q Consensus 239 ~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a 286 (492) .++.+.|++.++++....|+....+..+++.|+..|+++.|.-.|... T Consensus 247 K~~yelAL~~A~~av~~~P~~F~~W~~La~~Yi~l~d~e~ALl~LNS~ 294 (395) T pfam09295 247 KKKDELALECAKRAVNSAPSEFKTWALLVKVYIKLEDWENALLTLNSC 294 (395) T ss_pred CCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 798899999999987317866188999999999863299999999568 No 115 >PRK10866 outer membrane protein assembly complex subunit YfiO; Provisional Probab=97.77 E-value=0.0054 Score=41.38 Aligned_cols=177 Identities=11% Similarity=0.018 Sum_probs=91.1 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHCCCCC-CCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHH-----HH Q ss_conf 9999999999721004787999986301346-66158999998753222210001689999997268850579-----99 Q gi|254781033|r 85 YKALYTGLMSIAAHNIPLARKMHSYVSQQHT-FHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPATREFA-----VY 158 (492) Q Consensus 85 ~~al~~gl~al~~Gd~~~A~k~~~~a~~~~~-~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~-----~~ 158 (492) ......|+-++..||+..|.+.+.+....-+ ++.-..+.|..+.+....|++..|...++..++.-|+.... ++ T Consensus 33 ~~~Y~~A~~~~~~~~y~~A~~~fe~l~~~yP~s~~A~qA~L~~ayayy~~~~y~~A~~~~~rFi~lyP~~~~v~Ya~Yl~ 112 (243) T PRK10866 33 NEIYATAQQKLQDGNWKQAITQLEALDNRYPFGPYSQQVQLDLIYAYYKNADLPLAQAAIDRFMRLNPTHPNIDYVMYMR 112 (243) T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHH T ss_conf 99999999999878999999999999986899767899999999998730778999999999998678999738999998 Q ss_pred HHHHHH----------------HCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHH Q ss_conf 865455----------------2158869999999988842126989999999999952574569999999995023103 Q gi|254781033|r 159 SLYFES----------------CRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWN 222 (492) Q Consensus 159 gL~~~a----------------l~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~ 222 (492) |+.... ...+....|+..+++.+...|++.-+-.+...+..-.. .+ T Consensus 113 gl~~~~~~~~~~~~~~~~~~~~rD~~~~~~A~~~f~~li~~yP~S~Ya~dA~~r~~~lrn-----------~L------- 174 (243) T PRK10866 113 GLTNMALDDSALQGFFGVDRSDRDPQHARAAFSDFSKLVRGYPNSQYTTDATKRLVFLKD-----------RL------- 174 (243) T ss_pred HHHHHHCCHHHHHHHHCCCHHHCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHH-----------HH------- T ss_conf 898774013588887432564307699999999999999878998458999999999999-----------99------- Q ss_pred HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCC---CHHHHHHHHHHHHHCCCHHHHHHH Q ss_conf 578999999999999669989999999875431335---279999999999973994789999 Q gi|254781033|r 223 RNRAILLIARSLENADKGDMIASYHDAIESLKLCDN---SIMASICAAKSLISQNKKRKAEVI 282 (492) Q Consensus 223 ~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~---~~~a~~~~a~~l~~~g~~~~A~~~ 282 (492) +.-.+..+.-+...+....|..-++..++.-|+ ...+...+...+...|...+|.+. T Consensus 175 ---A~~El~Va~fY~kr~~y~AAi~R~~~vi~~Yp~t~~~~eAL~~l~eaY~~Lgl~~~A~~~ 234 (243) T PRK10866 175 ---AKYEYSVAEYYTKRGAWVAVVNRVEGMLRDYPDTQATRDALPLMENAYRQMQLNAQADKV 234 (243) T ss_pred ---HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCHHHHHHH T ss_conf ---999999999999926569999999999976899963399999999999986997899999 No 116 >COG2909 MalT ATP-dependent transcriptional regulator [Transcription] Probab=97.71 E-value=0.0065 Score=40.75 Aligned_cols=91 Identities=16% Similarity=0.179 Sum_probs=45.0 Q ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHCCCC-----------HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHH-- Q ss_conf 99875322221000168999999726885-----------05799986545521588699999999888421269899-- Q gi|254781033|r 124 LLEVQIALAERQYNIAHEKLEMMLQIPAT-----------REFAVYSLYFESCRIGDLNSAQRYATKALDISPDAPWV-- 190 (492) Q Consensus 124 Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~-----------~~~~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a-- 190 (492) ++.|-....+..+.++.....++...-+. .+.++++ ..++..||++.|.++++.++..-|.+.+. T Consensus 419 ll~aW~~~s~~r~~ea~~li~~l~~~l~~~~~~~~~~l~ae~~aL~a--~val~~~~~e~a~~lar~al~~L~~~~~~~r 496 (894) T COG2909 419 LLQAWLLASQHRLAEAETLIARLEHFLKAPMHSRQGDLLAEFQALRA--QVALNRGDPEEAEDLARLALVQLPEAAYRSR 496 (894) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHH--HHHHHCCCHHHHHHHHHHHHHHCCCCCCHHH T ss_conf 99999999736917899999999987077766312448899999999--9998458778999999999986612254000 Q ss_pred ---HHHHHHHHHHCCCCHHHHHHHHHHHH Q ss_conf ---99999999952574569999999995 Q gi|254781033|r 191 ---TEAVVQQYVLAKEWSRAITFLNQKKK 216 (492) Q Consensus 191 ---~~~L~~l~~~~gdw~~A~~~l~~~~~ 216 (492) ...++.+..-+|++.+|+.......+ T Consensus 497 ~~~~sv~~~a~~~~G~~~~Al~~~~~a~~ 525 (894) T COG2909 497 IVALSVLGEAAHIRGELTQALALMQQAEQ 525 (894) T ss_pred HHHHHHHHHHHHHHCHHHHHHHHHHHHHH T ss_conf 25654056888873439999998887999 No 117 >PRK09782 bacteriophage N4 receptor, outer membrane subunit; Provisional Probab=97.71 E-value=0.0066 Score=40.72 Aligned_cols=115 Identities=12% Similarity=0.067 Sum_probs=43.0 Q ss_pred CCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCH-HHH--HHHHHHCCCCHHHHHHHHHH Q ss_conf 998999999987543133527999999999997399478999999999832898-999--99998507895899999999 Q gi|254781033|r 240 GDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHP-EIA--NIYTHLLSENTVGKLKRALR 316 (492) Q Consensus 240 ~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p-~La--~a~~~~~~~~~~~al~~~~~ 316 (492) |+...+.....+++...|. .......+..+...|+...+.+-|..+....|+. ..- ..|.....++...+...+++ T Consensus 590 g~~~~Al~~~r~al~~ap~-a~~~~A~~~~~~Q~~d~~qa~~~LAeAv~lepd~~~y~A~lGy~la~~~D~Aq~~~~ler 668 (987) T PRK09782 590 GQPELALNDLTRSLNIAPS-ANAYVARATIYRQRHNVPAAVSDLRAALELEPNNSNYQAALGYALWDSGDIAQSREMLER 668 (987) T ss_pred CCCHHHHHHHHHHHHCCCC-HHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHCCCCHHHHHHHHHH T ss_conf 8738788889999860800-466899999999736807899999999817999854115677786139989999999997 Q ss_pred HHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCC Q ss_conf 998268983689999999998799889999988786609 Q gi|254781033|r 317 LEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIA 355 (492) Q Consensus 317 l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~ 355 (492) .....|+++...-.++-.+-+.++-..|+.++..+++.. T Consensus 669 Ah~~~p~d~~~~~~lAy~~~rl~D~a~a~~~~r~vID~~ 707 (987) T PRK09782 669 AHKDLPDDPALIRQLAYVNQRLDDMAATQHYARLVIDDI 707 (987) T ss_pred HHCCCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHH T ss_conf 521577552324788899876134699999999998654 No 118 >TIGR02795 tol_pal_ybgF tol-pal system protein YbgF; InterPro: IPR014162 Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, and a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.. Probab=97.65 E-value=0.0012 Score=46.57 Aligned_cols=102 Identities=14% Similarity=0.076 Sum_probs=55.1 Q ss_pred HHHHCCCCHHHHHHHHHHHHHHCCCCH---HHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 455215886999999998884212698---99999999999525745699999999950231035789999999999996 Q gi|254781033|r 162 FESCRIGDLNSAQRYATKALDISPDAP---WVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENAD 238 (492) Q Consensus 162 ~~al~~gd~~~A~~~~~~a~~~~P~~~---~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~ 238 (492) ...++.||++.|+..++.=++..|+++ .++.-|++++..+|+.++|...+....+..+.++.. ...++.++....+ T Consensus 10 ~~~lk~gdY~~A~~~F~~F~~~YP~S~y~pnA~YWLGe~~y~~~~~~~A~~~f~~v~~~yp~s~Ka-PdALLKlg~~~~~ 88 (119) T TIGR02795 10 LLVLKAGDYADAIQAFQAFLKKYPKSTYAPNAHYWLGEAYYAQGDYADAAKAFLAVVKKYPKSPKA-PDALLKLGMSLQE 88 (119) T ss_pred HHHHHCCCHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCC-HHHHHHHHHHHHH T ss_conf 999853887899999999988687987671054799999999733999999999998645899860-6899999999987 Q ss_pred CCCHHHHHHHHHHHHHCCCCCHHHHH Q ss_conf 69989999999875431335279999 Q gi|254781033|r 239 KGDMIASYHDAIESLKLCDNSIMASI 264 (492) Q Consensus 239 ~~~~~~A~~~~~~a~~~~p~~~~a~~ 264 (492) .|+.+.|...+.+..+.-|+...+.. T Consensus 89 ~g~~~~A~~~l~~V~~~YP~s~aA~~ 114 (119) T TIGR02795 89 LGDKEKAKATLQQVIKRYPGSSAAKL 114 (119) T ss_pred HCCHHHHHHHHHHHHHHCCCCHHHHH T ss_conf 28988999999999856699878899 No 119 >CHL00033 ycf3 photosystem I assembly protein Ycf3 Probab=97.64 E-value=0.0042 Score=42.22 Aligned_cols=63 Identities=13% Similarity=0.111 Sum_probs=25.6 Q ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHCCCCC---HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCC Q ss_conf 9999999996699899999998754313352---79999999999973994789999999998328 Q gi|254781033|r 229 LIARSLENADKGDMIASYHDAIESLKLCDNS---IMASICAAKSLISQNKKRKAEVILEKIWKVNP 291 (492) Q Consensus 229 ~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~---~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p 291 (492) ++..++.....|+..+|++.+.+|+++.++. ..+....+-++...|++.+|...+.+++..+| T Consensus 34 yyr~GmsaQs~GeyaEALe~Y~eAL~Le~D~~Drs~iLYNIGlIy~~nGe~~kALeyY~qALeiNP 99 (165) T CHL00033 34 YYRDGMSAQSEGEYAEALQNYYEAMRLEIDPYDRSYILYNIGLIHTSNGEHTKALEYYFQALERNP 99 (165) T ss_pred HHHCCCHHHCCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCC T ss_conf 998151010135389999999986234568512668998999999985888899999999998599 No 120 >PRK10803 hypothetical protein; Provisional Probab=97.62 E-value=0.0039 Score=42.47 Aligned_cols=98 Identities=13% Similarity=-0.025 Sum_probs=50.9 Q ss_pred HCCCCHHHHHHHHHHHHHHCCCCH---HHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCC Q ss_conf 215886999999998884212698---99999999999525745699999999950231035789999999999996699 Q gi|254781033|r 165 CRIGDLNSAQRYATKALDISPDAP---WVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGD 241 (492) Q Consensus 165 l~~gd~~~A~~~~~~a~~~~P~~~---~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~ 241 (492) +..++++.|+..++.-++..|++. .++.-|+++|...++|++|...+...++..|.+...- ..++..+......|+ T Consensus 153 l~~k~y~~Ai~aF~~Fi~~yP~s~y~~nA~YWLGe~~y~~~~~~~A~~~F~~vv~~yP~s~K~p-dAllklg~~~~~lg~ 231 (262) T PRK10803 153 QDKSRQDDAIVAFQNFIKKYPDSTYQPNANYWLGQLNYNKGKKDDAAYYFASVVKNYPKSPKAA-DAMFKVGVIMQDKGD 231 (262) T ss_pred HHHCCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCH-HHHHHHHHHHHHCCC T ss_conf 9735699999999999987879987743099999999974378999999999998677998678-999999999998599 Q ss_pred HHHHHHHHHHHHHCCCCCHHHH Q ss_conf 8999999987543133527999 Q gi|254781033|r 242 MIASYHDAIESLKLCDNSIMAS 263 (492) Q Consensus 242 ~~~A~~~~~~a~~~~p~~~~a~ 263 (492) .+.|...+.+.++..|+...+. T Consensus 232 ~~~Ak~~l~~vi~~YP~S~aA~ 253 (262) T PRK10803 232 TAKAKAVYQQVIKKYPGTDGAK 253 (262) T ss_pred HHHHHHHHHHHHHHCCCCHHHH T ss_conf 9999999999999885998999 No 121 >COG1729 Uncharacterized protein conserved in bacteria [Function unknown] Probab=97.56 E-value=0.01 Score=39.21 Aligned_cols=101 Identities=10% Similarity=0.059 Sum_probs=51.2 Q ss_pred HHHHCCCCHHHHHHHHHHHHHHCCCCH---HHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 455215886999999998884212698---99999999999525745699999999950231035789999999999996 Q gi|254781033|r 162 FESCRIGDLNSAQRYATKALDISPDAP---WVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENAD 238 (492) Q Consensus 162 ~~al~~gd~~~A~~~~~~a~~~~P~~~---~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~ 238 (492) ..+...||+..|...+.+-++..|++. .++.-|++.+..+|++++|...+....+..+..+..- ..++-.+....+ T Consensus 149 ~~~~ksgdy~~A~~~F~~fi~~YP~s~~~~nA~yWLGe~~y~qg~y~~Aa~~f~~~~k~~P~s~KAp-dallKlg~~~~~ 227 (262) T COG1729 149 LDLYKSGDYAEAEQAFQAFIKKYPNSTYTPNAYYWLGESLYAQGDYEDAAYIFARVVKDYPKSPKAP-DALLKLGVSLGR 227 (262) T ss_pred HHHHHCCCHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCCH-HHHHHHHHHHHH T ss_conf 9999838788999999999970999755602499988999862461679999999998389999882-899999999998 Q ss_pred CCCHHHHHHHHHHHHHCCCCCHHHH Q ss_conf 6998999999987543133527999 Q gi|254781033|r 239 KGDMIASYHDAIESLKLCDNSIMAS 263 (492) Q Consensus 239 ~~~~~~A~~~~~~a~~~~p~~~~a~ 263 (492) .++.++|...+.+..+..|+...+. T Consensus 228 l~~~d~A~atl~qv~k~YP~t~aA~ 252 (262) T COG1729 228 LGNTDEACATLQQVIKRYPGTDAAK 252 (262) T ss_pred HCCHHHHHHHHHHHHHHCCCCHHHH T ss_conf 2177999999999998789977999 No 122 >COG1729 Uncharacterized protein conserved in bacteria [Function unknown] Probab=97.53 E-value=0.0061 Score=40.95 Aligned_cols=107 Identities=21% Similarity=0.205 Sum_probs=77.2 Q ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHCCCCC---HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHCCC Q ss_conf 9999999996699899999998754313352---7999999999997399478999999999832898999999985078 Q gi|254781033|r 229 LIARSLENADKGDMIASYHDAIESLKLCDNS---IMASICAAKSLISQNKKRKAEVILEKIWKVNPHPEIANIYTHLLSE 305 (492) Q Consensus 229 ~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~---~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p~La~a~~~~~~~ 305 (492) .|..+..+...++..+|...+.+-++..|.. ..+.+.++..+..+|++..|..++..+.+..|+ T Consensus 144 ~Y~~A~~~~ksgdy~~A~~~F~~fi~~YP~s~~~~nA~yWLGe~~y~qg~y~~Aa~~f~~~~k~~P~------------- 210 (262) T COG1729 144 LYNAALDLYKSGDYAEAEQAFQAFIKKYPNSTYTPNAYYWLGESLYAQGDYEDAAYIFARVVKDYPK------------- 210 (262) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCC------------- T ss_conf 9999999998387889999999999709997556024999889998624616799999999983899------------- Q ss_pred CHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHH Q ss_conf 9589999999999826898368999999999879988999998878660999899999 Q gi|254781033|r 306 NTVGKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKEIFLL 363 (492) Q Consensus 306 ~~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~~~~l 363 (492) +|.-++.++-||....+.|+.++|+..|+++++..|..+.... T Consensus 211 ---------------s~KApdallKlg~~~~~l~~~d~A~atl~qv~k~YP~t~aA~~ 253 (262) T COG1729 211 ---------------SPKAPDALLKLGVSLGRLGNTDEACATLQQVIKRYPGTDAAKL 253 (262) T ss_pred ---------------CCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHH T ss_conf ---------------9988289999999999821779999999999987899779999 No 123 >TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD; InterPro: IPR005415 The type III secretion system of Gram-negative bacteria is used to transport virulence factors from the pathogen directly into the host cell and is only triggered when the bacterium comes into close contact with the host. Effector proteins secreted by the type III system do not possess a secretion signal, and are considered unique because of this. Yersinia spp. secrete effector proteins called YopB and YopD that facilitate the spread of other translocated proteins through the type III needle and the host cell cytoplasm . Both are believed to act as pore translocases, forming apertures in the host cell membrane and allowing the bacterium easy access to its cytoplasm. YopD also acts as a negative regulator of the Yersinia low-calcium response, and in turn is controlled by a chaperone, SycD . This protein also regulates YopB secretion. SycD is located on the Yop pathogenicity island of Yersinia spp., and is speculated to prevent a premature interaction between YopB, YopD and the calcium-response LcrV protein . It has been speculated that a type III secretion mechanism also exists in Chlamydial species. With the sequencing of the Chlamydia trachomatis genome, several proteins similar to characterised type III proteins have emerged, including a SycD homologue . The Pseudomonas aeruginosa gene PcrH is also similar to the Yersinia chaperone, suggesting a comparable function. Proteins in this entry are found in type III secretion operons. LcrH, from Yersinia is believed to have a regulatory function in the low-calcium response of the secretion system ,. The same protein is also known as SycD (SYC = Specific Yop Chaperone) for its chaperone role . In Pseudomonas, where the homolog is known as PcrH, the chaperone role has been demonstrated and the regulatory role appears to be absent . ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array .; GO: 0006950 response to stress. Probab=97.48 E-value=0.006 Score=41.01 Aligned_cols=106 Identities=15% Similarity=0.051 Sum_probs=87.1 Q ss_pred CCHHHHHH-HHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHH Q ss_conf 61589999-98753222210001689999997268850579998654552158869999999988842126989999999 Q gi|254781033|r 117 HNEYLVYL-LEVQIALAERQYNIAHEKLEMMLQIPATREFAVYSLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVV 195 (492) Q Consensus 117 ~~~~L~~L-l~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~ 195 (492) +.+.|..+ -.|...-.+|++++|...|.-+.-.+++...=..||+.-.+..++++.|+..+..+..++|++|......+ T Consensus 14 ~~~~L~~~Y~~AY~~y~~G~Y~~A~~~F~~L~~~~~~~~~Y~~gLa~c~q~~~~Y~~A~~~Y~~a~~l~~~~P~p~~~~a 93 (137) T TIGR02552 14 DSEQLEAIYALAYNLYQQGRYDEALKLFQLLAAYDPWNADYWLGLAACCQMLKEYEEAIDAYALAAALDPDDPRPYYHAA 93 (137) T ss_pred CHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHCCCCCCCCHHHHH T ss_conf 99899999999999985366799999999999974303689999999999861678999999999752768695126998 Q ss_pred HHHHHCCCCHHHHHHHHHHHHCCC-CHH Q ss_conf 999952574569999999995023-103 Q gi|254781033|r 196 QQYVLAKEWSRAITFLNQKKKNAK-EWN 222 (492) Q Consensus 196 ~l~~~~gdw~~A~~~l~~~~~~~~-~~~ 222 (492) +.+...|+-..|++.|+....... +.. T Consensus 94 ~C~l~~g~~~~A~~aL~~a~~~~~~~~~ 121 (137) T TIGR02552 94 ECYLALGEPESALKALDLAIEIAGGENP 121 (137) T ss_pred HHHHHHCCHHHHHHHHHHHHHHHCCCCC T ss_conf 9998728978999999999998488996 No 124 >TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD; InterPro: IPR005415 The type III secretion system of Gram-negative bacteria is used to transport virulence factors from the pathogen directly into the host cell and is only triggered when the bacterium comes into close contact with the host. Effector proteins secreted by the type III system do not possess a secretion signal, and are considered unique because of this. Yersinia spp. secrete effector proteins called YopB and YopD that facilitate the spread of other translocated proteins through the type III needle and the host cell cytoplasm . Both are believed to act as pore translocases, forming apertures in the host cell membrane and allowing the bacterium easy access to its cytoplasm. YopD also acts as a negative regulator of the Yersinia low-calcium response, and in turn is controlled by a chaperone, SycD . This protein also regulates YopB secretion. SycD is located on the Yop pathogenicity island of Yersinia spp., and is speculated to prevent a premature interaction between YopB, YopD and the calcium-response LcrV protein . It has been speculated that a type III secretion mechanism also exists in Chlamydial species. With the sequencing of the Chlamydia trachomatis genome, several proteins similar to characterised type III proteins have emerged, including a SycD homologue . The Pseudomonas aeruginosa gene PcrH is also similar to the Yersinia chaperone, suggesting a comparable function. Proteins in this entry are found in type III secretion operons. LcrH, from Yersinia is believed to have a regulatory function in the low-calcium response of the secretion system ,. The same protein is also known as SycD (SYC = Specific Yop Chaperone) for its chaperone role . In Pseudomonas, where the homolog is known as PcrH, the chaperone role has been demonstrated and the regulatory role appears to be absent . ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array .; GO: 0006950 response to stress. Probab=97.44 E-value=0.0023 Score=44.27 Aligned_cols=109 Identities=12% Similarity=0.064 Sum_probs=52.0 Q ss_pred HHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHH Q ss_conf 99726885057999865455215886999999998884212698999999999995257456999999999502310357 Q gi|254781033|r 145 MMLQIPATREFAVYSLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRN 224 (492) Q Consensus 145 ~~~~~~~~~~~~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~ 224 (492) .+.+.++.....+...+...-.+|++++|...+.-+...+|.+..-+..|+......++|++|+..+.......+..++. T Consensus 9 ~~~~l~~~~L~~~Y~~AY~~y~~G~Y~~A~~~F~~L~~~~~~~~~Y~~gLa~c~q~~~~Y~~A~~~Y~~a~~l~~~~P~p 88 (137) T TIGR02552 9 SLLGLDSEQLEAIYALAYNLYQQGRYDEALKLFQLLAAYDPWNADYWLGLAACCQMLKEYEEAIDAYALAAALDPDDPRP 88 (137) T ss_pred HHHCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHCCCCCCCC T ss_conf 77289998999999999999853667999999999999743036899999999998616789999999997527686951 Q ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCC Q ss_conf 899999999999966998999999987543133 Q gi|254781033|r 225 RAILLIARSLENADKGDMIASYHDAIESLKLCD 257 (492) Q Consensus 225 ~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p 257 (492) .+..+..++..|++.+|++.+.-+++.+. T Consensus 89 ----~~~~a~C~l~~g~~~~A~~aL~~a~~~~~ 117 (137) T TIGR02552 89 ----YYHAAECYLALGEPESALKALDLAIEIAG 117 (137) T ss_pred ----HHHHHHHHHHHCCHHHHHHHHHHHHHHHC T ss_conf ----26998999872897899999999999848 No 125 >COG4649 Uncharacterized protein conserved in bacteria [Function unknown] Probab=97.44 E-value=0.014 Score=38.11 Aligned_cols=139 Identities=14% Similarity=0.048 Sum_probs=94.9 Q ss_pred HHHHHHHCCCHHHHHHHHHHHHHCCCCCHH--HHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHH-------HHHHHHH Q ss_conf 999999669989999999875431335279--9999999999739947899999999983289899-------9999985 Q gi|254781033|r 232 RSLENADKGDMIASYHDAIESLKLCDNSIM--ASICAAKSLISQNKKRKAEVILEKIWKVNPHPEI-------ANIYTHL 302 (492) Q Consensus 232 ~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~--a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p~L-------a~a~~~~ 302 (492) .+......+..++++..+...-+..-+.-| +....+.++.+.|+...|.+.+...-+..+-|.. -.+|..+ T Consensus 64 aAL~lA~~~k~d~Alaaf~~lektg~g~YpvLA~mr~at~~a~kgdta~AV~aFdeia~dt~~P~~~rd~ARlraa~lLv 143 (221) T COG4649 64 AALKLAQENKTDDALAAFTDLEKTGYGSYPVLARMRAATLLAQKGDTAAAVAAFDEIAADTSIPQIGRDLARLRAAYLLV 143 (221) T ss_pred HHHHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHH T ss_conf 99999870796689999999986488855699999999998612647889998998852578760136799998789874 Q ss_pred CCCCHHHHHHHHHHHHH-HCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHH----HHHHHHHHH Q ss_conf 07895899999999998-268983689999999998799889999988786609998999----999999998 Q gi|254781033|r 303 LSENTVGKLKRALRLEE-INKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKEIF----LLLAQIEQA 370 (492) Q Consensus 303 ~~~~~~~al~~~~~l~~-~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~~~----~llA~ie~~ 370 (492) ..+++.+-..+++.+.. .+|--...+-+||-+++++|+|..|+..++++....-.++.. .+|.++-.. T Consensus 144 D~gsy~dV~srvepLa~d~n~mR~sArEALglAa~kagd~a~A~~~F~qia~Da~aprnirqRAq~mldlI~s 216 (221) T COG4649 144 DNGSYDDVSSRVEPLAGDGNPMRHSAREALGLAAYKAGDFAKAKSWFVQIANDAQAPRNIRQRAQIMLDLIDS 216 (221) T ss_pred CCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHC T ss_conf 3564888998741304899701788999876888732246779999999970146967799999999999860 No 126 >COG3118 Thioredoxin domain-containing protein [Posttranslational modification, protein turnover, chaperones] Probab=97.41 E-value=0.015 Score=37.87 Aligned_cols=129 Identities=15% Similarity=0.080 Sum_probs=92.3 Q ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHH Q ss_conf 98654552158869999999988842126989999999999952574569999999995023103578999999999999 Q gi|254781033|r 158 YSLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENA 237 (492) Q Consensus 158 ~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~ 237 (492) .+-.......||...|...+..+....|++..+...+++.+...|+.++|..+|...-..............+....... T Consensus 138 ~~~~~~~~~~e~~~~a~~~~~~al~~~~~~~~~~~~la~~~l~~g~~e~A~~iL~~lP~~~~~~~~~~l~a~i~ll~qaa 217 (304) T COG3118 138 LAEAKELIEAEDFGEAAPLLKQALQAAPENSEAKLLLAECLLAAGDVEAAQAILAALPLQAQDKAAHGLQAQIELLEQAA 217 (304) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHH T ss_conf 98766654165445589999999985845663777899999981885899999972850010457888999999999974 Q ss_pred HCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHC Q ss_conf 66998999999987543133527999999999997399478999999999832 Q gi|254781033|r 238 DKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVN 290 (492) Q Consensus 238 ~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~ 290 (492) ...+.. .+.+....+|+++.+.+.+|..+...|+...|...|-..++.+ T Consensus 218 ~~~~~~----~l~~~~aadPdd~~aa~~lA~~~~~~g~~e~Ale~Ll~~l~~d 266 (304) T COG3118 218 ATPEIQ----DLQRRLAADPDDVEAALALADQLHLVGRNEAALEHLLALLRRD 266 (304) T ss_pred CCCCHH----HHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHC T ss_conf 599778----9999987099988999999999997388799999999998750 No 127 >KOG0543 consensus Probab=97.40 E-value=0.0095 Score=39.46 Aligned_cols=71 Identities=14% Similarity=0.053 Sum_probs=47.0 Q ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHH Q ss_conf 79998654552158869999999988842126989999999999952574569999999995023103578 Q gi|254781033|r 155 FAVYSLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNR 225 (492) Q Consensus 155 ~~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~ 225 (492) ..+..|+..+++.+++..|+....+++.+.|+|.-++...++.+...++++.|+..+.++++..|.++..+ T Consensus 258 ~~~lNlA~c~lKl~~~~~Ai~~c~kvLe~~~~N~KALyRrG~A~l~~~e~~~A~~df~ka~k~~P~Nka~~ 328 (397) T KOG0543 258 ACHLNLAACYLKLKEYKEAIESCNKVLELDPNNVKALYRRGQALLALGEYDLARDDFQKALKLEPSNKAAR 328 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHH T ss_conf 77647998887412188999999998845899626777878898864209899999999997488868999 No 128 >KOG1941 consensus Probab=97.31 E-value=0.02 Score=36.99 Aligned_cols=165 Identities=15% Similarity=0.082 Sum_probs=114.1 Q ss_pred HHHHHHHHHCCHHHHHHHHHHHHCCCCHH-----HHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCH------HHHHHH Q ss_conf 87532222100016899999972688505-----7999865455215886999999998884212698------999999 Q gi|254781033|r 126 EVQIALAERQYNIAHEKLEMMLQIPATRE-----FAVYSLYFESCRIGDLNSAQRYATKALDISPDAP------WVTEAV 194 (492) Q Consensus 126 ~A~aA~~~gd~~~A~~~~~~~~~~~~~~~-----~~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~------~a~~~L 194 (492) .+.....--+..++..|-+.-+.-|.|+. .....+....+..+.+++++++++++++...++. .+...| T Consensus 89 lar~~e~l~~f~kt~~y~k~~l~lpgt~~~~~~gq~~l~~~~Ahlgls~fq~~Lesfe~A~~~A~~~~D~~LElqvcv~L 168 (518) T KOG1941 89 LARSNEKLCEFHKTISYCKTCLGLPGTRAGQLGGQVSLSMGNAHLGLSVFQKALESFEKALRYAHNNDDAMLELQVCVSL 168 (518) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEEEEHHHHH T ss_conf 99889999985668998878742898785556503565777775137889999999999998765069840354135338 Q ss_pred HHHHHHCCCCHHHHHHHHHHHHCC------CCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCC------HHH Q ss_conf 999995257456999999999502------31035789999999999996699899999998754313352------799 Q gi|254781033|r 195 VQQYVLAKEWSRAITFLNQKKKNA------KEWNRNRAILLIARSLENADKGDMIASYHDAIESLKLCDNS------IMA 262 (492) Q Consensus 195 ~~l~~~~gdw~~A~~~l~~~~~~~------~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~------~~a 262 (492) +.++.+.+|+++++-...++.... ..+-..+...++-.+..+...|....|.+...++.+++-.. ..- T Consensus 169 gslf~~l~D~~Kal~f~~kA~~lv~s~~l~d~~~kyr~~~lyhmaValR~~G~LgdA~e~C~Ea~klal~~Gdra~~arc 248 (518) T KOG1941 169 GSLFAQLKDYEKALFFPCKAAELVNSYGLKDWSLKYRAMSLYHMAVALRLLGRLGDAMECCEEAMKLALQHGDRALQARC 248 (518) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHH T ss_conf 89999987446777614769999875275605689999999999999998412113999999999999984777988999 Q ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHHC Q ss_conf 9999999997399478999999999832 Q gi|254781033|r 263 SICAAKSLISQNKKRKAEVILEKIWKVN 290 (492) Q Consensus 263 ~~~~a~~l~~~g~~~~A~~~le~a~~~~ 290 (492) ....+++|...|+.+.|-.-+|.+.... T Consensus 249 ~~~~aDIyR~~gd~e~af~rYe~Am~~m 276 (518) T KOG1941 249 LLCFADIYRSRGDLERAFRRYEQAMGTM 276 (518) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHHH T ss_conf 9999999886130766899999999899 No 129 >KOG3785 consensus Probab=97.27 E-value=0.021 Score=36.72 Aligned_cols=186 Identities=12% Similarity=0.093 Sum_probs=96.6 Q ss_pred CCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCC-CHHHHHHHHHHHHHCCCCHHHHHH Q ss_conf 1004787999986301346661589999987532222100016899999972688-505799986545521588699999 Q gi|254781033|r 97 AHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPA-TREFAVYSLYFESCRIGDLNSAQR 175 (492) Q Consensus 97 ~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~-~~~~~~~gL~~~al~~gd~~~A~~ 175 (492) .-|+..|..++.-...... ..+.-..+-.|..+..-||+++|...|+.+...+. ...++ ..|+....-.|.+.+|.. T Consensus 35 ~rDytGAislLefk~~~~~-EEE~~~~lWia~C~fhLgdY~~Al~~Y~~~~~~~~~~~el~-vnLAcc~FyLg~Y~eA~~ 112 (557) T KOG3785 35 NRDYTGAISLLEFKLNLDR-EEEDSLQLWIAHCYFHLGDYEEALNVYTFLMNKDDAPAELG-VNLACCKFYLGQYIEAKS 112 (557) T ss_pred CCCCHHHHHHHHHHHCCCH-HHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCCC-HHHHHHHHHHHHHHHHHH T ss_conf 2551458999988604441-44678999999988752139999999998853679970102-528999999998999999 Q ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHC Q ss_conf 99988842126989999999999952574569999999995023103578999999999999669989999999875431 Q gi|254781033|r 176 YATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIASYHDAIESLKL 255 (492) Q Consensus 176 ~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~ 255 (492) ..++ .|+++--.+.++.+..+.+|-.+....-+..... ... + ++.+-.........+|.+.+++.+.. T Consensus 113 ~~~k----a~k~pL~~RLlfhlahklndEk~~~~fh~~LqD~---~Ed-q----LSLAsvhYmR~HYQeAIdvYkrvL~d 180 (557) T KOG3785 113 IAEK----APKTPLCIRLLFHLAHKLNDEKRILTFHSSLQDT---LED-Q----LSLASVHYMRMHYQEAIDVYKRVLQD 180 (557) T ss_pred HHHH----CCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHH---HHH-H----HHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 9861----8987599999999998717688999999887546---787-7----76777799999899999999999704 Q ss_pred CCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHH Q ss_conf 33527999999999997399478999999999832898999 Q gi|254781033|r 256 CDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHPEIA 296 (492) Q Consensus 256 ~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p~La 296 (492) +|.....-+-.|.-+.+..-+.-+..++.-.+...|+..++ T Consensus 181 n~ey~alNVy~ALCyyKlDYydvsqevl~vYL~q~pdStiA 221 (557) T KOG3785 181 NPEYIALNVYMALCYYKLDYYDVSQEVLKVYLRQFPDSTIA 221 (557) T ss_pred CHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHH T ss_conf 86364567889999985561666899999999968995888 No 130 >COG4785 NlpI Lipoprotein NlpI, contains TPR repeats [General function prediction only] Probab=97.27 E-value=0.021 Score=36.71 Aligned_cols=218 Identities=15% Similarity=0.062 Sum_probs=130.6 Q ss_pred HHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHH Q ss_conf 99999999999999997210047879999863013466615899999875322221000168999999726885057999 Q gi|254781033|r 79 RNYDKGYKALYTGLMSIAAHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPATREFAVY 158 (492) Q Consensus 79 rr~~k~~~al~~gl~al~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~~~ 158 (492) ..+.++.-.+.+|.+.-+-|=+..|+-.+.++.+..+ +.+...++|.. -..+.||++.|-+.|....+-+|+...+.+ T Consensus 60 ~~eeRA~l~fERGvlYDSlGL~~LAR~DftQaLai~P-~m~~vfNyLG~-Yl~~a~~fdaa~eaFds~~ELDp~y~Ya~l 137 (297) T COG4785 60 TDEERAQLLFERGVLYDSLGLRALARNDFSQALAIRP-DMPEVFNYLGI-YLTQAGNFDAAYEAFDSVLELDPTYNYAHL 137 (297) T ss_pred CHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCC-CCHHHHHHHHH-HHHHCCCCHHHHHHHHHHHCCCCCCHHHHH T ss_conf 8788889988612034224079998601456663496-75899989999-998602205799986667104775107776 Q ss_pred HHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHH-HHHHCCCCHHHHHHHHHHHHHHHHH Q ss_conf 865455215886999999998884212698999999999995257456999999-9995023103578999999999999 Q gi|254781033|r 159 SLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLN-QKKKNAKEWNRNRAILLIARSLENA 237 (492) Q Consensus 159 gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~-~~~~~~~~~~~~~a~l~~~~a~~~~ 237 (492) .-....---|.+.-|..-+.+-.+.+|++|.-..-| -+-.+.-|..+|..-+. +..++..+.=.- .+.-+..+... T Consensus 138 NRgi~~YY~gR~~LAq~d~~~fYQ~D~~DPfR~LWL-Yl~E~k~dP~~A~tnL~qR~~~~d~e~WG~-~iV~~yLgkiS- 214 (297) T COG4785 138 NRGIALYYGGRYKLAQDDLLAFYQDDPNDPFRSLWL-YLNEQKLDPKQAKTNLKQRAEKSDKEQWGW-NIVEFYLGKIS- 214 (297) T ss_pred CCCEEEEECCCCHHHHHHHHHHHHCCCCCHHHHHHH-HHHHHCCCHHHHHHHHHHHHHCCCHHHHHH-HHHHHHHHHCC- T ss_conf 044023444751555999999984498980789999-987610786899999999987056765518-79999970403- Q ss_pred HCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHH Q ss_conf 6699899999998754313352799999999999739947899999999983289899999998 Q gi|254781033|r 238 DKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHPEIANIYTH 301 (492) Q Consensus 238 ~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p~La~a~~~ 301 (492) .+.-...+...+.+-..+.....+...-+++.+...|+..+|..+++-+...+-+.-+-..|+. T Consensus 215 ~e~l~~~~~a~a~~n~~~Ae~LTEtyFYL~K~~l~~G~~~~A~~LfKLaiannVynfVE~RyA~ 278 (297) T COG4785 215 EETLMERLKADATDNTSLAEHLTETYFYLGKYYLSLGDLDEATALFKLAVANNVYNFVEHRYAL 278 (297) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999986312128999999999999999986131178899999999988799999999999 No 131 >COG2976 Uncharacterized protein conserved in bacteria [Function unknown] Probab=97.22 E-value=0.024 Score=36.28 Aligned_cols=90 Identities=19% Similarity=0.168 Sum_probs=34.9 Q ss_pred HHHHCCCCHHHHHHHHHHHHHHCCCCH---HHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 455215886999999998884212698---99999999999525745699999999950231035789999999999996 Q gi|254781033|r 162 FESCRIGDLNSAQRYATKALDISPDAP---WVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENAD 238 (492) Q Consensus 162 ~~al~~gd~~~A~~~~~~a~~~~P~~~---~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~ 238 (492) ......|+++.|...++.++....+.. -+...|+.++.+++..++|+++|+...... . .+.....++-.++. T Consensus 97 k~~ve~~~~d~A~aqL~~~l~~t~De~lk~l~~lRLArvq~q~~k~D~AL~~L~t~~~~~--w---~~~~~elrGDill~ 171 (207) T COG2976 97 KAEVEANNLDKAEAQLKQALAQTKDENLKALAALRLARVQLQQKKADAALKTLDTIKEES--W---AAIVAELRGDILLA 171 (207) T ss_pred HHHHHHCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC--H---HHHHHHHHHHHHHH T ss_conf 999863429999999999871452578999999999999998603888999872301110--7---88999974269997 Q ss_pred CCCHHHHHHHHHHHHHCC Q ss_conf 699899999998754313 Q gi|254781033|r 239 KGDMIASYHDAIESLKLC 256 (492) Q Consensus 239 ~~~~~~A~~~~~~a~~~~ 256 (492) .|+..+|+..|.+++... T Consensus 172 kg~k~~Ar~ay~kAl~~~ 189 (207) T COG2976 172 KGDKQEARAAYEKALESD 189 (207) T ss_pred CCCHHHHHHHHHHHHHCC T ss_conf 286599999999998715 No 132 >KOG2796 consensus Probab=97.13 E-value=0.029 Score=35.67 Aligned_cols=157 Identities=11% Similarity=0.073 Sum_probs=112.2 Q ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHH-HHHHHHHHHHHCCCHHHHHHHHHHHHHHCCC------HHH---HH Q ss_conf 9999999999669989999999875431335279-9999999999739947899999999983289------899---99 Q gi|254781033|r 228 LLIARSLENADKGDMIASYHDAIESLKLCDNSIM-ASICAAKSLISQNKKRKAEVILEKIWKVNPH------PEI---AN 297 (492) Q Consensus 228 l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~-a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~------p~L---a~ 297 (492) ..+.....++..++..-....+.+..+.+|..-| ..-.+++.-++.|+.+.|....+..-+.+.. ..+ .. T Consensus 179 Vmy~~~~~llG~kEy~iS~d~~~~vi~~~~e~~p~L~s~Lgr~~MQ~GD~k~a~~yf~~vek~~~kL~~~q~~~~V~~n~ 258 (366) T KOG2796 179 VMYSMANCLLGMKEYVLSVDAYHSVIKYYPEQEPQLLSGLGRISMQIGDIKTAEKYFQDVEKVTQKLDGLQGKIMVLMNS 258 (366) T ss_pred HHHHHHHHHHCCHHHHEEHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHH T ss_conf 99999999836304511099999998638730679999987888860438889989999998761432340467887634 Q ss_pred HHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHH-------HHHHHHHHHH Q ss_conf 999850789589999999999826898368999999999879988999998878660999899-------9999999998 Q gi|254781033|r 298 IYTHLLSENTVGKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKEI-------FLLLAQIEQA 370 (492) Q Consensus 298 a~~~~~~~~~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~~-------~~llA~ie~~ 370 (492) ++.++-..+...+...+.+....+|.++..-...|-..+-.|+...|.+.++.+++..|+.-. .+-|++++.. T Consensus 259 a~i~lg~nn~a~a~r~~~~i~~~D~~~~~a~NnKALcllYlg~l~DAiK~~e~~~~~~P~~~l~es~~~nL~tmyEL~Ys 338 (366) T KOG2796 259 AFLHLGQNNFAEAHRFFTEILRMDPRNAVANNNKALCLLYLGKLKDALKQLEAMVQQDPRHYLHESVLFNLTTMYELEYS 338 (366) T ss_pred HHEEECCCCHHHHHHHHHHCCCCCCCCHHHHCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHH T ss_conf 42022133359999988504224898702205199999998778899999998761188621014588878989999976 Q ss_pred HCCC-HHHHHHHHHH Q ss_conf 5799-8999999999 Q gi|254781033|r 371 NSHN-TDKILYWTQS 384 (492) Q Consensus 371 e~~d-~~~ar~wl~r 384 (492) ...+ ....+.|.++ T Consensus 339 ~~~~~k~~l~~~ia~ 353 (366) T KOG2796 339 RSMQKKQALLEAVAG 353 (366) T ss_pred HHHHHHHHHHHHHHC T ss_conf 553599999999855 No 133 >KOG2796 consensus Probab=97.11 E-value=0.029 Score=35.69 Aligned_cols=208 Identities=11% Similarity=0.090 Sum_probs=127.1 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHC-------------CCCH-------H-HHHHHHHHHHHCCCCHHHHHHHHHH Q ss_conf 99999875322221000168999999726-------------8850-------5-7999865455215886999999998 Q gi|254781033|r 121 LVYLLEVQIALAERQYNIAHEKLEMMLQI-------------PATR-------E-FAVYSLYFESCRIGDLNSAQRYATK 179 (492) Q Consensus 121 L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~-------------~~~~-------~-~~~~gL~~~al~~gd~~~A~~~~~~ 179 (492) -..++-|.+-+..|+..+....+..+.+. ++.. . .....+....+-.|.+.-....+.+ T Consensus 123 smR~lhAe~~~~lgnpqesLdRl~~L~~~V~~ii~~~e~~~~~ESsv~lW~KRl~~Vmy~~~~~llG~kEy~iS~d~~~~ 202 (366) T KOG2796 123 SMRILHAELQQYLGNPQESLDRLHKLKTVVSKILANLEQGLAEESSIRLWRKRLGRVMYSMANCLLGMKEYVLSVDAYHS 202 (366) T ss_pred HHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHCCHHHHEEHHHHHH T ss_conf 99999999886569926779999999999999999988256604279999999999999999998363045110999999 Q ss_pred HHHHC-CCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHH--HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCC Q ss_conf 88421-2698999999999995257456999999999502310357--89999999999996699899999998754313 Q gi|254781033|r 180 ALDIS-PDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRN--RAILLIARSLENADKGDMIASYHDAIESLKLC 256 (492) Q Consensus 180 a~~~~-P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~--~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~ 256 (492) ..+.+ |..+.....|+.+..+-||-+.+...+++.-|.....+.. .....-..+..++..++..++-..+.+.+..+ T Consensus 203 vi~~~~e~~p~L~s~Lgr~~MQ~GD~k~a~~yf~~vek~~~kL~~~q~~~~V~~n~a~i~lg~nn~a~a~r~~~~i~~~D 282 (366) T KOG2796 203 VIKYYPEQEPQLLSGLGRISMQIGDIKTAEKYFQDVEKVTQKLDGLQGKIMVLMNSAFLHLGQNNFAEAHRFFTEILRMD 282 (366) T ss_pred HHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHEEECCCCHHHHHHHHHHCCCCC T ss_conf 98638730679999987888860438889989999998761432340467887634420221333599999885042248 Q ss_pred CCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHH--------HHHHHCCCCHHHHHHHHHHHHHHCCCCHHHH Q ss_conf 35279999999999973994789999999998328989999--------9998507895899999999998268983689 Q gi|254781033|r 257 DNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHPEIAN--------IYTHLLSENTVGKLKRALRLEEINKESVESL 328 (492) Q Consensus 257 p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p~La~--------a~~~~~~~~~~~al~~~~~l~~~~P~~~e~~ 328 (492) |..+.+...-|..+.-.|+...|.|.++.+....|.+.+-. .|-.........-....+-.....||+..+. T Consensus 283 ~~~~~a~NnKALcllYlg~l~DAiK~~e~~~~~~P~~~l~es~~~nL~tmyEL~Ys~~~~~k~~l~~~ia~~~~d~f~~~ 362 (366) T KOG2796 283 PRNAVANNNKALCLLYLGKLKDALKQLEAMVQQDPRHYLHESVLFNLTTMYELEYSRSMQKKQALLEAVAGKEGDSFNTQ 362 (366) T ss_pred CCCHHHHCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCH T ss_conf 98702205199999998778899999998761188621014588878989999976553599999999855287753413 No 134 >COG2909 MalT ATP-dependent transcriptional regulator [Transcription] Probab=97.08 E-value=0.032 Score=35.32 Aligned_cols=96 Identities=17% Similarity=0.102 Sum_probs=37.8 Q ss_pred HHHHHHHHHCCHHHHHHHHHHHHCCCCCCCC-H------HHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCC-----CHHH Q ss_conf 9999999721004787999986301346661-5------89999987532222100016899999972688-----5057 Q gi|254781033|r 88 LYTGLMSIAAHNIPLARKMHSYVSQQHTFHN-E------YLVYLLEVQIALAERQYNIAHEKLEMMLQIPA-----TREF 155 (492) Q Consensus 88 l~~gl~al~~Gd~~~A~k~~~~a~~~~~~~~-~------~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~-----~~~~ 155 (492) +.+++.....+.+..|+....++......++ . .-..-|.|+.+...||.+.+.++-+..+..-+ ++.. T Consensus 419 ll~aW~~~s~~r~~ea~~li~~l~~~l~~~~~~~~~~l~ae~~aL~a~val~~~~~e~a~~lar~al~~L~~~~~~~r~~ 498 (894) T COG2909 419 LLQAWLLASQHRLAEAETLIARLEHFLKAPMHSRQGDLLAEFQALRAQVALNRGDPEEAEDLARLALVQLPEAAYRSRIV 498 (894) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCHHHHH T ss_conf 99999999736917899999999987077766312448899999999999845877899999999998661225400025 Q ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHH Q ss_conf 9998654552158869999999988842 Q gi|254781033|r 156 AVYSLYFESCRIGDLNSAQRYATKALDI 183 (492) Q Consensus 156 ~~~gL~~~al~~gd~~~A~~~~~~a~~~ 183 (492) .+..++....-.|++++|+.+...+.+. T Consensus 499 ~~sv~~~a~~~~G~~~~Al~~~~~a~~~ 526 (894) T COG2909 499 ALSVLGEAAHIRGELTQALALMQQAEQM 526 (894) T ss_pred HHHHHHHHHHHHCHHHHHHHHHHHHHHH T ss_conf 6540568888734399999988879999 No 135 >KOG2053 consensus Probab=97.05 E-value=0.034 Score=35.11 Aligned_cols=121 Identities=16% Similarity=0.091 Sum_probs=73.0 Q ss_pred HCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHH Q ss_conf 21004787999986301346661589999987532222100016899999972688505799986545521588699999 Q gi|254781033|r 96 AAHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPATREFAVYSLYFESCRIGDLNSAQR 175 (492) Q Consensus 96 ~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~~~gL~~~al~~gd~~~A~~ 175 (492) ..|++..|-+...+..+. .++.+.+..+.|..-...|..++|...++..-...++....+..+...+.+.|+.+.|.. T Consensus 21 d~~qfkkal~~~~kllkk--~Pn~~~a~vLkaLsl~r~gk~~ea~~~Le~~~~~~~~D~~tLq~l~~~y~d~~~~d~~~~ 98 (932) T KOG2053 21 DSSQFKKALAKLGKLLKK--HPNALYAKVLKALSLFRLGKGDEALKLLEALYGLKGTDDLTLQFLQNVYRDLGKLDEAVH 98 (932) T ss_pred HHHHHHHHHHHHHHHHHH--CCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 267799999999999987--899179999999999981586467888751304799856899999999998742568999 Q ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCC Q ss_conf 99988842126989999999999952574569999999995023 Q gi|254781033|r 176 YATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAK 219 (492) Q Consensus 176 ~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~ 219 (492) .++++....|+ ...+..++..|.+.+++.+-.+.--+..|..+ T Consensus 99 ~Ye~~~~~~P~-eell~~lFmayvR~~~yk~qQkaa~~LyK~~p 141 (932) T KOG2053 99 LYERANQKYPS-EELLYHLFMAYVREKSYKKQQKAALQLYKNFP 141 (932) T ss_pred HHHHHHHHCCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 99998710884-88999999999998889999999999998587 No 136 >KOG4340 consensus Probab=96.99 E-value=0.038 Score=34.74 Aligned_cols=249 Identities=11% Similarity=-0.012 Sum_probs=122.3 Q ss_pred HHHHHHCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCH Q ss_conf 99997210047879999863013466615899999875322221000168999999726885057999865455215886 Q gi|254781033|r 91 GLMSIAAHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPATREFAVYSLYFESCRIGDL 170 (492) Q Consensus 91 gl~al~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~~~gL~~~al~~gd~ 170 (492) |....-..++..|..-..+.... .+...-..|-.||+.-..+-+..|.+....|.+.+.-+...+.--..+.-..+|. T Consensus 51 gyCYY~~Q~f~~AA~CYeQL~ql--~P~~~qYrlY~AQSLY~A~i~ADALrV~~~~~D~~~L~~~~lqLqaAIkYse~Dl 128 (459) T KOG4340 51 GYCYYRLQEFALAAECYEQLGQL--HPELEQYRLYQAQSLYKACIYADALRVAFLLLDNPALHSRVLQLQAAIKYSEGDL 128 (459) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH--CHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCCCC T ss_conf 99999999998888999999863--7188889999999999703528899999995684989999999999986054555 Q ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHH Q ss_conf 99999999888421269899999999999525745699999999950231035789999999999996699899999998 Q gi|254781033|r 171 NSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIASYHDAI 250 (492) Q Consensus 171 ~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~ 250 (492) ..++...+..... +....+...+-+..+.|++++|+.-+..++.-..- ...+.+..+..+...++.+.|++... T Consensus 129 ~g~rsLveQlp~e--n~Ad~~in~gCllykegqyEaAvqkFqaAlqvsGy----qpllAYniALaHy~~~qyasALk~iS 202 (459) T KOG4340 129 PGSRSLVEQLPSE--NEADGQINLGCLLYKEGQYEAAVQKFQAALQVSGY----QPLLAYNLALAHYSSRQYASALKHIS 202 (459) T ss_pred CCHHHHHHHCCCC--CCHHHHHCCHHHEECCCCHHHHHHHHHHHHHHCCC----CHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 3549999857788--85023303012000165489999999998730576----72789999999873120778999999 Q ss_pred HHH----HCCCCCH-------------------------HHHHHHHHHHHHCCCHHHHHHHHHHHHHHCC---CHHHHHH Q ss_conf 754----3133527-------------------------9999999999973994789999999998328---9899999 Q gi|254781033|r 251 ESL----KLCDNSI-------------------------MASICAAKSLISQNKKRKAEVILEKIWKVNP---HPEIANI 298 (492) Q Consensus 251 ~a~----~~~p~~~-------------------------~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p---~p~La~a 298 (492) +.+ +..|.+. .+..+-+.++.+.++...|...+...-..+- +|--... T Consensus 203 EIieRG~r~HPElgIGm~tegiDvrsvgNt~~lh~Sal~eAfNLKaAIeyq~~n~eAA~eaLtDmPPRaE~elDPvTLHN 282 (459) T KOG4340 203 EIIERGIRQHPELGIGMTTEGIDVRSVGNTLVLHQSALVEAFNLKAAIEYQLRNYEAAQEALTDMPPRAEEELDPVTLHN 282 (459) T ss_pred HHHHHHHHCCCCCCCCCEECCCCHHHCCCHHHHHHHHHHHHHHHHHHEEECCCCHHHHHHHHHCCCCCCCCCCCCHHHHH T ss_conf 99975542397557652102674131365599999999998500542000236778898886259985412378444566 Q ss_pred HHHHC-CCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHH Q ss_conf 99850-78958999999999982689836899999999987998899999 Q gi|254781033|r 299 YTHLL-SENTVGKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAK 347 (492) Q Consensus 299 ~~~~~-~~~~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~ 347 (492) .+... .+++.+....+.-++.++|=-+|.-..+--++.+..-|+.|... T Consensus 283 ~Al~n~~~~p~~g~~KLqFLL~~nPfP~ETFANlLllyCKNeyf~lAADv 332 (459) T KOG4340 283 QALMNMDARPTEGFEKLQFLLQQNPFPPETFANLLLLYCKNEYFDLAADV 332 (459) T ss_pred HHHHCCCCCCCCCHHHHHHHHCCCCCCHHHHHHHHHHHHCCHHHHHHHHH T ss_conf 77742668855208999999726999868898899999632677778999 No 137 >pfam03704 BTAD Bacterial transcriptional activator domain. Found in the DNRI/REDD/AFSR family of regulators. This region of AFSR along with the C terminal region is capable of independently directing actinorhodin production. This family contains TPR repeats. Probab=96.85 E-value=0.049 Score=33.93 Aligned_cols=60 Identities=10% Similarity=0.090 Sum_probs=39.7 Q ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHH Q ss_conf 999865455215886999999998884212698999999999995257456999999999 Q gi|254781033|r 156 AVYSLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKK 215 (492) Q Consensus 156 ~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~ 215 (492) ++..+....+..|+++.|+..+++++..+|-+..++..++..+...|+-.+|+..+++.. T Consensus 64 a~~~l~~~~~~~g~~~~a~~~~~~~l~~dPl~E~~~~~Lm~a~~~~G~~~~Al~~Y~~~r 123 (146) T pfam03704 64 ALERRIEADLRLGRHEEALAELRALVALHPLRERLHRQLMRALYRSGRQAEALRVYRRLR 123 (146) T ss_pred HHHHHHHHHHHCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHH T ss_conf 999999999885983021999999998688742999999999998899999999999999 No 138 >KOG4555 consensus Probab=96.85 E-value=0.039 Score=34.71 Aligned_cols=96 Identities=21% Similarity=0.160 Sum_probs=71.4 Q ss_pred HCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHH Q ss_conf 21588699999999888421269899999999999525745699999999950231035789999999999996699899 Q gi|254781033|r 165 CRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIA 244 (492) Q Consensus 165 l~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~ 244 (492) -..|+.+.|++.+.+++.+.|..+.++....+.+..+|+-++|++-+.+++.....-.+..-..+..++..+.-.|+.+. T Consensus 54 aE~g~Ld~AlE~F~qal~l~P~raSayNNRAQa~RLq~~~e~ALdDLn~AleLag~~trtacqa~vQRg~lyRl~g~dd~ 133 (175) T KOG4555 54 AEAGDLDGALELFGQALCLAPERASAYNNRAQALRLQGDDEEALDDLNKALELAGDQTRTACQAFVQRGLLYRLLGNDDA 133 (175) T ss_pred HHCCCHHHHHHHHHHHHHHCCCCHHHHCCHHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCHHH T ss_conf 85045678999999999854335386613999998728818789989999986286206999999999999999376188 Q ss_pred HHHHHHHHHHCCCCCH Q ss_conf 9999987543133527 Q gi|254781033|r 245 SYHDAIESLKLCDNSI 260 (492) Q Consensus 245 A~~~~~~a~~~~p~~~ 260 (492) |+..+..+..+...|. T Consensus 134 AR~DFe~AA~LGS~FA 149 (175) T KOG4555 134 ARADFEAAAQLGSKFA 149 (175) T ss_pred HHHHHHHHHHHCCHHH T ss_conf 8886999998678789 No 139 >COG4105 ComL DNA uptake lipoprotein [General function prediction only] Probab=96.85 E-value=0.049 Score=33.90 Aligned_cols=192 Identities=14% Similarity=0.180 Sum_probs=82.3 Q ss_pred HHHHCCCCHHHHHHHHHHHHHHCCCCHHHHH---HHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 4552158869999999988842126989999---9999999525745699999999950231035789999999999996 Q gi|254781033|r 162 FESCRIGDLNSAQRYATKALDISPDAPWVTE---AVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENAD 238 (492) Q Consensus 162 ~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~---~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~ 238 (492) ...++.|++++|...+++.....|.+++... .++-.+...++++.|+..+++-++..|.++...-+..+ .+.. T Consensus 42 ~~~L~~gn~~~A~~~fe~l~~~~p~~~~~~qa~l~l~yA~Yk~~~y~~A~~~~drFi~lyP~~~n~dY~~Yl-kgLs--- 117 (254) T COG4105 42 LTELQKGNYEEAIKYFEALDSRHPFSPYSEQAQLDLAYAYYKNGEYDLALAYIDRFIRLYPTHPNADYAYYL-KGLS--- 117 (254) T ss_pred HHHHHCCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCHHHHHHH-HHHH--- T ss_conf 999863788999999999987099981128999999999881355899999999999878999871699999-9998--- Q ss_pred CCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCHHHHHHHHHHHH Q ss_conf 69989999999875431335279999999999973994789999999998328989999999850789589999999999 Q gi|254781033|r 239 KGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHPEIANIYTHLLSENTVGKLKRALRLE 318 (492) Q Consensus 239 ~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p~La~a~~~~~~~~~~~al~~~~~l~ 318 (492) .+........+ +.-.+.|..-+......-|+...+ .+...++..+...+ T Consensus 118 ---------~~~~i~~~~rD--------------q~~~~~A~~~f~~~i~ryPnS~Ya--------~dA~~~i~~~~d~L 166 (254) T COG4105 118 ---------YFFQIDDVTRD--------------QSAARAAFAAFKELVQRYPNSRYA--------PDAKARIVKLNDAL 166 (254) T ss_pred ---------HHCCCCCCCCC--------------HHHHHHHHHHHHHHHHHCCCCCCH--------HHHHHHHHHHHHHH T ss_conf ---------75267754447--------------799999999999999878897023--------55999999999998 Q ss_pred HHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCH----HHHHHHHHHHHHHCCCHHHHHHHHHHHH-CCCCCCC Q ss_conf 8268983689999999998799889999988786609998----9999999999985799899999999996-2889995 Q gi|254781033|r 319 EINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRK----EIFLLLAQIEQANSHNTDKILYWTQSAL-HAMPDPL 393 (492) Q Consensus 319 ~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~----~~~~llA~ie~~e~~d~~~ar~wl~rA~-~a~~dp~ 393 (492) .+--+.+|+-++..|.|-.|...++.+++.-|+. ..+..|.+.....+-.+.+- . ..+-+ .-.||-. T Consensus 167 ------A~~Em~IaryY~kr~~~~AA~nR~~~v~e~y~~t~~~~eaL~~l~eaY~~lgl~~~a~-~-~~~vl~~N~p~s~ 238 (254) T COG4105 167 ------AGHEMAIARYYLKRGAYVAAINRFEEVLENYPDTSAVREALARLEEAYYALGLTDEAK-K-TAKVLGANYPDSQ 238 (254) T ss_pred ------HHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCHHHHH-H-HHHHHHHCCCCCC T ss_conf ------8999999999999057699999999998714555204778999999999828727899-9-9999973689882 Q ss_pred EEC Q ss_conf 275 Q gi|254781033|r 394 WIS 396 (492) Q Consensus 394 W~c 396 (492) |.. T Consensus 239 ~~~ 241 (254) T COG4105 239 WYK 241 (254) T ss_pred CHH T ss_conf 035 No 140 >KOG4555 consensus Probab=96.67 E-value=0.045 Score=34.21 Aligned_cols=95 Identities=16% Similarity=0.004 Sum_probs=40.3 Q ss_pred HHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCH----HHHHHHHHHHHHCC Q ss_conf 75322221000168999999726885057999865455215886999999998884212698----99999999999525 Q gi|254781033|r 127 VQIALAERQYNIAHEKLEMMLQIPATREFAVYSLYFESCRIGDLNSAQRYATKALDISPDAP----WVTEAVVQQYVLAK 202 (492) Q Consensus 127 A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~----~a~~~L~~l~~~~g 202 (492) +-+....|+.+.|.+.|.+.+.--|.+.-++..-+....-+|+.+.|+.-+++++++..+.. .++...+.+|..+| T Consensus 50 ~valaE~g~Ld~AlE~F~qal~l~P~raSayNNRAQa~RLq~~~e~ALdDLn~AleLag~~trtacqa~vQRg~lyRl~g 129 (175) T KOG4555 50 AIALAEAGDLDGALELFGQALCLAPERASAYNNRAQALRLQGDDEEALDDLNKALELAGDQTRTACQAFVQRGLLYRLLG 129 (175) T ss_pred HHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHCCHHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHC T ss_conf 88988504567899999999985433538661399999872881878998999998628620699999999999999937 Q ss_pred CCHHHHHHHHHHHHCCCCH Q ss_conf 7456999999999502310 Q gi|254781033|r 203 EWSRAITFLNQKKKNAKEW 221 (492) Q Consensus 203 dw~~A~~~l~~~~~~~~~~ 221 (492) +-+.|+.-++.+...+..+ T Consensus 130 ~dd~AR~DFe~AA~LGS~F 148 (175) T KOG4555 130 NDDAARADFEAAAQLGSKF 148 (175) T ss_pred CHHHHHHHHHHHHHHCCHH T ss_conf 6188888699999867878 No 141 >pfam03704 BTAD Bacterial transcriptional activator domain. Found in the DNRI/REDD/AFSR family of regulators. This region of AFSR along with the C terminal region is capable of independently directing actinorhodin production. This family contains TPR repeats. Probab=96.56 E-value=0.068 Score=32.79 Aligned_cols=131 Identities=15% Similarity=0.106 Sum_probs=77.4 Q ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHH Q ss_conf 99986545521588699999999888421269899999999999525745699999999950231035789999999999 Q gi|254781033|r 156 AVYSLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLE 235 (492) Q Consensus 156 ~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~ 235 (492) .+..-+..+...||++.|...+++++.+.-+..-.-. ..++|-......- ...........+.. T Consensus 8 ~~~~~a~~a~~~gd~~~a~~~~~~Al~LyrG~~L~~~-------~~~~W~~~~r~~l---------~~~~~~a~~~l~~~ 71 (146) T pfam03704 8 RLVRAGRRALAAGDPAEAARLLRAALALWRGPALADV-------PAGPWLEAERERL---------EELRLRALERRIEA 71 (146) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCCCC-------CCCHHHHHHHHHH---------HHHHHHHHHHHHHH T ss_conf 9999999988789999999999999985589887789-------8516899999999---------99999999999999 Q ss_pred HHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHH-------HCCCHHHHHHHHHH Q ss_conf 99669989999999875431335279999999999973994789999999998-------32898999999985 Q gi|254781033|r 236 NADKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWK-------VNPHPEIANIYTHL 302 (492) Q Consensus 236 ~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~-------~~p~p~La~a~~~~ 302 (492) .+..|++.++...+.+++..+|-.-++...+-..+...|+..+|.+.+++... ..|.|.+...|..+ T Consensus 72 ~~~~g~~~~a~~~~~~~l~~dPl~E~~~~~Lm~a~~~~G~~~~Al~~Y~~~r~~L~~ELGv~P~~~~~~l~~~i 145 (146) T pfam03704 72 DLRLGRHEEALAELRALVALHPLRERLHRQLMRALYRSGRQAEALRVYRRLRRRLAEELGVEPGPELRALHQEI 145 (146) T ss_pred HHHCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHH T ss_conf 98859830219999999986887429999999999988999999999999999999983899989999999976 No 142 >COG4785 NlpI Lipoprotein NlpI, contains TPR repeats [General function prediction only] Probab=96.54 E-value=0.077 Score=32.35 Aligned_cols=179 Identities=17% Similarity=0.136 Sum_probs=112.3 Q ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH Q ss_conf 99999988842126989999999999952574569999999995023103578999999999999669989999999875 Q gi|254781033|r 173 AQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIASYHDAIES 252 (492) Q Consensus 173 A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a 252 (492) |+--+..++.+.|+.+.+..-|+--+...|+++.|.+.++......|.++-.. +.++....-.|.+..|.+.+.+- T Consensus 84 AR~DftQaLai~P~m~~vfNyLG~Yl~~a~~fdaa~eaFds~~ELDp~y~Ya~----lNRgi~~YY~gR~~LAq~d~~~f 159 (297) T COG4785 84 ARNDFSQALAIRPDMPEVFNYLGIYLTQAGNFDAAYEAFDSVLELDPTYNYAH----LNRGIALYYGGRYKLAQDDLLAF 159 (297) T ss_pred HHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCCHHHH----HCCCEEEEECCCCHHHHHHHHHH T ss_conf 86014566634967589998999999860220579998666710477510777----60440234447515559999999 Q ss_pred HHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCH----HHHHHHHHHCCCCHHHHHHHHHHHHH----HCCCC Q ss_conf 43133527999999999997399478999999999832898----99999998507895899999999998----26898 Q gi|254781033|r 253 LKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHP----EIANIYTHLLSENTVGKLKRALRLEE----INKES 324 (492) Q Consensus 253 ~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p----~La~a~~~~~~~~~~~al~~~~~l~~----~~P~~ 324 (492) ...+|..+--...+ -+-...-+..+|..-+..-.+...+. .++..|..-. +.....+++..-.. ....- T Consensus 160 YQ~D~~DPfR~LWL-Yl~E~k~dP~~A~tnL~qR~~~~d~e~WG~~iV~~yLgki--S~e~l~~~~~a~a~~n~~~Ae~L 236 (297) T COG4785 160 YQDDPNDPFRSLWL-YLNEQKLDPKQAKTNLKQRAEKSDKEQWGWNIVEFYLGKI--SEETLMERLKADATDNTSLAEHL 236 (297) T ss_pred HHCCCCCHHHHHHH-HHHHHCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHC--CHHHHHHHHHHHHHHHHHHHHHH T ss_conf 84498980789999-9876107868999999999870567655187999997040--39999999986312128999999 Q ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCH Q ss_conf 3689999999998799889999988786609998 Q gi|254781033|r 325 VESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRK 358 (492) Q Consensus 325 ~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~ 358 (492) .|...-||..++..|+.++|...+.-++..+-.+ T Consensus 237 TEtyFYL~K~~l~~G~~~~A~~LfKLaiannVyn 270 (297) T COG4785 237 TETYFYLGKYYLSLGDLDEATALFKLAVANNVYN 270 (297) T ss_pred HHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHH T ss_conf 9999999999861311788999999999887999 No 143 >COG0457 NrfG FOG: TPR repeat [General function prediction only] Probab=96.53 E-value=0.079 Score=32.27 Aligned_cols=51 Identities=20% Similarity=0.279 Sum_probs=22.2 Q ss_pred HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCC Q ss_conf 589999999999826898368999999999879988999998878660999 Q gi|254781033|r 307 TVGKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPR 357 (492) Q Consensus 307 ~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~ 357 (492) ...+++.+.......|+.......++..+...+.+..+...+...+...|. T Consensus 218 ~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 268 (291) T COG0457 218 YEEALEYYEKALELDPDNAEALYNLALLLLELGRYEEALEALEKALELDPD 268 (291) T ss_pred HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCHH T ss_conf 999999999998729101799999999988388699999999999987988 No 144 >KOG4234 consensus Probab=96.48 E-value=0.084 Score=32.08 Aligned_cols=56 Identities=11% Similarity=0.075 Sum_probs=27.7 Q ss_pred CCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHH Q ss_conf 78958999999999982689836899999999987998899999887866099989 Q gi|254781033|r 304 SENTVGKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKE 359 (492) Q Consensus 304 ~~~~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~ 359 (492) .+.-..+++...+.+.++|.+.-++.-.|.+|-....|+.|.+-+.++++.+|+.+ T Consensus 147 l~k~e~aI~dcsKaiel~pty~kAl~RRAeayek~ek~eealeDyKki~E~dPs~~ 202 (271) T KOG4234 147 LRKWESAIEDCSKAIELNPTYEKALERRAEAYEKMEKYEEALEDYKKILESDPSRR 202 (271) T ss_pred HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHH T ss_conf 56689899998765750840679999999998855517888999999987182068 No 145 >pfam07219 HemY_N HemY protein N-terminus. This family represents the N-terminus (approximately 150 residues) of bacterial HemY porphyrin biosynthesis proteins. This is a membrane protein involved in a late step of protoheme IX synthesis. Probab=96.46 E-value=0.087 Score=31.97 Aligned_cols=128 Identities=13% Similarity=0.026 Sum_probs=69.4 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCEEEEEECCEEE-EEH---H-HHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHH Q ss_conf 999999999999999973489848999899766-221---8-99999999999999999999999841828899999999 Q gi|254781033|r 5 IRYFFVISLVICSFIIVSHYPEDVSITWGNRLY-RTS---P-FVILSILYFFLFAWILLFAVSRFFLSCPAMLFHMLHKR 79 (492) Q Consensus 5 l~~~~~i~~~~~~~~~l~~~~G~v~i~~~~~~i-e~s---~-~~~i~~~~~~~~~~~ll~~l~~~i~~~p~~~~~~~~~r 79 (492) +-++++++++.+++.++.. ....-.||.. +.. + ..+.+.++++++++.+++.+++.+...-..-++.++.+ T Consensus 2 ~R~l~~~~~~l~~~~~~~~----~l~~~pGyV~I~~~~~~ie~Sl~~~~~~l~~~~~~l~~l~~ll~~l~~~p~~~~~~~ 77 (134) T pfam07219 2 LRVLLFLFLVLLLGLFGGA----WLAGQPGYVVISYGGYRIETSLTFAAILLLAVLAVLFLLEWLLRLIIRLPRRVRRYF 77 (134) T ss_pred HHHHHHHHHHHHHHHHHHH----HHHCCCCEEEEEECCEEEEECHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHH T ss_conf 7899999999999999999----993699959999989988941999999999999999999999999973658999999 Q ss_pred HHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHHH Q ss_conf 99999999999999972100478799998630134666158999998753222210001689999997268850579998 Q gi|254781033|r 80 NYDKGYKALYTGLMSIAAHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPATREFAVYS 159 (492) Q Consensus 80 r~~k~~~al~~gl~al~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~~~g 159 (492) +++| + +++. ..+..+-.+...||...|++.+.+..+..++..+.+.. T Consensus 78 ~~rr-~----------------------~k~~----------~al~~Gl~a~~~Gd~~~A~k~~~ka~k~~~~p~l~~Ll 124 (134) T pfam07219 78 RRRK-R----------------------RRGR----------KALSEGLLALAEGDWALAERLARKAAELEDQPPLALLL 124 (134) T ss_pred HHHH-H----------------------HHHH----------HHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHH T ss_conf 9999-9----------------------9999----------99999999997899999999999987637993579999 Q ss_pred HHHHHHCCCC Q ss_conf 6545521588 Q gi|254781033|r 160 LYFESCRIGD 169 (492) Q Consensus 160 L~~~al~~gd 169 (492) -++.+..+|| T Consensus 125 aA~aA~~~GD 134 (134) T pfam07219 125 AARAAQAEGD 134 (134) T ss_pred HHHHHHHCCC T ss_conf 9999988579 No 146 >pfam06305 DUF1049 Protein of unknown function (DUF1049). This family consists of several hypothetical bacterial proteins of unknown function. Probab=96.46 E-value=0.036 Score=34.98 Aligned_cols=69 Identities=17% Similarity=0.085 Sum_probs=42.6 Q ss_pred HHHHHHHHHCCCCEEEEEECCEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999734898489998997662218999999999999999999999998418288999999999999999999 Q gi|254781033|r 14 VICSFIIVSHYPEDVSITWGNRLYRTSPFVILSILYFFLFAWILLFAVSRFFLSCPAMLFHMLHKRNYDKGYKALY 89 (492) Q Consensus 14 ~~~~~~~l~~~~G~v~i~~~~~~ie~s~~~~i~~~~~~~~~~~ll~~l~~~i~~~p~~~~~~~~~rr~~k~~~al~ 89 (492) +.++..+...|+..|.+.+..++.++|+...++...++.+++.++. ..+..++..++.||.+|-.+.+. T Consensus 5 ~il~v~Fa~qN~~~V~v~~l~~~~~~pl~l~ll~~f~~G~~lg~L~-------~~~~~l~~r~~~r~l~k~lk~l~ 73 (80) T pfam06305 5 FLLGLAFALQNQQLVTLNYLFAQGEFPLSLLLLIAFALGFLLGWLI-------SLPFYLRLRRRLRRLKKQLKKLE 73 (80) T ss_pred HHHHHHHHHCCCCEEEEEEEEEEEEHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999984997589999999876039999999999999999999-------99999999999999999999999 No 147 >COG0457 NrfG FOG: TPR repeat [General function prediction only] Probab=96.38 E-value=0.096 Score=31.63 Aligned_cols=83 Identities=18% Similarity=0.174 Sum_probs=37.9 Q ss_pred CHHHHHHHHHHHHHHCCC-CHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCC-HHHHHHHHHHHHHHCCCHHHHHHHHH Q ss_conf 958999999999982689-8368999999999879988999998878660999-89999999999985799899999999 Q gi|254781033|r 306 NTVGKLKRALRLEEINKE-SVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPR-KEIFLLLAQIEQANSHNTDKILYWTQ 383 (492) Q Consensus 306 ~~~~al~~~~~l~~~~P~-~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~-~~~~~llA~ie~~e~~d~~~ar~wl~ 383 (492) +...++..+.+.....++ .......++..+...+++.+|...+..++...|+ ...+..++.+.. ..+....+...+. T Consensus 182 ~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~ 260 (291) T COG0457 182 RYEEALELLEKALKLNPDDDAEALLNLGLLYLKLGKYEEALEYYEKALELDPDNAEALYNLALLLL-ELGRYEEALEALE 260 (291) T ss_pred HHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHH-HCCCHHHHHHHHH T ss_conf 399999999999862801227999999999987525999999999998729101799999999988-3886999999999 Q ss_pred HHHCCC Q ss_conf 996288 Q gi|254781033|r 384 SALHAM 389 (492) Q Consensus 384 rA~~a~ 389 (492) ++.... T Consensus 261 ~~~~~~ 266 (291) T COG0457 261 KALELD 266 (291) T ss_pred HHHHHC T ss_conf 999879 No 148 >KOG3081 consensus Probab=96.34 E-value=0.1 Score=31.47 Aligned_cols=250 Identities=15% Similarity=0.041 Sum_probs=125.0 Q ss_pred HCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHH Q ss_conf 21004787999986301346661589999987532222100016899999972688505799986545521588699999 Q gi|254781033|r 96 AAHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPATREFAVYSLYFESCRIGDLNSAQR 175 (492) Q Consensus 96 ~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~~~gL~~~al~~gd~~~A~~ 175 (492) -.|++........+....+ .+.+-..++-.+..| .|.+..-... .....-+-..+++.+.......+..+.-+. T Consensus 20 Y~Gnyq~~ine~~~~~~~~-~~~e~d~y~~raylA--lg~~~~~~~e---I~~~~~~~lqAvr~~a~~~~~e~~~~~~~~ 93 (299) T KOG3081 20 YLGNYQQCINEAEKFSSSK-TDVELDVYMYRAYLA--LGQYQIVISE---IKEGKATPLQAVRLLAEYLELESNKKSILA 93 (299) T ss_pred HHHHHHHHHHHHHHHCCCC-CHHHHHHHHHHHHHH--CCCCCCCCCC---CCCCCCCHHHHHHHHHHHHHCCCHHHHHHH T ss_conf 7668999988887520456-406899999999997--1553343531---233568768999999988608002677888 Q ss_pred HHHHHHHHCCCCH--HHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH Q ss_conf 9998884212698--99999999999525745699999999950231035789999999999996699899999998754 Q gi|254781033|r 176 YATKALDISPDAP--WVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIASYHDAIESL 253 (492) Q Consensus 176 ~~~~a~~~~P~~~--~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~ 253 (492) .+.+.....-... -.+..-+.+++..+++++|++++..... ..+..+ .....+.....+.|.+..++.. T Consensus 94 ~l~E~~a~~~~~sn~i~~l~aa~i~~~~~~~deAl~~~~~~~~-------lE~~Al--~VqI~lk~~r~d~A~~~lk~mq 164 (299) T KOG3081 94 SLYELVADSTDGSNLIDLLLAAIIYMHDGDFDEALKALHLGEN-------LEAAAL--NVQILLKMHRFDLAEKELKKMQ 164 (299) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHCCCH-------HHHHHH--HHHHHHHHHHHHHHHHHHHHHH T ss_conf 7999987501441389999766875427886899999852124-------889998--7799998789999999999997 Q ss_pred HCCCCCHH--HHHHHHHHHHHCCCHHHHHHHHHHHHH-HCCCHHHH--HHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHH Q ss_conf 31335279--999999999973994789999999998-32898999--99998507895899999999998268983689 Q gi|254781033|r 254 KLCDNSIM--ASICAAKSLISQNKKRKAEVILEKIWK-VNPHPEIA--NIYTHLLSENTVGKLKRALRLEEINKESVESL 328 (492) Q Consensus 254 ~~~p~~~~--a~~~~a~~l~~~g~~~~A~~~le~a~~-~~p~p~La--~a~~~~~~~~~~~al~~~~~l~~~~P~~~e~~ 328 (492) ..+.+..- .+..+..+....++...|--+++..-. ..|.|.+. .+..++..+++.++...++..+...+++++.+ T Consensus 165 ~ided~tLtQLA~awv~la~ggek~qdAfyifeE~s~k~~~T~~llnG~Av~~l~~~~~eeAe~lL~eaL~kd~~dpetL 244 (299) T KOG3081 165 QIDEDATLTQLAQAWVKLATGGEKIQDAFYIFEELSEKTPPTPLLLNGQAVCHLQLGRYEEAESLLEEALDKDAKDPETL 244 (299) T ss_pred CCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCHHHHCCHHHHHHHHCCHHHHHHHHHHHHHCCCCCHHHH T ss_conf 11357899999999999960632454689999998353599768870699998971699999999999985347878999 Q ss_pred HHHHHHHHHCCCHHHHH-HHHHHHHHCCCCHHH Q ss_conf 99999999879988999-998878660999899 Q gi|254781033|r 329 VIVSKIALEMGSIDQAH-AKAMLAMKIAPRKEI 360 (492) Q Consensus 329 ~~LA~~~~~~g~~~~Ar-~~l~~al~~~P~~~~ 360 (492) ..+-..+...|.-+++. +++.++...+|+..- T Consensus 245 ~Nliv~a~~~Gkd~~~~~r~l~QLk~~~p~h~~ 277 (299) T KOG3081 245 ANLIVLALHLGKDAEVTERNLSQLKLSHPEHPF 277 (299) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHH T ss_conf 999999998189917788899998740876537 No 149 >COG1066 Sms Predicted ATP-dependent serine protease [Posttranslational modification, protein turnover, chaperones] Probab=96.29 E-value=0.0025 Score=43.98 Aligned_cols=34 Identities=18% Similarity=0.346 Sum_probs=28.8 Q ss_pred CCEECC-CCCCCHHHCCCCCCCCCCCCCCCCCCCC Q ss_conf 952756-9895542422483768736400367875 Q gi|254781033|r 392 PLWISD-DGYLSSVWLPLSPISKTLCYFEWKIPTK 425 (492) Q Consensus 392 p~W~c~-~g~~~~~W~~~c~~~~~~~~~~W~~p~~ 425 (492) ..|+|. |||.|..|.--||+||.+|||+=..+.. T Consensus 6 t~f~C~~CG~~s~KW~GkCp~Cg~Wns~vE~~~~~ 40 (456) T COG1066 6 TAFVCQECGYVSPKWLGKCPACGAWNTLVEEVLAA 40 (456) T ss_pred CEEECCCCCCCCCCCCCCCCCCCCCCCEEEEECCC T ss_conf 27884457887862155598878743166730235 No 150 >KOG2047 consensus Probab=96.28 E-value=0.11 Score=31.24 Aligned_cols=194 Identities=11% Similarity=-0.021 Sum_probs=107.7 Q ss_pred HHHHHHCCHHHHHHHHHHHH-CCCCHHHH-----HHHHHHHHHCCCCHHHHHHHHHHHHHHCCCC----HHHHHHHHHHH Q ss_conf 32222100016899999972-68850579-----9986545521588699999999888421269----89999999999 Q gi|254781033|r 129 IALAERQYNIAHEKLEMMLQ-IPATREFA-----VYSLYFESCRIGDLNSAQRYATKALDISPDA----PWVTEAVVQQY 198 (492) Q Consensus 129 aA~~~gd~~~A~~~~~~~~~-~~~~~~~~-----~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~----~~a~~~L~~l~ 198 (492) .....|+..+-...|..+.. .+|....| ..+++..+...||.+.|+..++++.+..=.- ..++..-+++. T Consensus 356 V~l~e~~~~~~i~tyteAv~~vdP~ka~Gs~~~Lw~~faklYe~~~~l~~aRvifeka~~V~y~~v~dLa~vw~~waemE 435 (835) T KOG2047 356 VKLYEGNAAEQINTYTEAVKTVDPKKAVGSPGTLWVEFAKLYENNGDLDDARVIFEKATKVPYKTVEDLAEVWCAWAEME 435 (835) T ss_pred HHHHCCCHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHH T ss_conf 11104776778788999997169543789702679999999983374888999998841478632678999999999988 Q ss_pred HHCCCCHHHHHHHHHHHHCCCC--------HHHHHHH------HHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHH Q ss_conf 9525745699999999950231--------0357899------9999999999669989999999875431335279999 Q gi|254781033|r 199 VLAKEWSRAITFLNQKKKNAKE--------WNRNRAI------LLIARSLENADKGDMIASYHDAIESLKLCDNSIMASI 264 (492) Q Consensus 199 ~~~gdw~~A~~~l~~~~~~~~~--------~~~~~a~------l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~a~~ 264 (492) ...++.+.|+++++++...... ....... ++...+-....-|-...-+..+.+.+.+.--.+.... T Consensus 436 lrh~~~~~Al~lm~~A~~vP~~~~~~~yd~~~pvQ~rlhrSlkiWs~y~DleEs~gtfestk~vYdriidLriaTPqii~ 515 (835) T KOG2047 436 LRHENFEAALKLMRRATHVPTNPELEYYDNSEPVQARLHRSLKIWSMYADLEESLGTFESTKAVYDRIIDLRIATPQIII 515 (835) T ss_pred HHHHHHHHHHHHHHHHHCCCCCHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCHHHHH T ss_conf 76652899999987630489851555505898589999871999999988999725279999999988887618799999 Q ss_pred HHHHHHHHCCCHHHHHHHHHHHHHHCCCH---HHHHHHHH----HCCC-CHHHHHHHHHHHHHHCC Q ss_conf 99999997399478999999999832898---99999998----5078-95899999999998268 Q gi|254781033|r 265 CAAKSLISQNKKRKAEVILEKIWKVNPHP---EIANIYTH----LLSE-NTVGKLKRALRLEEINK 322 (492) Q Consensus 265 ~~a~~l~~~g~~~~A~~~le~a~~~~p~p---~La~a~~~----~~~~-~~~~al~~~~~l~~~~P 322 (492) .||-+|.+.....++-+++|+.....+-| ++...|.. -.++ ....+...|++.++.-| T Consensus 516 NyAmfLEeh~yfeesFk~YErgI~LFk~p~v~diW~tYLtkfi~rygg~klEraRdLFEqaL~~Cp 581 (835) T KOG2047 516 NYAMFLEEHKYFEESFKAYERGISLFKWPNVYDIWNTYLTKFIKRYGGTKLERARDLFEQALDGCP 581 (835) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCC T ss_conf 999998755899999999973775678832899999999999999668877889999999983599 No 151 >PRK10153 DNA-binding transcriptional activator CadC; Provisional Probab=96.27 E-value=0.11 Score=31.19 Aligned_cols=148 Identities=16% Similarity=0.081 Sum_probs=89.8 Q ss_pred CCCCCHHHHHH--HHHHHHHCC---CHHHHHHHHHHHHHHCCCHHHHHHHHHH-------CCCCHHHHHHHH----HHHH Q ss_conf 13352799999--999999739---9478999999999832898999999985-------078958999999----9999 Q gi|254781033|r 255 LCDNSIMASIC--AAKSLISQN---KKRKAEVILEKIWKVNPHPEIANIYTHL-------LSENTVGKLKRA----LRLE 318 (492) Q Consensus 255 ~~p~~~~a~~~--~a~~l~~~g---~~~~A~~~le~a~~~~p~p~La~a~~~~-------~~~~~~~al~~~----~~l~ 318 (492) .-|....+... -|..|...+ ...+|..+|+++.+..|+..++.+...+ ..+...+++..+ ++.. T Consensus 324 ~lP~~~~aL~~~~~A~~~l~~~d~~~l~~~idLl~~~~~~ePdn~Yv~Ae~yi~~~~~~~l~~~~~~~l~~Ln~~l~~~~ 403 (512) T PRK10153 324 GLPHQGAALMLFYQAHHYLNSGDAKSLNKGSDLLEEILKSEPDFTYALAEKALADHVRQSLQPLDEKQLAALNTELDNIV 403 (512) T ss_pred HCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHH T ss_conf 48998899999998515663034788999999999999729983899999999999998549888999999999999987 Q ss_pred H--HCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCC--E Q ss_conf 8--26898368999999999879988999998878660999899999999999857998999999999962889995--2 Q gi|254781033|r 319 E--INKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKEIFLLLAQIEQANSHNTDKILYWTQSALHAMPDPL--W 394 (492) Q Consensus 319 ~--~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~~~~llA~ie~~e~~d~~~ar~wl~rA~~a~~dp~--W 394 (492) . ..+..+..+-++|-.++-.|+.++|.+++.++...+-+.=.|.|++.++|.. |+...|-+....|..-.+.+. - T Consensus 404 ~~~~~~~~~riyeAlAl~al~~~~~~~A~~~L~~~~~~~~Svl~YILlgK~aEl~-G~~~~A~e~Y~~Af~~~~s~~tl~ 482 (512) T PRK10153 404 ALPGVNVLSRIYEALAVQALVKGDTDEAYQALNKAIELEMSWLNYVLLGKVAELK-GDNRLAADAYSTAFNLRPGENTYY 482 (512) T ss_pred HCCCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHC-CCHHHHHHHHHHHHHCCCCHHHHH T ss_conf 3324688728899999999847999999999986756455699999999999964-996688999999980388389999 Q ss_pred ECCCCCCCH Q ss_conf 756989554 Q gi|254781033|r 395 ISDDGYLSS 403 (492) Q Consensus 395 ~c~~g~~~~ 403 (492) .|+++.-+. T Consensus 483 l~enLvF~S 491 (512) T PRK10153 483 LIENLVFQS 491 (512) T ss_pred HHHHCCCCC T ss_conf 986253036 No 152 >COG4649 Uncharacterized protein conserved in bacteria [Function unknown] Probab=96.22 E-value=0.12 Score=31.01 Aligned_cols=145 Identities=11% Similarity=0.031 Sum_probs=109.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCH Q ss_conf 99999999999999999999972100478799998630134666158999998753222210001689999997268850 Q gi|254781033|r 74 HMLHKRNYDKGYKALYTGLMSIAAHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPATR 153 (492) Q Consensus 74 ~~~~~rr~~k~~~al~~gl~al~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~ 153 (492) .|+...+-.+.-+++..+|--...|-.+.|...+....+..+...|.|..+-.+-.-.+.|+...|..+|.....+.+.. T Consensus 48 ~yw~~s~as~sgd~flaAL~lA~~~k~d~Alaaf~~lektg~g~YpvLA~mr~at~~a~kgdta~AV~aFdeia~dt~~P 127 (221) T COG4649 48 TYWQTSRASKSGDAFLAALKLAQENKTDDALAAFTDLEKTGYGSYPVLARMRAATLLAQKGDTAAAVAAFDEIAADTSIP 127 (221) T ss_pred EHHCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCC T ss_conf 01113342432589999999987079668999999998648885569999999999861264788999899885257876 Q ss_pred HH----HHHHHHHHHHCCCCHHHHHHHHHHHH-HHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCC Q ss_conf 57----99986545521588699999999888-4212698999999999995257456999999999502 Q gi|254781033|r 154 EF----AVYSLYFESCRIGDLNSAQRYATKAL-DISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNA 218 (492) Q Consensus 154 ~~----~~~gL~~~al~~gd~~~A~~~~~~a~-~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~ 218 (492) .. +...-+......|-++.-....+.+. ..+|-...+...|+....+.||+.+|...+.+..... T Consensus 128 ~~~rd~ARlraa~lLvD~gsy~dV~srvepLa~d~n~mR~sArEALglAa~kagd~a~A~~~F~qia~Da 197 (221) T COG4649 128 QIGRDLARLRAAYLLVDNGSYDDVSSRVEPLAGDGNPMRHSAREALGLAAYKAGDFAKAKSWFVQIANDA 197 (221) T ss_pred CHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCC T ss_conf 0136799998789874356488899874130489970178899987688873224677999999997014 No 153 >PRK12798 chemotaxis protein; Reviewed Probab=96.22 E-value=0.12 Score=31.01 Aligned_cols=159 Identities=14% Similarity=0.042 Sum_probs=75.8 Q ss_pred HHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCH--HHHHHHHHH-HH Q ss_conf 999995257456999999999502310357899999999999966998999999987543133527--999999999-99 Q gi|254781033|r 195 VQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIASYHDAIESLKLCDNSI--MASICAAKS-LI 271 (492) Q Consensus 195 ~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~--~a~~~~a~~-l~ 271 (492) +-+....|+-.++.+.|....... .....-..+.+..+. .....++..|...+..+.=+.|+.. ++++...-. .. T Consensus 119 g~laY~~Gr~~ea~~~L~~i~~~~-lp~~lg~~laLv~a~-l~~~~dP~~Al~~lD~aRL~aPGTLvEEAALRR~i~la~ 196 (416) T PRK12798 119 GALAYLSGRGREARKLLSGVDPET-LPAELGAYLALVQGN-LMAATDPATALDRLDQARLLAPGTLVEEAALRRSLFIAA 196 (416) T ss_pred HHHHHHCCCHHHHHHHHCCCCHHH-CCCHHHHHHHHHHHH-HHCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHH T ss_conf 999997698799998851378010-472245779999888-751368899999988999818815899999987769998 Q ss_pred HCCCHHHHHHHHHHHHHHCCCHHHHHHH----HHHC-CCCHHHHHHHHHHHHHH-CC-CCHHHHHHHHHHHHHCCCHHHH Q ss_conf 7399478999999999832898999999----9850-78958999999999982-68-9836899999999987998899 Q gi|254781033|r 272 SQNKKRKAEVILEKIWKVNPHPEIANIY----THLL-SENTVGKLKRALRLEEI-NK-ESVESLVIVSKIALEMGSIDQA 344 (492) Q Consensus 272 ~~g~~~~A~~~le~a~~~~p~p~La~a~----~~~~-~~~~~~al~~~~~l~~~-~P-~~~e~~~~LA~~~~~~g~~~~A 344 (492) ..|+..++..+..+......|.-++..| .... ..+......++...+.. .| ...+.++.+|+.++-.|+-+.| T Consensus 197 ~~g~~~r~~~~a~~Y~rRf~~S~Ya~~F~~~F~~~vv~~~~~~~~~~l~~ils~~d~~~qr~lYL~IAR~A~i~Gk~~~a 276 (416) T PRK12798 197 QLGDADKFEALSRNYLRRFRHSPYAGNFAQRFVLAVVRLSDRIRDERLVEILSEMDPESQRELYLRIARAALIDGKTELA 276 (416) T ss_pred HCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCHHHHH T ss_conf 63767899999999999850680067899999999987403442879999998449687899999999999870428999 Q ss_pred HHHHHHHHHCC Q ss_conf 99988786609 Q gi|254781033|r 345 HAKAMLAMKIA 355 (492) Q Consensus 345 r~~l~~al~~~ 355 (492) +-.-+++..+. T Consensus 277 ~~As~rA~~La 287 (416) T PRK12798 277 RFAAERALKLA 287 (416) T ss_pred HHHHHHHHHHC T ss_conf 99999999745 No 154 >PRK11823 DNA repair protein RadA; Provisional Probab=96.18 E-value=0.0031 Score=43.22 Aligned_cols=32 Identities=16% Similarity=0.201 Sum_probs=27.8 Q ss_pred CCCCEECC-CCCCCHHHCCCCCCCCCCCCCCCC Q ss_conf 99952756-989554242248376873640036 Q gi|254781033|r 390 PDPLWISD-DGYLSSVWLPLSPISKTLCYFEWK 421 (492) Q Consensus 390 ~dp~W~c~-~g~~~~~W~~~c~~~~~~~~~~W~ 421 (492) ....|+|. ||+.+..|.--||+||++|||+=. T Consensus 4 ~k~~y~C~~CG~~~~kW~GkCp~C~~WnTl~E~ 36 (454) T PRK11823 4 AKTAYVCQECGAESPKWLGRCPACGAWNTLVEA 36 (454) T ss_pred CCCEEECCCCCCCCCCCCEECCCCCCCCCEEEE T ss_conf 887179988999788714358798986458987 No 155 >KOG4648 consensus Probab=96.16 E-value=0.12 Score=30.82 Aligned_cols=102 Identities=16% Similarity=0.137 Sum_probs=50.7 Q ss_pred HHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCH Q ss_conf 55215886999999998884212698999999999995257456999999999502310357899999999999966998 Q gi|254781033|r 163 ESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDM 242 (492) Q Consensus 163 ~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~ 242 (492) .+..+|.+++|+..+.+.....|.++-.....+..|.+...+..|..--..++.....+-. .+..++......|.. T Consensus 106 ~yFKQgKy~EAIDCYs~~ia~~P~NpV~~~NRA~AYlk~K~FA~AE~DC~~AiaLd~~Y~K----AYSRR~~AR~~Lg~~ 181 (536) T KOG4648 106 TYFKQGKYEEAIDCYSTAIAVYPHNPVYHINRALAYLKQKSFAQAEEDCEAAIALDKLYVK----AYSRRMQARESLGNN 181 (536) T ss_pred HHHHCCCHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHH T ss_conf 6764254135666653012457888642005889999888777543028898763288999----987777899987757 Q ss_pred HHHHHHHHHHHHCCCCCHHHHHHHHH Q ss_conf 99999998754313352799999999 Q gi|254781033|r 243 IASYHDAIESLKLCDNSIMASICAAK 268 (492) Q Consensus 243 ~~A~~~~~~a~~~~p~~~~a~~~~a~ 268 (492) .+|.+.+..++++.|......-.+++ T Consensus 182 ~EAKkD~E~vL~LEP~~~ELkK~~a~ 207 (536) T KOG4648 182 MEAKKDCETVLALEPKNIELKKSLAR 207 (536) T ss_pred HHHHHHHHHHHHCCCCCHHHHHHHHH T ss_conf 87777788998608561789999998 No 156 >COG4455 ImpE Protein of avirulence locus involved in temperature-dependent protein secretion [General function prediction only] Probab=96.10 E-value=0.093 Score=31.73 Aligned_cols=57 Identities=19% Similarity=0.188 Sum_probs=41.1 Q ss_pred HHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHC Q ss_conf 545521588699999999888421269899999999999525745699999999950 Q gi|254781033|r 161 YFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKN 217 (492) Q Consensus 161 ~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~ 217 (492) ....++.+....++..++.-.+.+|.+......|+++++-.|+|++|+.-++-.... T Consensus 8 ~seLL~~~sL~dai~~a~~qVkakPtda~~RhflfqLlcvaGdw~kAl~Ql~l~a~l 64 (273) T COG4455 8 ISELLDDNSLQDAIGLARDQVKAKPTDAGGRHFLFQLLCVAGDWEKALAQLNLAATL 64 (273) T ss_pred HHHHHHHCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHCCHHHHHHHHHHHHHHC T ss_conf 999987313999999999998649866630567787886466499999998888652 No 157 >pfam02259 FAT FAT domain. The FAT domain is named after FRAP, ATM and TRRAP. Probab=96.03 E-value=0.14 Score=30.35 Aligned_cols=65 Identities=14% Similarity=0.034 Sum_probs=45.5 Q ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCC--CHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH Q ss_conf 8999999999999669989999999875431335--2799999999999739947899999999983 Q gi|254781033|r 225 RAILLIARSLENADKGDMIASYHDAIESLKLCDN--SIMASICAAKSLISQNKKRKAEVILEKIWKV 289 (492) Q Consensus 225 ~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~--~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~ 289 (492) ....++..+......|....|.....++....+. .++..+.+++++-..|+..+|...|+..... T Consensus 149 ~~~~wlk~a~laRK~g~~~~A~~~l~~l~~~~~~~~~p~v~~e~aKllW~~g~~~~Ai~~L~~~~~~ 215 (351) T pfam02259 149 LAEMWLKFANLARKSGRFSLAEKALLKLLTYDTREDLPNVVIAYAKYLWAKGEQQEAFQKLREFTSC 215 (351) T ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHH T ss_conf 8999999999998888998999999998625887899546999999999748899999999999998 No 158 >KOG0551 consensus Probab=96.00 E-value=0.14 Score=30.28 Aligned_cols=99 Identities=19% Similarity=0.147 Sum_probs=87.9 Q ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHH Q ss_conf 99999999995257456999999999502310357899999999999966998999999987543133527999999999 Q gi|254781033|r 190 VTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIASYHDAIESLKLCDNSIMASICAAKS 269 (492) Q Consensus 190 a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~ 269 (492) -...-+.-|...+++..|...+...++..-..+...++++..++...+.-++.-.++....++++..|+..-+.+.-|.. T Consensus 83 n~KeeGN~~fK~Kryk~A~~~Yt~Glk~kc~D~dlnavLY~NRAAa~~~l~NyRs~l~Dcs~al~~~P~h~Ka~~R~Akc 162 (390) T KOG0551 83 NYKEEGNEYFKEKRYKDAVESYTEGLKKKCADPDLNAVLYTNRAAAQLYLGNYRSALNDCSAALKLKPTHLKAYIRGAKC 162 (390) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHH T ss_conf 88887689877756768999998887641899237899884299999999989999999999861487304434356689 Q ss_pred HHHCCCHHHHHHHHHHHHH Q ss_conf 9973994789999999998 Q gi|254781033|r 270 LISQNKKRKAEVILEKIWK 288 (492) Q Consensus 270 l~~~g~~~~A~~~le~a~~ 288 (492) +.+......|....+.... T Consensus 163 ~~eLe~~~~a~nw~ee~~~ 181 (390) T KOG0551 163 LLELERFAEAVNWCEEGLQ 181 (390) T ss_pred HHHHHHHHHHHHHHHHHHH T ss_conf 9999888887777765311 No 159 >cd01121 Sms Sms (bacterial radA) DNA repair protein. This protein is not related to archael radA any more than is to other RecA-like NTPases. Sms has a role in recombination and recombinational repair and is responsible for the stabilization or processing of branched DNA molecules. Probab=95.99 E-value=0.0046 Score=41.91 Aligned_cols=29 Identities=24% Similarity=0.423 Sum_probs=25.6 Q ss_pred EECC-CCCCCHHHCCCCCCCCCCCCCCCCC Q ss_conf 2756-9895542422483768736400367 Q gi|254781033|r 394 WISD-DGYLSSVWLPLSPISKTLCYFEWKI 422 (492) Q Consensus 394 W~c~-~g~~~~~W~~~c~~~~~~~~~~W~~ 422 (492) |+|. ||+.+..|.--||+||++|||+=.. T Consensus 1 y~C~~CG~~~~kW~GkCp~Cg~WnTl~E~~ 30 (372) T cd01121 1 YVCSECGYVSPKWLGKCPECGEWNTLVEEI 30 (372) T ss_pred CCCCCCCCCCCCCCEECCCCCCCCCEEEEE T ss_conf 999889997887242697999864389996 No 160 >KOG2047 consensus Probab=95.80 E-value=0.17 Score=29.67 Aligned_cols=221 Identities=14% Similarity=0.091 Sum_probs=119.5 Q ss_pred CCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHH-HHHHHHHHHHHHHHCCCHHHH Q ss_conf 5886999999998884212698999999999995257456999999999502310357-899999999999966998999 Q gi|254781033|r 167 IGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRN-RAILLIARSLENADKGDMIAS 245 (492) Q Consensus 167 ~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~-~a~l~~~~a~~~~~~~~~~~A 245 (492) -|-++.-...+++.+.+.--.|......+.+.....-++++.+.+++-+...+-...- --..++..............+ T Consensus 490 ~gtfestk~vYdriidLriaTPqii~NyAmfLEeh~yfeesFk~YErgI~LFk~p~v~diW~tYLtkfi~rygg~klEra 569 (835) T KOG2047 490 LGTFESTKAVYDRIIDLRIATPQIIINYAMFLEEHKYFEESFKAYERGISLFKWPNVYDIWNTYLTKFIKRYGGTKLERA 569 (835) T ss_pred HCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCHHHH T ss_conf 25279999999988887618799999999998755899999999973775678832899999999999999668877889 Q ss_pred HHHHHHHHHCCCCCH--HHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHH---HHHHHHHH---CCCCHHHHHHHHHHH Q ss_conf 999987543133527--9999999999973994789999999998328989---99999985---078958999999999 Q gi|254781033|r 246 YHDAIESLKLCDNSI--MASICAAKSLISQNKKRKAEVILEKIWKVNPHPE---IANIYTHL---LSENTVGKLKRALRL 317 (492) Q Consensus 246 ~~~~~~a~~~~p~~~--~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p~---La~a~~~~---~~~~~~~al~~~~~l 317 (492) +..+.++++..|..- ...++||++..+-|..+.|+++++++-...+..+ +...|..- .-|-+ .-...|++. T Consensus 570 RdLFEqaL~~Cpp~~aKtiyLlYA~lEEe~GLar~amsiyerat~~v~~a~~l~myni~I~kaae~yGv~-~TR~iYeka 648 (835) T KOG2047 570 RDLFEQALDGCPPEHAKTIYLLYAKLEEEHGLARHAMSIYERATSAVKEAQRLDMYNIYIKKAAEIYGVP-RTREIYEKA 648 (835) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCC-CCHHHHHHH T ss_conf 9999999835998877899999999999850799899999998732798999999999999999982873-207899999 Q ss_pred HHHCCCCHHHHHH--HHHHHHHCCCHHHHHHHHHHHHHC-CCCH--HHHHHHHHHHHHHCCCHHHHHHHHH--HHHCCC Q ss_conf 9826898368999--999999879988999998878660-9998--9999999999985799899999999--996288 Q gi|254781033|r 318 EEINKESVESLVI--VSKIALEMGSIDQAHAKAMLAMKI-APRK--EIFLLLAQIEQANSHNTDKILYWTQ--SALHAM 389 (492) Q Consensus 318 ~~~~P~~~e~~~~--LA~~~~~~g~~~~Ar~~l~~al~~-~P~~--~~~~llA~ie~~e~~d~~~ar~wl~--rA~~a~ 389 (492) +..-|+...-.+. -|.+-...|..+.||.-+.-..+. +|+. ..+.-.-+. +..+||+.-.++-|. |.+.|. T Consensus 649 Ie~Lp~~~~r~mclrFAdlEtklGEidRARaIya~~sq~~dPr~~~~fW~twk~F-EvrHGnedT~keMLRikRsvqa~ 726 (835) T KOG2047 649 IESLPDSKAREMCLRFADLETKLGEIDRARAIYAHGSQICDPRVTTEFWDTWKEF-EVRHGNEDTYKEMLRIKRSVQAT 726 (835) T ss_pred HHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHH-HHHCCCHHHHHHHHHHHHHHHHH T ss_conf 9867827789999989987642424899999999650227986775889888888-77528788999999999998876 No 161 >pfam06552 TOM20_plant Plant specific mitochondrial import receptor subunit TOM20. This family consists of several plant specific mitochondrial import receptor subunit TOM20 (translocase of outer membrane 20 kDa subunit) proteins. Most mitochondrial proteins are encoded by the nuclear genome, and are synthesized in the cytosol. TOM20 is a general import receptor that binds to mitochondrial pre-sequences in the early step of protein import into the mitochondria. Probab=95.66 E-value=0.18 Score=29.48 Aligned_cols=62 Identities=11% Similarity=0.122 Sum_probs=48.1 Q ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHHH-----------CCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHH Q ss_conf 899999999998268983689999999998-----------79988999998878660999899999999999 Q gi|254781033|r 308 VGKLKRALRLEEINKESVESLVIVSKIALE-----------MGSIDQAHAKAMLAMKIAPRKEIFLLLAQIEQ 369 (492) Q Consensus 308 ~~al~~~~~l~~~~P~~~e~~~~LA~~~~~-----------~g~~~~Ar~~l~~al~~~P~~~~~~llA~ie~ 369 (492) .+++..++..+..+|...+++-.||.++.. .+.|++|..++++++..+|++..|.--.++.. T Consensus 52 ~dAisKlEeAL~InP~khdalWclGNA~TS~~Fl~pD~~~A~~~F~kA~~~Fq~AvdeeP~NelYrKsLem~~ 124 (186) T pfam06552 52 QDAISKLEEALVINPKKHDALWCIGNAYTSHAFLTPDQTEAKDNFDKATQFFQQAVDEQPDNDLYRKSLEMAA 124 (186) T ss_pred HHHHHHHHHHHCCCCCCCCHHHEECCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHH T ss_conf 9999999998713988753333005411015131787899998899999999998634986189999999875 No 162 >pfam09613 HrpB1_HrpK Bacterial type III secretion protein (HrpB1_HrpK). This family of proteins is encoded by genes found within type III secretion operons in a limited range of species including Xanthomonas, Ralstonia and Burkholderia. Probab=95.45 E-value=0.22 Score=28.75 Aligned_cols=99 Identities=15% Similarity=0.069 Sum_probs=64.8 Q ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH Q ss_conf 79998654552158869999999988842126989999999999952574569999999995023103578999999999 Q gi|254781033|r 155 FAVYSLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSL 234 (492) Q Consensus 155 ~~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~ 234 (492) -|+..+...++..+|++.+...++-+.-+.|+.+.+-..-+-++...|+|.+|+.++...-...+.....++.+ +. T Consensus 11 ~gLie~~s~al~~~~~~D~e~lL~AlRvLRP~~~e~d~~dg~l~i~rg~w~dA~r~lr~v~~~~~~~~~~kAL~----A~ 86 (156) T pfam09613 11 GGLIEVVSVALRLADIADAEELLEALRILRPRLAELDLFDGWLYIGRGQWDDAARLLRELEGRASHLPLCKALM----AQ 86 (156) T ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCHHHHHHH----HH T ss_conf 99999999998048877899999999985898326658787787760658999999999862677848999999----99 Q ss_pred HHHHCCCHHHHHHHHHHHHHCCCC Q ss_conf 999669989999999875431335 Q gi|254781033|r 235 ENADKGDMIASYHDAIESLKLCDN 258 (492) Q Consensus 235 ~~~~~~~~~~A~~~~~~a~~~~p~ 258 (492) .+...+|+ .=+..+.+++...++ T Consensus 87 CL~a~~D~-~Wr~~A~evLe~~~~ 109 (156) T pfam09613 87 CLFALQDP-AWREYADEVLEEGGD 109 (156) T ss_pred HHHHCCCH-HHHHHHHHHHHHCCC T ss_conf 99874796-799999999861899 No 163 >PRK10941 putative transcriptional regulator; Provisional Probab=95.43 E-value=0.23 Score=28.69 Aligned_cols=71 Identities=15% Similarity=0.089 Sum_probs=59.9 Q ss_pred HHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHH-HHHHHH Q ss_conf 98507895899999999998268983689999999998799889999988786609998999999-999998 Q gi|254781033|r 300 THLLSENTVGKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKEIFLLL-AQIEQA 370 (492) Q Consensus 300 ~~~~~~~~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~~~~ll-A~ie~~ 370 (492) +++..++...+++..+.++...|+++.-..-.|-++.+.|.+..|...++..++..|+.....++ ..|..- T Consensus 190 ~~~~e~~~~~AL~~~e~lL~l~Pddp~EiRDRGli~~qL~c~~~Al~DLe~fle~~P~dp~ae~ik~qi~~l 261 (269) T PRK10941 190 ALMEEKQMELALRASEALLQFNPEDPYEIRDRGLIYAQLDCEHVALSDLSYFVEQCPEDPISEMIRAQIHSI 261 (269) T ss_pred HHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH T ss_conf 998511499999999999862969979999999999984994899999999999785972899999999998 No 164 >pfam10300 IML2 Putative mitochondrial outer membrane protein. This is a family of proteins conserved from fungi to humans. Members of this family also carry a TPR_2 domain pfam07719 at their C-terminus. Many of them are annotated as being mitochondrial outer membrane protein IML2. Probab=95.43 E-value=0.23 Score=28.68 Aligned_cols=146 Identities=15% Similarity=0.013 Sum_probs=78.7 Q ss_pred CCCHHHHHHHHHHHHHCCCCCHHHHHHHHH---------HHHHCCCHHHHHHHHHHHHHHCCCHHHHHHH---HHHCCCC Q ss_conf 699899999998754313352799999999---------9997399478999999999832898999999---9850789 Q gi|254781033|r 239 KGDMIASYHDAIESLKLCDNSIMASICAAK---------SLISQNKKRKAEVILEKIWKVNPHPEIANIY---THLLSEN 306 (492) Q Consensus 239 ~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~---------~l~~~g~~~~A~~~le~a~~~~p~p~La~a~---~~~~~~~ 306 (492) .||.+..++...++.+..--..|...+.-- .-...++..++.++++...+.-|+..+...+ .+...++ T Consensus 186 ~GdRe~GL~~L~~a~~~~~lr~~la~L~LL~Yy~~~~~~~~~~~~~~~~a~~lL~~~~~~yP~s~l~l~~~gR~~~~~g~ 265 (446) T pfam10300 186 SGDREEGLRLLWEASSSPNIRAALALLTLLFYYTGIRQFLGIGEGPLEEAEALLEPSRKRFPNSALWLFFEARIESLKGN 265 (446) T ss_pred CCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCCEECCCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCC T ss_conf 78599999999988247984179999999999605100046999889999999999885389946899999999997379 Q ss_pred HHHHHHHHHHHHHHCCCCHHH----HHHHHHHHHHCCCHHHHHHHHHHHHHCC-CCHHHHHHHHHHHHHHCCCH------ Q ss_conf 589999999999826898368----9999999998799889999988786609-99899999999999857998------ Q gi|254781033|r 307 TVGKLKRALRLEEINKESVES----LVIVSKIALEMGSIDQAHAKAMLAMKIA-PRKEIFLLLAQIEQANSHNT------ 375 (492) Q Consensus 307 ~~~al~~~~~l~~~~P~~~e~----~~~LA~~~~~~g~~~~Ar~~l~~al~~~-P~~~~~~llA~ie~~e~~d~------ 375 (492) ...++..++........-.+. ..-++..++-..+|.+|..++..+.+.+ .+...|..++++-..+.+++ T Consensus 266 l~~Ai~~~~~~~~~~~~~kQ~~~lc~~El~w~~~~~~~w~~A~~~~~~L~~~s~WSka~Y~Y~~a~c~~~~~~~~~~~~~ 345 (446) T pfam10300 266 LDEALELFEECIESQSEWKQIHHLCYWELAWCHVFLHRWKQAANYFLLLSDESKWSKALYAYLAAACLLMLGREEEKAEG 345 (446) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCHHHHHH T ss_conf 99999999986744654499999999999999999978999999999999845418999999999999865330105679 Q ss_pred HHHHHHHHH Q ss_conf 999999999 Q gi|254781033|r 376 DKILYWTQS 384 (492) Q Consensus 376 ~~ar~wl~r 384 (492) ..+..++.+ T Consensus 346 ~~a~~l~~~ 354 (446) T pfam10300 346 EKAVELFRE 354 (446) T ss_pred HHHHHHHHH T ss_conf 899999997 No 165 >KOG4234 consensus Probab=95.29 E-value=0.25 Score=28.37 Aligned_cols=99 Identities=14% Similarity=0.085 Sum_probs=58.0 Q ss_pred HHHHHHHHCCCCHHHHHHHHHHHHCCCC-HHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHH Q ss_conf 9999999525745699999999950231-035789999999999996699899999998754313352799999999999 Q gi|254781033|r 193 AVVQQYVLAKEWSRAITFLNQKKKNAKE-WNRNRAILLIARSLENADKGDMIASYHDAIESLKLCDNSIMASICAAKSLI 271 (492) Q Consensus 193 ~L~~l~~~~gdw~~A~~~l~~~~~~~~~-~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~ 271 (492) .-+.-+...|++.+|...+..++...+. ....+++++..++...+..+....+.....+++.+.|+...+....|.+|. T Consensus 100 ~EGN~~F~ngdyeeA~skY~~Ale~cp~~~~e~rsIly~Nraaa~iKl~k~e~aI~dcsKaiel~pty~kAl~RRAeaye 179 (271) T KOG4234 100 KEGNELFKNGDYEEANSKYQEALESCPSTSTEERSILYSNRAAALIKLRKWESAIEDCSKAIELNPTYEKALERRAEAYE 179 (271) T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH T ss_conf 87777651340778999999999868663088899997205889987566898999987657508406799999999988 Q ss_pred HCCCHHHHHHHHHHHHHHCC Q ss_conf 73994789999999998328 Q gi|254781033|r 272 SQNKKRKAEVILEKIWKVNP 291 (492) Q Consensus 272 ~~g~~~~A~~~le~a~~~~p 291 (492) +..++..|..-+++.....| T Consensus 180 k~ek~eealeDyKki~E~dP 199 (271) T KOG4234 180 KMEKYEEALEDYKKILESDP 199 (271) T ss_pred HHHHHHHHHHHHHHHHHHCC T ss_conf 55517888999999987182 No 166 >COG1592 Rubrerythrin [Energy production and conversion] Probab=95.23 E-value=0.26 Score=28.23 Aligned_cols=22 Identities=23% Similarity=0.507 Sum_probs=18.1 Q ss_pred CEECC-CCCCCHHHCC--CCCCCCCC Q ss_conf 52756-9895542422--48376873 Q gi|254781033|r 393 LWISD-DGYLSSVWLP--LSPISKTL 415 (492) Q Consensus 393 ~W~c~-~g~~~~~W~~--~c~~~~~~ 415 (492) .|+|. |||++.. .| +||.||.. T Consensus 134 ~~vC~vCGy~~~g-e~P~~CPiCga~ 158 (166) T COG1592 134 VWVCPVCGYTHEG-EAPEVCPICGAP 158 (166) T ss_pred EEECCCCCCCCCC-CCCCCCCCCCCH T ss_conf 7876878881268-998769999981 No 167 >TIGR00416 sms DNA repair protein RadA; InterPro: IPR004504 RadA/Sms is a highly conserved eubacterial protein that shares sequence similarity with both RecA strand transferase and lon protease. The RadA/Sms family are probable ATP-dependent proteases involved in both DNA repair and degradation of proteins, peptides, glycopeptides. They are classified in as non-peptidase homologues and unassigned peptidases in MEROPS peptidase family S16 (lon protease family, clan SJ). RadA/Sms is involved in recombination and recombinational repair, most likely involving the stabilisation or processing of branched DNA molecules or blocked replication forks because of its genetic redundancy with RecG and RuvABC .; GO: 0003684 damaged DNA binding, 0005524 ATP binding, 0006281 DNA repair. Probab=95.10 E-value=0.011 Score=39.10 Aligned_cols=30 Identities=20% Similarity=0.375 Sum_probs=26.7 Q ss_pred CCCCEECC-CCCCCHHHCCCCCCCCCCCCCC Q ss_conf 99952756-9895542422483768736400 Q gi|254781033|r 390 PDPLWISD-DGYLSSVWLPLSPISKTLCYFE 419 (492) Q Consensus 390 ~dp~W~c~-~g~~~~~W~~~c~~~~~~~~~~ 419 (492) |-..++|. ||+.|+.|.--||.|+.++||. T Consensus 5 Pk~~FvC~~CG~~~p~W~G~C~~C~aw~t~~ 35 (481) T TIGR00416 5 PKSKFVCQSCGAESPKWLGKCPACHAWNTLT 35 (481) T ss_pred CCCCCCCCCCCCCCHHHCCCCCCCCCCCHHH T ss_conf 8774202235774504647388866531377 No 168 >KOG3617 consensus Probab=94.95 E-value=0.31 Score=27.66 Aligned_cols=165 Identities=15% Similarity=0.039 Sum_probs=86.3 Q ss_pred HHHHHHHHHHHCCHHHHHHHHH----------HHHCCCCHH---------HHH-HHHHHHHHCCCCHHHHHHHHHHHH-- Q ss_conf 9987532222100016899999----------972688505---------799-986545521588699999999888-- Q gi|254781033|r 124 LLEVQIALAERQYNIAHEKLEM----------MLQIPATRE---------FAV-YSLYFESCRIGDLNSAQRYATKAL-- 181 (492) Q Consensus 124 Ll~A~aA~~~gd~~~A~~~~~~----------~~~~~~~~~---------~~~-~gL~~~al~~gd~~~A~~~~~~a~-- 181 (492) .-.|+-....+|-..|.+||++ |+...|... ..+ +.........|+.+.|+.++..+- T Consensus 862 y~yA~~Lear~Di~~AleyyEK~~~hafev~rmL~e~p~~~e~Yv~~~~d~~L~~WWgqYlES~GemdaAl~~Y~~A~D~ 941 (1416) T KOG3617 862 YNYAKYLEARRDIEAALEYYEKAGVHAFEVFRMLKEYPKQIEQYVRRKRDESLYSWWGQYLESVGEMDAALSFYSSAKDY 941 (1416) T ss_pred HHHHHHHHHHCCHHHHHHHHHHCCCHHHHHHHHHHHCHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH T ss_conf 77878888621288999888751873878899988583889999985065289999999975243157889999876555 Q ss_pred -------------------HHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHC------CCC--HHHHHHHHHH---- Q ss_conf -------------------421269899999999999525745699999999950------231--0357899999---- Q gi|254781033|r 182 -------------------DISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKN------AKE--WNRNRAILLI---- 230 (492) Q Consensus 182 -------------------~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~------~~~--~~~~~a~l~~---- 230 (492) ....++..+-..|++.|...|+..+|...+.++..- ..+ .+.+-+.+.+ T Consensus 942 fs~VrI~C~qGk~~kAa~iA~esgd~AAcYhlaR~YEn~g~v~~Av~FfTrAqafsnAIRlcKEnd~~d~L~nlal~s~~ 1021 (1416) T KOG3617 942 FSMVRIKCIQGKTDKAARIAEESGDKAACYHLARMYENDGDVVKAVKFFTRAQAFSNAIRLCKENDMKDRLANLALMSGG 1021 (1416) T ss_pred HHHEEEEEECCCCHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHCCC T ss_conf 54236676316735889998861417889999998622477888999989988888888887754878899998864285 Q ss_pred ----H--------------HHHHHHHCCCHHHHHHHHH----------HHHHCCCCCHHHH-HHHHHHHHHCCCHHHHHH Q ss_conf ----9--------------9999996699899999998----------7543133527999-999999997399478999 Q gi|254781033|r 231 ----A--------------RSLENADKGDMIASYHDAI----------ESLKLCDNSIMAS-ICAAKSLISQNKKRKAEV 281 (492) Q Consensus 231 ----~--------------~a~~~~~~~~~~~A~~~~~----------~a~~~~p~~~~a~-~~~a~~l~~~g~~~~A~~ 281 (492) . ..+.+...|....|++.+- -+-.++|+.-|.. ...+.++....++.+|.. T Consensus 1022 ~d~v~aArYyEe~g~~~~~AVmLYHkAGm~~kALelAF~tqQf~aL~lIa~DLd~~sDp~ll~RcadFF~~~~qyekAV~ 1101 (1416) T KOG3617 1022 SDLVSAARYYEELGGYAHKAVMLYHKAGMIGKALELAFRTQQFSALDLIAKDLDAGSDPKLLRRCADFFENNQQYEKAVN 1101 (1416) T ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHHCCHHHHHHHHHHHCCCHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 26899999999745026678778776120678899887643618999988741899998999999999875789999999 Q ss_pred HHHHHHH Q ss_conf 9999998 Q gi|254781033|r 282 ILEKIWK 288 (492) Q Consensus 282 ~le~a~~ 288 (492) +|-.+-+ T Consensus 1102 lL~~ar~ 1108 (1416) T KOG3617 1102 LLCLARE 1108 (1416) T ss_pred HHHHHHH T ss_conf 9999999 No 169 >KOG0545 consensus Probab=94.84 E-value=0.33 Score=27.46 Aligned_cols=55 Identities=13% Similarity=0.100 Sum_probs=29.9 Q ss_pred CCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCH Q ss_conf 7895899999999998268983689999999998799889999988786609998 Q gi|254781033|r 304 SENTVGKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRK 358 (492) Q Consensus 304 ~~~~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~ 358 (492) .++..+.+......++.+|+|.-+++-.|++....=+-++|++-+.++++++|+. T Consensus 243 ~~e~yevleh~seiL~~~~~nvKA~frRakAhaa~Wn~~eA~~D~~~vL~ldpsl 297 (329) T KOG0545 243 KEEYYEVLEHCSEILRHHPGNVKAYFRRAKAHAAVWNEAEAKADLQKVLELDPSL 297 (329) T ss_pred HHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCHHH T ss_conf 8789999998899985088508999998778775368888888999998619566 No 170 >PRK10941 putative transcriptional regulator; Provisional Probab=94.58 E-value=0.35 Score=27.23 Aligned_cols=68 Identities=18% Similarity=0.067 Sum_probs=34.2 Q ss_pred HHHHHHCCCHHHHHHHHHHHHHHCCC-HHHH--HHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHH Q ss_conf 99999739947899999999983289-8999--99998507895899999999998268983689999999 Q gi|254781033|r 267 AKSLISQNKKRKAEVILEKIWKVNPH-PEIA--NIYTHLLSENTVGKLKRALRLEEINKESVESLVIVSKI 334 (492) Q Consensus 267 a~~l~~~g~~~~A~~~le~a~~~~p~-p~La--~a~~~~~~~~~~~al~~~~~l~~~~P~~~e~~~~LA~~ 334 (492) =..|++..++.+|.+..+..+...|+ |.-. ..+.+...+....++..++..+.+.|+++.+-+....+ T Consensus 188 K~~~~~e~~~~~AL~~~e~lL~l~Pddp~EiRDRGli~~qL~c~~~Al~DLe~fle~~P~dp~ae~ik~qi 258 (269) T PRK10941 188 KAALMEEKQMELALRASEALLQFNPEDPYEIRDRGLIYAQLDCEHVALSDLSYFVEQCPEDPISEMIRAQI 258 (269) T ss_pred HHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHH T ss_conf 99998511499999999999862969979999999999984994899999999999785972899999999 No 171 >COG2976 Uncharacterized protein conserved in bacteria [Function unknown] Probab=94.56 E-value=0.38 Score=26.97 Aligned_cols=134 Identities=11% Similarity=0.031 Sum_probs=86.2 Q ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHH---HHHHHHHH Q ss_conf 999999999721004787999986301346661589999987532222100016899999972688505---79998654 Q gi|254781033|r 86 KALYTGLMSIAAHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPATRE---FAVYSLYF 162 (492) Q Consensus 86 ~al~~gl~al~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~---~~~~gL~~ 162 (492) ....+.+-++..+......-...-..+...+..-.+..|-.|......|+.+.|...++..+..+.... +.-..|++ T Consensus 55 ~~Y~~~i~~~~ak~~~~~~~~ekf~~~n~~t~Ya~laaL~lAk~~ve~~~~d~A~aqL~~~l~~t~De~lk~l~~lRLAr 134 (207) T COG2976 55 AQYQNAIKAVQAKKPKSIAAAEKFVQANGKTIYAVLAALELAKAEVEANNLDKAEAQLKQALAQTKDENLKALAALRLAR 134 (207) T ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHH T ss_conf 99999999985577025999999986156358999999999999986342999999999987145257899999999999 Q ss_pred HHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCC Q ss_conf 5521588699999999888421269899999999999525745699999999950231 Q gi|254781033|r 163 ESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKE 220 (492) Q Consensus 163 ~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~ 220 (492) ..++.|.++.|+..+..--...= ..-.....++++...||-++|+..+++++...++ T Consensus 135 vq~q~~k~D~AL~~L~t~~~~~w-~~~~~elrGDill~kg~k~~Ar~ay~kAl~~~~s 191 (207) T COG2976 135 VQLQQKKADAALKTLDTIKEESW-AAIVAELRGDILLAKGDKQEARAAYEKALESDAS 191 (207) T ss_pred HHHHHHHHHHHHHHHHCCCCCCH-HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCC T ss_conf 99986038889998723011107-8899997426999728659999999999871577 No 172 >KOG4648 consensus Probab=94.44 E-value=0.4 Score=26.78 Aligned_cols=109 Identities=13% Similarity=0.096 Sum_probs=82.9 Q ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHH Q ss_conf 99999999952574569999999995023103578999999999999669989999999875431335279999999999 Q gi|254781033|r 191 TEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIASYHDAIESLKLCDNSIMASICAAKSL 270 (492) Q Consensus 191 ~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l 270 (492) +...+.-|..+|.+++|++.+.+.....+-. .++...++..++.......|......|+.++...+-+.-..+... T Consensus 100 iKE~GN~yFKQgKy~EAIDCYs~~ia~~P~N----pV~~~NRA~AYlk~K~FA~AE~DC~~AiaLd~~Y~KAYSRR~~AR 175 (536) T KOG4648 100 IKERGNTYFKQGKYEEAIDCYSTAIAVYPHN----PVYHINRALAYLKQKSFAQAEEDCEAAIALDKLYVKAYSRRMQAR 175 (536) T ss_pred HHHHHHHHHHCCCHHHHHHHHHCCCCCCCCC----CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 8876336764254135666653012457888----642005889999888777543028898763288999987777899 Q ss_pred HHCCCHHHHHHHHHHHHHHCCC-HHHHHHHHHHC Q ss_conf 9739947899999999983289-89999999850 Q gi|254781033|r 271 ISQNKKRKAEVILEKIWKVNPH-PEIANIYTHLL 303 (492) Q Consensus 271 ~~~g~~~~A~~~le~a~~~~p~-p~La~a~~~~~ 303 (492) ...|+..+|.+-.|.++...|. .+|-..|+.+. T Consensus 176 ~~Lg~~~EAKkD~E~vL~LEP~~~ELkK~~a~i~ 209 (536) T KOG4648 176 ESLGNNMEAKKDCETVLALEPKNIELKKSLARIN 209 (536) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHC T ss_conf 9877578777778899860856178999999861 No 173 >KOG1308 consensus Probab=94.39 E-value=0.094 Score=31.69 Aligned_cols=70 Identities=14% Similarity=0.073 Sum_probs=48.4 Q ss_pred CCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHC Q ss_conf 0789589999999999826898368999999999879988999998878660999899999999999857 Q gi|254781033|r 303 LSENTVGKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKEIFLLLAQIEQANS 372 (492) Q Consensus 303 ~~~~~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~~~~llA~ie~~e~ 372 (492) ....+..+++.+...+..+|+....+...+++....|+|.+|+..|..+++++-+.++-..+-++....+ T Consensus 160 kl~kp~aairD~d~A~ein~Dsa~~ykfrg~A~rllg~~e~aa~dl~~a~kld~dE~~~a~lKeV~p~a~ 229 (377) T KOG1308 160 KLKKPNAAIRDCDFAIEINPDSAKGYKFRGYAERLLGNWEEAAHDLALACKLDYDEANSATLKEVFPNAG 229 (377) T ss_pred ECCCCCHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHCHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCHH T ss_conf 2368734654024554168310235530348998720228777799999861654777899987550201 No 174 >pfam04348 LppC LppC putative lipoprotein. This family includes several bacterial outer membrane antigens, whose molecular function is unknown. Probab=94.36 E-value=0.42 Score=26.65 Aligned_cols=108 Identities=12% Similarity=-0.071 Sum_probs=59.0 Q ss_pred HHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCC---HHHHHHHHHHHHHCCCCHHHHHHHHHHH-H Q ss_conf 99863013466615899999875322221000168999999726885---0579998654552158869999999988-8 Q gi|254781033|r 106 MHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPAT---REFAVYSLYFESCRIGDLNSAQRYATKA-L 181 (492) Q Consensus 106 ~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~---~~~~~~gL~~~al~~gd~~~A~~~~~~a-~ 181 (492) ++.++..... +...-..|+++++...+|+...|...+..+....-+ ......-.+..++..|+++.|+..+... . T Consensus 11 YL~~A~~s~~-~~~~~~~L~Aa~a~i~eg~~~qA~~ll~~l~~~~L~~~q~~e~~Ll~A~l~~~~~~~~~Al~~L~~~~~ 89 (535) T pfam04348 11 YLNKASQSQG-EQQNTYKLLAARVLIEENKIAQAQALLRELSVQQLTDEQKLDKSLLEAELAAAKRQNEVALQQLSHQHL 89 (535) T ss_pred HHHHHHHCCH-HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHCCCCC T ss_conf 9999984394-779999999999999779999999999974041199999999999999999975999999998557981 Q ss_pred HHCCCCHH--HHHHHHHHHHHCCCCHHHHHHHHHH Q ss_conf 42126989--9999999999525745699999999 Q gi|254781033|r 182 DISPDAPW--VTEAVVQQYVLAKEWSRAITFLNQK 214 (492) Q Consensus 182 ~~~P~~~~--a~~~L~~l~~~~gdw~~A~~~l~~~ 214 (492) ..-|..-+ .+...+.++...|++-++...+-.. T Consensus 90 ~~L~~~q~~r~~~l~A~~~~~~~~~l~a~r~~i~l 124 (535) T pfam04348 90 SGLSPSQQSRYYEGRARIAEARGDAIEAAKARIQM 124 (535) T ss_pred CCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH T ss_conf 10799999999999999999769869999999986 No 175 >COG5416 Uncharacterized integral membrane protein [Function unknown] Probab=94.35 E-value=0.25 Score=28.38 Aligned_cols=64 Identities=8% Similarity=0.037 Sum_probs=39.7 Q ss_pred HHHHHHHHHHHHCCCCEEEEEECCEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH Q ss_conf 99999999997348984899989976622189999999999999999999999984182889999999999 Q gi|254781033|r 11 ISLVICSFIIVSHYPEDVSITWGNRLYRTSPFVILSILYFFLFAWILLFAVSRFFLSCPAMLFHMLHKRNY 81 (492) Q Consensus 11 i~~~~~~~~~l~~~~G~v~i~~~~~~ie~s~~~~i~~~~~~~~~~~ll~~l~~~i~~~p~~~~~~~~~rr~ 81 (492) ++++.+.+.+...|.-.|.+++..|+.+.++...+++.++.-.++ ..+.+.++.++..++.+|. T Consensus 31 ilillLllifav~Nt~~V~~~~lfg~~~~PLilvil~s~v~G~Li-------~~~~~~~Ri~~lrr~~krl 94 (98) T COG5416 31 ILILLLLLIFAVINTDSVEFNYLFGQWELPLILVILGAAVVGALI-------AMFAGIARILQLRREVKRL 94 (98) T ss_pred HHHHHHHHHHHHHCCCCEEEEEECCHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHH T ss_conf 999999999997337734788643133304999999999999999-------9998189999999999988 No 176 >KOG0376 consensus Probab=94.30 E-value=0.17 Score=29.63 Aligned_cols=108 Identities=12% Similarity=0.040 Sum_probs=65.1 Q ss_pred HHHHHHHCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCC Q ss_conf 99999721004787999986301346661589999987532222100016899999972688505799986545521588 Q gi|254781033|r 90 TGLMSIAAHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPATREFAVYSLYFESCRIGD 169 (492) Q Consensus 90 ~gl~al~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~~~gL~~~al~~gd 169 (492) ++=.++-..+++.|.-++.|+...+ ++.+...-..+++....+++..|..=+..+++.+|+...++..-+...+..+. T Consensus 10 ean~~l~~~~fd~avdlysKaI~ld--pnca~~~anRa~a~lK~e~~~~Al~Da~kaie~dP~~~K~Y~rrg~a~m~l~~ 87 (476) T KOG0376 10 EANEALKDKVFDVAVDLYSKAIELD--PNCAIYFANRALAHLKVESFGGALHDALKAIELDPTYIKAYVRRGTAVMALGE 87 (476) T ss_pred HHHHHCCCCHHHHHHHHHHHHHHCC--CCCEEEECHHHHHHEEECHHHHHHHHHHHHHHCCCHHHHEEEECCHHHHHHHH T ss_conf 7765412001999999999998528--86324423021234320003367787776354080123303431177875777 Q ss_pred HHHHHHHHHHHHHHCCCCHHHHHHHHHHHH Q ss_conf 699999999888421269899999999999 Q gi|254781033|r 170 LNSAQRYATKALDISPDAPWVTEAVVQQYV 199 (492) Q Consensus 170 ~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~ 199 (492) +-+|+..+++...+.|+.+.+...+.+... T Consensus 88 ~~~A~~~l~~~~~l~Pnd~~~~r~~~Ec~~ 117 (476) T KOG0376 88 FKKALLDLEKVKKLAPNDPDATRKIDECNK 117 (476) T ss_pred HHHHHHHHHHHHHCCCCCHHHHHHHHHHHH T ss_conf 999999998766238676889987899999 No 177 >KOG0551 consensus Probab=94.10 E-value=0.44 Score=26.43 Aligned_cols=100 Identities=13% Similarity=0.154 Sum_probs=74.4 Q ss_pred HHHHHHHCCCHHHHHHHHHHHHHH-CCCHHHHH-HH-----HHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHC Q ss_conf 999999739947899999999983-28989999-99-----985078958999999999982689836899999999987 Q gi|254781033|r 266 AAKSLISQNKKRKAEVILEKIWKV-NPHPEIAN-IY-----THLLSENTVGKLKRALRLEEINKESVESLVIVSKIALEM 338 (492) Q Consensus 266 ~a~~l~~~g~~~~A~~~le~a~~~-~p~p~La~-a~-----~~~~~~~~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~ 338 (492) -+.-+.+..+++.|...+.++++. .++|+|-. .| +++..+++..++.-..+.++.+|.|.-+.+--|..++.. T Consensus 87 eGN~~fK~Kryk~A~~~Yt~Glk~kc~D~dlnavLY~NRAAa~~~l~NyRs~l~Dcs~al~~~P~h~Ka~~R~Akc~~eL 166 (390) T KOG0551 87 EGNEYFKEKRYKDAVESYTEGLKKKCADPDLNAVLYTNRAAAQLYLGNYRSALNDCSAALKLKPTHLKAYIRGAKCLLEL 166 (390) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHH T ss_conf 76898777567689999988876418992378998842999999999899999999998614873044343566899999 Q ss_pred CCHHHHHHHHHHHHHCCCCHHHHHHHH Q ss_conf 998899999887866099989999999 Q gi|254781033|r 339 GSIDQAHAKAMLAMKIAPRKEIFLLLA 365 (492) Q Consensus 339 g~~~~Ar~~l~~al~~~P~~~~~~llA 365 (492) ..|+.|....+..+..+-....+.-|. T Consensus 167 e~~~~a~nw~ee~~~~d~e~K~~~~l~ 193 (390) T KOG0551 167 ERFAEAVNWCEEGLQIDDEAKKAIELR 193 (390) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 888887777765311017789999998 No 178 >KOG0376 consensus Probab=93.96 E-value=0.14 Score=30.45 Aligned_cols=87 Identities=17% Similarity=0.211 Sum_probs=41.2 Q ss_pred HHHHCCHHHHHHHHHHHHCCCCH--HHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHH Q ss_conf 22210001689999997268850--5799986545521588699999999888421269899999999999525745699 Q gi|254781033|r 131 LAERQYNIAHEKLEMMLQIPATR--EFAVYSLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAI 208 (492) Q Consensus 131 ~~~gd~~~A~~~~~~~~~~~~~~--~~~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~ 208 (492) ...++++.|...|.++++.+++. +.+.|. ...++.+++..|+.-+.++.++.|....+....+......+.+.+|+ T Consensus 15 l~~~~fd~avdlysKaI~ldpnca~~~anRa--~a~lK~e~~~~Al~Da~kaie~dP~~~K~Y~rrg~a~m~l~~~~~A~ 92 (476) T KOG0376 15 LKDKVFDVAVDLYSKAIELDPNCAIYFANRA--LAHLKVESFGGALHDALKAIELDPTYIKAYVRRGTAVMALGEFKKAL 92 (476) T ss_pred CCCCHHHHHHHHHHHHHHCCCCCEEEECHHH--HHHEEECHHHHHHHHHHHHHHCCCHHHHEEEECCHHHHHHHHHHHHH T ss_conf 1200199999999999852886324423021--23432000336778777635408012330343117787577799999 Q ss_pred HHHHHHHHCCC Q ss_conf 99999995023 Q gi|254781033|r 209 TFLNQKKKNAK 219 (492) Q Consensus 209 ~~l~~~~~~~~ 219 (492) ..|+...+..| T Consensus 93 ~~l~~~~~l~P 103 (476) T KOG0376 93 LDLEKVKKLAP 103 (476) T ss_pred HHHHHHHHCCC T ss_conf 99987662386 No 179 >pfam10345 Cohesin_load Cohesin loading factor. Cohesin_load is a common cohesin loading factor protein that is conserved in fungi. It is associated with the cohesin complex and is required in G1 for cohesin binding to chromosomes but dispensable in G2 when cohesion has been established. It is referred to as both Ssl3, in pombe, and Scc4, in S.cerevisiae. It complexes with Mis4. Probab=93.80 E-value=0.52 Score=25.87 Aligned_cols=117 Identities=13% Similarity=0.061 Sum_probs=48.0 Q ss_pred CHHHHHHHHHHHHCCC----CCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCC----H-HHHHHHH-HHHHHCC Q ss_conf 0047879999863013----466615899999875322221000168999999726885----0-5799986-5455215 Q gi|254781033|r 98 HNIPLARKMHSYVSQQ----HTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPAT----R-EFAVYSL-YFESCRI 167 (492) Q Consensus 98 Gd~~~A~k~~~~a~~~----~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~----~-~~~~~gL-~~~al~~ 167 (492) .|.+.|+..+.|+... +..|--.-...|.++.....+. ..|.......+++-.+ . ....+-+ ....+.. T Consensus 75 ~N~d~Aet~LsK~i~l~~~~~l~dlkf~~q~Lla~~~~ktn~-kaAl~~l~~~I~~~e~~~~~~w~~~frll~~~l~l~~ 153 (593) T pfam10345 75 ENIDLAETYLSKAILIAERNGLEDLKFVCQFLLARLLFKTNP-KAALKSLNKLIEDAETYQHNAWVYAFRLLKIQLLLQS 153 (593) T ss_pred CCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCH-HHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHC T ss_conf 679999999999999876179457999899999999975281-8899999999999862463329999999999998742 Q ss_pred CCHHHHHHHHHHHHHHCCC--CHHHH--HH--HHHHHHHCCCCHHHHHHHHHHH Q ss_conf 8869999999988842126--98999--99--9999995257456999999999 Q gi|254781033|r 168 GDLNSAQRYATKALDISPD--APWVT--EA--VVQQYVLAKEWSRAITFLNQKK 215 (492) Q Consensus 168 gd~~~A~~~~~~a~~~~P~--~~~a~--~~--L~~l~~~~gdw~~A~~~l~~~~ 215 (492) +|+..|++.+++.....-. +..+. .. -+.+....+...+.+..+.++. T Consensus 154 ~d~~~A~~~L~~l~~~a~~~~~~~~~v~~~l~~~~l~l~~~~~~d~l~~l~~~~ 207 (593) T pfam10345 154 ADLAAALENLQSLAELANRRGDRKLFVVLSLLEALLHLYRGSPKDVLELLQEAI 207 (593) T ss_pred CCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHH T ss_conf 876899999999998742158689999999999989750398256999999999 No 180 >PRK11906 transcriptional regulator; Provisional Probab=93.80 E-value=0.53 Score=25.86 Aligned_cols=88 Identities=11% Similarity=0.090 Sum_probs=65.6 Q ss_pred HHHHHHHHHHHHHHCC-CHHHHHHHHHHC--CCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHH Q ss_conf 4789999999998328-989999999850--7895899999999998268983689999999998799889999988786 Q gi|254781033|r 276 KRKAEVILEKIWKVNP-HPEIANIYTHLL--SENTVGKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAM 352 (492) Q Consensus 276 ~~~A~~~le~a~~~~p-~p~La~a~~~~~--~~~~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al 352 (492) ..+|...++++...++ +|......+.+. .++..-.--.|++.--..|+.+..++--+..+.-.|+.++|+..+++.+ T Consensus 320 ~~kA~e~~d~a~~i~~~n~qaLgllgLIsgL~~q~~Vs~~LFeQA~ll~p~s~di~yy~~~~~F~~g~leea~~~I~ksL 399 (458) T PRK11906 320 AQKALELLDYVSDITTVDGKILAIMGLITGLSGQAKVSHILFEQAKIHSTDIASLYYYRALVHFHNEKIEEARICIDKSL 399 (458) T ss_pred HHHHHHHHHHHHHCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHCCCCHHHHHHHHHHH T ss_conf 99999999998836777767889999986134236698999987414699863312133353563374088999999986 Q ss_pred HCCCCHHHHHH Q ss_conf 60999899999 Q gi|254781033|r 353 KIAPRKEIFLL 363 (492) Q Consensus 353 ~~~P~~~~~~l 363 (492) +.+|+.+..-. T Consensus 400 ~leP~r~~a~i 410 (458) T PRK11906 400 QLEPRRRKAVV 410 (458) T ss_pred CCCCHHHCCCH T ss_conf 14940100245 No 181 >pfam09613 HrpB1_HrpK Bacterial type III secretion protein (HrpB1_HrpK). This family of proteins is encoded by genes found within type III secretion operons in a limited range of species including Xanthomonas, Ralstonia and Burkholderia. Probab=93.79 E-value=0.53 Score=25.86 Aligned_cols=61 Identities=11% Similarity=-0.056 Sum_probs=32.2 Q ss_pred CCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHH Q ss_conf 7895899999999998268983689999999998799889999988786609998999999 Q gi|254781033|r 304 SENTVGKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKEIFLLL 364 (492) Q Consensus 304 ~~~~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~~~~ll 364 (492) .++..+....+..+--..|+.++.-+.-|.+++..|+|++|.+.|..+.+..|....+..| T Consensus 23 ~~~~~D~e~lL~AlRvLRP~~~e~d~~dg~l~i~rg~w~dA~r~lr~v~~~~~~~~~~kAL 83 (156) T pfam09613 23 LADIADAEELLEALRILRPRLAELDLFDGWLYIGRGQWDDAARLLRELEGRASHLPLCKAL 83 (156) T ss_pred CCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCHHHHHH T ss_conf 4887789999999998589832665878778776065899999999986267784899999 No 182 >pfam04053 Coatomer_WDAD Coatomer WD associated region. This region is composed of WD40 repeats. Probab=93.58 E-value=0.57 Score=25.59 Aligned_cols=156 Identities=12% Similarity=-0.004 Sum_probs=88.4 Q ss_pred HHHCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHH Q ss_conf 97210047879999863013466615899999875322221000168999999726885057999865455215886999 Q gi|254781033|r 94 SIAAHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPATREFAVYSLYFESCRIGDLNSA 173 (492) Q Consensus 94 al~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~~~gL~~~al~~gd~~~A 173 (492) ++..||++.+.++.+...-. .+.+.... .+.-.+.+|-.+.|. +...+++++| ..+++.|+.+.| T Consensus 265 av~r~d~~~~~~~~~~~~l~--~p~~~~~~--i~~fLe~~G~~e~AL----~~~~D~~~rF-------eLAl~lG~L~~A 329 (435) T pfam04053 265 ALLRKDYEEVLRIIANSNLL--PPKDQGQK--IIRFLEKKGYPELAL----QFVTDPDTRF-------DLALELGNLDVA 329 (435) T ss_pred HHHCCCHHHHHHHHHHCCCC--CCHHHHHH--HHHHHHHCCCHHHHH----HHCCCHHHHH-------HHHHHCCCHHHH T ss_conf 99757788999876421336--87346679--999999779788887----6608904761-------676861999999 Q ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH Q ss_conf 99999888421269899999999999525745699999999950231035789999999999996699899999998754 Q gi|254781033|r 174 QRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIASYHDAIESL 253 (492) Q Consensus 174 ~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~ 253 (492) ++.+.+. +++.-+..|++....+|+.+-|...+.+... .. .+...+...|+.+.-.+....+. T Consensus 330 ~~~A~~~-----~~~~~Wk~Lg~~AL~~g~~~lAe~~~~k~~d----~~--------~LllLy~~tG~~~~L~kl~~~a~ 392 (435) T pfam04053 330 LEIAKEL-----DDEHKWKRLGDAALSQGNIKLAEEAYQKAKD----FD--------KLLLLYLSTGNKEKLKKLAKIAE 392 (435) T ss_pred HHHHHHH-----CCHHHHHHHHHHHHHCCCHHHHHHHHHHHCC----HH--------HHHHHHHHCCCHHHHHHHHHHHH T ss_conf 9999873-----6799999999999974995899999997425----34--------55999998499999999999999 Q ss_pred HCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHH Q ss_conf 313352799999999999739947899999999 Q gi|254781033|r 254 KLCDNSIMASICAAKSLISQNKKRKAEVILEKI 286 (492) Q Consensus 254 ~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a 286 (492) .....++ +... +.-.|+..+...+|.+. T Consensus 393 ~~g~~n~--aF~~---~~~lgd~~~cv~lL~~t 420 (435) T pfam04053 393 ERGDYNS--AFQN---ALYLGDVEKCVDILIKT 420 (435) T ss_pred HCCCCCH--HHHH---HHHHCCHHHHHHHHHHC T ss_conf 8476229--9999---98808999999999985 No 183 >KOG1308 consensus Probab=93.53 E-value=0.05 Score=33.86 Aligned_cols=62 Identities=15% Similarity=0.090 Sum_probs=38.2 Q ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCC Q ss_conf 98654552158869999999988842126989999999999952574569999999995023 Q gi|254781033|r 158 YSLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAK 219 (492) Q Consensus 158 ~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~ 219 (492) ++-+.+++..|.+++|+.++..+..++|.+...+.....+++..+.|..++.-+..++...+ T Consensus 118 k~~A~ealn~G~~~~ai~~~t~ai~lnp~~A~l~~kR~sv~lkl~kp~aairD~d~A~ein~ 179 (377) T KOG1308 118 KVQASEALNDGEFDTAIELFTSAIELNPPLAILYAKRASVFLKLKKPNAAIRDCDFAIEINP 179 (377) T ss_pred HHHHHHHHCCCCHHHHHCCCCCCCCCCCCHHHHCCCCCCEEEECCCCCHHHHHHHHHHCCCC T ss_conf 78889873276433331155312036994344105534100223687346540245541683 No 184 >pfam11146 DUF2905 Protein of unknown function (DUF2905). This is a family of bacterial proteins conserved of unknown function. Probab=93.52 E-value=0.38 Score=26.93 Aligned_cols=57 Identities=18% Similarity=0.192 Sum_probs=40.8 Q ss_pred CHHHHHHHHHHHHHHHHH-HH------HHCCCCEEEEEECCEEEEEHHHHHHHHHHHHHHHHHH Q ss_conf 967999999999999999-99------7348984899989976622189999999999999999 Q gi|254781033|r 1 MLRLIRYFFVISLVICSF-II------VSHYPEDVSITWGNRLYRTSPFVILSILYFFLFAWIL 57 (492) Q Consensus 1 M~r~l~~~~~i~~~~~~~-~~------l~~~~G~v~i~~~~~~ie~s~~~~i~~~~~~~~~~~l 57 (492) |.|.++++.++.++..+. .| +..-||++.|.-+|.++-.++..-++..+++.+++++ T Consensus 1 m~k~Li~~g~~l~v~G~l~~~l~k~~~lGrLPGDi~i~~~~~~fyfPi~T~illS~vlsli~~l 64 (65) T pfam11146 1 MGRWLIVLGLVLLVLGLLLPWLGKLLGLGRLPGDIRIRRGNFTFYFPLTTSILLSLLLSLLLWL 64 (65) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEECCEEEEEECHHHHHHHHHHHHHHHH T ss_conf 9007999999999999999998662777889867899868989999779999999999999997 No 185 >pfam04348 LppC LppC putative lipoprotein. This family includes several bacterial outer membrane antigens, whose molecular function is unknown. Probab=93.52 E-value=0.58 Score=25.53 Aligned_cols=57 Identities=26% Similarity=0.290 Sum_probs=34.1 Q ss_pred HHHHHHHHHCCHHHHHHHHHHHHCCCCCCCC-HHHHHHHHHHHHHHHHCCHHHHHHHH Q ss_conf 9999999721004787999986301346661-58999998753222210001689999 Q gi|254781033|r 88 LYTGLMSIAAHNIPLARKMHSYVSQQHTFHN-EYLVYLLEVQIALAERQYNIAHEKLE 144 (492) Q Consensus 88 l~~gl~al~~Gd~~~A~k~~~~a~~~~~~~~-~~L~~Ll~A~aA~~~gd~~~A~~~~~ 144 (492) |.-.-..+.+|++..|...+.........+. -.-..|+.|+.+..+|++..|...+. T Consensus 28 L~Aa~a~i~eg~~~qA~~ll~~l~~~~L~~~q~~e~~Ll~A~l~~~~~~~~~Al~~L~ 85 (535) T pfam04348 28 LLAARVLIEENKIAQAQALLRELSVQQLTDEQKLDKSLLEAELAAAKRQNEVALQQLS 85 (535) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHC T ss_conf 9999999977999999999997404119999999999999999997599999999855 No 186 >pfam05843 Suf Suppressor of forked protein (Suf). This family consists of several eukaryotic suppressor of forked (Suf) like proteins. The Drosophila melanogaster Suppressor of forked [Su(f)] protein shares homology with the yeast RNA14 protein and the 77-kDa subunit of human cleavage stimulation factor, which are proteins involved in mRNA 3' end formation. This suggests a role for Su(f) in mRNA 3' end formation in Drosophila. The su(f) gene produces three transcripts; two of them are polyadenylated at the end of the transcription unit, and one is a truncated transcript, polyadenylated in intron 4. It is thought that su(f) plays a role in the regulation of poly(A) site utilisation and an important role of the GU-rich sequence for this regulation to occur. Probab=93.37 E-value=0.61 Score=25.35 Aligned_cols=144 Identities=16% Similarity=0.096 Sum_probs=88.0 Q ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHH Q ss_conf 99999999995257456999999999502310357899999999999966998999999987543133527999999999 Q gi|254781033|r 190 VTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIASYHDAIESLKLCDNSIMASICAAKS 269 (492) Q Consensus 190 a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~ 269 (492) ++..++....+.+--.+++..+.++.+...... -+...+..+++.-.+|..-|.+.+.-..|.-++.......|.+. T Consensus 3 ~yi~~m~~~rR~eGi~~aR~vF~~aRk~~~~t~---~vyva~Al~E~~~~~d~~~a~kIFelGlK~f~~~~~y~~~Yld~ 79 (275) T pfam05843 3 VYIQYMRAMRRAEGIKAARKVFKKARKKGRITY---HVYVASALMEYYCSKDPKTAFKIFELGLKLFPEDGEFVLKYLDF 79 (275) T ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCH---HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHH T ss_conf 899999999984262889999999860789855---99999999999873780799999999987658978999999999 Q ss_pred HHHCCCHHHHHHHHHHHHHHCC-CHH---HHHHHHHHC--CCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHH Q ss_conf 9973994789999999998328-989---999999850--789589999999999826898368999999999 Q gi|254781033|r 270 LISQNKKRKAEVILEKIWKVNP-HPE---IANIYTHLL--SENTVGKLKRALRLEEINKESVESLVIVSKIAL 336 (492) Q Consensus 270 l~~~g~~~~A~~~le~a~~~~p-~p~---La~a~~~~~--~~~~~~al~~~~~l~~~~P~~~e~~~~LA~~~~ 336 (492) |+..|+...+..++|++....+ .++ +..-|...+ -|+-...++.-++.....|++.......-|-.. T Consensus 80 Li~~nd~~n~R~lFE~~~~~l~~~~e~~~i~~~~~~fEs~~gdL~~i~klE~R~~e~fPed~~l~~f~dRYs~ 152 (275) T pfam05843 80 LISLNDDTNARVLFETVVTKLTPEPEAKPLWKKFIKYESKYGDLSSILKLEKRMFELFPEDPPLELFSDRYSF 152 (275) T ss_pred HHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHCC T ss_conf 9871775018999999985048737779999999999885086999999999999867897279999998625 No 187 >KOG3617 consensus Probab=93.33 E-value=0.62 Score=25.30 Aligned_cols=149 Identities=15% Similarity=0.048 Sum_probs=70.5 Q ss_pred HHHHHHCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHHH-HHHHHHHHHHCCCC Q ss_conf 99997210047879999863013466615899999875322221000168999999726885057-99986545521588 Q gi|254781033|r 91 GLMSIAAHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPATREF-AVYSLYFESCRIGD 169 (492) Q Consensus 91 gl~al~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~-~~~gL~~~al~~gd 169 (492) +.+|.--|=.+.|..++++-.+. +.|+.|+ +.+|...+|.+.-+ ..+.-... .++..+......+| T Consensus 807 AvLAieLgMlEeA~~lYr~ckR~-----DLlNKly-----Qs~g~w~eA~eiAE---~~DRiHLr~Tyy~yA~~Lear~D 873 (1416) T KOG3617 807 AVLAIELGMLEEALILYRQCKRY-----DLLNKLY-----QSQGMWSEAFEIAE---TKDRIHLRNTYYNYAKYLEARRD 873 (1416) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHH-----HHCCCHHHHHHHHH---HCCCEEHHHHHHHHHHHHHHHCC T ss_conf 89999873488999999988888-----7999999-----86100888888775---10403124568778788886212 Q ss_pred HHHHHHHHHHH----------HHHCCCCH----------HHHHHHHHHHHHCCCCHHHHHHHHHHHHCCC---------- Q ss_conf 69999999988----------84212698----------9999999999952574569999999995023---------- Q gi|254781033|r 170 LNSAQRYATKA----------LDISPDAP----------WVTEAVVQQYVLAKEWSRAITFLNQKKKNAK---------- 219 (492) Q Consensus 170 ~~~A~~~~~~a----------~~~~P~~~----------~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~---------- 219 (492) .+.|+.+++++ +..+|+.. ..+.--++.....|+.+.|+..+..+..... T Consensus 874 i~~AleyyEK~~~hafev~rmL~e~p~~~e~Yv~~~~d~~L~~WWgqYlES~GemdaAl~~Y~~A~D~fs~VrI~C~qGk 953 (1416) T KOG3617 874 IEAALEYYEKAGVHAFEVFRMLKEYPKQIEQYVRRKRDESLYSWWGQYLESVGEMDAALSFYSSAKDYFSMVRIKCIQGK 953 (1416) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHCHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHEEEEEECCC T ss_conf 88999888751873878899988583889999985065289999999975243157889999876555542366763167 Q ss_pred CHHH-------HHHHHHHHHHHHHHHCCCHHHHHHHHHHH Q ss_conf 1035-------78999999999999669989999999875 Q gi|254781033|r 220 EWNR-------NRAILLIARSLENADKGDMIASYHDAIES 252 (492) Q Consensus 220 ~~~~-------~~a~l~~~~a~~~~~~~~~~~A~~~~~~a 252 (492) ..+. .....-+-.++.+...|+..+|...+.+| T Consensus 954 ~~kAa~iA~esgd~AAcYhlaR~YEn~g~v~~Av~FfTrA 993 (1416) T KOG3617 954 TDKAARIAEESGDKAACYHLARMYENDGDVVKAVKFFTRA 993 (1416) T ss_pred CHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 3588999886141788999999862247788899998998 No 188 >PRK10153 DNA-binding transcriptional activator CadC; Provisional Probab=93.27 E-value=0.63 Score=25.24 Aligned_cols=123 Identities=16% Similarity=0.144 Sum_probs=54.5 Q ss_pred HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHC----CCCHHHHHHHHHHHHC---CCCHHHHHHHHHHHHHHHHHHCCCH Q ss_conf 69999999988842126989999999999952----5745699999999950---2310357899999999999966998 Q gi|254781033|r 170 LNSAQRYATKALDISPDAPWVTEAVVQQYVLA----KEWSRAITFLNQKKKN---AKEWNRNRAILLIARSLENADKGDM 242 (492) Q Consensus 170 ~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~----gdw~~A~~~l~~~~~~---~~~~~~~~a~l~~~~a~~~~~~~~~ 242 (492) +..|..+++++++..|++..+...+.-.+... +--.+-+..|.+.++. .+.... ....+-+.+...+.+|++ T Consensus 350 l~~~idLl~~~~~~ePdn~Yv~Ae~yi~~~~~~~l~~~~~~~l~~Ln~~l~~~~~~~~~~~-~~riyeAlAl~al~~~~~ 428 (512) T PRK10153 350 LNKGSDLLEEILKSEPDFTYALAEKALADHVRQSLQPLDEKQLAALNTELDNIVALPGVNV-LSRIYEALAVQALVKGDT 428 (512) T ss_pred HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCCC-CCHHHHHHHHHHHHCCCH T ss_conf 9999999999997299838999999999999985498889999999999999873324688-728899999999847999 Q ss_pred HHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHH Q ss_conf 9999999875431335279999999999973994789999999998328989 Q gi|254781033|r 243 IASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHPE 294 (492) Q Consensus 243 ~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p~ 294 (492) ++|.....++.... ..+-..++.|+++.-.|+.+.|...+-+|....|.++ T Consensus 429 ~~A~~~L~~~~~~~-~Svl~YILlgK~aEl~G~~~~A~e~Y~~Af~~~~s~~ 479 (512) T PRK10153 429 DEAYQALNKAIELE-MSWLNYVLLGKVAELKGDNRLAADAYSTAFNLRPGEN 479 (512) T ss_pred HHHHHHHHHCHHHH-HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHH T ss_conf 99999998675645-5699999999999964996688999999980388389 No 189 >KOG1586 consensus Probab=92.99 E-value=0.69 Score=24.95 Aligned_cols=42 Identities=10% Similarity=0.015 Sum_probs=29.3 Q ss_pred HHHHHHHHC-CCHHHHHHHHHHHHHCCCC---HHHHHHHHHHHHHH Q ss_conf 999999987-9988999998878660999---89999999999985 Q gi|254781033|r 330 IVSKIALEM-GSIDQAHAKAMLAMKIAPR---KEIFLLLAQIEQAN 371 (492) Q Consensus 330 ~LA~~~~~~-g~~~~Ar~~l~~al~~~P~---~~~~~llA~ie~~e 371 (492) .-|-+|.-. .|--.++..|++..+.+|+ +|-|-+|.++.+.- T Consensus 199 lkAgLChl~~~D~v~a~~ALeky~~~dP~F~dsREckflk~L~~ai 244 (288) T KOG1586 199 LKAGLCHLCKADEVNAQRALEKYQELDPAFTDSRECKFLKDLLDAI 244 (288) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHH T ss_conf 9999886730207789999998875398646408879999999988 No 190 >COG3771 Predicted membrane protein [Function unknown] Probab=92.99 E-value=0.69 Score=24.94 Aligned_cols=83 Identities=11% Similarity=0.040 Sum_probs=40.9 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHH Q ss_conf 96799999999999999999734898489998997662218999999999999999999999998418288999999999 Q gi|254781033|r 1 MLRLIRYFFVISLVICSFIIVSHYPEDVSITWGNRLYRTSPFVILSILYFFLFAWILLFAVSRFFLSCPAMLFHMLHKRN 80 (492) Q Consensus 1 M~r~l~~~~~i~~~~~~~~~l~~~~G~v~i~~~~~~ie~s~~~~i~~~~~~~~~~~ll~~l~~~i~~~p~~~~~~~~~rr 80 (492) |.-++.|++++++.....+.=++|.-.|++++.--+-|..+..++. .+|.+-..+-|++..++..-.+++..+-.|+ T Consensus 1 mKyil~~vlvlaifliavtlGa~Ndq~v~FNYllAqgef~LSTLla---~lF~~G~~lgwli~g~fy~k~~l~~~~l~rq 77 (97) T COG3771 1 MKYILIFVLVLAIFLIAVTLGAQNDQQVTFNYLLAQGEFRLSTLLA---TLFAAGFALGWLICGLFYLKVRLSLMRLERQ 77 (97) T ss_pred CHHHHHHHHHHHHHHHHHHCCCCCCEEEEEEEHHHCCHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9107999999999999851047886088640004324302999999---9999999999999978999898886899999 Q ss_pred HHHHHH Q ss_conf 999999 Q gi|254781033|r 81 YDKGYK 86 (492) Q Consensus 81 ~~k~~~ 86 (492) -++... T Consensus 78 iKr~~~ 83 (97) T COG3771 78 IKRLEN 83 (97) T ss_pred HHHHHH T ss_conf 999985 No 191 >KOG4642 consensus Probab=92.76 E-value=0.74 Score=24.72 Aligned_cols=85 Identities=15% Similarity=0.128 Sum_probs=41.5 Q ss_pred HCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHH Q ss_conf 52574569999999995023103578999999999999669989999999875431335279999999999973994789 Q gi|254781033|r 200 LAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKA 279 (492) Q Consensus 200 ~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A 279 (492) ..+.+..|+..+.+++...|... ..+...+..++...+.+......++++.+.|+.+-+...++..+.....+.+| T Consensus 22 ~~k~y~~ai~~y~raI~~nP~~~----~Y~tnralchlk~~~~~~v~~dcrralql~~N~vk~h~flg~~~l~s~~~~ea 97 (284) T KOG4642 22 IPKRYDDAIDCYSRAICINPTVA----SYYTNRALCHLKLKHWEPVEEDCRRALQLDPNLVKAHYFLGQWLLQSKGYDEA 97 (284) T ss_pred CHHHHCHHHHHHHHHHHCCCCCC----HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCHH T ss_conf 30111468999999885489730----24454788888762002666617889860728789999988888752366078 Q ss_pred HHHHHHHHH Q ss_conf 999999998 Q gi|254781033|r 280 EVILEKIWK 288 (492) Q Consensus 280 ~~~le~a~~ 288 (492) .+.|.+++. T Consensus 98 I~~Lqra~s 106 (284) T KOG4642 98 IKVLQRAYS 106 (284) T ss_pred HHHHHHHHH T ss_conf 999999998 No 192 >KOG4507 consensus Probab=92.71 E-value=0.75 Score=24.67 Aligned_cols=92 Identities=13% Similarity=0.067 Sum_probs=54.3 Q ss_pred HHCCCHHHHHHHHHHHHHHCCCHH---HH-HHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHH Q ss_conf 973994789999999998328989---99-99998507895899999999998268983689999999998799889999 Q gi|254781033|r 271 ISQNKKRKAEVILEKIWKVNPHPE---IA-NIYTHLLSENTVGKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHA 346 (492) Q Consensus 271 ~~~g~~~~A~~~le~a~~~~p~p~---La-~a~~~~~~~~~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~ 346 (492) ..+|+...|.+-|..++...|... ++ .+-..+..+-..++-+.+-+.+..+...|-.++.+|++++...+.+.|.+ T Consensus 618 r~~gn~~~a~~cl~~a~~~~p~~~~v~~v~la~~~~~~~~~~da~~~l~q~l~~~~sepl~~~~~g~~~l~l~~i~~a~~ 697 (886) T KOG4507 618 RAVGNSTFAIACLQRALNLAPLQQDVPLVNLANLLIHYGLHLDATKLLLQALAINSSEPLTFLSLGNAYLALKNISGALE 697 (886) T ss_pred EECCCCHHHHHHHHHHHHCCHHHHCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHCCCCCHHHHHHCHHHHHHHHHHHHHH T ss_conf 64278088999999887058265235399999999986221008899999986502675288750237887750488999 Q ss_pred HHHHHHHCCCCHHHHH Q ss_conf 9887866099989999 Q gi|254781033|r 347 KAMLAMKIAPRKEIFL 362 (492) Q Consensus 347 ~l~~al~~~P~~~~~~ 362 (492) ++.++++.+|...+|. T Consensus 698 ~~~~a~~~~~~~~~~~ 713 (886) T KOG4507 698 AFRQALKLTTKCPECE 713 (886) T ss_pred HHHHHHHCCCCCHHHH T ss_conf 9999973499983467 No 193 >COG3629 DnrI DNA-binding transcriptional activator of the SARP family [Signal transduction mechanisms] Probab=92.68 E-value=0.75 Score=24.64 Aligned_cols=59 Identities=17% Similarity=0.139 Sum_probs=28.0 Q ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHH Q ss_conf 98654552158869999999988842126989999999999952574569999999995 Q gi|254781033|r 158 YSLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKK 216 (492) Q Consensus 158 ~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~ 216 (492) ..++......|+.+.+...++++...+|.+...+..+++.|...|+-..|+..+.+..+ T Consensus 157 ~~lae~~~~~~~~~~~~~~l~~Li~~dp~~E~~~~~lm~~y~~~g~~~~ai~~y~~l~~ 215 (280) T COG3629 157 TKLAEALIACGRADAVIEHLERLIELDPYDEPAYLRLMEAYLVNGRQSAAIRAYRQLKK 215 (280) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHH T ss_conf 99999987312188999999999862964649999999999975880689999999999 No 194 >PRK06330 transcript cleavage factor/unknown domain fusion protein; Validated Probab=92.51 E-value=0.79 Score=24.48 Aligned_cols=38 Identities=11% Similarity=0.101 Sum_probs=23.7 Q ss_pred HHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHC Q ss_conf 52158869999999988842126989999999999952 Q gi|254781033|r 164 SCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLA 201 (492) Q Consensus 164 al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~ 201 (492) +....+++.++..+...++..+++.|+...|.+.|... T Consensus 234 y~~~e~~~~~i~~lK~iL~~d~~~~kaR~~li~~~k~k 271 (906) T PRK06330 234 YKQLENWDEVIYILKKILEHDNKSNKAREELIRFYKEK 271 (906) T ss_pred HHHCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHH T ss_conf 76127688999999999733501068899999999986 No 195 >KOG3824 consensus Probab=91.16 E-value=1.1 Score=23.42 Aligned_cols=76 Identities=17% Similarity=0.081 Sum_probs=49.9 Q ss_pred HHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHH Q ss_conf 5899999875322221000168999999726885057999865455215886999999998884212698999999 Q gi|254781033|r 119 EYLVYLLEVQIALAERQYNIAHEKLEMMLQIPATREFAVYSLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAV 194 (492) Q Consensus 119 ~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~~~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L 194 (492) +++..|-.|+....+|+.+.|...|+.++.-.|+....+...........+.-.|-+++-+++.++|.|..++... T Consensus 115 EA~~Al~~A~~~~~~Gk~ekA~~lfeHAlalaP~~p~~L~e~G~f~E~~~~iv~ADq~Y~~ALtisP~nseALvnR 190 (472) T KOG3824 115 EAILALKAAGRSRKDGKLEKAMTLFEHALALAPTNPQILIEMGQFREMHNEIVEADQCYVKALTISPGNSEALVNR 190 (472) T ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHEEEEECCCCHHHHHHH T ss_conf 9999999988988643468899999999830899779999976888851446766556420232179855877503 No 196 >TIGR02561 HrpB1_HrpK type III secretion protein HrpB1/HrpK; InterPro: IPR013394 This family of proteins is encoded by genes found within type III secretion operons in a limited range of species including Xanthomonas, Ralstonia and Burkholderia.. Probab=91.02 E-value=1.1 Score=23.31 Aligned_cols=125 Identities=13% Similarity=0.019 Sum_probs=57.7 Q ss_pred CCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCH---HHHHHHHHHHHHHHHHHCCCHH Q ss_conf 5886999999998884212698999999999995257456999999999502310---3578999999999999669989 Q gi|254781033|r 167 IGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEW---NRNRAILLIARSLENADKGDMI 243 (492) Q Consensus 167 ~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~---~~~~a~l~~~~a~~~~~~~~~~ 243 (492) .+|++.+...+.-+.-+.|+...+-..=+-++...|+|++|..+|.......+.. ..-++. ++. . +....-. T Consensus 29 ~~~~~d~~l~L~AlRvLrP~l~~~d~~dG~L~i~~G~y~dAaR~L~E~~~~~~~~svm~yg~AL--lAl--c-L~~~~D~ 103 (190) T TIGR02561 29 AADPYDIELLLDALRVLRPNLKELDMFDGWLLIARGNYDDAARILRELLASAGAASVMPYGKAL--LAL--C-LSAKGDD 103 (190) T ss_pred CCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCHHHHHHHHH--HHH--H-HHHCCCH T ss_conf 5788889999999850276506889999999861587589999999872489972043789999--999--8-7432888 Q ss_pred HHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHH Q ss_conf 9999998754313352799999999999739947899999999983289899999 Q gi|254781033|r 244 ASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHPEIANI 298 (492) Q Consensus 244 ~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p~La~a 298 (492) +=...+.+.+..+++.- .+.+.+.|.++.+.-+|...-++.=...|....+++ T Consensus 104 ~W~~~A~e~L~~~a~~d--av~LVkAL~~~~D~~~A~~~~~~~G~~~~~~s~Ae~ 156 (190) T TIGR02561 104 EWHLHADEVLEEDADAD--AVALVKALEAREDVLKAKEAARRTGAFEPPASVAEA 156 (190) T ss_pred HHHHHHHHHHHCCCCCH--HHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHH T ss_conf 88999999874487850--679999886532489999986347985887115655 No 197 >pfam00515 TPR_1 Tetratricopeptide repeat. Probab=90.96 E-value=0.39 Score=26.89 Aligned_cols=32 Identities=19% Similarity=0.308 Sum_probs=22.1 Q ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCC Q ss_conf 68999999999879988999998878660999 Q gi|254781033|r 326 ESLVIVSKIALEMGSIDQAHAKAMLAMKIAPR 357 (492) Q Consensus 326 e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~ 357 (492) +.+..+|.++...|++++|.+.++++++.+|+ T Consensus 2 ~~~~~lG~~y~~~~~~~~A~~~~~~Al~l~p~ 33 (34) T pfam00515 2 KALYNLGNAYFKLGKYDEAIEYYEKALELNPN 33 (34) T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCC T ss_conf 79999899999917899999999998843989 No 198 >pfam04190 DUF410 Protein of unknown function (DUF410). This family of proteins is from Caenorhabditis elegans and has no known function. The protein has some GO references indicating that the protein has a positive regulation of growth rate and is involved in nematode larval development. Probab=90.91 E-value=1.1 Score=23.25 Aligned_cols=28 Identities=11% Similarity=0.114 Sum_probs=21.1 Q ss_pred CCCCHHHHHHHHHHHHHCCCCHHHHHHH Q ss_conf 1269899999999999525745699999 Q gi|254781033|r 184 SPDAPWVTEAVVQQYVLAKEWSRAITFL 211 (492) Q Consensus 184 ~P~~~~a~~~L~~l~~~~gdw~~A~~~l 211 (492) .-+++..+..++..+.+.++...|+..+ T Consensus 86 ~~Gdp~LH~~ig~~~~~e~~~~~Ae~Hf 113 (260) T pfam04190 86 KYGDPALHTLIAKKLVKEDNVESAENHF 113 (260) T ss_pred CCCCHHHHHHHHHHHHHCCCHHHHHHHH T ss_conf 8999999999999998758889998885 No 199 >KOG1585 consensus Probab=90.37 E-value=1.3 Score=22.90 Aligned_cols=198 Identities=13% Similarity=0.032 Sum_probs=82.3 Q ss_pred CCCCHHHHHHHHHHHHHHCCCCHH------HHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHH-HHHHHHHHHHHHHHH Q ss_conf 158869999999988842126989------9999999999525745699999999950231035-789999999999996 Q gi|254781033|r 166 RIGDLNSAQRYATKALDISPDAPW------VTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNR-NRAILLIARSLENAD 238 (492) Q Consensus 166 ~~gd~~~A~~~~~~a~~~~P~~~~------a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~-~~a~l~~~~a~~~~~ 238 (492) ...++++|...+.++.+-.-++.. +...-+-+......|.++..+++++...-.+..+ ..+...+..+...++ T Consensus 43 nAk~feKakdcLlkA~~~yEnnrslfhAAKayEqaamLake~~klsEvvdl~eKAs~lY~E~GspdtAAmaleKAak~le 122 (308) T KOG1585 43 NAKKFEKAKDCLLKASKGYENNRSLFHAAKAYEQAAMLAKELSKLSEVVDLYEKASELYVECGSPDTAAMALEKAAKALE 122 (308) T ss_pred HHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH T ss_conf 51357888999999999988321387888889999999999887279999999999999970995469999999998862 Q ss_pred CCCHHHHHHHHHHHHHCCCCC------HHHHHHHHHHHHHCCCHHHHHHHHHHHHH------HCCCH---HHHHHHHHHC Q ss_conf 699899999998754313352------79999999999973994789999999998------32898---9999999850 Q gi|254781033|r 239 KGDMIASYHDAIESLKLCDNS------IMASICAAKSLISQNKKRKAEVILEKIWK------VNPHP---EIANIYTHLL 303 (492) Q Consensus 239 ~~~~~~A~~~~~~a~~~~p~~------~~a~~~~a~~l~~~g~~~~A~~~le~a~~------~~p~p---~La~a~~~~~ 303 (492) ..++++|+..+.++....... .+..-..+++|.+..++.+|...+.+-.. ..+.+ -++....++. T Consensus 123 nv~Pd~AlqlYqralavve~~dr~~ma~el~gk~sr~lVrl~kf~Eaa~a~lKe~~~~~~~~~y~~~~k~~va~ilv~L~ 202 (308) T KOG1585 123 NVKPDDALQLYQRALAVVEEDDRDQMAFELYGKCSRVLVRLEKFTEAATAFLKEGVAADKCDAYNSQCKAYVAAILVYLY 202 (308) T ss_pred CCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH T ss_conf 38888999999999999850534878889998754573335775688999998635999986112389999999999720 Q ss_pred CCCHHHHHHHHHHHHHH----CCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHH Q ss_conf 78958999999999982----68983689999999998799889999988786609998999999 Q gi|254781033|r 304 SENTVGKLKRALRLEEI----NKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKEIFLLL 364 (492) Q Consensus 304 ~~~~~~al~~~~~l~~~----~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~~~~ll 364 (492) ..++..+.+.++.-.+. .|++..+.-.|- -+++.||-++....+....-.+-+.....++ T Consensus 203 ~~Dyv~aekc~r~~~qip~f~~sed~r~lenLL-~ayd~gD~E~~~kvl~sp~~r~MDneya~l~ 266 (308) T KOG1585 203 AHDYVQAEKCYRDCSQIPAFLKSEDSRSLENLL-TAYDEGDIEEIKKVLSSPTVRNMDNEYAHLN 266 (308) T ss_pred HHHHHHHHHHHCCHHCCCCCCCHHHHHHHHHHH-HHHCCCCHHHHHHHHCCHHHHHHHHHHHHHH T ss_conf 888999999835230276646747789999999-8753378899999972717653028999973 No 200 >KOG1130 consensus Probab=90.14 E-value=1.3 Score=22.76 Aligned_cols=131 Identities=13% Similarity=0.057 Sum_probs=80.8 Q ss_pred HHHHHHCCCCHHHHHHHHHHHHHHCCCC------HHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCC--HHHHHHHHHHH Q ss_conf 6545521588699999999888421269------899999999999525745699999999950231--03578999999 Q gi|254781033|r 160 LYFESCRIGDLNSAQRYATKALDISPDA------PWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKE--WNRNRAILLIA 231 (492) Q Consensus 160 L~~~al~~gd~~~A~~~~~~a~~~~P~~------~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~--~~~~~a~l~~~ 231 (492) |...+--.||+++|+.+-+.-+.+.... ..++..++..+.-.|+++.|.+.+...+..... .+...+...++ T Consensus 201 LGNTyYlLGdf~~ai~~H~~RL~ia~efGDrAaeRRA~sNlgN~hiflg~fe~A~ehYK~tl~LAielg~r~vEAQscYS 280 (639) T KOG1130 201 LGNTYYLLGDFDQAIHFHKLRLEIAQEFGDRAAERRAHSNLGNCHIFLGNFELAIEHYKLTLNLAIELGNRTVEAQSCYS 280 (639) T ss_pred CCCEEEEECCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHH T ss_conf 57635650457999999999999999863188887764043103555304276999999999999996313688898987 Q ss_pred HHHHHHHCCCHHHHHHHHHHHHHCCCCC------HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHC Q ss_conf 9999996699899999998754313352------7999999999997399478999999999832 Q gi|254781033|r 232 RSLENADKGDMIASYHDAIESLKLCDNS------IMASICAAKSLISQNKKRKAEVILEKIWKVN 290 (492) Q Consensus 232 ~a~~~~~~~~~~~A~~~~~~a~~~~p~~------~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~ 290 (492) .+..+.-..+...|+.+..+-+.....+ ..+..-++..+...|..++|....++.+... T Consensus 281 LgNtytll~e~~kAI~Yh~rHLaIAqeL~DriGe~RacwSLgna~~alg~h~kAl~fae~hl~~s 345 (639) T KOG1130 281 LGNTYTLLKEVQKAITYHQRHLAIAQELEDRIGELRACWSLGNAFNALGEHRKALYFAELHLRSS 345 (639) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 64489999999999999999999999988764148999998788876421788899999999998 No 201 >COG1110 Reverse gyrase [DNA replication, recombination, and repair] Probab=90.07 E-value=0.086 Score=31.99 Aligned_cols=27 Identities=15% Similarity=-0.112 Sum_probs=21.7 Q ss_pred CCCEEC-CCCCCCHHHCCCCCCCCCCCC Q ss_conf 995275-698955424224837687364 Q gi|254781033|r 391 DPLWIS-DDGYLSSVWLPLSPISKTLCY 417 (492) Q Consensus 391 dp~W~c-~~g~~~~~W~~~c~~~~~~~~ 417 (492) |--|.| +||+.+-+|+..||.||+-+. T Consensus 692 ~tIKrC~dcg~q~~~~~~~cP~Cgs~~v 719 (1187) T COG1110 692 DTIKRCRDCGEQFVDSEDKCPRCGSRNV 719 (1187) T ss_pred HHHHHHHHCCCEECCCCCCCCCCCCCCC T ss_conf 8888885158440443235988788320 No 202 >pfam07719 TPR_2 Tetratricopeptide repeat. This Pfam entry includes outlying Tetratricopeptide-like repeats (TPR) that are not matched by pfam00515. Probab=89.31 E-value=0.41 Score=26.73 Aligned_cols=32 Identities=19% Similarity=0.286 Sum_probs=22.9 Q ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCC Q ss_conf 68999999999879988999998878660999 Q gi|254781033|r 326 ESLVIVSKIALEMGSIDQAHAKAMLAMKIAPR 357 (492) Q Consensus 326 e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~ 357 (492) +.+..+|..+...|++++|.+.++++++.+|+ T Consensus 2 ~~~~~~g~~~~~~g~~~~A~~~~~~al~l~p~ 33 (34) T pfam07719 2 EALYNLGLAYYKLGDYEEALEAYEKALELDPN 33 (34) T ss_pred HHHHHCCHHHHHCCCHHHHHHHHHHHHCCCCC T ss_conf 16875014876216699999999997726999 No 203 >COG4976 Predicted methyltransferase (contains TPR repeat) [General function prediction only] Probab=89.08 E-value=0.9 Score=24.05 Aligned_cols=60 Identities=13% Similarity=0.155 Sum_probs=30.6 Q ss_pred HHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHH Q ss_conf 222210001689999997268850579998654552158869999999988842126989 Q gi|254781033|r 130 ALAERQYNIAHEKLEMMLQIPATREFAVYSLYFESCRIGDLNSAQRYATKALDISPDAPW 189 (492) Q Consensus 130 A~~~gd~~~A~~~~~~~~~~~~~~~~~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~ 189 (492) ....||.+.+.+.|.+++.-.|.-..|+..+.....+.|+.+.|.+.+++.++++|.... T Consensus 5 ~~~~~D~~aaaely~qal~lap~w~~gwfR~g~~~ekag~~daAa~a~~~~L~ldp~D~~ 64 (287) T COG4976 5 LAESGDAEAAAELYNQALELAPEWAAGWFRLGEYTEKAGEFDAAAAAYEEVLELDPEDHG 64 (287) T ss_pred HCCCCCHHHHHHHHHHHHHCCCHHHHHHHHCCHHHHHCCCHHHHHHHHHHHHCCCCCCCC T ss_conf 103678489999998775047044434554023666304189999999988707962122 No 204 >KOG3364 consensus Probab=88.99 E-value=1.6 Score=22.15 Aligned_cols=78 Identities=13% Similarity=0.067 Sum_probs=52.4 Q ss_pred HHHHHHHCCCCH---HHHHHHHHHHHH-HCC-CCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHH-HHHHHHH Q ss_conf 999998507895---899999999998-268-9836899999999987998899999887866099989999-9999999 Q gi|254781033|r 296 ANIYTHLLSENT---VGKLKRALRLEE-INK-ESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKEIFL-LLAQIEQ 369 (492) Q Consensus 296 a~a~~~~~~~~~---~~al~~~~~l~~-~~P-~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~~~~-llA~ie~ 369 (492) ..+|+.+...+. ..-+..++.+++ .+| ...+..+.||-.+.+.++|+++++++...++.+|+++-.. |--.|+. T Consensus 37 ~lAwaLV~S~~~~dv~~GI~iLe~l~~~~~~~~rRe~lyYLAvg~yRlkeY~~s~~yvd~ll~~e~~n~Qa~~Lk~~ied 116 (149) T KOG3364 37 NLAWALVRSRDTEDVQEGIVILEDLLKSAHPERRRECLYYLAVGHYRLKEYSKSLRYVDALLETEPNNRQALELKETIED 116 (149) T ss_pred HHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHH T ss_conf 99999974642589997699999986106840001321356777888865899999999999608770789999999999 Q ss_pred HHCC Q ss_conf 8579 Q gi|254781033|r 370 ANSH 373 (492) Q Consensus 370 ~e~~ 373 (492) +-.. T Consensus 117 ~itk 120 (149) T KOG3364 117 KITK 120 (149) T ss_pred HHHH T ss_conf 8745 No 205 >COG2912 Uncharacterized conserved protein [Function unknown] Probab=88.34 E-value=1.7 Score=21.84 Aligned_cols=64 Identities=19% Similarity=0.100 Sum_probs=55.2 Q ss_pred HHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHH Q ss_conf 8507895899999999998268983689999999998799889999988786609998999999 Q gi|254781033|r 301 HLLSENTVGKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKEIFLLL 364 (492) Q Consensus 301 ~~~~~~~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~~~~ll 364 (492) ..+......++...++++..+|+++.-..-.|-+|.+.|-+.-|.+.++..++..|+.+.+... T Consensus 191 ~~~e~~~~~al~~~~r~l~l~P~dp~eirDrGliY~ql~c~~vAl~dl~~~~~~~P~~~~a~~i 254 (269) T COG2912 191 LLRELQWELALRVAERLLDLNPEDPYEIRDRGLIYAQLGCYHVALEDLSYFVEHCPDDPIAEMI 254 (269) T ss_pred HHHHHCHHHHHHHHHHHHHHCCCCHHHCCCCHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHH T ss_conf 9984056999999999875189984431670899986488445699999999849993578999 No 206 >smart00028 TPR Tetratricopeptide repeats. Repeats present in 4 or more copies in proteins. Contain a minimum of 34 amino acids each and self-associate via a "knobs and holes" mechanism. Probab=88.23 E-value=0.77 Score=24.57 Aligned_cols=32 Identities=25% Similarity=0.361 Sum_probs=21.6 Q ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCC Q ss_conf 68999999999879988999998878660999 Q gi|254781033|r 326 ESLVIVSKIALEMGSIDQAHAKAMLAMKIAPR 357 (492) Q Consensus 326 e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~ 357 (492) ..+..+|.++...|++++|.+.++++++.+|+ T Consensus 2 ~~~~~lg~~y~~~~~~~~Ai~~~~~al~~~p~ 33 (34) T smart00028 2 EALYNLGNAYLKLGDYDEALEYYEKALELDPN 33 (34) T ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCC T ss_conf 28999899999823699999999997735999 No 207 >KOG2471 consensus Probab=87.53 E-value=1.9 Score=21.49 Aligned_cols=28 Identities=21% Similarity=0.008 Sum_probs=10.7 Q ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 6899999999987998899999887866 Q gi|254781033|r 326 ESLVIVSKIALEMGSIDQAHAKAMLAMK 353 (492) Q Consensus 326 e~~~~LA~~~~~~g~~~~Ar~~l~~al~ 353 (492) +..+..|-+++..|+.-.|-+.+.+++. T Consensus 336 eilYNcG~~~Lh~grPl~AfqCf~~av~ 363 (696) T KOG2471 336 EILYNCGLLYLHSGRPLLAFQCFQKAVH 363 (696) T ss_pred HHHHHHHHHHHHCCCCHHHHHHHHHHHH T ss_conf 4677423799863890799999999999 No 208 >KOG2610 consensus Probab=87.38 E-value=1.9 Score=21.43 Aligned_cols=212 Identities=11% Similarity=0.035 Sum_probs=118.0 Q ss_pred HHHHHHHCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHC-CCCH--HHHHHHHHHH-HH Q ss_conf 999997210047879999863013466615899999875322221000168999999726-8850--5799986545-52 Q gi|254781033|r 90 TGLMSIAAHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQI-PATR--EFAVYSLYFE-SC 165 (492) Q Consensus 90 ~gl~al~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~-~~~~--~~~~~gL~~~-al 165 (492) ..-+-.+.|+...|.-.-.+..+.. +.+.+..=..-.+....|+...-...+++.+.. .+.- ..=+.|.+.. .. T Consensus 109 ~aai~~~~g~~h~a~~~wdklL~d~--PtDlla~kfsh~a~fy~G~~~~~k~ai~kIip~wn~dlp~~sYv~GmyaFgL~ 186 (491) T KOG2610 109 KAAILWGRGKHHEAAIEWDKLLDDY--PTDLLAVKFSHDAHFYNGNQIGKKNAIEKIIPKWNADLPCYSYVHGMYAFGLE 186 (491) T ss_pred HHHHHHCCCCCCHHHHHHHHHHHHC--CHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHH T ss_conf 6787633566417788899998758--50355445444678741414421457998433458998578998889986678 Q ss_pred CCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHH Q ss_conf 15886999999998884212698999999999995257456999999999502310357899999999999966998999 Q gi|254781033|r 166 RIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIAS 245 (492) Q Consensus 166 ~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A 245 (492) ..|-+++|.+.++++++++|...|+..++.-+....++..++.+.+.+--..-...--.....+---+..+.+.+....+ T Consensus 187 E~g~y~dAEk~A~ralqiN~~D~Wa~Ha~aHVlem~~r~Keg~eFM~~ted~Wr~s~mlasHNyWH~Al~~iE~aeye~a 266 (491) T KOG2610 187 ECGIYDDAEKQADRALQINRFDCWASHAKAHVLEMNGRHKEGKEFMYKTEDDWRQSWMLASHNYWHTALFHIEGAEYEKA 266 (491) T ss_pred HHCCCHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCCCHHHH T ss_conf 72550269999986314787416889889999984020565799998531412123688765068888864101113578 Q ss_pred HHHHHH--HHHCCCCCHHHHHHHH---HHHHHCCCHHHHHH---HH--HHHHHHCCCHHHHHHHHHHC Q ss_conf 999987--5431335279999999---99997399478999---99--99998328989999999850 Q gi|254781033|r 246 YHDAIE--SLKLCDNSIMASICAA---KSLISQNKKRKAEV---IL--EKIWKVNPHPEIANIYTHLL 303 (492) Q Consensus 246 ~~~~~~--a~~~~p~~~~a~~~~a---~~l~~~g~~~~A~~---~l--e~a~~~~p~p~La~a~~~~~ 303 (492) ++.+.. ..++..+..-+...+- .+.......++-.+ .+ ...|-..+.-++...+..-. T Consensus 267 leIyD~ei~k~l~k~Da~a~~~~ld~dgv~~~~d~~~kld~la~~l~d~a~~~~d~~~~itt~~~~~~ 334 (491) T KOG2610 267 LEIYDREIWKRLEKDDAVARDVYLDLDGVDLRSDLWRKLDKLADSLTDKAMWYQDWLFDITTIWALSK 334 (491) T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHH T ss_conf 88887999987511331455544335567767888887776544311113443213311257765433 No 209 >KOG2396 consensus Probab=86.83 E-value=2.1 Score=21.21 Aligned_cols=67 Identities=22% Similarity=0.219 Sum_probs=51.1 Q ss_pred CCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCH-HHHHHHHHHHHHH-CCCHHHHHHHHHHHHC Q ss_conf 68983689999999998799889999988786609998-9999999999985-7998999999999962 Q gi|254781033|r 321 NKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRK-EIFLLLAQIEQAN-SHNTDKILYWTQSALH 387 (492) Q Consensus 321 ~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~-~~~~llA~ie~~e-~~d~~~ar~wl~rA~~ 387 (492) .|+.....-.+-+.+.+.|-..+||........+.|-+ ..+.-|-++|... .-|-.-+|...++|++ T Consensus 456 ~~~~~tl~s~~l~~~~e~~~~~~ark~y~~l~~lpp~sl~l~r~miq~e~~~~sc~l~~~r~~yd~a~~ 524 (568) T KOG2396 456 GADSVTLKSKYLDWAYESGGYKKARKVYKSLQELPPFSLDLFRKMIQFEKEQESCNLANIREYYDRALR 524 (568) T ss_pred CCCEEEHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH T ss_conf 774453158999999994134889999988872799518999999997765752470579999999999 No 210 >PRK09401 reverse gyrase; Reviewed Probab=86.71 E-value=0.18 Score=29.57 Aligned_cols=27 Identities=7% Similarity=-0.182 Sum_probs=22.2 Q ss_pred CCCCEECC-CCCCCHHHCCCCCCCCCCC Q ss_conf 99952756-9895542422483768736 Q gi|254781033|r 390 PDPLWISD-DGYLSSVWLPLSPISKTLC 416 (492) Q Consensus 390 ~dp~W~c~-~g~~~~~W~~~c~~~~~~~ 416 (492) -++-|.|. |||...+|...||.||+-+ T Consensus 674 Y~~IkrC~~cg~qft~~~~~cP~Cgs~~ 701 (1176) T PRK09401 674 YNSIKRCRDCGHQFTDESDRCPRCGSTN 701 (1176) T ss_pred EHHHHHHHHCCCCCCCCCCCCCCCCCCC T ss_conf 1667677535860124666688778887 No 211 >KOG1914 consensus Probab=86.52 E-value=2.2 Score=21.09 Aligned_cols=209 Identities=11% Similarity=-0.003 Sum_probs=133.6 Q ss_pred HHHHHHHHCCCCHHHHHHHH-------HHHHHCCCCHHH-------HHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCC Q ss_conf 99988842126989999999-------999952574569-------9999999950231035789999999999996699 Q gi|254781033|r 176 YATKALDISPDAPWVTEAVV-------QQYVLAKEWSRA-------ITFLNQKKKNAKEWNRNRAILLIARSLENADKGD 241 (492) Q Consensus 176 ~~~~a~~~~P~~~~a~~~L~-------~l~~~~gdw~~A-------~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~ 241 (492) ..+..+.--+-++.++.... +++.+.||-..| ..++++..... ......+.++.+.......+ T Consensus 267 ayeQ~ll~l~~~peiWy~~s~yl~~~s~l~~~~~d~~~a~~~t~e~~~~yEr~I~~l---~~~~~~Ly~~~a~~eE~~~~ 343 (656) T KOG1914 267 AYEQCLLYLGYHPEIWYDYSMYLIEISDLLTEKGDVPDAKSLTDEAASIYERAIEGL---LKENKLLYFALADYEESRYD 343 (656) T ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHCC T ss_conf 999999997437999999999999866898874235211345899999999987887---77889999998766887514 Q ss_pred ---HHHHHHHHHHH---HHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH--CCC-HHHHHHHHHHC-CCCHHHHH Q ss_conf ---89999999875---4313352799999999999739947899999999983--289-89999999850-78958999 Q gi|254781033|r 242 ---MIASYHDAIES---LKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKV--NPH-PEIANIYTHLL-SENTVGKL 311 (492) Q Consensus 242 ---~~~A~~~~~~a---~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~--~p~-p~La~a~~~~~-~~~~~~al 311 (492) .+.....+.+. ...+|+++ .+.+-....+..-...|.+++.++-+. .+| --++.++.+.. .++..-+. T Consensus 344 ~n~~~~~~~~~~~ll~~~~~~~tLv--~~~~mn~irR~eGlkaaR~iF~kaR~~~r~~hhVfVa~A~mEy~cskD~~~Af 421 (656) T KOG1914 344 DNKEKKVHEIYNKLLKIEDIDLTLV--YCQYMNFIRRAEGLKAARKIFKKAREDKRTRHHVFVAAALMEYYCSKDKETAF 421 (656) T ss_pred CCHHHHHHHHHHHHHHHHCCCCCEE--HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCHHHHH T ss_conf 3054430799999873221487430--26999999986407889999999853567741356788999998627814889 Q ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHC--CCCH--HHHHHHHHHHHHHCCCHHHHHHHHHHHHC Q ss_conf 9999999826898368999999999879988999998878660--9998--99999999999857998999999999962 Q gi|254781033|r 312 KRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKI--APRK--EIFLLLAQIEQANSHNTDKILYWTQSALH 387 (492) Q Consensus 312 ~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~--~P~~--~~~~llA~ie~~e~~d~~~ar~wl~rA~~ 387 (492) +-++--++..+|.++..+...--.+..++-..||..++++++. .|+. ++|.-|.+-|.. -||-..++..-.|-.. T Consensus 422 rIFeLGLkkf~d~p~yv~~YldfL~~lNdd~N~R~LFEr~l~s~l~~~ks~~Iw~r~l~yES~-vGdL~si~~lekR~~~ 500 (656) T KOG1914 422 RIFELGLKKFGDSPEYVLKYLDFLSHLNDDNNARALFERVLTSVLSADKSKEIWDRMLEYESN-VGDLNSILKLEKRRFT 500 (656) T ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHH-CCCHHHHHHHHHHHHH T ss_conf 999988875599769999999999983862108999999872158866668999999999872-2549999999999987 Q ss_pred CCC Q ss_conf 889 Q gi|254781033|r 388 AMP 390 (492) Q Consensus 388 a~~ 390 (492) +-+ T Consensus 501 af~ 503 (656) T KOG1914 501 AFP 503 (656) T ss_pred HCC T ss_conf 461 No 212 >pfam09988 DUF2227 Uncharacterized metal-binding protein (DUF2227). Members of this family of hypothetical bacterial proteins possess metal binding properties; however, their exact function has not, as yet, been determined. Probab=86.30 E-value=1.5 Score=22.35 Aligned_cols=14 Identities=7% Similarity=-0.036 Sum_probs=9.8 Q ss_pred CEEEEEECCEEEEE Q ss_conf 84899989976622 Q gi|254781033|r 26 EDVSITWGNRLYRT 39 (492) Q Consensus 26 G~v~i~~~~~~ie~ 39 (492) |.+.+.|.+|..-+ T Consensus 61 G~l~~iW~PY~~~~ 74 (174) T pfam09988 61 GPLRWIWWPYRRLI 74 (174) T ss_pred HHHEEHHHHHHCCC T ss_conf 21411063302024 No 213 >KOG3807 consensus Probab=85.92 E-value=2.3 Score=20.87 Aligned_cols=206 Identities=15% Similarity=0.093 Sum_probs=109.4 Q ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH Q ss_conf 79998654552158869999999988842126989999999999952574569999999995023103578999999999 Q gi|254781033|r 155 FAVYSLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSL 234 (492) Q Consensus 155 ~~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~ 234 (492) ..-..++..+.+..++.+-++.+..+++++|.-..++..|++-.. .--.+|.+++.+++|......+.. . T Consensus 185 r~e~eIMQ~AWRERnp~~RI~~A~~ALeIN~eCA~AyvLLAEEEa--~Ti~~AE~l~k~ALka~e~~yr~s--------q 254 (556) T KOG3807 185 RPEDEIMQKAWRERNPPARIKAAYQALEINNECATAYVLLAEEEA--TTIVDAERLFKQALKAGETIYRQS--------Q 254 (556) T ss_pred CHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHH--------H T ss_conf 707899999987319388899899987528013357876636756--479999999999999878676567--------8 Q ss_pred HHHHCCCHHHHHHHHHHHHHCCCC-CHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCC-------CHHHHHHHHHHCCC- Q ss_conf 999669989999999875431335-279999999999973994789999999998328-------98999999985078- Q gi|254781033|r 235 ENADKGDMIASYHDAIESLKLCDN-SIMASICAAKSLISQNKKRKAEVILEKIWKVNP-------HPEIANIYTHLLSE- 305 (492) Q Consensus 235 ~~~~~~~~~~A~~~~~~a~~~~p~-~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p-------~p~La~a~~~~~~~- 305 (492) .....+...++. .+.+-+ .+..--.++-.-.++|+.++|.+.++..-+..| |..|.++..++..- T Consensus 255 q~qh~~~~~da~------~rRDtnvl~YIKRRLAMCARklGrlrEA~K~~RDL~ke~pl~t~lniheNLiEalLE~QAYA 328 (556) T KOG3807 255 QCQHQSPQHEAQ------LRRDTNVLVYIKRRLAMCARKLGRLREAVKIMRDLMKEFPLLTMLNIHENLLEALLELQAYA 328 (556) T ss_pred HHHHHCCCHHHH------HHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 876414411356------61456156888999999998762499999999987640528999888899999999989888 Q ss_pred CHHHHHHHHHHHHHHCCCCHHHHHH----H---------HHHHHHCCCH---HHHHHHHHHHHHCCCCHHHHHHHHH-H- Q ss_conf 9589999999999826898368999----9---------9999987998---8999998878660999899999999-9- Q gi|254781033|r 306 NTVGKLKRALRLEEINKESVESLVI----V---------SKIALEMGSI---DQAHAKAMLAMKIAPRKEIFLLLAQ-I- 367 (492) Q Consensus 306 ~~~~al~~~~~l~~~~P~~~e~~~~----L---------A~~~~~~g~~---~~Ar~~l~~al~~~P~~~~~~llA~-i- 367 (492) +-..-+.+|..+ .-|.+...-+. . ...+.+.|.- -.|.+.+-++++-+|-..-|+|-.. + T Consensus 329 DvqavLakYDdi--slPkSA~icYTaALLK~RAVa~kFspd~asrRGLS~AE~~AvEAihRAvEFNPHVPkYLLE~kslI 406 (556) T KOG3807 329 DVQAVLAKYDDI--SLPKSAAICYTAALLKTRAVSEKFSPETASRRGLSTAEINAVEAIHRAVEFNPHVPKYLLEMKSLI 406 (556) T ss_pred HHHHHHHHHCCC--CCCCHHHHHHHHHHHHHHHHHHHCCCHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHCC T ss_conf 999999751004--585157889999999988887523901565515448777899999877525898867999887413 Q ss_pred ---H-HHHCCCHHHH Q ss_conf ---9-9857998999 Q gi|254781033|r 368 ---E-QANSHNTDKI 378 (492) Q Consensus 368 ---e-~~e~~d~~~a 378 (492) | -..+||.++. T Consensus 407 lPPEHiLKRGDSEAl 421 (556) T KOG3807 407 LPPEHILKRGDSEAI 421 (556) T ss_pred CCHHHHHHCCCHHHH T ss_conf 898998643758999 No 214 >pfam04184 ST7 ST7 protein. The ST7 (for suppression of tumorigenicity 7) protein is thought to be a tumour suppressor gene. The molecular function of this protein is uncertain. Probab=84.54 E-value=2.6 Score=20.41 Aligned_cols=206 Identities=14% Similarity=0.110 Sum_probs=109.9 Q ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHH Q ss_conf 99986545521588699999999888421269899999999999525745699999999950231035789999999999 Q gi|254781033|r 156 AVYSLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLE 235 (492) Q Consensus 156 ~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~ 235 (492) .-...|+.+-++.++..-++.+.++++++|+-..++..|++-.. .---+|.+++.+++|......+.. .. T Consensus 170 p~~eiMQ~AWRERnp~~RIkaA~~ALe~n~dCatAyiLLAEEEa--tTI~eaE~~~kqALK~~E~~yr~s--------q~ 239 (540) T pfam04184 170 PEDTIMQKAWRERNPQARIQAAYQALEINPECATAYVLLAEEEA--TTITEAERLFKQALKAGESTYRSS--------QQ 239 (540) T ss_pred CHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHC--CHHHHHHHHHHHHHHHHHHHHHHH--------HH T ss_conf 30899999987419198999999998719741888999864210--469999999999999888887655--------55 Q ss_pred HHHCCCHHHHHHHHHHHHHCC-CCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCC-------CHHHHHHHHHHCCC-C Q ss_conf 996699899999998754313-35279999999999973994789999999998328-------98999999985078-9 Q gi|254781033|r 236 NADKGDMIASYHDAIESLKLC-DNSIMASICAAKSLISQNKKRKAEVILEKIWKVNP-------HPEIANIYTHLLSE-N 306 (492) Q Consensus 236 ~~~~~~~~~A~~~~~~a~~~~-p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p-------~p~La~a~~~~~~~-~ 306 (492) ....+...++. .+.+ --.+...-.+|-...+.|+.++|.|..+...+..| |..|.+++.++..- + T Consensus 240 ~qh~~~~~ea~------~rrd~nv~vyikRRlAMCaRklgr~kEAvKm~rDL~ke~p~~~~lniheNLiEalLe~QaYad 313 (540) T pfam04184 240 IASYGQQGDAQ------HRRDTNVLVYIKRRLAMCARKLGRTREAVKMMRDLMKEFPLLSMLNIHENLLEALLELQAYAD 313 (540) T ss_pred HCCCCCCHHHH------HHCCCCEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 30557510245------523573143108899999987511899999999998757236776788999999999888899 Q ss_pred HHHHHHHHHHHHHHCCCCHHHHHH----HHH---------HHHHCCCH---HHHHHHHHHHHHCCCCHHHHHHHHH-H-- Q ss_conf 589999999999826898368999----999---------99987998---8999998878660999899999999-9-- Q gi|254781033|r 307 TVGKLKRALRLEEINKESVESLVI----VSK---------IALEMGSI---DQAHAKAMLAMKIAPRKEIFLLLAQ-I-- 367 (492) Q Consensus 307 ~~~al~~~~~l~~~~P~~~e~~~~----LA~---------~~~~~g~~---~~Ar~~l~~al~~~P~~~~~~llA~-i-- 367 (492) ...-+.++... .-|.+...-++ .+| .+.+.|.- -.|.+.+-++++-+|-..-|+|-.. + T Consensus 314 vQavLakYDdi--slPkSAticyTaALLKaR~V~dkfspe~a~rrGLs~aE~~AvEAihRAvEFNPHVPkYLLEmkslIl 391 (540) T pfam04184 314 VQAVLAKYDDI--SLPKSATICYTAALLKARAVSDKFSPEAASRRGLSTAEMNAVEAIHRAVEFNPHVPKYLLEMKSLIL 391 (540) T ss_pred HHHHHHHHCCC--CCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHCCCC T ss_conf 99999760235--6863478989999999898873058556765379888889999998765258988668886313248 Q ss_pred --H-HHHCCCHHHHH Q ss_conf --9-98579989999 Q gi|254781033|r 368 --E-QANSHNTDKIL 379 (492) Q Consensus 368 --e-~~e~~d~~~ar 379 (492) | --.+||.+++- T Consensus 392 PpEHiLKRGDSEAiA 406 (540) T pfam04184 392 PPEHILKRGDSEAVA 406 (540) T ss_pred CHHHHHHCCCHHHHH T ss_conf 978876336478899 No 215 >pfam04910 DUF654 Protein of unknown function, DUF654. This family includes a number of poorly characterized eukaryotic proteins. Probab=84.53 E-value=2.6 Score=20.41 Aligned_cols=50 Identities=10% Similarity=-0.047 Sum_probs=26.8 Q ss_pred HHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHH Q ss_conf 552158869999999988842126989999999999952574569999999995 Q gi|254781033|r 163 ESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKK 216 (492) Q Consensus 163 ~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~ 216 (492) .+....|++.=...+ +.+|-|...+..+.+++..+||.+.|-+++++++- T Consensus 19 ~~v~s~Dp~~l~~lL----~~~PYHvdtLLq~s~v~~~qgd~~~a~dlieRALf 68 (343) T pfam04910 19 AAVESHDPEALISLL----QKYPYHVDTLLQVSEIFRRQGDHSLANDLIERALF 68 (343) T ss_pred HHHHCCCHHHHHHHH----HHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHH T ss_conf 999806999999999----85998378999999999984778889999999999 No 216 >TIGR03504 FimV_Cterm FimV C-terminal domain. This protein is found at the extreme C-terminus of FimV from Pseudomonas aeruginosa, and of TspA of Neisseria meningitidis. Disruption of the former blocks twitching motility from type IV pili; Semmler, et al. suggest a role in peptidoglycan layer remodelling required by type IV fimbrial systems. Probab=84.24 E-value=1.8 Score=21.71 Aligned_cols=29 Identities=14% Similarity=0.193 Sum_probs=22.7 Q ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHCCC Q ss_conf 99999999987998899999887866099 Q gi|254781033|r 328 LVIVSKIALEMGSIDQAHAKAMLAMKIAP 356 (492) Q Consensus 328 ~~~LA~~~~~~g~~~~Ar~~l~~al~~~P 356 (492) .+-||++|++.|+.+.||+.|+.+++... T Consensus 2 KLDLA~aYiemGD~egAr~lL~EVi~eg~ 30 (44) T TIGR03504 2 KLDLARAYIEMGDLEGARELLEEVIEEGD 30 (44) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHHCC T ss_conf 67899999982988989999999998499 No 217 >KOG2300 consensus Probab=83.68 E-value=2.8 Score=20.15 Aligned_cols=29 Identities=17% Similarity=0.105 Sum_probs=12.6 Q ss_pred HHHHHHHHCCCHHHHHHHHHHHHHHCCCH Q ss_conf 99999997399478999999999832898 Q gi|254781033|r 265 CAAKSLISQNKKRKAEVILEKIWKVNPHP 293 (492) Q Consensus 265 ~~a~~l~~~g~~~~A~~~le~a~~~~p~p 293 (492) .++-+...+|++.+|...+.+.++...+. T Consensus 450 v~glfaf~qn~lnEaK~~l~e~Lkmanae 478 (629) T KOG2300 450 VYGLFAFKQNDLNEAKRFLRETLKMANAE 478 (629) T ss_pred HHHHHHHHHCCHHHHHHHHHHHHHHCCHH T ss_conf 99999998552899999999987540402 No 218 >TIGR01477 RIFIN variant surface antigen, rifin family; InterPro: IPR006373 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see . This entry represents the rifin family of variant surface antigens, encoded by the rif (short for "repetetive interspersed family") genes. This is the largest variant surface antigen family in P. falciparum, with roughly 150 genes present in the genome of this parasite . The function of these proteins is not yet known, but rifins are strongly immunogenic and high levels of anti-rifin antibodies are correlated with rapid parasite clearance and asymptomatic infections . . Probab=83.17 E-value=2.1 Score=21.14 Aligned_cols=11 Identities=18% Similarity=0.323 Sum_probs=4.5 Q ss_pred HHHHHHHHHHH Q ss_conf 99999999999 Q gi|254781033|r 84 GYKALYTGLMS 94 (492) Q Consensus 84 ~~~al~~gl~a 94 (492) ...++..|.-| T Consensus 199 ~~~a~~aG~~A 209 (395) T TIGR01477 199 AAAAIAAGIKA 209 (395) T ss_pred HHHHHHHHHHH T ss_conf 99999999999 No 219 >KOG3616 consensus Probab=82.42 E-value=3.1 Score=19.80 Aligned_cols=22 Identities=14% Similarity=0.043 Sum_probs=8.1 Q ss_pred HHHHHHHHHHHCCCHHHHHHHH Q ss_conf 8999999999879988999998 Q gi|254781033|r 327 SLVIVSKIALEMGSIDQAHAKA 348 (492) Q Consensus 327 ~~~~LA~~~~~~g~~~~Ar~~l 348 (492) .+.-+|.-+-..|+...|..++ T Consensus 884 t~~~f~~e~e~~g~lkaae~~f 905 (1636) T KOG3616 884 THKHFAKELEAEGDLKAAEEHF 905 (1636) T ss_pred HHHHHHHHHHHCCCHHHHHHHH T ss_conf 8999888877535756899988 No 220 >pfam10602 RPN7 26S proteasome subunit RPN7. RPN7 (known as the non ATPase regulatory subunit 6 in higher eukaryotes) is one of the lid subunits of the 26S proteasome and has been shown in Saccharomyces cerevisiae to be required for structural integrity. The 26S proteasome is is involved in the ATP-dependent degradation of ubiquitinated proteins. Probab=80.44 E-value=3.6 Score=19.31 Aligned_cols=99 Identities=16% Similarity=0.176 Sum_probs=55.5 Q ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCH---HHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCC--HHHHHHHHHH Q ss_conf 999865455215886999999998884212698---99999999999525745699999999950231--0357899999 Q gi|254781033|r 156 AVYSLYFESCRIGDLNSAQRYATKALDISPDAP---WVTEAVVQQYVLAKEWSRAITFLNQKKKNAKE--WNRNRAILLI 230 (492) Q Consensus 156 ~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~---~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~--~~~~~a~l~~ 230 (492) |+..++..+.+-||.+.|++.+.++.+.-.... .....+.++....+||.-....+.++...... ...++.-+.. T Consensus 38 ~~~~l~~~y~~iGd~~~A~k~y~~~~~~~ts~~~~id~~l~~irv~i~~~d~~~v~~~i~ka~~~~e~ggDw~r~nrlk~ 117 (174) T pfam10602 38 ANLDLGEHYAKIGDLENALKAYSRAREYCTSLGHKIDMCLNLIRVGIFFNDWDHVSKYIEKAKSLIEKGGDWERRNRLKV 117 (174) T ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHH T ss_conf 99998999999667999999999988317884599999999999999970099999999999856514455878988999 Q ss_pred HHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 999999966998999999987543 Q gi|254781033|r 231 ARSLENADKGDMIASYHDAIESLK 254 (492) Q Consensus 231 ~~a~~~~~~~~~~~A~~~~~~a~~ 254 (492) ..+...+..++..+|-..+.+... T Consensus 118 y~Gl~~l~~r~f~~AA~~fl~~~~ 141 (174) T pfam10602 118 YEGLALLAIRDFKEAAKLFLDSLS 141 (174) T ss_pred HHHHHHHHHCHHHHHHHHHHCCCC T ss_conf 999999971649999999963767 No 221 >KOG2422 consensus Probab=80.01 E-value=3.7 Score=19.21 Aligned_cols=30 Identities=10% Similarity=0.061 Sum_probs=13.6 Q ss_pred CCCHHHHHHHHHHHHHCCCCHHHHHHHHHH Q ss_conf 269899999999999525745699999999 Q gi|254781033|r 185 PDAPWVTEAVVQQYVLAKEWSRAITFLNQK 214 (492) Q Consensus 185 P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~ 214 (492) |-|...+..+.++...+||.+-+.+++++. T Consensus 281 PYHvdsLLqva~~~r~qgD~e~aadLieR~ 310 (665) T KOG2422 281 PYHVDSLLQVADIFRFQGDREMAADLIERG 310 (665) T ss_pred CCCHHHHHHHHHHHHHHCCHHHHHHHHHHH T ss_conf 840668999999998712245678899989 No 222 >pfam07214 DUF1418 Protein of unknown function (DUF1418). This family consists of several hypothetical Enterobacterial proteins of around 100 residues in length. Members of this family are often described as YbjC. In E. coli the ybjC gene is located downstream of nfsA (which encodes the major oxygen-insensitive nitroreductase). It is thought that nfsA and ybjC form an operon an its promoter is a class I SoxS-dependent promoter. The function of this family is unknown. Probab=79.96 E-value=3.8 Score=19.20 Aligned_cols=64 Identities=9% Similarity=0.110 Sum_probs=32.2 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHCCCCEEEE-EECCEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 967999999999999999997348984899-98997662218999999999999999999999998 Q gi|254781033|r 1 MLRLIRYFFVISLVICSFIIVSHYPEDVSI-TWGNRLYRTSPFVILSILYFFLFAWILLFAVSRFF 65 (492) Q Consensus 1 M~r~l~~~~~i~~~~~~~~~l~~~~G~v~i-~~~~~~ie~s~~~~i~~~~~~~~~~~ll~~l~~~i 65 (492) |=|.++++=++..+.+...|+.-|. |+++ .+.+..--.-..+|+.+.+.+=.++.++||....+ T Consensus 7 mPK~VlilE~lGmiLLv~a~LsiNd-yl~lP~~l~tp~a~i~MIflGi~lmiPAAv~ivWR~a~gl 71 (96) T pfam07214 7 LPKSVLILEILGMILLVLAYLSIND-YLSLPGPIASPTAAILMIFLGVGLMIPAAVVVVWRVAQGF 71 (96) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHCC-CEECCHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 7703699999999999999999714-2037522158058999999999998899999999999975 No 223 >TIGR02976 phageshock_pspB phage shock protein B; InterPro: IPR009554 This family consists of several bacterial phage shock protein B (PspB) sequences. The phage shock protein (psp) operon is induced in response to heat, ethanol, osmotic shock and infection by filamentous bacteriophages . Expression of the operon requires the alternative sigma factor sigma54 and the transcriptional activator PspF. In addition, PspA plays a negative regulatory role, and the integral-membrane proteins PspB and PspC play a positive one .. Probab=79.08 E-value=3.5 Score=19.41 Aligned_cols=59 Identities=17% Similarity=0.190 Sum_probs=30.1 Q ss_pred EHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHC Q ss_conf 2189999999999999999999999984182889999999999999999999999972100478799998630 Q gi|254781033|r 39 TSPFVILSILYFFLFAWILLFAVSRFFLSCPAMLFHMLHKRNYDKGYKALYTGLMSIAAHNIPLARKMHSYVS 111 (492) Q Consensus 39 ~s~~~~i~~~~~~~~~~~ll~~l~~~i~~~p~~~~~~~~~rr~~k~~~al~~gl~al~~Gd~~~A~k~~~~a~ 111 (492) |+.+++++-++++++++.++|.++.+. ++.|.-..+++.-.+...-=...|.|+-.+.. T Consensus 1 M~~~~~~~P~i~f~~fv~~~wl~lHy~--------------~~~~~~~~l~~~d~~~L~~L~~~A~Rl~~Ri~ 59 (75) T TIGR02976 1 MSIFFLAIPLIIFVIFVALLWLLLHYR--------------SKRKTAASLSTDDLQLLQELYAKADRLEERID 59 (75) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHH--------------CCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 915899999999999999999999852--------------36640011157999999999999989999999 No 224 >PRK13341 recombination factor protein RarA/unknown domain fusion protein; Reviewed Probab=78.61 E-value=4.1 Score=18.90 Aligned_cols=15 Identities=20% Similarity=0.392 Sum_probs=7.9 Q ss_pred CCHHHHHHHHHHHHC Q ss_conf 998999999999962 Q gi|254781033|r 373 HNTDKILYWTQSALH 387 (492) Q Consensus 373 ~d~~~ar~wl~rA~~ 387 (492) -++.....|+.|.+. T Consensus 457 ~~~~~~~~w~~r~~~ 471 (726) T PRK13341 457 PDDPELERWLQRQLG 471 (726) T ss_pred CCCHHHHHHHHHHHC T ss_conf 996578999999873 No 225 >TIGR03362 VI_chp_7 type VI secretion-associated protein, VC_A0119 family. This protein family is one of two related families in type VI secretion systems that contain an ImpA-related N-terminal domain (pfam06812). Probab=78.03 E-value=4.3 Score=18.77 Aligned_cols=150 Identities=13% Similarity=0.064 Sum_probs=73.1 Q ss_pred HHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH--CCCHHHHHHHHHHCCCCH- Q ss_conf 99999996699899999998754313352799999999999739947899999999983--289899999998507895- Q gi|254781033|r 231 ARSLENADKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKV--NPHPEIANIYTHLLSENT- 307 (492) Q Consensus 231 ~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~--~p~p~La~a~~~~~~~~~- 307 (492) ......+..++..+.++.+.+.+..+|-........+..+.+.| +..+...|+..... .-.|.|... .+..|.+ T Consensus 104 ~~y~~~~~q~~~~~Ll~~~E~sl~~~pfWLDgq~~~a~al~~LG-~~~~a~aI~~el~~fL~RlP~L~~L--~F~DGtPF 180 (301) T TIGR03362 104 ADYQELLAQADWAALLQRVEQSLSLAPFWLDGQRLSAQALERLG-YAAVAQAIRDELAAFLERLPGLLEL--KFSDGTPF 180 (301) T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCC-CHHHHHHHHHHHHHHHHHCCCHHHH--CCCCCCCC T ss_conf 99999997799899999999998827247359999999998768-5999999999999999867760430--23799987 Q ss_pred --HHHHHHHHHHHHH-------------CCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCH--HHH--HHHHHHH Q ss_conf --8999999999982-------------68983689999999998799889999988786609998--999--9999999 Q gi|254781033|r 308 --VGKLKRALRLEEI-------------NKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRK--EIF--LLLAQIE 368 (492) Q Consensus 308 --~~al~~~~~l~~~-------------~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~--~~~--~llA~ie 368 (492) ..-...+...... ..++......-|..+.+.+..+.|...++..+...++. +.+ ++||++. T Consensus 181 ad~~T~~WL~~~~~~~~~~~~~~~~~~~~~~~~~~~~~eA~~l~~~~gl~~Al~~L~~~l~~~~s~R~rf~~rL~~ArL~ 260 (301) T TIGR03362 181 ADDETRAWLAQHATRSNAASVAPVAEVGEESDWEELREEARALAAEGGLEAALQRLQQRLAQAREPRERFHWRLLLARLL 260 (301) T ss_pred CCHHHHHHHHHCCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHH T ss_conf 98889999986034677776677777776644899999999999747999999999975324899799999999999999 Q ss_pred HHHCCCHHHHHHHHHH Q ss_conf 9857998999999999 Q gi|254781033|r 369 QANSHNTDKILYWTQS 384 (492) Q Consensus 369 ~~e~~d~~~ar~wl~r 384 (492) +.. |-..-+..-+.. T Consensus 261 ~~~-g~~~lA~~~l~~ 275 (301) T TIGR03362 261 EQA-GKAELAQQLYAA 275 (301) T ss_pred HHC-CCHHHHHHHHHH T ss_conf 986-987999999999 No 226 >PRK00750 lysK lysyl-tRNA synthetase; Reviewed Probab=77.93 E-value=2.1 Score=21.24 Aligned_cols=10 Identities=20% Similarity=0.341 Sum_probs=4.2 Q ss_pred CCCCHHHHHH Q ss_conf 1588699999 Q gi|254781033|r 166 RIGDLNSAQR 175 (492) Q Consensus 166 ~~gd~~~A~~ 175 (492) +.|.+++++. T Consensus 144 ~sG~f~~~i~ 153 (513) T PRK00750 144 KSGRFDEALL 153 (513) T ss_pred HCCCHHHHHH T ss_conf 4486699999 No 227 >COG0790 FOG: TPR repeat, SEL1 subfamily [General function prediction only] Probab=77.24 E-value=4.5 Score=18.61 Aligned_cols=47 Identities=11% Similarity=0.066 Sum_probs=19.6 Q ss_pred CHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH----CCCCHHHHHHHHHHHHC Q ss_conf 86999999998884212698999999999995----25745699999999950 Q gi|254781033|r 169 DLNSAQRYATKALDISPDAPWVTEAVVQQYVL----AKEWSRAITFLNQKKKN 217 (492) Q Consensus 169 d~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~----~gdw~~A~~~l~~~~~~ 217 (492) |...|..+++ .....+++.+...|+.+|.. ..|..+|...+.++.+. T Consensus 92 ~~~~A~~~~~--~~a~~g~~~a~~~lg~~~~~G~gv~~d~~~A~~~~~~Aa~~ 142 (292) T COG0790 92 DKTKAADWYR--CAAADGLAEALFNLGLMYANGRGVPLDLVKALKYYEKAAKL 142 (292) T ss_pred CHHHHHHHHH--HHHHCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHC T ss_conf 3899999999--98736979999999999855899745899999999999984 No 228 >pfam11460 DUF3007 Protein of unknown function (DUF3007). This is a family of uncharacterized proteins found in bacteria and eukaryotes. Probab=76.96 E-value=4.2 Score=18.83 Aligned_cols=39 Identities=21% Similarity=0.169 Sum_probs=21.2 Q ss_pred HHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999998418288999999999999999999 Q gi|254781033|r 48 LYFFLFAWILLFAVSRFFLSCPAMLFHMLHKRNYDKGYKALY 89 (492) Q Consensus 48 ~~~~~~~~~ll~~l~~~i~~~p~~~~~~~~~rr~~k~~~al~ 89 (492) ++++.++.|+.-.++|.+.+. +...-++++.++.|++++ T Consensus 41 llv~~vv~Wv~SYlfRV~~g~---MTy~~Qrk~Ye~a~~~~~ 79 (104) T pfam11460 41 LLVLGVVGWTGSYLFRVVTGK---MTYMQQRRDYREAYDALT 79 (104) T ss_pred HHHHHHHHHHHHHHHHHHCCC---CCHHHHHHHHHHHHHHHH T ss_conf 999999999877876450687---758999999999999876 No 229 >pfam08424 DUF1740 Protein of unknown function (DUF1740). This is a family of eukaryotic proteins of unknown function. Probab=76.85 E-value=4.6 Score=18.54 Aligned_cols=108 Identities=13% Similarity=-0.003 Sum_probs=55.2 Q ss_pred HHHHHHH-CCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCC-HHHHHHHHHHCCCC-----HHHHHHHHHHHHHH Q ss_conf 9987543-13352799999999999739947899999999983289-89999999850789-----58999999999982 Q gi|254781033|r 248 DAIESLK-LCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPH-PEIANIYTHLLSEN-----TVGKLKRALRLEEI 320 (492) Q Consensus 248 ~~~~a~~-~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~-p~La~a~~~~~~~~-----~~~al~~~~~l~~~ 320 (492) .+.+|++ .+|+.......+-+...+.-+..+-.+-+++....+|+ +.|...|.....++ ..+-.+.|.+.++. T Consensus 55 ilEkAL~~~np~~~~Lll~~l~~~~~~w~~~~l~~~We~~l~~~p~~~~LW~~yL~~~q~~~~~f~~~~v~~~y~~cl~~ 134 (231) T pfam08424 55 ILEKALKEHNPDSERLLLGLLEEGEKVWDTDELLSRWEKVLKQNPGSPSLWRKYLDFRQGDFANFSYSKVRKTYEKCLRA 134 (231) T ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHH T ss_conf 99999998589978999999999998578799999999999988997599999999986065467088899999999999 Q ss_pred C-----------CCC-------HHHHHHHHHHHHHCCCHHHHHHHHHHHHHCC Q ss_conf 6-----------898-------3689999999998799889999988786609 Q gi|254781033|r 321 N-----------KES-------VESLVIVSKIALEMGSIDQAHAKAMLAMKIA 355 (492) Q Consensus 321 ~-----------P~~-------~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~ 355 (492) . ++. ....+-+.....++|-.+.|-..++..++.+ T Consensus 135 l~~~~~~~~~~~~~~~~~e~~~l~lflr~~~flrqaG~~E~Aval~QA~lE~n 187 (231) T pfam08424 135 LKAAKLKEITSVPAVPELEEVMLYLFLRLCRFLRQAGYTELAVALWQALLEFN 187 (231) T ss_pred HHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHH T ss_conf 87500244324555302789999999999999998596899999999999882 No 230 >KOG4318 consensus Probab=76.43 E-value=4.7 Score=18.46 Aligned_cols=23 Identities=13% Similarity=-0.145 Sum_probs=9.2 Q ss_pred HHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 99999973994789999999998 Q gi|254781033|r 266 AAKSLISQNKKRKAEVILEKIWK 288 (492) Q Consensus 266 ~a~~l~~~g~~~~A~~~le~a~~ 288 (492) ..+++.+.+....+.++++.-|+ T Consensus 534 l~dLL~r~~~l~dl~tiL~e~ks 556 (1088) T KOG4318 534 LQDLLQRLAILYDLSTILYEDKS 556 (1088) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999876889999999763167 No 231 >pfam01921 tRNA-synt_1f tRNA synthetases class I (K). This family includes only lysyl tRNA synthetases from prokaryotes. Probab=75.10 E-value=2.7 Score=20.31 Aligned_cols=56 Identities=18% Similarity=0.197 Sum_probs=31.7 Q ss_pred HHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCEECCCCCCCHHHCCCCC Q ss_conf 99999987998899999887866099989999999999985799899999999996288999527569895542422483 Q gi|254781033|r 331 VSKIALEMGSIDQAHAKAMLAMKIAPRKEIFLLLAQIEQANSHNTDKILYWTQSALHAMPDPLWISDDGYLSSVWLPLSP 410 (492) Q Consensus 331 LA~~~~~~g~~~~Ar~~l~~al~~~P~~~~~~llA~ie~~e~~d~~~ar~wl~rA~~a~~dp~W~c~~g~~~~~W~~~c~ 410 (492) -+.-++..|.|+++.... ++ +..++++-+.+-+..++...|.. +.|+|+ T Consensus 127 s~te~Y~sG~~~~~i~~~---l~--------------------~~~~I~~Il~~~~g~e~~~~y~P--------f~piC~ 175 (355) T pfam01921 127 SATELYKSGLYDEAILIA---LE--------------------NRDEIMEILLPYRGEERQETYSP--------FLPICP 175 (355) T ss_pred EHHHHHHCCCHHHHHHHH---HH--------------------HHHHHHHHHHHHCCCCCCCCEEE--------EEEECC T ss_conf 678855246559999999---98--------------------58999999998528666887368--------988858 Q ss_pred CCCCCCC Q ss_conf 7687364 Q gi|254781033|r 411 ISKTLCY 417 (492) Q Consensus 411 ~~~~~~~ 417 (492) +||+.++ T Consensus 176 ~cGrv~~ 182 (355) T pfam01921 176 KCGRVLT 182 (355) T ss_pred CCCCEEE T ss_conf 8893725 No 232 >pfam12273 RCR Chitin synthesis regulation, resistance to Congo red. RCR proteins are ER membrane proteins that regulate chitin deposition in fungal cell walls. Although chitin, a linear polymer of beta-1,4-linked N-acetylglucosamine, constitutes only 2% of the cell wall it plays a vital role in the overall protection of the cell wall against stress, noxious chemicals and osmotic pressure changes. Congo red is a cell wall-disrupting benzidine-type dye extensively used in many cell wall mutant studies that specifically targets chitin in yeast cells and inhibits growth. RCR proteins render the yeasts resistant to Congo red by diminishing the content of chitin in the cell wall. RCR proteins are probably regulating chitin synthase III interact directly with ubiquitin ligase Rsp5, and the VPEY motif is necessary for this, via interaction with the WW domains of Rsp5. Probab=74.18 E-value=2.7 Score=20.26 Aligned_cols=16 Identities=19% Similarity=0.397 Sum_probs=5.7 Q ss_pred HHHHHHHHHH-HHHHHH Q ss_conf 9999999999-999999 Q gi|254781033|r 79 RNYDKGYKAL-YTGLMS 94 (492) Q Consensus 79 rr~~k~~~al-~~gl~a 94 (492) ||+++|.+=+ -.|+|+ T Consensus 25 RRrr~G~~Pi~GT~Wma 41 (124) T pfam12273 25 RRRRRGLQPIYGTAWLA 41 (124) T ss_pred HHHHCCCCCCCCCCCCC T ss_conf 99875888876776567 No 233 >KOG2114 consensus Probab=73.76 E-value=5.4 Score=17.97 Aligned_cols=18 Identities=6% Similarity=-0.184 Sum_probs=10.4 Q ss_pred HHCCHHHHHHHHHHHHCC Q ss_conf 721004787999986301 Q gi|254781033|r 95 IAAHNIPLARKMHSYVSQ 112 (492) Q Consensus 95 l~~Gd~~~A~k~~~~a~~ 112 (492) ...||++.|.-.+.+... T Consensus 379 y~Kgdf~~A~~qYI~tI~ 396 (933) T KOG2114 379 YGKGDFDEATDQYIETIG 396 (933) T ss_pred HHCCCHHHHHHHHHHHCC T ss_conf 744777889999998706 No 234 >KOG1538 consensus Probab=73.60 E-value=5.4 Score=17.94 Aligned_cols=116 Identities=17% Similarity=0.143 Sum_probs=53.4 Q ss_pred HHHHHHCCHHHHHHHHHHHHCCCCCC------------CCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHH---- Q ss_conf 99997210047879999863013466------------61589999987532222100016899999972688505---- Q gi|254781033|r 91 GLMSIAAHNIPLARKMHSYVSQQHTF------------HNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPATRE---- 154 (492) Q Consensus 91 gl~al~~Gd~~~A~k~~~~a~~~~~~------------~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~---- 154 (492) ++-|+.+-+++.|++.+.+.+..+.+ ...|-. +|.|.....+|+..+|.+.|+.--.....-. T Consensus 592 A~~ALeAL~f~~ARkAY~rVRdl~~L~li~EL~~~k~rge~P~~-iLlA~~~Ay~gKF~EAAklFk~~G~enRAlEmyTD 670 (1081) T KOG1538 592 AMEALEALDFETARKAYIRVRDLRYLELISELEERKKRGETPND-LLLADVFAYQGKFHEAAKLFKRSGHENRALEMYTD 670 (1081) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHH-HHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHH T ss_conf 99998753357788889987053899999999988764898078-89898987755689999999974851267898877 Q ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHHHCC--CCHHHHHHHHHHHHHCCCCHHHHHH Q ss_conf 7999865455215886999999998884212--6989999999999952574569999 Q gi|254781033|r 155 FAVYSLYFESCRIGDLNSAQRYATKALDISP--DAPWVTEAVVQQYVLAKEWSRAITF 210 (492) Q Consensus 155 ~~~~gL~~~al~~gd~~~A~~~~~~a~~~~P--~~~~a~~~L~~l~~~~gdw~~A~~~ 210 (492) +-..-++.+.+..|+.++-..+.++-.+=.- +.| ..-+++....|+-++|..+ T Consensus 671 lRMFD~aQE~~~~g~~~eKKmL~RKRA~WAr~~keP---kaAAEmLiSaGe~~KAi~i 725 (1081) T KOG1538 671 LRMFDYAQEFLGSGDPKEKKMLIRKRADWARNIKEP---KAAAEMLISAGEHVKAIEI 725 (1081) T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCC---HHHHHHHHCCCCHHHHHHH T ss_conf 777888999863499178899999878876523784---6789877516741446464 No 235 >cd01675 RNR_III Class III ribonucleotide reductase. Ribonucleotide reductase (RNR) catalyzes the reductive synthesis of deoxyribonucleotides from their corresponding ribonucleotides. It provides the precursors necessary for DNA synthesis. RNRs are separated into three classes based on their metallocofactor usage. Class I RNRs, found in eukaryotes, bacteria, and bacteriophage, use a diiron-tyrosyl radical. Class II RNRs, found in bacteria, bacteriophage, algae and archaea, use coenzyme B12 (adenosylcobalamin, AdoCbl). Class III RNRs, found in strict or facultative anaerobic bacteria, bacteriophage, and archaea, use an FeS cluster and S-adenosylmethionine to generate a glycyl radical. Many organisms have more than one class of RNR present in their genomes. All three RNRs have a ten-stranded alpha-beta barrel domain that is structurally similar to the domain of PFL (pyruvate formate lyase). The class III enzyme from phage T4 consists of two subunits, this model covers the larger subunit w Probab=71.68 E-value=2.3 Score=20.86 Aligned_cols=50 Identities=14% Similarity=-0.043 Sum_probs=30.7 Q ss_pred CCHHHHHHHHHHHHCCCCC------CCEECC-CCCCCHHHCCCCCCCCCCCCCCCCC Q ss_conf 9989999999999628899------952756-9895542422483768736400367 Q gi|254781033|r 373 HNTDKILYWTQSALHAMPD------PLWISD-DGYLSSVWLPLSPISKTLCYFEWKI 422 (492) Q Consensus 373 ~d~~~ar~wl~rA~~a~~d------p~W~c~-~g~~~~~W~~~c~~~~~~~~~~W~~ 422 (492) .|..+...-..++..--.. |.=+|. ||+....=.-.||.||+=|.-.|++ T Consensus 492 ~n~ea~~~lv~~~~~~~~~y~~~ntp~~~C~~CG~~~~~~~~~CP~CG~~~~~~~~R 548 (555) T cd01675 492 PNPEALEALVKKAAKRGVIYFGINTPIDICNDCGYIGEGEGFKCPKCGSEDVEVISR 548 (555) T ss_pred CCHHHHHHHHHHHHHCCCCEEEEECCCEECCCCCCCCCCCCCCCCCCCCCCCEEEEE T ss_conf 999999999999998598349982288477899982756787690983967747998 No 236 >PRK11006 phoR phosphate regulon sensor protein; Provisional Probab=71.22 E-value=6.1 Score=17.55 Aligned_cols=52 Identities=15% Similarity=0.184 Sum_probs=27.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 799999999999999999734898489998997662218999999999999999999999998418 Q gi|254781033|r 3 RLIRYFFVISLVICSFIIVSHYPEDVSITWGNRLYRTSPFVILSILYFFLFAWILLFAVSRFFLSC 68 (492) Q Consensus 3 r~l~~~~~i~~~~~~~~~l~~~~G~v~i~~~~~~ie~s~~~~i~~~~~~~~~~~ll~~l~~~i~~~ 68 (492) +.++=++++++.+++..|+.+.. +..+++.+++.++.-+|-++++..|+|.. T Consensus 9 ~~~~~l~~~~~~~~~~g~~~~~~--------------~~~l~~~~~~~l~~~~~~~~~l~~wl~~~ 60 (431) T PRK11006 9 RLVLELALFCLPALILGAFFGYL--------------PWFLLASVTGLLIWHFWNLLRLSWWLWVD 60 (431) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH--------------HHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 99999999999999999998289--------------99999999999999999999999997158 No 237 >TIGR02264 gmx_para_CXXCG Myxococcus xanthus double-CXXCG motif paralogous family; InterPro: IPR011750 This entry consists of at least 10 paralogous proteins from Myxococcus xanthus that lack detectable sequence similarity to any other protein family. An imperfectly conserved CXXCG motif, a probable binding site, appears twice in the multiple sequence alignment.. Probab=70.10 E-value=2.4 Score=20.77 Aligned_cols=38 Identities=11% Similarity=-0.085 Sum_probs=25.7 Q ss_pred CHHHCCCCCCCCCCCCCCCC-C-CCCCCCCCCCCCCCCHH Q ss_conf 54242248376873640036-7-87554456665421000 Q gi|254781033|r 402 SSVWLPLSPISKTLCYFEWK-I-PTKSPEYISSENINFSL 439 (492) Q Consensus 402 ~~~W~~~c~~~~~~~~~~W~-~-p~~~~~~~~~~~~~~~~ 439 (492) ..+|+|-||.||+-....|- . +.-+..++|.+++.+-+ T Consensus 175 p~dr~PpCp~CGs~~~~~~Ps~~~~Lda~sLP~~vDlFRl 214 (247) T TIGR02264 175 PPDREPPCPTCGSSESEGLPSDTYVLDAASLPVDVDLFRL 214 (247) T ss_pred CCCCCCCCCCCCCHHHCCCCCCCHHHHHHCCCCCCEEEEE T ss_conf 5467688754452120478874112122128864105662 No 238 >pfam11808 DUF3329 Domain of unknown function (DUF3329). This family of proteins are functionally uncharacterized. This family is only found in bacteria. Probab=70.03 E-value=6.5 Score=17.37 Aligned_cols=54 Identities=15% Similarity=0.256 Sum_probs=30.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHH Q ss_conf 79999999999999999973489848999899766221899999999999999999999999841828 Q gi|254781033|r 3 RLIRYFFVISLVICSFIIVSHYPEDVSITWGNRLYRTSPFVILSILYFFLFAWILLFAVSRFFLSCPA 70 (492) Q Consensus 3 r~l~~~~~i~~~~~~~~~l~~~~G~v~i~~~~~~ie~s~~~~i~~~~~~~~~~~ll~~l~~~i~~~p~ 70 (492) +.+.-++++++.+++..|+.+++ ....++.+++.++.-++-+.++.++++..+. T Consensus 7 ~~l~~l~~~l~~~~lvG~~~g~~--------------~~~L~~~l~~~L~w~~~~l~rL~~WL~~~~~ 60 (90) T pfam11808 7 RLLRRLALFLLPALLLGLLFGYL--------------WWALLLGLLALLVWHYYQLYRLHDWLWQDRS 60 (90) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH--------------HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 99999999999999999999559--------------9999999999999999999999999767888 No 239 >KOG2041 consensus Probab=70.00 E-value=6.5 Score=17.36 Aligned_cols=26 Identities=12% Similarity=-0.067 Sum_probs=17.4 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHCC Q ss_conf 99999999721004787999986301 Q gi|254781033|r 87 ALYTGLMSIAAHNIPLARKMHSYVSQ 112 (492) Q Consensus 87 al~~gl~al~~Gd~~~A~k~~~~a~~ 112 (492) .+.++-+...-|+++.|+|.+..+.+ T Consensus 737 ~~q~aei~~~~g~feeaek~yld~dr 762 (1189) T KOG2041 737 EQQRAEISAFYGEFEEAEKLYLDADR 762 (1189) T ss_pred HHHHHHHHHHHCCHHHHHHHHHCCCH T ss_conf 77767676540105676665530213 No 240 >KOG2846 consensus Probab=69.48 E-value=6.6 Score=17.28 Aligned_cols=25 Identities=12% Similarity=-0.087 Sum_probs=15.7 Q ss_pred CEECCCCCCCH------H---HCCCCCCCCCCCC Q ss_conf 52756989554------2---4224837687364 Q gi|254781033|r 393 LWISDDGYLSS------V---WLPLSPISKTLCY 417 (492) Q Consensus 393 ~W~c~~g~~~~------~---W~~~c~~~~~~~~ 417 (492) .-+|..|+.|. + =...|++|+.++. T Consensus 220 ALIC~~C~~HNGla~~ee~~yi~F~C~~Cn~LN~ 253 (328) T KOG2846 220 ALICSQCHHHNGLARKEEYEYITFRCPHCNALNP 253 (328) T ss_pred HHCCHHHCCCCCCCCHHHCCCEEEECCCCCCCCC T ss_conf 4521210201586776556762898755545587 No 241 >pfam09670 Cas_Cas02710 CRISPR-associated protein (Cas_Cas02710). Members of this family are found, exclusively in the vicinity of CRISPR repeats and other CRISPR-associated (cas) genes, in Methanothermobacter thermautotrophicus (Methanobacterium thermoformicicum), Thermus thermophilus (Deinococcus-Thermus), Chloroflexus aurantiacus (Chloroflexi), and Thermomicrobium roseum (Thermomicrobia). Probab=69.42 E-value=6.6 Score=17.27 Aligned_cols=25 Identities=12% Similarity=0.101 Sum_probs=15.0 Q ss_pred HHHHHHCCCHHHHHHHHHHHHHHCC Q ss_conf 9999973994789999999998328 Q gi|254781033|r 267 AKSLISQNKKRKAEVILEKIWKVNP 291 (492) Q Consensus 267 a~~l~~~g~~~~A~~~le~a~~~~p 291 (492) |.--..+|+++.|...+-++++..- T Consensus 248 A~RRa~~gryDDAvaRlYRaLEL~~ 272 (379) T pfam09670 248 AERRAAEGRYDDAAARLYRALELLS 272 (379) T ss_pred HHHHHHCCCHHHHHHHHHHHHHHHH T ss_conf 9988865678899999999999999 No 242 >PRK11619 lytic murein transglycosylase; Provisional Probab=69.02 E-value=6.7 Score=17.21 Aligned_cols=42 Identities=10% Similarity=0.037 Sum_probs=20.6 Q ss_pred HCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHH Q ss_conf 215886999999998884212698999999999995257456 Q gi|254781033|r 165 CRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSR 206 (492) Q Consensus 165 l~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~ 206 (492) +..|+.+.|.+.+++.=-...+.+.+-..|+....+.|.-.. T Consensus 140 l~tG~~~~A~~~a~~LWl~G~S~P~aCDpLF~~w~~aG~lt~ 181 (645) T PRK11619 140 WATGQSEEAWQGAKELWLTGKSQPNACDKLFSVWRASGKQDP 181 (645) T ss_pred HHCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCH T ss_conf 865985789999998873799887157999999998699999 No 243 >cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity. Probab=68.25 E-value=2.5 Score=20.54 Aligned_cols=25 Identities=20% Similarity=0.343 Sum_probs=17.8 Q ss_pred CCEECC-CCCCCHHH--CCCCCCCCCCC Q ss_conf 952756-98955424--22483768736 Q gi|254781033|r 392 PLWISD-DGYLSSVW--LPLSPISKTLC 416 (492) Q Consensus 392 p~W~c~-~g~~~~~W--~~~c~~~~~~~ 416 (492) ..|+|. |||++..= --+||.||.-. T Consensus 1 k~w~C~vCGyi~~G~~aP~~CPvC~~~~ 28 (34) T cd00729 1 KVWVCPVCGYIHEGEEAPEKCPICGAPK 28 (34) T ss_pred CEEECCCCCCEEECCCCCCCCCCCCCCH T ss_conf 9688898998868887987586999967 No 244 >cd00674 LysRS_core_class_I This is the catalytic core domain of lysyl tRNA synthetase (LysRS). This class I enzyme is a monomer, which aminoacylates the 2'-OH of the nucleotide at the 3' of the appropriate tRNA. The core domain is based on the Rossman fold and is responsible for the ATP-dependent formation of the enzyme bound aminoacyl-adenylate. It contains the characteristic class I HIGH and KMSKS motifs, which are involved in ATP binding. The class I LysRS is found only in archaea and some bacteria and has evolved separately from class II LysRS, as the two do not share structural or sequence similarity. Probab=67.61 E-value=5.3 Score=18.05 Aligned_cols=59 Identities=15% Similarity=-0.035 Sum_probs=25.8 Q ss_pred HHHHHHHHHHHCCCHHHHHHHHHHCCCCHHH-HHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHH Q ss_conf 9999999998328989999999850789589-9999999998268983689999999998799889999 Q gi|254781033|r 279 AEVILEKIWKVNPHPEIANIYTHLLSENTVG-KLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHA 346 (492) Q Consensus 279 A~~~le~a~~~~p~p~La~a~~~~~~~~~~~-al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~ 346 (492) ..+.+-+-+..-|+|.= ...+..+ -.+.+...+..--=+++ ..-+.-+++.|.|+++.. T Consensus 81 ~~~ylg~PL~~IPdP~G-------~~~SyaeHf~~~f~~~L~~~gi~~e--~~s~te~Y~sG~f~~~i~ 140 (354) T cd00674 81 LEQYIGMPLSSVPDPFG-------CHESYAEHFNAPFEESLDRFGIEVE--FISASEMYKSGLYDEQIK 140 (354) T ss_pred HHHHCCCCCEECCCCCC-------CCCCHHHHHHHHHHHHHHHCCCEEE--EEEHHHCCCCCCCHHHHH T ss_conf 99862996300689777-------7545999999999999998498289--988566022576279999 No 245 >COG1230 CzcD Co/Zn/Cd efflux system component [Inorganic ion transport and metabolism] Probab=67.30 E-value=7.3 Score=16.96 Aligned_cols=20 Identities=20% Similarity=0.376 Sum_probs=6.8 Q ss_pred HHHHCCCCEEEEEECCEEEE Q ss_conf 99734898489998997662 Q gi|254781033|r 19 IIVSHYPEDVSITWGNRLYR 38 (492) Q Consensus 19 ~~l~~~~G~v~i~~~~~~ie 38 (492) .+++.++.....+||-+++| T Consensus 71 ~~~a~r~~~~~~TfGy~R~e 90 (296) T COG1230 71 IKLARRPATKRFTFGYKRLE 90 (296) T ss_pred HHHHCCCCCCCCCCCHHHHH T ss_conf 99841777787773076899 No 246 >COG1545 Predicted nucleic-acid-binding protein containing a Zn-ribbon [General function prediction only] Probab=66.80 E-value=3.1 Score=19.88 Aligned_cols=50 Identities=10% Similarity=-0.148 Sum_probs=36.0 Q ss_pred CHHHHHHHHHHHHCCCCCCCEECC-CCCCCHHHCCCCCCCCCCCCCCCCCC Q ss_conf 989999999999628899952756-98955424224837687364003678 Q gi|254781033|r 374 NTDKILYWTQSALHAMPDPLWISD-DGYLSSVWLPLSPISKTLCYFEWKIP 423 (492) Q Consensus 374 d~~~ar~wl~rA~~a~~dp~W~c~-~g~~~~~W~~~c~~~~~~~~~~W~~p 423 (492) .....+.|.-+++.-..=..=.|. ||+++-==+..||.|++-+.++|-.= T Consensus 10 ~~~~~~~~f~~~l~~~kl~g~kC~~CG~v~~PPr~~Cp~C~~~~~~E~vel 60 (140) T COG1545 10 TPGEARSKFFKGLKEGKLLGTKCKKCGRVYFPPRAYCPKCGSETELEWVEL 60 (140) T ss_pred CCCCHHHHHHHHHHHCEEEEEECCCCCCEECCCCCCCCCCCCCCCEEEEEE T ss_conf 354115677640131708786837898188198233878899784479993 No 247 >PRK13342 recombination factor protein RarA; Reviewed Probab=66.71 E-value=7.4 Score=16.88 Aligned_cols=18 Identities=17% Similarity=0.209 Sum_probs=10.2 Q ss_pred HHHCCCCHHHHHHHHHHH Q ss_conf 995257456999999999 Q gi|254781033|r 198 YVLAKEWSRAITFLNQKK 215 (492) Q Consensus 198 ~~~~gdw~~A~~~l~~~~ 215 (492) ....||...++..|+.+. T Consensus 186 ~~s~GDaR~aLN~LE~a~ 203 (417) T PRK13342 186 RLADGDARRALNLLELAA 203 (417) T ss_pred HHCCCCHHHHHHHHHHHH T ss_conf 814985999999999998 No 248 >PRK00564 hypA hydrogenase nickel incorporation protein; Provisional Probab=65.71 E-value=3.1 Score=19.85 Aligned_cols=26 Identities=0% Similarity=-0.259 Sum_probs=18.9 Q ss_pred CCEECC-CCCC--CHHHCC-CCCCCCCCCC Q ss_conf 952756-9895--542422-4837687364 Q gi|254781033|r 392 PLWISD-DGYL--SSVWLP-LSPISKTLCY 417 (492) Q Consensus 392 p~W~c~-~g~~--~~~W~~-~c~~~~~~~~ 417 (492) +...|. ||+. ...+.. .||+||+.+. T Consensus 70 ~~~~C~~Cg~~f~~~~~~~~~CP~Cgs~~~ 99 (117) T PRK00564 70 VELECKDCSHVFKPNALDYGVCEKCHSKNV 99 (117) T ss_pred CEEECCCCCCEEECCCCCCCCCCCCCCCCC T ss_conf 789910089988227740685909889997 No 249 >COG5243 HRD1 HRD ubiquitin ligase complex, ER membrane component [Posttranslational modification, protein turnover, chaperones] Probab=65.41 E-value=7.8 Score=16.70 Aligned_cols=66 Identities=17% Similarity=0.193 Sum_probs=46.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCEEEEEHHHHHHHHHHHHHHHHHHHHHHHHH-HHCC Q ss_conf 79999999999999999973489848999899766221899999999999999999999999-8418 Q gi|254781033|r 3 RLIRYFFVISLVICSFIIVSHYPEDVSITWGNRLYRTSPFVILSILYFFLFAWILLFAVSRF-FLSC 68 (492) Q Consensus 3 r~l~~~~~i~~~~~~~~~l~~~~G~v~i~~~~~~ie~s~~~~i~~~~~~~~~~~ll~~l~~~-i~~~ 68 (492) |++.+.+...++-++.++.....|--+.++.......|.+-..++..+++..+.++|.++.. ++|. T Consensus 2 r~i~y~l~~~Vl~~l~~~~~~~~s~t~ys~l~~t~~ls~~hi~i~~~~ill~~~l~~~~l~~llFGs 68 (491) T COG5243 2 RFILYVLASLVLFGLSVLLSLYSSATVYSALVMTSQLSPVHITIGLNVILLLFFLIANALKTLLFGS 68 (491) T ss_pred CEEEHHHHHHHHHHHHHHHHHHCCCEEEEEEEEEECCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 3221329999999999999871664345654553126750007999999999999999987774325 No 250 >PRK08579 anaerobic ribonucleoside triphosphate reductase; Provisional Probab=65.24 E-value=3 Score=19.95 Aligned_cols=50 Identities=8% Similarity=0.052 Sum_probs=26.9 Q ss_pred HCCCHHHHHHHHHHHHCCCC------CCCEECC-CCCCC--HHHCCCCCCCCCCCCCCCCC Q ss_conf 57998999999999962889------9952756-98955--42422483768736400367 Q gi|254781033|r 371 NSHNTDKILYWTQSALHAMP------DPLWISD-DGYLS--SVWLPLSPISKTLCYFEWKI 422 (492) Q Consensus 371 e~~d~~~ar~wl~rA~~a~~------dp~W~c~-~g~~~--~~W~~~c~~~~~~~~~~W~~ 422 (492) |..|..+......+..+-.- -|.=+|. |||.. ..| .||+||.=|.-.|++ T Consensus 538 E~~n~~a~~~lv~~~~~~~i~Y~~in~~~d~C~~CGy~~~g~~~--~CP~CG~~~~ev~~R 596 (623) T PRK08579 538 EEPDPEALAKLTKKIMKTKLVYWSYTPAITVCNRCGRSTTGLYT--RCPRCGSEDVEIWSR 596 (623) T ss_pred CCCCHHHHHHHHHHHHHCCCCEEEECCCCCCCCCCCCCCCCCCC--CCCCCCCCCCEEEEE T ss_conf 88999999999999996499579958888335889981777765--590996968727987 No 251 >PRK12775 putative trifunctional 2-polyprenylphenol hydroxylase/glutamate synthase subunit beta/ferritin domain-containing protein; Provisional Probab=64.17 E-value=3.1 Score=19.82 Aligned_cols=16 Identities=25% Similarity=0.092 Sum_probs=9.2 Q ss_pred HHHHHHHHHHHHHCCC Q ss_conf 7999999999997399 Q gi|254781033|r 260 IMASICAAKSLISQNK 275 (492) Q Consensus 260 ~~a~~~~a~~l~~~g~ 275 (492) -|+-+..|..|.+.|. T Consensus 441 GPAGLs~A~~Lar~G~ 456 (993) T PRK12775 441 GPAGLAAAADLTRYGV 456 (993) T ss_pred CHHHHHHHHHHHHCCC T ss_conf 7889999999997799 No 252 >PRK05452 anaerobic nitric oxide reductase flavorubredoxin; Provisional Probab=63.63 E-value=3.2 Score=19.79 Aligned_cols=25 Identities=16% Similarity=0.223 Sum_probs=16.1 Q ss_pred CCCCEECC-CCCCCHH--------------HC-----CCCCCCCC Q ss_conf 99952756-9895542--------------42-----24837687 Q gi|254781033|r 390 PDPLWISD-DGYLSSV--------------WL-----PLSPISKT 414 (492) Q Consensus 390 ~dp~W~c~-~g~~~~~--------------W~-----~~c~~~~~ 414 (492) ..+.|.|. ||.+++. |+ .+||.||- T Consensus 422 ~~~~~~c~vc~~~~~~~~~~~~~~~~~~~~~~~~p~~~~cp~c~~ 466 (479) T PRK05452 422 LGPRMQCSVCQWIYDPALGEPMQDVAPGTPWSEVPDNFLCPECSL 466 (479) T ss_pred CCCEEEEEEEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 654489977204613544787655578872002874436876689 No 253 >TIGR00739 yajC preprotein translocase, YajC subunit; InterPro: IPR003849 This entry describes proteins of unknown function.. Probab=63.51 E-value=8.5 Score=16.45 Aligned_cols=16 Identities=19% Similarity=0.349 Sum_probs=6.7 Q ss_pred HHHHHHHHHHHHHHHH Q ss_conf 9999999999999999 Q gi|254781033|r 43 VILSILYFFLFAWILL 58 (492) Q Consensus 43 ~~i~~~~~~~~~~~ll 58 (492) ..++.+++++++||++ T Consensus 4 ~~l~~lv~~~~~FYFl 19 (86) T TIGR00739 4 TTLLPLVLIFLIFYFL 19 (86) T ss_pred HHHHHHHHHHHHHHHH T ss_conf 1589999999999997 No 254 >pfam10516 SHNi-TPR SHNi-TPR. SHNi-TPR family members contain a reiterated sequence motif that is an interrupted form of TPR repeat. Probab=63.43 E-value=8.3 Score=16.51 Aligned_cols=29 Identities=21% Similarity=0.274 Sum_probs=21.7 Q ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHHC Q ss_conf 68999999999879988999998878660 Q gi|254781033|r 326 ESLVIVSKIALEMGSIDQAHAKAMLAMKI 354 (492) Q Consensus 326 e~~~~LA~~~~~~g~~~~Ar~~l~~al~~ 354 (492) +.+..||++.+..++|++|..-+++++++ T Consensus 2 d~~~~LgEislE~e~F~qA~~D~~~~L~l 30 (38) T pfam10516 2 DVYDLLGEISLENENFPQAVEDLRKALEL 30 (38) T ss_pred CHHHHHHHHHHHCCCHHHHHHHHHHHHHH T ss_conf 18887778654015607789999999999 No 255 >PRK07111 anaerobic ribonucleoside triphosphate reductase; Provisional Probab=63.14 E-value=3.4 Score=19.51 Aligned_cols=29 Identities=10% Similarity=0.038 Sum_probs=17.3 Q ss_pred CEECC-CCCCCHHHCCCCCCCCCCCCCCCCC Q ss_conf 52756-9895542422483768736400367 Q gi|254781033|r 393 LWISD-DGYLSSVWLPLSPISKTLCYFEWKI 422 (492) Q Consensus 393 ~W~c~-~g~~~~~W~~~c~~~~~~~~~~W~~ 422 (492) .=+|. |||.... ...||.||+=|.-.|++ T Consensus 647 ~~~C~~CGy~~~~-~~~CP~CG~~~~~~~~R 676 (703) T PRK07111 647 VDYCKNCGYYGVI-NDKCPKCGSANIQRIRR 676 (703) T ss_pred CCCCCCCCCCCCC-CCCCCCCCCCCCCEEEE T ss_conf 7657889982887-78291997988747987 No 256 >PRK13455 F0F1 ATP synthase subunit B; Provisional Probab=62.93 E-value=8.6 Score=16.37 Aligned_cols=51 Identities=25% Similarity=0.277 Sum_probs=24.1 Q ss_pred EEHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHH Q ss_conf 2218999999999999999999999998418288999999999999999999999997210047879999863 Q gi|254781033|r 38 RTSPFVILSILYFFLFAWILLFAVSRFFLSCPAMLFHMLHKRNYDKGYKALYTGLMSIAAHNIPLARKMHSYV 110 (492) Q Consensus 38 e~s~~~~i~~~~~~~~~~~ll~~l~~~i~~~p~~~~~~~~~rr~~k~~~al~~gl~al~~Gd~~~A~k~~~~a 110 (492) .+++|++|..++++++++|+. .|..+......|+.+ ..+|.+.|++.-..+ T Consensus 27 d~~FWv~IsFvif~~iL~~~~---------vp~~I~~~LD~R~~~-------------I~~dLdeAe~lreEA 77 (184) T PRK13455 27 NTDFIVTLAFLLFIGILVYFK---------VPGMIGGMLDKRAEG-------------IRSELEEARALREEA 77 (184) T ss_pred CCHHHHHHHHHHHHHHHHHHH---------HHHHHHHHHHHHHHH-------------HHHHHHHHHHHHHHH T ss_conf 954999999999999999995---------589999999999999-------------999999999999999 No 257 >pfam07721 TPR_4 Tetratricopeptide repeat. This Pfam entry includes tetratricopeptide-like repeats not detected by the pfam00515, pfam07719 and pfam07720 models. Probab=61.92 E-value=8 Score=16.63 Aligned_cols=23 Identities=26% Similarity=0.478 Sum_probs=13.3 Q ss_pred HHHHHHHHHHHCCCHHHHHHHHH Q ss_conf 89999999998799889999988 Q gi|254781033|r 327 SLVIVSKIALEMGSIDQAHAKAM 349 (492) Q Consensus 327 ~~~~LA~~~~~~g~~~~Ar~~l~ 349 (492) ....+|+++...|++++|+..++ T Consensus 3 ~~~~la~~~~~~G~~~~A~~~l~ 25 (26) T pfam07721 3 ALLALARALLALGDLDEARALLE 25 (26) T ss_pred HHHHHHHHHHHHCCHHHHHHHHC T ss_conf 69999999998067788998861 No 258 >pfam05297 Herpes_LMP1 Herpesvirus latent membrane protein 1 (LMP1). This family consists of several latent membrane protein 1 or LMP1s mostly from Epstein-Barr virus. LMP1 of EBV is a 62-65 kDa plasma membrane protein possessing six membrane spanning regions, a short cytoplasmic N-terminus and a long cytoplasmic carboxy tail of 200 amino acids. EBV latent membrane protein 1 (LMP1) is essential for EBV-mediated transformation and has been associated with several cases of malignancies. EBV-like viruses in Cynomolgus monkeys (Macaca fascicularis) have been associated with high lymphoma rates in immunosuppressed monkeys. Probab=61.81 E-value=4.4 Score=18.67 Aligned_cols=23 Identities=17% Similarity=0.496 Sum_probs=9.4 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 99999999999999999734898 Q gi|254781033|r 4 LIRYFFVISLVICSFIIVSHYPE 26 (492) Q Consensus 4 ~l~~~~~i~~~~~~~~~l~~~~G 26 (492) +.+.+++++++...++.+.+-.| T Consensus 27 lallllllAlLfW~~IiMSd~T~ 49 (382) T pfam05297 27 LALLLLLLALLFWLYIVMSDWTG 49 (382) T ss_pred HHHHHHHHHHHHHHHHHHHHCCC T ss_conf 99999999999999999730244 No 259 >PRK12496 hypothetical protein; Provisional Probab=61.60 E-value=4.6 Score=18.53 Aligned_cols=20 Identities=0% Similarity=-0.130 Sum_probs=12.3 Q ss_pred ECC-CCCCCHH--HCCCCCCCCC Q ss_conf 756-9895542--4224837687 Q gi|254781033|r 395 ISD-DGYLSSV--WLPLSPISKT 414 (492) Q Consensus 395 ~c~-~g~~~~~--W~~~c~~~~~ 414 (492) .|. ||..++. ..-.||.||+ T Consensus 131 ~C~gC~~~~~~~~~~~~C~~CGs 153 (166) T PRK12496 131 VCKGCKKMYPENYPDEVCDVCGS 153 (166) T ss_pred ECCCCCCCCCCCCCCCCCCCCCC T ss_conf 99987871778899883878898 No 260 >pfam11207 DUF2989 Protein of unknown function (DUF2989). Some members in this bacterial family of proteins are annotated as lipoproteins however this cannot be confirmed. Probab=61.60 E-value=9.1 Score=16.20 Aligned_cols=97 Identities=22% Similarity=0.155 Sum_probs=57.4 Q ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHCC-CCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCC- Q ss_conf 899999999983289899999998507895899999999998268-983689999999998799889999988786609- Q gi|254781033|r 278 KAEVILEKIWKVNPHPEIANIYTHLLSENTVGKLKRALRLEEINK-ESVESLVIVSKIALEMGSIDQAHAKAMLAMKIA- 355 (492) Q Consensus 278 ~A~~~le~a~~~~p~p~La~a~~~~~~~~~~~al~~~~~l~~~~P-~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~- 355 (492) +....|....+...||.++. -+=...+..+++++|-++..... ++++....||.-++ ..+-.++...+-++++.. T Consensus 95 ~~l~~L~~~tk~S~dP~llY--y~Wsr~~d~~A~~~fL~~E~~~~l~t~elq~aLAtyY~-k~d~~Kt~~Ll~~aL~l~~ 171 (203) T pfam11207 95 QELERLQEETKNSNDPYLLY--YHWSRFGDQAALRRFLALEGQPELETPELQYALATYYI-KRDPDKTIQLLYRALELYT 171 (203) T ss_pred HHHHHHHHHHCCCCCCCHHH--HHHHCCCCHHHHHHHHHHCCCCCCCCHHHHHHHHHHHH-CCCHHHHHHHHHHHHHHCC T ss_conf 99999999863599910688--77621597999999998638998898999999999998-0488999999999998058 Q ss_pred ----CCHHHHHHHHHHHHHHCCCHHHH Q ss_conf ----99899999999999857998999 Q gi|254781033|r 356 ----PRKEIFLLLAQIEQANSHNTDKI 378 (492) Q Consensus 356 ----P~~~~~~llA~ie~~e~~d~~~a 378 (492) .+..++.-||-+...++ +...+ T Consensus 172 ~~d~~n~eil~sLas~~~~~~-~~~~A 197 (203) T pfam11207 172 PGDKFNPEILLSLASIYQKQK-NYEQA 197 (203) T ss_pred CCCCCCHHHHHHHHHHHHHHC-CCHHH T ss_conf 767579799999999998714-21343 No 261 >KOG1310 consensus Probab=61.38 E-value=9.2 Score=16.18 Aligned_cols=53 Identities=19% Similarity=0.113 Sum_probs=34.0 Q ss_pred CCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCC Q ss_conf 69989999999875431335279999999999973994789999999998328 Q gi|254781033|r 239 KGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNP 291 (492) Q Consensus 239 ~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p 291 (492) .++.-.|+...-+|++++|....++..+++.+.+.+++.+|.......-...| T Consensus 424 ~~d~~~AlrDch~Alrln~s~~kah~~la~aL~el~r~~eal~~~~alq~~~P 476 (758) T KOG1310 424 RGDSYLALRDCHVALRLNPSIQKAHFRLARALNELTRYLEALSCHWALQMSFP 476 (758) T ss_pred CCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 43078998767762567838888899999999997668775312898751693 No 262 >pfam11846 DUF3366 Domain of unknown function (DUF3366). This domain is functionally uncharacterized. This domain is found in bacteria. This presumed domain is about 200 amino acids in length. Probab=61.34 E-value=9.2 Score=16.17 Aligned_cols=43 Identities=16% Similarity=0.156 Sum_probs=25.0 Q ss_pred HHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCC Q ss_conf 99972688505799986545521588699999999888421269 Q gi|254781033|r 144 EMMLQIPATREFAVYSLYFESCRIGDLNSAQRYATKALDISPDA 187 (492) Q Consensus 144 ~~~~~~~~~~~~~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~ 187 (492) ++.++..|... .++.++......||.++|.+..+++..+.|+. T Consensus 135 ~~~l~~~P~p~-~y~~la~~~~~~G~~~~A~~~~~~a~~lyP~~ 177 (193) T pfam11846 135 LKLLRYKPRPA-VYRRLALALALLGKPAEARDWMAQARYLYPLA 177 (193) T ss_pred HHHHHCCCCHH-HHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCC T ss_conf 99987499759-99999999998699999999999999839972 No 263 >PRK08475 F0F1 ATP synthase subunit B; Validated Probab=60.74 E-value=9.4 Score=16.10 Aligned_cols=18 Identities=17% Similarity=0.344 Sum_probs=11.8 Q ss_pred CHHHHHHHHHHHHHHHHH Q ss_conf 967999999999999999 Q gi|254781033|r 1 MLRLIRYFFVISLVICSF 18 (492) Q Consensus 1 M~r~l~~~~~i~~~~~~~ 18 (492) |.|+++|++++...++++ T Consensus 1 m~~~~~~~~~~~~~~~a~ 18 (170) T PRK08475 1 MMKLFFLILLLPLYAFGS 18 (170) T ss_pred CHHHHHHHHHHHHHHHHC T ss_conf 908999999999999863 No 264 >PRK10591 hypothetical protein; Provisional Probab=60.13 E-value=9.6 Score=16.02 Aligned_cols=64 Identities=11% Similarity=0.185 Sum_probs=33.9 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHCCCCEEEE-EECCEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 967999999999999999997348984899-98997662218999999999999999999999998 Q gi|254781033|r 1 MLRLIRYFFVISLVICSFIIVSHYPEDVSI-TWGNRLYRTSPFVILSILYFFLFAWILLFAVSRFF 65 (492) Q Consensus 1 M~r~l~~~~~i~~~~~~~~~l~~~~G~v~i-~~~~~~ie~s~~~~i~~~~~~~~~~~ll~~l~~~i 65 (492) |=|.++++=++..+.+...+++-|. |+.+ .|.+..--....+|+.+.+.+=.++.++||....+ T Consensus 7 lPK~VlilE~lGi~LLv~a~Lsind-yl~lp~~l~~~~a~i~Mif~GI~lmiPAav~ivWR~a~~l 71 (96) T PRK10591 7 LPKSVLILEILGMLLLVVALLSIND-YLSLPEPFNTPTAAILMIFLGVLLMLPAAVVIIWRVAKRL 71 (96) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHCC-CEECCHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 7830199999999999999999736-1237633248308999999999997489999999999863 No 265 >COG3914 Spy Predicted O-linked N-acetylglucosamine transferase, SPINDLY family [Posttranslational modification, protein turnover, chaperones] Probab=60.12 E-value=9.6 Score=16.02 Aligned_cols=96 Identities=9% Similarity=-0.065 Sum_probs=42.5 Q ss_pred HCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHH-HHHCCCCHHHHH--HHHHHHHHHHHHHCCC Q ss_conf 2158869999999988842126989999999999952574569999999-995023103578--9999999999996699 Q gi|254781033|r 165 CRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQ-KKKNAKEWNRNR--AILLIARSLENADKGD 241 (492) Q Consensus 165 l~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~-~~~~~~~~~~~~--a~l~~~~a~~~~~~~~ 241 (492) ...++...+...++..+..+|.+..+...|+......|.--.+...+.. .....+.+.... ....+..+......+. T Consensus 78 ~~~~~~~~~~~~~~~~l~~~~~~~~~~~~L~~ale~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 157 (620) T COG3914 78 APLADSTLAFLAKRIPLSVNPENCPAVQNLAAALELDGLQFLALADISEIAEWLSPDNAEFLGHLIRFYQLGRYLKLLGR 157 (620) T ss_pred CCCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 42466266899876567549552669998888899842478889999988876074007787667778999899998524 Q ss_pred HHHHHHHHHHHHHCCCCCH Q ss_conf 8999999987543133527 Q gi|254781033|r 242 MIASYHDAIESLKLCDNSI 260 (492) Q Consensus 242 ~~~A~~~~~~a~~~~p~~~ 260 (492) ..++.....++....|.+. T Consensus 158 ~~~~~~~l~~~~d~~p~~~ 176 (620) T COG3914 158 TAEAELALERAVDLLPKYP 176 (620) T ss_pred HHHHHHHHHHHHHHHHHHH T ss_conf 8999999999988543306 No 266 >KOG1464 consensus Probab=59.77 E-value=9.7 Score=15.98 Aligned_cols=188 Identities=12% Similarity=0.085 Sum_probs=79.9 Q ss_pred HHCCCCHHHHHHHHHHHHHHCCCCH-H---HHHHHHHHHHHCCCCHHHHHHHHHHHHCC-------CCHHHHHHHHH--- Q ss_conf 5215886999999998884212698-9---99999999995257456999999999502-------31035789999--- Q gi|254781033|r 164 SCRIGDLNSAQRYATKALDISPDAP-W---VTEAVVQQYVLAKEWSRAITFLNQKKKNA-------KEWNRNRAILL--- 229 (492) Q Consensus 164 al~~gd~~~A~~~~~~a~~~~P~~~-~---a~~~L~~l~~~~gdw~~A~~~l~~~~~~~-------~~~~~~~a~l~--- 229 (492) .++..++++|+..+++.+++.|.-. | ++.....++.+.+++.+....+.+.+..- -......+++- T Consensus 37 ~l~e~~p~~Al~sF~kVlelEgEKgeWGFKALKQmiKI~f~l~~~~eMm~~Y~qlLTYIkSAVTrNySEKsIN~IlDyiS 116 (440) T KOG1464 37 GLKEDEPKEALSSFQKVLELEGEKGEWGFKALKQMIKINFRLGNYKEMMERYKQLLTYIKSAVTRNYSEKSINSILDYIS 116 (440) T ss_pred CCCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHH T ss_conf 44436879999999999941664342679999988877750466799999999999999999850420777999999986 Q ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCH----------HHHHHH Q ss_conf 9999999966998999999987543133527999999999997399478999999999832898----------999999 Q gi|254781033|r 230 IARSLENADKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHP----------EIANIY 299 (492) Q Consensus 230 ~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p----------~La~a~ 299 (492) ++..+.+++. -....+..++.|- ...-....-.-+++++...|++.+-.+++++.-...... .|...| T Consensus 117 tS~~m~LLQ~-FYeTTL~ALkdAK-NeRLWFKTNtKLgkl~fd~~e~~kl~KIlkqLh~SCq~edGedD~kKGtQLLEiY 194 (440) T KOG1464 117 TSKNMDLLQE-FYETTLDALKDAK-NERLWFKTNTKLGKLYFDRGEYTKLQKILKQLHQSCQTEDGEDDQKKGTQLLEIY 194 (440) T ss_pred HHHHHHHHHH-HHHHHHHHHHHHH-CCEEEEECCCHHHHHHEEHHHHHHHHHHHHHHHHHHCCCCCCHHHHCCCHHHHHH T ss_conf 4113689999-9999999998512-2413554054376622109988899999999998855556731564440356457 Q ss_pred HH-H----CCCCHHHHHHHHHHHHHHCCC--CHHH----HHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 98-5----078958999999999982689--8368----99999999987998899999887866 Q gi|254781033|r 300 TH-L----LSENTVGKLKRALRLEEINKE--SVES----LVIVSKIALEMGSIDQAHAKAMLAMK 353 (492) Q Consensus 300 ~~-~----~~~~~~~al~~~~~l~~~~P~--~~e~----~~~LA~~~~~~g~~~~Ar~~l~~al~ 353 (492) +. + ...+...-...+++.+..... ||-. +-.=|.++++.|+|.+|..-+-.+.+ T Consensus 195 AlEIQmYT~qKnNKkLK~lYeqalhiKSAIPHPlImGvIRECGGKMHlreg~fe~AhTDFFEAFK 259 (440) T KOG1464 195 ALEIQMYTEQKNNKKLKALYEQALHIKSAIPHPLIMGVIRECGGKMHLREGEFEKAHTDFFEAFK 259 (440) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHH T ss_conf 66766565640558899999998776304885177868987177220014337888768999973 No 267 >KOG1550 consensus Probab=59.75 E-value=9.7 Score=15.97 Aligned_cols=95 Identities=7% Similarity=-0.130 Sum_probs=36.6 Q ss_pred CHHHHHHHHHHHHHCC-----CHHHHHHHHHHHHHHCC-CHHHHHHHHHHCC---CCHHHHHHHHHHHHHHCCCCHHHHH Q ss_conf 2799999999999739-----94789999999998328-9899999998507---8958999999999982689836899 Q gi|254781033|r 259 SIMASICAAKSLISQN-----KKRKAEVILEKIWKVNP-HPEIANIYTHLLS---ENTVGKLKRALRLEEINKESVESLV 329 (492) Q Consensus 259 ~~~a~~~~a~~l~~~g-----~~~~A~~~le~a~~~~p-~p~La~a~~~~~~---~~~~~al~~~~~l~~~~P~~~e~~~ 329 (492) .+++...++..|.+.. +..+|.+++.++-...+ +-..........+ .+...+.+.+....+ -+|.++.+ T Consensus 287 ~~~a~~~lg~~Y~~g~~~~~~d~~~A~~~~~~aA~~g~~~a~~~lg~~~~~g~~~~d~~~A~~yy~~Aa~--~G~~~A~~ 364 (552) T KOG1550 287 LPPAQYGLGRLYLQGLGVEKIDYEKALKLYTKAAELGNPDAQYLLGVLYETGTKERDYRRAFEYYSLAAK--AGHILAIY 364 (552) T ss_pred CCHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHH--CCCHHHHH T ss_conf 7422479999998189875124999999999998559904899988987627632138899999999997--59989999 Q ss_pred HHHHHHHHC----CCHHHHHHHHHHHHHCC Q ss_conf 999999987----99889999988786609 Q gi|254781033|r 330 IVSKIALEM----GSIDQAHAKAMLAMKIA 355 (492) Q Consensus 330 ~LA~~~~~~----g~~~~Ar~~l~~al~~~ 355 (492) -+|..+..- .+-..|..++.++.+.+ T Consensus 365 ~la~~y~~G~gv~r~~~~A~~~~k~aA~~g 394 (552) T KOG1550 365 RLALCYELGLGVERNLELAFAYYKKAAEKG 394 (552) T ss_pred HHHHHHHHCCCCCCCHHHHHHHHHHHHHCC T ss_conf 999999747786887789999999999827 No 268 >COG4941 Predicted RNA polymerase sigma factor containing a TPR repeat domain [Transcription] Probab=59.59 E-value=9.8 Score=15.95 Aligned_cols=182 Identities=12% Similarity=0.063 Sum_probs=99.4 Q ss_pred CHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHH Q ss_conf 86999999998884212698999999999995257456999999999502310357899999999999966998999999 Q gi|254781033|r 169 DLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIASYHD 248 (492) Q Consensus 169 d~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~ 248 (492) =-++|+...+-...+-|+.++++-.+..+..+..+-..=.. ..+. .+++..+-...-+.+-.+++... T Consensus 211 Lc~EairLgRll~~L~p~EPE~~GL~ALmll~~sR~~AR~~------~~G~------~vlL~dQDr~lW~r~lI~eg~al 278 (415) T COG4941 211 LCDEAIRLGRLLARLLPGEPEALGLLALMLLQESRRPARFD------ADGE------PVLLEDQDRSLWDRALIDEGLAL 278 (415) T ss_pred HHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCC------CCCC------EEECCCCCHHHHHHHHHHHHHHH T ss_conf 78999999999998769986888999999998753101028------9998------65221135666658887878999 Q ss_pred HHHHHHCC---CCCHHHHHHHHHHHH---HCCCHHHHHHHHHHHHHHCCCHH--HHHHHHHHCCCCHHHHHHHHHHHHHH Q ss_conf 98754313---352799999999999---73994789999999998328989--99999985078958999999999982 Q gi|254781033|r 249 AIESLKLC---DNSIMASICAAKSLI---SQNKKRKAEVILEKIWKVNPHPE--IANIYTHLLSENTVGKLKRALRLEEI 320 (492) Q Consensus 249 ~~~a~~~~---p~~~~a~~~~a~~l~---~~g~~~~A~~~le~a~~~~p~p~--La~a~~~~~~~~~~~al~~~~~l~~~ 320 (492) ..++.... |-...+.+....... ..-+...-..++.......|.|- |-.+.+..+...+...+..++.+... T Consensus 279 l~rA~~~~~pGPYqlqAAIaa~HA~a~~aedtDW~~I~aLYdaL~~~apSPvV~LNRAVAla~~~Gp~agLa~ve~L~~~ 358 (415) T COG4941 279 LDRALASRRPGPYQLQAAIAALHARARRAEDTDWPAIDALYDALEQAAPSPVVTLNRAVALAMREGPAAGLAMVEALLAR 358 (415) T ss_pred HHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCCEEEEHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 99999748998189999999998753335799859999999999984899857513889999765578689999875124 Q ss_pred --CCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHH Q ss_conf --689836899999999987998899999887866099989999 Q gi|254781033|r 321 --NKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKEIFL 362 (492) Q Consensus 321 --~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~~~~ 362 (492) -.+....+-..|.++.+.|+-.+|+..+++++.+.++...-. T Consensus 359 ~~L~gy~~~h~~RadlL~rLgr~~eAr~aydrAi~La~~~aer~ 402 (415) T COG4941 359 PRLDGYHLYHAARADLLARLGRVEEARAAYDRAIALARNAAERA 402 (415) T ss_pred CCCCCCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCHHHHH T ss_conf 35543422087778999981870778999999998668869999 No 269 >PRK08270 anaerobic ribonucleoside triphosphate reductase; Provisional Probab=59.56 E-value=4.3 Score=18.73 Aligned_cols=19 Identities=26% Similarity=0.459 Sum_probs=10.1 Q ss_pred ECC-CCCCCHHHCCCCCCCCC Q ss_conf 756-98955424224837687 Q gi|254781033|r 395 ISD-DGYLSSVWLPLSPISKT 414 (492) Q Consensus 395 ~c~-~g~~~~~W~~~c~~~~~ 414 (492) +|. ||++... .-.||+||+ T Consensus 627 ~C~~CGy~~ge-~~~CP~CG~ 646 (681) T PRK08270 627 ICPKHGYLAGE-HEFCPKCGK 646 (681) T ss_pred CCCCCCCCCCC-CCCCCCCCC T ss_conf 67889977788-674939989 No 270 >PRK12452 cardiolipin synthetase; Reviewed Probab=59.53 E-value=8.5 Score=16.43 Aligned_cols=22 Identities=5% Similarity=-0.017 Sum_probs=9.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHCC Q ss_conf 9999999999999999998418 Q gi|254781033|r 47 ILYFFLFAWILLFAVSRFFLSC 68 (492) Q Consensus 47 ~~~~~~~~~~ll~~l~~~i~~~ 68 (492) +.+++++++.++--++-+++|. T Consensus 60 aWllvi~~lP~~G~ilYl~fG~ 81 (509) T PRK12452 60 AWFLVLALLPVIGVLLYSIFGR 81 (509) T ss_pred HHHHHHHHHHHHHHHHHHEECC T ss_conf 9999999999999999991098 No 271 >cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity. Probab=59.43 E-value=3.7 Score=19.23 Aligned_cols=24 Identities=13% Similarity=0.131 Sum_probs=15.8 Q ss_pred CEECC-CCCCCHHH--CCCCCCCCCCC Q ss_conf 52756-98955424--22483768736 Q gi|254781033|r 393 LWISD-DGYLSSVW--LPLSPISKTLC 416 (492) Q Consensus 393 ~W~c~-~g~~~~~W--~~~c~~~~~~~ 416 (492) .|+|. |||+++.= --+||+||.-. T Consensus 1 ~~~C~vCGyi~~~~~~p~~CP~Cg~~k 27 (33) T cd00350 1 KYVCPVCGYIYDGEEAPWVCPVCGAPK 27 (33) T ss_pred CCCCCCCCCEEECCCCCCCCCCCCCCH T ss_conf 948886998875786987287889978 No 272 >COG2389 Uncharacterized metal-binding protein [General function prediction only] Probab=58.75 E-value=10 Score=15.85 Aligned_cols=12 Identities=8% Similarity=-0.155 Sum_probs=4.4 Q ss_pred EEEEECCEEEEE Q ss_conf 899989976622 Q gi|254781033|r 28 VSITWGNRLYRT 39 (492) Q Consensus 28 v~i~~~~~~ie~ 39 (492) +-+.+.++.+-. T Consensus 115 ~~l~~l~~~ip~ 126 (179) T COG2389 115 LPLTLLSFDIPR 126 (179) T ss_pred CCCCCCCCCCCC T ss_conf 764102466754 No 273 >PRK13184 pknD serine/threonine-protein kinase; Reviewed Probab=58.43 E-value=10 Score=15.81 Aligned_cols=120 Identities=22% Similarity=0.169 Sum_probs=63.5 Q ss_pred HHHHHHHHHHHHHH-----CC-H--HHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHH Q ss_conf 99999999999972-----10-0--4787999986301346661589999987532222100016899999972688505 Q gi|254781033|r 83 KGYKALYTGLMSIA-----AH-N--IPLARKMHSYVSQQHTFHNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPATRE 154 (492) Q Consensus 83 k~~~al~~gl~al~-----~G-d--~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~~~ 154 (492) .||+|.-+.-+.+. .| + +..|...+.+. +.+..-||-+|..|-..+..|++++-.+-|.-++++=+.+. T Consensus 510 eg~ea~fr~git~le~a~~~~~~~~~~~al~~f~~l---h~~~~apleylgkalvyq~~~~~~eeik~~~la~kry~~hp 586 (933) T PRK13184 510 EGYEAQFRLGITLLEKASEQGGDQEFTQALSEFSYL---HGGVAAPLEYLGKALVYQRLGEYNEEIKSLLLALKRYSQHP 586 (933) T ss_pred CCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH---CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 007889986389999887604725799999999885---38988757888889999873007899999999997567998 Q ss_pred HH-------HHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHH Q ss_conf 79-------998654552158869999999988842126989999999999952574569 Q gi|254781033|r 155 FA-------VYSLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRA 207 (492) Q Consensus 155 ~~-------~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A 207 (492) .- ...| .+.+. .+...++...--.+...|.+......---+...+++|+.. T Consensus 587 ~i~~l~~h~~~rl-he~~~-~~~~~~~~f~ll~~~~~p~~~~~~~~~~~~~~l~~~~~~~ 644 (933) T PRK13184 587 EIPRLKDHVVYRL-HESLY-KHRREALVFMLLILWIAPEKISVREEERFLRILYHKQQAT 644 (933) T ss_pred CCHHHHHHHHHHH-HHHHH-HHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCC T ss_conf 7367898999999-98876-5168999999999986501037644899999986313057 No 274 >PRK08271 anaerobic ribonucleoside triphosphate reductase; Provisional Probab=58.39 E-value=5.4 Score=17.95 Aligned_cols=49 Identities=12% Similarity=0.036 Sum_probs=29.1 Q ss_pred CCHHHHHHHHHHHHCCCCCCCE-------ECCCCC-CCHHHCCCCCCCCCCCCCCCCC Q ss_conf 9989999999999628899952-------756989-5542422483768736400367 Q gi|254781033|r 373 HNTDKILYWTQSALHAMPDPLW-------ISDDGY-LSSVWLPLSPISKTLCYFEWKI 422 (492) Q Consensus 373 ~d~~~ar~wl~rA~~a~~dp~W-------~c~~g~-~~~~W~~~c~~~~~~~~~~W~~ 422 (492) .+....+..+.++.+- +=|-| +|.+|| ...-=...||.||+-++-.|.+ T Consensus 541 ~~~ea~~~lvk~~~~~-~lpY~titp~fsiC~~chgy~~Ge~~~CP~CG~~~~e~~sR 597 (625) T PRK08271 541 KSEEGYLKLVNIAAKT-GVNYFAINVKITICNEGHAIDKRTGKRCPVCGSEKIDEGTR 597 (625) T ss_pred CCHHHHHHHHHHHHHC-CCCEEEECCCEEECCCCCCCCCCCCCCCCCCCCCCCEEEEE T ss_conf 9999999999999847-99769957671378899988577688398998978507988 No 275 >TIGR00373 TIGR00373 conserved hypothetical protein TIGR00373; InterPro: IPR005241 This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain. . Probab=58.25 E-value=6.5 Score=17.32 Aligned_cols=38 Identities=5% Similarity=0.013 Sum_probs=25.5 Q ss_pred HHHHHHHHHHCCCCCC-CEECCCCCC------CHHHCCCCCCCCCC Q ss_conf 9999999996288999-527569895------54242248376873 Q gi|254781033|r 377 KILYWTQSALHAMPDP-LWISDDGYL------SSVWLPLSPISKTL 415 (492) Q Consensus 377 ~ar~wl~rA~~a~~dp-~W~c~~g~~------~~~W~~~c~~~~~~ 415 (492) -+|. |++++.+.... -.+|.+|++ .-+-...||.||+- T Consensus 95 ~~kk-LreklEfE~nn~ff~CpN~~vrftf~eAme~nFtCP~CG~~ 139 (168) T TIGR00373 95 LVKK-LREKLEFEKNNMFFVCPNMNVRFTFDEAMELNFTCPECGAM 139 (168) T ss_pred HHHH-HHHHHHHHCCCEEEEECCCEEEEEHHHHHCCCCCCCCCCCH T ss_conf 9999-99874231077258713840574042231167988331323 No 276 >pfam01155 HypA Hydrogenase expression/synthesis hypA family. Four conserved cysteines lie either side of the least conserved region. Probab=58.13 E-value=6.5 Score=17.34 Aligned_cols=26 Identities=12% Similarity=-0.120 Sum_probs=19.4 Q ss_pred CCEECC-CCC--CCHHHCCCCCCCCCCCC Q ss_conf 952756-989--55424224837687364 Q gi|254781033|r 392 PLWISD-DGY--LSSVWLPLSPISKTLCY 417 (492) Q Consensus 392 p~W~c~-~g~--~~~~W~~~c~~~~~~~~ 417 (492) +...|. ||+ ..+.....||.||+.+. T Consensus 68 ~~~~C~~Cg~~~~~~~~~~~CP~Cgs~~~ 96 (112) T pfam01155 68 GVARCRDCGQEFELEERFFRCPKCGSLDL 96 (112) T ss_pred CEEECCCCCCEECCCCCCCCCCCCCCCCC T ss_conf 74899899971425777679908979988 No 277 >TIGR02487 NrdD anaerobic ribonucleoside-triphosphate reductase; InterPro: IPR012833 This entry is found in the oxygen-sensitive (anaerobic, class III) ribonucleotide reductase. The mechanism of the enzyme involves a glycine-centred radical , a C-terminal zinc binding site , and a set of conserved active site cysteines and asparagines . This enzyme requires an activating component, NrdG, a radical-SAM domain containing enzyme (IPR012837 from INTERPRO). Together the two form an alpha-2/beta-2 heterodimer.; GO: 0008998 ribonucleoside-triphosphate reductase activity, 0016491 oxidoreductase activity. Probab=57.76 E-value=4.9 Score=18.32 Aligned_cols=46 Identities=11% Similarity=0.141 Sum_probs=29.7 Q ss_pred HCCCHHHHHHHHHHHHCCCCC--------CCEECC-CC-CCCHHHC----CCCCCCCCCC Q ss_conf 579989999999999628899--------952756-98-9554242----2483768736 Q gi|254781033|r 371 NSHNTDKILYWTQSALHAMPD--------PLWISD-DG-YLSSVWL----PLSPISKTLC 416 (492) Q Consensus 371 e~~d~~~ar~wl~rA~~a~~d--------p~W~c~-~g-~~~~~W~----~~c~~~~~~~ 416 (492) +..|..+...-..++.+-.++ |.=+|. || |....-. -.||.||+=| T Consensus 560 ~~~d~~~l~~i~~~~~~~~t~i~Y~~in~~~~~C~~Cg~y~~~~~~t~~g~~CP~CGs~D 619 (655) T TIGR02487 560 AAKDPEALKDIIKKAMKNGTNIGYFGINPPVDVCEDCGEYTGEGLNTEKGDKCPKCGSHD 619 (655) T ss_pred CCCCHHHHHHHHHHHHHCCCCCCEEEECCCCCEECCCCCCHHCCCCCCCCCCCCCCCCCC T ss_conf 546737899999999973885222676787765257653200000011556688988762 No 278 >cd07853 STKc_NLK The catalytic domain of the Serine/Threonine Kinase, Nemo-Like Kinase subfamily. Serine/Threonine Kinases (STKs), Nemo-Like Kinase (NLK) subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The NLK subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. Mitogen-activated protein kinases (MAPKs) are important mediators of cellular responses to extracellular signals. NLK is an atypical MAPK that is not regulated by a MAPK kinase. It functions downstream of the MAPK kinase kinase Tak1, which also plays a role in activating the JNK and p38 MAPKs. The Tak1/NLK pathways are regulated by Wnts, a family of secreted proteins that is critical in the control of asymmetric division and cell polarity. NLK can phosphorylate Probab=57.68 E-value=5.7 Score=17.79 Aligned_cols=58 Identities=16% Similarity=0.138 Sum_probs=34.6 Q ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCEECCCCCCC-------HHHCCCCCC Q ss_conf 88999998878660999899999999999857998999999999962889995275698955-------424224837 Q gi|254781033|r 341 IDQAHAKAMLAMKIAPRKEIFLLLAQIEQANSHNTDKILYWTQSALHAMPDPLWISDDGYLS-------SVWLPLSPI 411 (492) Q Consensus 341 ~~~Ar~~l~~al~~~P~~~~~~llA~ie~~e~~d~~~ar~wl~rA~~a~~dp~W~c~~g~~~-------~~W~~~c~~ 411 (492) -.+|...+++.+..+|+.|. -.++...++ |+.... -+=..|.|.||+.. .+=.|+|+. T Consensus 261 ~~ea~DLL~kmL~~DP~kRi-----Ta~eaL~HP------yf~~~~--~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~ 325 (372) T cd07853 261 THEAVHLLCRMLVFDPDKRI-----SAADALAHP------YLDEGR--LRYHTCMCKCCYTTSGGRVYTSDFEPSANP 325 (372) T ss_pred CHHHHHHHHHHCCCCHHHCC-----CHHHHHCCC------CCCCCC--CCCCCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 99999999987778935690-----599986792------706774--223421112124666554568777999997 No 279 >TIGR02614 ftsW cell division protein FtsW; InterPro: IPR013437 FtsW is an integral membrane protein with ten transmembrane segments . In general, it is one of two paralogs involved in peptidoglycan biosynthesis, the other being RodA, and is essential for cell division. The FtsW designation is not used in endospore-forming bacteria (e.g. Bacillus subtilis), where these proteins are designated SpoVE, and three or more RodA/FtsW/SpoVE family paralogs are present. SpoVE acts in spore cortex formation and is dispensable for growth. Biological roles for FtsW in cell division include recruitment of penicillin-binding protein 3 to the division site.; GO: 0009252 peptidoglycan biosynthetic process, 0051301 cell division, 0016021 integral to membrane. Probab=57.51 E-value=11 Score=15.70 Aligned_cols=93 Identities=16% Similarity=0.225 Sum_probs=47.4 Q ss_pred EEEEHHHHHHHHHHHHHHH-----------HHHHHHHHHHHHCCHHHHHHHHHHH-----HH----HHHHHHHHHHHHHH Q ss_conf 6622189999999999999-----------9999999999841828899999999-----99----99999999999997 Q gi|254781033|r 36 LYRTSPFVILSILYFFLFA-----------WILLFAVSRFFLSCPAMLFHMLHKR-----NY----DKGYKALYTGLMSI 95 (492) Q Consensus 36 ~ie~s~~~~i~~~~~~~~~-----------~~ll~~l~~~i~~~p~~~~~~~~~r-----r~----~k~~~al~~gl~al 95 (492) ..-|+.+.+++.++.+|++ +..+..+...+...|++++|-..=- .. ..||| ++|+++|+ T Consensus 162 DfG~~~~~~~~~~~~lf~AG~~~~~~~~~~~~~~~~~~~l~~~~pYR~~Ri~~Fl~P~~Dp~~~~~g~gyQ-l~qsl~A~ 240 (370) T TIGR02614 162 DFGTTVVIFLILLGMLFLAGAPLRYFALLLLLGLLGLVILIVSSPYRMRRILSFLDPWADPFLADQGSGYQ-LTQSLIAL 240 (370) T ss_pred CHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHCCCCCCCCCCCCCCCH-HHHHHHHH T ss_conf 80389999999999999805019999999999999999999854279999998737776640104789604-99999998 Q ss_pred HCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHH Q ss_conf 210047879999863013466615899999875322 Q gi|254781033|r 96 AAHNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIAL 131 (492) Q Consensus 96 ~~Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA~ 131 (492) ++|.+--- =.-.-..|..++| ++.+-..-|-.++ T Consensus 241 g~GG~~G~-GLG~s~qKl~yLP-eAhtDFIfav~gE 274 (370) T TIGR02614 241 GRGGLFGV-GLGNSVQKLFYLP-EAHTDFIFAVIGE 274 (370) T ss_pred HCCCCEEE-CCCCCEECCCCCC-CCCHHHHHHHHHH T ss_conf 67985674-1687541135799-7421599999999 No 280 >COG3369 Zinc finger domain containing protein (CDGSH-type) [Function unknown] Probab=57.48 E-value=5.3 Score=18.03 Aligned_cols=26 Identities=12% Similarity=0.073 Sum_probs=20.8 Q ss_pred CCEECCCCCCCHHHCCCCCCCCCCCCCC Q ss_conf 9527569895542422483768736400 Q gi|254781033|r 392 PLWISDDGYLSSVWLPLSPISKTLCYFE 419 (492) Q Consensus 392 p~W~c~~g~~~~~W~~~c~~~~~~~~~~ 419 (492) ..|+|.||++.. .|.|+-++.--.|+ T Consensus 30 ~~~LCrCG~S~N--KPfCDGtH~k~gf~ 55 (78) T COG3369 30 QAALCRCGHSEN--KPFCDGTHKKTGFD 55 (78) T ss_pred EEEEEECCCCCC--CCCCCCCCCCCCCC T ss_conf 799972367578--88667744340236 No 281 >PRK09765 2-O-a-mannosyl-D-glycerate specific PTS transporten components IIABC; Provisional Probab=57.37 E-value=8.2 Score=16.56 Aligned_cols=20 Identities=10% Similarity=-0.125 Sum_probs=8.4 Q ss_pred CCHHHHH-HHHHHHHHHCCCC Q ss_conf 8958999-9999999826898 Q gi|254781033|r 305 ENTVGKL-KRALRLEEINKES 324 (492) Q Consensus 305 ~~~~~al-~~~~~l~~~~P~~ 324 (492) +.+-..+ ..+..|+..-.+. T Consensus 434 g~pi~~l~~~l~~~L~~m~~~ 454 (638) T PRK09765 434 GEPVAWINNSLTAWLNGLSGS 454 (638) T ss_pred HHHHHHHHHHHHHHHHHCCCC T ss_conf 489999999999999864441 No 282 >PRK11466 hybrid sensory histidine kinase TorS; Provisional Probab=56.72 E-value=11 Score=15.61 Aligned_cols=18 Identities=22% Similarity=0.296 Sum_probs=8.1 Q ss_pred HHHHHHHHHHHHHHHHHH Q ss_conf 799999999999999999 Q gi|254781033|r 3 RLIRYFFVISLVICSFII 20 (492) Q Consensus 3 r~l~~~~~i~~~~~~~~~ 20 (492) |+++-|.+++++.+++.+ T Consensus 8 rLl~aF~~~a~L~ll~~~ 25 (912) T PRK11466 8 RLWMGFALMALLTLTSTL 25 (912) T ss_pred HHHHHHHHHHHHHHHHHH T ss_conf 999999999999999999 No 283 >TIGR02443 TIGR02443 conserved hypothetical protein; InterPro: IPR012658 Members of this family are small proteins, about 70 residues in length, with a basic triplet near the N-terminus and a probable metal-binding motif CPXCX(18)CXXC. Members are found in various proteobacteria.. Probab=56.62 E-value=4.6 Score=18.54 Aligned_cols=15 Identities=7% Similarity=-0.070 Sum_probs=13.0 Q ss_pred CCCCCCCCCCCCCCC Q ss_conf 248376873640036 Q gi|254781033|r 407 PLSPISKTLCYFEWK 421 (492) Q Consensus 407 ~~c~~~~~~~~~~W~ 421 (492) ..||.|+.-|||.|= T Consensus 10 A~CP~C~~~D~l~~W 24 (63) T TIGR02443 10 AKCPACSAQDTLAMW 24 (63) T ss_pred CCCCCCCCCCCEEEE T ss_conf 728888874540002 No 284 >PRK06266 transcription initiation factor E subunit alpha; Validated Probab=56.48 E-value=6.4 Score=17.40 Aligned_cols=25 Identities=8% Similarity=0.105 Sum_probs=17.4 Q ss_pred CCEECCCCC-C-----CHHHCCCCCCCCCCC Q ss_conf 952756989-5-----542422483768736 Q gi|254781033|r 392 PLWISDDGY-L-----SSVWLPLSPISKTLC 416 (492) Q Consensus 392 p~W~c~~g~-~-----~~~W~~~c~~~~~~~ 416 (492) .-.+|.+|+ . .-+....||.||..= T Consensus 116 ~FY~C~~c~~R~tFeeAme~~F~CP~CG~~L 146 (178) T PRK06266 116 MFFVCPNGHPKFTFDEAMEYGFRCPQCGEML 146 (178) T ss_pred CEEECCCCCCCCCHHHHHHCCCCCCCCCCCC T ss_conf 8788899983005999988399699998866 No 285 >COG5107 RNA14 Pre-mRNA 3'-end processing (cleavage and polyadenylation) factor [RNA processing and modification] Probab=56.31 E-value=11 Score=15.56 Aligned_cols=116 Identities=9% Similarity=-0.007 Sum_probs=79.9 Q ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHHC---CCHHHHHHHHHHC-CCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH Q ss_conf 99999999997399478999999999832---8989999999850-7895899999999998268983689999999998 Q gi|254781033|r 262 ASICAAKSLISQNKKRKAEVILEKIWKVN---PHPEIANIYTHLL-SENTVGKLKRALRLEEINKESVESLVIVSKIALE 337 (492) Q Consensus 262 a~~~~a~~l~~~g~~~~A~~~le~a~~~~---p~p~La~a~~~~~-~~~~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~ 337 (492) ++..+-....+..-...|.+++.++-+.. +|--+..+|.+.. .+++..+-+-++--+...||++....-.---.+. T Consensus 399 v~C~~~N~v~r~~Gl~aaR~~F~k~rk~~~~~h~vyi~~A~~E~~~~~d~~ta~~ifelGl~~f~d~~~y~~kyl~fLi~ 478 (660) T COG5107 399 VFCVHLNYVLRKRGLEAARKLFIKLRKEGIVGHHVYIYCAFIEYYATGDRATAYNIFELGLLKFPDSTLYKEKYLLFLIR 478 (660) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCEEEEHHHHHHHHCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHH T ss_conf 78899999998755899999999971257777523431778898861895317899987777379953899999999987 Q ss_pred CCCHHHHHHHHHHHHHC-CCC--HHHHHHHHHHHHHHCCCHHHH Q ss_conf 79988999998878660-999--899999999999857998999 Q gi|254781033|r 338 MGSIDQAHAKAMLAMKI-APR--KEIFLLLAQIEQANSHNTDKI 378 (492) Q Consensus 338 ~g~~~~Ar~~l~~al~~-~P~--~~~~~llA~ie~~e~~d~~~a 378 (492) -++-..||..++++++. .-+ .++|.-|-+-|.. -|+-..+ T Consensus 479 inde~naraLFetsv~r~~~~q~k~iy~kmi~YEs~-~G~lN~v 521 (660) T COG5107 479 INDEENARALFETSVERLEKTQLKRIYDKMIEYESM-VGSLNNV 521 (660) T ss_pred HCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HCCHHHH T ss_conf 084777899998768888876666999999989986-0646777 No 286 >cd02682 MIT_AAA_Arch MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in mostly archaebacterial AAA-ATPases. The molecular function of the MIT domain is unclear. Probab=55.59 E-value=11 Score=15.48 Aligned_cols=65 Identities=17% Similarity=0.186 Sum_probs=33.7 Q ss_pred HHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCHHHHHHHHHHH Q ss_conf 99999987543133527999999999997399478999999999832898999999985078958999999999 Q gi|254781033|r 244 ASYHDAIESLKLCDNSIMASICAAKSLISQNKKRKAEVILEKIWKVNPHPEIANIYTHLLSENTVGKLKRALRL 317 (492) Q Consensus 244 ~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~~~~A~~~le~a~~~~p~p~La~a~~~~~~~~~~~al~~~~~l 317 (492) .|.+.+..|++.+..- ..-..-.++.+|..+|.+....-|+..+...|-... ..+..|++.+++. T Consensus 5 ~A~kYA~~AVkaD~~G--------~~~~AI~~Y~kAie~L~kl~~LYPds~l~~iY~q~~-~~Y~~Ri~~Lek~ 69 (75) T cd02682 5 MARKYAINAVKAEKEG--------NAEDAITNYKKAIEVLSQIVKNYPDSPTRLIYEQMI-NEYKRRIEVLEKQ 69 (75) T ss_pred HHHHHHHHHHHHHHCC--------CHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHH-HHHHHHHHHHHHH T ss_conf 8999999987375347--------788899999999999999998789817999999999-9999999999976 No 287 >pfam10938 YfdX YfdX protein. YfdX is a protein found in Proteobacteria of unknown function. The protein coding for this gene is regulated by EvgA in E. coli. Probab=55.49 E-value=11 Score=15.47 Aligned_cols=26 Identities=27% Similarity=0.328 Sum_probs=12.8 Q ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHH Q ss_conf 99999999997399478999999999 Q gi|254781033|r 262 ASICAAKSLISQNKKRKAEVILEKIW 287 (492) Q Consensus 262 a~~~~a~~l~~~g~~~~A~~~le~a~ 287 (492) ..+..+..+...|++.+|...|..+. T Consensus 119 ~av~~A~~lL~~gk~~eA~~aL~~A~ 144 (155) T pfam10938 119 AAVKQAASLLDEGKYYEAKAALKEAL 144 (155) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHH T ss_conf 99999999997799789999999986 No 288 >pfam09535 Gmx_para_CXXCG Protein of unknown function (Gmx_para_CXXCG). This entry consists of at least 10 paralogous proteins from Myxococcus xanthus and that lack detectable sequence similarity to any other protein family. An imperfectly conserved CXXCG motif, a probable binding site, appears twice in the multiple sequence alignment. Probab=54.94 E-value=6.9 Score=17.14 Aligned_cols=38 Identities=16% Similarity=0.160 Sum_probs=25.1 Q ss_pred CCHHHCCCCCCCCCCCCCCCC-CCCCCCCCCCCCCCCCHH Q ss_conf 554242248376873640036-787554456665421000 Q gi|254781033|r 401 LSSVWLPLSPISKTLCYFEWK-IPTKSPEYISSENINFSL 439 (492) Q Consensus 401 ~~~~W~~~c~~~~~~~~~~W~-~p~~~~~~~~~~~~~~~~ 439 (492) ...+|.|-|++||+. ++.|- +|.-+..++|.....+-+ T Consensus 166 ~ppd~~ppC~~CG~~-~~~lP~~~iLda~SlP~d~D~FRl 204 (237) T pfam09535 166 LPPDREPPCPTCGRH-SLRLPDTPILDAASLPADVDLFRL 204 (237) T ss_pred CCCCCCCCCCCCCCC-CCCCCCCCEECCCCCCCCCCEEEE T ss_conf 698989998531333-356897401301238776332531 No 289 >pfam03094 Mlo Mlo family. A family of plant integral membrane proteins, first discovered in barley. Mutants lacking wild-type Mlo proteins show broad spectrum resistance to the powdery mildew fungus, and dysregulated cell death control, with spontaneous cell death in response to developmental or abiotic stimuli. Thus wild-type Mlo proteins are thought to be inhibitors of cell death whose deficiency lowers the threshold required to trigger the cascade of events that result in plant cell death. Mlo proteins are localized in the plasma membrane and possess seven transmembrane regions; thus the Mlo family is the only major higher plant family to possess 7 transmembrane domains. It has been suggested that Mlo proteins function as G-protein coupled receptors in plants; however the molecular and biological functions of Mlo proteins remain to be fully determined. Probab=54.63 E-value=12 Score=15.37 Aligned_cols=56 Identities=21% Similarity=0.192 Sum_probs=32.9 Q ss_pred EEHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHCCHH Q ss_conf 22189999999999999999999999984182889999999999999999999--------9999721004 Q gi|254781033|r 38 RTSPFVILSILYFFLFAWILLFAVSRFFLSCPAMLFHMLHKRNYDKGYKALYT--------GLMSIAAHNI 100 (492) Q Consensus 38 e~s~~~~i~~~~~~~~~~~ll~~l~~~i~~~p~~~~~~~~~rr~~k~~~al~~--------gl~al~~Gd~ 100 (492) +|+.+...++..+++++.+++-+.+ ..+..|..+++++.-++||.+ |+|.|.-.=+ T Consensus 8 ~TPTWaVA~Vc~v~V~iSl~iEr~l-------H~lgk~Lkk~~kkaL~eALeKiK~ELMLLGFISLLLtv~ 71 (481) T pfam03094 8 ETPTWAVAVVCTVLVLISILLERGL-------HKLGKWLKKRHKKALFEALEKIKAELMLLGFISLLLTVG 71 (481) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 6875199999999999999999999-------999999876182689999999999999999999999977 No 290 >pfam11947 DUF3464 Protein of unknown function (DUF3464). This family of proteins are functionally uncharacterized. This protein is found in bacteria and eukaryotes. Proteins in this family are typically between 137 to 196 amino acids in length. Probab=54.63 E-value=12 Score=15.37 Aligned_cols=63 Identities=11% Similarity=0.034 Sum_probs=33.2 Q ss_pred CH-HHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 96-79999999999999999973489848999899766221899999999999999999999999841 Q gi|254781033|r 1 ML-RLIRYFFVISLVICSFIIVSHYPEDVSITWGNRLYRTSPFVILSILYFFLFAWILLFAVSRFFLS 67 (492) Q Consensus 1 M~-r~l~~~~~i~~~~~~~~~l~~~~G~v~i~~~~~~ie~s~~~~i~~~~~~~~~~~ll~~l~~~i~~ 67 (492) |+ |+++|..+=.+++++... -.|.++.-..+++-..+.+++.+++..+.++.+-+.++..-|. T Consensus 57 M~rR~~~~~GiP~~lg~~~f~----~~y~l~~~~~~d~p~~~~~~~s~~~Fg~gllGisYGilSaSWD 120 (149) T pfam11947 57 MIRRIAFFSGIPTALGMAVFV----VSYLLVSRGIIDVPPWVTLLVSLGFFGLGLLGLSYGILSASWD 120 (149) T ss_pred HHHHHHHHHCCHHHHHHHHHH----HHHHEEECCEEECCCHHHHHHHHHHHHHHHHHHHHEEEECCCC T ss_conf 999999981715888999987----7761026443644706999999999999886561201201448 No 291 >COG1333 ResB ResB protein required for cytochrome c biosynthesis [Posttranslational modification, protein turnover, chaperones] Probab=54.54 E-value=9.5 Score=16.04 Aligned_cols=63 Identities=14% Similarity=0.152 Sum_probs=34.1 Q ss_pred CCHHHHHHHHHHHHHCCCCHHHH-----HHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCEECCCCCCCHHHCC Q ss_conf 99889999988786609998999-----9999999985799899999999996288999527569895542422 Q gi|254781033|r 339 GSIDQAHAKAMLAMKIAPRKEIF-----LLLAQIEQANSHNTDKILYWTQSALHAMPDPLWISDDGYLSSVWLP 407 (492) Q Consensus 339 g~~~~Ar~~l~~al~~~P~~~~~-----~llA~ie~~e~~d~~~ar~wl~rA~~a~~dp~W~c~~g~~~~~W~~ 407 (492) ..+.++......-+..+|..... .+|..+-..--- ..-|.|+.. -..-+..|+. |+.+.+|.. T Consensus 378 ~~~~~~~~ftglqV~~Dp~~~~v~vg~~i~Mvglv~s~~~--~~rrvWl~~--~~G~~~V~~~--G~t~t~~~g 445 (478) T COG1333 378 VGFLEAVGFTGLQVSKDPGQPWVYVGGAILMVGLVQSLYW--RHRRVWLRS--LDGTREVEVG--GLTNTDWSG 445 (478) T ss_pred EECCCCCCCCCEEECCCCCCCEEHHHHHHHHHHHHHHHHH--HHHHHHHHH--CCCCEEEEEE--EEECCCCCC T ss_conf 4021102332214136899745047789999999999999--888888855--2784489881--032145765 No 292 >PRK00762 hypA hydrogenase nickel incorporation protein; Provisional Probab=54.24 E-value=7.4 Score=16.90 Aligned_cols=26 Identities=15% Similarity=0.206 Sum_probs=15.8 Q ss_pred CCEECCCCCC--C-----HHHCC--CCCCCCCCCC Q ss_conf 9527569895--5-----42422--4837687364 Q gi|254781033|r 392 PLWISDDGYL--S-----SVWLP--LSPISKTLCY 417 (492) Q Consensus 392 p~W~c~~g~~--~-----~~W~~--~c~~~~~~~~ 417 (492) +.-.|.||+. . ..+.+ .||.||+.+. T Consensus 69 ~~~~C~Cg~~~~~~~~~~~~~~~~~~CP~Cgs~~~ 103 (124) T PRK00762 69 PELECECGYKGVVDEDEIDHYAAVMECPVCGNKHA 103 (124) T ss_pred CEEEEECCCCCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 25996289955533210002567771918879777 No 293 >COG3107 LppC Putative lipoprotein [General function prediction only] Probab=54.19 E-value=12 Score=15.32 Aligned_cols=86 Identities=13% Similarity=-0.007 Sum_probs=43.4 Q ss_pred CHHHHHHHHHHHHHHHHCCHHHHHHHHHHHH-CCCC-HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHC-CCCHHHHHHH Q ss_conf 1589999987532222100016899999972-6885-05799986545521588699999999888421-2698999999 Q gi|254781033|r 118 NEYLVYLLEVQIALAERQYNIAHEKLEMMLQ-IPAT-REFAVYSLYFESCRIGDLNSAQRYATKALDIS-PDAPWVTEAV 194 (492) Q Consensus 118 ~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~-~~~~-~~~~~~gL~~~al~~gd~~~A~~~~~~a~~~~-P~~~~a~~~L 194 (492) +.....|+++.++..+|+...+...+.++.. -.++ +..-..-.+..++...++..|+..+.+...-. |.+-++..-. T Consensus 61 ~~~~~~llAa~al~~e~k~~qA~~Ll~ql~~~Ltd~Q~~~~~LL~ael~la~~q~~~Al~~L~~~~~~~ls~~Qq~Ry~q 140 (604) T COG3107 61 QQNDWLLLAARALVEEGKTAQAQALLNQLPQELTDAQRAEKSLLAAELALAQKQPAAALQQLAKLLPADLSQNQQARYYQ 140 (604) T ss_pred HHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCHHHCCHHHHHHHHH T ss_conf 56649999999999747828899999865623798999999999999998422729999998520321167788999999 Q ss_pred HHHHHHCCC Q ss_conf 999995257 Q gi|254781033|r 195 VQQYVLAKE 203 (492) Q Consensus 195 ~~l~~~~gd 203 (492) +.+...+++ T Consensus 141 ~~a~a~ea~ 149 (604) T COG3107 141 ARADALEAR 149 (604) T ss_pred HHHHHHHCC T ss_conf 999998613 No 294 >COG2816 NPY1 NTP pyrophosphohydrolases containing a Zn-finger, probably nucleic-acid-binding [DNA replication, recombination, and repair] Probab=54.09 E-value=5.7 Score=17.81 Aligned_cols=21 Identities=14% Similarity=0.211 Sum_probs=16.1 Q ss_pred ECC-CCC----CCHHHCCCCCCCCCC Q ss_conf 756-989----554242248376873 Q gi|254781033|r 395 ISD-DGY----LSSVWLPLSPISKTL 415 (492) Q Consensus 395 ~c~-~g~----~~~~W~~~c~~~~~~ 415 (492) .|+ ||+ ....|..+|++||.. T Consensus 113 FCg~CG~~~~~~~~g~~~~C~~cg~~ 138 (279) T COG2816 113 FCGRCGTKTYPREGGWARVCPKCGHE 138 (279) T ss_pred CCCCCCCCCCCCCCCEEEECCCCCCC T ss_conf 77778980743567324557988870 No 295 >pfam12583 TPPII_N Tripeptidyl peptidase II N terminal. This domain family is found in bacteria and eukaryotes, and is approximately 190 amino acids in length. The family is found in association with pfam00082. Tripeptidyl peptidase II (TPPII) is a crucial component of the proteolytic cascade acting downstream of the 26S proteasome in the ubiquitin-proteasome pathway. It is an amino peptidase belonging to the subtilase family removing tripeptides from the free N terminus of oligopeptides. Probab=53.95 E-value=12 Score=15.29 Aligned_cols=44 Identities=11% Similarity=0.096 Sum_probs=32.7 Q ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHH Q ss_conf 79998654552158869999999988842126989999999999 Q gi|254781033|r 155 FAVYSLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQY 198 (492) Q Consensus 155 ~~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~ 198 (492) .++|-+-...+...|++.|.+++++.....|++..++..|++-. T Consensus 71 EsLRDfQ~s~i~K~e~E~AE~iY~ev~~~~P~hL~aHl~liqnl 114 (134) T pfam12583 71 ESLRDFQCSHIVKCDLENAEKIYNEVVAAHPKHLQAHLLLIQNI 114 (134) T ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHC T ss_conf 99988888999730788899999999987811599999999832 No 296 >pfam00301 Rubredoxin Rubredoxin. Probab=53.38 E-value=5.5 Score=17.93 Aligned_cols=11 Identities=18% Similarity=0.525 Sum_probs=6.8 Q ss_pred CEECC-CCCCCH Q ss_conf 52756-989554 Q gi|254781033|r 393 LWISD-DGYLSS 403 (492) Q Consensus 393 ~W~c~-~g~~~~ 403 (492) .|+|. |||+++ T Consensus 1 ky~C~~CgyiYd 12 (47) T pfam00301 1 KYVCKVCGYVYD 12 (47) T ss_pred CCCCCCCCCEEC T ss_conf 969888995889 No 297 >pfam09526 DUF2387 Probable metal-binding protein (DUF2387). Members of this family are small proteins, about 70 residues in length, with a basic triplet near the N-terminus and a probable metal-binding motif CPXCX(18)CXXC. Members are found in various proteobacteria. Probab=52.74 E-value=5.2 Score=18.07 Aligned_cols=16 Identities=6% Similarity=0.017 Sum_probs=13.5 Q ss_pred CCCCCCCCCCCCCCCC Q ss_conf 2483768736400367 Q gi|254781033|r 407 PLSPISKTLCYFEWKI 422 (492) Q Consensus 407 ~~c~~~~~~~~~~W~~ 422 (492) .+||.|+..|++.|-. T Consensus 9 AvCP~C~~~D~i~~~~ 24 (61) T pfam09526 9 AVCPKCSAMDTIRMWR 24 (61) T ss_pred CCCCCCCCCCEEEEEE T ss_conf 4189875653798872 No 298 >KOG0890 consensus Probab=52.73 E-value=12 Score=15.15 Aligned_cols=68 Identities=22% Similarity=0.178 Sum_probs=51.9 Q ss_pred CCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHH-CCCCC Q ss_conf 89836899999999987998899999887866099989999999999985799899999999996-28899 Q gi|254781033|r 322 KESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKEIFLLLAQIEQANSHNTDKILYWTQSAL-HAMPD 391 (492) Q Consensus 322 P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~~~~llA~ie~~e~~d~~~ar~wl~rA~-~a~~d 391 (492) .+-.+.++..|+++..+|.++.|+.++-.+.+.. -+.++...|.+.-. .||...+-..|+.-+ ..-+| T Consensus 1667 ~~~ge~wLqsAriaR~aG~~q~A~nall~A~e~r-~~~i~~E~AK~lW~-~gd~~~Al~~Lq~~l~~~~~~ 1735 (2382) T KOG0890 1667 SRLGECWLQSARIARLAGHLQRAQNALLNAKESR-LPEIVLERAKLLWQ-TGDELNALSVLQEILSKNFPD 1735 (2382) T ss_pred CHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCC-CCHHHHHHHHHHHH-HCCHHHHHHHHHHHHHHHCCC T ss_conf 0268999999999986233899999987644315-41679999999986-230887999999999860533 No 299 >COG2176 PolC DNA polymerase III, alpha subunit (gram-positive type) [DNA replication, recombination, and repair] Probab=52.61 E-value=7.5 Score=16.88 Aligned_cols=27 Identities=15% Similarity=0.028 Sum_probs=17.4 Q ss_pred CCCCCEECCCCCCCHHHC------------CCCCCCCCC Q ss_conf 899952756989554242------------248376873 Q gi|254781033|r 389 MPDPLWISDDGYLSSVWL------------PLSPISKTL 415 (492) Q Consensus 389 ~~dp~W~c~~g~~~~~W~------------~~c~~~~~~ 415 (492) |--|.++|..|+-++.-. --||.||.. T Consensus 910 PL~PHY~Cp~Cky~Ef~~d~svgsGfDLpdK~CPkCg~p 948 (1444) T COG2176 910 PLPPHYLCPECKYSEFIDDGSVGSGFDLPDKDCPKCGTP 948 (1444) T ss_pred CCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCCC T ss_conf 899653288874046623787677888899989867994 No 300 >pfam03850 Tfb4 Transcription factor Tfb4. Probab=52.60 E-value=6.2 Score=17.52 Aligned_cols=25 Identities=12% Similarity=0.123 Sum_probs=18.2 Q ss_pred CCCCCEECCCC-CCCHHHCCCCCCCC Q ss_conf 89995275698-95542422483768 Q gi|254781033|r 389 MPDPLWISDDG-YLSSVWLPLSPISK 413 (492) Q Consensus 389 ~~dp~W~c~~g-~~~~~W~~~c~~~~ 413 (492) .-|-.|+|..| .+.-+-.|.|+.|| T Consensus 246 ~VdiGyVCSvCLsIfC~~~~~C~tCg 271 (271) T pfam03850 246 VVDIGYVCSVCLSIFCEIPPICPTCG 271 (271) T ss_pred CCCEEEECHHCCHHHCCCCCCCCCCC T ss_conf 42202585202120137999888889 No 301 >PRK03824 hypA hydrogenase nickel incorporation protein; Provisional Probab=52.39 E-value=9 Score=16.25 Aligned_cols=10 Identities=10% Similarity=0.125 Sum_probs=8.2 Q ss_pred CCCCCCCCCC Q ss_conf 4837687364 Q gi|254781033|r 408 LSPISKTLCY 417 (492) Q Consensus 408 ~c~~~~~~~~ 417 (492) .||.||+.+. T Consensus 109 ~CP~Cgs~~~ 118 (135) T PRK03824 109 KCPKCGSRDF 118 (135) T ss_pred CCCCCCCCCC T ss_conf 2909989886 No 302 >COG5270 PUA domain (predicted RNA-binding domain) [Translation, ribosomal structure and biogenesis] Probab=51.96 E-value=9.4 Score=16.07 Aligned_cols=40 Identities=8% Similarity=-0.061 Sum_probs=25.7 Q ss_pred CCCCEECCCCCCCHHHCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 99952756989554242248376873640036787554456 Q gi|254781033|r 390 PDPLWISDDGYLSSVWLPLSPISKTLCYFEWKIPTKSPEYI 430 (492) Q Consensus 390 ~dp~W~c~~g~~~~~W~~~c~~~~~~~~~~W~~p~~~~~~~ 430 (492) ..|.|-|.+|..+- =.+.|+.||+-..-.--+||.+++.. T Consensus 11 k~~iyWCe~cNlPl-~~~~c~~cg~~~~~l~LTpPaD~R~~ 50 (202) T COG5270 11 KFPIYWCEKCNLPL-LGRRCSVCGSKVEELRLTPPADVRPA 50 (202) T ss_pred CCCEEEHHHCCCCC-CCCCCCCCCCCCEEEEECCCCCCCCC T ss_conf 43064132288743-55524566776007886799874626 No 303 >pfam00558 Vpu Vpu protein. The Vpu protein contains an N-terminal transmembrane spanning region and a C-terminal cytoplasmic region. The HIV-1 Vpu protein stimulates virus production by enhancing the release of viral particles from infected cells. The VPU protein binds specifically to CD4. Probab=51.92 E-value=13 Score=15.06 Aligned_cols=37 Identities=11% Similarity=0.105 Sum_probs=16.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999999841828899999999999999999 Q gi|254781033|r 43 VILSILYFFLFAWILLFAVSRFFLSCPAMLFHMLHKRNYDKGYKAL 88 (492) Q Consensus 43 ~~i~~~~~~~~~~~ll~~l~~~i~~~p~~~~~~~~~rr~~k~~~al 88 (492) .+++++++++++.+++|.++ +..|++.+|++|-.+.+ T Consensus 7 ~~ii~l~v~liiaIvVW~iv---------~~ey~k~~rqrkI~~l~ 43 (81) T pfam00558 7 IGLIALIVALIINIVVWTIV---------YREYRKIKKQREILRLI 43 (81) T ss_pred HHHHHHHHHHHHHHHHHHHH---------HHHHHHHHHHHHHHHHH T ss_conf 99999999999999999999---------99999999999999999 No 304 >TIGR00823 EIIA-LAC PTS system, lactose-specific IIa component; InterPro: IPR003188 The phosphoenolpyruvate-dependent sugar phosphotransferase system (PTS) , is a major carbohydrate transport system in bacteria. The PTS catalyses the phosphorylation of incoming sugar substrates and coupled with translocation across the cell membrane, makes the PTS a link between the uptake and metabolism of sugars. The general mechanism of the PTS is the following: a phosphoryl group from phosphoenolpyruvate (PEP) is transferred via a signal transduction pathway, to enzyme I (EI) which in turn transfers it to a phosphoryl carrier, the histidine protein (HPr). Phospho-HPr then transfers the phosphoryl group to a sugar-specific permease, a membrane-bound complex known as enzyme 2 (EII), which transports the sugar to the cell. EII consists of at least three structurally distinct domains IIA, IIB and IIC . These can either be fused together in a single polypeptide chain or exist as two or three interactive chains, formerly called enzymes II (EII) and III (EIII). The first domain (IIA or EIIA) carries the first permease-specific phosphorylation site, a histidine which is phosphorylated by phospho-HPr. The second domain (IIB or EIIB) is phosphorylated by phospho-IIA on a cysteinyl or histidyl residue, depending on the sugar transported. Finally, the phosphoryl group is transferred from the IIB domain to the sugar substrate concomitantly with the sugar uptake processed by the IIC domain. This third domain (IIC or EIIC) forms the translocation channel and the specific substrate-binding site. An additional transmembrane domain IID, homologous to IIC, can be found in some PTSs, e.g. for mannose , , , . The lactose/cellobiose-specific family are one of four structurally and functionally distinct group IIA PTS system enzymes. This family of proteins normally function as a homotrimer, stabilized by a centrally located metal ion . Separation into subunits is thought to occur after phosphorylation. ; GO: 0005351 sugar:hydrogen ion symporter activity, 0006810 transport, 0009401 phosphoenolpyruvate-dependent sugar phosphotransferase system, 0016020 membrane. Probab=51.72 E-value=13 Score=15.04 Aligned_cols=30 Identities=13% Similarity=0.158 Sum_probs=22.3 Q ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHCCC Q ss_conf 999999999997210047879999863013 Q gi|254781033|r 84 GYKALYTGLMSIAAHNIPLARKMHSYVSQQ 113 (492) Q Consensus 84 ~~~al~~gl~al~~Gd~~~A~k~~~~a~~~ 113 (492) |+..+..++-+.-.||++.|+++..++... T Consensus 17 aRS~a~eAl~AA~~Gdf~~A~~l~~~a~~~ 46 (99) T TIGR00823 17 ARSKALEALKAAKAGDFAKARELVEQAGEE 46 (99) T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHH T ss_conf 999999999988206389999999998999 No 305 >pfam03833 PolC_DP2 DNA polymerase II large subunit DP2. Probab=51.63 E-value=8.4 Score=16.45 Aligned_cols=36 Identities=11% Similarity=-0.156 Sum_probs=20.7 Q ss_pred HHHHHHHHHCCCCC------CCEECC-CCCCCHHHCCCCCCCCCC Q ss_conf 99999999628899------952756-989554242248376873 Q gi|254781033|r 378 ILYWTQSALHAMPD------PLWISD-DGYLSSVWLPLSPISKTL 415 (492) Q Consensus 378 ar~wl~rA~~a~~d------p~W~c~-~g~~~~~W~~~c~~~~~~ 415 (492) .+.-+..|.+-.++ ..-.|. ||..+ |...||+||.- T Consensus 608 ~~R~i~~Aa~~~~g~i~vevg~R~C~~Cg~~t--~~~~C~~CG~~ 650 (852) T pfam03833 608 SRRDIKNAAKKTKGTIEVEVGFRRCPSCGKES--PESTCPKCGSR 650 (852) T ss_pred CHHHHHHHHHHCCCEEEEEEEEEECCCCCCCC--CCCCCCCCCCC T ss_conf 03109999861798289997003888999977--87149999982 No 306 >pfam04930 FUN14 FUN14 family. This family of short proteins are found in eukaryotes and some archaea. Although the function of these proteins is not known they may contain transmembrane helices. Probab=51.50 E-value=13 Score=15.01 Aligned_cols=32 Identities=3% Similarity=-0.007 Sum_probs=19.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCEEEEEECC Q ss_conf 79999999999999999973489848999899 Q gi|254781033|r 3 RLIRYFFVISLVICSFIIVSHYPEDVSITWGN 34 (492) Q Consensus 3 r~l~~~~~i~~~~~~~~~l~~~~G~v~i~~~~ 34 (492) |+..++.++....++..-++.+.||+.|+|.- T Consensus 18 K~~k~~a~~~G~~~l~lq~l~~~G~I~vnw~k 49 (97) T pfam04930 18 KVGKLAAIAVGGGILLLQYLAQKGYITVNWKK 49 (97) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCEEEECHHH T ss_conf 99999999999999999999979978966799 No 307 >pfam11234 DUF3036 Protein of unknown function (DUF3036). Some members in this family of proteins are annotated as yoaS. Currently no function is known. Probab=50.32 E-value=13 Score=14.88 Aligned_cols=96 Identities=17% Similarity=0.135 Sum_probs=47.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCEEEEEHH--HHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHH Q ss_conf 6799999999999999999734898489998997662218--99999999999999999999999841828899999999 Q gi|254781033|r 2 LRLIRYFFVISLVICSFIIVSHYPEDVSITWGNRLYRTSP--FVILSILYFFLFAWILLFAVSRFFLSCPAMLFHMLHKR 79 (492) Q Consensus 2 ~r~l~~~~~i~~~~~~~~~l~~~~G~v~i~~~~~~ie~s~--~~~i~~~~~~~~~~~ll~~l~~~i~~~p~~~~~~~~~r 79 (492) .|+.++++.+..+++...|+-.-.+...=...++.+...+ ....+..+.++++++-.|++++.+-+....-....+.- T Consensus 8 Lkivi~ligi~~Lal~i~~~p~la~~~~~~~p~~~~~~~p~l~~~y~~~ipf~~aLy~~~kLL~~I~~n~aFS~~sv~~l 87 (155) T pfam11234 8 LKIVIVLIGIPVLALCIFLVPLLANDTAELGPEIAYLLYPVLLGVYLGAIPFYVALYQAFKLLRYIDRNTAFSELSVRAL 87 (155) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHH T ss_conf 99999999699999999998998032688586505999899999999999999999999999988615762169999999 Q ss_pred HHHHHHHHHHHHHHHHHCC Q ss_conf 9999999999999997210 Q gi|254781033|r 80 NYDKGYKALYTGLMSIAAH 98 (492) Q Consensus 80 r~~k~~~al~~gl~al~~G 98 (492) |.=| +-++.-+.+.+..+ T Consensus 88 k~Ik-~ca~~is~l~~~~~ 105 (155) T pfam11234 88 KNIK-YCAIAISLLYFALL 105 (155) T ss_pred HHHH-HHHHHHHHHHHHHH T ss_conf 9999-99999999999998 No 308 >PRK11029 FtsH protease regulator HflC; Provisional Probab=50.27 E-value=13 Score=14.87 Aligned_cols=35 Identities=20% Similarity=0.362 Sum_probs=19.9 Q ss_pred CHHHHHHHHHHHHHHHHHHH-HHCCCC-EEEEEECCE Q ss_conf 96799999999999999999-734898-489998997 Q gi|254781033|r 1 MLRLIRYFFVISLVICSFII-VSHYPE-DVSITWGNR 35 (492) Q Consensus 1 M~r~l~~~~~i~~~~~~~~~-l~~~~G-~v~i~~~~~ 35 (492) |.++++++++++++++...+ +.+..- .|++.||-+ T Consensus 1 m~~~ilviii~vlv~l~~s~yiV~e~e~aVVlrFGk~ 37 (334) T PRK11029 1 MRKSVIAIIIIVLVVLYMSVFVVKEGERGITLRFGKV 37 (334) T ss_pred CCHHHHHHHHHHHHHHHHEEEEECCCEEEEEEECCCE T ss_conf 9125999999999999856899757708999966955 No 309 >pfam06957 COPI_C Coatomer (COPI) alpha subunit C-terminus. This family represents the C-terminus (approximately 500 residues) of the eukaryotic coatomer alpha subunit. Coatomer (COPI) is a large cytosolic protein complex which forms a coat around vesicles budding from the Golgi apparatus. Such coatomer-coated vesicles have been proposed to play a role in many distinct steps of intracellular transport. Note that many family members also contain the pfam04053 domain. Probab=50.08 E-value=14 Score=14.85 Aligned_cols=49 Identities=10% Similarity=0.115 Sum_probs=34.1 Q ss_pred HHHHHHHCCCCHHHHHH-HHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHH Q ss_conf 99999826898368999-99999987998899999887866099989999 Q gi|254781033|r 314 ALRLEEINKESVESLVI-VSKIALEMGSIDQAHAKAMLAMKIAPRKEIFL 362 (492) Q Consensus 314 ~~~l~~~~P~~~e~~~~-LA~~~~~~g~~~~Ar~~l~~al~~~P~~~~~~ 362 (492) |-...++.|.|...-+. --.++.+.++|..|.....+++++.|+..+.. T Consensus 287 YFThc~LQp~H~~LaLr~AM~~afK~KNy~TAasFArRLLel~p~~~~a~ 336 (421) T pfam06957 287 YFTHCKLQPVHMILTLRSALNLFFKLKNFKTAASFARRLLELAPKPEVAQ 336 (421) T ss_pred HHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHH T ss_conf 98541674899999999999999986229999999999986499889999 No 310 >COG5159 RPN6 26S proteasome regulatory complex component [Posttranslational modification, protein turnover, chaperones] Probab=49.73 E-value=14 Score=14.81 Aligned_cols=236 Identities=10% Similarity=0.058 Sum_probs=93.1 Q ss_pred HHHHHHHHHCCHHHHHHHHH----HHHCCCCH-HHHHHHHHHHHH-CCCCHHHHHHHHHHHHHHCCCCH------HHHHH Q ss_conf 87532222100016899999----97268850-579998654552-15886999999998884212698------99999 Q gi|254781033|r 126 EVQIALAERQYNIAHEKLEM----MLQIPATR-EFAVYSLYFESC-RIGDLNSAQRYATKALDISPDAP------WVTEA 193 (492) Q Consensus 126 ~A~aA~~~gd~~~A~~~~~~----~~~~~~~~-~~~~~gL~~~al-~~gd~~~A~~~~~~a~~~~P~~~------~a~~~ 193 (492) ..+..-..|++..-.+.... |.+....+ .-.+|.|....- .....+.-+..+....+-..+.. ..-.. T Consensus 51 l~~lyv~~g~~~~l~~~i~~sre~m~~ftk~k~~KiirtLiekf~~~~dsl~dqi~v~~~~iewA~rEkr~fLr~~Le~K 130 (421) T COG5159 51 LFKLYVSKGDYCSLGDTITSSREAMEDFTKPKITKIIRTLIEKFPYSSDSLEDQIKVLTALIEWADREKRKFLRLELECK 130 (421) T ss_pred HHHHHHHCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99899716773428888774389999734056899999999966789854788999999999999888889999999999 Q ss_pred HHHHHHHCCCCHHHHHHHHHHHHCCC----CHHHHHHHHHHHHHHHHHH-CCCHHHHHHHHHHHHH--CCCCCHHHH-HH Q ss_conf 99999952574569999999995023----1035789999999999996-6998999999987543--133527999-99 Q gi|254781033|r 194 VVQQYVLAKEWSRAITFLNQKKKNAK----EWNRNRAILLIARSLENAD-KGDMIASYHDAIESLK--LCDNSIMAS-IC 265 (492) Q Consensus 194 L~~l~~~~gdw~~A~~~l~~~~~~~~----~~~~~~a~l~~~~a~~~~~-~~~~~~A~~~~~~a~~--~~p~~~~a~-~~ 265 (492) +.-++.+.|.+.+|+.++...+.... ...-....++.+....... -......+..++-+.. ..|....+. -+ T Consensus 131 li~l~y~~~~YsdalalIn~ll~ElKk~DDK~~Li~vhllESKvyh~irnv~KskaSLTaArt~Ans~YCPpqlqa~lDL 210 (421) T COG5159 131 LIYLLYKTGKYSDALALINPLLHELKKYDDKINLITVHLLESKVYHEIRNVSKSKASLTAARTLANSAYCPPQLQAQLDL 210 (421) T ss_pred HHHHHHHCCCHHHHHHHHHHHHHHHHHHCCCCCEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHH T ss_conf 99999813527779998879999998616764201012566889999971024355778999885114788888878888 Q ss_pred -HHHHHHHCCCHHHHHHHHHHHHHHCCC--HH----HHHHHH---HHCCCCHH--HHHHHHHHHHH-HCCCCHHHHHHHH Q ss_conf -999999739947899999999983289--89----999999---85078958--99999999998-2689836899999 Q gi|254781033|r 266 -AAKSLISQNKKRKAEVILEKIWKVNPH--PE----IANIYT---HLLSENTV--GKLKRALRLEE-INKESVESLVIVS 332 (492) Q Consensus 266 -~a~~l~~~g~~~~A~~~le~a~~~~p~--p~----La~a~~---~~~~~~~~--~al~~~~~l~~-~~P~~~e~~~~LA 332 (492) -+-+.....++.-|...+-.+++.... .+ ...-|. .+..+... .++-+-+.-++ ......++...++ T Consensus 211 ~sGIlhcdd~dyktA~SYF~Ea~Egft~l~~d~kAc~sLkYmlLSkIMlN~~~evk~vl~~K~t~~~y~~r~I~am~ava 290 (421) T COG5159 211 LSGILHCDDRDYKTASSYFIEALEGFTLLKMDVKACVSLKYMLLSKIMLNRREEVKAVLRNKNTLKHYDDRMIRAMLAVA 290 (421) T ss_pred HCCCEEECCCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHH T ss_conf 63621204555105899999998240234242999999999999999985499999998160267663145689999999 Q ss_pred HHHH--HCCCHHHHHHHHHHHHHCCCCHHHH Q ss_conf 9999--8799889999988786609998999 Q gi|254781033|r 333 KIAL--EMGSIDQAHAKAMLAMKIAPRKEIF 361 (492) Q Consensus 333 ~~~~--~~g~~~~Ar~~l~~al~~~P~~~~~ 361 (492) +++- ...+|..|.....+-+..+|=-|.+ T Consensus 291 ea~~NRsL~df~~aL~qY~~el~~D~~iRsH 321 (421) T COG5159 291 EAFGNRSLKDFSDALAQYSDELHQDSFIRSH 321 (421) T ss_pred HHHCCCCHHHHHHHHHHHHHHHCCCHHHHHH T ss_conf 9837874766999999850875048778999 No 311 >cd00730 rubredoxin Rubredoxin; nonheme iron binding domains containing a [Fe(SCys)4] center. Rubredoxins are small nonheme iron proteins. The iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc. They are believed to be involved in electron transfer. Probab=49.41 E-value=6.5 Score=17.34 Aligned_cols=10 Identities=20% Similarity=0.461 Sum_probs=5.2 Q ss_pred EECC-CCCCCH Q ss_conf 2756-989554 Q gi|254781033|r 394 WISD-DGYLSS 403 (492) Q Consensus 394 W~c~-~g~~~~ 403 (492) |+|. |||+++ T Consensus 2 y~C~~CgyiYd 12 (50) T cd00730 2 YECRICGYIYD 12 (50) T ss_pred CCCCCCCCEEC T ss_conf 79888993889 No 312 >COG5191 Uncharacterized conserved protein, contains HAT (Half-A-TPR) repeat [General function prediction only] Probab=49.37 E-value=14 Score=14.77 Aligned_cols=70 Identities=14% Similarity=0.158 Sum_probs=48.6 Q ss_pred HHHHHHCCHHHHHHHHHHHHCCCCHHHHH-HHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHH Q ss_conf 32222100016899999972688505799-98654552158869999999988842126989999999999 Q gi|254781033|r 129 IALAERQYNIAHEKLEMMLQIPATREFAV-YSLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQY 198 (492) Q Consensus 129 aA~~~gd~~~A~~~~~~~~~~~~~~~~~~-~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~ 198 (492) -+...|-+..-...|.+.++..|+...-+ ..-..+..-.++.+.++..+.++++.+|++|..+...+.+. T Consensus 116 Y~~k~k~y~~~~nI~~~~l~khP~nvdlWI~~c~~e~~~~ani~s~Ra~f~~glR~N~~~p~iw~eyfr~E 186 (435) T COG5191 116 YVIKKKMYGEMKNIFAECLTKHPLNVDLWIYCCAFELFEIANIESSRAMFLKGLRMNSRSPRIWIEYFRME 186 (435) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCEEEEEECCCHHHHHCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHH T ss_conf 99998789899999999996199874056563000354415679899999865314889842999999999 No 313 >PRK06260 threonine synthase; Validated Probab=49.19 E-value=9.5 Score=16.06 Aligned_cols=21 Identities=5% Similarity=-0.035 Sum_probs=9.1 Q ss_pred CCHHHHHHHHHHHHHCCCCCH Q ss_conf 998999999987543133527 Q gi|254781033|r 240 GDMIASYHDAIESLKLCDNSI 260 (492) Q Consensus 240 ~~~~~A~~~~~~a~~~~p~~~ 260 (492) |...-+.+.+.+.....|+.+ T Consensus 204 G~kTi~~Ei~eQl~~~~PD~v 224 (400) T PRK06260 204 GQKTIAFEIYDQLDGEVPDRV 224 (400) T ss_pred CHHHHHHHHHHHHCCCCCCEE T ss_conf 066899999998268899989 No 314 >PRK00241 nudC NADH pyrophosphatase; Reviewed Probab=49.18 E-value=7.4 Score=16.91 Aligned_cols=22 Identities=18% Similarity=0.306 Sum_probs=16.4 Q ss_pred ECC-CCC----CCHHHCCCCCCCCCCC Q ss_conf 756-989----5542422483768736 Q gi|254781033|r 395 ISD-DGY----LSSVWLPLSPISKTLC 416 (492) Q Consensus 395 ~c~-~g~----~~~~W~~~c~~~~~~~ 416 (492) .|+ ||. ....|...|++||..- T Consensus 102 fC~~CG~~t~~~~~g~~~~C~~C~~~~ 128 (257) T PRK00241 102 FCGYCGHPTHPSKTEWAMLCPHCRERY 128 (257) T ss_pred CCCCCCCCCCCCCCCEEEECCCCCCCC T ss_conf 456567366338785069889998631 No 315 >COG4178 ABC-type uncharacterized transport system, permease and ATPase components [General function prediction only] Probab=48.92 E-value=14 Score=14.72 Aligned_cols=56 Identities=13% Similarity=0.202 Sum_probs=34.9 Q ss_pred HHHHHHHHHHHCCCCEEEEEECCEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 99999999973489848999899766221899999999999999999999999841 Q gi|254781033|r 12 SLVICSFIIVSHYPEDVSITWGNRLYRTSPFVILSILYFFLFAWILLFAVSRFFLS 67 (492) Q Consensus 12 ~~~~~~~~~l~~~~G~v~i~~~~~~ie~s~~~~i~~~~~~~~~~~ll~~l~~~i~~ 67 (492) ..++.....+-+..|.+.+..+|+.++.+-..+.++++-+++..++.+++-+-+.+ T Consensus 177 islisF~~iLw~lsg~~~~~~~g~~~~I~g~mv~~~i~Ya~~~s~~~~~iGr~Li~ 232 (604) T COG4178 177 ISLISFTGILWSLSGGLAFTIGGFSFGIPGYMVWAVIIYAISASLLTHLIGRPLIR 232 (604) T ss_pred HHHHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCCC T ss_conf 99999999999706762586236640141489999999999999999997166502 No 316 >PRK06450 threonine synthase; Validated Probab=48.87 E-value=9.4 Score=16.08 Aligned_cols=10 Identities=20% Similarity=0.577 Sum_probs=5.0 Q ss_pred HHHHCCHHHH Q ss_conf 9972100478 Q gi|254781033|r 93 MSIAAHNIPL 102 (492) Q Consensus 93 ~al~~Gd~~~ 102 (492) +.|.+|+... T Consensus 50 vsLgeG~TPl 59 (336) T PRK06450 50 ISLGEGETPI 59 (336) T ss_pred EECCCCCCCC T ss_conf 4327887861 No 317 >TIGR00756 PPR pentatricopeptide repeat domain; InterPro: IPR002885 Pentatricopeptide repeat proteins are characterised by the presence of a tandem array of repeats, where the number of PPR motifs controls the affinity and specificity of the PPR protein for RNA. These proteins occur predominantly in plants, where they appear to play essential roles in RNA/DNA metabolism in mitochondria and chloroplasts . It has been suggested that each of the highly variable PPR proteins is a gene-specific regulator of plant organellar RNA metabolism. PPR proteins may also play a role in organelle biogenesis, probably via binding to organellar transcripts . Examples of PPR repeat-containing proteins include PET309 P32522 from SWISSPROT, which may be involved in RNA stabilisation , and crp1, which is involved in RNA processing . The repeat is associated with a predicted plant protein O49549 from SWISSPROT that has a domain organization similar to the human BRCA1 protein.. Probab=48.67 E-value=14 Score=14.70 Aligned_cols=28 Identities=11% Similarity=0.226 Sum_probs=17.7 Q ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHCC Q ss_conf 9999999995257456999999999502 Q gi|254781033|r 191 TEAVVQQYVLAKEWSRAITFLNQKKKNA 218 (492) Q Consensus 191 ~~~L~~l~~~~gdw~~A~~~l~~~~~~~ 218 (492) ...++..+++.|++++|..++.+..+.+ T Consensus 3 yn~li~~~~~~g~~~~a~~~~~~M~~~g 30 (35) T TIGR00756 3 YNTLIDGLCKAGRVEEALELFDEMKERG 30 (35) T ss_pred HHHHHHHHHCCCCHHHHHHHHHHHHHCC T ss_conf 8988999862898789999999998778 No 318 >COG0846 SIR2 NAD-dependent protein deacetylases, SIR2 family [Transcription] Probab=48.61 E-value=9.5 Score=16.06 Aligned_cols=24 Identities=13% Similarity=-0.208 Sum_probs=15.7 Q ss_pred CCEECC-CCCCCHHHC----------CCCCCCCCC Q ss_conf 952756-989554242----------248376873 Q gi|254781033|r 392 PLWISD-DGYLSSVWL----------PLSPISKTL 415 (492) Q Consensus 392 p~W~c~-~g~~~~~W~----------~~c~~~~~~ 415 (492) ..|.|. ||.....|. |.||.||.. T Consensus 121 ~~~~C~~C~~~~~~~~~~~~~~~~~~p~C~~Cg~~ 155 (250) T COG0846 121 KRVRCSKCGNQYYDEDVIKFIEDGLIPRCPKCGGP 155 (250) T ss_pred EEEEECCCCCCCCHHHHHHHHCCCCCCCCCCCCCC T ss_conf 13576687573652143331034799868767996 No 319 >TIGR01405 polC_Gram_pos DNA polymerase III, alpha subunit, Gram-positive type; InterPro: IPR006308 These are the polypeptide chains of DNA polymerase III. Full-length homologs of this protein are restricted to the Gram-positive lineages, including the Mycoplasmas. This protein is designated alpha chain and given the gene symbol polC, but is not a full-length homolog of other polC genes. The N-terminal region of about 200 amino acids is rich in low-complexity sequence and poorly alignable. ; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006260 DNA replication, 0005737 cytoplasm. Probab=48.19 E-value=9.8 Score=15.95 Aligned_cols=26 Identities=12% Similarity=-0.013 Sum_probs=16.1 Q ss_pred CCCCCEECCCCCCCHH------------HCCCCCCCCC Q ss_conf 8999527569895542------------4224837687 Q gi|254781033|r 389 MPDPLWISDDGYLSSV------------WLPLSPISKT 414 (492) Q Consensus 389 ~~dp~W~c~~g~~~~~------------W~~~c~~~~~ 414 (492) |--|..+|..|+-++- =---||.||. T Consensus 713 PL~PHY~Cp~Cky~Ef~~D~~~~~GfDLp~K~CP~Cga 750 (1264) T TIGR01405 713 PLPPHYLCPNCKYSEFVTDGSVGSGFDLPDKDCPKCGA 750 (1264) T ss_pred CCCCCCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCC T ss_conf 57687508787355300378778877685788888887 No 320 >PRK09458 pspB phage shock protein B; Provisional Probab=48.14 E-value=14 Score=14.64 Aligned_cols=18 Identities=11% Similarity=0.239 Sum_probs=6.3 Q ss_pred HHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999 Q gi|254781033|r 42 FVILSILYFFLFAWILLF 59 (492) Q Consensus 42 ~~~i~~~~~~~~~~~ll~ 59 (492) .|+++-++++++++.-+| T Consensus 4 ~fl~vP~iiF~~fVaPiW 21 (75) T PRK09458 4 LFLAIPLIIFVLFVAPIW 21 (75) T ss_pred EEEHHHHHHHHHHHHHHH T ss_conf 543281999999999999 No 321 >pfam06667 PspB Phage shock protein B. This family consists of several bacterial phage shock protein B (PspB) sequences. The phage shock protein (psp) operon is induced in response to heat, ethanol, osmotic shock and infection by filamentous bacteriophages. Expression of the operon requires the alternative sigma factor sigma54 and the transcriptional activator PspF. In addition, PspA plays a negative regulatory role, and the integral-membrane proteins PspB and PspC play a positive one. Probab=47.68 E-value=15 Score=14.59 Aligned_cols=21 Identities=14% Similarity=0.127 Sum_probs=7.5 Q ss_pred EHHHHHHHHHHHHHHHHHHHH Q ss_conf 218999999999999999999 Q gi|254781033|r 39 TSPFVILSILYFFLFAWILLF 59 (492) Q Consensus 39 ~s~~~~i~~~~~~~~~~~ll~ 59 (492) ||..|+.+-++++++++..+| T Consensus 1 M~~~fl~vPliiF~ifVaP~W 21 (75) T pfam06667 1 MSMEFLAVPLIIFMLFVAPIW 21 (75) T ss_pred CCHHHHHHHHHHHHHHHHHHH T ss_conf 963898997999999999999 No 322 >COG3947 Response regulator containing CheY-like receiver and SARP domains [Signal transduction mechanisms] Probab=47.60 E-value=15 Score=14.58 Aligned_cols=59 Identities=19% Similarity=0.116 Sum_probs=35.2 Q ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHCCCCH-HHH-HHHHHHHHHHCCCHHHHHH---HHHHHHCCC Q ss_conf 999999998799889999988786609998-999-9999999985799899999---999996288 Q gi|254781033|r 329 VIVSKIALEMGSIDQAHAKAMLAMKIAPRK-EIF-LLLAQIEQANSHNTDKILY---WTQSALHAM 389 (492) Q Consensus 329 ~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~-~~~-~llA~ie~~e~~d~~~ar~---wl~rA~~a~ 389 (492) -..++.|..+|.+.+|-+.-+++++.+|=. ..+ .+|+-+ .. .||.-.+.. -+.+-+.+. T Consensus 283 gkva~~yle~g~~neAi~l~qr~ltldpL~e~~nk~lm~~l-a~-~gD~is~~khyerya~vleae 346 (361) T COG3947 283 GKVARAYLEAGKPNEAIQLHQRALTLDPLSEQDNKGLMASL-AT-LGDEISAIKHYERYAEVLEAE 346 (361) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHH-HH-HCCCHHHHHHHHHHHHHHHHH T ss_conf 99999999759927899999877202704467899999999-98-065043423899999999999 No 323 >PRK05580 primosome assembly protein PriA; Validated Probab=47.43 E-value=9.6 Score=16.03 Aligned_cols=10 Identities=30% Similarity=0.697 Sum_probs=4.5 Q ss_pred CCE-EEEEECC Q ss_conf 984-8999899 Q gi|254781033|r 25 PED-VSITWGN 34 (492) Q Consensus 25 ~G~-v~i~~~~ 34 (492) +|. |.+-|++ T Consensus 30 ~G~rV~VpFg~ 40 (699) T PRK05580 30 PGVRVRVPFGG 40 (699) T ss_pred CCEEEEEECCC T ss_conf 97089996189 No 324 >PRK11486 flagellar biosynthesis protein FliO; Provisional Probab=47.25 E-value=14 Score=14.63 Aligned_cols=26 Identities=4% Similarity=-0.055 Sum_probs=13.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 18999999999999999999999998 Q gi|254781033|r 40 SPFVILSILYFFLFAWILLFAVSRFF 65 (492) Q Consensus 40 s~~~~i~~~~~~~~~~~ll~~l~~~i 65 (492) ++...++.+++++.+++++.|+++.+ T Consensus 17 ~Llqv~~~L~~Vi~lI~~~aWL~rR~ 42 (124) T PRK11486 17 PLLQVSGALIAIIALILAAAWLVKRL 42 (124) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 69999999999999999999999982 No 325 >pfam09323 DUF1980 Domain of unknown function (DUF1980). Members of this family are found in a set of prokaryotic hypothetical proteins. Their exact function, has not, as yet, been defined. Probab=47.13 E-value=15 Score=14.53 Aligned_cols=14 Identities=21% Similarity=0.190 Sum_probs=7.5 Q ss_pred CHHHHHHHHHHHHH Q ss_conf 96799999999999 Q gi|254781033|r 1 MLRLIRYFFVISLV 14 (492) Q Consensus 1 M~r~l~~~~~i~~~ 14 (492) |+|.++++.+-.+. T Consensus 1 mir~lILlgf~~l~ 14 (179) T pfam09323 1 MIRFLILLGFGFLF 14 (179) T ss_pred CHHHHHHHHHHHHH T ss_conf 99999999999999 No 326 >pfam06676 DUF1178 Protein of unknown function (DUF1178). This family consists of several hypothetical bacterial proteins of around 150 residues in length. The function of this family is unknown. Probab=47.00 E-value=11 Score=15.61 Aligned_cols=23 Identities=17% Similarity=0.258 Sum_probs=16.4 Q ss_pred EECCCCCCCHHHC--------------CCCCCCCCCC Q ss_conf 2756989554242--------------2483768736 Q gi|254781033|r 394 WISDDGYLSSVWL--------------PLSPISKTLC 416 (492) Q Consensus 394 W~c~~g~~~~~W~--------------~~c~~~~~~~ 416 (492) -.|++||..+-|= -.||.||+-+ T Consensus 6 L~C~~~H~FEgWF~ss~~fe~Q~~~gli~CP~Cgs~~ 42 (145) T pfam06676 6 LRCDHGHRFEGWFASSADFDKQQARGLVSCPVCGSTE 42 (145) T ss_pred EECCCCCCCCCCCCCHHHHHHHHHCCCCCCCCCCCCE T ss_conf 7679999564145688999999986991799999983 No 327 >TIGR00869 sec62 protein translocation protein, Sec62 family; InterPro: IPR011553 Members of the NSCC2 family have been sequenced from various fungal and animal species including Saccharomyces cerevisiae, Drosophila melanogaster and Homo sapiens. These proteins are the Sec62 proteins, believed to be associated with the Sec61 and Sec63 constituents of the general protein secretary systems of yeast microsomes. They are also the non-selective cation (NS) channels of the mammalian cytoplasmic membrane. The yeast Sec62 protein has been shown to be essential for cell growth. The mammalian NS channel proteins have been implicated in platelet derived growth factor(PGDF) dependent single channel current in fibroblasts. These channels are essentially closed in serum deprived tissue-culture cells and are specifically opened by exposure to PDGF. These channels are reported to exhibit equal selectivity for Na+, K+ and Cs+ with low permeability to Ca2+, and no permeability to anions. This family of Sec62 proteins is restricted to the Ascomycota. ; GO: 0008565 protein transporter activity, 0015031 protein transport, 0030176 integral to endoplasmic reticulum membrane. Probab=46.89 E-value=7.2 Score=16.98 Aligned_cols=17 Identities=6% Similarity=0.006 Sum_probs=9.1 Q ss_pred HH-HHCCCCEEEEEECCE Q ss_conf 99-734898489998997 Q gi|254781033|r 19 II-VSHYPEDVSITWGNR 35 (492) Q Consensus 19 ~~-l~~~~G~v~i~~~~~ 35 (492) .| +.-+.|.=.++.|+. T Consensus 143 LWP~fmR~G~wYlS~G~l 160 (259) T TIGR00869 143 LWPIFMRRGSWYLSLGAL 160 (259) T ss_pred CCHHHHHCCCHHHHHHHH T ss_conf 121456355120045578 No 328 >PRK09934 putative fimbrial protein; Provisional Probab=46.42 E-value=15 Score=14.45 Aligned_cols=37 Identities=14% Similarity=0.105 Sum_probs=14.6 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCEEEE Q ss_conf 96799999999999999999734898489998997662 Q gi|254781033|r 1 MLRLIRYFFVISLVICSFIIVSHYPEDVSITWGNRLYR 38 (492) Q Consensus 1 M~r~l~~~~~i~~~~~~~~~l~~~~G~v~i~~~~~~ie 38 (492) |.|+++.++..++. ......+...|.+.|.|.|..++ T Consensus 1 mkk~~l~~~~~l~~-~~~~~aa~~~g~g~i~F~G~I~~ 37 (171) T PRK09934 1 MRRVFIAIFCGLLW-SPLSQAASPLGEINIELRGNVVD 37 (171) T ss_pred CHHHHHHHHHHHHH-HHHHHHCCCCCEEEEEEEEEEEE T ss_conf 92689999999998-63442113588058999999980 No 329 >pfam05140 ResB ResB-like family. This family includes both ResB and cytochrome c biogenesis proteins. Mutations in ResB indicate that they are essential for growth. ResB is predicted to be a transmembrane protein. Probab=46.32 E-value=8.2 Score=16.54 Aligned_cols=52 Identities=8% Similarity=0.076 Sum_probs=26.2 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHCCC---------C----EEEEEECCEEEEEHHHHHHHHHHHHH Q ss_conf 9679999999999999999973489---------8----48999899766221899999999999 Q gi|254781033|r 1 MLRLIRYFFVISLVICSFIIVSHYP---------E----DVSITWGNRLYRTSPFVILSILYFFL 52 (492) Q Consensus 1 M~r~l~~~~~i~~~~~~~~~l~~~~---------G----~v~i~~~~~~ie~s~~~~i~~~~~~~ 52 (492) |.--+++++++++++..++.+-.+. | .+.-.++=+.+=.|.+|..+.+++++ T Consensus 1 mr~ai~LL~llaiaSiiGTvipQ~~~~~~Y~~~yg~~~~~i~~~Lgl~~vY~S~WF~~ll~lL~i 65 (437) T pfam05140 1 LRLAISLLLLLAIASIIGTVIPQREPVAFYIENYGPFLGPIIDRLGLFDVYSSWWFLALLALLFV 65 (437) T ss_pred CCHHHHHHHHHHHHHHCCEECCCCCCHHHHHHHHCCHHHHHHHHCCCCCEECCHHHHHHHHHHHH T ss_conf 90499999999999751035567996899998969589999997599830667999999999999 No 330 >pfam10009 DUF2252 Uncharacterized protein conserved in bacteria (DUF2252). This domain, found in various hypothetical bacterial proteins, has no known function. Probab=45.74 E-value=8.6 Score=16.40 Aligned_cols=25 Identities=12% Similarity=0.108 Sum_probs=13.4 Q ss_pred HHHHHHHHHHHHHHCCCHHHHHHHH Q ss_conf 4789999999998328989999999 Q gi|254781033|r 276 KRKAEVILEKIWKVNPHPEIANIYT 300 (492) Q Consensus 276 ~~~A~~~le~a~~~~p~p~La~a~~ 300 (492) ..++.++..-.-..+++++....|. T Consensus 271 ~~~g~RVv~gQR~lq~~sD~fLG~~ 295 (383) T pfam10009 271 EHEGERVVEGQRALQAASDPFLGWT 295 (383) T ss_pred CCCHHHHHHHHHHHHCCCCCEEEEE T ss_conf 8727899999998660788623348 No 331 >LOAD_little_fing consensus Probab=45.54 E-value=11 Score=15.52 Aligned_cols=25 Identities=12% Similarity=0.041 Sum_probs=20.3 Q ss_pred CCCEECC-CCCCCHHHCCCCCCCCCC Q ss_conf 9952756-989554242248376873 Q gi|254781033|r 391 DPLWISD-DGYLSSVWLPLSPISKTL 415 (492) Q Consensus 391 dp~W~c~-~g~~~~~W~~~c~~~~~~ 415 (492) ...|+|. |++..-.|...|..|+.- T Consensus 2 ~gdW~C~~C~~~Nf~~r~~C~~C~~~ 27 (31) T LOAD_little_fi 2 PGDWICPKCTFLNFARRSSCNRCGAP 27 (31) T ss_pred CCCCCCCCCCCEECCCCCCCCCCCCC T ss_conf 97876787978245225813488896 No 332 >pfam04781 DUF627 Protein of unknown function (DUF627). This family represents the N-terminal region of several plant proteins of unknown function. Probab=45.49 E-value=16 Score=14.35 Aligned_cols=26 Identities=12% Similarity=0.058 Sum_probs=14.3 Q ss_pred HHHCCCCHHHHHHHHHHHHHHCCCCH Q ss_conf 55215886999999998884212698 Q gi|254781033|r 163 ESCRIGDLNSAQRYATKALDISPDAP 188 (492) Q Consensus 163 ~al~~gd~~~A~~~~~~a~~~~P~~~ 188 (492) -..+.||.-.|+++.++.....+++. T Consensus 5 ~~~~~GnhiKAL~iied~is~h~~~~ 30 (112) T pfam04781 5 DLFAKGDYIKALEIIEDSISDHGKDE 30 (112) T ss_pred HHHHCCCHHHHHHHHHHHHHHCCCCC T ss_conf 99876575999999999998726764 No 333 >COG5336 Uncharacterized protein conserved in bacteria [Function unknown] Probab=45.46 E-value=16 Score=14.34 Aligned_cols=43 Identities=21% Similarity=0.121 Sum_probs=26.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCEEEEEHHHHHHHHHHHHHHHHH Q ss_conf 799999999999999999734898489998997662218999999999999999 Q gi|254781033|r 3 RLIRYFFVISLVICSFIIVSHYPEDVSITWGNRLYRTSPFVILSILYFFLFAWI 56 (492) Q Consensus 3 r~l~~~~~i~~~~~~~~~l~~~~G~v~i~~~~~~ie~s~~~~i~~~~~~~~~~~ 56 (492) |+..=||-=.++.++..|+.|+ -.-||++++|+.+++-|.+-+ T Consensus 47 klssefIsGilVGa~iG~llD~-----------~agTsPwglIv~lllGf~AG~ 89 (116) T COG5336 47 KLSSEFISGILVGAGIGWLLDK-----------FAGTSPWGLIVFLLLGFGAGV 89 (116) T ss_pred HHHHHHHHHHHHHHHHHHHHHH-----------HCCCCCHHHHHHHHHHHHHHH T ss_conf 8999998788999999999997-----------617886799999999998999 No 334 >PRK11059 regulatory protein CsrD; Provisional Probab=45.43 E-value=16 Score=14.34 Aligned_cols=54 Identities=19% Similarity=0.096 Sum_probs=25.6 Q ss_pred EEHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHH Q ss_conf 2218999999999999999999999998418288999999999999999999999997210047879 Q gi|254781033|r 38 RTSPFVILSILYFFLFAWILLFAVSRFFLSCPAMLFHMLHKRNYDKGYKALYTGLMSIAAHNIPLAR 104 (492) Q Consensus 38 e~s~~~~i~~~~~~~~~~~ll~~l~~~i~~~p~~~~~~~~~rr~~k~~~al~~gl~al~~Gd~~~A~ 104 (492) ..|+....++.+.+++++.++++.++|+ |++=.|.+.|..=---...|+.+.+. T Consensus 135 ~~s~~~~~~~~~~~~~~~~~l~~~~~wl-------------~~ql~g~e~Le~R~~~il~g~~~~~~ 188 (642) T PRK11059 135 FYSLYATAILTLAIGFIVLFLFLGVRWL-------------RRQLAGQELLEERARRILNGEREQAV 188 (642) T ss_pred HCCHHHHHHHHHHHHHHHHHHHHHHHHH-------------HHHCCCHHHHHHHHHHHHCCCHHHHC T ss_conf 1579999999999999999999999999-------------98720769999999988669767644 No 335 >pfam10112 Halogen_Hydrol 5-bromo-4-chloroindolyl phosphate hydrolysis protein. Members of this family of prokaryotic proteins mediate the hydrolysis of 5-bromo-4-chloroindolyl phosphate bonds. Probab=45.35 E-value=16 Score=14.33 Aligned_cols=57 Identities=11% Similarity=0.154 Sum_probs=26.4 Q ss_pred CHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHC---CCCHHHHHHHHHHH Q ss_conf 0016899999972688505799986545521588699999999888421---26989999999999 Q gi|254781033|r 136 YNIAHEKLEMMLQIPATREFAVYSLYFESCRIGDLNSAQRYATKALDIS---PDAPWVTEAVVQQY 198 (492) Q Consensus 136 ~~~A~~~~~~~~~~~~~~~~~~~gL~~~al~~gd~~~A~~~~~~a~~~~---P~~~~a~~~L~~l~ 198 (492) ...++..|+... .+|.++...+.-+..+ .+.+.+..++-..+. -++......+-+.. T Consensus 107 ~~i~r~If~~v~-~~P~~~~~a~~F~~~~-----Lp~~v~l~ekY~~L~~qpvk~~e~~~~l~~t~ 166 (199) T pfam10112 107 LRISRRIFKAVV-KEPKRFYKAERFLYSH-----LPNAVELTEKYAELSKQPVKNKDIYQKLEETR 166 (199) T ss_pred HHHHHHHHHHHH-HCHHHHHHHHHHHHHH-----CHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHH T ss_conf 999999999998-6979888889999988-----60899999999999868799999999999999 No 336 >COG4260 Membrane protease subunit, stomatin/prohibitin family [Amino acid transport and metabolism] Probab=45.30 E-value=11 Score=15.50 Aligned_cols=25 Identities=20% Similarity=0.211 Sum_probs=17.7 Q ss_pred HHHCCCCCCCEECC-CCCCCHHHCCCCCCCC Q ss_conf 99628899952756-9895542422483768 Q gi|254781033|r 384 SALHAMPDPLWISD-DGYLSSVWLPLSPISK 413 (492) Q Consensus 384 rA~~a~~dp~W~c~-~g~~~~~W~~~c~~~~ 413 (492) .|-.+.|-..|.|. |+... .|++|| T Consensus 297 qaqaa~pa~t~~~~r~~k~n-----fc~ncG 322 (345) T COG4260 297 QAQAAAPAATWPCARCAKLN-----FCLNCG 322 (345) T ss_pred HHHHCCCCCCCCCHHCCCCC-----CCCCCC T ss_conf 46541875568510005655-----033468 No 337 >KOG1497 consensus Probab=44.52 E-value=16 Score=14.24 Aligned_cols=126 Identities=14% Similarity=0.091 Sum_probs=75.4 Q ss_pred HHHHHHCCCCHHHHHHHHHHHH-HHCCCCH-------HHHHHHHHHHHHCCCCHHHHHHHHHHHHCC-C--CHHHHHHHH Q ss_conf 6545521588699999999888-4212698-------999999999995257456999999999502-3--103578999 Q gi|254781033|r 160 LYFESCRIGDLNSAQRYATKAL-DISPDAP-------WVTEAVVQQYVLAKEWSRAITFLNQKKKNA-K--EWNRNRAIL 228 (492) Q Consensus 160 L~~~al~~gd~~~A~~~~~~a~-~~~P~~~-------~a~~~L~~l~~~~gdw~~A~~~l~~~~~~~-~--~~~~~~a~l 228 (492) +....+..-..+.........+ .+.|.-. -....|+.+|...++|..|-..+...-++. + ........+ T Consensus 67 l~~~~l~~l~~e~~Kei~~~~l~~iq~rvisfeEqv~~irl~LAsiYE~Eq~~~~aaq~L~~I~~~tg~~~~d~~~kl~l 146 (399) T KOG1497 67 LFDVELSILEDELRKEISHFTLEKIQPRVISFEEQVASIRLHLASIYEKEQNWRDAAQVLVGIPLDTGQKAYDVEQKLLL 146 (399) T ss_pred HHHHHHCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHH T ss_conf 99988433988999999997777415220569999999999999999876469999998854276654344125788999 Q ss_pred HHHHHHHHHHCCCHHHHHHHHHHHH-----HCCCC-CHHHHHHHHHHHHHCCCHHHHHHHHHH Q ss_conf 9999999996699899999998754-----31335-279999999999973994789999999 Q gi|254781033|r 229 LIARSLENADKGDMIASYHDAIESL-----KLCDN-SIMASICAAKSLISQNKKRKAEVILEK 285 (492) Q Consensus 229 ~~~~a~~~~~~~~~~~A~~~~~~a~-----~~~p~-~~~a~~~~a~~l~~~g~~~~A~~~le~ 285 (492) ....+..+++.+|..++.....++. ..+++ .+..-+++|+.+-..+++-+|.+-+-. T Consensus 147 ~iriarlyLe~~d~veae~~inRaSil~a~~~Ne~Lqie~kvc~ARvlD~krkFlEAAqrYye 209 (399) T KOG1497 147 CIRIARLYLEDDDKVEAEAYINRASILQAESSNEQLQIEYKVCYARVLDYKRKFLEAAQRYYE 209 (399) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999987158477899998887786640368999999999999998888899999999999 No 338 >PRK12380 hydrogenase nickel incorporation protein HybF; Provisional Probab=44.46 E-value=11 Score=15.71 Aligned_cols=26 Identities=12% Similarity=-0.098 Sum_probs=18.2 Q ss_pred CCCCEEC-CCCC--CCHHHCCCCCCCCCCC Q ss_conf 9995275-6989--5542422483768736 Q gi|254781033|r 390 PDPLWIS-DDGY--LSSVWLPLSPISKTLC 416 (492) Q Consensus 390 ~dp~W~c-~~g~--~~~~W~~~c~~~~~~~ 416 (492) |-.. .| +||+ ..++....||.||+++ T Consensus 68 p~~~-~C~~C~~~~~~~~~~~~CP~Cgs~~ 96 (113) T PRK12380 68 PAQA-WCWDCSQVVEIHQHDAQCPHCHGER 96 (113) T ss_pred CCEE-ECCCCCCEEECCCCCCCCCCCCCCC T ss_conf 8579-9655999872478888390697996 No 339 >TIGR00934 2a38euk potassium uptake protein, Trk family; InterPro: IPR004773 The proteins of the Trk family are derived from Gram-negative and Gram-positive bacteria, yeast and Triticum aestivum (Wheat). The proteins of Escherichia coli K12 TrkH and TrkG as well as several yeast proteins have been functionally characterised. The E. coli TrkH and TrkG proteins are complexed to two peripheral membrane proteins, TrkA, an NAD-binding protein, and TrkE, an ATP-binding protein. This complex forms the potassium uptake system. This family is specific for the eukaryotic Trk system.; GO: 0015079 potassium ion transmembrane transporter activity, 0006813 potassium ion transport, 0016021 integral to membrane. Probab=44.34 E-value=16 Score=14.22 Aligned_cols=52 Identities=13% Similarity=0.104 Sum_probs=40.1 Q ss_pred HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHH Q ss_conf 58999999999982689836899999999987998899999887866099989 Q gi|254781033|r 307 TVGKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKE 359 (492) Q Consensus 307 ~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~ 359 (492) |...-+|+...-.+||.....+.--+---+-++.+++|+..=+ -++..|+.+ T Consensus 454 PRk~skRvst~DDLnP~s~~~~~kk~sk~Y~mkH~PkA~~i~~-~~~r~~~~~ 505 (1197) T TIGR00934 454 PRKISKRVSTFDDLNPKSIVLYRKKASKKYLMKHLPKARLIQE-QVERRPSTS 505 (1197) T ss_pred CHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHCCHHHHHHH-HHCCCCCCC T ss_conf 4334311101121575311000001133458672543442454-313578877 No 340 >PRK10316 hypothetical protein; Provisional Probab=43.46 E-value=17 Score=14.12 Aligned_cols=22 Identities=23% Similarity=0.327 Sum_probs=10.9 Q ss_pred HHHHHHHHCCCHHHHHHHHHHH Q ss_conf 9999999739947899999999 Q gi|254781033|r 265 CAAKSLISQNKKRKAEVILEKI 286 (492) Q Consensus 265 ~~a~~l~~~g~~~~A~~~le~a 286 (492) ..+..+...|++-+|-..|+++ T Consensus 174 ~~A~~ll~~gkyyeAn~aLk~a 195 (209) T PRK10316 174 ADAQKLLDKGKYYEANLALKGA 195 (209) T ss_pred HHHHHHHHCCCHHHHHHHHHHH T ss_conf 9999998669844789999974 No 341 >COG2268 Uncharacterized protein conserved in bacteria [Function unknown] Probab=43.28 E-value=17 Score=14.10 Aligned_cols=10 Identities=0% Similarity=-0.429 Sum_probs=3.1 Q ss_pred HHHHHHHHCC Q ss_conf 9999997210 Q gi|254781033|r 89 YTGLMSIAAH 98 (492) Q Consensus 89 ~~gl~al~~G 98 (492) .+..-..+.| T Consensus 137 ~~~~~~~l~~ 146 (548) T COG2268 137 EQLAEDTLEG 146 (548) T ss_pred HHHHHHHHHH T ss_conf 9999999999 No 342 >COG5294 Uncharacterized protein conserved in bacteria [Function unknown] Probab=43.11 E-value=14 Score=14.65 Aligned_cols=38 Identities=16% Similarity=0.099 Sum_probs=22.3 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHCCC------CEEEEEECCEEEE Q ss_conf 9679999999999999999973489------8489998997662 Q gi|254781033|r 1 MLRLIRYFFVISLVICSFIIVSHYP------EDVSITWGNRLYR 38 (492) Q Consensus 1 M~r~l~~~~~i~~~~~~~~~l~~~~------G~v~i~~~~~~ie 38 (492) |.|+++.++.+++++.++.++..+. -+|.+...+-.++ T Consensus 1 MKkil~~ilall~~ii~a~~~~~~~~~d~fnpyvk~~~~y~k~~ 44 (113) T COG5294 1 MKKILIGILALLLIIIGALFIFYNINYDRFNPYVKITDSYAKVE 44 (113) T ss_pred CCCHHHHHHHHHHHHHHHHEEEEECCCCCCCCEEEEECCCEECC T ss_conf 90439999999999986621898213588772289862530313 No 343 >COG4031 Predicted metal-binding protein [General function prediction only] Probab=43.09 E-value=11 Score=15.64 Aligned_cols=12 Identities=0% Similarity=-0.261 Sum_probs=4.7 Q ss_pred CCCCCCCCCCCC Q ss_conf 248376873640 Q gi|254781033|r 407 PLSPISKTLCYF 418 (492) Q Consensus 407 ~~c~~~~~~~~~ 418 (492) ..|++||+-++. T Consensus 13 ~~Cl~Cg~~~av 24 (227) T COG4031 13 AFCLNCGRRHAV 24 (227) T ss_pred CHHCCCCCCCEE T ss_conf 110036996435 No 344 >pfam10058 DUF2296 Predicted integral membrane metal-binding protein (DUF2296). This domain, found in various hypothetical bacterial and eukaryotic metal-binding proteins, has no known function. Probab=42.56 E-value=16 Score=14.32 Aligned_cols=24 Identities=17% Similarity=0.033 Sum_probs=14.1 Q ss_pred CEECCC-----CCC----CHHHCCCCCCCCCCC Q ss_conf 527569-----895----542422483768736 Q gi|254781033|r 393 LWISDD-----GYL----SSVWLPLSPISKTLC 416 (492) Q Consensus 393 ~W~c~~-----g~~----~~~W~~~c~~~~~~~ 416 (492) .-+|.. |.. .+.++..|++||.++ T Consensus 19 aLIC~~C~~hNG~a~~~~~~~~~y~C~~C~~~N 51 (51) T pfam10058 19 ALICSKCHSHNGLAPKEEFEEVTYRCPHCGALN 51 (51) T ss_pred EEECHHHHHHCCCCCCCCCHHEEEECCCCCCCC T ss_conf 148523453288678886021378898777659 No 345 >TIGR02827 RNR_anaer_Bdell anaerobic ribonucleoside-triphosphate reductase; InterPro: IPR014192 Members of this entry belong to the class III anaerobic ribonucleoside-triphosphate reductases (RNR). These glycine-radical-containing enzymes are oxygen-sensitive and operate under anaerobic conditions. The genes for this family are paired with genes for an activating protein that creates a glycine radical. Members of this entry are related to IPR012833 from INTERPRO, a functionally equivalent protein set; no genome has members in both entries. Identification as RNR is supported by gene pairing with the activating protein, lack of other anaerobic RNR, and presence of an upstream regulatory element strongly conserved upstream of most RNR operons .. Probab=42.48 E-value=12 Score=15.32 Aligned_cols=25 Identities=8% Similarity=-0.011 Sum_probs=19.4 Q ss_pred CCEECC-CCCCCHHHCCCCCCCCCCC Q ss_conf 952756-9895542422483768736 Q gi|254781033|r 392 PLWISD-DGYLSSVWLPLSPISKTLC 416 (492) Q Consensus 392 p~W~c~-~g~~~~~W~~~c~~~~~~~ 416 (492) +.=+|. |+++...=.-.||.||+-+ T Consensus 582 kitvCn~c~~idkR~L~~C~~Cgs~~ 607 (638) T TIGR02827 582 KITVCNECHHIDKRTLERCPECGSAN 607 (638) T ss_pred CEEEECCCCCCCCCCCCCCCCCCCCC T ss_conf 40232478764676300078876412 No 346 >PRK00448 polC DNA polymerase III PolC; Validated Probab=42.33 E-value=13 Score=14.93 Aligned_cols=29 Identities=17% Similarity=0.185 Sum_probs=16.3 Q ss_pred HHHHHHHCCCCHHHHHHHHHHH--HHHCCCC Q ss_conf 8654552158869999999988--8421269 Q gi|254781033|r 159 SLYFESCRIGDLNSAQRYATKA--LDISPDA 187 (492) Q Consensus 159 gL~~~al~~gd~~~A~~~~~~a--~~~~P~~ 187 (492) |....++..|+.++|.+.+... +++.|.. T Consensus 664 GEv~~~il~~~~eea~~~a~~YDyiEI~p~~ 694 (1436) T PRK00448 664 GEVFDAMLQKGDEEVEEIAKFYDFIEIQPPA 694 (1436) T ss_pred CHHHHHHHCCCHHHHHHHHHHCCCEEECCCH T ss_conf 7999999818999999999753713347833 No 347 >PRK10712 fructose-specific PTS system IIBC component; Provisional Probab=42.07 E-value=17 Score=14.03 Aligned_cols=20 Identities=10% Similarity=-0.006 Sum_probs=8.7 Q ss_pred CCHHH-HHHHHHHHHHHCCCC Q ss_conf 89589-999999999826898 Q gi|254781033|r 305 ENTVG-KLKRALRLEEINKES 324 (492) Q Consensus 305 ~~~~~-al~~~~~l~~~~P~~ 324 (492) +.+-. -...+..|++.-.+. T Consensus 369 g~Pia~l~~~L~~~L~~m~~~ 389 (563) T PRK10712 369 GKPVAGILEGLTHWLQTMGTA 389 (563) T ss_pred HHHHHHHHHHHHHHHHHCCCC T ss_conf 899999999999999856641 No 348 >COG0375 HybF Zn finger protein HypA/HybF (possibly regulating hydrogenase expression) [General function prediction only] Probab=42.02 E-value=12 Score=15.24 Aligned_cols=27 Identities=15% Similarity=0.106 Sum_probs=18.4 Q ss_pred CCCCEECC-CC--CCCHHHCCCCCCCCCCCC Q ss_conf 99952756-98--955424224837687364 Q gi|254781033|r 390 PDPLWISD-DG--YLSSVWLPLSPISKTLCY 417 (492) Q Consensus 390 ~dp~W~c~-~g--~~~~~W~~~c~~~~~~~~ 417 (492) |.-.| |. |+ ...+.|...||.||+.+. T Consensus 68 p~~~~-C~~C~~~~~~e~~~~~CP~C~s~~~ 97 (115) T COG0375 68 PAECW-CLDCGQEVELEELDYRCPKCGSINL 97 (115) T ss_pred CCEEE-ECCCCCEECCHHHEEECCCCCCCCE T ss_conf 44789-4239976021553108889999855 No 349 >pfam10255 Paf67 RNA polymerase I-associated factor PAF67. RNA polymerase I is a multisubunit enzyme and its transcription competence is dependent on the presence of PAF67. This family of proteins is conserved from worms to humans. Probab=41.85 E-value=18 Score=13.95 Aligned_cols=63 Identities=16% Similarity=0.133 Sum_probs=46.7 Q ss_pred HHHHHHHHHHHHCCCCHHHHHHHHHHHH--------HHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHH Q ss_conf 5799986545521588699999999888--------42126989999999999952574569999999995 Q gi|254781033|r 154 EFAVYSLYFESCRIGDLNSAQRYATKAL--------DISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKK 216 (492) Q Consensus 154 ~~~~~gL~~~al~~gd~~~A~~~~~~a~--------~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~ 216 (492) ..++.||.+..--.||+..|++.++..- +..+-+.....-++-.|...++|.+|++++...+. T Consensus 122 YFSlIgLlRlhvLLGDY~~Alk~L~~Idl~~~~l~~kV~~~~vs~~YyvGFaYlMlrRY~DAir~f~~iL~ 192 (402) T pfam10255 122 YFSIIGLLRVHVLLGDYYLALKVLEPIDLSKKGLYSKVPAAYVSLYYYVGFAYLMLRRYADAIRVFSQILI 192 (402) T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCHHHCCCCHHHEEHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999998700568999997524144632010247404021245889999998658999999999999 No 350 >pfam07295 DUF1451 Protein of unknown function (DUF1451). This family consists of several hypothetical bacterial proteins of around 160 residues in length. Members of this family contain four highly conserved cysteine resides toward the C-terminal region of the protein. The function of this family is unknown. Probab=41.75 E-value=13 Score=15.03 Aligned_cols=22 Identities=14% Similarity=0.258 Sum_probs=14.1 Q ss_pred EECC-CCCC----CHHHCCCCCCCCCC Q ss_conf 2756-9895----54242248376873 Q gi|254781033|r 394 WISD-DGYL----SSVWLPLSPISKTL 415 (492) Q Consensus 394 W~c~-~g~~----~~~W~~~c~~~~~~ 415 (492) -+|. ||+. +..=-|-||+||.- T Consensus 115 LvC~~Cg~~~~~~~p~~ip~Cp~Cg~~ 141 (148) T pfam07295 115 LVCENCGHMLTFYHPSVIPPCPKCGHT 141 (148) T ss_pred EEECCCCCEEEEECCCCCCCCCCCCCC T ss_conf 772368987887468768898779998 No 351 >pfam08631 SPO22 Meiosis protein SPO22/ZIP4 like. SPO22/ZIP4 in yeast is a meiosis specific protein involved in sporulation. It has been shown to regulate crossover distribution by promoting synaptonemal complex formation. Probab=41.60 E-value=18 Score=13.92 Aligned_cols=20 Identities=15% Similarity=0.145 Sum_probs=12.9 Q ss_pred HHHCCHHHHHHHHHHHHCCC Q ss_conf 97210047879999863013 Q gi|254781033|r 94 SIAAHNIPLARKMHSYVSQQ 113 (492) Q Consensus 94 al~~Gd~~~A~k~~~~a~~~ 113 (492) |...||.+.|+.++.|+... T Consensus 3 Aw~qgd~~~A~~~~~k~~~l 22 (280) T pfam08631 3 AWRQGDLDLAEHLLSKAKSL 22 (280) T ss_pred CHHHCCHHHHHHHHHHHHHH T ss_conf 17755869999999987899 No 352 >pfam08194 DIM DIM protein. Drosophila immune-induced molecules (DIMs) are short proteins induced during the immune response of Drosophila. This family includes DIMs 1 to 4 that have masses below 5 kDa. Probab=41.28 E-value=18 Score=13.88 Aligned_cols=29 Identities=21% Similarity=0.360 Sum_probs=14.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCEEEEE Q ss_conf 79999999999999999973489848999 Q gi|254781033|r 3 RLIRYFFVISLVICSFIIVSHYPEDVSIT 31 (492) Q Consensus 3 r~l~~~~~i~~~~~~~~~l~~~~G~v~i~ 31 (492) |.+-+.+.+.+++..+.-.--+||.|+|. T Consensus 2 K~lsl~~~l~LLa~La~a~pl~pG~V~IN 30 (37) T pfam08194 2 KFLSLAFALGLLATLAFAVPLNPGNVIIN 30 (37) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCEEEEC T ss_conf 08999999999999997257999738986 No 353 >TIGR02956 TMAO_torS TMAO reductase sytem sensor TorS; InterPro: IPR014302 This protein, TorS, is part of a regulatory system for the torCAD operon that encodes the pterin molybdenum cofactor-containing enzyme trimethylamine-N-oxide (TMAO) reductase (TorA), a cognate chaperone (TorD), and a penta-haem cytochrome (TorC). TorS works together with the inducer-binding protein TorT and the response regulator TorR. TorS contains histidine kinase ATPase (IPR003594 from INTERPRO), HAMP (IPR003660 from INTERPRO), phosphoacceptor (IPR003661 from INTERPRO), and phosphotransfer (IPR008207 from INTERPRO) domains and a response regulator receiver domain (IPR001789 from INTERPRO).. Probab=41.18 E-value=18 Score=13.87 Aligned_cols=28 Identities=7% Similarity=0.031 Sum_probs=18.7 Q ss_pred HCCCCH-HHHHHHHHHHHHHCCCCHHHHH Q ss_conf 215886-9999999988842126989999 Q gi|254781033|r 165 CRIGDL-NSAQRYATKALDISPDAPWVTE 192 (492) Q Consensus 165 l~~gd~-~~A~~~~~~a~~~~P~~~~a~~ 192 (492) +..|+. +++.+.++...+.+=+..+=+. T Consensus 177 ~e~~~~l~q~~~aLD~lie~DLdl~e~l~ 205 (1052) T TIGR02956 177 IESGKNLEQVYQALDDLIEVDLDLAERLN 205 (1052) T ss_pred HHCCCHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 43356289999865679888666899999 No 354 >TIGR02738 TrbB type-F conjugative transfer system pilin assembly thiol-disulfide isomerase TrbB; InterPro: IPR014109 This entry includes thiol-disulphide isomerase (TrbB), a protein that is involved in conjugative transfer of plasmid DNA, specifically in the F-type system. TrbB has been predicted to contain a thioredoxin fold, it also contains a conserved pair of cysteines and has been shown to function as a thiol disulphide isomerase by complementation of Escherichia coli DsbA mutants . TtbB is believed to be involved in pilin assembly . TrbB is closely related to TraF (IPR014110 from INTERPRO) though somewhat longer lacking the cysteine motif and is apparently not functional as a disulphide bond isomerase.. Probab=41.10 E-value=9.3 Score=16.13 Aligned_cols=29 Identities=21% Similarity=0.541 Sum_probs=20.7 Q ss_pred CCCCEEC-CCCC--CCHHH------CCCCCCCCCCCCC Q ss_conf 9995275-6989--55424------2248376873640 Q gi|254781033|r 390 PDPLWIS-DDGY--LSSVW------LPLSPISKTLCYF 418 (492) Q Consensus 390 ~dp~W~c-~~g~--~~~~W------~~~c~~~~~~~~~ 418 (492) |-|.|.- .+|+ -.++| +..||.|++|+-. T Consensus 55 p~P~Wy~Ls~~r~~~~~~~~~vfF~~S~CPyCh~FAP~ 92 (176) T TIGR02738 55 PQPVWYRLSNGRKALLDDYALVFFYQSTCPYCHQFAPV 92 (176) T ss_pred CCCEEEEECHHHHHHHHCCEEEEEECCCCCCCCCCCHH T ss_conf 89500110336785444350788862889640022657 No 355 >COG4956 Integral membrane protein (PIN domain superfamily) [General function prediction only] Probab=40.87 E-value=18 Score=13.84 Aligned_cols=19 Identities=26% Similarity=0.443 Sum_probs=8.5 Q ss_pred CHHHHHHHHHHHHH-HHHHH Q ss_conf 96799999999999-99999 Q gi|254781033|r 1 MLRLIRYFFVISLV-ICSFI 19 (492) Q Consensus 1 M~r~l~~~~~i~~~-~~~~~ 19 (492) |.+.++.++++.+. ++|.. T Consensus 1 ml~~ii~l~~~i~g~~lG~~ 20 (356) T COG4956 1 MLKWIIILLFIIIGAVLGFA 20 (356) T ss_pred CHHHHHHHHHHHHHHHHHHH T ss_conf 98899999999997566674 No 356 >COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair] Probab=40.65 E-value=15 Score=14.57 Aligned_cols=19 Identities=26% Similarity=0.522 Sum_probs=9.1 Q ss_pred CCEECC-CCCCCHHHCCCCCCCCCC Q ss_conf 952756-989554242248376873 Q gi|254781033|r 392 PLWISD-DGYLSSVWLPLSPISKTL 415 (492) Q Consensus 392 p~W~c~-~g~~~~~W~~~c~~~~~~ 415 (492) |.=.|. |||+ .-||+|+.. T Consensus 434 ~~l~C~~Cg~v-----~~Cp~Cd~~ 453 (730) T COG1198 434 PLLLCRDCGYI-----AECPNCDSP 453 (730) T ss_pred CEEECCCCCCC-----CCCCCCCCC T ss_conf 40042568980-----248999951 No 357 >pfam09297 zf-NADH-PPase NADH pyrophosphatase zinc ribbon domain. This domain is found in between two duplicated NUDIX domains. It has a zinc ribbon structure. Probab=40.63 E-value=12 Score=15.31 Aligned_cols=21 Identities=14% Similarity=0.227 Sum_probs=15.0 Q ss_pred ECC-CCC----CCHHHCCCCCCCCCC Q ss_conf 756-989----554242248376873 Q gi|254781033|r 395 ISD-DGY----LSSVWLPLSPISKTL 415 (492) Q Consensus 395 ~c~-~g~----~~~~W~~~c~~~~~~ 415 (492) .|. ||. ....|..+||+||.. T Consensus 5 fC~~CG~~t~~~~~g~~r~C~~Cg~~ 30 (32) T pfam09297 5 FCGRCGAPTEPAEGGWARVCPSCGHE 30 (32) T ss_pred CCCCCCCCCCCCCCCCEEECCCCCCC T ss_conf 42446884750778517668876766 No 358 >COG1579 Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only] Probab=40.61 E-value=9 Score=16.24 Aligned_cols=13 Identities=8% Similarity=0.077 Sum_probs=9.7 Q ss_pred HHCCCCCCCCCCC Q ss_conf 2422483768736 Q gi|254781033|r 404 VWLPLSPISKTLC 416 (492) Q Consensus 404 ~W~~~c~~~~~~~ 416 (492) +=-..||+||+|= T Consensus 219 d~iv~CP~CgRIL 231 (239) T COG1579 219 DEIVFCPYCGRIL 231 (239) T ss_pred CCCCCCCCCCHHH T ss_conf 9975287611367 No 359 >TIGR00100 hypA hydrogenase nickel insertion protein HypA; InterPro: IPR000688 Bacterial membrane-bound nickel-dependent hydrogenases requires a number of accessory proteins which are involved in their maturation. The exact role of these proteins is not yet clear, but some seem to be required for the incorporation of the nickel ions . One of these proteins is generally known as hypA. It is a protein of about 12 to 14 kDa that contains, in its C-terminal region, four conserved cysteines that form a zinc-finger like motif. Escherichia coli has two proteins that belong to this family, hypA and hybF. A homologue, MJ0214, has also been found in a number of archaeal species, including the genome of Methanococcus jannaschii. ; GO: 0016151 nickel ion binding, 0006464 protein modification process. Probab=40.53 E-value=13 Score=15.01 Aligned_cols=22 Identities=9% Similarity=-0.071 Sum_probs=16.1 Q ss_pred ECCCCCCCH-HHCC----CCCCCCCCC Q ss_conf 756989554-2422----483768736 Q gi|254781033|r 395 ISDDGYLSS-VWLP----LSPISKTLC 416 (492) Q Consensus 395 ~c~~g~~~~-~W~~----~c~~~~~~~ 416 (492) .|...+..+ +|.. .||.|++.. T Consensus 83 ~C~~~~~~~~~~~~~F~~~CP~C~g~~ 109 (128) T TIGR00100 83 DCSEEVEPEIDELDEFEVRCPKCHGIR 109 (128) T ss_pred ECCCCCEECCCCCCCEEEECCCCCCCE T ss_conf 088012001013213255658998951 No 360 >KOG2487 consensus Probab=39.64 E-value=8.2 Score=16.56 Aligned_cols=33 Identities=9% Similarity=0.125 Sum_probs=23.0 Q ss_pred CCEECCCC-CCCHHHCCCCCCCCCCCCCCCCCCCCC Q ss_conf 95275698-955424224837687364003678755 Q gi|254781033|r 392 PLWISDDG-YLSSVWLPLSPISKTLCYFEWKIPTKS 426 (492) Q Consensus 392 p~W~c~~g-~~~~~W~~~c~~~~~~~~~~W~~p~~~ 426 (492) -.|+|.-| .+.-...|+|+.|+. .|.-+.+|.- T Consensus 272 iG~VCSVCLSVfC~~~PiC~~C~s--~F~~t~~Pv~ 305 (314) T KOG2487 272 IGFVCSVCLSVFCRFVPICKTCKS--KFSFTKYPVK 305 (314) T ss_pred EEEEHHHHHHHHHCCCCCCCHHHH--HCCCCCCCCC T ss_conf 503448778776078876611343--2133457665 No 361 >COG4781 Membrane domain of membrane-anchored glycerophosphoryl diester phosphodiesterase [Energy production and conversion] Probab=39.61 E-value=19 Score=13.70 Aligned_cols=56 Identities=20% Similarity=0.224 Sum_probs=30.5 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCEEEEEECCEEEEEHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999997348984899989976622189999999999999999999 Q gi|254781033|r 5 IRYFFVISLVICSFIIVSHYPEDVSITWGNRLYRTSPFVILSILYFFLFAWILLFA 60 (492) Q Consensus 5 l~~~~~i~~~~~~~~~l~~~~G~v~i~~~~~~ie~s~~~~i~~~~~~~~~~~ll~~ 60 (492) +..++........+..+.+..|.-.++.+++---.|-..++++++++.+++.++.. T Consensus 30 i~~~v~~p~l~~~f~fiL~VtG~~~is~nN~~~ils~p~~vl~Ll~~l~ll~~~vY 85 (340) T COG4781 30 IVEFVIVPFLSWLFSFILDVTGQPFISYNNVGEILSHPPLVLLLLALLFLLLLLVY 85 (340) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHCCCHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999999986398612173799997485899999999999999999 No 362 >TIGR02159 PA_CoA_Oxy4 phenylacetate-CoA oxygenase, PaaJ subunit; InterPro: IPR011883 Phenylacetate-CoA oxygenase is comprised of a five gene complex responsible for the hydroxylation of phenylacetate-CoA (PA-CoA) as the second catabolic step in phenylacetic acid (PA) degradation , . Although the exact function of this enzyme has not been determined, it has been shown to be required for phenylacetic acid degradation and has been proposed to function in a multicomponent oxygenase acting on phenylacetate-CoA . . Probab=39.56 E-value=19 Score=13.69 Aligned_cols=41 Identities=7% Similarity=-0.297 Sum_probs=26.7 Q ss_pred HHHHHHHHHCCCCCCCEECC-C-CCCCHHHCCCCCCCCCCCCC Q ss_conf 99999999628899952756-9-89554242248376873640 Q gi|254781033|r 378 ILYWTQSALHAMPDPLWISD-D-GYLSSVWLPLSPISKTLCYF 418 (492) Q Consensus 378 ar~wl~rA~~a~~dp~W~c~-~-g~~~~~W~~~c~~~~~~~~~ 418 (492) +|+=|++--=|||.+.-.-. . --.+..|.--||-||++||= T Consensus 81 are~Lr~yGIAPP~~~~~~~~s~~l~~~~~~v~CPRCgS~~T~ 123 (152) T TIGR02159 81 AREKLREYGIAPPAGAEVVGVSLVLSLEPPSVQCPRCGSADTT 123 (152) T ss_pred HHHHHHHCCCCCCCCCCCCCCEEECCCCCCCCCCCCCCCCCEE T ss_conf 3688885088748766746310001246877788887752113 No 363 >pfam09538 FYDLN_acid Protein of unknown function (FYDLN_acid). Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown. Probab=39.50 E-value=14 Score=14.64 Aligned_cols=22 Identities=9% Similarity=-0.186 Sum_probs=13.6 Q ss_pred CEECC-CCCC---CHHHCCCCCCCCC Q ss_conf 52756-9895---5424224837687 Q gi|254781033|r 393 LWISD-DGYL---SSVWLPLSPISKT 414 (492) Q Consensus 393 ~W~c~-~g~~---~~~W~~~c~~~~~ 414 (492) ..+|. ||.. ...=-.+||.||. T Consensus 9 kr~c~~c~~~fydl~k~p~~cp~cg~ 34 (104) T pfam09538 9 KRTCPTCGKRFYDLNKDPIVCPKCGE 34 (104) T ss_pred CCCCCCCCCCCCCCCCCCCCCCCCCC T ss_conf 65478888821127989871899888 No 364 >PRK06251 V-type ATP synthase subunit K; Validated Probab=39.28 E-value=19 Score=13.66 Aligned_cols=71 Identities=15% Similarity=0.117 Sum_probs=45.9 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHH Q ss_conf 96799999999999999999734898489998997662218999999999999999999999998418288 Q gi|254781033|r 1 MLRLIRYFFVISLVICSFIIVSHYPEDVSITWGNRLYRTSPFVILSILYFFLFAWILLFAVSRFFLSCPAM 71 (492) Q Consensus 1 M~r~l~~~~~i~~~~~~~~~l~~~~G~v~i~~~~~~ie~s~~~~i~~~~~~~~~~~ll~~l~~~i~~~p~~ 71 (492) |.|.++.+.++.++.-..+-.+..|+.-.-.+.+-.|-..+.+.+..+..-+.+-..--.-+..+-..|.. T Consensus 1 ~~rtli~~lll~l~~~~~~aaaqa~~~~~~~~~~~~iGAgLavGLA~iGaG~AvG~~gaAaig~iaE~~e~ 71 (101) T PRK06251 1 MRRTLLLLLLLPLLVSSYVAAAQAPGDTSQGFAGINIGAGLAVGLAAIGAGVAVGMAAAAGIGVLTERRDM 71 (101) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCHHH T ss_conf 93024999999999422677732877542420033443348888888500132000027566343037465 No 365 >COG4232 Thiol:disulfide interchange protein [Posttranslational modification, protein turnover, chaperones / Energy production and conversion] Probab=39.21 E-value=19 Score=13.66 Aligned_cols=28 Identities=7% Similarity=-0.127 Sum_probs=11.8 Q ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHCCC Q ss_conf 9999998268983689999999998799 Q gi|254781033|r 313 RALRLEEINKESVESLVIVSKIALEMGS 340 (492) Q Consensus 313 ~~~~l~~~~P~~~e~~~~LA~~~~~~g~ 340 (492) .+.+++..+++.+......|.-|+..+. T Consensus 464 ~L~~~la~~~~~pVmlDfyAdWCvtCK~ 491 (569) T COG4232 464 ELDQALAEAKAKPVMLDFYADWCVTCKE 491 (569) T ss_pred HHHHHHHHCCCCCEEEEEEHHHHHHHHH T ss_conf 9999998588996799622468787676 No 366 >COG3529 Predicted nucleic-acid-binding protein containing a Zn-ribbon domain [General function prediction only] Probab=39.13 E-value=8 Score=16.63 Aligned_cols=16 Identities=6% Similarity=-0.006 Sum_probs=13.3 Q ss_pred CCCCCCCCCCCCCCCC Q ss_conf 2483768736400367 Q gi|254781033|r 407 PLSPISKTLCYFEWKI 422 (492) Q Consensus 407 ~~c~~~~~~~~~~W~~ 422 (492) .+||.|...|+++|=+ T Consensus 11 A~CPaC~~~Dtl~mW~ 26 (66) T COG3529 11 AVCPACQAQDTLAMWR 26 (66) T ss_pred CCCCCCCHHHHHHHHH T ss_conf 7685321056899998 No 367 >TIGR02386 rpoC_TIGR DNA-directed RNA polymerase, beta' subunit; InterPro: IPR012754 DNA-directed RNA polymerases 2.7.7.6 from EC (also known as DNA-dependent RNA polymerases) are responsible for the polymerisation of ribonucleotides into a sequence complementary to the template DNA. In eukaryotes, there are three different forms of DNA-directed RNA polymerases transcribing different sets of genes. Most RNA polymerases are multimeric enzymes and are composed of a variable number of subunits. The core RNA polymerase complex consists of five subunits (two alpha, one beta, one beta-prime and one omega) and is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme . The core RNA polymerase complex forms a "crab claw"-like structure with an internal channel running along the full length . The key functional sites of the enzyme, as defined by mutational and cross-linking analysis, are located on the inner wall of this channel. RNA synthesis follows after the attachment of RNA polymerase to a specific site, the promoter, on the template DNA strand. The RNA synthesis process continues until a termination sequence is reached. The RNA product, which is synthesised in the 5' to 3'direction, is known as the primary transcript. Eukaryotic nuclei contain three distinct types of RNA polymerases that differ in the RNA they synthesise: RNA polymerase I: located in the nucleoli, synthesises precursors of most ribosomal RNAs. RNA polymerase II: occurs in the nucleoplasm, synthesises mRNA precursors. RNA polymerase III: also occurs in the nucleoplasm, synthesises the precursors of 5S ribosomal RNA, the tRNAs, and a variety of other small nuclear and cytosolic RNAs. Eukaryotic cells are also known to contain separate mitochondrial and chloroplast RNA polymerases. Eukaryotic RNA polymerases, whose molecular masses vary in size from 500 to 700 kD, contain two non-identical large (>100 kDa) subunits and an array of up to 12 different small (less than 50 kDa) subunits. This entry represents the beta-prime subunit, RpoC, found in most bacteria. It excludes some, mainly cyanobacterial, species where RpoC is replaced by two homologous proteins that include an additional domain. One arm of the "claw" is predominantly formed by this subunit, the other being predominantly formed by the beta subunit. The active site of the enzyme is defined by three invariant aspartate residues within the beta-prime subunit .; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006350 transcription. Probab=38.97 E-value=14 Score=14.74 Aligned_cols=25 Identities=16% Similarity=0.106 Sum_probs=9.7 Q ss_pred HHHHHHHCCCHHHHHHHHHHHHHCCC Q ss_conf 99999987998899999887866099 Q gi|254781033|r 331 VSKIALEMGSIDQAHAKAMLAMKIAP 356 (492) Q Consensus 331 LA~~~~~~g~~~~Ar~~l~~al~~~P 356 (492) .|+-.+..++ ++==..|+..++.+| T Consensus 453 ~Ak~~ie~~d-peVWdvLE~VIK~hP 477 (1552) T TIGR02386 453 SAKKMIEQED-PEVWDVLEEVIKEHP 477 (1552) T ss_pred HHHHHHHHCC-CCHHHHHHHHHCCCC T ss_conf 9999987428-702368999852488 No 368 >PRK04023 DNA polymerase II large subunit; Validated Probab=38.84 E-value=20 Score=13.61 Aligned_cols=32 Identities=13% Similarity=0.000 Sum_probs=20.7 Q ss_pred CCEECC-CCCCCHHHCCCCCCCCCCCCCCCCCCCC Q ss_conf 952756-9895542422483768736400367875 Q gi|254781033|r 392 PLWISD-DGYLSSVWLPLSPISKTLCYFEWKIPTK 425 (492) Q Consensus 392 p~W~c~-~g~~~~~W~~~c~~~~~~~~~~W~~p~~ 425 (492) ..-.|. ||.. .|...|++||......|.-|.. T Consensus 632 g~R~Cp~Cg~e--T~~~~C~~CG~~T~~~~~c~~C 664 (1128) T PRK04023 632 GNRKCPSCGKE--TFYRRCPFCGTHTEPVYRCPRC 664 (1128) T ss_pred EEEECCCCCCC--CCCCCCCCCCCCCCCCCCCCCC T ss_conf 20288999983--5755787779966543247766 No 369 >PRK06393 rpoE DNA-directed RNA polymerase subunit E''; Validated Probab=38.48 E-value=16 Score=14.33 Aligned_cols=26 Identities=15% Similarity=0.074 Sum_probs=17.7 Q ss_pred ECC-CCCCCHHHCCCCCCCCCCC-CCCCCC Q ss_conf 756-9895542422483768736-400367 Q gi|254781033|r 395 ISD-DGYLSSVWLPLSPISKTLC-YFEWKI 422 (492) Q Consensus 395 ~c~-~g~~~~~W~~~c~~~~~~~-~~~W~~ 422 (492) .|. |.++.++ ..||+||+-+ |..|.- T Consensus 7 ACr~C~~i~~~--~~CP~Cgs~~~t~dW~G 34 (64) T PRK06393 7 ACKKCKRLTPE--KTCPVHGDEKTTTEWFG 34 (64) T ss_pred HHHHCCCCCCC--CCCCCCCCCCCCCCCCE T ss_conf 36547751788--87899989878877564 No 370 >KOG0706 consensus Probab=38.22 E-value=17 Score=14.13 Aligned_cols=11 Identities=36% Similarity=0.507 Sum_probs=4.6 Q ss_pred CCCEEEEEECC Q ss_conf 89848999899 Q gi|254781033|r 24 YPEDVSITWGN 34 (492) Q Consensus 24 ~~G~v~i~~~~ 34 (492) +|--.+|+||- T Consensus 33 nPtWaSVTYGI 43 (454) T KOG0706 33 NPTWASVTYGI 43 (454) T ss_pred CCCCEEECCEE T ss_conf 99836651107 No 371 >PRK07418 acetolactate synthase 3 catalytic subunit; Reviewed Probab=37.79 E-value=16 Score=14.28 Aligned_cols=25 Identities=20% Similarity=0.210 Sum_probs=12.0 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 4436410146887544567888786 Q gi|254781033|r 457 KNHLPSIKKVSSFEDSTIHPLDPHI 481 (492) Q Consensus 457 ~~~~~~~~~~~~~~~~~~~~~~~~~ 481 (492) ....|.........++..-|+-|+- T Consensus 547 ~~~gP~lidV~vd~~e~~~p~~~~~ 571 (615) T PRK07418 547 AHDGPVLIDVHVRRDENCYPMVPPG 571 (615) T ss_pred HCCCCEEEEEEECCCCCCCCCCCCC T ss_conf 0899789999988866716987989 No 372 >PRK09687 hypothetical protein; Provisional Probab=37.64 E-value=20 Score=13.48 Aligned_cols=229 Identities=10% Similarity=-0.038 Sum_probs=107.8 Q ss_pred CCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCC--HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCH---HHH Q ss_conf 615899999875322221000168999999726885--057999865455215886999999998884212698---999 Q gi|254781033|r 117 HNEYLVYLLEVQIALAERQYNIAHEKLEMMLQIPAT--REFAVYSLYFESCRIGDLNSAQRYATKALDISPDAP---WVT 191 (492) Q Consensus 117 ~~~~L~~Ll~A~aA~~~gd~~~A~~~~~~~~~~~~~--~~~~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~---~a~ 191 (492) +.++....-++.+-...|. +.+....-.+++++.. |..+...|..+...+...+.+...+.+++..+++.. .+. T Consensus 34 D~n~~VR~~AA~aLg~ig~-~~a~~~li~llkd~n~~VR~~Aa~ALGqlg~~~~~s~~ai~~L~~lll~D~~~~VR~sAa 112 (280) T PRK09687 34 DHNSLKRISSARVLQLRGG-QEAFRLAVELCSSKNYIERDLGAFILSQLGMCKRHEDQSFNILLNLLLNDKSANVRATAI 112 (280) T ss_pred CCCHHHHHHHHHHHHCCCC-HHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCCHHHHHHHH T ss_conf 6988889999999880484-669999999873787889999999998448862002777999999987099778999999 Q ss_pred HHHHHHHHHCCC-CHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHH Q ss_conf 999999995257-4569999999995023103578999999999999669989999999875431335279999999999 Q gi|254781033|r 192 EAVVQQYVLAKE-WSRAITFLNQKKKNAKEWNRNRAILLIARSLENADKGDMIASYHDAIESLKLCDNSIMASICAAKSL 270 (492) Q Consensus 192 ~~L~~l~~~~gd-w~~A~~~l~~~~~~~~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l 270 (492) .+|+.......- +..+...+........ .+-+ .+.+..+ ..-..+.+...+.++++-.-..+.-+ ++..+ T Consensus 113 ~ALG~~~s~~~~~~~~~~~~l~~~~~d~d-~~VR-----~~AA~AL-g~i~se~AI~~Li~lL~D~d~~VR~~--Aa~AL 183 (280) T PRK09687 113 NATGHRCKKNPIYSPKIVEQSQLTAFDKS-VNVR-----RATAFAL-SVINNELAIPLLINLLKDPNGDVRNW--AAFGL 183 (280) T ss_pred HHHHHHCCCCCHHHHHHHHHHHHHHHCCC-HHHH-----HHHHHHH-HHCCCHHHHHHHHHHHHCCCHHHHHH--HHHHH T ss_conf 99874313440211347889899862665-2198-----9999999-70374768999999973798779999--99997 Q ss_pred HHCC-CHHHHHHHHHHHHHHCCCHHH--HHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHH Q ss_conf 9739-947899999999983289899--9999985078958999999999982689836899999999987998899999 Q gi|254781033|r 271 ISQN-KKRKAEVILEKIWKVNPHPEI--ANIYTHLLSENTVGKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAK 347 (492) Q Consensus 271 ~~~g-~~~~A~~~le~a~~~~p~p~L--a~a~~~~~~~~~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~ 347 (492) ...+ +...+...|-+.+. ..++++ -.+++...- .....+.-+-..+ .++..+...++++=+.|. .++... T Consensus 184 g~i~~~s~~~~~~Li~~L~-D~d~~VR~eAa~aLg~~-~~e~~i~~Li~~L----e~~~V~~~a~eAlg~igs-~~~ip~ 256 (280) T PRK09687 184 NINEYDSPDIRDAFVEMLS-DKNEEIRGEAIIGLAYR-KDKRVISVLIDEL----EKNTVYDLIIEAAGELGD-KRLLPV 256 (280) T ss_pred HCCCCCCHHHHHHHHHHHC-CCCHHHHHHHHHHHHCC-CCHHHHHHHHHHH----HCCCHHHHHHHHHHHCCC-HHHHHH T ss_conf 6144687668999999855-99777899999998534-7476799999998----475088999999986088-867899 Q ss_pred HHHHHHCCCCHHHHH Q ss_conf 887866099989999 Q gi|254781033|r 348 AMLAMKIAPRKEIFL 362 (492) Q Consensus 348 l~~al~~~P~~~~~~ 362 (492) |.+++...++..+.. T Consensus 257 L~~Ll~~~~d~~~~~ 271 (280) T PRK09687 257 LDTLLYKFDDNKIIT 271 (280) T ss_pred HHHHHHHCCCHHHHH T ss_conf 999998787266999 No 373 >TIGR02387 rpoC1_cyan DNA-directed RNA polymerase, gamma subunit; InterPro: IPR012755 DNA-directed RNA polymerases 2.7.7.6 from EC (also known as DNA-dependent RNA polymerases) are responsible for the polymerisation of ribonucleotides into a sequence complementary to the template DNA. In eukaryotes, there are three different forms of DNA-directed RNA polymerases transcribing different sets of genes. Most RNA polymerases are multimeric enzymes and are composed of a variable number of subunits. The core RNA polymerase complex consists of five subunits (two alpha, one beta, one beta-prime and one omega) and is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme . The core RNA polymerase complex forms a "crab claw"-like structure with an internal channel running along the full length . The key functional sites of the enzyme, as defined by mutational and cross-linking analysis, are located on the inner wall of this channel. RNA synthesis follows after the attachment of RNA polymerase to a specific site, the promoter, on the template DNA strand. The RNA synthesis process continues until a termination sequence is reached. The RNA product, which is synthesised in the 5' to 3'direction, is known as the primary transcript. Eukaryotic nuclei contain three distinct types of RNA polymerases that differ in the RNA they synthesise: RNA polymerase I: located in the nucleoli, synthesises precursors of most ribosomal RNAs. RNA polymerase II: occurs in the nucleoplasm, synthesises mRNA precursors. RNA polymerase III: also occurs in the nucleoplasm, synthesises the precursors of 5S ribosomal RNA, the tRNAs, and a variety of other small nuclear and cytosolic RNAs. Eukaryotic cells are also known to contain separate mitochondrial and chloroplast RNA polymerases. Eukaryotic RNA polymerases, whose molecular masses vary in size from 500 to 700 kD, contain two non-identical large (>100 kDa) subunits and an array of up to 12 different small (less than 50 kDa) subunits. The RNA polymerase gamma subunit, encoded by the rpoC1 gene, is found in cyanobacteria and corresponds to the N-terminal region the beta' subunit, encoded by rpoC, in other bacteria. The equivalent subunit in plastids and chloroplasts is designated beta', while the product of the rpoC2 gene is designated beta''.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006350 transcription. Probab=37.31 E-value=12 Score=15.23 Aligned_cols=15 Identities=20% Similarity=0.401 Sum_probs=7.2 Q ss_pred CCHHHHHHHHHHHHH Q ss_conf 998999999999998 Q gi|254781033|r 356 PRKEIFLLLAQIEQA 370 (492) Q Consensus 356 P~~~~~~llA~ie~~ 370 (492) .+.+++..+-++-.+ T Consensus 406 ~d~~~~~vl~evi~G 420 (625) T TIGR02387 406 ADDEIWSVLEEVITG 420 (625) T ss_pred CCHHHHHHHHHHHCC T ss_conf 436899999987468 No 374 >COG2093 DNA-directed RNA polymerase, subunit E'' [Transcription] Probab=36.29 E-value=15 Score=14.52 Aligned_cols=28 Identities=14% Similarity=0.109 Sum_probs=20.1 Q ss_pred ECCCC-CCCHHHCCCCCCCCCCC-CCCCCC Q ss_conf 75698-95542422483768736-400367 Q gi|254781033|r 395 ISDDG-YLSSVWLPLSPISKTLC-YFEWKI 422 (492) Q Consensus 395 ~c~~g-~~~~~W~~~c~~~~~~~-~~~W~~ 422 (492) .|.+| ++.++=.-+||+||+-+ |.+|.- T Consensus 6 AC~~Ck~l~~~d~e~CP~Cgs~~~te~W~G 35 (64) T COG2093 6 ACKNCKRLTPEDTEICPVCGSTDLTEEWFG 35 (64) T ss_pred HHHHCCCCCCCCCCCCCCCCCCCCCHHHCC T ss_conf 876346237887740878899400010162 No 375 >PRK00068 hypothetical protein; Validated Probab=36.25 E-value=21 Score=13.32 Aligned_cols=12 Identities=8% Similarity=-0.224 Sum_probs=6.1 Q ss_pred HCCHHHHHHHHH Q ss_conf 210047879999 Q gi|254781033|r 96 AAHNIPLARKMH 107 (492) Q Consensus 96 ~~Gd~~~A~k~~ 107 (492) +.|+|..-.... T Consensus 131 a~~~W~~~Llf~ 142 (978) T PRK00068 131 AQSYWYRIQLFL 142 (978) T ss_pred HHHHHHHHHHHH T ss_conf 998999999986 No 376 >pfam11682 DUF3279 Protein of unknown function (DUF3279). This family of proteins with unknown function appears to be restricted to Enterobacteriaceae. Probab=36.13 E-value=13 Score=15.09 Aligned_cols=39 Identities=10% Similarity=-0.073 Sum_probs=21.4 Q ss_pred HHHHHHHHHHHHCCCCCCCEECC-CCCCCHHHCCCCCCCCC Q ss_conf 89999999999628899952756-98955424224837687 Q gi|254781033|r 375 TDKILYWTQSALHAMPDPLWISD-DGYLSSVWLPLSPISKT 414 (492) Q Consensus 375 ~~~ar~wl~rA~~a~~dp~W~c~-~g~~~~~W~~~c~~~~~ 414 (492) ..+.+++..-+...++-..|-|- |++-. .=.-.||.||. T Consensus 79 ik~Lq~~v~~~~pv~~~~~W~CvmC~~~y-~G~K~C~~C~t 118 (128) T pfam11682 79 IKRLQQMVPDAYPVPRVASWHCVMCHHDY-YGEKYCPACGT 118 (128) T ss_pred HHHHHHHCCCCCCCCCCCCEEEEECCCCC-CCCEECCCCCC T ss_conf 99999853223456613506787338702-67880565799 No 377 >TIGR00627 tfb4 transcription factor tfb4; InterPro: IPR004600 Members of this family are part of the TFIIH complex which is involved in the initiation of transcription and nucleotide excision repair. The core-TFIIH basal transcription factor complex has six subunits, this is the p34 subunit.; GO: 0016251 general RNA polymerase II transcription factor activity, 0006281 DNA repair, 0006355 regulation of transcription DNA-dependent, 0000439 core TFIIH complex. Probab=35.91 E-value=14 Score=14.75 Aligned_cols=78 Identities=15% Similarity=0.147 Sum_probs=47.2 Q ss_pred HHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCEECCCC-CCCHHHCC- Q ss_conf 9999999879988999998878660999899999999999857998999999999962889995275698-95542422- Q gi|254781033|r 330 IVSKIALEMGSIDQAHAKAMLAMKIAPRKEIFLLLAQIEQANSHNTDKILYWTQSALHAMPDPLWISDDG-YLSSVWLP- 407 (492) Q Consensus 330 ~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~~~~llA~ie~~e~~d~~~ar~wl~rA~~a~~dp~W~c~~g-~~~~~W~~- 407 (492) .=+..|++......=.++|--.+=.||+.|..+. ..... +=|..++-.--.|-+ +=.|+|.=| .+--+..| T Consensus 213 ~TGG~YL~v~~~~~LL~yL~~~~l~D~~~R~~l~--~p~~~-~VDyRAsC~ch~~lv----~~GfvCSvCLSvlc~~~pG 285 (295) T TIGR00627 213 ITGGVYLKVEKPKGLLQYLMTAMLPDPDLRAVLV--KPNHA-SVDYRASCFCHKKLV----EIGFVCSVCLSVLCEYTPG 285 (295) T ss_pred HHCCEEEEECCCHHHHHHHHHHHCCCCCCCCCCC--CCCCC-CCCHHHHHHHCCCEE----EECCEEEEEHEECCCCCCC T ss_conf 6387457427874689999998578902112202--78278-833346566455545----4302513211000688888 Q ss_pred -CCCCCCC Q ss_conf -4837687 Q gi|254781033|r 408 -LSPISKT 414 (492) Q Consensus 408 -~c~~~~~ 414 (492) .|+.|+. T Consensus 286 n~C~~C~~ 293 (295) T TIGR00627 286 NICKTCKT 293 (295) T ss_pred CCCCCCCC T ss_conf 83686667 No 378 >PRK10350 hypothetical protein; Provisional Probab=35.81 E-value=22 Score=13.27 Aligned_cols=18 Identities=22% Similarity=0.198 Sum_probs=8.9 Q ss_pred CHHHHHHHHHHHHHHHHH Q ss_conf 967999999999999999 Q gi|254781033|r 1 MLRLIRYFFVISLVICSF 18 (492) Q Consensus 1 M~r~l~~~~~i~~~~~~~ 18 (492) |.|++++..++-+++++- T Consensus 1 MKrlLll~alLPfa~~AQ 18 (146) T PRK10350 1 MKRLLLLTALLPFVGFAQ 18 (146) T ss_pred CHHHHHHHHHHHHHHHHC T ss_conf 904799999867888616 No 379 >PRK03776 phosphoglycerol transferase I; Provisional Probab=35.66 E-value=22 Score=13.26 Aligned_cols=21 Identities=14% Similarity=0.477 Sum_probs=13.3 Q ss_pred CCCCCCEECCCCCCCHHHCCC Q ss_conf 889995275698955424224 Q gi|254781033|r 388 AMPDPLWISDDGYLSSVWLPL 408 (492) Q Consensus 388 a~~dp~W~c~~g~~~~~W~~~ 408 (492) +.....|+..|-.+-..|+|. T Consensus 544 ~~~~fvwvd~c~k~~~~w~p~ 564 (762) T PRK03776 544 PRDNFVWVDRCYKMAQLWAPE 564 (762) T ss_pred CCCCEEEEEHHHHHHHCCCHH T ss_conf 767068840111233305704 No 380 >PRK13861 type IV secretion system protein VirB9; Provisional Probab=35.47 E-value=22 Score=13.24 Aligned_cols=21 Identities=10% Similarity=0.048 Sum_probs=8.2 Q ss_pred CHHHHHHHHHHHHHHHHHHHH Q ss_conf 967999999999999999997 Q gi|254781033|r 1 MLRLIRYFFVISLVICSFIIV 21 (492) Q Consensus 1 M~r~l~~~~~i~~~~~~~~~l 21 (492) |||.+++.+++++.+....|. T Consensus 1 mmk~l~~~~~~~l~~~~~a~A 21 (293) T PRK13861 1 MIKKLFLTLACLLFAAIGALA 21 (293) T ss_pred CCHHHHHHHHHHHHHHHHHHH T ss_conf 908999999999986126678 No 381 >PRK10270 hypothetical protein; Provisional Probab=35.37 E-value=22 Score=13.22 Aligned_cols=14 Identities=14% Similarity=0.413 Sum_probs=7.1 Q ss_pred CHHHHHHHHHHHHH Q ss_conf 96799999999999 Q gi|254781033|r 1 MLRLIRYFFVISLV 14 (492) Q Consensus 1 M~r~l~~~~~i~~~ 14 (492) |.|+++++++++++ T Consensus 1 Mkk~~~~~~~l~~~ 14 (340) T PRK10270 1 MKKVLLIILLLLVV 14 (340) T ss_pred CCHHHHHHHHHHHH T ss_conf 90289999999999 No 382 >COG4499 Predicted membrane protein [Function unknown] Probab=35.12 E-value=22 Score=13.20 Aligned_cols=12 Identities=25% Similarity=0.208 Sum_probs=4.5 Q ss_pred HHHHCCCHHHHH Q ss_conf 999879988999 Q gi|254781033|r 334 IALEMGSIDQAH 345 (492) Q Consensus 334 ~~~~~g~~~~Ar 345 (492) +++..|+|++|. T Consensus 322 i~~GRGe~~eAi 333 (434) T COG4499 322 IYSGRGEFKEAI 333 (434) T ss_pred HHHCCCCHHHHH T ss_conf 984685178886 No 383 >PRK09109 motC flagellar motor protein; Reviewed Probab=35.10 E-value=22 Score=13.19 Aligned_cols=15 Identities=13% Similarity=0.084 Sum_probs=5.3 Q ss_pred HHHHHHHHHHHCCCC Q ss_conf 999999888421269 Q gi|254781033|r 173 AQRYATKALDISPDA 187 (492) Q Consensus 173 A~~~~~~a~~~~P~~ 187 (492) +.+.++.+-...|.. T Consensus 142 ~~~~~~~~g~~aPaf 156 (246) T PRK09109 142 AAKVFESMGGYAPTI 156 (246) T ss_pred HHHHHHHHHHHHHHH T ss_conf 778999999872676 No 384 >TIGR00354 polC DNA polymerase II, large subunit DP2; InterPro: IPR004475 This family represents the large subunit, DP2, of a two subunit novel archaebacterial replicative DNA polymerase first characterised for Pyrococcus furiosus. The structure of DP2 appears to be organised as a ~950 residue component separated from a ~300 residue component by a ~150 residue intein. The other subunit, DP1, has sequence similarity to the eukaryotic DNA polymerase delta small subunit.; GO: 0003677 DNA binding, 0003887 DNA-directed DNA polymerase activity, 0006260 DNA replication, 0006308 DNA catabolic process. Probab=35.06 E-value=17 Score=14.16 Aligned_cols=247 Identities=15% Similarity=0.091 Sum_probs=110.2 Q ss_pred HHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHH----------HHHHHHHHHCCCCHHHHHHHHHHHH Q ss_conf 552158869999999988842126989999999999952574569----------9999999950231035789999999 Q gi|254781033|r 163 ESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRA----------ITFLNQKKKNAKEWNRNRAILLIAR 232 (492) Q Consensus 163 ~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A----------~~~l~~~~~~~~~~~~~~a~l~~~~ 232 (492) ..|++||.-.- ..+++|-.+. +.......|++++..=||+=+= .+.+.+.+...+.++-.+- +.+.. T Consensus 395 VkLkNGDViri-~t~e~A~~vr-~eVeeILfLGd~LvnyGdFLeNNHpL~Pa~wceEWwiq~~~~~~kyPGddP-lN~~Y 471 (1173) T TIGR00354 395 VKLKNGDVIRI-DTLEKAKAVR-SEVEEILFLGDVLVNYGDFLENNHPLLPASWCEEWWIQELLKAGKYPGDDP-LNLEY 471 (1173) T ss_pred EEEECCCEEEE-CCHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHCCCCCCCCC-CCCCC T ss_conf 58417887882-5768888887-789988876578876300432475557776635799999996878778885-33200 Q ss_pred HHHHHHCCCHHH--HHHHHHH-HHHCCCCCHHHH-------HHHHHHHHHCCCH------------------HHHHHHHH Q ss_conf 999996699899--9999987-543133527999-------9999999973994------------------78999999 Q gi|254781033|r 233 SLENADKGDMIA--SYHDAIE-SLKLCDNSIMAS-------ICAAKSLISQNKK------------------RKAEVILE 284 (492) Q Consensus 233 a~~~~~~~~~~~--A~~~~~~-a~~~~p~~~~a~-------~~~a~~l~~~g~~------------------~~A~~~le 284 (492) .+.....-...+ |.+...+ -..+.|++...+ +.+-|-+...|+. +++.++|| T Consensus 472 ~r~~~~~~~~eeGdA~k~~~E~~vPLHPryTY~wHDvsved~~~LRnwl~~Gk~~~lEGK~~W~~dll~~ed~~~KrILE 551 (1173) T TIGR00354 472 LRDDYKELEEEEGDALKLAEEFKVPLHPRYTYLWHDVSVEDLRLLRNWLSDGKEDKLEGKKVWKVDLLVEEDKKAKRILE 551 (1173) T ss_pred CHHHHCCCCCCHHHHHHHHHHCCCCCCCCCCEEEECCCHHHHHHHHHHHHCCCCCCCCCCEEEEEEHHHHHCCHHHHHHH T ss_conf 00020067702113553544257977886412441123889999999973388764455212332000000141344456 Q ss_pred HHHHH-----------CCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHC-CCCHHHHHHHH----------HHHHHCCCHH Q ss_conf 99983-----------28989999999850789589999999999826-89836899999----------9999879988 Q gi|254781033|r 285 KIWKV-----------NPHPEIANIYTHLLSENTVGKLKRALRLEEIN-KESVESLVIVS----------KIALEMGSID 342 (492) Q Consensus 285 ~a~~~-----------~p~p~La~a~~~~~~~~~~~al~~~~~l~~~~-P~~~e~~~~LA----------~~~~~~g~~~ 342 (492) -..-. ++.|-|.....+ ..+..+-++.+++.+... .|..+....++ ++=-+.|+-+ T Consensus 552 ~lg~~H~Vr~s~vvIe~y~Pllys~~~~--~~~kkdL~e~l~K~lE~sGkd~~e~~N~~sp~~vr~ka~~yiGaR~GRPE 629 (1173) T TIGR00354 552 LLGVEHKVRESKVVIEEYRPLLYSADID--VLGKKDLVEKLEKVLESSGKDSLELVNRVSPVKVRKKALSYIGARMGRPE 629 (1173) T ss_pred HHCCCCEEECCEEEEECCCHHHHCCCCC--CCCCHHHHHHCCCCCCCCCCHHHHHHHCCCCEEEECCCCEEECCCCCCCC T ss_conf 4357731205547851343002203421--01221035420100111231245664255861572466605111048877 Q ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHH--HCCCCC------CCEECCCCCCCHHHCCCCCCCCC Q ss_conf 9999988786609998999999999998579989999999999--628899------95275698955424224837687 Q gi|254781033|r 343 QAHAKAMLAMKIAPRKEIFLLLAQIEQANSHNTDKILYWTQSA--LHAMPD------PLWISDDGYLSSVWLPLSPISKT 414 (492) Q Consensus 343 ~Ar~~l~~al~~~P~~~~~~llA~ie~~e~~d~~~ar~wl~rA--~~a~~d------p~W~c~~g~~~~~W~~~c~~~~~ 414 (492) +|++ -+..|...++.= | -.-||+...+.....+. -+|..+ -.-.|.+|-...- ..+||.||. T Consensus 630 Kake-----RKM~Ppph~LFP---i-G~aGG~~R~I~~A~~~~~synakkG~i~Vei~~~~CP~Cgk~s~-~~~Cp~CG~ 699 (1173) T TIGR00354 630 KAKE-----RKMKPPPHVLFP---I-GKAGGSVRDIKDAVEKTKSYNAKKGEIEVEIAIAKCPSCGKESL-YRVCPVCGE 699 (1173) T ss_pred CCCC-----CCCCCCCCCCCC---C-CCCCCCCCCHHHHHHHCCCCCCCCCCEEEEECCCCCCCCCCCCE-EEECCCCCC T ss_conf 7888-----888432863326---5-55788724769998605750333661268750121887664000-014577885 Q ss_pred CCCCCCCCCC Q ss_conf 3640036787 Q gi|254781033|r 415 LCYFEWKIPT 424 (492) Q Consensus 415 ~~~~~W~~p~ 424 (492) ---+.---|. T Consensus 700 ~te~~~~gPs 709 (1173) T TIGR00354 700 KTELDEYGPS 709 (1173) T ss_pred EEEECCCCCC T ss_conf 4544577885 No 385 >TIGR02508 type_III_yscG type III secretion protein, YscG family; InterPro: IPR013348 YscG is a molecular chaperone for YscE, where both are part of the type III secretion system that in Yersinia is designated Ysc (Yersinia secretion). The secretion system delivers effector proteins, designated Yops (Yersinia outer proteins), in Yersinia. This entry consists of YscG from Yersinia, and functionally equivalent type III secretion proteins in other species: e.g. AscG in Aeromonas and LscG in Photorhabdus luminescens.; GO: 0009405 pathogenesis. Probab=34.81 E-value=22 Score=13.16 Aligned_cols=50 Identities=16% Similarity=0.037 Sum_probs=19.9 Q ss_pred HHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCHHHHHHHHH Q ss_conf 99999973994789999999998328989999999850789589999999 Q gi|254781033|r 266 AAKSLISQNKKRKAEVILEKIWKVNPHPEIANIYTHLLSENTVGKLKRAL 315 (492) Q Consensus 266 ~a~~l~~~g~~~~A~~~le~a~~~~p~p~La~a~~~~~~~~~~~al~~~~ 315 (492) ....|..+|+|.+|.++.++.-...||-+-..+..+.+.|....-.+|+. T Consensus 46 rlsSLmN~G~Y~~Al~lg~~~~tayPdLepwlALce~rlGl~~Al~~Rl~ 95 (118) T TIGR02508 46 RLSSLMNRGDYQEALQLGEELCTAYPDLEPWLALCEWRLGLLSALEERLL 95 (118) T ss_pred HHHHHCCHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 98631275579999972433688777877899999999879999999999 No 386 >PRK03598 hypothetical protein; Provisional Probab=34.79 E-value=22 Score=13.16 Aligned_cols=12 Identities=8% Similarity=0.058 Sum_probs=4.6 Q ss_pred CHHHHHHHHHHH Q ss_conf 967999999999 Q gi|254781033|r 1 MLRLIRYFFVIS 12 (492) Q Consensus 1 M~r~l~~~~~i~ 12 (492) |.|-++|+++++ T Consensus 1 mkk~~~~~~~~~ 12 (331) T PRK03598 1 MKKPVVIGLAVV 12 (331) T ss_pred CCCHHHHHHHHH T ss_conf 983389999999 No 387 >PRK08456 flagellar motor protein MotA; Validated Probab=34.74 E-value=22 Score=13.15 Aligned_cols=17 Identities=6% Similarity=0.251 Sum_probs=6.1 Q ss_pred HHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999 Q gi|254781033|r 4 LIRYFFVISLVICSFII 20 (492) Q Consensus 4 ~l~~~~~i~~~~~~~~~ 20 (492) ++-+++.+.++.+++.. T Consensus 6 iiGli~~~~~i~~g~~l 22 (257) T PRK08456 6 ILGMVLAVASISVGDIL 22 (257) T ss_pred HHHHHHHHHHHHHHHHH T ss_conf 99999999999999998 No 388 >pfam11817 Foie-gras_1 Foie gras liver health family 1. Mutating the gene foie gras in zebrafish has been shown to affect development; the mutants develop large, lipid-filled hepatocytes in the liver, resembling those in individuals with fatty liver disease. Foie-gras protein is long and has several well-defined domains though none of them has a known function. We have annotated this one as the first. THe C-terminus of this region contains TPR repeats. Probab=34.67 E-value=22 Score=13.14 Aligned_cols=23 Identities=13% Similarity=0.033 Sum_probs=12.1 Q ss_pred HHHHHHHHCCCHHHHHHHHHHHH Q ss_conf 99999998799889999988786 Q gi|254781033|r 330 IVSKIALEMGSIDQAHAKAMLAM 352 (492) Q Consensus 330 ~LA~~~~~~g~~~~Ar~~l~~al 352 (492) -+|+-++..|+|++|.+.++.+. T Consensus 190 ~mA~Ey~~~g~~~~Al~ll~~~~ 212 (254) T pfam11817 190 EMAEEYLRLGDWSKALELLDPVA 212 (254) T ss_pred HHHHHHHHCCCHHHHHHHHHHHH T ss_conf 99999997689999999999999 No 389 >COG5415 Predicted integral membrane metal-binding protein [General function prediction only] Probab=34.36 E-value=23 Score=13.11 Aligned_cols=44 Identities=9% Similarity=0.001 Sum_probs=23.9 Q ss_pred CHHHHHHHHHHHHC--CCCC-------CCEECCCCCCC------HH---HCCCCCCCCCCCC Q ss_conf 98999999999962--8899-------95275698955------42---4224837687364 Q gi|254781033|r 374 NTDKILYWTQSALH--AMPD-------PLWISDDGYLS------SV---WLPLSPISKTLCY 417 (492) Q Consensus 374 d~~~ar~wl~rA~~--a~~d-------p~W~c~~g~~~------~~---W~~~c~~~~~~~~ 417 (492) +..-.++|++|.+. +.-+ ..-+|..|+-| .+ =..+||+|+..+- T Consensus 164 ~~~~~~~w~D~V~~vl~G~ne~~~~~~~alIC~~C~hhngl~~~~ek~~~efiC~~Cn~~n~ 225 (251) T COG5415 164 KKEDSDAWFDKVISVLAGGNELDLSPFKALICPQCHHHNGLYRLAEKPIIEFICPHCNHKND 225 (251) T ss_pred CCCCCHHHHHHHHHHHHCCCCCCCCCHHHHCCCCCCCCCCCCCCCCCCCHHEECCCCHHHCC T ss_conf 75551478998888872788566674265445502354665564434530016356421058 No 390 >smart00531 TFIIE Transcription initiation factor IIE. Probab=33.96 E-value=21 Score=13.35 Aligned_cols=25 Identities=4% Similarity=-0.209 Sum_probs=15.5 Q ss_pred CCCEECC-CCCCC--HH---H-----CCCCCCCCCC Q ss_conf 9952756-98955--42---4-----2248376873 Q gi|254781033|r 391 DPLWISD-DGYLS--SV---W-----LPLSPISKTL 415 (492) Q Consensus 391 dp~W~c~-~g~~~--~~---W-----~~~c~~~~~~ 415 (492) ....+|. ||... .+ . ...||.||.. T Consensus 97 ~~~Y~C~~C~~ryt~~eA~~l~~~~~~F~Cp~Cg~~ 132 (147) T smart00531 97 NAYYKCPNCQSKYTFLEANQLLDMDGTFTCPRCGEE 132 (147) T ss_pred CCEEECCCCCCEECHHHHHHHCCCCCCEECCCCCCE T ss_conf 987889999978629999884189983788999999 No 391 >PRK07591 threonine synthase; Validated Probab=33.43 E-value=22 Score=13.23 Aligned_cols=20 Identities=15% Similarity=-0.014 Sum_probs=7.9 Q ss_pred CCHHHHHHHHHHHHHCCCCC Q ss_conf 99899999998754313352 Q gi|254781033|r 240 GDMIASYHDAIESLKLCDNS 259 (492) Q Consensus 240 ~~~~~A~~~~~~a~~~~p~~ 259 (492) |...-+.+.+.+.-...|+. T Consensus 225 G~KTia~Ei~eqlg~~~PD~ 244 (422) T PRK07591 225 GSKTLGYEVAEQLGWRLPDQ 244 (422) T ss_pred HHHHHHHHHHHHCCCCCCCE T ss_conf 26689999999819999985 No 392 >PRK08565 DNA-directed RNA polymerase subunit beta; Provisional Probab=32.84 E-value=20 Score=13.49 Aligned_cols=22 Identities=0% Similarity=-0.032 Sum_probs=11.0 Q ss_pred CCCEEEEEECCEEEEEHHHHHH Q ss_conf 8984899989976622189999 Q gi|254781033|r 24 YPEDVSITWGNRLYRTSPFVIL 45 (492) Q Consensus 24 ~~G~v~i~~~~~~ie~s~~~~i 45 (492) ..|.+.+.+.....++++.+++ T Consensus 197 ~~g~i~~~i~~~~~~IPi~ill 218 (1101) T PRK08565 197 KDGTIYVSIPAVPGKIPFVILM 218 (1101) T ss_pred CCCEEEEEECCCCCEEEHHHHH T ss_conf 7987999987835476399999 No 393 >PRK12482 flagellar motor protein MotA; Provisional Probab=32.49 E-value=24 Score=12.89 Aligned_cols=30 Identities=17% Similarity=0.447 Sum_probs=18.6 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEEC Q ss_conf 967999999999999999997348984899989 Q gi|254781033|r 1 MLRLIRYFFVISLVICSFIIVSHYPEDVSITWG 33 (492) Q Consensus 1 M~r~l~~~~~i~~~~~~~~~l~~~~G~v~i~~~ 33 (492) |..++-+++.+.++.+++.+-.++ +...|. T Consensus 1 M~~iiGliv~~~~v~gg~~l~GG~---l~~l~q 30 (287) T PRK12482 1 MQKLFGLLVIMGCVFGGYLMSGGS---LSSIWQ 30 (287) T ss_pred CHHHHHHHHHHHHHHHHHHHCCCC---CHHHHC T ss_conf 920999999999999999973897---259868 No 394 >pfam09986 DUF2225 Uncharacterized protein conserved in bacteria (DUF2225). This domain, found in various hypothetical bacterial proteins, has no known function. Probab=32.15 E-value=24 Score=12.85 Aligned_cols=27 Identities=7% Similarity=0.174 Sum_probs=17.6 Q ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 899999999987998899999887866 Q gi|254781033|r 327 SLVIVSKIALEMGSIDQAHAKAMLAMK 353 (492) Q Consensus 327 ~~~~LA~~~~~~g~~~~Ar~~l~~al~ 353 (492) ..+++|++..+.|++++|.+.+.+++. T Consensus 167 l~YLigeL~rRlG~~~eA~~wfs~vi~ 193 (214) T pfam09986 167 LMYLIGELSRRLGNKEEALRWFSKVIG 193 (214) T ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHC T ss_conf 999999999994999999999999963 No 395 >PRK09966 hypothetical protein; Provisional Probab=31.89 E-value=25 Score=12.82 Aligned_cols=13 Identities=15% Similarity=0.082 Sum_probs=5.3 Q ss_pred HCCHHHHHHHHHH Q ss_conf 2100478799998 Q gi|254781033|r 96 AAHNIPLARKMHS 108 (492) Q Consensus 96 ~~Gd~~~A~k~~~ 108 (492) .=||-..|+..+. T Consensus 68 vF~D~~aA~e~L~ 80 (407) T PRK09966 68 VFADGPAATETLA 80 (407) T ss_pred CCCCHHHHHHHHH T ss_conf 1288999999999 No 396 >TIGR02745 ccoG_rdxA_fixG cytochrome c oxidase accessory protein CcoG; InterPro: IPR014116 Member of this ferredoxin-like protein family, include FixG, CcoG and RdxA,. They are found exclusively in species with an operon encoding the cbb3 type of cytochrome c oxidase (cco-cbb3), and near the cco-cbb3 operon in about half the cases. The cco-cbb3 is found in a variety of proteobacteria and almost nowhere else; it is associated with oxygen use under microaerobic conditions. Some (but not all) of these proteobacteria are also nitrogen-fixing, hence the gene symbol fixG. FixG was shown essential for functional cco-cbb3 expression in Bradyrhizobium japonicum.. Probab=31.55 E-value=25 Score=12.78 Aligned_cols=96 Identities=11% Similarity=0.141 Sum_probs=49.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCC------CEEEEEECCEEEEE--------HHHHHHHHHHHHHHHHHHHHHHHHHHH-- Q ss_conf 79999999999999999973489------84899989976622--------189999999999999999999999984-- Q gi|254781033|r 3 RLIRYFFVISLVICSFIIVSHYP------EDVSITWGNRLYRT--------SPFVILSILYFFLFAWILLFAVSRFFL-- 66 (492) Q Consensus 3 r~l~~~~~i~~~~~~~~~l~~~~------G~v~i~~~~~~ie~--------s~~~~i~~~~~~~~~~~ll~~l~~~i~-- 66 (492) |..++ .+++++.++..|+.-+. --|++++...++.. -+.++..+++++++.+.++..+...+| T Consensus 5 R~~~~-~~~~~~~~~~Pwi~w~gG~~~P~QavLld~~~r~F~~Fg~~fwpqe~~Ll~~lLi~aalgLFfiTt~~GRvWCG 83 (474) T TIGR02745 5 RYVIL-AVLLLIFLILPWIRWNGGPGLPNQAVLLDLAHRRFYFFGITFWPQEFYLLAGLLIIAALGLFFITTLAGRVWCG 83 (474) T ss_pred HHHHH-HHHHHHHHHHCEEECCCCCCCCCCEEECCCCCCEEEEECCEECHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 88899-99999998825352287667776525402331665531403440368899999999999999999988777632 Q ss_pred -CCHHHHHH-HHHHHHHHHHHHHHHHHHHHHHCCH Q ss_conf -18288999-9999999999999999999972100 Q gi|254781033|r 67 -SCPAMLFH-MLHKRNYDKGYKALYTGLMSIAAHN 99 (492) Q Consensus 67 -~~p~~~~~-~~~~rr~~k~~~al~~gl~al~~Gd 99 (492) .+|..+.+ .....=+++-...-.+.-+-+..++ T Consensus 84 y~CPQTVw~v~~f~~~E~~~eG~rn~r~~kl~~~p 118 (474) T TIGR02745 84 YACPQTVWTVDLFIVIERKIEGDRNKRSMKLDKAP 118 (474) T ss_pred CCCCCHHHHHHHHHHHHHHHHCHHHHHHCCCCCCC T ss_conf 57871788889999987775031223211102243 No 397 >KOG0276 consensus Probab=31.03 E-value=25 Score=12.72 Aligned_cols=46 Identities=17% Similarity=0.097 Sum_probs=19.6 Q ss_pred HHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHH Q ss_conf 552158869999999988842126989999999999952574569999999 Q gi|254781033|r 163 ESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQ 213 (492) Q Consensus 163 ~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~ 213 (492) .+++.|+.+.|.+++.++ ++..-+..|+++....++..-|.+.+.+ T Consensus 646 lal~lgrl~iA~~la~e~-----~s~~Kw~~Lg~~al~~~~l~lA~EC~~~ 691 (794) T KOG0276 646 LALKLGRLDIAFDLAVEA-----NSEVKWRQLGDAALSAGELPLASECFLR 691 (794) T ss_pred HHHHCCCHHHHHHHHHHH-----CCHHHHHHHHHHHHHCCCCHHHHHHHHH T ss_conf 445527188899987752-----3667799999998641663458999975 No 398 >pfam05145 AmoA Putative ammonia monooxygenase. This family are annotated by COGS as putative ammonia monooxygenase enzymes. Probab=30.99 E-value=25 Score=12.71 Aligned_cols=13 Identities=8% Similarity=0.059 Sum_probs=6.0 Q ss_pred HHHHHHHCCHHHH Q ss_conf 9999972100478 Q gi|254781033|r 90 TGLMSIAAHNIPL 102 (492) Q Consensus 90 ~gl~al~~Gd~~~ 102 (492) .++.+..-|-..+ T Consensus 85 TA~~~~~PGgls~ 97 (318) T pfam05145 85 TAFFGSMPGGASA 97 (318) T ss_pred HHHHHHCCCHHHH T ss_conf 9999808645999 No 399 >KOG4606 consensus Probab=30.99 E-value=25 Score=12.71 Aligned_cols=33 Identities=12% Similarity=0.045 Sum_probs=21.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECC Q ss_conf 679999999999999999973489848999899 Q gi|254781033|r 2 LRLIRYFFVISLVICSFIIVSHYPEDVSITWGN 34 (492) Q Consensus 2 ~r~l~~~~~i~~~~~~~~~l~~~~G~v~i~~~~ 34 (492) .|+++.++-++..+.|+.+....|....+.+.. T Consensus 31 WRi~lvi~svc~gaigawywl~dpet~~vp~~~ 63 (126) T KOG4606 31 WRIALVIFSVCTGAIGAWYWLIDPETQKVPFFT 63 (126) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCEEEEEHHH T ss_conf 246799999999863255420273205641388 No 400 >TIGR01054 rgy reverse gyrase; InterPro: IPR005736 DNA topoisomerases regulate the number of topological links between two DNA strands (i.e. change the number of superhelical turns) by catalysing transient single- or double-strand breaks, crossing the strands through one another, then resealing the breaks. These enzymes have several functions: to remove DNA supercoils during transcription and DNA replication; for strand breakage during recombination; for chromosome condensation; and to disentangle intertwined DNA during mitosis , . DNA topoisomerases are divided into two classes: type I enzymes (5.99.1.2 from EC; topoisomerases I, III and V) break single-strand DNA, and type II enzymes (5.99.1.3 from EC; topoisomerases II, IV and VI) break double-strand DNA . Type I topoisomerases are ATP-independent enzymes (except for reverse gyrase), and can be subdivided according to their structure and reaction mechanisms: type IA (bacterial and archaeal topoisomerase I, topoisomerase III and reverse gyrase) and type IB (eukaryotic topoisomerase I and topoisomerase V). These enzymes are primarily responsible for relaxing positively and/or negatively supercoiled DNA, except for reverse gyrase, which can introduce positive supercoils into DNA. Reverse gyrase is a type IA topoisomerase that is unique among these enzymes in its requirement for ATP. Reverse gyrase is a hyperthermophile-specific enzyme that acts as a renaturase by positively supercoiling DNA, and by annealing complementary single-strand circles . Hyperthermophilic organisms must protect themselves against heat-induced degradation, and reverse gyrase acts to reduce the rate of double-strand DNA breakage, a function that does not require ATP hydrolysis and which is independent of its positive supercoiling abilities. Reverse gyrase achieves this by recognising nicked DNA and recruiting a protein coat to the site of damage . More information about this protein can be found at Protein of the Month: DNA Topoisomerase .; GO: 0003677 DNA binding, 0003916 DNA topoisomerase activity, 0006265 DNA topological change, 0006268 DNA unwinding during replication, 0005694 chromosome. Probab=30.85 E-value=19 Score=13.66 Aligned_cols=17 Identities=18% Similarity=0.260 Sum_probs=6.6 Q ss_pred HHHHHHHHHHCCCHHHH Q ss_conf 99999999987998899 Q gi|254781033|r 328 LVIVSKIALEMGSIDQA 344 (492) Q Consensus 328 ~~~LA~~~~~~g~~~~A 344 (492) ++.+..+.....+-+.. T Consensus 464 LLSl~~~~~~e~~i~~~ 480 (1843) T TIGR01054 464 LLSLLALILKEERIESL 480 (1843) T ss_pred HHHHHHHHHCCCCHHHH T ss_conf 99999985143203578 No 401 >KOG2581 consensus Probab=30.77 E-value=26 Score=12.69 Aligned_cols=25 Identities=20% Similarity=0.275 Sum_probs=15.6 Q ss_pred HHHHHHHHHCCCHHH-HHHHHHHHHC Q ss_conf 999999985799899-9999999962 Q gi|254781033|r 363 LLAQIEQANSHNTDK-ILYWTQSALH 387 (492) Q Consensus 363 llA~ie~~e~~d~~~-ar~wl~rA~~ 387 (492) .+.+|....+=|.+. +..-..||.+ T Consensus 379 Sl~DIA~kL~l~Seed~EyiVakAIR 404 (493) T KOG2581 379 SLQDIAKKLGLNSEEDAEYIVAKAIR 404 (493) T ss_pred CHHHHHHHHCCCCCHHHHHHHHHHHH T ss_conf 59999999669982027999999987 No 402 >PRK09173 F0F1 ATP synthase subunit B; Validated Probab=30.59 E-value=26 Score=12.67 Aligned_cols=51 Identities=16% Similarity=0.296 Sum_probs=22.5 Q ss_pred EEHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHH Q ss_conf 2218999999999999999999999998418288999999999999999999999997210047879999863 Q gi|254781033|r 38 RTSPFVILSILYFFLFAWILLFAVSRFFLSCPAMLFHMLHKRNYDKGYKALYTGLMSIAAHNIPLARKMHSYV 110 (492) Q Consensus 38 e~s~~~~i~~~~~~~~~~~ll~~l~~~i~~~p~~~~~~~~~rr~~k~~~al~~gl~al~~Gd~~~A~k~~~~a 110 (492) ..++|++|..+++++++.++. .|..+......|+.+ ..++...|++.-..+ T Consensus 2 d~~FW~~I~fvi~~~ll~~~~---------~p~~i~~~Ld~R~~~-------------I~~~l~~Ae~~~~eA 52 (159) T PRK09173 2 DATFWAFVGLVLFLALVVYLK---------VPGMIGRSLDERADR-------------IKNELDEARRLREEA 52 (159) T ss_pred CCHHHHHHHHHHHHHHHHHHH---------HHHHHHHHHHHHHHH-------------HHHHHHHHHHHHHHH T ss_conf 812999999999999999997---------789999999999999-------------999999999889999 No 403 >pfam00320 GATA GATA zinc finger. This domain uses four cysteine residues to coordinate a zinc ion. This domain binds to DNA. Two GATA zinc fingers are found in the GATA transcription factors. However there are several proteins which only contains a single copy of the domain. Probab=30.48 E-value=26 Score=12.65 Aligned_cols=18 Identities=11% Similarity=0.012 Sum_probs=13.9 Q ss_pred CCCCCCCCCCCCCCCCCC Q ss_conf 837687364003678755 Q gi|254781033|r 409 SPISKTLCYFEWKIPTKS 426 (492) Q Consensus 409 c~~~~~~~~~~W~~p~~~ 426 (492) |.+||.-.|-.|+.-|.. T Consensus 1 C~~C~tt~Tp~WR~~~~g 18 (36) T pfam00320 1 CSNCGTTKTPLWRRGPDG 18 (36) T ss_pred CCCCCCCCCCCCCCCCCC T ss_conf 969789799645669989 No 404 >pfam05568 ASFV_J13L African swine fever virus J13L protein. This family consists of several African swine fever virus J13L proteins. Probab=30.34 E-value=26 Score=12.64 Aligned_cols=14 Identities=14% Similarity=0.000 Sum_probs=5.4 Q ss_pred HHHHHHHHHHHHHH Q ss_conf 99999999999999 Q gi|254781033|r 80 NYDKGYKALYTGLM 93 (492) Q Consensus 80 r~~k~~~al~~gl~ 93 (492) |++|+-.++.+--+ T Consensus 55 rkkkaaaaieeedi 68 (189) T pfam05568 55 RKKKAAAAIEEEDI 68 (189) T ss_pred CHHHHHHHHHHHCC T ss_conf 03676532004203 No 405 >PRK03681 hypA hydrogenase nickel incorporation protein; Validated Probab=30.14 E-value=26 Score=12.61 Aligned_cols=25 Identities=4% Similarity=-0.125 Sum_probs=14.6 Q ss_pred CEECC-CCCC-C--HHHCCCCCCCCCCCC Q ss_conf 52756-9895-5--424224837687364 Q gi|254781033|r 393 LWISD-DGYL-S--SVWLPLSPISKTLCY 417 (492) Q Consensus 393 ~W~c~-~g~~-~--~~W~~~c~~~~~~~~ 417 (492) .=.|. ||.. . ...-..||.||+.+. T Consensus 70 ~~~C~~C~~~~~~~~~~~~~CP~Cgs~~~ 98 (114) T PRK03681 70 ECWCETCQQYVTLLTQRVRRCPQCHGDML 98 (114) T ss_pred EEECCCCCCEEEECCCCCCCCCCCCCCCC T ss_conf 89965599855406776773908839975 No 406 >COG1096 Predicted RNA-binding protein (consists of S1 domain and a Zn-ribbon domain) [Translation, ribosomal structure and biogenesis] Probab=30.13 E-value=22 Score=13.14 Aligned_cols=14 Identities=7% Similarity=-0.121 Sum_probs=6.7 Q ss_pred CHHHCCCCCCCCCC Q ss_conf 54242248376873 Q gi|254781033|r 402 SSVWLPLSPISKTL 415 (492) Q Consensus 402 ~~~W~~~c~~~~~~ 415 (492) ...-...||+||.- T Consensus 161 ~~~~~l~Cp~Cg~t 174 (188) T COG1096 161 KKGNMLKCPNCGNT 174 (188) T ss_pred ECCCEEECCCCCCE T ss_conf 84759988877987 No 407 >pfam09946 DUF2178 Predicted membrane protein (DUF2178). This domain, found in various hypothetical archaeal proteins, has no known function. Probab=30.06 E-value=26 Score=12.60 Aligned_cols=31 Identities=13% Similarity=0.056 Sum_probs=15.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHCCHHH Q ss_conf 8999999999999999999999998418288 Q gi|254781033|r 41 PFVILSILYFFLFAWILLFAVSRFFLSCPAM 71 (492) Q Consensus 41 ~~~~i~~~~~~~~~~~ll~~l~~~i~~~p~~ 71 (492) +.++++++++..++.+++.+=+..+..+.+. T Consensus 27 ~~~a~~~v~~gi~~~~~~~~rv~~vvEDER~ 57 (111) T pfam09946 27 PLVAVLAVIAGILAIYFLKSRVDGVVEDERI 57 (111) T ss_pred EEHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 2058999999999999999998888768999 No 408 >COG3183 Predicted restriction endonuclease [Defense mechanisms] Probab=29.66 E-value=12 Score=15.15 Aligned_cols=13 Identities=23% Similarity=0.432 Sum_probs=9.3 Q ss_pred HHHCCCCCCCCCC Q ss_conf 4242248376873 Q gi|254781033|r 403 SVWLPLSPISKTL 415 (492) Q Consensus 403 ~~W~~~c~~~~~~ 415 (492) .+-.|+||||++. T Consensus 238 tDL~plCpNCH~m 250 (272) T COG3183 238 TDLAPLCPNCHKM 250 (272) T ss_pred HHHHHCCCCHHHH T ss_conf 4020028548899 No 409 >COG3170 FimV Tfp pilus assembly protein FimV [Cell motility and secretion / Intracellular trafficking and secretion] Probab=29.48 E-value=27 Score=12.53 Aligned_cols=31 Identities=16% Similarity=0.074 Sum_probs=19.1 Q ss_pred HHHHHHHHCCCHHHHHHHHHHHHHCCCCHHH Q ss_conf 9999999879988999998878660999899 Q gi|254781033|r 330 IVSKIALEMGSIDQAHAKAMLAMKIAPRKEI 360 (492) Q Consensus 330 ~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~~ 360 (492) .-+-.++.-|.+..+.+.+...++.+|..-. T Consensus 503 ~~ad~yiaygr~~qa~E~lre~~~~~per~~ 533 (755) T COG3170 503 AEADIYIAYGRDVQAEELLRETLEQHPERLD 533 (755) T ss_pred CCCCHHHHHCCCCCCHHHHHHHHHHCCHHHH T ss_conf 6763021201223538999999873862566 No 410 >pfam01616 Orbi_NS3 Orbivirus NS3. The function of this Orbivirus non structural protein is uncertain. However it may play a role on release of the virus from infected cells. Probab=29.22 E-value=27 Score=12.50 Aligned_cols=11 Identities=36% Similarity=0.096 Sum_probs=3.7 Q ss_pred HHHHHHHHHHH Q ss_conf 99999988842 Q gi|254781033|r 173 AQRYATKALDI 183 (492) Q Consensus 173 A~~~~~~a~~~ 183 (492) |...+.+++.- T Consensus 49 Af~a~aEAlrD 59 (194) T pfam01616 49 AFGAAAEALRD 59 (194) T ss_pred HHHHHHHHHCC T ss_conf 99999999637 No 411 >COG1862 YajC Preprotein translocase subunit YajC [Intracellular trafficking and secretion] Probab=29.18 E-value=27 Score=12.49 Aligned_cols=17 Identities=18% Similarity=0.307 Sum_probs=7.2 Q ss_pred HHHHHHHHHHHHHHHHH Q ss_conf 89999999999999999 Q gi|254781033|r 41 PFVILSILYFFLFAWIL 57 (492) Q Consensus 41 ~~~~i~~~~~~~~~~~l 57 (492) ....++.++++++++|+ T Consensus 8 ~~~~ll~~vl~~~ifyF 24 (97) T COG1862 8 GLVLLLPLVLIFAIFYF 24 (97) T ss_pred CHHHHHHHHHHHHHHHH T ss_conf 07799999999999998 No 412 >COG2256 MGS1 ATPase related to the helicase subunit of the Holliday junction resolvase [DNA replication, recombination, and repair] Probab=28.96 E-value=27 Score=12.47 Aligned_cols=41 Identities=12% Similarity=0.053 Sum_probs=16.1 Q ss_pred HHHHHHHHHHCCCCCHHHHHH-HHHHHHHCCC-H-HHHHHHHHHHHH Q ss_conf 999998754313352799999-9999997399-4-789999999998 Q gi|254781033|r 245 SYHDAIESLKLCDNSIMASIC-AAKSLISQNK-K-RKAEVILEKIWK 288 (492) Q Consensus 245 A~~~~~~a~~~~p~~~~a~~~-~a~~l~~~g~-~-~~A~~~le~a~~ 288 (492) ....+.+.++-+ .+.+++- +++ ++..|+ . --|.+++.-+++ T Consensus 252 liSA~hKSvRGS--D~dAALyylaR-mi~~GeDp~yiARRlv~~AsE 295 (436) T COG2256 252 LISALHKSVRGS--DPDAALYYLAR-MIEAGEDPLYIARRLVRIASE 295 (436) T ss_pred HHHHHHHHHCCC--CCCHHHHHHHH-HHHCCCCHHHHHHHHHHHHHH T ss_conf 999999861268--83789999999-996688789999999999874 No 413 >PRK12665 putative monovalent cation/H+ antiporter subunit D; Reviewed Probab=28.94 E-value=27 Score=12.46 Aligned_cols=22 Identities=5% Similarity=0.107 Sum_probs=14.5 Q ss_pred HHHHHHHCCCCEEEEEECCEEE Q ss_conf 9999973489848999899766 Q gi|254781033|r 16 CSFIIVSHYPEDVSITWGNRLY 37 (492) Q Consensus 16 ~~~~~l~~~~G~v~i~~~~~~i 37 (492) +...+..++.|...+..++|.- T Consensus 45 ~~ll~~~~~~g~~~~~~g~w~~ 66 (524) T PRK12665 45 AVLLYLVDRDGTLAVQVGGWAA 66 (524) T ss_pred HHHHHHHHCCCCEEEEEECCCC T ss_conf 9999998279847998754777 No 414 >TIGR02302 aProt_lowcomp conserved hypothetical protein TIGR02302; InterPro: IPR012683 Members of this family are bacterial proteins from the alphaproteobacteria. Each has 2-3 predicted transmembrane helices near the N-terminus and a long C-terminal region that includes stretches of Gln/Gly-rich low complexity sequence, predicted to be outside the membrane. Bradyrhizobium japonicum contains two tandem copies that are, together, homologous to the single members found in other species. . Probab=28.91 E-value=27 Score=12.46 Aligned_cols=20 Identities=20% Similarity=0.336 Sum_probs=11.2 Q ss_pred HCCCCHHHHHHHHHHHHHHC Q ss_conf 21588699999999888421 Q gi|254781033|r 165 CRIGDLNSAQRYATKALDIS 184 (492) Q Consensus 165 l~~gd~~~A~~~~~~a~~~~ 184 (492) -++|+.++|.+++.++..+= T Consensus 611 aRsG~rdqA~QlLsqLq~mm 630 (899) T TIGR02302 611 ARSGDRDQAKQLLSQLQRMM 630 (899) T ss_pred CCCCCHHHHHHHHHHHHHHH T ss_conf 06788789999999999999 No 415 >TIGR03525 GldK gliding motility-associated lipoprotein GldK. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldK is a lipoprotein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae. Knockouts of GldK abolish the gliding phenotype. GldK is homologous to GldJ. There is a GldK homolog in Cytophaga hutchinsonii and several other species that has a different, shorter architecture and is represented by a separate model. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility. Probab=28.06 E-value=28 Score=12.35 Aligned_cols=17 Identities=24% Similarity=0.542 Sum_probs=10.3 Q ss_pred CHHHHHHHHHHHHH-HHH Q ss_conf 96799999999999-999 Q gi|254781033|r 1 MLRLIRYFFVISLV-ICS 17 (492) Q Consensus 1 M~r~l~~~~~i~~~-~~~ 17 (492) |.|+++|+++++++ .|+ T Consensus 1 Mkk~~~~~~~~~~~~SCg 18 (449) T TIGR03525 1 MKKYLVFAALVVLVYSCG 18 (449) T ss_pred CCHHHHHHHHHHHHHHCC T ss_conf 912489999999872105 No 416 >PRK08476 F0F1 ATP synthase subunit B'; Validated Probab=28.00 E-value=28 Score=12.35 Aligned_cols=26 Identities=19% Similarity=0.418 Sum_probs=15.8 Q ss_pred EEEHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 62218999999999999999999999 Q gi|254781033|r 37 YRTSPFVILSILYFFLFAWILLFAVS 62 (492) Q Consensus 37 ie~s~~~~i~~~~~~~~~~~ll~~l~ 62 (492) ++.+++.++..++.|+++++++..++ T Consensus 3 l~~n~~~~i~~~i~FliL~~iL~kfl 28 (141) T PRK08476 3 LDINPYLMLLTFVVFLLLIVILNSWL 28 (141) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 57767799999999999999999995 No 417 >COG4647 AcxC Acetone carboxylase, gamma subunit [Secondary metabolites biosynthesis, transport, and catabolism] Probab=27.88 E-value=18 Score=13.93 Aligned_cols=25 Identities=28% Similarity=0.472 Sum_probs=12.9 Q ss_pred CCCCCCEECCCCCCCHHHCCCCCCCCCCCCCC Q ss_conf 88999527569895542422483768736400 Q gi|254781033|r 388 AMPDPLWISDDGYLSSVWLPLSPISKTLCYFE 419 (492) Q Consensus 388 a~~dp~W~c~~g~~~~~W~~~c~~~~~~~~~~ 419 (492) -.|||+|.-- =..+||.||-+--.+ T Consensus 109 ~ipdp~wme~-------reficpecg~l~eve 133 (165) T COG4647 109 DIPDPQWMEI-------REFICPECGILHEVE 133 (165) T ss_pred CCCCCCHHHH-------HHHHCCCCCCEEEEC T ss_conf 7999316788-------776186642302321 No 418 >COG1439 Predicted nucleic acid-binding protein, consists of a PIN domain and a Zn-ribbon module [General function prediction only] Probab=27.81 E-value=26 Score=12.66 Aligned_cols=24 Identities=8% Similarity=-0.182 Sum_probs=15.8 Q ss_pred CCEECC-CCCCCHHHCCCCCCCCCC Q ss_conf 952756-989554242248376873 Q gi|254781033|r 392 PLWISD-DGYLSSVWLPLSPISKTL 415 (492) Q Consensus 392 p~W~c~-~g~~~~~W~~~c~~~~~~ 415 (492) ..-+|. |+.+.+.=.-+||.||.- T Consensus 138 w~~rC~GC~~~f~~~~~~Cp~CG~~ 162 (177) T COG1439 138 WRLRCHGCKRIFPEPKDFCPICGSP 162 (177) T ss_pred EEEEEECCCEECCCCCCCCCCCCCC T ss_conf 5689845752508988807789991 No 419 >PRK05457 heat shock protein HtpX; Provisional Probab=27.76 E-value=29 Score=12.32 Aligned_cols=21 Identities=24% Similarity=0.358 Sum_probs=13.2 Q ss_pred CHHHHHHHHHHHHHHHHHHHH Q ss_conf 967999999999999999997 Q gi|254781033|r 1 MLRLIRYFFVISLVICSFIIV 21 (492) Q Consensus 1 M~r~l~~~~~i~~~~~~~~~l 21 (492) |+|+++|++.-.++.+...++ T Consensus 1 ~~r~~lf~~~~~~v~~~~~~~ 21 (289) T PRK05457 1 MKRIFLFLLTNLAVMLVLSIV 21 (289) T ss_pred CHHHHHHHHHHHHHHHHHHHH T ss_conf 965999999999999999999 No 420 >PRK10478 putative fructose-like permease EIIC subunit 2; Provisional Probab=27.70 E-value=29 Score=12.31 Aligned_cols=13 Identities=15% Similarity=0.281 Sum_probs=4.5 Q ss_pred HHHHHHHCCCHHH Q ss_conf 9999998799889 Q gi|254781033|r 331 VSKIALEMGSIDQ 343 (492) Q Consensus 331 LA~~~~~~g~~~~ 343 (492) ++-..+-.|-.+- T Consensus 255 mG~~~ITEGAIPF 267 (359) T PRK10478 255 MGCVGVTEGAIPF 267 (359) T ss_pred HHHHHHCCCCHHH T ss_conf 8998816451066 No 421 >TIGR02661 MauD methylamine dehydrogenase accessory protein MauD; InterPro: IPR013478 MauD appears critical to proper formation of the small subunit of methylamine dehydrogenase, which has both an unusual tryptophan tryptophylquinone cofactor and multiple disulphide bonds. MauD shares sequence similarity, including a CPxC motif, with a number of thiol:disulphide interchange proteins. In MauD mutants, the small subunit apparently does not form properly and is rapidly degraded .; GO: 0030416 methylamine metabolic process. Probab=27.64 E-value=19 Score=13.75 Aligned_cols=12 Identities=25% Similarity=0.412 Sum_probs=7.0 Q ss_pred HCCCCCCCCCCC Q ss_conf 422483768736 Q gi|254781033|r 405 WLPLSPISKTLC 416 (492) Q Consensus 405 W~~~c~~~~~~~ 416 (492) =.|.||.|.++= T Consensus 82 TaPsCPvC~KL~ 93 (189) T TIGR02661 82 TAPSCPVCDKLL 93 (189) T ss_pred CCCCCCHHHHHH T ss_conf 388885322432 No 422 >pfam02058 Guanylin Guanylin precursor. Probab=27.63 E-value=29 Score=12.30 Aligned_cols=27 Identities=7% Similarity=0.039 Sum_probs=19.2 Q ss_pred HHHHHHHHHCCCCEEEEEECCEEEEEH Q ss_conf 999999973489848999899766221 Q gi|254781033|r 14 VICSFIIVSHYPEDVSITWGNRLYRTS 40 (492) Q Consensus 14 ~~~~~~~l~~~~G~v~i~~~~~~ie~s 40 (492) +.+...|+....|.|.|..+|+.+..- T Consensus 9 l~ll~~~~~~~~~aV~Vqdg~fsf~LE 35 (109) T pfam02058 9 LALLLALALQGTQAVYVQDGGFSFSLE 35 (109) T ss_pred HHHHHHHHHHHCCCEEEEECCEEEEHH T ss_conf 999999998206817998899986289 No 423 >pfam06674 DUF1176 Protein of unknown function (DUF1176). This family consists of several hypothetical bacterial proteins of around 340 residues in length. Members of this family contain six highly conserved cysteine residues. The function of this family is unknown. Probab=27.60 E-value=19 Score=13.69 Aligned_cols=18 Identities=6% Similarity=-0.116 Sum_probs=7.2 Q ss_pred HCCCCCCCCCCCC--CCCCCC Q ss_conf 4224837687364--003678 Q gi|254781033|r 405 WLPLSPISKTLCY--FEWKIP 423 (492) Q Consensus 405 W~~~c~~~~~~~~--~~W~~p 423 (492) |.-.| -+|.+|+ ..|..+ T Consensus 231 v~~~C-~~gAYN~~~~~w~v~ 250 (338) T pfam06674 231 VSIPC-WRGAYNTGYGAWLVD 250 (338) T ss_pred EEEEC-CCCCCCCCCEEEEEC T ss_conf 87663-665145775599968 No 424 >KOG3251 consensus Probab=27.56 E-value=29 Score=12.29 Aligned_cols=12 Identities=8% Similarity=0.224 Sum_probs=4.0 Q ss_pred CCHHHHHHHHHH Q ss_conf 994789999999 Q gi|254781033|r 274 NKKRKAEVILEK 285 (492) Q Consensus 274 g~~~~A~~~le~ 285 (492) |-...-.++|++ T Consensus 174 GlSn~ti~lIeR 185 (213) T KOG3251 174 GLSNQTIRLIER 185 (213) T ss_pred CCCHHHHHHHHH T ss_conf 870989999999 No 425 >TIGR01031 rpmF_bact ribosomal protein L32; InterPro: IPR002677 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites , . About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits. Many of ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome , . Ribosomal protein L32p is part of the 50S ribosomal subunit. This family is found in both prokaryotes and eukaryotes. Ribosomal protein L32 of yeast binds to and regulates the splicing and the translation of the transcript of its own gene }.; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0015934 large ribosomal subunit. Probab=27.17 E-value=24 Score=12.98 Aligned_cols=21 Identities=14% Similarity=0.083 Sum_probs=12.0 Q ss_pred ECC-CC-CCCHHHCCCCCCCCCCCC Q ss_conf 756-98-955424224837687364 Q gi|254781033|r 395 ISD-DG-YLSSVWLPLSPISKTLCY 417 (492) Q Consensus 395 ~c~-~g-~~~~~W~~~c~~~~~~~~ 417 (492) .|. || +... --+|+.||..+. T Consensus 29 ~C~~CGe~~~p--HrvC~~CG~Y~g 51 (56) T TIGR01031 29 VCPNCGEYKLP--HRVCPSCGYYKG 51 (56) T ss_pred CCCCCCCCCCC--EEECCCCCCCCC T ss_conf 26778885355--020288886475 No 426 >PRK10833 putative assembly protein; Provisional Probab=27.15 E-value=29 Score=12.24 Aligned_cols=12 Identities=17% Similarity=0.163 Sum_probs=6.1 Q ss_pred CHHHHHHHHHHH Q ss_conf 828899999999 Q gi|254781033|r 68 CPAMLFHMLHKR 79 (492) Q Consensus 68 ~p~~~~~~~~~r 79 (492) .|..++.+-... T Consensus 27 dPNd~k~~I~~q 38 (617) T PRK10833 27 NPNDFRAYMVKQ 38 (617) T ss_pred CHHHHHHHHHHH T ss_conf 889979999999 No 427 >KOG1482 consensus Probab=27.15 E-value=29 Score=12.24 Aligned_cols=33 Identities=9% Similarity=0.049 Sum_probs=24.4 Q ss_pred HHHHHHHHHHHHCCCCEEEEEECCEEEEEHHHH Q ss_conf 999999999973489848999899766221899 Q gi|254781033|r 11 ISLVICSFIIVSHYPEDVSITWGNRLYRTSPFV 43 (492) Q Consensus 11 i~~~~~~~~~l~~~~G~v~i~~~~~~ie~s~~~ 43 (492) ..+..+.+.|+...|-+=.++||..++|+--.+ T Consensus 114 sf~isl~slw~s~~pa~~r~sfG~~R~Evlgal 146 (379) T KOG1482 114 SFIISLFSLWLSSRPATKRMSFGFHRAEVLGAL 146 (379) T ss_pred HHHHHHHHHHHCCCCCCCCEECCEEHHHHHHHH T ss_conf 999999899870379987231143447777789 No 428 >PRK10633 hypothetical protein; Provisional Probab=26.96 E-value=29 Score=12.22 Aligned_cols=34 Identities=15% Similarity=0.210 Sum_probs=26.9 Q ss_pred EEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 7662218999999999999999999999998418 Q gi|254781033|r 35 RLYRTSPFVILSILYFFLFAWILLFAVSRFFLSC 68 (492) Q Consensus 35 ~~ie~s~~~~i~~~~~~~~~~~ll~~l~~~i~~~ 68 (492) +..-++.++....++.-++...++|..++++++. T Consensus 38 ~~~GlP~WF~~SCi~~p~lf~~l~~~mVk~~Fkd 71 (80) T PRK10633 38 GFTGLPRWFEMACLLTPLLFILLCWAMVKFIFRD 71 (80) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 5567709999999999999999999999998607 No 429 >TIGR00328 flhB flagellar biosynthetic protein FlhB; InterPro: IPR006136 The flhB and flhA genes constitute an operon called flhB operon on the Salmonella typhimurium chromosome. They, together with FliI and FliH, may constitute the export apparatus of flagellin, the component protein of flagellar filament. The flhB gene encodes a highly hydrophobic polypeptide with several potential membrane-spanning segments, suggesting that it may be an integral membrane protein .; GO: 0009306 protein secretion, 0016020 membrane. Probab=26.88 E-value=30 Score=12.21 Aligned_cols=32 Identities=22% Similarity=0.204 Sum_probs=19.3 Q ss_pred HCCCHHHHHHHHHHHHHHC----CCHHHHHH-HHHHC Q ss_conf 7399478999999999832----89899999-99850 Q gi|254781033|r 272 SQNKKRKAEVILEKIWKVN----PHPEIANI-YTHLL 303 (492) Q Consensus 272 ~~g~~~~A~~~le~a~~~~----p~p~La~a-~~~~~ 303 (492) ..|...-|.++-+-|-+.+ .+|.||.+ |.+.+ T Consensus 289 AKG~D~~AlkI~eiA~en~v~~~e~PpLARaLY~~ve 325 (352) T TIGR00328 289 AKGVDELALKIKEIAEENNVPIVENPPLARALYRQVE 325 (352) T ss_pred ECCCCHHHHHHHHHHHHCCCCEECCCHHHHHHHHHCC T ss_conf 7788568899999998869114408735875211054 No 430 >pfam00641 zf-RanBP Zn-finger in Ran binding protein and others. Probab=26.84 E-value=25 Score=12.82 Aligned_cols=24 Identities=4% Similarity=-0.180 Sum_probs=18.8 Q ss_pred CCEECCC-CCCCHHHCCCCCCCCCC Q ss_conf 9527569-89554242248376873 Q gi|254781033|r 392 PLWISDD-GYLSSVWLPLSPISKTL 415 (492) Q Consensus 392 p~W~c~~-g~~~~~W~~~c~~~~~~ 415 (492) -.|.|.. +....-|...|-.|+.. T Consensus 3 g~W~C~~C~v~N~~~~~~C~aC~~p 27 (30) T pfam00641 3 GDWDCSSCLVQNFATSTKCVACQAP 27 (30) T ss_pred CCCCCCEEEECCCHHCCEEEECCCC T ss_conf 6441880466254320233553788 No 431 >TIGR00603 rad25 DNA repair helicase rad25; InterPro: IPR001161 Xeroderma pigmentosum (XP) is a human autosomal recessive disease, characterised by a high incidence of sunlight-induced skin cancer. People's skin cells with this condition are hypersensitive to ultraviolet light, due to defects in the incision step of DNA excision repair. There are a minimum of seven genetic complementation groups involved in this pathway: XP-A to XP-G. XP-G is one of the most rare and phenotypically heterogeneous of XP, showing anything from slight to extreme dysfunction in DNA excision repair , . XP-G can be corrected by a 133 Kd nuclear protein, XPGC . XPGC is an acidic protein that confers normal UV resistance in expressing cells . It is a magnesium-dependent, single-strand DNA endonuclease that makes structure-specific endonucleolytic incisions in a DNA substrate containing a duplex region and single-stranded arms , . XPGC cleaves one strand of the duplex at the border with the single-stranded region . XPG belongs to a family of proteins that includes RAD2 from Saccharomyces cerevisiae (Baker's yeast) and rad13 from Schizosaccharomyces pombe (Fission yeast), which are single-stranded DNA endonucleases , ; mouse and human FEN-1, a structure-specific endonuclease; RAD2 from fission yeast and RAD27 from budding yeast; fission yeast exo1, a 5'-3' double-stranded DNA exonuclease that may act in a pathway that corrects mismatched base pairs; yeast DHS1, and yeast DIN7. Sequence alignment of this family of proteins reveals that similarities are largely confined to two regions. The first is located at the N-terminal extremity (N-region) and corresponds to the first 95 to 105 amino acids. The second region is internal (I-region) and found towards the C-terminus; it spans about 140 residues and contains a highly conserved core of 27 amino acids that includes a conserved pentapeptide (E-A-[DE]-A-[QS]). It is possible that the conserved acidic residues are involved in the catalytic mechanism of DNA excision repair in XPG. The amino acids linking the N- and I-regions are not conserved. This entry represents XP group B (XP-B) give rise to both XP and Cockayne syndrome . The DNA/RNA helicase domain IPR001650 from INTERPRO is also present in this group of proteins.; GO: 0003677 DNA binding, 0004003 ATP-dependent DNA helicase activity, 0005524 ATP binding, 0006289 nucleotide-excision repair, 0005634 nucleus. Probab=26.79 E-value=30 Score=12.19 Aligned_cols=40 Identities=28% Similarity=0.525 Sum_probs=16.8 Q ss_pred CCEECC-CCCC-CHHHCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 952756-9895-5424224837687364003678755445666 Q gi|254781033|r 392 PLWISD-DGYL-SSVWLPLSPISKTLCYFEWKIPTKSPEYISS 432 (492) Q Consensus 392 p~W~c~-~g~~-~~~W~~~c~~~~~~~~~~W~~p~~~~~~~~~ 432 (492) |-|+|. ||++ -+..+|+-..-..|= +.=..|+.+|..++. T Consensus 16 PLW~~P~DG~IfLE~FSP~yk~A~DFL-vaIaEPvcRP~~iHE 57 (756) T TIGR00603 16 PLWVAPIDGHIFLESFSPLYKQAQDFL-VAIAEPVCRPEHIHE 57 (756) T ss_pred CEEECCCCCCEEEEHHHHHHHHHCCCE-EEEECCCCCCCCCCC T ss_conf 647716788076300215677431311-454067788786443 No 432 >pfam03745 DUF309 Domain of unknown function (DUF309). This domain is found in eubacterial and archaebacterial proteins of unknown function. The proteins contain a motif HXXXEXX(W/Y) where X can be any amino acid. This motif is likely to be functionally important and may be involved in metal binding. Probab=26.73 E-value=30 Score=12.19 Aligned_cols=29 Identities=28% Similarity=0.528 Sum_probs=19.5 Q ss_pred HHHHHHHHCCCHHHHHHHHHHHHHHCCCH Q ss_conf 99999997399478999999999832898 Q gi|254781033|r 265 CAAKSLISQNKKRKAEVILEKIWKVNPHP 293 (492) Q Consensus 265 ~~a~~l~~~g~~~~A~~~le~a~~~~p~p 293 (492) ..+..+...|++=+|-.++|..|...+.| T Consensus 4 ~~g~~lfn~g~~~eaHEvlE~~W~~~~~~ 32 (62) T pfam03745 4 REGVELFNAGDYFEAHEVLEELWKAAPGP 32 (62) T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHHCCCH T ss_conf 99999986788999999999998758971 No 433 >PRK10525 cytochrome o ubiquinol oxidase subunit II; Provisional Probab=26.69 E-value=30 Score=12.18 Aligned_cols=25 Identities=12% Similarity=0.126 Sum_probs=10.7 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 96799999999999999999734898 Q gi|254781033|r 1 MLRLIRYFFVISLVICSFIIVSHYPE 26 (492) Q Consensus 1 M~r~l~~~~~i~~~~~~~~~l~~~~G 26 (492) |.|.+-.+-.+++++ +...|++..+ T Consensus 3 ~~~~~~~~~~l~l~~-~~~lLsGC~~ 27 (305) T PRK10525 3 LRKYNKSLGWLSLFA-GTVLLSGCNS 27 (305) T ss_pred HHHHHHHHHHHHHHH-HHHHHCCCCC T ss_conf 667777899999999-9998667876 No 434 >PRK08197 threonine synthase; Validated Probab=26.45 E-value=30 Score=12.15 Aligned_cols=41 Identities=7% Similarity=0.033 Sum_probs=16.7 Q ss_pred CCHHHHHHHHHHHHHHCC----CHHHHHHHHHHHHC---CCCCCCEEC Q ss_conf 998999999999998579----98999999999962---889995275 Q gi|254781033|r 356 PRKEIFLLLAQIEQANSH----NTDKILYWTQSALH---AMPDPLWIS 396 (492) Q Consensus 356 P~~~~~~llA~ie~~e~~----d~~~ar~wl~rA~~---a~~dp~W~c 396 (492) .+.++...+..+.+.++- ..+.+-..+.|+.+ ..+|-.=+| T Consensus 326 sD~Ei~~A~~~La~~eGi~~EpasA~alAgl~kl~e~G~i~~~~~VV~ 373 (402) T PRK08197 326 DDEAILAALSELAREEGLFACPEGAAALAAAKQLRESGWLKSGERVVL 373 (402) T ss_pred CHHHHHHHHHHHHHHCCEEECCHHHHHHHHHHHHHHCCCCCCCCEEEE T ss_conf 999999999999985592777369999999999998299999996999 No 435 >TIGR02928 TIGR02928 orc1/cdc6 family replication initiation protein; InterPro: IPR014277 This set of DNA binding proteins shows homology to the origin recognition complex subunit 1/cell division control protein 6 family in eukaryotes. The proteins in this entry are found exclusively in the archaea. Several members may be found in a genome and interact with each other.. Probab=26.35 E-value=30 Score=12.14 Aligned_cols=25 Identities=28% Similarity=0.319 Sum_probs=15.6 Q ss_pred HHHHHHHHCCCHHHHHHHHHHHHHH Q ss_conf 9999999739947899999999983 Q gi|254781033|r 265 CAAKSLISQNKKRKAEVILEKIWKV 289 (492) Q Consensus 265 ~~a~~l~~~g~~~~A~~~le~a~~~ 289 (492) ++|-.-.+.|+.|+|..+|+.+-.. T Consensus 243 cAA~aAq~hGDAR~AiDLLR~AGe~ 267 (383) T TIGR02928 243 CAALAAQEHGDARKAIDLLRVAGEI 267 (383) T ss_pred HHHHHHCCCCCHHHHHHHHHHHHHH T ss_conf 9998620678789999999987687 No 436 >pfam09723 CxxC_CxxC_SSSS Zinc ribbon domain. This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence. Probab=26.34 E-value=27 Score=12.46 Aligned_cols=21 Identities=19% Similarity=0.208 Sum_probs=11.4 Q ss_pred EECC-CCCCCHHH-------CCCCCCCCC Q ss_conf 2756-98955424-------224837687 Q gi|254781033|r 394 WISD-DGYLSSVW-------LPLSPISKT 414 (492) Q Consensus 394 W~c~-~g~~~~~W-------~~~c~~~~~ 414 (492) ..|. ||+..+.+ ...||.||. T Consensus 6 y~C~~Cg~~fe~~~~~~~~~~~~CP~C~~ 34 (42) T pfam09723 6 YRCEDCGHTFEVLQKISDAPLATCPECGS 34 (42) T ss_pred EEECCCCCEEEEEEECCCCCCCCCCCCCC T ss_conf 89488999899998657998876979999 No 437 >TIGR02177 PorB_KorB 2-oxoacid:acceptor oxidoreductase, beta subunit, pyruvate/2-ketoisovalerate family; InterPro: IPR011896 A number of anaerobic and microaerophilic species lack pyruvate dehydrogenase and have instead a four subunit, oxygen-sensitive pyruvate oxidoreductase, with either ferredoxins or flavodoxins used as the acceptor. Several related four-subunit enzymes may exist in the same species. This entry describes the beta subunits, representing mostly pyruvate and 2-ketoisovalerate specific enzymes.. Probab=26.19 E-value=22 Score=13.17 Aligned_cols=60 Identities=17% Similarity=0.267 Sum_probs=38.5 Q ss_pred CCCHHHHHHHHH----HHHHHCCCHHHHHHHHHHHHCCCCCCCEECCCCCCCHHHCCCCCCCCCCCCCCCCCC Q ss_conf 999899999999----999857998999999999962889995275698955424224837687364003678 Q gi|254781033|r 355 APRKEIFLLLAQ----IEQANSHNTDKILYWTQSALHAMPDPLWISDDGYLSSVWLPLSPISKTLCYFEWKIP 423 (492) Q Consensus 355 ~P~~~~~~llA~----ie~~e~~d~~~ar~wl~rA~~a~~dp~W~c~~g~~~~~W~~~c~~~~~~~~~~W~~p 423 (492) +|=+.+.+..|. |+|+=.+|..--.+-+-.|++= .|+.-=|=.-=|+..+.++|.+|-.+ T Consensus 138 ~~~NP~lLAiA~GytFVARgfs~d~~hLKeiiK~A~~H---------~G~AlvDiLQPCvTyN~inT~~WY~~ 201 (302) T TIGR02177 138 DPVNPLLLAIAAGYTFVARGFSGDVAHLKEIIKEAINH---------KGYALVDILQPCVTYNKINTYEWYKE 201 (302) T ss_pred CCCCHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHCC---------CCCEEEEECCCCCCCCCCCCHHHHHH T ss_conf 54138999998085274588887489999999997328---------98458761178887466666576764 No 438 >TIGR00785 dass transporter, divalent anion:Na+ symporter (DASS) family; InterPro: IPR001898 Integral membrane proteins that mediate the intake of a wide variety of molecules with the concomitant uptake of sodium ions (sodium symporters) can be grouped, on the basis of sequence and functional similarities into a number of distinct families. One of these families currently consists of the following proteins: Mammalian sodium/sulphate cotransporter . Mammalian renal sodium/dicarboxylate cotransporter , which transports succinate and citrate. Mammalian intestinal sodium/dicarboxylate cotransporter. Chlamydomonas reinhardtii putative sulphur deprivation response regulator SAC1 . Caenorhabditis elegans hypothetical proteins B0285.6, F31F6.6, K08E5.2 and R107.1. Escherichia coli hypothetical protein yfbS. Haemophilus influenzae hypothetical protein HI0608. Synechocystis sp. (strain PCC 6803) hypothetical protein sll0640. Methanococcus jannaschii hypothetical protein MJ0672. These transporters are proteins of from 430 to 620 amino acids which are highly hydrophobic and which probably contain about 12 transmembrane regions.; GO: 0005215 transporter activity, 0006814 sodium ion transport, 0016020 membrane. Probab=25.96 E-value=31 Score=12.09 Aligned_cols=34 Identities=18% Similarity=0.354 Sum_probs=16.7 Q ss_pred CEEEEEECCEEEEEHHHHHHHHHHHHHHHHHHHH Q ss_conf 8489998997662218999999999999999999 Q gi|254781033|r 26 EDVSITWGNRLYRTSPFVILSILYFFLFAWILLF 59 (492) Q Consensus 26 G~v~i~~~~~~ie~s~~~~i~~~~~~~~~~~ll~ 59 (492) .+..|.|.+|-.-.+++..++++++...++|.++ T Consensus 204 ~g~~isw~~W~~~~~lP~~~~llll~~~llY~~f 237 (473) T TIGR00785 204 LGIEISWGDWFLAGFLPLGIILLLLVPLLLYVLF 237 (473) T ss_pred HCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 2000476678999452599999999999986526 No 439 >PRK13728 conjugal transfer protein TrbB; Provisional Probab=25.95 E-value=25 Score=12.78 Aligned_cols=31 Identities=19% Similarity=0.472 Sum_probs=22.8 Q ss_pred CCCCCCEEC-CCCC-C-CHHHC------CCCCCCCCCCCC Q ss_conf 889995275-6989-5-54242------248376873640 Q gi|254781033|r 388 AMPDPLWIS-DDGY-L-SSVWL------PLSPISKTLCYF 418 (492) Q Consensus 388 a~~dp~W~c-~~g~-~-~~~W~------~~c~~~~~~~~~ 418 (492) ..++|.|.- .+|- + ..+|. ..||.|+.|+-. T Consensus 51 ~~~~p~~~~LsdGr~vnl~dy~lV~F~rS~CPyChqFAP~ 90 (181) T PRK13728 51 EKPAPRWFRLSNGRQVNLADWKVVLFMQGHCPYCHQFDPV 90 (181) T ss_pred CCCCCCCEECCCCCEECCCCEEEEEEECCCCCHHHHHCHH T ss_conf 7899962588999686243418999973889257765889 No 440 >KOG0530 consensus Probab=25.94 E-value=31 Score=12.08 Aligned_cols=79 Identities=9% Similarity=0.014 Sum_probs=32.1 Q ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHH-HHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHH Q ss_conf 1689999997268850579998654552158869-999999988842126989999999999952574569999999995 Q gi|254781033|r 138 IAHEKLEMMLQIPATREFAVYSLYFESCRIGDLN-SAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKK 216 (492) Q Consensus 138 ~A~~~~~~~~~~~~~~~~~~~gL~~~al~~gd~~-~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~ 216 (492) +-.+++...+.+.+........--......||+. .-+....+.+..+.+|-.++.-.--+...-++|+.=+......++ T Consensus 96 ~El~~l~eI~e~npKNYQvWHHRr~ive~l~d~s~rELef~~~~l~~DaKNYHaWshRqW~~r~F~~~~~EL~y~~~Lle 175 (318) T KOG0530 96 KELEYLDEIIEDNPKNYQVWHHRRVIVELLGDPSFRELEFTKLMLDDDAKNYHAWSHRQWVLRFFKDYEDELAYADELLE 175 (318) T ss_pred HHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999987386521477999999998069411037899999711111002668999999987507889999999998 No 441 >COG4814 Uncharacterized protein with an alpha/beta hydrolase fold [General function prediction only] Probab=25.86 E-value=18 Score=13.86 Aligned_cols=22 Identities=5% Similarity=-0.059 Sum_probs=12.0 Q ss_pred CHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9679999999999999999973 Q gi|254781033|r 1 MLRLIRYFFVISLVICSFIIVS 22 (492) Q Consensus 1 M~r~l~~~~~i~~~~~~~~~l~ 22 (492) |.|+++.++++++.++++.|+. T Consensus 1 MKK~f~~Ivill~g~~i~g~i~ 22 (288) T COG4814 1 MKKVFFSIVILLAGALILGWII 22 (288) T ss_pred CCEEEEEHHHHHHHHHHHHHHE T ss_conf 9517876069999999976521 No 442 >PRK01741 cell division protein ZipA; Provisional Probab=25.52 E-value=31 Score=12.03 Aligned_cols=15 Identities=7% Similarity=0.031 Sum_probs=5.4 Q ss_pred HHHHHHHHHHHHHHH Q ss_conf 899999999999999 Q gi|254781033|r 41 PFVILSILYFFLFAW 55 (492) Q Consensus 41 ~~~~i~~~~~~~~~~ 55 (492) ++++|++.+++++++ T Consensus 5 lILIIvGaIAI~ALL 19 (342) T PRK01741 5 TILIILGILALVALV 19 (342) T ss_pred EHHHHHHHHHHHHHH T ss_conf 549999999999999 No 443 >COG4320 Uncharacterized protein conserved in bacteria [Function unknown] Probab=25.44 E-value=30 Score=12.15 Aligned_cols=27 Identities=11% Similarity=0.069 Sum_probs=16.5 Q ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHC Q ss_conf 789999999998328989999999850 Q gi|254781033|r 277 RKAEVILEKIWKVNPHPEIANIYTHLL 303 (492) Q Consensus 277 ~~A~~~le~a~~~~p~p~La~a~~~~~ 303 (492) .+|.++...+-.+.-+|+|..-|..+. T Consensus 279 ~eaaRV~~~araq~~sp~ll~~~v~ig 305 (410) T COG4320 279 HEAARVARTARAQLLSPDLLIGYVSIG 305 (410) T ss_pred HHHHHHHHHHHHHCCCHHHCCCEEEEC T ss_conf 467788765555305865547558834 No 444 >pfam10956 DUF2756 Protein of unknown function (DUF2756). Some members in this family of proteins are annotated yhhA however currently no function is known. The family appears to be restricted to Enterobacteriaceae. Probab=25.43 E-value=31 Score=12.02 Aligned_cols=18 Identities=17% Similarity=0.263 Sum_probs=9.5 Q ss_pred CHHHHHHHHHHHHHHHHH Q ss_conf 967999999999999999 Q gi|254781033|r 1 MLRLIRYFFVISLVICSF 18 (492) Q Consensus 1 M~r~l~~~~~i~~~~~~~ 18 (492) |.|++++..++-++++.- T Consensus 1 MKrll~l~allPf~~~Aq 18 (104) T pfam10956 1 MKRLLLLAALLPFAVLAQ 18 (104) T ss_pred CHHHHHHHHHHHHHHHHC T ss_conf 916999999867888715 No 445 >pfam01535 PPR PPR repeat. This repeat has no known function. It is about 35 amino acids long and found in up to 18 copies in some proteins. This family appears to be greatly expanded in plants. #=GF CC This repeat occurs in PET309, which may be involved in RNA stabilisation. This domain occurs in crp1 that is involved in RNA processing. This repeat is associated with a predicted plant protein that has a domain organisation similar to the human BRCA1 protein. The repeat has been called PPR. Probab=25.25 E-value=31 Score=11.99 Aligned_cols=24 Identities=17% Similarity=0.385 Sum_probs=11.0 Q ss_pred HHHHHHHHHCCCCHHHHHHHHHHH Q ss_conf 999999995257456999999999 Q gi|254781033|r 192 EAVVQQYVLAKEWSRAITFLNQKK 215 (492) Q Consensus 192 ~~L~~l~~~~gdw~~A~~~l~~~~ 215 (492) ..+...+.+.|++++|.+++.+.. T Consensus 4 n~li~~~~~~g~~~~a~~~~~~M~ 27 (31) T pfam01535 4 NSLISGYCKAGKLEEALELFKEMK 27 (31) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHH T ss_conf 999999987799999999999998 No 446 >TIGR00805 oat sodium-independent organic anion transporter; InterPro: IPR004156 This family consists of several eukaryotic Organic-Anion-Transporting Polypeptides (OATPs). Several have been identified mostly in human and rat. Different OATPs vary in tissue distribution and substrate specificity. Since the numbering of different OATPs in particular species was based originally on the order of discovery, similarly numbered OATPs in humans and rats did not necessarily correspond in function, tissue distribution and substrate specificity (in spite of the name, some OATPs also transport organic cations and neutral molecules) so a scheme of using digits for rat OATPs and letters for human ones was introduced . Prostaglandin transporter (PGT) proteins are also considered to be OATP family members. In addition, the methotrexate transporter OATK is closely related to OATPs. This family also includes several predicted proteins from Caenorhabditis elegans and Drosophila melanogaster. This similarity was not previously noted. Note: Members of this family are described (in the Swiss-Prot database) as belonging to the SLC21 family of transporters.; GO: 0005215 transporter activity, 0006810 transport, 0016020 membrane. Probab=25.18 E-value=32 Score=11.98 Aligned_cols=29 Identities=14% Similarity=0.014 Sum_probs=18.8 Q ss_pred ECC-CCC-CCHHHCCCCCCCCCCCCCCCCCC Q ss_conf 756-989-55424224837687364003678 Q gi|254781033|r 395 ISD-DGY-LSSVWLPLSPISKTLCYFEWKIP 423 (492) Q Consensus 395 ~c~-~g~-~~~~W~~~c~~~~~~~~~~W~~p 423 (492) -|. +|+ ....|.|||.+.|.--.|..-.| T Consensus 528 ~Cn~~C~C~~~~~~PVC~~~g~Grq~~Y~Sp 558 (723) T TIGR00805 528 DCNRQCSCDSSFFDPVCGDNGDGRQLAYLSP 558 (723) T ss_pred HCCCCCCCCCCCCCCCCCCCCCCCEEEEECC T ss_conf 6034678988766884687989545766275 No 447 >KOG0546 consensus Probab=25.17 E-value=32 Score=11.98 Aligned_cols=40 Identities=13% Similarity=0.052 Sum_probs=21.6 Q ss_pred HCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHH Q ss_conf 2689836899999999987998899999887866099989 Q gi|254781033|r 320 INKESVESLVIVSKIALEMGSIDQAHAKAMLAMKIAPRKE 359 (492) Q Consensus 320 ~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~~~P~~~ 359 (492) ..+.....+.-.+..+....++++|.+.++.+....|+.. T Consensus 304 ~~~s~tka~~Rr~~~~~~~~~~~~a~~~~~~a~~~~p~d~ 343 (372) T KOG0546 304 DERSKTKAHYRRGQAYKLLKNYDEALEDLKKAKQKAPNDK 343 (372) T ss_pred CCHHHCCHHHHHHHHHHHHHCHHHHHHHHHHHHCCCCCHH T ss_conf 6811182888877688764345666778887542285449 No 448 >COG3364 Zn-ribbon containing protein [General function prediction only] Probab=25.14 E-value=30 Score=12.19 Aligned_cols=22 Identities=14% Similarity=-0.118 Sum_probs=14.9 Q ss_pred ECC-CCCCCHH----HCCCCCCCCCCC Q ss_conf 756-9895542----422483768736 Q gi|254781033|r 395 ISD-DGYLSSV----WLPLSPISKTLC 416 (492) Q Consensus 395 ~c~-~g~~~~~----W~~~c~~~~~~~ 416 (492) .|. ||++.++ -..=||+||.=- T Consensus 4 ~CtrCG~vf~~g~~~il~GCp~CG~nk 30 (112) T COG3364 4 QCTRCGEVFDDGSEEILSGCPKCGCNK 30 (112) T ss_pred EECCCCCCCCCCCHHHHCCCCCCCCHH T ss_conf 214044563651299881572016311 No 449 >CHL00177 ccs1 c-type cytochrome biogenensis protein; Validated Probab=25.04 E-value=24 Score=12.98 Aligned_cols=20 Identities=15% Similarity=0.258 Sum_probs=10.8 Q ss_pred HHHHHHHHHHHHHHHHHHHC Q ss_conf 99999999999999999734 Q gi|254781033|r 4 LIRYFFVISLVICSFIIVSH 23 (492) Q Consensus 4 ~l~~~~~i~~~~~~~~~l~~ 23 (492) -++++++++++...++.+-. T Consensus 17 aI~Ll~~iaiaS~iGTvipQ 36 (427) T CHL00177 17 AIILLLIIALFSAIGTVIEQ 36 (427) T ss_pred HHHHHHHHHHHHHCCCCCCC T ss_conf 99999999999861311557 No 450 >PRK00349 uvrA excinuclease ABC subunit A; Reviewed Probab=25.03 E-value=32 Score=11.96 Aligned_cols=27 Identities=11% Similarity=-0.006 Sum_probs=12.9 Q ss_pred CCCCCCEECC-CCCCC------HHHCCCCCCCCC Q ss_conf 8899952756-98955------424224837687 Q gi|254781033|r 388 AMPDPLWISD-DGYLS------SVWLPLSPISKT 414 (492) Q Consensus 388 a~~dp~W~c~-~g~~~------~~W~~~c~~~~~ 414 (492) .+-+.+=.|. .|++. ++..-.||.|+. T Consensus 736 ~~gGrC~~C~G~G~~~~~m~Fl~d~~~~C~~C~G 769 (944) T PRK00349 736 VKGGRCEACQGDGVIKIEMHFLPDVYVPCDVCKG 769 (944) T ss_pred CCCCCCCCCCCCCCEEEEHHCCCCCEEECCCCCC T ss_conf 9896488765447289841028985347999898 No 451 >pfam05817 Ribophorin_II Oligosaccharyltransferase subunit Ribophorin II. This family contains eukaryotic Ribophorin II (RPN2) proteins. The mammalian oligosaccharyltransferase (OST) is a protein complex that effects the cotranslational N-glycosylation of newly synthesized polypeptides, and is composed of the following proteins: ribophorins I and II (RI and RII), OST48, and Dadl, N33/IAP, OST4, STT3. The family also includes the SWP1 protein from yeast. In yeast the oligosaccharyltransferase complex is composed 7 or 8 subunits, SWP1, being one of them. Probab=24.98 E-value=32 Score=11.96 Aligned_cols=25 Identities=20% Similarity=0.200 Sum_probs=11.4 Q ss_pred CHHHHHHHHHHHHHCCCHHHHHHHH Q ss_conf 8368999999999879988999998 Q gi|254781033|r 324 SVESLVIVSKIALEMGSIDQAHAKA 348 (492) Q Consensus 324 ~~e~~~~LA~~~~~~g~~~~Ar~~l 348 (492) ..+=-..+|...+..+.++.++..+ T Consensus 234 ~~dQ~vklaNy~LSrk~vqt~k~a~ 258 (636) T pfam05817 234 PEEQIVKLANYFLSRKSVQSAKDAF 258 (636) T ss_pred CHHHHHHHHHHHHCCCCCCCHHHHH T ss_conf 9899999999983277878747899 No 452 >pfam07509 DUF1523 Protein of unknown function (DUF1523). Probab=24.91 E-value=32 Score=11.95 Aligned_cols=17 Identities=24% Similarity=0.149 Sum_probs=8.9 Q ss_pred CCCCEEEEEECCEEEEE Q ss_conf 48984899989976622 Q gi|254781033|r 23 HYPEDVSITWGNRLYRT 39 (492) Q Consensus 23 ~~~G~v~i~~~~~~ie~ 39 (492) ++|..|.|..=||++++ T Consensus 111 ~~~~wV~I~hYGWR~~~ 127 (174) T pfam07509 111 DDPQWVEITYYGWRINM 127 (174) T ss_pred CCCCEEEEEEEEHHHHH T ss_conf 79847999973311478 No 453 >pfam11381 DUF3185 Protein of unknown function (DUF3185). Some members in this bacterial family of proteins are annotated as membrane proteins however this cannot be confirmed. Currently no function is known. Probab=24.67 E-value=30 Score=12.13 Aligned_cols=48 Identities=15% Similarity=0.100 Sum_probs=21.1 Q ss_pred HHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 999999999999998418288999999999999999999999997210 Q gi|254781033|r 51 FLFAWILLFAVSRFFLSCPAMLFHMLHKRNYDKGYKALYTGLMSIAAH 98 (492) Q Consensus 51 ~~~~~~ll~~l~~~i~~~p~~~~~~~~~rr~~k~~~al~~gl~al~~G 98 (492) ++....++++-+...-+..+.++|.+--+--+|..--+.-|.++...| T Consensus 8 lv~GivLl~~Gy~~~~S~~sq~~r~~TG~~td~t~~~ligG~va~ivG 55 (59) T pfam11381 8 LVLGIVLLYFGYQASDSLGSQATRAFTGSPTDKTMWYLIGGVVAVIVG 55 (59) T ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCHHHHHHHHHCHHHHHHH T ss_conf 986599998621010367788999971999336588876256878622 No 454 >PRK12468 flhB flagellar biosynthesis protein FlhB; Reviewed Probab=24.51 E-value=32 Score=11.89 Aligned_cols=10 Identities=0% Similarity=0.178 Sum_probs=4.0 Q ss_pred HHHHHHHCCC Q ss_conf 9999998799 Q gi|254781033|r 331 VSKIALEMGS 340 (492) Q Consensus 331 LA~~~~~~g~ 340 (492) |||..+...+ T Consensus 318 LAR~Ly~~~e 327 (383) T PRK12468 318 LARALFRHSE 327 (383) T ss_pred HHHHHHHCCC T ss_conf 9999987177 No 455 >PRK13130 H/ACA RNA-protein complex component Nop10p; Reviewed Probab=24.45 E-value=32 Score=11.89 Aligned_cols=18 Identities=6% Similarity=-0.143 Sum_probs=0.0 Q ss_pred CCCCCCCHHHCCCCCCCCC Q ss_conf 5698955424224837687 Q gi|254781033|r 396 SDDGYLSSVWLPLSPISKT 414 (492) Q Consensus 396 c~~g~~~~~W~~~c~~~~~ 414 (492) |..|..+.- .-+||+||. T Consensus 8 C~~Cg~YTL-~~~CP~CG~ 25 (54) T PRK13130 8 CPKCGRYTL-KETCPECGG 25 (54) T ss_pred CCCCCCEEC-CCCCCCCCC T ss_conf 863564303-252866887 No 456 >pfam05249 consensus Probab=24.40 E-value=33 Score=11.88 Aligned_cols=62 Identities=19% Similarity=0.256 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHH Q ss_conf 799999999999999999734898489998997662218999999999999999999999998418288999999999 Q gi|254781033|r 3 RLIRYFFVISLVICSFIIVSHYPEDVSITWGNRLYRTSPFVILSILYFFLFAWILLFAVSRFFLSCPAMLFHMLHKRN 80 (492) Q Consensus 3 r~l~~~~~i~~~~~~~~~l~~~~G~v~i~~~~~~ie~s~~~~i~~~~~~~~~~~ll~~l~~~i~~~p~~~~~~~~~rr 80 (492) |++--++++.+.+.+..|+....-.+.+.+. .++...+..++.++++ ++.-....||++.|+ T Consensus 23 ~i~p~vl~~~~~s~~v~~~~~~~~~~~~~~~-----~~p~tl~G~aLsllLv-----------FR~n~aYdRwWEaRk 84 (305) T pfam05249 23 AISSRVILCFLFSLVVILVAGWNSPVSIPFL-----PTPFSLPGIALALLLV-----------FRTNAAYDRFWEGRK 84 (305) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCCCCC-----CCHHHHHHHHHHHHHH-----------HCCCHHHHHHHHHHH T ss_conf 8999999999999999999984566677768-----8628899999999998-----------537148999999999 No 457 >pfam11157 DUF2937 Protein of unknown function (DUF2937). This family of proteins with unknown function appears to be restricted to Proteobacteria. Probab=24.32 E-value=33 Score=11.87 Aligned_cols=41 Identities=12% Similarity=-0.199 Sum_probs=0.0 Q ss_pred EECCEEEEEHH-HHHHHHHHHHHHHHHHHHHHHHHHHCCHHH Q ss_conf 98997662218-999999999999999999999998418288 Q gi|254781033|r 31 TWGNRLYRTSP-FVILSILYFFLFAWILLFAVSRFFLSCPAM 71 (492) Q Consensus 31 ~~~~~~ie~s~-~~~i~~~~~~~~~~~ll~~l~~~i~~~p~~ 71 (492) +|.+|+.-+++ .-.++.-++.+++.++++.++..++..|.+ T Consensus 121 t~~~f~pavplt~~gi~~g~vg~l~~~~l~~ll~~l~~~~~r 162 (167) T pfam11157 121 TWQAFSPAVPLTPEGIVFGLVGGLLGALLAELLLGLLRLLFR 162 (167) T ss_pred HHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 999688868188999999999999999999999999886523 No 458 >PRK13697 cytochrome c6; Provisional Probab=24.23 E-value=33 Score=11.86 Aligned_cols=19 Identities=11% Similarity=0.324 Sum_probs=0.0 Q ss_pred CHHHHHHHHHHHHHHHHHH Q ss_conf 9679999999999999999 Q gi|254781033|r 1 MLRLIRYFFVISLVICSFI 19 (492) Q Consensus 1 M~r~l~~~~~i~~~~~~~~ 19 (492) |.|++.++++.+++..++. T Consensus 1 Mk~l~~~~l~~~~~~~~~~ 19 (111) T PRK13697 1 MKKILKLVLLTLLLLTFAF 19 (111) T ss_pred CHHHHHHHHHHHHHHHHHC T ss_conf 9789999999999999980 No 459 >COG1645 Uncharacterized Zn-finger containing protein [General function prediction only] Probab=24.21 E-value=27 Score=12.50 Aligned_cols=17 Identities=12% Similarity=0.122 Sum_probs=0.0 Q ss_pred CC-CC---CCCHHHCCCCCCCC Q ss_conf 56-98---95542422483768 Q gi|254781033|r 396 SD-DG---YLSSVWLPLSPISK 413 (492) Q Consensus 396 c~-~g---~~~~~W~~~c~~~~ 413 (492) |. || +. .+=.-+||+|| T Consensus 31 Cp~Cg~PLF~-KdG~v~CPvC~ 51 (131) T COG1645 31 CPKCGTPLFR-KDGEVFCPVCG 51 (131) T ss_pred CCCCCCCCEE-ECCEEECCCCC T ss_conf 8655883163-08958777777 No 460 >PRK13264 3-hydroxyanthranilate 3,4-dioxygenase; Provisional Probab=24.18 E-value=30 Score=12.18 Aligned_cols=23 Identities=17% Similarity=0.215 Sum_probs=0.0 Q ss_pred CEECC-CCC-CCHHHCCC----------------------CCCCCCC Q ss_conf 52756-989-55424224----------------------8376873 Q gi|254781033|r 393 LWISD-DGY-LSSVWLPL----------------------SPISKTL 415 (492) Q Consensus 393 ~W~c~-~g~-~~~~W~~~----------------------c~~~~~~ 415 (492) .|.|+ |++ +++.|-.+ |++||.. T Consensus 120 ~WyC~~C~~~l~e~~f~~~di~~ql~pv~~~f~~see~RTC~~CG~v 166 (177) T PRK13264 120 EWYCDECNHKVHRVEVQLKNIVTDLPPVFDHFYASEELRTCKQCGHV 166 (177) T ss_pred EEECCCCCCEEEEEEEEEECHHHHHHHHHHHHHCCCCCCCCCCCCCC T ss_conf 89758989889999999637788779999998567134718777772 No 461 >COG1337 CRISPR system related protein, RAMP superfamily [Defense mechanisms] Probab=24.15 E-value=33 Score=11.85 Aligned_cols=35 Identities=11% Similarity=0.115 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHCCCCCCCEECCCCCCCHHHCCCCCCCCCCCC Q ss_conf 8999999999962889995275698955424224837687364 Q gi|254781033|r 375 TDKILYWTQSALHAMPDPLWISDDGYLSSVWLPLSPISKTLCY 417 (492) Q Consensus 375 ~~~ar~wl~rA~~a~~dp~W~c~~g~~~~~W~~~c~~~~~~~~ 417 (492) .++.|..+++.++. ++.+.+.|+ .++ |++|..|++ T Consensus 56 KG~~Rsl~E~~l~~--~~~~~~~c~--~d~----~~v~r~fg~ 90 (249) T COG1337 56 KGKFRSLLERLLRK--NPIGYHSCG--LDE----CVVCRLFGS 90 (249) T ss_pred HHHHHHHHHHHHCC--CCCCEECCC--CCC----CHHHHHHCC T ss_conf 68999999998545--987202267--556----703344245 No 462 >KOG0686 consensus Probab=24.12 E-value=33 Score=11.84 Aligned_cols=100 Identities=12% Similarity=0.056 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHCCCCHHHHHHHHHH---HHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHC----CCCHHHHHHH Q ss_conf 7999865455215886999999998---88421269899999999999525745699999999950----2310357899 Q gi|254781033|r 155 FAVYSLYFESCRIGDLNSAQRYATK---ALDISPDAPWVTEAVVQQYVLAKEWSRAITFLNQKKKN----AKEWNRNRAI 227 (492) Q Consensus 155 ~~~~gL~~~al~~gd~~~A~~~~~~---a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~l~~~~~~----~~~~~~~~a~ 227 (492) .|+.-++..++..|+.+.|++.+-+ ..............+..+-.-.|+|........++... .........- T Consensus 151 ra~~Dl~dhy~~cG~l~~Alr~YsR~RdYCTs~khvInm~ln~i~VSI~~~nw~hv~sy~~~A~st~~~~~~~~q~v~~k 230 (466) T KOG0686 151 RALEDLGDHYLDCGQLDNALRCYSRARDYCTSAKHVINMCLNLILVSIYMGNWGHVLSYISKAESTPDANENLAQEVPAK 230 (466) T ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCHHHHHHHHHCCCC T ss_conf 99999999999843388887630133454300189999999999999742431433367777774730245577744841 Q ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 999999999966998999999987543 Q gi|254781033|r 228 LLIARSLENADKGDMIASYHDAIESLK 254 (492) Q Consensus 228 l~~~~a~~~~~~~~~~~A~~~~~~a~~ 254 (492) +....+...+..++...|.+.+..+.. T Consensus 231 l~C~agLa~L~lkkyk~aa~~fL~~~~ 257 (466) T KOG0686 231 LKCAAGLANLLLKKYKSAAKYFLLAEF 257 (466) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 679988999987778999999982787 No 463 >KOG4814 consensus Probab=24.03 E-value=33 Score=11.83 Aligned_cols=156 Identities=11% Similarity=0.054 Sum_probs=0.0 Q ss_pred HHHCCCCHHHHHHHHHHHH--CCCCHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCC Q ss_conf 9952574569999999995--02310357899999999999966998999999987543133527999999999997399 Q gi|254781033|r 198 YVLAKEWSRAITFLNQKKK--NAKEWNRNRAILLIARSLENADKGDMIASYHDAIESLKLCDNSIMASICAAKSLISQNK 275 (492) Q Consensus 198 ~~~~gdw~~A~~~l~~~~~--~~~~~~~~~a~l~~~~a~~~~~~~~~~~A~~~~~~a~~~~p~~~~a~~~~a~~l~~~g~ 275 (492) ..+.++|..++..+..-++ ..+..++.-+-+....+..++...+.+.|.+.+.+|-+.+|..+--....-......|+ T Consensus 364 ~F~~~~Y~~s~~~y~~Sl~~i~~D~~~~~FaK~qR~l~~CYL~L~QLD~A~E~~~EAE~~d~~~~l~q~~~~~~~~~E~~ 443 (872) T KOG4814 364 LFKMEKYVVSIRFYKLSLKDIISDNYSDRFAKIQRALQVCYLKLEQLDNAVEVYQEAEEVDRQSPLCQLLMLQSFLAEDK 443 (872) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCC T ss_conf 75377689999999998874475046578999999999998259988779999999986073238899999999998322 Q ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 478999999999832898999999985078958999999999982689836899999999987998899999887866 Q gi|254781033|r 276 KRKAEVILEKIWKVNPHPEIANIYTHLLSENTVGKLKRALRLEEINKESVESLVIVSKIALEMGSIDQAHAKAMLAMK 353 (492) Q Consensus 276 ~~~A~~~le~a~~~~p~p~La~a~~~~~~~~~~~al~~~~~l~~~~P~~~e~~~~LA~~~~~~g~~~~Ar~~l~~al~ 353 (492) ..+|...+.+.....-+..--..+.-...+.|..-+..+..+.-.+.++..+--.+-.+++...+-..-...-+.+++ T Consensus 444 Se~AL~~~~~~~s~~~~~~~~~~~l~~~~~~PTt~lsv~~~l~~~n~Q~I~sE~~~~~l~~~~~~q~~v~~~~~e~L~ 521 (872) T KOG4814 444 SEEALTCLQKIKSSEDEKSTDALILAVAECKPTTDLSVQGLLMIFNSQTISSESSSQTLSILYTLQMIVKVSEEEPLE 521 (872) T ss_pred HHHHHHHHHHHHHHHHCCCCCHHHHHHCCCCCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 177899999987754021220237886058982589999999998400341322678888766678999888887899 No 464 >TIGR02108 PQQ_syn_pqqB coenzyme PQQ biosynthesis protein B; InterPro: IPR011842 This entry describes coenzyme PQQ biosynthesis protein B, a gene required for the biosynthesis of pyrrolo-quinoline-quinone (coenzyme PQQ). PQQ is required for some glucose dehydrogenases and alcohol dehydrogenases. Note that this gene appears to be required for PQQ in biosynthesis in Methylobacterium extorquens (under the name pqqG) and in Klebsiella pneumoniae but that the equivalent pqqV in Acinetobacter calcoaceticus is not necessary for heterologous expression of PQQ biosynthesis in Escherichia coli. Based on this latter finding, it is suggested that PqqB might be a transporter or a PQQ-dependent enzyme rather than a PQQ biosynthesis enzyme .; GO: 0006810 transport, 0018189 pyrroloquinoline quinone biosynthetic process. Probab=23.78 E-value=26 Score=12.65 Aligned_cols=10 Identities=30% Similarity=0.713 Sum_probs=0.0 Q ss_pred CCEECCCCCCCHHHCCCCCCC Q ss_conf 952756989554242248376 Q gi|254781033|r 392 PLWISDDGYLSSVWLPLSPIS 412 (492) Q Consensus 392 p~W~c~~g~~~~~W~~~c~~~ 412 (492) |||-|.| |+| T Consensus 14 PQWNC~C-----------~~C 23 (314) T TIGR02108 14 PQWNCNC-----------PNC 23 (314) T ss_pred CCCCCCC-----------HHH T ss_conf 7525687-----------034 No 465 >pfam12297 EVC2_like Ellis van Creveld protein 2 like protein. This family of proteins is found in eukaryotes. Proteins in this family are typically between 571 and 1310 amino acids in length. There are two conserved sequence motifs: LPA and ELH. EVC2 is implicated in Ellis van Creveld chondrodysplastic dwarfism in humans. Mutations in this protein can give rise to this congenital condition. LIMBIN is a protein which shares around 80% sequence homology with EVC2 and it is implicated in a similar condition in bovine chondrodysplastic dwarfism. Probab=23.72 E-value=33 Score=11.79 Aligned_cols=39 Identities=18% Similarity=0.148 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH Q ss_conf 999999999999999999998418288999999999999 Q gi|254781033|r 45 LSILYFFLFAWILLFAVSRFFLSCPAMLFHMLHKRNYDK 83 (492) Q Consensus 45 i~~~~~~~~~~~ll~~l~~~i~~~p~~~~~~~~~rr~~k 83 (492) ..+++++|++.+++.++..++.+-...+...+-.|.+.+ T Consensus 67 ~agf~vaf~vsiVlt~~~~f~l~rt~~~~~~~l~r~r~~ 105 (429) T pfam12297 67 AAGFVVAFLVSIVLTLLAFFLLGRTRCLQGRMLSRGRSK 105 (429) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHH T ss_conf 889999999999999999999987411013531011343 No 466 >PRK01294 lipase chaperone; Provisional Probab=23.70 E-value=33 Score=11.78 Aligned_cols=25 Identities=12% Similarity=0.199 Sum_probs=0.0 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHCCC Q ss_conf 9679999999999999999973489 Q gi|254781033|r 1 MLRLIRYFFVISLVICSFIIVSHYP 25 (492) Q Consensus 1 M~r~l~~~~~i~~~~~~~~~l~~~~ 25 (492) |.|+.+++++.++++++.+++...+ T Consensus 4 ~~~~~~~~~~~~~~~~~~~~~~~~~ 28 (339) T PRK01294 4 MRKVALYLIIGAAAVLAVWYWLGLP 28 (339) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 8999999999999999999980656 No 467 >KOG0292 consensus Probab=23.69 E-value=34 Score=11.78 Aligned_cols=73 Identities=16% Similarity=0.190 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHH Q ss_conf 1689999997268850579998654552158869999999988842126989999999999952574569999 Q gi|254781033|r 138 IAHEKLEMMLQIPATREFAVYSLYFESCRIGDLNSAQRYATKALDISPDAPWVTEAVVQQYVLAKEWSRAITF 210 (492) Q Consensus 138 ~A~~~~~~~~~~~~~~~~~~~gL~~~al~~gd~~~A~~~~~~a~~~~P~~~~a~~~L~~l~~~~gdw~~A~~~ 210 (492) +-..||...-=.+..+.++++.-+....+.+++.+|-..+.+.+++.|.-+.+....-.......+...+.++ T Consensus 1068 ElAaYFt~~~Lqp~H~ilalrtA~n~ffK~kN~ktAs~fa~rLlel~~~~~~A~q~rki~~a~eknp~Da~~l 1140 (1202) T KOG0292 1068 ELAAYFTHCKLQPMHRILALRTAMNVFFKLKNLKTAAEFARRLLELAPSPPVAEQARKIKQAAEKNPTDAYEL 1140 (1202) T ss_pred HHHHHHHCCCCCHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCCCCC T ss_conf 9999861388771789999999999998741388899999999861799828999999987753596444436 No 468 >PTZ00042 stevor; Provisional Probab=23.68 E-value=34 Score=11.78 Aligned_cols=22 Identities=23% Similarity=0.274 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999999 Q gi|254781033|r 42 FVILSILYFFLFAWILLFAVSR 63 (492) Q Consensus 42 ~~~i~~~~~~~~~~~ll~~l~~ 63 (492) +.+++++++.++++++-+|+.+ T Consensus 267 IaalVLlilaVvLIILYIWLyr 288 (304) T PTZ00042 267 IAALVLLILAVVLIILYIWLYR 288 (304) T ss_pred HHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999999999999998 No 469 >smart00704 ZnF_CDGSH CDGSH-type zinc finger. Function unknown. Probab=23.60 E-value=34 Score=11.77 Aligned_cols=19 Identities=16% Similarity=0.123 Sum_probs=0.0 Q ss_pred EECCCCCCCHHHCCCCCCCCC Q ss_conf 275698955424224837687 Q gi|254781033|r 394 WISDDGYLSSVWLPLSPISKT 414 (492) Q Consensus 394 W~c~~g~~~~~W~~~c~~~~~ 414 (492) |.|.||.+.. +|.|+..+. T Consensus 15 ~~C~Cg~S~~--~PfCDgsH~ 33 (38) T smart00704 15 ALCRCGRSKN--FPYCDGSHK 33 (38) T ss_pred EEEECCCCCC--CCCCCCCCC T ss_conf 9985677689--974478450 No 470 >pfam04194 PDCD2_C Programmed cell death protein 2, C-terminal putative domain. Probab=23.49 E-value=34 Score=11.75 Aligned_cols=34 Identities=26% Similarity=0.473 Sum_probs=0.0 Q ss_pred CCEECCCCCCCHHHCCCCCCCCCCCCCCCCCCCC Q ss_conf 9527569895542422483768736400367875 Q gi|254781033|r 392 PLWISDDGYLSSVWLPLSPISKTLCYFEWKIPTK 425 (492) Q Consensus 392 p~W~c~~g~~~~~W~~~c~~~~~~~~~~W~~p~~ 425 (492) |.|+.........=-|.|++||+--.|+.+.-|. T Consensus 89 PL~~s~~~~~~~~~iP~C~~Cg~~R~FEfQlmP~ 122 (169) T pfam04194 89 PLWYSDADAPGELLIPRCSNCGSERVFEFQIMPQ 122 (169) T ss_pred EEEECCCCCCCCCCCCCCCCCCCCCEEEEEECHH T ss_conf 7865689876655588887799865775775367 No 471 >KOG1226 consensus Probab=23.42 E-value=34 Score=11.75 Aligned_cols=51 Identities=8% Similarity=-0.048 Sum_probs=0.0 Q ss_pred ECCEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH Q ss_conf 8997662218999999999999999999999998418288999999999999 Q gi|254781033|r 32 WGNRLYRTSPFVILSILYFFLFAWILLFAVSRFFLSCPAMLFHMLHKRNYDK 83 (492) Q Consensus 32 ~~~~~ie~s~~~~i~~~~~~~~~~~ll~~l~~~i~~~p~~~~~~~~~rr~~k 83 (492) +.+..|-+.+...+.+++++.++++++|. +...++.-+-+.++-..|.+.| T Consensus 708 p~~~~~~~i~lg~v~~ivligl~llliwk-ll~~~~DrrE~akFe~er~~a~ 758 (783) T KOG1226 708 PPGPNILAIVLGVVAGIVLIGLALLLIWK-LLTTIHDRREFAKFEKERLNAK 758 (783) T ss_pred CCCCCEEEEHHHHHHHHHHHHHHHHHHHH-HHHEECCCHHHHHHHHHHHHHH T ss_conf 99876766468899999999999999999-7633113077652548887502 No 472 >pfam05478 Prominin Prominin. The prominins are an emerging family of proteins that among the multispan membrane proteins display a novel topology. Mouse prominin and human prominin (mouse)-like 1 (PROML1) are predicted to contain five membrane spanning domains, with an N-terminal domain exposed to the extracellular space followed by four, alternating small cytoplasmic and large extracellular, loops and a cytoplasmic C-terminal domain. The exact function of prominin is unknown although in humans defects in PROM1, the gene coding for prominin, cause retinal degeneration. Probab=23.41 E-value=34 Score=11.74 Aligned_cols=67 Identities=10% Similarity=0.013 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCEEEEEECCEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHH Q ss_conf 9999999999999999973489848999899766221899999999999999999999999841828 Q gi|254781033|r 4 LIRYFFVISLVICSFIIVSHYPEDVSITWGNRLYRTSPFVILSILYFFLFAWILLFAVSRFFLSCPA 70 (492) Q Consensus 4 ~l~~~~~i~~~~~~~~~l~~~~G~v~i~~~~~~ie~s~~~~i~~~~~~~~~~~ll~~l~~~i~~~p~ 70 (492) ++++++++.++.+.+.+=.+.+.+-.=.--.-..--+...+.++++.+|+.+++++-++.++.|... T Consensus 425 llLIv~~l~lGLlcG~cG~~r~~~p~~r~c~~~tGg~~Lm~gV~liFlf~~~lmlv~l~~FlvG~n~ 491 (809) T pfam05478 425 VLLVVLCNLLGLLCGICGLRRPDTPSDRCCVSNTGGRFLMAGVGFSFLFSWFLMLVVLALFLVGGNV 491 (809) T ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 9999999999998612467877888766323560189999999999999999999999999950001 No 473 >PRK05696 fliL flagellar basal body-associated protein FliL; Reviewed Probab=23.32 E-value=34 Score=11.73 Aligned_cols=21 Identities=10% Similarity=-0.040 Sum_probs=0.0 Q ss_pred CHHHHHHHHHHHHHHHHHHHH Q ss_conf 967999999999999999997 Q gi|254781033|r 1 MLRLIRYFFVISLVICSFIIV 21 (492) Q Consensus 1 M~r~l~~~~~i~~~~~~~~~l 21 (492) +|-|++..++++++++++.|+ T Consensus 18 Liiii~~~vlll~ggggg~~~ 38 (168) T PRK05696 18 LIIIIVVGVLLALGGGGAAWF 38 (168) T ss_pred EEHHHHHHHHHHHHHHHHHHH T ss_conf 204799999999999999999 No 474 >PRK04949 putative sulfate transport protein CysZ; Validated Probab=23.12 E-value=34 Score=11.70 Aligned_cols=66 Identities=11% Similarity=0.120 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCEEEEEECCEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHCCH Q ss_conf 999999999999999997348984899989976622189999999999999999999999984182 Q gi|254781033|r 4 LIRYFFVISLVICSFIIVSHYPEDVSITWGNRLYRTSPFVILSILYFFLFAWILLFAVSRFFLSCP 69 (492) Q Consensus 4 ~l~~~~~i~~~~~~~~~l~~~~G~v~i~~~~~~ie~s~~~~i~~~~~~~~~~~ll~~l~~~i~~~p 69 (492) ++-++++.++...+..|+.+--..+.=...+|.-=.+-...++.++++++++.++...+..+.++| T Consensus 35 liNi~Lf~~~~~~l~~~~~~~~~~~~~~lp~Wl~~L~~llw~l~~l~~l~~~~~~ft~i~~li~aP 100 (251) T PRK04949 35 LVNILLFGGAFWWLFTQLDAWIDWLMSQLPDWLSWLSYLLWPLAVLSVLLVFSFTFSTLANWIAAP 100 (251) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999999999999999983163199999999999999999999999999999999727 No 475 >pfam02591 DUF164 Putative zinc ribbon domain. Probab=23.12 E-value=30 Score=12.13 Aligned_cols=9 Identities=11% Similarity=0.117 Sum_probs=0.0 Q ss_pred CCCCCCCCC Q ss_conf 248376873 Q gi|254781033|r 407 PLSPISKTL 415 (492) Q Consensus 407 ~~c~~~~~~ 415 (492) ..||+||++ T Consensus 47 ~~Cp~CgRi 55 (56) T pfam02591 47 VRCPHCGRI 55 (56) T ss_pred EECCCCCCC T ss_conf 799798863 No 476 >TIGR00340 zpr1_rel zinc finger protein ZPR1 homolog; InterPro: IPR004470 Zinc finger (Znf) domains are relatively small protein motifs that bind one or more zinc atoms, and which usually contain multiple finger-like protrusions that make tandem contacts with their target molecule. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis, however they are now recognised to bind DNA, RNA, protein and/or lipid substrates , , , , . Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few . Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. (Note that in certain cases, some Znf domains have diverged such that they still maintain their core structure, but have lost their ability to bind zinc, using other means such as salt bridges or binding to other metals to stabilise the finger-like folds. These domains can show strong sequence identity to zinc-binding motifs, and may therefore be included in Znf entries). This entry represents ZPR1-type zinc finger domains. ZPR1 was shown experimentally to bind approximately two moles of zinc, and has two copies of a domain homologous to this protein, each containing a putative zinc finger of the form CXXCX(25)CXXC. ZPR1 binds the tyrosine kinase domain of epidermal growth factor receptor but is displaced by receptor activation and autophosphorylation after which it redistributes in part to the nucleus. The proteins described by this family by analogy may be suggested to play a role in signal transduction as proven for other Z-finger binding proteins. Deficiencies in ZPR1 may contribute to neurodegenerative disorders. ZPR1 appears to be down-regulated in patients with spinal muscular atrophy (SMA), a disease characterised by degeneration of the alpha-motor neurons in the spinal cord that can arise from mutations affecting the expression of Survival Motor Neurons (SMN) . ZPR1 interacts with complexes formed by SMN , and may act as a modifier that effects the severity of SMA. More information about these proteins can be found at Protein of the Month: Zinc Fingers .; GO: 0008270 zinc ion binding. Probab=23.04 E-value=30 Score=12.15 Aligned_cols=12 Identities=33% Similarity=0.396 Sum_probs=0.0 Q ss_pred CEECC-CCCCCHH Q ss_conf 52756-9895542 Q gi|254781033|r 393 LWISD-DGYLSSV 404 (492) Q Consensus 393 ~W~c~-~g~~~~~ 404 (492) .|+|. |||.+.+ T Consensus 29 ~~~Ce~CgYR~~D 41 (168) T TIGR00340 29 TLICEKCGYRSAD 41 (168) T ss_pred EEEECCCCCCEEE T ss_conf 7663268970000 No 477 >COG5402 Uncharacterized conserved protein [Function unknown] Probab=23.00 E-value=34 Score=11.69 Aligned_cols=36 Identities=17% Similarity=0.026 Sum_probs=0.0 Q ss_pred CHHHHHHHHHHHHHHHHHHH------HHCCCCEEEEEECCEE Q ss_conf 96799999999999999999------7348984899989976 Q gi|254781033|r 1 MLRLIRYFFVISLVICSFII------VSHYPEDVSITWGNRL 36 (492) Q Consensus 1 M~r~l~~~~~i~~~~~~~~~------l~~~~G~v~i~~~~~~ 36 (492) |+|+.+++++.+++++++.. +-...|.-.|-.+.|+ T Consensus 1 m~rv~~l~~lal~iaiggGi~t~~yAl~~~~gfgsiriG~W~ 42 (194) T COG5402 1 MFRVPFLVILALIIAIGGGIATAYYALDATAGFGSIRIGPWE 42 (194) T ss_pred CCCHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCEEECCCC T ss_conf 941279999999999645475522336553662326614532 No 478 >KOG4629 consensus Probab=22.98 E-value=34 Score=11.68 Aligned_cols=66 Identities=17% Similarity=0.185 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHH-HHHHHCCCCEEEEEECCEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 6799999999999999-99973489848999899766221899999999999999999999999841 Q gi|254781033|r 2 LRLIRYFFVISLVICS-FIIVSHYPEDVSITWGNRLYRTSPFVILSILYFFLFAWILLFAVSRFFLS 67 (492) Q Consensus 2 ~r~l~~~~~i~~~~~~-~~~l~~~~G~v~i~~~~~~ie~s~~~~i~~~~~~~~~~~ll~~l~~~i~~ 67 (492) |-.+.++.++..+++. ..-.....+.+...++.|.+..+......++...++.+.+.+-.-.+++. T Consensus 152 i~~I~~~~iv~~lv~~l~i~~~~~~~~~~~~~~kw~~~~~v~~~~~lv~~~~~~~vvf~~~~n~~~r 218 (714) T KOG4629 152 ITWILLVLIVSSLVCSLGIHVHRLVTLWSLILWKWLVTLLVRITAVLVSSWFAALVVFLIESNFLRR 218 (714) T ss_pred HHHHHHHHHHHHHHHHHHHHEECCCCEEEEEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999999866210000331037777651003369999999997437999998750027899 No 479 >pfam08273 Prim_Zn_Ribbon Zinc-binding domain of primase-helicase. Probab=22.92 E-value=35 Score=11.67 Aligned_cols=13 Identities=15% Similarity=0.186 Sum_probs=0.0 Q ss_pred CCCCCCCCCCC-CC Q ss_conf 83768736400-36 Q gi|254781033|r 409 SPISKTLCYFE-WK 421 (492) Q Consensus 409 c~~~~~~~~~~-W~ 421 (492) ||+||.-|.|. |. T Consensus 6 CP~CGg~Drfr~fd 19 (39) T pfam08273 6 CPNCGGSDRFRIFD 19 (39) T ss_pred CCCCCCCCCCEECC T ss_conf 77777857525655 No 480 >pfam10072 consensus Probab=22.87 E-value=35 Score=11.67 Aligned_cols=69 Identities=9% Similarity=-0.039 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCEEEEEECCEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHH Q ss_conf 999999999999999973489848999899766221899999999999999999999999841828899 Q gi|254781033|r 5 IRYFFVISLVICSFIIVSHYPEDVSITWGNRLYRTSPFVILSILYFFLFAWILLFAVSRFFLSCPAMLF 73 (492) Q Consensus 5 l~~~~~i~~~~~~~~~l~~~~G~v~i~~~~~~ie~s~~~~i~~~~~~~~~~~ll~~l~~~i~~~p~~~~ 73 (492) ++=+++.+.+++.+.-+.---..+.|.|.=|.-.=-....+...+.+...++..+++.+.+.+.|.-+. T Consensus 39 li~lll~~~~all~~~~gl~~l~~liv~~~w~~~Rl~aL~~lt~~~~~~a~~~~~~~~~~lr~~~~lf~ 107 (112) T pfam10072 39 LLSLLLMAGLALVFAAFGLVSLTVLVVIAFWDTYRLAALGGLTALYLGAALICLLRARRKLRRASTLFE 107 (112) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCH T ss_conf 999999999999999999999999999853824899999999999999999999999998416997706 No 481 >pfam08882 Acetone_carb_G Acetone carboxylase gamma subunit. Acetone carboxylase is the key enzyme of bacterial acetone metabolism, catalysing the condensation of acetone and CO(2) to form acetoacetate. Probab=22.71 E-value=35 Score=11.65 Aligned_cols=42 Identities=26% Similarity=0.250 Sum_probs=0.0 Q ss_pred CCCCEECCCCCCCHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHH Q ss_conf 99952756989554242248376873640036787554456665421000 Q gi|254781033|r 390 PDPLWISDDGYLSSVWLPLSPISKTLCYFEWKIPTKSPEYISSENINFSL 439 (492) Q Consensus 390 ~dp~W~c~~g~~~~~W~~~c~~~~~~~~~~W~~p~~~~~~~~~~~~~~~~ 439 (492) |||.|+--- ...||.||..--.+ ..||..|....-+.....+ T Consensus 59 Pdp~W~~ir-------EyyCP~Cgt~~evE-~~~Pg~P~~hD~epDid~f 100 (106) T pfam08882 59 PDPEWQRIR-------EYYCPGCGTLHEVE-YLPPGHPPIHDFEPDIDAF 100 (106) T ss_pred CCCCCEEEE-------EEECCCCCCCEEEC-CCCCCCCEEEECCCCHHHH T ss_conf 998838999-------99779864311453-3699998056344489999 No 482 >pfam01350 Flavi_NS4A Flavivirus non-structural protein NS4A. Flaviviruses encode a single polyprotein. This is cleaved into three structural and seven non-structural proteins. The NS4A protein is small and poorly conserved among the Flaviviruses. NS4A contains multiple hydrophobic potential membrane spanning regions. NS4A has only been found in cells infected by Kunjin virus. Probab=22.70 E-value=33 Score=11.85 Aligned_cols=59 Identities=10% Similarity=0.092 Sum_probs=0.0 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHCCCCE----------EEEEECCEEEEEHHHHHHHHHHHHHHHHHHHH Q ss_conf 967999999999999999997348984----------89998997662218999999999999999999 Q gi|254781033|r 1 MLRLIRYFFVISLVICSFIIVSHYPED----------VSITWGNRLYRTSPFVILSILYFFLFAWILLF 59 (492) Q Consensus 1 M~r~l~~~~~i~~~~~~~~~l~~~~G~----------v~i~~~~~~ie~s~~~~i~~~~~~~~~~~ll~ 59 (492) +..++.++++.....+.+.|+..+.|. +.-.|.=|.-.++......+++++|+++++++ T Consensus 49 l~till~~ll~~~T~Gv~~~lm~~kgi~rm~lG~~vm~~~~~llw~ggv~~~~IAg~~lv~filmvVLi 117 (145) T pfam01350 49 LETILLVALLGLATLGVFLLLMARKGIGRMGLGMVVLLVAGGLLWMGGVPPGKIAGVLLIFFLLLVVLI 117 (145) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHEEC T ss_conf 999999999999999999987627786635799999999999998428967885899999999996044 No 483 >TIGR02873 spore_ylxY probable sporulation protein, polysaccharide deacetylase family; InterPro: IPR014228 Members of this entry include YlxY, and are related to IPR014132 from INTERPRO, which represents a subset of the polysaccharide deacetylase family found in a species if, and only if, the species forms endospores e.g. Bacillus subtilis or Clostridium tetani. Proteins in this entry are likewise restricted to spore-formers, but are not universally found among endospore-forming species.. Probab=22.62 E-value=29 Score=12.22 Aligned_cols=10 Identities=40% Similarity=0.836 Sum_probs=0.0 Q ss_pred CCCCCCCCCC Q ss_conf 7364003678 Q gi|254781033|r 414 TLCYFEWKIP 423 (492) Q Consensus 414 ~~~~~~W~~p 423 (492) ..||+.|+.| T Consensus 207 tVDTIDWk~P 216 (269) T TIGR02873 207 TVDTIDWKNP 216 (269) T ss_pred EECCCCCCCC T ss_conf 4311247788 No 484 >PRK00481 NAD-dependent deacetylase; Provisional Probab=22.45 E-value=35 Score=11.61 Aligned_cols=22 Identities=14% Similarity=-0.019 Sum_probs=0.0 Q ss_pred EECC-CCCCCHHHC------CCCCCCCCC Q ss_conf 2756-989554242------248376873 Q gi|254781033|r 394 WISD-DGYLSSVWL------PLSPISKTL 415 (492) Q Consensus 394 W~c~-~g~~~~~W~------~~c~~~~~~ 415 (492) ..|. |++..+... |.||.||.. T Consensus 123 ~~C~~C~~~~~~~~~~~~~~p~C~~Cgg~ 151 (239) T PRK00481 123 ARCTKCGQTYDLDEVDLEPPPRCPKCGGI 151 (239) T ss_pred EEECCCCCCCCHHHHHCCCCCCCCCCCCE T ss_conf 17089989851666302579988567997 No 485 >PRK00420 hypothetical protein; Validated Probab=22.44 E-value=35 Score=11.61 Aligned_cols=39 Identities=8% Similarity=-0.098 Sum_probs=0.0 Q ss_pred HHHHHHHHHHCCCCCCCEECCCC-CCCHHHCCCCCCCCCC Q ss_conf 99999999962889995275698-9554242248376873 Q gi|254781033|r 377 KILYWTQSALHAMPDPLWISDDG-YLSSVWLPLSPISKTL 415 (492) Q Consensus 377 ~ar~wl~rA~~a~~dp~W~c~~g-~~~~~W~~~c~~~~~~ 415 (492) ++-+.|.+..+.-..+.-.|+.- +.+.+-.-+||+||+. T Consensus 6 ~~a~ll~~Ga~ml~~~C~~Cg~plf~~k~G~~~Cp~cgk~ 45 (107) T PRK00420 6 KAAELLRSGAKMLDKHCPVCGLPLFELKDGEVVCPNHGKV 45 (107) T ss_pred HHHHHHHHHHHHHHHHCCCCCCCEEECCCCCEECCCCCCE T ss_conf 9999999577763513765798405748987768989874 No 486 >COG1299 FruA Phosphotransferase system, fructose-specific IIC component [Carbohydrate transport and metabolism] Probab=22.44 E-value=35 Score=11.61 Aligned_cols=65 Identities=14% Similarity=0.033 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHCCCCEEEEEECCEEEE--EHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHH Q ss_conf 9999999999999734898489998997662--21899999999999999999999999841828899 Q gi|254781033|r 8 FFVISLVICSFIIVSHYPEDVSITWGNRLYR--TSPFVILSILYFFLFAWILLFAVSRFFLSCPAMLF 73 (492) Q Consensus 8 ~~~i~~~~~~~~~l~~~~G~v~i~~~~~~ie--~s~~~~i~~~~~~~~~~~ll~~l~~~i~~~p~~~~ 73 (492) |.+-.+.+.++.=++|+||-+--..+|+--. +-...++..++.-|++-|++.++.+++. .|+.+. T Consensus 59 lmvpvlagyia~SiadrpgLapG~igg~~a~~~~g~aGFlG~ii~GflAGy~v~~l~k~~~-lP~~l~ 125 (343) T COG1299 59 LMVPVLAAYIAYSIADRPGLAPGFIGGLLANSVIGGAGFLGAIIAGFLAGYVVKWLKKYIE-LPKSLD 125 (343) T ss_pred HHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH-CCHHHH T ss_conf 9999999989988605423542678777743244310256789999999999999999940-867664 No 487 >KOG3052 consensus Probab=22.36 E-value=15 Score=14.51 Aligned_cols=83 Identities=8% Similarity=-0.047 Sum_probs=0.0 Q ss_pred CHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCHHHCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 54242248376873640036787554456665421000002356555444555674436410146887544567888786 Q gi|254781033|r 402 SSVWLPLSPISKTLCYFEWKIPTKSPEYISSENINFSLEMAYPADDLQSMLNNGKKNHLPSIKKVSSFEDSTIHPLDPHI 481 (492) Q Consensus 402 ~~~W~~~c~~~~~~~~~~W~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 481 (492) +..++-||-.|++++-+.|+. --....+.+.+-..+......|.+.+....-..+.....--+.+-+.++.+.+.--- T Consensus 99 yqVYkqVCaaCHSm~~iayR~--lVgv~~TeeEaKa~A~eiev~Dgpdd~G~~~~RPGkLsD~~P~PYpNeeAAraANnG 176 (311) T KOG3052 99 YQVYKQVCAACHSMDLIAYRH--LVGVCYTEEEAKAMAAEIEVVDGPDDEGEMFTRPGKLSDYFPSPYPNEEAARAANNG 176 (311) T ss_pred HHHHHHHHHHHHHHHHHHHHH--HHHEECCHHHHHHHHHHEEECCCCCCCCCCCCCCCCCCCCCCCCCCCHHHHHHHHCC T ss_conf 899999998850127888876--651001389998766433412487877882158862012489999987998874089 Q ss_pred CCCCC Q ss_conf 89899 Q gi|254781033|r 482 RQPDD 486 (492) Q Consensus 482 ~~p~~ 486 (492) ..|.| T Consensus 177 A~PPD 181 (311) T KOG3052 177 AYPPD 181 (311) T ss_pred CCCCC T ss_conf 99997 No 488 >cd01410 SIRT7 SIRT7: Eukaryotic and prokaryotic group (class4) which includes human sirtuin SIRT6, SIRT7, and several bacterial homologs; and are members of the SIR2 family of proteins, silent information regulator 2 (Sir2) enzymes which catalyze NAD+-dependent protein/histone deacetylation. Sir2 proteins have been shown to regulate gene silencing, DNA repair, metabolic enzymes, and life span. Probab=22.28 E-value=36 Score=11.58 Aligned_cols=21 Identities=5% Similarity=-0.268 Sum_probs=0.0 Q ss_pred ECC-CCCCCHHHC-----------CCCCCCCCC Q ss_conf 756-989554242-----------248376873 Q gi|254781033|r 395 ISD-DGYLSSVWL-----------PLSPISKTL 415 (492) Q Consensus 395 ~c~-~g~~~~~W~-----------~~c~~~~~~ 415 (492) .|. |++.+..+. |.||.||.+ T Consensus 97 ~C~~C~~~~~~~~~~~~~~~~~~~p~C~~Cgg~ 129 (206) T cd01410 97 VCKSCGPEYVRDDVVETRGDKETGRRCHACGGI 129 (206) T ss_pred ECCCCCCCCCHHHHHHHHHCCCCCCCCCCCCCC T ss_conf 888999927579999887544789988233897 No 489 >PRK09546 zntB zinc transporter; Reviewed Probab=22.27 E-value=36 Score=11.58 Aligned_cols=65 Identities=6% Similarity=-0.055 Sum_probs=0.0 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 96799999999999999999734898489998997662218999999999999999999999998 Q gi|254781033|r 1 MLRLIRYFFVISLVICSFIIVSHYPEDVSITWGNRLYRTSPFVILSILYFFLFAWILLFAVSRFF 65 (492) Q Consensus 1 M~r~l~~~~~i~~~~~~~~~l~~~~G~v~i~~~~~~ie~s~~~~i~~~~~~~~~~~ll~~l~~~i 65 (492) |=|.+..+-+++.+.+-.+++++-=|.=+=...|-.-....+.++++++++.+++++.++--+|+ T Consensus 263 ~N~~~~~Lsvia~IFLPltfltGl~GMN~~gmPg~~~~~gf~~~~~~m~~~~~~~~~~frr~~WL 327 (327) T PRK09546 263 LNRRTYTMSLMAMVFLPSTFLTGLFGVNLGGIPGGGWPFGFSIFCILLVVLIGGVAWWLKRSKWL 327 (327) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 99999999999999987999872214677999887886079999999999999999999983779 No 490 >PRK05170 hypothetical protein; Provisional Probab=22.26 E-value=20 Score=13.55 Aligned_cols=14 Identities=14% Similarity=0.178 Sum_probs=0.0 Q ss_pred CCHHHCCCCCCCCC Q ss_conf 55424224837687 Q gi|254781033|r 401 LSSVWLPLSPISKT 414 (492) Q Consensus 401 ~~~~W~~~c~~~~~ 414 (492) ..++|...|+.||+ T Consensus 17 s~~EWEsLCDGCGk 30 (148) T PRK05170 17 SDAEWESLCDGCGK 30 (148) T ss_pred CHHHHHHHCCCHHH T ss_conf 99999998502386 No 491 >PRK09110 flagellar motor protein MotA; Validated Probab=22.25 E-value=36 Score=11.58 Aligned_cols=166 Identities=14% Similarity=0.076 Sum_probs=0.0 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHH Q ss_conf 96799999999999999999734898489998997662218999999999999999999999998418288999999999 Q gi|254781033|r 1 MLRLIRYFFVISLVICSFIIVSHYPEDVSITWGNRLYRTSPFVILSILYFFLFAWILLFAVSRFFLSCPAMLFHMLHKRN 80 (492) Q Consensus 1 M~r~l~~~~~i~~~~~~~~~l~~~~G~v~i~~~~~~ie~s~~~~i~~~~~~~~~~~ll~~l~~~i~~~p~~~~~~~~~rr 80 (492) |+.++-+++.+.++..++.+-.++. +.-+..+....|+.-.+..+++..-+..+..+++.-...-....-++ T Consensus 1 M~~iiGlv~~~~~i~~g~~l~Gg~~--------~~~~~~~s~lIV~Ggtlga~li~~p~~~i~~~~~~~~~~f~~~~~~~ 72 (283) T PRK09110 1 MLIIIGYIVVLGSVFGGYLLAGGHL--------GALIQPFELLIIGGAALGAFIVGNPGKAIKATLKALPKVFKGPKYKK 72 (283) T ss_pred CHHHHHHHHHHHHHHHHHHHCCCCC--------HHHHCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCCH T ss_conf 9209999999999999999548984--------28846447995887899999997889999999999999956878981 Q ss_pred H----------HHHHHHHHHHHHHHHC--------------------CHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHH Q ss_conf 9----------9999999999999721--------------------004787999986301346661589999987532 Q gi|254781033|r 81 Y----------DKGYKALYTGLMSIAA--------------------HNIPLARKMHSYVSQQHTFHNEYLVYLLEVQIA 130 (492) Q Consensus 81 ~----------~k~~~al~~gl~al~~--------------------Gd~~~A~k~~~~a~~~~~~~~~~L~~Ll~A~aA 130 (492) . +-+..+=.+|++++-. +....=-+..-+..-....+.+.+-.++..+.. T Consensus 73 ~~~~e~i~~l~~la~~aRr~GllaLE~~ie~p~~s~if~~~p~~~~d~~~~~fi~dgl~l~vdg~~~~~~i~~iLe~ei~ 152 (283) T PRK09110 73 ADYMDLLALLYELLRKARQEGMMALEAHIENPEESPIFAKYPKILADHHLVEFICDYLRLMISGNMNPHEIEALMDEEIE 152 (283) T ss_pred HHHHHHHHHHHHHHHHHHHCCHHHHHHHHCCCCCCHHHHHCHHHHCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHH T ss_conf 43899999999999999862499998774484213677723155315037899998789996589998999999999999 Q ss_pred HHHHCCHHHHHHHHHHHHCCCCH--HHHHHHHHHHHHCCCCHHHHH Q ss_conf 22210001689999997268850--579998654552158869999 Q gi|254781033|r 131 LAERQYNIAHEKLEMMLQIPATR--EFAVYSLYFESCRIGDLNSAQ 174 (492) Q Consensus 131 ~~~gd~~~A~~~~~~~~~~~~~~--~~~~~gL~~~al~~gd~~~A~ 174 (492) ....++....+.|+.+.+..|+- .-++.||...-...++....+ T Consensus 153 ~~~~r~~~~~~~~~~~g~~aPa~GmiGtviGLI~~l~~l~~~p~~l 198 (283) T PRK09110 153 THHHEAEVPAHALQTVADALPAFGIVAAVLGVVKTMGSIDQPPAVL 198 (283) T ss_pred HHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCHHHH T ss_conf 9999998788999999987728889999999999998448998999 No 492 >pfam11004 DUF2843 Protein of unknown function (DUF2843). This bacterial family of proteins has no known function. Probab=22.20 E-value=36 Score=11.57 Aligned_cols=52 Identities=17% Similarity=0.193 Sum_probs=0.0 Q ss_pred HHCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCEEC-CCCCCCH Q ss_conf 660999899999999999857998999999999962889995275-6989554 Q gi|254781033|r 352 MKIAPRKEIFLLLAQIEQANSHNTDKILYWTQSALHAMPDPLWIS-DDGYLSS 403 (492) Q Consensus 352 l~~~P~~~~~~llA~ie~~e~~d~~~ar~wl~rA~~a~~dp~W~c-~~g~~~~ 403 (492) +...+.+..=.+|..+-..-..|..=-+.--...+.-||+-.|+| .|...|. T Consensus 191 iTK~~RS~YDh~ML~LHD~mK~D~~YQ~~~pq~~~~FppgstWi~FsDqv~HA 243 (281) T pfam11004 191 ITKRPRSRYDHLMLQLHDLMKADLEYQKDAPQVTMAFPPGSSWIVFSDQVSHA 243 (281) T ss_pred CCCCCCCHHHHHHHHHHHHHHCCHHHHHCCCCEEECCCCCCEEEEEECCCHHH T ss_conf 88787876889999998875237888736986231269998689961740777 No 493 >cd07365 MhpB_like Subunit B of the Class III Extradiol ring-cleavage dioxygenase, 2,3-dihydroxyphenylpropionate 1,2-dioxygenase (MhpB), which catalyzes the oxidization and subsequent ring-opening of 2,3-dihydroxyphenylpropionate. 2,3-dihydroxyphenylpropionate 1,2-dioxygenase (MhpB) catalyzes the oxidization and subsequent ring-opening of 2,3-dihydroxyphenylpropionate, yielding the product 2-hydroxy-6-oxo-nona-2,4-diene 1,9-dicarboxylate. It is an essential enzyme in the beta-phenylpropionic degradation pathway, in which beta-phenylpropionic is first hydrolyzed to produce 2,3-dihydroxyphenylpropionate. The enzyme is a member of the class III extradiol dioxygenase family, a group of enzymes which use a non-heme Fe(II) to cleave aromatic rings between a hydroxylated carbon and an adjacent non-hydroxylated carbon. LigAB-like class III enzymes are usually composed of two subunits, designated A and B, which form a tetramer composed of two copies of each subunit. This model represents the ca Probab=22.07 E-value=36 Score=11.55 Aligned_cols=90 Identities=16% Similarity=0.159 Sum_probs=0.0 Q ss_pred HHHHHHCCCCHHHHHHHHHHHHHCCC-----------HHHH--HHHHHHHHHCCCCHHHHHHHHHHHHHHCCCHHHHHHH Q ss_conf 99998268983689999999998799-----------8899--9998878660999899999999999857998999999 Q gi|254781033|r 315 LRLEEINKESVESLVIVSKIALEMGS-----------IDQA--HAKAMLAMKIAPRKEIFLLLAQIEQANSHNTDKILYW 381 (492) Q Consensus 315 ~~l~~~~P~~~e~~~~LA~~~~~~g~-----------~~~A--r~~l~~al~~~P~~~~~~llA~ie~~e~~d~~~ar~w 381 (492) +.++.-.+-.++.+...-+.-+++++ .... ++.|+...+-++..=.-+-..++.+.-++....+|.| T Consensus 195 ~~li~g~~~t~e~r~~~~~rvi~a~~~~~~g~~~~~~lN~ewD~~fLd~l~~gd~~~l~~~t~~ei~~~aG~gg~EirtW 274 (310) T cd07365 195 ERLIAGRNPTPEARAARQQRVIAAAKAFAAGDSTLMPLNPEWDRAFLDLLASGDLAALDAMTNDEIAAQAGNSAHEVRTW 274 (310) T ss_pred HHHHCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHHCCCHHHHHCCCHHHHHHHCCCCCHHHHHH T ss_conf 77742689987888888888998777640475446778989999999999759999997489999998728751999999 Q ss_pred HHH--HHCCCCC-----------CCEECCCCCCCHH Q ss_conf 999--9628899-----------9527569895542 Q gi|254781033|r 382 TQS--ALHAMPD-----------PLWISDDGYLSSV 404 (492) Q Consensus 382 l~r--A~~a~~d-----------p~W~c~~g~~~~~ 404 (492) +.- |+.+..+ |.|+|+.|.++.+ T Consensus 275 iaa~aAl~a~g~~~~~~~~Y~pipEWi~G~gv~~a~ 310 (310) T cd07365 275 VAAFAALAAAGRYRAESRYYRPIPEWIAGFGVMTAE 310 (310) T ss_pred HHHHHHHHCCCCCCCCEEEEEEHHHHHHHCCEECCC T ss_conf 999988513478766435664768887432333069 No 494 >TIGR01654 bact_immun_7tm bacteriocin-associated integral membrane protein; InterPro: IPR006541 These sequences represent a family of integral membrane proteins, most of which are about 650 residues in size and predicted to span the membrane seven times. Nearly half of the members of this family are found in association with a member of the lactococcin 972 family of bacteriocins (IPR006540 from INTERPRO) . Others may be associated with uncharacterised proteins that may also act as bacteriocins. Although this protein is suggested to be an immunity protein, and the bacteriocin is suggested to be exported by a Sec-dependent process, the role of this protein is unclear. . Probab=22.00 E-value=36 Score=11.54 Aligned_cols=110 Identities=11% Similarity=-0.039 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHC--CCCEEEEEECCEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH Q ss_conf 99999999999999999734--8984899989976622189999999999999999999999984182889999999999 Q gi|254781033|r 4 LIRYFFVISLVICSFIIVSH--YPEDVSITWGNRLYRTSPFVILSILYFFLFAWILLFAVSRFFLSCPAMLFHMLHKRNY 81 (492) Q Consensus 4 ~l~~~~~i~~~~~~~~~l~~--~~G~v~i~~~~~~ie~s~~~~i~~~~~~~~~~~ll~~l~~~i~~~p~~~~~~~~~rr~ 81 (492) +++..+++.+......|..+ .++++-+.|....+.+.++.++..++-++..+.+--.-+.........++...--.-= T Consensus 253 ~l~~~l~~~l~~~~~l~~~gL~~~~~~~~~~~~~~l~~~~lll~~~~~~~l~~l~~~~~~l~~~ik~~~~~K~~~~~~~v 332 (745) T TIGR01654 253 LLISALLILLLSSLVLFIKGLLYTILLELFIFALSLLVLLLLLLLLIISFLTLLLLSKLSLANVIKGKSSFKLLLIVNLV 332 (745) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHH T ss_conf 98999999999999998656778989999999999999999999999999999999887788874165313599999999 Q ss_pred HHHHHHHHHHHHHHHCCHHHHHHHH-HHHHCCC Q ss_conf 9999999999999721004787999-9863013 Q gi|254781033|r 82 DKGYKALYTGLMSIAAHNIPLARKM-HSYVSQQ 113 (492) Q Consensus 82 ~k~~~al~~gl~al~~Gd~~~A~k~-~~~a~~~ 113 (492) -|.-..+.-+..-...++.-..... ..+.... T Consensus 333 ~K~~~l~~~~~~~~~~~~~~~~~~~i~~~~~~~ 365 (745) T TIGR01654 333 CKLVLLVLVIVTLITILSSLKDLNEILKKSEQE 365 (745) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999998899999999999986544 No 495 >PRK12336 translation initiation factor IF-2 subunit beta; Provisional Probab=21.72 E-value=36 Score=11.50 Aligned_cols=10 Identities=0% Similarity=-0.263 Sum_probs=0.0 Q ss_pred CCCCCCCCCC Q ss_conf 4837687364 Q gi|254781033|r 408 LSPISKTLCY 417 (492) Q Consensus 408 ~c~~~~~~~~ 417 (492) .|+.|++.|| T Consensus 100 ~C~~C~sPDT 109 (201) T PRK12336 100 ICSECGRPDT 109 (201) T ss_pred ECCCCCCCCC T ss_conf 9789999886 No 496 >pfam08702 Fib_alpha Fibrinogen alpha/beta chain family. Fibrinogen is a protein involved in platelet aggregation and is essential for the coagulation of blood. This domain forms part of the central coiled coiled region of the protein which is formed from two sets of three non-identical chains (alpha, beta and gamma). Probab=21.70 E-value=36 Score=11.50 Aligned_cols=26 Identities=19% Similarity=0.280 Sum_probs=0.0 Q ss_pred CCCCEECCCCCCCHHHCCCCCCCCCCCCC Q ss_conf 99952756989554242248376873640 Q gi|254781033|r 390 PDPLWISDDGYLSSVWLPLSPISKTLCYF 418 (492) Q Consensus 390 ~dp~W~c~~g~~~~~W~~~c~~~~~~~~~ 418 (492) ||..| ||+..++|-+.||..=++-.| T Consensus 1 ~~~~~---c~~~de~~G~~CPTgC~i~~~ 26 (146) T pfam08702 1 PDADW---CCCLDEDFGSYCPTGCGIQDL 26 (146) T ss_pred CCCCC---CCCCCCCCCCCCCCCCCHHHH T ss_conf 99998---727686667889974426589 No 497 >pfam10122 Mu-like_Com Mu-like prophage protein Com. Members of this family of proteins comprise the translational regulator of mom. Probab=21.65 E-value=36 Score=11.49 Aligned_cols=25 Identities=32% Similarity=0.308 Sum_probs=0.0 Q ss_pred CCCCCCCCCCCCCCCCCCCCCCCCC Q ss_conf 8376873640036787554456665 Q gi|254781033|r 409 SPISKTLCYFEWKIPTKSPEYISSE 433 (492) Q Consensus 409 c~~~~~~~~~~W~~p~~~~~~~~~~ 433 (492) ||-||.++-+.=..|..+|.+...+ T Consensus 27 CpRCgtlNhvka~s~~~~P~~~~~e 51 (51) T pfam10122 27 CPRCGTLNHVKATSLTESPRYECRE 51 (51) T ss_pred CCCCCCCCCCCCCCCCCCCHHHCCC T ss_conf 6866751100036777692423339 No 498 >PRK10861 lsignal peptidase I; Provisional Probab=21.63 E-value=37 Score=11.49 Aligned_cols=58 Identities=9% Similarity=0.111 Sum_probs=0.0 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHC------------------------CCCEEEEEECCEEEEEHHHHHHHHHHHHHHHHH Q ss_conf 96799999999999999999734------------------------898489998997662218999999999999999 Q gi|254781033|r 1 MLRLIRYFFVISLVICSFIIVSH------------------------YPEDVSITWGNRLYRTSPFVILSILYFFLFAWI 56 (492) Q Consensus 1 M~r~l~~~~~i~~~~~~~~~l~~------------------------~~G~v~i~~~~~~ie~s~~~~i~~~~~~~~~~~ 56 (492) |.-.+-++++++.++.|..|+.| ......+.=..+-+|.+-.++-++++++++=.+ T Consensus 1 m~~~f~lil~~~t~~tG~~w~~d~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~e~~~s~fpvi~~v~ilRsF 80 (324) T PRK10861 1 MANMFALILVIATLVTGILWCVDKFKFAPKRRARQAAAQAATGDSLDNATLKKVAPKPGWLETGASVFPVLAIVLIVRSF 80 (324) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHCCCCCCHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 94248999999999999999999886246778888776653035542455653047861555667799999999999988 Q ss_pred HH Q ss_conf 99 Q gi|254781033|r 57 LL 58 (492) Q Consensus 57 ll 58 (492) ++ T Consensus 81 l~ 82 (324) T PRK10861 81 IY 82 (324) T ss_pred HE T ss_conf 32 No 499 >PTZ00234 variable surface protein Vir12; Provisional Probab=21.58 E-value=17 Score=14.12 Aligned_cols=36 Identities=17% Similarity=0.143 Sum_probs=0.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHH Q ss_conf 189999999999999999999999984182889999999999999 Q gi|254781033|r 40 SPFVILSILYFFLFAWILLFAVSRFFLSCPAMLFHMLHKRNYDKG 84 (492) Q Consensus 40 s~~~~i~~~~~~~~~~~ll~~l~~~i~~~p~~~~~~~~~rr~~k~ 84 (492) +.+.++.++..+||+++ .+.-+++..++.+|+++|+ T Consensus 366 NIIMaaAILGtIfFLFY---------YN~SS~lkS~~~KrKrKKk 401 (433) T PTZ00234 366 HSIVGASIIGVLVFLFF---------FFKSTPIRSQTNKGEKKKR 401 (433) T ss_pred HHHHHHHHHHHHHHHHH---------EECCCCCCCCCCCCCCCCC T ss_conf 99999999999999863---------0036664345787666667 No 500 >COG4652 Uncharacterized protein conserved in bacteria [Function unknown] Probab=21.56 E-value=28 Score=12.37 Aligned_cols=71 Identities=10% Similarity=-0.072 Sum_probs=0.0 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHCCCCEEEEEECCEEEEEHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHH Q ss_conf 96799999999999999999734898489998997662218999999999999999999999998418288 Q gi|254781033|r 1 MLRLIRYFFVISLVICSFIIVSHYPEDVSITWGNRLYRTSPFVILSILYFFLFAWILLFAVSRFFLSCPAM 71 (492) Q Consensus 1 M~r~l~~~~~i~~~~~~~~~l~~~~G~v~i~~~~~~ie~s~~~~i~~~~~~~~~~~ll~~l~~~i~~~p~~ 71 (492) |.-++.+.++..++.++..|+.+.-+...+.+.++-+-.|..+++.+.+++-+..+++..-......+-++ T Consensus 207 i~~~s~~~i~~~L~~~~il~i~~l~f~~ll~f~~f~vl~~~l~~~~Is~lls~i~~l~ls~~sL~~~IKgK 277 (657) T COG4652 207 IKGWSLLTIISRLLGFSILWILGLLFLSLLFFRRFGVLFSLLFFLMISILLSLISLLLLSRLSLANLIKGK 277 (657) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 67889999999998789999999886699999999999999999999999999999999775689986378 Done!