Query gi|254780421|ref|YP_003064834.1| hypothetical protein CLIBASIA_01530 [Candidatus Liberibacter asiaticus str. psy62] Match_columns 218 No_of_seqs 126 out of 1065 Neff 8.5 Searched_HMMs 39220 Date Sun May 29 17:35:19 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780421.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 COG4649 Uncharacterized protei 100.0 2.5E-31 6.4E-36 212.3 18.6 206 9-215 14-220 (221) 2 COG2976 Uncharacterized protei 100.0 3.1E-26 7.9E-31 180.8 24.4 205 1-211 1-207 (207) 3 pfam10858 DUF2659 Protein of u 99.7 4.6E-15 1.2E-19 111.7 16.1 198 12-210 15-219 (220) 4 PRK11788 hypothetical protein; 99.3 2.8E-09 7.2E-14 75.8 19.6 161 30-193 9-172 (389) 5 PRK10866 outer membrane protei 99.1 1E-08 2.6E-13 72.4 15.0 174 20-198 3-212 (243) 6 TIGR03302 OM_YfiO outer membra 99.1 1.4E-08 3.6E-13 71.5 15.2 144 51-195 31-200 (235) 7 TIGR02917 PEP_TPR_lipo putativ 99.1 2.1E-08 5.3E-13 70.5 15.6 149 52-209 269-428 (924) 8 COG2956 Predicted N-acetylgluc 99.0 1.6E-07 4.1E-12 64.9 18.6 166 27-195 6-174 (389) 9 PRK10370 formate-dependent nit 99.0 1.2E-07 3.2E-12 65.6 17.5 166 22-195 17-185 (206) 10 TIGR02917 PEP_TPR_lipo putativ 98.9 2.2E-07 5.7E-12 64.1 15.5 130 55-190 31-160 (924) 11 PRK10803 hypothetical protein; 98.9 8.5E-08 2.2E-12 66.7 12.5 116 92-209 144-262 (262) 12 TIGR02521 type_IV_pilW type IV 98.7 1.2E-06 3E-11 59.6 14.8 156 47-208 31-192 (247) 13 COG3063 PilF Tfp pilus assembl 98.7 1.7E-06 4.4E-11 58.6 15.6 154 46-206 28-184 (250) 14 PRK10747 putative protoheme IX 98.7 2E-05 5.1E-10 52.0 22.2 154 46-205 77-230 (398) 15 TIGR02795 tol_pal_ybgF tol-pal 98.7 2.1E-07 5.5E-12 64.2 9.6 102 54-156 3-105 (119) 16 PRK11788 hypothetical protein; 98.7 1.1E-05 2.8E-10 53.6 18.1 133 55-191 109-243 (389) 17 COG4105 ComL DNA uptake lipopr 98.6 2.3E-05 5.8E-10 51.6 17.4 166 50-216 31-224 (254) 18 TIGR02795 tol_pal_ybgF tol-pal 98.5 2.1E-06 5.3E-11 58.1 10.3 107 91-198 4-113 (119) 19 COG1729 Uncharacterized protei 98.4 4.9E-06 1.3E-10 55.8 11.1 111 92-203 144-257 (262) 20 COG4235 Cytochrome c biogenesi 98.4 8.1E-05 2.1E-09 48.2 17.2 188 21-213 88-278 (287) 21 COG2976 Uncharacterized protei 98.4 0.00013 3.4E-09 46.9 17.4 159 11-209 8-168 (207) 22 PRK11447 cellulose synthase su 98.4 0.00018 4.5E-09 46.1 17.8 145 58-205 275-430 (1158) 23 COG4783 Putative Zn-dependent 98.4 0.00012 3E-09 47.2 16.8 152 50-209 303-472 (484) 24 TIGR03302 OM_YfiO outer membra 98.3 0.00018 4.5E-09 46.1 15.8 129 61-190 78-231 (235) 25 KOG4626 consensus 98.3 0.00014 3.5E-09 46.8 15.0 133 64-203 365-498 (966) 26 cd00189 TPR Tetratricopeptide 98.3 2.7E-05 7E-10 51.1 11.4 97 91-192 2-98 (100) 27 COG2956 Predicted N-acetylgluc 98.2 0.00044 1.1E-08 43.6 16.3 150 48-204 49-254 (389) 28 PRK02603 photosystem I assembl 98.2 0.00016 4.2E-09 46.3 13.9 123 67-191 13-149 (172) 29 PRK11447 cellulose synthase su 98.2 0.00058 1.5E-08 42.9 19.8 149 57-213 356-513 (1158) 30 KOG2376 consensus 98.1 0.00027 6.9E-09 45.0 14.1 146 57-209 83-271 (652) 31 COG3071 HemY Uncharacterized e 98.0 0.00094 2.4E-08 41.6 21.7 158 42-205 73-230 (400) 32 KOG0624 consensus 98.0 0.00098 2.5E-08 41.5 15.2 132 66-200 85-228 (504) 33 COG1729 Uncharacterized protei 98.0 0.00018 4.7E-09 46.0 11.2 100 56-156 144-244 (262) 34 KOG1126 consensus 97.9 0.00045 1.2E-08 43.6 12.6 120 64-190 500-619 (638) 35 PRK10049 pgaA outer membrane p 97.9 0.0011 2.8E-08 41.2 14.1 137 61-200 297-450 (818) 36 KOG3060 consensus 97.9 0.0021 5.4E-08 39.4 14.9 145 60-211 59-203 (289) 37 KOG4626 consensus 97.8 0.0013 3.4E-08 40.7 13.4 120 65-191 298-417 (966) 38 KOG0553 consensus 97.8 0.00083 2.1E-08 42.0 12.1 105 95-204 87-191 (304) 39 PRK10866 outer membrane protei 97.8 0.0014 3.5E-08 40.6 13.0 123 64-187 80-237 (243) 40 TIGR02552 LcrH_SycD type III s 97.7 0.00072 1.8E-08 42.3 10.9 112 91-209 20-137 (137) 41 COG4649 Uncharacterized protei 97.7 0.00099 2.5E-08 41.5 11.0 131 23-157 32-197 (221) 42 KOG2002 consensus 97.7 0.0041 1E-07 37.7 14.8 126 65-193 248-373 (1018) 43 COG5010 TadD Flp pilus assembl 97.7 0.0028 7.2E-08 38.7 12.8 101 93-198 70-171 (257) 44 KOG0543 consensus 97.6 0.0011 2.9E-08 41.1 10.1 106 96-202 215-331 (397) 45 PRK02603 photosystem I assembl 97.6 0.0028 7.3E-08 38.7 12.0 100 51-150 33-136 (172) 46 KOG2076 consensus 97.6 0.0056 1.4E-07 36.8 15.2 134 51-189 411-553 (895) 47 KOG0547 consensus 97.6 0.0043 1.1E-07 37.6 12.8 116 66-190 441-565 (606) 48 COG3118 Thioredoxin domain-con 97.5 0.0064 1.6E-07 36.5 16.5 155 52-213 133-289 (304) 49 TIGR02521 type_IV_pilW type IV 97.5 0.0042 1.1E-07 37.6 12.3 131 63-198 81-219 (247) 50 COG4783 Putative Zn-dependent 97.5 0.007 1.8E-07 36.2 13.1 77 128-205 307-384 (484) 51 KOG2002 consensus 97.5 0.0019 4.9E-08 39.7 10.0 60 91-153 309-368 (1018) 52 KOG1840 consensus 97.4 0.0084 2.1E-07 35.7 13.6 102 88-190 198-311 (508) 53 COG4700 Uncharacterized protei 97.4 0.0092 2.3E-07 35.5 15.7 124 89-216 89-213 (251) 54 PRK10747 putative protoheme IX 97.4 0.0092 2.4E-07 35.5 15.5 54 65-119 130-183 (398) 55 cd00189 TPR Tetratricopeptide 97.4 0.0052 1.3E-07 37.0 11.3 93 57-155 4-96 (100) 56 CHL00033 ycf3 photosystem I as 97.4 0.007 1.8E-07 36.2 11.7 102 87-190 29-137 (165) 57 KOG0624 consensus 97.4 0.01 2.6E-07 35.2 16.2 146 53-204 38-198 (504) 58 PRK10049 pgaA outer membrane p 97.2 0.017 4.3E-07 33.9 16.9 141 64-208 335-490 (818) 59 pfam09976 DUF2133 Uncharacteri 97.2 0.00091 2.3E-08 41.7 5.4 41 10-50 2-42 (43) 60 COG4499 Predicted membrane pro 97.1 0.018 4.7E-07 33.7 12.7 132 17-165 216-349 (434) 61 COG3063 PilF Tfp pilus assembl 97.1 0.012 3.1E-07 34.7 10.9 124 65-193 81-204 (250) 62 KOG0550 consensus 97.1 0.02 5.2E-07 33.3 14.7 149 52-204 202-362 (486) 63 KOG2376 consensus 97.1 0.021 5.4E-07 33.3 18.3 45 68-113 356-400 (652) 64 KOG2076 consensus 97.1 0.021 5.5E-07 33.2 14.2 73 131-203 418-490 (895) 65 PRK10153 DNA-binding transcrip 97.0 0.024 6.2E-07 32.9 14.1 143 52-196 330-479 (512) 66 KOG1840 consensus 97.0 0.025 6.3E-07 32.9 14.4 126 65-190 253-395 (508) 67 pfam10300 IML2 Putative mitoch 97.0 0.026 6.7E-07 32.7 11.9 121 65-188 228-355 (446) 68 KOG4340 consensus 97.0 0.023 5.8E-07 33.1 10.9 128 55-191 80-207 (459) 69 KOG4234 consensus 97.0 0.025 6.4E-07 32.8 11.1 101 95-196 101-202 (271) 70 KOG3060 consensus 96.8 0.033 8.4E-07 32.1 14.6 137 59-204 92-231 (289) 71 PRK10803 hypothetical protein; 96.8 0.014 3.7E-07 34.3 8.7 100 56-156 144-245 (262) 72 COG3071 HemY Uncharacterized e 96.6 0.046 1.2E-06 31.2 14.3 84 65-154 130-214 (400) 73 KOG1173 consensus 96.6 0.048 1.2E-06 31.1 13.5 145 60-208 387-535 (611) 74 pfam04348 LppC LppC putative l 96.5 0.059 1.5E-06 30.5 16.3 101 88-189 23-125 (535) 75 COG3898 Uncharacterized membra 96.5 0.059 1.5E-06 30.5 19.3 151 45-201 76-227 (531) 76 KOG1125 consensus 96.4 0.064 1.6E-06 30.3 12.4 143 62-209 328-509 (579) 77 KOG2003 consensus 96.2 0.08 2.1E-06 29.7 13.1 175 10-189 363-585 (840) 78 COG5010 TadD Flp pilus assembl 96.2 0.081 2.1E-06 29.6 16.1 133 66-205 79-211 (257) 79 KOG1126 consensus 96.2 0.087 2.2E-06 29.5 15.6 118 88-210 420-537 (638) 80 KOG0543 consensus 96.0 0.1 2.6E-06 29.1 13.3 101 88-192 256-356 (397) 81 KOG0547 consensus 95.9 0.11 2.9E-06 28.8 13.0 87 96-187 122-208 (606) 82 CHL00033 ycf3 photosystem I as 95.9 0.12 2.9E-06 28.7 10.3 99 52-150 30-132 (165) 83 pfam10140 essB Predicted membr 95.9 0.12 3E-06 28.7 13.9 129 15-156 188-316 (359) 84 pfam04733 Coatomer_E Coatomer 95.9 0.12 3.1E-06 28.6 15.1 145 56-213 105-251 (290) 85 KOG2003 consensus 95.8 0.12 3.2E-06 28.5 12.5 94 93-190 423-518 (840) 86 TIGR00990 3a0801s09 mitochondr 95.8 0.13 3.3E-06 28.4 11.3 184 6-198 21-236 (649) 87 pfam09295 ChAPs ChAPs (Chs5p-A 95.8 0.13 3.4E-06 28.3 12.4 113 66-188 182-294 (395) 88 COG4105 ComL DNA uptake lipopr 95.4 0.17 4.4E-06 27.6 12.1 123 64-187 82-229 (254) 89 TIGR02552 LcrH_SycD type III s 95.4 0.16 4E-06 27.9 8.5 96 51-152 16-111 (137) 90 KOG1129 consensus 95.2 0.2 5.2E-06 27.2 9.5 52 67-120 270-321 (478) 91 TIGR00990 3a0801s09 mitochondr 95.1 0.23 5.8E-06 26.9 9.9 27 163-189 577-603 (649) 92 KOG1125 consensus 95.0 0.24 6.2E-06 26.7 13.2 91 95-190 436-526 (579) 93 COG4700 Uncharacterized protei 94.0 0.41 1E-05 25.3 8.2 129 56-191 92-222 (251) 94 TIGR03504 FimV_Cterm FimV C-te 93.6 0.2 5.2E-06 27.2 5.6 40 168-209 5-44 (44) 95 pfam09986 DUF2225 Uncharacteri 93.6 0.48 1.2E-05 24.9 12.6 83 124-206 115-211 (214) 96 KOG4648 consensus 93.5 0.5 1.3E-05 24.8 9.4 100 95-202 103-205 (536) 97 pfam07219 HemY_N HemY protein 92.3 0.73 1.9E-05 23.7 10.3 58 45-104 77-134 (134) 98 KOG4162 consensus 91.9 0.81 2.1E-05 23.5 13.7 99 89-193 684-785 (799) 99 KOG1156 consensus 91.8 0.85 2.2E-05 23.3 17.2 160 44-206 85-263 (700) 100 PRK10920 putative uroporphyrin 91.6 0.88 2.2E-05 23.3 14.8 28 23-50 38-65 (389) 101 pfam12072 DUF3552 Domain of un 91.4 0.8 2E-05 23.5 6.3 29 23-51 2-30 (201) 102 pfam10602 RPN7 26S proteasome 89.8 1.3 3.3E-05 22.2 13.8 104 89-193 36-144 (174) 103 PRK04778 septation ring format 89.2 1.4 3.6E-05 21.9 17.1 167 23-191 4-206 (569) 104 pfam00515 TPR_1 Tetratricopept 89.2 0.7 1.8E-05 23.8 4.4 29 164-192 3-31 (34) 105 pfam03704 BTAD Bacterial trans 88.7 1.5 3.9E-05 21.7 12.5 75 129-204 64-144 (146) 106 COG3629 DnrI DNA-binding trans 88.1 1.7 4.3E-05 21.5 11.0 99 91-190 117-215 (280) 107 PRK11189 lipoprotein NlpI; Pro 87.9 1.7 4.4E-05 21.4 16.2 132 62-197 108-271 (297) 108 KOG1070 consensus 87.9 1.7 4.4E-05 21.4 16.4 148 55-207 1531-1679(1710) 109 KOG4555 consensus 87.8 1.8 4.5E-05 21.4 12.2 100 91-196 45-148 (175) 110 KOG1155 consensus 87.7 1.8 4.6E-05 21.3 14.4 32 163-194 433-464 (559) 111 KOG0687 consensus 87.0 2 5E-05 21.1 11.6 105 86-190 101-209 (393) 112 PRK01844 hypothetical protein; 86.8 1.5 3.9E-05 21.8 4.9 32 20-51 3-34 (72) 113 smart00028 TPR Tetratricopepti 86.8 1.1 2.8E-05 22.7 4.1 30 164-193 3-32 (34) 114 pfam06295 DUF1043 Protein of u 86.7 2 5.2E-05 21.0 5.9 25 67-91 50-74 (128) 115 pfam07719 TPR_2 Tetratricopept 85.9 0.9 2.3E-05 23.2 3.3 29 164-192 3-31 (34) 116 KOG1497 consensus 85.7 2.3 5.8E-05 20.7 11.4 115 85-199 99-221 (399) 117 KOG2300 consensus 85.6 2.3 5.9E-05 20.6 15.5 138 65-203 335-486 (629) 118 pfam00957 Synaptobrevin Synapt 85.2 2.4 6.1E-05 20.5 5.3 31 8-38 53-83 (89) 119 PRK11677 cytochrome d ubiquino 85.0 2.5 6.3E-05 20.5 6.7 25 67-91 54-78 (134) 120 PRK03598 hypothetical protein; 85.0 0.38 9.7E-06 25.5 1.1 13 176-188 159-171 (331) 121 KOG0548 consensus 84.7 2.5 6.5E-05 20.4 18.8 137 51-194 315-458 (539) 122 KOG2053 consensus 84.6 2.6 6.5E-05 20.4 10.9 150 57-211 47-241 (932) 123 pfam06160 EzrA Septation ring 84.5 2.6 6.6E-05 20.3 15.2 165 24-191 2-202 (559) 124 cd05804 StaR_like StaR_like; a 83.5 2.9 7.3E-05 20.1 17.0 135 54-192 44-216 (355) 125 pfam06864 PAP_PilO Pilin acces 82.7 3.1 7.8E-05 19.9 6.3 31 21-51 164-195 (414) 126 pfam10345 Cohesin_load Cohesin 81.7 3.4 8.5E-05 19.6 17.4 25 163-187 181-205 (593) 127 pfam10828 DUF2570 Protein of u 81.3 3.5 8.9E-05 19.5 7.4 28 21-49 2-29 (110) 128 COG1698 Uncharacterized protei 81.0 3.5 9E-05 19.5 8.3 86 95-211 5-90 (93) 129 KOG1128 consensus 80.9 3.6 9.1E-05 19.5 6.3 100 91-195 487-586 (777) 130 KOG1174 consensus 80.9 3.6 9.1E-05 19.5 12.9 82 105-192 420-501 (564) 131 PRK09782 bacteriophage N4 rece 80.3 3.7 9.5E-05 19.4 15.3 119 66-192 589-707 (987) 132 KOG0495 consensus 79.8 3.9 9.9E-05 19.3 12.9 50 140-190 664-713 (913) 133 KOG4782 consensus 79.4 3.9 9.9E-05 19.2 4.5 36 12-49 47-83 (108) 134 KOG1586 consensus 78.8 4.2 0.00011 19.1 13.5 135 52-190 72-223 (288) 135 TIGR02267 TIGR02267 Myxococcus 77.2 4.7 0.00012 18.8 6.7 50 160-210 74-123 (123) 136 PRK04841 transcriptional regul 76.3 4.9 0.00013 18.6 13.8 25 165-189 734-758 (903) 137 KOG3081 consensus 75.4 5.2 0.00013 18.5 16.0 110 98-213 146-257 (299) 138 PRK11556 multidrug efflux syst 74.2 3.3 8.3E-05 19.7 3.0 27 17-45 3-29 (415) 139 KOG2005 consensus 71.4 6.5 0.00017 17.9 11.8 137 67-209 448-589 (878) 140 PRK10476 multidrug resistance 71.1 1.2 3E-05 22.4 0.1 23 22-44 13-35 (348) 141 pfam05568 ASFV_J13L African sw 70.9 3.2 8.2E-05 19.8 2.3 44 24-67 34-77 (189) 142 KOG0545 consensus 70.4 6.8 0.00017 17.7 11.3 110 91-201 180-303 (329) 143 PRK04330 hypothetical protein; 69.3 7.2 0.00018 17.6 8.0 81 100-211 6-86 (88) 144 PRK12370 invasion protein regu 68.7 7.4 0.00019 17.5 13.2 134 53-191 258-401 (553) 145 COG5187 RPN7 26S proteasome re 68.3 7.5 0.00019 17.5 11.0 109 85-193 111-223 (412) 146 COG3170 FimV Tfp pilus assembl 67.2 7.9 0.0002 17.3 5.3 44 164-209 711-754 (755) 147 pfam04053 Coatomer_WDAD Coatom 66.3 8.3 0.00021 17.2 5.4 45 138-188 321-365 (435) 148 PRK07021 fliL flagellar basal 66.0 4.9 0.00013 18.6 2.4 23 24-46 17-39 (159) 149 COG3763 Uncharacterized protei 65.9 8.4 0.00022 17.2 4.8 26 23-48 6-31 (71) 150 KOG2047 consensus 65.8 8.5 0.00022 17.2 9.6 25 166-190 252-276 (835) 151 COG3167 PilO Tfp pilus assembl 65.5 8.6 0.00022 17.1 4.9 36 17-52 19-54 (211) 152 PRK09458 pspB phage shock prot 64.6 8.9 0.00023 17.0 4.0 22 65-86 50-71 (75) 153 KOG0376 consensus 64.0 9.1 0.00023 16.9 6.0 29 163-191 73-101 (476) 154 pfam04011 LemA LemA family. Th 63.9 9.2 0.00023 16.9 14.0 45 165-211 121-165 (186) 155 pfam09402 MSC Man1-Src1p-C-ter 63.6 9.3 0.00024 16.9 12.0 100 11-119 199-298 (325) 156 KOG2062 consensus 63.6 9.3 0.00024 16.9 14.3 161 42-209 490-651 (929) 157 KOG1127 consensus 63.4 9.4 0.00024 16.9 18.7 126 69-199 508-633 (1238) 158 KOG0860 consensus 63.3 9.4 0.00024 16.9 5.2 29 9-37 80-108 (116) 159 pfam12273 RCR Chitin synthesis 62.0 9 0.00023 17.0 3.1 23 22-44 1-23 (124) 160 COG4968 PilE Tfp pilus assembl 62.0 9.9 0.00025 16.7 6.5 36 23-58 13-48 (139) 161 PRK11619 lytic murein transgly 61.1 10 0.00026 16.6 13.5 124 57-190 245-374 (645) 162 KOG1915 consensus 60.7 10 0.00027 16.6 10.5 71 49-121 259-354 (677) 163 PRK02201 putative inner membra 59.6 11 0.00028 16.5 4.3 31 4-34 16-46 (392) 164 PRK05696 fliL flagellar basal 59.6 7.5 0.00019 17.5 2.4 12 31-42 29-40 (168) 165 COG4093 Uncharacterized protei 59.5 11 0.00028 16.4 3.8 38 15-52 10-47 (338) 166 pfam03685 UPF0147 Uncharacteri 59.3 11 0.00028 16.4 8.9 79 102-211 5-83 (85) 167 pfam09543 DUF2379 Protein of u 58.6 11 0.00029 16.4 5.7 51 159-210 72-122 (122) 168 PRK08455 fliL flagellar basal 58.6 6.1 0.00016 18.0 1.8 17 23-39 21-37 (180) 169 TIGR03319 YmdA_YtgF conserved 58.3 12 0.00029 16.3 6.8 28 24-51 1-28 (514) 170 PRK10559 p-hydroxybenzoic acid 57.3 12 0.00031 16.2 4.5 30 17-47 5-34 (310) 171 pfam03672 UPF0154 Uncharacteri 56.7 12 0.00031 16.2 3.6 23 27-49 3-25 (64) 172 pfam11297 DUF3098 Protein of u 55.9 11 0.00028 16.5 2.7 21 17-37 4-24 (64) 173 COG4785 NlpI Lipoprotein NlpI, 54.2 13 0.00034 15.9 6.6 126 61-190 107-265 (297) 174 TIGR00247 TIGR00247 conserved 54.1 14 0.00035 15.9 3.5 29 22-50 14-42 (373) 175 PRK10270 hypothetical protein; 53.0 14 0.00036 15.8 3.4 19 28-46 10-28 (340) 176 KOG4403 consensus 51.8 15 0.00037 15.7 6.8 25 22-47 215-239 (575) 177 TIGR02976 phageshock_pspB phag 51.7 15 0.00038 15.7 5.0 55 31-86 14-71 (75) 178 PRK12785 fliL flagellar basal 51.2 4.7 0.00012 18.8 0.2 17 27-43 32-48 (167) 179 KOG4814 consensus 50.8 15 0.00039 15.6 9.4 99 92-191 357-457 (872) 180 pfam04415 DUF515 Protein of un 50.6 15 0.00039 15.6 5.7 34 18-51 27-60 (416) 181 PRK00106 hypothetical protein; 49.1 16 0.00041 15.4 6.0 22 23-44 3-24 (535) 182 KOG3785 consensus 48.5 17 0.00042 15.3 13.9 54 133-188 434-487 (557) 183 KOG1585 consensus 47.7 17 0.00043 15.3 12.5 152 52-207 69-236 (308) 184 pfam02009 Rifin_STEVOR Rifin/s 47.1 17 0.00044 15.2 4.0 37 8-44 86-124 (289) 185 PRK10927 essential cell divisi 45.6 4.9 0.00012 18.6 -0.4 28 20-47 32-59 (319) 186 PRK09379 membrane-bound transc 45.4 18 0.00047 15.1 4.0 29 18-47 3-31 (303) 187 TIGR00771 DcuC transporter, an 45.3 19 0.00047 15.0 6.0 46 14-59 133-178 (390) 188 pfam06667 PspB Phage shock pro 45.1 19 0.00048 15.0 3.9 21 66-86 51-71 (75) 189 COG3107 LppC Putative lipoprot 44.4 19 0.00049 15.0 14.0 91 92-184 66-158 (604) 190 PRK07193 fliF flagellar MS-rin 44.4 19 0.00049 15.0 4.5 20 196-215 250-269 (550) 191 pfam06786 UPF0253 Uncharacteri 44.3 19 0.00049 14.9 3.9 30 108-137 30-59 (66) 192 KOG2796 consensus 44.0 19 0.0005 14.9 12.3 85 65-153 224-312 (366) 193 PRK03427 cell division protein 43.5 9.7 0.00025 16.8 0.8 24 24-49 10-33 (331) 194 PRK01741 cell division protein 43.5 10 0.00025 16.7 0.9 23 24-48 8-30 (342) 195 COG4941 Predicted RNA polymera 43.0 20 0.00051 14.8 11.6 135 70-210 273-412 (415) 196 COG4976 Predicted methyltransf 42.9 20 0.00052 14.8 5.5 56 64-121 6-61 (287) 197 PRK04964 hypothetical protein; 42.3 21 0.00053 14.8 3.8 33 181-213 30-62 (66) 198 pfam10516 SHNi-TPR SHNi-TPR. S 41.9 21 0.00054 14.7 3.3 26 165-190 4-29 (38) 199 TIGR03362 VI_chp_7 type VI sec 41.6 21 0.00054 14.7 8.1 62 130-191 216-279 (301) 200 pfam03748 FliL Flagellar basal 41.5 21 0.00054 14.7 3.1 22 23-44 6-27 (145) 201 KOG1550 consensus 41.4 21 0.00054 14.7 11.5 80 104-192 308-394 (552) 202 TIGR01495 ETRAMP early transcr 40.9 21 0.00054 14.7 2.3 25 24-48 70-94 (99) 203 pfam10037 MRP-S27 Mitochondria 40.1 22 0.00057 14.5 7.0 72 62-137 110-184 (417) 204 PRK10506 hypothetical protein; 40.0 22 0.00057 14.5 6.1 39 24-62 13-51 (155) 205 pfam07006 DUF1310 Protein of u 39.9 4.8 0.00012 18.7 -1.2 13 70-82 39-51 (122) 206 pfam06097 DUF945 Bacterial pro 39.7 23 0.00058 14.5 3.3 26 24-49 5-30 (460) 207 COG3105 Uncharacterized protei 39.6 23 0.00058 14.5 6.8 22 21-42 8-29 (138) 208 COG5294 Uncharacterized protei 39.2 12 0.00031 16.2 0.8 24 22-45 2-25 (113) 209 pfam04375 HemX HemX. This fami 39.2 23 0.00059 14.5 14.4 10 103-112 147-156 (372) 210 COG5547 Small integral membran 39.1 23 0.00059 14.4 3.1 19 17-35 3-21 (62) 211 PRK00523 hypothetical protein; 39.0 23 0.00059 14.4 4.6 23 29-51 13-35 (73) 212 pfam07721 TPR_4 Tetratricopept 38.6 24 0.0006 14.4 2.4 23 164-186 3-25 (26) 213 pfam10938 YfdX YfdX protein. Y 38.5 24 0.0006 14.4 6.4 62 129-190 77-145 (155) 214 PRK12757 cell division protein 38.0 7.3 0.00019 17.5 -0.5 22 25-46 2-23 (249) 215 PRK10941 putative transcriptio 37.7 24 0.00062 14.3 8.3 75 133-211 187-263 (269) 216 COG3115 ZipA Cell division pro 36.9 25 0.00064 14.2 2.4 22 24-47 9-30 (324) 217 PRK05886 yajC preprotein trans 36.9 25 0.00064 14.2 4.7 18 42-59 20-37 (108) 218 TIGR01477 RIFIN variant surfac 36.9 25 0.00064 14.2 3.2 42 68-109 214-257 (395) 219 PRK01294 lipase chaperone; Pro 36.7 25 0.00065 14.2 3.5 17 29-45 12-28 (339) 220 pfam09670 Cas_Cas02710 CRISPR- 35.8 26 0.00067 14.1 14.3 63 58-120 136-200 (379) 221 TIGR03061 pip_yhgE_Nterm YhgE/ 35.3 27 0.00068 14.1 3.4 33 15-47 5-37 (164) 222 cd00280 TRFH Telomeric Repeat 34.6 27 0.0007 14.0 7.9 67 105-172 85-155 (200) 223 TIGR00847 ccoS cytochrome oxid 33.9 28 0.00071 13.9 2.9 27 24-50 5-31 (53) 224 pfam10969 DUF2771 Protein of u 33.8 28 0.00072 13.9 2.5 25 22-46 4-28 (161) 225 PRK10893 hypothetical protein; 33.7 22 0.00056 14.6 1.4 20 21-40 5-24 (190) 226 COG2979 Uncharacterized protei 33.3 29 0.00073 13.9 3.0 25 182-206 194-218 (225) 227 TIGR00756 PPR pentatricopeptid 33.0 29 0.00074 13.8 3.6 29 167-195 5-33 (35) 228 KOG1464 consensus 32.8 29 0.00074 13.8 5.6 103 92-194 148-263 (440) 229 pfam10694 DUF2500 Protein of u 32.6 25 0.00065 14.2 1.6 20 26-46 5-24 (107) 230 pfam06120 Phage_HK97_TLTM Tail 32.5 30 0.00075 13.8 9.9 56 9-65 9-64 (311) 231 COG1422 Predicted membrane pro 32.3 30 0.00076 13.8 7.6 39 41-79 68-106 (201) 232 PTZ00202 tuzin; Provisional 32.0 30 0.00077 13.7 7.4 49 9-57 295-349 (664) 233 PRK08881 rpsN 30S ribosomal pr 31.5 31 0.00078 13.7 4.2 32 178-209 20-51 (101) 234 PRK00269 zipA cell division pr 31.5 22 0.00057 14.5 1.1 20 24-45 10-29 (295) 235 TIGR02588 TIGR02588 conserved 31.3 31 0.00079 13.7 2.5 26 23-48 5-30 (122) 236 PRK06007 fliF flagellar MS-rin 30.8 32 0.0008 13.6 4.1 36 10-45 12-47 (540) 237 TIGR02161 napC_nirT periplasmi 30.5 32 0.00081 13.6 3.8 34 103-137 93-126 (185) 238 TIGR02833 spore_III_AB stage I 30.3 32 0.00082 13.6 10.3 91 24-118 2-95 (170) 239 PRK07718 fliL flagellar basal 29.8 33 0.00084 13.5 2.2 23 21-43 6-28 (142) 240 PRK10617 cytochrome c-type pro 29.6 33 0.00084 13.5 5.1 44 15-58 13-57 (200) 241 PRK06770 hypothetical protein; 29.5 33 0.00085 13.5 8.9 21 18-38 2-22 (185) 242 COG5110 RPN1 26S proteasome re 29.5 33 0.00085 13.5 9.8 152 29-181 396-564 (881) 243 PRK10834 hypothetical protein; 29.1 34 0.00086 13.4 3.9 33 16-49 1-33 (239) 244 PRK11086 sensory histidine kin 28.8 34 0.00087 13.4 7.3 18 136-153 358-375 (541) 245 COG1566 EmrA Multidrug resista 28.3 35 0.00089 13.4 4.3 32 15-47 9-40 (352) 246 PHA02291 hypothetical protein 28.3 35 0.00089 13.3 2.6 25 21-45 4-28 (131) 247 pfam11381 DUF3185 Protein of u 28.1 23 0.00059 14.4 0.7 20 26-45 3-22 (59) 248 PRK10856 hypothetical protein; 27.7 36 0.00091 13.3 2.6 15 31-46 123-137 (332) 249 pfam10039 DUF2275 Predicted in 27.4 36 0.00092 13.3 1.9 22 14-35 28-49 (199) 250 COG0457 NrfG FOG: TPR repeat [ 27.3 36 0.00092 13.2 17.0 127 65-194 107-234 (291) 251 PTZ00046 rifin; Provisional 27.3 36 0.00092 13.2 3.9 39 7-46 129-169 (348) 252 pfam11044 TMEMspv1-c74-12 Plec 27.2 36 0.00093 13.2 4.4 24 26-49 8-31 (49) 253 KOG1463 consensus 26.9 37 0.00094 13.2 12.1 91 123-213 124-222 (411) 254 pfam09577 Spore_YpjB Sporulati 26.8 37 0.00094 13.2 12.3 135 55-211 7-161 (232) 255 PRK12800 fliF flagellar MS-rin 26.6 37 0.00095 13.2 3.5 20 196-215 253-272 (572) 256 PRK04561 tatA twin arginine tr 26.4 38 0.00096 13.1 1.6 15 19-33 5-19 (75) 257 pfam10177 DUF2371 Uncharacteri 25.8 18 0.00045 15.2 -0.2 32 11-42 32-63 (142) 258 KOG1308 consensus 25.6 39 0.00099 13.0 6.7 72 44-117 105-176 (377) 259 KOG4507 consensus 25.4 39 0.001 13.0 7.0 93 92-191 609-705 (886) 260 pfam11780 DUF3318 Protein of u 25.2 40 0.001 13.0 2.9 23 30-52 102-124 (142) 261 TIGR01411 tatAE twin arginine- 24.7 40 0.001 12.9 1.8 13 21-33 6-18 (48) 262 pfam11446 DUF2897 Protein of u 24.3 41 0.001 12.9 1.9 17 21-37 4-20 (55) 263 pfam11337 DUF3139 Protein of u 24.2 41 0.0011 12.9 3.1 24 39-62 25-48 (85) 264 PRK00720 tatA twin arginine tr 24.0 41 0.001 12.9 1.4 15 19-33 5-19 (71) 265 COG5261 IQG1 Protein involved 23.9 42 0.0011 12.8 5.0 22 27-48 606-627 (1054) 266 COG5116 RPN2 26S proteasome re 23.9 42 0.0011 12.8 11.0 194 9-209 446-648 (926) 267 PRK09687 hypothetical protein; 23.8 42 0.0011 12.8 17.6 131 70-212 89-222 (280) 268 PRK10209 acid-resistance membr 23.6 43 0.0011 12.8 4.5 36 4-39 3-39 (190) 269 TIGR03007 pepcterm_ChnLen poly 23.6 43 0.0011 12.8 2.0 95 14-116 8-110 (510) 270 TIGR03545 conserved hypothetic 23.4 43 0.0011 12.8 5.2 25 161-187 279-303 (554) 271 TIGR02830 spore_III_AG stage I 23.4 41 0.001 12.9 1.3 25 20-44 2-26 (193) 272 pfam12575 DUF3753 Protein of u 23.2 43 0.0011 12.8 1.7 17 23-39 50-66 (72) 273 TIGR02889 spore_YpeB germinati 23.0 44 0.0011 12.7 6.0 37 24-61 3-41 (465) 274 TIGR00380 cobD cobalamin biosy 23.0 44 0.0011 12.7 2.4 147 61-215 111-272 (322) 275 COG1704 LemA Uncharacterized c 23.0 44 0.0011 12.7 13.0 29 164-192 120-148 (185) 276 PRK13184 pknD serine/threonine 22.9 44 0.0011 12.7 10.0 95 103-198 781-877 (933) 277 pfam09716 ETRAMP Malarial earl 22.9 44 0.0011 12.7 3.1 17 28-44 62-78 (84) 278 pfam03097 BRO1 BRO1-like domai 22.8 44 0.0011 12.7 6.3 32 159-190 236-267 (374) 279 pfam09911 DUF2140 Uncharacteri 22.7 23 0.00059 14.5 -0.1 26 20-45 2-27 (187) 280 pfam07423 DUF1510 Protein of u 22.6 45 0.0011 12.7 3.4 30 9-42 7-36 (214) 281 pfam09782 NDUF_B6 NADH:ubiquin 22.4 26 0.00067 14.1 0.1 29 19-47 83-111 (152) 282 TIGR02831 spo_II_M stage II sp 22.1 46 0.0012 12.6 7.4 65 17-81 7-71 (201) 283 COG2959 HemX Uncharacterized e 21.8 46 0.0012 12.6 12.4 21 32-52 44-64 (391) 284 PRK04335 cell division protein 21.6 43 0.0011 12.8 1.1 23 24-48 9-31 (319) 285 pfam11743 DUF3301 Protein of u 21.5 47 0.0012 12.6 4.0 23 35-57 9-32 (97) 286 pfam06936 Selenoprotein_S Sele 21.5 47 0.0012 12.6 8.8 28 9-36 24-51 (190) 287 PRK01470 tatA twin arginine tr 21.3 31 0.0008 13.6 0.4 15 19-33 4-18 (53) 288 COG3005 TorC Nitrate/TMAO redu 21.2 48 0.0012 12.5 1.9 29 29-57 19-47 (190) 289 COG5353 Uncharacterized protei 20.9 46 0.0012 12.6 1.1 24 22-45 7-31 (161) 290 TIGR01655 yxeA_fam conserved h 20.9 48 0.0012 12.5 2.9 25 21-45 1-25 (121) 291 pfam06916 DUF1279 Protein of u 20.9 47 0.0012 12.6 1.2 34 12-45 1-34 (88) 292 TIGR02162 torC trimethylamine- 20.9 31 0.0008 13.6 0.3 43 15-57 2-46 (394) 293 COG3197 FixS Uncharacterized p 20.9 48 0.0012 12.5 2.5 12 33-44 15-26 (58) 294 pfam02116 STE2 Fungal pheromon 20.7 49 0.0012 12.5 4.2 16 18-33 36-51 (283) 295 TIGR02902 spore_lonB ATP-depen 20.7 23 0.00058 14.5 -0.5 28 26-53 3-30 (532) 296 COG4640 Predicted membrane pro 20.6 49 0.0012 12.4 3.1 39 35-77 64-102 (465) 297 PRK11519 tyrosine kinase; Prov 20.5 49 0.0013 12.4 3.2 28 9-37 19-46 (720) No 1 >COG4649 Uncharacterized protein conserved in bacteria [Function unknown] Probab=100.00 E-value=2.5e-31 Score=212.30 Aligned_cols=206 Identities=31% Similarity=0.403 Sum_probs=189.4 Q ss_pred HHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCH Q ss_conf 58899-99999975199999999999999999999999999999999999999998413876669999999731255206 Q gi|254780421|r 9 SYKGK-CKMGKCCYKWIAPLLILMILSLAIWFYLFDGSHEKKKNIVGENFAQALELFNSNKLDDARSSFEKILSQDNKLY 87 (218) Q Consensus 9 ~~~~~-~~~~~~~yk~ii~~~i~~i~~~~~~~~~~~~~q~~~~~~as~~y~~al~~~~~~~~~~A~~~l~~l~~~~~~~y 87 (218) -+-|+ -++|++...++|.+.+++++|. +||.+|.+||++...+.++.|..++...+.++.++|...|..+.+.+-.+| T Consensus 14 lRsEq~~~lw~rfgp~v~giailvVlGt-ag~~gy~yw~~s~as~sgd~flaAL~lA~~~k~d~Alaaf~~lektg~g~Y 92 (221) T COG4649 14 LRSEQAKTLWKRFGPAVIGIAILVVLGT-AGYVGYTYWQTSRASKSGDAFLAALKLAQENKTDDALAAFTDLEKTGYGSY 92 (221) T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHCC-CCCEEEEHHCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCC T ss_conf 9999999999971109999999998323-420100011133424325899999999870796689999999986488855 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHH Q ss_conf 99999999999951100577899999987312689578999999999963026998999999753114642356899999 Q gi|254780421|r 88 NPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEEISKILQKLSEPSNPMHQFANEI 167 (218) Q Consensus 88 a~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~el 167 (218) ..||+|..|.+..++|+...|+..|..+..+.+.|.+.+++||+|.++++++++.|++..+++.++..+.+|+++++.|. T Consensus 93 pvLA~mr~at~~a~kgdta~AV~aFdeia~dt~~P~~~rd~ARlraa~lLvD~gsy~dV~srvepLa~d~n~mR~sArEA 172 (221) T COG4649 93 PVLARMRAATLLAQKGDTAAAVAAFDEIAADTSIPQIGRDLARLRAAYLLVDNGSYDDVSSRVEPLAGDGNPMRHSAREA 172 (221) T ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCCCCHHHHHHHHH T ss_conf 69999999999861264788999899885257876013679999878987435648889987413048997017889998 Q ss_pred HHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCHHH Q ss_conf 999999889989999999999835589989999999999950034032 Q gi|254780421|r 168 LGISALKFGKVQKAKTIFEELAKDNNSPFGISTRSQMILANIIASDQR 215 (218) Q Consensus 168 lg~~~~~~Gd~~~A~~~y~~i~~~~~~p~~~r~ra~~~l~~l~~~~~~ 215 (218) +|++.++.||+..|++.|+.|.++...|.++|+|++++|..|.+...+ T Consensus 173 LglAa~kagd~a~A~~~F~qia~Da~aprnirqRAq~mldlI~s~g~a 220 (221) T COG4649 173 LGLAAYKAGDFAKAKSWFVQIANDAQAPRNIRQRAQIMLDLIDSSGKA 220 (221) T ss_pred HHHHHHHCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHCCCCC T ss_conf 768887322467799999999701469677999999999998602558 No 2 >COG2976 Uncharacterized protein conserved in bacteria [Function unknown] Probab=99.96 E-value=3.1e-26 Score=180.80 Aligned_cols=205 Identities=16% Similarity=0.157 Sum_probs=173.8 Q ss_pred CCCCCHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH Q ss_conf 9754112458899-999999751999999999999999999999999999999999999999984138766699999997 Q gi|254780421|r 1 MESNNDANSYKGK-CKMGKCCYKWIAPLLILMILSLAIWFYLFDGSHEKKKNIVGENFAQALELFNSNKLDDARSSFEKI 79 (218) Q Consensus 1 ~~~~~~~~~~~~~-~~~~~~~yk~ii~~~i~~i~~~~~~~~~~~~~q~~~~~~as~~y~~al~~~~~~~~~~A~~~l~~l 79 (218) |+.--+-+.--+. -.-|+.++|++|+++|+++++++| |.||+.+|..+...+|..|++++...+.+.. +.....+++ T Consensus 1 M~~~~~E~qql~~ik~wwkeNGk~li~gviLg~~~lfG-W~ywq~~q~~q~~~AS~~Y~~~i~~~~ak~~-~~~~~~ekf 78 (207) T COG2976 1 MAYYLEEQQQLEAIKDWWKENGKALIVGVILGLGGLFG-WRYWQSHQVEQAQEASAQYQNAIKAVQAKKP-KSIAAAEKF 78 (207) T ss_pred CCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC-HHHHHHHHH T ss_conf 96315379999999999997790789999999999998-8999998788888999999999999855770-259999999 Q ss_pred HHCC-CCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCC Q ss_conf 3125-520699999999999951100577899999987312689578999999999963026998999999753114642 Q gi|254780421|r 80 LSQD-NKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEEISKILQKLSEPSN 158 (218) Q Consensus 80 ~~~~-~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~~~ 158 (218) .... .+.|+.++.|.+|+..++.|++++|+..|++.+.. +.|..++.++.+|+++++++++++|++++.|..+.+++ T Consensus 79 ~~~n~~t~Ya~laaL~lAk~~ve~~~~d~A~aqL~~~l~~-t~De~lk~l~~lRLArvq~q~~k~D~AL~~L~t~~~~~- 156 (207) T COG2976 79 VQANGKTIYAVLAALELAKAEVEANNLDKAEAQLKQALAQ-TKDENLKALAALRLARVQLQQKKADAALKTLDTIKEES- 156 (207) T ss_pred HHHCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCC-CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC- T ss_conf 8615635899999999999998634299999999998714-52578999999999999998603888999872301110- Q ss_pred HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCC Q ss_conf 35689999999999988998999999999983558998999999999995003 Q gi|254780421|r 159 PMHQFANEILGISALKFGKVQKAKTIFEELAKDNNSPFGISTRSQMILANIIA 211 (218) Q Consensus 159 ~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~~~~p~~~r~ra~~~l~~l~~ 211 (218) |.+...+++||+++..||.++||..|++.++.. .++..|+.++|+|.+|++ T Consensus 157 -w~~~~~elrGDill~kg~k~~Ar~ay~kAl~~~-~s~~~~~~lqmKLn~L~~ 207 (207) T COG2976 157 -WAAIVAELRGDILLAKGDKQEARAAYEKALESD-ASPAAREILQMKLNNLSG 207 (207) T ss_pred -HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCC-CCHHHHHHHHHHHHHCCC T ss_conf -788999974269997286599999999998715-774889999958874279 No 3 >pfam10858 DUF2659 Protein of unknown function (DUF2659). This bacterial family of proteins has no known function. Probab=99.68 E-value=4.6e-15 Score=111.66 Aligned_cols=198 Identities=18% Similarity=0.211 Sum_probs=170.0 Q ss_pred HH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-CCCHHHHHHHHHHHHHCCCCCHHH Q ss_conf 99-9999997519999999999999999999999999999999999999999841-387666999999973125520699 Q gi|254780421|r 12 GK-CKMGKCCYKWIAPLLILMILSLAIWFYLFDGSHEKKKNIVGENFAQALELFN-SNKLDDARSSFEKILSQDNKLYNP 89 (218) Q Consensus 12 ~~-~~~~~~~yk~ii~~~i~~i~~~~~~~~~~~~~q~~~~~~as~~y~~al~~~~-~~~~~~A~~~l~~l~~~~~~~ya~ 89 (218) +| +.-||+.--.+|+..++++. .......++..|.+..++.++.|...+.... .++.+.+...++.+...+++.... T Consensus 15 ~krliffkkll~iviiisl~~it-imvi~nnnk~~~i~nnqkngdi~vk~i~l~~~~~n~ela~~tlenlv~~sNTkike 93 (220) T pfam10858 15 EKRLIFFKKLLPIVIIISLIAIT-IMVINNNNKARQIANNQKNGDIFVKSVDLESTKNNSELAVQTLENLVSNSNTKIQE 93 (220) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH-HHHHHCCCHHHHHHHHCCCCCEEEEECCCCCCCCCHHHHHHHHHHHHHCCCHHHHH T ss_conf 78999999889999999999999-99997155577888514366545564462003575888899999998613447999 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCH-----HHHHHHHHHHCCCCCHHHHHH Q ss_conf 999999999951100577899999987312689578999999999963026998-----999999753114642356899 Q gi|254780421|r 90 LSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSY-----EEISKILQKLSEPSNPMHQFA 164 (218) Q Consensus 90 lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~-----d~a~~~l~~~~~~~~~~~~~a 164 (218) +|++.++...+..|++.+|...++.++++.......++.||+.++.+.++-++. +...+.|....+|+.||++.| T Consensus 94 IaaLe~va~kIs~~~y~eak~llNkIi~nk~ySei~tsyaRi~~c~l~Idd~nldi~dkekL~k~l~~f~d~~kPFwatA 173 (220) T pfam10858 94 IAHLEQVAIKISVGSFSEAKDLLNKIIENKEYSEITTSYARINWCSLVIDDNNLDIDDKEKLIKYLNYFDDESKPFWATA 173 (220) T ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHEECCCCCCCCCHHHHHHHHHHHCCCCCCHHHHH T ss_conf 99999999998211348999999999831309999999999988862233777673419999999873048999479899 Q ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHC Q ss_conf 9999999998899899999999998355899899999999999500 Q gi|254780421|r 165 NEILGISALKFGKVQKAKTIFEELAKDNNSPFGISTRSQMILANII 210 (218) Q Consensus 165 ~ellg~~~~~~Gd~~~A~~~y~~i~~~~~~p~~~r~ra~~~l~~l~ 210 (218) .+..+....+.|+..+|.+.++.++.+.++++.++.|+.+++.+++ T Consensus 174 ~i~~AiwdiKnn~~~~A~k~Lk~ll~snn~Sd~lKdqakaLl~nld 219 (220) T pfam10858 174 SIYKAIWDIKNNMDNKAEKNLEALIRSNNSSDLLKDQAKALLQNLE 219 (220) T ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCC T ss_conf 9999999886386189999999998422115889999999986357 No 4 >PRK11788 hypothetical protein; Provisional Probab=99.28 E-value=2.8e-09 Score=75.82 Aligned_cols=161 Identities=14% Similarity=0.151 Sum_probs=134.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHH Q ss_conf 9999999999999999999999---9999999999984138766699999997312552069999999999995110057 Q gi|254780421|r 30 LMILSLAIWFYLFDGSHEKKKN---IVGENFAQALELFNSNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTK 106 (218) Q Consensus 30 ~~i~~~~~~~~~~~~~q~~~~~---~as~~y~~al~~~~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~ 106 (218) +.|..+.|||-.-........+ ..+..|...+...-++..|+|+..|-++++..+.+.. .+|-++.++...|+.+ T Consensus 9 lpva~~~GW~~~r~~~~~~~~~~~~~l~~~Y~~Gln~Ll~~q~dkAid~f~~~~~~~~~t~e--~hlaLG~LfRrrGE~d 86 (389) T PRK11788 9 LPVAAAYGWYMGRRSARQLLQEQANRLSRDYFKGLNFLLNEQPDKAIDLFIEMLKVDPETVE--LHLALGNLFRRRGEVD 86 (389) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHH--HHHHHHHHHHHCCCHH T ss_conf 99999999999860675555653122669999899887258945999999999864811499--9999999999656088 Q ss_pred HHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHH Q ss_conf 78999999873126895789999999999630269989999997531146423568999999999998899899999999 Q gi|254780421|r 107 NAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEEISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTIFE 186 (218) Q Consensus 107 ~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~ 186 (218) .|+.+=+.++..++.+...+..|.+-++.-++..|=+|.|...+..+... +++...++..+..+|...+|.++|++.++ T Consensus 87 RAIriHq~Ll~rp~L~~~~r~~a~~ELa~Dy~~aGllDRAE~~f~~l~~~-~~~~~~al~~Ll~iye~~~dW~~Ai~~a~ 165 (389) T PRK11788 87 RAIRIHQNLLSRPDLTREQRLLALQELGQDYLKAGLLDRAEELFLQLVDE-DDFAEGALRQLLTIYQQEKDWQKAIEVAE 165 (389) T ss_pred HHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHC-CCCHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999972879899999999999999998677499999999999828-54269999999999998645999999999 Q ss_pred HHHHCCC Q ss_conf 9983558 Q gi|254780421|r 187 ELAKDNN 193 (218) Q Consensus 187 ~i~~~~~ 193 (218) ++..... T Consensus 166 ~L~~~~~ 172 (389) T PRK11788 166 RLEKLGG 172 (389) T ss_pred HHHHCCC T ss_conf 9987047 No 5 >PRK10866 outer membrane protein assembly complex subunit YfiO; Provisional Probab=99.09 E-value=1e-08 Score=72.36 Aligned_cols=174 Identities=11% Similarity=0.011 Sum_probs=124.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCC-CCHHHHHHHHHHHH Q ss_conf 75199999999999999999999999999999999999999998413876669999999731255-20699999999999 Q gi|254780421|r 20 CYKWIAPLLILMILSLAIWFYLFDGSHEKKKNIVGENFAQALELFNSNKLDDARSSFEKILSQDN-KLYNPLSNMYIASI 98 (218) Q Consensus 20 ~yk~ii~~~i~~i~~~~~~~~~~~~~q~~~~~~as~~y~~al~~~~~~~~~~A~~~l~~l~~~~~-~~ya~lA~l~lA~~ 98 (218) .+|.+++.+++.++..++... + .........+.|..++...+++++..|+..|+.+...+| +.++..|.+.+|.+ T Consensus 3 ~~k~l~~~~~l~l~l~gCs~~--~--~~~~~~~~~~~Y~~A~~~~~~~~y~~A~~~fe~l~~~yP~s~~A~qA~L~~aya 78 (243) T PRK10866 3 RMKYLVAAATLSLFLAGCSGS--K--EEVPDNPPNEIYATAQQKLQDGNWKQAITQLEALDNRYPFGPYSQQVQLDLIYA 78 (243) T ss_pred HHHHHHHHHHHHHHHHHCCCC--C--CCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH T ss_conf 589999999999999876899--8--888899999999999999987899999999999998689976789999999999 Q ss_pred HHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCC---------------CCH---HHHHHHHHHHCC--CCC Q ss_conf 951100577899999987312689578999999999963026---------------998---999999753114--642 Q gi|254780421|r 99 LVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDT---------------SSY---EEISKILQKLSE--PSN 158 (218) Q Consensus 99 ~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~---------------~~~---d~a~~~l~~~~~--~~~ 158 (218) ++..++++.|+..++..+...+..+ ..+.|.+..+...+.. .++ -+|...+..+.. |++ T Consensus 79 yy~~~~y~~A~~~~~rFi~lyP~~~-~v~Ya~Yl~gl~~~~~~~~~~~~~~~~~~~~rD~~~~~~A~~~f~~li~~yP~S 157 (243) T PRK10866 79 YYKNADLPLAQAAIDRFMRLNPTHP-NIDYVMYMRGLTNMALDDSALQGFFGVDRSDRDPQHARAAFSDFSKLVRGYPNS 157 (243) T ss_pred HHHHCCHHHHHHHHHHHHHHCCCCC-CHHHHHHHHHHHHHHCCHHHHHHHHCCCHHHCCHHHHHHHHHHHHHHHHHCCCC T ss_conf 8730778999999999998678999-738999998898774013588887432564307699999999999999878998 Q ss_pred HHHHHHHH--------------HHHHHHHHCCCHHHHHHHHHHHHHC-CCCCHHH Q ss_conf 35689999--------------9999999889989999999999835-5899899 Q gi|254780421|r 159 PMHQFANE--------------ILGISALKFGKVQKAKTIFEELAKD-NNSPFGI 198 (218) Q Consensus 159 ~~~~~a~e--------------llg~~~~~~Gd~~~A~~~y~~i~~~-~~~p~~~ 198 (218) +|...+.+ ..|..|.+.|.+..|..-++.+++. +++|..- T Consensus 158 ~Ya~dA~~r~~~lrn~LA~~El~Va~fY~kr~~y~AAi~R~~~vi~~Yp~t~~~~ 212 (243) T PRK10866 158 QYTTDATKRLVFLKDRLAKYEYSVAEYYTKRGAWVAVVNRVEGMLRDYPDTQATR 212 (243) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCHH T ss_conf 4589999999999999999999999999992656999999999997689996339 No 6 >TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795). Probab=99.08 E-value=1.4e-08 Score=71.52 Aligned_cols=144 Identities=18% Similarity=0.112 Sum_probs=113.8 Q ss_pred HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCC-CCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHH Q ss_conf 9999999999998413876669999999731255-206999999999999511005778999999873126895789999 Q gi|254780421|r 51 NIVGENFAQALELFNSNKLDDARSSFEKILSQDN-KLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAA 129 (218) Q Consensus 51 ~~as~~y~~al~~~~~~~~~~A~~~l~~l~~~~~-~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA 129 (218) ..+.+.|..++...+++++..|+..|+.+...+| +.++..|.+.+|..++..|++..|+..++..+...+.++ ..+.| T Consensus 31 ~~~~~lY~~a~~~~~~~~y~~A~~~fe~l~~~yp~s~~a~~A~l~~a~a~y~~~~y~~A~~~~~~Fi~~yP~~~-~~~ya 109 (235) T TIGR03302 31 RPAEELYNEAKEALDSGDYTEAIKYFEALESRYPFSPYAEQAQLDLAYAYYKSGDYAEAIAAADRFIRLHPNHP-DADYA 109 (235) T ss_pred CCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC-HHHHH T ss_conf 89999999999999878999999999999986899313799999999999976439999999999999887743-49999 Q ss_pred HHHHHHHHHCCC--------CHHHHHHHHHHHCC--CCCHHHHHH--------------HHHHHHHHHHCCCHHHHHHHH Q ss_conf 999999630269--------98999999753114--642356899--------------999999999889989999999 Q gi|254780421|r 130 TLQAASILVDTS--------SYEEISKILQKLSE--PSNPMHQFA--------------NEILGISALKFGKVQKAKTIF 185 (218) Q Consensus 130 ~l~la~~~~~~~--------~~d~a~~~l~~~~~--~~~~~~~~a--------------~ellg~~~~~~Gd~~~A~~~y 185 (218) .+..+..+..+. ...+|+..++.+.. |++++...+ ....|..|++.|++..|...| T Consensus 110 ~y~~a~s~~~~~~~~~rDq~~~~~A~~~f~~~i~~yP~S~ya~dA~~~i~~l~~~LA~~e~~ia~~Y~k~~~y~aAi~r~ 189 (235) T TIGR03302 110 YYLRGLSNYKQIDDSDRDQTAAREAFEAFQELIRRYPNSEYAPDAKKRMDYLRNRLAGKELYVARFYLKRGAYVAAINRF 189 (235) T ss_pred HHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHH T ss_conf 99987998873665445808999999999999988899657999999999999999999999999999841709999999 Q ss_pred HHHHHC-CCCC Q ss_conf 999835-5899 Q gi|254780421|r 186 EELAKD-NNSP 195 (218) Q Consensus 186 ~~i~~~-~~~p 195 (218) +.+++. +++| T Consensus 190 ~~vl~~Yp~t~ 200 (235) T TIGR03302 190 ETVVENYPDTP 200 (235) T ss_pred HHHHHHCCCCC T ss_conf 99998689985 No 7 >TIGR02917 PEP_TPR_lipo putative PEP-CTERM system TPR-repeat lipoprotein; InterPro: IPR014266 This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system , analogous to the LPXTG/sortase system common in Gram-positive bacteria. The proteins in this entry occur in a species if, and only if, a transmembrane histidine kinase (IPR014265 from INTERPRO) and a DNA-binding response regulator (IPR014264 from INTERPRO) also occur. The present of tetratricopeptide repeats (TPR) suggests they may be involved in protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.. Probab=99.07 E-value=2.1e-08 Score=70.45 Aligned_cols=149 Identities=14% Similarity=0.165 Sum_probs=117.5 Q ss_pred HHHHHHHHHHHHHHCCC---HHHHHHHH--------HHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCC Q ss_conf 99999999999841387---66699999--------99731255206999999999999511005778999999873126 Q gi|254780421|r 52 IVGENFAQALELFNSNK---LDDARSSF--------EKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDL 120 (218) Q Consensus 52 ~as~~y~~al~~~~~~~---~~~A~~~l--------~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~ 120 (218) +....|..++..++.++ ++.|+..+ +.++...|+-+. +.++.+.+.+..|+++.|..+++.++...+ T Consensus 269 ~p~a~y~kAlv~~q~~~naG~~~A~~~~aqvlDhls~~~l~~~p~~~~--a~~lAG~~~y~lg~~~~A~~yl~~~l~~~P 346 (924) T TIGR02917 269 SPQAAYLKALVDFQKKNNAGYEDARETLAQVLDHLSQAALKSAPEYLP--ALLLAGASEYQLGNLEQAEQYLNQILKAAP 346 (924) T ss_pred CCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 846899999999983357744899999999999999999961877425--999999999996138999999999998668 Q ss_pred CCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHH Q ss_conf 89578999999999963026998999999753114642356899999999999889989999999999835589989999 Q gi|254780421|r 121 APLAVRYAATLQAASILVDTSSYEEISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEELAKDNNSPFGIST 200 (218) Q Consensus 121 ~~~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~~~~p~~~r~ 200 (218) .+ ..|+--++.+.+.+|++++++..|.+....++ -.+..+.++|..|++.||+++|.++|+++....+. +--. T Consensus 347 ~~----~~A~rllA~i~L~~g~~~~A~~~l~~~~~~~~-~d~~~l~~lG~~~L~~g~~~~A~~yl~ka~~~~P~--~a~~ 419 (924) T TIGR02917 347 NS----HQARRLLAAIQLREGRVDEAIATLSPALGLDS-DDSALLSLLGEAYLAKGDFEKAAEYLAKASELDPE--NAAA 419 (924) T ss_pred CC----HHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCC-CCHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCC--HHHH T ss_conf 86----68999999999862897899999998861588-77689999999998638967899999876105898--0889 Q ss_pred HHHHHHHHH Q ss_conf 999999950 Q gi|254780421|r 201 RSQMILANI 209 (218) Q Consensus 201 ra~~~l~~l 209 (218) +.+.=|..| T Consensus 420 ~t~lG~~~L 428 (924) T TIGR02917 420 RTKLGISKL 428 (924) T ss_pred HHHHHHHHH T ss_conf 999998862 No 8 >COG2956 Predicted N-acetylglucosaminyl transferase [Carbohydrate transport and metabolism] Probab=99.02 E-value=1.6e-07 Score=64.93 Aligned_cols=166 Identities=16% Similarity=0.230 Sum_probs=136.0 Q ss_pred HHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCC Q ss_conf 9999999999999999999---9999999999999999984138766699999997312552069999999999995110 Q gi|254780421|r 27 LLILMILSLAIWFYLFDGS---HEKKKNIVGENFAQALELFNSNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKG 103 (218) Q Consensus 27 ~~i~~i~~~~~~~~~~~~~---q~~~~~~as~~y~~al~~~~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g 103 (218) .+++-|++..||+-.-.+- +.+++.+.|.-|...+...-++..|.|+..|-.+....+.++. +++-++.++..-| T Consensus 6 fLlLPvaaa~GWymgrrsaqq~~~~qa~~Lsr~Yv~GlNfLLs~Q~dKAvdlF~e~l~~d~~t~e--~~ltLGnLfRsRG 83 (389) T COG2956 6 FLLLPVAAAYGWYMGRRSAQQDKQDQANRLSRDYVKGLNFLLSNQPDKAVDLFLEMLQEDPETFE--AHLTLGNLFRSRG 83 (389) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCHHHH--HHHHHHHHHHHCC T ss_conf 99999999877899636777647888845538888678898616963699999999835815678--9889888988635 Q ss_pred HHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHH Q ss_conf 05778999999873126895789999999999630269989999997531146423568999999999998899899999 Q gi|254780421|r 104 DTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEEISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKT 183 (218) Q Consensus 104 ~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~ 183 (218) ..+.|+..-+.+++.++.+..-+.+|.+.++.-++..|=+|.|...+..+... +.|..-|+-.+-.+|....+..+|++ T Consensus 84 EvDRAIRiHQ~L~~spdlT~~qr~lAl~qL~~Dym~aGl~DRAE~~f~~L~de-~efa~~AlqqLl~IYQ~treW~KAIe 162 (389) T COG2956 84 EVDRAIRIHQTLLESPDLTFEQRLLALQQLGRDYMAAGLLDRAEDIFNQLVDE-GEFAEGALQQLLNIYQATREWEKAID 162 (389) T ss_pred HHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC-HHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 08899999998852899866899999999989999856466899999997346-34328999999999998368999999 Q ss_pred HHHHHHHCCCCC Q ss_conf 999998355899 Q gi|254780421|r 184 IFEELAKDNNSP 195 (218) Q Consensus 184 ~y~~i~~~~~~p 195 (218) .-+++......+ T Consensus 163 ~A~~L~k~~~q~ 174 (389) T COG2956 163 VAERLVKLGGQT 174 (389) T ss_pred HHHHHHHCCCCC T ss_conf 999999718852 No 9 >PRK10370 formate-dependent nitrite reductase complex subunit NrfG; Provisional Probab=99.00 E-value=1.2e-07 Score=65.64 Aligned_cols=166 Identities=10% Similarity=0.070 Sum_probs=107.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHH Q ss_conf 1999999999999999999999999999999--99999999998413876669999999731255206999999999999 Q gi|254780421|r 22 KWIAPLLILMILSLAIWFYLFDGSHEKKKNI--VGENFAQALELFNSNKLDDARSSFEKILSQDNKLYNPLSNMYIASIL 99 (218) Q Consensus 22 k~ii~~~i~~i~~~~~~~~~~~~~q~~~~~~--as~~y~~al~~~~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~ 99 (218) +.++.++++++++.+|++.....||..+.+. .++.. .........+..+..++.-+...|+. .-.-+.+++.+ T Consensus 17 ~~l~~~~~~~v~~~~g~Y~~~~~wq~v~~~~~~~~~~l---~~~~~~q~~e~~~~~L~~rL~~~Pdd--~~gW~~LGr~y 91 (206) T PRK10370 17 KMLTTLTILMVFLCVGSYLLSPKWQAVRAEYQRQADPL---RQFASQQTPEAQLQALQDKIRANPQN--SEQWALLGEYY 91 (206) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCHH---HCCCCCCHHHHHHHHHHHHHHHCCCC--HHHHHHHHHHH T ss_conf 99999999999999754750473999998628750732---06785141799999999999839988--89999999999 Q ss_pred HHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCC-CCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCH Q ss_conf 51100577899999987312689578999999999963026-99899999975311464235689999999999988998 Q gi|254780421|r 100 VAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDT-SSYEEISKILQKLSEPSNPMHQFANEILGISALKFGKV 178 (218) Q Consensus 100 ~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~-~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~ 178 (218) +..|+++.|+..|+..+.-.+.+..+ ++-+--+..+.+. ..-+++...+.....- +|....++.++|..++..||+ T Consensus 92 ~~~~~~~~A~~Ay~kA~~L~p~~~~i--l~~yA~aL~~~~~~~~t~~~~~lL~~AL~l-Dp~~~~AL~Lla~~AFe~~dY 168 (206) T PRK10370 92 LWRNAYDNALLAYRQALQLRGENAEL--YAALATVLYYQAGQHMTPQTREMIDKALAL-DANEVTALMLLASDAFMQADY 168 (206) T ss_pred HHCCCHHHHHHHHHHHHHHCCCCHHH--HHHHHHHHHHHCCCCCCHHHHHHHHHHHHH-CCCCHHHHHHHHHHHHHHCCH T ss_conf 97689899999999999757998789--999999999875987768999999999874-989788999999999982649 Q ss_pred HHHHHHHHHHHHCCCCC Q ss_conf 99999999998355899 Q gi|254780421|r 179 QKAKTIFEELAKDNNSP 195 (218) Q Consensus 179 ~~A~~~y~~i~~~~~~p 195 (218) .+|+.++++++...++. T Consensus 169 ~~AI~~Wq~lL~~~~p~ 185 (206) T PRK10370 169 AQAIELWQKLLDLNSPR 185 (206) T ss_pred HHHHHHHHHHHHCCCCC T ss_conf 99999999998457999 No 10 >TIGR02917 PEP_TPR_lipo putative PEP-CTERM system TPR-repeat lipoprotein; InterPro: IPR014266 This protein family occurs in strictly within a subset of Gram-negative bacterial species with the proposed PEP-CTERM/exosortase system , analogous to the LPXTG/sortase system common in Gram-positive bacteria. The proteins in this entry occur in a species if, and only if, a transmembrane histidine kinase (IPR014265 from INTERPRO) and a DNA-binding response regulator (IPR014264 from INTERPRO) also occur. The present of tetratricopeptide repeats (TPR) suggests they may be involved in protein-protein interaction, possibly for the regulation of PEP-CTERM protein expression, since many PEP-CTERM proteins in these genomes are preceded by a proposed DNA binding site for the response regulator.. Probab=98.90 E-value=2.2e-07 Score=64.06 Aligned_cols=130 Identities=16% Similarity=0.199 Sum_probs=70.9 Q ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHH Q ss_conf 99999999841387666999999973125520699999999999951100577899999987312689578999999999 Q gi|254780421|r 55 ENFAQALELFNSNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAA 134 (218) Q Consensus 55 ~~y~~al~~~~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la 134 (218) +.+.++-...+.|++..|+-.|+..+...++. .-|+|++|++|.+.||...|...|+...+ -..++ . .+...++ T Consensus 31 ~l~~~Ak~yl~~~~~~aAiI~LKNAL~~~p~~--~EARflLg~~Y~~~Gd~~~AEKEL~kA~~-lg~~~--~-~~~p~LA 104 (924) T TIGR02917 31 ELIEEAKKYLQKNKYKAAIIQLKNALQKDPND--AEARFLLGKIYLAQGDAAAAEKELRKALS-LGYPK--E-QVLPLLA 104 (924) T ss_pred HHHHHHHHHHHCCCCCCHHHHHHHHHHCCCCC--HHHHHHHHHHHHHHCCHHHHHHHHHHHHH-CCCCH--H-HHHHHHH T ss_conf 99999999986189972267757877408674--78899999999984598888899999985-68998--6-5539999 Q ss_pred HHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 96302699899999975311464235689999999999988998999999999983 Q gi|254780421|r 135 SILVDTSSYEEISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEELAK 190 (218) Q Consensus 135 ~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~ 190 (218) +.++.++++++++..+......+.+-..-.+.++|.+++..|+.++|+..|+.++. T Consensus 105 ra~l~~~~~~~~~~~~~~~~~~~~~~~a~ll~~~G~A~L~~g~~~~A~~~~~~Al~ 160 (924) T TIGR02917 105 RAYLLQGKFQQVLDELEAKTSLDDEEAAELLALRGLAYLGLGELEEAQKSYEQALA 160 (924) T ss_pred HHHHHCCCHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH T ss_conf 99985595789998547887787066999999999998850787899999999876 No 11 >PRK10803 hypothetical protein; Provisional Probab=98.87 E-value=8.5e-08 Score=66.67 Aligned_cols=116 Identities=15% Similarity=0.140 Sum_probs=94.6 Q ss_pred HHHHHHHH-HHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCC--CCCHHHHHHHHHH Q ss_conf 99999999-51100577899999987312689578999999999963026998999999753114--6423568999999 Q gi|254780421|r 92 NMYIASIL-VAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEEISKILQKLSE--PSNPMHQFANEIL 168 (218) Q Consensus 92 ~l~lA~~~-~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~--~~~~~~~~a~ell 168 (218) .+..|-.+ .+.+++++|+..|+..+.+.+ +..+..-|.|-++.+++..+++++|+..+..+.. |+++..+.++.-+ T Consensus 144 ~Y~~A~~Lvl~~k~y~~Ai~aF~~Fi~~yP-~s~y~~nA~YWLGe~~y~~~~~~~A~~~F~~vv~~yP~s~K~pdAllkl 222 (262) T PRK10803 144 DYNAAIALVQDKSRQDDAIVAFQNFIKKYP-DSTYQPNANYWLGQLNYNKGKKDDAAYYFASVVKNYPKSPKAADAMFKV 222 (262) T ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHHHCC-CCCCCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCHHHHHHH T ss_conf 899999999973569999999999998787-9987743099999999974378999999999998677998678999999 Q ss_pred HHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHH Q ss_conf 99999889989999999999835589989999999999950 Q gi|254780421|r 169 GISALKFGKVQKAKTIFEELAKDNNSPFGISTRSQMILANI 209 (218) Q Consensus 169 g~~~~~~Gd~~~A~~~y~~i~~~~~~p~~~r~ra~~~l~~l 209 (218) |.++.+.||.++|+..|++++..-+ .+..-..|+..|..| T Consensus 223 g~~~~~lg~~~~Ak~~l~~vi~~YP-~S~aA~lAk~~L~~l 262 (262) T PRK10803 223 GVIMQDKGDTAKAKAVYQQVIKKYP-GTDGAKQAQKRLNAM 262 (262) T ss_pred HHHHHHCCCHHHHHHHHHHHHHHCC-CCHHHHHHHHHHHCC T ss_conf 9999985999999999999999885-998999999998629 No 12 >TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW; InterPro: IPR013360 Proteins in this entry are designated PilF and PilW . This outer membrane protein is required both for pilus stability and for pilus functions such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.. Probab=98.74 E-value=1.2e-06 Score=59.58 Aligned_cols=156 Identities=19% Similarity=0.236 Sum_probs=122.0 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCC--HHHHHHHHHHHHHHHCCCCHH Q ss_conf 999999999999999984138766699999997312552069999999999995110--057789999998731268957 Q gi|254780421|r 47 EKKKNIVGENFAQALELFNSNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKG--DTKNAAEIFLKVANDDLAPLA 124 (218) Q Consensus 47 ~~~~~~as~~y~~al~~~~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g--~~~~A~~~l~~i~~~~~~~~~ 124 (218) ......+....+-++..++.|+.+.|...+++=++..|+.| .|+..+|-++...| +.+.|...|+..+.-++.+.. T Consensus 31 ~~~~~~a~~r~qLal~Yl~~G~~~~Ak~~l~~AL~~dp~~~--~a~~~lA~~Yq~~grP~~~~A~~~f~~Al~~~p~~g~ 108 (247) T TIGR02521 31 TDREKAAKIRVQLALGYLEQGDLEKAKENLDKALEHDPDDY--DAYLALALYYQQLGRPELEKAEQSFRKALTLKPNNGD 108 (247) T ss_pred CCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCC--HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCC T ss_conf 02799999999999999860787899999999987376661--2899999999871799557899999998624899852 Q ss_pred HHHHHHHHHHHHHHCCC--CHHHHHHHHHHHCCCCC-HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH-CCCCCHHHHH Q ss_conf 89999999999630269--98999999753114642-35689999999999988998999999999983-5589989999 Q gi|254780421|r 125 VRYAATLQAASILVDTS--SYEEISKILQKLSEPSN-PMHQFANEILGISALKFGKVQKAKTIFEELAK-DNNSPFGIST 200 (218) Q Consensus 125 ~~~lA~l~la~~~~~~~--~~d~a~~~l~~~~~~~~-~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~-~~~~p~~~r~ 200 (218) +.=.-+..+-.++ ++++|...|..-..+.. +-...+.|-+|...++.|+.++|+..|++++. |+.-|.++.. T Consensus 109 ----~~NNYG~FLC~~g~~~~~~A~~~F~~A~~~p~Y~~~~~~~eN~G~C~~~~g~~~~A~~~l~~Al~~dp~~p~~lle 184 (247) T TIGR02521 109 ----VLNNYGTFLCQQGNDKYEQAEQQFEAAIEDPLYPQPARSYENAGLCALKAGDTEKAEKYLTRALKIDPQRPESLLE 184 (247) T ss_pred ----EECHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHHHH T ss_conf ----3201457764048970689999999998668897330245549999986178688999999999858788855799 Q ss_pred HHHHHHHH Q ss_conf 99999995 Q gi|254780421|r 201 RSQMILAN 208 (218) Q Consensus 201 ra~~~l~~ 208 (218) -++..+.. T Consensus 185 La~l~~~~ 192 (247) T TIGR02521 185 LAELYYER 192 (247) T ss_pred HHHHHHHC T ss_conf 99998750 No 13 >COG3063 PilF Tfp pilus assembly protein PilF [Cell motility and secretion / Intracellular trafficking and secretion] Probab=98.74 E-value=1.7e-06 Score=58.59 Aligned_cols=154 Identities=17% Similarity=0.187 Sum_probs=116.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCC-HH Q ss_conf 99999999999999999841387666999999973125520699999999999951100577899999987312689-57 Q gi|254780421|r 46 HEKKKNIVGENFAQALELFNSNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAP-LA 124 (218) Q Consensus 46 q~~~~~~as~~y~~al~~~~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~-~~ 124 (218) +....+.+....+-++...+.++...|...+++-+...++. .++++.+|.++...|+.+.|-+.|+..+.-.+-+ .+ T Consensus 28 ~~~~~~aa~arlqLal~YL~~gd~~~A~~nlekAL~~DPs~--~~a~~~~A~~Yq~~Ge~~~A~e~YrkAlsl~p~~GdV 105 (250) T COG3063 28 QTDRNEAAKARLQLALGYLQQGDYAQAKKNLEKALEHDPSY--YLAHLVRAHYYQKLGENDLADESYRKALSLAPNNGDV 105 (250) T ss_pred CCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCC--HHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCH T ss_conf 22588999999999999997789999998899998849520--8899999999999077556799999998529886405 Q ss_pred HHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCC-CCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH-CCCCCHHHHHHH Q ss_conf 89999999999630269989999997531146-4235689999999999988998999999999983-558998999999 Q gi|254780421|r 125 VRYAATLQAASILVDTSSYEEISKILQKLSEP-SNPMHQFANEILGISALKFGKVQKAKTIFEELAK-DNNSPFGISTRS 202 (218) Q Consensus 125 ~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~-~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~-~~~~p~~~r~ra 202 (218) +.. -+..+-.++.|+++...|..-..+ .=+-.+...|-+|...+++|+++.|++.|++.+. +++-|...-..+ T Consensus 106 LNN-----YG~FLC~qg~~~eA~q~F~~Al~~P~Y~~~s~t~eN~G~Cal~~gq~~~A~~~l~raL~~dp~~~~~~l~~a 180 (250) T COG3063 106 LNN-----YGAFLCAQGRPEEAMQQFERALADPAYGEPSDTLENLGLCALKAGQFDQAEEYLKRALELDPQFPPALLELA 180 (250) T ss_pred HHH-----HHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCHHHHHHH T ss_conf 521-----069997089868999999998718887775304544578886437704479999999883757876789987 Q ss_pred HHHH Q ss_conf 9999 Q gi|254780421|r 203 QMIL 206 (218) Q Consensus 203 ~~~l 206 (218) +..+ T Consensus 181 ~~~~ 184 (250) T COG3063 181 RLHY 184 (250) T ss_pred HHHH T ss_conf 9887 No 14 >PRK10747 putative protoheme IX biogenesis protein; Provisional Probab=98.73 E-value=2e-05 Score=51.97 Aligned_cols=154 Identities=12% Similarity=0.089 Sum_probs=118.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHH Q ss_conf 99999999999999999841387666999999973125520699999999999951100577899999987312689578 Q gi|254780421|r 46 HEKKKNIVGENFAQALELFNSNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAV 125 (218) Q Consensus 46 q~~~~~~as~~y~~al~~~~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~ 125 (218) ...+..++.......+...-+|++..|.+.+.+-.+.++. ..+..+..|......|+.+..-..+....+..+.... T Consensus 77 ~~rr~~kA~~~t~~Gl~~l~EG~~~~AeK~l~k~a~~~~~--pllnyL~AArAA~~~g~~~~rd~yL~~A~e~~p~a~l- 153 (398) T PRK10747 77 VGRKRRRARKQTEQALLKLAEGDYQQVEKLMAKNADHAEQ--PVVNYLLAAEAAQQRGDEARANQHLERAAELAGNDTI- 153 (398) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHCCCCCC--HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHH- T ss_conf 8758999999999999998777999999999863435677--4889999999999888989999999998885898488- Q ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHH Q ss_conf 99999999996302699899999975311464235689999999999988998999999999983558998999999999 Q gi|254780421|r 126 RYAATLQAASILVDTSSYEEISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEELAKDNNSPFGISTRSQMI 205 (218) Q Consensus 126 ~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~~~~p~~~r~ra~~~ 205 (218) ...+..+.+++++++++.|...|..+... .|-....+.++..+|...||.+.-......+-...-.++.--.+.+.. T Consensus 154 --av~lt~Ael~l~~~q~e~Ala~L~~L~~~-~p~~~~VL~Ll~~~y~~~~dW~aL~~LLp~L~K~~~l~~~e~~~L~~~ 230 (398) T PRK10747 154 --PVEITRVRLQLARNENHAARHGVDKLLEV-TPRHPEVLRLAEQAYIRTGAWSSLLDIIPSMAKAHVGDEEHRAMLEQQ 230 (398) T ss_pred --HHHHHHHHHHHHCCCHHHHHHHHHHHHHC-CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHH T ss_conf --99999999999779989999999999861-999999999999999987459999999999987579999999999999 No 15 >TIGR02795 tol_pal_ybgF tol-pal system protein YbgF; InterPro: IPR014162 Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, and a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.. Probab=98.68 E-value=2.1e-07 Score=64.18 Aligned_cols=102 Identities=22% Similarity=0.288 Sum_probs=88.2 Q ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHHCCC-CCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHH Q ss_conf 9999999998413876669999999731255-206999999999999511005778999999873126895789999999 Q gi|254780421|r 54 GENFAQALELFNSNKLDDARSSFEKILSQDN-KLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQ 132 (218) Q Consensus 54 s~~y~~al~~~~~~~~~~A~~~l~~l~~~~~-~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~ 132 (218) ...|..+...+..|+++.|+..|..++..+| ++|++=|++=++.+++..|++.+|+..|..++..-+ ....+.=|.|+ T Consensus 3 ~~~y~aa~~~lk~gdY~~A~~~F~~F~~~YP~S~y~pnA~YWLGe~~y~~~~~~~A~~~f~~v~~~yp-~s~KaPdALLK 81 (119) T TIGR02795 3 EAYYDAALLVLKAGDYADAIQAFQAFLKKYPKSTYAPNAHYWLGEAYYAQGDYADAAKAFLAVVKKYP-KSPKAPDALLK 81 (119) T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC-CCCCCHHHHHH T ss_conf 67899999998538878999999999886879876710547999999997339999999999986458-99860689999 Q ss_pred HHHHHHCCCCHHHHHHHHHHHCCC Q ss_conf 999630269989999997531146 Q gi|254780421|r 133 AASILVDTSSYEEISKILQKLSEP 156 (218) Q Consensus 133 la~~~~~~~~~d~a~~~l~~~~~~ 156 (218) ++.++...++.++|..+|+.+... T Consensus 82 lg~~~~~~g~~~~A~~~l~~V~~~ 105 (119) T TIGR02795 82 LGMSLQELGDKEKAKATLQQVIKR 105 (119) T ss_pred HHHHHHHHCCHHHHHHHHHHHHHH T ss_conf 999998728988999999999856 No 16 >PRK11788 hypothetical protein; Provisional Probab=98.67 E-value=1.1e-05 Score=53.59 Aligned_cols=133 Identities=17% Similarity=0.147 Sum_probs=64.2 Q ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHH-- Q ss_conf 999999998413876669999999731255206999999999999511005778999999873126895789999999-- Q gi|254780421|r 55 ENFAQALELFNSNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQ-- 132 (218) Q Consensus 55 ~~y~~al~~~~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~-- 132 (218) ..|.-+.+....|-+|.|...|.++.+.+ .|..-|.-.+..++...+++++|++....+..... ...-..+|.+. T Consensus 109 a~~ELa~Dy~~aGllDRAE~~f~~l~~~~--~~~~~al~~Ll~iye~~~dW~~Ai~~a~~L~~~~~-~~~~~~iAhy~CE 185 (389) T PRK11788 109 ALQELGQDYLKAGLLDRAEELFLQLVDED--DFAEGALRQLLTIYQQEKDWQKAIEVAERLEKLGG-DSLRKEIAHFYCE 185 (389) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHCC--CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC-CCCHHHHHHHHHH T ss_conf 99999999986774999999999998285--42699999999999986459999999999987047-6506999999999 Q ss_pred HHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHC Q ss_conf 99963026998999999753114642356899999999999889989999999999835 Q gi|254780421|r 133 AASILVDTSSYEEISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEELAKD 191 (218) Q Consensus 133 la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~ 191 (218) +|...+..++.+.|.+.+..-..- .|-...|..++|+++..+|++.+|++.|+.+... T Consensus 186 lA~~~~~~~~~~~A~~~L~kAL~~-dp~cvRA~lllg~l~~~~g~~~~Ai~~l~~i~~q 243 (389) T PRK11788 186 LAQQALARGDLDAARALLKKALAA-DPQCVRASILLGDLALAQGDYAAAIEALERVEEQ 243 (389) T ss_pred HHHHHHHCCCHHHHHHHHHHHHHH-CCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH T ss_conf 999998748999999999999874-8243418999999999879999999999999973 No 17 >COG4105 ComL DNA uptake lipoprotein [General function prediction only] Probab=98.57 E-value=2.3e-05 Score=51.63 Aligned_cols=166 Identities=14% Similarity=0.083 Sum_probs=117.0 Q ss_pred HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCC-CCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHH Q ss_conf 99999999999998413876669999999731255-20699999999999951100577899999987312689578999 Q gi|254780421|r 50 KNIVGENFAQALELFNSNKLDDARSSFEKILSQDN-KLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYA 128 (218) Q Consensus 50 ~~~as~~y~~al~~~~~~~~~~A~~~l~~l~~~~~-~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~l 128 (218) ..-+++.|.+++..++.|++++|+..|+.+...++ +.|+..+.+.++..++..++++.|+...+..+...+..+. .+. T Consensus 31 ~~p~~~LY~~g~~~L~~gn~~~A~~~fe~l~~~~p~~~~~~qa~l~l~yA~Yk~~~y~~A~~~~drFi~lyP~~~n-~dY 109 (254) T COG4105 31 NLPASELYNEGLTELQKGNYEEAIKYFEALDSRHPFSPYSEQAQLDLAYAYYKNGEYDLALAYIDRFIRLYPTHPN-ADY 109 (254) T ss_pred CCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCC-HHH T ss_conf 7889999999999986378899999999998709998112899999999988135589999999999987899987-169 Q ss_pred HHHHHHHHHHCCCC-----HH---HHHHHHHHHCC--CCCHHHHHHHH--------------HHHHHHHHCCCHHHHHHH Q ss_conf 99999996302699-----89---99999753114--64235689999--------------999999988998999999 Q gi|254780421|r 129 ATLQAASILVDTSS-----YE---EISKILQKLSE--PSNPMHQFANE--------------ILGISALKFGKVQKAKTI 184 (218) Q Consensus 129 A~l~la~~~~~~~~-----~d---~a~~~l~~~~~--~~~~~~~~a~e--------------llg~~~~~~Gd~~~A~~~ 184 (218) +.+-.+...+...+ +. ++...+..+.. |+++|...+.. ..|..|++.|....|... T Consensus 110 ~~YlkgLs~~~~i~~~~rDq~~~~~A~~~f~~~i~ryPnS~Ya~dA~~~i~~~~d~LA~~Em~IaryY~kr~~~~AA~nR 189 (254) T COG4105 110 AYYLKGLSYFFQIDDVTRDQSAARAAFAAFKELVQRYPNSRYAPDAKARIVKLNDALAGHEMAIARYYLKRGAYVAAINR 189 (254) T ss_pred HHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHH T ss_conf 99999998752677544477999999999999998788970235599999999999889999999999990576999999 Q ss_pred HHHHHHCCCCCHHHHH---HHHHHHHHHCCCHHHH Q ss_conf 9999835589989999---9999999500340320 Q gi|254780421|r 185 FEELAKDNNSPFGIST---RSQMILANIIASDQRN 216 (218) Q Consensus 185 y~~i~~~~~~p~~~r~---ra~~~l~~l~~~~~~~ 216 (218) ++.++.+-...+..+. |..+..-.++-.++++ T Consensus 190 ~~~v~e~y~~t~~~~eaL~~l~eaY~~lgl~~~a~ 224 (254) T COG4105 190 FEEVLENYPDTSAVREALARLEEAYYALGLTDEAK 224 (254) T ss_pred HHHHHHCCCCCCCHHHHHHHHHHHHHHCCCHHHHH T ss_conf 99998714555204778999999999828727899 No 18 >TIGR02795 tol_pal_ybgF tol-pal system protein YbgF; InterPro: IPR014162 Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, and a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.. Probab=98.49 E-value=2.1e-06 Score=58.08 Aligned_cols=107 Identities=19% Similarity=0.200 Sum_probs=92.3 Q ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCC--CCCHHHHHHHHHH Q ss_conf 99999999951100577899999987312689578999999999963026998999999753114--6423568999999 Q gi|254780421|r 91 SNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEEISKILQKLSE--PSNPMHQFANEIL 168 (218) Q Consensus 91 A~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~--~~~~~~~~a~ell 168 (218) +.+..|...+.+||++.|+..|+..+.+-+ ++.+..=|.+=++..+..++++++|...+..+.. |.++..+-++--+ T Consensus 4 ~~y~aa~~~lk~gdY~~A~~~F~~F~~~YP-~S~y~pnA~YWLGe~~y~~~~~~~A~~~f~~v~~~yp~s~KaPdALLKl 82 (119) T TIGR02795 4 AYYDAALLVLKAGDYADAIQAFQAFLKKYP-KSTYAPNAHYWLGEAYYAQGDYADAAKAFLAVVKKYPKSPKAPDALLKL 82 (119) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHHCC-CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHH T ss_conf 789999999853887899999999988687-9876710547999999997339999999999986458998606899999 Q ss_pred HHHHHHCCCHHHHHHHHHHHHHCC-CCCHHH Q ss_conf 999998899899999999998355-899899 Q gi|254780421|r 169 GISALKFGKVQKAKTIFEELAKDN-NSPFGI 198 (218) Q Consensus 169 g~~~~~~Gd~~~A~~~y~~i~~~~-~~p~~~ 198 (218) |.+..+.|+.++|+..|++|+..= .+|... T Consensus 83 g~~~~~~g~~~~A~~~l~~V~~~YP~s~aA~ 113 (119) T TIGR02795 83 GMSLQELGDKEKAKATLQQVIKRYPGSSAAK 113 (119) T ss_pred HHHHHHHCCHHHHHHHHHHHHHHCCCCHHHH T ss_conf 9999872898899999999985669987889 No 19 >COG1729 Uncharacterized protein conserved in bacteria [Function unknown] Probab=98.43 E-value=4.9e-06 Score=55.75 Aligned_cols=111 Identities=18% Similarity=0.162 Sum_probs=89.2 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCC--CCCHHHHHHHHHHH Q ss_conf 9999999951100577899999987312689578999999999963026998999999753114--64235689999999 Q gi|254780421|r 92 NMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEEISKILQKLSE--PSNPMHQFANEILG 169 (218) Q Consensus 92 ~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~--~~~~~~~~a~ellg 169 (218) .+..|--++..||+..|...|...+.+-+ ...+..-|.|-++..++..+++++|...+..... |.+|..+-++.-+| T Consensus 144 ~Y~~A~~~~ksgdy~~A~~~F~~fi~~YP-~s~~~~nA~yWLGe~~y~qg~y~~Aa~~f~~~~k~~P~s~KApdallKlg 222 (262) T COG1729 144 LYNAALDLYKSGDYAEAEQAFQAFIKKYP-NSTYTPNAYYWLGESLYAQGDYEDAAYIFARVVKDYPKSPKAPDALLKLG 222 (262) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHCCC-CCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHH T ss_conf 99999999983878899999999997099-97556024999889998624616799999999983899998828999999 Q ss_pred HHHHHCCCHHHHHHHHHHHHH-CCCCCHHHHHHHH Q ss_conf 999988998999999999983-5589989999999 Q gi|254780421|r 170 ISALKFGKVQKAKTIFEELAK-DNNSPFGISTRSQ 203 (218) Q Consensus 170 ~~~~~~Gd~~~A~~~y~~i~~-~~~~p~~~r~ra~ 203 (218) .+..+.|+.++|+..|++++. -+.+|...+.+.. T Consensus 223 ~~~~~l~~~d~A~atl~qv~k~YP~t~aA~~Ak~~ 257 (262) T COG1729 223 VSLGRLGNTDEACATLQQVIKRYPGTDAAKLAKVA 257 (262) T ss_pred HHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHH T ss_conf 99998217799999999999878997799999998 No 20 >COG4235 Cytochrome c biogenesis factor [Posttranslational modification, protein turnover, chaperones] Probab=98.43 E-value=8.1e-05 Score=48.21 Aligned_cols=188 Identities=13% Similarity=0.098 Sum_probs=117.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHH-HHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHH Q ss_conf 519999999999999999999999999999999-99999999-9841387666999999973125520699999999999 Q gi|254780421|r 21 YKWIAPLLILMILSLAIWFYLFDGSHEKKKNIV-GENFAQAL-ELFNSNKLDDARSSFEKILSQDNKLYNPLSNMYIASI 98 (218) Q Consensus 21 yk~ii~~~i~~i~~~~~~~~~~~~~q~~~~~~a-s~~y~~al-~~~~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~ 98 (218) .||...+++++++...+.+..|-.|+.-...+. ......-. ......+.+..+..++.-+...|+.-. --..++.+ T Consensus 88 ~~~~~~~~v~~~~v~~~~y~~~g~y~~vg~~~q~~~r~~~~~a~~~~~~~~~~l~a~Le~~L~~nP~d~e--gW~~Lg~~ 165 (287) T COG4235 88 KKWLLARVVALVAVLALSYLLLGLYQAVGAPEQPADRLADPLAQPPAEQEMEALIARLETHLQQNPGDAE--GWDLLGRA 165 (287) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHHHCCCCCCCCCCCHHHHHHHHHHHHHHCCCCCH--HHHHHHHH T ss_conf 3010578999999999999986576524785430034543201677733599999999999984988724--49999999 Q ss_pred HHHCCHHHHHHHHHHHHHHHCCCCHHH-HHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCC Q ss_conf 951100577899999987312689578-9999999999630269989999997531146423568999999999998899 Q gi|254780421|r 99 LVAKGDTKNAAEIFLKVANDDLAPLAV-RYAATLQAASILVDTSSYEEISKILQKLSEPSNPMHQFANEILGISALKFGK 177 (218) Q Consensus 99 ~~~~g~~~~A~~~l~~i~~~~~~~~~~-~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd 177 (218) ++..|+...|...|...+.-...++.+ -.+|.. -+..-+...-.++...|.....- +|....++.++|..+..+|| T Consensus 166 ym~~~~~~~A~~AY~~A~rL~g~n~~~~~g~aea--L~~~a~~~~ta~a~~ll~~al~~-D~~~iral~lLA~~afe~g~ 242 (287) T COG4235 166 YMALGRASDALLAYRNALRLAGDNPEILLGLAEA--LYYQAGQQMTAKARALLRQALAL-DPANIRALSLLAFAAFEQGD 242 (287) T ss_pred HHHHCCHHHHHHHHHHHHHHCCCCHHHHHHHHHH--HHHHCCCCCCHHHHHHHHHHHHC-CCCCHHHHHHHHHHHHHCCC T ss_conf 9984443479999999987379988999999999--99953884449999999999843-97328899999999998145 Q ss_pred HHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCH Q ss_conf 899999999998355899899999999999500340 Q gi|254780421|r 178 VQKAKTIFEELAKDNNSPFGISTRSQMILANIIASD 213 (218) Q Consensus 178 ~~~A~~~y~~i~~~~~~p~~~r~ra~~~l~~l~~~~ 213 (218) +.+|...++.++...++..+-|..++..+....... T Consensus 243 ~~~A~~~Wq~lL~~lp~~~~rr~~ie~~ia~~~~~~ 278 (287) T COG4235 243 YAEAAAAWQMLLDLLPADDPRRSLIERSIARALAQR 278 (287) T ss_pred HHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCC T ss_conf 999999999998518999823789999999987410 No 21 >COG2976 Uncharacterized protein conserved in bacteria [Function unknown] Probab=98.38 E-value=0.00013 Score=46.85 Aligned_cols=159 Identities=13% Similarity=0.066 Sum_probs=110.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHH Q ss_conf 89999999975199999999999999999999999999999999999999998413876669999999731255206999 Q gi|254780421|r 11 KGKCKMGKCCYKWIAPLLILMILSLAIWFYLFDGSHEKKKNIVGENFAQALELFNSNKLDDARSSFEKILSQDNKLYNPL 90 (218) Q Consensus 11 ~~~~~~~~~~yk~ii~~~i~~i~~~~~~~~~~~~~q~~~~~~as~~y~~al~~~~~~~~~~A~~~l~~l~~~~~~~ya~l 90 (218) .+-|.-.|.-+|--...++++++.++++++.|++||.++.+.+.++=.. ++.++.... T Consensus 8 ~qql~~ik~wwkeNGk~li~gviLg~~~lfGW~ywq~~q~~q~~~AS~~---------Y~~~i~~~~------------- 65 (207) T COG2976 8 QQQLEAIKDWWKENGKALIVGVILGLGGLFGWRYWQSHQVEQAQEASAQ---------YQNAIKAVQ------------- 65 (207) T ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------HHHHHHHHH------------- T ss_conf 9999999999997790789999999999998899999878888899999---------999999985------------- Q ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHC--CCCCHHHHHHHHHH Q ss_conf 9999999995110057789999998731268957899999999996302699899999975311--46423568999999 Q gi|254780421|r 91 SNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEEISKILQKLS--EPSNPMHQFANEIL 168 (218) Q Consensus 91 A~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~--~~~~~~~~~a~ell 168 (218) .++. +.+......+..++ ...+..++.+.++....+.+++++|+..|+... +.++.+..++.-.+ T Consensus 66 -----------ak~~-~~~~~~ekf~~~n~-~t~Ya~laaL~lAk~~ve~~~~d~A~aqL~~~l~~t~De~lk~l~~lRL 132 (207) T COG2976 66 -----------AKKP-KSIAAAEKFVQANG-KTIYAVLAALELAKAEVEANNLDKAEAQLKQALAQTKDENLKALAALRL 132 (207) T ss_pred -----------CCCC-HHHHHHHHHHHHCC-CCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCHHHHHHHHHHHHH T ss_conf -----------5770-25999999986156-3589999999999999863429999999999871452578999999999 Q ss_pred HHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHH Q ss_conf 99999889989999999999835589989999999999950 Q gi|254780421|r 169 GISALKFGKVQKAKTIFEELAKDNNSPFGISTRSQMILANI 209 (218) Q Consensus 169 g~~~~~~Gd~~~A~~~y~~i~~~~~~p~~~r~ra~~~l~~l 209 (218) +.+.+.+|.+++|.+.+..+-+. +...|...+..++ T Consensus 133 Arvq~q~~k~D~AL~~L~t~~~~-----~w~~~~~elrGDi 168 (207) T COG2976 133 ARVQLQQKKADAALKTLDTIKEE-----SWAAIVAELRGDI 168 (207) T ss_pred HHHHHHHHHHHHHHHHHHCCCCC-----CHHHHHHHHHHHH T ss_conf 99999860388899987230111-----0788999974269 No 22 >PRK11447 cellulose synthase subunit BcsC; Provisional Probab=98.37 E-value=0.00018 Score=46.14 Aligned_cols=145 Identities=14% Similarity=0.146 Sum_probs=93.6 Q ss_pred HHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCC-CH--HHHH------- Q ss_conf 9999984138766699999997312552069999999999995110057789999998731268-95--7899------- Q gi|254780421|r 58 AQALELFNSNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLA-PL--AVRY------- 127 (218) Q Consensus 58 ~~al~~~~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~-~~--~~~~------- 127 (218) ...+...+.|+...|...|...+...++-...+..| +.+++.+|+..+|...|......... +. -++. T Consensus 275 ~~Gl~aL~~G~l~aAe~~l~~AL~~~PnD~daLGgL--G~v~lRqG~~~eA~~yf~rA~~~d~~~~~~~kW~sll~ta~y 352 (1158) T PRK11447 275 QQGLAAVDSGMGGKAIAELQQAVRANPKDSEALGAL--GQAYSQRGDRARAVAQFEKALALDPHSSNRDKWESLLKVNRY 352 (1158) T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHH--HHHHHHCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHH T ss_conf 989999977997889999999997199987899888--999996588899999999999708998546889999988789 Q ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH-CCCCCHHHHHHHHHH Q ss_conf 999999996302699899999975311464235689999999999988998999999999983-558998999999999 Q gi|254780421|r 128 AATLQAASILVDTSSYEEISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEELAK-DNNSPFGISTRSQMI 205 (218) Q Consensus 128 lA~l~la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~-~~~~p~~~r~ra~~~ 205 (218) -..+..+...+..+++++|...+.....-+ |-...+.-.+|+++..+||+.+|...|+.++. +++.+..+|..+... T Consensus 353 W~~l~qa~~A~~~g~~~~A~~l~~~A~~~~-p~~~~A~l~LaD~~~a~~d~~~AE~~Yr~~L~~~~~N~~AlrGL~~l~ 430 (1158) T PRK11447 353 WLLIQQGDAALKANNLAQAERLYQQARQVD-NTDSYAVLGLGDVAMARKDYAAAERYYQQTLRMDSGNTNAVRGLANIY 430 (1158) T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHHCC-CCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHH T ss_conf 999999999998799899999999998559-998559999999999839979999999999974899878999999999 No 23 >COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only] Probab=98.37 E-value=0.00012 Score=47.25 Aligned_cols=152 Identities=15% Similarity=0.175 Sum_probs=104.5 Q ss_pred HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCC-HHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHH Q ss_conf 9999999999999841387666999999973125520-699999999999951100577899999987312689578999 Q gi|254780421|r 50 KNIVGENFAQALELFNSNKLDDARSSFEKILSQDNKL-YNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYA 128 (218) Q Consensus 50 ~~~as~~y~~al~~~~~~~~~~A~~~l~~l~~~~~~~-ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~l 128 (218) ....+..|..++.....+.++.|...+..++...|+. |. .-....++++.|+.++|.+.++..+.-.+.. .+ T Consensus 303 ~~~~aa~YG~A~~~~~~~~~d~A~~~l~~L~~~~P~N~~~---~~~~~~i~~~~nk~~~A~e~~~kal~l~P~~----~~ 375 (484) T COG4783 303 RGGLAAQYGRALQTYLAGQYDEALKLLQPLIAAQPDNPYY---LELAGDILLEANKAKEAIERLKKALALDPNS----PL 375 (484) T ss_pred CCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCHHH---HHHHHHHHHHCCCHHHHHHHHHHHHHCCCCC----CH T ss_conf 3653778889999998327015789999999848998799---9999999987588677999999998408985----07 Q ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHH-----------------HHHHHHHHHC Q ss_conf 99999996302699899999975311464235689999999999988998999-----------------9999999835 Q gi|254780421|r 129 ATLQAASILVDTSSYEEISKILQKLSEPSNPMHQFANEILGISALKFGKVQKA-----------------KTIFEELAKD 191 (218) Q Consensus 129 A~l~la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A-----------------~~~y~~i~~~ 191 (218) -.+.++..++..+++.+++..|+......+ -.+.-+.+++.+|-.+||..+| +..+..+... T Consensus 376 l~~~~a~all~~g~~~eai~~L~~~~~~~p-~dp~~w~~LAqay~~~g~~~~a~~A~AE~~~~~G~~~~A~~~l~~A~~~ 454 (484) T COG4783 376 LQLNLAQALLKGGKPQEAIRILNRYLFNDP-EDPNGWDLLAQAYAELGNRAEALLARAEGYALAGRLEQAIIFLMRASQQ 454 (484) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHCCC-CCCHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH T ss_conf 999999999956997899999999864487-7736999999999984761779999999998678878999999999986 Q ss_pred CCCCHHHHHHHHHHHHHH Q ss_conf 589989999999999950 Q gi|254780421|r 192 NNSPFGISTRSQMILANI 209 (218) Q Consensus 192 ~~~p~~~r~ra~~~l~~l 209 (218) ......-..|+.-++..+ T Consensus 455 ~~~~~~~~aR~dari~~~ 472 (484) T COG4783 455 VKLGFPDWARADARIDQL 472 (484) T ss_pred CCCCCHHHHHHHHHHHHH T ss_conf 368847899889999998 No 24 >TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795). Probab=98.27 E-value=0.00018 Score=46.15 Aligned_cols=129 Identities=14% Similarity=0.124 Sum_probs=102.7 Q ss_pred HHHHHCCCHHHHHHHHHHHHHCCC-CCHHHHHHHHHHHHHHH--------CCHHHHHHHHHHHHHHHCCCCHHHHHHHH- Q ss_conf 998413876669999999731255-20699999999999951--------10057789999998731268957899999- Q gi|254780421|r 61 LELFNSNKLDDARSSFEKILSQDN-KLYNPLSNMYIASILVA--------KGDTKNAAEIFLKVANDDLAPLAVRYAAT- 130 (218) Q Consensus 61 l~~~~~~~~~~A~~~l~~l~~~~~-~~ya~lA~l~lA~~~~~--------~g~~~~A~~~l~~i~~~~~~~~~~~~lA~- 130 (218) -.....+++++|+..++++++.+| +....-|.+..|..+++ +....+|+..++.+++.-+.. .+..-|+ T Consensus 78 ~a~y~~~~y~~A~~~~~~Fi~~yP~~~~~~ya~y~~a~s~~~~~~~~~rDq~~~~~A~~~f~~~i~~yP~S-~ya~dA~~ 156 (235) T TIGR03302 78 YAYYKSGDYAEAIAAADRFIRLHPNHPDADYAYYLRGLSNYKQIDDSDRDQTAAREAFEAFQELIRRYPNS-EYAPDAKK 156 (235) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHCCCC-HHHHHHHH T ss_conf 99997643999999999999988774349999999879988736654458089999999999999888996-57999999 Q ss_pred -------------HHHHHHHHCCCCHHHHHHHHHHHCC--CCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf -------------9999963026998999999753114--64235689999999999988998999999999983 Q gi|254780421|r 131 -------------LQAASILVDTSSYEEISKILQKLSE--PSNPMHQFANEILGISALKFGKVQKAKTIFEELAK 190 (218) Q Consensus 131 -------------l~la~~~~~~~~~d~a~~~l~~~~~--~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~ 190 (218) +..|..++..+.+..|+.+++.+.. |++++..-++..++.+|.+.|..++|++.++.+.. T Consensus 157 ~i~~l~~~LA~~e~~ia~~Y~k~~~y~aAi~r~~~vl~~Yp~t~~~~eAl~~l~eay~~lg~~~~A~~~~~~l~~ 231 (235) T TIGR03302 157 RMDYLRNRLAGKELYVARFYLKRGAYVAAINRFETVVENYPDTPATEEALARLVEAYLALGLTDLAQDAAAVLGA 231 (235) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH T ss_conf 999999999999999999999841709999999999986899851399999999999986998999999999975 No 25 >KOG4626 consensus Probab=98.25 E-value=0.00014 Score=46.81 Aligned_cols=133 Identities=12% Similarity=0.075 Sum_probs=85.4 Q ss_pred HHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCH Q ss_conf 41387666999999973125520699999999999951100577899999987312689578999999999963026998 Q gi|254780421|r 64 FNSNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSY 143 (218) Q Consensus 64 ~~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~ 143 (218) .+.++.+.|...+..-.+-.+... .|+-.+|.++-.+|++++|+..|+.++.- .+.+.+ |.-.++..+-+.++. T Consensus 365 ~E~~~~e~A~~ly~~al~v~p~~a--aa~nNLa~i~kqqgnl~~Ai~~YkealrI---~P~fAd-a~~NmGnt~ke~g~v 438 (966) T KOG4626 365 REQGKIEEATRLYLKALEVFPEFA--AAHNNLASIYKQQGNLDDAIMCYKEALRI---KPTFAD-ALSNMGNTYKEMGDV 438 (966) T ss_pred HHHCCCHHHHHHHHHHHHHCHHHH--HHHHHHHHHHHHCCCHHHHHHHHHHHHHC---CCHHHH-HHHHCCHHHHHHHHH T ss_conf 886122188999999986380245--56422999999605477899999988714---940789-987504277886417 Q ss_pred HHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH-CCCCCHHHHHHHH Q ss_conf 99999975311464235689999999999988998999999999983-5589989999999 Q gi|254780421|r 144 EEISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEELAK-DNNSPFGISTRSQ 203 (218) Q Consensus 144 d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~-~~~~p~~~r~ra~ 203 (218) +.|+.....-..-...| +-|+--+|.+|...|+..+|+..|+..+. .+|.|+....++- T Consensus 439 ~~A~q~y~rAI~~nPt~-AeAhsNLasi~kDsGni~~AI~sY~~aLklkPDfpdA~cNllh 498 (966) T KOG4626 439 SAAIQCYTRAIQINPTF-AEAHSNLASIYKDSGNIPEAIQSYRTALKLKPDFPDAYCNLLH 498 (966) T ss_pred HHHHHHHHHHHHCCCHH-HHHHHHHHHHHHCCCCCHHHHHHHHHHHCCCCCCCHHHHHHHH T ss_conf 88999999988639078-8987547888603688188999999987169999435667999 No 26 >cd00189 TPR Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and C Probab=98.25 E-value=2.7e-05 Score=51.13 Aligned_cols=97 Identities=23% Similarity=0.229 Sum_probs=80.8 Q ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHH Q ss_conf 99999999951100577899999987312689578999999999963026998999999753114642356899999999 Q gi|254780421|r 91 SNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEEISKILQKLSEPSNPMHQFANEILGI 170 (218) Q Consensus 91 A~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~ 170 (218) +.+.++..++..|++++|+..|+.++...+.+. .+.+.++.++...+++++|+..+.....-. |-....+..+|. T Consensus 2 ~~~~lg~~~~~~g~~~~A~~~~~~al~~~p~~~----~~~~~l~~~~~~~~~~~~A~~~~~~~~~~~-~~~~~~~~~la~ 76 (100) T cd00189 2 ALLNLGNLYYKLGDYDEALEYYEKALELDPDNA----DAYYNLAAAYYKLGKYEEALEDYEKALELD-PDNAKAYYNLGL 76 (100) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCH----HHHHHHHHHHHHCCCHHHHHHHHHHHHHHC-CCCHHHHHHHHH T ss_conf 899999999984999999999999998599989----999999999998598589999999999869-898999999999 Q ss_pred HHHHCCCHHHHHHHHHHHHHCC Q ss_conf 9998899899999999998355 Q gi|254780421|r 171 SALKFGKVQKAKTIFEELAKDN 192 (218) Q Consensus 171 ~~~~~Gd~~~A~~~y~~i~~~~ 192 (218) ++...|+.++|.+.|++++.-. T Consensus 77 ~~~~~~~~~~A~~~~~~al~~~ 98 (100) T cd00189 77 AYYKLGKYEEALEAYEKALELD 98 (100) T ss_pred HHHHCCCHHHHHHHHHHHHHHC T ss_conf 9999499999999999999749 No 27 >COG2956 Predicted N-acetylglucosaminyl transferase [Carbohydrate transport and metabolism] Probab=98.18 E-value=0.00044 Score=43.64 Aligned_cols=150 Identities=21% Similarity=0.236 Sum_probs=85.3 Q ss_pred HHHHHHHHHHHHHHHHH---------------HCCCHHHHHHHHHHHHHCCCCCHH--HHHHHHHHHHHHHCCHHHHHHH Q ss_conf 99999999999999984---------------138766699999997312552069--9999999999951100577899 Q gi|254780421|r 48 KKKNIVGENFAQALELF---------------NSNKLDDARSSFEKILSQDNKLYN--PLSNMYIASILVAKGDTKNAAE 110 (218) Q Consensus 48 ~~~~~as~~y~~al~~~---------------~~~~~~~A~~~l~~l~~~~~~~ya--~lA~l~lA~~~~~~g~~~~A~~ 110 (218) .+..++-+.|...+... +-|..|.|+..-+.+.++++.++. .+|.+.+|+-++..|=++-|.. T Consensus 49 ~Q~dKAvdlF~e~l~~d~~t~e~~ltLGnLfRsRGEvDRAIRiHQ~L~~spdlT~~qr~lAl~qL~~Dym~aGl~DRAE~ 128 (389) T COG2956 49 NQPDKAVDLFLEMLQEDPETFEAHLTLGNLFRSRGEVDRAIRIHQTLLESPDLTFEQRLLALQQLGRDYMAAGLLDRAED 128 (389) T ss_pred CCCCHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 69636999999998358156789889888988635088999999988528998668999999999899998564668999 Q ss_pred HHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHH---------------------------------------HHHHHH Q ss_conf 99998731268957899999999996302699899---------------------------------------999975 Q gi|254780421|r 111 IFLKVANDDLAPLAVRYAATLQAASILVDTSSYEE---------------------------------------ISKILQ 151 (218) Q Consensus 111 ~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~---------------------------------------a~~~l~ 151 (218) .|..+++.. .++.-|+-.+..|+-.+.+.++ |...+. T Consensus 129 ~f~~L~de~----efa~~AlqqLl~IYQ~treW~KAIe~A~~L~k~~~q~~~~eIAqfyCELAq~~~~~~~~d~A~~~l~ 204 (389) T COG2956 129 IFNQLVDEG----EFAEGALQQLLNIYQATREWEKAIDVAERLVKLGGQTYRVEIAQFYCELAQQALASSDVDRARELLK 204 (389) T ss_pred HHHHHHCCH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999973463----4328999999999998368999999999999718852125899999999999740311899999999 Q ss_pred HHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHH Q ss_conf 31146423568999999999998899899999999998355899899999999 Q gi|254780421|r 152 KLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEELAKDNNSPFGISTRSQM 204 (218) Q Consensus 152 ~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~~~~p~~~r~ra~~ 204 (218) .-... +|-...+..++|+++..+|++++|++.|+.++... |..+..-.++ T Consensus 205 kAlqa-~~~cvRAsi~lG~v~~~~g~y~~AV~~~e~v~~Qn--~~yl~evl~~ 254 (389) T COG2956 205 KALQA-DKKCVRASIILGRVELAKGDYQKAVEALERVLEQN--PEYLSEVLEM 254 (389) T ss_pred HHHHH-CCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHC--HHHHHHHHHH T ss_conf 99862-82001035454388875266699999999999849--5779999999 No 28 >PRK02603 photosystem I assembly protein Ycf3; Provisional Probab=98.17 E-value=0.00016 Score=46.33 Aligned_cols=123 Identities=15% Similarity=0.087 Sum_probs=88.1 Q ss_pred CCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHH Q ss_conf 87666999999973125520699999999999951100577899999987312689578999999999963026998999 Q gi|254780421|r 67 NKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEEI 146 (218) Q Consensus 67 ~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~a 146 (218) ..+..+...+.+++.-.+....+.+-+..+-.+...|++.+|++.|.+.+.=. .++.-+..+.+.++.++..++++++| T Consensus 13 k~ftv~ad~~~k~~p~~~~~k~A~~Yy~~G~~~q~~G~Y~EAle~Y~kAL~Le-~dp~D~s~ilyNiGlIY~~~G~~~kA 91 (172) T PRK02603 13 KSFTVMADLILKILPINKKAKEAFVYYRDGMSAQNDGDYAEALENYEEALKLE-EDPNDRSYILYNMAIIYASNGEHDKA 91 (172) T ss_pred HHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCC-CCCCCHHHHHHHHHHHHHHCCCHHHH T ss_conf 24899999999981588633689999972257774579999999999987277-89620899997799999986998899 Q ss_pred HHHHHHHCCCCCHHHHHHHHHHHHHHHHCCC--------------HHHHHHHHHHHHHC Q ss_conf 9997531146423568999999999998899--------------89999999999835 Q gi|254780421|r 147 SKILQKLSEPSNPMHQFANEILGISALKFGK--------------VQKAKTIFEELAKD 191 (218) Q Consensus 147 ~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd--------------~~~A~~~y~~i~~~ 191 (218) +........-. |-.+.++--+|.+|.++|. +++|.+.|.+++.- T Consensus 92 ie~Y~kALeiN-P~~~~AlNNlgvIY~k~G~~a~~~~~~~~~~~~fdkAa~yw~kAI~l 149 (172) T PRK02603 92 LETYHQALELN-PKQPSALNNIAVIYHKRGRKAEEAGDQDEAEAWFDKAAEYWKQAIRL 149 (172) T ss_pred HHHHHHHHHHC-CCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999868-99789999999999998599998458778999999999999998850 No 29 >PRK11447 cellulose synthase subunit BcsC; Provisional Probab=98.15 E-value=0.00058 Score=42.95 Aligned_cols=149 Identities=16% Similarity=0.094 Sum_probs=98.3 Q ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH Q ss_conf 99999984138766699999997312552069999999999995110057789999998731268957899999999996 Q gi|254780421|r 57 FAQALELFNSNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASI 136 (218) Q Consensus 57 y~~al~~~~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~ 136 (218) ..++-...+.++++.|...++......++. ..|.+.+|.++..+|++..|...|+.++...+.+. -|.-.++.+ T Consensus 356 l~qa~~A~~~g~~~~A~~l~~~A~~~~p~~--~~A~l~LaD~~~a~~d~~~AE~~Yr~~L~~~~~N~----~AlrGL~~l 429 (1158) T PRK11447 356 IQQGDAALKANNLAQAERLYQQARQVDNTD--SYAVLGLGDVAMARKDYAAAERYYQQTLRMDSGNT----NAVRGLANI 429 (1158) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHCCCCC--HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCH----HHHHHHHHH T ss_conf 999999998799899999999998559998--55999999999983997999999999997489987----899999999 Q ss_pred HHCCCCHHHHHHHHHHHCCCCC-----H---HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHC-CCCCHHHHHHHHHHHH Q ss_conf 3026998999999753114642-----3---56899999999999889989999999999835-5899899999999999 Q gi|254780421|r 137 LVDTSSYEEISKILQKLSEPSN-----P---MHQFANEILGISALKFGKVQKAKTIFEELAKD-NNSPFGISTRSQMILA 207 (218) Q Consensus 137 ~~~~~~~d~a~~~l~~~~~~~~-----~---~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~-~~~p~~~r~ra~~~l~ 207 (218) +..++ .+++...+..+...-. . .....+-..+..++.+||...|++.|++.+.. ++.|. +|-+.--.+- T Consensus 430 ~~~q~-~~~A~~ll~~l~~~q~~~~~~~~~~l~~~~l~~qA~~~~~~Gd~~~A~~~l~qa~~~~P~dpW-lR~~LAr~~~ 507 (1158) T PRK11447 430 YRQQS-PEKALAFIASLSASQRRSIDDIERSLQNDRLAQQAEALENQGKWAQAAELQRQRLALDPGSVW-ITYRLAQDLW 507 (1158) T ss_pred HHHHC-HHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCH-HHHHHHHHHH T ss_conf 98639-999999998379988877631444546799999999999788999999999999971899704-9999999999 Q ss_pred HHCCCH Q ss_conf 500340 Q gi|254780421|r 208 NIIASD 213 (218) Q Consensus 208 ~l~~~~ 213 (218) ..+-.+ T Consensus 508 ~~G~~~ 513 (1158) T PRK11447 508 QAGQRS 513 (1158) T ss_pred HCCCHH T ss_conf 779988 No 30 >KOG2376 consensus Probab=98.12 E-value=0.00027 Score=44.99 Aligned_cols=146 Identities=17% Similarity=0.191 Sum_probs=94.2 Q ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCH------------- Q ss_conf 9999998413876669999999731255206999999999999511005778999999873126895------------- Q gi|254780421|r 57 FAQALELFNSNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPL------------- 123 (218) Q Consensus 57 y~~al~~~~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~------------- 123 (218) |..|-..+..+..|+|+..++.+-.. + .-..-..|.+++..|+++++...|+.++.++..+. T Consensus 83 fEKAYc~Yrlnk~Dealk~~~~~~~~-~----~~ll~L~AQvlYrl~~ydealdiY~~L~kn~~dd~d~~~r~nl~a~~a 157 (652) T KOG2376 83 FEKAYCEYRLNKLDEALKTLKGLDRL-D----DKLLELRAQVLYRLERYDEALDIYQHLAKNNSDDQDEERRANLLAVAA 157 (652) T ss_pred HHHHHHHHHCCCHHHHHHHHHCCCCC-C----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHH T ss_conf 78888998705678899987503556-6----188999999999875389999999999855870578999978999887 Q ss_pred ----------------HHHHHHHHHHHHHHHCCCCHHHHHHHHHHH--------CCCCCH------HHHHHHHHHHHHHH Q ss_conf ----------------789999999999630269989999997531--------146423------56899999999999 Q gi|254780421|r 124 ----------------AVRYAATLQAASILVDTSSYEEISKILQKL--------SEPSNP------MHQFANEILGISAL 173 (218) Q Consensus 124 ----------------~~~~lA~l~la~~~~~~~~~d~a~~~l~~~--------~~~~~~------~~~~a~ellg~~~~ 173 (218) .+. -.|..+++++..+++.+|++.|..- ..++.. --...+-.++.++. T Consensus 158 ~l~~~~~q~v~~v~e~sye--l~yN~Ac~~i~~gky~qA~elL~kA~~~~~e~l~~~d~~eEeie~el~~IrvQlayVlQ 235 (652) T KOG2376 158 ALQVQLLQSVPEVPEDSYE--LLYNTACILIENGKYNQAIELLEKALRICREKLEDEDTNEEEIEEELNPIRVQLAYVLQ 235 (652) T ss_pred HHHHHHHHHCCCCCCCHHH--HHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHH T ss_conf 4047798745588854299--99999999985034899999999999988876314665306688887789999999999 Q ss_pred HCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHH Q ss_conf 889989999999999835589989999999999950 Q gi|254780421|r 174 KFGKVQKAKTIFEELAKDNNSPFGISTRSQMILANI 209 (218) Q Consensus 174 ~~Gd~~~A~~~y~~i~~~~~~p~~~r~ra~~~l~~l 209 (218) .+|++++|.++|..++.....+.+....+.--|..+ T Consensus 236 ~~Gqt~ea~~iy~~~i~~~~~D~~~~Av~~NNLva~ 271 (652) T KOG2376 236 LQGQTAEASSIYVDIIKRNPADEPSLAVAVNNLVAL 271 (652) T ss_pred HHCCHHHHHHHHHHHHHHCCCCCHHHHHHHCCHHHH T ss_conf 943308789999999872688822789996032542 No 31 >COG3071 HemY Uncharacterized enzyme of heme biosynthesis [Coenzyme metabolism] Probab=98.05 E-value=0.00094 Score=41.63 Aligned_cols=158 Identities=15% Similarity=0.143 Sum_probs=122.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCC Q ss_conf 99999999999999999999984138766699999997312552069999999999995110057789999998731268 Q gi|254780421|r 42 FDGSHEKKKNIVGENFAQALELFNSNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLA 121 (218) Q Consensus 42 ~~~~q~~~~~~as~~y~~al~~~~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~ 121 (218) |.+....+..++.....+++.....|++..|...+.+=.+.++- ..++.+.-|...-.-||.+.+-..+..+.+-... T Consensus 73 ~~w~~~rKrrra~~~~~egl~~l~eG~~~qAEkl~~rnae~~e~--p~l~~l~aA~AA~qrgd~~~an~yL~eaae~~~~ 150 (400) T COG3071 73 RGWFSRRKRRRARKALNEGLLKLFEGDFQQAEKLLRRNAEHGEQ--PVLAYLLAAEAAQQRGDEDRANRYLAEAAELAGD 150 (400) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCC--HHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCC T ss_conf 88988889999999999999987067389999999976534734--0899999999988630577898999998625899 Q ss_pred CHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHH Q ss_conf 95789999999999630269989999997531146423568999999999998899899999999998355899899999 Q gi|254780421|r 122 PLAVRYAATLQAASILVDTSSYEEISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEELAKDNNSPFGISTR 201 (218) Q Consensus 122 ~~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~~~~p~~~r~r 201 (218) +.. ...+..+.++++.+++..|...+..+.... |-..-.+-+.-.+|++.|+..+.-.+..++.+..-.++.-..| T Consensus 151 ~~l---~v~ltrarlll~~~d~~aA~~~v~~ll~~~-pr~~~vlrLa~r~y~~~g~~~~ll~~l~~L~ka~~l~~~e~~~ 226 (400) T COG3071 151 DTL---AVELTRARLLLNRRDYPAARENVDQLLEMT-PRHPEVLRLALRAYIRLGAWQALLAILPKLRKAGLLSDEEAAR 226 (400) T ss_pred CHH---HHHHHHHHHHHHCCCCHHHHHHHHHHHHHC-CCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCHHHHHH T ss_conf 408---899999999986788656898899998728-6886999999999998511899999889998735788299999 Q ss_pred HHHH Q ss_conf 9999 Q gi|254780421|r 202 SQMI 205 (218) Q Consensus 202 a~~~ 205 (218) .+.. T Consensus 227 le~~ 230 (400) T COG3071 227 LEQQ 230 (400) T ss_pred HHHH T ss_conf 9999 No 32 >KOG0624 consensus Probab=98.00 E-value=0.00098 Score=41.53 Aligned_cols=132 Identities=14% Similarity=0.141 Sum_probs=97.3 Q ss_pred CCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHH----HHH-------HHHH Q ss_conf 38766699999997312552069999999999995110057789999998731268957899----999-------9999 Q gi|254780421|r 66 SNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRY----AAT-------LQAA 134 (218) Q Consensus 66 ~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~----lA~-------l~la 134 (218) -|+.--|+..|.++++-.++.++ |+++.+.+++.+|.++.|..-|+.++...+....-.. ++. .... T Consensus 85 mGksk~al~Dl~rVlelKpDF~~--ARiQRg~vllK~Gele~A~~DF~~vl~~~~s~~~~~eaqskl~~~~e~~~l~~ql 162 (504) T KOG0624 85 MGKSKAALQDLSRVLELKPDFMA--ARIQRGVVLLKQGELEQAEADFDQVLQHEPSNGLVLEAQSKLALIQEHWVLVQQL 162 (504) T ss_pred HCCCCCCHHHHHHHHHCCCCHHH--HHHHHCHHHHHCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 25774003308999861831888--8887220355325288898799999725987302688998877689999999999 Q ss_pred HHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHC-CCCCHHHHH Q ss_conf 963026998999999753114642356899999999999889989999999999835-589989999 Q gi|254780421|r 135 SILVDTSSYEEISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEELAKD-NNSPFGIST 200 (218) Q Consensus 135 ~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~-~~~p~~~r~ 200 (218) ......++...++..+.-+.. -.||.+.-.+.++..|...|+...|+...+.+... .+.+...-. T Consensus 163 ~s~~~~GD~~~ai~~i~~llE-i~~Wda~l~~~Rakc~i~~~e~k~AI~Dlk~askLs~DnTe~~yk 228 (504) T KOG0624 163 KSASGSGDCQNAIEMITHLLE-IQPWDASLRQARAKCYIAEGEPKKAIHDLKQASKLSQDNTEGHYK 228 (504) T ss_pred HHHHCCCCHHHHHHHHHHHHH-CCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHH T ss_conf 988438746559999988885-371336899999888885386888888899987404665388898 No 33 >COG1729 Uncharacterized protein conserved in bacteria [Function unknown] Probab=97.98 E-value=0.00018 Score=46.01 Aligned_cols=100 Identities=24% Similarity=0.341 Sum_probs=87.8 Q ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHCCC-CCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHH Q ss_conf 99999998413876669999999731255-20699999999999951100577899999987312689578999999999 Q gi|254780421|r 56 NFAQALELFNSNKLDDARSSFEKILSQDN-KLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAA 134 (218) Q Consensus 56 ~y~~al~~~~~~~~~~A~~~l~~l~~~~~-~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la 134 (218) .|..++..+..+++..|...|..+++.+| ++|.+-|+|-++..++.+|++++|...|..++.+.+..+ ...=+.+.++ T Consensus 144 ~Y~~A~~~~ksgdy~~A~~~F~~fi~~YP~s~~~~nA~yWLGe~~y~qg~y~~Aa~~f~~~~k~~P~s~-KApdallKlg 222 (262) T COG1729 144 LYNAALDLYKSGDYAEAEQAFQAFIKKYPNSTYTPNAYYWLGESLYAQGDYEDAAYIFARVVKDYPKSP-KAPDALLKLG 222 (262) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCC-CCHHHHHHHH T ss_conf 999999999838788999999999970999755602499988999862461679999999998389999-8828999999 Q ss_pred HHHHCCCCHHHHHHHHHHHCCC Q ss_conf 9630269989999997531146 Q gi|254780421|r 135 SILVDTSSYEEISKILQKLSEP 156 (218) Q Consensus 135 ~~~~~~~~~d~a~~~l~~~~~~ 156 (218) .++...++.|+|-..|..+... T Consensus 223 ~~~~~l~~~d~A~atl~qv~k~ 244 (262) T COG1729 223 VSLGRLGNTDEACATLQQVIKR 244 (262) T ss_pred HHHHHHCCHHHHHHHHHHHHHH T ss_conf 9999821779999999999987 No 34 >KOG1126 consensus Probab=97.94 E-value=0.00045 Score=43.59 Aligned_cols=120 Identities=21% Similarity=0.190 Sum_probs=66.0 Q ss_pred HHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCH Q ss_conf 41387666999999973125520699999999999951100577899999987312689578999999999963026998 Q gi|254780421|r 64 FNSNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSY 143 (218) Q Consensus 64 ~~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~ 143 (218) ..+++++.|...|++-.+-.|..-..+.++ +..+...|+.++|+..|+..+--...++ +.++.-+.+++..+++ T Consensus 500 ~Kqek~e~Ae~~fqkA~~INP~nsvi~~~~--g~~~~~~k~~d~AL~~~~~A~~ld~kn~----l~~~~~~~il~~~~~~ 573 (638) T KOG1126 500 LKQEKLEFAEFHFQKAVEINPSNSVILCHI--GRIQHQLKRKDKALQLYEKAIHLDPKNP----LCKYHRASILFSLGRY 573 (638) T ss_pred ECCCHHHHHHHHHHHHHCCCCCCHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHCCCCCC----HHHHHHHHHHHHHCCH T ss_conf 204104678998876422796103577652--5999986013279999999881687881----5678877888762236 Q ss_pred HHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 99999975311464235689999999999988998999999999983 Q gi|254780421|r 144 EEISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEELAK 190 (218) Q Consensus 144 d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~ 190 (218) +++...|+.+.. --|.-...+.++|.+|.+.|..+.|..+|.-+.+ T Consensus 574 ~eal~~LEeLk~-~vP~es~v~~llgki~k~~~~~~~Al~~f~~A~~ 619 (638) T KOG1126 574 VEALQELEELKE-LVPQESSVFALLGKIYKRLGNTDLALLHFSWALD 619 (638) T ss_pred HHHHHHHHHHHH-HCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHC T ss_conf 999999999998-5750578999999999997330588876677860 No 35 >PRK10049 pgaA outer membrane protein PgaA; Provisional Probab=97.91 E-value=0.0011 Score=41.21 Aligned_cols=137 Identities=15% Similarity=0.141 Sum_probs=95.2 Q ss_pred HHHHHCCCHHHHHHHHHHHHHCCC--CCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHH-HHHHHHHHHHHHH Q ss_conf 998413876669999999731255--2069999999999995110057789999998731268957-8999999999963 Q gi|254780421|r 61 LELFNSNKLDDARSSFEKILSQDN--KLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLA-VRYAATLQAASIL 137 (218) Q Consensus 61 l~~~~~~~~~~A~~~l~~l~~~~~--~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~-~~~lA~l~la~~~ 137 (218) ..+..-+++.+++..++.+..++. +.|+--+ .|..++..+.+++|..+|..++........ .-.--...+.+-+ T Consensus 297 ~aL~~r~r~~~vi~eYq~L~a~~~~~P~Ya~~a---vadayL~~~qP~kA~~iy~~l~~~~~~~~~~~~~e~~~~LfYA~ 373 (818) T PRK10049 297 GALLVRDRYADVISEYQRLKAEGQKIPDYARRW---VASAYLDRRQPEKAAPILTELFYHPETIAPDLSDEELADLFYSL 373 (818) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHH---HHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHH T ss_conf 999987659999999999974689885799999---99999863996899999999973485457653567887789998 Q ss_pred HCCCCHHHHHHHHHHHCCCCCHH--------------HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHH Q ss_conf 02699899999975311464235--------------6899999999999889989999999999835589989999 Q gi|254780421|r 138 VDTSSYEEISKILQKLSEPSNPM--------------HQFANEILGISALKFGKVQKAKTIFEELAKDNNSPFGIST 200 (218) Q Consensus 138 ~~~~~~d~a~~~l~~~~~~~~~~--------------~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~~~~p~~~r~ 200 (218) ++.+++++|...++.+....+++ ...+..+++....-.||..+|.+.++.++...+-+..++. T Consensus 374 ~e~e~~~eA~~l~~~~~~~~p~~~~~~g~~~~~PNddy~~~~~l~a~~~~~~ndL~~Ae~~le~l~~~APgN~~L~~ 450 (818) T PRK10049 374 LESEQYPGALQVAVHTIETTPYFVRLYGLPTSIPNDDWLQGQSLLVQSLKASNDLPQAEKRLEELAYTAPGNQGLRI 450 (818) T ss_pred HHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHH T ss_conf 61111899999999865328960430688889888279999999999999757849999999999863999879999 No 36 >KOG3060 consensus Probab=97.85 E-value=0.0021 Score=39.44 Aligned_cols=145 Identities=20% Similarity=0.077 Sum_probs=100.4 Q ss_pred HHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHC Q ss_conf 99984138766699999997312552069999999999995110057789999998731268957899999999996302 Q gi|254780421|r 60 ALELFNSNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVD 139 (218) Q Consensus 60 al~~~~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~ 139 (218) .+.+.+.+..+.|..-...+....|+++... -..|-.+-..|.+++|++.|+.++++++.+..++ -|-..+.-. T Consensus 59 ~IAAld~~~~~lAq~C~~~L~~~fp~S~RV~--~lkam~lEa~~~~~~A~e~y~~lL~ddpt~~v~~----KRKlAilka 132 (289) T KOG3060 59 FIAALDTGRDDLAQKCINQLRDRFPGSKRVG--KLKAMLLEATGNYKEAIEYYESLLEDDPTDTVIR----KRKLAILKA 132 (289) T ss_pred HHHHHHHCCHHHHHHHHHHHHHHCCCCHHHH--HHHHHHHHHHHCHHHHHHHHHHHHCCCCCHHHHH----HHHHHHHHH T ss_conf 9999983313789999999998678970679--8999999986135459999999844686314899----999999997 Q ss_pred CCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCC Q ss_conf 699899999975311464235689999999999988998999999999983558998999999999995003 Q gi|254780421|r 140 TSSYEEISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEELAKDNNSPFGISTRSQMILANIIA 211 (218) Q Consensus 140 ~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~~~~p~~~r~ra~~~l~~l~~ 211 (218) +|+.-++++-+....... +-..-+.--++.+|+..|++++|.--|++++-..+..+-+-+|....+=.++| T Consensus 133 ~GK~l~aIk~ln~YL~~F-~~D~EAW~eLaeiY~~~~~f~kA~fClEE~ll~~P~n~l~f~rlae~~Yt~gg 203 (289) T KOG3060 133 QGKNLEAIKELNEYLDKF-MNDQEAWHELAEIYLSEGDFEKAAFCLEELLLIQPFNPLYFQRLAEVLYTQGG 203 (289) T ss_pred CCCCHHHHHHHHHHHHHH-CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHH T ss_conf 188579999999999983-57699999999999767689999999999997598718999999999999841 No 37 >KOG4626 consensus Probab=97.83 E-value=0.0013 Score=40.73 Aligned_cols=120 Identities=19% Similarity=0.174 Sum_probs=79.2 Q ss_pred HCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHH Q ss_conf 13876669999999731255206999999999999511005778999999873126895789999999999630269989 Q gi|254780421|r 65 NSNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYE 144 (218) Q Consensus 65 ~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d 144 (218) +.|..|-|+..+++-++-.+.. ..|.=.+|..+-+.|+..+|..-|+..+.-.. ...+ +...++.++.+.+..+ T Consensus 298 eqG~ldlAI~~Ykral~~~P~F--~~Ay~NlanALkd~G~V~ea~~cYnkaL~l~p---~had-am~NLgni~~E~~~~e 371 (966) T KOG4626 298 EQGLLDLAIDTYKRALELQPNF--PDAYNNLANALKDKGSVTEAVDCYNKALRLCP---NHAD-AMNNLGNIYREQGKIE 371 (966) T ss_pred CCCCHHHHHHHHHHHHHCCCCC--HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCC---CCHH-HHHHHHHHHHHHCCCH T ss_conf 1442788899999987039995--67775799997750546799999999987587---6377-8887989998861221 Q ss_pred HHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHC Q ss_conf 99999753114642356899999999999889989999999999835 Q gi|254780421|r 145 EISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEELAKD 191 (218) Q Consensus 145 ~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~ 191 (218) +|.........- .|--+-+.--+|.+|..+|++++|+..|++++.- T Consensus 372 ~A~~ly~~al~v-~p~~aaa~nNLa~i~kqqgnl~~Ai~~YkealrI 417 (966) T KOG4626 372 EATRLYLKALEV-FPEFAAAHNNLASIYKQQGNLDDAIMCYKEALRI 417 (966) T ss_pred HHHHHHHHHHHH-CHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHC T ss_conf 889999999863-8024556422999999605477899999988714 No 38 >KOG0553 consensus Probab=97.81 E-value=0.00083 Score=41.98 Aligned_cols=105 Identities=15% Similarity=0.153 Sum_probs=78.2 Q ss_pred HHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHH Q ss_conf 99999511005778999999873126895789999999999630269989999997531146423568999999999998 Q gi|254780421|r 95 IASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEEISKILQKLSEPSNPMHQFANEILGISALK 174 (218) Q Consensus 95 lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~ 174 (218) .+.-+++.+++.+|+..|...|+-.+.+.++-. .-+..+...+.++.|++....-..-+ |...-+.-.+|.+|+. T Consensus 87 eGN~~m~~~~Y~eAv~kY~~AI~l~P~nAVyyc----NRAAAy~~Lg~~~~AVkDce~Al~iD-p~yskay~RLG~A~~~ 161 (304) T KOG0553 87 EGNKLMKNKDYQEAVDKYTEAIELDPTNAVYYC----NRAAAYSKLGEYEDAVKDCESALSID-PHYSKAYGRLGLAYLA 161 (304) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHH----HHHHHHHHHCCHHHHHHHHHHHHHCC-HHHHHHHHHHHHHHHC T ss_conf 889999862599999999999845887626777----89999998601178999999977629-0899999998799980 Q ss_pred CCCHHHHHHHHHHHHHCCCCCHHHHHHHHH Q ss_conf 899899999999998355899899999999 Q gi|254780421|r 175 FGKVQKAKTIFEELAKDNNSPFGISTRSQM 204 (218) Q Consensus 175 ~Gd~~~A~~~y~~i~~~~~~p~~~r~ra~~ 204 (218) +|++.+|.+.|+++++-.+....++.-... T Consensus 162 ~gk~~~A~~aykKaLeldP~Ne~~K~nL~~ 191 (304) T KOG0553 162 LGKYEEAIEAYKKALELDPDNESYKSNLKI 191 (304) T ss_pred CCCHHHHHHHHHHHHCCCCCCHHHHHHHHH T ss_conf 673799999997643168884899988999 No 39 >PRK10866 outer membrane protein assembly complex subunit YfiO; Provisional Probab=97.80 E-value=0.0014 Score=40.62 Aligned_cols=123 Identities=9% Similarity=0.050 Sum_probs=64.6 Q ss_pred HHCCCHHHHHHHHHHHHHCCC-CCHHHHHHHHHHHHH------------------HHCCHHHHHHHHHHHHHHHCCCCHH Q ss_conf 413876669999999731255-206999999999999------------------5110057789999998731268957 Q gi|254780421|r 64 FNSNKLDDARSSFEKILSQDN-KLYNPLSNMYIASIL------------------VAKGDTKNAAEIFLKVANDDLAPLA 124 (218) Q Consensus 64 ~~~~~~~~A~~~l~~l~~~~~-~~ya~lA~l~lA~~~------------------~~~g~~~~A~~~l~~i~~~~~~~~~ 124 (218) ...+++++|+..++.++..+| +...+-|.+..+-.. .+.+...+|...++.+++.-+ ++. T Consensus 80 y~~~~y~~A~~~~~rFi~lyP~~~~v~Ya~Yl~gl~~~~~~~~~~~~~~~~~~~~rD~~~~~~A~~~f~~li~~yP-~S~ 158 (243) T PRK10866 80 YKNADLPLAQAAIDRFMRLNPTHPNIDYVMYMRGLTNMALDDSALQGFFGVDRSDRDPQHARAAFSDFSKLVRGYP-NSQ 158 (243) T ss_pred HHHCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCHHHHHHHHCCCHHHCCHHHHHHHHHHHHHHHHHCC-CCH T ss_conf 7307789999999999986789997389999988987740135888874325643076999999999999998789-984 Q ss_pred HHHHHH--------------HHHHHHHHCCCCHHHHHHHHHHHCC--CCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHH Q ss_conf 899999--------------9999963026998999999753114--64235689999999999988998999999999 Q gi|254780421|r 125 VRYAAT--------------LQAASILVDTSSYEEISKILQKLSE--PSNPMHQFANEILGISALKFGKVQKAKTIFEE 187 (218) Q Consensus 125 ~~~lA~--------------l~la~~~~~~~~~d~a~~~l~~~~~--~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~ 187 (218) +..-|+ +..|..++..+.+..|+.+++.+.. +++++..-|+..+..+|...|..++|.+.-.- T Consensus 159 Ya~dA~~r~~~lrn~LA~~El~Va~fY~kr~~y~AAi~R~~~vi~~Yp~t~~~~eAL~~l~eaY~~Lgl~~~A~~~~~v 237 (243) T PRK10866 159 YTTDATKRLVFLKDRLAKYEYSVAEYYTKRGAWVAVVNRVEGMLRDYPDTQATRDALPLMENAYRQMQLNAQADKVAKI 237 (243) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHH T ss_conf 5899999999999999999999999999926569999999999976899963399999999999986997899999999 No 40 >TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD; InterPro: IPR005415 The type III secretion system of Gram-negative bacteria is used to transport virulence factors from the pathogen directly into the host cell and is only triggered when the bacterium comes into close contact with the host. Effector proteins secreted by the type III system do not possess a secretion signal, and are considered unique because of this. Yersinia spp. secrete effector proteins called YopB and YopD that facilitate the spread of other translocated proteins through the type III needle and the host cell cytoplasm . Both are believed to act as pore translocases, forming apertures in the host cell membrane and allowing the bacterium easy access to its cytoplasm. YopD also acts as a negative regulator of the Yersinia low-calcium response, and in turn is controlled by a chaperone, SycD . This protein also regulates YopB secretion. SycD is located on the Yop pathogenicity island of Yersinia spp., and is speculated to prevent a premature interaction between YopB, YopD and the calcium-response LcrV protein . It has been speculated that a type III secretion mechanism also exists in Chlamydial species. With the sequencing of the Chlamydia trachomatis genome, several proteins similar to characterised type III proteins have emerged, including a SycD homologue . The Pseudomonas aeruginosa gene PcrH is also similar to the Yersinia chaperone, suggesting a comparable function. Proteins in this entry are found in type III secretion operons. LcrH, from Yersinia is believed to have a regulatory function in the low-calcium response of the secretion system ,. The same protein is also known as SycD (SYC = Specific Yop Chaperone) for its chaperone role . In Pseudomonas, where the homolog is known as PcrH, the chaperone role has been demonstrated and the regulatory role appears to be absent . ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array .; GO: 0006950 response to stress. Probab=97.75 E-value=0.00072 Score=42.34 Aligned_cols=112 Identities=18% Similarity=0.151 Sum_probs=84.2 Q ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHH--HHCCCCCHHHHHHHHHH Q ss_conf 9999999995110057789999998731268957899999999996302699899999975--31146423568999999 Q gi|254780421|r 91 SNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEEISKILQ--KLSEPSNPMHQFANEIL 168 (218) Q Consensus 91 A~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~a~~~l~--~~~~~~~~~~~~a~ell 168 (218) .-+..|-..+.+|++.+|...|.++.-=+..+ .+ -.+.||......+++++|+.... ....+++| ...... T Consensus 20 ~~Y~~AY~~y~~G~Y~~A~~~F~~L~~~~~~~--~~--Y~~gLa~c~q~~~~Y~~A~~~Y~~a~~l~~~~P---~p~~~~ 92 (137) T TIGR02552 20 AIYALAYNLYQQGRYDEALKLFQLLAAYDPWN--AD--YWLGLAACCQMLKEYEEAIDAYALAAALDPDDP---RPYYHA 92 (137) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHHCCHH--HH--HHHHHHHHHHHHHCHHHHHHHHHHHHHCCCCCC---CCHHHH T ss_conf 99999999985366799999999999974303--68--999999999986167899999999975276869---512699 Q ss_pred HHHHHHCCCHHHHHHHHHHHH---H-CCCCCHHHHHHHHHHHHHH Q ss_conf 999998899899999999998---3-5589989999999999950 Q gi|254780421|r 169 GISALKFGKVQKAKTIFEELA---K-DNNSPFGISTRSQMILANI 209 (218) Q Consensus 169 g~~~~~~Gd~~~A~~~y~~i~---~-~~~~p~~~r~ra~~~l~~l 209 (218) |+.++..|+...|++.++.++ . +.+....+++||+.++..| T Consensus 93 a~C~l~~g~~~~A~~aL~~a~~~~~~~~~~~~~l~~rA~~~l~~l 137 (137) T TIGR02552 93 AECYLALGEPESALKALDLAIEIAGGENPEYSELKERAEAMLESL 137 (137) T ss_pred HHHHHHHCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHC T ss_conf 899987289789999999999984889963178999999988319 No 41 >COG4649 Uncharacterized protein conserved in bacteria [Function unknown] Probab=97.69 E-value=0.00099 Score=41.48 Aligned_cols=131 Identities=15% Similarity=0.162 Sum_probs=85.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-------------HHHH--------------------HHHHHHHHHHCCCH Q ss_conf 99999999999999999999999999999-------------9999--------------------99999998413876 Q gi|254780421|r 23 WIAPLLILMILSLAIWFYLFDGSHEKKKN-------------IVGE--------------------NFAQALELFNSNKL 69 (218) Q Consensus 23 ~ii~~~i~~i~~~~~~~~~~~~~q~~~~~-------------~as~--------------------~y~~al~~~~~~~~ 69 (218) -+-+.+|++..|.+||-+|..+. .++.. +..+ .+..+....++++. T Consensus 32 giailvVlGtag~~gy~yw~~s~-as~sgd~flaAL~lA~~~k~d~Alaaf~~lektg~g~YpvLA~mr~at~~a~kgdt 110 (221) T COG4649 32 GIAILVVLGTAGYVGYTYWQTSR-ASKSGDAFLAALKLAQENKTDDALAAFTDLEKTGYGSYPVLARMRAATLLAQKGDT 110 (221) T ss_pred HHHHHHHHCCCCCEEEEHHCCCC-CCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHCCCH T ss_conf 99999983234201000111334-24325899999999870796689999999986488855699999999998612647 Q ss_pred HHHHHHHHHHHHCCCCC--HHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHH Q ss_conf 66999999973125520--6999999999999511005778999999873126895789999999999630269989999 Q gi|254780421|r 70 DDARSSFEKILSQDNKL--YNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEEIS 147 (218) Q Consensus 70 ~~A~~~l~~l~~~~~~~--ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~a~ 147 (218) +.|+..|+++....+.+ -..+|++.-|-++++.|.+++-......+..+ ..++|..|+--++.-....+++..+. T Consensus 111 a~AV~aFdeia~dt~~P~~~rd~ARlraa~lLvD~gsy~dV~srvepLa~d---~n~mR~sArEALglAa~kagd~a~A~ 187 (221) T COG4649 111 AAAVAAFDEIAADTSIPQIGRDLARLRAAYLLVDNGSYDDVSSRVEPLAGD---GNPMRHSAREALGLAAYKAGDFAKAK 187 (221) T ss_pred HHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHCCCC---CCHHHHHHHHHHHHHHHHCCCHHHHH T ss_conf 889998998852578760136799998789874356488899874130489---97017889998768887322467799 Q ss_pred HHHHHHCCCC Q ss_conf 9975311464 Q gi|254780421|r 148 KILQKLSEPS 157 (218) Q Consensus 148 ~~l~~~~~~~ 157 (218) +.+.-+.++. T Consensus 188 ~~F~qia~Da 197 (221) T COG4649 188 SWFVQIANDA 197 (221) T ss_pred HHHHHHHCCC T ss_conf 9999997014 No 42 >KOG2002 consensus Probab=97.67 E-value=0.0041 Score=37.66 Aligned_cols=126 Identities=14% Similarity=0.131 Sum_probs=94.3 Q ss_pred HCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHH Q ss_conf 13876669999999731255206999999999999511005778999999873126895789999999999630269989 Q gi|254780421|r 65 NSNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYE 144 (218) Q Consensus 65 ~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d 144 (218) +...+..+...+...-+. .++-+.+...+|..++..|++..+..+..-.+.+.. ...+..-.-|.+++.+-.+|+++ T Consensus 248 d~~s~~~~~~ll~~ay~~--n~~nP~~l~~LAn~fyfK~dy~~v~~la~~ai~~t~-~~~~~aes~Y~~gRs~Ha~Gd~e 324 (1018) T KOG2002 248 DSDSYKKGVQLLQRAYKE--NNENPVALNHLANHFYFKKDYERVWHLAEHAIKNTE-NKSIKAESFYQLGRSYHAQGDFE 324 (1018) T ss_pred CHHHHHHHHHHHHHHHHH--CCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHCCHH T ss_conf 567788899998888760--689957999997777413537899999999998610-14777788999999988622689 Q ss_pred HHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCC Q ss_conf 9999975311464235689999999999988998999999999983558 Q gi|254780421|r 145 EISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEELAKDNN 193 (218) Q Consensus 145 ~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~~~ 193 (218) +|..+.......++.+..+.+..+|..++..|++..|..-|++++...+ T Consensus 325 kA~~yY~~s~k~~~d~~~l~~~GlgQm~i~~~dle~s~~~fEkv~k~~p 373 (1018) T KOG2002 325 KAFKYYMESLKADNDNFVLPLVGLGQMYIKRGDLEESKFCFEKVLKQLP 373 (1018) T ss_pred HHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHCHHHHHHHHHHHHHHHCC T ss_conf 9999999997248887620012125789871327788999999998676 No 43 >COG5010 TadD Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking and secretion] Probab=97.65 E-value=0.0028 Score=38.69 Aligned_cols=101 Identities=16% Similarity=0.196 Sum_probs=65.2 Q ss_pred HHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHH Q ss_conf 99999995110057789999998731268957899999999996302699899999975311464235689999999999 Q gi|254780421|r 93 MYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEEISKILQKLSEPSNPMHQFANEILGISA 172 (218) Q Consensus 93 l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~ 172 (218) ...+..+...|+.+.+......+...+..+.... .. ++..++.++++..|+..+........+ ...++..+|.+| T Consensus 70 ~~~a~a~~~~G~a~~~l~~~~~~~~~~~~d~~ll---~~-~gk~~~~~g~~~~A~~~~rkA~~l~p~-d~~~~~~lgaal 144 (257) T COG5010 70 AKLATALYLRGDADSSLAVLQKSAIAYPKDRELL---AA-QGKNQIRNGNFGEAVSVLRKAARLAPT-DWEAWNLLGAAL 144 (257) T ss_pred HHHHHHHHHCCCCCCHHHHHHHHHCCCCCCHHHH---HH-HHHHHHHHCCHHHHHHHHHHHHCCCCC-CHHHHHHHHHHH T ss_conf 8877777740555500788755401474549999---99-999999804568899999998436999-746666778999 Q ss_pred HHCCCHHHHHHHHHHHHHC-CCCCHHH Q ss_conf 9889989999999999835-5899899 Q gi|254780421|r 173 LKFGKVQKAKTIFEELAKD-NNSPFGI 198 (218) Q Consensus 173 ~~~Gd~~~A~~~y~~i~~~-~~~p~~~ 198 (218) -+.|++++|+..|.++++- ++.|..+ T Consensus 145 dq~Gr~~~Ar~ay~qAl~L~~~~p~~~ 171 (257) T COG5010 145 DQLGRFDEARRAYRQALELAPNEPSIA 171 (257) T ss_pred HHCCCHHHHHHHHHHHHHHCCCCCHHH T ss_conf 973675677899999998425884466 No 44 >KOG0543 consensus Probab=97.59 E-value=0.0011 Score=41.12 Aligned_cols=106 Identities=19% Similarity=0.141 Sum_probs=60.2 Q ss_pred HHHHHHCCHHHHHHHHHHHHHHH----CCCC-------HHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHH Q ss_conf 99995110057789999998731----2689-------578999999999963026998999999753114642356899 Q gi|254780421|r 96 ASILVAKGDTKNAAEIFLKVAND----DLAP-------LAVRYAATLQAASILVDTSSYEEISKILQKLSEPSNPMHQFA 164 (218) Q Consensus 96 A~~~~~~g~~~~A~~~l~~i~~~----~~~~-------~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a 164 (218) +..++..|++..|+..|..++.- .+.+ ..++....+.++..++..+.+.+|+...+....-++ -..-| T Consensus 215 Gn~~fK~gk~~~A~~~Yerav~~l~~~~~~~~ee~~~~~~~k~~~~lNlA~c~lKl~~~~~Ai~~c~kvLe~~~-~N~KA 293 (397) T KOG0543 215 GNVLFKEGKFKLAKKRYERAVSFLEYRRSFDEEEQKKAEALKLACHLNLAACYLKLKEYKEAIESCNKVLELDP-NNVKA 293 (397) T ss_pred HHHHHHHCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC-CCHHH T ss_conf 15887512199999999999987511346997899999888877764799888741218899999999884589-96267 Q ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHH Q ss_conf 99999999988998999999999983558998999999 Q gi|254780421|r 165 NEILGISALKFGKVQKAKTIFEELAKDNNSPFGISTRS 202 (218) Q Consensus 165 ~ellg~~~~~~Gd~~~A~~~y~~i~~~~~~p~~~r~ra 202 (218) +..+|.+++..|+++.|+..|++++.-.+.+-.++..+ T Consensus 294 LyRrG~A~l~~~e~~~A~~df~ka~k~~P~Nka~~~el 331 (397) T KOG0543 294 LYRRGQALLALGEYDLARDDFQKALKLEPSNKAARAEL 331 (397) T ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHH T ss_conf 77878898864209899999999997488868999999 No 45 >PRK02603 photosystem I assembly protein Ycf3; Provisional Probab=97.58 E-value=0.0028 Score=38.66 Aligned_cols=100 Identities=13% Similarity=0.161 Sum_probs=62.5 Q ss_pred HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCH-HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCC-CCHHHHHH Q ss_conf 9999999999998413876669999999731255206-999999999999511005778999999873126-89578999 Q gi|254780421|r 51 NIVGENFAQALELFNSNKLDDARSSFEKILSQDNKLY-NPLSNMYIASILVAKGDTKNAAEIFLKVANDDL-APLAVRYA 128 (218) Q Consensus 51 ~~as~~y~~al~~~~~~~~~~A~~~l~~l~~~~~~~y-a~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~-~~~~~~~l 128 (218) .+|-..|-..+..-+.|++++|+.-+++-+.-.++.+ ...+.+.++-++.+.|+.++|++.|...++-++ .+..+..+ T Consensus 33 k~A~~Yy~~G~~~q~~G~Y~EAle~Y~kAL~Le~dp~D~s~ilyNiGlIY~~~G~~~kAie~Y~kALeiNP~~~~AlNNl 112 (172) T PRK02603 33 KEAFVYYRDGMSAQNDGDYAEALENYEEALKLEEDPNDRSYILYNMAIIYASNGEHDKALETYHQALELNPKQPSALNNI 112 (172) T ss_pred HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHH T ss_conf 68999997225777457999999999998727789620899997799999986998899999999998689978999999 Q ss_pred HHHH--HHHHHHCCCCHHHHHHHH Q ss_conf 9999--999630269989999997 Q gi|254780421|r 129 ATLQ--AASILVDTSSYEEISKIL 150 (218) Q Consensus 129 A~l~--la~~~~~~~~~d~a~~~l 150 (218) +.+. .+....+.++.|++...+ T Consensus 113 gvIY~k~G~~a~~~~~~~~~~~~f 136 (172) T PRK02603 113 AVIYHKRGRKAEEAGDQDEAEAWF 136 (172) T ss_pred HHHHHHHHHHHHHCCCHHHHHHHH T ss_conf 999999859999845877899999 No 46 >KOG2076 consensus Probab=97.57 E-value=0.0056 Score=36.82 Aligned_cols=134 Identities=15% Similarity=0.099 Sum_probs=96.6 Q ss_pred HHHHHHHHHHH-HHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHH Q ss_conf 99999999999-98413876669999999731255206999999999999511005778999999873126895789999 Q gi|254780421|r 51 NIVGENFAQAL-ELFNSNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAA 129 (218) Q Consensus 51 ~~as~~y~~al-~~~~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA 129 (218) ....+.|..+. ...+.+++.+|+..|..+.+.....- ...-+.+|..+++.|..+.|+..|..++...+.+ + =+ T Consensus 411 ~d~~dL~~d~a~al~~~~~~~~Al~~l~~i~~~~~~~~-~~vw~~~a~c~~~l~e~e~A~e~y~kvl~~~p~~--~--D~ 485 (895) T KOG2076 411 SDDVDLYLDLADALTNIGKYKEALRLLSPITNREGYQN-AFVWYKLARCYMELGEYEEAIEFYEKVLILAPDN--L--DA 485 (895) T ss_pred HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC--H--HH T ss_conf 55578999999999836558889999999864833355-1346989999998742899999999999528986--0--16 Q ss_pred HHHHHHHHHCCCCHHHHHHHHHHHCCCCC------HHHH--HHHHHHHHHHHHCCCHHHHHHHHHHHH Q ss_conf 99999963026998999999753114642------3568--999999999998899899999999998 Q gi|254780421|r 130 TLQAASILVDTSSYEEISKILQKLSEPSN------PMHQ--FANEILGISALKFGKVQKAKTIFEELA 189 (218) Q Consensus 130 ~l~la~~~~~~~~~d~a~~~l~~~~~~~~------~~~~--~a~ellg~~~~~~Gd~~~A~~~y~~i~ 189 (218) +++++.++...|++++|...|..+..+++ .|.. -.+..+.+++...|+.++=...-..++ T Consensus 486 Ri~Lasl~~~~g~~EkalEtL~~~~~~D~~~~e~~a~~~e~ri~~~r~d~l~~~gk~E~fi~t~~~Lv 553 (895) T KOG2076 486 RITLASLYQQLGNHEKALETLEQIINPDGRNAEACAWEPERRILAHRCDILFQVGKREEFINTASTLV 553 (895) T ss_pred HHHHHHHHHHCCCHHHHHHHHHCCCCCCCCCHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 63199999763888899998713458994002220036777899999999998001999999999999 No 47 >KOG0547 consensus Probab=97.57 E-value=0.0043 Score=37.56 Aligned_cols=116 Identities=22% Similarity=0.259 Sum_probs=52.9 Q ss_pred CCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCC---------CHHHHHHHHHHHHHH Q ss_conf 38766699999997312552069999999999995110057789999998731268---------957899999999996 Q gi|254780421|r 66 SNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLA---------PLAVRYAATLQAASI 136 (218) Q Consensus 66 ~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~---------~~~~~~lA~l~la~~ 136 (218) .+..++....|+..+...|+.-..-..| |.++.+++++++|++.|+..++-.+. +-+.+.+..++ T Consensus 441 ~~k~~~~m~~Fee~kkkFP~~~Evy~~f--AeiLtDqqqFd~A~k~YD~ai~LE~~~~~~~v~~~plV~Ka~l~~q---- 514 (606) T KOG0547 441 QHKIAESMKTFEEAKKKFPNCPEVYNLF--AEILTDQQQFDKAVKQYDKAIELEPREHLIIVNAAPLVHKALLVLQ---- 514 (606) T ss_pred HHHHHHHHHHHHHHHHHCCCCCHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHC---- T ss_conf 8789999999999998688982377899--9998317767789998888885064201132460544304676435---- Q ss_pred HHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 302699899999975311464235689999999999988998999999999983 Q gi|254780421|r 137 LVDTSSYEEISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEELAK 190 (218) Q Consensus 137 ~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~ 190 (218) -.++.+.++..+..-.+- +|-.-.|.+.+|.+.+.+|+.++|++.|++.+. T Consensus 515 --wk~d~~~a~~Ll~KA~e~-Dpkce~A~~tlaq~~lQ~~~i~eAielFEksa~ 565 (606) T KOG0547 515 --WKEDINQAENLLRKAIEL-DPKCEQAYETLAQFELQRGKIDEAIELFEKSAQ 565 (606) T ss_pred --HHHHHHHHHHHHHHHHCC-CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf --165489999999998725-944789999998999988659999999999999 No 48 >COG3118 Thioredoxin domain-containing protein [Posttranslational modification, protein turnover, chaperones] Probab=97.53 E-value=0.0064 Score=36.47 Aligned_cols=155 Identities=16% Similarity=0.156 Sum_probs=117.4 Q ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHH Q ss_conf 99999999999841387666999999973125520699999999999951100577899999987312689578999999 Q gi|254780421|r 52 IVGENFAQALELFNSNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATL 131 (218) Q Consensus 52 ~as~~y~~al~~~~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l 131 (218) ...+.+.++....+.++..++...|+......+.+ .-+.+.+|..+++.|+.+.|.+.|..+-.+...+....-.|++ T Consensus 133 ~~e~~~~~~~~~~~~e~~~~a~~~~~~al~~~~~~--~~~~~~la~~~l~~g~~e~A~~iL~~lP~~~~~~~~~~l~a~i 210 (304) T COG3118 133 EEEEALAEAKELIEAEDFGEAAPLLKQALQAAPEN--SEAKLLLAECLLAAGDVEAAQAILAALPLQAQDKAAHGLQAQI 210 (304) T ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCC--CHHHHHHHHHHHHCCCHHHHHHHHHHCCCCCHHHHHHHHHHHH T ss_conf 77888987666541654455899999999858456--6377789999998188589999997285001045788899999 Q ss_pred HHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH--CCCCCHHHHHHHHHHHHHH Q ss_conf 99996302699899999975311464235689999999999988998999999999983--5589989999999999950 Q gi|254780421|r 132 QAASILVDTSSYEEISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEELAK--DNNSPFGISTRSQMILANI 209 (218) Q Consensus 132 ~la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~--~~~~p~~~r~ra~~~l~~l 209 (218) .+..-.-...+..+....+.. .|+ ..-+...+++.+...|+.++|-+++-.++. ........|.|.-.++..+ T Consensus 211 ~ll~qaa~~~~~~~l~~~~aa--dPd---d~~aa~~lA~~~~~~g~~e~Ale~Ll~~l~~d~~~~d~~~Rk~lle~f~~~ 285 (304) T COG3118 211 ELLEQAAATPEIQDLQRRLAA--DPD---DVEAALALADQLHLVGRNEAALEHLLALLRRDRGFEDGEARKTLLELFEAF 285 (304) T ss_pred HHHHHHHCCCCHHHHHHHHHH--CCC---CHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHC T ss_conf 999997459977899999870--999---889999999999973887999999999987503443768999999999963 Q ss_pred CCCH Q ss_conf 0340 Q gi|254780421|r 210 IASD 213 (218) Q Consensus 210 ~~~~ 213 (218) +-.| T Consensus 286 g~~D 289 (304) T COG3118 286 GPAD 289 (304) T ss_pred CCCC T ss_conf 8898 No 49 >TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW; InterPro: IPR013360 Proteins in this entry are designated PilF and PilW . This outer membrane protein is required both for pilus stability and for pilus functions such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.. Probab=97.53 E-value=0.0042 Score=37.59 Aligned_cols=131 Identities=20% Similarity=0.236 Sum_probs=95.4 Q ss_pred HHHCC--CHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCC--HHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHH Q ss_conf 84138--766699999997312552069999999999995110--05778999999873126895789999999999630 Q gi|254780421|r 63 LFNSN--KLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKG--DTKNAAEIFLKVANDDLAPLAVRYAATLQAASILV 138 (218) Q Consensus 63 ~~~~~--~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g--~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~ 138 (218) .-+.+ +.+.|...|++=+.-.|.....+=.| +.-+..+| ++++|...|...++++..+...+ +--.++.+.+ T Consensus 81 Yq~~grP~~~~A~~~f~~Al~~~p~~g~~~NNY--G~FLC~~g~~~~~~A~~~F~~A~~~p~Y~~~~~--~~eN~G~C~~ 156 (247) T TIGR02521 81 YQQLGRPELEKAEQSFRKALTLKPNNGDVLNNY--GTFLCQQGNDKYEQAEQQFEAAIEDPLYPQPAR--SYENAGLCAL 156 (247) T ss_pred HHHHCCCCHHHHHHHHHHHHHCCCCCCCEECHH--HHHHHHCCCCCHHHHHHHHHHHHHCCCCCCCHH--HHHHHHHHHH T ss_conf 987179955789999999862489985232014--577640489706899999999986688973302--4554999998 Q ss_pred CCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHC----CCCCHHH Q ss_conf 26998999999753114642356899999999999889989999999999835----5899899 Q gi|254780421|r 139 DTSSYEEISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEELAKD----NNSPFGI 198 (218) Q Consensus 139 ~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~----~~~p~~~ 198 (218) ..++.++|...|.....-+ |-++.++--++.++...|++..|+..|++-... ...|.++ T Consensus 157 ~~g~~~~A~~~l~~Al~~d-p~~p~~lleLa~l~~~~g~~~~A~~~l~ry~~~~G~~~~~a~sL 219 (247) T TIGR02521 157 KAGDTEKAEKYLTRALKID-PQRPESLLELAELYYERGQYAQARALLERYQQTVGVKNQTAESL 219 (247) T ss_pred HCCCHHHHHHHHHHHHHHC-CCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCCCCCHHHH T ss_conf 6178688999999999858-78885579999998750772899999999998736788887347 No 50 >COG4783 Putative Zn-dependent protease, contains TPR repeats [General function prediction only] Probab=97.50 E-value=0.007 Score=36.23 Aligned_cols=77 Identities=26% Similarity=0.248 Sum_probs=41.9 Q ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH-CCCCCHHHHHHHHHH Q ss_conf 999999996302699899999975311464235689999999999988998999999999983-558998999999999 Q gi|254780421|r 128 AATLQAASILVDTSSYEEISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEELAK-DNNSPFGISTRSQMI 205 (218) Q Consensus 128 lA~l~la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~-~~~~p~~~r~ra~~~ 205 (218) -++|..+......++++++.+.|..+.... |..+..+++.++++.+.|+.++|.+.+++++. +++.|.-.-..++.+ T Consensus 307 aa~YG~A~~~~~~~~~d~A~~~l~~L~~~~-P~N~~~~~~~~~i~~~~nk~~~A~e~~~kal~l~P~~~~l~~~~a~al 384 (484) T COG4783 307 AAQYGRALQTYLAGQYDEALKLLQPLIAAQ-PDNPYYLELAGDILLEANKAKEAIERLKKALALDPNSPLLQLNLAQAL 384 (484) T ss_pred HHHHHHHHHHHHHCCCCHHHHHHHHHHHHC-CCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHH T ss_conf 778889999998327015789999999848-998799999999998758867799999999840898507999999999 No 51 >KOG2002 consensus Probab=97.48 E-value=0.0019 Score=39.71 Aligned_cols=60 Identities=15% Similarity=0.171 Sum_probs=24.1 Q ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHH Q ss_conf 999999999511005778999999873126895789999999999630269989999997531 Q gi|254780421|r 91 SNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEEISKILQKL 153 (218) Q Consensus 91 A~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~ 153 (218) +.|++|+.+...|++++|...|...+.-.+.. .-++.++++..++-.++.+.++..+..+ T Consensus 309 s~Y~~gRs~Ha~Gd~ekA~~yY~~s~k~~~d~---~~l~~~GlgQm~i~~~dle~s~~~fEkv 368 (1018) T KOG2002 309 SFYQLGRSYHAQGDFEKAFKYYMESLKADNDN---FVLPLVGLGQMYIKRGDLEESKFCFEKV 368 (1018) T ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCC---CCCCCCCHHHHHHHHCHHHHHHHHHHHH T ss_conf 89999999886226899999999997248887---6200121257898713277889999999 No 52 >KOG1840 consensus Probab=97.44 E-value=0.0084 Score=35.74 Aligned_cols=102 Identities=15% Similarity=0.110 Sum_probs=73.8 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHC---C--CCHHHHHHHHHHHHHHHHCCCCHHHHHHHH-------HHHCC Q ss_conf 99999999999951100577899999987312---6--895789999999999630269989999997-------53114 Q gi|254780421|r 88 NPLSNMYIASILVAKGDTKNAAEIFLKVANDD---L--APLAVRYAATLQAASILVDTSSYEEISKIL-------QKLSE 155 (218) Q Consensus 88 a~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~---~--~~~~~~~lA~l~la~~~~~~~~~d~a~~~l-------~~~~~ 155 (218) ..-...+++..+..+|+++.|+..+++.++.- . ..+.+.... -.++..++..+++++|+..+ ....+ T Consensus 198 ~~~~~~~La~~y~~~g~~e~A~~l~k~Al~~l~k~~G~~hl~va~~l-~~~a~~y~~~~k~~eAv~ly~~AL~i~e~~~G 276 (508) T KOG1840 198 RLRTLRNLAEMYAVQGRLEKAEPLCKQALRILEKTSGLKHLVVASML-NILALVYRSLGKYDEAVNLYEEALTIREEVFG 276 (508) T ss_pred HHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCCCHHHHHHH-HHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCC T ss_conf 78899999999998520778999999999999870376668999999-99999999744289999999999999998538 Q ss_pred CCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 64235689999999999988998999999999983 Q gi|254780421|r 156 PSNPMHQFANEILGISALKFGKVQKAKTIFEELAK 190 (218) Q Consensus 156 ~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~ 190 (218) +..|-.+..+.-++.+|.++|++++|...++.+++ T Consensus 277 ~~h~~va~~l~nLa~ly~~~GKf~EA~~~~e~Al~ 311 (508) T KOG1840 277 EDHPAVAATLNNLAVLYYKQGKFAEAEEYCERALE 311 (508) T ss_pred CCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH T ss_conf 98789999998899998604775778999999999 No 53 >COG4700 Uncharacterized protein conserved in bacteria containing a divergent form of TPR repeats [Function unknown] Probab=97.41 E-value=0.0092 Score=35.51 Aligned_cols=124 Identities=16% Similarity=0.115 Sum_probs=91.4 Q ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHH-HHHHHHH Q ss_conf 999999999995110057789999998731268957899999999996302699899999975311464235-6899999 Q gi|254780421|r 89 PLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEEISKILQKLSEPSNPM-HQFANEI 167 (218) Q Consensus 89 ~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~~~~~-~~~a~el 167 (218) ..-++.+|..+.+-|+..+|...|++.+.-.=.+ ...-.+.++.-++..+++..+...|+.+-.-...+ .+.-.-+ T Consensus 89 vqnr~rLa~al~elGr~~EA~~hy~qalsG~fA~---d~a~lLglA~Aqfa~~~~A~a~~tLe~l~e~~pa~r~pd~~Ll 165 (251) T COG4700 89 VQNRYRLANALAELGRYHEAVPHYQQALSGIFAH---DAAMLLGLAQAQFAIQEFAAAQQTLEDLMEYNPAFRSPDGHLL 165 (251) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCC---CHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCCCCCCHHH T ss_conf 7889999999998410566688999985465679---8899988898888616588888899998605976589972589 Q ss_pred HHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCHHHH Q ss_conf 9999998899899999999998355899899999999999500340320 Q gi|254780421|r 168 LGISALKFGKVQKAKTIFEELAKDNNSPFGISTRSQMILANIIASDQRN 216 (218) Q Consensus 168 lg~~~~~~Gd~~~A~~~y~~i~~~~~~p~~~r~ra~~~l~~l~~~~~~~ 216 (218) .|..+..+|+.+.|+..|+.++.--+-| .-|.+--++|+..+--++++ T Consensus 166 ~aR~laa~g~~a~Aesafe~a~~~ypg~-~ar~~Y~e~La~qgr~~ea~ 213 (251) T COG4700 166 FARTLAAQGKYADAESAFEVAISYYPGP-QARIYYAEMLAKQGRLREAN 213 (251) T ss_pred HHHHHHHCCCCHHHHHHHHHHHHHCCCH-HHHHHHHHHHHHHCCHHHHH T ss_conf 9999984588055899999999858877-78989999999712216788 No 54 >PRK10747 putative protoheme IX biogenesis protein; Provisional Probab=97.40 E-value=0.0092 Score=35.48 Aligned_cols=54 Identities=9% Similarity=0.009 Sum_probs=24.7 Q ss_pred HCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHC Q ss_conf 1387666999999973125520699999999999951100577899999987312 Q gi|254780421|r 65 NSNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDD 119 (218) Q Consensus 65 ~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~ 119 (218) +.++.+..-..++.-.+..+ .......+..|...+++|+++.|.+.+..+.... T Consensus 130 ~~g~~~~rd~yL~~A~e~~p-~a~lav~lt~Ael~l~~~q~e~Ala~L~~L~~~~ 183 (398) T PRK10747 130 QRGDEARANQHLERAAELAG-NDTIPVEITRVRLQLARNENHAARHGVDKLLEVT 183 (398) T ss_pred HCCCHHHHHHHHHHHHHHCC-CCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCC T ss_conf 88898999999999888589-8488999999999997799899999999998619 No 55 >cd00189 TPR Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and C Probab=97.39 E-value=0.0052 Score=37.03 Aligned_cols=93 Identities=19% Similarity=0.145 Sum_probs=74.7 Q ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH Q ss_conf 99999984138766699999997312552069999999999995110057789999998731268957899999999996 Q gi|254780421|r 57 FAQALELFNSNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASI 136 (218) Q Consensus 57 y~~al~~~~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~ 136 (218) |.-+....+.+++++|+..+++.++..+.. ..+.+.+|.++...|++++|+..++.++.-.+.+. .+.+.++.+ T Consensus 4 ~~lg~~~~~~g~~~~A~~~~~~al~~~p~~--~~~~~~l~~~~~~~~~~~~A~~~~~~~~~~~~~~~----~~~~~la~~ 77 (100) T cd00189 4 LNLGNLYYKLGDYDEALEYYEKALELDPDN--ADAYYNLAAAYYKLGKYEEALEDYEKALELDPDNA----KAYYNLGLA 77 (100) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHHCCCC--HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCH----HHHHHHHHH T ss_conf 999999998499999999999999859998--99999999999985985899999999998698989----999999999 Q ss_pred HHCCCCHHHHHHHHHHHCC Q ss_conf 3026998999999753114 Q gi|254780421|r 137 LVDTSSYEEISKILQKLSE 155 (218) Q Consensus 137 ~~~~~~~d~a~~~l~~~~~ 155 (218) +...+++++|+..+..... T Consensus 78 ~~~~~~~~~A~~~~~~al~ 96 (100) T cd00189 78 YYKLGKYEEALEAYEKALE 96 (100) T ss_pred HHHCCCHHHHHHHHHHHHH T ss_conf 9994999999999999997 No 56 >CHL00033 ycf3 photosystem I assembly protein Ycf3 Probab=97.37 E-value=0.007 Score=36.25 Aligned_cols=102 Identities=12% Similarity=0.053 Sum_probs=62.5 Q ss_pred HHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHH Q ss_conf 69999999999995110057789999998731268957899999999996302699899999975311464235689999 Q gi|254780421|r 87 YNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEEISKILQKLSEPSNPMHQFANE 166 (218) Q Consensus 87 ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~e 166 (218) -.+++.+.-+-.....|++.+|++.|.+.+.- ..++.-+....+.++.++..+++++.|+.....-.+- .|..+.++. T Consensus 29 k~af~yyr~GmsaQs~GeyaEALe~Y~eAL~L-e~D~~Drs~iLYNIGlIy~~nGe~~kALeyY~qALei-NP~~pqA~N 106 (165) T CHL00033 29 KEAFTYYRDGMSAQSEGEYAEALQNYYEAMRL-EIDPYDRSYILYNIGLIHTSNGEHTKALEYYFQALER-NPFLPQALN 106 (165) T ss_pred HHHHHHHHCCCHHHCCCCHHHHHHHHHHHHCC-CCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH-CCCCHHHHH T ss_conf 88999998151010135389999999986234-5685126689989999999858888999999999985-999689997 Q ss_pred HHHHHHH-------HCCCHHHHHHHHHHHHH Q ss_conf 9999999-------88998999999999983 Q gi|254780421|r 167 ILGISAL-------KFGKVQKAKTIFEELAK 190 (218) Q Consensus 167 llg~~~~-------~~Gd~~~A~~~y~~i~~ 190 (218) -+|.+|. ++|+.+.|...|+++.+ T Consensus 107 NlGvIyh~qG~~a~~~~~~~~~~~~fd~aa~ 137 (165) T CHL00033 107 NMAVICHYRGEQAIEQGDSEIAEAWFDQAAE 137 (165) T ss_pred HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH T ss_conf 8999999987999984586789999999999 No 57 >KOG0624 consensus Probab=97.36 E-value=0.01 Score=35.18 Aligned_cols=146 Identities=19% Similarity=0.236 Sum_probs=97.0 Q ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHH Q ss_conf 99999999998413876669999999731255206999999999999511005778999999873126895789999999 Q gi|254780421|r 53 VGENFAQALELFNSNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQ 132 (218) Q Consensus 53 as~~y~~al~~~~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~ 132 (218) +..-+.-.-..+-.+.+..|++.+-.-++-.++.|- |-|..|..+..-|+...|+.-+..+++-. |..+ -||+. T Consensus 38 vekhlElGk~lla~~Q~sDALt~yHaAve~dp~~Y~--aifrRaT~yLAmGksk~al~Dl~rVlelK--pDF~--~ARiQ 111 (504) T KOG0624 38 VEKHLELGKELLARGQLSDALTHYHAAVEGDPNNYQ--AIFRRATVYLAMGKSKAALQDLSRVLELK--PDFM--AARIQ 111 (504) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHH--HHHHHHHHHHHHCCCCCCHHHHHHHHHCC--CCHH--HHHHH T ss_conf 999999889999734677899998888707806699--99987777765257740033089998618--3188--88887 Q ss_pred HHHHHHCCCCHHHHHHHHHHHCCCCCH--HHHHHHHHHHHH------------HHHCCCHHHHHHHHHHHHHCCCCCHHH Q ss_conf 999630269989999997531146423--568999999999------------998899899999999998355899899 Q gi|254780421|r 133 AASILVDTSSYEEISKILQKLSEPSNP--MHQFANEILGIS------------ALKFGKVQKAKTIFEELAKDNNSPFGI 198 (218) Q Consensus 133 la~~~~~~~~~d~a~~~l~~~~~~~~~--~~~~a~ellg~~------------~~~~Gd~~~A~~~y~~i~~~~~~p~~~ 198 (218) -+.+++.+++++.|...+..+...++. ...-+++-++.+ +...||...|++....+++-..=..++ T Consensus 112 Rg~vllK~Gele~A~~DF~~vl~~~~s~~~~~eaqskl~~~~e~~~l~~ql~s~~~~GD~~~ai~~i~~llEi~~Wda~l 191 (504) T KOG0624 112 RGVVLLKQGELEQAEADFDQVLQHEPSNGLVLEAQSKLALIQEHWVLVQQLKSASGSGDCQNAIEMITHLLEIQPWDASL 191 (504) T ss_pred HCHHHHHCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHH T ss_conf 22035532528889879999972598730268899887768999999999998843874655999998888537133689 Q ss_pred H-HHHHH Q ss_conf 9-99999 Q gi|254780421|r 199 S-TRSQM 204 (218) Q Consensus 199 r-~ra~~ 204 (218) | .|+.- T Consensus 192 ~~~Rakc 198 (504) T KOG0624 192 RQARAKC 198 (504) T ss_pred HHHHHHH T ss_conf 9999888 No 58 >PRK10049 pgaA outer membrane protein PgaA; Provisional Probab=97.17 E-value=0.017 Score=33.85 Aligned_cols=141 Identities=9% Similarity=0.017 Sum_probs=82.0 Q ss_pred HHCCCHHHHHHHHHHHHHCCCCCHHHH---HHHHHHHHHHHCCHHHHHHHHHHHHHHHCCC------------CHHHHHH Q ss_conf 413876669999999731255206999---9999999995110057789999998731268------------9578999 Q gi|254780421|r 64 FNSNKLDDARSSFEKILSQDNKLYNPL---SNMYIASILVAKGDTKNAAEIFLKVANDDLA------------PLAVRYA 128 (218) Q Consensus 64 ~~~~~~~~A~~~l~~l~~~~~~~ya~l---A~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~------------~~~~~~l 128 (218) +..+..++|+..+..+....+..+... -...+--.+++++++++|...++.++...+. +..+. - T Consensus 335 L~~~qP~kA~~iy~~l~~~~~~~~~~~~~e~~~~LfYA~~e~e~~~eA~~l~~~~~~~~p~~~~~~g~~~~~PNddy~-~ 413 (818) T PRK10049 335 LDRRQPEKAAPILTELFYHPETIAPDLSDEELADLFYSLLESEQYPGALQVAVHTIETTPYFVRLYGLPTSIPNDDWL-Q 413 (818) T ss_pred HHCCCHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCCCCCHHHH-H T ss_conf 863996899999999973485457653567887789998611118999999998653289604306888898882799-9 Q ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHH Q ss_conf 99999996302699899999975311464235689999999999988998999999999983558998999999999995 Q gi|254780421|r 129 ATLQAASILVDTSSYEEISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEELAKDNNSPFGISTRSQMILAN 208 (218) Q Consensus 129 A~l~la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~~~~p~~~r~ra~~~l~~ 208 (218) +....+...+-.++...|.+.++.+... .|...-.+-.+|.++...|....|...++++.- ..|.+...++.+.... T Consensus 414 ~~~l~a~~~~~~ndL~~Ae~~le~l~~~-APgN~~L~~~~A~v~r~Rg~pr~Ae~~~~~a~~--l~P~~~~~~~~qa~~a 490 (818) T PRK10049 414 GQSLLVQSLKASNDLPQAEKRLEELAYT-APGNQGLRIDYASVYQARGWPRAAENELKAAES--LEPRSLNLEVAQAWTA 490 (818) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHH-CCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHH--CCCCCHHHHHHHHHHH T ss_conf 9999999999757849999999999863-999879999999999854995999999999870--0864279998899998 No 59 >pfam09976 DUF2133 Uncharacterized protein conserved in bacteria (DUF2133). This domain, found in various hypothetical prokaryotic proteins, has no known function. Probab=97.16 E-value=0.00091 Score=41.73 Aligned_cols=41 Identities=7% Similarity=-0.141 Sum_probs=31.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 88999999997519999999999999999999999999999 Q gi|254780421|r 10 YKGKCKMGKCCYKWIAPLLILMILSLAIWFYLFDGSHEKKK 50 (218) Q Consensus 10 ~~~~~~~~~~~yk~ii~~~i~~i~~~~~~~~~~~~~q~~~~ 50 (218) .-+++|-|-+.|+..|++++++++++++||.+|+++|.+++ T Consensus 2 q~e~ik~wwk~~G~~ii~~vvl~~~g~~G~~~w~~~q~~qa 42 (43) T pfam09976 2 QLERLKAWWKRYGKWIIVGAVLALAGVAGWRGWQKYQASQA 42 (43) T ss_pred HHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 68899999998179999999999999999999999998701 No 60 >COG4499 Predicted membrane protein [Function unknown] Probab=97.14 E-value=0.018 Score=33.66 Aligned_cols=132 Identities=17% Similarity=0.223 Sum_probs=84.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHH Q ss_conf 99975199999999999999999999999999999999999999998413876669999999731255206999999999 Q gi|254780421|r 17 GKCCYKWIAPLLILMILSLAIWFYLFDGSHEKKKNIVGENFAQALELFNSNKLDDARSSFEKILSQDNKLYNPLSNMYIA 96 (218) Q Consensus 17 ~~~~yk~ii~~~i~~i~~~~~~~~~~~~~q~~~~~~as~~y~~al~~~~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA 96 (218) ....+||+.+|++++.+.+++|+.|+-+....+.+.. ..+-..+-++++++.+..++.. .+....--+++.+| T Consensus 216 k~~ifk~~giGliillvl~li~~~Y~~f~~~p~qeai----~~a~~aFL~~nY~qVittLe~y---dp~klPksv~Y~LA 288 (434) T COG4499 216 KYTIFKYFGIGLIILLVLLLIYFTYYYFSNQPKQEAI----ITANTAFLKNNYDQVITTLENY---DPEKLPKSVQYILA 288 (434) T ss_pred CCEEHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHH----HHHHHHHHHCCHHHHHHHCCCC---CHHHCCHHHHHHHH T ss_conf 5111146888699999999999999999717268889----9889999853589886302348---91237688899998 Q ss_pred HHHHHCCHHHHHHH--HHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHH Q ss_conf 99951100577899--9999873126895789999999999630269989999997531146423568999 Q gi|254780421|r 97 SILVAKGDTKNAAE--IFLKVANDDLAPLAVRYAATLQAASILVDTSSYEEISKILQKLSEPSNPMHQFAN 165 (218) Q Consensus 97 ~~~~~~g~~~~A~~--~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ 165 (218) ..++.-.+...... +++.+. ..++ -.+.+-.++.+.|.+++|+..-..+..++ |..+|+ T Consensus 289 ~SYV~~e~L~~~kkeNi~Nnis--lkSd------~~~llYWi~~GRGe~~eAinIAr~L~D~d--~~~~Al 349 (434) T COG4499 289 VSYVNLEDLTTTKKENILNNIS--LKSD------DNYLLYWIYSGRGEFKEAINIARNLDDND--LTLLAL 349 (434) T ss_pred HHHHHCCCCCHHHHHHHHHCCC--CCCC------HHHHHHHHHHCCCCHHHHHHHHHHCCCCH--HHHHHH T ss_conf 8774003441578998862046--5666------45778999846851788866987178622--599999 No 61 >COG3063 PilF Tfp pilus assembly protein PilF [Cell motility and secretion / Intracellular trafficking and secretion] Probab=97.13 E-value=0.012 Score=34.74 Aligned_cols=124 Identities=19% Similarity=0.162 Sum_probs=74.4 Q ss_pred HCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHH Q ss_conf 13876669999999731255206999999999999511005778999999873126895789999999999630269989 Q gi|254780421|r 65 NSNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYE 144 (218) Q Consensus 65 ~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d 144 (218) ..+..+.|-..+++-++..+..-.++=.+ +.-+-.+|.+++|...|...++++.....-... ..++.+.+..++++ T Consensus 81 ~~Ge~~~A~e~YrkAlsl~p~~GdVLNNY--G~FLC~qg~~~eA~q~F~~Al~~P~Y~~~s~t~--eN~G~Cal~~gq~~ 156 (250) T COG3063 81 KLGENDLADESYRKALSLAPNNGDVLNNY--GAFLCAQGRPEEAMQQFERALADPAYGEPSDTL--ENLGLCALKAGQFD 156 (250) T ss_pred HCCCHHHHHHHHHHHHHCCCCCCCHHHHH--HHHHHHCCCHHHHHHHHHHHHHCCCCCCCCHHH--HHHHHHHHHCCCCH T ss_conf 90775567999999985298864055210--699970898689999999987188877753045--44578886437704 Q ss_pred HHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCC Q ss_conf 9999975311464235689999999999988998999999999983558 Q gi|254780421|r 145 EISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEELAKDNN 193 (218) Q Consensus 145 ~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~~~ 193 (218) .+...|..-..-+ |-.+....-++....+.|++..|+..++....... T Consensus 157 ~A~~~l~raL~~d-p~~~~~~l~~a~~~~~~~~y~~Ar~~~~~~~~~~~ 204 (250) T COG3063 157 QAEEYLKRALELD-PQFPPALLELARLHYKAGDYAPARLYLERYQQRGG 204 (250) T ss_pred HHHHHHHHHHHHC-CCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCC T ss_conf 4799999998837-57876789987988753130189999999970256 No 62 >KOG0550 consensus Probab=97.08 E-value=0.02 Score=33.35 Aligned_cols=149 Identities=15% Similarity=0.080 Sum_probs=103.4 Q ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHH----------HHHHHHHHCCHHHHHHHHHHHHHHHCCC Q ss_conf 999999999998413876669999999731255206999999----------9999995110057789999998731268 Q gi|254780421|r 52 IVGENFAQALELFNSNKLDDARSSFEKILSQDNKLYNPLSNM----------YIASILVAKGDTKNAAEIFLKVANDDLA 121 (218) Q Consensus 52 ~as~~y~~al~~~~~~~~~~A~~~l~~l~~~~~~~ya~lA~l----------~lA~~~~~~g~~~~A~~~l~~i~~~~~~ 121 (218) .+-..|.......-+.+.+.++..+++-+.-++.-+..-... .-+.-.+.+|.+.+|.+.|...+.-++ T Consensus 202 n~~al~vrg~~~yy~~~~~ka~~hf~qal~ldpdh~~sk~~~~~~k~le~~k~~gN~~fk~G~y~~A~E~Yteal~idP- 280 (486) T KOG0550 202 NAEALYVRGLCLYYNDNADKAINHFQQALRLDPDHQKSKSASMMPKKLEVKKERGNDAFKNGNYRKAYECYTEALNIDP- 280 (486) T ss_pred HHHHHHHCCCCCCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCC- T ss_conf 1678886153310232068888887652026802566776764078999887505667634424678998877633793- Q ss_pred CHHHHHHHHHH--HHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHH Q ss_conf 95789999999--9996302699899999975311464235689999999999988998999999999983558998999 Q gi|254780421|r 122 PLAVRYAATLQ--AASILVDTSSYEEISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEELAKDNNSPFGIS 199 (218) Q Consensus 122 ~~~~~~lA~l~--la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~~~~p~~~r 199 (218) ......|.+. .+...+-.+...+++...+....=++.| .-++..+|..++..+++++|++.|++++...-+ ..+| T Consensus 281 -~n~~~naklY~nra~v~~rLgrl~eaisdc~~Al~iD~sy-ikall~ra~c~l~le~~e~AV~d~~~a~q~~~s-~e~r 357 (486) T KOG0550 281 -SNKKTNAKLYGNRALVNIRLGRLREAISDCNEALKIDSSY-IKALLRRANCHLALEKWEEAVEDYEKAMQLEKD-CEIR 357 (486) T ss_pred -CCCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC-CCHH T ss_conf -2222468999876766420677165665446666347988-899999889888998999999999999863056-1067 Q ss_pred HHHHH Q ss_conf 99999 Q gi|254780421|r 200 TRSQM 204 (218) Q Consensus 200 ~ra~~ 204 (218) +-... T Consensus 358 ~~l~~ 362 (486) T KOG0550 358 RTLRE 362 (486) T ss_pred HHHHH T ss_conf 89999 No 63 >KOG2376 consensus Probab=97.07 E-value=0.021 Score=33.26 Aligned_cols=45 Identities=16% Similarity=0.173 Sum_probs=22.2 Q ss_pred CHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHH Q ss_conf 7666999999973125520699999999999951100577899999 Q gi|254780421|r 68 KLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFL 113 (218) Q Consensus 68 ~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~ 113 (218) ..-.++..+..+.+..+.. +....|.++.+.+.+|++..|+..+. T Consensus 356 ~~~ka~e~L~~~~~~~p~~-s~~v~L~~aQl~is~gn~~~A~~il~ 400 (652) T KOG2376 356 KHKKAIELLLQFADGHPEK-SKVVLLLRAQLKISQGNPEVALEILS 400 (652) T ss_pred HHHHHHHHHHHHHCCCCCH-HHHHHHHHHHHHHHCCCHHHHHHHHH T ss_conf 8764499999874158730-39999999999986589889999998 No 64 >KOG2076 consensus Probab=97.06 E-value=0.021 Score=33.23 Aligned_cols=73 Identities=15% Similarity=0.225 Sum_probs=31.0 Q ss_pred HHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHH Q ss_conf 9999963026998999999753114642356899999999999889989999999999835589989999999 Q gi|254780421|r 131 LQAASILVDTSSYEEISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEELAKDNNSPFGISTRSQ 203 (218) Q Consensus 131 l~la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~~~~p~~~r~ra~ 203 (218) +-++..+++.+.+.+|+..+.++.+....-...-...+|..|...|.+++|.+.|.+++...+..-+.|-+.. T Consensus 418 ~d~a~al~~~~~~~~Al~~l~~i~~~~~~~~~~vw~~~a~c~~~l~e~e~A~e~y~kvl~~~p~~~D~Ri~La 490 (895) T KOG2076 418 LDLADALTNIGKYKEALRLLSPITNREGYQNAFVWYKLARCYMELGEYEEAIEFYEKVLILAPDNLDARITLA 490 (895) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHH T ss_conf 9999999836558889999999864833355134698999999874289999999999952898601663199 No 65 >PRK10153 DNA-binding transcriptional activator CadC; Provisional Probab=97.00 E-value=0.024 Score=32.88 Aligned_cols=143 Identities=16% Similarity=0.085 Sum_probs=86.6 Q ss_pred HHHHHHHHHHHHHHCCCH---HHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHH--CCHHHHHHHHHHHHHHHCCCCHHHH Q ss_conf 999999999998413876---66999999973125520699999999999951--1005778999999873126895789 Q gi|254780421|r 52 IVGENFAQALELFNSNKL---DDARSSFEKILSQDNKLYNPLSNMYIASILVA--KGDTKNAAEIFLKVANDDLAPLAVR 126 (218) Q Consensus 52 ~as~~y~~al~~~~~~~~---~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~--~g~~~~A~~~l~~i~~~~~~~~~~~ 126 (218) .+=..|.++......++. ..++..++++++..|+.-..+|..+++...+. ++...+-+..|..-+.....-+... T Consensus 330 ~aL~~~~~A~~~l~~~d~~~l~~~idLl~~~~~~ePdn~Yv~Ae~yi~~~~~~~l~~~~~~~l~~Ln~~l~~~~~~~~~~ 409 (512) T PRK10153 330 AALMLFYQAHHYLNSGDAKSLNKGSDLLEEILKSEPDFTYALAEKALADHVRQSLQPLDEKQLAALNTELDNIVALPGVN 409 (512) T ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCCC T ss_conf 99999998515663034788999999999999729983899999999999998549888999999999999987332468 Q ss_pred HHHHHH--HHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCH Q ss_conf 999999--9996302699899999975311464235689999999999988998999999999983558998 Q gi|254780421|r 127 YAATLQ--AASILVDTSSYEEISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEELAKDNNSPF 196 (218) Q Consensus 127 ~lA~l~--la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~~~~p~ 196 (218) ..+|+. ++...+.++++++|...|....... ...+..-++|-++.-.|+.+.|.+.|-++..-..++. T Consensus 410 ~~~riyeAlAl~al~~~~~~~A~~~L~~~~~~~--~Svl~YILlgK~aEl~G~~~~A~e~Y~~Af~~~~s~~ 479 (512) T PRK10153 410 VLSRIYEALAVQALVKGDTDEAYQALNKAIELE--MSWLNYVLLGKVAELKGDNRLAADAYSTAFNLRPGEN 479 (512) T ss_pred CCCHHHHHHHHHHHHCCCHHHHHHHHHHCHHHH--HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHH T ss_conf 872889999999984799999999998675645--5699999999999964996688999999980388389 No 66 >KOG1840 consensus Probab=97.00 E-value=0.025 Score=32.85 Aligned_cols=126 Identities=21% Similarity=0.187 Sum_probs=66.4 Q ss_pred HCCCHHHHHHHHHHHHH-----CC-CCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCC----CCHHHHHHHHHHHH Q ss_conf 13876669999999731-----25-5206999999999999511005778999999873126----89578999999999 Q gi|254780421|r 65 NSNKLDDARSSFEKILS-----QD-NKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDL----APLAVRYAATLQAA 134 (218) Q Consensus 65 ~~~~~~~A~~~l~~l~~-----~~-~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~----~~~~~~~lA~l~la 134 (218) +.+++++|+..++.-.. .| .+.-.+.....+|.++...|++++|...++.+++=.. ....--......++ T Consensus 253 ~~~k~~eAv~ly~~AL~i~e~~~G~~h~~va~~l~nLa~ly~~~GKf~EA~~~~e~Al~I~~~~~~~~~~~v~~~l~~~~ 332 (508) T KOG1840 253 SLGKYDEAVNLYEEALTIREEVFGEDHPAVAATLNNLAVLYYKQGKFAEAEEYCERALEIYEKLLGASHPEVAAQLSELA 332 (508) T ss_pred HHCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH T ss_conf 74428999999999999999853898789999998899998604775778999999999999850368589999998999 Q ss_pred HHHHCCCCHHHHHHHHH-------HHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 96302699899999975-------311464235689999999999988998999999999983 Q gi|254780421|r 135 SILVDTSSYEEISKILQ-------KLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEELAK 190 (218) Q Consensus 135 ~~~~~~~~~d~a~~~l~-------~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~ 190 (218) .+....+.++++..+++ ....+..+.-+-....+|-.|+..|++.+|.+.|++++. T Consensus 333 ~~~~~~~~~Eea~~l~q~al~i~~~~~g~~~~~~a~~~~nl~~l~~~~gk~~ea~~~~k~ai~ 395 (508) T KOG1840 333 AILQSMNEYEEAKKLLQKALKIYLDAPGEDNVNLAKIYANLAELYLKMGKYKEAEELYKKAIQ 395 (508) T ss_pred HHHHHHCCHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH T ss_conf 999885205689999999999998614641057898998999999994465679999999999 No 67 >pfam10300 IML2 Putative mitochondrial outer membrane protein. This is a family of proteins conserved from fungi to humans. Members of this family also carry a TPR_2 domain pfam07719 at their C-terminus. Many of them are annotated as being mitochondrial outer membrane protein IML2. Probab=96.96 E-value=0.026 Score=32.66 Aligned_cols=121 Identities=17% Similarity=0.178 Sum_probs=95.2 Q ss_pred HCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHH Q ss_conf 13876669999999731255206999999999999511005778999999873126895789999999999630269989 Q gi|254780421|r 65 NSNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYE 144 (218) Q Consensus 65 ~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d 144 (218) ...+...+...++.....+|++ .+=.|..|++..-+|+.++|+..|+..++..+.-.-++.+-.+-++..++-..+++ T Consensus 228 ~~~~~~~a~~lL~~~~~~yP~s--~l~l~~~gR~~~~~g~l~~Ai~~~~~~~~~~~~~kQ~~~lc~~El~w~~~~~~~w~ 305 (446) T pfam10300 228 GEGPLEEAEALLEPSRKRFPNS--ALWLFFEARIESLKGNLDEALELFEECIESQSEWKQIHHLCYWELAWCHVFLHRWK 305 (446) T ss_pred CCCCHHHHHHHHHHHHHHCCCC--HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9988999999999988538994--68999999999973799999999998674465449999999999999999997899 Q ss_pred HHHHHHHHHCCCCCHH-HHHHHHHHHHHHHHCCCHH------HHHHHHHHH Q ss_conf 9999975311464235-6899999999999889989------999999999 Q gi|254780421|r 145 EISKILQKLSEPSNPM-HQFANEILGISALKFGKVQ------KAKTIFEEL 188 (218) Q Consensus 145 ~a~~~l~~~~~~~~~~-~~~a~ellg~~~~~~Gd~~------~A~~~y~~i 188 (218) +|...+..+... +.| ++....+.|..+...++.+ +|.+.+.++ T Consensus 306 ~A~~~~~~L~~~-s~WSka~Y~Y~~a~c~~~~~~~~~~~~~~~a~~l~~~v 355 (446) T pfam10300 306 QAANYFLLLSDE-SKWSKALYAYLAAACLLMLGREEEKAEGEKAVELFREV 355 (446) T ss_pred HHHHHHHHHHHH-CCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHC T ss_conf 999999999984-54189999999999998653301056798999999977 No 68 >KOG4340 consensus Probab=96.95 E-value=0.023 Score=33.08 Aligned_cols=128 Identities=15% Similarity=0.143 Sum_probs=96.2 Q ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHH Q ss_conf 99999999841387666999999973125520699999999999951100577899999987312689578999999999 Q gi|254780421|r 55 ENFAQALELFNSNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAA 134 (218) Q Consensus 55 ~~y~~al~~~~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la 134 (218) -.+..+....+..-+.+|+.....+.+. +.......-..+.+....+|+..+..+.++.......+ ..+..+ T Consensus 80 YrlY~AQSLY~A~i~ADALrV~~~~~D~--~~L~~~~lqLqaAIkYse~Dl~g~rsLveQlp~en~Ad------~~in~g 151 (459) T KOG4340 80 YRLYQAQSLYKACIYADALRVAFLLLDN--PALHSRVLQLQAAIKYSEGDLPGSRSLVEQLPSENEAD------GQINLG 151 (459) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHCCC--HHHHHHHHHHHHHHHCCCCCCCCHHHHHHHCCCCCCHH------HHHCCH T ss_conf 9999999999703528899999995684--98999999999998605455535499998577888502------330301 Q ss_pred HHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHC Q ss_conf 963026998999999753114642356899999999999889989999999999835 Q gi|254780421|r 135 SILVDTSSYEEISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEELAKD 191 (218) Q Consensus 135 ~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~ 191 (218) ++....++++.|+++++.-..-+ -|.+.-..-++.++++.|+++.|.++-.+|++. T Consensus 152 CllykegqyEaAvqkFqaAlqvs-GyqpllAYniALaHy~~~qyasALk~iSEIieR 207 (459) T KOG4340 152 CLLYKEGQYEAAVQKFQAALQVS-GYQPLLAYNLALAHYSSRQYASALKHISEIIER 207 (459) T ss_pred HHEECCCCHHHHHHHHHHHHHHC-CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 20001654899999999987305-767278999999987312077899999999975 No 69 >KOG4234 consensus Probab=96.95 E-value=0.025 Score=32.81 Aligned_cols=101 Identities=17% Similarity=0.102 Sum_probs=74.1 Q ss_pred HHHHHHHCCHHHHHHHHHHHHHHHCC-CCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHH Q ss_conf 99999511005778999999873126-89578999999999963026998999999753114642356899999999999 Q gi|254780421|r 95 IASILVAKGDTKNAAEIFLKVANDDL-APLAVRYAATLQAASILVDTSSYEEISKILQKLSEPSNPMHQFANEILGISAL 173 (218) Q Consensus 95 lA~~~~~~g~~~~A~~~l~~i~~~~~-~~~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~ 173 (218) .+.-++.+|++.+|...|...++--+ .+...+.+-...-+...+...+.+.++........-+ |-.--|++.++.+|. T Consensus 101 EGN~~F~ngdyeeA~skY~~Ale~cp~~~~e~rsIly~Nraaa~iKl~k~e~aI~dcsKaiel~-pty~kAl~RRAeaye 179 (271) T KOG4234 101 EGNELFKNGDYEEANSKYQEALESCPSTSTEERSILYSNRAAALIKLRKWESAIEDCSKAIELN-PTYEKALERRAEAYE 179 (271) T ss_pred HHHHHHHCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC-CHHHHHHHHHHHHHH T ss_conf 7777651340778999999999868663088899997205889987566898999987657508-406799999999988 Q ss_pred HCCCHHHHHHHHHHHHHCCCCCH Q ss_conf 88998999999999983558998 Q gi|254780421|r 174 KFGKVQKAKTIFEELAKDNNSPF 196 (218) Q Consensus 174 ~~Gd~~~A~~~y~~i~~~~~~p~ 196 (218) +..++++|.+.|.+|+...+... T Consensus 180 k~ek~eealeDyKki~E~dPs~~ 202 (271) T KOG4234 180 KMEKYEEALEDYKKILESDPSRR 202 (271) T ss_pred HHHHHHHHHHHHHHHHHHCCCHH T ss_conf 55517888999999987182068 No 70 >KOG3060 consensus Probab=96.84 E-value=0.033 Score=32.06 Aligned_cols=137 Identities=18% Similarity=0.160 Sum_probs=97.6 Q ss_pred HHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHH Q ss_conf 99998413876669999999731255206999999999999511005778999999873126895789999999999630 Q gi|254780421|r 59 QALELFNSNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILV 138 (218) Q Consensus 59 ~al~~~~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~ 138 (218) .++..-.++.+++|+..++.++++.|+.-+..- ....+.-..|+..+|+..++..++.=-.|.. |=.-++-+++ T Consensus 92 kam~lEa~~~~~~A~e~y~~lL~ddpt~~v~~K--RKlAilka~GK~l~aIk~ln~YL~~F~~D~E----AW~eLaeiY~ 165 (289) T KOG3060 92 KAMLLEATGNYKEAIEYYESLLEDDPTDTVIRK--RKLAILKAQGKNLEAIKELNEYLDKFMNDQE----AWHELAEIYL 165 (289) T ss_pred HHHHHHHHHCHHHHHHHHHHHHCCCCCHHHHHH--HHHHHHHHCCCCHHHHHHHHHHHHHHCCCHH----HHHHHHHHHH T ss_conf 999999861354599999998446863148999--9999999718857999999999998357699----9999999997 Q ss_pred CCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCC---CHHHHHHHHHHHHHCCCCCHHHHHHHHH Q ss_conf 26998999999753114642356899999999999889---9899999999998355899899999999 Q gi|254780421|r 139 DTSSYEEISKILQKLSEPSNPMHQFANEILGISALKFG---KVQKAKTIFEELAKDNNSPFGISTRSQM 204 (218) Q Consensus 139 ~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~G---d~~~A~~~y~~i~~~~~~p~~~r~ra~~ 204 (218) ..+++++|.--+..+.- ..||.+...-.+|++++-+| +...|+++|.+.++..+ .+.|....+ T Consensus 166 ~~~~f~kA~fClEE~ll-~~P~n~l~f~rlae~~Yt~gg~eN~~~arkyy~~alkl~~--~~~ral~GI 231 (289) T KOG3060 166 SEGDFEKAAFCLEELLL-IQPFNPLYFQRLAEVLYTQGGAENLELARKYYERALKLNP--KNLRALFGI 231 (289) T ss_pred HHHHHHHHHHHHHHHHH-CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCH--HHHHHHHHH T ss_conf 67689999999999997-5987189999999999998418889999999999987274--738999999 No 71 >PRK10803 hypothetical protein; Provisional Probab=96.77 E-value=0.014 Score=34.30 Aligned_cols=100 Identities=19% Similarity=0.345 Sum_probs=63.5 Q ss_pred HHHHHHHHH-HCCCHHHHHHHHHHHHHCCC-CCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHH Q ss_conf 999999984-13876669999999731255-2069999999999995110057789999998731268957899999999 Q gi|254780421|r 56 NFAQALELF-NSNKLDDARSSFEKILSQDN-KLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQA 133 (218) Q Consensus 56 ~y~~al~~~-~~~~~~~A~~~l~~l~~~~~-~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~l 133 (218) .|..+..++ +.++++.|+..|+.+++.+| +.|++-|+|-++.+++..|++++|+..|..++..-+.. ....-|.+++ T Consensus 144 ~Y~~A~~Lvl~~k~y~~Ai~aF~~Fi~~yP~s~y~~nA~YWLGe~~y~~~~~~~A~~~F~~vv~~yP~s-~K~pdAllkl 222 (262) T PRK10803 144 DYNAAIALVQDKSRQDDAIVAFQNFIKKYPDSTYQPNANYWLGQLNYNKGKKDDAAYYFASVVKNYPKS-PKAADAMFKV 222 (262) T ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCC-CCCHHHHHHH T ss_conf 899999999973569999999999998787998774309999999997437899999999999867799-8678999999 Q ss_pred HHHHHCCCCHHHHHHHHHHHCCC Q ss_conf 99630269989999997531146 Q gi|254780421|r 134 ASILVDTSSYEEISKILQKLSEP 156 (218) Q Consensus 134 a~~~~~~~~~d~a~~~l~~~~~~ 156 (218) +.++...++.++|...+..+... T Consensus 223 g~~~~~lg~~~~Ak~~l~~vi~~ 245 (262) T PRK10803 223 GVIMQDKGDTAKAKAVYQQVIKK 245 (262) T ss_pred HHHHHHCCCHHHHHHHHHHHHHH T ss_conf 99999859999999999999998 No 72 >COG3071 HemY Uncharacterized enzyme of heme biosynthesis [Coenzyme metabolism] Probab=96.65 E-value=0.046 Score=31.16 Aligned_cols=84 Identities=17% Similarity=0.147 Sum_probs=42.2 Q ss_pred HCCCHHHHHHHHHHHHHC-CCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCH Q ss_conf 138766699999997312-5520699999999999951100577899999987312689578999999999963026998 Q gi|254780421|r 65 NSNKLDDARSSFEKILSQ-DNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSY 143 (218) Q Consensus 65 ~~~~~~~A~~~l~~l~~~-~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~ 143 (218) +-++.+.+-..+.+..+- ++++ ....+..+++...+||++.|.....+..+..+-++.. -++ ..+.++..+++ T Consensus 130 qrgd~~~an~yL~eaae~~~~~~--l~v~ltrarlll~~~d~~aA~~~v~~ll~~~pr~~~v---lrL-a~r~y~~~g~~ 203 (400) T COG3071 130 QRGDEDRANRYLAEAAELAGDDT--LAVELTRARLLLNRRDYPAARENVDQLLEMTPRHPEV---LRL-ALRAYIRLGAW 203 (400) T ss_pred HCCCHHHHHHHHHHHHCCCCCCH--HHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHH---HHH-HHHHHHHHCCH T ss_conf 63057789899999862589940--8899999999986788656898899998728688699---999-99999985118 Q ss_pred HHHHHHHHHHC Q ss_conf 99999975311 Q gi|254780421|r 144 EEISKILQKLS 154 (218) Q Consensus 144 d~a~~~l~~~~ 154 (218) ..+...+..+. T Consensus 204 ~~ll~~l~~L~ 214 (400) T COG3071 204 QALLAILPKLR 214 (400) T ss_pred HHHHHHHHHHH T ss_conf 99999889998 No 73 >KOG1173 consensus Probab=96.63 E-value=0.048 Score=31.08 Aligned_cols=145 Identities=18% Similarity=0.146 Sum_probs=100.5 Q ss_pred HHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH---CCCCH-HHHHHHHHHHHH Q ss_conf 99984138766699999997312552069999999999995110057789999998731---26895-789999999999 Q gi|254780421|r 60 ALELFNSNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVAND---DLAPL-AVRYAATLQAAS 135 (218) Q Consensus 60 al~~~~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~---~~~~~-~~~~lA~l~la~ 135 (218) .++....++...|..-|..-..-.|+ -++..=.++-+.+..+.+.+|+..|+..+.. ..... .+.. -...++. T Consensus 387 gmey~~t~n~kLAe~Ff~~A~ai~P~--Dplv~~Elgvvay~~~~y~~A~~~f~~~l~~ik~~~~e~~~w~p-~~~NLGH 463 (611) T KOG1173 387 GMEYMRTNNLKLAEKFFKQALAIAPS--DPLVLHELGVVAYTYEEYPEALKYFQKALEVIKSVLNEKIFWEP-TLNNLGH 463 (611) T ss_pred HHHHHHHCCHHHHHHHHHHHHHCCCC--CCHHHHHHHHEEEHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHH-HHHHHHH T ss_conf 99998742398999999998711898--60443314420332754688999999998876630543221157-7876789 Q ss_pred HHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHH Q ss_conf 6302699899999975311464235689999999999988998999999999983558998999999999995 Q gi|254780421|r 136 ILVDTSSYEEISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEELAKDNNSPFGISTRSQMILAN 208 (218) Q Consensus 136 ~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~~~~p~~~r~ra~~~l~~ 208 (218) .+-..+.+++|+..++....-. |-....+-..|.+|...|+++.|+++|.+.+--.+...-+++-....+.+ T Consensus 464 ~~Rkl~~~~eAI~~~q~aL~l~-~k~~~~~asig~iy~llgnld~Aid~fhKaL~l~p~n~~~~~lL~~aie~ 535 (611) T KOG1173 464 AYRKLNKYEEAIDYYQKALLLS-PKDASTHASIGYIYHLLGNLDKAIDHFHKALALKPDNIFISELLKLAIED 535 (611) T ss_pred HHHHHHHHHHHHHHHHHHHHCC-CCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHH T ss_conf 9999766799999999998718-77656888899999996580779999999871487528999999999975 No 74 >pfam04348 LppC LppC putative lipoprotein. This family includes several bacterial outer membrane antigens, whose molecular function is unknown. Probab=96.48 E-value=0.059 Score=30.50 Aligned_cols=101 Identities=11% Similarity=0.033 Sum_probs=74.2 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCC--CHHHHHHH Q ss_conf 9999999999995110057789999998731268957899999999996302699899999975311464--23568999 Q gi|254780421|r 88 NPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEEISKILQKLSEPS--NPMHQFAN 165 (218) Q Consensus 88 a~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~~--~~~~~~a~ 165 (218) ..--.+..|++++.+|+...|...+..+.. ...++..+.-..+..+.+.+.+++++.|+..|....... .......+ T Consensus 23 ~~~~~L~Aa~a~i~eg~~~qA~~ll~~l~~-~~L~~~q~~e~~Ll~A~l~~~~~~~~~Al~~L~~~~~~~L~~~q~~r~~ 101 (535) T pfam04348 23 QNTYKLLAARVLIEENKIAQAQALLRELSV-QQLTDEQKLDKSLLEAELAAAKRQNEVALQQLSHQHLSGLSPSQQSRYY 101 (535) T ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHH-CCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHCCCCCCCCCHHHHHHHH T ss_conf 999999999999977999999999997404-1199999999999999999975999999998557981107999999999 Q ss_pred HHHHHHHHHCCCHHHHHHHHHHHH Q ss_conf 999999998899899999999998 Q gi|254780421|r 166 EILGISALKFGKVQKAKTIFEELA 189 (218) Q Consensus 166 ellg~~~~~~Gd~~~A~~~y~~i~ 189 (218) ++++.++...|+.-+|...+-.+. T Consensus 102 ~l~A~~~~~~~~~l~a~r~~i~l~ 125 (535) T pfam04348 102 EGRARIAEARGDAIEAAKARIQMD 125 (535) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHH T ss_conf 999999997698699999999862 No 75 >COG3898 Uncharacterized membrane-bound protein [Function unknown] Probab=96.48 E-value=0.059 Score=30.49 Aligned_cols=151 Identities=15% Similarity=0.121 Sum_probs=96.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHH Q ss_conf 99999999999999999984138766699999997312552069999999999995110057789999998731268957 Q gi|254780421|r 45 SHEKKKNIVGENFAQALELFNSNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLA 124 (218) Q Consensus 45 ~q~~~~~~as~~y~~al~~~~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~ 124 (218) ....+...........+...-.|+...|.+.-.+-.+.-++.-.+|.++..|....-.|++++|...|+.-+.+ ++ T Consensus 76 fr~rKRdrgyqALStGliAagAGda~lARkmt~~~~~llssDqepLIhlLeAQaal~eG~~~~Ar~kfeAMl~d----PE 151 (531) T COG3898 76 FRERKRDRGYQALSTGLIAAGAGDASLARKMTARASKLLSSDQEPLIHLLEAQAALLEGDYEDARKKFEAMLDD----PE 151 (531) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCC----HH T ss_conf 99987644788875412442167568999999998765424645899999988988647618799999987368----38 Q ss_pred HHHHHHHHHHHHHHC-CCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHH Q ss_conf 899999999996302-69989999997531146423568999999999998899899999999998355899899999 Q gi|254780421|r 125 VRYAATLQAASILVD-TSSYEEISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEELAKDNNSPFGISTR 201 (218) Q Consensus 125 ~~~lA~l~la~~~~~-~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~~~~p~~~r~r 201 (218) .+. .-+|--|+..+ .+..+.+..+-..-.. --|....+....-......||.+.|.+..+.-....-...+.-.| T Consensus 152 tRl-lGLRgLyleAqr~GareaAr~yAe~Aa~-~Ap~l~WA~~AtLe~r~~~gdWd~AlkLvd~~~~~~vie~~~aeR 227 (531) T COG3898 152 TRL-LGLRGLYLEAQRLGAREAARHYAERAAE-KAPQLPWAARATLEARCAAGDWDGALKLVDAQRAAKVIEKDVAER 227 (531) T ss_pred HHH-HHHHHHHHHHHHCCCHHHHHHHHHHHHH-HCCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCHHHHHH T ss_conf 877-7678899999860218999999999876-366771679999999986678688999999888877526116778 No 76 >KOG1125 consensus Probab=96.42 E-value=0.064 Score=30.29 Aligned_cols=143 Identities=11% Similarity=0.095 Sum_probs=78.1 Q ss_pred HHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCC-------------------- Q ss_conf 984138766699999997312552069999999999995110057789999998731268-------------------- Q gi|254780421|r 62 ELFNSNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLA-------------------- 121 (218) Q Consensus 62 ~~~~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~-------------------- 121 (218) ...++++-..++..|.+-++-.++.. -|.+-+|-.+..+|--..|...|..|+..+.. T Consensus 328 ~qaENE~E~~ai~AL~rcl~LdP~Nl--eaLmaLAVSytNeg~q~~Al~~L~~Wi~~~p~y~~l~~a~~~~~~~~~~s~~ 405 (579) T KOG1125 328 TQAENENEQNAISALRRCLELDPTNL--EALMALAVSYTNEGLQNQALKMLDKWIRNKPKYVHLVSAGENEDFENTKSFL 405 (579) T ss_pred HHHHCCCHHHHHHHHHHHHHCCCCCH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHCCCCCCCCCCCCCCCCC T ss_conf 76503420889999999984699519--9999988888614668889999999998486211114667665666776778 Q ss_pred -CHHHHHHHHH------------------HHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHH Q ss_conf -9578999999------------------999963026998999999753114642356899999999999889989999 Q gi|254780421|r 122 -PLAVRYAATL------------------QAASILVDTSSYEEISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAK 182 (218) Q Consensus 122 -~~~~~~lA~l------------------~la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~ 182 (218) ...+..+..+ .++.++--.+++|.++.-++...... |-.++-+--+|...-.-....+|+ T Consensus 406 ~~~~l~~i~~~fLeaa~~~~~~~DpdvQ~~LGVLy~ls~efdraiDcf~~AL~v~-Pnd~~lWNRLGAtLAN~~~s~EAI 484 (579) T KOG1125 406 DSSHLAHIQELFLEAARQLPTKIDPDVQSGLGVLYNLSGEFDRAVDCFEAALQVK-PNDYLLWNRLGATLANGNRSEEAI 484 (579) T ss_pred CHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCC-CCHHHHHHHHHHHHCCCCCCHHHH T ss_conf 8789999999999999857788884277626789740147888999999997049-841778988668860775539999 Q ss_pred HHHHHHHHCCCCCHHHHHHHHHHHHHH Q ss_conf 999999835589989999999999950 Q gi|254780421|r 183 TIFEELAKDNNSPFGISTRSQMILANI 209 (218) Q Consensus 183 ~~y~~i~~~~~~p~~~r~ra~~~l~~l 209 (218) +.|++++... |..+|.|.++-++.+ T Consensus 485 sAY~rALqLq--P~yVR~RyNlgIS~m 509 (579) T KOG1125 485 SAYNRALQLQ--PGYVRVRYNLGISCM 509 (579) T ss_pred HHHHHHHHCC--CCEEEEEHHHHHHHH T ss_conf 9999998418--872653010145662 No 77 >KOG2003 consensus Probab=96.25 E-value=0.08 Score=29.67 Aligned_cols=175 Identities=15% Similarity=0.092 Sum_probs=84.6 Q ss_pred HHHH-HHHHHHHHH-----HHHHHHHHHH----HHHHHHHHH-HHHHHHHHHHHH--HHHHHHHHHHHHCCCHHHHHHHH Q ss_conf 8899-999999751-----9999999999----999999999-999999999999--99999999984138766699999 Q gi|254780421|r 10 YKGK-CKMGKCCYK-----WIAPLLILMI----LSLAIWFYL-FDGSHEKKKNIV--GENFAQALELFNSNKLDDARSSF 76 (218) Q Consensus 10 ~~~~-~~~~~~~yk-----~ii~~~i~~i----~~~~~~~~~-~~~~q~~~~~~a--s~~y~~al~~~~~~~~~~A~~~l 76 (218) .|.- +|.|.+..| +||-++-++. --+..|..| ......++.... --....+.....+++++.|+..+ T Consensus 363 i~nd~lk~~ek~~ka~aek~i~ta~kiiapvi~~~fa~g~dwcle~lk~s~~~~la~dlei~ka~~~lk~~d~~~aieil 442 (840) T KOG2003 363 IKNDHLKNMEKENKADAEKAIITAAKIIAPVIAPDFAAGCDWCLESLKASQHAELAIDLEINKAGELLKNGDIEGAIEIL 442 (840) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHH T ss_conf 75689998987524447888998888741302532000328999998875312355654346888888626778899999 Q ss_pred HHHHHCCCCCHHHHHHHHH-----------------------------------HHHHHHCCHHHHHHHHHHHHHHHCCC Q ss_conf 9973125520699999999-----------------------------------99995110057789999998731268 Q gi|254780421|r 77 EKILSQDNKLYNPLSNMYI-----------------------------------ASILVAKGDTKNAAEIFLKVANDDLA 121 (218) Q Consensus 77 ~~l~~~~~~~ya~lA~l~l-----------------------------------A~~~~~~g~~~~A~~~l~~i~~~~~~ 121 (218) +-+.+..+.+.+..|.=.- +.+.+.+||+++|.+.|+..+++.++ T Consensus 443 kv~~~kdnk~~saaa~nl~~l~flqggk~~~~aqqyad~aln~dryn~~a~~nkgn~~f~ngd~dka~~~ykeal~ndas 522 (840) T KOG2003 443 KVFEKKDNKTASAAANNLCALRFLQGGKDFADAQQYADIALNIDRYNAAALTNKGNIAFANGDLDKAAEFYKEALNNDAS 522 (840) T ss_pred HHHHHCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHCCCCEEEECCCHHHHHHHHHHHHCCCHH T ss_conf 99873231256777521689999862610667787778774202247777506775444158689999999999717428 Q ss_pred CHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH Q ss_conf 95789999999999630269989999997531146423568999999999998899899999999998 Q gi|254780421|r 122 PLAVRYAATLQAASILVDTSSYEEISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEELA 189 (218) Q Consensus 122 ~~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~ 189 (218) -. -|.+..+..+-..++.++++.-+-.+..-- -...-.+..++.+|....|...|++.|-++- T Consensus 523 --c~--ealfniglt~e~~~~ldeald~f~klh~il-~nn~evl~qianiye~led~aqaie~~~q~~ 585 (840) T KOG2003 523 --CT--EALFNIGLTAEALGNLDEALDCFLKLHAIL-LNNAEVLVQIANIYELLEDPAQAIELLMQAN 585 (840) T ss_pred --HH--HHHHHHCCCHHHHCCHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHCHHHHHHHHHHHC T ss_conf --89--999971123987447899999999999999-7029999999999998548799999999850 No 78 >COG5010 TadD Flp pilus assembly protein TadD, contains TPR repeats [Intracellular trafficking and secretion] Probab=96.24 E-value=0.081 Score=29.65 Aligned_cols=133 Identities=15% Similarity=0.121 Sum_probs=85.7 Q ss_pred CCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHH Q ss_conf 38766699999997312552069999999999995110057789999998731268957899999999996302699899 Q gi|254780421|r 66 SNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEE 145 (218) Q Consensus 66 ~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~ 145 (218) .++.+.....+.++...+++--..+.. +++.++.+|++..|+..+.....-.+.+. + +.-.++.++...++.++ T Consensus 79 ~G~a~~~l~~~~~~~~~~~~d~~ll~~--~gk~~~~~g~~~~A~~~~rkA~~l~p~d~--~--~~~~lgaaldq~Gr~~~ 152 (257) T COG5010 79 RGDADSSLAVLQKSAIAYPKDRELLAA--QGKNQIRNGNFGEAVSVLRKAARLAPTDW--E--AWNLLGAALDQLGRFDE 152 (257) T ss_pred CCCCCCHHHHHHHHHCCCCCCHHHHHH--HHHHHHHHCCHHHHHHHHHHHHCCCCCCH--H--HHHHHHHHHHHCCCHHH T ss_conf 055550078875540147454999999--99999980456889999999843699974--6--66677899997367567 Q ss_pred HHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHH Q ss_conf 999975311464235689999999999988998999999999983558998999999999 Q gi|254780421|r 146 ISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEELAKDNNSPFGISTRSQMI 205 (218) Q Consensus 146 a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~~~~p~~~r~ra~~~ 205 (218) |...+.....- .+-.....--+|..++-.||++.|+..+.....++..++.+++-.-.. T Consensus 153 Ar~ay~qAl~L-~~~~p~~~nNlgms~~L~gd~~~A~~lll~a~l~~~ad~~v~~NLAl~ 211 (257) T COG5010 153 ARRAYRQALEL-APNEPSIANNLGMSLLLRGDLEDAETLLLPAYLSPAADSRVRQNLALV 211 (257) T ss_pred HHHHHHHHHHH-CCCCCHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHH T ss_conf 78999999984-258844666589999972779899999999986788763899989999 No 79 >KOG1126 consensus Probab=96.18 E-value=0.087 Score=29.46 Aligned_cols=118 Identities=20% Similarity=0.110 Sum_probs=80.0 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHH Q ss_conf 99999999999951100577899999987312689578999999999963026998999999753114642356899999 Q gi|254780421|r 88 NPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEEISKILQKLSEPSNPMHQFANEI 167 (218) Q Consensus 88 a~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~el 167 (218) ++-+.--++..+--++|.+.|+.-++..+.-+ +.+. .|--.++.-...+..+|.|.+.+..-..-+ |-.+-|+.. T Consensus 420 sPesWca~GNcfSLQkdh~~Aik~f~RAiQld---p~fa-YayTLlGhE~~~~ee~d~a~~~fr~Al~~~-~rhYnAwYG 494 (638) T KOG1126 420 SPESWCALGNCFSLQKDHDTAIKCFKRAIQLD---PRFA-YAYTLLGHESIATEEFDKAMKSFRKALGVD-PRHYNAWYG 494 (638) T ss_pred CCHHHHHHCCHHHHHHHHHHHHHHHHHHHCCC---CCCC-HHHHHCCCHHHHHHHHHHHHHHHHHHHCCC-CHHHHHHHH T ss_conf 70778774025666567999999887765468---7632-123530773454577776778777651578-056678886 Q ss_pred HHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHC Q ss_conf 9999998899899999999998355899899999999999500 Q gi|254780421|r 168 LGISALKFGKVQKAKTIFEELAKDNNSPFGISTRSQMILANII 210 (218) Q Consensus 168 lg~~~~~~Gd~~~A~~~y~~i~~~~~~p~~~r~ra~~~l~~l~ 210 (218) +|.+|+++++++.|.-+|+++++-.+.+.-+---+-++...++ T Consensus 495 lG~vy~Kqek~e~Ae~~fqkA~~INP~nsvi~~~~g~~~~~~k 537 (638) T KOG1126 495 LGTVYLKQEKLEFAEFHFQKAVEINPSNSVILCHIGRIQHQLK 537 (638) T ss_pred HHHHEECCCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHH T ss_conf 2030220410467899887642279610357765259999860 No 80 >KOG0543 consensus Probab=96.05 E-value=0.1 Score=29.07 Aligned_cols=101 Identities=16% Similarity=0.130 Sum_probs=75.6 Q ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHH Q ss_conf 99999999999951100577899999987312689578999999999963026998999999753114642356899999 Q gi|254780421|r 88 NPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEEISKILQKLSEPSNPMHQFANEI 167 (218) Q Consensus 88 a~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~el 167 (218) ...+++.+|..+...+++.+|+..-+.++...+.+ .+ |.||-+..++..++++.|...++....-.+..+....++ T Consensus 256 k~~~~lNlA~c~lKl~~~~~Ai~~c~kvLe~~~~N--~K--ALyRrG~A~l~~~e~~~A~~df~ka~k~~P~Nka~~~el 331 (397) T KOG0543 256 KLACHLNLAACYLKLKEYKEAIESCNKVLELDPNN--VK--ALYRRGQALLALGEYDLARDDFQKALKLEPSNKAARAEL 331 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC--HH--HHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCHHHHHHH T ss_conf 87776479988874121889999999988458996--26--777878898864209899999999997488868999999 Q ss_pred HHHHHHHCCCHHHHHHHHHHHHHCC Q ss_conf 9999998899899999999998355 Q gi|254780421|r 168 LGISALKFGKVQKAKTIFEELAKDN 192 (218) Q Consensus 168 lg~~~~~~Gd~~~A~~~y~~i~~~~ 192 (218) .-..--..--.+..++.|..+.... T Consensus 332 ~~l~~k~~~~~~kekk~y~~mF~k~ 356 (397) T KOG0543 332 IKLKQKIREYEEKEKKMYANMFAKL 356 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 9999999999988999999985002 No 81 >KOG0547 consensus Probab=95.94 E-value=0.11 Score=28.76 Aligned_cols=87 Identities=13% Similarity=-0.011 Sum_probs=37.7 Q ss_pred HHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHC Q ss_conf 99995110057789999998731268957899999999996302699899999975311464235689999999999988 Q gi|254780421|r 96 ASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEEISKILQKLSEPSNPMHQFANEILGISALKF 175 (218) Q Consensus 96 A~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~ 175 (218) +..++.+|++++||..|.+.|+-.+.++++- -..+..+...++++++++.-..-.+- .|-..-++..++.++... T Consensus 122 GN~~f~~kkY~eAIkyY~~AI~l~p~epiFY----sNraAcY~~lgd~~~Vied~TkALEl-~P~Y~KAl~RRA~A~E~l 196 (606) T KOG0547 122 GNKFFRNKKYDEAIKYYTQAIELCPDEPIFY----SNRAACYESLGDWEKVIEDCTKALEL-NPDYVKALLRRASAHEQL 196 (606) T ss_pred HHHHHHCCCHHHHHHHHHHHHHCCCCCCHHH----HHHHHHHHHHHHHHHHHHHHHHHHHC-CCHHHHHHHHHHHHHHHH T ss_conf 0455302538889999999985089976133----10899999985099999987877624-908899999987887753 Q ss_pred CCHHHHHHHHHH Q ss_conf 998999999999 Q gi|254780421|r 176 GKVQKAKTIFEE 187 (218) Q Consensus 176 Gd~~~A~~~y~~ 187 (218) |++++|...... T Consensus 197 g~~~eal~D~tv 208 (606) T KOG0547 197 GKFDEALFDVTV 208 (606) T ss_pred CCHHHHHHHHHH T ss_conf 238889886569 No 82 >CHL00033 ycf3 photosystem I assembly protein Ycf3 Probab=95.91 E-value=0.12 Score=28.70 Aligned_cols=99 Identities=11% Similarity=0.165 Sum_probs=65.2 Q ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHH-HHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCC-CCHHHHHHH Q ss_conf 9999999999984138766699999997312552069-99999999999511005778999999873126-895789999 Q gi|254780421|r 52 IVGENFAQALELFNSNKLDDARSSFEKILSQDNKLYN-PLSNMYIASILVAKGDTKNAAEIFLKVANDDL-APLAVRYAA 129 (218) Q Consensus 52 ~as~~y~~al~~~~~~~~~~A~~~l~~l~~~~~~~ya-~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~-~~~~~~~lA 129 (218) ++=..|-..|..-+.|++.+|+.-..+-+.-..+.|. .-..+.++-++..+|++++|++.|.+.++-++ .+..+..++ T Consensus 30 ~af~yyr~GmsaQs~GeyaEALe~Y~eAL~Le~D~~Drs~iLYNIGlIy~~nGe~~kALeyY~qALeiNP~~pqA~NNlG 109 (165) T CHL00033 30 EAFTYYRDGMSAQSEGEYAEALQNYYEAMRLEIDPYDRSYILYNIGLIHTSNGEHTKALEYYFQALERNPFLPQALNNMA 109 (165) T ss_pred HHHHHHHCCCHHHCCCCHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHH T ss_conf 89999981510101353899999999862345685126689989999999858888999999999985999689997899 Q ss_pred HHH--HHHHHHCCCCHHHHHHHH Q ss_conf 999--999630269989999997 Q gi|254780421|r 130 TLQ--AASILVDTSSYEEISKIL 150 (218) Q Consensus 130 ~l~--la~~~~~~~~~d~a~~~l 150 (218) .+. .+....++++.+.+...+ T Consensus 110 vIyh~qG~~a~~~~~~~~~~~~f 132 (165) T CHL00033 110 VICHYRGEQAIEQGDSEIAEAWF 132 (165) T ss_pred HHHHHHHHHHHHCCCHHHHHHHH T ss_conf 99999879999845867899999 No 83 >pfam10140 essB Predicted membrane protein essB. Members of this family of prokaryotic proteins include the virulence factor essB, which is required for the synthesis and secretion of EsxA and EsxB, both ESAT-6 like proteins. Probab=95.90 E-value=0.12 Score=28.67 Aligned_cols=129 Identities=16% Similarity=0.126 Sum_probs=76.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHH Q ss_conf 99999751999999999999999999999999999999999999999984138766699999997312552069999999 Q gi|254780421|r 15 KMGKCCYKWIAPLLILMILSLAIWFYLFDGSHEKKKNIVGENFAQALELFNSNKLDDARSSFEKILSQDNKLYNPLSNMY 94 (218) Q Consensus 15 ~~~~~~yk~ii~~~i~~i~~~~~~~~~~~~~q~~~~~~as~~y~~al~~~~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~ 94 (218) |-..+.+||+.++++++.+.++++..|+-+.....-+ .+.++-..+-+.++...+..++.. .+.....-+++. T Consensus 188 Kkk~~~~K~~~i~~~~l~v~l~~~~~Y~~f~~~P~q~----~i~~a~~~fi~~dYs~VI~~L~~~---~~~~lp~~~kY~ 260 (359) T pfam10140 188 KKKWKIFKYLGIGLIALLVPLIIPLIYLLFFAQPKQE----AILEANTAFLNVDYSKVITTLEKY---DPESLPKSVKYE 260 (359) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHHH----HHHHHHHHHHHCCHHHHHHHHHHC---CHHHCCHHHHHH T ss_conf 5468999999999999999999999999998170488----999999999862589999876207---932499777778 Q ss_pred HHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCC Q ss_conf 99999511005778999999873126895789999999999630269989999997531146 Q gi|254780421|r 95 IASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEEISKILQKLSEP 156 (218) Q Consensus 95 lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~ 156 (218) +|..++...+..+... +.+.++-+.. .=-.+.+-.++++.|+.++|+..-..+.++ T Consensus 261 LA~SyV~~e~Ls~~QK--~~Ilnnitlk----Sde~yLlYWIyiGrg~~eeAldiA~~L~D~ 316 (359) T pfam10140 261 LAYSYVQVEKLSDDQK--ENILNNVTLK----SDELYLLYWIYIGRGEFEEALDIARRLDDN 316 (359) T ss_pred HHHHHHHCCCCCHHHH--HHHHHCCCCC----CCHHHHHHHHHHCCCCHHHHHHHHHHCCCH T ss_conf 9899996166679999--9998307756----778999999997687649999999847978 No 84 >pfam04733 Coatomer_E Coatomer epsilon subunit. This family represents the epsilon subunit of the coatomer complex, which is involved in the regulation of intracellular protein trafficking between the endoplasmic reticulum and the Golgi complex. Probab=95.87 E-value=0.12 Score=28.58 Aligned_cols=145 Identities=14% Similarity=0.186 Sum_probs=94.0 Q ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH Q ss_conf 99999998413876669999999731255206999999999999511005778999999873126895789999999999 Q gi|254780421|r 56 NFAQALELFNSNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAAS 135 (218) Q Consensus 56 ~y~~al~~~~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~ 135 (218) ....+......|++++|+..+.. +. ... +......++...++.+-|...++..- +-+.|.++..+|.- +.. T Consensus 105 ~~~aati~~~~g~~e~AL~~l~~----~~-~lE--~~al~VQi~L~~nR~DLA~ke~~~~q-~~~eDs~l~qLa~a-wv~ 175 (290) T pfam04733 105 RLLAAIIFIHEGDFDDALKHLHK----GE-NLE--AMALNVQILLKMHRIDLAEQQLKKMQ-QIDEDATLTQLANA-WVK 175 (290) T ss_pred HHHHHHHHHHCCCHHHHHHHHHC----CC-CHH--HHHHHHHHHHHCCCHHHHHHHHHHHH-HCCCCHHHHHHHHH-HHH T ss_conf 99999999977999999999855----89-667--99999999997288367999999988-43510599999999-999 Q ss_pred HHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH-CCCCCHHHHHHHHHHH-HHHCCCH Q ss_conf 6302699899999975311464235689999999999988998999999999983-5589989999999999-9500340 Q gi|254780421|r 136 ILVDTSSYEEISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEELAK-DNNSPFGISTRSQMIL-ANIIASD 213 (218) Q Consensus 136 ~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~-~~~~p~~~r~ra~~~l-~~l~~~~ 213 (218) +.....++.++.-.++.+....++ .++-+-.++..+..+|++++|....+++++ +++-|+.+ ++++. ....|.+ T Consensus 176 l~~Ggek~q~A~yif~EL~~~~~~-t~lllng~Av~~m~~~~~~eAe~~L~eAl~~d~~d~dtl---aNliv~s~~~gK~ 251 (290) T pfam04733 176 LAVGGEKIQDAYYIFQEFSEKYDS-TPLLLNGQAVCCMCLGRYEEAESLLKEALDKDAKDPETL---INLVVCALHLGKP 251 (290) T ss_pred HHHCCHHHHHHHHHHHHHHHHCCC-CHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHH---HHHHHHHHHHCCC T ss_conf 972647789899999999852289-789999999999874787999999999987288988999---9999999980998 No 85 >KOG2003 consensus Probab=95.84 E-value=0.12 Score=28.49 Aligned_cols=94 Identities=18% Similarity=0.186 Sum_probs=38.7 Q ss_pred HHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHC--CCCHHHHHHHHHHHCCCCCHHHHHHHHHHHH Q ss_conf 99999995110057789999998731268957899999999996302--6998999999753114642356899999999 Q gi|254780421|r 93 MYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVD--TSSYEEISKILQKLSEPSNPMHQFANEILGI 170 (218) Q Consensus 93 l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~--~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~ 170 (218) +..|..++.+||++.|++.++-+-... .-....|.-.++.+.+- ..++.++-.+- ++.-....|.+.++--.|- T Consensus 423 i~ka~~~lk~~d~~~aieilkv~~~kd---nk~~saaa~nl~~l~flqggk~~~~aqqya-d~aln~dryn~~a~~nkgn 498 (840) T KOG2003 423 INKAGELLKNGDIEGAIEILKVFEKKD---NKTASAAANNLCALRFLQGGKDFADAQQYA-DIALNIDRYNAAALTNKGN 498 (840) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHCC---CHHHHHHHHHHHHHHHHHCCCCHHHHHHHH-HHHHCCCCCCHHHHHCCCC T ss_conf 468888886267788999999987323---125677752168999986261066778777-8774202247777506775 Q ss_pred HHHHCCCHHHHHHHHHHHHH Q ss_conf 99988998999999999983 Q gi|254780421|r 171 SALKFGKVQKAKTIFEELAK 190 (218) Q Consensus 171 ~~~~~Gd~~~A~~~y~~i~~ 190 (218) +.+..||+++|.+.|.++++ T Consensus 499 ~~f~ngd~dka~~~ykeal~ 518 (840) T KOG2003 499 IAFANGDLDKAAEFYKEALN 518 (840) T ss_pred EEEECCCHHHHHHHHHHHHC T ss_conf 44415868999999999971 No 86 >TIGR00990 3a0801s09 mitochondrial precursor proteins import receptor; InterPro: IPR005687 The mitochondrial protein translocase (MPT) family, which brings nuclearly encoded preproteins into mitochondria, is very complex with 19 currently identified protein constituents. These proteins include several chaperone proteins, four proteins of the outer membrane translocase (Tom) import receptor, five proteins of the Tom channel complex, five proteins of the inner membrane translocase (Tim) and three "motor" proteins. This family is specific for the Tom70 proteins. ; GO: 0015450 P-P-bond-hydrolysis-driven protein transmembrane transporter activity, 0006886 intracellular protein transport, 0005741 mitochondrial outer membrane. Probab=95.80 E-value=0.13 Score=28.40 Aligned_cols=184 Identities=10% Similarity=-0.064 Sum_probs=104.9 Q ss_pred HHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------------------HHHHHHHHHHH-HHHHH- Q ss_conf 124-5889999999975199999999999999999999999999------------------99999999999-99984- Q gi|254780421|r 6 DAN-SYKGKCKMGKCCYKWIAPLLILMILSLAIWFYLFDGSHEK------------------KKNIVGENFAQ-ALELF- 64 (218) Q Consensus 6 ~~~-~~~~~~~~~~~~yk~ii~~~i~~i~~~~~~~~~~~~~q~~------------------~~~~as~~y~~-al~~~- 64 (218) -.+ |.-.|+++|-+..|.+|..++.+.+.++|..++|...+.+ ++++.++.-.. .-... T Consensus 21 P~~ss~WDr~~~~~~~~K~~v~~~~~~~~~~~gA~~~Y~~~~~~~~~~~~~~~~K~E~~~~K~~~K~~~~g~~~~~~~~~ 100 (649) T TIGR00990 21 PADSSLWDRVSNFISENKTVVYTIAGVTVVVIGAGVVYYLRKGQQQRESKPKISKKERRKRKQAEKETEGGKAEEKKSTA 100 (649) T ss_pred CCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHCCCHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 77521788788886412057666667888876666787653050330357888613343222455542111345530157 Q ss_pred ----------HCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHH Q ss_conf ----------1387666999999973125520699999999999951100577899999987312689578999999999 Q gi|254780421|r 65 ----------NSNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAA 134 (218) Q Consensus 65 ----------~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la 134 (218) +.+..|..- --.+..+....|+. +.=..+.-.+.+.++++||..|.|.+.-. .||++- ... + T Consensus 101 p~~~~v~~~~~~~E~D~~~--~~~l~~~~~~~YA~-~LK~~GN~~y~~K~f~~AI~~Y~~A~~~K-~DPVyY--sN~--~ 172 (649) T TIGR00990 101 PKNAPVEPADELPEIDESS--VVNLSEEERKKYAA-KLKEKGNKAYRNKDFNKAIKLYSKAILLK-PDPVYY--SNR--A 172 (649) T ss_pred CCCCCCCCCCCCCCCCHHH--HHHCCHHHHHHHHH-HHHHHHHHHHCCCCHHHHHHHHHHHHCCC-CCCCCC--CHH--H T ss_conf 8887634310267877256--53047878999999-98974456640578768999977640037-897521--016--8 Q ss_pred HHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH-HHCCCCCHHH Q ss_conf 963026998999999753114642356899999999999889989999999999-8355899899 Q gi|254780421|r 135 SILVDTSSYEEISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEEL-AKDNNSPFGI 198 (218) Q Consensus 135 ~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i-~~~~~~p~~~ 198 (218) .++...++.++++..-..-..-. |-..-++..++.++-..|++..|.-.+... +.+.--...+ T Consensus 173 AC~~~~~~~~kVV~~~T~AL~l~-p~Y~K~L~RRa~A~~~Lgky~~AllD~~~~~~~~~F~~~~~ 236 (649) T TIGR00990 173 ACYNALGDLEKVVEDTTAALELD-PDYVKALNRRANAYDGLGKYADALLDLTASCIIDGFRNEQS 236 (649) T ss_pred HHHHHHHHHHHHHHHHHHHHHCC-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHH T ss_conf 99998999999999878974007-04788886554343113479999997889876234022247 No 87 >pfam09295 ChAPs ChAPs (Chs5p-Arf1p-binding proteins). ChAPs (Chs5p-Arf1p-binding proteins) are required for the export of specialized cargo from the Golgi. They physically interact with Chs3, Chs5 and the small GTPase Arf1, and they form also interactions with each other. Probab=95.75 E-value=0.13 Score=28.29 Aligned_cols=113 Identities=13% Similarity=0.156 Sum_probs=81.3 Q ss_pred CCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHH Q ss_conf 38766699999997312552069999999999995110057789999998731268957899999999996302699899 Q gi|254780421|r 66 SNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEE 145 (218) Q Consensus 66 ~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~ 145 (218) ++.++.++..|+++....+.- ...+|++++..++..+|+..+...+.+..-+..+ ..-.+..+++.++++- T Consensus 182 t~~~~~~~~~le~L~~~~p~v-----~~llakv~~~~~~E~~av~~i~~~L~~~p~d~~L----L~~Qa~FLl~K~~yel 252 (395) T pfam09295 182 TKRYDRALNLLEKLREEEPEV-----VSLLARVLLLMDEEIDAVKLMHDALQENPMDYEL----LCLQAEFLLSKKKDEL 252 (395) T ss_pred CCCHHHHHHHHHHHHHCCCHH-----HHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHH----HHHHHHHHHHCCCHHH T ss_conf 443898999999987518449-----9999999996560899999999998618876789----9999999981798899 Q ss_pred HHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH Q ss_conf 9999753114642356899999999999889989999999999 Q gi|254780421|r 146 ISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEEL 188 (218) Q Consensus 146 a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i 188 (218) |+..-..... -.|-....++.++.+|.+.||++.|.-..+.. T Consensus 253 AL~~A~~av~-~~P~~F~~W~~La~~Yi~l~d~e~ALl~LNS~ 294 (395) T pfam09295 253 ALECAKRAVN-SAPSEFKTWALLVKVYIKLEDWENALLTLNSC 294 (395) T ss_pred HHHHHHHHHH-CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 9999999873-17866188999999999863299999999568 No 88 >COG4105 ComL DNA uptake lipoprotein [General function prediction only] Probab=95.45 E-value=0.17 Score=27.59 Aligned_cols=123 Identities=11% Similarity=0.053 Sum_probs=66.2 Q ss_pred HHCCCHHHHHHHHHHHHHCCC-CCHHHHHHHHHHHHHH--------HCCHHHHHHHHHHHHHHHCCCCHHHHHHHHH--- Q ss_conf 413876669999999731255-2069999999999995--------1100577899999987312689578999999--- Q gi|254780421|r 64 FNSNKLDDARSSFEKILSQDN-KLYNPLSNMYIASILV--------AKGDTKNAAEIFLKVANDDLAPLAVRYAATL--- 131 (218) Q Consensus 64 ~~~~~~~~A~~~l~~l~~~~~-~~ya~lA~l~lA~~~~--------~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l--- 131 (218) ..+++++.|+..+++++..+| +.=..-+.+..+-.++ ++.-...|+..++.++.+-+ ++.+..-|.. T Consensus 82 Yk~~~y~~A~~~~drFi~lyP~~~n~dY~~YlkgLs~~~~i~~~~rDq~~~~~A~~~f~~~i~ryP-nS~Ya~dA~~~i~ 160 (254) T COG4105 82 YKNGEYDLALAYIDRFIRLYPTHPNADYAYYLKGLSYFFQIDDVTRDQSAARAAFAAFKELVQRYP-NSRYAPDAKARIV 160 (254) T ss_pred HHCCCHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHCC-CCCCHHHHHHHHH T ss_conf 813558999999999998789998716999999998752677544477999999999999998788-9702355999999 Q ss_pred -----------HHHHHHHCCCCHHHHHHHHHHHCCC--CCHHHHHHHHHHHHHHHHCCCHHHHHHHHHH Q ss_conf -----------9999630269989999997531146--4235689999999999988998999999999 Q gi|254780421|r 132 -----------QAASILVDTSSYEEISKILQKLSEP--SNPMHQFANEILGISALKFGKVQKAKTIFEE 187 (218) Q Consensus 132 -----------~la~~~~~~~~~d~a~~~l~~~~~~--~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~ 187 (218) ..+..++..+.+..|+.+++.+... ..+..--++..+..+|...|-.++|.+.=.- T Consensus 161 ~~~d~LA~~Em~IaryY~kr~~~~AA~nR~~~v~e~y~~t~~~~eaL~~l~eaY~~lgl~~~a~~~~~v 229 (254) T COG4105 161 KLNDALAGHEMAIARYYLKRGAYVAAINRFEEVLENYPDTSAVREALARLEEAYYALGLTDEAKKTAKV 229 (254) T ss_pred HHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCHHHHHHHHHH T ss_conf 999998899999999999905769999999999871455520477899999999982872789999999 No 89 >TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD; InterPro: IPR005415 The type III secretion system of Gram-negative bacteria is used to transport virulence factors from the pathogen directly into the host cell and is only triggered when the bacterium comes into close contact with the host. Effector proteins secreted by the type III system do not possess a secretion signal, and are considered unique because of this. Yersinia spp. secrete effector proteins called YopB and YopD that facilitate the spread of other translocated proteins through the type III needle and the host cell cytoplasm . Both are believed to act as pore translocases, forming apertures in the host cell membrane and allowing the bacterium easy access to its cytoplasm. YopD also acts as a negative regulator of the Yersinia low-calcium response, and in turn is controlled by a chaperone, SycD . This protein also regulates YopB secretion. SycD is located on the Yop pathogenicity island of Yersinia spp., and is speculated to prevent a premature interaction between YopB, YopD and the calcium-response LcrV protein . It has been speculated that a type III secretion mechanism also exists in Chlamydial species. With the sequencing of the Chlamydia trachomatis genome, several proteins similar to characterised type III proteins have emerged, including a SycD homologue . The Pseudomonas aeruginosa gene PcrH is also similar to the Yersinia chaperone, suggesting a comparable function. Proteins in this entry are found in type III secretion operons. LcrH, from Yersinia is believed to have a regulatory function in the low-calcium response of the secretion system ,. The same protein is also known as SycD (SYC = Specific Yop Chaperone) for its chaperone role . In Pseudomonas, where the homolog is known as PcrH, the chaperone role has been demonstrated and the regulatory role appears to be absent . ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array .; GO: 0006950 response to stress. Probab=95.41 E-value=0.16 Score=27.87 Aligned_cols=96 Identities=17% Similarity=0.125 Sum_probs=71.7 Q ss_pred HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHH Q ss_conf 99999999999984138766699999997312552069999999999995110057789999998731268957899999 Q gi|254780421|r 51 NIVGENFAQALELFNSNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAAT 130 (218) Q Consensus 51 ~~as~~y~~al~~~~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~ 130 (218) +.....|..|-...+.|++++|...|.-|.--.+..+. -.+-+|......+++++|+..|.-...=. .+.+ ..- T Consensus 16 ~~L~~~Y~~AY~~y~~G~Y~~A~~~F~~L~~~~~~~~~--Y~~gLa~c~q~~~~Y~~A~~~Y~~a~~l~-~~~P---~p~ 89 (137) T TIGR02552 16 EQLEAIYALAYNLYQQGRYDEALKLFQLLAAYDPWNAD--YWLGLAACCQMLKEYEEAIDAYALAAALD-PDDP---RPY 89 (137) T ss_pred HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCHHHH--HHHHHHHHHHHHHCHHHHHHHHHHHHHCC-CCCC---CCH T ss_conf 89999999999998536679999999999997430368--99999999998616789999999997527-6869---512 Q ss_pred HHHHHHHHCCCCHHHHHHHHHH Q ss_conf 9999963026998999999753 Q gi|254780421|r 131 LQAASILVDTSSYEEISKILQK 152 (218) Q Consensus 131 l~la~~~~~~~~~d~a~~~l~~ 152 (218) ++.+..++.-+++..|.+-|.. T Consensus 90 ~~~a~C~l~~g~~~~A~~aL~~ 111 (137) T TIGR02552 90 YHAAECYLALGEPESALKALDL 111 (137) T ss_pred HHHHHHHHHHCCHHHHHHHHHH T ss_conf 6998999872897899999999 No 90 >KOG1129 consensus Probab=95.23 E-value=0.2 Score=27.17 Aligned_cols=52 Identities=13% Similarity=-0.050 Sum_probs=30.1 Q ss_pred CCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCC Q ss_conf 876669999999731255206999999999999511005778999999873126 Q gi|254780421|r 67 NKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDL 120 (218) Q Consensus 67 ~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~ 120 (218) +....|+..+...++..|-... -.+-+|+++-.-++.++|.++|+.++.-++ T Consensus 270 dQP~~AL~~~~~gld~fP~~VT--~l~g~ARi~eam~~~~~a~~lYk~vlk~~~ 321 (478) T KOG1129 270 DQPERALLVIGEGLDSFPFDVT--YLLGQARIHEAMEQQEDALQLYKLVLKLHP 321 (478) T ss_pred CCHHHHHHHHHHHHHCCCCHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 3588999998646523883256--666369999999707889999999984087 No 91 >TIGR00990 3a0801s09 mitochondrial precursor proteins import receptor; InterPro: IPR005687 The mitochondrial protein translocase (MPT) family, which brings nuclearly encoded preproteins into mitochondria, is very complex with 19 currently identified protein constituents. These proteins include several chaperone proteins, four proteins of the outer membrane translocase (Tom) import receptor, five proteins of the Tom channel complex, five proteins of the inner membrane translocase (Tim) and three "motor" proteins. This family is specific for the Tom70 proteins. ; GO: 0015450 P-P-bond-hydrolysis-driven protein transmembrane transporter activity, 0006886 intracellular protein transport, 0005741 mitochondrial outer membrane. Probab=95.08 E-value=0.23 Score=26.88 Aligned_cols=27 Identities=33% Similarity=0.382 Sum_probs=10.6 Q ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHH Q ss_conf 999999999998899899999999998 Q gi|254780421|r 163 FANEILGISALKFGKVQKAKTIFEELA 189 (218) Q Consensus 163 ~a~ellg~~~~~~Gd~~~A~~~y~~i~ 189 (218) .|.-.++...+.||+..+|.+.|++.+ T Consensus 577 ~A~~~lAQl~LQQ~~v~EAl~~FE~~A 603 (649) T TIGR00990 577 IAVATLAQLLLQQGDVDEALKLFERAA 603 (649) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 789999998762346899999998778 No 92 >KOG1125 consensus Probab=94.96 E-value=0.24 Score=26.69 Aligned_cols=91 Identities=15% Similarity=0.124 Sum_probs=52.1 Q ss_pred HHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHH Q ss_conf 99999511005778999999873126895789999999999630269989999997531146423568999999999998 Q gi|254780421|r 95 IASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEEISKILQKLSEPSNPMHQFANEILGISALK 174 (218) Q Consensus 95 lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~ 174 (218) ++-++.-.|+++.|+.-|+..+...+-+.. -==|++..+......++|+.....-..-. |-...++.-+|..+.. T Consensus 436 LGVLy~ls~efdraiDcf~~AL~v~Pnd~~----lWNRLGAtLAN~~~s~EAIsAY~rALqLq-P~yVR~RyNlgIS~mN 510 (579) T KOG1125 436 LGVLYNLSGEFDRAVDCFEAALQVKPNDYL----LWNRLGATLANGNRSEEAISAYNRALQLQ-PGYVRVRYNLGISCMN 510 (579) T ss_pred HHHHHHCCHHHHHHHHHHHHHHHCCCCHHH----HHHHHHHHHCCCCCCHHHHHHHHHHHHCC-CCEEEEEHHHHHHHHH T ss_conf 678974014788899999999704984177----89886688607755399999999998418-8726530101456620 Q ss_pred CCCHHHHHHHHHHHHH Q ss_conf 8998999999999983 Q gi|254780421|r 175 FGKVQKAKTIFEELAK 190 (218) Q Consensus 175 ~Gd~~~A~~~y~~i~~ 190 (218) .|.+.+|.++|-.++. T Consensus 511 lG~ykEA~~hlL~AL~ 526 (579) T KOG1125 511 LGAYKEAVKHLLEALS 526 (579) T ss_pred HHHHHHHHHHHHHHHH T ss_conf 5669999999999998 No 93 >COG4700 Uncharacterized protein conserved in bacteria containing a divergent form of TPR repeats [Function unknown] Probab=93.98 E-value=0.41 Score=25.30 Aligned_cols=129 Identities=13% Similarity=0.101 Sum_probs=90.1 Q ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHH--HHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHH Q ss_conf 9999999841387666999999973125520699--99999999995110057789999998731268957899999999 Q gi|254780421|r 56 NFAQALELFNSNKLDDARSSFEKILSQDNKLYNP--LSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQA 133 (218) Q Consensus 56 ~y~~al~~~~~~~~~~A~~~l~~l~~~~~~~ya~--lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~l 133 (218) .|--+....+.|+..+|...+++-.. +.|+. .-.+-+|...+.-+++..+...++.+.+.++. .-..=-.+.. T Consensus 92 r~rLa~al~elGr~~EA~~hy~qals---G~fA~d~a~lLglA~Aqfa~~~~A~a~~tLe~l~e~~pa--~r~pd~~Ll~ 166 (251) T COG4700 92 RYRLANALAELGRYHEAVPHYQQALS---GIFAHDAAMLLGLAQAQFAIQEFAAAQQTLEDLMEYNPA--FRSPDGHLLF 166 (251) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHC---CCCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCC--CCCCCCHHHH T ss_conf 99999999984105666889999854---656798899988898888616588888899998605976--5899725899 Q ss_pred HHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHC Q ss_conf 9963026998999999753114642356899999999999889989999999999835 Q gi|254780421|r 134 ASILVDTSSYEEISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEELAKD 191 (218) Q Consensus 134 a~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~ 191 (218) ++.+-.++.++++.+-+....+--+-+ -+.-.-+....++|...+|+..|..+-++ T Consensus 167 aR~laa~g~~a~Aesafe~a~~~ypg~--~ar~~Y~e~La~qgr~~ea~aq~~~v~d~ 222 (251) T COG4700 167 ARTLAAQGKYADAESAFEVAISYYPGP--QARIYYAEMLAKQGRLREANAQYVAVVDT 222 (251) T ss_pred HHHHHHCCCCHHHHHHHHHHHHHCCCH--HHHHHHHHHHHHHCCHHHHHHHHHHHHHH T ss_conf 999984588055899999999858877--78989999999712216788879999999 No 94 >TIGR03504 FimV_Cterm FimV C-terminal domain. This protein is found at the extreme C-terminus of FimV from Pseudomonas aeruginosa, and of TspA of Neisseria meningitidis. Disruption of the former blocks twitching motility from type IV pili; Semmler, et al. suggest a role in peptidoglycan layer remodelling required by type IV fimbrial systems. Probab=93.65 E-value=0.2 Score=27.19 Aligned_cols=40 Identities=18% Similarity=0.395 Sum_probs=29.0 Q ss_pred HHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHH Q ss_conf 999999889989999999999835589989999999999950 Q gi|254780421|r 168 LGISALKFGKVQKAKTIFEELAKDNNSPFGISTRSQMILANI 209 (218) Q Consensus 168 lg~~~~~~Gd~~~A~~~y~~i~~~~~~p~~~r~ra~~~l~~l 209 (218) ++.+|...||.+.||+..++++... +...+.+|+.+|+.| T Consensus 5 LA~aYiemGD~egAr~lL~EVi~eg--~~~q~~eA~~LL~~l 44 (44) T TIGR03504 5 LARAYIEMGDLEGARELLEEVIEEG--DEAQRQEARALLAQL 44 (44) T ss_pred HHHHHHHCCCHHHHHHHHHHHHHHC--CHHHHHHHHHHHHHC T ss_conf 9999998298898999999999849--999999999998709 No 95 >pfam09986 DUF2225 Uncharacterized protein conserved in bacteria (DUF2225). This domain, found in various hypothetical bacterial proteins, has no known function. Probab=93.58 E-value=0.48 Score=24.86 Aligned_cols=83 Identities=11% Similarity=0.119 Sum_probs=54.4 Q ss_pred HHHHHHHHHHHHHHHCCCCHHH-------HHHHHHH-HC---CCCCHH-HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHC Q ss_conf 7899999999996302699899-------9999753-11---464235-6899999999999889989999999999835 Q gi|254780421|r 124 AVRYAATLQAASILVDTSSYEE-------ISKILQK-LS---EPSNPM-HQFANEILGISALKFGKVQKAKTIFEELAKD 191 (218) Q Consensus 124 ~~~~lA~l~la~~~~~~~~~d~-------a~~~l~~-~~---~~~~~~-~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~ 191 (218) .....--+++|.++-+.++.++ |...+.. .. .+..+. ......++|.+..+.|+.++|...|.+++.+ T Consensus 115 s~~A~~~LriAWlyR~~~~~e~E~~fl~~Al~~y~~ay~~e~~~~~~~~e~~l~YLigeL~rRlG~~~eA~~wfs~vi~~ 194 (214) T pfam09986 115 SKKAMLCLRLAWLYRLLGDEEKELLFLKKALELYKEAYEKEDFPIEGIDEATLMYLIGELSRRLGNKEEALRWFSKVIGS 194 (214) T ss_pred HHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCC T ss_conf 89999999999986225885899999999999999999606688777118899999999999949999999999999639 Q ss_pred CCC--CHHHHHHHHHHH Q ss_conf 589--989999999999 Q gi|254780421|r 192 NNS--PFGISTRSQMIL 206 (218) Q Consensus 192 ~~~--p~~~r~ra~~~l 206 (218) +.. |+.+..+++..= T Consensus 195 ~~~s~~~~i~~~aRe~w 211 (214) T pfam09986 195 PKASKPSKLKDLAREQW 211 (214) T ss_pred CCCCCCHHHHHHHHHHH T ss_conf 66663188999989888 No 96 >KOG4648 consensus Probab=93.50 E-value=0.5 Score=24.77 Aligned_cols=100 Identities=12% Similarity=-0.012 Sum_probs=69.9 Q ss_pred HHHHHHHCCHHHHHHHHHHHHHHHCCCC---HHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHH Q ss_conf 9999951100577899999987312689---5789999999999630269989999997531146423568999999999 Q gi|254780421|r 95 IASILVAKGDTKNAAEIFLKVANDDLAP---LAVRYAATLQAASILVDTSSYEEISKILQKLSEPSNPMHQFANEILGIS 171 (218) Q Consensus 95 lA~~~~~~g~~~~A~~~l~~i~~~~~~~---~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~ 171 (218) .+..++.+|++++|+.-|-..+.-...+ ..-|.+|-+++-.....+.+-..|+..=. .| .-|.-.+|.+ T Consensus 103 ~GN~yFKQgKy~EAIDCYs~~ia~~P~NpV~~~NRA~AYlk~K~FA~AE~DC~~AiaLd~-------~Y-~KAYSRR~~A 174 (536) T KOG4648 103 RGNTYFKQGKYEEAIDCYSTAIAVYPHNPVYHINRALAYLKQKSFAQAEEDCEAAIALDK-------LY-VKAYSRRMQA 174 (536) T ss_pred HHHHHHHCCCHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH-------HH-HHHHHHHHHH T ss_conf 633676425413566665301245788864200588999988877754302889876328-------89-9998777789 Q ss_pred HHHCCCHHHHHHHHHHHHHCCCCCHHHHHHH Q ss_conf 9988998999999999983558998999999 Q gi|254780421|r 172 ALKFGKVQKAKTIFEELAKDNNSPFGISTRS 202 (218) Q Consensus 172 ~~~~Gd~~~A~~~y~~i~~~~~~p~~~r~ra 202 (218) ....|+..+|++.|+.++...+.+..+|-.. T Consensus 175 R~~Lg~~~EAKkD~E~vL~LEP~~~ELkK~~ 205 (536) T KOG4648 175 RESLGNNMEAKKDCETVLALEPKNIELKKSL 205 (536) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCHHHHHHH T ss_conf 9987757877777889986085617899999 No 97 >pfam07219 HemY_N HemY protein N-terminus. This family represents the N-terminus (approximately 150 residues) of bacterial HemY porphyrin biosynthesis proteins. This is a membrane protein involved in a late step of protoheme IX synthesis. Probab=92.31 E-value=0.73 Score=23.73 Aligned_cols=58 Identities=17% Similarity=0.187 Sum_probs=35.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCH Q ss_conf 999999999999999999841387666999999973125520699999999999951100 Q gi|254780421|r 45 SHEKKKNIVGENFAQALELFNSNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGD 104 (218) Q Consensus 45 ~q~~~~~~as~~y~~al~~~~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~ 104 (218) ++.++..++.....+++.....|+.++|.+.+.+-.+..+++ .+..+..|+....+|| T Consensus 77 ~~~rr~~k~~~al~~Gl~a~~~Gd~~~A~k~~~ka~k~~~~p--~l~~LlaA~aA~~~GD 134 (134) T pfam07219 77 FRRRKRRRGRKALSEGLLALAEGDWALAERLARKAAELEDQP--PLALLLAARAAQAEGD 134 (134) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCC--HHHHHHHHHHHHHCCC T ss_conf 999999999999999999997899999999999987637993--5799999999988579 No 98 >KOG4162 consensus Probab=91.92 E-value=0.81 Score=23.45 Aligned_cols=99 Identities=21% Similarity=0.201 Sum_probs=46.0 Q ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHHH-HCCCCHHHHHHHHHHHHHHHHCCCCHHHHHH--HHHHHCCCCCHHHHHHH Q ss_conf 99999999999511005778999999873-1268957899999999996302699899999--97531146423568999 Q gi|254780421|r 89 PLSNMYIASILVAKGDTKNAAEIFLKVAN-DDLAPLAVRYAATLQAASILVDTSSYEEISK--ILQKLSEPSNPMHQFAN 165 (218) Q Consensus 89 ~lA~l~lA~~~~~~g~~~~A~~~l~~i~~-~~~~~~~~~~lA~l~la~~~~~~~~~d~a~~--~l~~~~~~~~~~~~~a~ 165 (218) +...+..+..+...|...+|.+.|...+. ++...+.+..+|. +++..|++--+.. .+.....- +|..+.++ T Consensus 684 ~~~~~~~G~~~~~~~~~~EA~~af~~Al~ldP~hv~s~~Ala~-----~lle~G~~~la~~~~~L~dalr~-dp~n~eaW 757 (799) T KOG4162 684 ASVYYLRGLLLEVKGQLEEAKEAFLVALALDPDHVPSMTALAE-----LLLELGSPRLAEKRSLLSDALRL-DPLNHEAW 757 (799) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHH-----HHHHHCCCCHHHHHHHHHHHHHH-CCCCHHHH T ss_conf 8999986489998774689999999987328887477899999-----99971884207789999999860-88777999 Q ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHCCC Q ss_conf 9999999988998999999999983558 Q gi|254780421|r 166 EILGISALKFGKVQKAKTIFEELAKDNN 193 (218) Q Consensus 166 ellg~~~~~~Gd~~~A~~~y~~i~~~~~ 193 (218) ..+|-+..++||.+.|.+-|+..+.-.+ T Consensus 758 ~~LG~v~k~~Gd~~~Aaecf~aa~qLe~ 785 (799) T KOG4162 758 YYLGEVFKKLGDSKQAAECFQAALQLEE 785 (799) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHHCC T ss_conf 9999999974666789999999986444 No 99 >KOG1156 consensus Probab=91.76 E-value=0.85 Score=23.34 Aligned_cols=160 Identities=16% Similarity=0.097 Sum_probs=99.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCHH-HHHHHHH--------------HHHHCCCCCHHHHHHHHHHHHHHHCCHHHHH Q ss_conf 999999999999999999984138766-6999999--------------9731255206999999999999511005778 Q gi|254780421|r 44 GSHEKKKNIVGENFAQALELFNSNKLD-DARSSFE--------------KILSQDNKLYNPLSNMYIASILVAKGDTKNA 108 (218) Q Consensus 44 ~~q~~~~~~as~~y~~al~~~~~~~~~-~A~~~l~--------------~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A 108 (218) ++..+...++---|..|+....++..- .-+..++ .+..-.++--+ ..+-.|..++..|+...| T Consensus 85 ~R~dK~Y~eaiKcy~nAl~~~~dN~qilrDlslLQ~QmRd~~~~~~tr~~LLql~~~~ra--~w~~~Avs~~L~g~y~~A 162 (700) T KOG1156 85 QRSDKKYDEAIKCYRNALKIEKDNLQILRDLSLLQIQMRDYEGYLETRNQLLQLRPSQRA--SWIGFAVAQHLLGEYKMA 162 (700) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHH T ss_conf 741175899999999998339972999999999999998616188889999772245379--999999999998778877 Q ss_pred HHHHHHHHHHCC----CCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHH Q ss_conf 999999873126----8957899999999996302699899999975311464235689999999999988998999999 Q gi|254780421|r 109 AEIFLKVANDDL----APLAVRYAATLQAASILVDTSSYEEISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTI 184 (218) Q Consensus 109 ~~~l~~i~~~~~----~~~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~ 184 (218) ..+++....... ....-+..+.+....+..+.+.++++.+.+...... -.......|..|+++.+.|+.++|... T Consensus 163 ~~il~ef~~t~~~~~s~~~~e~se~~Ly~n~i~~E~g~~q~ale~L~~~e~~-i~Dkla~~e~ka~l~~kl~~lEeA~~~ 241 (700) T KOG1156 163 LEILEEFEKTQNTSPSKEDYEHSELLLYQNQILIEAGSLQKALEHLLDNEKQ-IVDKLAFEETKADLLMKLGQLEEAVKV 241 (700) T ss_pred HHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 8999999986204998888999999899999999724688999999852067-777778864288899987418768999 Q ss_pred HHHHHHCCCCCHHHHHHHHHHH Q ss_conf 9999835589989999999999 Q gi|254780421|r 185 FEELAKDNNSPFGISTRSQMIL 206 (218) Q Consensus 185 y~~i~~~~~~p~~~r~ra~~~l 206 (218) |..++...+-.-......+..+ T Consensus 242 y~~Ll~rnPdn~~Yy~~l~~~l 263 (700) T KOG1156 242 YRRLLERNPDNLDYYEGLEKAL 263 (700) T ss_pred HHHHHHHCCHHHHHHHHHHHHH T ss_conf 9999862933678999999999 No 100 >PRK10920 putative uroporphyrinogen III C-methyltransferase; Provisional Probab=91.63 E-value=0.88 Score=23.25 Aligned_cols=28 Identities=11% Similarity=0.227 Sum_probs=15.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999999999999 Q gi|254780421|r 23 WIAPLLILMILSLAIWFYLFDGSHEKKK 50 (218) Q Consensus 23 ~ii~~~i~~i~~~~~~~~~~~~~q~~~~ 50 (218) ++.+.+|+++++++||++||.++|..+- T Consensus 38 ~l~~lai~l~la~g~g~y~~~~qq~~~~ 65 (389) T PRK10920 38 ILSAVAIAIALAAGAGLYGWGKQQATNQ 65 (389) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999999999999999999999 No 101 >pfam12072 DUF3552 Domain of unknown function (DUF3552). This presumed domain is functionally uncharacterized. This domain is found in bacteria, archaea and eukaryotes. This domain is about 200 amino acids in length. This domain is found associated with pfam00013, pfam01966. This domain has a single completely conserved residue A that may be functionally important. Probab=91.44 E-value=0.8 Score=23.50 Aligned_cols=29 Identities=21% Similarity=0.327 Sum_probs=19.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999999999999 Q gi|254780421|r 23 WIAPLLILMILSLAIWFYLFDGSHEKKKN 51 (218) Q Consensus 23 ~ii~~~i~~i~~~~~~~~~~~~~q~~~~~ 51 (218) +||++++.+++|+++||++.......+.. T Consensus 2 ~Ii~~i~~~~iG~~~G~~~~~~~~~~~~~ 30 (201) T pfam12072 2 IIILAIIALVVGFAIGYFVRKSIAEAKIS 30 (201) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 23999999999999999999999997899 No 102 >pfam10602 RPN7 26S proteasome subunit RPN7. RPN7 (known as the non ATPase regulatory subunit 6 in higher eukaryotes) is one of the lid subunits of the 26S proteasome and has been shown in Saccharomyces cerevisiae to be required for structural integrity. The 26S proteasome is is involved in the ATP-dependent degradation of ubiquitinated proteins. Probab=89.80 E-value=1.3 Score=22.22 Aligned_cols=104 Identities=13% Similarity=0.143 Sum_probs=72.8 Q ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHH---HHCCCCCHH--HHH Q ss_conf 999999999995110057789999998731268957899999999996302699899999975---311464235--689 Q gi|254780421|r 89 PLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEEISKILQ---KLSEPSNPM--HQF 163 (218) Q Consensus 89 ~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~a~~~l~---~~~~~~~~~--~~~ 163 (218) ..++..+|..+..-||.+.|...|....+....+...-+ ..+.+.++.+..+++.-+.+.+. .+...++.| +.. T Consensus 36 R~~~~~l~~~y~~iGd~~~A~k~y~~~~~~~ts~~~~id-~~l~~irv~i~~~d~~~v~~~i~ka~~~~e~ggDw~r~nr 114 (174) T pfam10602 36 RMANLDLGEHYAKIGDLENALKAYSRAREYCTSLGHKID-MCLNLIRVGIFFNDWDHVSKYIEKAKSLIEKGGDWERRNR 114 (174) T ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHH T ss_conf 999999899999966799999999998831788459999-9999999999970099999999999856514455878988 Q ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHCCC Q ss_conf 999999999988998999999999983558 Q gi|254780421|r 164 ANEILGISALKFGKVQKAKTIFEELAKDNN 193 (218) Q Consensus 164 a~ellg~~~~~~Gd~~~A~~~y~~i~~~~~ 193 (218) ..-..|...+..+++.+|-..|-...-.-. T Consensus 115 lk~y~Gl~~l~~r~f~~AA~~fl~~~~tf~ 144 (174) T pfam10602 115 LKVYEGLALLAIRDFKEAAKLFLDSLSTFT 144 (174) T ss_pred HHHHHHHHHHHHCHHHHHHHHHHCCCCCCC T ss_conf 999999999971649999999963767888 No 103 >PRK04778 septation ring formation regulator EzrA; Provisional Probab=89.19 E-value=1.4 Score=21.94 Aligned_cols=167 Identities=14% Similarity=0.097 Sum_probs=76.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC---------CCHHHHHHHHHHHHHCCC----CCHHH Q ss_conf 99999999999999999999999999999999999999998413---------876669999999731255----20699 Q gi|254780421|r 23 WIAPLLILMILSLAIWFYLFDGSHEKKKNIVGENFAQALELFNS---------NKLDDARSSFEKILSQDN----KLYNP 89 (218) Q Consensus 23 ~ii~~~i~~i~~~~~~~~~~~~~q~~~~~~as~~y~~al~~~~~---------~~~~~A~~~l~~l~~~~~----~~ya~ 89 (218) ++++++|++++++.+++++.+.+..++....-......+...-. +-..+....|+...+... ..... T Consensus 4 ~iii~iivlvii~~~~~~~~rr~~~k~I~~Le~rK~~l~~~pv~~el~kvk~L~l~Get~~~fe~w~~~w~ei~~~~l~~ 83 (569) T PRK04778 4 ILIIAIVVIIIIAYLVGYILRKRNYKRIDELEERKQELEDLPVNEELEKVKKLNLTGQSETKFEEWRQDWDEIVTNSFPD 83 (569) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 17899999999999999999999998999999999998769879999988808677065999999998999998842356 Q ss_pred -HHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCC-------------CCHHHHHHHHHHHCC Q ss_conf -999999999951100577899999987312689578999999999963026-------------998999999753114 Q gi|254780421|r 90 -LSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDT-------------SSYEEISKILQKLSE 155 (218) Q Consensus 90 -lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~-------------~~~d~a~~~l~~~~~ 155 (218) --.+.-|.-+.+..++.+|...+..+-. ..+..-..+..++.+.-.+-. ..|.+..+.+..... T Consensus 84 iee~l~~ae~~~~~~rf~kak~~l~~~~~--~l~~~e~~i~~i~~~l~~L~~~e~~nr~~v~~lk~~y~~lrk~ll~~~~ 161 (569) T PRK04778 84 IEELLFEAEDLNDKFRFFKAKHEINEIES--LLDEIEEDIEQILEELQELLESEEKNREEVEQLKDKYRELRKSLLANSF 161 (569) T ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 99999999999872528999999999999--9999999999999999999986899899999999999999999998522 Q ss_pred CCCHHHHHHHHHHHH---------HHHHCCCHHHHHHHHHHHHHC Q ss_conf 642356899999999---------999889989999999999835 Q gi|254780421|r 156 PSNPMHQFANEILGI---------SALKFGKVQKAKTIFEELAKD 191 (218) Q Consensus 156 ~~~~~~~~a~ellg~---------~~~~~Gd~~~A~~~y~~i~~~ 191 (218) .-+|-....-..+.. -.-..||+.+|++.+.++-.. T Consensus 162 ~yG~a~~~LE~~L~~le~~f~~f~~lt~~Gd~~~A~evl~~l~~~ 206 (569) T PRK04778 162 SFGPALDELEKQLENLEEEFSQFVELTESGDYVEAREVLDKLEEE 206 (569) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH T ss_conf 103369999999999999899999998688989999999999999 No 104 >pfam00515 TPR_1 Tetratricopeptide repeat. Probab=89.18 E-value=0.7 Score=23.85 Aligned_cols=29 Identities=31% Similarity=0.307 Sum_probs=24.6 Q ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHCC Q ss_conf 99999999998899899999999998355 Q gi|254780421|r 164 ANEILGISALKFGKVQKAKTIFEELAKDN 192 (218) Q Consensus 164 a~ellg~~~~~~Gd~~~A~~~y~~i~~~~ 192 (218) ++..+|.+|..+|++++|++.|++++.-. T Consensus 3 ~~~~lG~~y~~~~~~~~A~~~~~~Al~l~ 31 (34) T pfam00515 3 ALYNLGNAYFKLGKYDEAIEYYEKALELN 31 (34) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHCCC T ss_conf 99998999999178999999999988439 No 105 >pfam03704 BTAD Bacterial transcriptional activator domain. Found in the DNRI/REDD/AFSR family of regulators. This region of AFSR along with the C terminal region is capable of independently directing actinorhodin production. This family contains TPR repeats. Probab=88.74 E-value=1.5 Score=21.75 Aligned_cols=75 Identities=15% Similarity=0.111 Sum_probs=56.7 Q ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH------CCCCCHHHHHHH Q ss_conf 99999996302699899999975311464235689999999999988998999999999983------558998999999 Q gi|254780421|r 129 ATLQAASILVDTSSYEEISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEELAK------DNNSPFGISTRS 202 (218) Q Consensus 129 A~l~la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~------~~~~p~~~r~ra 202 (218) +...++......+++++++..+.....- .|+.=.++..+..++...|+..+|...|+..-. .-++++..+..- T Consensus 64 a~~~l~~~~~~~g~~~~a~~~~~~~l~~-dPl~E~~~~~Lm~a~~~~G~~~~Al~~Y~~~r~~L~~ELGv~P~~~~~~l~ 142 (146) T pfam03704 64 ALERRIEADLRLGRHEEALAELRALVAL-HPLRERLHRQLMRALYRSGRQAEALRVYRRLRRRLAEELGVEPGPELRALH 142 (146) T ss_pred HHHHHHHHHHHCCCCCCHHHHHHHHHHH-CCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHH T ss_conf 9999999998859830219999999986-887429999999999988999999999999999999983899989999999 Q ss_pred HH Q ss_conf 99 Q gi|254780421|r 203 QM 204 (218) Q Consensus 203 ~~ 204 (218) +. T Consensus 143 ~~ 144 (146) T pfam03704 143 QE 144 (146) T ss_pred HH T ss_conf 97 No 106 >COG3629 DnrI DNA-binding transcriptional activator of the SARP family [Signal transduction mechanisms] Probab=88.08 E-value=1.7 Score=21.48 Aligned_cols=99 Identities=19% Similarity=0.122 Sum_probs=70.2 Q ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHH Q ss_conf 99999999951100577899999987312689578999999999963026998999999753114642356899999999 Q gi|254780421|r 91 SNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEEISKILQKLSEPSNPMHQFANEILGI 170 (218) Q Consensus 91 A~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~ 170 (218) ..+..+.-++..|....--....|+......-......+...++-.+...+.++.+...++.+..- .||.=.+++.+-. T Consensus 117 ~~~~~~~~~~~~g~~~~d~~f~~WV~~~R~~l~e~~~~~l~~lae~~~~~~~~~~~~~~l~~Li~~-dp~~E~~~~~lm~ 195 (280) T COG3629 117 LRFEQAGELLSEGPVLGDDRFDEWVLEQRRALEELFIKALTKLAEALIACGRADAVIEHLERLIEL-DPYDEPAYLRLME 195 (280) T ss_pred HHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHC-CCCCHHHHHHHHH T ss_conf 799987887643876788149999999999999999999999999987312188999999999862-9646499999999 Q ss_pred HHHHCCCHHHHHHHHHHHHH Q ss_conf 99988998999999999983 Q gi|254780421|r 171 SALKFGKVQKAKTIFEELAK 190 (218) Q Consensus 171 ~~~~~Gd~~~A~~~y~~i~~ 190 (218) +|+..|+...|+..|..+.. T Consensus 196 ~y~~~g~~~~ai~~y~~l~~ 215 (280) T COG3629 196 AYLVNGRQSAAIRAYRQLKK 215 (280) T ss_pred HHHHCCCCHHHHHHHHHHHH T ss_conf 99975880689999999999 No 107 >PRK11189 lipoprotein NlpI; Provisional Probab=87.91 E-value=1.7 Score=21.42 Aligned_cols=132 Identities=17% Similarity=0.126 Sum_probs=70.9 Q ss_pred HHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHH----------- Q ss_conf 984138766699999997312552069999999999995110057789999998731268957899999----------- Q gi|254780421|r 62 ELFNSNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAAT----------- 130 (218) Q Consensus 62 ~~~~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~----------- 130 (218) -..+.+++|.|...|+..++=.|+- ..|.+..+-.+.-.|..+.|..-+...-...+.| ++|.+=. T Consensus 108 y~tq~~~fD~AyEAFDs~LEL~P~y--~YA~lNRGIAlYYg~R~~LA~~D~~~fy~~dp~D-pYR~LWLYl~e~~~d~~~ 184 (297) T PRK11189 108 YLTQEGEFDAAYEAFDSVLELDPTY--DYAYLNRGIALYYGGRYDLAQDDLLAFYQDDPND-PYRALWLYLIEQKLDPKQ 184 (297) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCC--HHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCC-CHHHHHHHHHHCCCCHHH T ss_conf 9999711999999987777269873--4899830278874463788999999999609998-279999999981169999 Q ss_pred ------------------HHHHHHHHCCCCHHHHHHHHHHHCCCCCHHH---HHHHHHHHHHHHHCCCHHHHHHHHHHHH Q ss_conf ------------------9999963026998999999753114642356---8999999999998899899999999998 Q gi|254780421|r 131 ------------------LQAASILVDTSSYEEISKILQKLSEPSNPMH---QFANEILGISALKFGKVQKAKTIFEELA 189 (218) Q Consensus 131 ------------------l~la~~~~~~~~~d~a~~~l~~~~~~~~~~~---~~a~ellg~~~~~~Gd~~~A~~~y~~i~ 189 (218) ..++..+++....++....+..-...+..+. --+..-+|-.+..+|+.++|...|+-.+ T Consensus 185 A~~~L~~r~~~~~~~~W~~~iV~~ylg~ise~~~~~~~~~~~~~n~~lAErLcEaYFYLaK~~~~~G~~~~A~~~fKLAl 264 (297) T PRK11189 185 AKENLKQRYEKLDKEQWGWNIVDFYLGKISEEQLFERLKAGATDNTELAERLCETYFYLAKYYLSLGDLDEAANLFKLAL 264 (297) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH T ss_conf 99999999983571226899999994589999999999872737099999999999999999998799899999999987 Q ss_pred HCCCCCHH Q ss_conf 35589989 Q gi|254780421|r 190 KDNNSPFG 197 (218) Q Consensus 190 ~~~~~p~~ 197 (218) . .++=.- T Consensus 265 a-~NVyeF 271 (297) T PRK11189 265 A-NNVYNF 271 (297) T ss_pred H-CCCHHH T ss_conf 0-482588 No 108 >KOG1070 consensus Probab=87.88 E-value=1.7 Score=21.41 Aligned_cols=148 Identities=14% Similarity=0.131 Sum_probs=98.0 Q ss_pred HHHHHHHHHH-HCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHH Q ss_conf 9999999984-138766699999997312552069999999999995110057789999998731268957899999999 Q gi|254780421|r 55 ENFAQALELF-NSNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQA 133 (218) Q Consensus 55 ~~y~~al~~~-~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~l 133 (218) ..|......+ ..++.++|...|+.+.+.....-+ .....+..++.+++.++|-..++..+.-.+......-+ -.- T Consensus 1531 ~V~~~L~~iy~k~ek~~~A~ell~~m~KKF~q~~~--vW~~y~~fLl~~ne~~aa~~lL~rAL~~lPk~eHv~~I--skf 1606 (1710) T KOG1070 1531 TVHLKLLGIYEKSEKNDEADELLRLMLKKFGQTRK--VWIMYADFLLRQNEAEAARELLKRALKSLPKQEHVEFI--SKF 1606 (1710) T ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCHHH--HHHHHHHHHHCCCHHHHHHHHHHHHHHHCCHHHHHHHH--HHH T ss_conf 89999887998762310489999999998653256--89999999851447889999999998646502368899--999 Q ss_pred HHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHH Q ss_conf 99630269989999997531146423568999999999998899899999999998355899899999999999 Q gi|254780421|r 134 ASILVDTSSYEEISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEELAKDNNSPFGISTRSQMILA 207 (218) Q Consensus 134 a~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~~~~p~~~r~ra~~~l~ 207 (218) |.+.+..++++.....+..+.... |-+-.-+....+.-.++|+.+-+|..|++++...-+|-..+-.-.+-|+ T Consensus 1607 AqLEFk~GDaeRGRtlfEgll~ay-PKRtDlW~VYid~eik~~~~~~vR~lfeRvi~l~l~~kkmKfffKkwLe 1679 (1710) T KOG1070 1607 AQLEFKYGDAERGRTLFEGLLSAY-PKRTDLWSVYIDMEIKHGDIKYVRDLFERVIELKLSIKKMKFFFKKWLE 1679 (1710) T ss_pred HHHHHHCCCCHHHHHHHHHHHHHC-CCCHHHHHHHHHHHHCCCCHHHHHHHHHHHHHCCCCHHHHHHHHHHHHH T ss_conf 998765188435678988777638-5430279999998870477799999999998647786676999999999 No 109 >KOG4555 consensus Probab=87.82 E-value=1.8 Score=21.38 Aligned_cols=100 Identities=23% Similarity=0.084 Sum_probs=66.2 Q ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHHCC-CCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHC---CCCCHHHHHHHH Q ss_conf 999999999511005778999999873126-8957899999999996302699899999975311---464235689999 Q gi|254780421|r 91 SNMYIASILVAKGDTKNAAEIFLKVANDDL-APLAVRYAATLQAASILVDTSSYEEISKILQKLS---EPSNPMHQFANE 166 (218) Q Consensus 91 A~l~lA~~~~~~g~~~~A~~~l~~i~~~~~-~~~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~---~~~~~~~~~a~e 166 (218) ..=..+-.+.+.|+.+.|++.|.+.+.-.+ -++.+..-| ..+--+++.++++..|+.-. ++...-.-.+.- T Consensus 45 ~LEl~~valaE~g~Ld~AlE~F~qal~l~P~raSayNNRA-----Qa~RLq~~~e~ALdDLn~AleLag~~trtacqa~v 119 (175) T KOG4555 45 ELELKAIALAEAGDLDGALELFGQALCLAPERASAYNNRA-----QALRLQGDDEEALDDLNKALELAGDQTRTACQAFV 119 (175) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHCCHH-----HHHHHCCCHHHHHHHHHHHHHHCCCCCHHHHHHHH T ss_conf 9999988988504567899999999985433538661399-----99987288187899899999862862069999999 Q ss_pred HHHHHHHHCCCHHHHHHHHHHHHHCCCCCH Q ss_conf 999999988998999999999983558998 Q gi|254780421|r 167 ILGISALKFGKVQKAKTIFEELAKDNNSPF 196 (218) Q Consensus 167 llg~~~~~~Gd~~~A~~~y~~i~~~~~~p~ 196 (218) .+|.+|..+|+.+.||..|+.+.. .-+|. T Consensus 120 QRg~lyRl~g~dd~AR~DFe~AA~-LGS~F 148 (175) T KOG4555 120 QRGLLYRLLGNDDAARADFEAAAQ-LGSKF 148 (175) T ss_pred HHHHHHHHHCCHHHHHHHHHHHHH-HCCHH T ss_conf 999999993761888886999998-67878 No 110 >KOG1155 consensus Probab=87.66 E-value=1.8 Score=21.33 Aligned_cols=32 Identities=13% Similarity=0.078 Sum_probs=17.0 Q ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCC Q ss_conf 99999999999889989999999999835589 Q gi|254780421|r 163 FANEILGISALKFGKVQKAKTIFEELAKDNNS 194 (218) Q Consensus 163 ~a~ellg~~~~~~Gd~~~A~~~y~~i~~~~~~ 194 (218) .-+-.+|..|.+.++.++|++-|..++.-.++ T Consensus 433 Rlw~aLG~CY~kl~~~~eAiKCykrai~~~dt 464 (559) T KOG1155 433 RLWVALGECYEKLNRLEEAIKCYKRAILLGDT 464 (559) T ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHHCCCC T ss_conf 89999999999855689999999888852464 No 111 >KOG0687 consensus Probab=86.96 E-value=2 Score=21.08 Aligned_cols=105 Identities=13% Similarity=0.069 Sum_probs=77.8 Q ss_pred CHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCH--HHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHH--H Q ss_conf 06999999999999511005778999999873126895--7899999999996302699899999975311464235--6 Q gi|254780421|r 86 LYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPL--AVRYAATLQAASILVDTSSYEEISKILQKLSEPSNPM--H 161 (218) Q Consensus 86 ~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~--~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~~~~~--~ 161 (218) .+..-|.+..|..+..-||.+.|.+.+....++.-+-. .---...+|++..+++..=..+.+....++...++.| + T Consensus 101 ~ev~ea~~~kaeYycqigDkena~~~~~~t~~ktvs~g~kiDVvf~~iRlglfy~D~~lV~~~iekak~liE~GgDWeRr 180 (393) T KOG0687 101 SEVREAMLRKAEYYCQIGDKENALEALRKTYEKTVSLGHKIDVVFYKIRLGLFYLDHDLVTESIEKAKSLIEEGGDWERR 180 (393) T ss_pred HHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHHH T ss_conf 78999999999999986038779999999988776405303568888877886210889998899989999827886666 Q ss_pred HHHHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 89999999999988998999999999983 Q gi|254780421|r 162 QFANEILGISALKFGKVQKAKTIFEELAK 190 (218) Q Consensus 162 ~~a~ellg~~~~~~Gd~~~A~~~y~~i~~ 190 (218) ....-..|......-|+.+|-..|-..+. T Consensus 181 NRlKvY~Gly~msvR~Fk~Aa~Lfld~vs 209 (393) T KOG0687 181 NRLKVYQGLYCMSVRNFKEAADLFLDSVS 209 (393) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 56889888999999737889999998704 No 112 >PRK01844 hypothetical protein; Provisional Probab=86.79 E-value=1.5 Score=21.75 Aligned_cols=32 Identities=9% Similarity=0.092 Sum_probs=23.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 75199999999999999999999999999999 Q gi|254780421|r 20 CYKWIAPLLILMILSLAIWFYLFDGSHEKKKN 51 (218) Q Consensus 20 ~yk~ii~~~i~~i~~~~~~~~~~~~~q~~~~~ 51 (218) .+=||+++++.+++|+++||+.-..+-++... T Consensus 3 ~~~~Ili~i~~li~G~~~GffiaRk~m~~ylk 34 (72) T PRK01844 3 IWLGILVGVVALVAGVALGFFIARKYMMNYLQ 34 (72) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 59999999999999999999999999999886 No 113 >smart00028 TPR Tetratricopeptide repeats. Repeats present in 4 or more copies in proteins. Contain a minimum of 34 amino acids each and self-associate via a "knobs and holes" mechanism. Probab=86.79 E-value=1.1 Score=22.67 Aligned_cols=30 Identities=27% Similarity=0.262 Sum_probs=24.4 Q ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHCCC Q ss_conf 999999999988998999999999983558 Q gi|254780421|r 164 ANEILGISALKFGKVQKAKTIFEELAKDNN 193 (218) Q Consensus 164 a~ellg~~~~~~Gd~~~A~~~y~~i~~~~~ 193 (218) ++..+|.+|...|++++|.+.|++++...+ T Consensus 3 ~~~~lg~~y~~~~~~~~Ai~~~~~al~~~p 32 (34) T smart00028 3 ALYNLGNAYLKLGDYDEALEYYEKALELDP 32 (34) T ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHCCCC T ss_conf 899989999982369999999999773599 No 114 >pfam06295 DUF1043 Protein of unknown function (DUF1043). This family consists of several hypothetical bacterial proteins of unknown function. Probab=86.70 E-value=2 Score=20.99 Aligned_cols=25 Identities=16% Similarity=0.348 Sum_probs=11.9 Q ss_pred CCHHHHHHHHHHHHHCCCCCHHHHH Q ss_conf 8766699999997312552069999 Q gi|254780421|r 67 NKLDDARSSFEKILSQDNKLYNPLS 91 (218) Q Consensus 67 ~~~~~A~~~l~~l~~~~~~~ya~lA 91 (218) +.+......+.++.+++..-|..+| T Consensus 50 ~HF~~tA~Ll~~l~~~Y~~ly~HlA 74 (128) T pfam06295 50 DHFAQSAELLETLGKDYHKLYQHMA 74 (128) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999999999999999999 No 115 >pfam07719 TPR_2 Tetratricopeptide repeat. This Pfam entry includes outlying Tetratricopeptide-like repeats (TPR) that are not matched by pfam00515. Probab=85.86 E-value=0.9 Score=23.18 Aligned_cols=29 Identities=24% Similarity=0.307 Sum_probs=23.9 Q ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHCC Q ss_conf 99999999998899899999999998355 Q gi|254780421|r 164 ANEILGISALKFGKVQKAKTIFEELAKDN 192 (218) Q Consensus 164 a~ellg~~~~~~Gd~~~A~~~y~~i~~~~ 192 (218) ++..+|.++...|++++|++.|++++... T Consensus 3 ~~~~~g~~~~~~g~~~~A~~~~~~al~l~ 31 (34) T pfam07719 3 ALYNLGLAYYKLGDYEEALEAYEKALELD 31 (34) T ss_pred HHHHCCHHHHHCCCHHHHHHHHHHHHCCC T ss_conf 68750148762166999999999977269 No 116 >KOG1497 consensus Probab=85.73 E-value=2.3 Score=20.69 Aligned_cols=115 Identities=13% Similarity=0.087 Sum_probs=74.9 Q ss_pred CCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHC---CCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHH----HCCCC Q ss_conf 20699999999999951100577899999987312---689578999999999963026998999999753----11464 Q gi|254780421|r 85 KLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDD---LAPLAVRYAATLQAASILVDTSSYEEISKILQK----LSEPS 157 (218) Q Consensus 85 ~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~---~~~~~~~~lA~l~la~~~~~~~~~d~a~~~l~~----~~~~~ 157 (218) .-+..--++.+|.+|-..+++.+|...+-.+-.+. ..+...+..-.+|+++++++-++..++....+. ..... T Consensus 99 eEqv~~irl~LAsiYE~Eq~~~~aaq~L~~I~~~tg~~~~d~~~kl~l~iriarlyLe~~d~veae~~inRaSil~a~~~ 178 (399) T KOG1497 99 EEQVASIRLHLASIYEKEQNWRDAAQVLVGIPLDTGQKAYDVEQKLLLCIRIARLYLEDDDKVEAEAYINRASILQAESS 178 (399) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCC T ss_conf 99999999999999987646999999885427665434412578899999999998715847789999888778664036 Q ss_pred CH-HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHH Q ss_conf 23-5689999999999988998999999999983558998999 Q gi|254780421|r 158 NP-MHQFANEILGISALKFGKVQKAKTIFEELAKDNNSPFGIS 199 (218) Q Consensus 158 ~~-~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~~~~p~~~r 199 (218) ++ ......-.-+.+.=..+++-+|-.-|-++....-.|.+.+ T Consensus 179 Ne~Lqie~kvc~ARvlD~krkFlEAAqrYyels~~ki~~e~~~ 221 (399) T KOG1497 179 NEQLQIEYKVCYARVLDYKRKFLEAAQRYYELSQRKIVDESER 221 (399) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHH T ss_conf 8999999999999998888899999999999889973365889 No 117 >KOG2300 consensus Probab=85.56 E-value=2.3 Score=20.63 Aligned_cols=138 Identities=13% Similarity=0.063 Sum_probs=97.3 Q ss_pred HCCCHHHHHHHHHHHHH---CCCCC-----HHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH Q ss_conf 13876669999999731---25520-----69999999999995110057789999998731268957899999999996 Q gi|254780421|r 65 NSNKLDDARSSFEKILS---QDNKL-----YNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASI 136 (218) Q Consensus 65 ~~~~~~~A~~~l~~l~~---~~~~~-----ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~ 136 (218) -.+++.+|+.......+ ..|+. .++.-++.++-.-..-|-++.|...+......... ..+.....+.+|.. T Consensus 335 v~~~~~~al~~i~dm~~w~~r~p~~~Llr~~~~~ih~LlGlys~sv~~~enAe~hf~~a~k~t~~-~dl~a~~nlnlAi~ 413 (629) T KOG2300 335 VRGDYVEALEEIVDMKNWCTRFPTPLLLRAHEAQIHMLLGLYSHSVNCYENAEFHFIEATKLTES-IDLQAFCNLNLAIS 413 (629) T ss_pred HHCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHH T ss_conf 70788999999999999987189457777737899999716763102477899999999986207-78999999869999 Q ss_pred HHCCCCHHHHHHHHHHHCCCCC-H-----HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHH Q ss_conf 3026998999999753114642-3-----56899999999999889989999999999835589989999999 Q gi|254780421|r 137 LVDTSSYEEISKILQKLSEPSN-P-----MHQFANEILGISALKFGKVQKAKTIFEELAKDNNSPFGISTRSQ 203 (218) Q Consensus 137 ~~~~~~~d~a~~~l~~~~~~~~-~-----~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~~~~p~~~r~ra~ 203 (218) |+..++-+..-+.++.+..++. + ..+....+.|.....+|++.+|+-...+.++..+.-+..|--+. T Consensus 414 YL~~~~~ed~y~~ld~i~p~nt~s~ssq~l~a~~~~v~glfaf~qn~lnEaK~~l~e~Lkmanaed~~rL~a~ 486 (629) T KOG2300 414 YLRIGDAEDLYKALDLIGPLNTNSLSSQRLEASILYVYGLFAFKQNDLNEAKRFLRETLKMANAEDLNRLTAC 486 (629) T ss_pred HHHHCCHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCHHHHHHHHHH T ss_conf 9971208789999986289877762499998779999999999855289999999998754040257788878 No 118 >pfam00957 Synaptobrevin Synaptobrevin. Probab=85.21 E-value=2.4 Score=20.53 Aligned_cols=31 Identities=23% Similarity=0.068 Sum_probs=20.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 4588999999997519999999999999999 Q gi|254780421|r 8 NSYKGKCKMGKCCYKWIAPLLILMILSLAIW 38 (218) Q Consensus 8 ~~~~~~~~~~~~~yk~ii~~~i~~i~~~~~~ 38 (218) ++.+=|-+||-.+||+.++.++++++++... T Consensus 53 ~a~~l~r~~~w~~~k~~~ii~~iv~~~i~ii 83 (89) T pfam00957 53 QARKLKRKMWWKNMKLKIILGLIVLILILII 83 (89) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999998989999999999999999 No 119 >PRK11677 cytochrome d ubiquinol oxidase subunit III; Provisional Probab=85.04 E-value=2.5 Score=20.48 Aligned_cols=25 Identities=8% Similarity=0.324 Sum_probs=11.8 Q ss_pred CCHHHHHHHHHHHHHCCCCCHHHHH Q ss_conf 8766699999997312552069999 Q gi|254780421|r 67 NKLDDARSSFEKILSQDNKLYNPLS 91 (218) Q Consensus 67 ~~~~~A~~~l~~l~~~~~~~ya~lA 91 (218) +.+......+.++.+++..-|..+| T Consensus 54 ~HF~~tA~Ll~~l~~~Y~~lyqHlA 78 (134) T PRK11677 54 DHFARSAELLDTMAKDYRQLYQHMA 78 (134) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999999999999999999 No 120 >PRK03598 hypothetical protein; Provisional Probab=85.04 E-value=0.38 Score=25.49 Aligned_cols=13 Identities=8% Similarity=0.250 Sum_probs=5.6 Q ss_pred CCHHHHHHHHHHH Q ss_conf 9989999999999 Q gi|254780421|r 176 GKVQKAKTIFEEL 188 (218) Q Consensus 176 Gd~~~A~~~y~~i 188 (218) ...+.|+..+..+ T Consensus 159 a~l~~a~~~l~~~ 171 (331) T PRK03598 159 ATLKSAQDKLSQY 171 (331) T ss_pred HHHHHHHHHHHHH T ss_conf 9999999999998 No 121 >KOG0548 consensus Probab=84.74 E-value=2.5 Score=20.40 Aligned_cols=137 Identities=16% Similarity=0.079 Sum_probs=84.0 Q ss_pred HHHHHHHHHHHHHHHCCCH-------HHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCH Q ss_conf 9999999999998413876-------669999999731255206999999999999511005778999999873126895 Q gi|254780421|r 51 NIVGENFAQALELFNSNKL-------DDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPL 123 (218) Q Consensus 51 ~~as~~y~~al~~~~~~~~-------~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~ 123 (218) ..+-..|..++....+.+. +++....+...--++. . .......+.-++..||+..|+..|.+.|...+.+. T Consensus 315 ~~ai~~~~kaLte~Rt~~~ls~lk~~Ek~~k~~e~~a~~~pe-~-A~e~r~kGne~Fk~gdy~~Av~~YteAIkr~P~Da 392 (539) T KOG0548 315 EGAIKYYQKALTEHRTPDLLSKLKEAEKALKEAERKAYINPE-K-AEEEREKGNEAFKKGDYPEAVKHYTEAIKRDPEDA 392 (539) T ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHCHH-H-HHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHH T ss_conf 889999999866325888787778999999998888731846-7-89999877887753588999999999984298415 Q ss_pred HHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCC Q ss_conf 78999999999963026998999999753114642356899999999999889989999999999835589 Q gi|254780421|r 124 AVRYAATLQAASILVDTSSYEEISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEELAKDNNS 194 (218) Q Consensus 124 ~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~~~~ 194 (218) .+ . -..+..++..+.+..+++-...-...+++| .-+.-..|.++....++++|.+.|++.+...+. T Consensus 393 ~l--Y--sNRAac~~kL~~~~~aL~Da~~~ieL~p~~-~kgy~RKg~al~~mk~ydkAleay~eale~dp~ 458 (539) T KOG0548 393 RL--Y--SNRAACYLKLGEYPEALKDAKKCIELDPNF-IKAYLRKGAALRAMKEYDKALEAYQEALELDPS 458 (539) T ss_pred HH--H--HHHHHHHHHHHHHHHHHHHHHHHHHCCCHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCH T ss_conf 78--8--889999998731798998899987509318-999999999999999899999999998734915 No 122 >KOG2053 consensus Probab=84.63 E-value=2.6 Score=20.37 Aligned_cols=150 Identities=18% Similarity=0.144 Sum_probs=68.4 Q ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH Q ss_conf 99999984138766699999997312552069999999999995110057789999998731268957899999999996 Q gi|254780421|r 57 FAQALELFNSNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASI 136 (218) Q Consensus 57 y~~al~~~~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~ 136 (218) -..++.....|+.++|...++.+...+.+.-.++. .+-..+.+.|+.+++...|+.+....+. ...+..+..+++ T Consensus 47 vLkaLsl~r~gk~~ea~~~Le~~~~~~~~D~~tLq--~l~~~y~d~~~~d~~~~~Ye~~~~~~P~---eell~~lFmayv 121 (932) T KOG2053 47 VLKALSLFRLGKGDEALKLLEALYGLKGTDDLTLQ--FLQNVYRDLGKLDEAVHLYERANQKYPS---EELLYHLFMAYV 121 (932) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHCCCCCCCHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHCCC---HHHHHHHHHHHH T ss_conf 99999999815864678887513047998568999--9999999874256899999998710884---889999999999 Q ss_pred HHC---------------------------------CCCHHH---------HHHHHHHHCCCCCHHHHHH-HHHHHHHHH Q ss_conf 302---------------------------------699899---------9999753114642356899-999999999 Q gi|254780421|r 137 LVD---------------------------------TSSYEE---------ISKILQKLSEPSNPMHQFA-NEILGISAL 173 (218) Q Consensus 137 ~~~---------------------------------~~~~d~---------a~~~l~~~~~~~~~~~~~a-~ellg~~~~ 173 (218) -.. ..+++. |....+.+..+++++...+ ..+.-+++. T Consensus 122 R~~~yk~qQkaa~~LyK~~pk~~yyfWsV~Slilqs~~~~~~~~~~i~l~LA~~m~~~~l~~~gk~~s~aE~~Lyl~iL~ 201 (932) T KOG2053 122 REKSYKKQQKAALQLYKNFPKRAYYFWSVISLILQSIFSENELLDPILLALAEKMVQKLLEKKGKIESEAEIILYLLILE 201 (932) T ss_pred HHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHH T ss_conf 98889999999999998587664058999999998225886321103278899999998614775103899999999998 Q ss_pred HCCCHHHHHHHHH-H-HHHCCCCCHHHHHHHHHHHHHHCC Q ss_conf 8899899999999-9-983558998999999999995003 Q gi|254780421|r 174 KFGKVQKAKTIFE-E-LAKDNNSPFGISTRSQMILANIIA 211 (218) Q Consensus 174 ~~Gd~~~A~~~y~-~-i~~~~~~p~~~r~ra~~~l~~l~~ 211 (218) .+|++++|-+.+. . +......|..+..+...++..+.- T Consensus 202 ~~~k~~eal~~l~~~la~~l~~~~~~l~~~~~dllk~l~~ 241 (932) T KOG2053 202 LQGKYQEALEFLAITLAEKLTSANLYLENKKLDLLKLLNR 241 (932) T ss_pred HCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCC T ss_conf 6246788999997888874265446899988899998467 No 123 >pfam06160 EzrA Septation ring formation regulator, EzrA. During the bacterial cell cycle, the tubulin-like cell-division protein FtsZ polymerizes into a ring structure that establishes the location of the nascent division site. EzrA modulates the frequency and position of FtsZ ring formation. Probab=84.52 E-value=2.6 Score=20.34 Aligned_cols=165 Identities=13% Similarity=0.125 Sum_probs=72.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC---------CCHHHHHHHHHHHHHCCC----CCHHH- Q ss_conf 9999999999999999999999999999999999999998413---------876669999999731255----20699- Q gi|254780421|r 24 IAPLLILMILSLAIWFYLFDGSHEKKKNIVGENFAQALELFNS---------NKLDDARSSFEKILSQDN----KLYNP- 89 (218) Q Consensus 24 ii~~~i~~i~~~~~~~~~~~~~q~~~~~~as~~y~~al~~~~~---------~~~~~A~~~l~~l~~~~~----~~ya~- 89 (218) ||+++|+++++.++++ +++.+..++....-+.....+...-. +=..+....|+...+... ...+. T Consensus 2 ii~iivivii~~~~~~-~~rk~~~k~I~~Le~rk~~l~~~pv~~el~kvk~L~l~Ges~~~f~~w~~~w~ei~~~~l~~i 80 (559) T pfam06160 2 IIGIIVIVIVAYLAGY-FLRKKIYKQIDELEERKEELMNLPVNEELSKVKKLNLTGESEETFEEWRQKWDDIVTNSLPDI 80 (559) T ss_pred EEEHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 4408999999999999-998889999999999999987498899999988298970569999999999999988622569 Q ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCC-------------CCHHHHHHHHHHHCCC Q ss_conf 999999999951100577899999987312689578999999999963026-------------9989999997531146 Q gi|254780421|r 90 LSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDT-------------SSYEEISKILQKLSEP 156 (218) Q Consensus 90 lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~-------------~~~d~a~~~l~~~~~~ 156 (218) --.+.-|.-+.+..++.+|...+..+-+. .+..-..+..++.+.-.+-. ..|.+..+.+...... T Consensus 81 e~~l~~ae~~~~~~rf~ka~~~i~~~~~~--i~~~e~~i~~i~~~l~~L~~~e~~nr~~v~~lk~~y~~lrk~ll~~~~~ 158 (559) T pfam06160 81 EELLFEAEELNDKFRFLKAKKAIKEIEEL--LDTIEEDIEQILEELNELKESEEKNRKEVEELKDKYRELRKTLLAKSFS 158 (559) T ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 99999999998606289999999999999--9999999999999999999878988999999999999999999985222 Q ss_pred CCHHHHHHHHHHH---------HHHHHCCCHHHHHHHHHHHHHC Q ss_conf 4235689999999---------9999889989999999999835 Q gi|254780421|r 157 SNPMHQFANEILG---------ISALKFGKVQKAKTIFEELAKD 191 (218) Q Consensus 157 ~~~~~~~a~ellg---------~~~~~~Gd~~~A~~~y~~i~~~ 191 (218) -+|-....-..+. .-.-..||+.+|++.+.++-.. T Consensus 159 yG~a~~~LE~~L~~ie~~F~~f~~lt~~Gd~~~A~e~l~~l~~~ 202 (559) T pfam06160 159 YGPALDELEKQLDELEEEFEQFVELTESGDYLEAREVLLKLEEE 202 (559) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH T ss_conf 01368999999999999899999998689989999999999999 No 124 >cd05804 StaR_like StaR_like; a well-conserved protein found in bacteria, plants, and animals. A family member from Streptomyces toyocaensis, StaR is part of a gene cluster involved in the biosynthesis of glycopeptide antibiotics (GPAs), specifically A47934. It has been speculated that StaR could be a flavoprotein hydroxylating a tyrosine sidechain. Some family members have been annotated as proteins containing tetratricopeptide (TPR) repeats, which may at least indicate mostly alpha-helical secondary structure. Probab=83.46 E-value=2.9 Score=20.06 Aligned_cols=135 Identities=16% Similarity=0.130 Sum_probs=92.4 Q ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHH----HH-------------------------------HHHHH Q ss_conf 99999999984138766699999997312552069999----99-------------------------------99999 Q gi|254780421|r 54 GENFAQALELFNSNKLDDARSSFEKILSQDNKLYNPLS----NM-------------------------------YIASI 98 (218) Q Consensus 54 s~~y~~al~~~~~~~~~~A~~~l~~l~~~~~~~ya~lA----~l-------------------------------~lA~~ 98 (218) --.+.+++...-.++...|...++.++..+|.-.-.+. .| ..|-. T Consensus 44 E~~hv~a~~~~~~g~~~~A~~~~~~~l~~~P~D~lalk~~~~~f~lg~~~g~~~~~~~~l~~~~p~~~~~~~~~gm~AF~ 123 (355) T cd05804 44 ERAHVEALSAWIAGDLPKALALLEQLLDDYPRDLLALKLHLGAFGLGDFSGMRDHVARVLPLWAPENPDYWYLLGMLAFG 123 (355) T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHCCCCCCCCHHHHHHHHHH T ss_conf 99999999999879978999999999986983599999876741025413240569999872573489836999999999 Q ss_pred HHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCC---CCCHHHHHHHHHHHHHHHHC Q ss_conf 951100577899999987312689578999999999963026998999999753114---64235689999999999988 Q gi|254780421|r 99 LVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEEISKILQKLSE---PSNPMHQFANEILGISALKF 175 (218) Q Consensus 99 ~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~---~~~~~~~~a~ellg~~~~~~ 175 (218) +.+.|.+++|.+.=+..++-++.+. . |.=-.+..+..++..++.+..++.... ..+++...-.=-.++.++.. T Consensus 124 leE~g~~~~Ae~~a~~al~~np~d~-w---a~HAvAHv~~~~g~~~eG~~~l~~~~~~w~~~~~l~~H~wWH~AL~~Le~ 199 (355) T cd05804 124 LEEAGQYDRAEEAARRALELNPDDA-W---AVHAVAHVLEMQGRFKEGIAFMESWRDTWDCSSMLRGHNWWHLALFYLER 199 (355) T ss_pred HHHCCCHHHHHHHHHHHHHCCCCCC-H---HHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHHC T ss_conf 9985889999999999996399993-7---99999999998388788999998517525778858989999999998877 Q ss_pred CCHHHHHHHHHHHHHCC Q ss_conf 99899999999998355 Q gi|254780421|r 176 GKVQKAKTIFEELAKDN 192 (218) Q Consensus 176 Gd~~~A~~~y~~i~~~~ 192 (218) |+++++...|..-+-.. T Consensus 200 g~~~~~l~iyD~~i~~~ 216 (355) T cd05804 200 GDYEAALAIYDTHIAPS 216 (355) T ss_pred CCHHHHHHHHHHHHCCC T ss_conf 89889999988874456 No 125 >pfam06864 PAP_PilO Pilin accessory protein (PilO). This family consists of several enterobacterial PilO proteins. The function of PilO is unknown although it has been suggested that it is a cytoplasmic protein in the absence of other Pil proteins, but PilO protein is translocated to the outer membrane in the presence of other Pil proteins. Alternatively, PilO protein may form a complex with other Pil protein(s). PilO has been predicted to function as a component of the pilin transport apparatus and thin-pilus basal body. This family does not seem to be related to pfam04350. Probab=82.70 E-value=3.1 Score=19.88 Aligned_cols=31 Identities=10% Similarity=0.114 Sum_probs=12.8 Q ss_pred HHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHH Q ss_conf 519999999999-9999999999999999999 Q gi|254780421|r 21 YKWIAPLLILMI-LSLAIWFYLFDGSHEKKKN 51 (218) Q Consensus 21 yk~ii~~~i~~i-~~~~~~~~~~~~~q~~~~~ 51 (218) .++++.++++++ +++.++|.+|+.+++++.. T Consensus 164 ~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~ 195 (414) T pfam06864 164 PQFILAGLLSLLAGTAWAVWQYWQRQQEEAAA 195 (414) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 06999999999999999999999976789999 No 126 >pfam10345 Cohesin_load Cohesin loading factor. Cohesin_load is a common cohesin loading factor protein that is conserved in fungi. It is associated with the cohesin complex and is required in G1 for cohesin binding to chromosomes but dispensable in G2 when cohesion has been established. It is referred to as both Ssl3, in pombe, and Scc4, in S.cerevisiae. It complexes with Mis4. Probab=81.69 E-value=3.4 Score=19.65 Aligned_cols=25 Identities=12% Similarity=0.149 Sum_probs=9.1 Q ss_pred HHHHHHHHHHHHCCCHHHHHHHHHH Q ss_conf 9999999999988998999999999 Q gi|254780421|r 163 FANEILGISALKFGKVQKAKTIFEE 187 (218) Q Consensus 163 ~a~ellg~~~~~~Gd~~~A~~~y~~ 187 (218) .+.-+.+.+.+..|....+.+..+. T Consensus 181 ~~~l~~~~l~l~~~~~~d~l~~l~~ 205 (593) T pfam10345 181 VLSLLEALLHLYRGSPKDVLELLQE 205 (593) T ss_pred HHHHHHHHHHHHCCCCHHHHHHHHH T ss_conf 9999999897503982569999999 No 127 >pfam10828 DUF2570 Protein of unknown function (DUF2570). This is a family of proteins with unknown function. Probab=81.25 E-value=3.5 Score=19.55 Aligned_cols=28 Identities=21% Similarity=0.326 Sum_probs=18.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 51999999999999999999999999999 Q gi|254780421|r 21 YKWIAPLLILMILSLAIWFYLFDGSHEKK 49 (218) Q Consensus 21 yk~ii~~~i~~i~~~~~~~~~~~~~q~~~ 49 (218) .|||.+++.++|+|+.| |.|+++.+... T Consensus 2 ~~~i~~~l~~lilgL~g-wlw~QS~~Id~ 29 (110) T pfam10828 2 SKYIKATLAALILGLGG-WSWYQSQKIDS 29 (110) T ss_pred CHHHHHHHHHHHHHHHH-HHHHHHHHHHH T ss_conf 06899999999999999-99999888899 No 128 >COG1698 Uncharacterized protein conserved in archaea [Function unknown] Probab=81.03 E-value=3.5 Score=19.50 Aligned_cols=86 Identities=20% Similarity=0.295 Sum_probs=59.9 Q ss_pred HHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHH Q ss_conf 99999511005778999999873126895789999999999630269989999997531146423568999999999998 Q gi|254780421|r 95 IASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEEISKILQKLSEPSNPMHQFANEILGISALK 174 (218) Q Consensus 95 lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~ 174 (218) .+.....++.+......+..++.|...|...|..| .+++..|.... .+|. . T Consensus 5 ~~~~~~~~e~i~q~~~lL~~Ii~DttVPRNIRraA--------------~~a~e~L~~e~--e~p~-v------------ 55 (93) T COG1698 5 QGLMNDSEEKINQVMQLLDEIIQDTTVPRNIRRAA--------------EEAKEALNNEG--ESPA-V------------ 55 (93) T ss_pred HCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHH--------------HHHHHHHCCCC--CCCH-H------------ T ss_conf 30124149999999999999981576667899999--------------99999980888--8811-6------------ Q ss_pred CCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCC Q ss_conf 8998999999999983558998999999999995003 Q gi|254780421|r 175 FGKVQKAKTIFEELAKDNNSPFGISTRSQMILANIIA 211 (218) Q Consensus 175 ~Gd~~~A~~~y~~i~~~~~~p~~~r~ra~~~l~~l~~ 211 (218) .-..|+..++++.+|++.|..-|..+=..++.|.. T Consensus 56 --RaAtaIsiLeeisnDPNmP~h~RT~iw~vis~LE~ 90 (93) T COG1698 56 --RAATAISILEEISNDPNMPLHARTLIWNVISQLET 90 (93) T ss_pred --HHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHH T ss_conf --79889999999713999854999999999999986 No 129 >KOG1128 consensus Probab=80.94 E-value=3.6 Score=19.48 Aligned_cols=100 Identities=12% Similarity=0.091 Sum_probs=56.2 Q ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHH Q ss_conf 99999999951100577899999987312689578999999999963026998999999753114642356899999999 Q gi|254780421|r 91 SNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEEISKILQKLSEPSNPMHQFANEILGI 170 (218) Q Consensus 91 A~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~ 170 (218) |+...|.....++++.++...++.-.+-.+.- + -.-++.++..+...++..+.+.+.--.. ..|....+.-.+.- T Consensus 487 A~r~~~~~~~~~~~fs~~~~hle~sl~~nplq--~--~~wf~~G~~ALqlek~q~av~aF~rcvt-L~Pd~~eaWnNls~ 561 (777) T KOG1128 487 AQRSLALLILSNKDFSEADKHLERSLEINPLQ--L--GTWFGLGCAALQLEKEQAAVKAFHRCVT-LEPDNAEAWNNLST 561 (777) T ss_pred HHHHHCCCCCCCCCHHHHHHHHHHHHHCCCCC--H--HHHHHCCHHHHHHHHHHHHHHHHHHHHH-CCCCCCHHHHHHHH T ss_conf 87752543335212899999999876506632--5--5787414899987636789999998962-48882055523668 Q ss_pred HHHHCCCHHHHHHHHHHHHHCCCCC Q ss_conf 9998899899999999998355899 Q gi|254780421|r 171 SALKFGKVQKAKTIFEELAKDNNSP 195 (218) Q Consensus 171 ~~~~~Gd~~~A~~~y~~i~~~~~~p 195 (218) +|.+.|+..+|...+++++.-...| T Consensus 562 ayi~~~~k~ra~~~l~EAlKcn~~~ 586 (777) T KOG1128 562 AYIRLKKKKRAFRKLKEALKCNYQH 586 (777) T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 9999751488899999986147788 No 130 >KOG1174 consensus Probab=80.92 E-value=3.6 Score=19.48 Aligned_cols=82 Identities=16% Similarity=0.013 Sum_probs=52.2 Q ss_pred HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHH Q ss_conf 57789999998731268957899999999996302699899999975311464235689999999999988998999999 Q gi|254780421|r 105 TKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEEISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTI 184 (218) Q Consensus 105 ~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~ 184 (218) -++|+..++..+.- .+.+- -|...++-+..-++.+.++++.|.......+. ..-+-.+|++...++.+.+|.++ T Consensus 420 rEKAKkf~ek~L~~---~P~Y~-~AV~~~AEL~~~Eg~~~D~i~LLe~~L~~~~D--~~LH~~Lgd~~~A~Ne~Q~am~~ 493 (564) T KOG1174 420 REKAKKFAEKSLKI---NPIYT-PAVNLIAELCQVEGPTKDIIKLLEKHLIIFPD--VNLHNHLGDIMRAQNEPQKAMEY 493 (564) T ss_pred HHHHHHHHHHHCCC---CCCHH-HHHHHHHHHHHHHCCCCCHHHHHHHHHHHCCC--HHHHHHHHHHHHHHHHHHHHHHH T ss_conf 79999999842035---88608-99999999998627654189999998742562--08999999999986107999999 Q ss_pred HHHHHHCC Q ss_conf 99998355 Q gi|254780421|r 185 FEELAKDN 192 (218) Q Consensus 185 y~~i~~~~ 192 (218) |+.++... T Consensus 494 y~~ALr~d 501 (564) T KOG1174 494 YYKALRQD 501 (564) T ss_pred HHHHHCCC T ss_conf 99997059 No 131 >PRK09782 bacteriophage N4 receptor, outer membrane subunit; Provisional Probab=80.32 E-value=3.7 Score=19.35 Aligned_cols=119 Identities=9% Similarity=0.036 Sum_probs=75.9 Q ss_pred CCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHH Q ss_conf 38766699999997312552069999999999995110057789999998731268957899999999996302699899 Q gi|254780421|r 66 SNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEE 145 (218) Q Consensus 66 ~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~ 145 (218) .++...|......-+...+.. ....-.+.++...|+...++.-|...+.-.+-.+.++ --+++...+.++... T Consensus 589 ~g~~~~Al~~~r~al~~ap~a---~~~~A~~~~~~Q~~d~~qa~~~LAeAv~lepd~~~y~----A~lGy~la~~~D~Aq 661 (987) T PRK09782 589 PGQPELALNDLTRSLNIAPSA---NAYVARATIYRQRHNVPAAVSDLRAALELEPNNSNYQ----AALGYALWDSGDIAQ 661 (987) T ss_pred CCCCHHHHHHHHHHHHCCCCH---HHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCCCCC----HHHHHHHHCCCCHHH T ss_conf 387387888899998608004---6689999999973680789999999981799985411----567778613998999 Q ss_pred HHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCC Q ss_conf 99997531146423568999999999998899899999999998355 Q gi|254780421|r 146 ISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEELAKDN 192 (218) Q Consensus 146 a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~~ 192 (218) ....|..-..+ .|-.+-..+.++.++-+.||.+.|+-.++.+++.. T Consensus 662 ~~~~lerAh~~-~p~d~~~~~~lAy~~~rl~D~a~a~~~~r~vID~~ 707 (987) T PRK09782 662 SREMLERAHKD-LPDDPALIRQLAYVNQRLDDMAATQHYARLVIDDI 707 (987) T ss_pred HHHHHHHHHCC-CCCCCCHHHHHHHHHHHHCCHHHHHHHHHHHHHHH T ss_conf 99999975215-77552324788899876134699999999998654 No 132 >KOG0495 consensus Probab=79.85 E-value=3.9 Score=19.26 Aligned_cols=50 Identities=14% Similarity=0.150 Sum_probs=28.0 Q ss_pred CCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 699899999975311464235689999999999988998999999999983 Q gi|254780421|r 140 TSSYEEISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEELAK 190 (218) Q Consensus 140 ~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~ 190 (218) +++.++++..|+.-......| +-....+|.++...++.+.||+.|..-.. T Consensus 664 ld~~eeA~rllEe~lk~fp~f-~Kl~lmlGQi~e~~~~ie~aR~aY~~G~k 713 (913) T KOG0495 664 LDNVEEALRLLEEALKSFPDF-HKLWLMLGQIEEQMENIEMAREAYLQGTK 713 (913) T ss_pred HHHHHHHHHHHHHHHHHCCCH-HHHHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 640799999999998848755-89999986899998779999999970256 No 133 >KOG4782 consensus Probab=79.44 E-value=3.9 Score=19.24 Aligned_cols=36 Identities=19% Similarity=0.075 Sum_probs=23.0 Q ss_pred HHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999-99999751999999999999999999999999999 Q gi|254780421|r 12 GKC-KMGKCCYKWIAPLLILMILSLAIWFYLFDGSHEKK 49 (218) Q Consensus 12 ~~~-~~~~~~yk~ii~~~i~~i~~~~~~~~~~~~~q~~~ 49 (218) +|| -.+.++||.+|-+++++.+.+. .|||..+..+| T Consensus 47 ekV~~~~aknykN~is~a~i~alVia--IY~YTfYSikQ 83 (108) T KOG4782 47 EKVKEIFAKNYKNHISFAGIGALVIA--IYGYTFYSIKQ 83 (108) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHEEEEHHH T ss_conf 99999998602201236777789998--64320010039 No 134 >KOG1586 consensus Probab=78.81 E-value=4.2 Score=19.06 Aligned_cols=135 Identities=16% Similarity=0.180 Sum_probs=68.0 Q ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHH--------HHHHHHHHC-CHHHHHHHHHHHHHHHCCCC Q ss_conf 999999999998413876669999999731255206999999--------999999511-00577899999987312689 Q gi|254780421|r 52 IVGENFAQALELFNSNKLDDARSSFEKILSQDNKLYNPLSNM--------YIASILVAK-GDTKNAAEIFLKVANDDLAP 122 (218) Q Consensus 52 ~as~~y~~al~~~~~~~~~~A~~~l~~l~~~~~~~ya~lA~l--------~lA~~~~~~-g~~~~A~~~l~~i~~~~~~~ 122 (218) .++..|..+-..+...+..+|+.-++.-++- |..+.+| .+|.++-.. .++++|+..|+...+=-..+ T Consensus 72 Daat~YveA~~cykk~~~~eAv~cL~~aieI----yt~~Grf~~aAk~~~~iaEiyEsdl~d~ekaI~~YE~Aae~yk~e 147 (288) T KOG1586 72 DAATTYVEAANCYKKVDPEEAVNCLEKAIEI----YTDMGRFTMAAKHHIEIAEIYESDLQDFEKAIAHYEQAAEYYKGE 147 (288) T ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCH T ss_conf 3788999998876215848899999999999----985227778775111399998622777999999999999997001 Q ss_pred HH--HHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCC--C---HHHHHHHH-HHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 57--899999999996302699899999975311464--2---35689999-999999988998999999999983 Q gi|254780421|r 123 LA--VRYAATLQAASILVDTSSYEEISKILQKLSEPS--N---PMHQFANE-ILGISALKFGKVQKAKTIFEELAK 190 (218) Q Consensus 123 ~~--~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~~--~---~~~~~a~e-llg~~~~~~Gd~~~A~~~y~~i~~ 190 (218) .. ...-..+..+......+++.+++..+..+.... + .|..--.. .-|..++-.+|.-.+....++-.+ T Consensus 148 es~ssANKC~lKvA~yaa~leqY~~Ai~iyeqva~~s~~n~LLKys~KdyflkAgLChl~~~D~v~a~~ALeky~~ 223 (288) T KOG1586 148 ESVSSANKCLLKVAQYAAQLEQYSKAIDIYEQVARSSLDNNLLKYSAKDYFLKAGLCHLCKADEVNAQRALEKYQE 223 (288) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH T ss_conf 5552688999999999999999999999999999986036487767888999999886730207789999998875 No 135 >TIGR02267 TIGR02267 Myxococcus xanthus paralogous family TIGR02267; InterPro: IPR011753 This family consists of at least 7 paralogs in Myxococcus xanthus, a member of the Deltaproteobacteria. The function is unknown.. Probab=77.17 E-value=4.7 Score=18.76 Aligned_cols=50 Identities=12% Similarity=0.087 Sum_probs=32.5 Q ss_pred HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHC Q ss_conf 568999999999998899899999999998355899899999999999500 Q gi|254780421|r 160 MHQFANEILGISALKFGKVQKAKTIFEELAKDNNSPFGISTRSQMILANII 210 (218) Q Consensus 160 ~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~~~~p~~~r~ra~~~l~~l~ 210 (218) -+.+-.-..+.-+...||.+.||..+..+++-..+| -+|..++..|+.|+ T Consensus 74 ~RL~rAL~~a~~~~~AGD~d~AR~~l~d~LavE~VP-~YR~~a~~~l~~Ld 123 (123) T TIGR02267 74 RRLSRALTEAVRHRDAGDLDGARALLEDVLAVEVVP-LYRELAQVQLEALD 123 (123) T ss_pred HHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCCC-HHHHHHHHHHHHCC T ss_conf 899999999986300268688859999877403785-07899999986209 No 136 >PRK04841 transcriptional regulator MalT; Provisional Probab=76.31 E-value=4.9 Score=18.62 Aligned_cols=25 Identities=16% Similarity=0.070 Sum_probs=11.2 Q ss_pred HHHHHHHHHHCCCHHHHHHHHHHHH Q ss_conf 9999999998899899999999998 Q gi|254780421|r 165 NEILGISALKFGKVQKAKTIFEELA 189 (218) Q Consensus 165 ~ellg~~~~~~Gd~~~A~~~y~~i~ 189 (218) .-+.+.++...|+..+|...+.+++ T Consensus 734 ~ll~A~a~~~~g~~~~A~~~l~~AL 758 (903) T PRK04841 734 LILLNQLYWQQGRKSDAQRALLEAL 758 (903) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHH T ss_conf 9999999998799999999999999 No 137 >KOG3081 consensus Probab=75.41 E-value=5.2 Score=18.47 Aligned_cols=110 Identities=17% Similarity=0.220 Sum_probs=55.5 Q ss_pred HHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCC Q ss_conf 99511005778999999873126895789999999999630269989999997531146423568999999999998899 Q gi|254780421|r 98 ILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEEISKILQKLSEPSNPMHQFANEILGISALKFGK 177 (218) Q Consensus 98 ~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd 177 (218) ++......+-|...++.-. +-+.+.++..+|.- +..+.....+..+|--.++.+..+.+| ...-+..++-..+.+|+ T Consensus 146 I~lk~~r~d~A~~~lk~mq-~ided~tLtQLA~a-wv~la~ggek~qdAfyifeE~s~k~~~-T~~llnG~Av~~l~~~~ 222 (299) T KOG3081 146 ILLKMHRFDLAEKELKKMQ-QIDEDATLTQLAQA-WVKLATGGEKIQDAFYIFEELSEKTPP-TPLLLNGQAVCHLQLGR 222 (299) T ss_pred HHHHHHHHHHHHHHHHHHH-CCCHHHHHHHHHHH-HHHHHCCCHHHHHHHHHHHHHHCCCCC-CHHHHCCHHHHHHHHCC T ss_conf 9998789999999999997-11357899999999-999960632454689999998353599-76887069999897169 Q ss_pred HHHHHHHHHHHHHC-CCCCHHHHHHHHHHH-HHHCCCH Q ss_conf 89999999999835-589989999999999-9500340 Q gi|254780421|r 178 VQKAKTIFEELAKD-NNSPFGISTRSQMIL-ANIIASD 213 (218) Q Consensus 178 ~~~A~~~y~~i~~~-~~~p~~~r~ra~~~l-~~l~~~~ 213 (218) +++|....+.+++. ..-|..+ ++++. +.+.|.| T Consensus 223 ~eeAe~lL~eaL~kd~~dpetL---~Nliv~a~~~Gkd 257 (299) T KOG3081 223 YEEAESLLEEALDKDAKDPETL---ANLIVLALHLGKD 257 (299) T ss_pred HHHHHHHHHHHHHCCCCCHHHH---HHHHHHHHHHCCC T ss_conf 9999999999985347878999---9999999981899 No 138 >PRK11556 multidrug efflux system subunit MdtA; Provisional Probab=74.16 E-value=3.3 Score=19.71 Aligned_cols=27 Identities=15% Similarity=0.806 Sum_probs=16.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99975199999999999999999999999 Q gi|254780421|r 17 GKCCYKWIAPLLILMILSLAIWFYLFDGS 45 (218) Q Consensus 17 ~~~~yk~ii~~~i~~i~~~~~~~~~~~~~ 45 (218) .+..++|+|++++++. ++++||+ |... T Consensus 3 ~~~~~~~~~~~~~~~~-~~~~~~~-~~~~ 29 (415) T PRK11556 3 GSYKFRWVIVIVVVIA-AIAAFWF-WHGR 29 (415) T ss_pred CCHHHHHHHHHHHHHH-HHHHHHH-HCCC T ss_conf 4115589999999999-9999987-4047 No 139 >KOG2005 consensus Probab=71.37 E-value=6.5 Score=17.87 Aligned_cols=137 Identities=19% Similarity=0.155 Sum_probs=88.1 Q ss_pred CCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHH Q ss_conf 87666999999973125520699999999999951100577899999987312689578999999999963026998999 Q gi|254780421|r 67 NKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEEI 146 (218) Q Consensus 67 ~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~a 146 (218) ++-+-|...+...+.+.++....-+-|-++-.+ ...+.++-...+..+..+.+.+.....+|.+.++.++...-+.|-+ T Consensus 448 ne~dpalALLsdyv~~~~s~~ri~aIlGLglay-aGsq~e~Vl~lL~Pi~~d~~~~~ev~~~aslsLG~IfvGscn~dvt 526 (878) T KOG2005 448 NECDPALALLSDYLQSSSSIHRIGAILGLGLAY-AGSQREEVLELLSPIMFDTKSPMEVVAFASLSLGMIFVGSCNEDVT 526 (878) T ss_pred CCCCHHHHHHHHHCCCCCCEEEHHHHHHHHHHH-CCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHCCEEEEECCCHHHH T ss_conf 446779999987405887212048665217765-4776689999976775178986367888875301168704774889 Q ss_pred HHHHHHHCCCCC-----HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHH Q ss_conf 999753114642-----356899999999999889989999999999835589989999999999950 Q gi|254780421|r 147 SKILQKLSEPSN-----PMHQFANEILGISALKFGKVQKAKTIFEELAKDNNSPFGISTRSQMILANI 209 (218) Q Consensus 147 ~~~l~~~~~~~~-----~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~~~~p~~~r~ra~~~l~~l 209 (218) -..++.+-+.+. +|.-.-.-.+|.+++.+++..+|... .+. ..+..++..+++.+..- T Consensus 527 s~ilqtlmekse~El~d~~~RFL~LGL~llflgkqe~~d~~~e---~~~--~i~~~~~~~~~~lv~~c 589 (878) T KOG2005 527 SSILQTLMEKSETELEDQWFRFLALGLALLFLGKQESVDAVVE---TIK--AIEGPIRKHESILVKSC 589 (878) T ss_pred HHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHCCCCHHHHHHH---HHH--HHHHHHHHHHHHHHHHH T ss_conf 9999999975466540469999999999998455324899999---999--74057888999999886 No 140 >PRK10476 multidrug resistance protein MdtN; Provisional Probab=71.08 E-value=1.2 Score=22.42 Aligned_cols=23 Identities=13% Similarity=0.284 Sum_probs=12.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 19999999999999999999999 Q gi|254780421|r 22 KWIAPLLILMILSLAIWFYLFDG 44 (218) Q Consensus 22 k~ii~~~i~~i~~~~~~~~~~~~ 44 (218) +.+|+.++++++++++||+||.. T Consensus 13 ~~~i~~~~~~~~~~~~~~~~~~~ 35 (348) T PRK10476 13 AKVIALAIVALALVALVYVIWRT 35 (348) T ss_pred CEEHHHHHHHHHHHHHHHHHEEC T ss_conf 12278999999999987651324 No 141 >pfam05568 ASFV_J13L African swine fever virus J13L protein. This family consists of several African swine fever virus J13L proteins. Probab=70.93 E-value=3.2 Score=19.76 Aligned_cols=44 Identities=18% Similarity=0.401 Sum_probs=20.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC Q ss_conf 99999999999999999999999999999999999999984138 Q gi|254780421|r 24 IAPLLILMILSLAIWFYLFDGSHEKKKNIVGENFAQALELFNSN 67 (218) Q Consensus 24 ii~~~i~~i~~~~~~~~~~~~~q~~~~~~as~~y~~al~~~~~~ 67 (218) |++++|++|++++...|.+.+++++.+....+.=.+.+..++.. T Consensus 34 iliaivvliiiiivliylfssrkkkaaaaieeediqfinpyqdq 77 (189) T pfam05568 34 ILIAIVVLIIIIIVLIYLFSSRKKKAAAAIEEEDIQFINPYQDQ 77 (189) T ss_pred HHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHCCEEECCCCCC T ss_conf 99999999999999999972203676532004203020733200 No 142 >KOG0545 consensus Probab=70.41 E-value=6.8 Score=17.74 Aligned_cols=110 Identities=13% Similarity=0.025 Sum_probs=76.4 Q ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHH--------HCCCCHHHHHHHH------HHHHHHHHCCCCHHHHHHHHHHHCCC Q ss_conf 999999999511005778999999873--------1268957899999------99999630269989999997531146 Q gi|254780421|r 91 SNMYIASILVAKGDTKNAAEIFLKVAN--------DDLAPLAVRYAAT------LQAASILVDTSSYEEISKILQKLSEP 156 (218) Q Consensus 91 A~l~lA~~~~~~g~~~~A~~~l~~i~~--------~~~~~~~~~~lA~------l~la~~~~~~~~~d~a~~~l~~~~~~ 156 (218) +.-+.+.-++..|++.+|...|...+. ..+.++.+..+.+ +..+..++..+++-+++..-+.+... T Consensus 180 ~l~q~GN~lfk~~~ykEA~~~YreAi~~l~~L~lkEkP~e~eW~eLdk~~tpLllNy~QC~L~~~e~yevleh~seiL~~ 259 (329) T KOG0545 180 VLHQEGNRLFKLGRYKEASSKYREAIICLRNLQLKEKPGEPEWLELDKMITPLLLNYCQCLLKKEEYYEVLEHCSEILRH 259 (329) T ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC T ss_conf 89875014666221778999999999999988740489981899998763377886777874487899999988999850 Q ss_pred CCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHH Q ss_conf 423568999999999998899899999999998355899899999 Q gi|254780421|r 157 SNPMHQFANEILGISALKFGKVQKAKTIFEELAKDNNSPFGISTR 201 (218) Q Consensus 157 ~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~~~~p~~~r~r 201 (218) . |-..-|+..+|-+....=+.++|+..|++++...++-.+...| T Consensus 260 ~-~~nvKA~frRakAhaa~Wn~~eA~~D~~~vL~ldpslasvVsr 303 (329) T KOG0545 260 H-PGNVKAYFRRAKAHAAVWNEAEAKADLQKVLELDPSLASVVSR 303 (329) T ss_pred C-CCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHCCHHHHHHHHH T ss_conf 8-8508999998778775368888888999998619566789999 No 143 >PRK04330 hypothetical protein; Provisional Probab=69.33 E-value=7.2 Score=17.60 Aligned_cols=81 Identities=25% Similarity=0.331 Sum_probs=54.5 Q ss_pred HHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHH Q ss_conf 51100577899999987312689578999999999963026998999999753114642356899999999999889989 Q gi|254780421|r 100 VAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEEISKILQKLSEPSNPMHQFANEILGISALKFGKVQ 179 (218) Q Consensus 100 ~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~ 179 (218) ..+..+..++..++++++|.+.|..+|..| .++...|. +++..... +.. T Consensus 6 ~~e~~i~~~~~~L~~I~~D~sVPRNIRr~a--------------~ea~~~L~---ne~~~~~v--------------Raa 54 (88) T PRK04330 6 DNEEKIKQAIEMLERIINDTTVPRNIRRAA--------------TEAKEILL---NEEESPGV--------------RAA 54 (88) T ss_pred CHHHHHHHHHHHHHHHHCCCCCCHHHHHHH--------------HHHHHHHH---CCCCCHHH--------------HHH T ss_conf 389999999999999853898876899999--------------99999986---77856457--------------899 Q ss_pred HHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCC Q ss_conf 99999999983558998999999999995003 Q gi|254780421|r 180 KAKTIFEELAKDNNSPFGISTRSQMILANIIA 211 (218) Q Consensus 180 ~A~~~y~~i~~~~~~p~~~r~ra~~~l~~l~~ 211 (218) .|...++++.+|++.|...|..+=+.++.|.. T Consensus 55 ~aIs~LDeIs~DPNmP~h~RT~Iw~ivS~LE~ 86 (88) T PRK04330 55 TAISILDEISNDPNMPLHTRTLIWNIVSQLET 86 (88) T ss_pred HHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHH T ss_conf 99999997623999981899999999999852 No 144 >PRK12370 invasion protein regulator; Provisional Probab=68.69 E-value=7.4 Score=17.51 Aligned_cols=134 Identities=16% Similarity=0.095 Sum_probs=62.6 Q ss_pred HHHHHHHHHHHH---HCCCHHHHHHHHHHHHHCCCCC---HHHHHHHHHHHHHHHCCHHHHHHHHH----HHHHHHCCCC Q ss_conf 999999999984---1387666999999973125520---69999999999995110057789999----9987312689 Q gi|254780421|r 53 VGENFAQALELF---NSNKLDDARSSFEKILSQDNKL---YNPLSNMYIASILVAKGDTKNAAEIF----LKVANDDLAP 122 (218) Q Consensus 53 as~~y~~al~~~---~~~~~~~A~~~l~~l~~~~~~~---ya~lA~l~lA~~~~~~g~~~~A~~~l----~~i~~~~~~~ 122 (218) +.+.|...+... ...+...|+..|++-.+..|.. |+.++.-+.......-.+.+++...- ....+-++.+ T Consensus 258 A~~~Yl~Gl~hl~rft~edn~~A~~LF~~Av~~dP~fA~AyA~la~cy~q~a~l~~~d~~~~~~~A~~~AeravEldp~d 337 (553) T PRK12370 258 STMVYLRGKHELNQYTPYSLQQALKLLTQCVNMSPNSIAPYCALAECYLSMAQMGIFDKQNAMIKAKEHAIKATELDHNN 337 (553) T ss_pred HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCC T ss_conf 89999978999973188779999999999873397744667878999997634036882889999999999987438788 Q ss_pred HHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHC Q ss_conf 578999999999963026998999999753114642356899999999999889989999999999835 Q gi|254780421|r 123 LAVRYAATLQAASILVDTSSYEEISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEELAKD 191 (218) Q Consensus 123 ~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~ 191 (218) + .|...++.+.--.++.+-+...++.-.. -+|..+.+....|-.+.-.|+.++|+...++.+.. T Consensus 338 p----~a~~~lg~~~~L~~d~~~a~~~f~rA~~-L~Pnsa~~~Y~~~W~~~maGr~~ea~~~i~~a~~L 401 (553) T PRK12370 338 P----QALGLLGLINTIHSEYIVGSLLFKQANL-LSPISADIKYYYGWNLFMAGQLEEALQTINECLKL 401 (553) T ss_pred H----HHHHHHHHHHHHCCCCHHHHHHHHHHHC-CCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHC T ss_conf 7----8889999998723552244799987642-59984257999999999758689999999998835 No 145 >COG5187 RPN7 26S proteasome regulatory complex component, contains PCI domain [Posttranslational modification, protein turnover, chaperones] Probab=68.31 E-value=7.5 Score=17.47 Aligned_cols=109 Identities=13% Similarity=0.109 Sum_probs=82.0 Q ss_pred CCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCH--HHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHH-- Q ss_conf 206999999999999511005778999999873126895--7899999999996302699899999975311464235-- Q gi|254780421|r 85 KLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPL--AVRYAATLQAASILVDTSSYEEISKILQKLSEPSNPM-- 160 (218) Q Consensus 85 ~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~--~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~~~~~-- 160 (218) -++..-|.-.+|..+-.-+|.+.+.+......++.-+.. .---+.++|++.++-+..=.++.++.+.++...++.| T Consensus 111 E~e~~ea~~n~aeyY~qi~D~~ng~~~~~~~~~~a~stg~KiDv~l~kiRlg~~y~d~~vV~e~lE~~~~~iEkGgDWeR 190 (412) T COG5187 111 ETEGSEADRNIAEYYCQIMDIQNGFEWMRRLMRDAMSTGLKIDVFLCKIRLGLIYGDRKVVEESLEVADDIIEKGGDWER 190 (412) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHCCCHHH T ss_conf 05789999889999998740664899999999988751640466887787887401099999999987889870787776 Q ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCC Q ss_conf 689999999999988998999999999983558 Q gi|254780421|r 161 HQFANEILGISALKFGKVQKAKTIFEELAKDNN 193 (218) Q Consensus 161 ~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~~~ 193 (218) +....-..|......-++.+|-..+-.++..-. T Consensus 191 rNRyK~Y~Gi~~m~~RnFkeAa~Ll~d~l~tF~ 223 (412) T COG5187 191 RNRYKVYKGIFKMMRRNFKEAAILLSDILPTFE 223 (412) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 656899988999999856788999998833334 No 146 >COG3170 FimV Tfp pilus assembly protein FimV [Cell motility and secretion / Intracellular trafficking and secretion] Probab=67.20 E-value=7.9 Score=17.33 Aligned_cols=44 Identities=16% Similarity=0.271 Sum_probs=33.6 Q ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHH Q ss_conf 9999999999889989999999999835589989999999999950 Q gi|254780421|r 164 ANEILGISALKFGKVQKAKTIFEELAKDNNSPFGISTRSQMILANI 209 (218) Q Consensus 164 a~ellg~~~~~~Gd~~~A~~~y~~i~~~~~~p~~~r~ra~~~l~~l 209 (218) .-+.++.+|+..||.+.|+++.++++...+.+ ...+++.++..| T Consensus 711 ~~~dla~ayi~~gd~e~A~~~l~~vl~~g~~s--~~aea~~L~~~~ 754 (755) T COG3170 711 IPLDLARAYIEIGDRDGAREILQEVLAEGDPS--QLAEAQALLQEL 754 (755) T ss_pred CHHHHHHHHHHHCCHHHHHHHHHHHHHCCCCH--HHHHHHHHHHHC T ss_conf 46789999987158888999999998604812--678999998531 No 147 >pfam04053 Coatomer_WDAD Coatomer WD associated region. This region is composed of WD40 repeats. Probab=66.25 E-value=8.3 Score=17.21 Aligned_cols=45 Identities=18% Similarity=0.310 Sum_probs=26.6 Q ss_pred HCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH Q ss_conf 026998999999753114642356899999999999889989999999999 Q gi|254780421|r 138 VDTSSYEEISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEEL 188 (218) Q Consensus 138 ~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i 188 (218) ++.++.+.|.+....+.++ ..+..+|+.++.+|+.+-|.+.|++. T Consensus 321 l~lG~L~~A~~~A~~~~~~------~~Wk~Lg~~AL~~g~~~lAe~~~~k~ 365 (435) T pfam04053 321 LELGNLDVALEIAKELDDE------HKWKRLGDAALSQGNIKLAEEAYQKA 365 (435) T ss_pred HHCCCHHHHHHHHHHHCCH------HHHHHHHHHHHHCCCHHHHHHHHHHH T ss_conf 8619999999999873679------99999999999749958999999974 No 148 >PRK07021 fliL flagellar basal body-associated protein FliL; Reviewed Probab=65.95 E-value=4.9 Score=18.60 Aligned_cols=23 Identities=13% Similarity=0.160 Sum_probs=10.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999999 Q gi|254780421|r 24 IAPLLILMILSLAIWFYLFDGSH 46 (218) Q Consensus 24 ii~~~i~~i~~~~~~~~~~~~~q 46 (218) ||++++++++++++|.+||-..+ T Consensus 17 ii~~~~lll~~~g~g~~~~~~~~ 39 (159) T PRK07021 17 ILLILITLAAAAGAGYSYWLSKQ 39 (159) T ss_pred HHHHHHHHHHHHHHHHHHHHCCC T ss_conf 89999999999999999995477 No 149 >COG3763 Uncharacterized protein conserved in bacteria [Function unknown] Probab=65.86 E-value=8.4 Score=17.16 Aligned_cols=26 Identities=15% Similarity=0.221 Sum_probs=16.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999999999 Q gi|254780421|r 23 WIAPLLILMILSLAIWFYLFDGSHEK 48 (218) Q Consensus 23 ~ii~~~i~~i~~~~~~~~~~~~~q~~ 48 (218) +|++.++.+++|+++|++.-..+-++ T Consensus 6 ail~ivl~ll~G~~~G~fiark~~~k 31 (71) T COG3763 6 AILLIVLALLAGLIGGFFIARKQMKK 31 (71) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999999999999999 No 150 >KOG2047 consensus Probab=65.78 E-value=8.5 Score=17.15 Aligned_cols=25 Identities=24% Similarity=0.321 Sum_probs=11.2 Q ss_pred HHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 9999999988998999999999983 Q gi|254780421|r 166 EILGISALKFGKVQKAKTIFEELAK 190 (218) Q Consensus 166 ellg~~~~~~Gd~~~A~~~y~~i~~ 190 (218) --+++.|.+.|.+++|+..|++.+. T Consensus 252 ~SLAdYYIr~g~~ekarDvyeeai~ 276 (835) T KOG2047 252 CSLADYYIRSGLFEKARDVYEEAIQ 276 (835) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999873116777899999987 No 151 >COG3167 PilO Tfp pilus assembly protein PilO [Cell motility and secretion / Intracellular trafficking and secretion] Probab=65.47 E-value=8.6 Score=17.12 Aligned_cols=36 Identities=11% Similarity=0.094 Sum_probs=23.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999751999999999999999999999999999999 Q gi|254780421|r 17 GKCCYKWIAPLLILMILSLAIWFYLFDGSHEKKKNI 52 (218) Q Consensus 17 ~~~~yk~ii~~~i~~i~~~~~~~~~~~~~q~~~~~~ 52 (218) |----|-++..++++.+.+.|||++|...+.+-... T Consensus 19 Wpl~~rlv~~lL~~~~V~~lGy~f~~s~k~eel~~~ 54 (211) T COG3167 19 WPLAPRLVFCLLAVAAVLGLGYAFYLSGKLEELEEL 54 (211) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHH T ss_conf 200379999999999999999999854078889999 No 152 >PRK09458 pspB phage shock protein B; Provisional Probab=64.56 E-value=8.9 Score=17.01 Aligned_cols=22 Identities=18% Similarity=0.360 Sum_probs=14.8 Q ss_pred HCCCHHHHHHHHHHHHHCCCCC Q ss_conf 1387666999999973125520 Q gi|254780421|r 65 NSNKLDDARSSFEKILSQDNKL 86 (218) Q Consensus 65 ~~~~~~~A~~~l~~l~~~~~~~ 86 (218) ...+.++.+..++.+++...+. T Consensus 50 ~A~~m~eRI~tLE~ILDae~P~ 71 (75) T PRK09458 50 KAEKMRERIQTLEAILDAEHPN 71 (75) T ss_pred HHHHHHHHHHHHHHHHCCCCCC T ss_conf 9999999999999997266988 No 153 >KOG0376 consensus Probab=64.04 E-value=9.1 Score=16.95 Aligned_cols=29 Identities=24% Similarity=0.209 Sum_probs=12.7 Q ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHHC Q ss_conf 99999999999889989999999999835 Q gi|254780421|r 163 FANEILGISALKFGKVQKAKTIFEELAKD 191 (218) Q Consensus 163 ~a~ellg~~~~~~Gd~~~A~~~y~~i~~~ 191 (218) .+...+|.+..+.+.+.+|...|+..... T Consensus 73 K~Y~rrg~a~m~l~~~~~A~~~l~~~~~l 101 (476) T KOG0376 73 KAYVRRGTAVMALGEFKKALLDLEKVKKL 101 (476) T ss_pred HEEEECCHHHHHHHHHHHHHHHHHHHHHC T ss_conf 30343117787577799999999876623 No 154 >pfam04011 LemA LemA family. The members of this family are related to the LemA protein. LemA contains an amino terminal predicted transmembrane helix. It has been predicted that the small amino terminus is extracellular. The exact molecular function of this protein is uncertain. Probab=63.87 E-value=9.2 Score=16.93 Aligned_cols=45 Identities=16% Similarity=0.175 Sum_probs=23.3 Q ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCC Q ss_conf 99999999988998999999999983558998999999999995003 Q gi|254780421|r 165 NEILGISALKFGKVQKAKTIFEELAKDNNSPFGISTRSQMILANIIA 211 (218) Q Consensus 165 ~ellg~~~~~~Gd~~~A~~~y~~i~~~~~~p~~~r~ra~~~l~~l~~ 211 (218) ..+...+..-+.+...||..|+.....-+ ..+++.=..+++.+-| T Consensus 121 ~~L~~~l~~~E~~Ia~aR~~YN~~V~~yN--~~i~~FP~~lvA~~~g 165 (186) T pfam04011 121 LELQSQLEGTENRIAVARRDYNEAVQEYN--TKIRQFPSVIVAKLFG 165 (186) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHCCHHHHHHHCC T ss_conf 99999999999999999999999999998--8788382999998759 No 155 >pfam09402 MSC Man1-Src1p-C-terminal domain. MAN1 is an integral protein of the inner nuclear membrane which binds to chromatin associated proteins and plays a role in nuclear organisation. The C terminal nucleoplasmic region forms a DNA binding winged helix and binds to Smad. This C-terminal tail is also found in S. cerevisiae and is thought to consist of three conserved helices followed by two downstream strands. Probab=63.56 E-value=9.3 Score=16.90 Aligned_cols=100 Identities=13% Similarity=0.146 Sum_probs=57.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHH Q ss_conf 89999999975199999999999999999999999999999999999999998413876669999999731255206999 Q gi|254780421|r 11 KGKCKMGKCCYKWIAPLLILMILSLAIWFYLFDGSHEKKKNIVGENFAQALELFNSNKLDDARSSFEKILSQDNKLYNPL 90 (218) Q Consensus 11 ~~~~~~~~~~yk~ii~~~i~~i~~~~~~~~~~~~~q~~~~~~as~~y~~al~~~~~~~~~~A~~~l~~l~~~~~~~ya~l 90 (218) .-.++.|-..|+.+|++++++++++....+.|..++. ......+.+..+++..++... ... .+....+|-.. T Consensus 199 ~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~r~~~~~~-~~~~v~~lv~~vi~~L~~q~~----~~~---~~~~~~p~i~~ 270 (325) T pfam09402 199 RRFVRLFLKRYRLLLLGIAILLLVVFYLRYRIRKKRE-EKARVEELVKKIIDKLKQQAA----LSD---EDSSEEPYVSI 270 (325) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHH----HHC---CCCCCCCCCCH T ss_conf 6799999999999999999999999999999999999-999999999999999999998----615---78888987123 Q ss_pred HHHHHHHHHHHCCHHHHHHHHHHHHHHHC Q ss_conf 99999999951100577899999987312 Q gi|254780421|r 91 SNMYIASILVAKGDTKNAAEIFLKVANDD 119 (218) Q Consensus 91 A~l~lA~~~~~~g~~~~A~~~l~~i~~~~ 119 (218) .++. -.++...++...-..+++.+.+.- T Consensus 271 ~qLR-D~lL~~~~~~~~~~~lW~~V~~~v 298 (325) T pfam09402 271 PQLR-DDLLRDEHRLKRRNRLWEKVVSKL 298 (325) T ss_pred HHHH-HHHCCCCCCHHHHHHHHHHHHHHH T ss_conf 5768-887167766168999999999998 No 156 >KOG2062 consensus Probab=63.56 E-value=9.3 Score=16.90 Aligned_cols=161 Identities=22% Similarity=0.242 Sum_probs=99.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCC Q ss_conf 99999999999999999999984138766699999997312552069999999999995110057789999998731268 Q gi|254780421|r 42 FDGSHEKKKNIVGENFAQALELFNSNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLA 121 (218) Q Consensus 42 ~~~~q~~~~~~as~~y~~al~~~~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~ 121 (218) ..+-++.|.++.-.-..-.+....=+..++|-...+++....++....-..|.+|-.+..-|+ .+|+..+..+.-.+.- T Consensus 490 ~~Ya~ETQHeki~RGl~vGiaL~~ygrqe~Ad~lI~el~~dkdpilR~~Gm~t~alAy~GTgn-nkair~lLh~aVsD~n 568 (929) T KOG2062 490 LTYAQETQHEKIIRGLAVGIALVVYGRQEDADPLIKELLRDKDPILRYGGMYTLALAYVGTGN-NKAIRRLLHVAVSDVN 568 (929) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHCCHHHHHHHHHCCCC-HHHHHHHHCCCCCCCC T ss_conf 998512158889998877589998610444479999985277567664019999999865675-0777876110125775 Q ss_pred CHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHH-HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHH Q ss_conf 957899999999996302699899999975311464235-6899999999999889989999999999835589989999 Q gi|254780421|r 122 PLAVRYAATLQAASILVDTSSYEEISKILQKLSEPSNPM-HQFANEILGISALKFGKVQKAKTIFEELAKDNNSPFGIST 200 (218) Q Consensus 122 ~~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~~~~~-~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~~~~p~~~r~ 200 (218) +. .|..|.+.++.+. -.+++.....+..+...-+|+ ++-+...+|.+..-.|+. +|+...+-+.. |+..-+|+ T Consensus 569 DD-VrRaAVialGFVl--~~dp~~~~s~V~lLses~N~HVRyGaA~ALGIaCAGtG~~-eAi~lLepl~~--D~~~fVRQ 642 (929) T KOG2062 569 DD-VRRAAVIALGFVL--FRDPEQLPSTVSLLSESYNPHVRYGAAMALGIACAGTGLK-EAINLLEPLTS--DPVDFVRQ 642 (929) T ss_pred HH-HHHHHHHHHEEEE--ECCHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCCH-HHHHHHHHHHC--CHHHHHHH T ss_conf 38-9999998700258--4685653489998765047166655899875674488728-99998767631--83988988 Q ss_pred HHHHHHHHH Q ss_conf 999999950 Q gi|254780421|r 201 RSQMILANI 209 (218) Q Consensus 201 ra~~~l~~l 209 (218) -|-+-++.| T Consensus 643 gAlIa~amI 651 (929) T KOG2062 643 GALIALAMI 651 (929) T ss_pred HHHHHHHHH T ss_conf 789999999 No 157 >KOG1127 consensus Probab=63.45 E-value=9.4 Score=16.88 Aligned_cols=126 Identities=14% Similarity=0.054 Sum_probs=75.0 Q ss_pred HHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHH Q ss_conf 66699999997312552069999999999995110057789999998731268957899999999996302699899999 Q gi|254780421|r 69 LDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEEISK 148 (218) Q Consensus 69 ~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~a~~ 148 (218) .-.|.+-|++--+-.++-|.+. -..+..+.+..+.+.|.+......++.+ ...+.-.-.+.+..+++.++...++. T Consensus 508 m~RA~kCf~KAFeLDatdaeaa--aa~adtyae~~~we~a~~I~l~~~qka~--a~~~k~nW~~rG~yyLea~n~h~aV~ 583 (1238) T KOG1127 508 MKRAKKCFDKAFELDATDAEAA--AASADTYAEESTWEEAFEICLRAAQKAP--AFACKENWVQRGPYYLEAHNLHGAVC 583 (1238) T ss_pred HHHHHHHHHHHHCCCCHHHHHH--HHHHHHHHCCCCHHHHHHHHHHHHHHCH--HHHHHHHHHHCCCCCCCCCCHHHHHH T ss_conf 9999999988863791234467--8888876315418899999988742154--77877523540434368521555899 Q ss_pred HHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHH Q ss_conf 975311464235689999999999988998999999999983558998999 Q gi|254780421|r 149 ILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEELAKDNNSPFGIS 199 (218) Q Consensus 149 ~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~~~~p~~~r 199 (218) -++.-..- +|-..-..+.+|.+|.+-|.+..|.+.|.++..-.+.+...| T Consensus 584 ~fQsALR~-dPkD~n~W~gLGeAY~~sGry~~AlKvF~kAs~LrP~s~y~~ 633 (1238) T KOG1127 584 EFQSALRT-DPKDYNLWLGLGEAYPESGRYSHALKVFTKASLLRPLSKYGR 633 (1238) T ss_pred HHHHHHCC-CCHHHHHHHHHHHHHHHCCCEEHHHHHHHHHHHCCCHHHHHH T ss_conf 99998638-906578899877887743762007776424574482768999 No 158 >KOG0860 consensus Probab=63.30 E-value=9.4 Score=16.87 Aligned_cols=29 Identities=28% Similarity=0.114 Sum_probs=16.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 58899999999751999999999999999 Q gi|254780421|r 9 SYKGKCKMGKCCYKWIAPLLILMILSLAI 37 (218) Q Consensus 9 ~~~~~~~~~~~~yk~ii~~~i~~i~~~~~ 37 (218) +.|=|=|||-++-|+.++..+++++.++. T Consensus 80 A~klkrk~wWkn~Km~~il~~v~~i~l~i 108 (116) T KOG0860 80 AVKLKRKMWWKNCKMRIILGLVIIILLVV 108 (116) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999878999999999999999 No 159 >pfam12273 RCR Chitin synthesis regulation, resistance to Congo red. RCR proteins are ER membrane proteins that regulate chitin deposition in fungal cell walls. Although chitin, a linear polymer of beta-1,4-linked N-acetylglucosamine, constitutes only 2% of the cell wall it plays a vital role in the overall protection of the cell wall against stress, noxious chemicals and osmotic pressure changes. Congo red is a cell wall-disrupting benzidine-type dye extensively used in many cell wall mutant studies that specifically targets chitin in yeast cells and inhibits growth. RCR proteins render the yeasts resistant to Congo red by diminishing the content of chitin in the cell wall. RCR proteins are probably regulating chitin synthase III interact directly with ubiquitin ligase Rsp5, and the VPEY motif is necessary for this, via interaction with the WW domains of Rsp5. Probab=62.02 E-value=9 Score=17.00 Aligned_cols=23 Identities=9% Similarity=0.327 Sum_probs=11.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 19999999999999999999999 Q gi|254780421|r 22 KWIAPLLILMILSLAIWFYLFDG 44 (218) Q Consensus 22 k~ii~~~i~~i~~~~~~~~~~~~ 44 (218) +|++++++|+++.++++.+...+ T Consensus 1 RWv~fav~i~~~~i~~f~~~~~n 23 (124) T pfam12273 1 RWVLFAIFIIALLILFFLTARIN 23 (124) T ss_pred CEEEHHHHHHHHHHHHHHHHHHH T ss_conf 91108999999999999999873 No 160 >COG4968 PilE Tfp pilus assembly protein PilE [Cell motility and secretion / Intracellular trafficking and secretion] Probab=62.01 E-value=9.9 Score=16.72 Aligned_cols=36 Identities=11% Similarity=0.097 Sum_probs=24.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999999999999999999999 Q gi|254780421|r 23 WIAPLLILMILSLAIWFYLFDGSHEKKKNIVGENFA 58 (218) Q Consensus 23 ~ii~~~i~~i~~~~~~~~~~~~~q~~~~~~as~~y~ 58 (218) -+|.++|+.|++++.+-.|.++....+..++-..+. T Consensus 13 LmIvVaIv~ILa~IAyPSY~~yv~rs~R~~a~A~L~ 48 (139) T COG4968 13 LMIVVAIVGILALIAYPSYQNYVLRSRRSAAKAALL 48 (139) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999987076899999989999999999 No 161 >PRK11619 lytic murein transglycosylase; Provisional Probab=61.09 E-value=10 Score=16.62 Aligned_cols=124 Identities=15% Similarity=0.145 Sum_probs=51.0 Q ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHCC--CCC-HHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHH Q ss_conf 999999841387666999999973125--520-69999999999995110057789999998731268957899999999 Q gi|254780421|r 57 FAQALELFNSNKLDDARSSFEKILSQD--NKL-YNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQA 133 (218) Q Consensus 57 y~~al~~~~~~~~~~A~~~l~~l~~~~--~~~-ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~l 133 (218) ...++.-.-..+.+.|...+..+.... +.. ...+... +|-.++.....+++..-+..+......+.... -++|. T Consensus 245 ~~~al~RLAR~Dp~~A~~~~~~~~~~~~~s~~~~~~l~~~-ia~rl~~~~~~~e~~~W~d~~~~~~~~~~l~E--~rvR~ 321 (645) T PRK11619 245 AAVAFASVARQDAENARLMIPSLVQAQKLNEDQIQELRDI-VAWRLMGNDVTDEQAKWRDDAIMRSQSTSLIE--RRVRM 321 (645) T ss_pred HHHHHHHHHHHCHHHHHHHHHHHHHHCCCCHHHHHHHHHH-HHHHHHHCCCCHHHHHHHHHHHHHCCCHHHHH--HHHHH T ss_conf 9999999998499999999999987546999999999999-99999743379888866876653138849999--99999 Q ss_pred HHHHHCCCCHHHHHHHHH---HHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 996302699899999975---311464235689999999999988998999999999983 Q gi|254780421|r 134 ASILVDTSSYEEISKILQ---KLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEELAK 190 (218) Q Consensus 134 a~~~~~~~~~d~a~~~l~---~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~ 190 (218) + +-.++...+..-+. +-......|++ =+|.++...|+.++|.+.|+.++. T Consensus 322 A---Lr~~DW~~v~~~I~~Lp~~~q~~~~WqY----W~ARal~~~G~~~~A~~~~~~La~ 374 (645) T PRK11619 322 A---LGTGDRRGLNTWLARLPMEAKEKDEWRY----WQADLLLERGREAEAKEILHQLMQ 374 (645) T ss_pred H---HHHCCHHHHHHHHHHCCHHHCCCHHHHH----HHHHHHHHCCCHHHHHHHHHHHHH T ss_conf 9---9855699999999968997627777799----999999973998889999999860 No 162 >KOG1915 consensus Probab=60.66 E-value=10 Score=16.57 Aligned_cols=71 Identities=14% Similarity=0.160 Sum_probs=34.5 Q ss_pred HHHHHHHHHHHHHHHHHCCCHHHHHHH-------------------------HHHHHHCCCCCHHHHHHHHHHHHHHHCC Q ss_conf 999999999999998413876669999-------------------------9997312552069999999999995110 Q gi|254780421|r 49 KKNIVGENFAQALELFNSNKLDDARSS-------------------------FEKILSQDNKLYNPLSNMYIASILVAKG 103 (218) Q Consensus 49 ~~~~as~~y~~al~~~~~~~~~~A~~~-------------------------l~~l~~~~~~~ya~lA~l~lA~~~~~~g 103 (218) .-+.+...|.-+++.+-.++.++.-+. +++.+...+..|. +-|-..++.-..| T Consensus 259 E~ERar~iykyAld~~pk~raeeL~k~~~~fEKqfGd~~gIEd~Iv~KRk~qYE~~v~~np~nYD--sWfdylrL~e~~g 336 (677) T KOG1915 259 EYERARFIYKYALDHIPKGRAEELYKKYTAFEKQFGDKEGIEDAIVGKRKFQYEKEVSKNPYNYD--SWFDYLRLEESVG 336 (677) T ss_pred HHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHCCCCCH--HHHHHHHHHHHCC T ss_conf 99999999999987468650999999999999971651116888766565379999872987706--8999999997227 Q ss_pred HHHHHHHHHHHHHHHCCC Q ss_conf 057789999998731268 Q gi|254780421|r 104 DTKNAAEIFLKVANDDLA 121 (218) Q Consensus 104 ~~~~A~~~l~~i~~~~~~ 121 (218) +.+.-.+.|...|.+.+. T Consensus 337 ~~~~Ire~yErAIanvpp 354 (677) T KOG1915 337 DKDRIRETYERAIANVPP 354 (677) T ss_pred CHHHHHHHHHHHHCCCCC T ss_conf 788999999998705994 No 163 >PRK02201 putative inner membrane protein translocase component YidC; Provisional Probab=59.62 E-value=11 Score=16.46 Aligned_cols=31 Identities=29% Similarity=0.442 Sum_probs=14.2 Q ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 4112458899999999751999999999999 Q gi|254780421|r 4 NNDANSYKGKCKMGKCCYKWIAPLLILMILS 34 (218) Q Consensus 4 ~~~~~~~~~~~~~~~~~yk~ii~~~i~~i~~ 34 (218) +||.|.-|.|.+-|++.+||+=+.+.+.+++ T Consensus 16 ~~~~~~~k~~k~~~k~~~K~~Ki~~~~fl~~ 46 (392) T PRK02201 16 SNLNNKKKTKKKIFKKVWKWLKIFFYLFLFG 46 (392) T ss_pred HHHCCHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 4421003458899999999999999999999 No 164 >PRK05696 fliL flagellar basal body-associated protein FliL; Reviewed Probab=59.57 E-value=7.5 Score=17.48 Aligned_cols=12 Identities=0% Similarity=0.061 Sum_probs=4.3 Q ss_pred HHHHHHHHHHHH Q ss_conf 999999999999 Q gi|254780421|r 31 MILSLAIWFYLF 42 (218) Q Consensus 31 ~i~~~~~~~~~~ 42 (218) ++.+++++|++. T Consensus 29 l~ggggg~~~~l 40 (168) T PRK05696 29 ALGGGGAAWFFM 40 (168) T ss_pred HHHHHHHHHHHH T ss_conf 999999999995 No 165 >COG4093 Uncharacterized protein conserved in bacteria [Function unknown] Probab=59.47 E-value=11 Score=16.45 Aligned_cols=38 Identities=8% Similarity=0.260 Sum_probs=26.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999751999999999999999999999999999999 Q gi|254780421|r 15 KMGKCCYKWIAPLLILMILSLAIWFYLFDGSHEKKKNI 52 (218) Q Consensus 15 ~~~~~~yk~ii~~~i~~i~~~~~~~~~~~~~q~~~~~~ 52 (218) +-.++.+-|..+++++++.+..+||+|--.+-+++... T Consensus 10 ~~~rkr~~wl~i~ivv~~g~ySaGWFylA~rle~~~~a 47 (338) T COG4093 10 SATRKRLFWLVIAIVVLIGAYSAGWFYLADRLEQQADA 47 (338) T ss_pred CCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHH T ss_conf 86432234899999998877501076677699999987 No 166 >pfam03685 UPF0147 Uncharacterized protein family (UPF0147). This family of small proteins have no known function. Probab=59.30 E-value=11 Score=16.43 Aligned_cols=79 Identities=23% Similarity=0.315 Sum_probs=52.8 Q ss_pred CCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHH Q ss_conf 10057789999998731268957899999999996302699899999975311464235689999999999988998999 Q gi|254780421|r 102 KGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEEISKILQKLSEPSNPMHQFANEILGISALKFGKVQKA 181 (218) Q Consensus 102 ~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A 181 (218) +..+..++..++.+++|.+.|..+|..| .++...|. +++..... +...| T Consensus 5 e~~i~~~~~~L~~I~~D~sVPRNIRr~a--------------~e~~~~L~---ne~~~~~v--------------Raa~a 53 (85) T pfam03685 5 EEKIKQAIEMLDRIINDTTVPRNIRRAA--------------TDAKAALL---NEEESPAV--------------RAATA 53 (85) T ss_pred HHHHHHHHHHHHHHHCCCCCCHHHHHHH--------------HHHHHHHH---CCCCCHHH--------------HHHHH T ss_conf 9999999999999843898876899999--------------99999987---88856457--------------89999 Q ss_pred HHHHHHHHHCCCCCHHHHHHHHHHHHHHCC Q ss_conf 999999983558998999999999995003 Q gi|254780421|r 182 KTIFEELAKDNNSPFGISTRSQMILANIIA 211 (218) Q Consensus 182 ~~~y~~i~~~~~~p~~~r~ra~~~l~~l~~ 211 (218) ...++++.+|++.|...|..+=+.++.|.. T Consensus 54 is~LdeIs~DPNmP~h~RT~Iw~ivS~LE~ 83 (85) T pfam03685 54 ISILDEISNDPNMPIHTRTLIWNIVSQLET 83 (85) T ss_pred HHHHHHHHCCCCCCHHHHHHHHHHHHHHHH T ss_conf 999997623999981899999999999862 No 167 >pfam09543 DUF2379 Protein of unknown function (DUF2379). This family consists of at least 7 paralogs in Myxococcus xanthus and 6 in Stigmatella aurantiaca, both members of the Deltaproteobacteria. The function is unknown. Probab=58.61 E-value=11 Score=16.36 Aligned_cols=51 Identities=6% Similarity=-0.054 Sum_probs=39.7 Q ss_pred HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHC Q ss_conf 3568999999999998899899999999998355899899999999999500 Q gi|254780421|r 159 PMHQFANEILGISALKFGKVQKAKTIFEELAKDNNSPFGISTRSQMILANII 210 (218) Q Consensus 159 ~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~~~~p~~~r~ra~~~l~~l~ 210 (218) .++.......+.-+...||++.|+.....++.-. +-+-+|..++..|..|+ T Consensus 72 S~RL~~Al~r~~~~~daGDldgARq~m~dvLaVE-VVP~YR~~Ae~qL~~ld 122 (122) T pfam09543 72 SDRLMDALHRAYRLRDAGDLDGARQEMRDVLAVE-VVPHYREIAEEQLEDLD 122 (122) T ss_pred HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHCCC T ss_conf 9999999999998615328688999999878876-41999999999876149 No 168 >PRK08455 fliL flagellar basal body-associated protein FliL; Reviewed Probab=58.60 E-value=6.1 Score=18.04 Aligned_cols=17 Identities=12% Similarity=0.116 Sum_probs=8.2 Q ss_pred HHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999 Q gi|254780421|r 23 WIAPLLILMILSLAIWF 39 (218) Q Consensus 23 ~ii~~~i~~i~~~~~~~ 39 (218) +||+++++++++++|+. T Consensus 21 iIii~vvvlll~~gG~~ 37 (180) T PRK08455 21 IIIIGVVVLLLLIGGVI 37 (180) T ss_pred EEEHHHHHHHHHHHHHH T ss_conf 73499999999999999 No 169 >TIGR03319 YmdA_YtgF conserved hypothetical protein YmdA/YtgF. Probab=58.25 E-value=12 Score=16.32 Aligned_cols=28 Identities=32% Similarity=0.341 Sum_probs=17.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999999999999 Q gi|254780421|r 24 IAPLLILMILSLAIWFYLFDGSHEKKKN 51 (218) Q Consensus 24 ii~~~i~~i~~~~~~~~~~~~~q~~~~~ 51 (218) |+++++.+++|++.||+++.....++.. T Consensus 1 Ii~~ii~lviG~~iGy~l~k~~~~~~i~ 28 (514) T TIGR03319 1 ILLALVALIVGLIIGYLLRKRIAEKKLG 28 (514) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9068999999999999999999999999 No 170 >PRK10559 p-hydroxybenzoic acid efflux subunit AaeA; Provisional Probab=57.27 E-value=12 Score=16.22 Aligned_cols=30 Identities=17% Similarity=0.145 Sum_probs=14.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9997519999999999999999999999999 Q gi|254780421|r 17 GKCCYKWIAPLLILMILSLAIWFYLFDGSHE 47 (218) Q Consensus 17 ~~~~yk~ii~~~i~~i~~~~~~~~~~~~~q~ 47 (218) -++..+..|..+ ++++++++++++|.++.. T Consensus 5 ~rk~~r~~i~i~-~v~~~~~~~~~~w~~~~~ 34 (310) T PRK10559 5 TRKISRTAITVV-LVILAFIAIFNAWVYYTE 34 (310) T ss_pred HHHHHHHHHHHH-HHHHHHHHHHHHHHHHCC T ss_conf 899999999999-999999999999986268 No 171 >pfam03672 UPF0154 Uncharacterized protein family (UPF0154). This family contains a set of short bacterial proteins of unknown function. Probab=56.69 E-value=12 Score=16.16 Aligned_cols=23 Identities=4% Similarity=0.324 Sum_probs=12.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999999 Q gi|254780421|r 27 LLILMILSLAIWFYLFDGSHEKK 49 (218) Q Consensus 27 ~~i~~i~~~~~~~~~~~~~q~~~ 49 (218) .++.+++|+++||++-..+-++. T Consensus 3 iil~ll~G~~~Gff~ark~~~k~ 25 (64) T pfam03672 3 IVLCLLAGFILGYFISRKIMKKQ 25 (64) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH T ss_conf 28999998999999999999999 No 172 >pfam11297 DUF3098 Protein of unknown function (DUF3098). This bacterial family of proteins has no known function. Probab=55.86 E-value=11 Score=16.46 Aligned_cols=21 Identities=24% Similarity=0.351 Sum_probs=17.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHH Q ss_conf 999751999999999999999 Q gi|254780421|r 17 GKCCYKWIAPLLILMILSLAI 37 (218) Q Consensus 17 ~~~~yk~ii~~~i~~i~~~~~ 37 (218) .|+||..+++++.++++|.+. T Consensus 4 ~k~NY~ll~iG~~vI~~Gfil 24 (64) T pfam11297 4 GKKNYILMAIGIAVIALGFIL 24 (64) T ss_pred CCHHHHHHHHHHHHHHHHHHH T ss_conf 616899999999999999994 No 173 >COG4785 NlpI Lipoprotein NlpI, contains TPR repeats [General function prediction only] Probab=54.24 E-value=13 Score=15.91 Aligned_cols=126 Identities=19% Similarity=0.215 Sum_probs=55.8 Q ss_pred HHHHHCCCHHHHHHHHHHHHHCCCC-CHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHH--------- Q ss_conf 9984138766699999997312552-069999999999995110057789999998731268957899999--------- Q gi|254780421|r 61 LELFNSNKLDDARSSFEKILSQDNK-LYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAAT--------- 130 (218) Q Consensus 61 l~~~~~~~~~~A~~~l~~l~~~~~~-~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~--------- 130 (218) +-....+++|.|...|+.+.+-.|+ .|+ ++..+-.+.=.|.+.-|...+...-.+++.| ++|.+=. T Consensus 107 ~Yl~~a~~fdaa~eaFds~~ELDp~y~Ya---~lNRgi~~YY~gR~~LAq~d~~~fYQ~D~~D-PfR~LWLYl~E~k~dP 182 (297) T COG4785 107 IYLTQAGNFDAAYEAFDSVLELDPTYNYA---HLNRGIALYYGGRYKLAQDDLLAFYQDDPND-PFRSLWLYLNEQKLDP 182 (297) T ss_pred HHHHHCCCCHHHHHHHHHHHCCCCCCHHH---HHCCCEEEEECCCCHHHHHHHHHHHHCCCCC-HHHHHHHHHHHHCCCH T ss_conf 99986022057999866671047751077---7604402344475155599999998449898-0789999987610786 Q ss_pred --------------------HHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHH---HHHHHHHHHHHHCCCHHHHHHHHHH Q ss_conf --------------------99999630269989999997531146423568---9999999999988998999999999 Q gi|254780421|r 131 --------------------LQAASILVDTSSYEEISKILQKLSEPSNPMHQ---FANEILGISALKFGKVQKAKTIFEE 187 (218) Q Consensus 131 --------------------l~la~~~~~~~~~d~a~~~l~~~~~~~~~~~~---~a~ellg~~~~~~Gd~~~A~~~y~~ 187 (218) ..++..+++....+.+.+++.....++..+.- -....+|-.++..|+.++|...|.- T Consensus 183 ~~A~tnL~qR~~~~d~e~WG~~iV~~yLgkiS~e~l~~~~~a~a~~n~~~Ae~LTEtyFYL~K~~l~~G~~~~A~~LfKL 262 (297) T COG4785 183 KQAKTNLKQRAEKSDKEQWGWNIVEFYLGKISEETLMERLKADATDNTSLAEHLTETYFYLGKYYLSLGDLDEATALFKL 262 (297) T ss_pred HHHHHHHHHHHHCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHH T ss_conf 89999999998705676551879999970403999999998631212899999999999999998613117889999999 Q ss_pred HHH Q ss_conf 983 Q gi|254780421|r 188 LAK 190 (218) Q Consensus 188 i~~ 190 (218) .+. T Consensus 263 aia 265 (297) T COG4785 263 AVA 265 (297) T ss_pred HHH T ss_conf 998 No 174 >TIGR00247 TIGR00247 conserved hypothetical protein TIGR00247; InterPro: IPR003770 This family contains several aminodeoxychorismate lyase proteins. Aminodeoxychorismate lyase is a pyridoxal 5'-phosphate-dependent enzyme that converts 4-aminodeoxychorismate to pyruvate and p-aminobenzoate, a precursor of folic acid in bacteria . . Probab=54.05 E-value=14 Score=15.89 Aligned_cols=29 Identities=31% Similarity=0.296 Sum_probs=21.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 19999999999999999999999999999 Q gi|254780421|r 22 KWIAPLLILMILSLAIWFYLFDGSHEKKK 50 (218) Q Consensus 22 k~ii~~~i~~i~~~~~~~~~~~~~q~~~~ 50 (218) |-|+++.|+++++++|+...|.-..-... T Consensus 14 kri~i~~~~vll~~~gs~~~y~~~~L~~~ 42 (373) T TIGR00247 14 KRIIILLLLVLLLIIGSILLYKISLLEKS 42 (373) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 68999999999999988998877752001 No 175 >PRK10270 hypothetical protein; Provisional Probab=53.01 E-value=14 Score=15.79 Aligned_cols=19 Identities=26% Similarity=0.436 Sum_probs=7.6 Q ss_pred HHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999 Q gi|254780421|r 28 LILMILSLAIWFYLFDGSH 46 (218) Q Consensus 28 ~i~~i~~~~~~~~~~~~~q 46 (218) ++++++++++|+.+|...+ T Consensus 10 ~l~~~l~i~~g~~~~~~~~ 28 (340) T PRK10270 10 LLLVVLGIAAGVGVWKVRH 28 (340) T ss_pred HHHHHHHHHHHHHHHHHHC T ss_conf 9999999999999987410 No 176 >KOG4403 consensus Probab=51.83 E-value=15 Score=15.67 Aligned_cols=25 Identities=16% Similarity=0.361 Sum_probs=13.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 19999999999999999999999999 Q gi|254780421|r 22 KWIAPLLILMILSLAIWFYLFDGSHE 47 (218) Q Consensus 22 k~ii~~~i~~i~~~~~~~~~~~~~q~ 47 (218) |-+|+++++++..+.+||. |...+. T Consensus 215 KD~iLv~lili~v~gcw~a-y~Qnk~ 239 (575) T KOG4403 215 KDFILVVLILIGVGGCWFA-YRQNKK 239 (575) T ss_pred HHHHHHHHHHHHHHHHHHH-HHHHHH T ss_conf 5689999999986055545-443167 No 177 >TIGR02976 phageshock_pspB phage shock protein B; InterPro: IPR009554 This family consists of several bacterial phage shock protein B (PspB) sequences. The phage shock protein (psp) operon is induced in response to heat, ethanol, osmotic shock and infection by filamentous bacteriophages . Expression of the operon requires the alternative sigma factor sigma54 and the transcriptional activator PspF. In addition, PspA plays a negative regulatory role, and the integral-membrane proteins PspB and PspC play a positive one .. Probab=51.67 E-value=15 Score=15.65 Aligned_cols=55 Identities=18% Similarity=0.378 Sum_probs=30.4 Q ss_pred HHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHH--CCCHHHHHHHHHHHHHCCCCC Q ss_conf 999999999999999-99999999999999999841--387666999999973125520 Q gi|254780421|r 31 MILSLAIWFYLFDGS-HEKKKNIVGENFAQALELFN--SNKLDDARSSFEKILSQDNKL 86 (218) Q Consensus 31 ~i~~~~~~~~~~~~~-q~~~~~~as~~y~~al~~~~--~~~~~~A~~~l~~l~~~~~~~ 86 (218) +|+.+..|. +..+. +.+....++..=.+.+.... .++.++-+..+|.++++.+++ T Consensus 14 ~~fv~~~wl-~lHy~~~~~~~~~l~~~d~~~L~~L~~~A~Rl~~Ri~tLE~iLd~e~P~ 71 (75) T TIGR02976 14 VIFVALLWL-LLHYRSKRKTAASLSTDDLQLLQELYAKADRLEERIDTLERILDAEHPN 71 (75) T ss_pred HHHHHHHHH-HHHHHCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCC T ss_conf 999999999-9985236640011157999999999999989999999999987204843 No 178 >PRK12785 fliL flagellar basal body-associated protein FliL; Reviewed Probab=51.16 E-value=4.7 Score=18.76 Aligned_cols=17 Identities=12% Similarity=0.141 Sum_probs=6.9 Q ss_pred HHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999 Q gi|254780421|r 27 LLILMILSLAIWFYLFD 43 (218) Q Consensus 27 ~~i~~i~~~~~~~~~~~ 43 (218) ++++++++++++|+++. T Consensus 32 ~~vlll~gggg~~f~~~ 48 (167) T PRK12785 32 AAVLLLGGGGGGWFFFF 48 (167) T ss_pred HHHHHHHHHHHHEEEEE T ss_conf 99999973114014664 No 179 >KOG4814 consensus Probab=50.84 E-value=15 Score=15.57 Aligned_cols=99 Identities=12% Similarity=0.059 Sum_probs=65.5 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHH--HHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHH Q ss_conf 99999999511005778999999873126895789999999--9996302699899999975311464235689999999 Q gi|254780421|r 92 NMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQ--AASILVDTSSYEEISKILQKLSEPSNPMHQFANEILG 169 (218) Q Consensus 92 ~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~--la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg 169 (218) ...-|+-+++..++..++.-|..-+..-+.|..-+..|++. ++..|+...+.|+|.+..+.--. ..|-.++-+...- T Consensus 357 LWn~A~~~F~~~~Y~~s~~~y~~Sl~~i~~D~~~~~FaK~qR~l~~CYL~L~QLD~A~E~~~EAE~-~d~~~~l~q~~~~ 435 (872) T KOG4814 357 LWNTAKKLFKMEKYVVSIRFYKLSLKDIISDNYSDRFAKIQRALQVCYLKLEQLDNAVEVYQEAEE-VDRQSPLCQLLML 435 (872) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-CCCCCHHHHHHHH T ss_conf 998888775377689999999998874475046578999999999998259988779999999986-0732388999999 Q ss_pred HHHHHCCCHHHHHHHHHHHHHC Q ss_conf 9999889989999999999835 Q gi|254780421|r 170 ISALKFGKVQKAKTIFEELAKD 191 (218) Q Consensus 170 ~~~~~~Gd~~~A~~~y~~i~~~ 191 (218) -+.+..|+..+|.....++.+. T Consensus 436 ~~~~~E~~Se~AL~~~~~~~s~ 457 (872) T KOG4814 436 QSFLAEDKSEEALTCLQKIKSS 457 (872) T ss_pred HHHHHHCCHHHHHHHHHHHHHH T ss_conf 9999832217789999998775 No 180 >pfam04415 DUF515 Protein of unknown function (DUF515). Family of hypothetical Archaeal proteins. Probab=50.64 E-value=15 Score=15.55 Aligned_cols=34 Identities=18% Similarity=0.205 Sum_probs=13.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9975199999999999999999999999999999 Q gi|254780421|r 18 KCCYKWIAPLLILMILSLAIWFYLFDGSHEKKKN 51 (218) Q Consensus 18 ~~~yk~ii~~~i~~i~~~~~~~~~~~~~q~~~~~ 51 (218) ....+-+|+++|++++.++.-+-.|.+.+.+..+ T Consensus 27 err~riiIga~il~iIIl~~~~~~Y~~~~~~~~q 60 (416) T pfam04415 27 ERRKRLIIGALVLSVIILLLVFAIYYYIENKAYQ 60 (416) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 3344100689999999999999999998727789 No 181 >PRK00106 hypothetical protein; Provisional Probab=49.15 E-value=16 Score=15.41 Aligned_cols=22 Identities=18% Similarity=0.247 Sum_probs=13.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999999 Q gi|254780421|r 23 WIAPLLILMILSLAIWFYLFDG 44 (218) Q Consensus 23 ~ii~~~i~~i~~~~~~~~~~~~ 44 (218) ++|++++.+++|+++||..+.. T Consensus 3 ~ii~~i~~~liG~~iG~~~~~~ 24 (535) T PRK00106 3 NIILLVVSALIGLVLGYALISI 24 (535) T ss_pred HHHHHHHHHHHHHHHHHHHHHH T ss_conf 0499999999999989999999 No 182 >KOG3785 consensus Probab=48.47 E-value=17 Score=15.34 Aligned_cols=54 Identities=17% Similarity=0.202 Sum_probs=20.0 Q ss_pred HHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH Q ss_conf 99963026998999999753114642356899999999999889989999999999 Q gi|254780421|r 133 AASILVDTSSYEEISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEEL 188 (218) Q Consensus 133 la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i 188 (218) +++.++-.++|.-|-...-....++..|. -+-+.+.-.++.|.+=-|-+.|..+ T Consensus 434 LArCyi~nkkP~lAW~~~lk~~t~~e~fs--LLqlIAn~CYk~~eFyyaaKAFd~l 487 (557) T KOG3785 434 LARCYIRNKKPQLAWDMMLKTNTPSERFS--LLQLIANDCYKANEFYYAAKAFDEL 487 (557) T ss_pred HHHHHHHCCCCHHHHHHHHHCCCCHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99998716994077888873589266779--9999999999877889898765588 No 183 >KOG1585 consensus Probab=47.70 E-value=17 Score=15.27 Aligned_cols=152 Identities=11% Similarity=0.035 Sum_probs=76.5 Q ss_pred HHHHHHHHHHHHH-HCCCHHHHHHHHHHHH----HCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCC-HHH Q ss_conf 9999999999984-1387666999999973----125520699999999999951100577899999987312689-578 Q gi|254780421|r 52 IVGENFAQALELF-NSNKLDDARSSFEKIL----SQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAP-LAV 125 (218) Q Consensus 52 ~as~~y~~al~~~-~~~~~~~A~~~l~~l~----~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~-~~~ 125 (218) .++-.|.++..+. +.....++...+++-. +.|++--+ -..+..|.-..++-++++|+.+|+..++--..+ ... T Consensus 69 hAAKayEqaamLake~~klsEvvdl~eKAs~lY~E~GspdtA-AmaleKAak~lenv~Pd~AlqlYqralavve~~dr~~ 147 (308) T KOG1585 69 HAAKAYEQAAMLAKELSKLSEVVDLYEKASELYVECGSPDTA-AMALEKAAKALENVKPDDALQLYQRALAVVEEDDRDQ 147 (308) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHH-HHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHCCCHHH T ss_conf 888889999999999887279999999999999970995469-9999999988623888899999999999985053487 Q ss_pred HHHHHHH-HHHHHHCCCCHHHHHHHHHHHCC-------CCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH--CCCCC Q ss_conf 9999999-99963026998999999753114-------64235689999999999988998999999999983--55899 Q gi|254780421|r 126 RYAATLQ-AASILVDTSSYEEISKILQKLSE-------PSNPMHQFANEILGISALKFGKVQKAKTIFEELAK--DNNSP 195 (218) Q Consensus 126 ~~lA~l~-la~~~~~~~~~d~a~~~l~~~~~-------~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~--~~~~p 195 (218) ...-+++ .+++++-...++++...+..... -+++.+ +.-..-++++-..|+..|...|+.-.. +-..| T Consensus 148 ma~el~gk~sr~lVrl~kf~Eaa~a~lKe~~~~~~~~~y~~~~k--~~va~ilv~L~~~Dyv~aekc~r~~~qip~f~~s 225 (308) T KOG1585 148 MAFELYGKCSRVLVRLEKFTEAATAFLKEGVAADKCDAYNSQCK--AYVAAILVYLYAHDYVQAEKCYRDCSQIPAFLKS 225 (308) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHH--HHHHHHHHHHHHHHHHHHHHHHCCHHCCCCCCCH T ss_conf 88899987545733357756889999986359999861123899--9999999972088899999983523027664674 Q ss_pred HHHHHHHHHHHH Q ss_conf 899999999999 Q gi|254780421|r 196 FGISTRSQMILA 207 (218) Q Consensus 196 ~~~r~ra~~~l~ 207 (218) +..|. .+.+|. T Consensus 226 ed~r~-lenLL~ 236 (308) T KOG1585 226 EDSRS-LENLLT 236 (308) T ss_pred HHHHH-HHHHHH T ss_conf 77899-999998 No 184 >pfam02009 Rifin_STEVOR Rifin/stevor family. Several multicopy gene families have been described in Plasmodium falciparum, including the stevor family of subtelomeric open reading frames and the rif interspersed repetitive elements. Both families contain three predicted transmembrane segments. It has been proposed that stevor and rif are members of a larger superfamily that code for variant surface antigens. Probab=47.12 E-value=17 Score=15.21 Aligned_cols=37 Identities=8% Similarity=-0.256 Sum_probs=15.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHH Q ss_conf 4588999999997519999999999999999--999999 Q gi|254780421|r 8 NSYKGKCKMGKCCYKWIAPLLILMILSLAIW--FYLFDG 44 (218) Q Consensus 8 ~~~~~~~~~~~~~yk~ii~~~i~~i~~~~~~--~~~~~~ 44 (218) +|+-+||--+--.-++.+++.|..-+|++|| .+.|.. T Consensus 86 KSladKvEK~CL~Cg~~LGggVap~~Gl~ggiav~~wk~ 124 (289) T pfam02009 86 KSLAEKVEKECLRCGFGLGGGVAPKSGLYGAIAVNEWTK 124 (289) T ss_pred CCHHHHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHHH T ss_conf 569999987516562656775056246668713553589 No 185 >PRK10927 essential cell division protein; Provisional Probab=45.56 E-value=4.9 Score=18.63 Aligned_cols=28 Identities=4% Similarity=0.119 Sum_probs=18.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 7519999999999999999999999999 Q gi|254780421|r 20 CYKWIAPLLILMILSLAIWFYLFDGSHE 47 (218) Q Consensus 20 ~yk~ii~~~i~~i~~~~~~~~~~~~~q~ 47 (218) ..+-++.++++++|+++||+||..++.. T Consensus 32 ~s~~~~ala~avlv~FiGGLyFith~K~ 59 (319) T PRK10927 32 VSPAMVAIAAAVLVTFIGGLYFITHHKK 59 (319) T ss_pred CCHHHHHHHHHHHHHHHCCEEEEECCCC T ss_conf 6658999999999997500478870587 No 186 >PRK09379 membrane-bound transcriptional regulator LytR; Provisional Probab=45.45 E-value=18 Score=15.05 Aligned_cols=29 Identities=17% Similarity=0.078 Sum_probs=12.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 997519999999999999999999999999 Q gi|254780421|r 18 KCCYKWIAPLLILMILSLAIWFYLFDGSHE 47 (218) Q Consensus 18 ~~~yk~ii~~~i~~i~~~~~~~~~~~~~q~ 47 (218) |+..+|+++.++++++++ +++.++-+++. T Consensus 3 Kkil~~i~~v~~vlll~~-g~~~~~~y~~~ 31 (303) T PRK09379 3 KKILFWILGIIGVLIIGG-GIYAYNVYSSV 31 (303) T ss_pred CEEEHHHHHHHHHHHHHH-HHHHHHHHHHH T ss_conf 266036999999999999-99999999998 No 187 >TIGR00771 DcuC transporter, anaerobic C4-dicarboxylate uptake C (DcuC) family; InterPro: IPR004669 Escherichia coli contains four different secondary carriers (DcuA, DcuB, DcuC, and DctA) for C4-dicarboxylates , , , DcuA is used for aerobic growth on C4-dicarboxylates , , whereas the Dcu carriers (encoded by the dcuA, dcuB, and dcuC genes) are used under anaerobic conditions and form a distinct family of carriers , , , , , . Each of the Dcu carriers is able to catalyze the uptake, antiport, and possibly also efflux of C4-dicarboxylates. DcuB is the major C4-dicarboxylate carrier for fumarate respiration with high fumarate-succinate exchange activity. It is synthesized only in the absence of oxygen and nitrate and in the presence of C4-dicarboxylates , , , . DcuA is expressed constitutively in aerobic and anaerobic growth and can substitute for DcuB , . These proteins are members of the C4-dicarboxylate Uptake C (DcuC) family. DcuC has 12 GES predicted transmembrane regions, is induced only under anaerobic conditions, and is not repressed by glucose. DcuC may therefore function as a succinate efflux system during anaerobic glucose fermentation. However, when overexpressed, it can replace either DcuA or DcuB in catalyzing fumarate-succinate exchange and fumarate uptake , . DcuC shows the same transport modes as DcuA and DcuB (exchange, uptake, and presumably efflux of C4-dicarboxylates) . ; GO: 0015556 C4-dicarboxylate transmembrane transporter activity, 0015740 C4-dicarboxylate transport, 0016021 integral to membrane. Probab=45.33 E-value=19 Score=15.04 Aligned_cols=46 Identities=17% Similarity=0.353 Sum_probs=34.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999997519999999999999999999999999999999999999 Q gi|254780421|r 14 CKMGKCCYKWIAPLLILMILSLAIWFYLFDGSHEKKKNIVGENFAQ 59 (218) Q Consensus 14 ~~~~~~~yk~ii~~~i~~i~~~~~~~~~~~~~q~~~~~~as~~y~~ 59 (218) ++...=.||+.+.+.+..|++.+...+||++..++|....+++-.+ T Consensus 133 ms~a~y~F~~~lp~ai~aii~~~iahfF~qr~~dkk~~~i~~e~~~ 178 (390) T TIGR00771 133 MSLADYAFKYTLPVAIAAIILVAIAHFFKQRALDKKEAYIAHEKEE 178 (390) T ss_pred CHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 1010124545537899999999999999998765788764565677 No 188 >pfam06667 PspB Phage shock protein B. This family consists of several bacterial phage shock protein B (PspB) sequences. The phage shock protein (psp) operon is induced in response to heat, ethanol, osmotic shock and infection by filamentous bacteriophages. Expression of the operon requires the alternative sigma factor sigma54 and the transcriptional activator PspF. In addition, PspA plays a negative regulatory role, and the integral-membrane proteins PspB and PspC play a positive one. Probab=45.05 E-value=19 Score=15.02 Aligned_cols=21 Identities=24% Similarity=0.447 Sum_probs=14.4 Q ss_pred CCCHHHHHHHHHHHHHCCCCC Q ss_conf 387666999999973125520 Q gi|254780421|r 66 SNKLDDARSSFEKILSQDNKL 86 (218) Q Consensus 66 ~~~~~~A~~~l~~l~~~~~~~ 86 (218) .+..++.+..++.+++..++. T Consensus 51 A~rm~~RI~tLE~ILDae~P~ 71 (75) T pfam06667 51 AEKLQERIQTLERILDAESPN 71 (75) T ss_pred HHHHHHHHHHHHHHHCCCCCC T ss_conf 999999999999997366989 No 189 >COG3107 LppC Putative lipoprotein [General function prediction only] Probab=44.44 E-value=19 Score=14.96 Aligned_cols=91 Identities=18% Similarity=0.085 Sum_probs=64.3 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCH--HHHHHHHHHH Q ss_conf 99999999511005778999999873126895789999999999630269989999997531146423--5689999999 Q gi|254780421|r 92 NMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEEISKILQKLSEPSNP--MHQFANEILG 169 (218) Q Consensus 92 ~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~~~~--~~~~a~ellg 169 (218) .+..+++++++|+...|..++..+-. ...+.-+.-..+..+.+.+.+.++..|...|..+..-..+ -+...+..++ T Consensus 66 ~llAa~al~~e~k~~qA~~Ll~ql~~--~Ltd~Q~~~~~LL~ael~la~~q~~~Al~~L~~~~~~~ls~~Qq~Ry~q~~a 143 (604) T COG3107 66 LLLAARALVEEGKTAQAQALLNQLPQ--ELTDAQRAEKSLLAAELALAQKQPAAALQQLAKLLPADLSQNQQARYYQARA 143 (604) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHCCC--CCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHCCHHHCCHHHHHHHHHHHH T ss_conf 99999999974782889999986562--3798999999999999998422729999998520321167788999999999 Q ss_pred HHHHHCCCHHHHHHH Q ss_conf 999988998999999 Q gi|254780421|r 170 ISALKFGKVQKAKTI 184 (218) Q Consensus 170 ~~~~~~Gd~~~A~~~ 184 (218) ++....|+.-.|... T Consensus 144 ~a~ea~~~~~~a~ra 158 (604) T COG3107 144 DALEARGDSIDAARA 158 (604) T ss_pred HHHHCCCCHHHHHHH T ss_conf 998613215899999 No 190 >PRK07193 fliF flagellar MS-ring protein; Reviewed Probab=44.37 E-value=19 Score=14.95 Aligned_cols=20 Identities=5% Similarity=0.112 Sum_probs=15.4 Q ss_pred HHHHHHHHHHHHHHCCCHHH Q ss_conf 89999999999950034032 Q gi|254780421|r 196 FGISTRSQMILANIIASDQR 215 (218) Q Consensus 196 ~~~r~ra~~~l~~l~~~~~~ 215 (218) ..+++|++.+|.-+-|.+.+ T Consensus 250 ~~l~~~i~~~L~~i~G~~n~ 269 (550) T PRK07193 250 QSTIKNAASMLTPVLGAGNF 269 (550) T ss_pred HHHHHHHHHHHHHHCCCCCE T ss_conf 99999999998853489846 No 191 >pfam06786 UPF0253 Uncharacterized protein family (UPF0253). Probab=44.33 E-value=19 Score=14.95 Aligned_cols=30 Identities=33% Similarity=0.407 Sum_probs=13.9 Q ss_pred HHHHHHHHHHHCCCCHHHHHHHHHHHHHHH Q ss_conf 899999987312689578999999999963 Q gi|254780421|r 108 AAEIFLKVANDDLAPLAVRYAATLQAASIL 137 (218) Q Consensus 108 A~~~l~~i~~~~~~~~~~~~lA~l~la~~~ 137 (218) ++..|+.+..+.+.|...|.-|-+-.+.++ T Consensus 30 alk~Ln~iAad~~Lp~~vRe~AAfAAANLl 59 (66) T pfam06786 30 AIKALNDIAADEALPEDVREKAAYAAANLL 59 (66) T ss_pred HHHHHHHHHCCCCCCHHHHHHHHHHHHHHH T ss_conf 999999886364478889999999998876 No 192 >KOG2796 consensus Probab=44.01 E-value=19 Score=14.92 Aligned_cols=85 Identities=14% Similarity=0.116 Sum_probs=34.8 Q ss_pred HCCCHHHHHHHHHHHHHCC----CCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCC Q ss_conf 1387666999999973125----520699999999999951100577899999987312689578999999999963026 Q gi|254780421|r 65 NSNKLDDARSSFEKILSQD----NKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDT 140 (218) Q Consensus 65 ~~~~~~~A~~~l~~l~~~~----~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~ 140 (218) +-|+...+...|+.+.++. +..-..+.....|-++..++++..|...+..++..++.++. |.-..|.+++-. T Consensus 224 Q~GD~k~a~~yf~~vek~~~kL~~~q~~~~V~~n~a~i~lg~nn~a~a~r~~~~i~~~D~~~~~----a~NnKALcllYl 299 (366) T KOG2796 224 QIGDIKTAEKYFQDVEKVTQKLDGLQGKIMVLMNSAFLHLGQNNFAEAHRFFTEILRMDPRNAV----ANNNKALCLLYL 299 (366) T ss_pred HCCCHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHEEECCCCHHHHHHHHHHCCCCCCCCHH----HHCHHHHHHHHH T ss_conf 6043888998999999876143234046788763442022133359999988504224898702----205199999998 Q ss_pred CCHHHHHHHHHHH Q ss_conf 9989999997531 Q gi|254780421|r 141 SSYEEISKILQKL 153 (218) Q Consensus 141 ~~~d~a~~~l~~~ 153 (218) ++.-+|++.+..+ T Consensus 300 g~l~DAiK~~e~~ 312 (366) T KOG2796 300 GKLKDALKQLEAM 312 (366) T ss_pred HHHHHHHHHHHHH T ss_conf 7788999999987 No 193 >PRK03427 cell division protein ZipA; Provisional Probab=43.49 E-value=9.7 Score=16.79 Aligned_cols=24 Identities=21% Similarity=0.176 Sum_probs=12.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999999999 Q gi|254780421|r 24 IAPLLILMILSLAIWFYLFDGSHEKK 49 (218) Q Consensus 24 ii~~~i~~i~~~~~~~~~~~~~q~~~ 49 (218) ||+|+|+|+.+++-| +|..++++. T Consensus 10 IIvGaIAIiALL~HG--lWtsRKErs 33 (331) T PRK03427 10 IIVGAIAIIALLVHG--FWTSRKERS 33 (331) T ss_pred HHHHHHHHHHHHHHH--CEECCCCCC T ss_conf 999999999999852--022345544 No 194 >PRK01741 cell division protein ZipA; Provisional Probab=43.45 E-value=10 Score=16.71 Aligned_cols=23 Identities=22% Similarity=0.251 Sum_probs=12.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999999999 Q gi|254780421|r 24 IAPLLILMILSLAIWFYLFDGSHEK 48 (218) Q Consensus 24 ii~~~i~~i~~~~~~~~~~~~~q~~ 48 (218) ||+|+|+++.+++-| +|.+++++ T Consensus 8 IIvGaIAI~ALL~HG--LWTsRKEr 30 (342) T PRK01741 8 IILGILALVALVAHG--LWSNRREK 30 (342) T ss_pred HHHHHHHHHHHHHHC--CCCCCCCC T ss_conf 999999999999851--21134445 No 195 >COG4941 Predicted RNA polymerase sigma factor containing a TPR repeat domain [Transcription] Probab=43.04 E-value=20 Score=14.82 Aligned_cols=135 Identities=10% Similarity=0.088 Sum_probs=79.6 Q ss_pred HHHHHHHHHHHHCC-CCCHHHHHHHHHHHH---HHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHH Q ss_conf 66999999973125-520699999999999---95110057789999998731268957899999999996302699899 Q gi|254780421|r 70 DDARSSFEKILSQD-NKLYNPLSNMYIASI---LVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEE 145 (218) Q Consensus 70 ~~A~~~l~~l~~~~-~~~ya~lA~l~lA~~---~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~ 145 (218) ++....+++-..+. ++.|...|.+.-... ..+.-|+..-..+|+.+..-.++| + ..+.-+...-....++. T Consensus 273 ~eg~all~rA~~~~~pGPYqlqAAIaa~HA~a~~aedtDW~~I~aLYdaL~~~apSP-v----V~LNRAVAla~~~Gp~a 347 (415) T COG4941 273 DEGLALLDRALASRRPGPYQLQAAIAALHARARRAEDTDWPAIDALYDALEQAAPSP-V----VTLNRAVALAMREGPAA 347 (415) T ss_pred HHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCCCCCHHHHHHHHHHHHHHCCCC-E----EEEHHHHHHHHHHHHHH T ss_conf 878999999997489981899999999987533357998599999999999848998-5----75138899997655786 Q ss_pred HHHHHHHHCCCCC-HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHC Q ss_conf 9999753114642-3568999999999998899899999999998355899899999999999500 Q gi|254780421|r 146 ISKILQKLSEPSN-PMHQFANEILGISALKFGKVQKAKTIFEELAKDNNSPFGISTRSQMILANII 210 (218) Q Consensus 146 a~~~l~~~~~~~~-~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~~~~p~~~r~ra~~~l~~l~ 210 (218) .+..+..+..... .-...++-.+|+...+.|...+|++.|+..+.-.. ....|..+.+.++-+. T Consensus 348 gLa~ve~L~~~~~L~gy~~~h~~RadlL~rLgr~~eAr~aydrAi~La~-~~aer~~l~~r~~~l~ 412 (415) T COG4941 348 GLAMVEALLARPRLDGYHLYHAARADLLARLGRVEEARAAYDRAIALAR-NAAERAFLRQRLDRLA 412 (415) T ss_pred HHHHHHHHHCCCCCCCCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHCC-CHHHHHHHHHHHHHHH T ss_conf 8999987512435543422087778999981870778999999998668-8699999999998763 No 196 >COG4976 Predicted methyltransferase (contains TPR repeat) [General function prediction only] Probab=42.85 E-value=20 Score=14.81 Aligned_cols=56 Identities=14% Similarity=0.119 Sum_probs=26.3 Q ss_pred HHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCC Q ss_conf 4138766699999997312552069999999999995110057789999998731268 Q gi|254780421|r 64 FNSNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLA 121 (218) Q Consensus 64 ~~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~ 121 (218) .+.++.+.+...+.+...--+.-. ..-|.++...-..|+.+.|...|.++++-.+. T Consensus 6 ~~~~D~~aaaely~qal~lap~w~--~gwfR~g~~~ekag~~daAa~a~~~~L~ldp~ 61 (287) T COG4976 6 AESGDAEAAAELYNQALELAPEWA--AGWFRLGEYTEKAGEFDAAAAAYEEVLELDPE 61 (287) T ss_pred CCCCCHHHHHHHHHHHHHCCCHHH--HHHHHCCHHHHHCCCHHHHHHHHHHHHCCCCC T ss_conf 036784899999987750470444--34554023666304189999999988707962 No 197 >PRK04964 hypothetical protein; Provisional Probab=42.29 E-value=21 Score=14.75 Aligned_cols=33 Identities=24% Similarity=0.436 Sum_probs=15.2 Q ss_pred HHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCCH Q ss_conf 999999998355899899999999999500340 Q gi|254780421|r 181 AKTIFEELAKDNNSPFGISTRSQMILANIIASD 213 (218) Q Consensus 181 A~~~y~~i~~~~~~p~~~r~ra~~~l~~l~~~~ 213 (218) +.+.++.+..+...|..+|+++--.-++|.-+| T Consensus 30 ~~k~Ln~iAad~~Lp~~vRe~AAyAAANLliSD 62 (66) T PRK04964 30 VLKVLNEVAADDALPESVREKAAYAAANLLVSD 62 (66) T ss_pred HHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHH T ss_conf 999999885363257889999999998756543 No 198 >pfam10516 SHNi-TPR SHNi-TPR. SHNi-TPR family members contain a reiterated sequence motif that is an interrupted form of TPR repeat. Probab=41.86 E-value=21 Score=14.71 Aligned_cols=26 Identities=15% Similarity=0.182 Sum_probs=20.8 Q ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 99999999988998999999999983 Q gi|254780421|r 165 NEILGISALKFGKVQKAKTIFEELAK 190 (218) Q Consensus 165 ~ellg~~~~~~Gd~~~A~~~y~~i~~ 190 (218) ..++|.+-+..+++++|.+.|++.++ T Consensus 4 ~~~LgEislE~e~F~qA~~D~~~~L~ 29 (38) T pfam10516 4 YDLLGEISLENENFPQAVEDLRKALE 29 (38) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHH T ss_conf 88777865401560778999999999 No 199 >TIGR03362 VI_chp_7 type VI secretion-associated protein, VC_A0119 family. This protein family is one of two related families in type VI secretion systems that contain an ImpA-related N-terminal domain (pfam06812). Probab=41.61 E-value=21 Score=14.69 Aligned_cols=62 Identities=18% Similarity=0.198 Sum_probs=47.7 Q ss_pred HHHHHHHHHCCCCHHHHHHHHHHH-CCCCCH-HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHC Q ss_conf 999999630269989999997531-146423-56899999999999889989999999999835 Q gi|254780421|r 130 TLQAASILVDTSSYEEISKILQKL-SEPSNP-MHQFANEILGISALKFGKVQKAKTIFEELAKD 191 (218) Q Consensus 130 ~l~la~~~~~~~~~d~a~~~l~~~-~~~~~~-~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~ 191 (218) ....+.-.++.+..+.|+..|+.- ....++ -+.+.+.+++.++...|..+-|...|+.+... T Consensus 216 ~~~eA~~l~~~~gl~~Al~~L~~~l~~~~s~R~rf~~rL~~ArL~~~~g~~~lA~~~l~~L~~~ 279 (301) T TIGR03362 216 LREEARALAAEGGLEAALQRLQQRLAQAREPRERFHWRLLLARLLEQAGKAELAQQLYAALDQQ 279 (301) T ss_pred HHHHHHHHHHCCCHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHH T ss_conf 9999999997479999999999753248997999999999999999869879999999999999 No 200 >pfam03748 FliL Flagellar basal body-associated protein FliL. This FliL protein controls the rotational direction of the flagella during chemotaxis. FliL is a cytoplasmic membrane protein associated with the basal body. Probab=41.53 E-value=21 Score=14.68 Aligned_cols=22 Identities=18% Similarity=0.122 Sum_probs=11.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999999 Q gi|254780421|r 23 WIAPLLILMILSLAIWFYLFDG 44 (218) Q Consensus 23 ~ii~~~i~~i~~~~~~~~~~~~ 44 (218) .++++++++++++++++++... T Consensus 6 ~~i~~~~l~~~~~~~~~~~~~~ 27 (145) T pfam03748 6 LLILGLLLLAAGGGGGFYFLLK 27 (145) T ss_pred HHHHHHHHHHHHHHHHHHHHCC T ss_conf 5899999999999999988503 No 201 >KOG1550 consensus Probab=41.44 E-value=21 Score=14.67 Aligned_cols=80 Identities=13% Similarity=0.091 Sum_probs=31.7 Q ss_pred HHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCC---CHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHC----C Q ss_conf 05778999999873126895789999999999630269---9899999975311464235689999999999988----9 Q gi|254780421|r 104 DTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTS---SYEEISKILQKLSEPSNPMHQFANEILGISALKF----G 176 (218) Q Consensus 104 ~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~---~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~----G 176 (218) |...|...|.........+ +.++++.++.... +...|...+..-...+. ..+..-++..|..- - T Consensus 308 d~~~A~~~~~~aA~~g~~~------a~~~lg~~~~~g~~~~d~~~A~~yy~~Aa~~G~---~~A~~~la~~y~~G~gv~r 378 (552) T KOG1550 308 DYEKALKLYTKAAELGNPD------AQYLLGVLYETGTKERDYRRAFEYYSLAAKAGH---ILAIYRLALCYELGLGVER 378 (552) T ss_pred CHHHHHHHHHHHHHCCCHH------HHHHHHHHHHCCCCCCCHHHHHHHHHHHHHCCC---HHHHHHHHHHHHHCCCCCC T ss_conf 4999999999998559904------899988987627632138899999999997599---8999999999974778688 Q ss_pred CHHHHHHHHHHHHHCC Q ss_conf 9899999999998355 Q gi|254780421|r 177 KVQKAKTIFEELAKDN 192 (218) Q Consensus 177 d~~~A~~~y~~i~~~~ 192 (218) +...|...|.+++... T Consensus 379 ~~~~A~~~~k~aA~~g 394 (552) T KOG1550 379 NLELAFAYYKKAAEKG 394 (552) T ss_pred CHHHHHHHHHHHHHCC T ss_conf 7789999999999827 No 202 >TIGR01495 ETRAMP early transcribed membrane protein (ETRAMP) family; InterPro: IPR006389 These sequences represent a family of proteins from the malaria parasite Plasmodium falciparum, several of which have been shown to be expressed specifically in the ring stage as well as the rodent parasite Plasmodium yoelii . A homolog from Plasmodium chabaudi was localized to the parasitophorous vacuole membrane . Members have an initial hydrophobic, Phe/Tyr-rich stretch long enough to span the membrane, a highly charged region rich in Lys, a second putative transmembrane region, and a second highly charged, low complexity sequence region. Some members have up to 100 residues of additional C-terminal sequence. These genes have been shown to be found in the sub-telomeric regions of both Plasmodium falciparum and P. yoelii chromosomes.. Probab=40.93 E-value=21 Score=14.68 Aligned_cols=25 Identities=4% Similarity=0.042 Sum_probs=12.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999999999 Q gi|254780421|r 24 IAPLLILMILSLAIWFYLFDGSHEK 48 (218) Q Consensus 24 ii~~~i~~i~~~~~~~~~~~~~q~~ 48 (218) -+++++.+.+|.+.|+.+|.+...+ T Consensus 70 ~~as~lAlL~g~~~GlG~y~~~K~~ 94 (99) T TIGR01495 70 SIASGLALLVGAGVGLGYYYKKKNK 94 (99) T ss_pred HHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 9999999999999999998752366 No 203 >pfam10037 MRP-S27 Mitochondrial 28S ribosomal protein S27. Members of this family of small ribosomal proteins possess one of three conserved blocks of sequence found in proteins that stimulate the dissociation of guanine nucleotides from G-proteins, leaving open the possibility that MRP-S27 might be a functional partner of GTP-binding ribosomal proteins. Probab=40.12 E-value=22 Score=14.54 Aligned_cols=72 Identities=11% Similarity=0.103 Sum_probs=39.1 Q ss_pred HHHHCCCHHHHHHHHHHHHHCC--CCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCC-HHHHHHHHHHHHHHH Q ss_conf 9841387666999999973125--520699999999999951100577899999987312689-578999999999963 Q gi|254780421|r 62 ELFNSNKLDDARSSFEKILSQD--NKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAP-LAVRYAATLQAASIL 137 (218) Q Consensus 62 ~~~~~~~~~~A~~~l~~l~~~~--~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~-~~~~~lA~l~la~~~ 137 (218) ...+.+..++++..+..-+..| ++.|. .-.+-..++.+|++..|...--.+......+ +..+.++.+. .+-+ T Consensus 110 q~L~~~~~d~lL~~L~nr~~YGiF~D~ft---~nlLmD~fl~~~~~~~Aa~Va~~lmlQE~~~~~~~~~L~ly~-~~~y 184 (417) T pfam10037 110 QCLKYQAPDKALYTLRNRIQYGIFPDDFT---ANLLLDSFLKKGNFKSAASVVTELMLQESFDNESTRILSLYV-LYSY 184 (417) T ss_pred HHHHCCCHHHHHHHHHCCCCCCCCCCHHH---HHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCHHHHHHHHHH-HHHH T ss_conf 99946888889999857120450554358---999999999657778899999999999870678899999999-9999 No 204 >PRK10506 hypothetical protein; Provisional Probab=40.05 E-value=22 Score=14.54 Aligned_cols=39 Identities=10% Similarity=0.053 Sum_probs=18.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999999999999999999999999 Q gi|254780421|r 24 IAPLLILMILSLAIWFYLFDGSHEKKKNIVGENFAQALE 62 (218) Q Consensus 24 ii~~~i~~i~~~~~~~~~~~~~q~~~~~~as~~y~~al~ 62 (218) +|..+|+.|+..++...|.++.+..+...++......+. T Consensus 13 lvvi~ii~il~~~a~p~~~~~~q~~~L~~~a~~l~~fL~ 51 (155) T PRK10506 13 LVAMTLVVILSAWGLYGWQYWQQQQRLWQTAQQLRDFLL 51 (155) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999998877779999999999999999999999 No 205 >pfam07006 DUF1310 Protein of unknown function (DUF1310). This family consists of several hypothetical proteins of around 125 residues in length. Members of this family seem to be specific to Listeria and Streptococcus species. The function of this family is unknown. Probab=39.91 E-value=4.8 Score=18.67 Aligned_cols=13 Identities=38% Similarity=0.585 Sum_probs=7.0 Q ss_pred HHHHHHHHHHHHC Q ss_conf 6699999997312 Q gi|254780421|r 70 DDARSSFEKILSQ 82 (218) Q Consensus 70 ~~A~~~l~~l~~~ 82 (218) ++|...+++-+.. T Consensus 39 ~Eak~~~E~~Lk~ 51 (122) T pfam07006 39 EEAKKVFEKGLKA 51 (122) T ss_pred HHHHHHHHHHHHH T ss_conf 9999999999985 No 206 >pfam06097 DUF945 Bacterial protein of unknown function (DUF945). This family consists of several hypothetical bacterial proteins of unknown function. Probab=39.70 E-value=23 Score=14.50 Aligned_cols=26 Identities=12% Similarity=0.045 Sum_probs=15.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999999999 Q gi|254780421|r 24 IAPLLILMILSLAIWFYLFDGSHEKK 49 (218) Q Consensus 24 ii~~~i~~i~~~~~~~~~~~~~q~~~ 49 (218) +.+++|++++++++|+.||...+..+ T Consensus 5 va~gviv~l~a~~~g~~wytG~~~E~ 30 (460) T pfam06097 5 VAIGVIVALGAVWTGGPWYTGKQAEA 30 (460) T ss_pred EEEHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 62098999999985346576789999 No 207 >COG3105 Uncharacterized protein conserved in bacteria [Function unknown] Probab=39.56 E-value=23 Score=14.49 Aligned_cols=22 Identities=18% Similarity=0.116 Sum_probs=10.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH Q ss_conf 5199999999999999999999 Q gi|254780421|r 21 YKWIAPLLILMILSLAIWFYLF 42 (218) Q Consensus 21 yk~ii~~~i~~i~~~~~~~~~~ 42 (218) |-+.++++||+|+++...-.+. T Consensus 8 W~~a~igLvvGi~IG~li~Rlt 29 (138) T COG3105 8 WEYALIGLVVGIIIGALIARLT 29 (138) T ss_pred HHHHHHHHHHHHHHHHHHHHHC T ss_conf 9999999999999999999982 No 208 >COG5294 Uncharacterized protein conserved in bacteria [Function unknown] Probab=39.23 E-value=12 Score=16.17 Aligned_cols=24 Identities=17% Similarity=0.200 Sum_probs=13.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 199999999999999999999999 Q gi|254780421|r 22 KWIAPLLILMILSLAIWFYLFDGS 45 (218) Q Consensus 22 k~ii~~~i~~i~~~~~~~~~~~~~ 45 (218) |.|+++++.+++.++++.++|..- T Consensus 2 Kkil~~ilall~~ii~a~~~~~~~ 25 (113) T COG5294 2 KKILIGILALLLIIIGALFIFYNI 25 (113) T ss_pred CCHHHHHHHHHHHHHHHHEEEEEC T ss_conf 043999999999998662189821 No 209 >pfam04375 HemX HemX. This family consists of several bacterial HemX proteins. The hemX gene is not essential for haem synthesis in B. subtilis. HemX is a polytopic membrane protein which by an unknown mechanism down-regulates the level of HemA. Probab=39.16 E-value=23 Score=14.45 Aligned_cols=10 Identities=20% Similarity=0.405 Sum_probs=4.0 Q ss_pred CHHHHHHHHH Q ss_conf 0057789999 Q gi|254780421|r 103 GDTKNAAEIF 112 (218) Q Consensus 103 g~~~~A~~~l 112 (218) +|...|...+ T Consensus 147 ~Dv~~A~~lL 156 (372) T pfam04375 147 QDVSTATALL 156 (372) T ss_pred CCHHHHHHHH T ss_conf 8989999999 No 210 >COG5547 Small integral membrane protein [Function unknown] Probab=39.09 E-value=23 Score=14.44 Aligned_cols=19 Identities=11% Similarity=-0.179 Sum_probs=12.8 Q ss_pred HHHHHHHHHHHHHHHHHHH Q ss_conf 9997519999999999999 Q gi|254780421|r 17 GKCCYKWIAPLLILMILSL 35 (218) Q Consensus 17 ~~~~yk~ii~~~i~~i~~~ 35 (218) |.+-|||-|++.+++++.. T Consensus 3 flk~fkypIIgglvglliA 21 (62) T COG5547 3 FLKKFKYPIIGGLVGLLIA 21 (62) T ss_pred HHHHHCCCHHHHHHHHHHH T ss_conf 8887163359999999999 No 211 >PRK00523 hypothetical protein; Provisional Probab=38.98 E-value=23 Score=14.43 Aligned_cols=23 Identities=13% Similarity=0.222 Sum_probs=13.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999999 Q gi|254780421|r 29 ILMILSLAIWFYLFDGSHEKKKN 51 (218) Q Consensus 29 i~~i~~~~~~~~~~~~~q~~~~~ 51 (218) +.+++|+++||++-..+-+++.. T Consensus 13 l~li~G~~~Gff~ark~~~k~lk 35 (73) T PRK00523 13 LSLIVGGIIGYFVSKKYFKKQLR 35 (73) T ss_pred HHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999999999999886 No 212 >pfam07721 TPR_4 Tetratricopeptide repeat. This Pfam entry includes tetratricopeptide-like repeats not detected by the pfam00515, pfam07719 and pfam07720 models. Probab=38.55 E-value=24 Score=14.39 Aligned_cols=23 Identities=26% Similarity=0.414 Sum_probs=14.0 Q ss_pred HHHHHHHHHHHCCCHHHHHHHHH Q ss_conf 99999999998899899999999 Q gi|254780421|r 164 ANEILGISALKFGKVQKAKTIFE 186 (218) Q Consensus 164 a~ellg~~~~~~Gd~~~A~~~y~ 186 (218) +...+|..+...||.++|+..++ T Consensus 3 ~~~~la~~~~~~G~~~~A~~~l~ 25 (26) T pfam07721 3 ALLALARALLALGDLDEARALLE 25 (26) T ss_pred HHHHHHHHHHHHCCHHHHHHHHC T ss_conf 69999999998067788998861 No 213 >pfam10938 YfdX YfdX protein. YfdX is a protein found in Proteobacteria of unknown function. The protein coding for this gene is regulated by EvgA in E. coli. Probab=38.52 E-value=24 Score=14.39 Aligned_cols=62 Identities=21% Similarity=0.190 Sum_probs=37.9 Q ss_pred HHHHHHHHHHCCCCHHHHHHHHHHHCCCCC------HHH-HHHHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 999999963026998999999753114642------356-89999999999988998999999999983 Q gi|254780421|r 129 ATLQAASILVDTSSYEEISKILQKLSEPSN------PMH-QFANEILGISALKFGKVQKAKTIFEELAK 190 (218) Q Consensus 129 A~l~la~~~~~~~~~d~a~~~l~~~~~~~~------~~~-~~a~ellg~~~~~~Gd~~~A~~~y~~i~~ 190 (218) +.+..+.-.+..|+...+...|.....+-. |.. .-..--.+..++..|++.+|...+..+.+ T Consensus 77 ~a~~~A~~~l~~Gd~~~A~~~L~~a~~ev~~~~~~lPL~~~~~av~~A~~lL~~gk~~eA~~aL~~A~~ 145 (155) T pfam10938 77 AAIKKANELLKKGEKQAARETLKLAGSEVVITVALLPLAQTPAAVKQAASLLDEGKYYEAKAALKEALD 145 (155) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHCCCCEEEEEEECCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHC T ss_conf 999999999977898999999986366405777736715209999999999977997899999999861 No 214 >PRK12757 cell division protein FtsN; Provisional Probab=38.02 E-value=7.3 Score=17.54 Aligned_cols=22 Identities=9% Similarity=0.274 Sum_probs=13.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999999 Q gi|254780421|r 25 APLLILMILSLAIWFYLFDGSH 46 (218) Q Consensus 25 i~~~i~~i~~~~~~~~~~~~~q 46 (218) +..+++++++++||.||..++. T Consensus 2 lalA~avvv~F~gGLyFit~~k 23 (249) T PRK12757 2 VALAVALLVVFVGGLYFITHNK 23 (249) T ss_pred CHHHHHHHHHHHHHHEEEECCC T ss_conf 1028999999876441455678 No 215 >PRK10941 putative transcriptional regulator; Provisional Probab=37.67 E-value=24 Score=14.31 Aligned_cols=75 Identities=13% Similarity=0.104 Sum_probs=51.8 Q ss_pred HHHHHHCCCCHHHHHHHHHHHCC--CCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHC Q ss_conf 99963026998999999753114--6423568999999999998899899999999998355899899999999999500 Q gi|254780421|r 133 AASILVDTSSYEEISKILQKLSE--PSNPMHQFANEILGISALKFGKVQKAKTIFEELAKDNNSPFGISTRSQMILANII 210 (218) Q Consensus 133 la~~~~~~~~~d~a~~~l~~~~~--~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~~~~p~~~r~ra~~~l~~l~ 210 (218) +-..++.+.+++.|+...+.+.. |++|+ -.--+|.+|.+.|....|...++.-++..+.. +....+++.+..|. T Consensus 187 LK~~~~~e~~~~~AL~~~e~lL~l~Pddp~---EiRDRGli~~qL~c~~~Al~DLe~fle~~P~d-p~ae~ik~qi~~l~ 262 (269) T PRK10941 187 LKAALMEEKQMELALRASEALLQFNPEDPY---EIRDRGLIYAQLDCEHVALSDLSYFVEQCPED-PISEMIRAQIHSIE 262 (269) T ss_pred HHHHHHHHCCHHHHHHHHHHHHHHCCCCHH---HHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCC-HHHHHHHHHHHHHH T ss_conf 999998511499999999999862969979---99999999998499489999999999978597-28999999999986 Q ss_pred C Q ss_conf 3 Q gi|254780421|r 211 A 211 (218) Q Consensus 211 ~ 211 (218) . T Consensus 263 ~ 263 (269) T PRK10941 263 H 263 (269) T ss_pred H T ss_conf 0 No 216 >COG3115 ZipA Cell division protein [Cell division and chromosome partitioning] Probab=36.88 E-value=25 Score=14.23 Aligned_cols=22 Identities=27% Similarity=0.179 Sum_probs=10.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999999999 Q gi|254780421|r 24 IAPLLILMILSLAIWFYLFDGSHE 47 (218) Q Consensus 24 ii~~~i~~i~~~~~~~~~~~~~q~ 47 (218) ||+|+|++|.+++-| +|.++.+ T Consensus 9 IIvG~IAIiaLLvhG--lWtsRkE 30 (324) T COG3115 9 IIVGAIAIIALLVHG--LWTSRKE 30 (324) T ss_pred HHHHHHHHHHHHHHH--HHHCCHH T ss_conf 999999999999863--3311011 No 217 >PRK05886 yajC preprotein translocase subunit YajC; Validated Probab=36.88 E-value=25 Score=14.23 Aligned_cols=18 Identities=11% Similarity=0.224 Sum_probs=6.8 Q ss_pred HHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999 Q gi|254780421|r 42 FDGSHEKKKNIVGENFAQ 59 (218) Q Consensus 42 ~~~~q~~~~~~as~~y~~ 59 (218) +-.+.+++..+.-..+.+ T Consensus 20 ~~iRpQkKrqk~~~~m~~ 37 (108) T PRK05886 20 FASRRQRKAMQATIDLHE 37 (108) T ss_pred HHCCHHHHHHHHHHHHHH T ss_conf 931689999999999998 No 218 >TIGR01477 RIFIN variant surface antigen, rifin family; InterPro: IPR006373 Malaria is still a major cause of mortality in many areas of the world. Plasmodium falciparum causes the most severe human form of the disease and is responsible for most fatalities. Severe cases of malaria can occur when the parasite invades and then proliferates within red blood cell erythrocytes. The parasite produces many variant antigenic proteins, encoded by multigene families, which are present on the surface of the infected erythrocyte and play important roles in virulence. A crucial survival mechanism for the malaria parasite is its ability to evade the immune response by switching these variant surface antigens. The high virulence of P. falciparum relative to other malarial parasites is in large part due to the fact that in this organism many of these surface antigens mediate the binding of infected erythrocytes to the vascular endothelium (cytoadherence) and non-infected erythrocytes (rosetting). This can lead to the accumulation of infected cells in the vasculature of a variety of organs, blocking the blood flow and reducing the oxygen supply. Clinical symptoms of severe infection can include fever, progressive anaemia, multi-organ dysfunction and coma. For more information see . This entry represents the rifin family of variant surface antigens, encoded by the rif (short for "repetetive interspersed family") genes. This is the largest variant surface antigen family in P. falciparum, with roughly 150 genes present in the genome of this parasite . The function of these proteins is not yet known, but rifins are strongly immunogenic and high levels of anti-rifin antibodies are correlated with rapid parasite clearance and asymptomatic infections . . Probab=36.86 E-value=25 Score=14.23 Aligned_cols=42 Identities=17% Similarity=0.082 Sum_probs=17.3 Q ss_pred CHHHHHHHHHHHHH--CCCCCHHHHHHHHHHHHHHHCCHHHHHH Q ss_conf 76669999999731--2552069999999999995110057789 Q gi|254780421|r 68 KLDDARSSFEKILS--QDNKLYNPLSNMYIASILVAKGDTKNAA 109 (218) Q Consensus 68 ~~~~A~~~l~~l~~--~~~~~ya~lA~l~lA~~~~~~g~~~~A~ 109 (218) -.+.++..+.++.. -+.=+...+..+.-+..|.+.-.+-+++ T Consensus 214 G~~~vI~~l~~~~~~~~~~l~~~~l~~i~~~~nY~~~~~l~~~i 257 (395) T TIGR01477 214 GINVVIEGLKKILGKTLSTLTGKKLKQIINASNYTNPSLLSDAI 257 (395) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHH T ss_conf 99999999999987888763246787404765654167899999 No 219 >PRK01294 lipase chaperone; Provisional Probab=36.66 E-value=25 Score=14.21 Aligned_cols=17 Identities=12% Similarity=0.167 Sum_probs=6.7 Q ss_pred HHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999 Q gi|254780421|r 29 ILMILSLAIWFYLFDGS 45 (218) Q Consensus 29 i~~i~~~~~~~~~~~~~ 45 (218) +++++++.++|+||... T Consensus 12 ~~~~~~~~~~~~~~~~~ 28 (339) T PRK01294 12 IIGAAAVLAVWYWLGLP 28 (339) T ss_pred HHHHHHHHHHHHHHCCC T ss_conf 99999999999980656 No 220 >pfam09670 Cas_Cas02710 CRISPR-associated protein (Cas_Cas02710). Members of this family are found, exclusively in the vicinity of CRISPR repeats and other CRISPR-associated (cas) genes, in Methanothermobacter thermautotrophicus (Methanobacterium thermoformicicum), Thermus thermophilus (Deinococcus-Thermus), Chloroflexus aurantiacus (Chloroflexi), and Thermomicrobium roseum (Thermomicrobia). Probab=35.78 E-value=26 Score=14.12 Aligned_cols=63 Identities=24% Similarity=0.216 Sum_probs=50.3 Q ss_pred HHHHHHHHCCCHHHHHHHHHHHHHC--CCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCC Q ss_conf 9999984138766699999997312--55206999999999999511005778999999873126 Q gi|254780421|r 58 AQALELFNSNKLDDARSSFEKILSQ--DNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDL 120 (218) Q Consensus 58 ~~al~~~~~~~~~~A~~~l~~l~~~--~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~ 120 (218) ..+..+.+.+++..|...|+.+... ....|..+..+-.+-.+++.-++.+|...+...+.... T Consensus 136 ~~a~~lfn~~~y~~A~~~l~~l~~rl~~~~~~~~~~~l~~~y~~WD~f~~~~A~~~L~~~~~~~~ 200 (379) T pfam09670 136 RQAIRLFNKGDYGSALRILEDLARRLPGRKDYAAYKLLSEGYLAWDSFDHEEAADYLSKVLALLL 200 (379) T ss_pred HHHHHHHHCCCHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHH T ss_conf 99999884568899999999998737974579999999999999865589999999999887541 No 221 >TIGR03061 pip_yhgE_Nterm YhgE/Pip N-terminal domain. This family contains the N-terminal domain of a family of multiple membrane-spanning proteins of Gram-positive bacteria. One member was shown to be a host protein essential for phage infection, so many members of this family are called "phage infection protein". A separate model, TIGR03062, represents the conserved C-terminal domain. The domains are separated by regions highly variable in both length and sequence, often containing extended heptad repeats as described in model TIGR03057. Probab=35.29 E-value=27 Score=14.07 Aligned_cols=33 Identities=21% Similarity=0.137 Sum_probs=23.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999997519999999999999999999999999 Q gi|254780421|r 15 KMGKCCYKWIAPLLILMILSLAIWFYLFDGSHE 47 (218) Q Consensus 15 ~~~~~~yk~ii~~~i~~i~~~~~~~~~~~~~q~ 47 (218) ++++.-.-+++++++++|-.+.+|++.|-.|.- T Consensus 5 ~~~~~~~~~~~~~~~~liP~ly~~~~l~a~wdP 37 (164) T TIGR03061 5 RLRKNKLLRIALIAIMLIPLLYGGLFLWAFWDP 37 (164) T ss_pred HHHHCCHHHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 876382899999999999999999998463687 No 222 >cd00280 TRFH Telomeric Repeat binding Factor or TTAGGG Repeat binding Factor, central (dimerization) domain Homology; TRFH. Telomeres are protein/DNA complexes that make up the physical ends of eukaryotic linear chromosomes and are essential for chromosome stability, protecting the chromosome ends from degradation and end-to-end fusion. Proteins TRF1, TRF2 and Taz1 bind telomeric DNA and are also involved in recruiting interacting proteins, TIN2, and Rap1, to the telomeres. It has also been demonstrated that PARP1 associates with TRF2 and is capable of poly(ADP-ribosyl)ation of TRF2, which affects binding of TRF2 to telomeric DNA. TRF1, TRF2 and Taz1 proteins contain three functional domains: an N-terminal acidic domain, a central TRF-specific/dimerization domain, and a C-terminal DNA binding domain with a single Myb-like repeat. Homodimerization, a prerequisite to DNA binding, results in the juxtaposition of two Myb DNA binding domains. Probab=34.64 E-value=27 Score=14.01 Aligned_cols=67 Identities=13% Similarity=0.218 Sum_probs=45.0 Q ss_pred HHHHHHHHHHHHHHCCCC----HHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHHHHHHHHHHH Q ss_conf 577899999987312689----57899999999996302699899999975311464235689999999999 Q gi|254780421|r 105 TKNAAEIFLKVANDDLAP----LAVRYAATLQAASILVDTSSYEEISKILQKLSEPSNPMHQFANEILGISA 172 (218) Q Consensus 105 ~~~A~~~l~~i~~~~~~~----~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~a~ellg~~~ 172 (218) .+.|+..++.+-++-+.+ ..++.+-+...+.+.+..+.+++|...|..+.+++++... -..++..+- T Consensus 85 LESAl~v~~~I~kE~~~~e~l~e~I~~lik~qaV~VCmek~~Fk~A~eilkr~f~~~~~~k~-r~kLl~iI~ 155 (200) T cd00280 85 LESALMVLESIEKEFSLPETLHEEIRKLIKEQAVAVCMENGEFKKAEEVLKRLFSDPESQKL-RMKLLMIIR 155 (200) T ss_pred HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHCCCCCHHH-HHHHHHHHH T ss_conf 89999999999985177076899999999999999999815548899999998559984167-899999997 No 223 >TIGR00847 ccoS cytochrome oxidase maturation protein, cbb3-type; InterPro: IPR004714 Cytochrome cbb3 oxidases are found almost exclusively in Proteobacteria, and represent a distinctive class of proton-pumping respiratory haem-copper oxidases (HCO) that lack many of the key structural features that contribute to the reaction cycle of the intensely studied mitochondrial cytochrome c oxidase (CcO). Expression of cytochrome cbb3 oxidase allows human pathogens to colonise anoxic tissues and agronomically important diazotrophs to sustain nitrogen fixation . Genes encoding a cytochrome cbb3 oxidase were initially designated fixNOQP (ccoNOQP), the ccoNOQP operon is always found close to a second gene cluster, known as fixGHIS (ccoGHIS) whose expression is necessary for the assembly of a functional cbb3 oxidase. On the basis of their derived amino acid sequences each of the four proteins encoded by the ccoGHIS operon are thought to be membrane-bound. It has been suggested that they may function in concert as a multi-subunit complex, possibly playing a role in the uptake and metabolism of copper required for the assembly of the binuclear centre of cytochrome cbb3 oxidase. . Probab=33.94 E-value=28 Score=13.94 Aligned_cols=27 Identities=7% Similarity=-0.080 Sum_probs=13.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999999999999 Q gi|254780421|r 24 IAPLLILMILSLAIWFYLFDGSHEKKK 50 (218) Q Consensus 24 ii~~~i~~i~~~~~~~~~~~~~q~~~~ 50 (218) +|...|.+++|++|.+.|+-+-+..|. T Consensus 5 ~~L~Pisl~lG~~GL~AFlWslksGQy 31 (53) T TIGR00847 5 TILIPISLLLGGVGLVAFLWSLKSGQY 31 (53) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 778899999999999999985311478 No 224 >pfam10969 DUF2771 Protein of unknown function (DUF2771). This bacterial family of proteins has no known function. Probab=33.75 E-value=28 Score=13.92 Aligned_cols=25 Identities=4% Similarity=-0.141 Sum_probs=17.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 1999999999999999999999999 Q gi|254780421|r 22 KWIAPLLILMILSLAIWFYLFDGSH 46 (218) Q Consensus 22 k~ii~~~i~~i~~~~~~~~~~~~~q 46 (218) .-+++++++++++...|+.+|+... T Consensus 4 ~~~l~a~v~v~~~~~~gv~~W~~~~ 28 (161) T pfam10969 4 LALLLAGLVVVLAAVAGVGVWLLVR 28 (161) T ss_pred EEHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 2024888999888865423776278 No 225 >PRK10893 hypothetical protein; Provisional Probab=33.69 E-value=22 Score=14.58 Aligned_cols=20 Identities=30% Similarity=0.566 Sum_probs=11.3 Q ss_pred HHHHHHHHHHHHHHHHHHHH Q ss_conf 51999999999999999999 Q gi|254780421|r 21 YKWIAPLLILMILSLAIWFY 40 (218) Q Consensus 21 yk~ii~~~i~~i~~~~~~~~ 40 (218) -+|+++.+.+++++++||+. T Consensus 5 ~~w~~~~l~~~a~~l~~w~~ 24 (190) T PRK10893 5 RRWVIIVLSLAALVLIGWNM 24 (190) T ss_pred HHHHHHHHHHHHHHHHEEEE T ss_conf 77999999999999850551 No 226 >COG2979 Uncharacterized protein conserved in bacteria [Function unknown] Probab=33.28 E-value=29 Score=13.87 Aligned_cols=25 Identities=12% Similarity=0.142 Sum_probs=16.7 Q ss_pred HHHHHHHHHCCCCCHHHHHHHHHHH Q ss_conf 9999999835589989999999999 Q gi|254780421|r 182 KTIFEELAKDNNSPFGISTRSQMIL 206 (218) Q Consensus 182 ~~~y~~i~~~~~~p~~~r~ra~~~l 206 (218) |.+++.+......|+.++.-++.-+ T Consensus 194 r~YL~~La~~L~L~dalvd~lE~qv 218 (225) T COG2979 194 RSYLNALAGALGLPDALVDHLERQV 218 (225) T ss_pred HHHHHHHHHHHCCCHHHHHHHHHHH T ss_conf 9999999987289889999999998 No 227 >TIGR00756 PPR pentatricopeptide repeat domain; InterPro: IPR002885 Pentatricopeptide repeat proteins are characterised by the presence of a tandem array of repeats, where the number of PPR motifs controls the affinity and specificity of the PPR protein for RNA. These proteins occur predominantly in plants, where they appear to play essential roles in RNA/DNA metabolism in mitochondria and chloroplasts . It has been suggested that each of the highly variable PPR proteins is a gene-specific regulator of plant organellar RNA metabolism. PPR proteins may also play a role in organelle biogenesis, probably via binding to organellar transcripts . Examples of PPR repeat-containing proteins include PET309 P32522 from SWISSPROT, which may be involved in RNA stabilisation , and crp1, which is involved in RNA processing . The repeat is associated with a predicted plant protein O49549 from SWISSPROT that has a domain organization similar to the human BRCA1 protein.. Probab=33.00 E-value=29 Score=13.84 Aligned_cols=29 Identities=28% Similarity=0.394 Sum_probs=19.7 Q ss_pred HHHHHHHHCCCHHHHHHHHHHHHHCCCCC Q ss_conf 99999998899899999999998355899 Q gi|254780421|r 167 ILGISALKFGKVQKAKTIFEELAKDNNSP 195 (218) Q Consensus 167 llg~~~~~~Gd~~~A~~~y~~i~~~~~~p 195 (218) .+-..+.+.|+.++|.+.|++.....-.| T Consensus 5 ~li~~~~~~g~~~~a~~~~~~M~~~g~~p 33 (35) T TIGR00756 5 TLIDGLCKAGRVEEALELFDEMKERGIKP 33 (35) T ss_pred HHHHHHHCCCCHHHHHHHHHHHHHCCCCC T ss_conf 88999862898789999999998778978 No 228 >KOG1464 consensus Probab=32.83 E-value=29 Score=13.83 Aligned_cols=103 Identities=16% Similarity=0.203 Sum_probs=52.4 Q ss_pred HHHHHHHHHHCCHHHHHHHHHHHHHHHCC---CCHHHHH---HHHHHH--HHHHHCCCCHHHHHHHHH-HHC----CCCC Q ss_conf 99999999511005778999999873126---8957899---999999--996302699899999975-311----4642 Q gi|254780421|r 92 NMYIASILVAKGDTKNAAEIFLKVANDDL---APLAVRY---AATLQA--ASILVDTSSYEEISKILQ-KLS----EPSN 158 (218) Q Consensus 92 ~l~lA~~~~~~g~~~~A~~~l~~i~~~~~---~~~~~~~---lA~l~l--a~~~~~~~~~d~a~~~l~-~~~----~~~~ 158 (218) ...+++++++-|++.+-...+.++-..-. .+...+. +-.+.. ..++-.+.+......... .+. -|.+ T Consensus 148 NtKLgkl~fd~~e~~kl~KIlkqLh~SCq~edGedD~kKGtQLLEiYAlEIQmYT~qKnNKkLK~lYeqalhiKSAIPHP 227 (440) T KOG1464 148 NTKLGKLYFDRGEYTKLQKILKQLHQSCQTEDGEDDQKKGTQLLEIYALEIQMYTEQKNNKKLKALYEQALHIKSAIPHP 227 (440) T ss_pred CCHHHHHHEEHHHHHHHHHHHHHHHHHHCCCCCCHHHHCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCCCCH T ss_conf 54376622109988899999999998855556731564440356457667665656405588999999987763048851 Q ss_pred HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCC Q ss_conf 356899999999999889989999999999835589 Q gi|254780421|r 159 PMHQFANEILGISALKFGKVQKAKTIFEELAKDNNS 194 (218) Q Consensus 159 ~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~~~~ 194 (218) --.+..+|.=|--+++.|.+.+|...|=++-..-+. T Consensus 228 lImGvIRECGGKMHlreg~fe~AhTDFFEAFKNYDE 263 (440) T KOG1464 228 LIMGVIRECGGKMHLREGEFEKAHTDFFEAFKNYDE 263 (440) T ss_pred HHHHHHHHCCCCCCCCCCHHHHHHHHHHHHHHCCCC T ss_conf 778689871772200143378887689999733343 No 229 >pfam10694 DUF2500 Protein of unknown function (DUF2500). The members of this family are largely confined to the Gammaproteobacteria. The function is not known. Probab=32.63 E-value=25 Score=14.19 Aligned_cols=20 Identities=30% Similarity=0.504 Sum_probs=8.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999999 Q gi|254780421|r 26 PLLILMILSLAIWFYLFDGSH 46 (218) Q Consensus 26 ~~~i~~i~~~~~~~~~~~~~q 46 (218) +.++++|++++++. +++... T Consensus 5 ~iv~~iii~~~~~~-~~~~~~ 24 (107) T pfam10694 5 ILVIIAIIGLAIFV-FVRQIR 24 (107) T ss_pred EHHHHHHHHHHHHH-HHHHHH T ss_conf 03899999999999-999987 No 230 >pfam06120 Phage_HK97_TLTM Tail length tape measure protein. This family consists of the tail length tape measure protein from bacteriophage HK97 and related sequences from Escherichia coli O157:H7. Probab=32.52 E-value=30 Score=13.79 Aligned_cols=56 Identities=13% Similarity=-0.012 Sum_probs=27.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 588999999997519999999999999999999999999999999999999999841 Q gi|254780421|r 9 SYKGKCKMGKCCYKWIAPLLILMILSLAIWFYLFDGSHEKKKNIVGENFAQALELFN 65 (218) Q Consensus 9 ~~~~~~~~~~~~yk~ii~~~i~~i~~~~~~~~~~~~~q~~~~~~as~~y~~al~~~~ 65 (218) +.-++++.+..--=-+|++..-++..+.|.| |+-|+...|+.+.+-.|..-++.+. T Consensus 9 ~v~~~l~~ga~g~l~lvGG~pGlvmlaAGA~-Y~~YQ~qeQAresA~~yA~~ideIr 64 (311) T pfam06120 9 AVGGLLKGGAMGLLSLIGGLPGILMLGAGAW-YALYQNQEQARESAIDYADTIEEIR 64 (311) T ss_pred HHHHHHHHCHHHHHHHHCCCHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9988987220567777246178999998899-9999989999999999997699999 No 231 >COG1422 Predicted membrane protein [Function unknown] Probab=32.29 E-value=30 Score=13.77 Aligned_cols=39 Identities=10% Similarity=0.078 Sum_probs=19.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHH Q ss_conf 999999999999999999999984138766699999997 Q gi|254780421|r 41 LFDGSHEKKKNIVGENFAQALELFNSNKLDDARSSFEKI 79 (218) Q Consensus 41 ~~~~~q~~~~~~as~~y~~al~~~~~~~~~~A~~~l~~l 79 (218) .-+...-++.++...+|++........+.+.+++.+++. T Consensus 68 liD~ekm~~~qk~m~efq~e~~eA~~~~d~~~lkkLq~~ 106 (201) T COG1422 68 LIDQEKMKELQKMMKEFQKEFREAQESGDMKKLKKLQEK 106 (201) T ss_pred HCCHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHH T ss_conf 602999999999999999999999980889999999999 No 232 >PTZ00202 tuzin; Provisional Probab=31.99 E-value=30 Score=13.74 Aligned_cols=49 Identities=8% Similarity=0.043 Sum_probs=22.5 Q ss_pred HHHHHHHHHHHH-----HHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 588999999997-----5199-999999999999999999999999999999999 Q gi|254780421|r 9 SYKGKCKMGKCC-----YKWI-APLLILMILSLAIWFYLFDGSHEKKKNIVGENF 57 (218) Q Consensus 9 ~~~~~~~~~~~~-----yk~i-i~~~i~~i~~~~~~~~~~~~~q~~~~~~as~~y 57 (218) -++||||----. |+-+ -..+-++.+..+++|++-+|..+++.-+...+- T Consensus 295 l~rERVKERd~RY~l~KYsGvvSA~~A~LGv~svFgWN~knyr~qQRs~Ql~~Av 349 (664) T PTZ00202 295 LRRERVKDTNFRYALAKYKGTMSCIAGVLVVAYVFTANLRAYRRQQRGHQLRTAI 349 (664) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999988788999998752178789999989999865467788777678999999 No 233 >PRK08881 rpsN 30S ribosomal protein S14; Reviewed Probab=31.55 E-value=31 Score=13.69 Aligned_cols=32 Identities=9% Similarity=0.140 Sum_probs=16.1 Q ss_pred HHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHH Q ss_conf 89999999999835589989999999999950 Q gi|254780421|r 178 VQKAKTIFEELAKDNNSPFGISTRSQMILANI 209 (218) Q Consensus 178 ~~~A~~~y~~i~~~~~~p~~~r~ra~~~l~~l 209 (218) +..-+..|..++.+...|...|..++..|..| T Consensus 20 ~~~kR~~lK~~~~n~~l~~e~r~~a~~kL~~L 51 (101) T PRK08881 20 YAAKRAELKAIIDDENLSEEERFEARLKLQKL 51 (101) T ss_pred HHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHC T ss_conf 99999999999857799999999999998746 No 234 >PRK00269 zipA cell division protein ZipA; Reviewed Probab=31.53 E-value=22 Score=14.54 Aligned_cols=20 Identities=15% Similarity=-0.210 Sum_probs=9.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999999 Q gi|254780421|r 24 IAPLLILMILSLAIWFYLFDGS 45 (218) Q Consensus 24 ii~~~i~~i~~~~~~~~~~~~~ 45 (218) ||+|+|+++.+++=| ||..+ T Consensus 10 IIVGalaI~AlLlHG--w~r~R 29 (295) T PRK00269 10 IVIGIIVIAGILFDG--WRRMR 29 (295) T ss_pred HHHHHHHHHHHHHHH--HEECC T ss_conf 999999999999865--02113 No 235 >TIGR02588 TIGR02588 conserved hypothetical protein TIGR02588; InterPro: IPR013417 The function of this protein is unknown. It is always found as part of a two-gene operon with IPR013416 from INTERPRO, a protein that appears to span the membrane seven times. It has so far been found in the bacteria Nostoc sp. PCC 7120, Agrobacterium tumefaciens, Rhizobium meliloti, and Gloeobacter violaceus.. Probab=31.29 E-value=31 Score=13.67 Aligned_cols=26 Identities=19% Similarity=0.421 Sum_probs=20.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999999999 Q gi|254780421|r 23 WIAPLLILMILSLAIWFYLFDGSHEK 48 (218) Q Consensus 23 ~ii~~~i~~i~~~~~~~~~~~~~q~~ 48 (218) |+.+++-+++++.+.||..|+....+ T Consensus 5 Wv~~gis~l~l~~~~G~~~yd~l~~~ 30 (122) T TIGR02588 5 WVTFGISVLILAALLGLVIYDWLSEE 30 (122) T ss_pred EEHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 21278999999999999877544169 No 236 >PRK06007 fliF flagellar MS-ring protein; Reviewed Probab=30.82 E-value=32 Score=13.62 Aligned_cols=36 Identities=8% Similarity=-0.135 Sum_probs=20.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 889999999975199999999999999999999999 Q gi|254780421|r 10 YKGKCKMGKCCYKWIAPLLILMILSLAIWFYLFDGS 45 (218) Q Consensus 10 ~~~~~~~~~~~yk~ii~~~i~~i~~~~~~~~~~~~~ 45 (218) +++..+.+-+.=|.+++++++++++++++..+|... T Consensus 12 ~~~~~~~l~~~qki~l~~~~~~~i~~~~~l~~~~~~ 47 (540) T PRK06007 12 LLEFLKKLSKLRKIALIGAAAAVIAAIVALVLWASR 47 (540) T ss_pred HHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 999997069888999999999999999999984169 No 237 >TIGR02161 napC_nirT periplasmic nitrate (or nitrite) reductase c-type cytochrome, NapC/NirT family; InterPro: IPR011885 This entry contains NapC, a predicted membrane-anchored four-heme c-type cytochrome that forms one component of the periplasmic nitrate reductase along with NapA, NapB, NapD, NapE, and NapF subunits. A single known exception at this time is NirT, which is instead a component of a nitrite reductase. This family excludes TorC subunits of trimethylamine N-oxide (TMAO) reductases .; GO: 0020037 heme binding, 0006118 electron transport, 0016021 integral to membrane. Probab=30.48 E-value=32 Score=13.58 Aligned_cols=34 Identities=12% Similarity=0.036 Sum_probs=18.7 Q ss_pred CHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHH Q ss_conf 00577899999987312689578999999999963 Q gi|254780421|r 103 GDTKNAAEIFLKVANDDLAPLAVRYAATLQAASIL 137 (218) Q Consensus 103 g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~ 137 (218) -+...+.+.|..+.-.-+...-|.+ -|+.+|.-. T Consensus 93 RK~~As~Ev~~~~~G~idt~~KF~~-~R~~lA~~E 126 (185) T TIGR02161 93 RKVQASKEVYAKLFGTIDTREKFEA-KRLELAERE 126 (185) T ss_pred HHHHHHHHHHHHHCCCCCCHHHHHH-HHHHHHHHH T ss_conf 9999989998877044340788999-999999998 No 238 >TIGR02833 spore_III_AB stage III sporulation protein AB; InterPro: IPR014198 This entry represents the stage III sporulation protein AB, which is encoded in a spore formation operon: spoIIIAABCDEFGH that is under sigma G regulation . A comparative genome analysis of all sequenced genomes of Firmicutes shows that the proteins are strictly conserved among the sub-set of endospore-forming species.. Probab=30.30 E-value=32 Score=13.56 Aligned_cols=91 Identities=18% Similarity=0.189 Sum_probs=49.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCC---CHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHH Q ss_conf 99999999999999999999999999999999999999984138---766699999997312552069999999999995 Q gi|254780421|r 24 IAPLLILMILSLAIWFYLFDGSHEKKKNIVGENFAQALELFNSN---KLDDARSSFEKILSQDNKLYNPLSNMYIASILV 100 (218) Q Consensus 24 ii~~~i~~i~~~~~~~~~~~~~q~~~~~~as~~y~~al~~~~~~---~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~ 100 (218) ++++++++....+.||.|=+.+.++-.+ =..+..|+...+++ ...=....++++....+++ .-.-|..+.-.+ T Consensus 2 llGa~Liv~~~T~iGF~~a~r~~~R~~~--Lr~l~~AL~~L~~EI~Y~~TPL~ea~~~iA~~~~~p--v~~lF~~~s~~L 77 (170) T TIGR02833 2 LLGAILIVLSSTWIGFEYANRFKERPRQ--LRQLKNALQSLEAEIVYGHTPLPEAFKKIAKKSPKP--VNKLFESASERL 77 (170) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHCCCH--HHHHHHHHHHHH T ss_conf 6899999997679999999889887999--999999999999730178764689999999741524--899999999872 Q ss_pred HCCHHHHHHHHHHHHHHH Q ss_conf 110057789999998731 Q gi|254780421|r 101 AKGDTKNAAEIFLKVAND 118 (218) Q Consensus 101 ~~g~~~~A~~~l~~i~~~ 118 (218) +.+....+-+.|...++. T Consensus 78 ~~~~~~~v~~Aw~~~l~e 95 (170) T TIGR02833 78 KEGEGETVYEAWKKALNE 95 (170) T ss_pred CCCCCCCHHHHHHHHHHH T ss_conf 268999988899999987 No 239 >PRK07718 fliL flagellar basal body-associated protein FliL; Reviewed Probab=29.79 E-value=33 Score=13.51 Aligned_cols=23 Identities=17% Similarity=0.095 Sum_probs=11.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 51999999999999999999999 Q gi|254780421|r 21 YKWIAPLLILMILSLAIWFYLFD 43 (218) Q Consensus 21 yk~ii~~~i~~i~~~~~~~~~~~ 43 (218) -+.++++++++++++++++++.. T Consensus 6 i~Imiiil~~l~l~G~~a~~~~~ 28 (142) T PRK07718 6 IKIMLIILIVIALLGTAALVVVI 28 (142) T ss_pred HHHHHHHHHHHHHHHHHHHHEEE T ss_conf 66999999999999999872464 No 240 >PRK10617 cytochrome c-type protein NapC; Provisional Probab=29.56 E-value=33 Score=13.49 Aligned_cols=44 Identities=14% Similarity=0.126 Sum_probs=20.7 Q ss_pred HHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999997519-99999999999999999999999999999999999 Q gi|254780421|r 15 KMGKCCYKW-IAPLLILMILSLAIWFYLFDGSHEKKKNIVGENFA 58 (218) Q Consensus 15 ~~~~~~yk~-ii~~~i~~i~~~~~~~~~~~~~q~~~~~~as~~y~ 58 (218) ++|.-..+. -+...+++++|+++|..+|...+..-....++.|= T Consensus 13 r~w~~~~~ps~~~lg~l~~iG~~~Gii~w~gf~~~me~Tnt~eFC 57 (200) T PRK10617 13 RLWKWWRTPSRLALGTLLLIGFVGGIIFWGGFNTGMEKANTEEFC 57 (200) T ss_pred HHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHH T ss_conf 999997581577999999999999999999999999982884144 No 241 >PRK06770 hypothetical protein; Provisional Probab=29.54 E-value=33 Score=13.48 Aligned_cols=21 Identities=24% Similarity=0.523 Sum_probs=12.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHH Q ss_conf 997519999999999999999 Q gi|254780421|r 18 KCCYKWIAPLLILMILSLAIW 38 (218) Q Consensus 18 ~~~yk~ii~~~i~~i~~~~~~ 38 (218) ++.+||+++.+.+++++++.. T Consensus 2 Kk~~kwi~ii~~~~Vl~ig~t 22 (185) T PRK06770 2 KKLFKWIGIIAGMAVLGIGVT 22 (185) T ss_pred HHHHHHHHHHHHHHHHHHHHH T ss_conf 059999999999999999888 No 242 >COG5110 RPN1 26S proteasome regulatory complex component [Posttranslational modification, protein turnover, chaperones] Probab=29.52 E-value=33 Score=13.48 Aligned_cols=152 Identities=14% Similarity=0.102 Sum_probs=86.5 Q ss_pred HHHHHHHHHHHHHHHHHHH-H---HHHHHHHHHHHHHHH-----HHC---CCHHHHHHHHHHHHHCCCCCHHHHHHHHHH Q ss_conf 9999999999999999999-9---999999999999998-----413---876669999999731255206999999999 Q gi|254780421|r 29 ILMILSLAIWFYLFDGSHE-K---KKNIVGENFAQALEL-----FNS---NKLDDARSSFEKILSQDNKLYNPLSNMYIA 96 (218) Q Consensus 29 i~~i~~~~~~~~~~~~~q~-~---~~~~as~~y~~al~~-----~~~---~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA 96 (218) ++-+++.+|-.+.|+.-.. + +..=+.+.|..+-.. ... +..+-|...+.....++++.. ..|.+.-- T Consensus 396 liSa~aSIG~i~~WN~d~gl~~Ldkyly~de~~~KaGaLLGig~s~~~v~~E~~palalLs~yl~s~s~k~-~~aaiLGl 474 (881) T COG5110 396 LISAFASIGVIESWNSDKGLETLDKYLYADESYRKAGALLGIGLSGLRVFEERPPALALLSNYLQSSSSKH-VIAAILGL 474 (881) T ss_pred HHHEEECCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCEEEEEECCCCCCCCCCHHHHHHHHHCCCCCHHH-HHHHHHHH T ss_conf 22101000113122047669999888744741012131012200233223446619999997524874477-99998624 Q ss_pred HHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCC-----HHHHHHHHHHHHH Q ss_conf 99951100577899999987312689578999999999963026998999999753114642-----3568999999999 Q gi|254780421|r 97 SILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEEISKILQKLSEPSN-----PMHQFANEILGIS 171 (218) Q Consensus 97 ~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~~~-----~~~~~a~ellg~~ 171 (218) .+.+...+.++-.+.++.+.-+.+.|.....+|.+.++.+....-+-|-.-..++.....+. .|.-.-.-.+|.. T Consensus 475 g~afsGt~~eevl~lL~Pi~~std~pie~~~~asltLg~vFvGtcngD~ts~ilqtf~Er~~~e~~tqw~RFlaLgLa~L 554 (881) T COG5110 475 GAAFSGTQAEEVLELLQPIMFSTDSPIEVVFFASLTLGSVFVGTCNGDLTSLILQTFVERGKIESETQWFRFLALGLASL 554 (881) T ss_pred HHHHCCCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHEEEECCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 77635776799999851454078970889999997654268614685388999999998501312548999999999999 Q ss_pred HHHCCCHHHH Q ss_conf 9988998999 Q gi|254780421|r 172 ALKFGKVQKA 181 (218) Q Consensus 172 ~~~~Gd~~~A 181 (218) +..++|...+ T Consensus 555 f~g~~d~~d~ 564 (881) T COG5110 555 FYGRKDQVDD 564 (881) T ss_pred HCCCCCHHHH T ss_conf 7365323688 No 243 >PRK10834 hypothetical protein; Provisional Probab=29.14 E-value=34 Score=13.44 Aligned_cols=33 Identities=15% Similarity=0.084 Sum_probs=20.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999751999999999999999999999999999 Q gi|254780421|r 16 MGKCCYKWIAPLLILMILSLAIWFYLFDGSHEKK 49 (218) Q Consensus 16 ~~~~~yk~ii~~~i~~i~~~~~~~~~~~~~q~~~ 49 (218) |||.....+++.++++++.+++ ..+|-.++.+. T Consensus 1 m~Krl~~~~~~~~~~~~~~i~~-~d~wv~~~~~~ 33 (239) T PRK10834 1 MLKRVFYSLLVLIGLLLLTVLG-LDRWMSWKTAP 33 (239) T ss_pred CHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHCCC T ss_conf 9799999999999999999999-87988544067 No 244 >PRK11086 sensory histidine kinase DcuS; Provisional Probab=28.84 E-value=34 Score=13.41 Aligned_cols=18 Identities=22% Similarity=0.386 Sum_probs=9.7 Q ss_pred HHHCCCCHHHHHHHHHHH Q ss_conf 630269989999997531 Q gi|254780421|r 136 ILVDTSSYEEISKILQKL 153 (218) Q Consensus 136 ~~~~~~~~d~a~~~l~~~ 153 (218) -+++.+.++++...+... T Consensus 358 Gll~l~~~d~~~~~i~~~ 375 (541) T PRK11086 358 GLLHLKSYDQLEDYILKT 375 (541) T ss_pred HHHHHCHHHHHHHHHHHH T ss_conf 888643089999999999 No 245 >COG1566 EmrA Multidrug resistance efflux pump [Defense mechanisms] Probab=28.35 E-value=35 Score=13.35 Aligned_cols=32 Identities=13% Similarity=0.029 Sum_probs=16.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999997519999999999999999999999999 Q gi|254780421|r 15 KMGKCCYKWIAPLLILMILSLAIWFYLFDGSHE 47 (218) Q Consensus 15 ~~~~~~yk~ii~~~i~~i~~~~~~~~~~~~~q~ 47 (218) +.-++.++.++..+++++.+ +.+|+||..+.. T Consensus 9 ~~~~k~~~~~~~~~~~~~a~-~~~~~~~~~~~~ 40 (352) T COG1566 9 KAKRKRFKLPLTLLVVLAAV-LYGGLIWLYYRN 40 (352) T ss_pred HHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHCC T ss_conf 44556778899999999999-999999997246 No 246 >PHA02291 hypothetical protein Probab=28.29 E-value=35 Score=13.35 Aligned_cols=25 Identities=20% Similarity=0.309 Sum_probs=16.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 5199999999999999999999999 Q gi|254780421|r 21 YKWIAPLLILMILSLAIWFYLFDGS 45 (218) Q Consensus 21 yk~ii~~~i~~i~~~~~~~~~~~~~ 45 (218) -+||...+++++++....-++|.+. T Consensus 4 K~~iFYiL~~~VL~~si~sY~~sS~ 28 (131) T PHA02291 4 KASIFYILVVIVLAFSISSYYISSF 28 (131) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHEE T ss_conf 0103789999999999988742105 No 247 >pfam11381 DUF3185 Protein of unknown function (DUF3185). Some members in this bacterial family of proteins are annotated as membrane proteins however this cannot be confirmed. Currently no function is known. Probab=28.12 E-value=23 Score=14.45 Aligned_cols=20 Identities=15% Similarity=0.481 Sum_probs=9.6 Q ss_pred HHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999 Q gi|254780421|r 26 PLLILMILSLAIWFYLFDGS 45 (218) Q Consensus 26 ~~~i~~i~~~~~~~~~~~~~ 45 (218) ++++++++|.+..++.|+.. T Consensus 3 iGi~Llv~GivLl~~Gy~~~ 22 (59) T pfam11381 3 IGIALLVLGIVLLYFGYQAS 22 (59) T ss_pred EEEEHHHHHHHHHHHHHHHC T ss_conf 77746986599998621010 No 248 >PRK10856 hypothetical protein; Provisional Probab=27.66 E-value=36 Score=13.28 Aligned_cols=15 Identities=7% Similarity=0.282 Sum_probs=5.5 Q ss_pred HHHHHHHHHHHHHHHH Q ss_conf 9999999999999999 Q gi|254780421|r 31 MILSLAIWFYLFDGSH 46 (218) Q Consensus 31 ~i~~~~~~~~~~~~~q 46 (218) +++++++.| ||+.++ T Consensus 123 vvl~l~~~w-~~~~~~ 137 (332) T PRK10856 123 VVIGLTGAW-WWQNHK 137 (332) T ss_pred HHHHHHHHH-HHCCCC T ss_conf 999988887-620333 No 249 >pfam10039 DUF2275 Predicted integral membrane protein (DUF2275). This domain, found in various hypothetical bacterial proteins and in the RNA polymerase sigma factor, has no known function. Probab=27.44 E-value=36 Score=13.25 Aligned_cols=22 Identities=5% Similarity=-0.023 Sum_probs=15.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999997519999999999999 Q gi|254780421|r 14 CKMGKCCYKWIAPLLILMILSL 35 (218) Q Consensus 14 ~~~~~~~yk~ii~~~i~~i~~~ 35 (218) .++|.++|+-+-+.+|+++++. T Consensus 28 ~R~f~~~~ra~ala~l~v~i~A 49 (199) T pfam10039 28 FRLFLKLKRALALAALIVFILA 49 (199) T ss_pred HHHHHHHHHHHHHHHHHHHHHH T ss_conf 9988999999999999999998 No 250 >COG0457 NrfG FOG: TPR repeat [General function prediction only] Probab=27.34 E-value=36 Score=13.24 Aligned_cols=127 Identities=25% Similarity=0.242 Sum_probs=71.1 Q ss_pred HCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHH-HHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCH Q ss_conf 138766699999997312552069999999999-9951100577899999987312689578999999999963026998 Q gi|254780421|r 65 NSNKLDDARSSFEKILSQDNKLYNPLSNMYIAS-ILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSY 143 (218) Q Consensus 65 ~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~-~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~ 143 (218) ..+++..+...+.......+.. ......... .+...|+.+.+...+...+. ............+.........+.+ T Consensus 107 ~~~~~~~a~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 183 (291) T COG0457 107 ALGKYEEALELLEKALALDPDP--DLAEALLALGALYELGDYEEALELYEKALE-LDPELNELAEALLALGALLEALGRY 183 (291) T ss_pred HHHHHHHHHHHHHHHHHCCCCC--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9843999999999987336564--269999999999986409999999999987-4944467899999999898870339 Q ss_pred HHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCC Q ss_conf 999999753114642356899999999999889989999999999835589 Q gi|254780421|r 144 EEISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEELAKDNNS 194 (218) Q Consensus 144 d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~~~~ 194 (218) +.++..+.................++..+...|+...|...+..+...... T Consensus 184 ~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~ 234 (291) T COG0457 184 EEALELLEKALKLNPDDDAEALLNLGLLYLKLGKYEEALEYYEKALELDPD 234 (291) T ss_pred HHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHCCC T ss_conf 999999999986280122799999999998752599999999999872910 No 251 >PTZ00046 rifin; Provisional Probab=27.33 E-value=36 Score=13.24 Aligned_cols=39 Identities=10% Similarity=-0.091 Sum_probs=17.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHH Q ss_conf 24588999999997519999999999999999--99999999 Q gi|254780421|r 7 ANSYKGKCKMGKCCYKWIAPLLILMILSLAIW--FYLFDGSH 46 (218) Q Consensus 7 ~~~~~~~~~~~~~~yk~ii~~~i~~i~~~~~~--~~~~~~~q 46 (218) .+|+-+||-=|--+-+..++ .|+.-+|++|| .+.|.... T Consensus 129 EKSlAdKvEK~CLkCg~~LG-gVap~~Gllggi~~~~Wk~~a 169 (348) T PTZ00046 129 EKSLADKVEKGCLRCGCGLG-GVAPSWGLLGGLAYNAWKKAA 169 (348) T ss_pred HHHHHHHHHHHHHHCCCCCC-CCCCCCCCCCCHHHHHHHHHH T ss_conf 12498999998986388566-526332311305899888999 No 252 >pfam11044 TMEMspv1-c74-12 Plectrovirus spv1-c74 ORF 12 transmembrane protein. This is a family of proteins expressed by Plectroviruses. The plectroviruses are single-stranded DNA viruses belonging to the Inoviridae. Except that it is a putative transmembrane protein the function is not known. Probab=27.23 E-value=36 Score=13.23 Aligned_cols=24 Identities=17% Similarity=0.261 Sum_probs=11.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999999999 Q gi|254780421|r 26 PLLILMILSLAIWFYLFDGSHEKK 49 (218) Q Consensus 26 ~~~i~~i~~~~~~~~~~~~~q~~~ 49 (218) +..|+++++.+.|+...-++..+| T Consensus 8 ifsvviil~if~~~glsiyqki~q 31 (49) T pfam11044 8 IFSVVIILGVFAWIGLSIYQKIRQ 31 (49) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 999999999999999999999999 No 253 >KOG1463 consensus Probab=26.88 E-value=37 Score=13.19 Aligned_cols=91 Identities=13% Similarity=0.112 Sum_probs=59.3 Q ss_pred HHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCC-----CCHHHHHHHHHHHHHHHHCCCHHHHHHHHHH---HHHCCCC Q ss_conf 5789999999999630269989999997531146-----4235689999999999988998999999999---9835589 Q gi|254780421|r 123 LAVRYAATLQAASILVDTSSYEEISKILQKLSEP-----SNPMHQFANEILGISALKFGKVQKAKTIFEE---LAKDNNS 194 (218) Q Consensus 123 ~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~-----~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~---i~~~~~~ 194 (218) ..+|.--.-|+++++.+.++|-+|+....++..+ +.+...--+-+.--.|....+..+|+..+.. ..+.... T Consensus 124 tFLRq~Learli~Ly~d~~~YteAlaL~~~L~rElKKlDDK~lLvev~llESK~y~~l~Nl~KakasLTsART~AnaiYc 203 (411) T KOG1463 124 TFLRQSLEARLIRLYNDTKRYTEALALINDLLRELKKLDDKILLVEVHLLESKAYHALRNLPKAKASLTSARTTANAIYC 203 (411) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCEEEEHHHHHHHHHHHHCCHHHHHHHHHHHHHHCCCCC T ss_conf 99999999999999985678999999889999999850550030121014458899883125678888888875032016 Q ss_pred CHHHHHHHHHHHHHHCCCH Q ss_conf 9899999999999500340 Q gi|254780421|r 195 PFGISTRSQMILANIIASD 213 (218) Q Consensus 195 p~~~r~ra~~~l~~l~~~~ 213 (218) |+.++.-+-+.-..+-+.+ T Consensus 204 pPqlQa~lDLqSGIlha~e 222 (411) T KOG1463 204 PPQLQATLDLQSGILHAAE 222 (411) T ss_pred CHHHHHHHHHHCCCEEECC T ss_conf 8888877788526311000 No 254 >pfam09577 Spore_YpjB Sporulation protein YpjB (SpoYpjB). These proteins are found in the endospore-forming bacteria which include Bacillus species. In Bacillus subtilis, ypjB was found to be part of the sigma-E regulon. Sigma-E is a sporulation sigma factor that regulates expression in the mother cell compartment. Null mutants of ypjB show a sporulation defect, but this gene is not, however, a part of the endospore formation minimal gene set. Probab=26.82 E-value=37 Score=13.19 Aligned_cols=135 Identities=11% Similarity=0.051 Sum_probs=72.4 Q ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHCCCC-CH----HHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHH Q ss_conf 9999999984138766699999997312552-06----999999999999511005778999999873126895789999 Q gi|254780421|r 55 ENFAQALELFNSNKLDDARSSFEKILSQDNK-LY----NPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAA 129 (218) Q Consensus 55 ~~y~~al~~~~~~~~~~A~~~l~~l~~~~~~-~y----a~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA 129 (218) ..-..++....++++++|...+.-+.+...+ .+ -....+.... --+++|+..+... +.+...-++..+ T Consensus 7 ~lsD~~Lql~K~~~yeeAkqvL~yFs~~F~~~~~~~~~~t~~elR~vT-----~~~~~a~~al~a~--~~~~~eki~~~t 79 (232) T pfam09577 7 DLSDTVLQLTKQQKYEEAKQVLQYFSKQFLSADYEDRELTMDELRQVT-----LAYEDAVKALTAE--DMPDQEKIRAVT 79 (232) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHH-----HHHHHHHHHHHHC--CCCHHHHHHHHH T ss_conf 558999999873008899999999999987463544676899999999-----9999999998604--798799999999 Q ss_pred HHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHHH-HHHHH-----HHHHHHCCCHHHHHHHHHHHHHC---------CCC Q ss_conf 9999996302699899999975311464235689-99999-----99999889989999999999835---------589 Q gi|254780421|r 130 TLQAASILVDTSSYEEISKILQKLSEPSNPMHQF-ANEIL-----GISALKFGKVQKAKTIFEELAKD---------NNS 194 (218) Q Consensus 130 ~l~la~~~~~~~~~d~a~~~l~~~~~~~~~~~~~-a~ell-----g~~~~~~Gd~~~A~~~y~~i~~~---------~~~ 194 (218) ++|++.-.+. ++..|.+.. -.-+. ---+...||...=++.++..++. -+. T Consensus 80 ~fRL~vDA~~---------------s~~qPLW~~~e~~vM~~f~~mk~a~~~~d~~~f~~~~n~Fl~~Y~~I~PSl~Idl 144 (232) T pfam09577 80 QFRLVVDAIV---------------SKSQPLWGSLEKPVMDAFSAMKEAAQKEDSEAFQEKLNEFLSLYDMIYPSLTIDV 144 (232) T ss_pred HHHHHHHHHC---------------CCCCCHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHCCCCEEEEC T ss_conf 9999998870---------------6676438988999999999999999838889999999999998722064036656 Q ss_pred CHHHHHHHHHHHHHHCC Q ss_conf 98999999999995003 Q gi|254780421|r 195 PFGISTRSQMILANIIA 211 (218) Q Consensus 195 p~~~r~ra~~~l~~l~~ 211 (218) |+..-+|+...++.|+. T Consensus 145 ~~~~~q~v~~~i~~l~~ 161 (232) T pfam09577 145 SEKQLQRVDSHLSVLEN 161 (232) T ss_pred CHHHHHHHHHHHHHHHH T ss_conf 99999989999999970 No 255 >PRK12800 fliF flagellar MS-ring protein; Reviewed Probab=26.56 E-value=37 Score=13.16 Aligned_cols=20 Identities=10% Similarity=0.144 Sum_probs=16.2 Q ss_pred HHHHHHHHHHHHHHCCCHHH Q ss_conf 89999999999950034032 Q gi|254780421|r 196 FGISTRSQMILANIIASDQR 215 (218) Q Consensus 196 ~~~r~ra~~~l~~l~~~~~~ 215 (218) ..+++|++.+|.-+-|.+.+ T Consensus 253 ~~~~~rI~~~L~pivG~gn~ 272 (572) T PRK12800 253 SSYNQRIRELLEPMTGPGRV 272 (572) T ss_pred HHHHHHHHHHHHHHCCCCCE T ss_conf 99999999998866278825 No 256 >PRK04561 tatA twin arginine translocase protein A; Provisional Probab=26.37 E-value=38 Score=13.15 Aligned_cols=15 Identities=20% Similarity=0.844 Sum_probs=8.6 Q ss_pred HHHHHHHHHHHHHHH Q ss_conf 975199999999999 Q gi|254780421|r 19 CCYKWIAPLLILMIL 33 (218) Q Consensus 19 ~~yk~ii~~~i~~i~ 33 (218) +.+.|+|+++||+++ T Consensus 5 Siwh~lIv~~ivvll 19 (75) T PRK04561 5 SIWHWLVVLVIVLLV 19 (75) T ss_pred CHHHHHHHHHHHHHH T ss_conf 389999999999998 No 257 >pfam10177 DUF2371 Uncharacterized conserved protein (DUF2371). This is a family of proteins conserved from nematodes to humans. The function is not known. Probab=25.82 E-value=18 Score=15.19 Aligned_cols=32 Identities=9% Similarity=0.256 Sum_probs=17.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 89999999975199999999999999999999 Q gi|254780421|r 11 KGKCKMGKCCYKWIAPLLILMILSLAIWFYLF 42 (218) Q Consensus 11 ~~~~~~~~~~yk~ii~~~i~~i~~~~~~~~~~ 42 (218) |+|+|+---.-...+++++++++|.+.....| T Consensus 32 rGKlrl~S~sg~~~~lG~lVllVGiaMAvlGY 63 (142) T pfam10177 32 RGKLRLVSASGRALILGVLILLVGIAMAVLGY 63 (142) T ss_pred EEEEEECCHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 53789716868999983999998588862101 No 258 >KOG1308 consensus Probab=25.58 E-value=39 Score=13.05 Aligned_cols=72 Identities=17% Similarity=0.115 Sum_probs=43.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHH Q ss_conf 99999999999999999998413876669999999731255206999999999999511005778999999873 Q gi|254780421|r 44 GSHEKKKNIVGENFAQALELFNSNKLDDARSSFEKILSQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVAN 117 (218) Q Consensus 44 ~~q~~~~~~as~~y~~al~~~~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~ 117 (218) ...+.....+++.=.+++..+..+..++|+..+..-+.-.++. .+-+-..+.++++..+...|+.-+...+. T Consensus 105 e~tee~~dqa~d~k~~A~ealn~G~~~~ai~~~t~ai~lnp~~--A~l~~kR~sv~lkl~kp~aairD~d~A~e 176 (377) T KOG1308 105 EITEEMMDQANDKKVQASEALNDGEFDTAIELFTSAIELNPPL--AILYAKRASVFLKLKKPNAAIRDCDFAIE 176 (377) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCHHHHHCCCCCCCCCCCCH--HHHCCCCCCEEEECCCCCHHHHHHHHHHC T ss_conf 6449999998888788898732764333311553120369943--44105534100223687346540245541 No 259 >KOG4507 consensus Probab=25.38 E-value=39 Score=13.02 Aligned_cols=93 Identities=18% Similarity=0.183 Sum_probs=43.9 Q ss_pred HHHHHHHH-HHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHH---HHCCCCCHHHHHHHHH Q ss_conf 99999999-5110057789999998731268957899999999996302699899999975---3114642356899999 Q gi|254780421|r 92 NMYIASIL-VAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEEISKILQ---KLSEPSNPMHQFANEI 167 (218) Q Consensus 92 ~l~lA~~~-~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~a~~~l~---~~~~~~~~~~~~a~el 167 (218) .+.+|.++ .-.|+..-|++-++-...+.+.. ..+....++.+++.-+-..+|...|. .+. .+.| +.... T Consensus 609 ~ln~aglywr~~gn~~~a~~cl~~a~~~~p~~---~~v~~v~la~~~~~~~~~~da~~~l~q~l~~~-~sep---l~~~~ 681 (886) T KOG4507 609 ILNEAGLYWRAVGNSTFAIACLQRALNLAPLQ---QDVPLVNLANLLIHYGLHLDATKLLLQALAIN-SSEP---LTFLS 681 (886) T ss_pred EEECCCCEEEECCCCHHHHHHHHHHHHCCHHH---HCCCHHHHHHHHHHHHHHCCHHHHHHHHHHHC-CCCC---HHHHH T ss_conf 86245315664278088999999887058265---23539999999998622100889999998650-2675---28875 Q ss_pred HHHHHHHCCCHHHHHHHHHHHHHC Q ss_conf 999999889989999999999835 Q gi|254780421|r 168 LGISALKFGKVQKAKTIFEELAKD 191 (218) Q Consensus 168 lg~~~~~~Gd~~~A~~~y~~i~~~ 191 (218) +|..++...|.++|.++|+.+++. T Consensus 682 ~g~~~l~l~~i~~a~~~~~~a~~~ 705 (886) T KOG4507 682 LGNAYLALKNISGALEAFRQALKL 705 (886) T ss_pred HCHHHHHHHHHHHHHHHHHHHHHC T ss_conf 023788775048899999999734 No 260 >pfam11780 DUF3318 Protein of unknown function (DUF3318). This is a bacterial family of uncharacterized proteins. Probab=25.22 E-value=40 Score=13.00 Aligned_cols=23 Identities=9% Similarity=-0.115 Sum_probs=10.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999999 Q gi|254780421|r 30 LMILSLAIWFYLFDGSHEKKKNI 52 (218) Q Consensus 30 ~~i~~~~~~~~~~~~~q~~~~~~ 52 (218) +++.++.++|..|+-||+.+.++ T Consensus 102 ~~la~GLa~~a~~riwq~~~~~~ 124 (142) T pfam11780 102 LGLAGGLAAWAAYRIWQQNKSVP 124 (142) T ss_pred HHHHHHHHHHHHHHHHHHCCCCH T ss_conf 99999999999999999835736 No 261 >TIGR01411 tatAE twin arginine-targeting protein translocase, TatA/E family; InterPro: IPR006312 This family represents TatA and TatE. The Tat (twin-arginine translocation) system is a Sec-independent exporter for folded proteins, often with a redox cofactor already bound, across the bacterial inner membrane. Functionally equivalent systems are found in the chloroplast and some in archaeal species. The signal peptide recognized by the Tat system is modeled by IPR006311 from INTERPRO. ; GO: 0015031 protein transport, 0016021 integral to membrane. Probab=24.71 E-value=40 Score=12.95 Aligned_cols=13 Identities=15% Similarity=0.634 Sum_probs=6.5 Q ss_pred HHHHHHHHHHHHH Q ss_conf 5199999999999 Q gi|254780421|r 21 YKWIAPLLILMIL 33 (218) Q Consensus 21 yk~ii~~~i~~i~ 33 (218) +.|+|+.+|++++ T Consensus 6 ~~~liiL~v~~Ll 18 (48) T TIGR01411 6 WHLLIILVVILLL 18 (48) T ss_pred HHHHHHHHHHHHH T ss_conf 9999999998864 No 262 >pfam11446 DUF2897 Protein of unknown function (DUF2897). This is a bacterial family of uncharacterized proteins. Probab=24.33 E-value=41 Score=12.90 Aligned_cols=17 Identities=18% Similarity=0.444 Sum_probs=9.3 Q ss_pred HHHHHHHHHHHHHHHHH Q ss_conf 51999999999999999 Q gi|254780421|r 21 YKWIAPLLILMILSLAI 37 (218) Q Consensus 21 yk~ii~~~i~~i~~~~~ 37 (218) +-|+|+.+|++++++-. T Consensus 4 ~~WlIiilv~gvIvgnl 20 (55) T pfam11446 4 EVWLIIILVLGVIVGNL 20 (55) T ss_pred EHHHHHHHHHHHHHHHH T ss_conf 11789999999998259 No 263 >pfam11337 DUF3139 Protein of unknown function (DUF3139). This family of proteins with unknown function appears to be restricted to Firmicutes. Probab=24.25 E-value=41 Score=12.89 Aligned_cols=24 Identities=8% Similarity=0.009 Sum_probs=13.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999999999 Q gi|254780421|r 39 FYLFDGSHEKKKNIVGENFAQALE 62 (218) Q Consensus 39 ~~~~~~~q~~~~~~as~~y~~al~ 62 (218) |+.-+.++..++.++-+-|.+... T Consensus 25 ~f~~~~y~~~~a~~~vd~yl~~~g 48 (85) T pfam11337 25 FFAGNQYQRHKALKAIDWYLAKQG 48 (85) T ss_pred HHHCCCCCHHHHHHHHHHHHHHCC T ss_conf 861552331668999999999828 No 264 >PRK00720 tatA twin arginine translocase protein A; Provisional Probab=24.02 E-value=41 Score=12.91 Aligned_cols=15 Identities=20% Similarity=0.786 Sum_probs=7.8 Q ss_pred HHHHHHHHHHHHHHH Q ss_conf 975199999999999 Q gi|254780421|r 19 CCYKWIAPLLILMIL 33 (218) Q Consensus 19 ~~yk~ii~~~i~~i~ 33 (218) ..+-|+|+++|++++ T Consensus 5 siw~~lIi~~vVllL 19 (71) T PRK00720 5 SIWHWLIVLAVVLLL 19 (71) T ss_pred CHHHHHHHHHHHHHH T ss_conf 299999999999999 No 265 >COG5261 IQG1 Protein involved in regulation of cellular morphogenesis/cytokinesis [Cell division and chromosome partitioning / Signal transduction mechanisms] Probab=23.90 E-value=42 Score=12.85 Aligned_cols=22 Identities=18% Similarity=0.141 Sum_probs=14.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999999 Q gi|254780421|r 27 LLILMILSLAIWFYLFDGSHEK 48 (218) Q Consensus 27 ~~i~~i~~~~~~~~~~~~~q~~ 48 (218) .+++=+++.+|++.+|.+.... T Consensus 606 f~~~~~lg~Ig~f~~~~fv~e~ 627 (1054) T COG5261 606 FAEIDSLGLIGGFFFLRFVNEA 627 (1054) T ss_pred HHHHHHHHHHHHHHHHHCCCHH T ss_conf 9989887564323543100286 No 266 >COG5116 RPN2 26S proteasome regulatory complex component [Posttranslational modification, protein turnover, chaperones] Probab=23.85 E-value=42 Score=12.84 Aligned_cols=194 Identities=14% Similarity=0.078 Sum_probs=107.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHH Q ss_conf 58899999999751999999999999999--------9999999999999999999999999841387666999999973 Q gi|254780421|r 9 SYKGKCKMGKCCYKWIAPLLILMILSLAI--------WFYLFDGSHEKKKNIVGENFAQALELFNSNKLDDARSSFEKIL 80 (218) Q Consensus 9 ~~~~~~~~~~~~yk~ii~~~i~~i~~~~~--------~~~~~~~~q~~~~~~as~~y~~al~~~~~~~~~~A~~~l~~l~ 80 (218) +.-++||--.++=.-+.+-+-++-.|..+ .-.-..+-++.|.++.-.-..-.+.++.-+..+.+-....++. T Consensus 446 eiye~lKe~l~nD~a~~geAa~~gMGl~mLgt~s~eai~dm~tya~ETqhe~i~Rglgig~aLi~ygrqe~add~I~ell 525 (926) T COG5116 446 EIYEKLKELLKNDRALLGEAAVYGMGLLMLGTWSVEAIEDMRTYAGETQHERIKRGLGIGFALILYGRQEMADDYINELL 525 (926) T ss_pred HHHHHHHHHHHCCHHHHHHHHHHCCCEEEECCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999973232566666664251134367878999999987041156667764201125765534988889999985 Q ss_pred HCCCCCHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHH Q ss_conf 12552069999999999995110057789999998731268957899999999996302699899999975311464235 Q gi|254780421|r 81 SQDNKLYNPLSNMYIASILVAKGDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEEISKILQKLSEPSNPM 160 (218) Q Consensus 81 ~~~~~~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~a~~~l~~~~~~~~~~ 160 (218) .+.++....-..|-+|-++..-|+..--..++--.+.|.. +.. |..|.+-++.+. -.+++-....++.+.....++ T Consensus 526 ~d~ds~lRy~G~fs~alAy~GTgn~~vv~~lLh~avsD~n-DDV-rRAAViAlGfvc--~~D~~~lv~tvelLs~shN~h 601 (926) T COG5116 526 YDKDSILRYNGVFSLALAYVGTGNLGVVSTLLHYAVSDGN-DDV-RRAAVIALGFVC--CDDRDLLVGTVELLSESHNFH 601 (926) T ss_pred CCCHHHHHHCCHHHHHHHHHCCCCCHHHHHHHEEECCCCC-HHH-HHHHHHHEEEEE--ECCCCHHHHHHHHHHHCCCHH T ss_conf 3750776631188887888557850567665223113676-088-888878400267--448404567889865414645 Q ss_pred -HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHH Q ss_conf -6899999999999889989999999999835589989999999999950 Q gi|254780421|r 161 -HQFANEILGISALKFGKVQKAKTIFEELAKDNNSPFGISTRSQMILANI 209 (218) Q Consensus 161 -~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~~~~p~~~r~ra~~~l~~l 209 (218) +.-....+|.+..-.|+. .|...++.+..+ +.+-+||-+.+-++.| T Consensus 602 VR~g~AvaLGiacag~G~~-~a~diL~~L~~D--~~dfVRQ~AmIa~~mI 648 (926) T COG5116 602 VRAGVAVALGIACAGTGDK-VATDILEALMYD--TNDFVRQSAMIAVGMI 648 (926) T ss_pred HHHHHHHHHHHHHCCCCCH-HHHHHHHHHHHC--CHHHHHHHHHHHHHHH T ss_conf 4243687755564388618-899999988508--1899988879998888 No 267 >PRK09687 hypothetical protein; Provisional Probab=23.80 E-value=42 Score=12.84 Aligned_cols=131 Identities=17% Similarity=0.073 Sum_probs=54.2 Q ss_pred HHHHHHHHHHH-HCCCCCHHHHHHHHHHHHHHHCCH-HHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHH Q ss_conf 66999999973-125520699999999999951100-5778999999873126895789999999999630269989999 Q gi|254780421|r 70 DDARSSFEKIL-SQDNKLYNPLSNMYIASILVAKGD-TKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEEIS 147 (218) Q Consensus 70 ~~A~~~l~~l~-~~~~~~ya~lA~l~lA~~~~~~g~-~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~a~ 147 (218) +.++..+..+. ++.++....-|-.-++.....+-- ...++..+..+..+++. ..|.-|.+-++.+ .-+.++ T Consensus 89 ~~ai~~L~~lll~D~~~~VR~sAa~ALG~~~s~~~~~~~~~~~~l~~~~~d~d~--~VR~~AA~ALg~i-----~se~AI 161 (280) T PRK09687 89 DQSFNILLNLLLNDKSANVRATAINATGHRCKKNPIYSPKIVEQSQLTAFDKSV--NVRRATAFALSVI-----NNELAI 161 (280) T ss_pred HHHHHHHHHHHHCCCCHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHCCCH--HHHHHHHHHHHHC-----CCHHHH T ss_conf 777999999987099778999999998743134402113478898998626652--1989999999703-----747689 Q ss_pred HHHHHHCCCCCH-HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHCCC Q ss_conf 997531146423-56899999999999889989999999999835589989999999999950034 Q gi|254780421|r 148 KILQKLSEPSNP-MHQFANEILGISALKFGKVQKAKTIFEELAKDNNSPFGISTRSQMILANIIAS 212 (218) Q Consensus 148 ~~l~~~~~~~~~-~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~~~~p~~~r~ra~~~l~~l~~~ 212 (218) ..|-.+..+..+ -+..|...+|.+- .+.+++++.+-+.++| ..+.+|..|-.-|+.++.. T Consensus 162 ~~Li~lL~D~d~~VR~~Aa~ALg~i~---~~s~~~~~~Li~~L~D--~d~~VR~eAa~aLg~~~~e 222 (280) T PRK09687 162 PLLINLLKDPNGDVRNWAAFGLNINE---YDSPDIRDAFVEMLSD--KNEEIRGEAIIGLAYRKDK 222 (280) T ss_pred HHHHHHHHCCCHHHHHHHHHHHHCCC---CCCHHHHHHHHHHHCC--CCHHHHHHHHHHHHCCCCH T ss_conf 99999973798779999999976144---6876689999998559--9777899999998534747 No 268 >PRK10209 acid-resistance membrane protein; Provisional Probab=23.57 E-value=43 Score=12.81 Aligned_cols=36 Identities=17% Similarity=0.068 Sum_probs=21.7 Q ss_pred CCHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 4112458899-99999975199999999999999999 Q gi|254780421|r 4 NNDANSYKGK-CKMGKCCYKWIAPLLILMILSLAIWF 39 (218) Q Consensus 4 ~~~~~~~~~~-~~~~~~~yk~ii~~~i~~i~~~~~~~ 39 (218) |-|.++.+.= -.+-|+..||.++..++++++++... T Consensus 3 ~~~~~~l~~~~~~~lkk~~~~~~i~gvLl~l~Gi~ai 39 (190) T PRK10209 3 NIDKATLLKFDEEMLKKQRRAIQIIAVLLFIVGLLCL 39 (190) T ss_pred CCCHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 7678999756999999717999999999999999999 No 269 >TIGR03007 pepcterm_ChnLen polysaccharide chain length determinant protein, PEP-CTERM locus subfamily; InterPro: IPR014345 Members of this entry belong to the family of polysaccharide chain length determinant proteins. They are found in species that encode the PEP-CTERM/exosortase system and are predicted to act in protein sorting in a number of Gram-negative bacteria, they are also found near the epsH homologue that is the putative exosortase gene.. Probab=23.55 E-value=43 Score=12.81 Aligned_cols=95 Identities=12% Similarity=0.172 Sum_probs=44.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHH-----CCCHHHHHHHHHHHHHCCCC Q ss_conf 9999997519999999999999999999999999999999999999---999841-----38766699999997312552 Q gi|254780421|r 14 CKMGKCCYKWIAPLLILMILSLAIWFYLFDGSHEKKKNIVGENFAQ---ALELFN-----SNKLDDARSSFEKILSQDNK 85 (218) Q Consensus 14 ~~~~~~~yk~ii~~~i~~i~~~~~~~~~~~~~q~~~~~~as~~y~~---al~~~~-----~~~~~~A~~~l~~l~~~~~~ 85 (218) +-+|+..|=+++++.+|.++|++..+.+=+.|+ ..|..|.. .+...- +-+.|..+..+..-+=+-++ T Consensus 8 ~~iw~~rw~~~~~Aw~V~~~gw~~v~~lP~~Y~-----a~a~VYVdTqsvlrpLl~G~a~~P~~~q~iril~~~LlSRp~ 82 (510) T TIGR03007 8 KGIWRRRWLFVAVAWVVMIVGWVVVYVLPDRYE-----ASARVYVDTQSVLRPLLKGLAVTPNVDQKIRILSRTLLSRPN 82 (510) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHEECCCCCE-----EEEEEEEECCCHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHH T ss_conf 999999999999999999987671214067401-----436888843201656416673438678898887767443457 Q ss_pred CHHHHHHHHHHHHHHHCCHHHHHHHHHHHHH Q ss_conf 0699999999999951100577899999987 Q gi|254780421|r 86 LYNPLSNMYIASILVAKGDTKNAAEIFLKVA 116 (218) Q Consensus 86 ~ya~lA~l~lA~~~~~~g~~~~A~~~l~~i~ 116 (218) ..-..+. ..+-....+.++-......+- T Consensus 83 -l~~V~r~--~DlD~~~~~pa~~E~li~~l~ 110 (510) T TIGR03007 83 -LEKVIRM--LDLDVGAKSPAQLEALITKLQ 110 (510) T ss_pred -HHHHHHH--HCCCCCCCCHHHHHHHHHHHH T ss_conf -8999988--526777699899999999876 No 270 >TIGR03545 conserved hypothetical protein TIGR03545. This model represents a relatively rare but broadly distributed uncharacterized protein family, distributed in 1-2 percent of bacterial genomes, all of which have outer membranes. In many of these genomes, it is part of a two-gene pair. Probab=23.42 E-value=43 Score=12.79 Aligned_cols=25 Identities=20% Similarity=0.371 Sum_probs=9.8 Q ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHH Q ss_conf 689999999999988998999999999 Q gi|254780421|r 161 HQFANEILGISALKFGKVQKAKTIFEE 187 (218) Q Consensus 161 ~~~a~ellg~~~~~~Gd~~~A~~~y~~ 187 (218) ..+...+.|+-.. +-.+.+..+|+. T Consensus 279 ~~l~~~L~g~~i~--~~~~~~~~~y~~ 303 (554) T TIGR03545 279 KNFAVDLFGEEIR--RYLQTAYDYYDQ 303 (554) T ss_pred HHHHHHHHHHHHH--HHHHHHHHHHHH T ss_conf 9999999869999--999999999998 No 271 >TIGR02830 spore_III_AG stage III sporulation protein AG; InterPro: IPR014195 This entry represents the stage III sporulation protein AG, which is encoded in a spore formation operon: spoIIIAABCDEFGH under the control of sigma G . A comparative genome analysis of all sequenced genomes of Firmicutes shows that the proteins are strictly conserved among the sub-set of endospore-forming species.. Probab=23.39 E-value=41 Score=12.92 Aligned_cols=25 Identities=20% Similarity=0.323 Sum_probs=15.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 7519999999999999999999999 Q gi|254780421|r 20 CYKWIAPLLILMILSLAIWFYLFDG 44 (218) Q Consensus 20 ~yk~ii~~~i~~i~~~~~~~~~~~~ 44 (218) .++|+++.++++++..+...+|... T Consensus 2 ~~~~l~~L~liG~~ll~~s~~F~~~ 26 (193) T TIGR02830 2 KLKYLLVLLLIGLLLLLVSSFFSSS 26 (193) T ss_pred CCCCEEHHHHHHHHHHHHHHHCCCC T ss_conf 6301442478999999998761785 No 272 >pfam12575 DUF3753 Protein of unknown function (DUF3753). This family of proteins is found in viruses. Proteins in this family are approximately 70 amino acids in length. There is a conserved YLK sequence motif. There are two completely conserved residues (D and F) that may be functionally important. Probab=23.18 E-value=43 Score=12.76 Aligned_cols=17 Identities=12% Similarity=0.286 Sum_probs=6.4 Q ss_pred HHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999 Q gi|254780421|r 23 WIAPLLILMILSLAIWF 39 (218) Q Consensus 23 ~ii~~~i~~i~~~~~~~ 39 (218) ++|+++++++++.+..+ T Consensus 50 ~lii~ii~v~ii~ll~f 66 (72) T pfam12575 50 ILIITIIAVVIIVLLTF 66 (72) T ss_pred HHHHHHHHHHHHHHHHH T ss_conf 99999999999999999 No 273 >TIGR02889 spore_YpeB germination protein YpeB; InterPro: IPR014239 Proteins in this entry include the sporulation protein YpeB from Bacillus subtilis. YpeB and the putative spore-cortex-lytic enzyme SleB are required for normal germination. These proteins appear to be restricted to endospore-forming species in the Firmicutes lineage of bacteria, and have been found in all such species to date except Clostridium perfringens. The matching phenotypes of mutants in SleB (called a lytic transglycosylase) and YpeB suggests that YpeB is necessary for SleB to function.. Probab=23.01 E-value=44 Score=12.74 Aligned_cols=37 Identities=14% Similarity=0.185 Sum_probs=16.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHH Q ss_conf 999999999999999999999999999--99999999999 Q gi|254780421|r 24 IAPLLILMILSLAIWFYLFDGSHEKKK--NIVGENFAQAL 61 (218) Q Consensus 24 ii~~~i~~i~~~~~~~~~~~~~q~~~~--~~as~~y~~al 61 (218) +.++++.++|++++.|- ++..+++.. ..+-..|+.++ T Consensus 3 ~~~a~~~~~vv~~~~Wg-y~~~~ek~~l~~~L~~~Y~R~F 41 (465) T TIGR02889 3 ILTAFVAIAVVGLGAWG-YLQTQEKNDLRNYLQAQYQRAF 41 (465) T ss_pred HHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHH T ss_conf 88888999999999999-8776788999999852221679 No 274 >TIGR00380 cobD cobalamin biosynthesis protein CobD; InterPro: IPR004485 Cobalamin (vitamin B12) is a structurally complex cofactor, consisting of a modified tetrapyrrole with a centrally chelated cobalt. Cobalamin is usually found in one of two biologically active forms: methylcobalamin and adocobalamin. Most prokaryotes, as well as animals, have cobalamin-dependent enzymes, whereas plants and fungi do not appear to use it. In bacteria and archaea, these include methionine synthase, ribonucleotide reductase, glutamate and methylmalonyl-CoA mutases, ethanolamine ammonia lyase, and diol dehydratase . In mammals, cobalamin is obtained through the diet, and is required for methionine synthase and methylmalonyl-CoA mutase . There are at least two distinct cobalamin biosynthetic pathways in bacteria : Aerobic pathway that requires oxygen and in which cobalt is inserted late in the pathway ; found in Pseudomonas denitrificans and Rhodobacter capsulatus. Anaerobic pathway in which cobalt insertion is the first committed step towards cobalamin synthesis ; found in Salmonella typhimurium, Bacillus megaterium, and Propionibacterium freudenreichii shermanii. Either pathway can be divided into two parts: (1) corrin ring synthesis (differs in aerobic and anaerobic pathways) and (2) adenosylation of corrin ring, attachment of aminopropanol arm, and assembly of the nucleotide loop (common to both pathways) . There are about 30 enzymes involved in either pathway, where those involved in the aerobic pathway are prefixed Cob and those of the anaerobic pathway Cbi. Several of these enzymes are pathway-specific: CbiD, CbiG, and CbiK are specific to the anaerobic route of S. typhimurium, whereas CobE, CobF, CobG, CobN, CobS, CobT, and CobW are unique to the aerobic pathway of P. denitrificans. This entry represents the CbiB protein, which is involved in cobalamin biosynthesis and porphyrin biosynthesis. It converts cobyric acid to cobinamide by the addition of aminopropanol on the F carboxylic group. It is part of the cob operon .; GO: 0009236 cobalamin biosynthetic process, 0016021 integral to membrane. Probab=22.98 E-value=44 Score=12.74 Aligned_cols=147 Identities=19% Similarity=0.186 Sum_probs=64.2 Q ss_pred HHHHHCCCHHHHHHHHHHHHHCCCCCHHHHHHHHHHHH-HHHCCHHHHHHHHH-HH---HHHHCCCCH------HHHHHH Q ss_conf 99841387666999999973125520699999999999-95110057789999-99---873126895------789999 Q gi|254780421|r 61 LELFNSNKLDDARSSFEKILSQDNKLYNPLSNMYIASI-LVAKGDTKNAAEIF-LK---VANDDLAPL------AVRYAA 129 (218) Q Consensus 61 l~~~~~~~~~~A~~~l~~l~~~~~~~ya~lA~l~lA~~-~~~~g~~~~A~~~l-~~---i~~~~~~~~------~~~~lA 129 (218) ++...+||.+.|.+.++-+.....+..+.-- +.-|.+ -..++=.|.-+.-+ -. ++ =..+- .||.+- T Consensus 111 ~E~~k~GDle~AR~~l~~~VSRdt~~Ls~e~-i~sA~vESlaEN~vDgv~apLFY~llGil--fGl~GPWPlA~~Yravn 187 (322) T TIGR00380 111 IESLKEGDLEDARKKLQMIVSRDTEELSEEQ-ILSAAVESLAENIVDGVTAPLFYALLGIL--FGLPGPWPLAFVYRAVN 187 (322) T ss_pred CCCCCCCCCHHHHHHHHHHHCCCCCCCCHHH-HHHHHHHHHHHCHHHHHHHHHHHHHHHHH--CCCCCCHHHHHHHHHHH T ss_conf 6665579826789998764224755345356-77788655330113333789999999986--47877147999999997 Q ss_pred HHHHHHHHHCCCCH---HHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCCH-HHHHHHHHH Q ss_conf 99999963026998---99999975311464235689999999999988998999999999983558998-999999999 Q gi|254780421|r 130 TLQAASILVDTSSY---EEISKILQKLSEPSNPMHQFANEILGISALKFGKVQKAKTIFEELAKDNNSPF-GISTRSQMI 205 (218) Q Consensus 130 ~l~la~~~~~~~~~---d~a~~~l~~~~~~~~~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~~~~~p~-~~r~ra~~~ 205 (218) ++- |++-...++| -..-.||+++.+-= |=+..---+.--+..-.|+..+|-..+.+=....++|+ +. ++-. T Consensus 188 TLD-AMvGY~~~~Y~~~GkfaARLdDilnyi-PsRl~G~Llv~~l~~l~~~~k~a~~~~lk~~~~~PSPNSG~---~~A~ 262 (322) T TIGR00380 188 TLD-AMVGYKKEKYEDLGKFAARLDDILNYI-PSRLTGVLLVIALLFLEGKKKGAFAILLKDARKVPSPNSGY---TMAA 262 (322) T ss_pred HHH-HHHCCCCCCCCCCCCCHHHHHHHHHHH-HHHHHHHHHHHHHHHHCCCCCHHHHHHHHCCCCCCCCCCHH---HHHH T ss_conf 753-210246764310043011278898755-67899999999999835881124322211068893798668---9999 Q ss_pred HHHHCCCHHH Q ss_conf 9950034032 Q gi|254780421|r 206 LANIIASDQR 215 (218) Q Consensus 206 l~~l~~~~~~ 215 (218) ++.+.++... T Consensus 263 ~A~~L~v~Le 272 (322) T TIGR00380 263 LAAALGVQLE 272 (322) T ss_pred HHHHCCCEEC T ss_conf 9987063106 No 275 >COG1704 LemA Uncharacterized conserved protein [Function unknown] Probab=22.98 E-value=44 Score=12.74 Aligned_cols=29 Identities=14% Similarity=0.165 Sum_probs=18.1 Q ss_pred HHHHHHHHHHHCCCHHHHHHHHHHHHHCC Q ss_conf 99999999998899899999999998355 Q gi|254780421|r 164 ANEILGISALKFGKVQKAKTIFEELAKDN 192 (218) Q Consensus 164 a~ellg~~~~~~Gd~~~A~~~y~~i~~~~ 192 (218) .+++...+-.-++...-||..|.....+- T Consensus 120 f~~Lq~ql~~tEn~Ia~aR~~YN~av~~y 148 (185) T COG1704 120 FLELQSQLEGTENRIAVARRLYNEAVRDY 148 (185) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999998857899999999999999999 No 276 >PRK13184 pknD serine/threonine-protein kinase; Reviewed Probab=22.94 E-value=44 Score=12.73 Aligned_cols=95 Identities=14% Similarity=0.048 Sum_probs=47.7 Q ss_pred CHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHH-HCCCC-CHHHHHHHHHHHHHHHHCCCHHH Q ss_conf 00577899999987312689578999999999963026998999999753-11464-23568999999999998899899 Q gi|254780421|r 103 GDTKNAAEIFLKVANDDLAPLAVRYAATLQAASILVDTSSYEEISKILQK-LSEPS-NPMHQFANEILGISALKFGKVQK 180 (218) Q Consensus 103 g~~~~A~~~l~~i~~~~~~~~~~~~lA~l~la~~~~~~~~~d~a~~~l~~-~~~~~-~~~~~~a~ellg~~~~~~Gd~~~ 180 (218) +..+.-+..++. +.+...+..+...-.-.....++=..+..++-+.|.. ...+. ..-...+..+-|-...-.++..- T Consensus 781 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~i~~~l~~~~~~~~~~ll~~~~~~~~~~~~~s~~~~lygc~l~~t~~~~~ 859 (933) T PRK13184 781 EEGESIIDALQL-IYDPVSEEEYHDHLLVYEIQAHLWNRDLEKAYKLLNDNYPEDEELDEYSEAFVLYGCYLALTGDRSA 859 (933) T ss_pred HHHHHHHHHHHH-HHHCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHCCCCHHHHHHHHHHHHCCCHHH T ss_conf 347689999999-8741365554314014446888863038999988860188767410133487787668765387378 Q ss_pred HHHHHHHHHHCCCCCHHH Q ss_conf 999999998355899899 Q gi|254780421|r 181 AKTIFEELAKDNNSPFGI 198 (218) Q Consensus 181 A~~~y~~i~~~~~~p~~~ 198 (218) |..+|....++.-.|..+ T Consensus 860 a~~hfs~~r~~~~~~~~~ 877 (933) T PRK13184 860 AKAHFSRCREDALIPKCL 877 (933) T ss_pred HHHHHHHCCCCCCCCHHH T ss_conf 999874304445453344 No 277 >pfam09716 ETRAMP Malarial early transcribed membrane protein (ETRAMP). These sequences represent a family of proteins from the malaria parasite Plasmodium falciparum, several of which have been shown to be expressed specifically in the ring stage as well as the rodent parasite Plasmodium yoelii. A homologue from Plasmodium chabaudi was localized to the parasitophorous vacuole membrane. Members have an initial hydrophobic, Phe/Tyr-rich, stretch long enough to span the membrane, a highly charged region rich in Lys, a second putative transmembrane region and a second highly charged, low complexity sequence region. Some members have up to 100 residues of additional C-terminal sequence. These genes have been shown to be found in the sub-telomeric regions of both Plasmodium falciparum and P. yoelii chromosomes. Probab=22.93 E-value=44 Score=12.73 Aligned_cols=17 Identities=6% Similarity=0.167 Sum_probs=6.3 Q ss_pred HHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999 Q gi|254780421|r 28 LILMILSLAIWFYLFDG 44 (218) Q Consensus 28 ~i~~i~~~~~~~~~~~~ 44 (218) .+.+.++.++|+.+|.+ T Consensus 62 ~lalli~~~~G~g~y~k 78 (84) T pfam09716 62 GLALLIGTALGLGYYKK 78 (84) T ss_pred HHHHHHHHHHHHHHHHC T ss_conf 99999998888765412 No 278 >pfam03097 BRO1 BRO1-like domain. This domain is found in a number proteins including Rhophilin and BRO1. It is known to have a role in endosomal targeting. ESCRT-III subunit Snf7 binds to a conserved hydrophobic patch in the BRO1 domain that is required for protein complex formation and for the protein-sorting function of BRO1. Probab=22.78 E-value=44 Score=12.71 Aligned_cols=32 Identities=9% Similarity=0.000 Sum_probs=24.8 Q ss_pred HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 35689999999999988998999999999983 Q gi|254780421|r 159 PMHQFANEILGISALKFGKVQKAKTIFEELAK 190 (218) Q Consensus 159 ~~~~~a~ellg~~~~~~Gd~~~A~~~y~~i~~ 190 (218) .|.+.++...|......+++-+++..++.+.. T Consensus 236 ~f~a~A~y~~a~~~~e~~~~Ge~ia~L~~A~~ 267 (374) T pfam03097 236 YFKALAHYYHALALEEKKKYGEAIARLQKALS 267 (374) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH T ss_conf 99999999999998988647699999999999 No 279 >pfam09911 DUF2140 Uncharacterized protein conserved in bacteria (DUF2140). This domain, found in various hypothetical prokaryotic proteins, has no known function. Probab=22.73 E-value=23 Score=14.47 Aligned_cols=26 Identities=15% Similarity=0.180 Sum_probs=17.2 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 75199999999999999999999999 Q gi|254780421|r 20 CYKWIAPLLILMILSLAIWFYLFDGS 45 (218) Q Consensus 20 ~yk~ii~~~i~~i~~~~~~~~~~~~~ 45 (218) .|||..++++.++++++++...+-.. T Consensus 2 ~WKwaF~~Llal~l~~~~~i~~~~~~ 27 (187) T pfam09911 2 KWKWAFFVLLALNLASIAVVFFRLFA 27 (187) T ss_pred HHHHHHHHHHHHHHHHHHEEEEEEEC T ss_conf 08999999999999997126789974 No 280 >pfam07423 DUF1510 Protein of unknown function (DUF1510). This family consists of several hypothetical bacterial proteins of around 200 residues in length. The function of this family is unknown. Probab=22.59 E-value=45 Score=12.69 Aligned_cols=30 Identities=13% Similarity=0.212 Sum_probs=12.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 5889999999975199999999999999999999 Q gi|254780421|r 9 SYKGKCKMGKCCYKWIAPLLILMILSLAIWFYLF 42 (218) Q Consensus 9 ~~~~~~~~~~~~yk~ii~~~i~~i~~~~~~~~~~ 42 (218) +.+-|.+++.+ |.|++|+++++++++..|. T Consensus 7 ~k~Rk~n~vLN----iaI~iV~llIiiva~~lf~ 36 (214) T pfam07423 7 QKRRKINRVLN----IAIGIVVVLIIIVAYQLFF 36 (214) T ss_pred HHHHHHHHHHH----HHHHHHHHHHHHHHHHEEE T ss_conf 77764534557----9999999999997664023 No 281 >pfam09782 NDUF_B6 NADH:ubiquinone oxidoreductase, NDUFB6/B17 subunit. Members of this family mediate the transfer of electrons from NADH to the respiratory chain. The immediate electron acceptor for the enzyme is believed to be ubiquinone, the reaction that occurs being: NADH + ubiquinone = NAD(+) + ubiquinol. Probab=22.39 E-value=26 Score=14.10 Aligned_cols=29 Identities=21% Similarity=0.343 Sum_probs=17.9 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 97519999999999999999999999999 Q gi|254780421|r 19 CCYKWIAPLLILMILSLAIWFYLFDGSHE 47 (218) Q Consensus 19 ~~yk~ii~~~i~~i~~~~~~~~~~~~~q~ 47 (218) ..|.+.++-+..+++++.+++||+.|++. T Consensus 83 ~~~R~~~~kl~~~l~~~~~~~YY~KY~~~ 111 (152) T pfam09782 83 KAYRFWVFKLLLVLVPVWIGHYYFKYHVN 111 (152) T ss_pred EEEEEEHHHHHHHHHHHHHHHHHEEEECC T ss_conf 67763078999999999996641255336 No 282 >TIGR02831 spo_II_M stage II sporulation protein M; InterPro: IPR014196 This entry represents the stage II sporulation protein M, which is encoded in a spore formation operon . SpoIIM is one of the three genes (spoIID, spoIIM and spoIIP, , , ), under the control of sigma E, that have been shown to be essential for the engulfment of the forespore by the mother cell. Their products are involved in degradation of the septal peptidoglycan and mutations in spoIID, spoIIM or spoIIP block sporulation at morphological stage II, prior to the stage of engulfment. These three genes are absolutely conserved (sometimes even duplicated) in all endospore formers . . Probab=22.06 E-value=46 Score=12.62 Aligned_cols=65 Identities=9% Similarity=0.040 Sum_probs=47.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHH Q ss_conf 99975199999999999999999999999999999999999999998413876669999999731 Q gi|254780421|r 17 GKCCYKWIAPLLILMILSLAIWFYLFDGSHEKKKNIVGENFAQALELFNSNKLDDARSSFEKILS 81 (218) Q Consensus 17 ~~~~yk~ii~~~i~~i~~~~~~~~~~~~~q~~~~~~as~~y~~al~~~~~~~~~~A~~~l~~l~~ 81 (218) .|-+.--++...+.+.+|.+.|-+.-++-+..+.++....+.+......+++.-.....|.+-.. T Consensus 7 ~~en~~lY~~~~~lf~~G~vfGa~~Vns~~~~~k~dL~~Yl~~F~~~~~~g~~~~~~~~l~qsl~ 71 (201) T TIGR02831 7 LKENFWLYIFVLLLFLVGIVFGAYTVNSMSIVQKEDLYFYLSNFFGQLSGGESASYTEILRQSLF 71 (201) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHH T ss_conf 87416999999999999999999998720077699999999999886417944373026999999 No 283 >COG2959 HemX Uncharacterized enzyme of heme biosynthesis [Coenzyme metabolism] Probab=21.82 E-value=46 Score=12.59 Aligned_cols=21 Identities=10% Similarity=-0.118 Sum_probs=10.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHH Q ss_conf 999999999999999999999 Q gi|254780421|r 32 ILSLAIWFYLFDGSHEKKKNI 52 (218) Q Consensus 32 i~~~~~~~~~~~~~q~~~~~~ 52 (218) +..++|||+||+.++....++ T Consensus 44 LgLGagg~~f~QqQ~~~~~~~ 64 (391) T COG2959 44 LGLGAGGYYFGQQQNVLQTQE 64 (391) T ss_pred HHHCHHHHHHHHHHHHHHHHH T ss_conf 984405788889999998899 No 284 >PRK04335 cell division protein ZipA; Provisional Probab=21.62 E-value=43 Score=12.81 Aligned_cols=23 Identities=17% Similarity=0.052 Sum_probs=12.7 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999999999 Q gi|254780421|r 24 IAPLLILMILSLAIWFYLFDGSHEK 48 (218) Q Consensus 24 ii~~~i~~i~~~~~~~~~~~~~q~~ 48 (218) ||+|+|+++.+++-| +|.+++++ T Consensus 9 IIvGaIAIiALL~HG--lWtsRKEr 31 (319) T PRK04335 9 IVVGALAIAALLFHG--LWTSKKEG 31 (319) T ss_pred HHHHHHHHHHHHHHC--CCCCCCCC T ss_conf 999999999999840--11004455 No 285 >pfam11743 DUF3301 Protein of unknown function (DUF3301). This family is conserved in Proteobacteria, but the function is not known. Probab=21.49 E-value=47 Score=12.55 Aligned_cols=23 Identities=0% Similarity=0.055 Sum_probs=10.7 Q ss_pred HHHHHHHHHHHHHHH-HHHHHHHH Q ss_conf 999999999999999-99999999 Q gi|254780421|r 35 LAIWFYLFDGSHEKK-KNIVGENF 57 (218) Q Consensus 35 ~~~~~~~~~~~q~~~-~~~as~~y 57 (218) ++++|+||+..+.+. +...+..+ T Consensus 9 ~~~~~~ww~~~~~~E~A~~~a~~~ 32 (97) T pfam11743 9 AFVAALWWQQRKQRERALQAARRY 32 (97) T ss_pred HHHHHHHHHCCCHHHHHHHHHHHH T ss_conf 999999997220779999999999 No 286 >pfam06936 Selenoprotein_S Selenoprotein S (SelS). This family consists of several mammalian selenoprotein S (SelS) sequences. SelS is a plasma membrane protein and is present in a variety of tissues and cell types. The function of this family is unknown. Probab=21.47 E-value=47 Score=12.55 Aligned_cols=28 Identities=18% Similarity=-0.009 Sum_probs=19.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 5889999999975199999999999999 Q gi|254780421|r 9 SYKGKCKMGKCCYKWIAPLLILMILSLA 36 (218) Q Consensus 9 ~~~~~~~~~~~~yk~ii~~~i~~i~~~~ 36 (218) .+.--|-.|...|+|+|+.++|++..++ T Consensus 24 ~l~~tvg~~l~~YGWyiL~~~v~ly~l~ 51 (190) T pfam06936 24 FLHFTVGELLSKYGWYLLLGCVGIYLLI 51 (190) T ss_pred HHHHHHHHHHHHCCHHHHHHHHHHHHHH T ss_conf 9999999999963779999999999999 No 287 >PRK01470 tatA twin arginine translocase protein A; Provisional Probab=21.32 E-value=31 Score=13.62 Aligned_cols=15 Identities=27% Similarity=0.467 Sum_probs=7.2 Q ss_pred HHHHHHHHHHHHHHH Q ss_conf 975199999999999 Q gi|254780421|r 19 CCYKWIAPLLILMIL 33 (218) Q Consensus 19 ~~yk~ii~~~i~~i~ 33 (218) ..+-|+|+++|++++ T Consensus 4 S~w~~~iV~~iv~~L 18 (53) T PRK01470 4 SFSHLLIVLLIIFVL 18 (53) T ss_pred CHHHHHHHHHHHHHH T ss_conf 399999999999999 No 288 >COG3005 TorC Nitrate/TMAO reductases, membrane-bound tetraheme cytochrome c subunit [Energy production and conversion] Probab=21.18 E-value=48 Score=12.51 Aligned_cols=29 Identities=14% Similarity=0.298 Sum_probs=12.3 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 99999999999999999999999999999 Q gi|254780421|r 29 ILMILSLAIWFYLFDGSHEKKKNIVGENF 57 (218) Q Consensus 29 i~~i~~~~~~~~~~~~~q~~~~~~as~~y 57 (218) +++++++++|+.+|...+.--....++.| T Consensus 19 ~l~~~gfv~G~~~w~~~~~~~~~tnt~eF 47 (190) T COG3005 19 TLLLIGFVVGILFWGGFNVGLELTNTEEF 47 (190) T ss_pred HHHHHHHHHHHEEHHHHHHHHHHCCCCHH T ss_conf 99999888741010102578874687277 No 289 >COG5353 Uncharacterized protein conserved in bacteria [Function unknown] Probab=20.92 E-value=46 Score=12.60 Aligned_cols=24 Identities=21% Similarity=0.407 Sum_probs=10.0 Q ss_pred HHHHH-HHHHHHHHHHHHHHHHHHH Q ss_conf 19999-9999999999999999999 Q gi|254780421|r 22 KWIAP-LLILMILSLAIWFYLFDGS 45 (218) Q Consensus 22 k~ii~-~~i~~i~~~~~~~~~~~~~ 45 (218) .||++ .++++++.+.+..+||... T Consensus 7 ~~i~ii~viflai~~s~~~~~~~s~ 31 (161) T COG5353 7 IIIIIILVIFLAIILSIALFFWKSM 31 (161) T ss_pred EEEHHHHHHHHHHHHHHHHHHHHHC T ss_conf 6224688999999999999875751 No 290 >TIGR01655 yxeA_fam conserved hypothetical protein; InterPro: IPR006542 These are a family of small (about 115 amino acids) uncharacterised proteins with N-terminal signal sequences, found exclusively in Gram-positive organisms. Most genomes that have any members of this family have at least two members.. Probab=20.90 E-value=48 Score=12.48 Aligned_cols=25 Identities=24% Similarity=0.191 Sum_probs=14.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 5199999999999999999999999 Q gi|254780421|r 21 YKWIAPLLILMILSLAIWFYLFDGS 45 (218) Q Consensus 21 yk~ii~~~i~~i~~~~~~~~~~~~~ 45 (218) +|.++++++++|++++.....+-+. T Consensus 1 MKk~~~~Lla~i~~~~i~~~~~~~~ 25 (121) T TIGR01655 1 MKKVLAGLLALIVVIIIVGLLFFHP 25 (121) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHCC T ss_conf 9355688999999999988766403 No 291 >pfam06916 DUF1279 Protein of unknown function (DUF1279). This family represents the C-terminus (approx. 120 residues) of a number of eukaryotic proteins of unknown function. Probab=20.86 E-value=47 Score=12.55 Aligned_cols=34 Identities=15% Similarity=-0.083 Sum_probs=21.4 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999975199999999999999999999999 Q gi|254780421|r 12 GKCKMGKCCYKWIAPLLILMILSLAIWFYLFDGS 45 (218) Q Consensus 12 ~~~~~~~~~yk~ii~~~i~~i~~~~~~~~~~~~~ 45 (218) +|+|..-+.|+|+.+++-+.+-....+..|.--+ T Consensus 1 qr~K~l~k~YG~~al~vh~~~s~~~~~~~y~~v~ 34 (88) T pfam06916 1 QRFKKLFKKYGKVALGVHFSISLASFGIFYLLVS 34 (88) T ss_pred CHHHHHHHHHCEEEEEEHHHHHHHHHHHHHHHHH T ss_conf 9278999984938212035589999999999997 No 292 >TIGR02162 torC trimethylamine-N-oxide reductase c-type cytochrome TorC; InterPro: IPR009154 This family includes TorC, a pentahemic c-type cytochrome subunit of periplasmic reductases for trimethylamine-N-oxide (TMAO). The N-terminal half is closely related to tetrahemic NapC (or NirT) subunits of periplasmic nitrate (or nitrite) reductases; some species have both TMAO and nitrate reductase complexes .; GO: 0005506 iron ion binding, 0009055 electron carrier activity, 0020037 heme binding, 0006118 electron transport, 0009276 1-2nm peptidoglycan-based cell wall, 0016021 integral to membrane. Probab=20.86 E-value=31 Score=13.63 Aligned_cols=43 Identities=19% Similarity=0.194 Sum_probs=19.2 Q ss_pred HHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999975199--999999999999999999999999999999999 Q gi|254780421|r 15 KMGKCCYKWI--APLLILMILSLAIWFYLFDGSHEKKKNIVGENF 57 (218) Q Consensus 15 ~~~~~~yk~i--i~~~i~~i~~~~~~~~~~~~~q~~~~~~as~~y 57 (218) |||+...|.- |..+|++++|+++|.-+|+....--..+.++.| T Consensus 2 ~~w~~~~rPS~k~slGvL~~~G~iaG~~~w~~F~~~~~~T~T~EF 46 (394) T TIGR02162 2 KIWKAFTRPSAKISLGVLVVIGFIAGAILWQGFNVVLEKTSTEEF 46 (394) T ss_pred CHHHHHHCHHHHHHHHHHHHHHHHEEEEEECCCCCCEECCCCCCE T ss_conf 113444040588999999985053021245343110025665203 No 293 >COG3197 FixS Uncharacterized protein, possibly involved in nitrogen fixation [Inorganic ion transport and metabolism] Probab=20.85 E-value=48 Score=12.47 Aligned_cols=12 Identities=8% Similarity=0.130 Sum_probs=4.4 Q ss_pred HHHHHHHHHHHH Q ss_conf 999999999999 Q gi|254780421|r 33 LSLAIWFYLFDG 44 (218) Q Consensus 33 ~~~~~~~~~~~~ 44 (218) ++++.+.++|-- T Consensus 15 ~~v~l~~flWav 26 (58) T COG3197 15 GAVGLGAFLWAV 26 (58) T ss_pred HHHHHHHHHHHC T ss_conf 999999999851 No 294 >pfam02116 STE2 Fungal pheromone mating factor STE2 GPCR. Probab=20.70 E-value=49 Score=12.45 Aligned_cols=16 Identities=13% Similarity=0.156 Sum_probs=6.9 Q ss_pred HHHHHHHHHHHHHHHH Q ss_conf 9975199999999999 Q gi|254780421|r 18 KCCYKWIAPLLILMIL 33 (218) Q Consensus 18 ~~~yk~ii~~~i~~i~ 33 (218) -.+|+--+++.++.++ T Consensus 36 ~I~yG~qiGA~~illi 51 (283) T pfam02116 36 AINYGAQIGASLLLLI 51 (283) T ss_pred HHHHHHHHHHHHHHHH T ss_conf 9999989999999999 No 295 >TIGR02902 spore_lonB ATP-dependent protease LonB; InterPro: IPR014251 This entry represents LonB, a paralog of the ATP-dependent protease La (LonA, IPR004815 from INTERPRO). LonB proteins are unassigned peptidases belonging to the MEROPS peptidase family S16 (lon protease family, clan SJ) and are found strictly, and almost universally, in endospore-forming bacteria. This protease was shown, in Bacillus subtilis, to be expressed specifically in the forespore during sporulation, under control of sigmaF . The lonB gene, despite being located immediately upstream of lonA, was shown to be monocistronic. LonB appears to be involved in the post-translation control of sigmaH, but lonB mutation did not produce an obvious sporulation defect under the conditions tested . Note that additional paralogs of LonA and LonB occur in the Clostridium lineage and these are excluded from this entry. . Probab=20.68 E-value=23 Score=14.51 Aligned_cols=28 Identities=18% Similarity=0.419 Sum_probs=16.0 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 9999999999999999999999999999 Q gi|254780421|r 26 PLLILMILSLAIWFYLFDGSHEKKKNIV 53 (218) Q Consensus 26 ~~~i~~i~~~~~~~~~~~~~q~~~~~~a 53 (218) +.++=+|+..+.|.|||+.-+.||-++. T Consensus 3 ~~~~q~if~iiIG~YFfn~LK~Qq~nk~ 30 (532) T TIGR02902 3 FAIVQLIFAIIIGLYFFNLLKNQQTNKI 30 (532) T ss_pred EEEHHHHHHHHHHHHHHHHHHCCCCCEE T ss_conf 6642589999999999975440534404 No 296 >COG4640 Predicted membrane protein [Function unknown] Probab=20.64 E-value=49 Score=12.44 Aligned_cols=39 Identities=15% Similarity=0.273 Sum_probs=18.1 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHHHHHHH Q ss_conf 9999999999999999999999999999841387666999999 Q gi|254780421|r 35 LAIWFYLFDGSHEKKKNIVGENFAQALELFNSNKLDDARSSFE 77 (218) Q Consensus 35 ~~~~~~~~~~~q~~~~~~as~~y~~al~~~~~~~~~~A~~~l~ 77 (218) +++.++++...+.+. ...-+.|.+|+. .++.++...++. T Consensus 64 lii~~~~fgk~fssp-~~~vk~le~Ai~---~~Dk~kvatLl~ 102 (465) T COG4640 64 LIIILFFFGKNFSSP-EAQVKILENAIE---NNDKQKVATLLS 102 (465) T ss_pred HHHHHHHHHHCCCCH-HHHHHHHHHHHH---CCCHHHHHHHHC T ss_conf 999999975403888-899999999874---153989998842 No 297 >PRK11519 tyrosine kinase; Provisional Probab=20.54 E-value=49 Score=12.43 Aligned_cols=28 Identities=7% Similarity=-0.035 Sum_probs=12.6 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 58899999999751999999999999999 Q gi|254780421|r 9 SYKGKCKMGKCCYKWIAPLLILMILSLAI 37 (218) Q Consensus 9 ~~~~~~~~~~~~yk~ii~~~i~~i~~~~~ 37 (218) .+..=++.. ..++|.|+++++++++++. T Consensus 19 Dl~~ll~~l-~rrrw~Il~~~~i~~~~a~ 46 (720) T PRK11519 19 DIGRLVGTV-IEARWWVIGITAVFALCAV 46 (720) T ss_pred CHHHHHHHH-HHHHHHHHHHHHHHHHHHH T ss_conf 499999999-9744999999999999999 Done!